Blog

AboutBlogPublications CV

Implementing UL2 for Decoder-Only Language Models

An in-depth look at modeling considerationsRead More →

Sun Oct 20 2024

How does torch.compile speed up a transformer?

A case study of kernel fusion for a vision transformerRead More →

Fri Jul 12 2024

Transformer FLOPs

How to count FLOPs and why it's useful.Read More →

Tue May 16 2023

2024 © Adam Casson.