Rabbit Holes

Rabbit Holes, Learning, thinking, and writing.

  • Blog
  • About
  • RSS
  • Search

A hitchhiker’s guide to CUDA programming

May 5, 2024

How to write a CUDA kernel to achieve 95% cuBLAS performance

State Space Models lack sequence-crossing

Mar 2, 2024

My opinions on SSMs

DDIM

Feb 24, 2024

Takeaways from DDIM

Interactive Fréchet Distance Visualization

Nov 23, 2023

Walk the walk and walk the dog

Your typical HF architecture

Apr 22, 2022

If you were to design an HF

EM in a nutshell

Aug 2, 2019

A quick rundown on expectation maximization


© Sean Zhang 2021 - 2025