Ramblings on Deep Learning

A hitchhiker’s guide to CUDA programming

CUDA

programming

technical

Walkthrough of writing a SGEMM kernel that achieves 95% of cuBLAS performance

Mar 5, 2025

Sean Zhang

SSM lacks sequence mixing

deep learning

sequence modeling

technical

Every architecture contains some implicit trade-offs. My impression is SSMs are a good sequential architecture for modalities where interactions within a sequence matters less than a good compression of past states.

Mar 2, 2024

Sean Zhang

Interactive FID

deep learning

image generation

Imagine that you have 2 curves in a 2-D space, how would you measure the similarity of these 2 curves?

Nov 23, 2023

Sean Zhang

EM in a nutshell

deep learning

probability

technical

Explaining the EM algorithm in a nutshell

Aug 2, 2019

Sean Zhang

More Ramblings?

Categories

A hitchhiker’s guide to CUDA programming

SSM lacks sequence mixing

Interactive FID

EM in a nutshell