←
Home
Blog
About
Press
Media
Subscribe
Matt Suiche
Hacker · Founder of OnDB
X
LinkedIn
2025
Oct 15
AMD GPU Support in Triton Gluon Framework
GPU
Triton
Gluon
AMD
ROCm
HIP
CUDA
Performance
Oct 15
RustBPE: High-Performance BPE Tokenizer Training in Rust
Rust
Machine Learning
Natural Language Processing
Tokenization
Performance Engineering
BPE
PyO3
Sep 30
Optimizing AlphaFold's Triangle Multiplicative Update: A First Look at GPU Performance Engineering
GPU Optimization
PyTorch
Triton
AlphaFold
Machine Learning
Performance Engineering
H100
Tensor Cores
Sep 23
Gluon: When Triton Isn't Low-Level Enough
GPU
Triton
Gluon
Performance
CUDA
Deep Learning
PyTorch
Sep 14
The Hidden Math Bug That Makes AI Unpredictable
determinism
floating-point
neural-networks
pytorch
mlx