- 🔭 I’m currently working on: Scaling machine learning (ML) models, post-training and agents.
- 🌱 I’m currently learning: Mostly about post-training techniques, GPU kernels, distributed training, and inference engines
- 💬 Ask me about: AI, ML, LLMs, computer vision, startups
- 📫 How to reach me: Twitter, Email
⏩
GPU Mode
Research Engineer @ Google DeepMind. Interested in post-training and ML systems
-
Google DeepMind
- Toronto, Canada
- shashankshekhar.com
- @sshkhr16
- in/sshkhr
- @sshkhr.bsky.social
Pinned Loading
-
awesome-mlss/awesome-mlss
awesome-mlss/awesome-mlss Public🤖 Machine Learning Summer School Guide
-
safeguarding-llms
safeguarding-llms PublicTMLS 2024 Workshop: A Practitioner's Guide To Safeguarding Your LLM Applications
-
mla-pytorch
mla-pytorch Publicminimal Pytorch implementation of DeepSeek's Multi Head Latent Attention + benchmarks
Python 5
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.




