-
深圳大学
- shenzhen
-
19:25
(UTC +08:00)
Highlights
- Pro
Popular repositories Loading
-
assignment1-basics
assignment1-basics PublicForked from stanford-cs336/assignment1-basics
Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
Python
-
OLMo
OLMo PublicForked from allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Python
-
gated_attention
gated_attention PublicForked from qiuzh20/gated_attention
The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
Jupyter Notebook
-
open-r1
open-r1 PublicForked from huggingface/open-r1
Fully open reproduction of DeepSeek-R1
Python
If the problem persists, check the GitHub status page or contact support.