terminator123

terminator123

3 followers · 17 following

Stars

NVIDIA / kvpress

LLM KV cache compression made easy

Python 1,091 147 Updated May 18, 2026

DSXiangLi / DecryptPrompt

总结Prompt&LLM论文，开源数据&模型，AIGC应用

3,408 322 Updated May 6, 2026

PaperDecision / PaperDecision

Python 186 3 Updated Jan 19, 2026

skindhu / How-To-Scale-Your-Model-CN

《How to Scale Your Model》中文翻译项目 - 智能技术文档翻译工具。专为大语言模型扩展技术书籍设计，突破长文档翻译瓶颈，完美保留数学公式、代码块格式。采用占位符机制+分层翻译策略，基于Gemini API提供高质量翻译。Python+crawl4ai技术栈，支持批量处理和增量更新。

HTML 167 16 Updated Aug 30, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 6,382 532 Updated May 25, 2026

Infrasys-AI / AIInfra

AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 7,118 916 Updated Dec 22, 2025

meituan-longcat / LongCat-Flash-Chat

1,335 66 Updated May 25, 2026

yanring / Megatron-MoE-ModelZoo

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 192 32 Updated May 20, 2026

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 3,021 277 Updated May 25, 2026

AngleMAXIN / llm-application-interview

333 18 Updated Jun 2, 2025

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 17,980 1,140 Updated May 18, 2026

meta-pytorch / torchtune

PyTorch native post-training library

Python 5,760 723 Updated May 25, 2026

Victarry / PP-Schedule-Visualization

Pipeline Parallelism Emulation and Visualization

Python 83 9 Updated Jan 8, 2026

MooreThreads / SimuMax

a static analytical model for LLM distributed training

Python 133 19 Updated May 11, 2026

sail-sg / zero-bubble-pipeline-parallelism

Forked from NVIDIA/Megatron-LM

Zero Bubble Pipeline Parallelism

Python 456 34 Updated May 7, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,999 288 Updated May 15, 2025

liu-xiao-guo / semantic_search_es

Jupyter Notebook 9 5 Updated Nov 5, 2024

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,643 586 Updated Oct 24, 2024

RUC-NLPIR / LLM4IR-Survey

This is the repo for the survey of LLM4IR.

535 44 Updated Nov 13, 2025

mli / paper-reading

深度学习经典、新论文逐段精读

33,330 2,803 Updated Mar 22, 2025

shuxueslpi / chatGLM-6B-QLoRA

使用peft库，对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调，并做lora model和base model的merge及4bit的量化（quantize）。

Python 358 45 Updated Aug 22, 2023

liucongg / ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型，进行下游具体任务微调，涉及Freeze、Lora、P-tuning、全参微调等

Python 2,778 308 Updated Dec 12, 2023

necrophagists / ChatGLM2_Lora

This repo was a simple way to implement Lora to fine-tuning ChatGLM2.这个项目是用LORA微调chatglm2的简单实现。

Python 9 1 Updated Aug 21, 2023

DUTIR-Emotion-Group / CCL2019-Chinese-Humor-Computation

CCL2019，“小牛杯”中文幽默计算任务的数据集及baseline

Jupyter Notebook 25 4 Updated Aug 27, 2024

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

4,199 288 Updated May 23, 2026

THUDM / WebGLM

WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

Python 1,604 134 Updated Mar 25, 2025

kamalkraj / BERT-NER

Pytorch-Named-Entity-Recognition-with-BERT

Python 1,249 271 Updated May 6, 2021

mlfoundations / open_clip

An open source implementation of CLIP.

Python 13,855 1,283 Updated May 25, 2026

brightmart / nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,893 1,555 Updated Feb 6, 2026

Werneror / Poetry

非常全的古诗词数据，收录了从先秦到现代的共计85万余首古诗词。

Python 1,735 417 Updated Aug 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly