Skip to content
View okfang's full-sized avatar
  • Guangzhou China

Block or report okfang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 19,238 1,767 Updated Jan 30, 2026

MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka

Python 325 10 Updated Jun 21, 2025

Witness the aha moment of VLM with less than $3.

Python 4,059 284 Updated May 19, 2025

A fork to add multimodal model training to open-r1

Python 1,550 72 Updated Feb 8, 2025

Fully open reproduction of DeepSeek-R1

Python 26,019 2,419 Updated Apr 2, 2026

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。

1,165 89 Updated Feb 27, 2024

NTK scaled version of ALiBi position encoding in Transformer.

69 3 Updated Aug 16, 2023

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

4,199 288 Updated May 23, 2026

ChatGPT 国粹版,和 GPT 一起学习地道的中国话吧

TypeScript 811 91 Updated Jul 28, 2023

TigerBot: A multi-language multi-task LLM

Python 2,264 189 Updated Dec 28, 2024

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,134 379 Updated Aug 13, 2024

CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)

Python 8,789 690 Updated Aug 13, 2024

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 5,564 511 Updated May 15, 2026

ChatGPT 中文指南🔥,ChatGPT 中文调教指南,指令指南,应用开发指南,精选资源清单,更好的使用 chatGPT 让你的生产力 up up up! 🚀

Python 11,542 938 Updated Nov 5, 2024

收集了目前为止中文领域的MRC抽取式数据集

123 15 Updated Jun 20, 2024

MOSS 003 WebSearchTool: A simple but reliable implementation

Python 45 9 Updated May 24, 2023

GLM (General Language Model)

Python 3,501 352 Updated Nov 3, 2023

An open-source tool-augmented conversational language model from Fudan University

Python 12,122 1,134 Updated Jul 13, 2024

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,658 503 Updated Jul 18, 2024

开源SFT数据集整理,随时补充

579 41 Updated Jun 2, 2023

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

22,578 2,127 Updated May 10, 2026

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 38,089 6,210 Updated Nov 10, 2025

WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

Python 1,604 134 Updated Mar 25, 2025

Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"

HTML 913 73 Updated Nov 25, 2023

行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍

1,368 252 Updated Aug 26, 2025

【干货】史上最全的PyTorch学习资源汇总

Python 4,731 830 Updated Aug 14, 2019

Using GPT to organize and access information, and generate questions. Long term goal is to make an agent-like research assistant.

Jupyter Notebook 693 51 Updated Oct 21, 2025

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,643 586 Updated Oct 24, 2024

A quick guide (especially) for trending instruction finetuning datasets

3,385 237 Updated Nov 28, 2023

Awesome-LLM: a curated list of Large Language Model

26,853 2,544 Updated Jul 31, 2025
Next