Stars
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
A backend management system based on vue3, typescript, element-plus, and vite
🔥基于 Vue 3 + Vite 7+ TypeScript + element-plus 构建的后台管理前端模板(配套后端源码),vue-element-admin 的 vue3 版本。
Vue3、Element Plus、typescript后台管理系统
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…
GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
Transformer related optimization, including BERT, GPT
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
FlashMLA: Efficient Multi-head Latent Attention Kernels
DeepEP: an efficient expert-parallel communication library
Fast and memory-efficient exact attention
Fully open reproduction of DeepSeek-R1
【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Puzzles for learning Triton
Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。
CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Large Language Model Text Generation Inference
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型