- Beijing, China
Stars
Optimized primitives for collective multi-GPU communication
Knowhere is an open-source vector search engine, integrating FAISS, HNSW, etc.
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …
LlamaIndex is the leading document agent and OCR platform
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Reference models for Intel(R) Gaudi(R) AI Accelerator
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Accessible large language models via k-bit quantization for PyTorch.
Example models using DeepSpeed
Fast inference engine for Transformer models
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).
CUDA Templates and Python DSLs for High-Performance Linear Algebra
Development repository for the Triton language and compiler
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
A fast JSON parser/generator for C++ with both SAX/DOM style API
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
Robust Speech Recognition via Large-Scale Weak Supervision
Unsupervised text tokenizer for Neural Network-based text generation.
A library for efficient similarity search and clustering of dense vectors.
💫 Industrial-strength Natural Language Processing (NLP) in Python
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

