Skip to content
View minyoungg's full-sized avatar

Block or report minyoungg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🧱 Modula software package

Python 327 31 Updated Aug 18, 2025
Python 681 59 Updated Apr 12, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,375 450 Updated Apr 13, 2026

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,288 2,722 Updated Apr 1, 2026

PyTorch native post-training library

Python 5,732 714 Updated Apr 14, 2026

Minimalistic large language model 3D-parallelism training

Python 2,650 293 Updated Apr 7, 2026

A framework for few-shot evaluation of language models.

Python 12,171 3,182 Updated Apr 8, 2026
Python 71 8 Updated Jul 11, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,852 624 Updated Apr 14, 2026

PyTorch native quantization and sparsity for training and inference

Python 2,772 481 Updated Apr 14, 2026

Mamba SSM architecture

Python 17,969 1,687 Updated Apr 13, 2026

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,190 572 Updated Aug 22, 2025

Python toolbox for optimization on Riemannian manifolds with support for automatic differentiation

Python 889 166 Updated Jun 2, 2025

Train transformer language models with reinforcement learning.

Python 18,039 2,643 Updated Apr 14, 2026

Fast and memory-efficient exact attention

Python 23,352 2,614 Updated Apr 14, 2026

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,271 692 Updated Apr 13, 2026

PyTorch extensions for high performance and large scale training.

Python 3,405 296 Updated Apr 26, 2025

Ongoing research training transformer models at scale

Python 16,037 3,825 Updated Apr 14, 2026

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,080 520 Updated Jul 1, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,198 6,676 Updated Sep 30, 2025

A fast, clean, responsive Hugo theme.

HTML 13,361 3,362 Updated Apr 11, 2026

Inference Llama 2 in one file of pure C

C 19,396 2,503 Updated Aug 6, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,656 9,690 Updated Nov 12, 2025

Foundation Architecture for (M)LLMs

Python 3,136 225 Updated Apr 11, 2024

Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent, and chunkwise forward.

Jupyter Notebook 227 26 Updated Mar 12, 2024

Generative Models by Stability AI

Python 27,085 3,069 Updated Dec 16, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 8,116 840 Updated Apr 14, 2026

Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".

Python 111 12 Updated Jun 8, 2023

Open source code for paper "On the Learning and Learnability of Quasimetrics".

C++ 32 1 Updated Nov 28, 2022
Next