Skip to content
View rwightman's full-sized avatar

Sponsoring

@borisdayma

Highlights

  • Pro

Block or report rwightman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for lip-sync. Runs in-browser (WebGPU/WASM) or on local Node.js WebSocket/REST server (CPU).

JavaScript 121 16 Updated Dec 8, 2025

Talking Head (3D): A JavaScript class for real-time lip-sync using full-body 3D avatars.

JavaScript 1,130 282 Updated Mar 28, 2026

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 35,031 2,369 Updated Mar 22, 2026
Python 2 Updated Jan 27, 2026

An embedded key-value database in pure Rust

Rust 4,370 207 Updated Mar 18, 2026

Copper is an operating system for robots - build, run, and replay your entire robot deterministically.

Rust 1,239 78 Updated Mar 30, 2026

PyTorch Single Controller

Rust 1,002 156 Updated Mar 31, 2026

The best ChatGPT that $100 can buy.

Python 50,721 6,652 Updated Mar 27, 2026
Python 26 1 Updated Mar 30, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,949 2,061 Updated Mar 27, 2026

An implementation of PSGD Kron in JAX for distributed training in JAX or Flax

Python 10 Updated Nov 6, 2025

Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditioner and more)

Python 193 13 Updated Mar 22, 2026

An implementation of PSGD Kron second-order optimizer for PyTorch

Python 99 6 Updated Jul 24, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,704 2,235 Updated Feb 1, 2025

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

TypeScript 22,590 1,295 Updated Mar 30, 2026

Official repository of the xLSTM.

Python 2,139 175 Updated Nov 4, 2025

[TIP2024] MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers

Python 76 6 Updated Dec 6, 2024

[ICLR 2026] When it comes to optimizers, it's always better to be safe than sorry

Python 410 13 Updated Sep 26, 2025

Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.

Python 46 2 Updated Feb 9, 2026

Supercharge Your PyTorch Image Models: Bag of Tricks to 8x Faster Inference with ONNX Runtime & Optimizations

Jupyter Notebook 24 Updated Oct 4, 2024

Scenic: A Jax Library for Computer Vision Research and Beyond

Python 3,785 477 Updated Mar 25, 2026

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

Python 20,028 1,004 Updated Mar 31, 2026

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 15,291 1,709 Updated Jun 25, 2025

Robot Utility Models are trained on a diverse set of environments and objects, and then can be deployed in novel environments with novel objects without any further data or training.

Python 243 16 Updated Jan 19, 2026

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 12,716 1,703 Updated Mar 29, 2026

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,328 71 Updated Jan 27, 2026

MambaOut: Do We Really Need Mamba for Vision? (CVPR 2025)

Python 2,677 47 Updated Mar 9, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,995 137 Updated Nov 7, 2025

A PyTorch native platform for training generative AI models

Python 5,200 766 Updated Mar 31, 2026
Next