Stars
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration
Lets make video diffusion practical!
A framework to enable multimodal models to operate a computer.
The first "code-first" agent framework for seamlessly planning and executing data analytics tasks.
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
Twitter API Scraper | Without an API key | Twitter Internal API | Free | Twitter scraper | Twitter Bot
Reliable Multi-Agent Orchestration Framework
Toutatis is a tool that allows you to extract information from instagrams accounts such as e-mails, phone numbers and more
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
An environmental monitoring and regulation system
HAAS = Hierarchical Autonomous Agent Swarm - "Resistance is futile!"
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
[NeurIPS 2025] PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers
AI powered speech denoising and enhancement
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
A Simple Implementation of Qwen3-TTS's ComfyUI
Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets
GPU Poor Version of Hunyuan3D-2
cjeen / LoRAEdit
Forked from tdrussell/diffusion-pipeWe achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additional reference conditions.
[ICCV 2025] Official code for AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation
Autoforge takes a picture and generates a 3D layer STL file that you can print with a 3d printer




