Stars
短剧平台 AI Short Film Motion Comic Generation Platform Industrial AI Motion Comic & Video Workbench
🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine
Production-grade engineering skills for AI coding agents.
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
An open-source alternative to Claude Cowork (powered by opencode)
The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation…
Extracted system prompts from Anthropic - Opus 4.7, Opus 4.6, Sonnet 4.6. OpenAI - ChatGPT 5.5 Thinking, GPT 5.5 Instant, Codex. Google Gemini - 3.5 Flash, 3.1 Pro, 3 Flash, Antigravity. xAI - Grok…
This API provides programmatic access to the AlphaGenome model developed by Google DeepMind.
An open-source AI agent that brings the power of Gemini directly into your terminal.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
[ICCV 2025] LayerAnimate: Layer-specific Control for Animation
A Conversational Speech Generation Model
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
Genome modeling and design across all domains of life
Wan: Open and Advanced Large-Scale Video Generative Models
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
SkyReels V1: The first and most advanced open-source human-centric video foundation model
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
A feature-rich command-line audio/video downloader
Easily fine-tune, evaluate and deploy Gemma 4, Qwen3.5, Qwen3.6, gpt-oss, DeepSeek-R1, or any open source LLM / VLM!
Janus-Series: Unified Multimodal Understanding and Generation Models