Skip to content
View ywdong's full-sized avatar

Block or report ywdong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Fast, small, and fully autonomous AI assistant infrastructure — deploy anywhere, swap anything 🦀

Rust 25,872 3,314 Updated Mar 11, 2026

"🐈 nanobot: The Ultra-Lightweight OpenClaw"

Python 32,045 5,264 Updated Mar 10, 2026

A lightweight alternative to OpenClaw that runs in containers for security. Connects to WhatsApp, Telegram, Slack, Discord, Gmail and other messaging apps,, has memory, scheduled jobs, and runs dir…

TypeScript 21,464 4,037 Updated Mar 10, 2026

An open-source SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skills and subagents, it handles different levels of tasks that could take minute…

Python 28,719 3,411 Updated Mar 11, 2026

🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理!- A powered tool for easy and efficient video subtitling.

Python 13,501 1,104 Updated Feb 26, 2026

Anthropic's Interactive Prompt Engineering Tutorial

Jupyter Notebook 33,222 3,387 Updated Mar 1, 2026

A free, open source, and extensible speech-to-text application that works completely offline.

Rust 17,305 1,294 Updated Mar 11, 2026

We write your reusable computer vision tools. 💜

Python 36,671 3,117 Updated Mar 11, 2026

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,772 766 Updated Mar 10, 2026

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,524 454 Updated Feb 10, 2026

[NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".

Python 623 45 Updated Oct 22, 2025

A unified inference and post-training framework for accelerated video generation.

Python 3,140 278 Updated Mar 11, 2026

An inference and training framework for multiple image input in Flux Kontext dev

Jupyter Notebook 438 33 Updated Sep 1, 2025

A PyTorch native platform for training generative AI models

Python 5,125 732 Updated Mar 11, 2026

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 5,344 402 Updated Apr 21, 2025

Collection of ComfyUI Workflows

23 1 Updated Jul 24, 2025

The ultimate training toolkit for finetuning diffusion models

Python 9,692 1,166 Updated Mar 10, 2026
Python 1,730 245 Updated Mar 6, 2026

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,678 251 Updated Oct 17, 2025

This node preserves image quality by selectively merging only the changed regions from AI-generated edits back into the original image.

Python 92 7 Updated Aug 12, 2025
Python 387 12 Updated Jul 13, 2025

Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip

Jupyter Notebook 37 1 Updated Jan 27, 2026

Repo for SeedVR2 (ICLR2026) & SeedVR (CVPR2025 Highlight)

Python 1,074 63 Updated Jan 27, 2026

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

TypeScript 29,396 2,258 Updated Mar 4, 2026

ComfyUI implemtation for NAG

Python 305 36 Updated Nov 3, 2025

LBM: Latent Bridge Matching for Fast Image-to-Image Translation ✨ (ICCV 2025 Highlight)

Python 822 55 Updated Jul 24, 2025

Context engineering is the new vibe coding - it's the way to actually make AI coding assistants work. Claude Code is the best for this so that's what this repo is centered around, but you can apply…

Python 12,718 2,645 Updated Feb 13, 2026

[NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

Python 773 32 Updated Feb 22, 2026

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 97,190 12,119 Updated Mar 11, 2026

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,774 315 Updated Nov 28, 2025
Next