-
Guangzhou University
- Beijing
- https://github.com/Guanyuansheng
Stars
Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.
"🐈 nanobot: The Ultra-Lightweight Personal AI Assistant"
An agentic skills framework & software development methodology that works.
Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞
Chrome DevTools for coding agents
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
lihaoyun6 / FlashVSR_plus
Forked from OpenImagingLab/FlashVSRTowards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de…
More practical frame interpolation approach.
[NeurIPS 2024] Generalizable Implicit Motion Modeling for Video Frame Interpolation
[NeurIPS'25] DOVE: Efficient One-Step Diffusion Model for Real-World Video Super-Resolution
A curated list of resources for video super-resolution using diffusion models.
🔥(CVPR 2025 Highlight) Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
TransNet V2: Shot Boundary Detection Neural Network
[NeurIPS 2024] Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation
[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
(ECCV 2024) SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
Lets make video diffusion practical!
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
SkyReels V1: The first and most advanced open-source human-centric video foundation model
【Accepted by TPAMI】Human Motion Video Generation: A Survey (https://ieeexplore.ieee.org/document/11106267)
wip - running some training with overfitting - https://wandb.ai/snoozie/vasa-overfitting
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
Measures and metrics for image2image tasks. PyTorch.
