Skip to content
View bsdcfp's full-sized avatar

Block or report bsdcfp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
JavaScript 2,074 115 Updated Apr 2, 2026

A framework for efficient model inference with omni-modality models

Python 4,122 674 Updated Apr 2, 2026

Zotero MCP: Connects your Zotero research library with Claude and other AI assistants via the Model Context Protocol to discuss papers, get summaries, analyze citations, and more.

Python 2,231 212 Updated Mar 26, 2026

The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞

43,871 4,205 Updated Apr 1, 2026

Public repository for Agent Skills

Python 109,324 12,264 Updated Mar 25, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,308 851 Updated Mar 22, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,260 846 Updated Apr 3, 2026

Official inference repo for FLUX.2 models

Python 2,070 134 Updated Mar 12, 2026

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 3,831 453 Updated Apr 3, 2026

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 76,950 8,630 Updated Apr 2, 2026

Ongoing research training transformer models at scale

Python 15,891 3,783 Updated Apr 3, 2026

A curated list of research papers, resources, and advancements on Diffusion Cache and related efficient diffusion model acceleration techniques.

79 3 Updated Nov 4, 2025

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 34,889 4,293 Updated Aug 6, 2024

A curated list of materials on AI efficiency

215 20 Updated Feb 22, 2026

ValueCell is a community-driven, multi-agent platform for financial applications.

Python 10,187 1,739 Updated Mar 9, 2026

Self Refining Diffusion Samplers

4 Updated Dec 11, 2024

[ICCV 2025] CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers

Python 16 1 Updated Mar 3, 2026

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 3,534 263 Updated Jul 31, 2025

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 820 61 Updated Mar 6, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 107,614 12,429 Updated Apr 2, 2026

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …

Python 720 224 Updated Mar 6, 2026

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

1,156 67 Updated Aug 17, 2025

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

Python 337 77 Updated Apr 3, 2026

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 3,761 237 Updated Mar 7, 2026

compiler learning resources collect.

Python 2,702 368 Updated Mar 19, 2025

how to optimize some algorithm in cuda.

Cuda 2,905 267 Updated Apr 1, 2026

Official inference repo for FLUX.1 models

Python 25,375 1,873 Updated Jul 31, 2025

📝A simple and elegant markdown editor, available for Linux, macOS and Windows.

JavaScript 54,855 4,080 Updated Mar 4, 2026
Next