The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,851 2,413 Updated Mar 20, 2026

huangb23 / VTimeLLM

[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".

Python 295 13 Updated Jun 13, 2024

Atomic-man007 / Awesome_Multimodel_LLM

Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-contex…

365 23 Updated Mar 19, 2025

wgwang / awesome-LLMs-In-China

中国大模型

6,430 557 Updated Nov 30, 2024

microsoft / Cream

This is a collection of our NAS and Vision Transformer work.

Python 1,829 241 Updated Jul 25, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 75,302 15,179 Updated Apr 5, 2026

zai-org / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,734 452 Updated May 29, 2024

jackaduma / awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案

1,312 327 Updated Dec 14, 2023

OFA-Sys / Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,852 549 Updated Mar 31, 2026

toggle1995 / RIS-DMMI

Python 45 Updated Oct 3, 2023

Huntersxsx / MGPN

source code of our MGPN in SIGIR 2022

Python 18 1 Updated Jun 8, 2022

datawhalechina / competition-baseline

数据挖掘、计算机视觉、自然语言处理、推荐系统竞赛知识、代码、思路

Jupyter Notebook 4,736 1,088 Updated Oct 22, 2025

TheLastBen / fast-stable-diffusion

fast-stable-diffusion + DreamBooth

Python 7,896 1,373 Updated Nov 29, 2025

XavierXiao / Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Jupyter Notebook 7,742 803 Updated Dec 8, 2022

983632847 / All-in-One

All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment

Python 19 3 Updated Feb 11, 2025

KyanChen / RSPrompter

This is the pytorch implement of our paper "RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model"

Python 656 43 Updated Jun 29, 2024

ttgeng233 / UnAV

Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)

Python 72 6 Updated Jan 4, 2026

OpenGVLab / InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin…

Python 3,211 235 Updated Aug 20, 2024

OptimalScale / DetGPT

Jupyter Notebook 787 75 Updated Aug 7, 2024

MasterBin-IIAU / UNINEXT

[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval

Python 1,280 122 Updated Jul 18, 2023

amazon-science / polygon-transformer

Python 162 13 Updated Jul 19, 2023

jxhe / unify-parameter-efficient-tuning

Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)

Python 544 42 Updated Mar 24, 2022

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,392 894 Updated Dec 17, 2024

tianrun-chen / SAM-Adapter-PyTorch

Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts

Python 1,502 122 Updated Dec 1, 2025

seanzhuh / SeqTR

SeqTR: A Simple yet Universal Network for Visual Grounding

Python 144 15 Updated Oct 30, 2024

Huntersxsx / AVVP-Learning-List

Related papers about Weakly-supervised Audio-Visual Video Parsing (AVVP) & Audio-Visual Event Localization (AVE)

5 Updated Jun 11, 2024

Huntersxsx / RIS-Learning-List

Related papers about Referring Image Segmentation (RIS)

16 Updated Dec 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sun Xin Huntersxsx

Achievements