-
University of Electronic Science and Technology of China
Stars
Medical imaging processing for AI applications.
Align Anything: Training All-modality Model with Feedback
ICCV 2023, "GraphEcho: Graph-Driven Unsupervised Domain Adaptation for Echocardiogram Video Segmentation"
A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone
MiniCPM5-1B: A SOTA 1B on-device LLM, small yet powerful.
[TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
👾 A Python API wrapper for Poe.com. With this, you will have free access to GPT-4, Claude, Llama, Gemini, Mistral and more! 🚀
LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
The Medical Image Registration ToolKit (MIRTK), the successor of the IRTK, contains common CMake build configuration files, core libraries, and basic command-line tools. Extension packages are host…
OHIF zero-footprint DICOM viewer and oncology specific Lesion Tracker, plus shared extension packages
Official repository for "AM-RADIO: Reduce All Domains Into One"
Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, B…
[npj Digital Medicine] The official repository for "Large-Vocabulary Segmentation for Medical Images with Text Prompts"
[ICLR 2024 oral; top 1.2%] Supervised Pre-Trained 3D Models for Medical Image Analysis (9,262 CT volumes + 25 annotated classes)
RadImageNet, a pre-trained convolutional neural networks trained solely from medical imaging to be used as the basis of transfer learning for medical imaging applications.
A library for calculating the FLOPs in the forward() process based on torch.fx
Machine Learning Agility (MLAgility) benchmark and benchmarking tools
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models
Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
