-
Alibaba.com
- Hangzhou, China
Stars
The fullstack MCP framework to develop MCP Apps for ChatGPT / Claude & MCP Servers for AI Agents.
8-bit quantization for PyTorch on Apple Silicon (M1/M2/M3/M4)
Mobile-Agent: The Powerful GUI Agent Family
Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning
The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞
Open Cowork - Opensource Claude Cowork for Windows & macOS.
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
Independent technology for modern publishing, memberships, subscriptions and newsletters.
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++
Browser automation CLI for AI agents
Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM — deploy anywhere, swap anything 🦀
LibVLC-based media player for the Universal Windows Platform
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
Pioneering Automated GUI Interaction with Native Agents
"🐈 nanobot: The Ultra-Lightweight OpenClaw"
Payload is the open-source, fullstack Next.js framework, giving you instant backend superpowers. Get a full TypeScript backend and admin panel instantly. Use Payload as a headless CMS or for buildi…
BrowserWing turns your browser actions into MCP commands Or Claude Skill, allowing AI agents to control browsers efficiently and reliably. Say goodbye to slow, token-heavy LLM interactions — let ag…
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Successor of Undetected-Chromedriver. Providing a blazing fast framework for web automation, webscraping, bots and any other creative ideas which are normally hindered by annoying anti bot systems …
Chrome DevTools for coding agents
Python version of the Playwright testing and automation library.
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B
A simple screen parsing tool towards pure vision based GUI agent
A natural language interface for computers

