Starred repositories
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone
💫 Toolkit to help you get started with Spec-Driven Development
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
😼 优雅地使用基于 clash/mihomo 的代理环境
AI-Powered Python & Python-Powered AI (Python-Use)
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A voiceprint recognition classifier for audio dataset
很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。
Aix-DB 基于 LangChain/LangGraph 框架,结合 MCP Skills 多智能体协作架构,实现自然语言到数据洞察的端到端转换。
《一人企业方法论》第二版,也适合做其他副业(比如自媒体、电商、数字商品)的非技术人群。
A Python-based Xiaozhi AI for users who want the full Xiaozhi experience without owning specialized hardware.
LangGPT: Empowering everyone to become a prompt expert! 🚀 📌 结构化提示词(Structured Prompt)提出者 📌 元提示词(Meta-Prompt)发起者 📌 最流行的提示词落地范式 | Language of GPT The pioneering framework for structured & meta-prompt…
本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api
Android real-time display control software
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。
A browser extension for automating your browser by connecting blocks
