Parser
To extract main article from given URL with Node.js
📜 Extract meaningful content from the chaos of a web page
Vision utilities for web interaction agents 👀
Flexible Node.js AI-assisted crawler library
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
🔥 The API to search, scrape, and interact with the web for AI
Composio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.
Parse, inspect, transform, and serialize content with syntax trees
⚡A CLI tool for code structural search, lint and rewriting. Written in Rust
A web tool to explore the ASTs generated by various parsers.
OCR, layout analysis, reading order, table recognition in 90+ languages
AI-powered, vision-driven UI automation for every platform.
TypeScript-first schema validation with static type inference
🔥 The open-source no-code platform for web scraping, crawling, search and AI data extraction • Turn websites into structured APIs in minutes 🔥
A language for constraint-guided and efficient LLM programming.
A Bulletproof Way to Generate Structured JSON from Language Models
Convert Word documents (.docx files) to HTML
Extract frontmatter from markdown code blocks using remark, and do interesting things!
A Unified/Remark plugin that injects a DOCX compiler using [`mdast2docx`](https://github.com/tiny-md/mdast2docx) and outputs `.docx` files from Markdown.
Remark plugin to turn code blocks into carbon.now.sh screenshots.
plugin to generate a table of contents (TOC)
utility to generate a table of contents from an mdast tree
Get the main content of any page as Markdown.






