-
AP Consulting
-
04:17
(UTC +02:00) - https://www.apconsulting.cz/
- in/peteadamek
-
Speechmatics Public
A simple, browser-based application for transcribing audio files using the Speechmatics API.
-
-
invoice_extractor Public
Script extracting data from the Czech invoices in JSON and checking validity of VAT tax payer
-
pdf2md Public
Convert complex structured PDF documents with tables, formulas without OCR to clear markdown using Google's vision models. Markdown files are suitable for RAG pipeline.
-
Uctenka Public
This Python script uses the Google Gemini API to extract structured information from PDF or image files representing financial receipts.
-
PDFSplitter Public
A simple Node.js script to split a PDF into individual pages and create JPEG thumbnails.
-
Receipt_Data_Extractor Public
Extract structured data from receipt images using Groq's Vision API. Automatically processes financial receipts and outputs VAT information, prices, and transaction details in JSON format.
-
image2md Public
Convert batch of pictures with structured data like tables, formulas, charts to markdown using vision capable models. Markdown files are highly suitable for RAG pipeline.
-
MistralOCR Public
A versatile command-line tool for processing PDFs and images with Mistral's OCR API.
-
markitdown Public
Forked from microsoft/markitdownPython tool for converting files and office documents to Markdown.
Python MIT License UpdatedDec 19, 2024