Skip to content
View brunobrocai's full-sized avatar

Highlights

  • Pro

Block or report brunobrocai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 2,181 245 Updated Oct 16, 2025

Python port for IWNLP.Lemmatizer

Python 19 3 Updated Apr 13, 2026

A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.

502 56 Updated Oct 25, 2022

Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German

521 66 Updated Oct 30, 2024

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,376 2,116 Updated Oct 27, 2025

Python NLP tools like sentence segmenter, cleaner, sentence language identification and word tokenizer with support for various file formats

Python 6 1 Updated Jun 13, 2024

A tokenizer and sentence splitter for German and English web and social media texts.

Python 153 21 Updated Dec 9, 2024

[NeurIPS '25] Knowledge Graph Generation from Any Text

Python 1,149 169 Updated Mar 24, 2026

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,424 119 Updated Apr 17, 2026

Area-weighted venn-diagrams for Python/matplotlib

Jupyter Notebook 570 72 Updated Feb 25, 2025

Python Implementation of Dawid and Skene's EM Algorithm.

Python 6 1 Updated Apr 28, 2021

Implementation of the estimator for combining noisy observations from Dawid and Skene (1979)

Python 41 11 Updated Aug 23, 2014

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 171,652 16,171 Updated May 15, 2026
Jupyter Notebook 3 Updated Jan 3, 2025

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 33,584 4,682 Updated Mar 28, 2026

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 5,969 372 Updated Sep 12, 2025

Automated identification of text structures related to moralization.

Python 2 3 Updated Mar 16, 2026