Skip to content

Pull requests: openai/evals

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix OpenAI completion args routing
#1653 opened Apr 23, 2026 by kayametehan Loading…
Add explain mode to HumanCliSolver
#1652 opened Apr 23, 2026 by kayametehan Loading…
Handle nested token usage details in oaieval
#1650 opened Apr 23, 2026 by kayametehan Loading…
Add Turkish proverbs eval
#1649 opened Apr 23, 2026 by kayametehan Loading…
eval: add RAIL Score responsible AI evaluation across 8 dimensions
#1640 opened Apr 2, 2026 by SumitVermakgp Loading…
12 tasks done
README: fix Evals starter guide link
#1623 opened Feb 19, 2026 by dcol91863 Loading…
Add Logic Stress Stress-test Suite (v2, v3)
#1622 opened Feb 16, 2026 by 14H034160212 Contributor Loading…
fix: correct typos in evals
#1621 opened Feb 7, 2026 by thecaptain789 Loading…
Improving CI
#1617 opened Feb 5, 2026 by fsdavi Loading…
13 tasks
Pritiks23 patch 1
#1613 opened Feb 3, 2026 by Pritiks23 Loading…
13 tasks done
Refactor JSONL file loading logic in data.py
#1612 opened Feb 3, 2026 by Pritiks23 Loading…
13 tasks done
Add powershell-encoding-basics eval
#1611 opened Jan 28, 2026 by TheodorNEngoy Loading…
Update to python 3.12
#1607 opened Dec 21, 2025 by omonimus1 Loading…
Update custom-eval.md
#1598 opened Aug 19, 2025 by rajeshkp Loading…
13 tasks
Fix specifying API arguments from the CLI
#1505 opened Mar 27, 2024 by LoryPack Contributor Loading…
6 tasks done
[Evals] Add eval for Dhivehi diacritical marks
#1495 opened Mar 16, 2024 by aanaseer Loading…
11 of 12 tasks
Add **kwargs to OpenAIChatCompletionFn
#1494 opened Mar 15, 2024 by ezraporter Loading…
Extending to Azure OpenAI implementation
#1470 opened Feb 23, 2024 by pkt1583 Loading…
Now I have the change in place, it seems wrong.
#1209 opened Jun 21, 2023 by CholoTook Contributor Loading…
ProTip! Filter pull requests by the default branch with base:main.