Skip to content

Pull requests: openai/evals

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Extending to Azure OpenAI implementation
#1470 opened Feb 23, 2024 by pkt1583 Loading…
Add **kwargs to OpenAIChatCompletionFn
#1494 opened Mar 15, 2024 by ezraporter Loading…
[Evals] Add eval for Dhivehi diacritical marks
#1495 opened Mar 16, 2024 by aanaseer Loading…
11 of 12 tasks
Fix specifying API arguments from the CLI
#1505 opened Mar 27, 2024 by LoryPack Contributor Loading…
6 tasks done
Update custom-eval.md
#1598 opened Aug 19, 2025 by rajeshkp Loading…
13 tasks
Update to python 3.12
#1607 opened Dec 21, 2025 by omonimus1 Loading…
Add powershell-encoding-basics eval
#1611 opened Jan 28, 2026 by TheodorNEngoy Loading…
Refactor JSONL file loading logic in data.py
#1612 opened Feb 3, 2026 by Pritiks23 Loading…
13 tasks done
Pritiks23 patch 1
#1613 opened Feb 3, 2026 by Pritiks23 Loading…
13 tasks done
Improving CI
#1617 opened Feb 5, 2026 by fsdavi Loading…
13 tasks
fix: correct typos in evals
#1621 opened Feb 7, 2026 by thecaptain789 Loading…
Add Logic Stress Stress-test Suite (v2, v3)
#1622 opened Feb 16, 2026 by 14H034160212 Contributor Loading…
README: fix Evals starter guide link
#1623 opened Feb 19, 2026 by dcol91863 Loading…
eval: add RAIL Score responsible AI evaluation across 8 dimensions
#1640 opened Apr 2, 2026 by SumitVermakgp Loading…
12 tasks done
Add Turkish proverbs eval
#1649 opened Apr 23, 2026 by kayametehan Loading…
Handle nested token usage details in oaieval
#1650 opened Apr 23, 2026 by kayametehan Loading…
Add explain mode to HumanCliSolver
#1652 opened Apr 23, 2026 by kayametehan Loading…
Fix OpenAI completion args routing
#1653 opened Apr 23, 2026 by kayametehan Loading…
ProTip! Updated in the last three days: updated:>2026-04-25.