Skip to content

Commit 0608de0

Browse files
committed
Add benchmarks CLI
1 parent 474105c commit 0608de0

File tree

4 files changed

+963
-2
lines changed

4 files changed

+963
-2
lines changed

README.md

Lines changed: 38 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,7 @@ TUI and CLI for browsing AI models, benchmarks, and coding agents.
1919

2020
- **Models tab redesign** — capability indicators, adaptive provider panel, and detailed model info at a glance
2121
- **Benchmark compare mode** — head-to-head tables, scatter plots, and radar charts for selected models
22+
- **Benchmarks CLI** — list and inspect benchmark data directly from the terminal
2223
- **Linux packages** — native .deb and .rpm packages for x86_64 and aarch64
2324
- **Agents CLI** — track agent releases, view changelogs, and compare versions from the terminal
2425

@@ -56,6 +57,13 @@ TUI and CLI for browsing AI models, benchmarks, and coding agents.
5657
- **Dual entry point** — use as `models agents` or create an `agents` symlink for standalone usage
5758
- **Fast** — concurrent GitHub fetching and version detection
5859

60+
### Benchmarks CLI
61+
- **Live benchmark queries** — fetch the current benchmark dataset without launching the TUI
62+
- **List and detail views** — use `models benchmarks list` for filtered tables and `models benchmarks show` for full breakdowns
63+
- **Filtering** — narrow by search text, creator, open/closed source, and reasoning status
64+
- **Sorting** — sort by any supported metric, including intelligence, coding, math, GPQA, speed, pricing, and release date
65+
- **JSON output** — pipe structured benchmark data into shell scripts and other tools
66+
5967
## Installation
6068

6169
### Cargo (from crates.io)
@@ -277,6 +285,33 @@ See [Custom Agents](docs/custom-agents.md) for the full reference.
277285

278286
## CLI Usage
279287

288+
### Benchmarks CLI
289+
290+
Query benchmark data from the command line using the same live benchmark feed as the Benchmarks tab.
291+
292+
#### List benchmark entries
293+
294+
```bash
295+
models benchmarks list
296+
models benchmarks list --sort speed --limit 10
297+
models benchmarks list --creator openai --reasoning
298+
models benchmarks list --open --sort price-input --asc
299+
```
300+
301+
#### Show benchmark details
302+
303+
```bash
304+
models benchmarks show gpt-4o
305+
models benchmarks show "Claude Sonnet 4"
306+
```
307+
308+
#### JSON output
309+
310+
```bash
311+
models benchmarks list --creator anthropic --json
312+
models benchmarks show gpt-4o --json
313+
```
314+
280315
### Agents CLI
281316

282317
Track AI coding agent releases from the command line. Install the `agents` alias during setup, or use `models agents` as a fallback.
@@ -396,9 +431,11 @@ models search "claude opus"
396431

397432
#### JSON output
398433

399-
All model commands support `--json` for scripting:
434+
All models and benchmarks commands support `--json` for scripting:
400435

401436
```bash
437+
models benchmarks list --json
438+
models benchmarks show gpt-4o --json
402439
models list providers --json
403440
models show claude-opus-4-5 --json
404441
models search "llama" --json

0 commit comments

Comments
 (0)