Adding prompt caching by jamesbraza · Pull Request #1293 · Future-House/paper-qa

jamesbraza · 2026-02-19T00:08:24Z

This PR adds ~~cache_breakpoint throughout the code base~~ cache_control_injection_points to LiteLLM Router so we can get some caching and save money

cursor · 2026-02-19T00:08:28Z

You have run out of free Bugbot PR reviews for this billing cycle. This will reset on March 5.

To receive reviews on all of your PRs, visit the Cursor dashboard to activate Pro and start your 14-day free trial.

dosubot · 2026-02-19T00:10:19Z

Related Documentation

Checked 1 published document(s) in 1 knowledge base(s). No updates required.

^{How did I do? Any feedback?}

Copilot

Pull request overview

This PR implements prompt caching support to reduce LLM API costs by adding cache_breakpoint=True to system messages that are reused across multiple requests.

Changes:

Added cache_breakpoint=True to system messages in evidence gathering, query processing, and agent operations
Updated dependency fhaviary from version 0.27 to 0.33 to support the cache_breakpoint feature
Enhanced test coverage with assertions to verify prompt caching behavior for Anthropic models

Reviewed changes

Copilot reviewed 5 out of 6 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
src/paperqa/core.py	Added cache_breakpoint to system messages in evidence-gathering function
src/paperqa/docs.py	Added cache_breakpoint to reusable system message in query operations
src/paperqa/agents/main.py	Added cache_breakpoint to agent system prompt that's reused on every turn
pyproject.toml	Updated fhaviary dependency to v0.33 for cache_breakpoint support
tests/test_paperqa.py	Extended Anthropic test with caching assertions and padding to meet minimum token requirements

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/test_paperqa.py

sidnarayanan

Approving, but I haven't looked at why CI is failing

…evidence test

This reverts commit dd9dfd4.

This reverts commit 5d9fdde.

…oviders

jamesbraza requested review from MicPie, alexandonian, elimoss, maykcaldas, mskarlin, nadolskit, sidnarayanan and whitead February 19, 2026 00:08

jamesbraza self-assigned this Feb 19, 2026

Copilot AI review requested due to automatic review settings February 19, 2026 00:08

jamesbraza added the enhancement New feature or request label Feb 19, 2026

Copilot started reviewing on behalf of jamesbraza February 19, 2026 00:08 View session

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Feb 19, 2026

Copilot AI reviewed Feb 19, 2026

View reviewed changes

tests/test_paperqa.py Outdated Show resolved Hide resolved

jamesbraza force-pushed the caching branch from e4a6583 to ac0d1d4 Compare February 19, 2026 00:17

sidnarayanan approved these changes Feb 19, 2026

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Feb 19, 2026

jamesbraza force-pushed the caching branch from ac0d1d4 to 57fa82c Compare February 20, 2026 07:29

jamesbraza added 4 commits February 20, 2026 12:05

Pulled in aviary 0.33 for caching change

dd9dfd4

Introduced cache breakpoints throughout the system, with a Docs.aget_…

ed03f59

…evidence test

Refreshing cassettes as needed

5d9fdde

Revert "Pulled in aviary 0.33 for caching change"

9600527

This reverts commit dd9dfd4.

jamesbraza force-pushed the caching branch from 709b517 to 256ce57 Compare February 20, 2026 20:57

Revert "Refreshing cassettes as needed"

af385de

This reverts commit 5d9fdde.

jamesbraza force-pushed the caching branch from 256ce57 to 2267651 Compare February 20, 2026 20:58

Moved to LiteLLM cache_control_injection_points, with test for all pr…

2c99117

…oviders

jamesbraza force-pushed the caching branch from 2267651 to 2c99117 Compare February 20, 2026 21:50

jamesbraza requested a review from sidnarayanan February 20, 2026 21:59

sidnarayanan approved these changes Feb 20, 2026

View reviewed changes

jamesbraza merged commit 7cdc9ac into main Feb 20, 2026
6 of 7 checks passed

jamesbraza deleted the caching branch February 20, 2026 23:58

jamesbraza mentioned this pull request Feb 25, 2026

Disabling cache reads assertion for Google Gemini #1306

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding prompt caching#1293

Adding prompt caching#1293
jamesbraza merged 6 commits intomainfrom
caching

jamesbraza commented Feb 19, 2026 •

edited

Loading

Uh oh!

cursor bot commented Feb 19, 2026

Uh oh!

dosubot bot commented Feb 19, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

sidnarayanan left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jamesbraza commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor bot commented Feb 19, 2026

Uh oh!

dosubot bot commented Feb 19, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

sidnarayanan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jamesbraza commented Feb 19, 2026 •

edited

Loading