Non-destructive retrying on `nemotron-parse` API by jamesbraza · Pull Request #1281 · Future-House/paper-qa

jamesbraza · 2026-01-29T22:21:29Z

Previously we could have a destructive interference with retrying in nemotron-parse:

First retry 3X API timeouts
Then retry 3X bounding box errors

This PR tightens up retrying to just retry 3X cumulatively

Note

Medium Risk
Touches external API retry/backoff behavior, which can change throughput and error handling under load; logic is small and covered by a new unit test but still affects request timing.

Overview
Tightens Nvidia nemotron-parse retry behavior so failures across bbox validation errors, rate-limit TimeoutErrors, and LiteLLM 408 timeouts share a single stop_after_attempt(3) budget instead of potentially retrying in multiple stacked phases.

Adds a custom tenacity wait strategy that applies exponential backoff only for the rate-limit TimeoutError path (not for inference-timeout litellm.Timeout), and updates the LiteLLM timeout type check accordingly; includes a focused unit test for the new wait/backoff logic.

^{Written by Cursor Bugbot for commit 84efd89. Configure here.}

Copilot

Pull request overview

This PR adjusts retry behavior for the nemotron-parse Nvidia API calls to avoid “stacked” retry loops that could previously result in more than 3 total retries across different failure modes.

Changes:

Combine multiple Tenacity retry decorators into a single retry policy capped at 3 total attempts.
Introduce a custom Tenacity wait function to apply exponential backoff only for the TimeoutError (rate-limit) failure mode.
Add a unit test validating the conditional backoff behavior.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
`packages/paper-qa-nemotron/src/paperqa_nemotron/api.py`	Consolidates retry logic and adds conditional exponential backoff helper.
`packages/paper-qa-nemotron/tests/test_api.py`	Adds test coverage for the new conditional wait/backoff logic.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

packages/paper-qa-nemotron/src/paperqa_nemotron/api.py

jamesbraza added 2 commits January 29, 2026 14:19

Better consolidated retrying on nemotron-parse

0260fbd

Removed '.exceptions' from litellm import

84efd89

jamesbraza requested review from MicPie, mskarlin, sidnarayanan and whitead January 29, 2026 22:21

jamesbraza self-assigned this Jan 29, 2026

Copilot AI review requested due to automatic review settings January 29, 2026 22:21

jamesbraza added the bug Something isn't working label Jan 29, 2026

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Jan 29, 2026

Copilot started reviewing on behalf of jamesbraza January 29, 2026 22:21 View session

dosubot bot added the enhancement New feature or request label Jan 29, 2026

Copilot AI reviewed Jan 29, 2026

View reviewed changes

packages/paper-qa-nemotron/src/paperqa_nemotron/api.py Show resolved Hide resolved

Fixing backoff counter being heightened due to other retries

329d363

sidnarayanan approved these changes Jan 29, 2026

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Jan 29, 2026

jamesbraza merged commit c153af1 into main Jan 29, 2026
5 of 7 checks passed

jamesbraza deleted the better-retrying branch January 29, 2026 23:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-destructive retrying on `nemotron-parse` API#1281

Non-destructive retrying on `nemotron-parse` API#1281
jamesbraza merged 3 commits intomainfrom
better-retrying

jamesbraza commented Jan 29, 2026 •

edited by cursor bot

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jamesbraza commented Jan 29, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jamesbraza commented Jan 29, 2026 •

edited by cursor bot

Loading