Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: NVIDIA/TensorRT-LLM
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: 2d45d4a
Choose a base ref
...
head repository: NVIDIA/TensorRT-LLM
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: 5db5f93
Choose a head ref
  • 8 commits
  • 27 files changed
  • 7 contributors

Commits on Aug 4, 2025

  1. [None][chore] add online help to build_wheel.py and fix a doc link (#…

    …6391)
    
    Signed-off-by: Zhenhua Wang <zhenhuaw@nvidia.com>
    zhenhuaw-me authored Aug 4, 2025
    Configuration menu
    Copy the full SHA
    59d91b8 View commit details
    Browse the repository at this point in the history
  2. test: move ministral_8b_fp8 to fp8_specific gpu list(exclude Ampere) (#…

    …6533)
    
    Signed-off-by: ruodil <200874449+ruodil@users.noreply.github.com>
    Co-authored-by: Larry <197874197+LarryXFly@users.noreply.github.com>
    ruodil and LarryXFly authored Aug 4, 2025
    Configuration menu
    Copy the full SHA
    6459725 View commit details
    Browse the repository at this point in the history
  3. [TRTLLM-5563][infra] Move test_rerun.py to script folder (#6571)

    Signed-off-by: Yiqing Yan <yiqingy@nvidia.com>
    yiqingy0 authored Aug 4, 2025
    Configuration menu
    Copy the full SHA
    4763e94 View commit details
    Browse the repository at this point in the history
  4. [None][infra] Enable accuracy test for eagle3 and chunked prefill (#6386

    )
    
    Signed-off-by: leslie-fang25 <leslief@nvidia.com>
    leslie-fang25 authored Aug 4, 2025
    Configuration menu
    Copy the full SHA
    a601908 View commit details
    Browse the repository at this point in the history
  5. [None][infra] Enable test of chunked prefill with logit post processor (

    #6483)
    
    Signed-off-by: leslie-fang25 <leslief@nvidia.com>
    leslie-fang25 authored Aug 4, 2025
    Configuration menu
    Copy the full SHA
    b9fe0fa View commit details
    Browse the repository at this point in the history
  6. [TRTLLM-4406][feat] LLM sleep & wakeup Part 1: virtual device memory (#…

    …5034)
    
    Signed-off-by: Yuan Tong <13075180+tongyuantongyu@users.noreply.github.com>
    tongyuantongyu authored Aug 4, 2025
    Configuration menu
    Copy the full SHA
    a2f271c View commit details
    Browse the repository at this point in the history
  7. [None][fix] remove closed bugs (#6576)

    Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
    Co-authored-by: Larry <197874197+LarryXFly@users.noreply.github.com>
    xinhe-nv and LarryXFly authored Aug 4, 2025
    Configuration menu
    Copy the full SHA
    a54972e View commit details
    Browse the repository at this point in the history
  8. waive failed tests

    Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
    xinhe-nv committed Aug 4, 2025
    Configuration menu
    Copy the full SHA
    5db5f93 View commit details
    Browse the repository at this point in the history
Loading