Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: NVIDIA/TensorRT-LLM
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: 6994f9c
Choose a base ref
...
head repository: NVIDIA/TensorRT-LLM
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: 6ec395d
Choose a head ref
  • 14 commits
  • 159 files changed
  • 13 contributors

Commits on Oct 29, 2025

  1. [None][infra] Check in most recent lock file from nightly pipeline (#…

    …8739)
    
    Signed-off-by: TensorRT LLM <90828364+tensorrt-cicd@users.noreply.github.com>
    Signed-off-by: Yuanjing Xue <197832395+yuanjingx87@users.noreply.github.com>
    Co-authored-by: TensorRT LLM <90828364+tensorrt-cicd@users.noreply.github.com>
    yuanjingx87 and tensorrt-cicd authored Oct 29, 2025
    Configuration menu
    Copy the full SHA
    45b36cc View commit details
    Browse the repository at this point in the history
  2. [https://nvbugs/5599086][fix] Fix FP8 Linear module for spark (#8707)

    Signed-off-by: Simeng Liu <simengl@nvidia.com>
    SimengLiu-nv authored Oct 29, 2025
    Configuration menu
    Copy the full SHA
    834a780 View commit details
    Browse the repository at this point in the history

Commits on Oct 30, 2025

  1. [None][doc] Minor doc update to disagg-serving (#8768)

    Signed-off-by: Sharan Chetlur <116769508+schetlur-nv@users.noreply.github.com>
    schetlur-nv authored Oct 30, 2025
    Configuration menu
    Copy the full SHA
    a2e964d View commit details
    Browse the repository at this point in the history
  2. [https://nvbugs/5547414][fix] Use cached models (#8755)

    Signed-off-by: Hui Gao <huig@nvidia.com>
    HuiGao-NV authored Oct 30, 2025
    Configuration menu
    Copy the full SHA
    ae57738 View commit details
    Browse the repository at this point in the history
  3. [None][fix] Fix UnboundLocalError. (#8756)

    Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
    yuxianq authored Oct 30, 2025
    Configuration menu
    Copy the full SHA
    3176bd3 View commit details
    Browse the repository at this point in the history
  4. [TRTLLM-8971][infra] Update gpu key for B300/GB300 (#8724)

    Signed-off-by: qqiao <qqiao@nvidia.com>
    EmmaQiaoCh authored Oct 30, 2025
    Configuration menu
    Copy the full SHA
    db99a93 View commit details
    Browse the repository at this point in the history
  5. [None][doc] Add doc for torch.compile & piecewise cuda graph (#8527)

    Signed-off-by: yizhang-nv <187001205+yizhang-nv@users.noreply.github.com>
    yizhang-nv authored Oct 30, 2025
    Configuration menu
    Copy the full SHA
    496b419 View commit details
    Browse the repository at this point in the history
  6. [None][infra] Unwaive the tests passed in latest CI and disable a per…

    …f stage (#8775)
    
    Signed-off-by: qqiao <qqiao@nvidia.com>
    EmmaQiaoCh authored Oct 30, 2025
    Configuration menu
    Copy the full SHA
    7d3cebf View commit details
    Browse the repository at this point in the history
  7. [None][infra] fix slurm results path (#8751)

    Signed-off-by: Yuanjing Xue <197832395+yuanjingx87@users.noreply.github.com>
    yuanjingx87 authored Oct 30, 2025
    Configuration menu
    Copy the full SHA
    e689a73 View commit details
    Browse the repository at this point in the history
  8. [None][fix] fix runtime error that bf16 input is not quantized to nvf…

    …p4 when use bf16 dispatch (#8507)
    
    Signed-off-by: Yilin Zhang <18275976+yilin-void@users.noreply.github.com>
    yilin-void authored Oct 30, 2025
    Configuration menu
    Copy the full SHA
    6b755fd View commit details
    Browse the repository at this point in the history
  9. [https://nvbugs/5608461][fix] exclude InductorSubproc from thread lea…

    …k check (#8704)
    
    Signed-off-by: leslie-fang25 <leslief@nvidia.com>
    leslie-fang25 authored Oct 30, 2025
    Configuration menu
    Copy the full SHA
    2072185 View commit details
    Browse the repository at this point in the history
  10. [https://nvbugs/5481206][fix] update waives (#8774)

    Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
    xinhe-nv authored Oct 30, 2025
    Configuration menu
    Copy the full SHA
    a4f7539 View commit details
    Browse the repository at this point in the history
  11. [None][feat] Refactor scaffolding streaming feature and fix openai wo… (

    #8622)
    
    Signed-off-by: Fred Wei <20514172+WeiHaocheng@users.noreply.github.com>
    WeiHaocheng authored Oct 30, 2025
    Configuration menu
    Copy the full SHA
    cc28668 View commit details
    Browse the repository at this point in the history
  12. feat: sampling using FlashInfer.sampling

    Signed-off-by: ixlmar <206748156+ixlmar@users.noreply.github.com>
    ixlmar committed Oct 30, 2025
    Configuration menu
    Copy the full SHA
    6ec395d View commit details
    Browse the repository at this point in the history
Loading