-
Notifications
You must be signed in to change notification settings - Fork 753
fix: 0.4.1 disable kvbm tests (CP #2611) #2635
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: 0.4.1 disable kvbm tests (CP #2611) #2635
Conversation
|
Caution Review failedThe pull request is closed. WalkthroughBulk updates switch default model references to Qwen/Qwen3-0.6B, bump TensorRT-LLM pins to 1.0.0rc6 with container/version alignments, adjust UCX refs to v1.19.0, remove the local async-openai-macros crate in favor of a published dependency, and add SGLang runtime behavior tweaks (frontend tokenization default, decode error guard). Tests and docs updated accordingly. Changes
Sequence Diagram(s)sequenceDiagram
autonumber
participant Client
participant Frontend (Dynamo)
participant Worker (SGLang)
rect rgb(245,248,255)
note over Frontend (Dynamo): Tokenization enforced (skip_tokenizer_init=True)
Client->>Frontend (Dynamo): /v1/chat/completions
Frontend (Dynamo)->>Frontend (Dynamo): Tokenize input
Frontend (Dynamo)->>Worker (SGLang): Prefill/Decode request (no tokenizer init)
end
alt Streaming tokens
Worker (SGLang)-->>Frontend (Dynamo): res with output_ids
Frontend (Dynamo)->>Client: stream tokens
else Missing output_ids
Worker (SGLang)-->>Frontend (Dynamo): res without output_ids
Frontend (Dynamo)->>Frontend (Dynamo): raise ValueError with keys hint
Frontend (Dynamo)-->>Client: error response
end
Frontend (Dynamo)->>Frontend (Dynamo): Detokenize (if needed)
Frontend (Dynamo)-->>Client: final response
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~25 minutes Possibly related PRs
Poem
Tip 🔌 Remote MCP (Model Context Protocol) integration is now available!Pro plan users can now connect to remote MCP servers from the Integrations page. Connect with popular remote MCPs such as Notion and Linear to add more context to your reviews and chats. 📜 Recent review detailsConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Pro 💡 Knowledge Base configuration:
You can enable these sources in your CodeRabbit configuration. ⛔ Files ignored due to path filters (2)
📒 Files selected for processing (45)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. 🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
SupportNeed help? Create a ticket on our support page for assistance with any issues or questions. CodeRabbit Commands (Invoked using PR/Issue comments)Type Other keywords and placeholders
Status, Documentation and Community
|
Overview:
Cherry-pick for #2611
Details:
Where should the reviewer start?
Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)
Summary by CodeRabbit
New Features
Bug Fixes
Documentation
Chores
Tests