Skip to content

Conversation

@hhzhang16
Copy link
Contributor

Overview:

This MR builds on the vLLM v1 profiling work to add proper Kubernetes service account configuration for running SLA profiling jobs in a cluster environment.

Details:

  • adds service account configuration with necessary permissions
  • adds rbac and role bindings with the service account
  • Modified dynamo_deployment.py to support service account-based authentication and authorization

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

  • closes GitHub issue: #xxx

@hhzhang16 hhzhang16 force-pushed the hannahz/dep-239-explore-use-of-service-accounts-in-k8s branch from 01b6df3 to 1193c4e Compare July 16, 2025 21:45
@hhzhang16 hhzhang16 merged commit 8e292f6 into hzhou/profile_vllmv1_k8s Jul 17, 2025
6 of 10 checks passed
@hhzhang16 hhzhang16 deleted the hannahz/dep-239-explore-use-of-service-accounts-in-k8s branch July 17, 2025 00:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants