-
Notifications
You must be signed in to change notification settings - Fork 148
Closed
Labels
enhancementNew feature or requestNew feature or requestmodule: runnerissues related to the torchx.runner and torchx.scheduler modulesissues related to the torchx.runner and torchx.scheduler modulesscheduler-requestNew scheduler requestsNew scheduler requests
Description
It would be nice to have GCP + TPU support in addition to our existing schedulers. Currently you can run on GCP via Kubernetes + the Kubernetes scheduler but would be handy to have direct training platform support.
Example scheduler: AWS Batch https://github.com/pytorch/torchx/blob/main/torchx/schedulers/aws_batch_scheduler.py
Scheduler documentation: https://pytorch.org/torchx/main/schedulers
GCP Docs:
- https://cloud.google.com/ai-platform/training/docs/training-jobs
- https://cloud.google.com/ai-platform/training/docs/reference/rest/v1/projects.jobs
Stretch Goal:
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestmodule: runnerissues related to the torchx.runner and torchx.scheduler modulesissues related to the torchx.runner and torchx.scheduler modulesscheduler-requestNew scheduler requestsNew scheduler requests