This repo contains information on how to deploy ollama on OpenShift.
- OpenShift >= 4.15
- A GPU worker node with at least 16GB of GPU memory.
- AWS
g4dn.2xlargeg5.2xlarge
- AWS
Use CPU only
# setup ollama
until oc apply -k deploy; do : ; doneUse Nvidia GPU
# setup nvidia gpu nodes (prerequisite)
until oc apply -k deploy/nvidia-gpu-autoscale; do : ; done# setup ollama w/ gpu
until oc apply -k deploy; do : ; done
until oc apply -k deploy/ollama-gpu; do : ; doneSetup Web Terminal (optional)
until oc apply -k deploy/web-terminal; do : ; done