Helm chart for Ollama #774

jonminkin97 · 2025-02-05T23:11:33Z

Description

Add Helm chart to deploy Ollama as a model server.

Issues

n/a

Type of change

List the type of change like below. Please delete options that are not relevant.

New feature (non-breaking change which adds new functionality)

Dependencies

List the newly introduced 3rd party dependency if exists.

Ollama

Tests

Manual deployment of Helm chart, verifying installation and inference
Helm test pod to automatically test the Helm release using helm test [RELEASE]

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

lianhao

Thanks very much for contributing this. Besides the embedded comment, there is something missing in the helm template(e.g. HPA, service monitor). I would suggest you check https://github.com/opea-project/GenAIInfra/tree/main/helm-charts/common/lvm-serve to add the missing gap. Thanks!

helm-charts/common/ollama-service/values.yaml

helm-charts/common/ollama-service/templates/_helpers.tpl

helm-charts/common/ollama-service/templates/deployment.yaml

helm-charts/common/ollama-service/values.yaml

helm-charts/common/ollama-service/templates/deployment.yaml

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

jonminkin97 · 2025-02-06T21:27:59Z

@lianhao Thank you for the feedback. I've made the requested changes, and they are now pending your approval. Let me know if there are any other enhancements you would like to see.

helm-charts/common/ollama-service/templates/deployment.yaml

helm-charts/common/ollama-service/values.yaml

helm-charts/common/ollama-service/templates/deployment.yaml

lianhao

2 more things beside the embedding comments:

Add a cpu-values.yaml file which is used to trigger the CI, just like what lvm-serve does. These kind of xxx-values contains additional configuration for CI to pass(could be empty if there no additional requirements at all).
Could you name this chart ollama instead of ollama-service? One of our 1.3 task is to make sure the helm chart name is consistent with the corresponding names in GenAIComps.

helm-charts/common/ollama-service/templates/servicemonitor.yaml

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

lianhao

Thanks for your contribution @jonminkin97

jonminkin97 requested review from yongfengdu and lianhao as code owners February 5, 2025 23:11

jonminkin97 added 3 commits February 5, 2025 23:14

Create initial helm chart with ollama deployment

30750d3

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

Add test pod, remove auto scaling, pin ollama version

8b034f9

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

Add README and run pre-commit hooks

eae360d

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

jonminkin97 force-pushed the chatqna-ollama-helm branch from caa3e18 to eae360d Compare February 5, 2025 23:15

Remove whitespace to fix chart linting

f54fa1c

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

lianhao requested changes Feb 6, 2025

View reviewed changes

jonminkin97 added 6 commits February 6, 2025 18:03

Formatting and naming refactoring

fb6d17d

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

Support sharedSAName and default security context

3595ccc

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

Add support for modelUseHostPath and modelUsePVC

b3d963d

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

Add startup probe to determine if model is downloaded

7c3110c

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

Add service monitor and hpa

b476e6b

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

Add topology spread constraints

0eb3d0c

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

lianhao reviewed Feb 7, 2025

View reviewed changes

eero-t reviewed Feb 7, 2025

View reviewed changes

helm-charts/common/ollama-service/templates/servicemonitor.yaml Outdated Show resolved Hide resolved

jonminkin97 added 6 commits February 11, 2025 19:57

Rename chart, remove service monitor, update init security

23535e0

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

Rename ollama-service folder to ollama

dcc5958

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

Add port number to helm test

f82df8b

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

Add curl flags to follow existing test pattern

0322d9d

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

Merge remote-tracking branch 'upstream/main' into chatqna-ollama-helm

93307b2

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

Fix error code variable

180b358

Signed-off-by: Jonathan Minkin <minkinj@amazon.com>

lianhao approved these changes Feb 12, 2025

View reviewed changes

lianhao requested a review from poussa February 12, 2025 01:38

yongfengdu approved these changes Feb 13, 2025

View reviewed changes

Merge branch 'main' into chatqna-ollama-helm

3fa2642

lianhao merged commit 7d66afb into opea-project:main Feb 14, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Helm chart for Ollama #774

Helm chart for Ollama #774

jonminkin97 commented Feb 5, 2025

lianhao left a comment

jonminkin97 commented Feb 6, 2025

lianhao left a comment

lianhao left a comment

Helm chart for Ollama #774

Helm chart for Ollama #774

Conversation

jonminkin97 commented Feb 5, 2025

Description

Issues

Type of change

Dependencies

Tests

lianhao left a comment

Choose a reason for hiding this comment

jonminkin97 commented Feb 6, 2025

lianhao left a comment

Choose a reason for hiding this comment

lianhao left a comment

Choose a reason for hiding this comment