retriever-usvc

Helm chart for deploying OPEA retriever-usvc microservice.

Installing the chart

retriever-usvc will use TEI for embedding service, and support different vector DB backends.

  • TEI: please refer to the tei for more information.

  • Redis vector DB: please refer to redis-vector-db for more information.

  • Milvus DB: please refer to milvus-helm for more information.

First, you need to install the tei helm chart and one of the vector DB service, i.e. redis-vector-db chart.

After you’ve deployed dependency charts successfully, please run kubectl get svc to get the service endpoint URL respectively, i.e. http://tei:80, redis://redis-vector-db:6379.

To install retriever-usvc chart, run the following:

cd GenAIInfra/helm-charts/common/retriever-usvc
helm dependency update
export HFTOKEN="insert-your-huggingface-token-here"
export TEI_EMBEDDING_ENDPOINT="http://tei"

# Install retriever-usvc with Redis DB backend
export RETRIEVER_BACKEND="REDIS"
export DB_HOST="redis-vector-db"
helm install retriever-usvc . --set TEI_EMBEDDING_ENDPOINT=${TEI_EMBEDDING_ENDPOINT} --set global.HUGGINGFACEHUB_API_TOKEN=${HF_TOKEN} --set RETRIEVER_BACKEND=${RETRIEVER_BACKEND} --set REDIS_HOST=${DB_HOST}

# Install retriever-usvc with Milvus DB backend
# export RETRIEVER_BACKEND="MILVUS"
# export DB_HOST="milvus"
# helm install retriever-usvc . --set TEI_EMBEDDING_ENDPOINT=${TEI_EMBEDDING_ENDPOINT} --set global.HUGGINGFACEHUB_API_TOKEN=${HF_TOKEN} --set RETRIEVER_BACKEND=${RETRIEVER_BACKEND} --set MILVUS_HOST=${DB_HOST}

Verify

To verify the installation, run the command kubectl get pod to make sure all pods are running.

Then run the command kubectl port-forward svc/retriever-usvc 7000:7000 to expose the retriever-usvc service for access.

Open another terminal and run the following command to verify the service if working:

export your_embedding=$(python3 -c "import random; embedding = [random.uniform(-1, 1) for _ in range(768)]; print(embedding)")
curl http://localhost:7000/v1/retrieval  \
    -X POST \
    -d "{\"text\":\"What is the revenue of Nike in 2023?\",\"embedding\":${your_embedding}}" \
    -H 'Content-Type: application/json'

Values

Key

Type

Default

Description

global.HUGGINGFACEHUB_API_TOKEN

string

""

Your own Hugging Face API token

service.port

string

"7000"

RETRIEVER_BACKEND

string

"REDIS"

vector DB backend to use, one of “REDIS”, “MILVUS”

REDIS_HOST

string

""

Redis service URL host, only valid for Redis, please see values.yaml for other Redis configuration

MILVUS_HOST

string

""

Milvus service URL host, only valid for Milvus, please see values.yaml for other Milvus configuration

TEI_EMBEDDING_ENDPOINT

string

""

global.monitoring

bool

false

Milvus support

Refer to the milvus-values.yaml for Milvus configurations.