Example AvatarChatbot Deployment on Intel® Gaudi® Platform¶

This document outlines the deployment process for a AvatarChatbot application utilizing the GenAIComps microservice pipeline on Intel Gaudi server. This example includes the following sections:

AvatarChatbot Quick Start Deployment: Demonstrates how to quickly deploy a AvatarChatbot service/pipeline on Intel® Gaudi® platform.
AvatarChatbot Docker Compose Files: Describes some example deployments and their docker compose files.
AvatarChatbot Service Configuration: Describes the service and possible configuration changes.

AvatarChatbot Quick Start Deployment¶

This section describes how to quickly deploy and test the AvatarChatbot service manually on Intel® Gaudi® platform. The basic steps are:

Access the Code
Generate a HuggingFace Access Token
Configure the Deployment Environment
Deploy the Service Using Docker Compose
Check the Deployment Status
Test the Pipeline
Cleanup the Deployment

Access the Code¶

Clone the GenAIExamples repository and access the AvatarChatbot Intel® Gaudi® platform Docker Compose files and supporting scripts:

git clone https://github.com/opea-project/GenAIExamples.git
cd GenAIExamples/AvatarChatbot/docker_compose/intel/hpu/gaudi/

Checkout a released version, such as v1.3:

git checkout v1.3

Generate a HuggingFace Access Token¶

Some HuggingFace resources, such as some models, are only accessible if you have an access token. If you do not already have a HuggingFace access token, you can create one by first creating an account by following the steps provided at HuggingFace and then generating a user access token.

Configure the Deployment Environment¶

To set up environment variables for deploying AvatarChatbot service, source the set_env.sh script in this directory:

source set_env.sh

The set_env.sh script will prompt for required and optional environment variables used to configure the AvatarChatbot service. If a value is not entered, the script will use a default value for the same. It will also generate a env file defining the desired configuration. Consult the section on AvatarChatbot Service configuration for information on how service specific configuration parameters affect deployments.

Deploy the Service Using Docker Compose¶

To deploy the AvatarChatbot service, execute the docker compose up command with the appropriate arguments. For a default deployment, execute:

docker compose up -d

The AvatarChatbot docker images should automatically be downloaded from the OPEA registry and deployed on the Intel® Gaudi® Platform:

[+] Running 7/7
 ✔ Network gaudi_default                         Created0.1s
 ✔ Container animation-gaudi-server              Started0.9s
 ✔ Container speecht5-service                    Started0.9s
 ✔ Container whisper-service                     Started0.9s
 ✔ Container tgi-gaudi-server                    Started0.7s
 ✔ Container wav2lip-service                     Started1.0s
 ✔ Container avatarchatbot-gaudi-backend-server  Started1.1s

Check the Deployment Status¶

After running docker compose, check if all the containers launched via docker compose have started:

docker ps -a

For the default deployment, the following 5 containers should be running:

CONTAINER ID   IMAGE                                 COMMAND                  CREATED         STATUS                          PORTS                                       NAMES
3a28b3f45f28   opea/avatarchatbot:latest             "python avatarchatbo…"   2 minutes ago   Up 2 minutes                    0.0.0.0:3009->8888/tcp, :::3009->8888/tcp   avatarchatbot-gaudi-backend-server
308eba6b90b8   opea/whisper-gaudi:latest             "python whisper_serv…"   2 minutes ago   Up 2 minutes                                               whisper-service
4cd4df4827e7   opea/wav2lip-gaudi:latest             "/usr/local/bin/entr…"   2 minutes ago   Up 2 minutes                    0.0.0.0:7860->7860/tcp, :::7860->7860/tcp   wav2lip-service
8e029d0ff462   ghcr.io/huggingface/tgi-gaudi:2.0.6   "text-generation-lau…"   2 minutes ago   Up 2 minutes                    0.0.0.0:3006->80/tcp, :::3006->80/tcp       tgi-gaudi-server
6fcc695f027e   opea/speecht5-gaudi:latest            "python speecht5_ser…"   2 minutes ago   Up 2 minutes                                               speecht5-service
3e0c598210d0   opea/animation:latest                 "python3 opea_animat…"   2 minutes ago   Up 2 minutes                    0.0.0.0:3008->9066/tcp, :::3008->9066/tcp   animation-gaudi-server

Test the Pipeline¶

Once the AvatarChatbot service are running, test the pipeline using the following command:

curl http://${host_ip}:3009/v1/avatarchatbot \
  -X POST \
  -d @assets/audio/sample_whoareyou.json \
  -H 'Content-Type: application/json'

If the megaservice is running properly, you should see the following output:

"/outputs/result.mp4"

The output file will be saved in the current working directory, as ${PWD} is mapped to /outputs inside the wav2lip-service Docker container.

Note The value of host_ip was set using the set_env.sh script and can be found in the .env file.

Cleanup the Deployment¶

To stop the containers associated with the deployment, execute the following command:

docker compose -f compose.yaml down

[+] Running 7/7
 ✔ Container wav2lip-service                     Removed                                                                                                                                    10.9s
 ✔ Container speecht5-service                    Removed                                                                                                                                     0.0s
 ✔ Container tgi-gaudi-server                    Removed                                                                                                                                     4.2s
 ✔ Container whisper-service                     Removed                                                                                                                                     0.0s
 ✔ Container avatarchatbot-gaudi-backend-server  Removed                                                                                                                                    10.4s
 ✔ Container animation-gaudi-server              Removed                                                                                                                                    10.3s
 ✔ Network gaudi_default                         Removed                                                                                                                                     0.4s

All the AvatarChatbot containers will be stopped and then removed on completion of the “down” command.

AvatarChatbot Docker Compose Files¶

The compose.yaml is default compose file using tgi as serving framework

Service Name	Image Name
tgi-service	ghcr.io/huggingface/tgi-gaudi:2.0.6
whisper-service	opea/whisper-gaudi:latest
speecht5-service	opea/speecht5-gaudi:latest
wav2lip-service	opea/wav2lip-gaudi:latest
animation	opea/animation:latest
avatarchatbot-gaudi-backend-server	opea/avatarchatbot:latest

AvatarChatbot Service Configuration¶

The table provides a comprehensive overview of the AvatarChatbot service utilized across various deployments as illustrated in the example Docker Compose files. Each row in the table represents a distinct service, detailing its possible images used to enable it and a concise description of its function within the deployment architecture.

Service Name	Possible Image Names	Optional	Description
tgi-service	ghcr.io/huggingface/tgi-gaudi:2.0.6	No	Specific to the TGI deployment, focuses on text generation inference using Gaudi hardware.
whisper-service	opea/whisper-gaudi:latest	No	Provides automatic speech recognition (ASR), converting spoken audio input into text.
speecht5-service	opea/speecht5-gaudi:latest	No	Performs text-to-speech (TTS) synthesis, generating natural-sounding speech from text.
wav2lip-service	opea/wav2lip-gaudi:latest	No	Generates realistic lip-sync animations by aligning speech audio with a video of a face.
animation	opea/animation:latest	No	Handles avatar animation, rendering facial expressions and movements for the chatbot avatar.
avatarchatbot-gaudi-backend-server	opea/avatarchatbot:latest	No	Orchestrates the overall AvatarChatbot pipeline, managing requests and integrating all services.