Deploying GenAI¶
GenAIInfra is the containerization and cloud native suite for OPEA, including artifacts to deploy GenAI Examples in a cloud native way so enterprise users can deploy to their own cloud.
We’re building this documentation from content in the GenAIInfra GitHub repository.
Installation Guides¶
Helm Charts¶
- Helm charts for deploying GenAI Components and Examples
- HorizontalPodAutoscaler (HPA) support
- asr
- chathistory-usvc
- data-prep
- embedding-usvc
- guardrails-usvc
- llm-uservice
- mongodb
- prompt-usvc
- redis-vector-db
- reranking-usvc
- retriever-usvc
- speecht5
- tei
- teirerank
- tgi
- tts
- vllm
- web-retriever
- whisper
- ChatQnA
- ChatQnA Troubleshooting
- CodeGen
- CodeTrans
- DocSum