Deploying GenAI¶
GenAIInfra is the containerization and cloud native suite for OPEA, including artifacts to deploy GenAI Examples in a cloud native way so enterprise users can deploy to their own cloud.
We’re building this documentation from content in the GenAIInfra GitHub repository.
Installation Guides¶
Helm Charts¶
- Helm charts for deploying GenAI Components and Examples
- CI guidelines for helm charts
- HorizontalPodAutoscaler (HPA) support
- Monitoring support
- agent
- asr
- chathistory-usvc
- data-prep
- embedding-usvc
- gpt-sovits
- guardrails-usvc
- llm-uservice
- lvm-uservice
- mongodb
- prompt-usvc
- redis-vector-db
- reranking-usvc
- retriever-usvc
- speecht5
- tei
- teirerank
- tgi
- tts
- vllm
- web-retriever
- whisper
- AgentQnA
- AudioQnA
- ChatQnA
- ChatQnA Troubleshooting
- CodeGen
- CodeTrans
- DocSum
- FaqGen
- VisualQnA