# OPEA 2024 - 2025 Roadmap ## May 2024 ### Contribution - **Components** - ASR - Data Prep - Embedding - Guardrails - LLM (Gaudi TGI) - Rerank - Retrieval - TTS - VectorDB - **Use Cases/Examples** - ChatQnA - CodeGen - CodeTrans - **Cloud Native** - OneClick OPEA on ChatQnA - OneClick OPEA on CodeGen - GenAI microservice connector - **Evaluation & Others** - CICD & Validation - Eval: E2E (GenAIComps & GenAIExamples), lm-eval-harness, bigcode-eval-harness - RAGAS evaluation service ### AI Models - LLM: llama2 (7b, 13b, 70b), llama3 (8b, 70b), code-llama, Llama guard - Embedding: BGE-base ### AI Tools Integration - VectorDB: Chroma - Framework: Langchain ### Deployment Type - On Prem,IDC (Xeon, Gaudi) ## June 2024 ### Contribution - **Components** - LLM (Xeon vLLM & Ray, Ollama) - OVMS - prompting - user feedback management - Mega Component (MI6 RAG service) - **Use Cases/Examples** - DocSum - SearchQnA - **Cloud Native** - OneClick OPEA for 2 more examples - GMC with switch support (dynamic pipelines) - Helm charts/templates for custom yamls (refactoring) - **Evaluation & Others** - CICD & Validation - Eval: E2E (GenAIComps & GenAIExamples) Gaudi (2) and CPUs in CICD cluster ### AI Models - LLM: mistral-7B, mixtral-8x7B - Embedding: E5-mistral-7b-instruct, all-mpnet-base-v2 ### AI Tools Integration - VectorDB: Pinecone, Redis - Framework: Llamaindex, Haystack ### Deployment Type - On Prem,IDC (Xeon, Gaudi) ## July 2024 ### Contribution - **Components** - LVM (Gaudi vLLM & Ray) - vectordb (svs) - Gateway guardrail, Auth Z/N - **Use Cases/Examples** - FAQGen - **Cloud Native** - OpenShift enablement for OPEA - OneClick OPEA for 3 more examples - Security (Service Mesh, guardrails) - **Evaluation & Others** - CICD & Validation - Eval: E2E (GenAIComps & GenAIExamples) ### AI Models - LLM: Phi, Gemma - Embedding: all-MiniLM-L6-v2, paraphrase-albert-small-v2 ### AI Tools Integration - VectorDB: PGVector, Qdrant ### Deployment Type ## Aug 2024 ### Contribution - **Components** - Documentation - Test automation script - Telemetry - **Use Cases/Examples** - Documentation - Test automation script - **Cloud Native** - Demo K8s resource management - Documentation on autoscaler analysis - **Evaluation & Others** - CICD & Validation - Eval: E2E (GenAIComps & GenAIExamples) ### AI Models - Vision: llava - Mixtral-8x22B ### AI Tools Integration - VectorDB: Milvus ### Deployment Type - Public Cloud AWS (Xeon CPU & NV GPU) ## Sep 2024 ### Contribution - **Components** - Microservice for Image and Video - **Use Cases/Examples** - Text to Image generation - Image to Video generation - Playground (composable and configurable) - **Cloud Native** - **Evaluation & Others** - CICD & Validation - Eval: E2E (GenAIComps & GenAIExamples) ### AI Models - Diffusion model: - Stable Diffusion XL - Stable Diffusion 3M - Stable Video Diffusion ### AI Tools Integration - VectorDB: Weaviate ### Deployment Type ## Q4 2024 ### Contribution - **Components** - Fine-tuning E2E pipeline - Knowledge Graph - **Use Cases/Examples** - Fine-tuning (Lora) - AI Agent (single Agent with text and Audio as user interface) - Closed source LLM - GraphRAG - **Cloud Native** - Static tuning on Resource management for deployment - **Evaluation & Others** - CICD & Validation - Eval: E2E (GenAIComps & GenAIExamples) ### AI Models - LLM open: Grok 1 - LLM Close: GPT3.5/4/4o, Claude 3/3.5 - AWS Bedrock endpoint ### AI Tools Integration - Knowledge graph: Neo4j - Agent: LangGraph ### Deployment Type - Public Cloud (Azure, GCP, Oracle, AWS) - AI PC (Intel) ## Q1 2025 ### Contribution - **Components** - more Microservice request from community - Confidential Container - **Use Cases/Examples** - AI Agent (Multi Agent) - Fine-tuning (Adpative) - Long context window (>1M) - GenAI Studio - **Cloud Native** - Dynamic tuning on Resource management through K8s - **Evaluation & Others** - CICD & Validation - Eval: E2E (GenAIComps & GenAIExamples) ### AI Models - LLM: SetFit - More to be defined ### AI Tools Integration - AutoGen, CrewAI ### Deployment Type - Public Cloud (tier2 CSP) - AI PC (others)