# OPEA Release Notes v1.3
We are excited to announce the release of OPEA version 1.3, which includes significant contributions from the open-source community. This release addresses over 520 pull requests.

More information about how to get started with OPEA v1.3 can be found on the [Getting Started](https://opea-project.github.io/latest/index.html) page. All project source code is maintained in the [opea-project organization](https://github.com/opea-project). To pull Docker images, please access the [Docker Hub](https://hub.docker.com/u/opea). For instructions on deploying Helm Charts, please refer to the [guide](https://github.com/opea-project/GenAIInfra/tree/v1.3/helm-charts#readme).

## Table of Contents
- [What's New in OPEA v1.3](#whats-new-in-opea-v13)
- [Deprecations](#deprecations)
- [Updated Dependencies](#updated-dependencies)
- [Changes to Default Behavior](#changes-to-default-behavior)
- [Validated Hardware](#validated-hardware)
- [Validated Software](#validated-software)
- [Known Issues](#known-issues)
- [Full Changelogs](#full-changelogs)
- [Contributors](#contributors)

## What's New in OPEA v1.3

This release introduces exciting capabilities, optimizations, and user-centric enhancements:

### Advanced Agent Capabilities
- <b>Multi-Turn Conversation</b>: Enhanced the OPEA agent framework for dynamic, context-aware dialogues. ([GenAIComps#1248](https://github.com/opea-project/GenAIComps/pull/1248)) 
- <b>Finance Agent Example</b>: A financial agent [example](https://github.com/opea-project/GenAIExamples/tree/v1.3/FinanceAgent) for automating financial data aggregation and leveraging LLMs to generate insights, forecasts, and strategic recommendations. ([GenAIExamples#](https://github.com/opea-project/GenAIExamples/pull/1539))

### Performance and Scalability
- <b>vLLM Enhancement</b>: Integrated [vLLM](https://github.com/vllm-project/vllm) as the default LLM serving backend for key GenAI examples across Intel® Xeon® processors, Intel® Gaudi® accelerators, and AMD® GPUs. ([GenAIExamples#](https://github.com/opea-project/GenAIExamples/pull/1436))
- <b>KubeAI Operator for OPEA (Alpha release)</b>: Simplified OPEA inference operations in cloud environment and enabled optimal out-of-the-box performance for specific models and hardware using profiles. ([GenAIInfra#945](https://github.com/opea-project/GenAIInfra/pull/945)) 

### Ecosystem Integrations
- <b>Haystack Integration</b>: Enabled OPEA as a backend of [Haystack](https://haystack.deepset.ai). ([Haystack-OPEA#](https://github.com/opea-project/Haystack-OPEA#))
- <b>Cloud Readiness</b>: Expanded automated Terraform deployment for ChatQnA to include support for Azure, and enabled CodeGen deployments on AWS and GCP. ([GenAIExamples#1731](https://github.com/opea-project/GenAIExamples/pull/1731))

### New GenAI Capabilities
- <b>OPEA Store</b>: Delivered a unified data store access API and a robust data store integration layer that streamlines data store integration. ArangoDB was integrated. ([GenAIComps#1493](https://github.com/opea-project/GenAIComps/pull/1493))
- <b>CodeGen using RAG and Agent</b>: Leveraged RAG and code agent to provide an additional layer of intelligence and adaptability for CodeGen example. ([GenAIExamples#1757](https://github.com/opea-project/GenAIExamples/pull/1757))
- <b>Enhanced Multimodality</b>: Added support for additional audio file types (.mp3) and supported spoken audio captions with image ingestion. ([GenAIExamples#1549](https://github.com/opea-project/GenAIExamples/issues/1549))
- <b>Struct to Graph</b>: Supported transforming structured data to graphs using Neo4j graph database. ([GenAIComps#1502](https://github.com/opea-project/GenAIComps/pull/1502))
- <b>Text to Graph</b>: Supported creating graphs from text by extracting graph triplets. ([GenAIComps#1357](https://github.com/opea-project/GenAIComps/pull/1357), [GenAIComps#](https://github.com/opea-project/GenAIComps/pull/1472))
- <b>Text to Cypher</b>: Supported generating and executing Cypher queries from natural language for graph database retrieval. ([GenAIComps#1319](https://github.com/opea-project/GenAIComps/pull/1319))

### Enhanced Evaluation
- <b>Enhanced Long-Context Model Evaluation</b>: Supported evaluating long-context model on Intel® Gaudi® with vLLM. ([HELMET#20](https://github.com/princeton-nlp/HELMET/pull/20))
- <b>TAG-Bench for SQL Agents</b>: Integrated [TAG-Bench](https://github.com/TAG-Research/TAG-Bench) to evaluate complex SQL query generation ([GenAIEval#](https://github.com/opea-project/GenAIEval/pull/230)).
- <b>DocSum Support</b>: GenAIEval now supports evaluating the performance of DocSum. ([GenAIEval#252](https://github.com/opea-project/GenAIEval/pull/252))
- <b>Toxicity Detection Evaluation</b>: Introduced a workflow to evaluate the capability of detecting toxic language based on LLMs. ([GenAIEval#241](https://github.com/opea-project/GenAIEval/pull/241))
- <b>Model Card</b>: Added a model card generator for generating reports containing model performance and fairness metrics. ([GenAIEval#236](https://github.com/opea-project/GenAIEval/pull/236))

### Observability
- <b>OpenTelemetry Tracing</b>: Leveraged OpenTelemetry to enable tracing for ChatQnA and AgentQnA along with TGI and TEI. ([GenAIExamples#1542](https://github.com/opea-project/GenAIExamples/issues/1542))
- <b>Application dashboards</b>: Helm installed application E2E performance dashboard(s). ([GenAIInfra#800](https://github.com/opea-project/GenAIInfra/pull/800))
- <b>E2E (end-to-end) metric improvements</b>: E2E metrics are summed together for applications that use multiple megaservice instances. Tests for the E2E metrics + fixes. ([GenAIComps#1301](https://github.com/opea-project/GenAIComps/pull/1301), ([GenAIComps#](https://github.com/opea-project/GenAIComps/pull/1343))

### Better User Experience
- <b>GenAIStudio</b>: Supported drag-and-drop creation of agentic applications. ([GenAIStudio#50](https://github.com/opea-project/GenAIStudio/pull/50))
- <b>Documentation Refinement</b>: Refined READMEs for key examples to help readers easily locate documentation tailored to deployment, customization, and hardware. ([GenAIExamples#1741](https://github.com/opea-project/GenAIExamples/issues/1741))
- <b>Optimized Dockerfiles</b>: Simplified application Dockerfiles for faster image builds. ([GenAIExamples#1585](https://github.com/opea-project/GenAIExamples/pull/1585))

### Exploration
- <b>SQFT</b>: Supported low-precision sparse parameter-efficient fine-tuning on LLMs. ([GenAIResearch#1](https://github.com/opea-project/GenAIResearch/pull/1))

### Newly Supported Models
OPEA introduced the support for the following models in this release.

| Model                                       | TGI-Gaudi | vLLM-CPU | vLLM-Gaudi | vLLM-ROCm | OVMS     | Optimum-Habana | PredictionGuard |
| ------------------------------------------- | --------- | -------- | ---------- | --------- | -------- | -------------- | --------------- |
| deepseek-ai/DeepSeek-R1-Distill-Llama-8B    | ✓         | ✓       | ✓          | ✓         | -        | ✓             | -               |   
| deepseek-ai/DeepSeek-R1-Distill-Llama-70B   | ✓         | ✓       | ✓          | ✓         | -        | ✓             | -               |                 
| deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B   | ✓         | ✓       | ✓          | ✓         | -        | ✓             | -               |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-7B     | ✓         | ✓       | ✓          | ✓         | -        | ✓             | -               |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-14B    | ✓         | ✓       | ✓          | ✓         | -        | ✓             | -               |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-32B    | ✓         | ✓       | ✓          | ✓         | -        | ✓             | -               |
| deepseek-ai/Deepseek-v3                     | ✓         | -        | ✓         | ✓         | -        | ✓              | -              | 
| Hermes-3-Llama-3.1-8B                       | -         | -        | -          | ✓         | -        | -              | ✓              |
| ibm-granite/granite-3.2-8b-instruct         | -         | -        | ✓          | ✓         | -        | -             | -               |
| Phi-4-mini                                  | x         | x        | x          | ✓         | x        | ✓             | -               |
| Phi-4-multimodal-instruct                   | x         | x        | x          | ✓         | x        | ✓             | -               |
| mistralai/Mistral-Small-24B-Instruct-2501   | ✓         | -        | ✓         | ✓         | -        | ✓              | -              |
| mistralai/Mistral-Large-Instruct-2411       | x         | -        | ✓          | ✓         | -        | ✓             | -              |

(✓: supported; -: not validated; x: unsupported)


### Newly Supported Hardware
- AMD® GPU using AMD® ROCm™ for 9 examples. ([GenAIExamples#1613](https://github.com/opea-project/GenAIExamples/pull/1613) and 8 more.)

### Other Notable Changes

Expand the following lists to read:

<details><summary>GenAIExamples</summary> 

- Functionalities
    - [AgentQnA] Added web search tool support and simplify the run instructions. ([#1656](https://github.com/opea-project/GenAIExamples/pull/1656)) (e8f2313)
    - [ChatQnA] Added support for latest deepseek models on Gaudi ([#1491](https://github.com/opea-project/GenAIExamples/pull/1491)) (9adf7a6)
    - [EdgeCraftRAG] A sleek new UI based on Vue and Ant Design for enhanced user experience, supporting concurrent multi-requests on vLLM, JSON pipeline configuration, and API-based prompt modification. ([#1665](https://github.com/opea-project/GenAIExamples/pull/1665)) (5a50ae0)
    - [EdgeCraftRAG] Supported multi-card deployment of Intel ARC GPU for vllm inference ([#1729](https://github.com/opea-project/GenAIExamples/pull/1729)) (1a0c5f0)
    - [FaqGen] Merged FaqGen into ChatQnA for unified Chatbot experience. ([#1654](https://github.com/opea-project/GenAIExamples/pull/1654)) (6d24c1c)

- Benchmark
    - [ChatQnA] Provided unified scalable deployment and benchmarking support for examples ([#1315](https://github.com/opea-project/GenAIExamples/pull/1315)) (ed16308)

- Deployment
    - Sync values yaml file for 1.3 release ([#1748](https://github.com/opea-project/GenAIExamples/pull/1748)) (46ebb78)

- Bug Fixes
    - [AgentQnA] Fixed errors for running AgentQnA on xeon with openai and update readme ([#1664](https://github.com/opea-project/GenAIExamples/pull/1664)) (fecc227)
    - [AudioQnA] Fixed the LLM model field for inputs alignment ([#1611](https://github.com/opea-project/GenAIExamples/pull/1611)) (2dfcfa0)

- Documentation
    - Updated README.md for OPEA OTLP tracing ([#1406](https://github.com/opea-project/GenAIExamples/pull/1406)) (4c41a5d)
    - Updated README.md for Agent UI ([#1495](https://github.com/opea-project/GenAIExamples/pull/1495)) (88a8235)
    - Refactored AudioQnA README ([#1508](https://github.com/opea-project/GenAIExamples/pull/1508)) (9f36e84)
    - Added a new section to change LLM model such as deepseek based on validated model table in LLM microservice ([#1501](https://github.com/opea-project/GenAIExamples/pull/1501)) (970b869)
    - Updated README.md of AIPC quick start ([#1578](https://github.com/opea-project/GenAIExamples/pull/1578)) (852bc70)
    - Added short descriptions to the images OPEA publishes on Docker Hub ([#1637](https://github.com/opea-project/GenAIExamples/pull/1637)) (68747a9)

- CI/CD/UT
    - Added UT for rerank finetuning on Gaudi ([#1472](https://github.com/opea-project/GenAIExamples/pull/1472)) (5f4b182)
    - Enabled Gaudi 3, Rocm and Arc on manually release test. ([#1615](https://github.com/opea-project/GenAIExamples/pull/1615)) (63b789a)
    - Enabled base image build in CI/CD ([#1669](https://github.com/opea-project/GenAIExamples/pull/1669)) (2204fe8)
    - ChatQnA run CI with latest base image, group logs in GHA outputs.  ([#1736](https://github.com/opea-project/GenAIExamples/pull/1736)) (c48cd65)
<!-- - Fix Multimodal & ProductivitySuite Issue (#1820) (b80449b) -->

</details>

<details><summary>GenAIComps</summary> 

- Functionalities
    - [agent] Enabled custom prompt for react_llama and react_langgraph ([#1391](https://github.com/opea-project/GenAIComps/pull/1391)) (558a2f6)
    - [dataprep] Added Multimodal support for Milvus for dataprep component ([#1380](https://github.com/opea-project/GenAIComps/pull/1380)) (006bd91)
    - [dataprep]: New Arango integration ([#1558](https://github.com/opea-project/GenAIComps/pull/1558))
    - [dataprep]: Added ability to customize Dataprep unique input parameters by way of subclassing the DataprepRequest pydantic model. Avoids having to introduce parameters unique to a few Dataprep integrations across all Dataprep providers ([#1525](https://github.com/opea-project/GenAIComps/pull/1525))
    - [retrieval]: New Arango integration ([#1558](https://github.com/opea-project/GenAIComps/pull/1558))
    - [cores/mega] Added remote endpoint support ([#1399](https://github.com/opea-project/GenAIComps/pull/1399)) (1871dec)
    - [docsum] Enlarged DocSum prompt buffer ([#1471](https://github.com/opea-project/GenAIComps/pull/1471)) (772ef6e)
    - [embeddings] Refined CLIP embedding microservice by leveraging the third-party CLIP ([#1298](https://github.com/opea-project/GenAIComps/pull/1298)) (7727235)
    - [finetuning] Added xtune to finetuning for Intel ARC GPU ([#1432](https://github.com/opea-project/GenAIComps/pull/1432)) (80ef317)
    - [guardrails] Added native support for toxicity detection guardrail microservice ([#1258](https://github.com/opea-project/GenAIComps/pull/1258)) (625aec9)   
    - [llm/text-generation] Added support for string message in Bedrock textgen ([#1291](https://github.com/opea-project/GenAIComps/pull/1291)) (364ccad)
    - [ipex] Added native LLM microservice using IPEX ([#1337](https://github.com/opea-project/GenAIComps/pull/1337)) (d51a136)
    - [lvm] Integrated vLLM to lvm as a backend ([#1362](https://github.com/opea-project/GenAIComps/pull/1362)) (831c5a3)
    - [lvm] Integrated UI-TARS vLLM in lvm component ([#1458](https://github.com/opea-project/GenAIComps/pull/1458)) (4a15795)
    - [nubula] Docker deployment support for Nebula graph database ([#1396](https://github.com/opea-project/GenAIComps/pull/1396)) (342c1ed)
    - [OVMS] Text generation, Embeddings and Reranking microservices based on [OVMS](https://github.com/openvinotoolkit/model_server) component ([#](https://github.com/opea-project/GenAIComps/pull/1318)) (78b94fc)
    - [retriever/milvus] Added Multimodal support for Milvus for retriever component ([#1381](https://github.com/opea-project/GenAIComps/pull/1381)) (40d431a)
    - [text2image & image2image] Enriched input parameters of text2image and image2image. ([#1339](https://github.com/opea-project/GenAIComps/pull/1339)) (42f323f)
    - Refined synchronized I/O in asynchronous functions ([#1300](https://github.com/opea-project/GenAIComps/pull/1300)) (b08571f)

- Bug Fixes
    - Docsum error by HuggingFaceEndpoint ([#1246](https://github.com/opea-project/GenAIComps/pull/1246)) (30e3dea)
    - Fixed tei embedding and tei reranking bug ([#1256](https://github.com/opea-project/GenAIComps/pull/1256)) (fa01f46)
    - Fixed web-retrievers hub client and tei endpoint issue ([#1270](https://github.com/opea-project/GenAIComps/pull/1270)) (ecb7f7b)
    - Fixed Dataprep Ingest Data Issue. ([#1271](https://github.com/opea-project/GenAIComps/pull/1271)) (b777db7)
    - Fixed metric id issue when init multiple Orchestrator instance ([#1280](https://github.com/opea-project/GenAIComps/pull/1280)) (f8e6216)
    - Bug Fix neo4j dataprep ingest error handling and skip_ingestion argument passing ([#1288](https://github.com/opea-project/GenAIComps/pull/1288)) (4a90692)
    - Fixed the retriever issue of Milvus ([#1286](https://github.com/opea-project/GenAIComps/pull/1286)) (47f68a4)
    - Fixed Qdrant retriever RAG issue. ([#1289](https://github.com/opea-project/GenAIComps/pull/1289)) (c3c8497)
    - Fixed agent message format. ([#1297](https://github.com/opea-project/GenAIComps/pull/1297)) (022d052)
    - Fixed milvus dataprep ingest files failure ([#1299](https://github.com/opea-project/GenAIComps/pull/1299)) (a033c05)
    - Fixed docker image security issues ([#1321](https://github.com/opea-project/GenAIComps/pull/1321)) (589587a)
    - Megaservice / orchestrator metric testing + fixes ([#1348](https://github.com/opea-project/GenAIComps/pull/1348)) (1064b2b)
    - Fixed finetuning python regex syntax error ([#1446](https://github.com/opea-project/GenAIComps/pull/1446)) (380f95c)
    - Upgraded Optimum Habana version to fix security check issue ([#1571](https://github.com/opea-project/GenAIComps/pull/1571)) (83350aa)
    - Make llamaguard compatible with both TGI and vLLM ([#1581](https://github.com/opea-project/GenAIComps/pull/1581)) (4024302)

- Documentation
    - GraphRAG README/compose fixes post refactor ([#1221](https://github.com/opea-project/GenAIComps/pull/1221)) (b38d9f3)
    - Updated docs for LLamaGuard & WildGuard Microservice ([#1259](https://github.com/opea-project/GenAIComps/pull/1259)) (0df374b)
    - Fixed Readme errors in dataprep component for all VectorDBs ([#1377](https://github.com/opea-project/GenAIComps/pull/1377)) (492f028)
    - Refined the README for llms/doc-summarization ([#1437](https://github.com/opea-project/GenAIComps/pull/1437)) (559ebb2)


- CI/CD/UT
    - Refine dataprep test scripts ([#1305](https://github.com/opea-project/GenAIComps/pull/1305)) (a4f6af1)

<!-- - Make llamaguard compatible with both TGI and vLLM (#1581) (4024302) -->
</details>

<details><summary>GenAIEval</summary>

- Auto Tuner
    - RAG Pilot - A RAG pipeline tuning tool allowing fine-grained control over key aspects of parsing, chunking, postprocessing, and generating selection, enabling better retrieval and response generation. ([#243](https://github.com/opea-project/GenAIEval/pull/243)) (97da8f2)

- Monitoring
    - Integrate with memory bandwidth exporter to support collection and reporting of memory bandwidth, cpu, mem metrics. ([#218](https://github.com/opea-project/GenAIEval/pull/218)) (df5fd3e)
    - Add benchmark docker image to support getting metrics among microservices and  fixed a missing package for benchmarking with Dockerfile  ([#249](https://github.com/opea-project/GenAIEval/pull/249)) (dc3409f)

- Metrics
    - Collect vllm latency metric for e2e test ([#244](https://github.com/opea-project/GenAIEval/pull/244)) (1b6a91d)

- Bug Fixes
    - Fix relative path issue for possion. ([#234](https://github.com/opea-project/GenAIEval/pull/234)) (3b9981a)
    - Add the missed file in release package ([#233](https://github.com/opea-project/GenAIEval/pull/233)) (28ed0db)
    - fix the error  of TTFT and TPOT while the bench target is chatqna_qlist_pubmed ([#238](https://github.com/opea-project/GenAIEval/pull/238)) (da04a9f)
    - Fix performance benchmark with pubmed ([#239](https://github.com/opea-project/GenAIEval/pull/239)) (5c8ab6e)

- Documentation
    - Add recommendations to platform optimization documentation (ea086a6)

<!-- - add N/A option (#264) (4224928) -->

</details>

<details><summary>GenAIInfra</summary> 

- HelmChart
    - [TDX] Added Intel TDX support to helm charts ([#799](https://github.com/opea-project/GenAIInfra/pull/799)) (040860e)
    - Add helm starter chart for developing new charts ([#776](https://github.com/opea-project/GenAIInfra/pull/776)) (6154b6c)
    - HPA enabling usability improvement ([#770](https://github.com/opea-project/GenAIInfra/pull/770)) (3016f5f)
    - Helm chart for Ollama ([#774](https://github.com/opea-project/GenAIInfra/pull/774)) (7d66afb)
    - Helm: Added Qdrant support ([#796](https://github.com/opea-project/GenAIInfra/pull/796)) (99ccf0c)
    - Chatqna: Added Qdrant DB support ([#813](https://github.com/opea-project/GenAIInfra/pull/813)) (5576cfd)
    - Helm installed application metric Grafana dashboards ([#800](https://github.com/opea-project/GenAIInfra/pull/800)) (f46e8c1)
    - LLM TextGen Bedrock Support ([#811](https://github.com/opea-project/GenAIInfra/pull/811)) (da37b9f)
    - codegen: Add rag pipeline and change default UI ([#985](https://github.com/opea-project/GenAIInfra/pull/985)) (46b1b6b)
    - dataprep/retriever: Support airgap offline environment ([#980](https://github.com/opea-project/GenAIInfra/pull/980)) (b9b10e9)

- CSP
    - Added automated provisioning of CosmosDB and App Insights for OPEA applications ([#657](https://github.com/opea-project/GenAIInfra/pull/657)) (d29bd2d)

- Bug Fixes
    - Fixed the helm chart release dependency update ([#842](https://github.com/opea-project/GenAIInfra/pull/842)) (f121edd)

- CI/CD/UT
    - CI: Enabled milvus related test ([#767](https://github.com/opea-project/GenAIInfra/pull/767)) (5b2cca9)
<!-- - chatqna: add VLLM_CPU_KVCACHE_SPACE setting (#981) (591772a) -->
</details>

<details><summary>GenAIStudio</summary> 

- Updated studio fe table UI and updated studio be according to the dataprep refactor ([#32](https://github.com/opea-project/GenAIStudio/pull/32)) (1168507)
- [Feat] Added GenAI Studio UI improvement ([#48](https://github.com/opea-project/GenAIStudio/pull/48)) (ad64f7c)
- Enabled LLM Traces for sandbox ([#51](https://github.com/opea-project/GenAIStudio/pull/51)) (df6b73e)
- Migrated to internal k8 mysql and enable deployment package generation for agentqna ([#52](https://github.com/opea-project/GenAIStudio/pull/52)) (0cddbe0)

</details>


## Deprecations
### Deprecated Examples
The following GenAI examples are deprecated, and were removed since OPEA v1.3:
| Example | Migration Solution | Reasons for Deprecation |
|--|--|--|
|[FaqGen](https://github.com/opea-project/GenAIExamples/tree/v1.2/FaqGen)|Use the example [ChatQnA](https://github.com/opea-project/GenAIExamples/tree/v1.3/ChatQnA) instead.| Provide users with a unified chatbot experience and reduce redundancy. |

### Deprecated Docker Images
The following Docker images are deprecated, and not updated / tagged for OPEA v1.3 release:
| Deprecated Docker Image | Migration Solution | Reasons for Deprecation |
|--|--|--|
|[opea/agent-ui](https://hub.docker.com/r/opea/agent-ui)|Use [opea/agent-openwebui](https://hub.docker.com/r/opea/agent-openwebui) instead.|Open WebUI based UI for better user experience.|
|[opea/chathistory-mongo-server](https://hub.docker.com/r/opea/chathistory-mongo-server)|Use [opea/chathistory-mongo](https://hub.docker.com/r/opea/chathistory-mongo) instead.| Follow the OPEA naming rules |
|[opea/faqgen](https://hub.docker.com/r/opea/faqgen)|Use [opea/chatqna](https://hub.docker.com/r/opea/chatqna) or [opea/chatqna-without-rerank](https://hub.docker.com/r/opea/chatqna-without-rerank) instead.| FaqGen is deprecated. |
|[opea/faqgen-ui](https://hub.docker.com/r/opea/faqgen-ui)|Use [opea/chatqna-ui](https://hub.docker.com/r/opea/chatqna-ui) instead.| FaqGen is deprecated. | 
|[opea/faqgen-react-ui](https://hub.docker.com/r/opea/faqgen-react-ui)|Use [opea/chatqna-ui](https://hub.docker.com/r/opea/chatqna-ui) instead.| FaqGen is deprecated. |
|[opea/feedbackmanagement](https://hub.docker.com/r/opea/feedbackmanagement)|Use [opea/feedbackmanagement-mongo](https://hub.docker.com/r/opea/feedbackmanagement-mongo) instead.| Follow the OPEA naming rules |
|[opea/promptregistry-mongo-server](https://hub.docker.com/r/opea/promptregistry-mongo-server)|Use [opea/promptregistry-mongo](https://hub.docker.com/r/opea/promptregistry-mongo) instead.| Follow the OPEA naming rules |

The following Docker images are deprecated, and will not be updated / tagged since OPEA v1.4 release:
| Deprecated Docker Image | Migration Solution | Reasons for Deprecation |
|--|--|--|
|[opea/chathistory-mongo](https://hub.docker.com/r/opea/chathistory-mongo)|Use [opea/chathistory](https://hub.docker.com/r/opea/chathistory) instead. The Docker image will be released with the `latest` tag before the v1.4 release. | OPEA introduced OPEAStore to decouple chathistory component from MongoDB.  |
|[opea/feedbackmanagement-mongo](https://hub.docker.com/r/opea/feedbackmanagement-mongo)|Use [opea/feedbackmanagement](https://hub.docker.com/r/opea/feedbackmanagement) instead. The Docker image will be released with the `latest` tag before the v1.4 release.| OPEA introduced OPEAStore to decouple feedback management component from MongoDB. |
|[opea/promptregistry-mongo](https://hub.docker.com/r/opea/promptregistry-mongo)|Use [opea/promptregistry](https://hub.docker.com/r/opea/promptregistry) instead. The Docker image will be released with the `latest` tag before the v1.4 release.| OPEA introduced OPEAStore to decouple prompt registry component from MongoDB. |

[All OPEA docker images](https://github.com/opea-project/GenAIExamples/blob/v1.3/docker_images_list.md)

### Deprecated GenAIExample Variables
| Example | Type | Variable | Migration Solution |
|--|--|--|--|
|[ChatQnA](https://github.com/opea-project/GenAIExamples/tree/v1.3/ChatQnA)|environment variable|`your_hf_api_token`|Removed from Intel AIPC deployment. Use the environment variable `HUGGINGFACEHUB_API_TOKEN` instead. This change aligns with the standardized naming conventions for environment variables.|
|[ChatQnA](https://github.com/opea-project/GenAIExamples/tree/v1.3/ChatQnA)|environment variable|`OLLAMA_HOST`|Removed from Intel AIPC deployment. Instead, users can customize `LLM_SERVER_HOST_IP` in `ChatQnA/docker_compose/intel/cpu/aipc/compose.yaml`.|
|[DocIndexRetriever](https://github.com/opea-project/GenAIExamples/tree/v1.3/DocIndexRetriever)|environment variable|`TGI_LLM_ENDPOINT`|Removed due to no uses.|
|[DocIndexRetriever](https://github.com/opea-project/GenAIExamples/tree/v1.3/DocIndexRetriever)|environment variable|`MEGA_SERVICE_HOST_IP`|Removed due to no uses.|
|[DocIndexRetriever](https://github.com/opea-project/GenAIExamples/tree/v1.3/DocIndexRetriever)|environment variable|`LLM_SERVICE_HOST_IP`|Removed due to no uses.|
|[GraphRAG](https://github.com/opea-project/GenAIExamples/tree/v1.3/GraphRAG)|environment variable|`MAX_OUTPUT_TOKENS`|Instead, it has been split into two new environment variables: `MAX_INPUT_TOKENS` (default: 4096) and `MAX_TOTAL_TOKENS` (default: 8192) to control the maximum token limits.|

### Deprecated GenAIComps Parameters
| Component | Parameter | Migration Solution |
|--|--|--|
|[agent](https://github.com/opea-project/GenAIComps/tree/v1.3/comps/agent)|`with_store` of `agent_config` in the Assistants APIs|Its functionality is now fully covered by the new `memory_type` parameter. In v1.3, please use `"with_memory": true` and `"memory_type": persistent` as its replacement. The `with_memory` parameter in `agent_config` of APIs is now enabled by default (true) for enabling multi-turn conversations. Please refer to the [guide](https://github.com/opea-project/GenAIComps/blob/v1.3/comps/agent/src/README.md#agent-memory) for more details.|

## Updated Dependencies

| Dependency | Hardware | Scope | Version | Version in OPEA v1.2 | Comments |
|--|--|--|--|--|--|
|gradio|-|all examples|5.11.0|5.5.0||
|huggingface/text-generation-inference|AMD GPU|all examples|2.4.1-rocm|2.3.1-rocm||
|huggingface/text-embeddings-inference|all|all examples|cpu-1.6|cpu-1.5||
|langchain<br/>langchain_community|-|llms/doc-summarization<br/>llms/faq-generation|0.3.14|0.3.15|Avoid bugs in FaqGen and DocSum.|
|optimum-habana|Gaudi|lvms/llama-vision|1.17.0|-||
|pytorch|Gaudi|all components|2.5.1|2.4.0||
|transformers|-|lvms/llama-vision|4.48.0|4.45.1||
|vllm|Xeon|all supported examples except EdgeCraftRAG|v0.8.3|-||
|vllm|Gaudi|all supported examples except EdgeCraftRAG|v0.6.6.post1+Gaudi-1.20.0|v0.6.4.post2+Gaudi-1.19.0||
|vllm|AMD GPU|all supported examples|rocm6.3.1_instinct_vllm0.8.3_20250410|-||

## Changes to Default Behavior
- [[agent](https://github.com/opea-project/GenAIComps/tree/v1.3/comps/agent)] The default model changed from `meta-llama/Meta-Llama-3-8B-Instruct` to `meta-llama/Llama-3.3-70B-Instruct`.


## Validated Hardware
- Intel® Arc™ Graphics GPU (A770)
- Intel® Gaudi® Al Accelerators (2nd, 3rd)
- Intel® Xeon® Scalable processor (4th, 5th, 6th)
- AMD® Instinct™ MI300X Accelerators (CDNA3)


## Validated Software
- AMD® ROCm™ Software [v6.3.3](https://rocm.docs.amd.com/projects/install-on-linux/en/docs-6.3.2/install/quick-start.html)
- Docker 28.0.4
- Docker Compose v2.34.0
- Intel® Gaudi® software and drivers [v1.20](https://docs.habana.ai/en/v1.20.0/Installation_Guide/index.html)
- Kubernetes v1.29.15
- TEI v1.6
- TGI v2.4.0 (Xeon), v2.3.1(Gaudi), v2.4.1 (ROCm)
- Torch v2.5.1
- Ubuntu 22.04
- vLLM v0.8.3 (Xeon/ROCm), v0.6.6 (Gaudi)

## Known Issues
- [AvatarChatbot](https://github.com/opea-project/GenAIExamples/tree/v1.3/AvatarChatbot) can not work in K8s environment because of a functional gap in wav2clip service. ([GenAIExamples#](https://github.com/opea-project/GenAIExamples/pull/1506))

## Full Changelogs
- GenAIExamples: [v1.2...v1.3](https://github.com/opea-project/GenAIExamples/compare/v1.2...v1.3)
- GenAIComps: [v1.2...v1.3](https://github.com/opea-project/GenAIComps/compare/v1.2...v1.3)
- GenAIInfra: [v1.2...v1.3](https://github.com/opea-project/GenAIInfra/compare/v1.2...v1.3)
- GenAIEval: [v1.2...v1.3](https://github.com/opea-project/GenAIEval/compare/v1.2...v1.3)
- GenAIStudio: [v1.2...v1.3](https://github.com/opea-project/GenAIStudio/compare/v1.2...v1.3)
- docs: [v1.2...v1.3](https://github.com/opea-project/docs/compare/v1.2...v1.3)

## Contributors
This release would not have been possible without the contributions of the following organizations and individuals.

### Contributing Organizations
- `Amazon`: Ollama deployment, Bedrock integration, OVMS integration and bug fixes.
- `AMD`: vLLM enablement on AMD GPUs for key examples, AMD GPUs enabling on more examples, AMD OPEA blogs.
- `ArangoDB`: OPEA Store and ArangoDB integration.
- `Intel`: Development and improvements to GenAI examples, components, infrastructure, and evaluation.
- `Infosys`: Azure support and documentation updates.
- `National Chiao Tung University`: Documentation updates.
- `Prediction Guard`: Maintenance of Prediction Guard components.

### Individual Contributors
For a comprehensive list of individual contributors, please refer to the [Full Changelogs](#full-changelogs) section.