# OPEA Release Notes v0.9 ## What’s New in OPEA v0.9 - Broaden functionality - Provide telemetry functionalities for metrics and tracing using Prometheus, Grafana, and Jaeger - Initialize two Agent examples: AgentQnA and DocIndexRetriever - Support for authentication and authorization - Add Nginx Component to strengthen backend security - Provide Toxicity Detection Microservice - Support the experimental Fine-tuning microservice - Enhancement - Align the Microservice format with the standards of OpenAI (Chat Completions, Fine-tuning... etc) - Enhance the performance benchmarking and evaluation for GenAI Examples, ex: TGI, resource allocation, ...etc - Enable support for launching container images as a non-root user - Use Llama-Guard-2-8B as default Guardrails model and bge-large-zh-v1.5 as default embedding model, mistral-7b-grok as default CodeTrans model - Add ProductivitySuite to provide access management and maintains user context - Deployment - Support Red Hat OpenShift Container Platform (RHOCP) - GenAI Microservices Connector (GMC) successfully tested on Nvidia GPUs - Add Kubernetes support for AudioQnA and VisualQnA examples - OPEA Docker Hub: https://hub.docker.com/u/opea - GitHub IO: https://opea-project.github.io/latest/index.html - Thanks for the external contribution from [Sharan Shirodkar](https://github.com/sharanshirodkar7), [Aishwarya Ramasethu ](https://github.com/aramasethu), [Michal Nicpon](https://github.com/michalnicp) and [Jacob Mansdorfer](https://github.com/jmansdorfer) ## Details
GenAIExamples - ChatQnA - Update port in set_env.sh([040d2b7](https://github.com/opea-project/GenAIExamples/commit/040d2b7)) - Fix minor issue in ChatQnA Gaudi docker README([a5ed223](https://github.com/opea-project/GenAIExamples/commit/a5ed223)) - update chatqna dataprep-redis port([02a1536](https://github.com/opea-project/GenAIExamples/commit/02a1536)) - Add support for .md file in file upload in the chatqna-ui([7a67298](https://github.com/opea-project/GenAIExamples/commit/7a67298)) - Added the ChatQnA delete feature, and updated the corresponding README([09a3196](https://github.com/opea-project/GenAIExamples/commit/09a3196)) - fixed ISSUE-528([45cf553](https://github.com/opea-project/GenAIExamples/commit/45cf553)) - Fix vLLM and vLLM-on-Ray UT bug([cfcac3f](https://github.com/opea-project/GenAIExamples/commit/cfcac3f)) - set OLLAMA_MODEL env to docker container([c297155](https://github.com/opea-project/GenAIExamples/commit/c297155)) - Update guardrail docker file path([06c4484](https://github.com/opea-project/GenAIExamples/commit/06c4484)) - remove ray serve([c71bc68](https://github.com/opea-project/GenAIExamples/commit/c71bc68)) - Refine docker_compose for dataprep param settings([3913c7b](https://github.com/opea-project/GenAIExamples/commit/3913c7b)) - fix chatqna guardrails([db2d2bd](https://github.com/opea-project/GenAIExamples/commit/db2d2bd)) - Support ChatQnA pipeline without rerank microservice([a54ffd2](https://github.com/opea-project/GenAIExamples/commit/a54ffd2)) - Update the number of microservice replicas for OPEA v0.9([e6b4fff](https://github.com/opea-project/GenAIExamples/commit/e6b4fff)) - Update set_env.sh([9657f7b](https://github.com/opea-project/GenAIExamples/commit/9657f7b)) - add env for chatqna vllm([f78aa9e](https://github.com/opea-project/GenAIExamples/commit/f78aa9e)) - Deployment - update manifests for v0.9([ba78b4c](https://github.com/opea-project/GenAIExamples/commit/ba78b4c)) - Update K8S manifest for ChatQnA/CodeGen/CodeTrans/DocSum([01c1b75](https://github.com/opea-project/GenAIExamples/commit/01c1b75)) - Update benchmark manifest to fix errors([4fd3517](https://github.com/opea-project/GenAIExamples/commit/4fd3517)) - Update env for manifest([4fa37e7](https://github.com/opea-project/GenAIExamples/commit/4fa37e7)) - update manifests for v0.9([08f57fa](https://github.com/opea-project/GenAIExamples/commit/08f57fa)) - Add AudioQnA example via GMC([c86cf85](https://github.com/opea-project/GenAIExamples/commit/c86cf85)) - add k8s support for audioqna([0a6bad0](https://github.com/opea-project/GenAIExamples/commit/0a6bad0)) - Update mainifest for FaqGen([80e3e2a](https://github.com/opea-project/GenAIExamples/commit/80e3e2a)) - Add kubernetes support for VisualQnA([4f7fc39](https://github.com/opea-project/GenAIExamples/commit/4f7fc39)) - Add dataprep microservice to chatQnA example and the e2e test([1c23d87](https://github.com/opea-project/GenAIExamples/commit/1c23d87)) - Documentation - [doc] Update README.md([c73e4e0](https://github.com/opea-project/GenAIExamples/commit/c73e4e0)) - doc fix: Update README.md to remove specific dicscription of paragraph-1([5a9c109](https://github.com/opea-project/GenAIExamples/commit/5a9c109)) - doc: fix markdown in docker_image_list.md([9277fe6](https://github.com/opea-project/GenAIExamples/commit/9277fe6)) - doc: fix markdown in Translation/README.md([d645305](https://github.com/opea-project/GenAIExamples/commit/d645305)) - doc: fix markdown in SearchQnA/README.md([c461b60](https://github.com/opea-project/GenAIExamples/commit/c461b60)) - doc: fix FaqGen/README.md markdown([704ec92](https://github.com/opea-project/GenAIExamples/commit/704ec92)) - doc: fix markdown in DocSum/README.md([83712b9](https://github.com/opea-project/GenAIExamples/commit/83712b9)) - doc: fix markdown in CodeTrans/README.md([076bca3](https://github.com/opea-project/GenAIExamples/commit/076bca3)) - doc: fix CodeGen/README.md markdown([33f8329](https://github.com/opea-project/GenAIExamples/commit/33f8329)) - doc: fix markdown in ChatQnA/README.md([015a2b1](https://github.com/opea-project/GenAIExamples/commit/015a2b1)) - doc: fix headings in markdown files([21fab71](https://github.com/opea-project/GenAIExamples/commit/21fab71)) - doc: missed an H1 in the middle of a doc([4259240](https://github.com/opea-project/GenAIExamples/commit/4259240)) - doc: remove use of HTML for table in README([e81e0e5](https://github.com/opea-project/GenAIExamples/commit/e81e0e5)) - Update ChatQnA readme with OpenShift instructions([ed48371](https://github.com/opea-project/GenAIExamples/commit/ed48371)) - Convert HTML to markdown format.([14621f8](https://github.com/opea-project/GenAIExamples/commit/14621f8)) - Fix typo {your_ip} to {host_ip}([ad8ca88](https://github.com/opea-project/GenAIExamples/commit/ad8ca88)) - README fix typo([abc02e1](https://github.com/opea-project/GenAIExamples/commit/abc02e1)) - fix script issues in MD file([acdd712](https://github.com/opea-project/GenAIExamples/commit/acdd712)) - Minor documentation improvements in the CodeGen README([17b9676](https://github.com/opea-project/GenAIExamples/commit/17b9676)) - Refine Main README([08eb269](https://github.com/opea-project/GenAIExamples/commit/08eb269)) - [Doc]Add a micro/mega service WorkFlow for DocSum([343d614](https://github.com/opea-project/GenAIExamples/commit/343d614)) - Update README for k8s deployment([fbb81b6](https://github.com/opea-project/GenAIExamples/commit/fbb81b6)) - Other examples - Clean deprecated VisualQnA code([87617e7](https://github.com/opea-project/GenAIExamples/commit/87617e7)) - Using TGI official release docker image for intel cpu([b2771ad](https://github.com/opea-project/GenAIExamples/commit/b2771ad)) - Add VisualQnA UI([923cf69](https://github.com/opea-project/GenAIExamples/commit/923cf69)) - fix container name([5ac77f7](https://github.com/opea-project/GenAIExamples/commit/5ac77f7)) - Add VisualQnA docker for both Gaudi and Xeon using TGI serving([2390920](https://github.com/opea-project/GenAIExamples/commit/2390920)) - Remove LangSmith from Examples([88eeb0d](https://github.com/opea-project/GenAIExamples/commit/88eeb0d)) - Modify the language variable to match language highlight.([f08d411](https://github.com/opea-project/GenAIExamples/commit/f08d411)) - Remove deprecated folder.([7dd9952](https://github.com/opea-project/GenAIExamples/commit/7dd9952)) - update env for manifest([4fa37e7](https://github.com/opea-project/GenAIExamples/commit/4fa37e7)) - AgentQnA example([67df280](https://github.com/opea-project/GenAIExamples/commit/67df280)) - fix tgi xeon tag([6674832](https://github.com/opea-project/GenAIExamples/commit/6674832)) - Add new DocIndexRetriever example([566cf93](https://github.com/opea-project/GenAIExamples/commit/566cf93)) - Add env params for chatqna xeon test([5d3950](https://github.com/opea-project/GenAIExamples/commit/5d3950)) - ProductivitySuite Combo Application with REACT UI and Keycloak Authen([947cbe3](https://github.com/opea-project/GenAIExamples/commit/947cbe3)) - change codegen tgi model([06cb308](https://github.com/opea-project/GenAIExamples/commit/06cb308)) - change searchqna prompt([acbaaf8](https://github.com/opea-project/GenAIExamples/commit/acbaaf8)) - minor fix mismatched hf token([ac324a9](https://github.com/opea-project/GenAIExamples/commit/ac324a9)) - fix translation gaudi env([4f3be23](https://github.com/opea-project/GenAIExamples/commit/4f3be23)) - Minor fixes for CodeGen Xeon and Gaudi Kubernetes codegen.yaml ([c25063f](https://github.com/opea-project/GenAIExamples/commit/c25063f)) - CI/CD/UT - update deploy_gmc logical in cd workflow([c016d82](https://github.com/opea-project/GenAIExamples/commit/c016d82)) - fix ghcr.io/huggingface/text-generation-inference tag([503a1a9](https://github.com/opea-project/GenAIExamples/commit/503a1a9)) - Add GMC e2e in CD workflow([f45e4c6](https://github.com/opea-project/GenAIExamples/commit/f45e4c6)) - Fix CI test changed file detect issue([5dcadf3](https://github.com/opea-project/GenAIExamples/commit/5dcadf3)) - update cd workflow name([3363a37](https://github.com/opea-project/GenAIExamples/commit/3363a37)) - Change microservice tags in CD workflow([71363a6](https://github.com/opea-project/GenAIExamples/commit/71363a6)) - Fix manual freeze images workflow([c327972](https://github.com/opea-project/GenAIExamples/commit/c327972)) - open chatqna guardrails test([db2d2bd](https://github.com/opea-project/GenAIExamples/commit/db2d2bd)) - Add gmc build, scan and deploy workflow([a39f23a](https://github.com/opea-project/GenAIExamples/commit/a39f23a)) - Enhance CI/CD infrastructure([c26d0f6](https://github.com/opea-project/GenAIExamples/commit/c26d0f6)) - Fix typo in CI workflow([e12baca](https://github.com/opea-project/GenAIExamples/commit/e12baca)) - Fix ChatQnA Qdrant CI issues([e71aba0](https://github.com/opea-project/GenAIExamples/commit/e71aba0)) - remove continue-on-error: true to stop the test when image build failed([6296e9f](https://github.com/opea-project/GenAIExamples/commit/6296e9f)) - Fix CD workflow typos([039014f](https://github.com/opea-project/GenAIExamples/commit/039014f)) - Freeze base images([c9f9aca](https://github.com/opea-project/GenAIExamples/commit/c9f9aca)) - support multiple test cases for ChatQnA([939502d](https://github.com/opea-project/GenAIExamples/commit/939502d)) - set action back to pull_request_target([1c07a38](https://github.com/opea-project/GenAIExamples/commit/1c07a38)) - Add BoM collect workflow and image publish workflow([e93146b](https://github.com/opea-project/GenAIExamples/commit/e93146b)) - Fix left issues in CI/CD structure refactor([a6385bc](https://github.com/opea-project/GenAIExamples/commit/a6385bc)) - Add composable manifest e2e test for cd workflow([d68be05](https://github.com/opea-project/GenAIExamples/commit/d68be05)) - Add secrets for CI test([3c9e2aa](https://github.com/opea-project/GenAIExamples/commit/3c9e2aa)) - Build up docker images CD workflow([8c384e0](https://github.com/opea-project/GenAIExamples/commit/8c384e0)) - fix corner issue in CI test([64bfea9](https://github.com/opea-project/GenAIExamples/commit/64bfea9)) - Rename github workflow files([ebc165a](https://github.com/opea-project/GenAIExamples/commit/ebc165a)) - Improve manifest chaqna test([a072441](https://github.com/opea-project/GenAIExamples/commit/a072441)) - Refactor build image workflows with common action.yml([e22d413](https://github.com/opea-project/GenAIExamples/commit/e22d413)) - Automatic create issue to GenAIInfra when docker compose files changed([8bdb598](https://github.com/opea-project/GenAIExamples/commit/8bdb598)) - Add components owner([ab98795](https://github.com/opea-project/GenAIExamples/commit/ab98795)) - Fix code scan warning([ac89855](https://github.com/opea-project/GenAIExamples/commit/ac89855)) - Check url of docker image list.([cf021ee](https://github.com/opea-project/GenAIExamples/commit/cf021ee)) - change namespace surfix to random string ([46af6f3](https://github.com/opea-project/GenAIExamples/commit/46af6f3)) - chatqna k8s manifest: Fixed retriever-redis v0.9 image issue([7719755](https://github.com/opea-project/GenAIExamples/commit/7719755)) - Adding Trivy and SBOM actions([f3ffcd5](https://github.com/opea-project/GenAIExamples/commit/f3ffcd5)) - optimize CI log format([dfaf479](https://github.com/opea-project/GenAIExamples/commit/dfaf479))
GenAIComps - Cores - Refine parameter in api_protocol.py([0584b45](https://github.com/opea-project/GenAIComps/commit/0584b45)) - Revert the default value of max_new_tokens to 1024([f2497c5](https://github.com/opea-project/GenAIComps/commit/f2497c5)) - Fixed Orchestrator schedule method([76877c1](https://github.com/opea-project/GenAIComps/commit/76877c1)) - fix wrong indent([9b0edf2](https://github.com/opea-project/GenAIComps/commit/9b0edf2)) - Allow downstream of streaming nodes([90e367e](https://github.com/opea-project/GenAIComps/commit/90e367e)) - Add Retrieval gateway in core to support IndexRetrivel Megaservice([56daf95](https://github.com/opea-project/GenAIComps/commit/56daf95)) - add telemetry doc([2a2a93](https://github.com/opea-project/GenAIComps/commit/2a2a93)) - LLM/embedding/reranking/retrieval - Using habana docker 1.16.1 everywhere([5deb383](https://github.com/opea-project/GenAIComps/commit/5deb383)) - adding entrypoint.sh to faq-generation comp ([4a7b8f4](https://github.com/opea-project/GenAIComps/commit/4a7b8f4)) - Fix image in docker compose yaml to use the built docker image tag from the README([72a2553](https://github.com/opea-project/GenAIComps/commit/72a2553)) - Refine LLM Native Microservice([b16b14a](https://github.com/opea-project/GenAIComps/commit/b16b14a)) - Fix Retriever qdrant issue([7aee7e4](https://github.com/opea-project/GenAIComps/commit/7aee7e4)) - Change /root/ to /home/user/.([4a67d42](https://github.com/opea-project/GenAIComps/commit/4a67d42)) - Fix embeddings_langchain-mosec issue.([87905ad](https://github.com/opea-project/GenAIComps/commit/87905ad)) - fix HuggingFaceEmbedding deprecated in favor of HuggingFaceInferenceAPIEmbedding([2891cc6](https://github.com/opea-project/GenAIComps/commit/2891cc6)) - align vllm-ray response format to tgi response format([ac4a777](https://github.com/opea-project/GenAIComps/commit/ac4a777)) - build new images for llms([ed99d47](https://github.com/opea-project/GenAIComps/commit/ed99d47)) - LLM micro service input data does not have input model name([761f7e0](https://github.com/opea-project/GenAIComps/commit/761f7e0)) - Fix OpenVINO vLLM build scripts and update unit test case([91d825c](https://github.com/opea-project/GenAIComps/commit/91d825c)) - Refine the instructions to run the retriever example with qdrant([eb51018](https://github.com/opea-project/GenAIComps/commit/eb51018)) - Add cmds to restart ollama service and add proxy settings while launching docker([8eb8b6a](https://github.com/opea-project/GenAIComps/commit/8eb8b6a)) - Vllm and vllm-ray bug fix (add opea for vllm, update setuptools version)([0614fc2](https://github.com/opea-project/GenAIComps/commit/0614fc2)) - remove deprecated langchain imports and switch to langchain-huggingface([055404a](https://github.com/opea-project/GenAIComps/commit/055404a)) - [Enhence] Increase mosec_embedding forward timeout to support high concurrency cases([b61f61b](https://github.com/opea-project/GenAIComps/commit/b61f61b)) - Fix issues in updating embedding & reranking model to bge-large-zh-v1.5([da19c5d](https://github.com/opea-project/GenAIComps/commit/da19c5d)) - refact embedding/ranking/llm request/response by referring to openai format([7287caa](https://github.com/opea-project/GenAIComps/commit/7287caa)) - align VLLM micro-service output format with UI([c1887ed](https://github.com/opea-project/GenAIComps/commit/c1887ed)) - fix vllm docker command([c1a5883](https://github.com/opea-project/GenAIComps/commit/c1a5883)) - Update Embedding Mosec Dockerfile to use BAAI/bge-large-zh-v1.5([bbdc1f0](https://github.com/opea-project/GenAIComps/commit/bbdc1f0)) - remove length limitation of embedding([edcd1e8](https://github.com/opea-project/GenAIComps/commit/edcd1e8)) - Support SearchedDoc input type in LLM for No Rerank Pipeline ([3c29fb4](https://github.com/opea-project/GenAIComps/commit/3c29fb4)) - Add local_embedding return 768 length to align with chatqna example([a234db](https://github.com/opea-project/GenAIComps/commit/a234db)) - Refine LLM for No Rerank([fe8ef3](https://github.com/opea-project/GenAIComps/commit/fe8ef3f)) - Remove redundant dependency from 'vllm-ray' comps([068527d](https://github.com/opea-project/GenAIComps/commit/068527d)) - LVM/TTS/ASR - Revise TTS, SpeechT5Model to end the last audio chunk at the correct punctuation mark location([20fc8ca](https://github.com/opea-project/GenAIComps/commit/20fc8ca)) - Support llava-next using TGI([e156101](https://github.com/opea-project/GenAIComps/commit/e156101)) - whisper: Fix container build failure([d5b8cdf](https://github.com/opea-project/GenAIComps/commit/d5b8cdf)) - support whisper long-form generation ([daec680](https://github.com/opea-project/GenAIComps/commit/daec680)) - Support multiple image sources for LVM microservice([ed776ac](https://github.com/opea-project/GenAIComps/commit/ed776ac)) - fix ffmpeg build on hpu([ac3909d](https://github.com/opea-project/GenAIComps/commit/ac3909d)) - Support streaming output for LVM microservice([c5a0344](https://github.com/opea-project/GenAIComps/commit/c5a0344)) - Add video-llama LVM microservice under lvms([db8c893](https://github.com/opea-project/GenAIComps/commit/db8c893)) - add torchvision into requirements([1566047](https://github.com/opea-project/GenAIComps/commit/1566047)) - Use Gaudi base images from Dockerhub([33db504](https://github.com/opea-project/GenAIComps/commit/33db504)) - update the requirements.txt for tts and asr([5ba2561](https://github.com/opea-project/GenAIComps/commit/5ba2561)) - DataPrep - Fix Dataprep qdrant issues and add Test Script([a851abf](https://github.com/opea-project/GenAIComps/commit/a851abf)) - Refine robustness of Dataprep Redis([04986c1](https://github.com/opea-project/GenAIComps/commit/04986c1)) - Address testcase failure([075e84f](https://github.com/opea-project/GenAIComps/commit/075e84f)) - Added support for Unified Port, GET/DELETE endpoints in pgvector Dataprep([8a62bac](https://github.com/opea-project/GenAIComps/commit/8a62bac)) - Update dataprep default mosec embedding model in config.py([8f0f2b0](https://github.com/opea-project/GenAIComps/commit/8f0f2b0)) - unify port in one microservice.([f8d45e5](https://github.com/opea-project/GenAIComps/commit/f8d45e5)) - Pinecone update to OPEA([7c9f77b](https://github.com/opea-project/GenAIComps/commit/7c9f77b)) - Refine Dataprep Code & UT([867e9d7](https://github.com/opea-project/GenAIComps/commit/867e9d7)) - Support delete for Milvus vector db in Dataprep([767a14c](https://github.com/opea-project/GenAIComps/commit/767a14c)) - Redis-dataprep: Make Redis connection consistent([cfaf5f0](https://github.com/opea-project/GenAIComps/commit/cfaf5f0)) - Update Dataprep with Parameter Settings([55b457b](https://github.com/opea-project/GenAIComps/commit/55b457b)) - Fix Dataprep Potential Error in get_file([04ff8bf](https://github.com/opea-project/GenAIComps/commit/04ff8bf)) - Add dependency for pdf2image and OCR processing([9397522](https://github.com/opea-project/GenAIComps/commit/9397522)) - Fix the data load issue for structured files ([40f1463](https://github.com/opea-project/GenAIComps/commit/40f1463)) - Fix deps #568([c541d1d](https://github.com/opea-project/GenAIComps/commit/c541d1d)) - Other Components - Remove 'langsmith' per code review([dcf68a0](https://github.com/opea-project/GenAIComps/commit/dcf68a0)) - Refine Nginx Component([69f9895](https://github.com/opea-project/GenAIComps/commit/69f9895)) - Add logging for unified debug([fab1fbd](https://github.com/opea-project/GenAIComps/commit/fab1fbd)) - Add Nginx Component for Service Forwarding([60cc0b0](https://github.com/opea-project/GenAIComps/commit/60cc0b0)) - Fix line endings to LF([fecf4ac](https://github.com/opea-project/GenAIComps/commit/fecf4ac)) - Add Assistant API for agent([f3a8935](https://github.com/opea-project/GenAIComps/commit/f3a8935)) - doc: remove use of unknown highlight language([5bd8bda](https://github.com/opea-project/GenAIComps/commit/5bd8bda)) - Update README.md([b271739](https://github.com/opea-project/GenAIComps/commit/b271739)) - doc: fix multiple H1 headings([77e0e7b](https://github.com/opea-project/GenAIComps/commit/77e0e7b)) - Add RagAgentDocGrader to agent comp([368c833](https://github.com/opea-project/GenAIComps/commit/368c833)) - Update Milvus docker-compose.yaml([d3eefea](https://github.com/opea-project/GenAIComps/commit/d3eefea)) - prompt_registry: Unifying API endpoint port([27a01ee](https://github.com/opea-project/GenAIComps/commit/27a01ee)) - Minor SPDX header update([4712545](https://github.com/opea-project/GenAIComps/commit/4712545)) - Modification to toxicity plugin PR ([63650d0](https://github.com/opea-project/GenAIComps/commit/63650d0)) - Optional container build instructions([be4833f](https://github.com/opea-project/GenAIComps/commit/be4833f)) - Add Uvicorn dependency([b2e2b1a](https://github.com/opea-project/GenAIComps/commit/b2e2b1a)) - Support launch as Non-Root user in all published container images.([1eaf6b7](https://github.com/opea-project/GenAIComps/commit/1eaf6b7)) - Update readme and remove empty readme([a61e434](https://github.com/opea-project/GenAIComps/commit/a61e434)) - Refine Guardrails README and update model([7749ce3](https://github.com/opea-project/GenAIComps/commit/7749ce3)) - Add codeowner([fb0ea3d](https://github.com/opea-project/GenAIComps/commit/fb0ea3d)) - Remove unnecessary langsmith dependency([cc8cd70](https://github.com/opea-project/GenAIComps/commit/cc8cd70)) - doc: add .gitignore([d39fee9](https://github.com/opea-project/GenAIComps/commit/d39fee9)) - Add output evaluation for guardrails([62ca5bc](https://github.com/opea-project/GenAIComps/commit/62ca5bc)) - Add ML detection strategy to PII detection guardrail([de27e6b](https://github.com/opea-project/GenAIComps/commit/de27e6b)) - Add finetuning list job, cancel job, retrieve finetuning job feature([7bbbdaf](https://github.com/opea-project/GenAIComps/commit/7bbbdaf)) - update finetuning api with openai format.([1ff81da](https://github.com/opea-project/GenAIComps/commit/1ff81da)) - Add finetuning component ([ad0bb7c](https://github.com/opea-project/GenAIComps/commit/ad0bb7c)) - Add toxicity detection microservice([97fdf54](https://github.com/opea-project/GenAIComps/commit/97fdf54)) - fix searchqna readme([66cbbf3](https://github.com/opea-project/GenAIComps/commit/66cbbf3)) - Fix typos and add definitions for toxicity detection microservice([9b8798a](https://github.com/opea-project/GenAIComps/commit/9b8798a)) - CI/CD/UT - Fix tts image build error([8b9dcdd](https://github.com/opea-project/GenAIComps/commit/8b9dcdd)) - Add CD workflow.([5dedd04](https://github.com/opea-project/GenAIComps/commit/5dedd04)) - Fix CI test changed file detect issue([cd83854](https://github.com/opea-project/GenAIComps/commit/cd83854)) - add sudo in wf remove([1043336](https://github.com/opea-project/GenAIComps/commit/1043336)) - adapt GenAIExample test structure refine([7ffaf24](https://github.com/opea-project/GenAIComps/commit/7ffaf24)) - Freeze base images([61dba72](https://github.com/opea-project/GenAIComps/commit/61dba72)) - Fix image build check waring.([2b14c63](https://github.com/opea-project/GenAIComps/commit/2b14c63)) - Modify validate result check.([8a6079d](https://github.com/opea-project/GenAIComps/commit/8a6079d)) - Fix requirement actions([2207503](https://github.com/opea-project/GenAIComps/commit/2207503)) - Add validate result detection.([cf15b91](https://github.com/opea-project/GenAIComps/commit/cf15b91)) - Check build fail and change port 8008 to 5025/5026.([5159aac](https://github.com/opea-project/GenAIComps/commit/5159aac)) - Freeze requirements([5d9a855](https://github.com/opea-project/GenAIComps/commit/5d9a855)) - Fix vllm-ray issue([0bd8215](https://github.com/opea-project/GenAIComps/commit/0bd8215)) - Standardize image build.([a56a847](https://github.com/opea-project/GenAIComps/commit/a56a847)) - clean local images before test([f36629a](https://github.com/opea-project/GenAIComps/commit/f36629a)) - update test files([ab8ebc4](https://github.com/opea-project/GenAIComps/commit/ab8ebc4)) - Fix validation failure without exit.([f46f1f3](https://github.com/opea-project/GenAIComps/commit/f46f1f3)) - Update Microservice CI trigger path([3ffcff4](https://github.com/opea-project/GenAIComps/commit/3ffcff4)) - Add E2E example test([ec4143e](https://github.com/opea-project/GenAIComps/commit/ec4143e)) - Added unified ports for Chat History Microservice.([2098b91](https://github.com/opea-project/GenAIComps/commit/2098b91)) - add secrets for test([cafcf1b](https://github.com/opea-project/GenAIComps/commit/cafcf1b)) - [tests] normalize embedding and reranking endpoint docker image name([e3f29c3](https://github.com/opea-project/GenAIComps/commit/e3f29c3)) - fix asr ut on hpu([9580298](https://github.com/opea-project/GenAIComps/commit/9580298)) - update image build list([7185d6b](https://github.com/opea-project/GenAIComps/commit/7185d6b)) - Add path check for dockerfiles in compose.yaml and change workflow name.([c45f8f0](https://github.com/opea-project/GenAIComps/commit/c45f8f0)) - enhance docker image build([75d6bc9](https://github.com/opea-project/GenAIComps/commit/75d6bc9)) - refactor build image with common action.yml([ee5b0f6](https://github.com/opea-project/GenAIComps/commit/ee5b0f6)) - Fix '=' miss issues.([eb5cc8a](https://github.com/opea-project/GenAIComps/commit/eb5cc8a)) - fix freeze workflow([945b9e4](https://github.com/opea-project/GenAIComps/commit/945b9e4))
GenAIEvals - remove useless code.([1004d5b](https://github.com/opea-project/GenAIEval/commit/1004d5b)) - Unify benchmark tool based on stresscli library([71637c0](https://github.com/opea-project/GenAIEval/commit/71637c0)) - Fixed query list id out-of-range issue([7b719de](https://github.com/opea-project/GenAIEval/commit/7b719de)) - Add GMC chatqna benchmark script([6a390da](https://github.com/opea-project/GenAIEval/commit/6a390da)) - Add test example prompts for codegen([ebee50c](https://github.com/opea-project/GenAIEval/commit/ebee50c)) - doc: fix language on codeblock in README([85aef83](https://github.com/opea-project/GenAIEval/commit/85aef83)) - Fix metrics issue of CRUD([82c1654](https://github.com/opea-project/GenAIEval/commit/82c1654)) - Add benchmark stresscli scripts([9998cd7](https://github.com/opea-project/GenAIEval/commit/9998cd7)) - remove useless code([1004d5b](https://github.com/opea-project/GenAIEval/commit/1004d5b)) - Add GMC chatqna benchmark script([6a390da](https://github.com/opea-project/GenAIEval/commit/6a390da)) - Fixed query list id out-of-range issue([7b719de](https://github.com/opea-project/GenAIEval/commit/7b719de)) - enhance multihop dataset accuracy([dfc2c1e](https://github.com/opea-project/GenAIEval/commit/dfc2c1e)) - doc: add Kubernetes platform-optimization README([7600db4](https://github.com/opea-project/GenAIEval/commit/7600db4)) - doc: fix platform optimization README based on PR#73 feedback([8c7eb1b](https://github.com/opea-project/GenAIEval/commit/8c7eb1b)) - update for faq benchmark([d754a84](https://github.com/opea-project/GenAIEval/commit/d754a84)) - Support e2e and first token P90 statistics([b07cd12](https://github.com/opea-project/GenAIEval/commit/b07cd12))
GenAIInfra - GMC - update GMC e2e and Doc([8a85364](https://github.com/opea-project/GenAIInfra/commit/8a85364)) - Fixed some bugs for GMC yaml files([112295a](https://github.com/opea-project/GenAIInfra/commit/112295a)) - Set up CD workflow for GMC([3d94844](https://github.com/opea-project/GenAIInfra/commit/3d94844)) - GMC: Add GPU support for GMC.([119941e](https://github.com/opea-project/GenAIInfra/commit/119941e)) - authN-authZ: add oauth2-proxy support for authentication and authorization together with GMC([488a1ca](https://github.com/opea-project/GenAIInfra/commit/488a1ca)) - Output streaming support for the whole pipeline in GMC router([c412aa3](https://github.com/opea-project/GenAIInfra/commit/c412aa3)) - re-org k8s manifests files for GMC and examples([d39b315](https://github.com/opea-project/GenAIInfra/commit/d39b315)) - GMC: resource management([81060ab](https://github.com/opea-project/GenAIInfra/commit/81060ab)) - Enable GMC helm installation test in CI([497ff61](https://github.com/opea-project/GenAIInfra/commit/497ff61)) - Add helm chart for deploying GMC itself([a76c90f](https://github.com/opea-project/GenAIInfra/commit/a76c90f)) - Add multiple endpoints for GMC pipeline via gmcrouter([da4f091](https://github.com/opea-project/GenAIInfra/commit/da4f091)) - GMC: fix unsafe quoting([aa2730a](https://github.com/opea-project/GenAIInfra/commit/aa2730a)) - fix: update doc for authN-authZ with oauth([54cd66f](https://github.com/opea-project/GenAIInfra/commit/54cd66f)) - Troubleshooting guide for the validating webhook.([b47ec0c](https://github.com/opea-project/GenAIInfra/commit/b47ec0c)) - Fix router bugs on max_new_tokens and dataprep gaudi yaml file([5735dd3](https://github.com/opea-project/GenAIInfra/commit/5735dd3)) - Add dataprep microservice to chatQnA example([d9a0271](https://github.com/opea-project/GenAIInfra/commit/d9a0271)) - Troubleshooting guide for the validating webhook([b47ec0c](https://github.com/opea-project/GenAIInfra/commit/b47ec0c)) - Add HPA support to ChatQnA([cab7a88](https://github.com/opea-project/GenAIInfra/commit/cab7a88)) - HelmChart - Add manual helm e2e test flow([3b5f62e](https://github.com/opea-project/GenAIInfra/commit/3b5f62e)) - Add script to generate manifests from helm charts([273cb1d](https://github.com/opea-project/GenAIInfra/commit/273cb1d)) - ui: update chatqna helm chart readme and env name([a1d6d70](https://github.com/opea-project/GenAIInfra/commit/a1d6d70)) - Update helm chart readme([656dcc6](https://github.com/opea-project/GenAIInfra/commit/656dcc6)) - helm: fix tei/tgi/docsum([a270726](https://github.com/opea-project/GenAIInfra/commit/a270726)) - helm: update data-prep to latest changes([625899b](https://github.com/opea-project/GenAIInfra/commit/625899b)) - helm: Update helm manifest to address user raised issues([4319660](https://github.com/opea-project/GenAIInfra/commit/4319660)) - helm: Support local embedding([73b5b65](https://github.com/opea-project/GenAIInfra/commit/73b5b65)) - ui: add helm chart/manifests for conversational UI([9dbe550](https://github.com/opea-project/GenAIInfra/commit/9dbe550)) - helm: Add K8S probes to retriever-usvc([af47b3c](https://github.com/opea-project/GenAIInfra/commit/af47b3c)) - Enable google secrets in helm chart e2e workflow([7079049](https://github.com/opea-project/GenAIInfra/commit/7079049)) - Helm/Manifest: Add K8S probe([d3fc939](https://github.com/opea-project/GenAIInfra/commit/d3fc939)) - Enable helm/common tests in CI([fa8ef35](https://github.com/opea-project/GenAIInfra/commit/fa8ef35)) - Helm: Add Nvidia GPU support for ChatQnA([868103b](https://github.com/opea-project/GenAIInfra/commit/868103b)) - misc changes([b1182c4](https://github.com/opea-project/GenAIInfra/commit/b1182c4)) - tgi: Update tgi version on xeon to latest-intel-cpu([c06bcea](https://github.com/opea-project/GenAIInfra/commit/c06bcea)) - Fix typos in README([faa976b](https://github.com/opea-project/GenAIInfra/commit/faa976b)) - Support HF_ENDPOINT([cf28da4](https://github.com/opea-project/GenAIInfra/commit/cf28da4)) - Set model-volume default to tmp volume([b5c14cd](https://github.com/opea-project/GenAIInfra/commit/b5c14cd)) - Enable using PV as model cache directory([c0d2ba6](https://github.com/opea-project/GenAIInfra/commit/c0d2ba6)) - add manual helm e2e test flow([3b5f62e](https://github.com/opea-project/GenAIInfra/commit/3b5f62e)) - helm/manifest: Update to release v0.9([182183e](https://github.com/opea-project/GenAIInfra/commit/182183e)) - Others - Rename workflows to get better readable([cb31d05](https://github.com/opea-project/GenAIInfra/commit/cb31d05)) - Add manual job to freeze image tags and versions after code freeze([c0f5e2f](https://github.com/opea-project/GenAIInfra/commit/c0f5e2f)) - tgi: revert xeon version to 2.2.0([076e81e](https://github.com/opea-project/GenAIInfra/commit/076e81e)) - Initial commit for Intel Gaudi Base Operator([c2a13d1](https://github.com/opea-project/GenAIInfra/commit/c2a13d1)) - Add AudioQnA example and e2e test([1b50b73](https://github.com/opea-project/GenAIInfra/commit/1b50b73)) - Reorg and rename CI workflows to follow the rules([2bf648c](https://github.com/opea-project/GenAIInfra/commit/2bf648c)) - Fix errors in ci workflow([779e526](https://github.com/opea-project/GenAIInfra/commit/779e526)) - Add e2e test for chatqna with switch mode enable([7b20273](https://github.com/opea-project/GenAIInfra/commit/7b20273)) - Validating webhook implementation([df5f6f3](https://github.com/opea-project/GenAIInfra/commit/df5f6f3)) - Enhance manually run image build workflow([e983c32](https://github.com/opea-project/GenAIInfra/commit/e983c32)) - Add image build process on manual event([833dcec](https://github.com/opea-project/GenAIInfra/commit/833dcec)) - CI: change chart e2e to support tag replacing([739788a](https://github.com/opea-project/GenAIInfra/commit/739788a)) - Add e2e test for chatQnA with dataprep microservice([c1fd27f](https://github.com/opea-project/GenAIInfra/commit/c1fd27f)) - Fix a bug of chart e2e workflow([86dd739](https://github.com/opea-project/GenAIInfra/commit/86dd739)) - Improve chart e2e test workflow and scripts([70205e5](https://github.com/opea-project/GenAIInfra/commit/70205e5)) - rename workflows to get better readable([cb31d05](https://github.com/opea-project/GenAIInfra/commit/cb31d05)) - Correct TGI image tag for NV platform([629033b](https://github.com/opea-project/GenAIInfra/commit/629033b)) - authN-authZ: change folder and split support([0c39b7b](https://github.com/opea-project/GenAIInfra/commit/0c39b7b)) - fix errors of manual helm workflow([bd46dfd](https://github.com/opea-project/GenAIInfra/commit/bd46dfd)) - update freeze tag manual workflow([c565909](https://github.com/opea-project/GenAIInfra/commit/c565909)) - Update README([9480afc](https://github.com/opea-project/GenAIInfra/commit/9480afc)) - improve cd workflows and add release document ([a4398b0](https://github.com/opea-project/GenAIInfra/commit/a4398b0)) - Add some NVIDIA platform support docs and scripts([cad2fc3](https://github.com/opea-project/GenAIInfra/commit/cad2fc3))