Skip to content

Commit

Permalink
doc: update 2024-2025 roadmap
Browse files Browse the repository at this point in the history
Replace roadmap with updated content approved by OPEA TSC

Signed-off-by: David B. Kinder <[email protected]>
  • Loading branch information
dbkinder authored and chickenrae committed Aug 5, 2024
1 parent f5b14a7 commit 02550bb
Showing 1 changed file with 244 additions and 122 deletions.
366 changes: 244 additions & 122 deletions roadmap/2024-2025.md
Original file line number Diff line number Diff line change
@@ -1,130 +1,252 @@
# OPEA 2024 - 2025 Roadmap

## Milestone 1 (May, Done)

- [x] Components contribution
- [x] ASR
- [x] Data Prep
- [x] Embedding
- [x] Guardrails
- [x] LLM(Gaudi TGI)
- [x] RAG Rerank
- [x] RAG Retrieval
- [x] TTS
- [x] RAG VectorDB
- [x] Open Telemetry support

- [x] UseCases/Examples
- [x] ChatQnA
- [x] CodeGen
- [x] CodeTrans

- [x] Cloud Native
- [x] OneClickOPEA on ChatQnA
- [x] OneClickOPEA on CodeGen

- [x] Evaluation & Others
- [x] CICD & Validation
- [x] lm-eval-harness
- [x] bigcode-eval-harness
- [x] End2End evaluation on GenAIComps & GenAIExamples

## Milestone 2 (June)

- [ ] Components contribution
- [ ] LLM on Xeon by vLLM + Ray, Ollama
- [ ] OVMS
- [ ] Prompting
- [ ] User Feedback Management
- [ ] MI6 Mega Components(MI6 RAG Service)

- [ ] UseCases/Examples
- [ ] DocSum
- [ ] SearchQnA
- [ ] FAQGen
- [ ] End-to-end RAG example using OPEA on Xeon and cloud

- [ ] Cloud Native
- [ ] OneClickOPEA on 2 or more examples

- [ ] Evaluation & Others
- [ ] CICD & Validation
- [ ] End2End evaluation on GenAIComps & GenAIExamples
- [ ] RAG evaluation

## Milestone 3 (July)

- [ ] Components contribution
- [ ] LLM on Gaudi by vLLM + Ray
- [ ] LVM on Gaudi by vLLM + Ray
- [ ] VectorDB(svs)
- [ ] Telemetry

- [ ] UseCases/Examples
- [ ] VisualQnA
- [ ] Windows Desktop App for AIPC

- [ ] Cloud Native
- [ ] OpenShift enablement
- [ ] GenAI Microservice Connector
- [ ] OneClickOPEA on 3 or more examples

- [ ] Evaluation & Others
- [ ] CICD & Validation
- [ ] End2End evaluation on GenAIComps & GenAIExamples

## Milestone 4 (Aug)

- [ ] Components contribution
- [ ] Documentation
- [ ] Automation test script

- [ ] UseCases/Examples
- [ ] Documentation
- [ ] Automation test script

- [ ] Cloud Native
- [ ] K8s Resource Management
- [ ] Documentation
- [ ] AutoScaler Analysis

- [ ] Evaluation & Others
- [ ] CICD & Validation
- [ ] End2End evaluation on GenAIComps & GenAIExamples

## Milestone 5 (from Sep to Dec)

- [ ] Components contribution
- [ ] More micro service components for image and video
- [ ] Fine-tuning support
- [ ] Knowledge graph support
- [ ] OPEA Playground support

- [ ] UseCases/Examples
- [ ] More use cases like language translation and AudioQnA

- [ ] Cloud Native
- [ ] Docker Containerization through Docker Composer
- [ ] Static tuning on Resource management for deployment

- [ ] Evaluation & Others
- [ ] CICD & Validation
- [ ] End2End evaluation
## May 2024

### Contribution

## Milestone 6 (2025)
- **Components**
- ASR
- Data Prep
- Embedding
- Guardrails
- LLM (Gaudi TGI)
- Rerank
- Retrieval
- TTS
- VectorDB

- [ ] Components contribution
- [ ] More micro service components per community request
- [ ] AI Agent support
- **Use Cases/Examples**
- ChatQnA
- CodeGen
- CodeTrans

- [ ] UseCases/Examples
- [ ] No code solution (GenAIStudio)
- [ ] OPEA Model Hub
- **Cloud Native**
- OneClick OPEA on ChatQnA
- OneClick OPEA on CodeGen
- GenAI microservice connector

- [ ] Cloud Native
- [ ] Dynamic tuning on Resource management through K8s
- **Evaluation & Others**
- CICD & Validation
- Eval: E2E (GenAIComps & GenAIExamples), lm-eval-harness, bigcode-eval-harness
- RAGAS evaluation service

- [ ] Evaluation & Others
- [ ] CICD & Validation
- [ ] End2End evaluation
### AI Models

- LLM: llama2 (7b, 13b, 70b), llama3 (8b, 70b), code-llama, Llama guard
- Embedding: BGE-base

### AI Tools Integration

- VectorDB: Chroma
- Framework: Langchain

### Deployment Type

- On Prem,IDC (Xeon, Gaudi)

## June 2024

### Contribution

- **Components**
- LLM (Xeon vLLM & Ray, Ollama)
- OVMS
- prompting
- user feedback management
- Mega Component (MI6 RAG service)

- **Use Cases/Examples**
- DocSum
- SearchQnA

- **Cloud Native**
- OneClick OPEA for 2 more examples
- GMC with switch support (dynamic pipelines)
- Helm charts/templates for custom yamls (refactoring)

- **Evaluation & Others**
- CICD & Validation
- Eval: E2E (GenAIComps & GenAIExamples) Gaudi (2) and CPUs in CICD cluster

### AI Models

- LLM: mistral-7B, mixtral-8x7B
- Embedding: E5-mistral-7b-instruct, all-mpnet-base-v2

### AI Tools Integration

- VectorDB: Pinecone, Redis
- Framework: Llamaindex, Haystack

### Deployment Type

- On Prem,IDC (Xeon, Gaudi)

## July 2024

### Contribution

- **Components**
- LVM (Gaudi vLLM & Ray)
- vectordb (svs)
- Gateway guardrail, Auth Z/N

- **Use Cases/Examples**
- FAQGen

- **Cloud Native**
- OpenShift enablement for OPEA
- OneClick OPEA for 3 more examples
- Security (Service Mesh, guardrails)

- **Evaluation & Others**
- CICD & Validation
- Eval: E2E (GenAIComps & GenAIExamples)

### AI Models

- LLM: Phi, Gemma
- Embedding: all-MiniLM-L6-v2, paraphrase-albert-small-v2

### AI Tools Integration

- VectorDB: PGVector, Qdrant

### Deployment Type

## Aug 2024

### Contribution

- **Components**
- Documentation
- Test automation script
- Telemetry

- **Use Cases/Examples**
- Documentation
- Test automation script

- **Cloud Native**
- Demo K8s resource management
- Documentation on autoscaler analysis

- **Evaluation & Others**
- CICD & Validation
- Eval: E2E (GenAIComps & GenAIExamples)

### AI Models

- Vision: llava
- Mixtral-8x22B

### AI Tools Integration

- VectorDB: Milvus

### Deployment Type

- Public Cloud AWS (Xeon CPU & NV GPU)

## Sep 2024

### Contribution

- **Components**
- Microservice for Image and Video

- **Use Cases/Examples**
- Text to Image generation
- Image to Video generation
- Playground (composable and configurable)

- **Cloud Native**

- **Evaluation & Others**
- CICD & Validation
- Eval: E2E (GenAIComps & GenAIExamples)

### AI Models

- Diffusion model:
- Stable Diffusion XL
- Stable Diffusion 3M
- Stable Video Diffusion

### AI Tools Integration

- VectorDB: Weaviate

### Deployment Type

## Q4 2024

### Contribution

- **Components**
- Fine-tuning E2E pipeline
- Knowledge Graph

- **Use Cases/Examples**
- Fine-tuning (Lora)
- AI Agent (single Agent with text and Audio as user interface)
- Closed source LLM
- GraphRAG

- **Cloud Native**
- Static tuning on Resource management for deployment

- **Evaluation & Others**
- CICD & Validation
- Eval: E2E (GenAIComps & GenAIExamples)

### AI Models

- LLM open: Grok 1
- LLM Close: GPT3.5/4/4o, Claude 3/3.5
- AWS Bedrock endpoint

### AI Tools Integration

- Knowledge graph: Neo4j
- Agent: LangGraph

### Deployment Type

- Public Cloud (Azure, GCP, Oracle, AWS)
- AI PC (Intel)

## Q1 2025

### Contribution

- **Components**
- more Microservice request from community
- Confidential Container

- **Use Cases/Examples**
- AI Agent (Multi Agent)
- Fine-tuning (Adpative)
- Long context window (>1M)
- GenAI Studio

- **Cloud Native**
- Dynamic tuning on Resource management through K8s

- **Evaluation & Others**
- CICD & Validation
- Eval: E2E (GenAIComps & GenAIExamples)

### AI Models

- LLM: SetFit
- More to be defined

### AI Tools Integration

- AutoGen, CrewAI

### Deployment Type

- Public Cloud (tier2 CSP)
- AI PC (others)

0 comments on commit 02550bb

Please sign in to comment.