vac:bi:rag:2025q3-rag-evaluation-pipeline
Description
Setting up a evaluation process for the RAG responses of predefined questions. This will help to determind the gain or regretion from different change in the pipelines.
Task List
Setup an evaluation process
- fully qualified name:
vac:bi:rag:2025q4-rag-evaluation-pipeline
- owner: nickninov
- status: todo
- start-date:
- end-date:
Description
https://github.com/status-im/data-docs/issues/99
Deliverables
- List of questions to use as evaluation process
- Pipeline to test the response
- Evaluation process: can be manual or automatic