vac:bi:rag:2025q4-rag-evaluation

Description

Setting up a evaluation process for the RAG responses of predefined questions. This will help to determind the gain or regretion from different change in the pipelines.

Task List

Setup an evaluation process

  • fully qualified name: vac:bi:rag:2025q4-rag-evaluation-pipeline
  • owner: nickninov
  • status: in progress (30%)
  • start-date: 2025/10/01
  • end-date: 2025/12/31

Description

https://github.com/status-im/data-docs/issues/99

Schedule note: Dates reflect quarter bounds; update when actual timing is known.

Deliverables

  • Monitoring data transformed for easier evaluation tracking.
  • List of evaluation questions drafted.