vac:bi:rag:2025q4-rag-context-improvement

Description

Extract the transcript from Youtube video to use it for RAG context and other possibility

Task List

Add Code Chunking to the RAG

  • fully qualified name: vac:bi:rag:2025q4-rag-context-code
  • owner: nickninov
  • status: in progress (5%)
  • start-date: 2025/10/01
  • end-date: 2025/12/31

Description

https://github.com/status-im/data-docs/issues/82

Schedule note: Dates reflect quarter bounds; update when actual timing is known.

Deliverables

  • Add task to dagster ETL to include code repository to the RAG context
  • Write documentation in Data-docs.

Google Meeting transcript

  • fully qualified name: vac:bi:rag:2025q4-rag-context-google-meet
  • owner: nickninov
  • status: in progress (5%)
  • start-date: 2025/10/01
  • end-date: 2025/12/31

Description

Include transcript from Google Meeting to the RAG context. https://github.com/status-im/data-docs/issues/68

Schedule note: Dates reflect quarter bounds; update when actual timing is known.

Deliverables

  • Add task to dagster ETL to include meeting transcript to the RAG context.
  • Write documentation in Data-docs.