vac:dst:codex:2026q1-codex-evaluation

Description

Test Codex on each new version or requested feature and look for regressions if this is required. Help Codex with testing and benchmarking new features.

Background

We want to learn specific, actionable information about Codex’s behaviour and how it is evolving over time with each new release and with each thing we are specifically asked to check and test.

We will use a combination of real world testing, theoretical analysis and experiments.

Task List

Filehsharing client

  • fully qualified name: vac:dst:codex:2026q1-codex-evaluation:filesharing-client
  • owner: TBD
  • status: not started
  • start-date: 2026/01/01
  • end-date: 2026/03/31

Description

Assist the Codex team to check the functionality of the filesharing client implementation under heavy/big workloads.

Deliverables

  • Reports:
  • Related PRs if apply:

Filehsharing client + mix

  • fully qualified name: vac:dst:codex:2026q1-codex-evaluation:filesharing-client-mix
  • owner: TBD
  • status: not started
  • start-date: 2026/01/01
  • end-date: 2026/03/31

Description

Assist the Codex team to check the functionality of the filesharing client implementation under heavy/big workloads using mix protocol.

Deliverables

  • Reports:
  • Related PRs if apply: