Second Workshop on Video Large Language Models

Challenge Tracks

Challenge track details for CVPR 2026 are being finalized. Stay tuned for announcements!

  1. Dense Modification Composed Video Retrieval
  2. Reasoning-based video retrieval requiring models to interpret causal, temporal, and semantic modifications across video pairs.

  3. See Beyond Frames: The Implicit Video Reasoning Challenge
  4. Testing models' ability to infer spatial, narrative, and causal relationships that extend beyond what is directly visible in video frames.

  5. TimeLogic QA: Evaluating Temporal Reasoning in Videos
  6. Assessing temporal logical understanding including operators like Before, After, Until, Since, and event ordering across video sequences.

Each track tests a different aspect of VidLLMs. Challenge links will be provided when available.

Challenge Prizes

Prizes will be awarded across the challenge tracks. Details TBD. Winners will be decided by leaderboard ranking and review.

Key Dates