Multi-Step Reasoning Demo

Evaluate how an LLM breaks down and solves multi-step problems. Watch the chain-of-thought process unfold step by step.

Configuration
Challenge
Select a model and challenge, then click "Run".
Reasoning Chain

Building reasoning chain...
Answer
Final answer will appear here
Quality Breakdown
MetricScoreAssessment
— Run a test to see results —