Multi-Step Reasoning Demo
Evaluate how an LLM breaks down and solves multi-step problems. Watch the chain-of-thought process unfold step by step.
Configuration
Challenge
Select a model and challenge, then click "Run".
Reasoning Chain
Building reasoning chain...
Answer
Final answer will appear here
Quality Breakdown
MetricScoreAssessment
— Run a test to see results —