Tool Use & Function Calling Demo
Evaluate how well an LLM selects and uses tools from correct invocation to parameter accuracy and error recovery.
🔧
Coming Soon
This demo will present tool-use scenarios where an LLM must choose the correct function, pass proper parameters, and handle errors gracefully.
In Progress
Planned Features
✓ Tool Selection
Model must choose between 3-5 available tools to solve a given task.
✓ Parameter Accuracy
Check that tool arguments are correct types and within valid ranges.
✓ Error Recovery
Evaluate how the model handles tool failures and retries.
✓ Tool Composition
Multi-step workflows requiring sequential tool calls with chaining.