LLM Benchmark Rankings

Compare and rate the top 10 Large Language Models across coding, agentic ability, reasoning, math and language capabilities.

Coding

Code generation and understanding

Agentic Ability

Autonomous decision making

Reasoning

Logical problem solving

Math

Mathematical computation

Language

Natural language processing

Rate Models

Share your experience with these language models by rating their performance across different dimensions.

Start Rating

View Leaderboard

Explore detailed comparisons and rankings of the top language models based on various performance metrics.

View Rankings

Current Rankings

ModelCodingAgenticReasoningMathLanguageAverage
GPT-4o98.598.59.58.9
Claude 3.7 Sonnet8.588.5898.4
Gemini 2.5 Pro88.587.58.58.1