Use full model names in infographic and add sources

2025-12-25 00:55:02 +04:00
parent 1b9d4bf7fb
commit 0c12539a51
1 changed files with 37 additions and 31 deletions
--- a/README.md
+++ b/README.md
@@ -27,7 +27,7 @@ The global landscape for AI-powered development is shifting. While Western tools
 GLM-4.7 demonstrates competitive performance against the newest generation of flagship models, including **Claude Sonnet 4.5** and **GPT-5.1 High**, based on the official Z.ai Technical Report (Dec 2025).

 ### 📊 2025 AI Coding Model Performance Comparison
-*Note: Best scores per category are highlighted in $\color{green}{\text{green}}$. Data sourced from [Z.ai Official Blog](https://z.ai/blog/glm-4.7).*
+*Note: Best scores per category are highlighted in $\color{green}{\text{green}}$.*

 <div align="center">

@@ -39,67 +39,73 @@ GLM-4.7 demonstrates competitive performance against the newest generation of fl
 ║  ┌────────────────────────────────────────────────────────────────────────────┐        ║
 ║  │  🧮 MATH (AIME 25)                                                 │        ║
 ║  │  ┌─────────────────────────────────────────────────────────────────────┐       │        ║
-║  │  │  GLM-4.7  ████████████████████ 95.7% 🥇                     │       │        ║
-║  │  │  Gemini   ███████████████████░ 95.0%                          │       │        ║
-║  │  │  GPT-5.1  ██████████████████░░ 94.0%                          │       │        ║
-║  │  │  DeepSeek ███████████████░░░░ 93.1%                          │       │        ║
-║  │  │  Claude   ███████████░░░░░░░ 87.0%                          │       │        ║
+║  │  │  GLM-4.7           ████████████████████ 95.7% 🥇           │       │        ║
+║  │  │  Gemini 3.0 Pro     ███████████████████░ 95.0%                │       │        ║
+║  │  │  GPT-5.1 High       ██████████████████░░ 94.0%                │       │        ║
+║  │  │  DeepSeek-V3.2      ███████████████░░░░ 93.1%                │       │        ║
+║  │  │  Claude Sonnet 4.5  ███████████░░░░░░░ 87.0%                │       │        ║
 ║  │  └─────────────────────────────────────────────────────────────────────┘       │        ║
+║  │  Source: [Z.ai](https://z.ai/blog/glm-4.7)                              │        ║
 ║  └────────────────────────────────────────────────────────────────────────────┘        ║
 ║                                                                              ║
 ║  ┌────────────────────────────────────────────────────────────────────────────┐        ║
 ║  │  💻 CODING (LiveCodeBench v6)                                        │        ║
 ║  │  ┌─────────────────────────────────────────────────────────────────────┐       │        ║
-║  │  │  Gemini   ████████████████████ 90.7% 🥇                     │       │        ║
-║  │  │  GPT-5.1  ███████████████████░ 87.0%                          │       │        ║
-║  │  │  GLM-4.7  ████████████████░░░ 84.9%                          │       │        ║
-║  │  │  DeepSeek ███████████████░░░░ 83.3%                          │       │        ║
-║  │  │  Claude   ██████████░░░░░░░░ 64.0%                          │       │        ║
+║  │  │  Gemini 3.0 Pro     ████████████████████ 90.7% 🥇           │       │        ║
+║  │  │  GPT-5.1 High       ███████████████████░ 87.0%                │       │        ║
+║  │  │  GLM-4.7           ████████████████░░░ 84.9%                │       │        ║
+║  │  │  DeepSeek-V3.2      ███████████████░░░░ 83.3%                │       │        ║
+║  │  │  Claude Sonnet 4.5  ██████████░░░░░░░░ 64.0%                │       │        ║
 ║  │  └─────────────────────────────────────────────────────────────────────┘       │        ║
+║  │  Source: [Z.ai](https://z.ai/blog/glm-4.7)                              │        ║
 ║  └────────────────────────────────────────────────────────────────────────────┘        ║
 ║                                                                              ║
 ║  ┌────────────────────────────────────────────────────────────────────────────┐        ║
 ║  │  🔬 SCIENCE (GPQA-Diamond)                                           │        ║
 ║  │  ┌─────────────────────────────────────────────────────────────────────┐       │        ║
-║  │  │  Gemini   ████████████████████ 91.9% 🥇                     │       │        ║
-║  │  │  GPT-5.1  ███████████████████░ 88.1%                          │       │        ║
-║  │  │  GLM-4.7  ████████████████░░░ 85.7%                          │       │        ║
-║  │  │  Claude   ██████████████░░░░░ 83.4%                          │       │        ║
-║  │  │  DeepSeek ██████████████░░░░░░ 82.4%                          │       │        ║
+║  │  │  Gemini 3.0 Pro     ████████████████████ 91.9% 🥇           │       │        ║
+║  │  │  GPT-5.1 High       ███████████████████░ 88.1%                │       │        ║
+║  │  │  GLM-4.7           ████████████████░░░ 85.7%                │       │        ║
+║  │  │  Claude Sonnet 4.5  ██████████████░░░░░ 83.4%                │       │        ║
+║  │  │  DeepSeek-V3.2      ██████████████░░░░░░ 82.4%                │       │        ║
 ║  │  └─────────────────────────────────────────────────────────────────────┘       │        ║
+║  │  Source: [Z.ai](https://z.ai/blog/glm-4.7)                              │        ║
 ║  └────────────────────────────────────────────────────────────────────────────┘        ║
 ║                                                                              ║
 ║  ┌────────────────────────────────────────────────────────────────────────────┐        ║
 ║  │  🧠 LOGIC (HLE w/Tools)                                             │        ║
 ║  │  ┌─────────────────────────────────────────────────────────────────────┐       │        ║
-║  │  │  Gemini   ██████████░░░░░░░░ 45.8% 🥇                     │       │        ║
-║  │  │  GLM-4.7  ██████████░░░░░░░░ 42.8%                          │       │        ║
-║  │  │  GPT-5.1  ██████████░░░░░░░░ 42.7%                          │       │        ║
-║  │  │  DeepSeek █████████░░░░░░░░░ 40.8%                          │       │        ║
-║  │  │  Claude   ███████░░░░░░░░░░ 32.0%                          │       │        ║
+║  │  │  Gemini 3.0 Pro     ██████████░░░░░░░░ 45.8% 🥇           │       │        ║
+║  │  │  GLM-4.7           ██████████░░░░░░░░ 42.8%                │       │        ║
+║  │  │  GPT-5.1 High       ██████████░░░░░░░░ 42.7%                │       │        ║
+║  │  │  DeepSeek-V3.2      █████████░░░░░░░░░ 40.8%                │       │        ║
+║  │  │  Claude Sonnet 4.5  ███████░░░░░░░░░░ 32.0%                │       │        ║
 ║  │  └─────────────────────────────────────────────────────────────────────┘       │        ║
+║  │  Source: [Z.ai](https://z.ai/blog/glm-4.7)                              │        ║
 ║  └────────────────────────────────────────────────────────────────────────────┘        ║
 ║                                                                              ║
 ║  ┌────────────────────────────────────────────────────────────────────────────┐        ║
 ║  │  ⚙️ ENGINEERING (SWE-bench)                                          │        ║
 ║  │  ┌─────────────────────────────────────────────────────────────────────┐       │        ║
-║  │  │  Claude   ███████████████████░ 77.2% 🥇                     │       │        ║
-║  │  │  GPT-5.1  █████████████████░░░ 76.3%                          │       │        ║
-║  │  │  Gemini   ███████████████░░░░ 76.2%                          │       │        ║
-║  │  │  GLM-4.7  ██████████████░░░░░ 73.8%                          │       │        ║
-║  │  │  DeepSeek █████████████░░░░░░ 73.1%                          │       │        ║
+║  │  │  Claude Sonnet 4.5  ███████████████████░ 77.2% 🥇           │       │        ║
+║  │  │  GPT-5.1 High       █████████████████░░░ 76.3%                │       │        ║
+║  │  │  Gemini 3.0 Pro     ███████████████░░░░ 76.2%                │       │        ║
+║  │  │  GLM-4.7           ██████████████░░░░░ 73.8%                │       │        ║
+║  │  │  DeepSeek-V3.2      █████████████░░░░░░ 73.1%                │       │        ║
 ║  │  └─────────────────────────────────────────────────────────────────────┘       │        ║
+║  │  Source: [SWE-bench](https://github.com/princeton-nlp/SWE-bench)          │        ║
 ║  └────────────────────────────────────────────────────────────────────────────┘        ║
 ║                                                                              ║
 ║  ┌────────────────────────────────────────────────────────────────────────────┐        ║
 ║  │  🤖 AGENTIC (τ²-Bench)                                              │        ║
 ║  │  ┌─────────────────────────────────────────────────────────────────────┐       │        ║
-║  │  │  Gemini   ████████████████████ 90.7% 🥇                     │       │        ║
-║  │  │  GLM-4.7  ██████████████████░░ 87.4%                          │       │        ║
-║  │  │  Claude   ██████████████████░░ 87.2%                          │       │        ║
-║  │  │  DeepSeek ███████████████░░░░ 85.3%                          │       │        ║
-║  │  │  GPT-5.1  ███████████░░░░░░░ 82.7%                          │       │        ║
+║  │  │  Gemini 3.0 Pro     ████████████████████ 90.7% 🥇           │       │        ║
+║  │  │  GLM-4.7           ██████████████████░░ 87.4%                │       │        ║
+║  │  │  Claude Sonnet 4.5  ██████████████████░░ 87.2%                │       │        ║
+║  │  │  DeepSeek-V3.2      ███████████████░░░░ 85.3%                │       │        ║
+║  │  │  GPT-5.1 High       ███████████░░░░░░░ 82.7%                │       │        ║
 ║  │  └─────────────────────────────────────────────────────────────────────┘       │        ║
+║  │  Source: [Z.ai](https://z.ai/blog/glm-4.7)                              │        ║
 ║  └────────────────────────────────────────────────────────────────────────────┘        ║
 ║                                                                              ║
 ║  🎯 Key Wins: Math (1st) | Agentic (2nd) | Logic (2nd) | Coding (3rd)           ║