Latest Benchmark Benchmark Archive Tesslate AI

Community

Hugging Face Discord

TFrameX

Docs GitHub

Model: Groq-DeepSeek-70b

Date Tested: 05/17/2025 | Config: master_prompts_benchmark_config.json

Aggregate Scores

Overall Weighted Score

76.67 %

Avg. Prompt Score

75.73 %

Prompts Analyzed

10 / 10

Total Technical Quality

1513.7 / 1900.0

Total Prompt Adherence

186.0 / 245.0

Individual Prompt Results (10)

Prompt Status TQ (Earned/Max) Adherence (Earned/Max) Overall (%) Details
Personal Portfolio Landing Page (FPC001) SUCCESS 170.9 / 190.0
90%
29.0 / 29.0
100%
97.99 Full Report
Interactive Recipe Page (FPC002) SUCCESS 133.3 / 190.0
70%
22.0 / 26.0
85%
81.72 Full Report
Dashboard UI with Chart and Table (FPC003) SUCCESS 163.8 / 190.0
86%
14.0 / 22.0
64%
68.15 Full Report
ECommerce Product Page With Image Zoom and Reviews (FPC004) SUCCESS 128.1 / 190.0
67%
18.0 / 25.0
72%
71.08 Full Report
Interactive Kanban Board UI (FPC005) SUCCESS 154.8 / 190.0
81%
18.0 / 25.0
72%
73.89 Full Report
Social Media Feed Infinite Scroll (FPC006) SUCCESS 171.6 / 190.0
90%
8.0 / 22.0
36%
47.15 Full Report
Multi Step Wizard Form with Validation and Summary (FPC007) SUCCESS 162.4 / 190.0
85%
13.0 / 21.0
62%
66.62 Full Report
Realtime Collaborative Text Editor Simulation (FPC008) SUCCESS 161.0 / 190.0
85%
20.0 / 25.0
80%
80.95 Full Report
Flight Booking System Interface (FPC009) SUCCESS 131.2 / 190.0
69%
20.0 / 26.0
77%
75.35 Full Report
WYSIWYG Rich Text Editor with Image Upload and Tables (FPC010) SUCCESS 136.6 / 190.0
72%
24.0 / 24.0
100%
94.38 Full Report