Latest Benchmark Benchmark Archive Tesslate AI

Community

Hugging Face Discord

TFrameX

Docs GitHub

Model: Gemini-2.5-Flash

Date Tested: 07/16/2025 | Config: master_prompts_benchmark_config.json

Aggregate Scores

Overall Weighted Score

78.57 %

Avg. Prompt Score

78.09 %

Prompts Analyzed

9 / 10

Total Technical Quality

697.0 / 990.0

Total Prompt Adherence

183.0 / 223.0

Individual Prompt Results (10)

Prompt Status TQ (Earned/Max) Adherence (Earned/Max) Overall (%) Details
Personal Portfolio Landing Page (FPC001) SUCCESS 88.0 / 110.0
80%
26.0 / 29.0
90%
86.76 Full Report
Interactive Recipe Page (FPC002) SUCCESS 63.0 / 110.0
57%
26.0 / 26.0
100%
87.18 Full Report
Dashboard UI with Chart and Table (FPC003) WEBDRIVER_ERROR
Error (hover)
0.0 / 0.0
0%
0.0 / 0.0
0%
0.00 Full Report
ECommerce Product Page With Image Zoom and Reviews (FPC004) SUCCESS 95.0 / 110.0
86%
21.0 / 25.0
84%
84.71 Full Report
Interactive Kanban Board UI (FPC005) SUCCESS 57.0 / 110.0
52%
18.0 / 25.0
72%
65.95 Full Report
Social Media Feed Infinite Scroll (FPC006) SUCCESS 96.0 / 110.0
87%
12.0 / 22.0
55%
64.36 Full Report
Multi Step Wizard Form with Validation and Summary (FPC007) SUCCESS 78.0 / 110.0
71%
15.0 / 21.0
71%
71.27 Full Report
Realtime Collaborative Text Editor Simulation (FPC008) SUCCESS 76.0 / 110.0
69%
20.0 / 25.0
80%
76.73 Full Report
Flight Booking System Interface (FPC009) SUCCESS 57.0 / 110.0
52%
21.0 / 26.0
81%
72.08 Full Report
WYSIWYG Rich Text Editor with Image Upload and Tables (FPC010) SUCCESS 87.0 / 110.0
79%
24.0 / 24.0
100%
93.73 Full Report