Latest Benchmark Benchmark Archive Tesslate AI

Community

Hugging Face Discord

TFrameX

Docs GitHub

Model: Groq-Llama4-Scout

Date Tested: 05/17/2025 | Config: master_prompts_benchmark_config.json

Aggregate Scores

Overall Weighted Score

75.92 %

Avg. Prompt Score

74.98 %

Prompts Analyzed

10 / 10

Total Technical Quality

1349.1 / 1900.0

Total Prompt Adherence

189.0 / 245.0

Individual Prompt Results (10)

Prompt Status TQ (Earned/Max) Adherence (Earned/Max) Overall (%) Details
Personal Portfolio Landing Page (FPC001) SUCCESS 103.3 / 190.0
54%
29.0 / 29.0
100%
90.87 Full Report
Interactive Recipe Page (FPC002) SUCCESS 108.1 / 190.0
57%
24.0 / 26.0
92%
85.23 Full Report
Dashboard UI with Chart and Table (FPC003) SUCCESS 151.4 / 190.0
80%
13.0 / 22.0
59%
63.21 Full Report
ECommerce Product Page With Image Zoom and Reviews (FPC004) SUCCESS 152.1 / 190.0
80%
20.0 / 25.0
80%
80.01 Full Report
Interactive Kanban Board UI (FPC005) SUCCESS 145.8 / 190.0
77%
17.0 / 25.0
68%
69.75 Full Report
Social Media Feed Infinite Scroll (FPC006) SUCCESS 171.9 / 190.0
90%
12.0 / 22.0
55%
61.73 Full Report
Multi Step Wizard Form with Validation and Summary (FPC007) SUCCESS 143.0 / 190.0
75%
12.0 / 21.0
57%
60.77 Full Report
Realtime Collaborative Text Editor Simulation (FPC008) SUCCESS 122.0 / 190.0
64%
20.0 / 25.0
80%
76.84 Full Report
Flight Booking System Interface (FPC009) SUCCESS 116.2 / 190.0
61%
20.0 / 26.0
77%
73.77 Full Report
WYSIWYG Rich Text Editor with Image Upload and Tables (FPC010) SUCCESS 135.3 / 190.0
71%
22.0 / 24.0
92%
87.58 Full Report