Latest Benchmark Benchmark Archive Tesslate AI

Community

Hugging Face Discord

TFrameX

Docs GitHub

Model: Groq-Llama4-Scout

Date Tested: 07/16/2025 | Config: master_prompts_benchmark_config.json

Aggregate Scores

Overall Weighted Score

69.71 %

Avg. Prompt Score

68.89 %

Prompts Analyzed

10 / 10

Total Technical Quality

576.0 / 1100.0

Total Prompt Adherence

189.0 / 245.0

Individual Prompt Results (10)

Prompt Status TQ (Earned/Max) Adherence (Earned/Max) Overall (%) Details
Personal Portfolio Landing Page (FPC001) SUCCESS 30.0 / 110.0
27%
29.0 / 29.0
100%
78.18 Full Report
Interactive Recipe Page (FPC002) SUCCESS 35.0 / 110.0
32%
24.0 / 26.0
92%
74.16 Full Report
Dashboard UI with Chart and Table (FPC003) SUCCESS 79.0 / 110.0
72%
13.0 / 22.0
59%
62.91 Full Report
ECommerce Product Page With Image Zoom and Reviews (FPC004) SUCCESS 77.0 / 110.0
70%
20.0 / 25.0
80%
77.00 Full Report
Interactive Kanban Board UI (FPC005) SUCCESS 72.0 / 110.0
65%
17.0 / 25.0
68%
67.24 Full Report
Social Media Feed Infinite Scroll (FPC006) SUCCESS 56.0 / 110.0
51%
12.0 / 22.0
55%
53.45 Full Report
Multi Step Wizard Form with Validation and Summary (FPC007) SUCCESS 71.0 / 110.0
65%
12.0 / 21.0
57%
59.36 Full Report
Realtime Collaborative Text Editor Simulation (FPC008) SUCCESS 50.0 / 110.0
45%
20.0 / 25.0
80%
69.64 Full Report
Flight Booking System Interface (FPC009) SUCCESS 44.0 / 110.0
40%
20.0 / 26.0
77%
65.85 Full Report
WYSIWYG Rich Text Editor with Image Upload and Tables (FPC010) SUCCESS 62.0 / 110.0
56%
22.0 / 24.0
92%
81.08 Full Report