Latest Benchmark Benchmark Archive Tesslate AI

Community

Hugging Face Discord

TFrameX

Docs GitHub

Model: Groq-Llama3.3-70b

Date Tested: 05/17/2025 | Config: master_prompts_benchmark_config.json

Aggregate Scores

Overall Weighted Score

76.30 %

Avg. Prompt Score

75.47 %

Prompts Analyzed

10 / 10

Total Technical Quality

1416.5 / 1900.0

Total Prompt Adherence

188.0 / 245.0

Individual Prompt Results (10)

Prompt Status TQ (Earned/Max) Adherence (Earned/Max) Overall (%) Details
Personal Portfolio Landing Page (FPC001) SUCCESS 136.3 / 190.0
72%
29.0 / 29.0
100%
94.35 Full Report
Interactive Recipe Page (FPC002) SUCCESS 132.3 / 190.0
70%
19.0 / 26.0
73%
72.39 Full Report
Dashboard UI with Chart and Table (FPC003) SUCCESS 174.2 / 190.0
92%
14.0 / 22.0
64%
69.25 Full Report
ECommerce Product Page With Image Zoom and Reviews (FPC004) SUCCESS 136.3 / 190.0
72%
20.0 / 25.0
80%
78.35 Full Report
Interactive Kanban Board UI (FPC005) SUCCESS 136.2 / 190.0
72%
20.0 / 25.0
80%
78.34 Full Report
Social Media Feed Infinite Scroll (FPC006) SUCCESS 130.5 / 190.0
69%
12.0 / 22.0
55%
57.37 Full Report
Multi Step Wizard Form with Validation and Summary (FPC007) SUCCESS 163.2 / 190.0
86%
12.0 / 21.0
57%
62.89 Full Report
Realtime Collaborative Text Editor Simulation (FPC008) SUCCESS 161.0 / 190.0
85%
20.0 / 25.0
80%
80.95 Full Report
Flight Booking System Interface (FPC009) SUCCESS 114.2 / 190.0
60%
20.0 / 26.0
77%
73.56 Full Report
WYSIWYG Rich Text Editor with Image Upload and Tables (FPC010) SUCCESS 132.3 / 190.0
70%
22.0 / 24.0
92%
87.26 Full Report