Latest Benchmark Benchmark Archive Tesslate AI

Community

Hugging Face Discord

TFrameX

Docs GitHub

Model: Groq-Llama3.3-70b

Date Tested: 07/16/2025 | Config: master_prompts_benchmark_config.json

Aggregate Scores

Overall Weighted Score

72.10 %

Avg. Prompt Score

71.37 %

Prompts Analyzed

10 / 10

Total Technical Quality

674.0 / 1100.0

Total Prompt Adherence

188.0 / 245.0

Individual Prompt Results (10)

Prompt Status TQ (Earned/Max) Adherence (Earned/Max) Overall (%) Details
Personal Portfolio Landing Page (FPC001) SUCCESS 62.0 / 110.0
56%
29.0 / 29.0
100%
86.91 Full Report
Interactive Recipe Page (FPC002) SUCCESS 57.0 / 110.0
52%
19.0 / 26.0
73%
66.70 Full Report
Dashboard UI with Chart and Table (FPC003) SUCCESS 100.0 / 110.0
91%
14.0 / 22.0
64%
71.82 Full Report
ECommerce Product Page With Image Zoom and Reviews (FPC004) SUCCESS 61.0 / 110.0
55%
20.0 / 25.0
80%
72.64 Full Report
Interactive Kanban Board UI (FPC005) SUCCESS 62.0 / 110.0
56%
20.0 / 25.0
80%
72.91 Full Report
Social Media Feed Infinite Scroll (FPC006) SUCCESS 56.0 / 110.0
51%
12.0 / 22.0
55%
53.45 Full Report
Multi Step Wizard Form with Validation and Summary (FPC007) SUCCESS 87.0 / 110.0
79%
12.0 / 21.0
57%
63.73 Full Report
Realtime Collaborative Text Editor Simulation (FPC008) SUCCESS 86.0 / 110.0
78%
20.0 / 25.0
80%
79.45 Full Report
Flight Booking System Interface (FPC009) SUCCESS 44.0 / 110.0
40%
20.0 / 26.0
77%
65.85 Full Report
WYSIWYG Rich Text Editor with Image Upload and Tables (FPC010) SUCCESS 59.0 / 110.0
54%
22.0 / 24.0
92%
80.26 Full Report