Model: Groq-Llama4-Scout
Date Tested: 05/17/2025 | Config: master_prompts_benchmark_config.json
Aggregate Scores
Overall Weighted Score
75.92 %
Avg. Prompt Score
74.98 %
Prompts Analyzed
10 / 10
Total Technical Quality
1349.1 / 1900.0
Total Prompt Adherence
189.0 / 245.0
Individual Prompt Results (10)
Prompt | Status | TQ (Earned/Max) | Adherence (Earned/Max) | Overall (%) | Details |
---|---|---|---|---|---|
Personal Portfolio Landing Page (FPC001) | SUCCESS | 103.3 / 190.0 | 29.0 / 29.0 | 90.87 | Full Report |
Interactive Recipe Page (FPC002) | SUCCESS | 108.1 / 190.0 | 24.0 / 26.0 | 85.23 | Full Report |
Dashboard UI with Chart and Table (FPC003) | SUCCESS | 151.4 / 190.0 | 13.0 / 22.0 | 63.21 | Full Report |
ECommerce Product Page With Image Zoom and Reviews (FPC004) | SUCCESS | 152.1 / 190.0 | 20.0 / 25.0 | 80.01 | Full Report |
Interactive Kanban Board UI (FPC005) | SUCCESS | 145.8 / 190.0 | 17.0 / 25.0 | 69.75 | Full Report |
Social Media Feed Infinite Scroll (FPC006) | SUCCESS | 171.9 / 190.0 | 12.0 / 22.0 | 61.73 | Full Report |
Multi Step Wizard Form with Validation and Summary (FPC007) | SUCCESS | 143.0 / 190.0 | 12.0 / 21.0 | 60.77 | Full Report |
Realtime Collaborative Text Editor Simulation (FPC008) | SUCCESS | 122.0 / 190.0 | 20.0 / 25.0 | 76.84 | Full Report |
Flight Booking System Interface (FPC009) | SUCCESS | 116.2 / 190.0 | 20.0 / 26.0 | 73.77 | Full Report |
WYSIWYG Rich Text Editor with Image Upload and Tables (FPC010) | SUCCESS | 135.3 / 190.0 | 22.0 / 24.0 | 87.58 | Full Report |