Model: Groq-Llama4-Scout
Date Tested: 07/16/2025 | Config: master_prompts_benchmark_config.json
Aggregate Scores
Overall Weighted Score
69.71 %
Avg. Prompt Score
68.89 %
Prompts Analyzed
10 / 10
Total Technical Quality
576.0 / 1100.0
Total Prompt Adherence
189.0 / 245.0
Individual Prompt Results (10)
Prompt | Status | TQ (Earned/Max) | Adherence (Earned/Max) | Overall (%) | Details |
---|---|---|---|---|---|
Personal Portfolio Landing Page (FPC001) | SUCCESS | 30.0 / 110.0 | 29.0 / 29.0 | 78.18 | Full Report |
Interactive Recipe Page (FPC002) | SUCCESS | 35.0 / 110.0 | 24.0 / 26.0 | 74.16 | Full Report |
Dashboard UI with Chart and Table (FPC003) | SUCCESS | 79.0 / 110.0 | 13.0 / 22.0 | 62.91 | Full Report |
ECommerce Product Page With Image Zoom and Reviews (FPC004) | SUCCESS | 77.0 / 110.0 | 20.0 / 25.0 | 77.00 | Full Report |
Interactive Kanban Board UI (FPC005) | SUCCESS | 72.0 / 110.0 | 17.0 / 25.0 | 67.24 | Full Report |
Social Media Feed Infinite Scroll (FPC006) | SUCCESS | 56.0 / 110.0 | 12.0 / 22.0 | 53.45 | Full Report |
Multi Step Wizard Form with Validation and Summary (FPC007) | SUCCESS | 71.0 / 110.0 | 12.0 / 21.0 | 59.36 | Full Report |
Realtime Collaborative Text Editor Simulation (FPC008) | SUCCESS | 50.0 / 110.0 | 20.0 / 25.0 | 69.64 | Full Report |
Flight Booking System Interface (FPC009) | SUCCESS | 44.0 / 110.0 | 20.0 / 26.0 | 65.85 | Full Report |
WYSIWYG Rich Text Editor with Image Upload and Tables (FPC010) | SUCCESS | 62.0 / 110.0 | 22.0 / 24.0 | 81.08 | Full Report |