Model: Groq-Llama4-Scout

Date Tested: 07/16/2025 | Config: master_prompts_benchmark_config.json

Aggregate Scores

Overall Weighted Score

69.71 %

Avg. Prompt Score

68.89 %

Prompts Analyzed

10 / 10

Total Technical Quality

576.0 / 1100.0

Total Prompt Adherence

189.0 / 245.0

Individual Prompt Results (10)

Prompt	Status	TQ (Earned/Max)	Adherence (Earned/Max)	Overall (%)	Details
Personal Portfolio Landing Page (FPC001)	SUCCESS	30.0 / 110.0 27%	29.0 / 29.0 100%	78.18	Full Report
Interactive Recipe Page (FPC002)	SUCCESS	35.0 / 110.0 32%	24.0 / 26.0 92%	74.16	Full Report
Dashboard UI with Chart and Table (FPC003)	SUCCESS	79.0 / 110.0 72%	13.0 / 22.0 59%	62.91	Full Report
ECommerce Product Page With Image Zoom and Reviews (FPC004)	SUCCESS	77.0 / 110.0 70%	20.0 / 25.0 80%	77.00	Full Report
Interactive Kanban Board UI (FPC005)	SUCCESS	72.0 / 110.0 65%	17.0 / 25.0 68%	67.24	Full Report
Social Media Feed Infinite Scroll (FPC006)	SUCCESS	56.0 / 110.0 51%	12.0 / 22.0 55%	53.45	Full Report
Multi Step Wizard Form with Validation and Summary (FPC007)	SUCCESS	71.0 / 110.0 65%	12.0 / 21.0 57%	59.36	Full Report
Realtime Collaborative Text Editor Simulation (FPC008)	SUCCESS	50.0 / 110.0 45%	20.0 / 25.0 80%	69.64	Full Report
Flight Booking System Interface (FPC009)	SUCCESS	44.0 / 110.0 40%	20.0 / 26.0 77%	65.85	Full Report
WYSIWYG Rich Text Editor with Image Upload and Tables (FPC010)	SUCCESS	62.0 / 110.0 56%	22.0 / 24.0 92%	81.08	Full Report