Model: Groq-Llama3.3-70b

Date Tested: 07/16/2025 | Config: master_prompts_benchmark_config.json

Aggregate Scores

Overall Weighted Score

72.10 %

Avg. Prompt Score

71.37 %

Prompts Analyzed

10 / 10

Total Technical Quality

674.0 / 1100.0

Total Prompt Adherence

188.0 / 245.0

Individual Prompt Results (10)

Prompt	Status	TQ (Earned/Max)	Adherence (Earned/Max)	Overall (%)	Details
Personal Portfolio Landing Page (FPC001)	SUCCESS	62.0 / 110.0 56%	29.0 / 29.0 100%	86.91	Full Report
Interactive Recipe Page (FPC002)	SUCCESS	57.0 / 110.0 52%	19.0 / 26.0 73%	66.70	Full Report
Dashboard UI with Chart and Table (FPC003)	SUCCESS	100.0 / 110.0 91%	14.0 / 22.0 64%	71.82	Full Report
ECommerce Product Page With Image Zoom and Reviews (FPC004)	SUCCESS	61.0 / 110.0 55%	20.0 / 25.0 80%	72.64	Full Report
Interactive Kanban Board UI (FPC005)	SUCCESS	62.0 / 110.0 56%	20.0 / 25.0 80%	72.91	Full Report
Social Media Feed Infinite Scroll (FPC006)	SUCCESS	56.0 / 110.0 51%	12.0 / 22.0 55%	53.45	Full Report
Multi Step Wizard Form with Validation and Summary (FPC007)	SUCCESS	87.0 / 110.0 79%	12.0 / 21.0 57%	63.73	Full Report
Realtime Collaborative Text Editor Simulation (FPC008)	SUCCESS	86.0 / 110.0 78%	20.0 / 25.0 80%	79.45	Full Report
Flight Booking System Interface (FPC009)	SUCCESS	44.0 / 110.0 40%	20.0 / 26.0 77%	65.85	Full Report
WYSIWYG Rich Text Editor with Image Upload and Tables (FPC010)	SUCCESS	59.0 / 110.0 54%	22.0 / 24.0 92%	80.26	Full Report