Model: Groq-DeepSeek-70b
Date Tested: 05/17/2025 | Config: master_prompts_benchmark_config.json
Aggregate Scores
Overall Weighted Score
76.67 %
Avg. Prompt Score
75.73 %
Prompts Analyzed
10 / 10
Total Technical Quality
1513.7 / 1900.0
Total Prompt Adherence
186.0 / 245.0
Individual Prompt Results (10)
Prompt | Status | TQ (Earned/Max) | Adherence (Earned/Max) | Overall (%) | Details |
---|---|---|---|---|---|
Personal Portfolio Landing Page (FPC001) | SUCCESS | 170.9 / 190.0 | 29.0 / 29.0 | 97.99 | Full Report |
Interactive Recipe Page (FPC002) | SUCCESS | 133.3 / 190.0 | 22.0 / 26.0 | 81.72 | Full Report |
Dashboard UI with Chart and Table (FPC003) | SUCCESS | 163.8 / 190.0 | 14.0 / 22.0 | 68.15 | Full Report |
ECommerce Product Page With Image Zoom and Reviews (FPC004) | SUCCESS | 128.1 / 190.0 | 18.0 / 25.0 | 71.08 | Full Report |
Interactive Kanban Board UI (FPC005) | SUCCESS | 154.8 / 190.0 | 18.0 / 25.0 | 73.89 | Full Report |
Social Media Feed Infinite Scroll (FPC006) | SUCCESS | 171.6 / 190.0 | 8.0 / 22.0 | 47.15 | Full Report |
Multi Step Wizard Form with Validation and Summary (FPC007) | SUCCESS | 162.4 / 190.0 | 13.0 / 21.0 | 66.62 | Full Report |
Realtime Collaborative Text Editor Simulation (FPC008) | SUCCESS | 161.0 / 190.0 | 20.0 / 25.0 | 80.95 | Full Report |
Flight Booking System Interface (FPC009) | SUCCESS | 131.2 / 190.0 | 20.0 / 26.0 | 75.35 | Full Report |
WYSIWYG Rich Text Editor with Image Upload and Tables (FPC010) | SUCCESS | 136.6 / 190.0 | 24.0 / 24.0 | 94.38 | Full Report |