Model: Groq-DeepSeek-70b
Date Tested: 07/16/2025 | Config: master_prompts_benchmark_config.json
Aggregate Scores
Overall Weighted Score
71.11 %
Avg. Prompt Score
70.27 %
Prompts Analyzed
10 / 10
Total Technical Quality
711.0 / 1100.0
Total Prompt Adherence
181.0 / 245.0
Individual Prompt Results (10)
Prompt | Status | TQ (Earned/Max) | Adherence (Earned/Max) | Overall (%) | Details |
---|---|---|---|---|---|
Personal Portfolio Landing Page (FPC001) | SUCCESS | 95.0 / 110.0 | 29.0 / 29.0 | 95.91 | Full Report |
Interactive Recipe Page (FPC002) | SUCCESS | 59.0 / 110.0 | 22.0 / 26.0 | 75.32 | Full Report |
Dashboard UI with Chart and Table (FPC003) | SUCCESS | 51.0 / 110.0 | 13.0 / 22.0 | 55.27 | Full Report |
ECommerce Product Page With Image Zoom and Reviews (FPC004) | SUCCESS | 33.0 / 110.0 | 14.0 / 25.0 | 48.20 | Full Report |
Interactive Kanban Board UI (FPC005) | SUCCESS | 81.0 / 110.0 | 18.0 / 25.0 | 72.49 | Full Report |
Social Media Feed Infinite Scroll (FPC006) | SUCCESS | 96.0 / 110.0 | 8.0 / 22.0 | 51.64 | Full Report |
Multi Step Wizard Form with Validation and Summary (FPC007) | SUCCESS | 87.0 / 110.0 | 13.0 / 21.0 | 67.06 | Full Report |
Realtime Collaborative Text Editor Simulation (FPC008) | SUCCESS | 86.0 / 110.0 | 20.0 / 25.0 | 79.45 | Full Report |
Flight Booking System Interface (FPC009) | SUCCESS | 59.0 / 110.0 | 20.0 / 26.0 | 69.94 | Full Report |
WYSIWYG Rich Text Editor with Image Upload and Tables (FPC010) | SUCCESS | 64.0 / 110.0 | 24.0 / 24.0 | 87.45 | Full Report |