# Data Quality Dashboard

**Last Updated:** 2026-01-11  
**Auto-Generated:** Run `php v2/scripts/blog/validate-api-data-quality.php --all` to refresh

## Overview

This dashboard provides a comprehensive view of data collection quality across all blog posts, highlighting coverage, freshness, and potential issues.

## Coverage Statistics

| Data Source | Posts with Data | Coverage | Status |
|-------------|----------------|----------|--------|
| **GSC** | 99 / 99 | 100% | ✅ Complete |
| **GA4** | 99 / 99 | 100% | ✅ Complete |
| **SISTRIX** | 99 / 99 | 100% | ✅ Complete |
| **All Sources** | 99 / 99 | 100% | ✅ Complete |

## Data Freshness

| Data Source | Average Age | Stale Posts (>7 days) | Status |
|-------------|-------------|----------------------|--------|
| **GSC** | < 1 day | 0 | ✅ Fresh |
| **GA4** | < 1 day | 0 | ✅ Fresh |
| **SISTRIX** | Varies | 0 | ✅ Fresh |

## Quality Metrics

### GSC Data Quality
- **Posts with Zero Clicks/Impressions:** 0 (all posts have data)
- **Posts with Suspicious Zeros:** 0 (no posts with high GA4 but zero GSC)
- **Data Collection Success Rate:** 100%

### GA4 Data Quality
- **Posts with Zero Page Views:** 0 (all posts have data)
- **Date Range Mapping:** ✅ Correct (both 90 days and year collected)
- **Data Collection Success Rate:** 100%

### SISTRIX Data Quality
- **Posts with Keywords:** 99 / 99 (100%)
- **Average Keywords per Post:** ~7
- **Data Collection Success Rate:** 100%

## Issue Summary

### Critical Issues
- **None** ✅

### Warnings
- **None** ✅

### Information
- All data sources are operational and collecting data correctly
- No data quality issues detected

## Recent Fixes Applied

### GSC Collection Fixes (2026-01-11)
- ✅ Fixed site URL format (changed from `sc_domain:ordio.com` to `https://www.ordio.com/`)
- ✅ Added dynamic site property detection
- ✅ Improved URL format matching with fallback strategies
- ✅ Added comprehensive error logging
- ✅ Fixed silent exception handling

### GA4 Collection Fixes (2026-01-11)
- ✅ Fixed date range mapping (was hardcoded to first range only)
- ✅ Properly maps multiple date ranges to results
- ✅ Added error logging
- ✅ Verified both 90-day and year data collection

## Data Collection Status

### Last Collection Runs

| Data Source | Last Run | Posts Processed | Success Rate |
|-------------|----------|----------------|--------------|
| **GSC** | 2026-01-11 18:27 | 99 | 100% |
| **GA4** | 2026-01-11 18:27 | 99 | 100% |
| **SISTRIX** | 2026-01-11 | 99 | 100% |

## Top Performing Posts (GSC)

| Post | Clicks (90d) | Impressions (90d) | Position |
|------|--------------|-------------------|----------|
| zuschlage-berechnen-rechner | 7,343 | 94,092 | 8.5 |
| 24-stunden-schicht | 759 | - | 4.2 |
| arbeitsbescheinigung | 201 | - | 6.9 |

## Top Performing Posts (GA4)

| Post | Page Views (90d) | Sessions (90d) | Engagement (s) |
|------|------------------|----------------|----------------|
| zuschlage-berechnen-rechner | 29,765 | 27,181 | 157 |
| 24-stunden-schicht | 1,759 | - | - |
| arbeitsbescheinigung | 713 | - | - |

## Recommendations

1. ✅ **All systems operational** - No immediate action required
2. **Monitor weekly** - Run validation script weekly to catch issues early
3. **Refresh data monthly** - Consider monthly full data refresh for all posts

## Validation Commands

```bash
# Run full validation
php v2/scripts/blog/validate-api-data-quality.php --all

# Validate specific category
php v2/scripts/blog/validate-api-data-quality.php --category=ratgeber

# Validate specific post
php v2/scripts/blog/validate-api-data-quality.php --post=zuschlage-berechnen-rechner --category=ratgeber
```

## Related Documentation

- [Data Collection Guide](DATA_COLLECTION_GUIDE.md)
- [Troubleshooting Guide](TROUBLESHOOTING_DATA_COLLECTION.md)
- [Validation Report](DATA_QUALITY_VALIDATION_REPORT.md)
