# Advanced SISTRIX Data Collection Report

**Last Updated:** 2026-01-11

## Overview

This report documents the advanced SISTRIX API data collection strategy implemented to maximize value from remaining credits and provide comprehensive SEO insights.

## Implementation Status

✅ **All 8 collection scripts created and operational**

## Collection Scripts

### Phase 1: SERP Features ✅

**Script:** `v2/scripts/blog/collect-post-serp-features.php`

**Purpose:** Identify featured snippet opportunities, knowledge panel eligibility, and People Also Ask optimization

**Endpoint:** `keyword.seo.serpfeatures` (1 credit per keyword)

**Target:** Top 50 keywords (volume > 500, position < 10)

**Estimated Cost:** 50 credits

**Output:** `docs/content/blog/posts/{category}/{slug}/data/serp-features.json`

**Value:**
- Identify featured snippet opportunities
- Optimize for knowledge panels
- Understand SERP feature competition
- AEO/GEO optimization insights

### Phase 2: Search Intent ✅

**Script:** `v2/scripts/blog/collect-post-search-intent.php`

**Purpose:** Classify search intent to optimize content strategy and keyword targeting

**Endpoint:** `keyword.seo.searchintent` (1 credit per keyword)

**Target:** All 99 primary keywords + top 50 secondary keywords

**Estimated Cost:** 149 credits

**Output:** `docs/content/blog/posts/{category}/{slug}/data/search-intent.json`

**Value:**
- Optimize content for correct search intent
- Identify informational vs transactional keywords
- Improve conversion optimization
- Content strategy alignment

### Phase 3: Competition Levels ✅

**Script:** `v2/scripts/blog/collect-post-competition-levels.php`

**Purpose:** Understand competition levels for all keywords to prioritize optimization efforts

**Endpoint:** `keyword.seo.competition` (1 credit per keyword, supports batch)

**Target:** All ~700 keywords across all posts

**Estimated Cost:** 700 credits (batch mode)

**Output:** Updates `keywords-sistrix.json` with `competition_level` field

**Value:**
- Prioritize low-competition keywords
- Identify quick-win opportunities
- Competitive difficulty assessment
- Keyword prioritization matrix

### Phase 4: Competitor Keywords ✅

**Script:** `v2/scripts/blog/collect-competitor-keywords.php`

**Purpose:** Identify keyword opportunities by analyzing what competitors rank for

**Endpoint:** `keyword.domain.seo` with `domain` parameter (1 credit per keyword returned)

**Target:** Top 5 competitors, 50 keywords each

**Estimated Cost:** 250 credits

**Output:** 
- `docs/content/blog/domain-level-data/competitor-keywords.json`
- `docs/content/blog/domain-level-data/competitive-gaps.json`

**Value:**
- Identify keyword gaps vs competitors
- Discover new keyword opportunities
- Competitive positioning insights
- Content gap analysis

### Phase 5: Content Ideas ✅

**Script:** `v2/scripts/blog/collect-domain-content-ideas.php`

**Purpose:** Get AI-generated content ideas based on domain performance

**Endpoint:** `domain.ideas` (1 credit per idea returned)

**Target:** ordio.com domain, 100 ideas

**Estimated Cost:** 100 credits

**Output:** `docs/content/blog/domain-level-data/content-ideas.json`

**Value:**
- Content gap identification
- New topic ideas
- Content strategy insights
- Blog post opportunities

### Phase 6: Domain Opportunities ✅

**Script:** `v2/scripts/blog/collect-domain-opportunities.php`

**Purpose:** Identify keyword opportunities where domain could rank better

**Endpoint:** `domain.opportunities` (1 credit per opportunity returned)

**Target:** ordio.com domain, 100 opportunities

**Estimated Cost:** 100 credits

**Output:** `docs/content/blog/domain-level-data/domain-opportunities.json`

**Value:**
- Quick-win keyword opportunities
- Ranking improvement potential
- Content optimization priorities

### Phase 7: Backlink Analysis ✅

**Script:** `v2/scripts/blog/collect-domain-backlinks.php`

**Purpose:** Understand backlink profile for competitive advantage

**Endpoints:** `links.overview`, `links.linktargets`, `links.linktexts`

**Target:** ordio.com domain

**Estimated Cost:** 201 credits (1 + 100 + 100)

**Output:** `docs/content/blog/domain-level-data/backlinks.json`

**Value:**
- Backlink profile analysis
- Anchor text optimization
- Link building opportunities
- Competitive backlink comparison

### Phase 8: High-Value SERP Data ✅

**Script:** `v2/scripts/blog/collect-high-value-serp-data.php`

**Purpose:** Get top 10 SERP results for highest-value keywords only

**Endpoint:** `keyword.domain.seo` with `kw` parameter (100 credits per keyword)

**Target:** Top 10 keywords (volume > 2000, position 1-5)

**Estimated Cost:** 1,000 credits (selective collection)

**Output:** `docs/content/blog/domain-level-data/serp-results.json`

**Value:**
- Detailed competitive analysis
- Content gap identification
- Ranking factor analysis
- SERP feature opportunities

## Credit Budget

**Total Estimated Cost:** 2,550 credits

**Breakdown:**
- SERP Features: 50 credits
- Search Intent: 149 credits
- Competition Levels: 700 credits
- Competitor Keywords: 250 credits
- Content Ideas: 100 credits
- Domain Opportunities: 100 credits
- Backlink Analysis: 201 credits
- High-Value SERP: 1,000 credits

**Remaining Credits After Collection:** ~4,194 credits (from 6,744 starting)

## Documentation Updates

### Templates Updated

1. **SEO_REPORT.md** - Added sections for:
   - Search Intent Analysis
   - Competition Analysis
   - Enhanced SERP Features section

2. **SERP_ANALYSIS.md** - Enhanced with:
   - Detailed SERP features data
   - Optimization recommendations

3. **COMPETITIVE_ANALYSIS.md** - New template created for:
   - Competitor keyword comparison
   - Keyword gap analysis
   - Competitive positioning insights

### Documentation Generation Updated

**Script:** `v2/scripts/blog/generate-post-documentation.php`

**Changes:**
- Loads SERP features data (`serp-features.json`)
- Loads search intent data (`search-intent.json`)
- Integrates competition levels from `keywords-sistrix.json`
- Loads high-value SERP data from domain-level directory
- Populates new template placeholders

## Usage

### Run All Advanced Collection

```bash
php v2/scripts/blog/run-all-advanced-collection.php [--dry-run] [--skip-phase=N]
```

### Run Individual Phases

```bash
# Phase 1: SERP Features
php v2/scripts/blog/collect-post-serp-features.php --limit=50

# Phase 2: Search Intent
php v2/scripts/blog/collect-post-search-intent.php --all

# Phase 3: Competition Levels
php v2/scripts/blog/collect-post-competition-levels.php --all

# Phase 4: Competitor Keywords
php v2/scripts/blog/collect-competitor-keywords.php --limit=50

# Phase 5: Content Ideas
php v2/scripts/blog/collect-domain-content-ideas.php --limit=100

# Phase 6: Domain Opportunities
php v2/scripts/blog/collect-domain-opportunities.php --limit=100

# Phase 7: Backlink Analysis
php v2/scripts/blog/collect-domain-backlinks.php --limit=100

# Phase 8: High-Value SERP (selective)
php v2/scripts/blog/collect-high-value-serp-data.php --limit=10
```

## Expected Outcomes

1. **SERP Features Data:** Identify 10-20 featured snippet opportunities
2. **Search Intent Data:** Classify all primary keywords for content optimization
3. **Competition Data:** Prioritize 50-100 low-competition keywords
4. **Competitor Keywords:** Identify 100+ keyword gaps vs competitors
5. **Content Ideas:** Generate 20-30 new content opportunities
6. **Domain Opportunities:** Identify 20-30 quick-win keywords
7. **Backlink Data:** Understand backlink profile for strategy
8. **SERP Data:** Detailed competitive analysis for top 10 keywords

## Data File Locations

**Per-Post Data:**
- `docs/content/blog/posts/{category}/{slug}/data/serp-features.json`
- `docs/content/blog/posts/{category}/{slug}/data/search-intent.json`
- `docs/content/blog/posts/{category}/{slug}/data/keywords-sistrix.json` (updated with competition_level)

**Domain-Level Data:**
- `docs/content/blog/domain-level-data/competitor-keywords.json`
- `docs/content/blog/domain-level-data/competitive-gaps.json`
- `docs/content/blog/domain-level-data/content-ideas.json`
- `docs/content/blog/domain-level-data/domain-opportunities.json`
- `docs/content/blog/domain-level-data/backlinks.json`
- `docs/content/blog/domain-level-data/serp-results.json`

## Next Steps

1. **Run Collection:** Execute `run-all-advanced-collection.php` to collect all data
2. **Regenerate Documentation:** Run `generate-post-documentation.php --all` to update SEO reports
3. **Review Data:** Check domain-level data files for insights
4. **Manual Review:** Use collected data to inform manual review process
5. **Implement Improvements:** Act on identified opportunities and gaps

## Notes

- All scripts include credit management and weekly limit tracking
- Batch processing used where available to optimize speed
- Domain-level data collected once and reused across all posts
- High-value SERP collection is optional due to high cost (100 credits/keyword)
- All scripts support `--dry-run` mode for testing
