# FAQ Quality Improvement Guide

**Last Updated:** 2026-01-15

Complete guide for improving FAQ content quality, identifying and fixing common issues, and maintaining high standards.

## Overview

This guide covers:

- Common FAQ quality issues and how to fix them
- Topic relevance validation process
- Pattern detection and prevention
- Ordering best practices
- Review checklist
- Examples of good vs bad FAQs

## Common Issues and Fixes

### Issue 1: Off-Topic FAQs

**Problem:** FAQs that don't relate to the post topic (e.g., "Was ist ordio erfahrungen?" on a post about "ueberstunden-erfassen").

**Detection:**

```bash
php v2/scripts/blog/validate-faq-topic-relevance.php --post=slug --category=category
php v2/scripts/blog/comprehensive-faq-analysis.php --post=slug --category=category
```

**Fix:**

1. Check topic relevance score (must be ≥ 0.3)
2. Remove FAQs below threshold
3. Replace with topic-relevant FAQs if needed

**Example:**

- ❌ **BAD:** "Was ist ordio support?" on post about "ueberstunden-erfassen" (relevance: 0.15)
- ✅ **GOOD:** "Wie funktioniert die Überstundenerfassung?" (relevance: 0.85)

### Issue 2: Repetitive Nonsensical Patterns

**Problem:** Questions following nonsensical patterns like "Was kostet [abstract concept]?" or "Wie lange dauert [abstract concept]?".

**Common Patterns:**

- "Was kostet Freizeitausgleich?" (abstract concept, doesn't have a cost)
- "Was kostet motivieren?" (action verb, doesn't have a cost)
- "Wie lange dauert beispiele?" (abstract concept, no duration)
- "Wie lange dauert bedeutung?" (abstract concept, no duration)

**Detection:**

```bash
php v2/scripts/blog/detect-faq-patterns.php --post=slug --category=category
```

**Fix:**

1. Identify pattern violations
2. Remove high-severity violations
3. Rewrite questions if they can be salvaged (e.g., "Was kostet Freizeitausgleich?" → "Wie funktioniert Freizeitausgleich?")

**Example:**

- ❌ **BAD:** "Was kostet Freizeitausgleich?" (nonsensical - Freizeitausgleich doesn't cost anything)
- ✅ **GOOD:** "Wie funktioniert Freizeitausgleich?" (makes sense - explains how it works)

### Issue 3: Duplicate Questions

**Problem:** Multiple questions asking the same thing with slight variations.

**Common Duplicates:**

- "Was ist Accord-Arbeit?" vs "Was ist Accordarbeit?" vs "Was ist Akkord-Arbeit?"
- "Wie berechnet man X?" vs "Wie funktioniert X?" (if similarity > 0.7)

**Detection:**

```bash
php v2/scripts/blog/check-faq-uniqueness.php --post=slug --category=category
php v2/scripts/blog/comprehensive-faq-analysis.php --post=slug --category=category
```

**Fix:**

1. Identify duplicate pairs (similarity > 0.7)
2. Choose best FAQ (better answer quality, closer to 40-80 words)
3. Remove duplicates
4. Keep one canonical version

**Example:**

- ❌ **BAD:**
  - FAQ 1: "Was ist Accord-Arbeit?"
  - FAQ 2: "Was ist Accordarbeit?"
  - FAQ 3: "Was ist Akkord-Arbeit?"
- ✅ **GOOD:** Keep one: "Was ist Akkordarbeit?" (standard spelling)

### Issue 4: Random/Scattered Ordering

**Problem:** FAQs not following logical flow, appearing random and scattered.

**Detection:**

```bash
php v2/scripts/blog/analyze-faq-ordering.php --post=slug --category=category
```

**Fix:**

1. Review current ordering
2. Reorder following logical flow:
   - Definitions first ("Was ist...?")
   - How-to questions second ("Wie funktioniert...?")
   - Requirements/details third ("Was muss ich...?")
   - Edge cases last
3. Prioritize high-volume queries (from GSC data)

**Example:**

- ❌ **BAD:** Random order with definitions scattered throughout
- ✅ **GOOD:**
  1. "Was sind Überstunden?" (definition)
  2. "Wie funktioniert die Überstundenerfassung?" (how-to)
  3. "Was muss ich bei der Überstundenerfassung beachten?" (requirements)
  4. "Wie lange dauert die Bearbeitung?" (details)

### Issue 5: Brand Questions on Non-Brand Posts

**Problem:** Ordio-related questions appearing on posts not about Ordio.

**Common Issues:**

- "Was ist ordio erfahrungen?" on post about "ueberstunden-erfassen"
- "Was ist ordio support?" on post about "arbeitszeitmodelle"
- "Was sind Ordio Kosten?" on post about "lohnabrechnung"

**Detection:**

```bash
php v2/scripts/blog/detect-faq-patterns.php --post=slug --category=category
php v2/scripts/blog/comprehensive-faq-analysis.php --post=slug --category=category
```

**Fix:**

1. Identify brand questions on non-brand posts
2. Remove unless naturally relevant to topic
3. Brand questions should only appear on:
   - Posts in "inside-ordio" category
   - Posts with "ordio" in slug

**Example:**

- ❌ **BAD:** "Was ist ordio erfahrungen?" on post about "ueberstunden-erfassen"
- ✅ **GOOD:** Brand questions only on Ordio-related posts

### Issue 6: Malformed Questions

**Problem:** Questions with fragments, incorrect grammar, or malformed structure.

**Common Issues:**

- Questions ending with fragments ("Was ist Überstunden erfassen: Vergütung,?")
- Questions with nested fragments ("Was ist gibt es ein...?")
- Multiple question marks incorrectly ("Was ist X??")

**Detection:**

```bash
php v2/scripts/blog/detect-faq-patterns.php --post=slug --category=category
```

**Fix:**

1. Remove fragments at the end
2. Fix grammar and structure
3. Ensure single question mark
4. Make questions complete sentences

**Example:**

- ❌ **BAD:** "Was ist Überstunden erfassen: Vergütung,?"
- ✅ **GOOD:** "Was bedeutet Überstunden erfassen?"

## Topic Relevance Validation Process

### Step 1: Run Validation

```bash
php v2/scripts/blog/validate-faq-topic-relevance.php --post=slug --category=category
```

### Step 2: Review Results

- Check relevance scores for each FAQ
- Identify FAQs below 0.3 threshold
- Review why FAQs are off-topic

### Step 3: Fix Issues

- Remove off-topic FAQs
- Replace with topic-relevant FAQs if needed
- Ensure all remaining FAQs have relevance ≥ 0.3

### Step 4: Validate Fixes

```bash
php v2/scripts/blog/validate-faq-topic-relevance.php --post=slug --category=category
```

## Pattern Detection and Prevention

### Detection Process

```bash
php v2/scripts/blog/detect-faq-patterns.php --post=slug --category=category
```

### Common Patterns to Watch For

1. **"Was kostet [abstract concept]?"**

   - Abstract concepts don't have costs
   - Action verbs don't have costs
   - Only concrete products/services have costs

2. **"Wie lange dauert [abstract concept]?"**

   - Abstract concepts don't have duration
   - Only processes/actions have duration

3. **Brand questions on non-brand posts**

   - Check category and slug
   - Remove unless naturally relevant

4. **Malformed questions**
   - Check for fragments
   - Check for multiple question marks
   - Check grammar

### Prevention

- Use comprehensive analysis before adding FAQs
- Validate questions during generation
- Review manually before publishing
- Use fix script with dry-run first

## Ordering Best Practices

### Logical Flow Order

1. **Definition Questions** - "Was ist...?", "Was bedeutet...?"
2. **How-To Questions** - "Wie funktioniert...?", "Wie erstellt man...?"
3. **Requirements** - "Was muss ich...?", "Was sollte...?"
4. **When/Why** - "Wann...?", "Warum...?"
5. **Which** - "Welche...?"
6. **Yes/No** - "Ist...?", "Darf...?"
7. **Cost/Duration** - "Was kostet...?", "Wie lange dauert...?"
8. **Edge Cases** - Troubleshooting, exceptions

### Priority Order (Within Types)

1. High-volume queries (from GSC) - highest clicks/impressions
2. People Also Ask questions (from SISTRIX)
3. Related keywords
4. Standard questions

### Validation

```bash
php v2/scripts/blog/analyze-faq-ordering.php --post=slug --category=category
```

## Review Checklist

### Pre-Review

- [ ] Run comprehensive analysis
- [ ] Review post title, slug, primary keyword
- [ ] Understand post topic and context

### Topic Relevance

- [ ] All FAQs relevant to post topic (relevance ≥ 0.3)
- [ ] No off-topic FAQs
- [ ] No brand questions on non-brand posts

### Patterns

- [ ] No nonsensical cost patterns ("Was kostet [abstract]?")
- [ ] No nonsensical duration patterns ("Wie lange dauert [abstract]?")
- [ ] No malformed questions

### Duplicates

- [ ] No duplicate questions (similarity < 0.7)
- [ ] No repetitive answers (similarity < 0.6)

### Ordering

- [ ] Definitions come before how-to questions
- [ ] High-volume queries appear first
- [ ] Related questions grouped together
- [ ] Logical flow maintained

### Content Quality

- [ ] 10-15 FAQs per post
- [ ] Answers 40-80 words
- [ ] Primary keyword in 3-5 FAQs
- [ ] Du tone consistent
- [ ] No template language

## Examples

### Good FAQ Set

**Post:** "Überstunden erfassen: Vergütung, Abrechnung und Freizeitausgleich"

1. "Was bedeutet Überstunden erfassen?" (definition, relevance: 0.92)
2. "Wie funktioniert die Überstundenerfassung?" (how-to, relevance: 0.88)
3. "Was muss ich bei der Überstundenerfassung beachten?" (requirements, relevance: 0.85)
4. "Wie funktioniert die Abrechnung von Überstunden?" (how-to, relevance: 0.87)
5. "Ist Freizeitausgleich gesetzlich vorgeschrieben?" (yes/no, relevance: 0.82)
6. "Wie lange dauert die Bearbeitung von Überstunden?" (duration, relevance: 0.79)

**Why it's good:**

- All FAQs relevant to topic (relevance > 0.75)
- Logical ordering (definition → how-to → requirements)
- No duplicates
- No pattern violations
- Covers key aspects of topic

### Bad FAQ Set

**Post:** "Überstunden erfassen: Vergütung, Abrechnung und Freizeitausgleich"

1. "Was ist ordio support?" (off-topic, relevance: 0.12)
2. "Was kostet Freizeitausgleich?" (pattern violation, relevance: 0.45)
3. "Was ist ordio erfahrungen?" (off-topic, relevance: 0.08)
4. "Wie lange dauert beispiele?" (pattern violation, relevance: 0.23)
5. "Was ist Accord-Arbeit?" (duplicate of #6, relevance: 0.31)
6. "Was ist Accordarbeit?" (duplicate of #5, relevance: 0.32)

**Why it's bad:**

- Off-topic FAQs (relevance < 0.3)
- Pattern violations (nonsensical patterns)
- Duplicates (similarity > 0.7)
- Random ordering
- Brand questions on non-brand post

## Tools Reference

### Analysis Tools

- `comprehensive-faq-analysis.php` - Complete quality analysis
- `validate-faq-topic-relevance.php` - Topic relevance validation
- `detect-faq-patterns.php` - Pattern detection
- `analyze-faq-ordering.php` - Ordering analysis
- `check-faq-uniqueness.php` - Duplicate detection

### Fix Tools

- `fix-faq-quality-issues.php` - Automated fixes (with manual review)

### Review Tools

- `review-faq-quality-post-by-post.php` - Interactive review workflow

## Related Documentation

- `docs/content/blog/FAQ_QUALITY_STANDARDS.md` - Quality standards and thresholds
- `docs/content/blog/FAQ_QUALITY_RESEARCH.md` - Research findings
- `.cursor/rules/blog-faq-optimization.mdc` - Complete FAQ optimization rules
