How Content is Graded
Score Range: 0-100 points across 7 categories
Letter Grades:
- A+ (97-100), A (93-96), A- (90-92)
- B+ (87-89), B (83-86), B- (80-82)
- C+ (77-79), C (73-76), C- (70-72)
- D+ (67-69), D (63-66), D- (60-62)
- F (0-59)
Category Breakdown:
- Substance (22 points) - Content depth, accuracy, value delivery
- Voice (17 points) - Brand consistency, tone, conversational markers
- Structure (17 points) - H2/H3 hierarchy, section balance, flow
- Clarity (17 points) - Readability, jargon-free language, sentence length
- Engagement (17 points) - Hooks, CTAs, reader connection
- SEO (10 points) - Keyword placement, meta elements, search optimization
- Visual (10 points) - Formatting, scannability (not penalized - images generated separately)
What Gets Detected
53 Issue Types Across 8 Categories:
Hook & Opening (5 types)
- Weak opening - Generic first sentence
- Generic intro - "In today's world..." phrases
- Buried lede - Key point not in first 50 words
- No promise - Opening doesn't state reader value
- Slow start - Too much preamble
Structure (6 types)
- Poor hierarchy - H3 without H2, skipped levels
- Unbalanced sections - Sections with wildly different lengths
- Missing H2 headers - No H2 tags at all (knocks Structure to 0)
- Too few sections - Fewer than 3 H2 sections
- Wall of text - Paragraphs exceeding 200 words
- Overly thin sections - Sections under 100 words
Substance (7 types)
- Thin content - Total word count too low
- Lack of specifics - Vague claims without evidence
- No examples - Missing concrete examples
- Unsupported claims - Claims without backing
- Missing data - No statistics or facts
- Surface level - Doesn't go deep enough
- Repetitive - Redundant points
Voice (12 types)
- AI-isms - "Delve," "navigate," "realm," "landscape," "robust," "streamline," etc.
- Clichés - "At the end of the day," "game-changer," "cutting-edge"
- Conversational markers - Excessive "you," "your," "we"
- Corporate jargon - "Leverage," "synergy," "holistic"
- Fake specificity - "5 easy steps" without real specificity
- Formulaic openings - "When it comes to," "In today's world"
- Repetitive voice framing - Overuse of same framing device
- Negative phrasing - "Don't miss," "avoid," "prevent"
- Vague language - "Some," "many," "various"
- Passive voice - Exceeding 10% of sentences
- Em-dashes - Prohibited punctuation (highest priority AI signal)
- Fabrication patterns - Phrases that signal made-up content
Clarity (7 types)
- Long sentences - Sentences exceeding 35 words on average
- Dense paragraphs - Paragraphs over 150 words
- Unclear antecedents - Ambiguous pronouns
- Run-on sentences - Multiple clauses without breaks
- Complex vocabulary - Unnecessarily difficult words
- Jargon heavy - Too much industry terminology
- Poor transitions - Abrupt section changes
Engagement (6 types)
- No hook - Missing engaging opening
- Weak CTA - Call-to-action is generic or missing
- No questions - Content doesn't engage reader thinking
- Rhetorical questions - Overused rhetorical devices
- No reader benefit - Doesn't explain "what's in it for me"
- Boring subheads - H2/H3 tags don't create curiosity
SEO (5 types)
- Missing keyword - Target keyword not in first 100 words
- Keyword stuffing - Keyword density exceeds 3%
- No keyword in title - Target keyword missing from H1/title
- No bold keywords - Target keyword never bolded
- Thin meta - Missing or poor meta description
Formatting (5 types)
- BR tags - Using
instead of tags - Markdown remnants -
**bold** instead of - Placeholders - [INSERT X], [ADD Y] still present
- H1 tags in body - Title should be separate
- Inconsistent formatting - Mixed styles
Total: 53 issue types monitored
Auto-Fixing Process
Content goes through a three-phase auto-fixing pipeline before grading:
Phase 1: Programmatic Fixes (No AI cost)
- BR tags →
paragraphs - Markdown → HTML conversion
- Placeholder removal (
[INSERT], [ADD]) - H1 tag stripping
Phase 2: Em-Dash Fixing (Highest priority)
- Separate AI call
- Context-aware replacements
- Critical signal of AI-generated text
Phase 3: Style & Word Count (Combined call)
- Negative phrasing removal
- Rhetorical question removal
- Cliché elimination
- Formulaic opening fixes
- Word count reduction (if over target by 10%+)
What is NOT Auto-Fixed:
- Structure issues (heading hierarchy)
- Substance issues (missing examples, thin content)
- Most engagement issues (weak CTAs, no hook)
- SEO issues (keyword placement)
These require regeneration with updated prompt instructions.
Penalty System
Issue Severity:
- Critical: 3 points per occurrence (no cap)
- High: 2 points per occurrence (capped at 10)
- Medium: 1 point per occurrence (capped at 6)
- Low: 0.5 points per occurrence (capped at 4)
Knockout Issues (Zero out entire category):
no_h2_headers - No H2 tags → Structure = 0 points
Example Scoring:
- Content has 5 AI-isms (critical): -15 points from Voice
- Content has 8 clichés (high): -10 points (capped) from Voice
- Content has weak opening (high): -2 points from Engagement
- Missing keyword in title (medium): -1 point from SEO
- Total penalty: -28 points
- Final score: 72/100 (C)
Where You See Grades
Content Hub - Grade badge next to each content piece
Content Detail Page - Full breakdown:
- Overall grade (letter + score)
- Category scores (7 individual scores)
- Issues detected (full list with severity)
- Issues fixed (what was auto-corrected)
- Issues remaining (what needs manual attention)
Brief Detail Page - If content generated from brief
Email Reports - Content generation completion emails include grade
Improving Your Grade
For Substance Issues:
- Regenerate with "Add specific examples" instruction
- Manually add data, statistics, case studies
- Deepen analysis in weak sections
For Voice Issues:
- Use Learning Quality System (platform learns from your edits)
- Mark false positives as "Not an issue" to suppress detection
- Regenerate with updated brand voice samples
For Structure Issues:
- Manually fix heading hierarchy (H2 → H3 nesting)
- Add more H2 sections if too few
- Break up long sections into subsections
For Clarity Issues:
- Break long sentences into shorter ones
- Simplify complex vocabulary
- Add transitions between sections
For Engagement Issues:
- Add strong opening hook (first 2 sentences)
- Include clear CTAs in conclusion
- Make subheadings more curiosity-driven
For SEO Issues:
- Manually bold target keyword 2-3 times
- Add keyword to first 100 words
- Ensure keyword in title/H1
Learning Quality System
The platform learns from your edits to improve over time:
How It Works:
- Content is graded with issues detected
- You edit the content or mark issues as "Not an issue"
- Platform tracks which issues you consistently ignore or fix
- Future content auto-prevents recurring issues per tenant
- False-positive detections get suppressed
Result: Quality improves 60% in first month of use
Example:
- Week 1: "Rhetorical questions" flagged 12 times
- You mark 8 as "Not an issue" (these are strategic questions)
- Week 4: Platform stops flagging strategic questions in your content
- Only excessive rhetorical questions get flagged
Quality Monitoring System
ILLIXIS runs background monitoring to detect quality issues across your content generation pipeline.
Brief Quality Monitoring
Schedule: Daily
Checks for:
- Invariant violations: Briefs missing required fields (keyword, title, content strategy)
- SERP pending: Briefs stuck waiting for SERP analysis
- Incomplete data: Briefs with partial analysis data
- Error rates: Percentage of briefs failing during creation
What happens when issues are detected:
- Dashboard notification appears in Strategy Hub
- If error rate exceeds 10%, system alert is logged
- Admins can view details in Settings > System Health
Content Generation Monitoring
Schedule: Daily
Monitors:
- Truncation rate: Content cut off due to token limits
- Chart failures: Charts that failed to render
- Image failures: Hero images that didn't generate
- Success rates by brief type: Which brief types produce best results
Quality metrics tracked:
| Metric | Threshold | Action if exceeded |
|--------|-----------|-------------------|
| Truncation rate | >5% | Prompt optimization review |
| Chart failure rate | >10% | Playwright health check |
| Image failure rate | >15% | AI image generation investigation |
| Overall error rate | >10% | System alert generated |
Viewing quality metrics: Settings > System Health shows:
- 7-day rolling quality metrics
- Breakdown by brief type (keyword, trend, link magnet, etc.)
- Comparison to previous week
Automation Schedule
Content grading runs automatically at key points to ensure your content quality is always tracked:
On Content Generation:
- Grading runs immediately when new content is generated
- Auto-fixing pipeline (Phase 1-3) executes before grading
- Grade is assigned and stored with the content piece
- Improvement suggestions are generated alongside the grade
Weekly Bulk Re-Grading:
- All existing content is re-graded every Saturday at 2:00 AM UTC
- This catches improvements from Learning Quality System updates
- Re-grading uses latest detection rules and thresholds
- Only content that has changed since last grading is re-processed
Grade History:
- Every grading run is stored for trend analysis
- View grade changes over time in Content Detail page
- Track how your content quality evolves month-over-month
- Identify patterns in which categories improve or decline
Improvement Suggestions:
- Fresh suggestions generate with each grading run
- Suggestions prioritized by impact (highest point gains first)
- Suggestions become more refined as platform learns your preferences
- Stale suggestions (issues already fixed) are automatically cleared
Grade vs. Publish Decision
Grade ≠ Ready to Publish
A C-grade article with strong substance may outperform an A+ article with thin value.
Consider:
- B+ or higher - Usually ready with minor tweaks
- B to C+ - Review issues, fix critical ones
- C to D - Significant issues, consider regeneration
- F - Major problems, regenerate with better prompt
Strategic Exception:
- Pillar content targeting high-competition keywords: aim for A- or higher
- Quick blog posts for long-tail keywords: B is often sufficient
- Social repurposing content: C+ is fine (brevity over depth)
Viewing Grade Details
Navigate: Content Hub → [Content Piece] → View Details
You'll see:
- Overall grade badge
- Score breakdown by category
- Full issue list (detected, fixed, remaining)
- Recommendations for improvement
- Grading checklist (what was checked)
To regrade after manual edits:
- Click "Regrade Content" button
- Platform re-runs detection on current content
- New grade calculated based on remaining issues
FAQs
Q: Why did my content get a C when it looks good? A: Check the category breakdown. You may have high substance but poor structure (no H2 headers) or excessive AI-isms in voice.
Q: Can I disable certain issue detections? A: Use the Learning Quality System. Mark false positives as "Not an issue" and the platform will stop flagging them for your tenant.
Q: Why aren't all issues auto-fixed? A: Some issues (structure, substance, engagement) require regeneration with updated prompts. Auto-fixing is limited to style and formatting issues that the AI can safely correct without changing meaning.
Q: Does a higher grade mean better SEO performance? A: Not necessarily. Grade measures quality, not keyword targeting or search intent match. A B-grade article perfectly matching search intent will outrank an A+ article targeting the wrong intent.
Q: What's the average grade for generated content? A: First-generation content typically scores B to B+ (83-89). After auto-fixing and learning from your edits, average grade improves to A- (90-92) within 30 days.
Q: Why is Visual worth 10 points if it's not penalized? A: Visual scoring was removed December 2024 because images are generated separately. The 10 points were redistributed to other categories (Substance, Voice, Structure, Clarity, Engagement each gained 2 points).
Q: Can I export grade data? A: Yes. Export grade data via Content Hub filters or API access.