Back to research

ML/NLP / 2026-03-20

IAB 3.0 Content Classifier Training Report

This report summarizes a cascading hierarchical classifier for assigning media content to IAB 3.0 taxonomy categories. The model is designed to replace an external classification API with an in-house system trained on existing labeled data.

Published Research Native report page
Tier 1 categories 37
Tier 2 categories 332
Tier 3 categories 250
Best micro F1 74.9%

Tier 1

Problem

Replace an external content classification dependency while preserving hierarchical IAB 3.0 parent-child consistency.

Approach

Trained a cascading taxonomy classifier where Tier 1 narrows Tier 2 candidates and Tier 2 narrows Tier 3 candidates.

Result

Tier 1 reached 74.9% micro F1, with clear next steps around rare-category balancing and deeper-tier thresholds.

Technologies
  • Python
  • NLP
  • IAB 3.0
  • Hierarchical Classification
  • Model Evaluation
  • Taxonomy Modeling

Overview

The model classifies media records into IAB 3.0 taxonomy categories using a cascading hierarchy. Tier 1 predictions narrow the allowed Tier 2 predictions, and Tier 2 predictions narrow the allowed Tier 3 predictions.

This structure keeps parent-child consistency intact and reduces impossible category combinations.

Performance

LevelMicro F1Macro F1
Tier 174.9%55.4%
Tier 265.9%21.4%
Tier 358.9%21.8%
MetricValue
Primary category accuracy18.4%

Modeling implication

The Tier 1 result is strong enough to support production experimentation. The lower macro F1 values at Tier 2 and Tier 3 suggest that rare categories need more balanced training data, additional sampling controls, or category-specific thresholds before the model should be treated as final.

Detailed methodology and results

Supporting methodology, figures, and tables are rendered here as native page content with the same visual system as the rest of this website.

Generated: 2026-03-20 13:29

Overview

This model classifies media content into IAB 3.0 taxonomy categories using a cascading hierarchical approach. It replaces the external classification API with an in-house solution trained on existing labeled data.

How It Works

The model uses a cascading architecture : Tier 1 predictions narrow down the possible Tier 2 categories, and Tier 2 narrows down Tier 3. This ensures parent-child consistency and reduces errors.

How It Works
How It Works

Training Data

Training Data
Training Data
Training Data
Training Data
Training Data
Training Data

Model Performance

Model Performance
Model Performance
Model Performance
Model Performance
Model Performance
Model Performance
Model Performance
Model Performance
Model Performance
Model Performance