On-line Hierarchical Multi-label Classification last 6 months Jesse Read jesse.read@gmail.com University of Waikato On-line Hierarchical Multi-label Classification – p. 1/3
Outline Multi-label classification (review) Problem Transformation (review) Multi-labeled data (a closer look) PPT: A new Problem Transformation method Experiments I PPT: An extension Experiments II Experiments III PPT: Some related applications Summary, current and planned work On-line Hierarchical Multi-label Classification – p. 2/3
Single-label (Multi-class) Classification On-line Hierarchical Multi-label Classification – p. 3/3
Single-label (Multi-class) Classification Set of documents D . Set of labels L . For each d ∈ D , select a label l ∈ L Single-label representation: ( d, l ) On-line Hierarchical Multi-label Classification – p. 3/3
Single-label (Multi-class) Classification Set of documents D . Set of labels L . For each d ∈ D , select a label l ∈ L Single-label representation: ( d, l ) e.g. L = { Sport, Environment, Science, Politics } : Document ( d ) Label ( l ∈ L ) “NZ scientists help discover solar system in our galaxy. . . ” “Antarctic food chain in danger. . . ” “Top sports stars fuelling success. . . ” “Steeled for ironman. . . ” “Greens claim report doctored. . . ” “Revealed: Polluting impact of humans on the oceans. . . ” “Union muzzled while awaiting poll watchdog’s ruling. . . ” “Technology pushes sporting boundaries. . . ” On-line Hierarchical Multi-label Classification – p. 3/3
Single-label (Multi-class) Classification Set of documents D . Set of labels L . For each d ∈ D , select a label l ∈ L Single-label representation: ( d, l ) e.g. L = { Sport, Environment, Science, Politics } : Document ( d ) Label ( l ∈ L ) “NZ scientists help discover solar system in our galaxy. . . ” Science “Antarctic food chain in danger. . . ” “Top sports stars fuelling success. . . ” “Steeled for ironman. . . ” “Greens claim report doctored. . . ” “Revealed: Polluting impact of humans on the oceans. . . ” “Union muzzled while awaiting poll watchdog’s ruling. . . ” “Technology pushes sporting boundaries. . . ” On-line Hierarchical Multi-label Classification – p. 3/3
Single-label (Multi-class) Classification Set of documents D . Set of labels L . For each d ∈ D , select a label l ∈ L Single-label representation: ( d, l ) e.g. L = { Sport, Environment, Science, Politics } : Document ( d ) Label ( l ∈ L ) “NZ scientists help discover solar system in our galaxy. . . ” Science “Antarctic food chain in danger. . . ” Science “Top sports stars fuelling success. . . ” “Steeled for ironman. . . ” “Greens claim report doctored. . . ” “Revealed: Polluting impact of humans on the oceans. . . ” “Union muzzled while awaiting poll watchdog’s ruling. . . ” “Technology pushes sporting boundaries. . . ” On-line Hierarchical Multi-label Classification – p. 3/3
Single-label (Multi-class) Classification Set of documents D . Set of labels L . For each d ∈ D , select a label l ∈ L Single-label representation: ( d, l ) e.g. L = { Sport, Environment, Science, Politics } : Document ( d ) Label ( l ∈ L ) “NZ scientists help discover solar system in our galaxy. . . ” Science “Antarctic food chain in danger. . . ” Science “Top sports stars fuelling success. . . ” Sport “Steeled for ironman. . . ” “Greens claim report doctored. . . ” “Revealed: Polluting impact of humans on the oceans. . . ” “Union muzzled while awaiting poll watchdog’s ruling. . . ” “Technology pushes sporting boundaries. . . ” On-line Hierarchical Multi-label Classification – p. 3/3
Single-label (Multi-class) Classification Set of documents D . Set of labels L . For each d ∈ D , select a label l ∈ L Single-label representation: ( d, l ) e.g. L = { Sport, Environment, Science, Politics } : Document ( d ) Label ( l ∈ L ) “NZ scientists help discover solar system in our galaxy. . . ” Science “Antarctic food chain in danger. . . ” Science “Top sports stars fuelling success. . . ” Sport “Steeled for ironman. . . ” Sport “Greens claim report doctored. . . ” “Revealed: Polluting impact of humans on the oceans. . . ” “Union muzzled while awaiting poll watchdog’s ruling. . . ” “Technology pushes sporting boundaries. . . ” On-line Hierarchical Multi-label Classification – p. 3/3
Single-label (Multi-class) Classification Set of documents D . Set of labels L . For each d ∈ D , select a label l ∈ L Single-label representation: ( d, l ) e.g. L = { Sport, Environment, Science, Politics } : Document ( d ) Label ( l ∈ L ) “NZ scientists help discover solar system in our galaxy. . . ” Science “Antarctic food chain in danger. . . ” Science “Top sports stars fuelling success. . . ” Sport “Steeled for ironman. . . ” Sport “Greens claim report doctored. . . ” Politics “Revealed: Polluting impact of humans on the oceans. . . ” “Union muzzled while awaiting poll watchdog’s ruling. . . ” “Technology pushes sporting boundaries. . . ” On-line Hierarchical Multi-label Classification – p. 3/3
Single-label (Multi-class) Classification Set of documents D . Set of labels L . For each d ∈ D , select a label l ∈ L Single-label representation: ( d, l ) e.g. L = { Sport, Environment, Science, Politics } : Document ( d ) Label ( l ∈ L ) “NZ scientists help discover solar system in our galaxy. . . ” Science “Antarctic food chain in danger. . . ” Science “Top sports stars fuelling success. . . ” Sport “Steeled for ironman. . . ” Sport “Greens claim report doctored. . . ” Politics “Revealed: Polluting impact of humans on the oceans. . . ” Environment “Union muzzled while awaiting poll watchdog’s ruling. . . ” “Technology pushes sporting boundaries. . . ” On-line Hierarchical Multi-label Classification – p. 3/3
Single-label (Multi-class) Classification Set of documents D . Set of labels L . For each d ∈ D , select a label l ∈ L Single-label representation: ( d, l ) e.g. L = { Sport, Environment, Science, Politics } : Document ( d ) Label ( l ∈ L ) “NZ scientists help discover solar system in our galaxy. . . ” Science “Antarctic food chain in danger. . . ” Science “Top sports stars fuelling success. . . ” Sport “Steeled for ironman. . . ” Sport “Greens claim report doctored. . . ” Politics “Revealed: Polluting impact of humans on the oceans. . . ” Environment “Union muzzled while awaiting poll watchdog’s ruling. . . ” Politics “Technology pushes sporting boundaries. . . ” On-line Hierarchical Multi-label Classification – p. 3/3
Single-label (Multi-class) Classification Set of documents D . Set of labels L . For each d ∈ D , select a label l ∈ L Single-label representation: ( d, l ) e.g. L = { Sport, Environment, Science, Politics } : Document ( d ) Label ( l ∈ L ) “NZ scientists help discover solar system in our galaxy. . . ” Science “Antarctic food chain in danger. . . ” Science “Top sports stars fuelling success. . . ” Sport “Steeled for ironman. . . ” Sport “Greens claim report doctored. . . ” Politics “Revealed: Polluting impact of humans on the oceans. . . ” Environment “Union muzzled while awaiting poll watchdog’s ruling. . . ” Politics “Technology pushes sporting boundaries. . . ” Science On-line Hierarchical Multi-label Classification – p. 3/3
Multi-label Classification On-line Hierarchical Multi-label Classification – p. 4/3
Multi-label Classification Set of documents D . Set of labels L . For each d ∈ D , select a label subset S ⊆ L Multi-label representation: ( d, S ) On-line Hierarchical Multi-label Classification – p. 4/3
Multi-label Classification Set of documents D . Set of labels L . For each d ∈ D , select a label subset S ⊆ L Multi-label representation: ( d, S ) e.g. L = { Sport, Environment, Science, Politics } : Label ( S ⊆ L ) Document ( d ) “NZ scientists help discover solar system in our galaxy. . . ” “Antarctic food chain in danger. . . ” “Top sports stars fuelling success. . . ” “Steeled for ironman. . . ” “Greens claim report doctored. . . ” “Revealed: Polluting impact of humans on the oceans. . . ” “Union muzzled while awaiting poll watchdog’s ruling. . . ” “Technology pushes sporting boundaries. . . ” On-line Hierarchical Multi-label Classification – p. 4/3
Multi-label Classification Set of documents D . Set of labels L . For each d ∈ D , select a label subset S ⊆ L Multi-label representation: ( d, S ) e.g. L = { Sport, Environment, Science, Politics } : Label ( S ⊆ L ) Document ( d ) “NZ scientists help discover solar system in our galaxy. . . ” { Science } “Antarctic food chain in danger. . . ” “Top sports stars fuelling success. . . ” “Steeled for ironman. . . ” “Greens claim report doctored. . . ” “Revealed: Polluting impact of humans on the oceans. . . ” “Union muzzled while awaiting poll watchdog’s ruling. . . ” “Technology pushes sporting boundaries. . . ” On-line Hierarchical Multi-label Classification – p. 4/3
Recommend
More recommend