7

The meaning of multi-class classification rules

Example: I have two classification rules (Refund is a predictor and Cheat is a binary response):
(Refund, No) → (Cheat, No) Support = 0.4, Confidence = 0.57
(Refund, No) → (Cheat, Yes) Support = 0.3, Confidence = 0.43

=> multi-class classification rules:

(Refund, No) → (Cheat, No) v (Cheat, Yes)

When predicted classification for test data, (Cheat, No) will be selected priority so why we need to have (Cheat, Yes) in multi-class classification rules here?

Nitesh
  • 1,615
  • 1
  • 12
  • 22
Xuan Dung
  • 153
  • 1
  • 6

2 Answers2

3
(Cheat, No) will be selected (cheat will be classified as No) for the rule (Refund, No). 

In a binary response variable like cheat, all the information can be inferred from just the first rule:

(Refund, No) → (Cheat, No) Support = 0.4, Confidence = 0.57 

The other rule is redundant.

However, in the case of a multi-class response variable, we would like to have all the rules written out so we exactly know the likelihood of the rule implying each of the different classes in the response variable. To keep things consistent, this is also done for the case when the response variable is binary.

Nitesh
  • 1,615
  • 1
  • 12
  • 22
  • Thanks Nitesh, The purpose (Cheat, Yes) is retained in the multi-layer classification rules to do? – Xuan Dung Nov 08 '14 at 23:03
  • @XuanDung: For classification where the response is not binary (multi-class), the confidence of other classes cannot be derived from the confidence of the predicted class (unlike in the binary class version). – Nitesh Nov 08 '14 at 23:05
  • I want to know why need to have (Cheat, yes) in this multi-class classification rule, while the predicted classification (Cheat, No) is selected, so (Cheat, Yes) is redundant. – Xuan Dung Nov 08 '14 at 23:12
  • Its not just the predicted class, the probability with which its predicted is also important. – Nitesh Nov 08 '14 at 23:20
  • I still do not understand why need to have (Cheat, yes) in this multi-class classification rule, you can more explain clean, thanks. – Xuan Dung Nov 08 '14 at 23:26
0

It is important to see all the rules if one of the states of target column is more important than others. For example, if you are predicting fraudulent transactions, you might want to flag something as fraud even if is has 5% probability.

Rubens
  • 4,107
  • 5
  • 23
  • 42