CIFAR 10 vs. CIFAR 100 is the most popular benchmark dataset for Out-of-Distribution (OOD) performance evaluation. Google in their 2022 post "towards-reliability-in-deep-learning"[1] used CIFAR 10 vs. CIFAR 100 to demo their new state-of-the-art model pixel. The main feature of CIFAR 10 vs. CIFAR 100 is mutual exclusivity, meaning that CIFAR 100 only includes Negative-Examples of CIFAR 10. For example, CIFAR 10 has classes "automobiles" and "trucks" but not the class "pickup trucks" which only CIFAR 100 does. However, the class "pickup trucks" in CIFAR 100 is under the superclass "vehicle 2" not "automobiles" or "trucks". So how to use CIFAR 100 to test the OOD of CIFAR 10?
[1] https://ai.googleblog.com/2022/07/towards-reliability-in-deep-learning.html