Содержание
- 2. Evaluation Why? What? How? Measures Training and test data Significance
- 3. Confusion matrix
- 4. Two classes Two classes: T/F, Positive/Negative
- 5. Two classes Two classes: T/F, Positive/Negative
- 6. Two class measures True positive / false positive / true negative / false negative Accuracy (TP+TN)
- 7. Multi-class measures? True positive / false positive / true negative / false negative Accuracy (TP+TN) /(P+N)
- 8. Evaluation Why? What? How? Measures Training and test data Significance
- 9. Training en test data 1: same data for training en testing Bad idea => why?
- 10. Training en test data 2: holdout / percentage split Complete data set Randomly select x% as
- 11. Training en test data 3: k-fold cross-validation Complete data set Fold 1: Fold 2: Fold 3:
- 12. More cross-validation Leave-one-out Stratified cross-validation
- 13. Evaluation Why? What? How? Measures Training and test data Significance
- 14. Method M1 significantly better than M2? 10-fold cross-validation => n=10 Paired t-test H0: performance M1 same
- 16. Other aspects of performance Efficiency Scalability Robustness Interpretability
- 18. Скачать презентацию