Computing Optimal Cut-Offs
Probabilities from classification models can have two problems:
1. Miscalibration: A p of .9 often doesn’t mean a 90% chance of 1 (assuming a dichotomous y). (You can calibrate it using isotonic regression.)
2. Optimal cut-offs: For multi-class classifiers, we do not know what probability value will maximize the