Ngevaluasi model ML apa artinya ngukur lan-lan model kuwi bisa kerja — nggunakake metrics yang sesuai (accuracy, precision, recall, etc.) ing test data sing ora diweruhi model. Evaluasi sing bener-bener penting supaya ngerti apa model kuwi bener-bener bisa kerja lan bisa dipercaya.
Ngevaluasi ing data sing durung diweruhi
→ evaluate on a TEST set the model did NOT train on → measures GENERALIZATION (real performance)
→ training accuracy alone is misleading (a model can memorize training data)
→ train/validation/test split; cross-validation → reliable performance estimates
