过拟合(模型记住训练数据但在新数据上失败)和欠拟合(模型过于简单无法捕捉模式)是机器学习中的两个基本问题。平衡两者——实现良好的泛化——是构建有效模型的中心。
过拟合与欠拟合
text
OVERFITTING → the model learns the training data TOO well (including noise) →
→ performs great on training data but POORLY on new/unseen data (doesn't generalize)
→ too complex; memorizes rather than learns general patterns
→ like memorizing answers vs understanding the concept
UNDERFITTING → the model is TOO SIMPLE to capture the underlying patterns →
→ performs poorly on BOTH training and new data
→ not enough complexity/capacity to learn the patterns
→ the goal is GENERALIZATION: learn real patterns → perform well on NEW data
