split (partition) data up into major chunks maximizing change in explained deviance when Gaussian error, maximizing between group SS minimizing SSerror
recursively partition (split) decision tree simple trees tend to overfit. error is fitted along with the model
R packages simple CART library(tree) an extension that facilitates (some) non-gaussian errors library(rpart)
Limitations crude overfitting protection low resolution limited error distributions little scope for random effects
Boosting machine learning meets predictive modelling ensemble models sequence of simple Trees (10,000+ trees) built to predict residuals of previous tree shrinkage produce excellent fit
minimizing square error loss test (validation) data 75% train, 25% test out of bag 50% in, 50% out cross validation 3 folds