Tech Notes

My notes on Statistics, Big Data, Cloud Computing, Cyber Security

Tag Archives: training set

Decision Trees and Prediction with trees

A decision tree is a kind of flowchart — a graphical representation of the process for making a decision or a series of decisions. Read more of this post

Advertisements

Prediction Study Design

Steps in building a prediction
1. Find the right data
2. Define your error rate
3. Split data into:

  • Training
  • Testing
  • Validation (Optional)

4. On the training set pick features
5. On the training set pick prediction function
6. On the training set cross-validate
7. If no validation – apply 1x to test set
8. If validation – apply to test set and refine
9. If validation – apply 1x to validation Read more of this post