• Welcome!
  • Blogs

Matrix, Control and Robotics

Matrix, Control and Robotics

Category Archives: Statistics

Terminologies in statistics

27 Monday Apr 2015

Posted by junkailu in Statistics

≈ Leave a comment

Tags

blog

  1. k-fold cross-validation

In k-fold cross-validation, the original sample is randomly partitioned into k equal size subsamples. Of the k subsamples, a single subsample is retained as the validation data for testing the model, and the remaining k − 1 subsamples are used as training data. The cross-validation process is then repeated k times (the folds), with each of the k subsamples used exactly once as the validation data. The k results from the folds can then be averaged (or otherwise combined) to produce a single estimation. The advantage of this method over repeated random sub-sampling (see below) is that all observations are used for both training and validation, and each observation is used for validation exactly once. 10-fold cross-validation is commonly used, but in general k remains an unfixed parameter.

When k = n (the number of observations), the k-fold cross-validation is exactly the leave-one-out cross-validation.

In stratified k-fold cross-validation, the folds are selected so that the mean response value is approximately equal in all the folds. In the case of a dichotomous classification, this means that each fold contains roughly the same proportions of the two types of class labels.

Reference: Wikipedia

Categories

  • Hardware
  • Kalman Filter
  • LaTeX
  • Linear Systems
  • MATLAB
  • Robotics
  • Statistics

Blog at WordPress.com.

  • Subscribe Subscribed
    • Matrix, Control and Robotics
    • Already have a WordPress.com account? Log in now.
    • Matrix, Control and Robotics
    • Subscribe Subscribed
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar