Skip to content

Commit df8902b

Browse files
author
Michael Galarnyk
committed
Added things to the list
1 parent 10d5f82 commit df8902b

File tree

2 files changed

+4
-6
lines changed

2 files changed

+4
-6
lines changed

Statistics/Sample_With_Replacement/.ipynb_checkpoints/SampleWithReplacement-checkpoint.ipynb

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -502,7 +502,6 @@
502502
"metadata": {},
503503
"source": [
504504
"![](images/SampleWithoutReplacement.png)\n",
505-
"Caption: Sampling without replacement. Image by [Michael Galarnyk](https://twitter.com/GalarnykMichael).\n",
506505
"\n",
507506
"Sampling without replacement can be defined as random sampling that DOES NOT allow sampling units to occur more than once. Let's now go over a quick example of how sampling <b>without</b> replacement works.\n",
508507
"\n",
@@ -606,14 +605,13 @@
606605
"cell_type": "markdown",
607606
"metadata": {},
608607
"source": [
609-
"Sampling without replacement is used throughout data science. One very common use is in model validation procedures like [train test split](https://towardsdatascience.com/understanding-train-test-split-scikit-learn-python-ea676d5e3d1) and [cross validation](https://scikit-learn.org/stable/modules/cross_validation.html). In short, each of these procedures allows you to simulate how a machine learning model would perform on new/unseen data. \n",
608+
"Sampling without replacement is used throughout data science. One very common use is in model validation procedures like [train test split](https://builtin.com/data-science/train-test-split) and [cross validation](https://scikit-learn.org/stable/modules/cross_validation.html). In short, each of these procedures allows you to simulate how a machine learning model would perform on new/unseen data. \n",
610609
"\n",
611610
"The image below shows the train test split procedure which consists of splitting a dataset into two pieces: a training set and a testing set. This consists of randomly sampling WITHOUT replacement about 75% (you can vary this) of the rows and putting them into your training set and putting the remaining 25% to your test set. Note that the colors in “Features” and “Target” indicate where their data will go (“X_train”, “X_test”, “y_train”, “y_test”) for a particular train test split.\n",
612611
"\n",
613612
"![](images/TrainTestSplit.png)\n",
614-
"Caption: train test split procedure. Image by [Michael Galarnyk](https://twitter.com/GalarnykMichael).\n",
615613
"\n",
616-
"If you would like to learn more about train test split, you can check out my blog post [Understanding Train Test Split](https://towardsdatascience.com/understanding-train-test-split-scikit-learn-python-ea676d5e3d1)."
614+
"If you would like to learn more about train test split, you can check out my blog post [Understanding Train Test Split](https://builtin.com/data-science/train-test-split)."
617615
]
618616
},
619617
{

Statistics/Sample_With_Replacement/SampleWithReplacement.ipynb

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -605,13 +605,13 @@
605605
"cell_type": "markdown",
606606
"metadata": {},
607607
"source": [
608-
"Sampling without replacement is used throughout data science. One very common use is in model validation procedures like [train test split](https://towardsdatascience.com/understanding-train-test-split-scikit-learn-python-ea676d5e3d1) and [cross validation](https://scikit-learn.org/stable/modules/cross_validation.html). In short, each of these procedures allows you to simulate how a machine learning model would perform on new/unseen data. \n",
608+
"Sampling without replacement is used throughout data science. One very common use is in model validation procedures like [train test split](https://builtin.com/data-science/train-test-split) and [cross validation](https://scikit-learn.org/stable/modules/cross_validation.html). In short, each of these procedures allows you to simulate how a machine learning model would perform on new/unseen data. \n",
609609
"\n",
610610
"The image below shows the train test split procedure which consists of splitting a dataset into two pieces: a training set and a testing set. This consists of randomly sampling WITHOUT replacement about 75% (you can vary this) of the rows and putting them into your training set and putting the remaining 25% to your test set. Note that the colors in “Features” and “Target” indicate where their data will go (“X_train”, “X_test”, “y_train”, “y_test”) for a particular train test split.\n",
611611
"\n",
612612
"![](images/TrainTestSplit.png)\n",
613613
"\n",
614-
"If you would like to learn more about train test split, you can check out my blog post [Understanding Train Test Split](https://towardsdatascience.com/understanding-train-test-split-scikit-learn-python-ea676d5e3d1)."
614+
"If you would like to learn more about train test split, you can check out my blog post [Understanding Train Test Split](https://builtin.com/data-science/train-test-split)."
615615
]
616616
},
617617
{

0 commit comments

Comments
 (0)