Skip to content

Commit 9ba1799

Browse files
committed
Crosslinking
1 parent a0e47dd commit 9ba1799

File tree

2 files changed

+12
-5
lines changed

2 files changed

+12
-5
lines changed

_posts/2015-12-03-rpart-coursera-har.html

Lines changed: 7 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -70,7 +70,11 @@
7070
<h2>Abstract</h2>
7171
<p>‘Practical Machine Learning’ course project. The course is kindly provided by Johns Hopkins University and Coursera. The project requires the use of machine learning techniques to analyze <a href="http://groupware.les.inf.puc-rio.br/har">Human Activity Recognition (HAR) data</a> and predict the activity ‘quality’ (classe column) performed by the wired user.</p>
7272
<p>An important and useful goal of the project is nonetheless to practice a mix of techniques such as k-fold cross validation, features selection, identification of correlated and zero variance predictors, that can be used to build a forecasting algorithm.</p>
73-
<p>For this case, we use trees to predict the ‘classe’ feature and we report the Out of Sample prediction accuracy, which is not particularly good (0.47). In order to better the accuracy, we therefore resort to a different algorithm, random forest, tailoring the model on the features present in the testing set.</p>
73+
<p>For this case, we use trees to predict the ‘classe’ feature and we report the
74+
Out of Sample prediction accuracy, which is not particularly good (0.47). In
75+
order to better the accuracy, we therefore <a
76+
href="http://giuppo.github.io/random-forests-coursera-har/">resort to a
77+
different algorithm</a>, random forest, tailoring the model on the features present in the testing set.</p>
7478
<ul>
7579
<li><a href="https://d396qusza40orc.cloudfront.net/predmachlearn/pml-training.csv">Training data</a></li>
7680
<li><a href="https://d396qusza40orc.cloudfront.net/predmachlearn/pml-testing.csv">Testing data</a></li>
@@ -153,7 +157,8 @@ <h3>Build model and predict from fold</h3>
153157
</div>
154158
<div id="expected-out-of-sample-accuracy" class="section level3">
155159
<h3>Expected out of sample accuracy</h3>
156-
<p>The estimated out of sample accuracy is the average (with standard deviation) over the results obtained with the different folds. Mean accuracy is 0.47, surely not a great results.</p>
160+
<p>The estimated out of sample accuracy is the average (with standard deviation) over the results obtained with the different folds. Mean accuracy is 0.47, surely not a great results.
161+
However, this was not the goal of this study. <a href="http://giuppo.github.io/random-forests-coursera-har/">Random forests reach 99% prediction accuracy</a></p>
157162
<pre class="r"><code>paste(round(mean(accs),2), round(sd(accs),2))</code></pre>
158163
<pre><code>## [1] &quot;0.47 0.06&quot;</code></pre>
159164
</div>

_posts/2015-12-18-random-forests-coursera-har.html

Lines changed: 5 additions & 3 deletions
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)