CleanData_Coursera

Repository contains Coursera Getting nad Cleaning Data Course project.

The main script in this project is 'run_analysis.R' which performs the following tasks:

Downloads data from https://d396qusza40orc.cloudfront.net/getdata%2Fprojectfiles%2FUCI%20HAR%20Dataset.zip
Unzips all files into "./data" directory
Reads all the data for Subject, X, and Y, both training and test datasets using read.csv function.
Uses rbind to merge training and test data sets, and 'cbind' to merge Subject, Activity and Measurements.
Gives hard-coded names for Subject and Activity columns, reads "features.txt" to give meaningful names for other columns in the combined data set.
Uses grepl to find all column names containing "mean" or "std". Columns containing "meanFreq" are excluded since these contain rate of measurement but not sensor data.
Reads IDs and names of activities from "activity_labels.txt" and converts activity ids in the dataset to descriptive names of activities using factor.
Uses ddply to create a second data set with the average of each variable for each activity and each subject. Data is split by variables Subject and Activity, colMeans are used to calculate means on all columns except first two (Subject and Activity).
Uses write.csv to save the final tidy dataset as a CSV file names "./data/tidydata.txt".

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
.gitignore		.gitignore
CodeBook.md		CodeBook.md
README.md		README.md
run_analysis.R		run_analysis.R

Provide feedback