Nd002-Syllabus 2018-June v9
Nd002-Syllabus 2018-June v9
S C H O O L O F D ATA S C I E N C E
Data Analyst
Nanodegree Program Syllabus
Overview
This program prepares learners for a career as a data analyst by helping them learn to organize data, uncover patterns and
insights, draw meaningful conclusions, and clearly communicate critical findings. Learners will develop proficiency in Python
and its data analysis libraries (Numpy, pandas, Matplotlib) and SQL as they build a portfolio of projects to showcase in their job
search.
Depending on how quickly learners work through the material, the amount of time required is variable. We have included
an hourly estimation for each section of the program. The program covers one term of three month (approx. 13 weeks). If
learners spend about 10 hours per week working through the program, they should finish the term within 13 weeks. Learners
will have an additional four weeks beyond the end of the term to complete all projects. In order to succeed in this program, we
recommend having experience working with data in Python (Numpy and Pandas) and SQL.
Data Analyst 2
Program information
Prerequisites
Learners should have experience working with data in Python (specifically Numpy and Pandas) and SQL.
Required Hardware/Software
Learners need access to the internet and a 64-bit computer. Additional software such as Python and its common data analysis
libraries (e.g., Numpy and Pandas) will be required, but the program will guide learners on how to download once the course
has begun.
*The length of this program is an estimation of total hours the average student may take to complete all required
coursework, including lecture and project time. If you spend about 5-10 hours per week working through the program, you
should finish within the time provided. Actual hours may vary.
Data Analyst 3
Course 1
Course Project
Course Project
Investigate a Dataset
In this project, learners will choose one of Udacity’s curated datasets and investigate it using NumPy and
Pandas. They’ll complete the entire data analysis process, starting by posing a question and finishing by
sharing their findings.
Data Analyst 4
Lesson 1 • Learn to use Anaconda to manage packages and environments for use with
Python.
Anaconda
Lesson 2 • Learn to use this open-source web application to combine explanatory text,
math equations, code, and visualizations in one sharable document.
Jupyter Notebooks
Lesson 3 • Learn about the keys steps of the data analysis process.
Data Analysis Process • Investigate multiple datasets using Python and Pandas.
Pandas & NumPy: • Learn to use NumPy and Pandas to wrangle, explore, analyze, and visualize
Pandas & NumPy: • Learn more about NumPy and Pandas to wrangle, explore, analyze, and
Lesson 6
• Learn about how to carry out analysis outside Jupyter notebook using IPython
Programming Workflow for or the command line interface.
Data Analysis
Data Analyst 5
Course 2
Practical Statistics
Learn how to apply inferential statistics and probability to real-world scenarios, such as analyzing A/B tests and building
supervised learning models.
Course Project
Lesson 1
• Examine a case study to learn about Simpson’s Paradox.
Simpson’s Paradox
Lesson 2
• Learn the fundamental rules of probability.
Probability
Data Analyst 6
Lesson 4
• Learn about conditional probability, i.e., when events are not independent.
Conditional Probability
Lesson 6 • Convert distributions into the standard normal distribution using the Z-score.
Sampling Distributions & • Use the Z-table to look up the proportions of observations above, below, or in
Lesson 9 • Use critical values to make decisions on whether or not a treatment has
changed the value of a population parameter.
Hypothesis Testing
Lesson 10 • Test the effect of a treatment or compare the difference in means for two
groups when we have small sample sizes.
T-Tests & A/B Tests
Data Analyst 7
Lesson 12 • Use multiple linear regression results to interpret coefficients for several
predictors.
Multiple Linear Regression
Lesson 13 • Use logistic regression results to make a prediction about the relationship
between categorical dependent variables and predictors.
Logistic Regression
Course 3
Data Wrangling
Learn the data wrangling process of gathering, assessing, and cleaning data. Learn to use Python to wrangle data
programmatically and prepare it for analysis.
Course Project
Data Analyst 8
• Identify each step of the data wrangling process (gathering, assessing, and
Lesson 1 cleaning).
Intro to Data Wrangling • Wrangle a CSV file downloaded from Kaggle using fundamental gathering,
assessing, and cleaning code.
Lesson 3 • Distinguish between dirty data (content or “quality” issues) and messy data
(structural or “tidiness” issues).
Assessing Data
• Identify data quality issues and categorize them using metrics: validity,
accuracy, completeness, consistency, and uniformity.
• Identify each step of the data cleaning process (defining, coding, and testing).
Lesson 4
• Clean data using Python and Pandas.
Cleaning Data
• Test cleaning code visually and programmatically using Python.
Data Analyst 9
Course 4
Course Project
Data Visualization in Data • Know what distinguishes exploratory analysis from Explanatory analysis, and
Design of Visualizations • Understand various pitfalls that can affect the effectiveness and truthfulness of
visualizations.
Data Analyst 10
• Use bar charts to depict distributions of categorical variables.
Lesson 3
• Use histograms to depict distributions of numeric variables.
Univariate Exploration of Data
• Use axis limits and different scales to change how your data is interpreted.
• Use encodings like size, shape, and color to encode values of a third variable in
Lesson 5 a visualization.
Data Analyst 11
Meet your instructors.
Josh Bernhard
Data Scientist at Nerd Wallet
Josh has been sharing his passion for data for nearly a decade at all levels of university and as Lead
Data Science Instructor at Galvanize. He’s used data science for work ranging from cancer research
to process automation.
Sebastian Thrun
Founder and Executive Chairman of Udacity
As the founder and president of Udacity, Sebastian’s mission is to democratize education. He is also
the founder of Google X, where he led projects including the Self-Driving Car, Google Glass, and
more.
Derek Steer
CEO at MODE
Derek is the CEO of Mode Analytics. He developed an analytical foundation at Facebook and
Yammer and is passionate about sharing it with future analysts. He authored SQL School and is
a mentor at Insight Data Science.
Juno Lee
Curriculum Lead at Udacity
Juno is the curriculum lead for the School of Data Science. She has been sharing her passion
for data and teaching, building several courses at Udacity. As a data scientist, she built
recommendation engines, computer vision and NLP models, and tools to analyze user behavior.
Data Analyst 12
Mike Yi
Data Analyst Instructor
Mike is a content developer with a multidisciplinary academic background, including math, statistics,
physics, and psychology. Previously, he worked on Udacity’s Data Analyst Nanodegree program as a
support lead.
David Venturi
Data Analyst Instructor
Formerly a chemical engineer and data analyst, David created a personalized data science master’s
program using online resources. He has studied hundreds of online courses and is excited to bring
the best to Udacity students.
Sam Nelson
Product Lead
Sam is the product lead for Udacity’s Data Analyst, Business Analyst, and Data Foundations
programs. He’s worked as an analytics consultant on projects in several industries and is
passionate about helping others improve their data skills.
Data Analyst 13
Udacity’s learning
experience
Data Analyst 14
Our proven approach for building
job-ready digital skills.
Experienced Project Reviewers
• Project review cycle creates a feedback loop with multiple opportunities for
improvement—until the concept is mastered.
• Project reviewers leverage industry best practices and provide pro tips.
• Unlimited access to mentors means help arrives when it’s needed most.
• 2 hr or less average question response time assures that skills development stays on track.
Empower job-readiness.
• Access to a Github portfolio review that can give you an edge by highlighting your
strengths, and demonstrating your value to employers.*
• Get help optimizing your LinkedIn and establishing your personal brand so your profile
ranks higher in searches by recruiters and hiring managers.
Mentor Network
• Mentors work across more than 30 different industries and often complete a Nanodegree
program themselves.
Data Analyst 15
Learn more at
www.udacity.com/online-learning-for-individuals →
01.16.23 | V1.0