0% found this document useful (0 votes)
43 views16 pages

Nd002-Syllabus 2018-June v9

This program prepares learners for a career as a data analyst by helping them develop proficiency in Python, SQL, and data analysis techniques. The program covers four months of coursework focusing on data analysis, statistics, and wrangling skills.

Uploaded by

Bert Reinald
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
43 views16 pages

Nd002-Syllabus 2018-June v9

This program prepares learners for a career as a data analyst by helping them develop proficiency in Python, SQL, and data analysis techniques. The program covers four months of coursework focusing on data analysis, statistics, and wrangling skills.

Uploaded by

Bert Reinald
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 16

INDIVIDUAL LEARNERS

S C H O O L O F D ATA S C I E N C E

Data Analyst
Nanodegree Program Syllabus
Overview
This program prepares learners for a career as a data analyst by helping them learn to organize data, uncover patterns and
insights, draw meaningful conclusions, and clearly communicate critical findings. Learners will develop proficiency in Python
and its data analysis libraries (Numpy, pandas, Matplotlib) and SQL as they build a portfolio of projects to showcase in their job
search.

Depending on how quickly learners work through the material, the amount of time required is variable. We have included
an hourly estimation for each section of the program. The program covers one term of three month (approx. 13 weeks). If
learners spend about 10 hours per week working through the program, they should finish the term within 13 weeks. Learners
will have an additional four weeks beyond the end of the term to complete all projects. In order to succeed in this program, we
recommend having experience working with data in Python (Numpy and Pandas) and SQL.

Built in collaboration with:

Data Analyst 2
Program information

Estimated Time Skill Level

4 months at 10hrs/week* Intermediate

Prerequisites

Learners should have experience working with data in Python (specifically Numpy and Pandas) and SQL.

Required Hardware/Software

Learners need access to the internet and a 64-bit computer. Additional software such as Python and its common data analysis
libraries (e.g., Numpy and Pandas) will be required, but the program will guide learners on how to download once the course
has begun.

*The length of this program is an estimation of total hours the average student may take to complete all required
coursework, including lecture and project time. If you spend about 5-10 hours per week working through the program, you
should finish within the time provided. Actual hours may vary.

Data Analyst 3
Course 1

Introduction to Data Analysis


Learn the data analysis process of wrangling, exploring, analyzing, and communicating data. Work with data in Python, using
libraries like NumPy and Pandas.

Course Project

Explore Weather Trends


This project will introduce learners to the SQL and how to download data from a database. They’ll analyze
local and global temperature data and compare the temperature trends where they live to overall global
temperature trends.

Course Project

Investigate a Dataset
In this project, learners will choose one of Udacity’s curated datasets and investigate it using NumPy and
Pandas. They’ll complete the entire data analysis process, starting by posing a question and finishing by
sharing their findings.

Data Analyst 4
Lesson 1 • Learn to use Anaconda to manage packages and environments for use with
Python.
Anaconda

Lesson 2 • Learn to use this open-source web application to combine explanatory text,
math equations, code, and visualizations in one sharable document.
Jupyter Notebooks

Lesson 3 • Learn about the keys steps of the data analysis process.

Data Analysis Process • Investigate multiple datasets using Python and Pandas.

Lesson 4 • Perform the entire data analysis process on a dataset.

Pandas & NumPy: • Learn to use NumPy and Pandas to wrangle, explore, analyze, and visualize

Case Study 1 data.

Lesson 5 • Perform the entire data analysis process on a dataset.

Pandas & NumPy: • Learn more about NumPy and Pandas to wrangle, explore, analyze, and

Case Study 2 visualize data.

Lesson 6
• Learn about how to carry out analysis outside Jupyter notebook using IPython
Programming Workflow for or the command line interface.
Data Analysis

Data Analyst 5
Course 2

Practical Statistics
Learn how to apply inferential statistics and probability to real-world scenarios, such as analyzing A/B tests and building
supervised learning models.

Course Project

Analyze Experiment Results


In this project, learners will be provided a dataset reflecting data collected from an experiment.
They’ll use statistical techniques to answer questions about the data and report their conclusions and
recommendations in a report.

Lesson 1
• Examine a case study to learn about Simpson’s Paradox.
Simpson’s Paradox

Lesson 2
• Learn the fundamental rules of probability.
Probability

• Learn about binomial distribution where each observation represents one of


Lesson 3
two outcomes.
Binomial Distribution • Derive the probability of a binomial distribution.

Data Analyst 6
Lesson 4
• Learn about conditional probability, i.e., when events are not independent.
Conditional Probability

Lesson 5 • Build on conditional probability principles to understand the Bayes rule.

Bayes Rule • Derive the Bayes theorem.

Lesson 6 • Convert distributions into the standard normal distribution using the Z-score.

Standardizing • Compute proportions using standardized distributions.

Lesson 7 • Use normal distributions to compute probabilities.

Sampling Distributions & • Use the Z-table to look up the proportions of observations above, below, or in

Central Limit Theorem between values.

Lesson 8 • Estimate population parameters from sample statistics using confidence


intervals.
Confidence Intervals

Lesson 9 • Use critical values to make decisions on whether or not a treatment has
changed the value of a population parameter.
Hypothesis Testing

Lesson 10 • Test the effect of a treatment or compare the difference in means for two
groups when we have small sample sizes.
T-Tests & A/B Tests

• Build a linear regression model to understand the relationship between


Lesson 11
independent and dependent variables.
Regression • Use linear regression results to make a prediction.

Data Analyst 7
Lesson 12 • Use multiple linear regression results to interpret coefficients for several
predictors.
Multiple Linear Regression

Lesson 13 • Use logistic regression results to make a prediction about the relationship
between categorical dependent variables and predictors.
Logistic Regression

Course 3

Data Wrangling
Learn the data wrangling process of gathering, assessing, and cleaning data. Learn to use Python to wrangle data
programmatically and prepare it for analysis.

Course Project

Wrangle & Analyze Data


Real-world data rarely comes clean. Using Python, learners will gather data from a variety of sources, assess
its quality and tidiness, then clean it. They’ll document their wrangling efforts in a Jupyter Notebook, plus
showcase them through analyses and visualizations using Python and SQL.

Data Analyst 8
• Identify each step of the data wrangling process (gathering, assessing, and
Lesson 1 cleaning).

Intro to Data Wrangling • Wrangle a CSV file downloaded from Kaggle using fundamental gathering,
assessing, and cleaning code.

• Gather data from multiple sources, including gathering files, programmatically


downloading files, web-scraping data, and accessing data from APIs.
Lesson 2
• Import data of various file formats into Pandas, including flat files (e.g. TSV),
Gathering Data HTML files, TXT files, and JSON files.

• Store gathered data in a PostgreSQL database.

• Assess data visually and programmatically using Pandas.

Lesson 3 • Distinguish between dirty data (content or “quality” issues) and messy data
(structural or “tidiness” issues).
Assessing Data
• Identify data quality issues and categorize them using metrics: validity,
accuracy, completeness, consistency, and uniformity.

• Identify each step of the data cleaning process (defining, coding, and testing).
Lesson 4
• Clean data using Python and Pandas.
Cleaning Data
• Test cleaning code visually and programmatically using Python.

Data Analyst 9
Course 4

Data Visualization with Python


Learn to apply visualization principles to the data analysis process. Explore data visually at multiple levels to find insights and
create a compelling story.

Course Project

Communicate Data Findings


Learners will use Python’s data visualization tools to systematically explore a selected dataset for its
properties and relationships between variables. Then, they will create a presentation that communicates
their findings to others.

Lesson 1 • Understand why visualization is important in the practice of data analysis.

Data Visualization in Data • Know what distinguishes exploratory analysis from Explanatory analysis, and

Analysis the role of data visualization in each.

• Interpret features in terms of level of measurement.


Lesson 2 • Know different encodings that can be used to depict data in visualizations.

Design of Visualizations • Understand various pitfalls that can affect the effectiveness and truthfulness of
visualizations.

Data Analyst 10
• Use bar charts to depict distributions of categorical variables.
Lesson 3
• Use histograms to depict distributions of numeric variables.
Univariate Exploration of Data
• Use axis limits and different scales to change how your data is interpreted.

• Use scatterplots to depict relationships between numeric variables.

• Use clustered bar charts to depict relationships between categorical variables.


Lesson 4
• Use violin and bar charts to depict relationships between categorical and
Bivariate Exploration of Data numeric variables.

• Use faceting to create plots across different subsets of the data.

• Use encodings like size, shape, and color to encode values of a third variable in
Lesson 5 a visualization.

• Use plot matrices to explore relationships between multiple variables at the


Multivariate Exploration of
same time.
Data
• Use feature engineering to capture relationships between variables.

• Understand what it means to tell a compelling story with data.


Lesson 6
• Choose the best plot type, encodings, and annotations to polish your plots.
Explanatory Visualizations
• Create a slide deck using a Jupyter Notebook to convey your findings.

Lesson 7 • Apply your knowledge of data visualization to a dataset involving the


characteristics of diamonds and their prices.
Visualization Case Study

Data Analyst 11
Meet your instructors.

Josh Bernhard
Data Scientist at Nerd Wallet

Josh has been sharing his passion for data for nearly a decade at all levels of university and as Lead
Data Science Instructor at Galvanize. He’s used data science for work ranging from cancer research
to process automation.

Sebastian Thrun
Founder and Executive Chairman of Udacity

As the founder and president of Udacity, Sebastian’s mission is to democratize education. He is also
the founder of Google X, where he led projects including the Self-Driving Car, Google Glass, and
more.

Derek Steer
CEO at MODE

Derek is the CEO of Mode Analytics. He developed an analytical foundation at Facebook and
Yammer and is passionate about sharing it with future analysts. He authored SQL School and is
a mentor at Insight Data Science.

Juno Lee
Curriculum Lead at Udacity

Juno is the curriculum lead for the School of Data Science. She has been sharing her passion
for data and teaching, building several courses at Udacity. As a data scientist, she built
recommendation engines, computer vision and NLP models, and tools to analyze user behavior.

Data Analyst 12
Mike Yi
Data Analyst Instructor

Mike is a content developer with a multidisciplinary academic background, including math, statistics,
physics, and psychology. Previously, he worked on Udacity’s Data Analyst Nanodegree program as a
support lead.

David Venturi
Data Analyst Instructor

Formerly a chemical engineer and data analyst, David created a personalized data science master’s
program using online resources. He has studied hundreds of online courses and is excited to bring
the best to Udacity students.

Sam Nelson
Product Lead

Sam is the product lead for Udacity’s Data Analyst, Business Analyst, and Data Foundations
programs. He’s worked as an analytics consultant on projects in several industries and is
passionate about helping others improve their data skills.

Data Analyst 13
Udacity’s learning
experience

Hands-on Projects Quizzes


Open-ended, experiential projects are designed Auto-graded quizzes strengthen comprehension.
to reflect actual workplace challenges. They aren’t Learners can return to lessons at any time during
just multiple choice questions or step-by-step the course to refresh concepts.
guides, but instead require critical thinking.

Knowledge Custom Study Plans


Find answers to your questions with Knowledge, Create a personalized study plan that fits your
our proprietary wiki. Search questions asked by individual needs. Utilize this plan to keep track of
other students, connect with technical mentors, movement toward your overall goal.
and discover how to solve the challenges that
you encounter.

Workspaces Progress Tracker


See your code in action. Check the output and Take advantage of milestone reminders to stay
quality of your code by running it on interactive on schedule and complete your program.
workspaces that are integrated into the platform.

Data Analyst 14
Our proven approach for building
job-ready digital skills.
Experienced Project Reviewers

Verify skills mastery.


• Personalized project feedback and critique includes line-by-line code review from
skilled practitioners with an average turnaround time of 1.1 hours.

• Project review cycle creates a feedback loop with multiple opportunities for
improvement—until the concept is mastered.

• Project reviewers leverage industry best practices and provide pro tips.

Technical Mentor Support

24/7 support unblocks learning.


• Learning accelerates as skilled mentors identify areas of achievement and potential
for growth.

• Unlimited access to mentors means help arrives when it’s needed most.

• 2 hr or less average question response time assures that skills development stays on track.

Personal Career Services

Empower job-readiness.
• Access to a Github portfolio review that can give you an edge by highlighting your
strengths, and demonstrating your value to employers.*

• Get help optimizing your LinkedIn and establishing your personal brand so your profile
ranks higher in searches by recruiters and hiring managers.

Mentor Network

Highly vetted for effectiveness.


• Mentors must complete a 5-step hiring process to join Udacity’s selective network.

• After passing an objective and situational assessment, mentors must demonstrate


communication and behavioral fit for a mentorship role.

• Mentors work across more than 30 different industries and often complete a Nanodegree
program themselves.

*Applies to select Nanodegree programs only.

Data Analyst 15
Learn more at
www.udacity.com/online-learning-for-individuals →

01.16.23 | V1.0

You might also like