with Microsoft Excel
Learning outcomes
(Part 1 – Data Cleaning)
1. Recognize the common data issues : incomplete, inaccurate,
inconsistent, duplicate and unstructured data within a dataset.
2. Apply basic data cleaning techniques including the usage of TRIM and
CLEAN function to specific examples or scenario.
3. Demonstrate an understanding of ethical considerations in data
cleaning process.
Data cleaning, also known as data
cleansing, is the process of identifying
and correcting inconsistencies in
datasets to improve their quality.
Introduction to It also involves addressing inaccuracies,
Data Cleaning incompleteness, inconsistencies, and
other issues in the data.
Importance of Clean Data
Reliable Insights: Clean data leads to
more accurate analysis
Decision-Making: High-quality data
enhances informed decision-making
Data Integrity: Ensures the reliability
and trustworthiness of data
• Incomplete data: Missing values or
blanks in the dataset.
• Inaccurate data: Incorrectly
Common entered data, such as misspellings
Data Issues or typos.
• Inconsistent data: Varied data
formats or units, or irregular
spacing.
• Duplicate data: Repetition of data
entries, often caused by data entry
errors or system issues.
Common • Unstructured data: Data that lacks
Data Issues a clear table-like structure, and the
(cont.) presence of extra spaces or strange
characters.
• Incomplete Data
• Filter
• Use filters to identify and filter
out rows with missing data
Data Cleaning • Fill or Delete
Techniques • Fill in the blanks to enter missing
data based on neighbouring cells.
• Delete rows or columns with
incomplete data if they are not
critical.
• Inaccurate Data
• Spelling Checkers
• Many software applications, including
word processors and spreadsheet tools
Data Cleaning like Microsoft Word or Excel, have built-
Techniques (cont.) in spelling checkers.
• Use these tools to automatically
identify and correct common
misspellings
• Inconsistent Data
• Format Cells
• Standardize the format of cells using
formatting options.
• Convert Units
• If there are different units, convert
Data Cleaning them to a common unit for
consistency.
Techniques (cont.) • Data Cleaning Functions
• Use functions like TRIM to remove
extra spaces and CLEAN to remove
non-printable characters.
• Duplicate Data
• Remove Duplicates
• Utilize the ‘Remove Duplicates’
feature to eliminate identical rows.
Data Cleaning • Unstructured Data
Techniques (cont.) • Text to Columns
• Use the ‘Text to Columns’ feature
to split data into separate columns
based on delimiters.
Data Cleaning Ethics : Best Practices
Regular Backup: Save copies of the Documentation: Keep a record of changes Collaboration: Encourage teamwork to
original data before cleaning process to made during cleaning to ensure foster a shared responsibility for data
keep it safe and maintain data integrity. transparency and accountability. quality in the cleaning process
Learning outcomes
(Part 2 – Introduction to Microsoft Excel)
By the end of this lesson, students will be able to:
1. Apply basic concepts to create a spreadsheet.
2. Identify various types of data.
3. Apply data entry techniques.
4. Manage data types and formats.
Introduction to Microsoft Excel
Microsoft Excel is a spreadsheet
program for data organization,
analysis, and visualization.
Getting Started
with Excel Excel
How to Open or Search Excel Software
on a computer?
OR
Excel Interface
Excel Interface
(cont.)
Workbook Essentials
Link
Creating a workbook Link
Opening an existing
Link
Link
workbook
Link
Saving a workbook Link
Link
Closing a workbook Link
Workbook vs. Worksheet concept
Workbook vs. Worksheet Workbook
Worksheet
Understanding Data Types
Navigating a Worksheet
F5 or Arrow
01 Clicking at a 02 Name Box 03 Go To 04 Keys
desired cell
Editing Data in Cell
01 02 03 04 05
Editing text in a Selecting a range Clearing a cell or Moving data Copying and
cell range pasting data
Link Link Link
Link Link Link
Link Link
Link Link
Manipulating a Worksheet
01 02 03
Renaming sheet tab Colouring sheet tab Changing column
Link
Link
Link
Link width or row height
Link
Link
Manipulation of Rows, Columns, and Cells
Adding a single row
01 02 03 04 05
Adding a Adding Adding a Adding Adding a cell
single row multiple single multiple
rows column columns
Link
Formatting Cells in a Worksheet
Link
Format Text Link
Link
Format Number Link
Link
Format Dates Link
Copy Cell Format Using Link
Link
Format Painter
Handling Rows, Columns and Cells
Removing cells, rows, Link
Link
and columns
Link
Merging cells Link
Hiding/Unhiding Link
Link
columns/rows
Basic Excel Functions
• A function is a built-in formula.
• A formula must start with the equal sign (=) to
perform a specific operation or calculation.
Several examples of functions are:
• TRIM, CLEAN: Basic data cleaning functions
• *SUM, AVERAGE, MIN, MAX: Basic mathematical
functions
• *IF function: Conditional statements
• *VLOOKUP: Basic data lookup
• *Charts: Creating simple visualizations
*The contents will be covered from Week 10 to Week 12.
Ethics of the day
Honesty and Integrity:
Quranic Principle: "And do not mix the truth with falsehood or conceal the
truth while you know [it]." (Quran, 2:42)
Hadith: The Prophet Muhammad (peace be upon him) emphasized the
importance of honesty and integrity in various sayings. For instance, in
Sahih Muslim, Book 1, Hadith 34, the Prophet said, "Be truthful, for indeed
truthfulness leads to righteousness, and righteousness leads to Paradise."
Additional Resources
• eBook
• Ultimate Guide to Cleaning Data with Excel
• Website
• Excel Help & Learning | Microsoft
• Microsoft Excel Cheat Sheet
• Excel Tutorial | W3 Schools
• Excel Basic Training Course
• Top Ten Ways to Clean Your Data
Additional Resources (cont.)
• YouTube
• Excel for Beginners – The Complete Course
• Cleaning Data in Excel : Microsoft Excel Crash Course
• Understanding Data Cleaning
• What is Data Integrity?