LM #01-Introduction To ML
LM #01-Introduction To ML
Introduction
11/17/2024 4
Preamble: ..contd.
Traditional Programming
Data
Computer Output
Program
Machine Learning
Data
Computer Program
Output
11/17/2024 5
Preamble:
Magic?
No, more like gardening
• Seeds = Algorithms
• Nutrients = Data
• Gardener = You
• Plants = Programs
11/17/2024 6
What is Machine Learning?
• “The goal of machine learning is to build
computer systems that can adapt and learn
from their experience.”
– Tom Dietterich
11/17/2024 7
What is Machine Learning?.........Contd…
11/17/2024 8
What is Machine Learning?.........Contd…
• Machine Learning
– Study of algorithms that
– improve their performance
– at some task
– with experience
• Optimize a performance criterion using example data or past
experience.
• Role of Statistics: Inference from a sample
• Role of Computer science: Efficient algorithms to
– Solve the optimization problem
– Representing and evaluating the model for inference
11/17/2024 9
Another Definition of Machine
Learning
• Machine Learning algorithms discover the
relationships between the variables of a system
(input, output and hidden) from direct samples of
the system
11/17/2024 10
A Generic System
x1 y1
x2 y2
System
…
…
xN h1 , h2 ,..., hK
yM
• Economically efficient
• Can consider larger data spaces and hypothesis spaces than
people can
• Can formalize learning problem to explicitly
identify/describe goals and criteria
11/17/2024 16
Facts Encouraging ML…
• Learning general models from data of particular example
• Data is cheap and abundant (data warehouses, data marts);
Knowledge is expensive and scarce
• Example in retail: Customer transactions to consumer
behavior: (Recommender system)
customers who bought “Advances in Knowledge Discovery
and Data Mining”, also bought “Data Mining: Practical Machine
Learning Tools and Techniques with Java Implementations”
(www.amazon.com)
• Build a model that is a good and useful approximation to
the data
11/17/2024 17
Successful Machine Learning
Applications
• Speech recognition
– Telephone menu navigation
• Computer vision
– Mail sorting
• Bio-surveillance
– Identifying disease outbreaks
• Robot control
– Autonomous driving
• Empirical science
• Information extraction
• Social networks
• Debugging
• [Your favorite areas]…
18
Applications ..Contd….
• Speech and hand-writing recognition
• Autonomous robot control
• Data mining and bioinformatics: motifs, alignment, …
• Playing games
• Fault detection
• Clinical diagnosis
• Spam email detection
• Retail: Market basket analysis, Customer relationship
management (CRM)
• Finance: Credit scoring, fraud detection
• Manufacturing: Optimization, troubleshooting
• Medicine: Medical diagnosis
• Telecommunications: Quality of service optimization
• Web mining: Search engines
• etc………
Applications are diverse but methods are generic 19
When are ML algorithms NOT
needed?
• When the relationships between all system
variables (input, output, and hidden) is
completely understood!
11/17/2024 20
Relevant disciplines
• Algorithms • Linear algebra
• Artificial intelligence • Etc, etc …..
• Control
• Statistics Researchers in machine
• Information theory learning come from
• Dynamical systems a variety of backgrounds.
• Neurobiology
• Signal processing
11/17/2024 21
22
+ Machine learning extract features manually and with simple
data set (text)
Relevant emerging Disciplines
Classification
Supervised Learning Regression
11/17/2024 23
24
+ Tools & Modules
Implementation:
The selected topic (chosen option after discussing at least 3 alternatives during
brainstorming stage)
Description of the topic and the benefits or advantages (what, why, how, and possible expected
outcomes)
Further Justification for this topic based on brief literature review done (is it a worthwhile topic based
on brief literature search done, or may be someone already did it but you wish to improve upon it – so
what improvement are you proposing? Or why the need to still do such project?).
References
Please consult your Professor appropriately as you try to complete this task please!
UK's Office for National Statistics — Source of datasets generated by the UK's
Office for National Statistics
UK's Met Office Data — Climate station records from the UK's National
Weather Service