SlideShare a Scribd company logo
1 of 34
Download to read offline
Machine Learning for Language Technology 2015
http://stp.lingfil.uu.se/~santinim/ml/2015/ml4lt_2015.htm
What is Machine Learning?
Marina Santini
santinim@stp.lingfil.uu.se
Department of Linguistics and Philology
Uppsala University, Uppsala, Sweden
Autumn 2015
Acknowledgements
ā€¢ Thanks to Hal Daumeā€™ III, Andrew Y. Ng, Arthur
Samuel, Tom Mitchell, Ethem Alpaydin,
Michael Jordan, Michael Collins, Joakim Nivre,
Pedro Domingo, wikipedia, the web.
Lecture 1: What is Machine Learning? 2
What is Machine Learning?
http://www.umiacs.umd.edu/~hal/ciml/
ā€¢ Machine learning is the study of computer
systems that learn from data and experience.
ā€¢ It is applied in an incredibly wide variety of
application areas, from medicine to
advertising, from military to pedestrian.
ā€¢ Any area in which you need to make sense of
data is a potential customer of machine
learning.
[Hal Daumeā€™ III]
Lecture 1: What is Machine Learning? 3
Example: Rule-based systems
ā€¢ Parts-Of-Speech Tagger (Brillā€™s tagger)
ā€¢ tag1--> tag2 IF Condition
the Condition tests the preceding and/or following word tokens, or their tags
(the notation for such rules differs between implementations). For example,
in Brill's notation:
ā€¢ IN NN WDPREVTAG DT while
change the tag of a word from IN (preposition) to NN (common noun), if the
preceding word's tag is DT (determiner) and the word itself is "while". This
covers cases like "all the while" or "in a while", where "while" should be
tagged as a noun rather than its more common use as a preposition (many
rules are more general).
Lecture 1: What is Machine Learning? 4
Example: Machine Learning-Based Systems
(From: The Apache OpenNLP library, a machine learning based toolkit for the
processing of natural language text: https://opennlp.apache.org/ )
ā€¢ We do not have rules , but a training corpus/dataset: The POS Tagger can be
trained on annotated training material. The training material is a collection of
tokenized sentences where each token has the assigned part-of-speech tag. The
training material may look like this:
About_IN
10_CD
Euro_NNP
,_,
I_PRP
reckon_VBP
That_DT
sounds_VBZ
good_JJ
ā€¢ With this annotated material we train a (mathematical/statistical) model.
ā€¢ Then we evaluate the results: how well does this model perform? The accuracy can
be measured on a test dataset or via cross validation.
Lecture 1: What is Machine Learning? 5
Generally speaking: Deduction vs Induction
ā€¢ Deductive reasoning works from the more general to the more
specific. Sometimes this is informally called a "top-down" approach.
We might begin with thinking up a theory about our topic of
interest. We then narrow that down into more specific hypotheses
that we can test. This ultimately leads us to be able to test the
hypotheses with specific data -- a confirmation (or not) of our
original theories.
ā€¢ Inductive reasoning works the other way, moving from specific
observations to broader generalizations and theories. Informally,
we sometimes call this a "bottom up" approach. In inductive
reasoning, we begin with specific observations (the data) and
measures, begin to detect patterns and regularities, formulate
some tentative hypotheses that we can explore, and finally end up
developing a general model.
Lecture 1: What is Machine Learning? 6
Machine Learning is based on ...
ā€¢ Induction
ā€¢ Generalization from data
Lecture 1: What is Machine Learning? 7
In summary (by Hal Daumeā€™ III)
https://piazza.com/umd/fall2015/cmsc422/home
ā€¢ Machine learning is all about finding patterns in data.
The whole idea is to replace the "human writing code"
with a "human supplying data" and then let the system
figure out what it is that the person wants to do by
looking at the examples.
ā€¢ The most central concept in machine learning is
generalization: how to generalize beyond the examples
that have been provided at "training time" to new
examples that you see at "test time.ā€œ
ā€¢ A very large fraction of what we'll talk about has to do
with figuring out what generalization means.
Lecture 1: What is Machine Learning? 8
Why learning?
ā€¢ We do not know the exact method: speech
recognition, spam filters, robotics, etc.
ā€¢ The exact method is too expensive: statistical
physics, etc.
ā€¢ Task evolves over time...
ā€¢ There is no need to use machine learning for
computing a payroll: for this task we just need
an algorithm!
Lecture 1: What is Machine Learning? 9
Why is ML so fashionable?
ā€¢ Broad applicability:
ā€“ Finance, robotic, medicin, NLP, IT, MT etc.
ā€¢ Close connection between theory and practice
ā€¢ Open field, lots of room for new work.M
Lecture 1: What is Machine Learning? 10
Interdisciplinary Field
Machine learning is:
ā€¢ a subfield of computer science that evolved from the study
of pattern recognition and computational learning
theory in artificial intelligence.
ā€¢ Machine learning explores the study and construction of
algorithms that can LEARN from and make predictions on data.
ā€¢ Such algorithms operate by building a model from example inputs
in order to make data-driven predictions or decisions, rather than
following strictly static program instructions.
[wikipidia]
Lecture 1: What is Machine Learning? 11
Machine Learning & Statistics
ā€¢ Machine learning and statistics are closely
related fields.
ā€¢ According to Michael Jordan, the ideas of
machine learning, from methodological
principles to theoretical tools, have had a long
pre-history in statistics. He also suggested the
term data science as a placeholder to call the
overall field.
Lecture 1: What is Machine Learning? 12
Machine Learning and Data Mining
ā€¢ Machine Learning relates with the study, design
and development of models and algorithms that
give computers the capability to learn from data.
ā€¢ Data Mining can be defined as the process that
starting from apparently unstructured data tries
to extract knowledge and/or unknown interesting
patterns. During this process machine Learning
algorithms are used.
Lecture 1: What is Machine Learning? 13
Many Types of Learning
ā€¢ Supervised learning
ā€“ Supervised classification is the only focus of this
course
ā€¢ Unsupervised learning
ā€¢ Semi- and weakly supervised
ā€¢ Reinforcement learning
ā€¢ etc.
Lecture 1: What is Machine Learning? 14
In other words
ā€¢ Supervised learning: learning with a teacher
(ļƒ with class labels)
ā€¢ Unsupervised learning: learning without a
teacher (ļƒ  without class labels)
Lecture 1: What is Machine Learning? 15
Learning problems
ā€¢ Regression
ā€¢ Binary classification
ā€¢ Multiclass classification
ā€¢ Ranking
ā€¢ etc.
Lecture 1: What is Machine Learning? 16
Informal and Formal Definitions
ā€¢ There are plenty of definitions...
ā€¢ Informal: The field of study that gives computers the
ability to learn without being explicitly programmed
(Arthur Samuel, 1959)
ā€¢ Formal: A computer program is said to learn from
experience E, with respect to some task T, and some
performance measure P, if its performance on T as
measured by P improves with experience E (Tom
Mitchell, 1998).
Lecture 1: What is Machine Learning? 17
Example: Spam Filter (by Andrew Y. Ng)
1. Classifying emails as spam or not spam
2. Watching you label emails as spam or not spam.
3. The number (or fraction) of emails correctly classified as spam/not spam.
4. None of the aboveā€”this is not a machine learning problem.
Lecture 1: What is Machine Learning? 18
Spam Filter (by Andrew Y. Ng)
ā€¢ Answer:
Lecture 1: What is Machine Learning? 19
Classification: Questions
ā€¢ How would you write a program to
distinguish a picture of me from a picture of
someone else?
ā€¢ How would you write a program to determine
whether a sentence is grammatical or not?
ā€¢ How would you write a program to distinguish
cancerous cells from normal cells?
Lecture 1: What is Machine Learning? 20
Classification: Answers
ā€¢ How would you write a program to distinguish a picture of
me from a picture of someone else?
ā‡’ Provide examples pictures of me and pictures of other people
and let a classiļ¬er learn to distinguish the two.
ā€¢ How would you write a program to determine whether a
sentence is grammatical or not?
ā‡’ Provide examples of grammatical and ungrammatical
sentences and let a classiļ¬er learn to distinguish the two.
ā€¢ How would you write a program to distinguish cancerous
cells from normal cells?
ā‡’ Provide examples of cancerous and normal cells and let
a classiļ¬er learn to distinguish the two.
Lecture 1: What is Machine Learning? 21
Elements of Machine Learning
ā€¢ Generalization: how well a model performs on new data.
ā€¢ Data:
ā€“ Training data: specific examples to learn from.
ā€“ Test data: new specific examples to assess performance.
ā€¢ Models (theoretical assumptions)
ā€“ decision trees, naive bayes, perceptron, etc.
ā€¢ Algorithms:
ā€“ Learning altorighms that infer the model parameters from the data.
ā€“ Inference algorithms that infer prediction from a model.
Lecture 1: What is Machine Learning? 22
Generalization
ā€¢ Predicting the future based on the past
ā€¢ Our system needs to generalize beyond the
training data to some future data that it might
not have seen yet.
Lecture 1: What is Machine Learning? 23
Generalization: Overfitting & Underfitting
ā€¢ Overfitting occurs when the model fits the training data too well
and does not generalize so it performs badly on the test
data. Overfitting is often a result of an excessively complicated
model
ā€¢ Underfitting occurs when the model does not fit the data well
enough. Underfitting is often a result of an excessively simple
model.
ā€¢ Both overfitting and underfitting lead to poor predictions on new
data sets.
ā€¢ A learning model that overfits or underfits does not generalize well.
Lecture 1: What is Machine Learning? 24
Example: Letter vs non-letter classification
Training
set
Test set
Lecture 1: What is Machine Learning? 25
Data:
The iris
dataset
Three components:
1. Class label (aka ā€œlabelā€, denoted y)
2. Features (aka ā€œattributesā€)
3. Feature values (aka ā€œattribute valuesā€, denoted x) ā‡’ Features can be
binary, nominal or continuous
ā€¢ A labeled dataset is a collection of (x,y) pairs
Lecture 1: What is Machine Learning? 26
Task
ā€¢ Predict the class for this ā€testā€ example:
Sepal length ā€“ Sepal width ā€“ Petal length ā€“ Petal width - Type
5.2 3.7 1.7 0.3 ???
Lecture 1: What is Machine Learning? 27
Require us to
generalize from
the training data
Noise
ā€¢ Unexplained or random variation in the data
ā€¢ Anomaly
Lecture 1: What is Machine Learning? 28
Models
Models are theoretical assumptions
ā€“ decision trees, naive bayes, perceptron, etc.
Given:
ā€“ Domain X: descriptions
ā€“ Domain Y: predictions
ā€“ H: hypothesis space; the set of all possible hypotheses
ā€“ h: target hypothesis
ā€“ Idea: to extrapolate observed y's over all X.
ā€“ Hope: to predict well on future y's given x's.
ā€“ Require: there must be regularities to be found.
Lecture 1: What is Machine Learning? 29
Inductive bias
ā€¢ In ML, the inductive bias is the set of
theoretical assumptions that must be added
to the observed data to transform the
algorithm's outputs into logical deductions.
Lecture 1: What is Machine Learning? 30
Examples of inductive biases
ā€¢ Decision trees (ID3): Shorter trees are preferred
over larger trees. Trees that place high
information gain attributes close to the root are
preferred over those that do not.
ā€¢ Naive Bayes classifier: maximize conditional
independence.
ā€¢ Logistic regression: there exists a simple
boundary that splits one class from the other, and
the further you get from that boundary the more
confident you can be.
ā€¢ Perceptron: data must be linearly separable.
Lecture 1: What is Machine Learning? 31
Inductive Bias: definition
ā€¢ ā€œThe inductive bias of a learning algorithm is
the set of assumptions that the learner uses
to predict outputs given inputs that it has not
encounteredā€.
ā€“ Tom Mitchell, 1980
Lecture 1: What is Machine Learning? 32
Learning & Inference Algorithms
ā€¢ Traditionally, the goal of learning has been to find
a model for which prediction (i.e., inference)
accuracy is as high as possible.
ā€¢ More recently: find models for which prediction
(i.e., inference) is as efficient as possible.
ā€¢ In practical terms: recent interest in more
unconventional approaches to learning that
combine generalization accuracy with other
desiderata such as faster inference.
Lecture 3: Basic Concepts of ML 33
The End
Lecture 1: What is Machine Learning? 34

More Related Content

What's hot

Machine learning overview
Machine learning overviewMachine learning overview
Machine learning overviewprih_yah
Ā 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learningbutest
Ā 
Machine Learning
Machine LearningMachine Learning
Machine LearningKumar P
Ā 
Basics of Machine Learning
Basics of Machine LearningBasics of Machine Learning
Basics of Machine Learningbutest
Ā 
Machine Learning
Machine LearningMachine Learning
Machine LearningVivek Garg
Ā 
Machine learning
Machine learningMachine learning
Machine learningSanjay krishne
Ā 
Types of Machine Learning
Types of Machine LearningTypes of Machine Learning
Types of Machine LearningSamra Shahzadi
Ā 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning BasicsSuresh Arora
Ā 
Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationAnkit Gupta
Ā 
Machine learning
Machine learningMachine learning
Machine learningTushar Nikam
Ā 
2.17Mb ppt
2.17Mb ppt2.17Mb ppt
2.17Mb pptbutest
Ā 
Introduction to-machine-learning
Introduction to-machine-learningIntroduction to-machine-learning
Introduction to-machine-learningBabu Priyavrat
Ā 
Machine learning
Machine learningMachine learning
Machine learningeonx_32
Ā 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRahul Jain
Ā 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningKmPooja4
Ā 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learningKoundinya Desiraju
Ā 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.butest
Ā 

What's hot (20)

Machine learning overview
Machine learning overviewMachine learning overview
Machine learning overview
Ā 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learning
Ā 
Machine Learning
Machine LearningMachine Learning
Machine Learning
Ā 
Machine Learning
Machine LearningMachine Learning
Machine Learning
Ā 
Basics of Machine Learning
Basics of Machine LearningBasics of Machine Learning
Basics of Machine Learning
Ā 
Machine Learning
Machine LearningMachine Learning
Machine Learning
Ā 
Machine learning
Machine learningMachine learning
Machine learning
Ā 
Machine learning
Machine learningMachine learning
Machine learning
Ā 
Types of Machine Learning
Types of Machine LearningTypes of Machine Learning
Types of Machine Learning
Ā 
Machine learning
Machine learningMachine learning
Machine learning
Ā 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning Basics
Ā 
Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning Presentation
Ā 
Machine learning
Machine learningMachine learning
Machine learning
Ā 
2.17Mb ppt
2.17Mb ppt2.17Mb ppt
2.17Mb ppt
Ā 
Introduction to-machine-learning
Introduction to-machine-learningIntroduction to-machine-learning
Introduction to-machine-learning
Ā 
Machine learning
Machine learningMachine learning
Machine learning
Ā 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Ā 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Ā 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
Ā 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.
Ā 

Viewers also liked

Data Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural NetworksData Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural NetworksBICA Labs
Ā 
Classification of learning profile based on categories of student preferences
Classification of learning profile based on categories of student preferencesClassification of learning profile based on categories of student preferences
Classification of learning profile based on categories of student preferencesLuciana Zaina
Ā 
Information processing in consumer behaviour
Information processing in consumer behaviourInformation processing in consumer behaviour
Information processing in consumer behaviourMadhu Verma
Ā 
Theory of reasoned action
Theory of reasoned actionTheory of reasoned action
Theory of reasoned actionpaulebuckley
Ā 
Five functions of effective management
Five functions of effective managementFive functions of effective management
Five functions of effective managementHealthcare87
Ā 
Active audience
Active audienceActive audience
Active audiencethemerch78
Ā 
Consumer behavior Values
Consumer behavior ValuesConsumer behavior Values
Consumer behavior ValuesMohamed Mousa
Ā 
Beliefs, attitudes, and behavior
Beliefs, attitudes, and behaviorBeliefs, attitudes, and behavior
Beliefs, attitudes, and behaviorDieu-Donne Bitoyi
Ā 
BB Chapter Nine : Learning and Memory
BB Chapter Nine : Learning and MemoryBB Chapter Nine : Learning and Memory
BB Chapter Nine : Learning and MemoryBBAdvisor
Ā 
Active and Passive audience theories
Active and Passive audience theoriesActive and Passive audience theories
Active and Passive audience theoriesCrispySharp
Ā 
Active and passive voice ppt
Active and passive voice pptActive and passive voice ppt
Active and passive voice pptidahisyam
Ā 
Cognitive Dissonance Theory
Cognitive Dissonance TheoryCognitive Dissonance Theory
Cognitive Dissonance TheoryArun Jacob
Ā 
Consumer Attitudes and Beliefs
Consumer Attitudes and BeliefsConsumer Attitudes and Beliefs
Consumer Attitudes and BeliefsAbhipsha Mishra
Ā 
Buying decision process in rural marketing
Buying decision process in rural marketingBuying decision process in rural marketing
Buying decision process in rural marketingkanikaguptauicet
Ā 
Consumer Behavior
Consumer BehaviorConsumer Behavior
Consumer Behaviornitindhingra26
Ā 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningLior Rokach
Ā 

Viewers also liked (20)

Data Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural NetworksData Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural Networks
Ā 
Machine Learning for Dummies
Machine Learning for DummiesMachine Learning for Dummies
Machine Learning for Dummies
Ā 
Classification of learning profile based on categories of student preferences
Classification of learning profile based on categories of student preferencesClassification of learning profile based on categories of student preferences
Classification of learning profile based on categories of student preferences
Ā 
Information processing in consumer behaviour
Information processing in consumer behaviourInformation processing in consumer behaviour
Information processing in consumer behaviour
Ā 
Theory of reasoned action
Theory of reasoned actionTheory of reasoned action
Theory of reasoned action
Ā 
Five functions of effective management
Five functions of effective managementFive functions of effective management
Five functions of effective management
Ā 
Fhs attitudes
Fhs attitudesFhs attitudes
Fhs attitudes
Ā 
Active audience
Active audienceActive audience
Active audience
Ā 
Consumer Learning
Consumer LearningConsumer Learning
Consumer Learning
Ā 
Consumer behavior Values
Consumer behavior ValuesConsumer behavior Values
Consumer behavior Values
Ā 
Beliefs, attitudes, and behavior
Beliefs, attitudes, and behaviorBeliefs, attitudes, and behavior
Beliefs, attitudes, and behavior
Ā 
BB Chapter Nine : Learning and Memory
BB Chapter Nine : Learning and MemoryBB Chapter Nine : Learning and Memory
BB Chapter Nine : Learning and Memory
Ā 
Active and Passive audience theories
Active and Passive audience theoriesActive and Passive audience theories
Active and Passive audience theories
Ā 
Active and passive voice ppt
Active and passive voice pptActive and passive voice ppt
Active and passive voice ppt
Ā 
Cognitive Dissonance Theory
Cognitive Dissonance TheoryCognitive Dissonance Theory
Cognitive Dissonance Theory
Ā 
Cognitive dissonance theory
Cognitive dissonance theoryCognitive dissonance theory
Cognitive dissonance theory
Ā 
Consumer Attitudes and Beliefs
Consumer Attitudes and BeliefsConsumer Attitudes and Beliefs
Consumer Attitudes and Beliefs
Ā 
Buying decision process in rural marketing
Buying decision process in rural marketingBuying decision process in rural marketing
Buying decision process in rural marketing
Ā 
Consumer Behavior
Consumer BehaviorConsumer Behavior
Consumer Behavior
Ā 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Ā 

Similar to Lecture 1: What is Machine Learning?

Introduction AI ML& Mathematicals of ML.pdf
Introduction AI ML& Mathematicals of ML.pdfIntroduction AI ML& Mathematicals of ML.pdf
Introduction AI ML& Mathematicals of ML.pdfGandhiMathy6
Ā 
Machine Learning Basics - By Animesh Sinha
Machine Learning Basics - By Animesh Sinha Machine Learning Basics - By Animesh Sinha
Machine Learning Basics - By Animesh Sinha Animesh Sinha
Ā 
Machine Learning SPPU Unit 1
Machine Learning SPPU Unit 1Machine Learning SPPU Unit 1
Machine Learning SPPU Unit 1Amruta Aphale
Ā 
Machine Learning Landscape
Machine Learning LandscapeMachine Learning Landscape
Machine Learning LandscapeEng Teong Cheah
Ā 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceChristy Abraham Joy
Ā 
Training_Report_on_Machine_Learning.docx
Training_Report_on_Machine_Learning.docxTraining_Report_on_Machine_Learning.docx
Training_Report_on_Machine_Learning.docxShubhamBishnoi14
Ā 
ML.pptvdvdvdvdvdfvdfgvdsdgdsfgdfgdfgdfgdf
ML.pptvdvdvdvdvdfvdfgvdsdgdsfgdfgdfgdfgdfML.pptvdvdvdvdvdfvdfgvdsdgdsfgdfgdfgdfgdf
ML.pptvdvdvdvdvdfvdfgvdsdgdsfgdfgdfgdfgdfAvijitChaudhuri3
Ā 
Lec 6 learning
Lec 6 learningLec 6 learning
Lec 6 learningEyob Sisay
Ā 
CodeLess Machine Learning
CodeLess Machine LearningCodeLess Machine Learning
CodeLess Machine LearningSharjeel Imtiaz
Ā 
Machine_Learning.pptx
Machine_Learning.pptxMachine_Learning.pptx
Machine_Learning.pptxshubhamatak136
Ā 
Intro/Overview on Machine Learning Presentation -2
Intro/Overview on Machine Learning Presentation -2Intro/Overview on Machine Learning Presentation -2
Intro/Overview on Machine Learning Presentation -2Ankit Gupta
Ā 
Introduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdfIntroduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdfSisayNegash4
Ā 
Big data, big opportunities
Big data, big opportunitiesBig data, big opportunities
Big data, big opportunitiesChouaieb NEMRI
Ā 
chapter1-introduction1.ppt
chapter1-introduction1.pptchapter1-introduction1.ppt
chapter1-introduction1.pptSeshuSrinivas2
Ā 

Similar to Lecture 1: What is Machine Learning? (20)

Introduction AI ML& Mathematicals of ML.pdf
Introduction AI ML& Mathematicals of ML.pdfIntroduction AI ML& Mathematicals of ML.pdf
Introduction AI ML& Mathematicals of ML.pdf
Ā 
ML
MLML
ML
Ā 
Machine Learning Basics - By Animesh Sinha
Machine Learning Basics - By Animesh Sinha Machine Learning Basics - By Animesh Sinha
Machine Learning Basics - By Animesh Sinha
Ā 
Machine Learning SPPU Unit 1
Machine Learning SPPU Unit 1Machine Learning SPPU Unit 1
Machine Learning SPPU Unit 1
Ā 
Machine Learning Landscape
Machine Learning LandscapeMachine Learning Landscape
Machine Learning Landscape
Ā 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Ā 
Training_Report_on_Machine_Learning.docx
Training_Report_on_Machine_Learning.docxTraining_Report_on_Machine_Learning.docx
Training_Report_on_Machine_Learning.docx
Ā 
ML.ppt
ML.pptML.ppt
ML.ppt
Ā 
ML.ppt
ML.pptML.ppt
ML.ppt
Ā 
ML.ppt
ML.pptML.ppt
ML.ppt
Ā 
ML.ppt
ML.pptML.ppt
ML.ppt
Ā 
ML.pptvdvdvdvdvdfvdfgvdsdgdsfgdfgdfgdfgdf
ML.pptvdvdvdvdvdfvdfgvdsdgdsfgdfgdfgdfgdfML.pptvdvdvdvdvdfvdfgvdsdgdsfgdfgdfgdfgdf
ML.pptvdvdvdvdvdfvdfgvdsdgdsfgdfgdfgdfgdf
Ā 
ML.ppt
ML.pptML.ppt
ML.ppt
Ā 
Lec 6 learning
Lec 6 learningLec 6 learning
Lec 6 learning
Ā 
CodeLess Machine Learning
CodeLess Machine LearningCodeLess Machine Learning
CodeLess Machine Learning
Ā 
Machine_Learning.pptx
Machine_Learning.pptxMachine_Learning.pptx
Machine_Learning.pptx
Ā 
Intro/Overview on Machine Learning Presentation -2
Intro/Overview on Machine Learning Presentation -2Intro/Overview on Machine Learning Presentation -2
Intro/Overview on Machine Learning Presentation -2
Ā 
Introduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdfIntroduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdf
Ā 
Big data, big opportunities
Big data, big opportunitiesBig data, big opportunities
Big data, big opportunities
Ā 
chapter1-introduction1.ppt
chapter1-introduction1.pptchapter1-introduction1.ppt
chapter1-introduction1.ppt
Ā 

More from Marina Santini

Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...
Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...
Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...Marina Santini
Ā 
Towards a Quality Assessment of Web Corpora for Language Technology Applications
Towards a Quality Assessment of Web Corpora for Language Technology ApplicationsTowards a Quality Assessment of Web Corpora for Language Technology Applications
Towards a Quality Assessment of Web Corpora for Language Technology ApplicationsMarina Santini
Ā 
A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-
A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-
A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-Marina Santini
Ā 
An Exploratory Study on Genre Classification using Readability Features
An Exploratory Study on Genre Classification using Readability FeaturesAn Exploratory Study on Genre Classification using Readability Features
An Exploratory Study on Genre Classification using Readability FeaturesMarina Santini
Ā 
Lecture: Semantic Word Clouds
Lecture: Semantic Word CloudsLecture: Semantic Word Clouds
Lecture: Semantic Word CloudsMarina Santini
Ā 
Lecture: Ontologies and the Semantic Web
Lecture: Ontologies and the Semantic WebLecture: Ontologies and the Semantic Web
Lecture: Ontologies and the Semantic WebMarina Santini
Ā 
Lecture: Summarization
Lecture: SummarizationLecture: Summarization
Lecture: SummarizationMarina Santini
Ā 
Relation Extraction
Relation ExtractionRelation Extraction
Relation ExtractionMarina Santini
Ā 
Lecture: Question Answering
Lecture: Question AnsweringLecture: Question Answering
Lecture: Question AnsweringMarina Santini
Ā 
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)Marina Santini
Ā 
Lecture: Vector Semantics (aka Distributional Semantics)
Lecture: Vector Semantics (aka Distributional Semantics)Lecture: Vector Semantics (aka Distributional Semantics)
Lecture: Vector Semantics (aka Distributional Semantics)Marina Santini
Ā 
Lecture: Word Sense Disambiguation
Lecture: Word Sense DisambiguationLecture: Word Sense Disambiguation
Lecture: Word Sense DisambiguationMarina Santini
Ā 
Lecture: Word Senses
Lecture: Word SensesLecture: Word Senses
Lecture: Word SensesMarina Santini
Ā 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment AnalysisMarina Santini
Ā 
Semantic Role Labeling
Semantic Role LabelingSemantic Role Labeling
Semantic Role LabelingMarina Santini
Ā 
Semantics and Computational Semantics
Semantics and Computational SemanticsSemantics and Computational Semantics
Semantics and Computational SemanticsMarina Santini
Ā 
Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)Marina Santini
Ā 
Lecture 8: Machine Learning in Practice (1)
Lecture 8: Machine Learning in Practice (1) Lecture 8: Machine Learning in Practice (1)
Lecture 8: Machine Learning in Practice (1) Marina Santini
Ā 
Lecture 5: Interval Estimation
Lecture 5: Interval Estimation Lecture 5: Interval Estimation
Lecture 5: Interval Estimation Marina Santini
Ā 
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain RatioLecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain RatioMarina Santini
Ā 

More from Marina Santini (20)

Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...
Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...
Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...
Ā 
Towards a Quality Assessment of Web Corpora for Language Technology Applications
Towards a Quality Assessment of Web Corpora for Language Technology ApplicationsTowards a Quality Assessment of Web Corpora for Language Technology Applications
Towards a Quality Assessment of Web Corpora for Language Technology Applications
Ā 
A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-
A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-
A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-
Ā 
An Exploratory Study on Genre Classification using Readability Features
An Exploratory Study on Genre Classification using Readability FeaturesAn Exploratory Study on Genre Classification using Readability Features
An Exploratory Study on Genre Classification using Readability Features
Ā 
Lecture: Semantic Word Clouds
Lecture: Semantic Word CloudsLecture: Semantic Word Clouds
Lecture: Semantic Word Clouds
Ā 
Lecture: Ontologies and the Semantic Web
Lecture: Ontologies and the Semantic WebLecture: Ontologies and the Semantic Web
Lecture: Ontologies and the Semantic Web
Ā 
Lecture: Summarization
Lecture: SummarizationLecture: Summarization
Lecture: Summarization
Ā 
Relation Extraction
Relation ExtractionRelation Extraction
Relation Extraction
Ā 
Lecture: Question Answering
Lecture: Question AnsweringLecture: Question Answering
Lecture: Question Answering
Ā 
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)
Ā 
Lecture: Vector Semantics (aka Distributional Semantics)
Lecture: Vector Semantics (aka Distributional Semantics)Lecture: Vector Semantics (aka Distributional Semantics)
Lecture: Vector Semantics (aka Distributional Semantics)
Ā 
Lecture: Word Sense Disambiguation
Lecture: Word Sense DisambiguationLecture: Word Sense Disambiguation
Lecture: Word Sense Disambiguation
Ā 
Lecture: Word Senses
Lecture: Word SensesLecture: Word Senses
Lecture: Word Senses
Ā 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
Ā 
Semantic Role Labeling
Semantic Role LabelingSemantic Role Labeling
Semantic Role Labeling
Ā 
Semantics and Computational Semantics
Semantics and Computational SemanticsSemantics and Computational Semantics
Semantics and Computational Semantics
Ā 
Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)
Ā 
Lecture 8: Machine Learning in Practice (1)
Lecture 8: Machine Learning in Practice (1) Lecture 8: Machine Learning in Practice (1)
Lecture 8: Machine Learning in Practice (1)
Ā 
Lecture 5: Interval Estimation
Lecture 5: Interval Estimation Lecture 5: Interval Estimation
Lecture 5: Interval Estimation
Ā 
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain RatioLecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Ā 

Recently uploaded

Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
Ā 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
Ā 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
Ā 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
Ā 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
Ā 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
Ā 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
Ā 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
Ā 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
Ā 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
Ā 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
Ā 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
Ā 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
Ā 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
Ā 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
Ā 
ā€œOh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
ā€œOh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...ā€œOh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
ā€œOh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
Ā 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
Ā 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
Ā 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
Ā 

Recently uploaded (20)

Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
Ā 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
Ā 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
Ā 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
Ā 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
Ā 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
Ā 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
Ā 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
Ā 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
Ā 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
Ā 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
Ā 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
Ā 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
Ā 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
Ā 
Model Call Girl in Bikash Puri Delhi reach out to us at šŸ”9953056974šŸ”
Model Call Girl in Bikash Puri  Delhi reach out to us at šŸ”9953056974šŸ”Model Call Girl in Bikash Puri  Delhi reach out to us at šŸ”9953056974šŸ”
Model Call Girl in Bikash Puri Delhi reach out to us at šŸ”9953056974šŸ”
Ā 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
Ā 
ā€œOh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
ā€œOh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...ā€œOh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
ā€œOh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
Ā 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Ā 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Ā 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
Ā 

Lecture 1: What is Machine Learning?

  • 1. Machine Learning for Language Technology 2015 http://stp.lingfil.uu.se/~santinim/ml/2015/ml4lt_2015.htm What is Machine Learning? Marina Santini santinim@stp.lingfil.uu.se Department of Linguistics and Philology Uppsala University, Uppsala, Sweden Autumn 2015
  • 2. Acknowledgements ā€¢ Thanks to Hal Daumeā€™ III, Andrew Y. Ng, Arthur Samuel, Tom Mitchell, Ethem Alpaydin, Michael Jordan, Michael Collins, Joakim Nivre, Pedro Domingo, wikipedia, the web. Lecture 1: What is Machine Learning? 2
  • 3. What is Machine Learning? http://www.umiacs.umd.edu/~hal/ciml/ ā€¢ Machine learning is the study of computer systems that learn from data and experience. ā€¢ It is applied in an incredibly wide variety of application areas, from medicine to advertising, from military to pedestrian. ā€¢ Any area in which you need to make sense of data is a potential customer of machine learning. [Hal Daumeā€™ III] Lecture 1: What is Machine Learning? 3
  • 4. Example: Rule-based systems ā€¢ Parts-Of-Speech Tagger (Brillā€™s tagger) ā€¢ tag1--> tag2 IF Condition the Condition tests the preceding and/or following word tokens, or their tags (the notation for such rules differs between implementations). For example, in Brill's notation: ā€¢ IN NN WDPREVTAG DT while change the tag of a word from IN (preposition) to NN (common noun), if the preceding word's tag is DT (determiner) and the word itself is "while". This covers cases like "all the while" or "in a while", where "while" should be tagged as a noun rather than its more common use as a preposition (many rules are more general). Lecture 1: What is Machine Learning? 4
  • 5. Example: Machine Learning-Based Systems (From: The Apache OpenNLP library, a machine learning based toolkit for the processing of natural language text: https://opennlp.apache.org/ ) ā€¢ We do not have rules , but a training corpus/dataset: The POS Tagger can be trained on annotated training material. The training material is a collection of tokenized sentences where each token has the assigned part-of-speech tag. The training material may look like this: About_IN 10_CD Euro_NNP ,_, I_PRP reckon_VBP That_DT sounds_VBZ good_JJ ā€¢ With this annotated material we train a (mathematical/statistical) model. ā€¢ Then we evaluate the results: how well does this model perform? The accuracy can be measured on a test dataset or via cross validation. Lecture 1: What is Machine Learning? 5
  • 6. Generally speaking: Deduction vs Induction ā€¢ Deductive reasoning works from the more general to the more specific. Sometimes this is informally called a "top-down" approach. We might begin with thinking up a theory about our topic of interest. We then narrow that down into more specific hypotheses that we can test. This ultimately leads us to be able to test the hypotheses with specific data -- a confirmation (or not) of our original theories. ā€¢ Inductive reasoning works the other way, moving from specific observations to broader generalizations and theories. Informally, we sometimes call this a "bottom up" approach. In inductive reasoning, we begin with specific observations (the data) and measures, begin to detect patterns and regularities, formulate some tentative hypotheses that we can explore, and finally end up developing a general model. Lecture 1: What is Machine Learning? 6
  • 7. Machine Learning is based on ... ā€¢ Induction ā€¢ Generalization from data Lecture 1: What is Machine Learning? 7
  • 8. In summary (by Hal Daumeā€™ III) https://piazza.com/umd/fall2015/cmsc422/home ā€¢ Machine learning is all about finding patterns in data. The whole idea is to replace the "human writing code" with a "human supplying data" and then let the system figure out what it is that the person wants to do by looking at the examples. ā€¢ The most central concept in machine learning is generalization: how to generalize beyond the examples that have been provided at "training time" to new examples that you see at "test time.ā€œ ā€¢ A very large fraction of what we'll talk about has to do with figuring out what generalization means. Lecture 1: What is Machine Learning? 8
  • 9. Why learning? ā€¢ We do not know the exact method: speech recognition, spam filters, robotics, etc. ā€¢ The exact method is too expensive: statistical physics, etc. ā€¢ Task evolves over time... ā€¢ There is no need to use machine learning for computing a payroll: for this task we just need an algorithm! Lecture 1: What is Machine Learning? 9
  • 10. Why is ML so fashionable? ā€¢ Broad applicability: ā€“ Finance, robotic, medicin, NLP, IT, MT etc. ā€¢ Close connection between theory and practice ā€¢ Open field, lots of room for new work.M Lecture 1: What is Machine Learning? 10
  • 11. Interdisciplinary Field Machine learning is: ā€¢ a subfield of computer science that evolved from the study of pattern recognition and computational learning theory in artificial intelligence. ā€¢ Machine learning explores the study and construction of algorithms that can LEARN from and make predictions on data. ā€¢ Such algorithms operate by building a model from example inputs in order to make data-driven predictions or decisions, rather than following strictly static program instructions. [wikipidia] Lecture 1: What is Machine Learning? 11
  • 12. Machine Learning & Statistics ā€¢ Machine learning and statistics are closely related fields. ā€¢ According to Michael Jordan, the ideas of machine learning, from methodological principles to theoretical tools, have had a long pre-history in statistics. He also suggested the term data science as a placeholder to call the overall field. Lecture 1: What is Machine Learning? 12
  • 13. Machine Learning and Data Mining ā€¢ Machine Learning relates with the study, design and development of models and algorithms that give computers the capability to learn from data. ā€¢ Data Mining can be defined as the process that starting from apparently unstructured data tries to extract knowledge and/or unknown interesting patterns. During this process machine Learning algorithms are used. Lecture 1: What is Machine Learning? 13
  • 14. Many Types of Learning ā€¢ Supervised learning ā€“ Supervised classification is the only focus of this course ā€¢ Unsupervised learning ā€¢ Semi- and weakly supervised ā€¢ Reinforcement learning ā€¢ etc. Lecture 1: What is Machine Learning? 14
  • 15. In other words ā€¢ Supervised learning: learning with a teacher (ļƒ with class labels) ā€¢ Unsupervised learning: learning without a teacher (ļƒ  without class labels) Lecture 1: What is Machine Learning? 15
  • 16. Learning problems ā€¢ Regression ā€¢ Binary classification ā€¢ Multiclass classification ā€¢ Ranking ā€¢ etc. Lecture 1: What is Machine Learning? 16
  • 17. Informal and Formal Definitions ā€¢ There are plenty of definitions... ā€¢ Informal: The field of study that gives computers the ability to learn without being explicitly programmed (Arthur Samuel, 1959) ā€¢ Formal: A computer program is said to learn from experience E, with respect to some task T, and some performance measure P, if its performance on T as measured by P improves with experience E (Tom Mitchell, 1998). Lecture 1: What is Machine Learning? 17
  • 18. Example: Spam Filter (by Andrew Y. Ng) 1. Classifying emails as spam or not spam 2. Watching you label emails as spam or not spam. 3. The number (or fraction) of emails correctly classified as spam/not spam. 4. None of the aboveā€”this is not a machine learning problem. Lecture 1: What is Machine Learning? 18
  • 19. Spam Filter (by Andrew Y. Ng) ā€¢ Answer: Lecture 1: What is Machine Learning? 19
  • 20. Classification: Questions ā€¢ How would you write a program to distinguish a picture of me from a picture of someone else? ā€¢ How would you write a program to determine whether a sentence is grammatical or not? ā€¢ How would you write a program to distinguish cancerous cells from normal cells? Lecture 1: What is Machine Learning? 20
  • 21. Classification: Answers ā€¢ How would you write a program to distinguish a picture of me from a picture of someone else? ā‡’ Provide examples pictures of me and pictures of other people and let a classiļ¬er learn to distinguish the two. ā€¢ How would you write a program to determine whether a sentence is grammatical or not? ā‡’ Provide examples of grammatical and ungrammatical sentences and let a classiļ¬er learn to distinguish the two. ā€¢ How would you write a program to distinguish cancerous cells from normal cells? ā‡’ Provide examples of cancerous and normal cells and let a classiļ¬er learn to distinguish the two. Lecture 1: What is Machine Learning? 21
  • 22. Elements of Machine Learning ā€¢ Generalization: how well a model performs on new data. ā€¢ Data: ā€“ Training data: specific examples to learn from. ā€“ Test data: new specific examples to assess performance. ā€¢ Models (theoretical assumptions) ā€“ decision trees, naive bayes, perceptron, etc. ā€¢ Algorithms: ā€“ Learning altorighms that infer the model parameters from the data. ā€“ Inference algorithms that infer prediction from a model. Lecture 1: What is Machine Learning? 22
  • 23. Generalization ā€¢ Predicting the future based on the past ā€¢ Our system needs to generalize beyond the training data to some future data that it might not have seen yet. Lecture 1: What is Machine Learning? 23
  • 24. Generalization: Overfitting & Underfitting ā€¢ Overfitting occurs when the model fits the training data too well and does not generalize so it performs badly on the test data. Overfitting is often a result of an excessively complicated model ā€¢ Underfitting occurs when the model does not fit the data well enough. Underfitting is often a result of an excessively simple model. ā€¢ Both overfitting and underfitting lead to poor predictions on new data sets. ā€¢ A learning model that overfits or underfits does not generalize well. Lecture 1: What is Machine Learning? 24
  • 25. Example: Letter vs non-letter classification Training set Test set Lecture 1: What is Machine Learning? 25
  • 26. Data: The iris dataset Three components: 1. Class label (aka ā€œlabelā€, denoted y) 2. Features (aka ā€œattributesā€) 3. Feature values (aka ā€œattribute valuesā€, denoted x) ā‡’ Features can be binary, nominal or continuous ā€¢ A labeled dataset is a collection of (x,y) pairs Lecture 1: What is Machine Learning? 26
  • 27. Task ā€¢ Predict the class for this ā€testā€ example: Sepal length ā€“ Sepal width ā€“ Petal length ā€“ Petal width - Type 5.2 3.7 1.7 0.3 ??? Lecture 1: What is Machine Learning? 27 Require us to generalize from the training data
  • 28. Noise ā€¢ Unexplained or random variation in the data ā€¢ Anomaly Lecture 1: What is Machine Learning? 28
  • 29. Models Models are theoretical assumptions ā€“ decision trees, naive bayes, perceptron, etc. Given: ā€“ Domain X: descriptions ā€“ Domain Y: predictions ā€“ H: hypothesis space; the set of all possible hypotheses ā€“ h: target hypothesis ā€“ Idea: to extrapolate observed y's over all X. ā€“ Hope: to predict well on future y's given x's. ā€“ Require: there must be regularities to be found. Lecture 1: What is Machine Learning? 29
  • 30. Inductive bias ā€¢ In ML, the inductive bias is the set of theoretical assumptions that must be added to the observed data to transform the algorithm's outputs into logical deductions. Lecture 1: What is Machine Learning? 30
  • 31. Examples of inductive biases ā€¢ Decision trees (ID3): Shorter trees are preferred over larger trees. Trees that place high information gain attributes close to the root are preferred over those that do not. ā€¢ Naive Bayes classifier: maximize conditional independence. ā€¢ Logistic regression: there exists a simple boundary that splits one class from the other, and the further you get from that boundary the more confident you can be. ā€¢ Perceptron: data must be linearly separable. Lecture 1: What is Machine Learning? 31
  • 32. Inductive Bias: definition ā€¢ ā€œThe inductive bias of a learning algorithm is the set of assumptions that the learner uses to predict outputs given inputs that it has not encounteredā€. ā€“ Tom Mitchell, 1980 Lecture 1: What is Machine Learning? 32
  • 33. Learning & Inference Algorithms ā€¢ Traditionally, the goal of learning has been to find a model for which prediction (i.e., inference) accuracy is as high as possible. ā€¢ More recently: find models for which prediction (i.e., inference) is as efficient as possible. ā€¢ In practical terms: recent interest in more unconventional approaches to learning that combine generalization accuracy with other desiderata such as faster inference. Lecture 3: Basic Concepts of ML 33
  • 34. The End Lecture 1: What is Machine Learning? 34