Self Taught Learning

•Download as PPTX, PDF•

2 likes•2,166 views

Shao-Chuan Wang

ICML 2007 paper report

Education

Self-taught learning: Transfer learning from unlabelled data Authors: Rajat Raina et al. @Stanford University Presenter: Shao-Chuan Wang Some slides are from author’s work

Self-taught learning: Transfer learning from unlabelled data Objective: Use the unlabelled data to improve the performance on a classification task. Kea Ideas: Relax the assumption about the unlabelled data. Use unlabelled data to learn the best representation (dictionary)

Machine Learning Schemes ,[object Object]

Next: Self-taught learning?Cars Motorcycles Cars Motorcycles Cars Motorcycles Bus Tractor Aircraft Helicopter Motorcycle Car Natural scenes

Self-taught Learning Labeled examples: Unlabeled examples: The unlabeled and labeled data: Need not share labels y. Need not share a generative distribution. Advantage: Such unlabeled data is often easy to obtain.

Self-taught learning Use unlabelled data to learn the best representation.

Learning the structure Sparse coding: learning the dictionary Encoding (L1-regularized least square problem) Least angle regression (Efron et al. 2004) (very fast!!) Feature-sign search (Honglak Lee et al. 2006) Coordinate descent (Friedman et al. 2008) Update dictionary(L2-constrained least square problem) K-SVD (Aharon et al. 2006) Online dictionary learning (Mairal et al. 2009) Reconstruction error Sparsity penalty

Self-taught learning: flow K-SVD, online dictionary learning… Learning algorithm Unlabelled data Dictionary Encoded Training features Encoding SVM with Fisher kernel Training data Encoded Testing features Testing data The Lasso Predicted Labels

SVM with Fisher kernel Fisher kernel In Bayesian view, Laplace prior

Example atoms Natural images. “edges” “strokes” Handwritten characters. Sparse representation gives a higher level representation

Result: Character recognition Digits Handwritten English English font Handwritten English classification (20 labeled images per handwritten character) Bases learnt on digits English font classification (20 labeled images per font character) Bases learnt on handwritten English 8.2% error reduction 2.8% error reduction

Similar to Self Taught Learning

Final Presentation V3weichen

Query Linguistic Intent Detectionbutest

Adaptive membership functions for hand written character recognition by voron...JPINFOTECH JAYAPRAKASH

OntologySudarsun Santhiappan

Weakly Supervised Machine ReadingIsabelle Augenstein

AP Computer Science Test Prep - Part 2 - Object Oriented ProgrammingNR Computer Learning Center

课堂讲义(最后更新:2009-9-25)butest

NL based Object Oriented modeling - EJSR 35(1)IT Industry

Introduction to Pythontswr

sahuPPT.pptxKartikDutta10

CVPR2009: Object Detection Using a Max-Margin Hough Transformzukun

Similar to Self Taught Learning (11)

Final Presentation V3

Query Linguistic Intent Detection

Adaptive membership functions for hand written character recognition by voron...

Ontology

Weakly Supervised Machine Reading

AP Computer Science Test Prep - Part 2 - Object Oriented Programming

课堂讲义(最后更新:2009-9-25)

NL based Object Oriented modeling - EJSR 35(1)

Introduction to Python

sahuPPT.pptx

CVPR2009: Object Detection Using a Max-Margin Hough Transform

Recently uploaded

Influencing policy (training slides from Fast Track Impact)Mark Reed

Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup

Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri

YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxConquiztadors- the Quiz Society of Sri Venkateswara College

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1

Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George

Field Attribute Index Feature in Odoo 17Celine George

How to Add Barcode on PDF Report in Odoo 17Celine George

ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1

MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma

Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2

HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection

Keynote by Prof. Wurzer at Nordex about IP-designMIPLM

Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105

Concurrency Control in Database Management systemChristalin Nelson

ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli

Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR

Transaction Management in Database Management SystemChristalin Nelson

Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy

FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxConquiztadors- the Quiz Society of Sri Venkateswara College

Recently uploaded (20)

Influencing policy (training slides from Fast Track Impact)

Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf

Judging the Relevance and worth of ideas part 2.pptx

YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...

Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17

Field Attribute Index Feature in Odoo 17

How to Add Barcode on PDF Report in Odoo 17

ANG SEKTOR NG agrikultura.pptx QUARTER 4

MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx

Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf

HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...

Keynote by Prof. Wurzer at Nordex about IP-design

Barangay Council for the Protection of Children (BCPC) Orientation.pptx

Concurrency Control in Database Management system

ACC 2024 Chronicles. Cardiology. Exam.pdf

Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝

Transaction Management in Database Management System

Student Profile Sample - We help schools to connect the data they have, with ...

FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx

Self Taught Learning

1. Self-taught learning: Transfer learning from unlabelled data Authors: Rajat Raina et al. @Stanford University Presenter: Shao-Chuan Wang Some slides are from author’s work

2. Self-taught learning: Transfer learning from unlabelled data Objective: Use the unlabelled data to improve the performance on a classification task. Kea Ideas: Relax the assumption about the unlabelled data. Use unlabelled data to learn the best representation (dictionary)

4. Semi-supervised learning.

5. Transfer learning.

6. Next: Self-taught learning?Cars Motorcycles Cars Motorcycles Cars Motorcycles Bus Tractor Aircraft Helicopter Motorcycle Car Natural scenes

7. Self-taught Learning Labeled examples: Unlabeled examples: The unlabeled and labeled data: Need not share labels y. Need not share a generative distribution. Advantage: Such unlabeled data is often easy to obtain.

8. Self-taught learning Use unlabelled data to learn the best representation.

9. Learning the structure Sparse coding: learning the dictionary Encoding (L1-regularized least square problem) Least angle regression (Efron et al. 2004) (very fast!!) Feature-sign search (Honglak Lee et al. 2006) Coordinate descent (Friedman et al. 2008) Update dictionary(L2-constrained least square problem) K-SVD (Aharon et al. 2006) Online dictionary learning (Mairal et al. 2009) Reconstruction error Sparsity penalty

10. Self-taught learning: flow K-SVD, online dictionary learning… Learning algorithm Unlabelled data Dictionary Encoded Training features Encoding SVM with Fisher kernel Training data Encoded Testing features Testing data The Lasso Predicted Labels

11. SVM with Fisher kernel Fisher kernel In Bayesian view, Laplace prior

12. Example atoms Natural images. “edges” “strokes” Handwritten characters. Sparse representation gives a higher level representation

13. Result: Character recognition Digits Handwritten English English font Handwritten English classification (20 labeled images per handwritten character) Bases learnt on digits English font classification (20 labeled images per font character) Bases learnt on handwritten English 8.2% error reduction 2.8% error reduction

14. Image classification Other reported results: Fei-Fei et al, 2004: 16% Berg et al., 2005: 17% Holub et al., 2005: 40% Serre et al., 2005: 35% Berg et al, 2005: 48% Zhang et al., 2006: 59% Lazebnik et al., 2006: 56% (15 labeled images per class) 36.0% error reduction Self-taught Learning

15. Discussion Why sparse coding? Can this idea (learning structure from unlabelled image) be applied to other encoding schemes? Probably because the nonlinear coding from x to alpha? Emulation of “end-stopping” phenomenon. (A feature is maximally activated by edges of only a specific orientation and length)

16.

17. Use unlabeled data to discover features.

18. Use sparse coding to construct an easy-to-classify, “higher-level” representation.Unlabeled images = 0.8 * + 0.3 * + 0.5 *

Self Taught Learning

Recommended

Recommended

More Related Content

Similar to Self Taught Learning

Similar to Self Taught Learning (11)

More from Shao-Chuan Wang

More from Shao-Chuan Wang (9)

Recently uploaded

Recently uploaded (20)

Self Taught Learning