Extreme learning machine:Theory and applications

•Download as PPTX, PDF•

21 likes•11,973 views

James Chou

20120315

Technology Education

Outline
2

 Introduction
 Single-hidden layer feed-forward neural networks
 Neural Network Mathematical Model
 Back Propagation algorithm
 ELM Mathematical Model
 Performance Evaluation
 Conclusion

Introduction
3

 For past decades, gradient descent based methods have mainly
been used in many learning algorithms of feed-forward neural
networks.
 Traditionally, all the parameters of the feed-forward neural
networks need to tune iterative and need a very long time to
learn.
 When the input weights and the hidden layer biases are
randomly assigned, SLFNs (single-hidden layer feed-forward
neural networks) can be simply considered as a linear system
and the output weights (linking the hidden layer to the output
layer) can be computed through simple generalized inverse
operation.

Introduction (Cont.)
4

 Based on this idea, this paper proposes a simple learning
algorithm for SLFNs called extreme learning.
 Different from traditional learning algorithms the extreme
learning algorithm not only provide the smaller training
error but also the better performance.

Single-hidden layer feed-forward
5
neural networks

N
Output  F ( i xi   )
i 1
θ is the threshold
 F(．) is activation function
 Hard Limiter function

1, when x  
f ( x)  
0, when x  

 Sigmoid function
1
f ( x) 
1  e x

Single-hidden layer feed-forward
6
neural networks (Cont.)

G() is activation function
L is number of hidden layer nodes

Neural Network Mathematical Model (Cont.)
8

If ε = 0 , mean
FL(x) = f(x) = T , T is known target
and
Cost function = 0

Neural Network Mathematical Model (Cont.)
9



Back Propagation algorithm
10

 BP algorithm is the classic gradient base algorithm to find the
best weight vectors and minimize the cost function.

Demo BP
algorithm!

η is Leaming Rate

ELM Mathematical Model
11

 H+ is the Moore-Penrose generalized inverse of
hidden layer output matrix H.
 H+ = (HTH)-1HT

Regression of SinC Function (Cont.)
16

 100000 training data with 5-20% noise.
 100000 testing data is noise free. Demo
 The result of training 50 times in the ELM!

following table.
Noise TrainingTime_AVG(sec) TrainingRMS_AVG TestingRMS_AVG
5% 0.6462 0.0113 2.201e-04=0.00022
10% 0.6306 0.0224 2.753e-04=0.00027
15% 0.6427 0.0334 8.336e-04=0.00083
20% 0.6452 0.0449 11.541e-04=0.00115

Real-World Regression Problems (Cont.)
18

Real-World Regression Problems (Cont.)
19

Real-World Regression Problems (Cont.)
20

Real-World Very Large Complex
Applications
21

Real Medical Diagnosis Application:
Diabetes
22

Conclusion
24

 Advantages
 ELM needs less training time compared to popular BP and
SVM/SVR.
 The prediction performance of ELM is usually a little better
than BP and close to SVM/SVR in many applications.
 Only need to turn the parameter L (hidden layer nodes).
 Nonlinear activation function still can work in ELM.
 Disadvantages
 How to find the optimal soluction?
 Local minima issue.
 Easy Overfitting.

What's hot

Overview on Optimization algorithms in Deep LearningKhang Pham

11848 ch04Debanjan Bhattacharya

LeNet to ResNetSomnath Banerjee

Optimization problems and algorithmsAboul Ella Hassanien

Soft ComputingMANISH T I

Activation functionAstha Jain

Elliptic Curve CryptographyJorgeVillamarin5

Discrete Logarithmic Problem- Basis of Elliptic Curve CryptosystemsNIT Sikkim

Turing machineslavishka_anuj

Unit I & II in Principles of Soft computing Sivagowry Shathesh

Transfer LearningHichem Felouat

Maximum Matching in General GraphsAhmad Khayyat

Associative memory networkDr. C.V. Suresh Babu

Firefly algorithmsupriya shilwant

Neural Networks: Support Vector machinesMostafa G. M. Mostafa

Neural Networks: Principal Component Analysis (PCA)Mostafa G. M. Mostafa

Neural Networks and Genetic Algorithms Multiobjective accelerationArmando Vieira

Random Oracle Model & Hashing - Cryptography & Network SecurityMahbubur Rahman

Machine Learning Chapter 11 2butest

Back Propagation Neural Network In AI PowerPoint Presentation Slide Templates...SlideTeam

What's hot (20)

Overview on Optimization algorithms in Deep Learning

11848 ch04

LeNet to ResNet

Optimization problems and algorithms

Soft Computing

Activation function

Elliptic Curve Cryptography

Discrete Logarithmic Problem- Basis of Elliptic Curve Cryptosystems

Turing machines

Unit I & II in Principles of Soft computing

Transfer Learning

Maximum Matching in General Graphs

Associative memory network

Firefly algorithm

Neural Networks: Support Vector machines

Neural Networks: Principal Component Analysis (PCA)

Neural Networks and Genetic Algorithms Multiobjective acceleration

Random Oracle Model & Hashing - Cryptography & Network Security

Machine Learning Chapter 11 2

Back Propagation Neural Network In AI PowerPoint Presentation Slide Templates...

Similar to Extreme learning machine:Theory and applications

TFFN: Two Hidden Layer Feed Forward Network using the randomness of Extreme L...Nimai Chand Das Adhikari

Deep learning: Mathematical PerspectiveYounusS2

Multi-Layer PerceptronsESCOM

Basic Learning Algorithms of ANNwaseem khan

M7 - Neural Networks in machine learning.pdfArushiKansal3

19_Learning.pptgnans Kgnanshek

High performance extreme learning machines a complete toolbox for big data a...redpel dot com

Java and Deep LearningOswald Campesato

deep CNN vs conventional MLChao Han chaohan@vt.edu

ann-ics320Part4.pptGayathriRHICETCSESTA

Techniques in Deep LearningSourya Dey

Java and Deep Learning (Introduction)Oswald Campesato

Convolutional Neural Networks (CNN)Gaurav Mittal

elmXiaoyu Sun

PerceptronNagarajan

Jörg Stelzerbutest

تطبيق الشبكة العصبية الاصطناعية (( ANN في كشف اعطال منظومة نقل القدرة الكهربائيةssuserfdec151

Goodfellow, Bengio, Couville (2016) "Deep Learning", Chap. 6Ono Shigeru

UofT_ML_lecture.pptxabcdefghijklmn19

Similar to Extreme learning machine:Theory and applications (20)

TFFN: Two Hidden Layer Feed Forward Network using the randomness of Extreme L...

Deep learning: Mathematical Perspective

Multi-Layer Perceptrons

Basic Learning Algorithms of ANN

M7 - Neural Networks in machine learning.pdf

19_Learning.ppt

High performance extreme learning machines a complete toolbox for big data a...

Java and Deep Learning

deep CNN vs conventional ML

ann-ics320Part4.ppt

Techniques in Deep Learning

Java and Deep Learning (Introduction)

Convolutional Neural Networks (CNN)

elm

Perceptron

Jörg Stelzer

تطبيق الشبكة العصبية الاصطناعية (( ANN في كشف اعطال منظومة نقل القدرة الكهربائية

Goodfellow, Bengio, Couville (2016) "Deep Learning", Chap. 6

UofT_ML_lecture.pptx

Recently uploaded

A Domino Admins Adventures (Engage 2024)Gabriella Davis

GenCyber Cyber Security Day PresentationMichael W. Hawkins

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

A Call to Action for Generative AI in 2024Results

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Slack Application Development 101 Slidespraypatel2

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

Recently uploaded (20)

A Domino Admins Adventures (Engage 2024)

GenCyber Cyber Security Day Presentation

08448380779 Call Girls In Friends Colony Women Seeking Men

The 7 Things I Know About Cyber Security After 25 Years | April 2024

How to Troubleshoot Apps for the Modern Connected Worker

Powerful Google developer tools for immediate impact! (2023-24 C)

Breaking the Kubernetes Kill Chain: Host Path Mount

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

08448380779 Call Girls In Civil Lines Women Seeking Men

2024: Domino Containers - The Next Step. News from the Domino Container commu...

A Call to Action for Generative AI in 2024

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Axa Assurance Maroc - Insurer Innovation Award 2024

Slack Application Development 101 Slides

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Boost PC performance: How more available memory can improve productivity

Tata AIG General Insurance Company - Insurer Innovation Award 2024

Presentation on how to chat with PDF using ChatGPT code interpreter

Data Cloud, More than a CDP by Matt Robison

Extreme learning machine:Theory and applications

1. Extreme learning machine:Theory and applications G.-B. Huang, Q.-Y. Zhu, and C.-K. Siew Neurocomputing, 2006 Presenter: James Chou 2012/03/15

2. Outline 2  Introduction  Single-hidden layer feed-forward neural networks  Neural Network Mathematical Model  Back Propagation algorithm  ELM Mathematical Model  Performance Evaluation  Conclusion

3. Introduction 3  For past decades, gradient descent based methods have mainly been used in many learning algorithms of feed-forward neural networks.  Traditionally, all the parameters of the feed-forward neural networks need to tune iterative and need a very long time to learn.  When the input weights and the hidden layer biases are randomly assigned, SLFNs (single-hidden layer feed-forward neural networks) can be simply considered as a linear system and the output weights (linking the hidden layer to the output layer) can be computed through simple generalized inverse operation.

4. Introduction (Cont.) 4  Based on this idea, this paper proposes a simple learning algorithm for SLFNs called extreme learning.  Different from traditional learning algorithms the extreme learning algorithm not only provide the smaller training error but also the better performance.

5. Single-hidden layer feed-forward 5 neural networks N Output  F ( i xi   ) i 1 θ is the threshold  F(．) is activation function  Hard Limiter function  1, when x   f ( x)   0, when x     Sigmoid function 1 f ( x)  1  e x

6. Single-hidden layer feed-forward 6 neural networks (Cont.) G() is activation function L is number of hidden layer nodes

7. Neural Network Mathematical Model 7

8. Neural Network Mathematical Model (Cont.) 8 If ε = 0 , mean FL(x) = f(x) = T , T is known target and Cost function = 0

9. Neural Network Mathematical Model (Cont.) 9 

10. Back Propagation algorithm 10  BP algorithm is the classic gradient base algorithm to find the best weight vectors and minimize the cost function. Demo BP algorithm! η is Leaming Rate

11. ELM Mathematical Model 11  H+ is the Moore-Penrose generalized inverse of hidden layer output matrix H.  H+ = (HTH)-1HT

12. ELM Mathematical Model (Cont.) 12 

13. ELM Mathematical Model (Cont.) 13 

14.

15. Regression of SinC Function 15

16. Regression of SinC Function (Cont.) 16  100000 training data with 5-20% noise.  100000 testing data is noise free. Demo  The result of training 50 times in the ELM! following table. Noise TrainingTime_AVG(sec) TrainingRMS_AVG TestingRMS_AVG 5% 0.6462 0.0113 2.201e-04=0.00022 10% 0.6306 0.0224 2.753e-04=0.00027 15% 0.6427 0.0334 8.336e-04=0.00083 20% 0.6452 0.0449 11.541e-04=0.00115

17. Real-World Regression Problems 17

18. Real-World Regression Problems (Cont.) 18

19. Real-World Regression Problems (Cont.) 19

20. Real-World Regression Problems (Cont.) 20

21. Real-World Very Large Complex Applications 21

22. Real Medical Diagnosis Application: Diabetes 22

23. Protein Sequence Classification 23

24. Conclusion 24  Advantages  ELM needs less training time compared to popular BP and SVM/SVR.  The prediction performance of ELM is usually a little better than BP and close to SVM/SVR in many applications.  Only need to turn the parameter L (hidden layer nodes).  Nonlinear activation function still can work in ELM.  Disadvantages  How to find the optimal soluction?  Local minima issue.  Easy Overfitting.

Extreme learning machine:Theory and applications

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Extreme learning machine:Theory and applications

Similar to Extreme learning machine:Theory and applications (20)

Recently uploaded

Recently uploaded (20)

Extreme learning machine:Theory and applications