Generating Wikipedia summaries with deep learning

•

2 likes•624 views

1) The document summarizes recent papers on deep learning techniques for text summarization including neural attention models for sentence summarization and abstractive text summarization using sequence-to-sequence RNNs. 2) It also discusses papers on transformer models using attention mechanisms, mixture of experts layers in neural networks, and effective approaches to attention-based neural machine translation. 3) Recent advances in deep learning like attention mechanisms, transformer models, and mixture of experts layers have led to improvements in natural language tasks like text summarization and machine translation.

Technology

1
DEEP LEARNING JP
[DL Papers]
http://deeplearning.jp/
Generating Wikipedia by Summarizing Long Sequences
(ICLR 2018)
Toru Fujino, scalab, UTokyo

• . /0
• s ( -:: 8 2 I
• : p 2>2 > goo.gl/wSuuS9 k
• 1 2ILCG B iI rd i
e
• a Ird i I R l
• G ItW )
• B W DC noI2>> > : g

• , ) 1
• ,
•
• : ,
•
•
• :
•
•
• , , ( )
• , ,

1
• .G DC L N
•
• 51 3 2 : 0 6 4 1 (
•
• R
• // 1 2 : /1 1:1 4 1 )
•
•
•
1) Rush et al. “A Neural Attention Model for Sentence Summarization”, EMNLP 2015
2) Nallapati et al. “Abstractive Text Summarization using Sequence-to-Sequence RNNs and Beyond”, CoNLL 2016

• (,
,, ),( )
• e /
• S
•
/
• 2 A / /
2) Nallapati et al. “Abstractive Text Summarization using Sequence-to-Sequence RNNs and Beyond”, CoNLL 2016
2)

• 1 W R
a
•
• 2 G
• 00 . c
•
• 1 d
https://en.wikipedia.org/wiki/Deep_learning

• 3 43 4 Y p
• CD 4 M i
• ac c 43 , 4 a
• d ac c n nY
r Ly
• ot ldA CN m m
3 43 e
• 4 , 3 43 m u
• ) 24 : 42 3 43 m
• s e
3) A. Vaswani et al. “Attention is All You Need”, NIPS 2017
4) N. Shazzer et al. “Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer”, ICLR 2017

• . .
• 3 .)
• .) A
• ( 3 .) : :
3) A. Vaswani et al. “Attention is All You Need”, NIPS 2017

• ( 2
• E !"
= [!%
"
, !'
"
, … , !)"
"
]
• E !+
= [!%
+
, !'
+
, … , !)+
+
]
• ) E
• A : D
5) M.-T. Luong et al. “Effective Approaches to Attention-based Neural Machine translation”, EMNLP 2015
5)

) (
• )
•
/(
4) N. Shazzer et al. “Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer”, ICLR 2017
4)

• 9 L
• 2 / 02 2 / 02 2 /
• 9
• M - - - 9 =
• -1 5

• M
• / / -,/ p k lsr im e
• W lsr f :
• n s k
• a a > - - /
• - / y
• -2 2 - -, - / Wy
•
• ot C L c A
• L M d L C
• lsr : L

Similar to Generating Wikipedia summaries with deep learning

Survey of the current trends, and the future in Natural Language Generation Yu Sheng Su

An online semantic enhanced dirichlet model for short textJay Kumarr

Open-source tools for generating and analyzing large materials data setsAnubhav Jain

Text Analysis of Academic Papers Archived in Institutional RepositoriesOkamoto Laboratory, The University of Electro-Communications

Using Local Spectral Methods to Robustify Graph-Based LearningDavid Gleich

Model of semantic textual document clusteringSK Ahammad Fahad

DeepLabCut AI ResidencyVic Shao-Chih Chiang

How to make effective presentationمحمد طه أحمد

Promoting Science and Technology Exchange using Machine TranslationToshiaki Nakazawa

The Materials Project: overview and infrastructureAnubhav Jain

Dominik Kowald PhD Defense Recommender SystemsDominik Kowald

Software Sustainability: Better Software Better ScienceCarole Goble

2014 11-13-sbsm032-reproducible researchYannick Wurm

The data we wantElena Simperl

ET_with_EEGXuan Guo

Automatic Classification of Springer Nature Proceedings with Smart Topic MinerFrancesco Osborne

Observation of The Hot Research on Disruptive Science and Technology via Scie...Masatsura IGAMI

Advances in automating analysis of neural time seriesMainak Jas

R&D Halfyearly Report.pptxSaumya Acharya

Improving Semantic Search Using Query Log AnalysisStuart Wrigley

Similar to Generating Wikipedia summaries with deep learning (20)

Survey of the current trends, and the future in Natural Language Generation

An online semantic enhanced dirichlet model for short text

Open-source tools for generating and analyzing large materials data sets

Text Analysis of Academic Papers Archived in Institutional Repositories

Using Local Spectral Methods to Robustify Graph-Based Learning

Model of semantic textual document clustering

DeepLabCut AI Residency

How to make effective presentation

Promoting Science and Technology Exchange using Machine Translation

The Materials Project: overview and infrastructure

Dominik Kowald PhD Defense Recommender Systems

Software Sustainability: Better Software Better Science

2014 11-13-sbsm032-reproducible research

The data we want

ET_with_EEG

Automatic Classification of Springer Nature Proceedings with Smart Topic Miner

Observation of The Hot Research on Disruptive Science and Technology via Scie...

Advances in automating analysis of neural time series

R&D Halfyearly Report.pptx

Improving Semantic Search Using Query Log Analysis

Recently uploaded

WordPress Websites for Engineers: Elevate Your Brandgvaughan

Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3

Scale your database traffic with Read & Write split using MySQL RouterMydbops

Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro

How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett

Gen AI in Business - Global Trends Report 2024.pdfAddepto

Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

What is Artificial Intelligence?????????blackmambaettijean

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

"ML in Production",Oleksandr BaganFwdays

Visualising and forecasting stocks using Dashnarutouzumaki53779

Time Series Foundation Models - current state and future directionsNathaniel Shimoni

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

Recently uploaded (20)

WordPress Websites for Engineers: Elevate Your Brand

Digital Identity is Under Attack: FIDO Paris Seminar.pptx

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

Scale your database traffic with Read & Write split using MySQL Router

Unraveling Multimodality with Large Language Models.pdf

How AI, OpenAI, and ChatGPT impact business and software.

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

What's New in Teams Calling, Meetings and Devices March 2024

Gen AI in Business - Global Trends Report 2024.pdf

Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

What is Artificial Intelligence?????????

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

"ML in Production",Oleksandr Bagan

Visualising and forecasting stocks using Dash

Time Series Foundation Models - current state and future directions

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Generating Wikipedia summaries with deep learning

1. 1 DEEP LEARNING JP [DL Papers] http://deeplearning.jp/ Generating Wikipedia by Summarizing Long Sequences (ICLR 2018) Toru Fujino, scalab, UTokyo

2. • . /0 • s ( -:: 8 2 I • : p 2>2 > goo.gl/wSuuS9 k • 1 2ILCG B iI rd i e • a Ird i I R l • G ItW ) • B W DC noI2>> > : g

3. • , ) 1 • , • • : , • • • : • • • , , ( ) • , ,

4. 1 • .G DC L N • • 51 3 2 : 0 6 4 1 ( • • R • // 1 2 : /1 1:1 4 1 ) • • • 1) Rush et al. “A Neural Attention Model for Sentence Summarization”, EMNLP 2015 2) Nallapati et al. “Abstractive Text Summarization using Sequence-to-Sequence RNNs and Beyond”, CoNLL 2016

5. • (, ,, ),( ) • e / • S • / • 2 A / / 2) Nallapati et al. “Abstractive Text Summarization using Sequence-to-Sequence RNNs and Beyond”, CoNLL 2016 2)

6. • 2. 1 / • 2

7. • 1 W R a • • 2 G • 00 . c • • 1 d https://en.wikipedia.org/wiki/Deep_learning

8. • goo.gl/wSuuS9 ( )

9. • , 1 2 3 4

10. • • • - •

11. • 3 43 4 Y p • CD 4 M i • ac c 43 , 4 a • d ac c n nY r Ly • ot ldA CN m m 3 43 e • 4 , 3 43 m u • ) 24 : 42 3 43 m • s e 3) A. Vaswani et al. “Attention is All You Need”, NIPS 2017 4) N. Shazzer et al. “Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer”, ICLR 2017

12. • . . • 3 .) • .) A • ( 3 .) : : 3) A. Vaswani et al. “Attention is All You Need”, NIPS 2017

13. • ( 2 • E !" = [!% " , !' " , … , !)" " ] • E !+ = [!% + , !' + , … , !)+ + ] • ) E • A : D 5) M.-T. Luong et al. “Effective Approaches to Attention-based Neural Machine translation”, EMNLP 2015 5)

14. • ( - E • : D • : D • ) : D • E : A

15. • • A , - •

16. • K K V 2 5 /6 ) , • ( A 6 • 6

17. • L K 3 ASA • V = • . • 11,/1 / •

18. ) ( • ) • /( 4) N. Shazzer et al. “Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer”, ICLR 2017 4)

19. • 9 L • 2 / 02 2 / 02 2 / • 9 • M - - - 9 = • -1 5

20. •

21. • - - • - • - • :

22. •

23.

24. )& ( • ( ()

25.