SlideShare a Scribd company logo
1 of 10
Build Cutting Edge Biomedical & Clinical NLU Models
BioBERT for NLU
2
TRENDS IN NLP & SPEECH
NLP’s ImageNet Moment has Arrived
You don’t need a Phd in ML to do industrial
strength NLP.
LOWER BARRIER TO ENTRY
Textual data is still largely not utilized in
healthcare, despite its value.
UNSTRUCTURED & UNTAPPED
Pre-train a very language model once and fine
tune many times for different use cases
BioBERT beats BERT on Biomedical tasks.
ClinicalBERT beats BioBERT on clinical tasks.
DOMAIN SPECIFIC BEATS GENERIC
GROWTH OF MULTI-MODAL DATASETS
Transformer & its derivatives like BERT & XLNet
produce game changing performance improvements.
DRAMATICALLY IMPROVING ALGORITHMS
CONVERSATIONAL AI NEEDS LARGE MODELS
EHR data, PubMed literature, Clinical Notes,
Imaging, Devices, Patient Communications, Social
Media.
3
USE CASES IN HEALTHCARE
Text Classification
Sentiment Analysis
Intent Classification
Message Triaging
Claims Processing
Named Entity Recognition
Information Extraction
Features in ML models
Knowledge Graphs
Automatic Weak Labeling
De-identification
Question-Answer
Answer questions posed in
natural language
Chatbots
Text Summarization
Summarize physician
notes, radiology reports
etc.
Speech Recognition
Call Center optimization
Voice commands
Machine Translation
Patient Engagement
Published Literature
4
RACE TO CONVERSATIONAL AI
Exceeding Human Level Performance
GLUE Leaderboard
Google
(BERT)
Facebook
(RoBERTa
)
Alibaba
(Enriched BERT base)
Uber
(Plato)
Microsoft
(MT-DNN)
Baidu
(ERNIE)
2017 2018 2019 Today
Google
(Transformer
)
5
DOMAIN SPECIFIC BEATS GENERIC
BioBERT
• Pre-trained on top of BERT using
PubMed data
• Beats BERT on Biomedical tasks.
Clinical BERT(s)
• Pre-trained on top of Bio-BERT using
clinical Notes
• Beats BioBERT on clinical tasks.
6
Pre-Training vs. Fine-Tuning
7
8
https://ngc.nvidia.com/catalog/model-scripts/nvidia:biobert_for_tensorflow
TRAIN USING NGC
Optimized, Scalable & Easy to Use
• Convenient scripts for pre-training & fine-tuning
• Optimized Docker images for TensorFlow
• Automatic Mixed Precision for up to 3x speedup
• Scale out for pre-training & fine-tuning
9
TRAIN USING NGC
Optimized, Scalable & Easy to Use
For comparison, the BioBERT paper reported 10+ days
(240+ hours) to train on a 8x32 GB V100 system.
https://news.developer.nvidia.com/biobert-optimized/
NLP for Biomedical Applications

More Related Content

What's hot

What's hot (20)

Bert
BertBert
Bert
 
Transfer Learning
Transfer LearningTransfer Learning
Transfer Learning
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender Systems
 
Question Answering System using machine learning approach
Question Answering System using machine learning approachQuestion Answering System using machine learning approach
Question Answering System using machine learning approach
 
Large Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdfLarge Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdf
 
Transfer Learning -- The Next Frontier for Machine Learning
Transfer Learning -- The Next Frontier for Machine LearningTransfer Learning -- The Next Frontier for Machine Learning
Transfer Learning -- The Next Frontier for Machine Learning
 
Explainable AI (XAI) - A Perspective
Explainable AI (XAI) - A Perspective Explainable AI (XAI) - A Perspective
Explainable AI (XAI) - A Perspective
 
Natural Language Understanding in Healthcare
Natural Language Understanding in HealthcareNatural Language Understanding in Healthcare
Natural Language Understanding in Healthcare
 
Fine tune and deploy Hugging Face NLP models
Fine tune and deploy Hugging Face NLP modelsFine tune and deploy Hugging Face NLP models
Fine tune and deploy Hugging Face NLP models
 
An introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERTAn introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERT
 
Transformer Introduction (Seminar Material)
Transformer Introduction (Seminar Material)Transformer Introduction (Seminar Material)
Transformer Introduction (Seminar Material)
 
What is MLOps
What is MLOpsWhat is MLOps
What is MLOps
 
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
 
Deep learning and Healthcare
Deep learning and HealthcareDeep learning and Healthcare
Deep learning and Healthcare
 
Introduction of Knowledge Graphs
Introduction of Knowledge GraphsIntroduction of Knowledge Graphs
Introduction of Knowledge Graphs
 
Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)
Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)
Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)
 
Deep learning for medical imaging
Deep learning for medical imagingDeep learning for medical imaging
Deep learning for medical imaging
 
Deep dive into LangChain integration with Neo4j.pptx
Deep dive into LangChain integration with Neo4j.pptxDeep dive into LangChain integration with Neo4j.pptx
Deep dive into LangChain integration with Neo4j.pptx
 
Generative Models and ChatGPT
Generative Models and ChatGPTGenerative Models and ChatGPT
Generative Models and ChatGPT
 

Similar to NLP for Biomedical Applications

Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...
Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...
Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...
Benjamin Good
 
Human-like Chatbots: Promises, Challenges, and Implications
Human-like Chatbots: Promises, Challenges, and ImplicationsHuman-like Chatbots: Promises, Challenges, and Implications
Human-like Chatbots: Promises, Challenges, and Implications
Amit Sheth
 

Similar to NLP for Biomedical Applications (20)

How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning How to Implement Biomedical Named Entity Recognition with Machine Learning
How to Implement Biomedical Named Entity Recognition with Machine Learning
 
health care chatbot using data science with python
health care chatbot using data science with pythonhealth care chatbot using data science with python
health care chatbot using data science with python
 
EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017
 
Nlp for the precision medicine
Nlp for the precision medicineNlp for the precision medicine
Nlp for the precision medicine
 
IRJET - Implementation of Disease Prediction Chatbot and Report Analyzer ...
IRJET -  	  Implementation of Disease Prediction Chatbot and Report Analyzer ...IRJET -  	  Implementation of Disease Prediction Chatbot and Report Analyzer ...
IRJET - Implementation of Disease Prediction Chatbot and Report Analyzer ...
 
Growth and Integration of ML/AI in Biotech
Growth and Integration of ML/AI in BiotechGrowth and Integration of ML/AI in Biotech
Growth and Integration of ML/AI in Biotech
 
generative AI in healthcare.pdf
generative AI in healthcare.pdfgenerative AI in healthcare.pdf
generative AI in healthcare.pdf
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 
Biomedical Entity Linking - Introduction, approaches, challenges
Biomedical Entity Linking - Introduction, approaches, challengesBiomedical Entity Linking - Introduction, approaches, challenges
Biomedical Entity Linking - Introduction, approaches, challenges
 
ShortStory_bioCaster.pptx
ShortStory_bioCaster.pptxShortStory_bioCaster.pptx
ShortStory_bioCaster.pptx
 
Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...
Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...
Poster: Microtask crowdsourcing for disease mention annotation in PubMed abst...
 
Creating Value Through Computable Clinical Data
Creating Value Through Computable Clinical DataCreating Value Through Computable Clinical Data
Creating Value Through Computable Clinical Data
 
What is Deep Learning and how it helps to Healthcare Sector?
What is Deep Learning and how it helps to Healthcare Sector?What is Deep Learning and how it helps to Healthcare Sector?
What is Deep Learning and how it helps to Healthcare Sector?
 
Human-like Chatbots: Promises, Challenges, and Implications
Human-like Chatbots: Promises, Challenges, and ImplicationsHuman-like Chatbots: Promises, Challenges, and Implications
Human-like Chatbots: Promises, Challenges, and Implications
 
Standards & Coding Systems in Biomedical and Health Informatics
Standards & Coding Systems in Biomedical and Health InformaticsStandards & Coding Systems in Biomedical and Health Informatics
Standards & Coding Systems in Biomedical and Health Informatics
 
Artificial Intelligence - Potential Game Changer for Medical Technology Compa...
Artificial Intelligence - Potential Game Changer for Medical Technology Compa...Artificial Intelligence - Potential Game Changer for Medical Technology Compa...
Artificial Intelligence - Potential Game Changer for Medical Technology Compa...
 
Careers in bioinformatics, Scope, Skills and Jobs
Careers in bioinformatics, Scope, Skills and JobsCareers in bioinformatics, Scope, Skills and Jobs
Careers in bioinformatics, Scope, Skills and Jobs
 
Natural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health RecordsNatural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health Records
 
AI approaches in healthcare - targeting precise and personalized medicine
AI approaches in healthcare - targeting precise and personalized medicine AI approaches in healthcare - targeting precise and personalized medicine
AI approaches in healthcare - targeting precise and personalized medicine
 

More from NVIDIA

NVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October SummaryNVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October Summary
NVIDIA
 

More from NVIDIA (20)

NVIDIA Story 2023.pdf
NVIDIA Story 2023.pdfNVIDIA Story 2023.pdf
NVIDIA Story 2023.pdf
 
NVIDIA GTC2022 Spring Highlights
NVIDIA GTC2022 Spring HighlightsNVIDIA GTC2022 Spring Highlights
NVIDIA GTC2022 Spring Highlights
 
NVIDIA Brochure 2021 Company Overview
NVIDIA Brochure 2021 Company OverviewNVIDIA Brochure 2021 Company Overview
NVIDIA Brochure 2021 Company Overview
 
NVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October SummaryNVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October Summary
 
The Best of AI and HPC in Healthcare and Life Sciences
The Best of AI and HPC in Healthcare and Life SciencesThe Best of AI and HPC in Healthcare and Life Sciences
The Best of AI and HPC in Healthcare and Life Sciences
 
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
 
Top 5 Deep Learning and AI Stories - August 30, 2019
Top 5 Deep Learning and AI Stories - August 30, 2019Top 5 Deep Learning and AI Stories - August 30, 2019
Top 5 Deep Learning and AI Stories - August 30, 2019
 
Seven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence ResearchSeven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence Research
 
NVIDIA Developer Program Overview
NVIDIA Developer Program OverviewNVIDIA Developer Program Overview
NVIDIA Developer Program Overview
 
NVIDIA at Computex 2019
NVIDIA at Computex 2019 NVIDIA at Computex 2019
NVIDIA at Computex 2019
 
Top 5 DGX Sessions From GTC 2019
Top 5 DGX Sessions From GTC 2019Top 5 DGX Sessions From GTC 2019
Top 5 DGX Sessions From GTC 2019
 
DGX POD Top 4 Sessions From GTC 2019
DGX POD Top 4 Sessions From GTC 2019DGX POD Top 4 Sessions From GTC 2019
DGX POD Top 4 Sessions From GTC 2019
 
Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019
 
This Week in Data Science - Top 5 News - April 26, 2019
This Week in Data Science - Top 5 News - April 26, 2019This Week in Data Science - Top 5 News - April 26, 2019
This Week in Data Science - Top 5 News - April 26, 2019
 
GTC 2019 Keynote in Silicon Valley
GTC 2019 Keynote in Silicon ValleyGTC 2019 Keynote in Silicon Valley
GTC 2019 Keynote in Silicon Valley
 
CUDA DLI Training Courses at GTC 2019
CUDA DLI Training Courses at GTC 2019CUDA DLI Training Courses at GTC 2019
CUDA DLI Training Courses at GTC 2019
 
DGX Sessions You Won't Want to Miss at GTC 2019
DGX Sessions You Won't Want to Miss at GTC 2019DGX Sessions You Won't Want to Miss at GTC 2019
DGX Sessions You Won't Want to Miss at GTC 2019
 
Transforming Healthcare at GTC Silicon Valley
Transforming Healthcare at GTC Silicon ValleyTransforming Healthcare at GTC Silicon Valley
Transforming Healthcare at GTC Silicon Valley
 
OpenACC Monthly Highlights February 2019
OpenACC Monthly Highlights February 2019OpenACC Monthly Highlights February 2019
OpenACC Monthly Highlights February 2019
 
CUDA Sessions You Won't Want to Miss at GTC 2019
CUDA Sessions You Won't Want to Miss at GTC 2019CUDA Sessions You Won't Want to Miss at GTC 2019
CUDA Sessions You Won't Want to Miss at GTC 2019
 

Recently uploaded

Recently uploaded (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 

NLP for Biomedical Applications

  • 1. Build Cutting Edge Biomedical & Clinical NLU Models BioBERT for NLU
  • 2. 2 TRENDS IN NLP & SPEECH NLP’s ImageNet Moment has Arrived You don’t need a Phd in ML to do industrial strength NLP. LOWER BARRIER TO ENTRY Textual data is still largely not utilized in healthcare, despite its value. UNSTRUCTURED & UNTAPPED Pre-train a very language model once and fine tune many times for different use cases BioBERT beats BERT on Biomedical tasks. ClinicalBERT beats BioBERT on clinical tasks. DOMAIN SPECIFIC BEATS GENERIC GROWTH OF MULTI-MODAL DATASETS Transformer & its derivatives like BERT & XLNet produce game changing performance improvements. DRAMATICALLY IMPROVING ALGORITHMS CONVERSATIONAL AI NEEDS LARGE MODELS EHR data, PubMed literature, Clinical Notes, Imaging, Devices, Patient Communications, Social Media.
  • 3. 3 USE CASES IN HEALTHCARE Text Classification Sentiment Analysis Intent Classification Message Triaging Claims Processing Named Entity Recognition Information Extraction Features in ML models Knowledge Graphs Automatic Weak Labeling De-identification Question-Answer Answer questions posed in natural language Chatbots Text Summarization Summarize physician notes, radiology reports etc. Speech Recognition Call Center optimization Voice commands Machine Translation Patient Engagement Published Literature
  • 4. 4 RACE TO CONVERSATIONAL AI Exceeding Human Level Performance GLUE Leaderboard Google (BERT) Facebook (RoBERTa ) Alibaba (Enriched BERT base) Uber (Plato) Microsoft (MT-DNN) Baidu (ERNIE) 2017 2018 2019 Today Google (Transformer )
  • 5. 5 DOMAIN SPECIFIC BEATS GENERIC BioBERT • Pre-trained on top of BERT using PubMed data • Beats BERT on Biomedical tasks. Clinical BERT(s) • Pre-trained on top of Bio-BERT using clinical Notes • Beats BioBERT on clinical tasks.
  • 7. 7
  • 8. 8 https://ngc.nvidia.com/catalog/model-scripts/nvidia:biobert_for_tensorflow TRAIN USING NGC Optimized, Scalable & Easy to Use • Convenient scripts for pre-training & fine-tuning • Optimized Docker images for TensorFlow • Automatic Mixed Precision for up to 3x speedup • Scale out for pre-training & fine-tuning
  • 9. 9 TRAIN USING NGC Optimized, Scalable & Easy to Use For comparison, the BioBERT paper reported 10+ days (240+ hours) to train on a 8x32 GB V100 system. https://news.developer.nvidia.com/biobert-optimized/