SlideShare a Scribd company logo
1 of 12
*
    Nick Campbell
    Speech Communication Lab
    Trinity College Dublin, Ireland
*
    * TCD – Stokes Professor (Dublin)
    * CNGL – PI – Delivery & Interaction
    * ELRA – board member / VP – speech
    * ISCA – board member – workshops
    * IEEE – Sig Proc Soc - SLTC member
    * ATR/NiCT – research director(Japan)
    * Speech Prosody 2014 (Dublin) host

        * Speech scientist/researcher/corpus analyst
* AT&T Bell Labs
    * The ideas people – think ‘BIG’

* IBM UK Scientific Centre
    * The corpus people – ‘collect it all’

* ATR basic telecom research
    * The fundamentals - learn how to ‘infer’ from it


*
* we used to be considered BIG – speech data
  (and now multimedia) gobbled up memory
* I collected 1500 hours of everyday chat/daily
  conversations in 2000 – (@1GB per minute) -
  took 5-years to process!

* now Apple, Google, Ms, .. get that each minute
       (but the secret is in the metadata)

* we need accessible data & tools for everybody!

   *
* but we need to manage privacy issues first!




  *
* and we need a way to protect IP as well

* written publications have ISBN standard
* work is now underway (cf ELRA & COCOSDA) to
  institute ISLRN for Language Resources
* researchers need to get credit for corpora as
  well as for publishing research results
* The community needs a way to identify,
  acknowledge, attribute, and reference data



 *
* tools for processing speech & multimodal data

* htk, hts, R, etc . . .   not simple to use


* little consensus on what features to encode

* manual bootstrap – much too time-consuming!


*
* social interaction

* personal idiosyncracies

* group dynamics – multimodal data (TB/hr)

* issues of robustness / domain specificity /
 privacy / storage & archiving / redistribution


     *
context analytics:


* cultural and language-specific needs
* multimodal – multimedia – multilingual
* tools for ‘less-well-supported’ languages

* e.g., U-STAR consortium for speech research –
 sharing tools & data & knowledge for research



     *
* European Language Resources Association
* COCOSDA – int’l coordinating committee
* IEEE SLTC, ISCA SIGS, there are places to go

    * but are they ready for really BIG data?
               perhaps not yet . . .




                          *
* curricula prepare people

* what standards to rely on?
* what resources available?
* what features to extract?
* what tools to work with?
* what use to put it to?
* what info to hide?
* what to do next?

                               *
*

More Related Content

Viewers also liked

Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)EUCLID project
 
Annotation Processor, trésor caché de la JVM
Annotation Processor, trésor caché de la JVMAnnotation Processor, trésor caché de la JVM
Annotation Processor, trésor caché de la JVMRaphaël Brugier
 
Querying Linked Data on Android
Querying Linked Data on AndroidQuerying Linked Data on Android
Querying Linked Data on AndroidEUCLID project
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked DataEUCLID project
 
Interaction with Linked Data
Interaction with Linked DataInteraction with Linked Data
Interaction with Linked DataEUCLID project
 
Building Linked Data Applications
Building Linked Data ApplicationsBuilding Linked Data Applications
Building Linked Data ApplicationsEUCLID project
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
 
Conférence sur les annotations Java par Olivier Croisier (Zenika) au Paris JUG
Conférence sur les annotations Java par Olivier Croisier (Zenika) au Paris JUGConférence sur les annotations Java par Olivier Croisier (Zenika) au Paris JUG
Conférence sur les annotations Java par Olivier Croisier (Zenika) au Paris JUGZenika
 
A Guide to SlideShare Analytics - Excerpts from Hubspot's Step by Step Guide ...
A Guide to SlideShare Analytics - Excerpts from Hubspot's Step by Step Guide ...A Guide to SlideShare Analytics - Excerpts from Hubspot's Step by Step Guide ...
A Guide to SlideShare Analytics - Excerpts from Hubspot's Step by Step Guide ...SlideShare
 

Viewers also liked (11)

Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)
 
Comment manager des geeks - Devoxx 2015
Comment manager des geeks - Devoxx 2015Comment manager des geeks - Devoxx 2015
Comment manager des geeks - Devoxx 2015
 
Annotation Processor, trésor caché de la JVM
Annotation Processor, trésor caché de la JVMAnnotation Processor, trésor caché de la JVM
Annotation Processor, trésor caché de la JVM
 
Querying Linked Data on Android
Querying Linked Data on AndroidQuerying Linked Data on Android
Querying Linked Data on Android
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
Interaction with Linked Data
Interaction with Linked DataInteraction with Linked Data
Interaction with Linked Data
 
Querying Linked Data
Querying Linked DataQuerying Linked Data
Querying Linked Data
 
Building Linked Data Applications
Building Linked Data ApplicationsBuilding Linked Data Applications
Building Linked Data Applications
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
Conférence sur les annotations Java par Olivier Croisier (Zenika) au Paris JUG
Conférence sur les annotations Java par Olivier Croisier (Zenika) au Paris JUGConférence sur les annotations Java par Olivier Croisier (Zenika) au Paris JUG
Conférence sur les annotations Java par Olivier Croisier (Zenika) au Paris JUG
 
A Guide to SlideShare Analytics - Excerpts from Hubspot's Step by Step Guide ...
A Guide to SlideShare Analytics - Excerpts from Hubspot's Step by Step Guide ...A Guide to SlideShare Analytics - Excerpts from Hubspot's Step by Step Guide ...
A Guide to SlideShare Analytics - Excerpts from Hubspot's Step by Step Guide ...
 

Similar to Nick Campbell speech scientist overview

GBIF BIFA mentoring, Day 5a Data management, July 2016
GBIF BIFA mentoring, Day 5a Data management, July 2016GBIF BIFA mentoring, Day 5a Data management, July 2016
GBIF BIFA mentoring, Day 5a Data management, July 2016Dag Endresen
 
IWST 2013: Intro
IWST 2013: IntroIWST 2013: Intro
IWST 2013: IntroESUG
 
Kathryn Cassidy - DRI Training Series: 4. Metadata and XML
Kathryn Cassidy - DRI Training Series: 4. Metadata and XMLKathryn Cassidy - DRI Training Series: 4. Metadata and XML
Kathryn Cassidy - DRI Training Series: 4. Metadata and XMLdri_ireland
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Alexandru Iosup
 
DRI Introduction to Digital Preservation Training- Metadata and xml-Kathryn C...
DRI Introduction to Digital Preservation Training- Metadata and xml-Kathryn C...DRI Introduction to Digital Preservation Training- Metadata and xml-Kathryn C...
DRI Introduction to Digital Preservation Training- Metadata and xml-Kathryn C...dri_ireland
 
iUser2011 Keynote: The Personal Information Environment beyond the Personal C...
iUser2011 Keynote: The Personal Information Environment beyond the Personal C...iUser2011 Keynote: The Personal Information Environment beyond the Personal C...
iUser2011 Keynote: The Personal Information Environment beyond the Personal C...Alan Dix
 
dbGLOVE (presentation at Silicon Valley Personal Health Technology)
dbGLOVE (presentation at Silicon Valley Personal Health Technology)dbGLOVE (presentation at Silicon Valley Personal Health Technology)
dbGLOVE (presentation at Silicon Valley Personal Health Technology)QIRIS
 
Six Use Cases for Edinburgh DataShare
Six Use Cases for Edinburgh DataShareSix Use Cases for Edinburgh DataShare
Six Use Cases for Edinburgh DataShareRobin Rice
 
Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016KellliBee
 
Information Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ DeloitteInformation Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ DeloitteDeep Kayal
 
Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...
Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...
Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...Michael Hausenblas
 
What's the fuss about all this metadata?
What's the fuss about all this metadata?What's the fuss about all this metadata?
What's the fuss about all this metadata?Sara Sterkenburg
 
An information environment for neuroscientists
An information environment for neuroscientistsAn information environment for neuroscientists
An information environment for neuroscientistsDavid Wallom
 
Ensuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly ResourcesEnsuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly ResourcesEDINA, University of Edinburgh
 
Digital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework ProgrammeDigital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework Programmelocloud
 

Similar to Nick Campbell speech scientist overview (20)

GBIF BIFA mentoring, Day 5a Data management, July 2016
GBIF BIFA mentoring, Day 5a Data management, July 2016GBIF BIFA mentoring, Day 5a Data management, July 2016
GBIF BIFA mentoring, Day 5a Data management, July 2016
 
Born Digital Archives
Born Digital ArchivesBorn Digital Archives
Born Digital Archives
 
Importance of Database in Library
Importance of Database in LibraryImportance of Database in Library
Importance of Database in Library
 
IWST 2013: Intro
IWST 2013: IntroIWST 2013: Intro
IWST 2013: Intro
 
Kathryn Cassidy - DRI Training Series: 4. Metadata and XML
Kathryn Cassidy - DRI Training Series: 4. Metadata and XMLKathryn Cassidy - DRI Training Series: 4. Metadata and XML
Kathryn Cassidy - DRI Training Series: 4. Metadata and XML
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.
 
DRI Introduction to Digital Preservation Training- Metadata and xml-Kathryn C...
DRI Introduction to Digital Preservation Training- Metadata and xml-Kathryn C...DRI Introduction to Digital Preservation Training- Metadata and xml-Kathryn C...
DRI Introduction to Digital Preservation Training- Metadata and xml-Kathryn C...
 
iUser2011 Keynote: The Personal Information Environment beyond the Personal C...
iUser2011 Keynote: The Personal Information Environment beyond the Personal C...iUser2011 Keynote: The Personal Information Environment beyond the Personal C...
iUser2011 Keynote: The Personal Information Environment beyond the Personal C...
 
dbGLOVE (presentation at Silicon Valley Personal Health Technology)
dbGLOVE (presentation at Silicon Valley Personal Health Technology)dbGLOVE (presentation at Silicon Valley Personal Health Technology)
dbGLOVE (presentation at Silicon Valley Personal Health Technology)
 
Takeda 101214short-d
Takeda 101214short-dTakeda 101214short-d
Takeda 101214short-d
 
Six Use Cases for Edinburgh DataShare
Six Use Cases for Edinburgh DataShareSix Use Cases for Edinburgh DataShare
Six Use Cases for Edinburgh DataShare
 
Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016
 
Digital Archive of Knowledge for Sharing and Re-using
Digital Archive of Knowledge for Sharing and Re-usingDigital Archive of Knowledge for Sharing and Re-using
Digital Archive of Knowledge for Sharing and Re-using
 
Challenges for Linked Data in Japan
Challenges for Linked Data in JapanChallenges for Linked Data in Japan
Challenges for Linked Data in Japan
 
Information Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ DeloitteInformation Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ Deloitte
 
Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...
Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...
Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...
 
What's the fuss about all this metadata?
What's the fuss about all this metadata?What's the fuss about all this metadata?
What's the fuss about all this metadata?
 
An information environment for neuroscientists
An information environment for neuroscientistsAn information environment for neuroscientists
An information environment for neuroscientists
 
Ensuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly ResourcesEnsuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly Resources
 
Digital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework ProgrammeDigital Cultural Heritage and the new EU Framework Programme
Digital Cultural Heritage and the new EU Framework Programme
 

Recently uploaded

[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your QueriesExploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your QueriesSanjay Willie
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Fact vs. Fiction: Autodetecting Hallucinations in LLMs
Fact vs. Fiction: Autodetecting Hallucinations in LLMsFact vs. Fiction: Autodetecting Hallucinations in LLMs
Fact vs. Fiction: Autodetecting Hallucinations in LLMsZilliz
 

Recently uploaded (20)

[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your QueriesExploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
Exploring ChatGPT Prompt Hacks To Maximally Optimise Your Queries
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Fact vs. Fiction: Autodetecting Hallucinations in LLMs
Fact vs. Fiction: Autodetecting Hallucinations in LLMsFact vs. Fiction: Autodetecting Hallucinations in LLMs
Fact vs. Fiction: Autodetecting Hallucinations in LLMs
 

Nick Campbell speech scientist overview

  • 1. * Nick Campbell Speech Communication Lab Trinity College Dublin, Ireland
  • 2. * * TCD – Stokes Professor (Dublin) * CNGL – PI – Delivery & Interaction * ELRA – board member / VP – speech * ISCA – board member – workshops * IEEE – Sig Proc Soc - SLTC member * ATR/NiCT – research director(Japan) * Speech Prosody 2014 (Dublin) host * Speech scientist/researcher/corpus analyst
  • 3. * AT&T Bell Labs * The ideas people – think ‘BIG’ * IBM UK Scientific Centre * The corpus people – ‘collect it all’ * ATR basic telecom research * The fundamentals - learn how to ‘infer’ from it *
  • 4. * we used to be considered BIG – speech data (and now multimedia) gobbled up memory * I collected 1500 hours of everyday chat/daily conversations in 2000 – (@1GB per minute) - took 5-years to process! * now Apple, Google, Ms, .. get that each minute (but the secret is in the metadata) * we need accessible data & tools for everybody! *
  • 5. * but we need to manage privacy issues first! *
  • 6. * and we need a way to protect IP as well * written publications have ISBN standard * work is now underway (cf ELRA & COCOSDA) to institute ISLRN for Language Resources * researchers need to get credit for corpora as well as for publishing research results * The community needs a way to identify, acknowledge, attribute, and reference data *
  • 7. * tools for processing speech & multimodal data * htk, hts, R, etc . . . not simple to use * little consensus on what features to encode * manual bootstrap – much too time-consuming! *
  • 8. * social interaction * personal idiosyncracies * group dynamics – multimodal data (TB/hr) * issues of robustness / domain specificity / privacy / storage & archiving / redistribution *
  • 9. context analytics: * cultural and language-specific needs * multimodal – multimedia – multilingual * tools for ‘less-well-supported’ languages * e.g., U-STAR consortium for speech research – sharing tools & data & knowledge for research *
  • 10. * European Language Resources Association * COCOSDA – int’l coordinating committee * IEEE SLTC, ISCA SIGS, there are places to go * but are they ready for really BIG data? perhaps not yet . . . *
  • 11. * curricula prepare people * what standards to rely on? * what resources available? * what features to extract? * what tools to work with? * what use to put it to? * what info to hide? * what to do next? *
  • 12. *