The document discusses natural language processing (NLP), which is a subfield of artificial intelligence that aims to allow computers to understand and interpret human language. It provides an introduction to NLP and its history, describes common areas of NLP research like text processing and machine translation, and discusses potential applications and the future of the field. The document is presented as a slideshow on NLP by an expert in the area.
1. Natur al Language Processing
Jaganadh G
Process expert (NLP, ir & ie)
r&d Division
365media inc.
Coimbatore, India
California , usa
Jaganadhg@365media.in
www.365media.com
04-06-2010 Govt. Eng. College
painav
2. outline
â˘Introduction
â˘History
â˘Areas in NLP
â˘Future of NLP
â˘References
04-06-2010 Govt. Eng. College
painav
3. Question ?
â˘Have you ever used any NLP products/ NLP
Powered tools ?
â˘
â˘
â˘
04-06-2010 Govt. Eng. College
painav
4. Natural Language Processing
â˘A sub-field of Artificial Intelligence (AI)
â˘An inter disciplinary subject
â˘Aim:
â˘To build intelligent computers that can interact with
human being like a human being !!
04-06-2010 Govt. Eng. College
painav
5. Natural Language ?
⢠Natural Language?
â˘Refers to the language spoken by people, e.g. English,
Japanese, Swahili, as opposed to artificial languages, like
C++, Java, etc.
04-06-2010 Govt. Eng. College
painav
6. Definition
Natural Language Processing is a theoretically motivated
range of computational techniques for analyzing and
representing naturally occurring texts/speech at one or
more levels of linguistic analysis for the purpose of
achieving human-like language processing for a range of
tasks or applications.
04-06-2010 Govt. Eng. College
painav
7. History
âSecond World War !!!
âStarted with Machine Translation Research
â Now:
âThe most promising technology solutions
âLabs --> Industry --> Layman
04-06-2010 Govt. Eng. College
painav
8. Why NLP
â˘Huge amounts of data
Internet = at least 20 billions pages
Text data â web sites, blog, tweets .......
Audio data â speech .......
â˘Applications for processing large amounts of texts require NLP
expertise
04-06-2010 Govt. Eng. College
painav
9. Why nlp?
News:
AN EARTHQUAKE struck Indonesia today - a strapping 7.7 magnitude earthquake that struck early today off the
northern coast of the island of Sumatra. It caused minor damage and there are no reports of any deaths, although
electricity was interrupted in several places.
Location : Indonesia
Magnitude: 7.7
Region: Sumatra (Northern Cost)
Deaths: Nil
Damage: Minor
Tweet
@nokia announces release of new PDA phones see is.gd/iuTuY
Who: Nokia
What: Product announcement
04-06-2010 Govt. Eng. College
painav
10. Is NLP really hard to achieve
04-06-2010 Govt. Eng. College
painav
11. MAJOR Areas of Research & Development
â˘Text Processing
â˘Morph Analyzer
â˘POS Tagging
â˘Parsing
â˘Machine Translation .........
â˘Speech Processing
â˘Text to Speech (TTS)
â˘Automatic Speech Recognition (ASR)
â˘Speech to Speech Translation
04-06-2010 Govt. Eng. College
painav
12. Text processing
â Processing raw text
â Morphological Analysis
â Running --> run + ing
â POS Tagging
â Ram/NNP goes/VB to/TO school/NNP ..
â Stemming
â running --> run
â Parsing
â Identifying sentence structure
â S --> NP + VP .Govt. Eng. College
04-06-2010
painav
13. Text processing
Machine Translation
Translating content in one natural language to another
natural language
Example : Translating and English Sentence to Malaylam
with the help of a software.
04-06-2010 Govt. Eng. College
painav
14. Speech processing
â˘Text to speech
Converting electronic text to digital speech
â˘Automatic Speech Recognition
Automatic transcription of spoken content to
electronic text
â˘Speech to speech translation
Translating spoken content from one language to
another in real time or offline.
04-06-2010 Govt. Eng. College
painav
15. MAJOR Areas of Research & Development
industrial Applications
⢠Search Engines
â˘Advanced Text Editors
â˘Commercial Machine Translation Systems
â˘Information Extraction
â˘Collaborative filtering
â˘Translation Memories
â˘Computational Advertising
â˘Fraud Detection
â˘Sentiment Analysis
â˘Opinion Mining ......... Govt. Eng. College
04-06-2010
painav
16. Some examples
Document classification
??
Sports
Document Arts
History
Science
??
04-06-2010 Govt. Eng. College
painav
17. Information extraction
Who did what ?
Document When ?
Where?
Barrack Obama
Person: Barrack Obama ->Who
elected as president
Position: President -> What
Of US
Event: elected -> What
04-06-2010 Govt. Eng. College
painav
18. Sentiment analysis
#2012 in very good !!??
bleh :-(
Toby Segram's Programming
Collective intelligence is a
nice book. It gives a detailed
and simple view on ......
04-06-2010 Govt. Eng. College
painav
19. Collaborative filtering
The art /technology to make recommendations based on
user behavior
04-06-2010 Govt. Eng. College
painav
23. NLP in other Domains
⢠Bio-Medical
â˘Forensic Science
â˘Advertisement
â˘Education
â˘Politics
â˘E-governance
â˘Business Development
â˘Marketing
â˘and where ever we use language !!!
04-06-2010 Govt. Eng. College
painav
24. Nlp in India
IIT Kanpur
IIT Kharagpur
IIT Delhi
IIIT Hydrabad
AU-KBC Chennai
C-DAC
Microsoft
Yahoo
AOL
365MEEDIA
Taazaa
Reuters India
.....
04-06-2010 Govt. Eng. College
painav
25. Discussion time
Questions ?
04-06-2010 Govt. Eng. College
painav
26. About 365media
Real time information services
Started in 1998 â with 10 staff at California
India operations started in 2005 @ coimbatore
Now 300 employees , 20 + clients
04-06-2010 Govt. Eng. College
painav
27. thanks
Jaganadh G
Email
Business -Jaganadhg@365media.in
Personal -jaganadhg@gmail.com
http://jaganadhg.freeflux.net/blog
04-06-2010 Govt. Eng. College
painav