Linking Stanford Typed Dependencies to Support Text Analytics

•Download as PPTX, PDF•

1 like•713 views

In this presentation, we talk about our approach on making text dependencies more accessible for consumption and reuse in text analytics.

Data & Analytics

LINKING STANFORD TYPED
DEPENDENCIES TO SUPPORT
TEXT ANALYTICS
Fouad Zablith, Ibrahim H. Osman
American University of Beirut

Problem
• Text documents naturally include dependency relations
among textual elements
• Such dependencies enable readers to cognitively infer the
flow of thoughts and how the various elements are
semantically affected
• Automatically identifying textual dependencies has been
the focus of various approaches. However we observe
that aggregating, accessing and reusing dependencies for
further processing is still a challenge

Question
• Our aim is to answer the following question: how can we
make text dependencies more accessible for consumption
and reuse in text analysis?
• For that we focus on the following requirements:
• To have unique references to textual elements
• To preserve dependency links across text sources
• To store and serve the data for further consumption

Approach Overview
POS
Tagger
Lexical
Parser
RDF
Generator
Triple
Store
Text
Analytics
Apps
Input
Text
Processing
Linking
Publishing/
Reusing

RDF Model
Sentence
RDFS:hasDescription
Sentence text
DCT:hasPart
Term
STD:…
RDFS:subClassOf
STD:Dependent
STD:JJ
STD:VB
STD:NN
STD:CD
STD:auxiliary
RDFS:subPropertyOf
STD:passiveAuxiliary
RDFS:subPropertyOf
STD:copula
STD:modifier
STD:adjectivalModifier
RDFS:subPropertyOf
STD:quantifierModifier
STD:… (all other dependency relations)
Text/
context
DCT:hasPart
Term
Label
RDFS:label

Example
NS:sentence/4c7aa81ba8fbcd3ad42996eb6bac18dc
RDFS:hasDescription
It is an efficient service
NS:term/PRP/It/4c7aa81ba8fbcd3ad42996eb6bac18dc_1
It
RDFS:label
NS:term/VBZ/is/4c7aa81ba8fbcd3ad42996eb6bac18dc_2
Is
RDFS:label
NS:term/DT/an/4c7aa81ba8fbcd3ad42996eb6bac18dc_3
an
RDFS:label
NS:term/JJ/efficient/4c7aa81ba8fbcd3ad42996eb6bac18dc_4
efficient
RDFS:label
NS:term/NN/service/4c7aa81ba8fbcd3ad42996eb6bac18dc_5
service
RDFS:label
DCT:hasPart
STD:nsubj
STD:det
STD:PRP STD:VBZ
STD:DT
STD:JJ
STD:NN
ISA
ISA
ISA
ISA
ISA

Scenario
3140 User Comments
on eGovernment Services
174,862 triples
Input
Processing
Output

Processing Dependencies through
SPARQL – Example 1
• What were the adjectives used by users to describe their
experience from the most frequent, to the less frequent?

Processing Dependencies through
SPARQL – Example 2
• What were the “things” that users found “easy”?

Processing Dependencies through
SPARQL – Example 3
• How is the term “Service” described by users in the
comments?
• users?

So What?
• This graph based manipulation of dependencies would
add potential benefits such as:
• Aggregating and transforming distributed pieces of text as a
coherent query enabled dependency layer
• The possibility of “hardwiring” text dependency patterns at a query
level, and hook them to further analytical tools and techniques (e.g.
visualization)
• The ability to easily extend the text-based graph to capture further
data entities such as polarity dictionaries and perform further
analytics

Future Directions
• At the level of dependency RDF generator, the extractor
can be improved by providing filtering mechanisms that
can be controlled by the analyst
• We are building an online tool that would enable users to
upload a corpus, and generate the corresponding
dependency RDF to be downloaded or pushed to a
triplestore
• We are planning to focus next on exploiting this graph
representation to perform business analytics around
decision models (e.g. user satisfaction and performance
models)

Conclusions
• We presented our work on generating a linked
dependency layer on top of text documents
• We highlighted the preliminary value of this layer by
applying the linking process on 3,140 disparate user
comments
• We believe that this layer will open the path for improving
the consumption and reuse of text dependencies in the
context of text and business analytics

Thank you!
fouad.zablith@aub.edu.lb
http://fouad.zablith.org
@fzablith

What's hot

acm_src_grandfinals_thomas_efflandThomas Effland

Web scale discovery vs google scholarNikesh Narayanan

Soa 10 soa technology soapVaibhav Khanna

Soa 9 soa technologies wsdlVaibhav Khanna

Federated Search: The Good, The Bad And The Uglydorishelfer

Www(alyssa) (2)alyssamarieparal

The Road from Millennium to Alma: Two Tracks, One DestinationNASIG

Investigating Perpetual AccessNASIG

Evaluation of search engineDr. B T Sampath Kumar

The Rest Architectural StyleRobert Wilson

Resource Erik Mitchell

OAI Metadata: Why and HowJenn Riley

Cloud web scale discovery services landscape an overviewNikesh Narayanan

NISO Update June 2014 KBART LevinNational Information Standards Organization (NISO)

Tracking Down the Problem: The Development of a Web-Scale Discovery Troublesh...NASIG

Evaluation of Web Scale Discovery ServicesNikesh Narayanan

At33264269IJERA Editor

Extended WordNetShrikrishna Parab

User Centered E-Resource Management WorkflowsNASIG

HIMSS Digital Healthcare Week 2013- The journey from HL7v2 to HL7 FHIRVictor Chai

What's hot (20)

acm_src_grandfinals_thomas_effland

Web scale discovery vs google scholar

Soa 10 soa technology soap

Soa 9 soa technologies wsdl

Federated Search: The Good, The Bad And The Ugly

Www(alyssa) (2)

The Road from Millennium to Alma: Two Tracks, One Destination

Investigating Perpetual Access

Evaluation of search engine

The Rest Architectural Style

Resource

OAI Metadata: Why and How

Cloud web scale discovery services landscape an overview

NISO Update June 2014 KBART Levin

Tracking Down the Problem: The Development of a Web-Scale Discovery Troublesh...

Evaluation of Web Scale Discovery Services

At33264269

Extended WordNet

User Centered E-Resource Management Workflows

HIMSS Digital Healthcare Week 2013- The journey from HL7v2 to HL7 FHIR

Viewers also liked

Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy DataLeon Derczynski

Tutorial on Opinion Mining and Sentiment AnalysisYun Hao

Aspect Level Sentiment Analysis for Arabic LanguageMido Razaz

Opinion Mining and Sentiment Analysis Issues and Challenges Jaganadh Gopinadhan

Presentation of Domain Specific Question Answering System Using N-gram Approach.Tasnim Ara Islam

Latent Semantic Indexing For Information RetrievalSudarsun Santhiappan

Viewers also liked (6)

Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data

Tutorial on Opinion Mining and Sentiment Analysis

Aspect Level Sentiment Analysis for Arabic Language

Opinion Mining and Sentiment Analysis Issues and Challenges

Presentation of Domain Specific Question Answering System Using N-gram Approach.

Latent Semantic Indexing For Information Retrieval

Recently uploaded (20)

RadioAdProWritingCinderellabyButleri.pdf

Top 5 Best Data Analytics Courses In Queens

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...

RABBIT: A CLI tool for identifying bots based on their GitHub events.

Heart Disease Classification Report: A Data Analysis Project

Advanced Machine Learning for Business Professionals

专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改

Call Girls in Saket 99530🔝 56974 Escort Service

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝

GA4 Without Cookies [Measure Camp AMS]

PKS-TGC-1084-630 - Stage 1 Proposal.pptx

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf

9654467111 Call Girls In Munirka Hotel And Home Service

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf

原版1:1定制南十字星大学毕业证（SCU毕业证）#文凭成绩单#真实留信学历认证永久存档

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx

INTERNSHIP ON PURBASHA COMPOSITE TEX LTD

Easter Eggs From Star Wars and in cars 1 and 2

Customer Service Analytics - Make Sense of All Your Data.pptx

Linking Stanford Typed Dependencies to Support Text Analytics

1. LINKING STANFORD TYPED DEPENDENCIES TO SUPPORT TEXT ANALYTICS Fouad Zablith, Ibrahim H. Osman American University of Beirut

2. Problem • Text documents naturally include dependency relations among textual elements • Such dependencies enable readers to cognitively infer the flow of thoughts and how the various elements are semantically affected • Automatically identifying textual dependencies has been the focus of various approaches. However we observe that aggregating, accessing and reusing dependencies for further processing is still a challenge

3. Question • Our aim is to answer the following question: how can we make text dependencies more accessible for consumption and reuse in text analysis? • For that we focus on the following requirements: • To have unique references to textual elements • To preserve dependency links across text sources • To store and serve the data for further consumption

4. Approach Overview POS Tagger Lexical Parser RDF Generator Triple Store Text Analytics Apps Input Text Processing Linking Publishing/ Reusing

5. Approach Overview POS Tagger Lexical Parser RDF Generator Triple Store Text Analytics Apps Input Text Processing Linking Publishing/ Reusing

6. RDF Model Sentence RDFS:hasDescription Sentence text DCT:hasPart Term STD:… RDFS:subClassOf STD:Dependent STD:JJ STD:VB STD:NN STD:CD STD:auxiliary RDFS:subPropertyOf STD:passiveAuxiliary RDFS:subPropertyOf STD:copula STD:modifier STD:adjectivalModifier RDFS:subPropertyOf STD:quantifierModifier STD:… (all other dependency relations) Text/ context DCT:hasPart Term Label RDFS:label

7. Example NS:sentence/4c7aa81ba8fbcd3ad42996eb6bac18dc RDFS:hasDescription It is an efficient service NS:term/PRP/It/4c7aa81ba8fbcd3ad42996eb6bac18dc_1 It RDFS:label NS:term/VBZ/is/4c7aa81ba8fbcd3ad42996eb6bac18dc_2 Is RDFS:label NS:term/DT/an/4c7aa81ba8fbcd3ad42996eb6bac18dc_3 an RDFS:label NS:term/JJ/efficient/4c7aa81ba8fbcd3ad42996eb6bac18dc_4 efficient RDFS:label NS:term/NN/service/4c7aa81ba8fbcd3ad42996eb6bac18dc_5 service RDFS:label DCT:hasPart STD:nsubj STD:det STD:PRP STD:VBZ STD:DT STD:JJ STD:NN ISA ISA ISA ISA ISA

8. Scenario 3140 User Comments on eGovernment Services 174,862 triples Input Processing Output

9. Processing Dependencies through SPARQL – Example 1 • What were the adjectives used by users to describe their experience from the most frequent, to the less frequent?

10. Processing Dependencies through SPARQL – Example 2 • What were the “things” that users found “easy”?

11. Processing Dependencies through SPARQL – Example 3 • How is the term “Service” described by users in the comments? • users?

12. So What? • This graph based manipulation of dependencies would add potential benefits such as: • Aggregating and transforming distributed pieces of text as a coherent query enabled dependency layer • The possibility of “hardwiring” text dependency patterns at a query level, and hook them to further analytical tools and techniques (e.g. visualization) • The ability to easily extend the text-based graph to capture further data entities such as polarity dictionaries and perform further analytics

13. Future Directions • At the level of dependency RDF generator, the extractor can be improved by providing filtering mechanisms that can be controlled by the analyst • We are building an online tool that would enable users to upload a corpus, and generate the corresponding dependency RDF to be downloaded or pushed to a triplestore • We are planning to focus next on exploiting this graph representation to perform business analytics around decision models (e.g. user satisfaction and performance models)

14. Conclusions • We presented our work on generating a linked dependency layer on top of text documents • We highlighted the preliminary value of this layer by applying the linking process on 3,140 disparate user comments • We believe that this layer will open the path for improving the consumption and reuse of text dependencies in the context of text and business analytics

15. Thank you! fouad.zablith@aub.edu.lb http://fouad.zablith.org @fzablith

Linking Stanford Typed Dependencies to Support Text Analytics

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (6)

Similar to Linking Stanford Typed Dependencies to Support Text Analytics

Similar to Linking Stanford Typed Dependencies to Support Text Analytics (20)

Recently uploaded

Recently uploaded (20)

Linking Stanford Typed Dependencies to Support Text Analytics