SlideShare a Scribd company logo
1 of 27
Download to read offline
Information Diffusion in Web Services Networks


Shahab Mokarizadeh , Royal Institute of Technology (KTH) , Sweden
               Peep Küngas, University of Tartu (UT) , Estonia
       Mihhail Matskin , Royal Institute of Technology (KTH) , Sweden
    Marco Crasso, Marcelo Campo, Alejandro Zunino , UNICEN University,
                                  Argentina

                        Contact: shahabm@kth.se
1
Outline

     Background of Information Flow Analysis
     Roadmap and Computational Model
       Web service Annotation
       Web service Categorization
     Experimental Results
     Discussion & Conclusion




2
Background – Information Diffusion
 Information Diffusion: the communication of knowledge over
  time among members of a social system
 It shows intrinsic properties of real-world phenomenon.
 Already studied in the context of: biosphere, microblogs,
publication citation, … where a network structure present.




3
Information Diffusion
         among Web service Domains
Observation: Services published in the Web form a conceptual
 ecology of knowledge where information is shared and flows
 along input and output parameters of service operations.

Case-study: How Web services in different commodities have
 been designed from information exchange perspective?
     Introducing value-add Web services
     Web service adoption spots




4
Roadmap

    1
        • Semantically annotation of Web services


    2
        • Assign Web services to respective categories


    3
        • Construct Web service network


    4
        • Compute information flow matrix

        • Matrix Analysis
    5

5
1-Web service Annotation
-Only semantic annotations of basic elements of input and output
parameters of Web service Operations
-SAWSDL annotation model




-We exploit our Semi-automated ontology learning method which
relies on lexico-syntactic patterns
    “Ontology Learning for Cost-Effective Large-Scale Semantic Annotation
    of Web Service Interfaces”. EKAW 2010:pp. 401-410
                                                       Image from : Web Services and
6                                                      Security,1/17/2006 ,Marco Cova
Tax and Customs Board service




        Output message content fragment
7
Business Registry service




       Input message content fragment
8
A Business Registry service




9     Output message content fragment
Registry of Economic Activities Service




           Output message content fragment
10
2-Web service Categorization
A category (a.k.a. commodity) describes a general kind of a service
that is provided, for example “B2B” , “Health”, “E-Commerce”, etc.
Each Web service could belong to multiple categories !
Standard Software Taxonomy e.g. UNSPSC: http://www.unspsc.org/
We use Classifier : "AWSC: An approach to Web Service classification
based on machine learning techniques“, Inteligencia Artificial, ISSN 1137-3601, vol.
12, no. 37, pp. 25-36, Asociación Española para la Inteligencia Artificial, Valencia, España.
2008.
                                             UNSPSC
             Instant messaging                    Calendar and scheduling
             Adventure games                      Mobile operator specific
             Internet directory services          Medical software
11           Music or sound editing               Video conferencing software
3-Web service Network Construction

1- Present annotated Web services as bipartite (2-mode) graph
2- Create Semantic Network (1-mode graph)
3- Create Weighted Category Network using Semantic network




12
Bipartite Web Service Network




13
Bipartite Web Service Network
              (categorized)




14
Network Transformation
       Semantic Network                              Category Network




Propagate the categories to semantic Ds, Dt : category nodes
nodes , Cu: semantic node ,          Label each category edge with weights:
qk: weight of node in category k
                    Q u  q1 ,..qk ., qn    u ,v ( Ds , Dt )  qu , s .qv,t
                  frequency of Cu in Ds
 15
      qs     n

              frequency of      Cu in Di
                                              W ( Ds , Dt )        
                                                                edge ( u ,v )
                                                                              u ,v   ( D s , Dt )
             i 1
4-Normalizing Weights (Z-score)
 Edge category weight W(Di,Dj) :                      Wi,j

 Sum of all weights of all links from category i:   W i *   W ( Di , D j )
                                                                  j


 Sum of all weights of all links to category j:     W* j   W ( Di , D j )
                                                                      i

 Sum of weights of all categories:                  W   W ( Di , D j )
                                                           i, j


 Expected weights from category i to category j :
                                                       Wi*  W* j
                                                              W
 Normalize category weights (Z-Score):

                                   Wi*  W* j        Wi*  W* j
              i , j  (Wi , j                 )
                                      W                    W
16
Matrix of Information flow
Matrix of information flow between pair of categories:
                      1,1      1, j     1,n 
                                         
                                                  
                    i ,1     i , j    i ,n 
                                                  
                                         
                       n ,1    n, j      n,n 
                                                  

A high proximity (Φ i j) between categories i and j reveals a strong
tendency for semantic concepts associated to category j to be resulted
from invocation of services which take semantic concepts associated to
category i.

17
5-Experimental Settings
 27000 public Web services (WSDLs) (collected 2005-2011)


 Semantic Annotation
   Lexico-syntactic based ontology learning
   Annotation accuracy: Precision= 31% , Recall= 19%


 Categorization
   AWSC Classifier
   Training dataset: 1500 WSDLs
   Categorization Accuracy: 91%


                                                            18
Excerpt of Identified Service Categories
          Category                            Category
1-Communications server         11-Network operation system
2-Instant messaging             12-Database management system
3-Adventure games               13-Analytical or scientific
4-Internet directory services   14-Portal server
5-Music or sound editing        15-Foreign language software
6-Calendar and scheduling       16-Procurement software
7-Mobile operator specific      17-Inventory management software
8-Medical software              18-Dictionary software
9-Video conferencing            19-Fax software
10-Map creation software        20-Object oriented database management
                                                                         19
Visualization of Matrix of Information Flow




20
Information Exchange Patterns - 1:
 Self-Referential Pattern:          A category mainly provides inputs
     for its own services and consumes mostly the information
     provided by itself (i.e. self contained).
      Appear in diagonal of matrix

      Categories: Financial Analysis Software, Web Platform Development
       Software, Map Creation Software, Video Conferencing Software and
       Accounting Software

      The API-s exposed by these Web services exploit frequently
       domain-specific concepts as input and output elements


21
Information Exchange Patterns - 2:
 Outside main diagonal:
-Foreign Language category , Presentation category
-Financial Analysis category , Enterprise Resource Planning category


 Least volume of information flow:
  -Video Conferencing software and Financial Analysis software




22
Threats to Validity
 The presented model heavily relies of accuracy of underlying
     semantic annotation and matching scheme !

 The examined Web services account only for small proportion
     of existing ones on the Web!

 The collection of Web services’ interface descriptions may also
     suffer from unintentional preference toward some specific
     categories.

 In the absence of timing factor our analysis is rather static
     analysis of information flow
23
Conclusion and Future Work
 The presented approach can discover information exchange
  patterns.
 In general our approach is applicable to any other kind of machine
  understandable APIs, not just WSDLs, !

Future work:
To examine how presence of service composition or mashups
 influences the information exchange pattern
Recommending value-add Web services based on identified
 information exchange patterns and Web service network
 properties

24
Thanks!




               Questions Please!



25
Partial Category Weight for Edge (Ds,Dt) :    u ,v ( Ds , Dt )  qu , s .qv,t

Augmented Category Weight for Edge (Ds, Dt): W ( Ds , Dt )        
                                                               edge ( u ,v )
                                                                             u ,v   ( D s , Dt )

26
Ontology Learning for                                           Information Elicitation

 Web service Annotation1                                             Term Extraction


                                                                   Syntactic Refinement



                                                                   Ontology Discovery
Ontology Learning Input:                                              Pattern-based
          - Message Part names of input/output                      Semantic Analysis
         parameters                                               Term Disambiguation
          - XML Schema leaf element names of
         complex types                                              Class and Relation
                                                                      Determination


                                                                   Ontology Organization

                                                                      Adding Relations
[1] ”Ontology Learning for Cost-Effective Large-scale Semantic
Annotation of XML Schemas and Web Service Interfaces". in Porc.
EKAW 2010, LNAI 6317,pp.401-410, 2010                                             Reference
27                                                                                Ontology

More Related Content

Viewers also liked

Startup Culture: Value Creation in the Academic Library
Startup Culture: Value Creation in the Academic LibraryStartup Culture: Value Creation in the Academic Library
Startup Culture: Value Creation in the Academic LibraryKevin Rundblad
 
Help Wanted - Orphan Advocates
Help Wanted - Orphan AdvocatesHelp Wanted - Orphan Advocates
Help Wanted - Orphan AdvocatesAndy Lehman
 
Domena Publiczna a misja GLAM
Domena Publiczna a misja GLAMDomena Publiczna a misja GLAM
Domena Publiczna a misja GLAMKOED
 
Ccr Short 08 (2)
Ccr Short 08 (2)Ccr Short 08 (2)
Ccr Short 08 (2)drewgrdina
 
MoPH CHECKS 14 THAIS RETURNING FROM MEXICO
MoPH CHECKS 14 THAIS RETURNING FROM MEXICOMoPH CHECKS 14 THAIS RETURNING FROM MEXICO
MoPH CHECKS 14 THAIS RETURNING FROM MEXICONSTDA THAILAND
 
MAN OF MANY ROLES SEES ENDLESS WORK AHEAD
MAN OF MANY ROLES SEES ENDLESS WORK AHEADMAN OF MANY ROLES SEES ENDLESS WORK AHEAD
MAN OF MANY ROLES SEES ENDLESS WORK AHEADNSTDA THAILAND
 
Probleemist Lahenduseni 120509
Probleemist Lahenduseni 120509Probleemist Lahenduseni 120509
Probleemist Lahenduseni 120509guestc3a1679
 
רמי לוי רוצה להיות גדול
רמי לוי רוצה להיות גדולרמי לוי רוצה להיות גדול
רמי לוי רוצה להיות גדולyossi koren
 
A kígyó bennünk lakik?
A kígyó bennünk lakik?A kígyó bennünk lakik?
A kígyó bennünk lakik?guest92337e
 

Viewers also liked (20)

Startup Culture: Value Creation in the Academic Library
Startup Culture: Value Creation in the Academic LibraryStartup Culture: Value Creation in the Academic Library
Startup Culture: Value Creation in the Academic Library
 
Help Wanted - Orphan Advocates
Help Wanted - Orphan AdvocatesHelp Wanted - Orphan Advocates
Help Wanted - Orphan Advocates
 
Domena Publiczna a misja GLAM
Domena Publiczna a misja GLAMDomena Publiczna a misja GLAM
Domena Publiczna a misja GLAM
 
My favorite philosophy
My favorite philosophyMy favorite philosophy
My favorite philosophy
 
Tatorii
TatoriiTatorii
Tatorii
 
Promi karikaturen
Promi karikaturenPromi karikaturen
Promi karikaturen
 
Ccr Short 08 (2)
Ccr Short 08 (2)Ccr Short 08 (2)
Ccr Short 08 (2)
 
Directmail 101118075946-phpapp02
Directmail 101118075946-phpapp02Directmail 101118075946-phpapp02
Directmail 101118075946-phpapp02
 
21 shevchenko-optimizatoin2010 доп услуга
21 shevchenko-optimizatoin2010 доп услуга21 shevchenko-optimizatoin2010 доп услуга
21 shevchenko-optimizatoin2010 доп услуга
 
MoPH CHECKS 14 THAIS RETURNING FROM MEXICO
MoPH CHECKS 14 THAIS RETURNING FROM MEXICOMoPH CHECKS 14 THAIS RETURNING FROM MEXICO
MoPH CHECKS 14 THAIS RETURNING FROM MEXICO
 
Astonishing beauty
Astonishing beautyAstonishing beauty
Astonishing beauty
 
43 kozlov-optimization2010 видеобаннер бегуна
43 kozlov-optimization2010 видеобаннер бегуна43 kozlov-optimization2010 видеобаннер бегуна
43 kozlov-optimization2010 видеобаннер бегуна
 
MAN OF MANY ROLES SEES ENDLESS WORK AHEAD
MAN OF MANY ROLES SEES ENDLESS WORK AHEADMAN OF MANY ROLES SEES ENDLESS WORK AHEAD
MAN OF MANY ROLES SEES ENDLESS WORK AHEAD
 
! 35 borisov-etarget2011
! 35 borisov-etarget2011! 35 borisov-etarget2011
! 35 borisov-etarget2011
 
420
420420
420
 
Probleemist Lahenduseni 120509
Probleemist Lahenduseni 120509Probleemist Lahenduseni 120509
Probleemist Lahenduseni 120509
 
Navidad 2010
Navidad 2010Navidad 2010
Navidad 2010
 
22apr s51-a-sergeev-110426092204-phpapp01
22apr s51-a-sergeev-110426092204-phpapp0122apr s51-a-sergeev-110426092204-phpapp01
22apr s51-a-sergeev-110426092204-phpapp01
 
רמי לוי רוצה להיות גדול
רמי לוי רוצה להיות גדולרמי לוי רוצה להיות גדול
רמי לוי רוצה להיות גדול
 
A kígyó bennünk lakik?
A kígyó bennünk lakik?A kígyó bennünk lakik?
A kígyó bennünk lakik?
 

Similar to Using semantic annotation of web services for analyzing

Recovering a Business Object Model from Web Applications
Recovering a Business Object Model from Web ApplicationsRecovering a Business Object Model from Web Applications
Recovering a Business Object Model from Web ApplicationsPorfirio Tramontana
 
Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...
Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...
Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...Claudio Greco
 
Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...
Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...
Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...Alessandro Suglia
 
Linked Open Data to support content based Recommender Systems
Linked Open Data to support content based Recommender SystemsLinked Open Data to support content based Recommender Systems
Linked Open Data to support content based Recommender SystemsVito Ostuni
 
Linked Open Data to Support Content-based Recommender Systems - I-SEMANTIC…
Linked Open Data to Support Content-based Recommender Systems - I-SEMANTIC…Linked Open Data to Support Content-based Recommender Systems - I-SEMANTIC…
Linked Open Data to Support Content-based Recommender Systems - I-SEMANTIC…Roku
 
Gephi short introduction
Gephi short introductionGephi short introduction
Gephi short introductionSébastien
 
Bridging Socially-Enhanced Virtual Communities
Bridging Socially-Enhanced Virtual CommunitiesBridging Socially-Enhanced Virtual Communities
Bridging Socially-Enhanced Virtual CommunitiesDaniel Schall
 
Filtering Inaccurate Entity Co-references on the Linked Open Data
Filtering Inaccurate Entity Co-references on the Linked Open DataFiltering Inaccurate Entity Co-references on the Linked Open Data
Filtering Inaccurate Entity Co-references on the Linked Open Dataebrahim_bagheri
 
Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...
Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...
Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...Matthew Rowe
 
Context-aware Mobile Recommendation Services for Conference Participants
Context-aware Mobile Recommendation Services for Conference ParticipantsContext-aware Mobile Recommendation Services for Conference Participants
Context-aware Mobile Recommendation Services for Conference ParticipantsRalf Klamma
 
Knowledge Graphs and Milestone
Knowledge Graphs and MilestoneKnowledge Graphs and Milestone
Knowledge Graphs and MilestoneBarry Norton
 
Redes: de entes físicos a procesos software en entornos virtuales
Redes: de entes físicos a procesos software en entornos virtualesRedes: de entes físicos a procesos software en entornos virtuales
Redes: de entes físicos a procesos software en entornos virtualesFacultad de Informática UCM
 
Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석datasciencekorea
 
Digital Twins, Virtual Devices, and Augmentations for Self-Organising Cyber-P...
Digital Twins, Virtual Devices, and Augmentations for Self-Organising Cyber-P...Digital Twins, Virtual Devices, and Augmentations for Self-Organising Cyber-P...
Digital Twins, Virtual Devices, and Augmentations for Self-Organising Cyber-P...Roberto Casadei
 
Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...Nees Jan van Eck
 
Graph Theoretic Model for Community Wireless Networks
Graph Theoretic Model for Community Wireless NetworksGraph Theoretic Model for Community Wireless Networks
Graph Theoretic Model for Community Wireless NetworksABDELAAL
 
Reference Knowledge Models for Smart Application
Reference Knowledge Models for Smart ApplicationReference Knowledge Models for Smart Application
Reference Knowledge Models for Smart ApplicationMaxime Lefrançois
 
A preliminary approach to knowledge integrity risk assessment in Wikipedia p...
A preliminary approach to knowledge integrity  risk assessment in Wikipedia p...A preliminary approach to knowledge integrity  risk assessment in Wikipedia p...
A preliminary approach to knowledge integrity risk assessment in Wikipedia p...Pablo Aragón
 
SP1: Exploratory Network Analysis with Gephi
SP1: Exploratory Network Analysis with GephiSP1: Exploratory Network Analysis with Gephi
SP1: Exploratory Network Analysis with GephiJohn Breslin
 

Similar to Using semantic annotation of web services for analyzing (20)

Recovering a Business Object Model from Web Applications
Recovering a Business Object Model from Web ApplicationsRecovering a Business Object Model from Web Applications
Recovering a Business Object Model from Web Applications
 
Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...
Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...
Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...
 
Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...
Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...
Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neur...
 
Linked Open Data to support content based Recommender Systems
Linked Open Data to support content based Recommender SystemsLinked Open Data to support content based Recommender Systems
Linked Open Data to support content based Recommender Systems
 
Linked Open Data to Support Content-based Recommender Systems - I-SEMANTIC…
Linked Open Data to Support Content-based Recommender Systems - I-SEMANTIC…Linked Open Data to Support Content-based Recommender Systems - I-SEMANTIC…
Linked Open Data to Support Content-based Recommender Systems - I-SEMANTIC…
 
Gephi short introduction
Gephi short introductionGephi short introduction
Gephi short introduction
 
Bridging Socially-Enhanced Virtual Communities
Bridging Socially-Enhanced Virtual CommunitiesBridging Socially-Enhanced Virtual Communities
Bridging Socially-Enhanced Virtual Communities
 
Filtering Inaccurate Entity Co-references on the Linked Open Data
Filtering Inaccurate Entity Co-references on the Linked Open DataFiltering Inaccurate Entity Co-references on the Linked Open Data
Filtering Inaccurate Entity Co-references on the Linked Open Data
 
Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...
Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...
Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...
 
Context-aware Mobile Recommendation Services for Conference Participants
Context-aware Mobile Recommendation Services for Conference ParticipantsContext-aware Mobile Recommendation Services for Conference Participants
Context-aware Mobile Recommendation Services for Conference Participants
 
Knowledge Graphs and Milestone
Knowledge Graphs and MilestoneKnowledge Graphs and Milestone
Knowledge Graphs and Milestone
 
Redes: de entes físicos a procesos software en entornos virtuales
Redes: de entes físicos a procesos software en entornos virtualesRedes: de entes físicos a procesos software en entornos virtuales
Redes: de entes físicos a procesos software en entornos virtuales
 
Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석
 
Digital Twins, Virtual Devices, and Augmentations for Self-Organising Cyber-P...
Digital Twins, Virtual Devices, and Augmentations for Self-Organising Cyber-P...Digital Twins, Virtual Devices, and Augmentations for Self-Organising Cyber-P...
Digital Twins, Virtual Devices, and Augmentations for Self-Organising Cyber-P...
 
Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...
 
Graph Theoretic Model for Community Wireless Networks
Graph Theoretic Model for Community Wireless NetworksGraph Theoretic Model for Community Wireless Networks
Graph Theoretic Model for Community Wireless Networks
 
OSI 7 Layer Model
OSI 7 Layer ModelOSI 7 Layer Model
OSI 7 Layer Model
 
Reference Knowledge Models for Smart Application
Reference Knowledge Models for Smart ApplicationReference Knowledge Models for Smart Application
Reference Knowledge Models for Smart Application
 
A preliminary approach to knowledge integrity risk assessment in Wikipedia p...
A preliminary approach to knowledge integrity  risk assessment in Wikipedia p...A preliminary approach to knowledge integrity  risk assessment in Wikipedia p...
A preliminary approach to knowledge integrity risk assessment in Wikipedia p...
 
SP1: Exploratory Network Analysis with Gephi
SP1: Exploratory Network Analysis with GephiSP1: Exploratory Network Analysis with Gephi
SP1: Exploratory Network Analysis with Gephi
 

Using semantic annotation of web services for analyzing

  • 1. Information Diffusion in Web Services Networks Shahab Mokarizadeh , Royal Institute of Technology (KTH) , Sweden Peep Küngas, University of Tartu (UT) , Estonia Mihhail Matskin , Royal Institute of Technology (KTH) , Sweden Marco Crasso, Marcelo Campo, Alejandro Zunino , UNICEN University, Argentina Contact: shahabm@kth.se 1
  • 2. Outline  Background of Information Flow Analysis  Roadmap and Computational Model  Web service Annotation  Web service Categorization  Experimental Results  Discussion & Conclusion 2
  • 3. Background – Information Diffusion  Information Diffusion: the communication of knowledge over time among members of a social system  It shows intrinsic properties of real-world phenomenon.  Already studied in the context of: biosphere, microblogs, publication citation, … where a network structure present. 3
  • 4. Information Diffusion among Web service Domains Observation: Services published in the Web form a conceptual ecology of knowledge where information is shared and flows along input and output parameters of service operations. Case-study: How Web services in different commodities have been designed from information exchange perspective?  Introducing value-add Web services  Web service adoption spots 4
  • 5. Roadmap 1 • Semantically annotation of Web services 2 • Assign Web services to respective categories 3 • Construct Web service network 4 • Compute information flow matrix • Matrix Analysis 5 5
  • 6. 1-Web service Annotation -Only semantic annotations of basic elements of input and output parameters of Web service Operations -SAWSDL annotation model -We exploit our Semi-automated ontology learning method which relies on lexico-syntactic patterns “Ontology Learning for Cost-Effective Large-Scale Semantic Annotation of Web Service Interfaces”. EKAW 2010:pp. 401-410 Image from : Web Services and 6 Security,1/17/2006 ,Marco Cova
  • 7. Tax and Customs Board service Output message content fragment 7
  • 8. Business Registry service Input message content fragment 8
  • 9. A Business Registry service 9 Output message content fragment
  • 10. Registry of Economic Activities Service Output message content fragment 10
  • 11. 2-Web service Categorization A category (a.k.a. commodity) describes a general kind of a service that is provided, for example “B2B” , “Health”, “E-Commerce”, etc. Each Web service could belong to multiple categories ! Standard Software Taxonomy e.g. UNSPSC: http://www.unspsc.org/ We use Classifier : "AWSC: An approach to Web Service classification based on machine learning techniques“, Inteligencia Artificial, ISSN 1137-3601, vol. 12, no. 37, pp. 25-36, Asociación Española para la Inteligencia Artificial, Valencia, España. 2008. UNSPSC Instant messaging Calendar and scheduling Adventure games Mobile operator specific Internet directory services Medical software 11 Music or sound editing Video conferencing software
  • 12. 3-Web service Network Construction 1- Present annotated Web services as bipartite (2-mode) graph 2- Create Semantic Network (1-mode graph) 3- Create Weighted Category Network using Semantic network 12
  • 13. Bipartite Web Service Network 13
  • 14. Bipartite Web Service Network (categorized) 14
  • 15. Network Transformation Semantic Network Category Network Propagate the categories to semantic Ds, Dt : category nodes nodes , Cu: semantic node , Label each category edge with weights: qk: weight of node in category k Q u  q1 ,..qk ., qn  u ,v ( Ds , Dt )  qu , s .qv,t frequency of Cu in Ds 15 qs  n  frequency of Cu in Di W ( Ds , Dt )   edge ( u ,v ) u ,v ( D s , Dt ) i 1
  • 16. 4-Normalizing Weights (Z-score)  Edge category weight W(Di,Dj) : Wi,j  Sum of all weights of all links from category i: W i *   W ( Di , D j ) j  Sum of all weights of all links to category j: W* j   W ( Di , D j ) i  Sum of weights of all categories: W   W ( Di , D j ) i, j  Expected weights from category i to category j : Wi*  W* j W  Normalize category weights (Z-Score): Wi*  W* j Wi*  W* j i , j  (Wi , j  ) W W 16
  • 17. Matrix of Information flow Matrix of information flow between pair of categories: 1,1  1, j  1,n             i ,1  i , j  i ,n            n ,1  n, j   n,n    A high proximity (Φ i j) between categories i and j reveals a strong tendency for semantic concepts associated to category j to be resulted from invocation of services which take semantic concepts associated to category i. 17
  • 18. 5-Experimental Settings  27000 public Web services (WSDLs) (collected 2005-2011)  Semantic Annotation  Lexico-syntactic based ontology learning  Annotation accuracy: Precision= 31% , Recall= 19%  Categorization  AWSC Classifier  Training dataset: 1500 WSDLs  Categorization Accuracy: 91% 18
  • 19. Excerpt of Identified Service Categories Category Category 1-Communications server 11-Network operation system 2-Instant messaging 12-Database management system 3-Adventure games 13-Analytical or scientific 4-Internet directory services 14-Portal server 5-Music or sound editing 15-Foreign language software 6-Calendar and scheduling 16-Procurement software 7-Mobile operator specific 17-Inventory management software 8-Medical software 18-Dictionary software 9-Video conferencing 19-Fax software 10-Map creation software 20-Object oriented database management 19
  • 20. Visualization of Matrix of Information Flow 20
  • 21. Information Exchange Patterns - 1:  Self-Referential Pattern: A category mainly provides inputs for its own services and consumes mostly the information provided by itself (i.e. self contained).  Appear in diagonal of matrix  Categories: Financial Analysis Software, Web Platform Development Software, Map Creation Software, Video Conferencing Software and Accounting Software  The API-s exposed by these Web services exploit frequently domain-specific concepts as input and output elements 21
  • 22. Information Exchange Patterns - 2:  Outside main diagonal: -Foreign Language category , Presentation category -Financial Analysis category , Enterprise Resource Planning category  Least volume of information flow: -Video Conferencing software and Financial Analysis software 22
  • 23. Threats to Validity  The presented model heavily relies of accuracy of underlying semantic annotation and matching scheme !  The examined Web services account only for small proportion of existing ones on the Web!  The collection of Web services’ interface descriptions may also suffer from unintentional preference toward some specific categories.  In the absence of timing factor our analysis is rather static analysis of information flow 23
  • 24. Conclusion and Future Work  The presented approach can discover information exchange patterns.  In general our approach is applicable to any other kind of machine understandable APIs, not just WSDLs, ! Future work: To examine how presence of service composition or mashups influences the information exchange pattern Recommending value-add Web services based on identified information exchange patterns and Web service network properties 24
  • 25. Thanks! Questions Please! 25
  • 26. Partial Category Weight for Edge (Ds,Dt) : u ,v ( Ds , Dt )  qu , s .qv,t Augmented Category Weight for Edge (Ds, Dt): W ( Ds , Dt )   edge ( u ,v ) u ,v ( D s , Dt ) 26
  • 27. Ontology Learning for Information Elicitation Web service Annotation1 Term Extraction Syntactic Refinement Ontology Discovery Ontology Learning Input: Pattern-based - Message Part names of input/output Semantic Analysis parameters Term Disambiguation - XML Schema leaf element names of complex types Class and Relation Determination Ontology Organization Adding Relations [1] ”Ontology Learning for Cost-Effective Large-scale Semantic Annotation of XML Schemas and Web Service Interfaces". in Porc. EKAW 2010, LNAI 6317,pp.401-410, 2010 Reference 27 Ontology