SlideShare a Scribd company logo
1 of 20
Download to read offline
Inconsistencies	
  of	
  Connec,on	
  for	
  
Heterogeneity	
  and	
  
a	
  New	
  Rela,on	
  Discovery	
  Method	
  that	
  
Solved	
  them	
Takafumi	
  NAKANISHI	
  ,	
  Kiyotaka	
  UCHIMOTO,	
  Yutaka	
  KIDAWARA	
  
	
Na,onal	
  Ins,tute	
  of	
  Informa,on	
  and	
  Communica,on	
  
Technology	
  (NICT),	
  Japan	
  
What’s	
  Big	
  Data?	
•  Speed	
  up?	
  Processing	
  a	
  lot	
  of	
  data?	
  
–  What	
  differences	
  are	
  there	
  between	
  VLDB	
  and	
  Big	
  
Data.	
  (Very	
  Large	
  Database)?	
  
•  Fragmental	
  data	
  exist	
  
–  Un,l	
  now,	
  scien,sts	
  work	
  such	
  data	
  for	
  simula,on.	
  
•  Heterogeneous	
  Database	
  Integra,on(Cross	
  
database	
  search)	
  
–  S,ll	
  Considering?
Purposes	
  of	
  this	
  presenta,on	
•  We	
  should	
  consider	
  the	
  paradigm	
  shiV	
  in	
  
computer	
  science.	
  
– From	
  the	
  closed	
  assump,on	
  to	
  the	
  opened	
  
assump,on	
  
– What	
  are	
  there	
  any	
  problems?	
  
•  Businesspeople	
  require	
  not	
  only	
  EDW	
  (Enterprise	
  Data	
  
Warehouse)	
  but	
  also	
  the	
  other	
  analysis	
  methods.	
  	
  
•  Discovering	
  rela,on	
  between	
  heterogeneous	
  concept,	
  
dataset,	
  etc.	
  
•  Three	
  Opened	
  Assump,on’s	
  Evils	
  
True	
  Problem	
  Defini,ons	
  of	
  Big	
  Data	
Rela,on	
  Discovery	
  in	
  
Heterogeneity	
  	
Big	
  
data	
Speeding	
  Up,	
  
Promo,on	
  of	
  
Streamlining,	
  and	
  
Increasing	
  Data	
  
Volume	
  
	
  for	
  Processing	
Schemaless	
  
Data	
  and	
  New	
  
Data	
  Processing	
  
Method	
Distributed	
  Parallel	
  
Processing,	
  High	
  
Performance	
  Compu,ng	
  
(HPC),	
  Network	
  Delay,	
  
etc.	
  	
Construc)on	
  of	
  Big	
  data	
  
environment	
  (Hardware,	
  
middleware	
  researches)	
Big	
  data	
  analy)cs	
  
(So=ware	
  researches)	
Closed	
  Assump,on	
  System	
  
à	
  Open	
  Assump,on	
  System
AI	
  Community	
 DB	
  Community	
a1	
a2	
b10	
b8	
a9	
a8	
a7	
a6	
a5	
a4	
a3	
b9	
b6	
b7	
b4	
b5	
b2	
b3	
b1	
Someone	
  adds	
  rela,onships	
  between	
  a3	
  and	
  b4	
Rela,onships	
  among	
  persons	
  in	
  communi,es	
  AI	
  and	
  DB.	
  ai,	
  bj	
  are	
  researchers.	
  When	
  someone	
  
adds	
  symmetric	
  and	
  transi,ve	
  rela,onships	
  between	
  a3	
  and	
  b4,	
  it	
  is	
  true	
  that	
  a1	
  is	
  related	
  to	
  
b5	
  because	
  a1	
  is	
  related	
  to	
  a3,	
  a3	
  is	
  related	
  to	
  b4,	
  and	
  b4	
  is	
  related	
  to	
  b5.
Office	
  Community	
Music	
  Community	
a1	
a2	
b10	
b8	
a9	
a8	
a7	
a6	
a5	
a4	
a3	
b9	
b6	
b7	
b4	
b5	
b2	
b3	
b1	
Someone	
  adds	
  
rela,onships	
  
between	
  a3	
  and	
  b4	
Rela,onships	
  among	
  persons	
  in	
  workplace	
  and	
  music	
  communi,es.	
  
ai	
  are	
  co-­‐workers,	
  and	
  bj	
  are	
  musicians.	
  When	
  someone	
  adds	
  symmetric	
  and	
  transi,ve	
  
rela,onships	
  between	
  a3	
  and	
  b4,	
  it	
  is	
  actually	
  not	
  true	
  that a1	
  is	
  related	
  to	
  b5.	
  In	
  graph	
  
structure,	
  it	
  is	
  true	
  that	
  a1	
  is	
  related	
  to	
  b5. However,	
  realis,cally,	
  a1	
  and	
  b5	
  do	
  not	
  share	
  
ground	
  without	
  other	
  defini,ons	
  or	
  analysis.
Difference	
  of	
  two	
  examples	
•  “AI Community” ∩	
  “DB Community” ≠ ∅.
à Closed Assumption
– Representation of relations in the previous methods
such as owl, RDF, etc.
•  “Office	
  Community” ∩	
  “Music	
  Community”	
  =	
  ∅.
àOpened Assumption
– unable representation of relations in the previous
method
Proof	
  of	
  inconsistency	
  of	
  order	
  rela,on	
  
between	
  two	
  certain	
  sets	
  [1/2]	
•  A = {a1, a2, … , an}, B = {b1, b2, …, bm}
•  A ∩ B = ∅.
•  Both sets A and B may define the order
relations differently.
•  prove that we cannot discover the relationship
between sets A and B or other relationships
when we get relationship f between a1 ∈ A
and b1 ∈ B. à b1=f(a1)
Proof	
  of	
  inconsistency	
  of	
  order	
  rela,on	
  
between	
  two	
  certain	
  sets	
  [2/2]	
•  We prove that it is satisfied when bi = f(ai) is not
true by induction.
–  b1 = f(a1) is true by the above condition when i = 1.
–  We assume that bk = f(ak) is true when i = k.
–  When i = k + 1, bk+1 = f(ak+1) is not true.
•  set A has an order relation. set B has another order
relation.
–  bk ≤ bk+1 may not be true, if ak ≤ ak+1 is true and vice
versa. Furthermore, both ak ≤ ak+1 and bk ≤ bk+1 may
not be true.
•  Although b1 = f(a1) is true, bi = f(ai) is not.
Proof	
  of	
  inconsistency	
  of	
  the	
  transi,ve	
  
rela,on	
  between	
  two	
  certain	
  sets[1/2]	
•  A = {a1, a2, … , an}, B = {b1, b2, …, bm}
•  A ∩ B = ∅.
•  Set B has order relation b1 ≤ b2 ≤ b3 ≤ b4…
– Transitive relation
– If b1 ≤ b2 and b2 ≤ b3 are true, b1 ≤ b3 is true
•  Set A has its own order relation.
Proof	
  of	
  inconsistency	
  of	
  the	
  transi,ve	
  
rela,on	
  between	
  two	
  certain	
  sets[2/2]	
•  Assume a1 = (1, 5), b1 =(2, 1), b2 = (3, 2), b3 = (4, 3).
•  We prove that a1 ≤ b3 is true when we get relation a1 ≤ b1.
•  To reveal the conclusion first, a1 ≤ b3 may not satisfy.
•  The relationship of a1 and b1 focuses on each first element.
•  Then a1 ≤ b1 is true.
•  The order relation of set B focuses on more values of each second
element.
•  Then b1 ≤ b2 ≤ b3, and if b1 ≤ b2 and b2 ≤ b3 is true, then b1 ≤ b3 is
true.
•  However, a1 ≤ b3 is not true in the order set of set B.
•  Like the relation of a1 and b1, an inconsistency occurs whose order
and transitive relations of set B are not guaranteed.
Inconsistencies	
  	
  
–	
  Three	
  Opened	
  Assump,on’s	
  Evils	
  	
•  Inconsistency	
  is	
  shown	
  whose	
  rela,on	
  does	
  not	
  
guarantee	
  the	
  future	
  
•  Inconsistency	
  where	
  any	
  transi,ve	
  rela,on	
  is	
  not	
  
true,	
  when	
  anyone	
  connects	
  links	
  for	
  
heterogeneous	
  fields	
  
•  Inconsistency	
  where	
  any	
  rela,on	
  in	
  
heterogeneous	
  fields	
  cannot	
  be	
  discovered	
  in	
  set	
  
theory
Misconcep,on	
  of	
  Future	
  Informa,on	
  
Systems	
•  A	
  user	
  Do	
  Not	
  want	
  to	
  retrieve	
  some	
  data,	
  need	
  
some	
  solu,ons	
  
–  A	
  system	
  solve	
  some	
  clues	
  for	
  a	
  user	
  from	
  data	
  by	
  
rela,vely	
  comparing	
  
–  It	
  is	
  important	
  to	
  rela,vely	
  compare	
  between	
  data.	
  
•  We	
  can	
  Not	
  write	
  anymore	
  rela,onships	
  
–  dynamical	
  changing	
  depending	
  on	
  user,	
  situa,on,	
  etc.	
  
–  when	
  data	
  are	
  changing,	
  rela,onships	
  are	
  changing	
  
•  We	
  cannot	
  create	
  indexes.	
  
•  We	
  cannot	
  discover	
  without	
  wri,ng	
  rela,onships	
  
–  However,	
  a	
  system	
  can	
  compare	
  on	
  the	
  basis.	
  
Functional Predicate	
Set Theory	
Coordinates System	
•  commutative property
•  associative property
•  distributive property
•  reflexive relation
•  antisymmetric relation
•  transitive relation	
•  axis adaptability evaluation
•  uniqueness evaluation
•  certainty evaluation
•  predicate satisfaction evaluation	
Incomplete	
  Mutual	
  Map	
  Transforma,on	
  
Framework	
  between	
  set	
  theory	
  and	
  the	
  
Cartesian	
  system	
  of	
  coordinates.	
Mutual	
  mapping	
  by	
  mathema,cal	
  rule,	
  formula,	
  etc.	
  
(Because	
  the	
  mathema,cal	
  rule	
  and	
  formula	
  are	
  closed	
  assump,on)
Overview	
  of	
  our	
  method	
Sampling	
  Data	
•  A	
  query	
  given	
  by	
  a	
  user	
•  Sampling	
  the	
  data	
  set	
  depend	
  on	
  a	
  
query	
Selec,on	
  of	
  Basis	
•  A	
  system	
  selects	
  some	
  basis	
  for	
  solu,on	
  of	
  query	
•  Order	
  rela,onships?,	
  con,nues	
  or	
  equal	
  interval	
  
Sampling?	
  	
  	
Mapping	
  	
  from	
  set	
  
theory	
  to	
  the	
  
Cartesian	
  system	
  of	
  
coordinates	
•  Mathema,cal	
  rule/formula	
  	
  
à	
  closed	
  assump,on	
•  Crea,on	
  transforma,on	
  opera,on	
  on	
  the	
  closed	
  
assump,on	
  manually.	
Discovery	
  of	
  
rela,onships	
  on	
  the	
  
the	
  Cartesian	
  
system	
  of	
  
coordinates	
•  Predefini,on	
  of	
  func,onal	
  predicates	
•  Sa,sfying	
  each	
  func,on	
  predicates	
	
  Re-­‐mapping	
  to	
  set	
  
theory	
•  	
  Representa,on	
  of	
  	
  predicate	
  in	
  predicate	
  func,ons	
•  	
  Representa,on	
  of	
  reasons	
  in	
  basis	
1	
2	
3	
4	
5
Example:	
  Crea,on	
  Func,onal	
  
Predicate	
  –	
  dependOn	
•  ”dependOn” means that set A relies on set X.
– The value of element ai of set A should only
change with the variation of the value of element xj
of set X.
•  ”dependOn” is represented in {A}(X), when set
A depends on set X.
Example	
  Dataset	
 	
 Jan.	
 Feb.	
 Mar.	
 Apr.	
 May.	
Jun.	
 Jul.	
 Aug.	
 Sep.	
 Oct.	
 Nov.	
 Dec.	
 Ave.	
2007	
 4.9	
 6.1	
 8.2	
 12.3	
 18.7	
 22.7	
 23.5	
 28	
 24.1	
 17.1	
 11	
 6.5	
 15.3	
2008	
 3.6	
 2.9	
 9	
 13.6	
 18	
 21.1	
 26.3	
 25.8	
 22.9	
 17.6	
 10.7	
 6.9	
 14.9	
2009	
 4.3	
 5.5	
 7.6	
 14.1	
 19.4	
 22.2	
 25.4	
 25.8	
 22	
 16.8	
 11.4	
 6.7	
 15.1	
2010	
 4.3	
 4.8	
 7.2	
 11.2	
 18.1	
 23.5	
 27	
 29	
 24.2	
 17.7	
 11.2	
 7.2	
 15.5	
2011	
 2.4	
 4.9	
 6.1	
 12.6	
 17.8	
 22.9	
 27.1	
 26.6	
 23.9	
 17.1	
 12.3	
 4.8	
 14.9	
 	
 cucumber	
 cabbage	
2007	
 1168	
 604	
2008	
 1226	
 594	
2009	
 1102	
 662	
2010	
 1231	
 739	
2011	
 1179	
 573	
MONTHLY	
  AVERAGE	
  TEMPERATURE	
  IN	
  GUMMA	
  PREFECTURE,	
  JAPAN	
ANNUAL	
  AVERAGE	
  PRICE	
  (Y	
  en)	
  OF	
  CUCUMBERS	
  (5kg)	
  AND	
  CABBAGE(10kg)
Result	
 	
  	
 Jan	
 Feb	
 Mar	
 Apr	
 May	
 Jun	
 Jul	
 Aug	
 Sep	
 Oct	
 Nov	
 Dec	
Cucumber	
AAE	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
UE	
 0.031 	
 0.394 	
 0.028 	
 0.345 	
 0.707 	
 0.002 	
 0.207 	
 0.188 	
 0.355 	
 0.924 	
 0.090 	
 0.043 	
CE	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
BV	
 -9.590 	
-27.269 	
 8.075 	
-27.022 	
-67.471 	
 2.254 	
 16.039 	
 16.006 	
 32.882 	
132.937 	
-25.899 	
 11.466 	
Cabbage	
AAE	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
UE	
 0.243 	
 0.024 	
 0.007 	
 0.199 	
 0.052 	
 0.255 	
 0.045 	
 0.330 	
 0.003 	
 0.114 	
 0.048 	
 0.436 	
CE	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
 1.000 	
BV	
 34.617 	
 8.705 	
 -5.190 	
-26.357 	
 23.588 	
 37.635 	
 9.576 	
 27.231 	
 3.696 	
 59.930 	
-24.346 	
 47.057 	
•  AAE: axis adaptability evaluation
•  UE: uniqueness evaluation
•  CE: certainty evaluation
•  BV: predicate satisfaction evaluation	
{Cucumber Price}(May temperature)	
Discovered	
  dependOn	
  Rela,ons	
{Cucumber Price}(Oct temperature)	
{Cabbage Price}(Dec temperature)
Conclusion	
•  Three	
  opened	
  assump,on	
  evils	
  
–  We	
  represented	
  the	
  inconsistencies	
  of	
  past	
  researches	
  that	
  
contributed	
  to	
  the	
  interconnec,on	
  of	
  such	
  heterogeneous	
  
fields	
  as	
  Linked	
  Data,	
  and	
  our	
  past	
  researches.	
  
•  Map	
  transforma,on	
  framework	
  from	
  set	
  theory	
  to	
  the	
  
Cartesian	
  system	
  of	
  coordinates	
  
–  defining	
  such	
  predicate	
  func,ons	
  as	
  disjoint, meet, overlap,
coveredBy, covers, equal, contain, inside, correlate, moreThan,
lessThan, alongWith, join, etc.
•  A	
  preliminary	
  evalua,on	
  of	
  predicate	
  func,on	
  
”dependOn”
Thank	
  you

More Related Content

Viewers also liked

Recent Samples of Cartoon Maps, Caricatures, T Shirts and Illustrations for Apps
Recent Samples of Cartoon Maps, Caricatures, T Shirts and Illustrations for AppsRecent Samples of Cartoon Maps, Caricatures, T Shirts and Illustrations for Apps
Recent Samples of Cartoon Maps, Caricatures, T Shirts and Illustrations for AppsDave Stephens
 
Formula hipo alergenik untuk pencegahan alergi
Formula hipo alergenik untuk pencegahan alergiFormula hipo alergenik untuk pencegahan alergi
Formula hipo alergenik untuk pencegahan alergiAriyanto Harsono
 
Genetics-2
Genetics-2Genetics-2
Genetics-2Ash G
 
Final renja-2013-sesuai-dpa-2013
Final renja-2013-sesuai-dpa-2013Final renja-2013-sesuai-dpa-2013
Final renja-2013-sesuai-dpa-2013Bhaba Remplex
 
Nöd i paradiset i ppt97
Nöd i paradiset i ppt97Nöd i paradiset i ppt97
Nöd i paradiset i ppt97Matilda Leksell
 
Wahid’s philosophy the examined & careful consideration of strategic plannin...
Wahid’s philosophy  the examined & careful consideration of strategic plannin...Wahid’s philosophy  the examined & careful consideration of strategic plannin...
Wahid’s philosophy the examined & careful consideration of strategic plannin...Mohammad Wahid Abdullah Khan
 
Achieve Closing the Expectations Gap 2014 Webinar Slides
Achieve Closing the Expectations Gap 2014 Webinar SlidesAchieve Closing the Expectations Gap 2014 Webinar Slides
Achieve Closing the Expectations Gap 2014 Webinar SlidesAchieve, Inc.
 
The ABC’S of The 2014 Kitchen, Bath & Renovation/Home Expressions Home & Gard...
The ABC’S of The 2014 Kitchen, Bath & Renovation/Home Expressions Home & Gard...The ABC’S of The 2014 Kitchen, Bath & Renovation/Home Expressions Home & Gard...
The ABC’S of The 2014 Kitchen, Bath & Renovation/Home Expressions Home & Gard...MBHomeBuilders
 

Viewers also liked (9)

Recent Samples of Cartoon Maps, Caricatures, T Shirts and Illustrations for Apps
Recent Samples of Cartoon Maps, Caricatures, T Shirts and Illustrations for AppsRecent Samples of Cartoon Maps, Caricatures, T Shirts and Illustrations for Apps
Recent Samples of Cartoon Maps, Caricatures, T Shirts and Illustrations for Apps
 
Formula hipo alergenik untuk pencegahan alergi
Formula hipo alergenik untuk pencegahan alergiFormula hipo alergenik untuk pencegahan alergi
Formula hipo alergenik untuk pencegahan alergi
 
Genetics-2
Genetics-2Genetics-2
Genetics-2
 
Final renja-2013-sesuai-dpa-2013
Final renja-2013-sesuai-dpa-2013Final renja-2013-sesuai-dpa-2013
Final renja-2013-sesuai-dpa-2013
 
Fatigue
FatigueFatigue
Fatigue
 
Nöd i paradiset i ppt97
Nöd i paradiset i ppt97Nöd i paradiset i ppt97
Nöd i paradiset i ppt97
 
Wahid’s philosophy the examined & careful consideration of strategic plannin...
Wahid’s philosophy  the examined & careful consideration of strategic plannin...Wahid’s philosophy  the examined & careful consideration of strategic plannin...
Wahid’s philosophy the examined & careful consideration of strategic plannin...
 
Achieve Closing the Expectations Gap 2014 Webinar Slides
Achieve Closing the Expectations Gap 2014 Webinar SlidesAchieve Closing the Expectations Gap 2014 Webinar Slides
Achieve Closing the Expectations Gap 2014 Webinar Slides
 
The ABC’S of The 2014 Kitchen, Bath & Renovation/Home Expressions Home & Gard...
The ABC’S of The 2014 Kitchen, Bath & Renovation/Home Expressions Home & Gard...The ABC’S of The 2014 Kitchen, Bath & Renovation/Home Expressions Home & Gard...
The ABC’S of The 2014 Kitchen, Bath & Renovation/Home Expressions Home & Gard...
 

Similar to Inconsistencies of Connection for Heterogeneity and a New Rela,on Discovery Method

Introduction to Probabilistic Latent Semantic Analysis
Introduction to Probabilistic Latent Semantic AnalysisIntroduction to Probabilistic Latent Semantic Analysis
Introduction to Probabilistic Latent Semantic AnalysisNYC Predictive Analytics
 
Dive into the Data
Dive into the DataDive into the Data
Dive into the Datadr_jp_ebejer
 
So sánh cấu trúc protein_Protein structure comparison
So sánh cấu trúc protein_Protein structure comparisonSo sánh cấu trúc protein_Protein structure comparison
So sánh cấu trúc protein_Protein structure comparisonbomxuan868
 
Structured equation model
Structured equation modelStructured equation model
Structured equation modelKing Abidi
 
How Does Math Matter in Data Science
How Does Math Matter in Data ScienceHow Does Math Matter in Data Science
How Does Math Matter in Data ScienceMutia Ulfi
 
Computing Contrast on Conceptual Spaces
Computing Contrast on Conceptual SpacesComputing Contrast on Conceptual Spaces
Computing Contrast on Conceptual SpacesGiovanni Sileno
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)Ryan Herzog
 
Rank Monotonicity in Centrality Measures (A report about Quality guarantees f...
Rank Monotonicity in Centrality Measures (A report about Quality guarantees f...Rank Monotonicity in Centrality Measures (A report about Quality guarantees f...
Rank Monotonicity in Centrality Measures (A report about Quality guarantees f...Mahdi Cherif
 
Lecture-2-Relational-Algebra-and-SQL-Advanced-DataBase-Theory-MS.pdf
Lecture-2-Relational-Algebra-and-SQL-Advanced-DataBase-Theory-MS.pdfLecture-2-Relational-Algebra-and-SQL-Advanced-DataBase-Theory-MS.pdf
Lecture-2-Relational-Algebra-and-SQL-Advanced-DataBase-Theory-MS.pdfssuserf86fba
 
PCA (Principal component analysis)
PCA (Principal component analysis)PCA (Principal component analysis)
PCA (Principal component analysis)Learnbay Datascience
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataInside Analysis
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)Ryan Herzog
 

Similar to Inconsistencies of Connection for Heterogeneity and a New Rela,on Discovery Method (20)

Introduction to Probabilistic Latent Semantic Analysis
Introduction to Probabilistic Latent Semantic AnalysisIntroduction to Probabilistic Latent Semantic Analysis
Introduction to Probabilistic Latent Semantic Analysis
 
Lecture 4
Lecture 4Lecture 4
Lecture 4
 
Dive into the Data
Dive into the DataDive into the Data
Dive into the Data
 
RelateSheets_SSDBM2013_Poster
RelateSheets_SSDBM2013_PosterRelateSheets_SSDBM2013_Poster
RelateSheets_SSDBM2013_Poster
 
Covariance.pdf
Covariance.pdfCovariance.pdf
Covariance.pdf
 
So sánh cấu trúc protein_Protein structure comparison
So sánh cấu trúc protein_Protein structure comparisonSo sánh cấu trúc protein_Protein structure comparison
So sánh cấu trúc protein_Protein structure comparison
 
Structured equation model
Structured equation modelStructured equation model
Structured equation model
 
How Does Math Matter in Data Science
How Does Math Matter in Data ScienceHow Does Math Matter in Data Science
How Does Math Matter in Data Science
 
Computing Contrast on Conceptual Spaces
Computing Contrast on Conceptual SpacesComputing Contrast on Conceptual Spaces
Computing Contrast on Conceptual Spaces
 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)
 
Rank Monotonicity in Centrality Measures (A report about Quality guarantees f...
Rank Monotonicity in Centrality Measures (A report about Quality guarantees f...Rank Monotonicity in Centrality Measures (A report about Quality guarantees f...
Rank Monotonicity in Centrality Measures (A report about Quality guarantees f...
 
Lecture1_jps (1).ppt
Lecture1_jps (1).pptLecture1_jps (1).ppt
Lecture1_jps (1).ppt
 
Lecture1_jps.ppt
Lecture1_jps.pptLecture1_jps.ppt
Lecture1_jps.ppt
 
Lecture-2-Relational-Algebra-and-SQL-Advanced-DataBase-Theory-MS.pdf
Lecture-2-Relational-Algebra-and-SQL-Advanced-DataBase-Theory-MS.pdfLecture-2-Relational-Algebra-and-SQL-Advanced-DataBase-Theory-MS.pdf
Lecture-2-Relational-Algebra-and-SQL-Advanced-DataBase-Theory-MS.pdf
 
PCA (Principal component analysis)
PCA (Principal component analysis)PCA (Principal component analysis)
PCA (Principal component analysis)
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of Data
 
Data Mining Lecture_9.pptx
Data Mining Lecture_9.pptxData Mining Lecture_9.pptx
Data Mining Lecture_9.pptx
 
An introduction to R
An introduction to RAn introduction to R
An introduction to R
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)
 

Recently uploaded

My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 

Recently uploaded (20)

My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 

Inconsistencies of Connection for Heterogeneity and a New Rela,on Discovery Method

  • 1. Inconsistencies  of  Connec,on  for   Heterogeneity  and   a  New  Rela,on  Discovery  Method  that   Solved  them Takafumi  NAKANISHI  ,  Kiyotaka  UCHIMOTO,  Yutaka  KIDAWARA   Na,onal  Ins,tute  of  Informa,on  and  Communica,on   Technology  (NICT),  Japan  
  • 2. What’s  Big  Data? •  Speed  up?  Processing  a  lot  of  data?   –  What  differences  are  there  between  VLDB  and  Big   Data.  (Very  Large  Database)?   •  Fragmental  data  exist   –  Un,l  now,  scien,sts  work  such  data  for  simula,on.   •  Heterogeneous  Database  Integra,on(Cross   database  search)   –  S,ll  Considering?
  • 3. Purposes  of  this  presenta,on •  We  should  consider  the  paradigm  shiV  in   computer  science.   – From  the  closed  assump,on  to  the  opened   assump,on   – What  are  there  any  problems?   •  Businesspeople  require  not  only  EDW  (Enterprise  Data   Warehouse)  but  also  the  other  analysis  methods.     •  Discovering  rela,on  between  heterogeneous  concept,   dataset,  etc.   •  Three  Opened  Assump,on’s  Evils  
  • 4. True  Problem  Defini,ons  of  Big  Data Rela,on  Discovery  in   Heterogeneity   Big   data Speeding  Up,   Promo,on  of   Streamlining,  and   Increasing  Data   Volume    for  Processing Schemaless   Data  and  New   Data  Processing   Method Distributed  Parallel   Processing,  High   Performance  Compu,ng   (HPC),  Network  Delay,   etc.   Construc)on  of  Big  data   environment  (Hardware,   middleware  researches) Big  data  analy)cs   (So=ware  researches) Closed  Assump,on  System   à  Open  Assump,on  System
  • 5. AI  Community DB  Community a1 a2 b10 b8 a9 a8 a7 a6 a5 a4 a3 b9 b6 b7 b4 b5 b2 b3 b1 Someone  adds  rela,onships  between  a3  and  b4 Rela,onships  among  persons  in  communi,es  AI  and  DB.  ai,  bj  are  researchers.  When  someone   adds  symmetric  and  transi,ve  rela,onships  between  a3  and  b4,  it  is  true  that  a1  is  related  to   b5  because  a1  is  related  to  a3,  a3  is  related  to  b4,  and  b4  is  related  to  b5.
  • 6. Office  Community Music  Community a1 a2 b10 b8 a9 a8 a7 a6 a5 a4 a3 b9 b6 b7 b4 b5 b2 b3 b1 Someone  adds   rela,onships   between  a3  and  b4 Rela,onships  among  persons  in  workplace  and  music  communi,es.   ai  are  co-­‐workers,  and  bj  are  musicians.  When  someone  adds  symmetric  and  transi,ve   rela,onships  between  a3  and  b4,  it  is  actually  not  true  that a1  is  related  to  b5.  In  graph   structure,  it  is  true  that  a1  is  related  to  b5. However,  realis,cally,  a1  and  b5  do  not  share   ground  without  other  defini,ons  or  analysis.
  • 7. Difference  of  two  examples •  “AI Community” ∩  “DB Community” ≠ ∅. à Closed Assumption – Representation of relations in the previous methods such as owl, RDF, etc. •  “Office  Community” ∩  “Music  Community”  =  ∅. àOpened Assumption – unable representation of relations in the previous method
  • 8. Proof  of  inconsistency  of  order  rela,on   between  two  certain  sets  [1/2] •  A = {a1, a2, … , an}, B = {b1, b2, …, bm} •  A ∩ B = ∅. •  Both sets A and B may define the order relations differently. •  prove that we cannot discover the relationship between sets A and B or other relationships when we get relationship f between a1 ∈ A and b1 ∈ B. à b1=f(a1)
  • 9. Proof  of  inconsistency  of  order  rela,on   between  two  certain  sets  [2/2] •  We prove that it is satisfied when bi = f(ai) is not true by induction. –  b1 = f(a1) is true by the above condition when i = 1. –  We assume that bk = f(ak) is true when i = k. –  When i = k + 1, bk+1 = f(ak+1) is not true. •  set A has an order relation. set B has another order relation. –  bk ≤ bk+1 may not be true, if ak ≤ ak+1 is true and vice versa. Furthermore, both ak ≤ ak+1 and bk ≤ bk+1 may not be true. •  Although b1 = f(a1) is true, bi = f(ai) is not.
  • 10. Proof  of  inconsistency  of  the  transi,ve   rela,on  between  two  certain  sets[1/2] •  A = {a1, a2, … , an}, B = {b1, b2, …, bm} •  A ∩ B = ∅. •  Set B has order relation b1 ≤ b2 ≤ b3 ≤ b4… – Transitive relation – If b1 ≤ b2 and b2 ≤ b3 are true, b1 ≤ b3 is true •  Set A has its own order relation.
  • 11. Proof  of  inconsistency  of  the  transi,ve   rela,on  between  two  certain  sets[2/2] •  Assume a1 = (1, 5), b1 =(2, 1), b2 = (3, 2), b3 = (4, 3). •  We prove that a1 ≤ b3 is true when we get relation a1 ≤ b1. •  To reveal the conclusion first, a1 ≤ b3 may not satisfy. •  The relationship of a1 and b1 focuses on each first element. •  Then a1 ≤ b1 is true. •  The order relation of set B focuses on more values of each second element. •  Then b1 ≤ b2 ≤ b3, and if b1 ≤ b2 and b2 ≤ b3 is true, then b1 ≤ b3 is true. •  However, a1 ≤ b3 is not true in the order set of set B. •  Like the relation of a1 and b1, an inconsistency occurs whose order and transitive relations of set B are not guaranteed.
  • 12. Inconsistencies     –  Three  Opened  Assump,on’s  Evils   •  Inconsistency  is  shown  whose  rela,on  does  not   guarantee  the  future   •  Inconsistency  where  any  transi,ve  rela,on  is  not   true,  when  anyone  connects  links  for   heterogeneous  fields   •  Inconsistency  where  any  rela,on  in   heterogeneous  fields  cannot  be  discovered  in  set   theory
  • 13. Misconcep,on  of  Future  Informa,on   Systems •  A  user  Do  Not  want  to  retrieve  some  data,  need   some  solu,ons   –  A  system  solve  some  clues  for  a  user  from  data  by   rela,vely  comparing   –  It  is  important  to  rela,vely  compare  between  data.   •  We  can  Not  write  anymore  rela,onships   –  dynamical  changing  depending  on  user,  situa,on,  etc.   –  when  data  are  changing,  rela,onships  are  changing   •  We  cannot  create  indexes.   •  We  cannot  discover  without  wri,ng  rela,onships   –  However,  a  system  can  compare  on  the  basis.  
  • 14. Functional Predicate Set Theory Coordinates System •  commutative property •  associative property •  distributive property •  reflexive relation •  antisymmetric relation •  transitive relation •  axis adaptability evaluation •  uniqueness evaluation •  certainty evaluation •  predicate satisfaction evaluation Incomplete  Mutual  Map  Transforma,on   Framework  between  set  theory  and  the   Cartesian  system  of  coordinates. Mutual  mapping  by  mathema,cal  rule,  formula,  etc.   (Because  the  mathema,cal  rule  and  formula  are  closed  assump,on)
  • 15. Overview  of  our  method Sampling  Data •  A  query  given  by  a  user •  Sampling  the  data  set  depend  on  a   query Selec,on  of  Basis •  A  system  selects  some  basis  for  solu,on  of  query •  Order  rela,onships?,  con,nues  or  equal  interval   Sampling?     Mapping    from  set   theory  to  the   Cartesian  system  of   coordinates •  Mathema,cal  rule/formula     à  closed  assump,on •  Crea,on  transforma,on  opera,on  on  the  closed   assump,on  manually. Discovery  of   rela,onships  on  the   the  Cartesian   system  of   coordinates •  Predefini,on  of  func,onal  predicates •  Sa,sfying  each  func,on  predicates  Re-­‐mapping  to  set   theory •   Representa,on  of    predicate  in  predicate  func,ons •   Representa,on  of  reasons  in  basis 1 2 3 4 5
  • 16. Example:  Crea,on  Func,onal   Predicate  –  dependOn •  ”dependOn” means that set A relies on set X. – The value of element ai of set A should only change with the variation of the value of element xj of set X. •  ”dependOn” is represented in {A}(X), when set A depends on set X.
  • 17. Example  Dataset   Jan. Feb. Mar. Apr. May. Jun. Jul. Aug. Sep. Oct. Nov. Dec. Ave. 2007 4.9 6.1 8.2 12.3 18.7 22.7 23.5 28 24.1 17.1 11 6.5 15.3 2008 3.6 2.9 9 13.6 18 21.1 26.3 25.8 22.9 17.6 10.7 6.9 14.9 2009 4.3 5.5 7.6 14.1 19.4 22.2 25.4 25.8 22 16.8 11.4 6.7 15.1 2010 4.3 4.8 7.2 11.2 18.1 23.5 27 29 24.2 17.7 11.2 7.2 15.5 2011 2.4 4.9 6.1 12.6 17.8 22.9 27.1 26.6 23.9 17.1 12.3 4.8 14.9   cucumber cabbage 2007 1168 604 2008 1226 594 2009 1102 662 2010 1231 739 2011 1179 573 MONTHLY  AVERAGE  TEMPERATURE  IN  GUMMA  PREFECTURE,  JAPAN ANNUAL  AVERAGE  PRICE  (Y  en)  OF  CUCUMBERS  (5kg)  AND  CABBAGE(10kg)
  • 18. Result     Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec Cucumber AAE 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 UE 0.031 0.394 0.028 0.345 0.707 0.002 0.207 0.188 0.355 0.924 0.090 0.043 CE 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 BV -9.590 -27.269 8.075 -27.022 -67.471 2.254 16.039 16.006 32.882 132.937 -25.899 11.466 Cabbage AAE 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 UE 0.243 0.024 0.007 0.199 0.052 0.255 0.045 0.330 0.003 0.114 0.048 0.436 CE 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 BV 34.617 8.705 -5.190 -26.357 23.588 37.635 9.576 27.231 3.696 59.930 -24.346 47.057 •  AAE: axis adaptability evaluation •  UE: uniqueness evaluation •  CE: certainty evaluation •  BV: predicate satisfaction evaluation {Cucumber Price}(May temperature) Discovered  dependOn  Rela,ons {Cucumber Price}(Oct temperature) {Cabbage Price}(Dec temperature)
  • 19. Conclusion •  Three  opened  assump,on  evils   –  We  represented  the  inconsistencies  of  past  researches  that   contributed  to  the  interconnec,on  of  such  heterogeneous   fields  as  Linked  Data,  and  our  past  researches.   •  Map  transforma,on  framework  from  set  theory  to  the   Cartesian  system  of  coordinates   –  defining  such  predicate  func,ons  as  disjoint, meet, overlap, coveredBy, covers, equal, contain, inside, correlate, moreThan, lessThan, alongWith, join, etc. •  A  preliminary  evalua,on  of  predicate  func,on   ”dependOn”