SlideShare a Scribd company logo
1 of 32
Download to read offline
Some	
  slide	
  prompts	
  to	
  support	
  a	
  data	
  framing	
  inves3ga3on	
  around	
  corporate	
  data	
  –	
  
originally	
  prepared	
  for	
  the	
  OGP	
  Fes3val,	
  London,	
  October	
  2013.	
  
For	
  more	
  informa3on,	
  contact:	
  schoolOfData.org	
  

1	
  
These	
  notes	
  provide	
  a	
  worked	
  example	
  of	
  how	
  to	
  download	
  company	
  ownership	
  
rela3onship	
  data	
  from	
  OpenCorproates	
  (opencorporates.com)	
  using	
  the	
  cross-­‐
plaNorm	
  data	
  cleaning	
  tool	
  OpenRefine	
  (openrefine.org),	
  and	
  then	
  visualise	
  the	
  data	
  
using	
  the	
  cross-­‐plaNorm	
  Gephi	
  netwrok	
  visualisa3on	
  tool	
  (gephi.org).	
  

2	
  
OpenCorporates	
  is	
  a	
  private	
  company	
  that	
  has	
  set	
  itself	
  the	
  ambi3ous	
  task	
  of	
  building	
  
a	
  database	
  of	
  registered	
  company	
  informa3on	
  for	
  every	
  legal	
  corporate	
  en3ty	
  in	
  the	
  
world.	
  
One	
  of	
  the	
  views	
  OpenCorporates	
  offers	
  over	
  at	
  least	
  some	
  of	
  the	
  data	
  in	
  its	
  database	
  
shows	
  how	
  companies	
  are	
  connected	
  by	
  beneficial	
  ownership	
  or	
  shareholder	
  
rela3onships.	
  
Although	
  complex,	
  this	
  diagram	
  is	
  “human	
  readable”	
  –	
  the	
  data	
  is	
  presented	
  in	
  a	
  way	
  
that	
  is	
  intended	
  to	
  make	
  some	
  sort	
  of	
  meaningful	
  sense	
  to	
  us.	
  

3	
  
But	
  as	
  well	
  as	
  publishing	
  data	
  for	
  us	
  humans	
  to	
  read,	
  OpenCorporates	
  also	
  makes	
  
data	
  available	
  in	
  a	
  way	
  that	
  machines	
  can	
  read	
  	
  -­‐	
  machine	
  readable	
  data.	
  
You	
  may	
  have	
  heard	
  of	
  the	
  term	
  “API”	
  in	
  the	
  context	
  of	
  data	
  publishing	
  websites.	
  To	
  
all	
  intents	
  and	
  purposes,	
  an	
  API	
  is	
  an	
  interface	
  that	
  computers	
  can	
  use	
  to	
  get	
  
informa3on	
  out	
  of	
  websites	
  in	
  a	
  way	
  that	
  they,	
  and	
  the	
  databases	
  they	
  work	
  with,	
  can	
  
understand.	
  
The	
  data	
  is	
  published	
  in	
  a	
  format	
  known	
  as	
  JSON	
  –	
  Javascript	
  Object	
  Nota3on.	
  But	
  
you	
  don’t	
  really	
  need	
  to	
  know	
  much	
  more	
  than	
  that	
  –	
  just	
  that	
  it’s	
  called	
  JSON,	
  and	
  
tools	
  that	
  can	
  parse	
  and	
  work	
  with	
  JSON	
  can	
  parse	
  and	
  work	
  with	
  the	
  data	
  that	
  the	
  
OpenCorporates	
  API	
  publishes.	
  

4	
  
If	
  you	
  aren’t	
  a	
  programmer,	
  here’s	
  way	
  of	
  ge]ng	
  the	
  data	
  out	
  of	
  OpenCorporates	
  and	
  
into	
  a	
  tabular	
  form	
  you	
  may	
  be	
  more	
  comfortable	
  with,	
  and	
  which	
  we	
  can	
  use	
  to	
  
generate	
  a	
  network	
  diagram	
  to	
  display	
  in	
  a	
  tool	
  such	
  as	
  Gephi…	
  
You	
  can	
  download	
  the	
  OpenRefine	
  applica3on	
  from	
  openrefine.org.	
  When	
  you	
  run	
  it	
  
on	
  your	
  computer,	
  it	
  will	
  launch	
  an	
  applica3on	
  that	
  runs	
  inside	
  a	
  browser	
  tab	
  using	
  
your	
  default	
  web	
  browser.	
  

5	
  
We	
  can	
  get	
  company	
  ownership	
  (subsidiary	
  rela3ons,	
  major	
  shareholdings,	
  etc)	
  from	
  
OpenCorporates	
  by	
  hacking	
  the	
  web	
  address/URL	
  of	
  a	
  company	
  page	
  on	
  
OpenCorporates.	
  
From	
  a	
  company	
  page	
  on	
  OpenCorporates,	
  which	
  should	
  have	
  the	
  form:	
  
http://opencorporates.com/companies/JURISDICTION/
COMPANY_ID!
add	
  the	
  following	
  to	
  the	
  end	
  of	
  the	
  web	
  address/URL:	
  
/network.json?depth=2	
  
to	
  give	
  something	
  with	
  the	
  following	
  form:	
  
http://opencorporates.com/companies/JURISDICTION/
COMPANY_ID/network.json?depth=2!
(Note:	
  company	
  network	
  data	
  may	
  not	
  be	
  available	
  in	
  all	
  jurisdic3ons	
  or	
  for	
  all	
  
companies.)	
  

6	
  
In	
  OpenRefine,	
  select	
  the	
  op3on	
  to	
  Create	
  [a	
  new]	
  Project	
  using	
  the	
  web	
  address	
  –	
  or	
  
URL	
  –	
  to	
  the	
  JSON	
  data	
  page	
  that	
  reveals	
  the	
  data	
  rela3ng	
  to	
  the	
  corporate	
  ownership	
  
network	
  of	
  the	
  company	
  we	
  are	
  interested	
  in	
  on	
  OpenCorporates.	
  
Note	
  that	
  you	
  can	
  import	
  data	
  into	
  OpenRefine	
  from	
  several	
  web	
  addresses	
  all	
  in	
  one	
  
go,	
  though	
  the	
  data	
  returned	
  from	
  each	
  URL	
  should	
  have	
  the	
  same	
  format	
  or	
  
structure.	
  
Using	
  mul3ple	
  URLs	
  results	
  in	
  a	
  combined	
  data	
  set,	
  which	
  can	
  be	
  quite	
  handy.	
  

7	
  
Being	
  machine	
  readable,	
  the	
  data	
  makes	
  more	
  sense	
  to	
  OpenRefine	
  than	
  it	
  probably	
  
does	
  to	
  us!	
  	
  
Select	
  a	
  block	
  of	
  data	
  in	
  the	
  preview	
  view	
  that	
  is	
  typical	
  of	
  a	
  set	
  of	
  data	
  that	
  you	
  want	
  
to	
  map	
  into	
  a	
  single	
  row	
  in	
  a	
  “tradi3onal”	
  spreadsheet	
  like	
  view.	
  
Data	
  blocks	
  are	
  typically	
  contained	
  within	
  braces	
  (curly	
  brackets);	
  these	
  things	
  :	
  {	
  }	
  
Note	
  that	
  in	
  some	
  machine	
  readable	
  data,	
  some	
  data	
  blocks	
  may	
  be	
  contained	
  within	
  
other	
  data	
  blocks…	
  
Each	
  of	
  the	
  items	
  in	
  a	
  single	
  data	
  block	
  can	
  be	
  mapped	
  into	
  a	
  separate	
  cell	
  –	
  that	
  is,	
  a	
  
separate	
  column	
  –	
  in	
  a	
  single	
  row	
  of	
  data.	
  
So	
  each	
  data	
  block	
  is	
  a	
  row,	
  and	
  each	
  item	
  in	
  the	
  block	
  is	
  a	
  column….	
  OpenRefine	
  will	
  
give	
  you	
  a	
  preview	
  of	
  how	
  the	
  data	
  will	
  look	
  if	
  you	
  click	
  the	
  right	
  bumon!	
  

8	
  
You	
  can	
  preview	
  the	
  effect	
  of	
  making	
  par3cular	
  block	
  selec3ons	
  using	
  Update	
  
Preview.	
  
To	
  return	
  to	
  the	
  block	
  highlighter,	
  use	
  ‘Pick	
  Record	
  Nodes’.	
  
When	
  you	
  are	
  happy	
  with	
  your	
  selec3on,	
  you	
  are	
  ready	
  to	
  “Create	
  Project”.	
  

9	
  
Once	
  we’re	
  happy	
  with	
  the	
  data	
  preview,	
  we	
  can	
  import	
  the	
  data	
  into	
  a	
  more	
  familiar	
  
looking	
  layout.	
  
The	
  arrows	
  at	
  the	
  top	
  of	
  each	
  column	
  pop	
  up	
  menus	
  that	
  allow	
  us	
  to	
  run	
  a	
  wide	
  
variety	
  of	
  opera3ons	
  on	
  a	
  column.	
  
One	
  of	
  the	
  opera3ons	
  let’s	
  us	
  change	
  the	
  column	
  name,	
  so	
  I’m	
  going	
  to	
  rename	
  the	
  
child	
  company	
  and	
  parent	
  company	
  columns	
  to	
  what	
  Gephi	
  expects:	
  Source	
  and	
  
Target.	
  

10	
  
This	
  is	
  the	
  format	
  that	
  Gephi	
  wants	
  to	
  see	
  when	
  we	
  import	
  data	
  from	
  a	
  simple	
  two	
  
column,	
  comma	
  separated	
  variable	
  (CSV)	
  text	
  file.	
  
One	
  of	
  the	
  columns	
  needs	
  to	
  be	
  called	
  Source,	
  another	
  needs	
  to	
  be	
  called	
  Target.	
  
When	
  construc3ng	
  the	
  network	
  diagram,	
  Gephi	
  then	
  knows	
  to	
  draw	
  a	
  line	
  going	
  from	
  
each	
  Source	
  element	
  to	
  the	
  corresponding	
  Target.	
  

11	
  
The	
  OpenCorporates	
  network	
  data	
  in	
  tabulated	
  form.	
  The	
  default	
  column	
  names	
  are	
  
not	
  necessarily	
  as	
  human	
  readable	
  as	
  they	
  could	
  be!	
  
In	
  par3cular,	
  we	
  can	
  iden3fy	
  the	
  name	
  of	
  the	
  parent	
  company	
  and	
  the	
  child	
  company	
  
for	
  each	
  ownership	
  rela3on.	
  We	
  also	
  have	
  access	
  to	
  the	
  OpenCorporates	
  IDs	
  for	
  all	
  of	
  
those	
  companies.	
  The	
  type	
  of	
  rela3onship	
  between	
  the	
  companies	
  is	
  also	
  described.	
  
For	
  the	
  moment,	
  we	
  will	
  treat	
  them	
  all	
  equally.	
  
(If	
  you	
  want	
  to	
  view	
  just	
  those	
  company	
  connec3ons	
  that	
  relate	
  to	
  a	
  par3cular	
  type	
  of	
  
rela3on,	
  use	
  the	
  Facet	
  or	
  Text	
  Filter	
  tool	
  applied	
  to	
  the	
  appropriate	
  column.)	
  

12	
  
From	
  the	
  appropriate	
  column	
  menu,	
  select	
  “Edit	
  Column”	
  and	
  then	
  “Rename	
  this	
  
column”	
  to	
  change	
  the	
  column	
  name.	
  

13	
  
We	
  can	
  now	
  export	
  the	
  data	
  using	
  the	
  Custom	
  Tabular	
  Exporter.	
  
Deselect	
  all	
  the	
  columns	
  then	
  select	
  just	
  the	
  Source	
  and	
  Target	
  columns	
  –	
  we	
  will	
  only	
  
export	
  data	
  from	
  these	
  two	
  columns.	
  

14	
  
Preview	
  your	
  data	
  to	
  check	
  that	
  it	
  looks	
  like	
  the	
  sort	
  of	
  data	
  you	
  expect	
  to	
  export.	
  
From	
  the	
  Download	
  tab,	
  select	
  the	
  CSV	
  output	
  type	
  and	
  export	
  your	
  data	
  –	
  it	
  should	
  
be	
  saved	
  into	
  the	
  default	
  download	
  directory	
  used	
  by	
  your	
  browser,	
  with	
  a	
  file	
  name	
  
that	
  corresponds	
  to	
  the	
  OpenRefine	
  project	
  name.	
  
You	
  should	
  have	
  the	
  two	
  column	
  data	
  saved	
  to	
  your	
  computer	
  that	
  you	
  can	
  now	
  load	
  
in	
  to	
  Gephi.	
  

15	
  
Gephi	
  is	
  a	
  powerful	
  cross-­‐plaNorm	
  desktop	
  tool	
  for	
  visualising	
  data	
  that	
  describes	
  
networks,	
  such	
  as	
  social	
  networks	
  or	
  corporate	
  ownership	
  networks.	
  You	
  can	
  import	
  
data	
  into	
  Gephi	
  using	
  specialised	
  graph/network	
  representa3on	
  formats,	
  or	
  from	
  
simple	
  two	
  column	
  data	
  files	
  where	
  each	
  describes	
  a	
  simple	
  connec3on	
  between	
  two	
  
elements	
  (eg	
  thing1,	
  thing2	
  would	
  say	
  that	
  thing1	
  connects	
  to	
  thing2).	
  
You	
  can	
  download	
  the	
  Gephi	
  applica3on	
  from	
  gephi.org.	
  When	
  you	
  run	
  it	
  on	
  your	
  
computer,	
  it	
  will	
  launch	
  a	
  desktop	
  applica3on.	
  Note	
  that	
  Gephi	
  requires	
  Java	
  –	
  if	
  you	
  
are	
  on	
  a	
  Mac,	
  you	
  may	
  need	
  to	
  download	
  and	
  install	
  Java	
  yourself:	
  www.java.com	
  

16	
  
Launch	
  Gephi	
  (download	
  it	
  from	
  gephi.org	
  if	
  you	
  don’t	
  already	
  have	
  it	
  installed)	
  and	
  
select	
  Data	
  Laboratory.	
  
If	
  the	
  Data	
  Table	
  toolbar	
  is	
  empty,	
  go	
  to	
  the	
  applica3on’s	
  File	
  menu	
  and	
  select	
  ‘New	
  
Project’.	
  A	
  new	
  project	
  will	
  be	
  created	
  and	
  you	
  should	
  see	
  several	
  toolbar	
  op3ons	
  
appear	
  in	
  the	
  Data	
  Table.	
  

17	
  
Load	
  the	
  data	
  in	
  using	
  the	
  “Import	
  Spreadsheet”	
  tool	
  op3on.	
  Make	
  sure	
  that	
  you	
  
select	
  Edges	
  table	
  as	
  the	
  table	
  type.	
  
If	
  your	
  data	
  file	
  does	
  not	
  have	
  Source	
  and	
  Target	
  column	
  names,	
  an	
  error	
  will	
  occur	
  
and	
  you	
  will	
  not	
  be	
  able	
  to	
  import	
  the	
  data	
  file.	
  (In	
  such	
  a	
  case,	
  you	
  could	
  always	
  
open	
  the	
  file	
  in	
  a	
  text	
  editor,	
  change	
  the	
  column	
  names	
  in	
  the	
  file,	
  save	
  it,	
  and	
  try	
  
again.	
  Alterna3vely,	
  go	
  in	
  to	
  OpenRefine,	
  change	
  the	
  column	
  names	
  there,	
  and	
  re-­‐
export	
  the	
  custom	
  tabulated	
  data…)	
  

18	
  
The	
  final	
  stage	
  of	
  the	
  import	
  gives	
  some	
  addi3onal	
  informa3on	
  about	
  how	
  uploaded	
  
data	
  will	
  be	
  treated.	
  
Because	
  we	
  are	
  simply	
  loading	
  in	
  data	
  that	
  describes	
  how	
  one	
  company	
  (iden3fied	
  by	
  
its	
  name)	
  is	
  connected	
  to	
  another	
  company	
  (also	
  iden3fied	
  by	
  its	
  name),	
  we	
  need	
  to	
  
get	
  Gephi	
  to	
  automa3cally	
  create	
  a	
  node	
  each	
  3me	
  it	
  sees	
  a	
  new	
  company	
  (as	
  
iden3fied	
  by	
  its	
  company	
  name…).	
  

19	
  
When	
  the	
  data	
  is	
  imported,	
  we	
  can	
  preview	
  it,	
  either	
  by	
  looking	
  at	
  a	
  list	
  of	
  nodes	
  that	
  
have	
  been	
  created,	
  or	
  ‘edges’	
  –	
  that	
  is,	
  connec3ons	
  between	
  two	
  companies.	
  

20	
  
So	
  now	
  let’s	
  see	
  where	
  we	
  can	
  start	
  to	
  view	
  this	
  data	
  as	
  a	
  network	
  visualisa3on.	
  
Click	
  on	
  the	
  top	
  paleme	
  Overview	
  bumon	
  to	
  get	
  an	
  overview	
  of	
  the	
  network	
  in	
  visual	
  
form.	
  This	
  is	
  the	
  area	
  where	
  we	
  can	
  interac3vely	
  visualise	
  the	
  network.	
  

21	
  
The	
  default	
  Overview	
  layout	
  has	
  three	
  main	
  areas:	
  
-­‐ 	
  in	
  the	
  middle	
  is	
  the	
  canvas	
  where	
  we	
  can	
  see	
  the	
  current	
  layout	
  of	
  the	
  network;	
  
along	
  the	
  les	
  hand	
  side	
  of	
  the	
  central	
  panel	
  are	
  several	
  tools	
  for	
  opera3ng	
  on	
  the	
  
elements	
  shown	
  on	
  the	
  canvas;	
  along	
  the	
  bomom	
  of	
  the	
  central	
  panel	
  are	
  several	
  
tools	
  for	
  controlling	
  how	
  text	
  labels	
  are	
  displayed.	
  
-­‐ 	
  to	
  the	
  les	
  are	
  several	
  tools	
  for	
  manipula3ng	
  what	
  the	
  network	
  looks	
  like:	
  tools	
  for	
  
laying	
  out	
  the	
  network	
  (that	
  is,	
  posi3oning	
  the	
  nodes)	
  automa3cally,	
  as	
  well	
  as	
  
colouring	
  and	
  sizing	
  the	
  nodes;	
  
-­‐	
  to	
  the	
  right	
  are	
  several	
  tools	
  that	
  allow	
  us	
  to	
  analyse	
  and	
  process	
  the	
  graph	
  (that	
  is,	
  
the	
  mathema3cal	
  structure	
  that	
  defines	
  the	
  network);	
  for	
  example,	
  we	
  can	
  run	
  
various	
  sta3s3cs	
  on	
  the	
  network,	
  or	
  filter	
  the	
  nodes	
  that	
  are	
  displayed	
  according	
  to	
  
one	
  or	
  more	
  specified	
  criteria.	
  

22	
  
Let’s	
  start	
  by	
  laying	
  out	
  the	
  network.	
  There	
  are	
  several	
  layout	
  tools	
  provided	
  by	
  
default	
  (you	
  can	
  install	
  more	
  from	
  the	
  Tools-­‐>Plugins	
  	
  menu)	
  which	
  each	
  have	
  slightly	
  
different	
  behaviours	
  and	
  can	
  be	
  differently	
  effec3ve	
  at	
  laying	
  out	
  networks	
  with	
  
different	
  sorts	
  of	
  structure.	
  
A	
  couple	
  of	
  good	
  all-­‐round	
  layout	
  algorithms	
  are:	
  
-­‐ 	
  ForceAtlas2	
  
-­‐ 	
  Yifan	
  Hu.	
  
If	
  you	
  imagine	
  connected	
  nodes	
  held	
  together	
  by	
  springs,	
  you	
  can	
  thing	
  of	
  these	
  
layout	
  tools	
  as	
  trying	
  to	
  posi3on	
  the	
  nodes	
  so	
  that	
  the	
  springs	
  are	
  stretched	
  as	
  limle	
  
as	
  possible.	
  Sort	
  of.	
  

23	
  
At	
  the	
  moment,	
  we	
  don’t	
  know	
  what	
  each	
  node	
  represents.	
  By	
  default,	
  when	
  labels	
  
are	
  switched	
  on,	
  Gephi	
  looks	
  for	
  a	
  label	
  column	
  value	
  associated	
  with	
  a	
  node	
  and	
  
displays	
  that.	
  But	
  we	
  can	
  also	
  display	
  other	
  values.	
  In	
  this	
  case,	
  we	
  are	
  using	
  a	
  
company	
  name	
  as	
  the	
  node	
  ID,	
  so	
  we	
  can	
  select	
  id	
  as	
  the	
  element	
  to	
  display	
  when	
  we	
  
switch	
  labels	
  on.	
  Click	
  on	
  the	
  clipboard	
  icon	
  on	
  the	
  toolbar	
  at	
  the	
  bomom	
  of	
  the	
  
screen	
  to	
  raise	
  the	
  label	
  selector.	
  
To	
  actually	
  switch	
  labels	
  on,	
  click	
  on	
  the	
  lesmost/darket	
  T	
  bumon	
  on	
  the	
  toolbar	
  at	
  
the	
  bomom	
  of	
  the	
  screen.	
  
The	
  slider	
  on	
  the	
  right	
  controls	
  the	
  text	
  label	
  size.	
  

24	
  
We	
  can	
  also	
  change	
  the	
  size	
  of	
  labels	
  propor3onal	
  to	
  the	
  size	
  of	
  a	
  node	
  –	
  but	
  how	
  do	
  
we	
  size	
  nodes?	
  
Whilst	
  it	
  is	
  possible	
  to	
  load	
  in	
  data	
  that	
  describes	
  various	
  amributes	
  associated	
  with	
  
each	
  node	
  (for	
  example,	
  in	
  the	
  case	
  of	
  a	
  company	
  node	
  it	
  might	
  be	
  the	
  turnover	
  or	
  
profit	
  in	
  the	
  last	
  financial	
  year),	
  we	
  can	
  also	
  generate	
  informa3on	
  about	
  each	
  node	
  
based	
  on	
  various	
  network	
  proper3es.	
  
For	
  example,	
  the	
  degree	
  of	
  a	
  node	
  says	
  how	
  many	
  connec3ons	
  it	
  has	
  with	
  other	
  
nodes.	
  Where	
  connec3ons	
  are	
  ‘directed’	
  –	
  that	
  is,	
  represented	
  by	
  arrows	
  –	
  the	
  
number	
  of	
  arrows	
  that	
  leave	
  a	
  node	
  is	
  referred	
  to	
  as	
  the	
  out-­‐degree	
  of	
  the	
  node,	
  and	
  
the	
  number	
  of	
  arrows	
  that	
  come	
  into	
  a	
  node	
  as	
  the	
  in-­‐degree.	
  

25	
  
We	
  can	
  use	
  the	
  Average	
  Degree	
  sta3s3c	
  tool	
  to	
  calculate	
  the	
  degree,	
  in-­‐degree	
  and	
  
out-­‐degree	
  values	
  for	
  each	
  node.	
  
We	
  can	
  then	
  use	
  these	
  values	
  as	
  the	
  basis	
  for	
  sizing	
  the	
  nodes	
  in	
  the	
  network	
  
visualisa3on.	
  

26	
  
Here	
  we	
  have	
  sized	
  the	
  nodes	
  by	
  Degree.	
  The	
  min	
  and	
  max	
  size	
  parameters	
  can	
  be	
  set	
  
as	
  required	
  to	
  scale	
  the	
  size	
  of	
  the	
  nodes.	
  

27	
  
We	
  can	
  set	
  the	
  label	
  size	
  so	
  that	
  it	
  is	
  propor3onal	
  to	
  the	
  node	
  size	
  –	
  from	
  the	
  black/
dark	
  A	
  label	
  on	
  the	
  toolbar	
  at	
  the	
  bomom	
  of	
  the	
  screen,	
  select	
  the	
  [proporIonal	
  to]	
  
Node	
  Size	
  menu	
  op3on.	
  

28	
  
As	
  well	
  as	
  tools	
  for	
  genera3ng	
  grandscale	
  layouts,	
  there	
  are	
  also	
  layout	
  tools	
  for	
  
tweaking	
  a	
  par3cular	
  layout.	
  
The	
  Expansion	
  tool	
  just	
  stretches	
  (or	
  shrinks)	
  the	
  layout	
  in	
  the	
  x	
  and	
  y	
  direc3ons.	
  This	
  
can	
  be	
  good	
  for	
  just	
  pu]ng	
  a	
  bit	
  of	
  space	
  into	
  a	
  layout.	
  
The	
  Label	
  Adjust	
  tool	
  juggles	
  nodes	
  so	
  that	
  their	
  labels	
  don’t	
  overlap.	
  Note	
  that	
  this	
  
tool	
  may	
  move	
  some	
  nodes	
  quite	
  a	
  distance	
  compared	
  to	
  their	
  neighbours	
  and	
  so	
  
may	
  upset	
  any	
  meaningful	
  spa3al	
  rela3onships	
  obtained	
  using	
  the	
  other	
  layout	
  tools.	
  

29	
  
We	
  can	
  colour	
  and	
  size	
  nodes	
  according	
  to	
  a	
  wide	
  range	
  of	
  proper3es	
  obtained	
  from	
  
running	
  various	
  network	
  sta3s3cs.	
  
As	
  you	
  work	
  with	
  network	
  data	
  more	
  and	
  more,	
  you	
  start	
  to	
  get	
  a	
  feel	
  for	
  which	
  tools	
  
to	
  use	
  to	
  help	
  you	
  look	
  for	
  par3cular	
  pamerns,	
  structures	
  and	
  stories	
  within	
  the	
  data.	
  
But	
  that	
  is	
  a	
  tutorial	
  for	
  another	
  day…	
  

30	
  
We	
  can	
  use	
  various	
  tools	
  in	
  concert	
  to	
  tweak	
  the	
  layout	
  of	
  the	
  network.	
  
In	
  this	
  example,	
  I	
  have:	
  
-­‐ 	
  sized	
  the	
  nodes	
  by	
  degree;	
  
-­‐ 	
  set	
  the	
  label	
  sizes	
  propor3onal	
  to	
  the	
  Degree;	
  
-­‐ 	
  tweaked	
  the	
  scale	
  using	
  the	
  text-­‐size	
  slide;	
  
-­‐ 	
  used	
  the	
  Authority	
  value	
  (obtained	
  via	
  the	
  HITS	
  sta3s3c)	
  to	
  colour	
  the	
  nodes;	
  
-­‐ 	
  laid	
  out	
  the	
  network	
  using	
  a	
  ForceAtlas2	
  algorithm,	
  a	
  bit	
  of	
  Expansion	
  and	
  a	
  dash	
  of	
  
Label	
  Adjust.	
  

31	
  
If	
  you	
  want	
  to	
  know	
  more,	
  contact	
  us…	
  

32	
  

More Related Content

What's hot

Media and Information Literacy (MIL) 1. Introduction to Media and Informatio...
Media and Information Literacy (MIL)  1. Introduction to Media and Informatio...Media and Information Literacy (MIL)  1. Introduction to Media and Informatio...
Media and Information Literacy (MIL) 1. Introduction to Media and Informatio...Arniel Ping
 
TRENDS-NETWORKS-QUARTER-1-WEEK-4.docx
TRENDS-NETWORKS-QUARTER-1-WEEK-4.docxTRENDS-NETWORKS-QUARTER-1-WEEK-4.docx
TRENDS-NETWORKS-QUARTER-1-WEEK-4.docxRomelChristianZamora
 
Common productivity tools: Advanced Word Processing Skills, Advanced Spreadsh...
Common productivity tools: Advanced Word Processing Skills, Advanced Spreadsh...Common productivity tools: Advanced Word Processing Skills, Advanced Spreadsh...
Common productivity tools: Advanced Word Processing Skills, Advanced Spreadsh...Edgar Dumalaog Jr.
 
Media and Information Literacy 104
Media and Information Literacy 104Media and Information Literacy 104
Media and Information Literacy 104Marvin Broñoso
 
Intrographics and ict projects
Intrographics and ict projectsIntrographics and ict projects
Intrographics and ict projectsCharlotte Quemada
 
Em tech tg-acad-v5-112316
Em tech tg-acad-v5-112316Em tech tg-acad-v5-112316
Em tech tg-acad-v5-112316Live Angga
 
Chapter 1 Introduction to Media and Information Literacy
Chapter 1 Introduction to Media and Information LiteracyChapter 1 Introduction to Media and Information Literacy
Chapter 1 Introduction to Media and Information LiteracyJuvywen
 
Empowerment Technologies - Online Platforms as Tools for ICT Content Development
Empowerment Technologies - Online Platforms as Tools for ICT Content DevelopmentEmpowerment Technologies - Online Platforms as Tools for ICT Content Development
Empowerment Technologies - Online Platforms as Tools for ICT Content DevelopmentLany Lyn Magdaraog
 
Ict project-for-social-change
Ict project-for-social-changeIct project-for-social-change
Ict project-for-social-changeella_rexjune
 
Empowerment Technology by: Maria Elisa Pal and Rodel Reyes
Empowerment Technology  by: Maria Elisa Pal and Rodel ReyesEmpowerment Technology  by: Maria Elisa Pal and Rodel Reyes
Empowerment Technology by: Maria Elisa Pal and Rodel Reyesandregoron
 
Media and Information Literacy (MIL)- Visual Information and Media (Part 1)
Media and Information Literacy (MIL)- Visual Information and Media (Part 1)Media and Information Literacy (MIL)- Visual Information and Media (Part 1)
Media and Information Literacy (MIL)- Visual Information and Media (Part 1)Arniel Ping
 
Empowerment technologies byShyna Gane Daleon and Galasao
Empowerment technologies byShyna Gane  Daleon and Galasao Empowerment technologies byShyna Gane  Daleon and Galasao
Empowerment technologies byShyna Gane Daleon and Galasao janelle17
 
Study and Appreciation of Philippine Arts
Study and Appreciation of Philippine Arts Study and Appreciation of Philippine Arts
Study and Appreciation of Philippine Arts Jeanelei Carolino
 
Empowerment Technology: Integrating Materials (MS Word)
Empowerment Technology: Integrating Materials (MS Word)Empowerment Technology: Integrating Materials (MS Word)
Empowerment Technology: Integrating Materials (MS Word)Faux Xerxes
 
Repleksyong Papel: Methods at Media
Repleksyong Papel: Methods at MediaRepleksyong Papel: Methods at Media
Repleksyong Papel: Methods at MediaShairah Cometa
 

What's hot (20)

Module 4 EMPOWERMENT TECHNOLOGY
Module 4 EMPOWERMENT TECHNOLOGYModule 4 EMPOWERMENT TECHNOLOGY
Module 4 EMPOWERMENT TECHNOLOGY
 
Media and Information Literacy (MIL) 1. Introduction to Media and Informatio...
Media and Information Literacy (MIL)  1. Introduction to Media and Informatio...Media and Information Literacy (MIL)  1. Introduction to Media and Informatio...
Media and Information Literacy (MIL) 1. Introduction to Media and Informatio...
 
TRENDS-NETWORKS-QUARTER-1-WEEK-4.docx
TRENDS-NETWORKS-QUARTER-1-WEEK-4.docxTRENDS-NETWORKS-QUARTER-1-WEEK-4.docx
TRENDS-NETWORKS-QUARTER-1-WEEK-4.docx
 
Common productivity tools: Advanced Word Processing Skills, Advanced Spreadsh...
Common productivity tools: Advanced Word Processing Skills, Advanced Spreadsh...Common productivity tools: Advanced Word Processing Skills, Advanced Spreadsh...
Common productivity tools: Advanced Word Processing Skills, Advanced Spreadsh...
 
Media and Information Literacy 104
Media and Information Literacy 104Media and Information Literacy 104
Media and Information Literacy 104
 
Intrographics and ict projects
Intrographics and ict projectsIntrographics and ict projects
Intrographics and ict projects
 
Em tech tg-acad-v5-112316
Em tech tg-acad-v5-112316Em tech tg-acad-v5-112316
Em tech tg-acad-v5-112316
 
Chapter 1 Introduction to Media and Information Literacy
Chapter 1 Introduction to Media and Information LiteracyChapter 1 Introduction to Media and Information Literacy
Chapter 1 Introduction to Media and Information Literacy
 
Empowerment Technologies - Online Platforms as Tools for ICT Content Development
Empowerment Technologies - Online Platforms as Tools for ICT Content DevelopmentEmpowerment Technologies - Online Platforms as Tools for ICT Content Development
Empowerment Technologies - Online Platforms as Tools for ICT Content Development
 
Ict project-for-social-change
Ict project-for-social-changeIct project-for-social-change
Ict project-for-social-change
 
Empowerment Technology by: Maria Elisa Pal and Rodel Reyes
Empowerment Technology  by: Maria Elisa Pal and Rodel ReyesEmpowerment Technology  by: Maria Elisa Pal and Rodel Reyes
Empowerment Technology by: Maria Elisa Pal and Rodel Reyes
 
Media and Information Literacy (MIL)- Visual Information and Media (Part 1)
Media and Information Literacy (MIL)- Visual Information and Media (Part 1)Media and Information Literacy (MIL)- Visual Information and Media (Part 1)
Media and Information Literacy (MIL)- Visual Information and Media (Part 1)
 
Productivity tools
Productivity toolsProductivity tools
Productivity tools
 
EAPP- COT 2.finale.pptx
EAPP- COT 2.finale.pptxEAPP- COT 2.finale.pptx
EAPP- COT 2.finale.pptx
 
Empowerment technologies byShyna Gane Daleon and Galasao
Empowerment technologies byShyna Gane  Daleon and Galasao Empowerment technologies byShyna Gane  Daleon and Galasao
Empowerment technologies byShyna Gane Daleon and Galasao
 
Study and Appreciation of Philippine Arts
Study and Appreciation of Philippine Arts Study and Appreciation of Philippine Arts
Study and Appreciation of Philippine Arts
 
ICT as a Platform for Change
ICT as a Platform for Change ICT as a Platform for Change
ICT as a Platform for Change
 
Empowerment Technology: Integrating Materials (MS Word)
Empowerment Technology: Integrating Materials (MS Word)Empowerment Technology: Integrating Materials (MS Word)
Empowerment Technology: Integrating Materials (MS Word)
 
Manipulative Media
Manipulative MediaManipulative Media
Manipulative Media
 
Repleksyong Papel: Methods at Media
Repleksyong Papel: Methods at MediaRepleksyong Papel: Methods at Media
Repleksyong Papel: Methods at Media
 

Viewers also liked

School Of Data - mapping opencorporates networks using openrefine and Gephi
School Of Data - mapping opencorporates networks using openrefine and GephiSchool Of Data - mapping opencorporates networks using openrefine and Gephi
School Of Data - mapping opencorporates networks using openrefine and GephiTony Hirst
 
Mapping Corporate Networks With OpenCorporates
Mapping Corporate Networks With OpenCorporatesMapping Corporate Networks With OpenCorporates
Mapping Corporate Networks With OpenCorporatesTony Hirst
 
Scoda company networks2
Scoda company networks2Scoda company networks2
Scoda company networks2Tony Hirst
 
Scoda openrefine-directordata
Scoda openrefine-directordataScoda openrefine-directordata
Scoda openrefine-directordataTony Hirst
 
HumaNum - ANF - Fréjus 23-26 septembre 2013 - Visualiser les données de la re...
HumaNum - ANF - Fréjus 23-26 septembre 2013 - Visualiser les données de la re...HumaNum - ANF - Fréjus 23-26 septembre 2013 - Visualiser les données de la re...
HumaNum - ANF - Fréjus 23-26 septembre 2013 - Visualiser les données de la re...simon le bayon (phd)
 
Gephi Tutorial Visualization
Gephi Tutorial VisualizationGephi Tutorial Visualization
Gephi Tutorial VisualizationGephi Consortium
 

Viewers also liked (7)

School Of Data - mapping opencorporates networks using openrefine and Gephi
School Of Data - mapping opencorporates networks using openrefine and GephiSchool Of Data - mapping opencorporates networks using openrefine and Gephi
School Of Data - mapping opencorporates networks using openrefine and Gephi
 
Gephi Quick Start
Gephi Quick StartGephi Quick Start
Gephi Quick Start
 
Mapping Corporate Networks With OpenCorporates
Mapping Corporate Networks With OpenCorporatesMapping Corporate Networks With OpenCorporates
Mapping Corporate Networks With OpenCorporates
 
Scoda company networks2
Scoda company networks2Scoda company networks2
Scoda company networks2
 
Scoda openrefine-directordata
Scoda openrefine-directordataScoda openrefine-directordata
Scoda openrefine-directordata
 
HumaNum - ANF - Fréjus 23-26 septembre 2013 - Visualiser les données de la re...
HumaNum - ANF - Fréjus 23-26 septembre 2013 - Visualiser les données de la re...HumaNum - ANF - Fréjus 23-26 septembre 2013 - Visualiser les données de la re...
HumaNum - ANF - Fréjus 23-26 septembre 2013 - Visualiser les données de la re...
 
Gephi Tutorial Visualization
Gephi Tutorial VisualizationGephi Tutorial Visualization
Gephi Tutorial Visualization
 

Similar to School of Data - mapping company networks

Scoda project companygraph
Scoda project companygraphScoda project companygraph
Scoda project companygraphTony Hirst
 
Scraping Handout
Scraping HandoutScraping Handout
Scraping HandoutCindy Royal
 
Rits Brown Bag - Salesforce Lightning External Connection
Rits Brown Bag - Salesforce Lightning External ConnectionRits Brown Bag - Salesforce Lightning External Connection
Rits Brown Bag - Salesforce Lightning External ConnectionRight IT Services
 
Oracle to vb 6.0 connectivity
Oracle to vb 6.0 connectivityOracle to vb 6.0 connectivity
Oracle to vb 6.0 connectivityrohit vishwakarma
 
Osss manual-5-extract data
Osss manual-5-extract dataOsss manual-5-extract data
Osss manual-5-extract datacwarner7_11
 
Osgis 2010 notes
Osgis 2010 notesOsgis 2010 notes
Osgis 2010 notesJoanne Cook
 
Using OpenCalais in the context of linked data
Using OpenCalais in the context of linked dataUsing OpenCalais in the context of linked data
Using OpenCalais in the context of linked dataguest342da08
 
IBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNIBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNabclearnn
 
XPages - The Ties That Bind
XPages - The Ties That BindXPages - The Ties That Bind
XPages - The Ties That BindMichael McGarel
 
Final Project Write-up
Final Project Write-upFinal Project Write-up
Final Project Write-upshiyang feng
 
Ez platform meetup, madrid 21 marzo 2018 english
Ez platform meetup, madrid 21 marzo 2018   englishEz platform meetup, madrid 21 marzo 2018   english
Ez platform meetup, madrid 21 marzo 2018 englishcrevillo
 
HTML Flight Scraper
HTML Flight Scraper HTML Flight Scraper
HTML Flight Scraper Anthony Kilde
 
Using pandas library for data analysis in python
Using pandas library for data analysis in pythonUsing pandas library for data analysis in python
Using pandas library for data analysis in pythonBruce Jenks
 
SharePoint - Crunch the Numbers Together
SharePoint - Crunch the Numbers TogetherSharePoint - Crunch the Numbers Together
SharePoint - Crunch the Numbers TogetherDavid J Rosenthal
 
From_SPSS Modeler_to_KNIME_v4.7_ebook.pdf
From_SPSS Modeler_to_KNIME_v4.7_ebook.pdfFrom_SPSS Modeler_to_KNIME_v4.7_ebook.pdf
From_SPSS Modeler_to_KNIME_v4.7_ebook.pdfVeniAgustina1
 
Events Registration System Part 1
Events Registration System Part 1Events Registration System Part 1
Events Registration System Part 1Adolfo Nasol
 

Similar to School of Data - mapping company networks (20)

Scoda project companygraph
Scoda project companygraphScoda project companygraph
Scoda project companygraph
 
Scraping Handout
Scraping HandoutScraping Handout
Scraping Handout
 
Rits Brown Bag - Salesforce Lightning External Connection
Rits Brown Bag - Salesforce Lightning External ConnectionRits Brown Bag - Salesforce Lightning External Connection
Rits Brown Bag - Salesforce Lightning External Connection
 
Oracle to vb 6.0 connectivity
Oracle to vb 6.0 connectivityOracle to vb 6.0 connectivity
Oracle to vb 6.0 connectivity
 
Osss manual-5-extract data
Osss manual-5-extract dataOsss manual-5-extract data
Osss manual-5-extract data
 
Dwh faqs
Dwh faqsDwh faqs
Dwh faqs
 
Osgis 2010 notes
Osgis 2010 notesOsgis 2010 notes
Osgis 2010 notes
 
Using OpenCalais in the context of linked data
Using OpenCalais in the context of linked dataUsing OpenCalais in the context of linked data
Using OpenCalais in the context of linked data
 
IBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNIBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARN
 
XPages - The Ties That Bind
XPages - The Ties That BindXPages - The Ties That Bind
XPages - The Ties That Bind
 
Final Project Write-up
Final Project Write-upFinal Project Write-up
Final Project Write-up
 
Semantic Web - Introduction
Semantic Web - IntroductionSemantic Web - Introduction
Semantic Web - Introduction
 
Ez platform meetup, madrid 21 marzo 2018 english
Ez platform meetup, madrid 21 marzo 2018   englishEz platform meetup, madrid 21 marzo 2018   english
Ez platform meetup, madrid 21 marzo 2018 english
 
HTML Flight Scraper
HTML Flight Scraper HTML Flight Scraper
HTML Flight Scraper
 
Using pandas library for data analysis in python
Using pandas library for data analysis in pythonUsing pandas library for data analysis in python
Using pandas library for data analysis in python
 
SharePoint - Crunch the Numbers Together
SharePoint - Crunch the Numbers TogetherSharePoint - Crunch the Numbers Together
SharePoint - Crunch the Numbers Together
 
From_SPSS Modeler_to_KNIME_v4.7_ebook.pdf
From_SPSS Modeler_to_KNIME_v4.7_ebook.pdfFrom_SPSS Modeler_to_KNIME_v4.7_ebook.pdf
From_SPSS Modeler_to_KNIME_v4.7_ebook.pdf
 
Salesforce
SalesforceSalesforce
Salesforce
 
Apps1
Apps1Apps1
Apps1
 
Events Registration System Part 1
Events Registration System Part 1Events Registration System Part 1
Events Registration System Part 1
 

More from Tony Hirst

15 in 20 research fiesta
15 in 20 research fiesta15 in 20 research fiesta
15 in 20 research fiestaTony Hirst
 
Jupyternotebooks ou.pptx
Jupyternotebooks ou.pptxJupyternotebooks ou.pptx
Jupyternotebooks ou.pptxTony Hirst
 
Virtual computing.pptx
Virtual computing.pptxVirtual computing.pptx
Virtual computing.pptxTony Hirst
 
ouseful-parlihacks
ouseful-parlihacksouseful-parlihacks
ouseful-parlihacksTony Hirst
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriateTony Hirst
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriateTony Hirst
 
Robotlab jupyter
Robotlab   jupyterRobotlab   jupyter
Robotlab jupyterTony Hirst
 
Fco open data in half day th-v2
Fco open data in half day  th-v2Fco open data in half day  th-v2
Fco open data in half day th-v2Tony Hirst
 
Notes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 WorkshopNotes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 WorkshopTony Hirst
 
Community Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wireCommunity Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wireTony Hirst
 
Residential school 2015_robotics_interest
Residential school 2015_robotics_interestResidential school 2015_robotics_interest
Residential school 2015_robotics_interestTony Hirst
 
Data Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKXData Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKXTony Hirst
 
A Quick Tour of OpenRefine
A Quick Tour of OpenRefineA Quick Tour of OpenRefine
A Quick Tour of OpenRefineTony Hirst
 
Conversations with data
Conversations with dataConversations with data
Conversations with dataTony Hirst
 
Data reuse OU workshop bingo
Data reuse OU workshop bingoData reuse OU workshop bingo
Data reuse OU workshop bingoTony Hirst
 
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories Tony Hirst
 
Lincoln jun14datajournalism
Lincoln jun14datajournalismLincoln jun14datajournalism
Lincoln jun14datajournalismTony Hirst
 

More from Tony Hirst (20)

15 in 20 research fiesta
15 in 20 research fiesta15 in 20 research fiesta
15 in 20 research fiesta
 
Dev8d jupyter
Dev8d jupyterDev8d jupyter
Dev8d jupyter
 
Ili 16 robot
Ili 16 robotIli 16 robot
Ili 16 robot
 
Jupyternotebooks ou.pptx
Jupyternotebooks ou.pptxJupyternotebooks ou.pptx
Jupyternotebooks ou.pptx
 
Virtual computing.pptx
Virtual computing.pptxVirtual computing.pptx
Virtual computing.pptx
 
ouseful-parlihacks
ouseful-parlihacksouseful-parlihacks
ouseful-parlihacks
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriate
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriate
 
Robotlab jupyter
Robotlab   jupyterRobotlab   jupyter
Robotlab jupyter
 
Fco open data in half day th-v2
Fco open data in half day  th-v2Fco open data in half day  th-v2
Fco open data in half day th-v2
 
Notes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 WorkshopNotes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 Workshop
 
Community Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wireCommunity Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wire
 
Residential school 2015_robotics_interest
Residential school 2015_robotics_interestResidential school 2015_robotics_interest
Residential school 2015_robotics_interest
 
Data Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKXData Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKX
 
Week4
Week4Week4
Week4
 
A Quick Tour of OpenRefine
A Quick Tour of OpenRefineA Quick Tour of OpenRefine
A Quick Tour of OpenRefine
 
Conversations with data
Conversations with dataConversations with data
Conversations with data
 
Data reuse OU workshop bingo
Data reuse OU workshop bingoData reuse OU workshop bingo
Data reuse OU workshop bingo
 
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
 
Lincoln jun14datajournalism
Lincoln jun14datajournalismLincoln jun14datajournalism
Lincoln jun14datajournalism
 

Recently uploaded

Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 

Recently uploaded (20)

Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 

School of Data - mapping company networks

  • 1. Some  slide  prompts  to  support  a  data  framing  inves3ga3on  around  corporate  data  –   originally  prepared  for  the  OGP  Fes3val,  London,  October  2013.   For  more  informa3on,  contact:  schoolOfData.org   1  
  • 2. These  notes  provide  a  worked  example  of  how  to  download  company  ownership   rela3onship  data  from  OpenCorproates  (opencorporates.com)  using  the  cross-­‐ plaNorm  data  cleaning  tool  OpenRefine  (openrefine.org),  and  then  visualise  the  data   using  the  cross-­‐plaNorm  Gephi  netwrok  visualisa3on  tool  (gephi.org).   2  
  • 3. OpenCorporates  is  a  private  company  that  has  set  itself  the  ambi3ous  task  of  building   a  database  of  registered  company  informa3on  for  every  legal  corporate  en3ty  in  the   world.   One  of  the  views  OpenCorporates  offers  over  at  least  some  of  the  data  in  its  database   shows  how  companies  are  connected  by  beneficial  ownership  or  shareholder   rela3onships.   Although  complex,  this  diagram  is  “human  readable”  –  the  data  is  presented  in  a  way   that  is  intended  to  make  some  sort  of  meaningful  sense  to  us.   3  
  • 4. But  as  well  as  publishing  data  for  us  humans  to  read,  OpenCorporates  also  makes   data  available  in  a  way  that  machines  can  read    -­‐  machine  readable  data.   You  may  have  heard  of  the  term  “API”  in  the  context  of  data  publishing  websites.  To   all  intents  and  purposes,  an  API  is  an  interface  that  computers  can  use  to  get   informa3on  out  of  websites  in  a  way  that  they,  and  the  databases  they  work  with,  can   understand.   The  data  is  published  in  a  format  known  as  JSON  –  Javascript  Object  Nota3on.  But   you  don’t  really  need  to  know  much  more  than  that  –  just  that  it’s  called  JSON,  and   tools  that  can  parse  and  work  with  JSON  can  parse  and  work  with  the  data  that  the   OpenCorporates  API  publishes.   4  
  • 5. If  you  aren’t  a  programmer,  here’s  way  of  ge]ng  the  data  out  of  OpenCorporates  and   into  a  tabular  form  you  may  be  more  comfortable  with,  and  which  we  can  use  to   generate  a  network  diagram  to  display  in  a  tool  such  as  Gephi…   You  can  download  the  OpenRefine  applica3on  from  openrefine.org.  When  you  run  it   on  your  computer,  it  will  launch  an  applica3on  that  runs  inside  a  browser  tab  using   your  default  web  browser.   5  
  • 6. We  can  get  company  ownership  (subsidiary  rela3ons,  major  shareholdings,  etc)  from   OpenCorporates  by  hacking  the  web  address/URL  of  a  company  page  on   OpenCorporates.   From  a  company  page  on  OpenCorporates,  which  should  have  the  form:   http://opencorporates.com/companies/JURISDICTION/ COMPANY_ID! add  the  following  to  the  end  of  the  web  address/URL:   /network.json?depth=2   to  give  something  with  the  following  form:   http://opencorporates.com/companies/JURISDICTION/ COMPANY_ID/network.json?depth=2! (Note:  company  network  data  may  not  be  available  in  all  jurisdic3ons  or  for  all   companies.)   6  
  • 7. In  OpenRefine,  select  the  op3on  to  Create  [a  new]  Project  using  the  web  address  –  or   URL  –  to  the  JSON  data  page  that  reveals  the  data  rela3ng  to  the  corporate  ownership   network  of  the  company  we  are  interested  in  on  OpenCorporates.   Note  that  you  can  import  data  into  OpenRefine  from  several  web  addresses  all  in  one   go,  though  the  data  returned  from  each  URL  should  have  the  same  format  or   structure.   Using  mul3ple  URLs  results  in  a  combined  data  set,  which  can  be  quite  handy.   7  
  • 8. Being  machine  readable,  the  data  makes  more  sense  to  OpenRefine  than  it  probably   does  to  us!     Select  a  block  of  data  in  the  preview  view  that  is  typical  of  a  set  of  data  that  you  want   to  map  into  a  single  row  in  a  “tradi3onal”  spreadsheet  like  view.   Data  blocks  are  typically  contained  within  braces  (curly  brackets);  these  things  :  {  }   Note  that  in  some  machine  readable  data,  some  data  blocks  may  be  contained  within   other  data  blocks…   Each  of  the  items  in  a  single  data  block  can  be  mapped  into  a  separate  cell  –  that  is,  a   separate  column  –  in  a  single  row  of  data.   So  each  data  block  is  a  row,  and  each  item  in  the  block  is  a  column….  OpenRefine  will   give  you  a  preview  of  how  the  data  will  look  if  you  click  the  right  bumon!   8  
  • 9. You  can  preview  the  effect  of  making  par3cular  block  selec3ons  using  Update   Preview.   To  return  to  the  block  highlighter,  use  ‘Pick  Record  Nodes’.   When  you  are  happy  with  your  selec3on,  you  are  ready  to  “Create  Project”.   9  
  • 10. Once  we’re  happy  with  the  data  preview,  we  can  import  the  data  into  a  more  familiar   looking  layout.   The  arrows  at  the  top  of  each  column  pop  up  menus  that  allow  us  to  run  a  wide   variety  of  opera3ons  on  a  column.   One  of  the  opera3ons  let’s  us  change  the  column  name,  so  I’m  going  to  rename  the   child  company  and  parent  company  columns  to  what  Gephi  expects:  Source  and   Target.   10  
  • 11. This  is  the  format  that  Gephi  wants  to  see  when  we  import  data  from  a  simple  two   column,  comma  separated  variable  (CSV)  text  file.   One  of  the  columns  needs  to  be  called  Source,  another  needs  to  be  called  Target.   When  construc3ng  the  network  diagram,  Gephi  then  knows  to  draw  a  line  going  from   each  Source  element  to  the  corresponding  Target.   11  
  • 12. The  OpenCorporates  network  data  in  tabulated  form.  The  default  column  names  are   not  necessarily  as  human  readable  as  they  could  be!   In  par3cular,  we  can  iden3fy  the  name  of  the  parent  company  and  the  child  company   for  each  ownership  rela3on.  We  also  have  access  to  the  OpenCorporates  IDs  for  all  of   those  companies.  The  type  of  rela3onship  between  the  companies  is  also  described.   For  the  moment,  we  will  treat  them  all  equally.   (If  you  want  to  view  just  those  company  connec3ons  that  relate  to  a  par3cular  type  of   rela3on,  use  the  Facet  or  Text  Filter  tool  applied  to  the  appropriate  column.)   12  
  • 13. From  the  appropriate  column  menu,  select  “Edit  Column”  and  then  “Rename  this   column”  to  change  the  column  name.   13  
  • 14. We  can  now  export  the  data  using  the  Custom  Tabular  Exporter.   Deselect  all  the  columns  then  select  just  the  Source  and  Target  columns  –  we  will  only   export  data  from  these  two  columns.   14  
  • 15. Preview  your  data  to  check  that  it  looks  like  the  sort  of  data  you  expect  to  export.   From  the  Download  tab,  select  the  CSV  output  type  and  export  your  data  –  it  should   be  saved  into  the  default  download  directory  used  by  your  browser,  with  a  file  name   that  corresponds  to  the  OpenRefine  project  name.   You  should  have  the  two  column  data  saved  to  your  computer  that  you  can  now  load   in  to  Gephi.   15  
  • 16. Gephi  is  a  powerful  cross-­‐plaNorm  desktop  tool  for  visualising  data  that  describes   networks,  such  as  social  networks  or  corporate  ownership  networks.  You  can  import   data  into  Gephi  using  specialised  graph/network  representa3on  formats,  or  from   simple  two  column  data  files  where  each  describes  a  simple  connec3on  between  two   elements  (eg  thing1,  thing2  would  say  that  thing1  connects  to  thing2).   You  can  download  the  Gephi  applica3on  from  gephi.org.  When  you  run  it  on  your   computer,  it  will  launch  a  desktop  applica3on.  Note  that  Gephi  requires  Java  –  if  you   are  on  a  Mac,  you  may  need  to  download  and  install  Java  yourself:  www.java.com   16  
  • 17. Launch  Gephi  (download  it  from  gephi.org  if  you  don’t  already  have  it  installed)  and   select  Data  Laboratory.   If  the  Data  Table  toolbar  is  empty,  go  to  the  applica3on’s  File  menu  and  select  ‘New   Project’.  A  new  project  will  be  created  and  you  should  see  several  toolbar  op3ons   appear  in  the  Data  Table.   17  
  • 18. Load  the  data  in  using  the  “Import  Spreadsheet”  tool  op3on.  Make  sure  that  you   select  Edges  table  as  the  table  type.   If  your  data  file  does  not  have  Source  and  Target  column  names,  an  error  will  occur   and  you  will  not  be  able  to  import  the  data  file.  (In  such  a  case,  you  could  always   open  the  file  in  a  text  editor,  change  the  column  names  in  the  file,  save  it,  and  try   again.  Alterna3vely,  go  in  to  OpenRefine,  change  the  column  names  there,  and  re-­‐ export  the  custom  tabulated  data…)   18  
  • 19. The  final  stage  of  the  import  gives  some  addi3onal  informa3on  about  how  uploaded   data  will  be  treated.   Because  we  are  simply  loading  in  data  that  describes  how  one  company  (iden3fied  by   its  name)  is  connected  to  another  company  (also  iden3fied  by  its  name),  we  need  to   get  Gephi  to  automa3cally  create  a  node  each  3me  it  sees  a  new  company  (as   iden3fied  by  its  company  name…).   19  
  • 20. When  the  data  is  imported,  we  can  preview  it,  either  by  looking  at  a  list  of  nodes  that   have  been  created,  or  ‘edges’  –  that  is,  connec3ons  between  two  companies.   20  
  • 21. So  now  let’s  see  where  we  can  start  to  view  this  data  as  a  network  visualisa3on.   Click  on  the  top  paleme  Overview  bumon  to  get  an  overview  of  the  network  in  visual   form.  This  is  the  area  where  we  can  interac3vely  visualise  the  network.   21  
  • 22. The  default  Overview  layout  has  three  main  areas:   -­‐   in  the  middle  is  the  canvas  where  we  can  see  the  current  layout  of  the  network;   along  the  les  hand  side  of  the  central  panel  are  several  tools  for  opera3ng  on  the   elements  shown  on  the  canvas;  along  the  bomom  of  the  central  panel  are  several   tools  for  controlling  how  text  labels  are  displayed.   -­‐   to  the  les  are  several  tools  for  manipula3ng  what  the  network  looks  like:  tools  for   laying  out  the  network  (that  is,  posi3oning  the  nodes)  automa3cally,  as  well  as   colouring  and  sizing  the  nodes;   -­‐  to  the  right  are  several  tools  that  allow  us  to  analyse  and  process  the  graph  (that  is,   the  mathema3cal  structure  that  defines  the  network);  for  example,  we  can  run   various  sta3s3cs  on  the  network,  or  filter  the  nodes  that  are  displayed  according  to   one  or  more  specified  criteria.   22  
  • 23. Let’s  start  by  laying  out  the  network.  There  are  several  layout  tools  provided  by   default  (you  can  install  more  from  the  Tools-­‐>Plugins    menu)  which  each  have  slightly   different  behaviours  and  can  be  differently  effec3ve  at  laying  out  networks  with   different  sorts  of  structure.   A  couple  of  good  all-­‐round  layout  algorithms  are:   -­‐   ForceAtlas2   -­‐   Yifan  Hu.   If  you  imagine  connected  nodes  held  together  by  springs,  you  can  thing  of  these   layout  tools  as  trying  to  posi3on  the  nodes  so  that  the  springs  are  stretched  as  limle   as  possible.  Sort  of.   23  
  • 24. At  the  moment,  we  don’t  know  what  each  node  represents.  By  default,  when  labels   are  switched  on,  Gephi  looks  for  a  label  column  value  associated  with  a  node  and   displays  that.  But  we  can  also  display  other  values.  In  this  case,  we  are  using  a   company  name  as  the  node  ID,  so  we  can  select  id  as  the  element  to  display  when  we   switch  labels  on.  Click  on  the  clipboard  icon  on  the  toolbar  at  the  bomom  of  the   screen  to  raise  the  label  selector.   To  actually  switch  labels  on,  click  on  the  lesmost/darket  T  bumon  on  the  toolbar  at   the  bomom  of  the  screen.   The  slider  on  the  right  controls  the  text  label  size.   24  
  • 25. We  can  also  change  the  size  of  labels  propor3onal  to  the  size  of  a  node  –  but  how  do   we  size  nodes?   Whilst  it  is  possible  to  load  in  data  that  describes  various  amributes  associated  with   each  node  (for  example,  in  the  case  of  a  company  node  it  might  be  the  turnover  or   profit  in  the  last  financial  year),  we  can  also  generate  informa3on  about  each  node   based  on  various  network  proper3es.   For  example,  the  degree  of  a  node  says  how  many  connec3ons  it  has  with  other   nodes.  Where  connec3ons  are  ‘directed’  –  that  is,  represented  by  arrows  –  the   number  of  arrows  that  leave  a  node  is  referred  to  as  the  out-­‐degree  of  the  node,  and   the  number  of  arrows  that  come  into  a  node  as  the  in-­‐degree.   25  
  • 26. We  can  use  the  Average  Degree  sta3s3c  tool  to  calculate  the  degree,  in-­‐degree  and   out-­‐degree  values  for  each  node.   We  can  then  use  these  values  as  the  basis  for  sizing  the  nodes  in  the  network   visualisa3on.   26  
  • 27. Here  we  have  sized  the  nodes  by  Degree.  The  min  and  max  size  parameters  can  be  set   as  required  to  scale  the  size  of  the  nodes.   27  
  • 28. We  can  set  the  label  size  so  that  it  is  propor3onal  to  the  node  size  –  from  the  black/ dark  A  label  on  the  toolbar  at  the  bomom  of  the  screen,  select  the  [proporIonal  to]   Node  Size  menu  op3on.   28  
  • 29. As  well  as  tools  for  genera3ng  grandscale  layouts,  there  are  also  layout  tools  for   tweaking  a  par3cular  layout.   The  Expansion  tool  just  stretches  (or  shrinks)  the  layout  in  the  x  and  y  direc3ons.  This   can  be  good  for  just  pu]ng  a  bit  of  space  into  a  layout.   The  Label  Adjust  tool  juggles  nodes  so  that  their  labels  don’t  overlap.  Note  that  this   tool  may  move  some  nodes  quite  a  distance  compared  to  their  neighbours  and  so   may  upset  any  meaningful  spa3al  rela3onships  obtained  using  the  other  layout  tools.   29  
  • 30. We  can  colour  and  size  nodes  according  to  a  wide  range  of  proper3es  obtained  from   running  various  network  sta3s3cs.   As  you  work  with  network  data  more  and  more,  you  start  to  get  a  feel  for  which  tools   to  use  to  help  you  look  for  par3cular  pamerns,  structures  and  stories  within  the  data.   But  that  is  a  tutorial  for  another  day…   30  
  • 31. We  can  use  various  tools  in  concert  to  tweak  the  layout  of  the  network.   In  this  example,  I  have:   -­‐   sized  the  nodes  by  degree;   -­‐   set  the  label  sizes  propor3onal  to  the  Degree;   -­‐   tweaked  the  scale  using  the  text-­‐size  slide;   -­‐   used  the  Authority  value  (obtained  via  the  HITS  sta3s3c)  to  colour  the  nodes;   -­‐   laid  out  the  network  using  a  ForceAtlas2  algorithm,  a  bit  of  Expansion  and  a  dash  of   Label  Adjust.   31  
  • 32. If  you  want  to  know  more,  contact  us…   32