An introduction to Force11 and Beyond the PDF meetings presented to the WWW2013 meeting in Rio de Janeiro, Brazil on May 15, 2013. Presenters were: Ivan Herman, W3C; Sweitze Roffel, Elsevier; David De Roure, University of Oxford; and Todd Carpenter, NISO.
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
An Introduction to Force11 at WWW2013
1. (1)
Ivan Herman, W3C
Sweitze Roffel, Elsevier
David De Roure, University of Oxford
Todd Carpenter, NISO
Open Discussion, Everyone
2. (2)
A grass roots effort to accelerate the pace and
nature of scholarly communications and e-
scholarship through technology, education
and community
4. (4)
Anyone who has a stake in moving scholarly communication into the 21st century
(>350 members)
Publishers
Library and
Information
scientists
Policy
makers
Tool
builders
Funders
Science
Social
Science
Humanities
Scholars
23. (23)
Experts find one another’s result
They engage into private or public conversations,
discussions
They may lead to
new results
new, possibly common actions (that did happen in our case)
They get to know and possibly influence one
another’s view
Etc.
25. (25)
I found out about the paper on a social site
not through a formal bibliography
I could not get to the paper directly
though my Institute’s library has a subscription to Elsevier’s
Web site…
I was not there! I.e., I had no access
I had to know (because I am part of the community) that
there is a preprint server
26. (26)
I had access to PDF: like a paper printout, but on
the screen
no access to higher resolution images
no access to the underlying data, so that I could check
some of the statements
no access to the algorithm to really try it out (have you ever
tried to read a complex computing algorithm?)
no direct link to all the other papers via the references
• if I want to read them: for each paper the whole story starts all
over again!
27. (27)
My blog led to
additional insights, possibly to both of us
maybe some practical results
• submission of a specification to a standard body in this case
the paper certainly had an impact on me …
But… the whole series of communication, of
references, etc., go unnoticed on the authors’ official
impact factors
and that is almost the only thing that counts for career
advancement…
28. (28)
Web sites can offer you lively experiences on
images, interactive diagrams, video, audio
possibly illustrated algorithms running real time on demand
interactive control over remote program execution
Hyperlinks are the norm: getting from one page to
another is normal and expected
29. (29)
Storage is cheap: publishing data or images beyond
pure text is common place
Data mining is real: cross references, relationships,
etc., become possible if the underlying content is
“software friendly”
30. (30)
Experts communicate through emails, Twitter,
Google+, Facebook, etc.
possibly more knowledge and information flows through
these channels than through “official” scientific
communications
this flow is not measured for scientific career purposes
Pace of information exchange is higher, a publication
must be made available almost instantaneously
compare it to the long publication delays through official
channels
31. (31)
There is an information overload; people expect
technical help in managing it
Collaborative platforms come to the fore where
scientific discourse may happen through common
development and discussion
etc.
33. (33)
Web technology researchers have a special role
many of the issues are related to Web technologies
our community may provide some of the technological
underpinning of a new scholarly world
34. (34)
What is a publication on the Web?
documents?
pictures?
data (lots of them!)
algorithms, running code
all of the above bundled together!
What/how do you store, how to you refer to those?
How do you do meaningful search?
35. (35)
Better tools!!!
authoring, say, XHTML is still very hard
reviewing process through the Web is almost non-existent
• we need comments, annotation, revision history, provenance,…
Identity management
one can have URI-s for persons, events, publications…
…but, in practice, there are many for each, and no proper
bridges exists!
36. (36)
Proper metadata on publications, libraries, citations
currently: “one standard is good, more is better”
Right expressions, right management
industrial researchers have different requirements than
university researchers
how to combine openness with (necessary) protection?
Better “impact factor” measures
include influence through social sites
39. (39)
A
lot
of
these
systems
don’t
really
talk
to
each
other
that
well
LaTeX
is
pre
webtech
-‐
Metafont
is
incompatable
with
unicode
and
many
,
many
more…
40. (40)
• Mission: to create a comprehensive set of math fonts
that serve the scientific and engineering community
• Unicode based
• Applicable to both print + online
• Collaboration between AIP, ACS, AMS, IEEE, APS,
and Elsevier
• Status:
– STIX fonts 1.1.0 released as SIL Open Fonts Licence
– LaTeX beta released early 2013
– ~300 downloads/week
http://www.stixfonts.org/
41. (41)
• New,
open-‐source
technology
to
render
mathemaAcs
on
the
web
• Crisp
display
at
any
level
of
zoom
and
seamless
integraAon
into
HTML
• Work
with
both
MathML
and
LaTeX
• Supports
copy-‐paste
and
assisAve
technologies
• Elsevier
is
an
early
supporter
of
MathJax
Screenshot from the Article of the Future (http://
www.articleofthefuture.com/S0022314X08001856/)
showing high-resolution math display at any level of zoom http://www.mathjax.org/
42. (42)
Video: Alain Connes
explains his paper
Center
pane:
“TradiAonal”
full-‐text
view,
designed
for
opAmal
online
reading
experience
Right pane: Additional
content & tools. Shown
here: theorem browser
Left pane:
efficient
navigation &
browsing
43. (43)
Nice…..
But
mathML
is
sAll
not
real
content
mathemaAcs…
I
can’t
actually
caluculate
with
it
44. (44)
• Explore
figures
interacAvely
–
zoom,
rotate,
etc.
• Download
underlying
data
for
validaAon
&
re-‐use
http://www.elsevier.com/matlab
45. (45)
How does it work?
1. Click on the link next to
the paper
2. Takes you to the
associated code in the
cloud
• Collaboration with 3rd party that
stores the code page
• Supports R and , well matlab
• Links back and forth
• Currently live on Science Direct on
a number of journals
46. (46)
For
machine
consumable
code
separaAon
of
form
and
content
does
not
work.
In
arAcles
code
is
oUen
a
‘picture’
to
keep
lay
out
intact
in
XML/HTML/
PDF
So
we
need
to
get
the
actual
code
file
from
authors
–
keep
this
file
intact
through
the
whole
publicaAon
process
to
render
it
back
to
readers
;
but
within
the
XML
enriched
arAcle
at
the
correct
place
Currently
piloAng
this
new
process
on
the Journal of Web Semanitcs Guide For Authors; http://www.elsevier.com/about/
content-innovation/inline-supplementary-material-for-journal-articles
Please: with
your next JWS
article submit
your code also
as a .txt ISM file
47. (47)
Its
just
not
there…yet
It
would
be
nice
if
mathML
became
content
maths…
Imagine
the
services
one
could
build
with
machine
readable
mathemaAcs
Linking
theory
to
algorithms
to
forges
and
back…
Happy
to
help
48. (48)
Entities,
concepts and
relationships
Smart Content Applications
Better understanding through
analysis and visualization
• Question & Answer"
• Actionable Content & Alerts"
• Tag clouds
• Heatmaps"
• Animations"
Better discovery through
semantic search & navigation
• Faceted search & browse
• Ontology-driven navigation
• Task-specific results
• Personalized/localized results
• Link to evidenced-based content"
New knowledge through
aggregation and synthesis
• Topic pages
• Social network maps
• Geolocation maps
• Data integration and mashups"
• Text mining "
• Inference and Reasoning
Images
Text
Tables
Elsevier
Content
Elsevier
knowledge
organization
systems
Linked data from
partners and the Web
Partner
Content
48
49. (49)
CONTENT
Customers may build their own toothbrush:
Run extensive searches
and use locally loaded
content for text mining
purposes for their own
research.
Perform extensive mining operations on
subscribed content .
Structuring input text
Deriving patterns within the structured text
Evaluation and interpretation of the output.
Extract semantic entities
from Elsevier content for the
purpose of recognition and
classification of the relations
between them
Integrate results on a server used for the
customer’s own mining system for access
and use by its researchers through the
customer’s internal secure network.
Enabling developers who wish to
design and implement applications to
analyse our content, or test
applications as part of their research
within Elsevier content
1
50. (50)
Content Mining/ machine access
Content Accessibility
App, data and visualisation
integration in ‘actionable’ papers
Automated
discovery for those
with no time to read
Consistent high-quality formats
and stable identifiers
Digital preservation
Mobile devices/apps
21st Century Rights
Management
65. (65)
May 15, 2013 65
Source: Citations for SEER Databases
Source: Global Land Cover Facility
Source: International Polar Year
Source: ICPSR
Source: The Economist
70. (70)
Founded by CrossRef,
Thomson-Reuters,
Nature in 2009
Now 328 participant
organizations, 50 of
which have provided
sponsorship funding
Prototype technology
Launched in fall 2011
May 15, 2013 70
71. (71)
May 15, 2013 71
Network of institutions interested in developing
long-term preservation system for research
data and content
Federated approach to preservation
Ecosystem of repositories acting as nodes
contributing “dark content” to the network
Replicating nodes contain redundant, dark
copies of all deposits that can be brightened in
cases of catastrophic loss
Will launch in early 2013
72. (72)
May 15, 2013 72
Formed in 2009
Now 20 members
Partnership of data
centers to assign
persistent identifiers to
datasets
Initially using DOIs
Provide discovery
services related to
data
73. (73)
May 15, 2013 73
Altmetrics is a community of scientists,
publishers and service providers exploring new
ways of assessing scholarly impact in novel
ways
How do we apply usage data, downloads, social
media mentions, social graph, traditional
citations, page rank, linking, “saving” in citation
services, to both traditional and new media
Addressing questions like: what is the use of a
data set or a piece of software mean?
Capture a richer set of assessment measures
76. (76)
May 15, 2013 76
Todd Carpenter
Executive Director
tcarpenter@niso.org
National Information Standards Organization (NISO)
3600 Clipper Mill Road, Suite 302
Baltimore, MD 21211 USA
+1 (301) 654-2512
www.niso.org