URSSI - SGCI - PresQT: Research Software and Science Gateways: Addressing Sustainability, Usability and Reproducibility Challenges to Enhance Research �
Research software plays a major role in academia evident in the fast developing landscape of simulations and modeling and new evolving areas such as deep learning. Research software has been gaining increased attention in the last 5 years than ever before. Science gateways are a subgroup of research software addressing the needs for seamless end-to-end solutions so that scientists can focus on their research questions instead of becoming acquainted with details of the often complex underlying computing and data infrastructure. Providers of distributed computing infrastructures such as XSEDE report since 2013 that more users apply their resources for research via science gateways than via command line. Recognizing the importance of software and science gateways, the conceptualization of the US Research Software Sustainability Institute (URSSI) and the Science Gateways Community Institute (SGCI) that is funded since 2016, aim at addressing challenges to achieve sustainability of software and increase the usability of complex computational methods to accelerate research. A cornerstone of science is reproducibility and even though solutions such as science gateways support reproducibility of research, the current landscape needs further tools and concepts to address the reproducibility challenges. One of the goals of the project PresQT (Preservation Quality Tool) is to step into the gap to connect existing systems to achieve easier curation and preservation of data and software and thus, contribute essentially to reproducibility of research. The talk will go into detail for the research software and science gateways landscape in the US, challenges faced by projects and the actions and vision of URSSI, SGCI and PresQT.
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
Similar to URSSI - SGCI - PresQT: Research Software and Science Gateways: Addressing Sustainability, Usability and Reproducibility Challenges to Enhance Research �
Similar to URSSI - SGCI - PresQT: Research Software and Science Gateways: Addressing Sustainability, Usability and Reproducibility Challenges to Enhance Research � (20)
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
URSSI - SGCI - PresQT: Research Software and Science Gateways: Addressing Sustainability, Usability and Reproducibility Challenges to Enhance Research �
20. ConceptualizaHon
• Workshops
• First workshop took place in April in Berkeley
• Second workshop took place in October in
Chicago
• So+ware credit workshop will take place in
January in Santa Barbara
• Incubator workshop will take place in February in
Maryland
• Survey with about 1200 answers – in analysis
• Ethnographic studies
• Mission and vision working group
22. Technology-Enhanced Research
22
• Increased complexity of
• today’s research quesHons
• hardware and so+ware
• skills required
• Greater need for openness
and reproducibility
• Science increasingly driving
policy quesHons
• Opportunity to integrate
research with teaching
• Be_er workforce
preparaHon
We need end-to-end
solu,ons that provide
broad access to
advanced resources
and
allow all to tackle
today’s challenging
science ques,ons
è Science Gateways
30. Science Gateway Survey 2014
30
What services
would be helpful? • sent out to 29,000 persons
• 4,957 responses from
across domains
• 52% from life, physical or
mathemaHcal sciences
• 32% from computer and
informaHon sciences or
engineering
• 45% develop data collecHons
• 44% develop data analysis
tools
Proposed Service % Interest
Evaluation, impact analysis, website analytics 72%
Adapting technologies 67%
Web/visual/graphic design 67%
Choosing technologies 66%
Usability Services 66%
Visualization 65%
Developing open-source software 64%
Support for education 64%
Community engagement mechanisms 62%
Keeping your project running 62%
Legal perspectives 61%
Managing data 60%
Computational resources 59%
Mobile technology 59%
Database structure, optimization, and query
expertise
59%
Data mining and analysis 58%
Cybersecurity consultation 57%
Website construction 57%
Software engineering process consultation 53%
Source code review and/or audit 51%
High-bandwidth networks 45%
Scientific instruments or data streams 44%
Management aspects of a project 38%
31. Science Gateway Survey 2014
31
34% 36%
20%
17%
31%
26%
42%
16%
30%
18%
45% 44%
14% 15%
0%
5%
10%
15%
20%
25%
30%
35%
40%
45%
50%
Usability
Consultant
Graphic
Designer
Community
Liaison/
Evangelist
Project
Manager
Professional
Software
Developer
Security
Expert
Quality
Assurance
and Testing
Expert
Wished we had this
Yes, we had this
Well-designed gateways require a variety of exper-se
35. Sustainability via On-Campus Teams
On-campus teams
It is a centralized team at your
insHtuHon –
irrespecHve whether you are part of
a university, a naHonal lab,
an organizaHon, a consorHum or
a company…
Local teams vs. distributed and
remote teams:
For local teams it is s-ll easier to
build more trust, to be more efficient
and to create a strong culture.
h_ps://www.codementor.io/blog/modern-engineering-teams-4ea9dp+zs
Even ants wish
they had an extra
pair of hands
when developing
science gateways!
•
•
•
•
•
•
•
•
•
Is your campus seeing an increasing number of research projects that
include web-based applications? Does each group have to hire developers
independently? This can be time consuming and inefficient.
You are not alone.
Contact us to request a free consultation, webi-
nar, or on-campus visit to start your path toward
sustainable gateway development.
Addressing Software
Sustainability on
Your Campus
THEREISASOLUTION
Synergy between projects
http://sciencegateways.org/campusgroups
help@sciencegateways.org
We can provide supplemental expertise
where you don’t have it.
We can provide support for your journey to
creating a campus-based group.
We can provide ongoing advice based on
campuses who have successfully created
their own groups.
Creating a central pool of expertise on your
campus offers many benefits including:
NOWISTHERIGHTTIME!
INTERESTED?CONTACTUS!
Science gateways are online, end-to-end solutions that
provide broad access to advanced resources. They provide
a community space for science and engineering research
and education, allowing all to tackle today's challenging
science questions.
Gateways are an increasingly common component of
funded activities by many agencies. Individual PIs find it
challenging to recruit and sustain teams that offer the
diversity of expertise necessary for developing gateways.
HOWTOSTART?
39. Bridging the Gap to Data Sharing
Researchers
“the local academic community struggles to effectively manage its assets which
manifested itself in a number of challenges, and as for researchers, they lacked
storage capacity and data curation processes, and the institution lacked standard
metadata and indexing technologies, as well as tools that would support the whole
research workflow” - Digital Asset Strategy Committee, DigitalND, 2011
Libraries
Typically, data curation happens retroactively, and as a result data is either not
captured at all or available metadata is incomplete.
Pressures from the Outside
“...digitally formatted scientific data resulting from unclassified research supported
wholly or in part should be stored and publicly accessible to search, retrieve, and
analyze.” - White House OSTP Public Access Memo, Feb. 2013
Image Credit Peter Alfred Hess (CC BY 2.0)
40. Current Lifecycle of Research Projects
Hesburgh Libraries
SelecHon/
development
of tools
Data
assembling/
creaHng
Reports
PreservaHon of
Data
Funding ends
New
project
Work-intensive and
too late in the lifecycle
41. Target Lifecycle of Research Projects
Hesburgh Libraries
SelecHon/
development
of tools
Data
assembling/
creaHng
Reports
PreservaHon of
Data
Funding ends
New
project
EASY STEP!!!
(ideally)
Assure quality
of data
Assure
quality
of data
48. Repository and Tool Agnostic Solutions
• Open design of tools and services using standards
• Integrate with workflows, tools, and virtual environments
• Priority Focus Areas
➔ Available for anyone to adopt what they need and build
upon it!
49. Open Design Document
• Open design of tools and services using standards
• Integrate with workflows, tools, and virtual environments
• Priority Focus Areas
➔ Available for anyone to adopt what they need and build
upon it!
50. Partners and Committed Collaborations
• Sheridan Libraries, John Hopkins University
• NDS
• UC San Diego Library
• HUBzero team, Purdue University
• Yale University Library
• Libraries at Amherst College, Fontbonne University, Tuskegee
University, ConfederaHon of Open Access Repositories (COAR)
• ReproZip, Jupyter, CERN, RDA groups
• Midwest Big Data Hub, Science Gateways Community
InsHtute, URSSI, Center for Open Science, Data CuraHon
Network, So+ware PreservaHon Network
51. Partners and Committed Collaborations
• Sheridan Libraries, John Hopkins University
• NDS
• UC San Diego Library
• HUBzero team, Purdue University
• Yale University Library
• Libraries at Amherst College, Fontbonne University, Tuskegee
University, ConfederaHon of Open Access Repositories (COAR)
• ReproZip, Jupyter, CERN, RDA groups
• Midwest Big Data Hub, Science Gateways Community
InsHtute, URSSI, Center for Open Science, Data CuraHon
Network, So+ware PreservaHon Network
JOIN US!