Implementing policy @ WSSSPE

•Download as PPT, PDF•

1 like•837 views

Daisie Huang

Panel subpresentation on Implementing Policy in Sustainable Software

Technology Business

Implementing Policy
WSSSPE Workshop 2013

Daisie Huang
Biodiversity Research Centre
University of British Columbia

Implementing Policy
•

Key issues:
•

As software matures, new problems emerge.

•

Sustainability issues should be addressed
throughout the life cycle.

•

How to implement sustainability when
resources are limited?

Implementing Policy
➡ API

Governance

➡ Software

Security

➡ Sustainability

Implementing Policy
➡ API

Governance

Developing Systems for API Governance
C Krintz, H Jayathilaka, S Dimopoulos, A Pucher, and R Wolski,
Department of Computer Science, UC Santa Barbara

API Governance
•

Scientific research
relies on access to
digital assets as well
as hardware.

•

APIs govern the
interactions between
these digital assets.
from phylotastic.org

API Governance
•

APIs need to be portable and consistent.
•

Semantic compatibility

•

Syntactic compatibility

Implementing Policy
➡ Software

Security

Toward a Research Software Security
Maturity Model
R Heiland, B Thomas, V Welch, C Jackson, Center for Trustworthy
Scientific Cyberinfrastructure, Indiana University

Software Security
•

A Security Maturity Model can formalize this
process:
•

Provides classification of software security
practices.

•

Provides a path for tightening security practices
as a package’s maturity level increases.

•

Emphasizes understandability over complexity.

Implementing Policy

➡ Sustainability

A User Perspective on Sustainable
Scientific Software
Brian Blanton and Chris Lenhardt, Renaissance Computing Institute

Sustainability
•

Tension between “getting it
done” enough to publish
scientific results and “getting
it right” for future users.

Sustainability
Co-funding

Best suited for large, collaborative projects

Sustainability
“Software carpentry”

Teach scientists to use software development best
practices.

Implementing Policy

➡ Sustainability

Software Engineering as Instrumentation
for the Long Tail of Scientific Software
Daisie Huang and Hilmar Lapp, UBC and NESCent

The Long Tail
The lifespan of scientific software can be
unexpectedly long.

The Long Tail
Lots of small programs implement different methods.

Facets of software design
•

API development

•

Security

•

User interface design

•

Test engineering

•

Deployment

Facets of software design
Phylogenetics/Genomics/Ecology/Mol
ecular Biology/Developmental Biology
•

API development

•

Security

•

User interface design

•

Test engineering

•

Deployment

Instrumentation
•

Software engineering as a resource

•

Analogous to DNA sequencing facilities

Instrumenting Software
Engineering
•

A scientific software engineering center can
provide these resources to many projects.

•

Governed by long-term vision that is not tied to
success or failure of any individual project.

•

Emphasis on executing good science by making
functional tools.

Conclusions
•

Many facets of software design not addressed in most
scientific software projects.

•

Possible solutions include:
•

•
•

large projects can hire developers with software
engineering expertise
providing scientists with software design guidance

A software engineering center can provide both
expertise and guidance to the long tail.

What's hot

Sgci data west 12-15-16

Nancy Wilkins-Diehr

1.4 Pre-requisits for using data in emergency services

Fraunhofer FOKUS

One Health, Traceability and Emerging Technologies - Mr. Thomas A. Burke, Food Traceability Scientist, Global Food Traceability Center, Institute of Food Technologists, from the 2018 NIAA Annual Conference, Livestock Traceability: Opportunities for Animal Agriculture, plus the Traceability and the Real World Interactive Workshop, April 10 - 12, Denver, CO, USA. More presentations at https://www.youtube.com/channel/UCeUDeS810OcOfuEYwj1oHKQ

Mr. Thomas A. Burke - One Health, Traceability and Emerging Technologies

John Blue

Presentation of at ChaossCon 2020 Europe https://chaoss.community/chaosscon-2020-eu/ Business and software ecosystem health can be considered as a key performance indicator of an ecosystem's potential to create opportunities for its members. In this talk, Johan will present the practical application of ecosystem health on JobTech Dev, a software ecosystem of job-matching actors in Sweden. The ecosystem is lead by the Swedish Public Employment Service and is underpinned by open data and open source software that is used and co-developed by the whole ecosystem. Elicited metrics cover the ecosystem's productivity in development and maintenance, robustness to withstand change and disruptions, and openness for new business applications, use cases, and external contributions. As the metrics have just been introduced at the Employment Service, Johan will share some initial lessons learned and discuss the roadmap of connecting the health metrics to impact metrics relating to the ecosystem's common vision of improved job-matching on the Swedish labor market.

ChaossCon 2020 - Application of Health metrics on a Cross-sector software eco...

Johan Linåker

ResuméSpring2016v2

Ghassan Makhoul

Panel members v2_datajournals_repositories_repofringe3aug2015

University of Edinburgh

Accelerometer data processing with GGIR - a success story in Research Software

Vincent van Hees

Customer Success Story: IEEE Xplore Inspires Innovation

IEEE Xplore Digital Library

Complex software-intensive systems are often described as systems of systems (SoS) due to their heterogeneous architectural elements. As SoS behavior is often only understandable during operation, runtime monitoring is needed to detect deviations from requirements. Today, while diverse monitoring approaches exist, most do not provide what is needed to monitor SoS, e.g., support for dynamically defining and deploying diverse checks across multiple systems. In this talk, I will describe our experiences of developing, applying, and evolving an approach for monitoring an SoS in the domain of industrial automation software, that is based on a domain-specific language (DSL). I will first describe our initial approach to dynamically define and check constraints in SoS at runtime, including a demo of our monitoring tool REMINDS, and then motivate and describe its evolution based on requirements elicited in an industry collaboration project. I will furthermore describe solutions we have developed to support the evolution of our approach, i.e., a code generation approach and a framework to automate testing the DSL after changes. We evaluated the expressiveness and scalability of our new DSL-based approach using an industrial SoS. At the end of the talk, I will also present general lessons we learned and give an overview of other projects in the area of software monitoring as well as other areas such as software product lines, that I am currently involved in.

Developing and Evolving a DSL-Based Approach for Runtime Monitoring of System...

Förderverein Technische Fakultät

Irving-TeraData: data and science driven big industry-nfdp13

DataDryad

Growing Software Systems

Marc

International Journal of Advanced Smart Sensor Network Systems ( IJASSN )

ijassn

WSSSPE: Building communities

Karen Cranston

Towards ecosystem for research and development of electrodermal activity appl...

Jari Jussila

Using information technology in medical professionalism

MTD Lakshan

What's hot (15)

Sgci data west 12-15-16

1.4 Pre-requisits for using data in emergency services

Mr. Thomas A. Burke - One Health, Traceability and Emerging Technologies

ChaossCon 2020 - Application of Health metrics on a Cross-sector software eco...

ResuméSpring2016v2

Panel members v2_datajournals_repositories_repofringe3aug2015

Accelerometer data processing with GGIR - a success story in Research Software

Customer Success Story: IEEE Xplore Inspires Innovation

Developing and Evolving a DSL-Based Approach for Runtime Monitoring of System...

Irving-TeraData: data and science driven big industry-nfdp13

Growing Software Systems

International Journal of Advanced Smart Sensor Network Systems ( IJASSN )

WSSSPE: Building communities

Towards ecosystem for research and development of electrodermal activity appl...

Using information technology in medical professionalism

Similar to Implementing policy @ WSSSPE

Secure DevOPS Implementation Guidance

Tej Luthra

Sustainability Training Workshop - Intro to the SSI

Software Sustainability Institute

Big Data Analytics of Software Ecosystem Health: Presentation during INFORTECH Scientific Day (23 May 2018) by Professor Tom Mens. The talk reports on ongoing research of the Software Engineering Lab of the University of Mons (UMONS) on health aspects of evolving software ecosystems. This research was conducted in collaboration with postdoctoral researchers Alexandre Decan and Eleni Constantinou, as well as the external partners of two ongoing research projects: SECOHealth (https://secohealth.github.io) and the Excellence of Science research project SECO-ASSIST (https://secoassist.github.io).

Software Ecosystems = Big Data

Tom Mens

01 fse software&sw-engineering

Mohesh Chandran

Pentest is yesterday, DevSecOps is tomorrow

Amien Harisen Rosyandino

Continuous Software Engineering - A tutorial

Breno de França

Software systems engineering PRINCIPLES

Ivano Malavolta

Cultivating Sustainable Software For Research

Neil Chue Hong

Software engineering process

KanchanPatil34

Considerations and challenges in building an end to-end microbiome workflow

Eagle Genomics

Scientific Software Challenges and Community Responses

Daniel S. Katz

RDA BoF on Sustainability - my experience with ISA tools

Susanna-Assunta Sansone

Doing Science Properly In The Digital Age - Rutgers Seminar

Neil Chue Hong

Software Security Assurance for DevOps

Black Duck by Synopsys

How do organizations build secure applications, given today's rapidly moving and evolving DevOps practices? Join Black Duck and our customer experts on best practices for application security in DevOps. You’ll learn: -New security challenges facing today’s popular DevOps and Continuous Integration (CI) practices, including managing custom code and open source risks with containers and traditional environments -Best practices for designing and incorporating an automated approach to application security into your existing development environment -Future development and application security challenges organizations will face and what they can do to prepare

Software Security Assurance for Devops

Jerika Phelps

Science gateways - also called virtual research environments or virtual labs - allow science and engineering communities to access shared data, software, computing services, instruments, and other resources specific to their disciplines and use them also in teaching environments. In the last decade mature complete science gateway frameworks have evolved such as HUBzero and Galaxy as well as Agave and Apache Airavata. Successful implementations have been adapted for several science gateways, for example, the technologies behind the science gateways CIPRES, which is used by over 20.000 users to date and serves the community in the area of large phylogenetic trees. Lessons learned from the last decade include that approaches should be technology agnostic, use standard web technologies or deliver a complete solution. Independent of the technology, the major driver for science gateways are the user communities and user engagement is key for successful science gateways. The US Science Gateways Community Institute (SGCI), opened in August 2016, provides free resources, services, experts, and ideas for creating and sustaining science gateways. It offers five areas of services to the science gateway developer and user communities: the Incubator, Extended Developer Support, the Scientific Software Collaborative, Community Engagement and Exchange, and Workforce Development. The talk will give an introduction to science gateways, examples for science gateways and an overview on the services offered by the SGCI to serve user communities and developers for creating successful science gateways.

SGCI - Science Gateways - Technology-Enhanced Research Under Consideration of...

Sandra Gesing

Sgci esip-7-20-18

Nancy Wilkins-Diehr

LEC 2asasasasasasasasasasasasasasasasa.pptx

GodFather51

MODULE 1 : Software Product and Process Introduction –FAQs About Software Engineering, Definition Of Software Engineering, Difference Between Software Engineering And Computer Science, Difference Between Software Engineering And System Engineering, Software Process, Software Process Models, The Waterfall Model, Incremental Process Models, Evolutionary Process Models Spiral Development, Prototyping, Component Based Software Engineering , The Unified Process, Attributes Of Good Software, Key Challenges Facing By Software Engineering, Verification – Validation, Computer Based System, Business Process Engineering,

MODULE 1 Software Product and Process_ SW ENGG 22CSE141.pdf

Jayanthi Kannan MK

The high-profile attacks and data-breaches of the last few years have shown us the importance of securing our software. While it is good that we are seeing more tools that can analyze systems for vulnerabilities, this does not help the programmer write secure code in the first place. To prevent security from becoming a bottleneck–and expensive security mistakes from becoming increasingly probable–we need to look to techniques that allow us to secure software by construction. This talk has two parts. First, I will present technical ideas from research, including my own, that help secure software by construction. Even though these are reasonable ideas, however, the gap between academia and industry often prevents these ideas from becoming realized in practice. Second, I will discuss what prevents longer-term security solutions from being commercialized, how we started the Cybersecurity Factory accelerator bridge the research/industry gap, and how we can work together to address the issues that remain. http://2016.phillyemergingtech.com/session/securing-software-by-construction/

Philly ETE 2016: Securing Software by Construction

jxyz

Similar to Implementing policy @ WSSSPE (20)

Secure DevOPS Implementation Guidance

Sustainability Training Workshop - Intro to the SSI

Software Ecosystems = Big Data

01 fse software&sw-engineering

Pentest is yesterday, DevSecOps is tomorrow

Continuous Software Engineering - A tutorial

Software systems engineering PRINCIPLES

Cultivating Sustainable Software For Research

Software engineering process

Considerations and challenges in building an end to-end microbiome workflow

Scientific Software Challenges and Community Responses

RDA BoF on Sustainability - my experience with ISA tools

Doing Science Properly In The Digital Age - Rutgers Seminar

Software Security Assurance for DevOps

Software Security Assurance for Devops

SGCI - Science Gateways - Technology-Enhanced Research Under Consideration of...

Sgci esip-7-20-18

LEC 2asasasasasasasasasasasasasasasasa.pptx

MODULE 1 Software Product and Process_ SW ENGG 22CSE141.pdf

Philly ETE 2016: Securing Software by Construction

Recently uploaded

The value of a flexible API Management solution for Open Banking Steve Melan, Manager for IT Innovation and Architecture - State's and Saving's Bank of Luxembourg Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The value of a flexible API Management solution for O...

apidays

Artificial Intelligence Chap.5 : Uncertainty

Khushali Kathiriya

Increase engagement and revenue with Muvi Live Paywall! In this presentation, we will explore the five key benefits of using Muvi Live Paywall to monetize your live streams. You'll learn how Muvi Live Paywall can help you: Monetize your live content easily: Set up pay-per-view access to your live streams and start generating revenue from your content. Increase audience engagement: Provide exclusive, premium content behind the paywall to keep your viewers engaged. Gain valuable viewer insights: Track viewer data and analytics to better understand your audience and tailor your content accordingly. Reduce content piracy: Muvi Live Paywall's security features help protect your content from unauthorized distribution. Streamline your workflow: The all-in-one platform simplifies the process of managing and monetizing your live streams. With Muvi Live Paywall, you can take control of your live stream monetization and create a sustainable business model for your content. Learn more about Muvi Live Paywall and start generating revenue from your live streams today!

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams

Roshan Dwivedi

Partners Life - Insurer Innovation Award 2024

The Digital Insurer

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Martijn de Jong

Boost Fertility New Invention Ups Success Rates.pdf

sudhanshuwaghmare1

As privacy and data protection regulations evolve rapidly, organizations operating in multiple jurisdictions face mounting challenges to ensure compliance and safeguard customer data. With state-specific privacy laws coming up in multiple states this year, it is essential to understand what their unique data protection regulations will require clearly. How will data privacy evolve in the US in 2024? How to stay compliant? Our panellists will guide you through the intricacies of these states' specific data privacy laws, clarifying complex legal frameworks and compliance requirements. This webinar will review: - The essential aspects of each state's privacy landscape and the latest updates - Common compliance challenges faced by organizations operating in multiple states and best practices to achieve regulatory adherence - Valuable insights into potential changes to existing regulations and prepare your organization for the evolving landscape

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

TrustArc

Top 10 Most Downloaded Games on Play Store in 2024

SynarionITSolutions

Join our latest Connector Corner webinar to discover how UiPath Integration Service revolutionizes API-centric automation in a 'Quote to Cash' process—and how that automation empowers businesses to accelerate revenue generation. A comprehensive demo will explore connecting systems, GenAI, and people, through powerful pre-built connectors designed to speed process cycle times. Speakers: James Dickson, Senior Software Engineer Charlie Greenberg, Host, Product Marketing Manager

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

DianaGray10

🐬 The future of MySQL is Postgres 🐘

RTylerCroy

Axa Assurance Maroc - Insurer Innovation Award 2024

The Digital Insurer

Scaling API-first – The story of a global engineering organization Ian Reasor, Senior Computer Scientist - Adobe Radu Cotescu, Senior Computer Scientist - Adobe Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

apidays

Real Time Object Detection Using Open CV

Khem

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Neo4j

Scaling API-first – The story of a global engineering organization

Radu Cotescu

Effective data discovery is crucial for maintaining compliance and mitigating risks in today's rapidly evolving privacy landscape. However, traditional manual approaches often struggle to keep pace with the growing volume and complexity of data. Join us for an insightful webinar where industry leaders from TrustArc and Privya will share their expertise on leveraging AI-powered solutions to revolutionize data discovery. You'll learn how to: - Effortlessly maintain a comprehensive, up-to-date data inventory - Harness code scanning insights to gain complete visibility into data flows leveraging the advantages of code scanning over DB scanning - Simplify compliance by leveraging Privya's integration with TrustArc - Implement proven strategies to mitigate third-party risks Our panel of experts will discuss real-world case studies and share practical strategies for overcoming common data discovery challenges. They'll also explore the latest trends and innovations in AI-driven data management, and how these technologies can help organizations stay ahead of the curve in an ever-changing privacy landscape.

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

TrustArc

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Rafal Los

Three things you will take away from the session: • How to run an effective tenant-to-tenant migration • Best practices for before, during, and after migration • Tips for using migration as a springboard to prepare for Copilot in Microsoft 365 Main ideas: Migration Overview: The presentation covers the current reality of cross-tenant migrations, the triggers, phases, best practices, and benefits of a successful tenant migration Considerations: When considering a migration, it is important to consider the migration scope, performance, customization, flexibility, user-friendly interface, automation, monitoring, support, training, scalability, data integrity, data security, cost, and licensing structure Next Wave: The next wave of change includes the launch of Copilot, which requires businesses to be prepared for upcoming changes related to Copilot and the cloud, and to consolidate data and tighten governance ShareGate: ShareGate can help with pre-migration analysis, configurable migration tool, and automated, end-user driven collaborative governance

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

sammart93

Imagine a world where information flows as swiftly as thought itself, making decision-making as fluid as the data driving it. Every moment is critical, and the right tools can significantly boost your organization’s performance. The power of real-time data automation through FME can turn this vision into reality. Aimed at professionals eager to leverage real-time data for enhanced decision-making and efficiency, this webinar will cover the essentials of real-time data and its significance. We’ll explore: FME’s role in real-time event processing, from data intake and analysis to transformation and reporting An overview of leveraging streams vs. automations FME’s impact across various industries highlighted by real-life case studies Live demonstrations on setting up FME workflows for real-time data Practical advice on getting started, best practices, and tips for effective implementation Join us to enhance your skills in real-time data automation with FME, and take your operational capabilities to the next level.

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Safe Software

Manulife - Insurer Innovation Award 2024

The Digital Insurer

Recently uploaded (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...

Artificial Intelligence Chap.5 : Uncertainty

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams

Partners Life - Insurer Innovation Award 2024

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Boost Fertility New Invention Ups Success Rates.pdf

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Top 10 Most Downloaded Games on Play Store in 2024

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

🐬 The future of MySQL is Postgres 🐘

Axa Assurance Maroc - Insurer Innovation Award 2024

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Real Time Object Detection Using Open CV

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Scaling API-first – The story of a global engineering organization

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Manulife - Insurer Innovation Award 2024

Implementing policy @ WSSSPE

1. Implementing Policy WSSSPE Workshop 2013 Daisie Huang Biodiversity Research Centre University of British Columbia

2. Implementing Policy • Key issues: • As software matures, new problems emerge. • Sustainability issues should be addressed throughout the life cycle. • How to implement sustainability when resources are limited?

3. Implementing Policy ➡ API Governance ➡ Software Security ➡ Sustainability

4. Implementing Policy ➡ API Governance Developing Systems for API Governance C Krintz, H Jayathilaka, S Dimopoulos, A Pucher, and R Wolski, Department of Computer Science, UC Santa Barbara

5. API Governance • Scientific research relies on access to digital assets as well as hardware. • APIs govern the interactions between these digital assets. from phylotastic.org

6. API Governance • APIs need to be portable and consistent. • Semantic compatibility • Syntactic compatibility

7. Implementing Policy ➡ API Governance ➡ Software Security ➡ Sustainability

8. Implementing Policy ➡ Software Security Toward a Research Software Security Maturity Model R Heiland, B Thomas, V Welch, C Jackson, Center for Trustworthy Scientific Cyberinfrastructure, Indiana University

9. Software Security

10. Software Security • A Security Maturity Model can formalize this process: • Provides classification of software security practices. • Provides a path for tightening security practices as a package’s maturity level increases. • Emphasizes understandability over complexity.

11. Implementing Policy ➡ API Governance ➡ Software Security ➡ Sustainability

12. Implementing Policy ➡ Sustainability A User Perspective on Sustainable Scientific Software Brian Blanton and Chris Lenhardt, Renaissance Computing Institute

13. Sustainability • Tension between “getting it done” enough to publish scientific results and “getting it right” for future users.

14. Sustainability Co-funding Best suited for large, collaborative projects

15. Sustainability “Software carpentry” Teach scientists to use software development best practices.

16. Implementing Policy ➡ Sustainability Software Engineering as Instrumentation for the Long Tail of Scientific Software Daisie Huang and Hilmar Lapp, UBC and NESCent

17. The Long Tail The lifespan of scientific software can be unexpectedly long.

18. The Long Tail Lots of small programs implement different methods.

19. Facets of software design • API development • Security • User interface design • Test engineering • Deployment

20. Facets of software design Phylogenetics/Genomics/Ecology/Mol ecular Biology/Developmental Biology • API development • Security • User interface design • Test engineering • Deployment

21. Instrumentation • Software engineering as a resource • Analogous to DNA sequencing facilities

22. Instrumenting Software Engineering • A scientific software engineering center can provide these resources to many projects. • Governed by long-term vision that is not tied to success or failure of any individual project. • Emphasis on executing good science by making functional tools.

23. Conclusions • Many facets of software design not addressed in most scientific software projects. • Possible solutions include: • • • large projects can hire developers with software engineering expertise providing scientists with software design guidance A software engineering center can provide both expertise and guidance to the long tail.

Editor's Notes

Hello, I’m Daisie Huang, and I’m an evolutionary biologist at the University of British Columbia and I’m also a software engineer. I’ll be discussing some matters of implementing policy in sustainable scientific software.
To create sustainable software, we need to look at some key issues. First, we need to acknowledge that as a software package matures, we will face new types of problems, and we need to plan for these throughout a package’s life cycle. Therefore, making scientific software sustainable means that we define policies and guidelines that the scientific community can follow and implement. But the reality of science today is that we have limited resources and rewards to encourage people to follow these policies and guidelines. Implementing policy often takes specialized expertise in software engineering.
In light of these issues, I’ll be discussing several papers that were contributed to the workshop. Some of these papers focus on specific facets of software design that are not often addressed in scientific software development, such as API governance and Software Security, and some of the papers discuss strategies to implement all of the different facets of sustainable software design in the framework of scientific software.
First, Krintz et al from The University of California at Santa Barbara’s Department of Computer Science discuss one of these important issues: developing systems for API governance.
The authors make the point that scientific research is moving away from local hardware environments towards cloud computing. Therefore, instead of focusing on access to hardware, we will need to focus on access to the digital assets: the code and the data. APIs—programming interfaces—are the main link governing interactions between these assets. Because APIs are the main interface between different digital assets, they have to be maintained in a sustainable way.
The authors focus on understanding the portability and consistency of APIs used to connect these data archives, because changes in APIs affect accessibility of data. They define two different types of compatibility, semantic compatibility and syntactic compatibility. They demonstrate an algorithmic method for categorizing a particular API port as “hard” or “easy,” at least for semantic compatibility.
Next, we’ll look at issues of software security.
Heiland et al from the Center for Trustworthy Scientific Infrastructure discussed issues related to implementing strong security measures for scientific software. They point out that cybersecurity is rarely addressed in scientific software design.
Security considerations for software vary depending on the maturity level of the software package. But when we initially develop scientific software, we generally don’t know what the final maturity level will be. Scientific software developers are probably not aware of best practices for cybersecurity. So the authors introduce the concept of Software Security Maturity Models, such as OpenSAMM and BSI-MM. These are used in industry to identify and define security vulnerabilities at different stages of the software life cycle.
They suggest that a similar Software Security Maturity Model can formalize this process: It provides classification of software security practices. It provides a path for tightening security practices as a package’s maturity level increases. It emphasizes understandability over complexity.
Finally, we’ll look at some papers that discuss implementing sustainability in scientific software.
Blanton and Lenhardt from the Renaissance Computing Institute discuss these issues from a user perspective.
The authors focus on a point that has been brought up many times in this context: There is a tension between writing code that is good enough just to “get it done,” i.e. to publish a paper about scientific results obtained using software, and “getting it right,” that is, developing software that is comprehensible to future users and reviewers. Just because the elevator panel works like this doesn’t mean it’s sustainable for the long run. We don’t have a way to validate that the software used in a paper is actually done right. The best way to get software designed correctly is to make sure best practices are considered from the start.
The authors highlight two models for sustainable software, at different extremes: One is what they call “co-funding”: In these projects, usually large, multi-year collaborations, there is equal emphasis on both the science and the software development. Both are planned into the project from inception. In the life sciences, the iPlant Collaborative, Galaxy Project, and Qiime are good examples of these sorts of large, well-designed projects.
At the other extreme, they discuss “software carpentry”: in this model, it’s assumed that the scientists themselves will write and maintain their code. Groups like Software Carpentry and ROpenSci assume that scientists won’t have access to dedicated software engineering, so they try to give them tools to use best practices in their own software development.
There might be a middle ground here: a way to get the engineering expertise that large co-funded projects have to individual scientist-developers. Hilmar Lapp of NESCent and I discuss one such possibility in our paper, Software Engineering as Instrumentation for the Long Tail of Scientific Software.
What do we mean when we refer to the “long tail” of scientific software? Think of the distribution of resources in scientific software. Most are focused on big projects with lots of community buy-in and funding. But a lot of scientific software exists away from this model. For example, scientific software can be used long after the original developer has moved on or the funding runs out. Look at MacClade: it was originally released in 1986 and last updated in 2005, but it was still cited over 400 times in 2013! The scientists who developed it have a newer package, Mesquite, that was meant to replace MacClade, but they haven’t had sufficient time or resources to maintain either package fully, let alone both of them.
Another dimension of the long tail can also be found in my particular research domain. In the field of phylogenetics, we have a lot of programs that implement different computational methods in slightly different ways. Here, Joe Felsenstein has listed some (but not anywhere near all) phylogenetics packages available online. Most of these programs are developed by academic scientists… They generally have limited training in software engineering Limited time or career incentive to improve software Limited funding
So, to summarize a bit: Making sustainable software means we have to pay attention to many facets of software design, like APIs, security, user experience, testing, etc. A single project that requires one full-time software engineer may actually require fractions of different kinds of engineers. But long-tail projects can’t even fund one FTE, let alone one that can address all these facets.
Then we have to consider that the users of scientific software are scientists, so the developers need to understand the users and the science. This is the idea of a “t-skilled” person: one who is both well-versed in a scientific domain and deeply experienced in one or more facets of software engineering. These people are pretty rare in the first place and difficult to retain in academia, because the academic career structure doesn’t incentivize this.
We should look at software engineering as an expensive resource, but one that needs to be accessible to scientists at all levels. Think of it as analogous to DNA sequencing: Sequencers used to be something that individual labs and institutions had to buy, maintain, and operate themselves, so only highly-funded operations had them and probably didn’t use them to their full capacity even when they had one. But now, core facilities provide the instrumentation and service to labs of any size. Anyone can pay a core facility to sequence their samples for them and provide quality control and bioinformatics advice as additional services.
We propose that software engineering can be “instrumented” in a similar way. Let’s create a nonprofit center for scientific software engineering. This center can hire these t-skilled personnel and provide access to them for projects at contracted cost. Because the center is focused on providing development services to scientific projects, it is not tied to the long-term success or failure of any individual project. It would emphasize the centrality of doing good science by making functional software tools as envisioned by scientists.
So, to conclude… Implementing policies to encourage sustainability in scientific software requires that many facets of good software design are addressed throughout the lifecycle of these projects. But most of them aren’t addressed in the status quo. We’ve highlighted some of these facets today and suggested some possible solutions. Large projects can afford to hire software engineers with the expertise to implement these facets correctly. Grassroots developer groups can provide guidance to scientists about best practices in software development. We think there is a place for a software engineering center that can provide both engineering expertise and guidance with a contract-driven instrumentation model to the scientific software in the long tail.

Implementing policy @ WSSSPE

Recommended

Recommended

More Related Content

What's hot

What's hot (15)

Similar to Implementing policy @ WSSSPE

Similar to Implementing policy @ WSSSPE (20)

Recently uploaded

Recently uploaded (20)

Implementing policy @ WSSSPE

Editor's Notes