SlideShare a Scribd company logo
1 of 29
Download to read offline
Archivematica
Technical Training:
Diagnostics Guide
Ashley Blewer @ Artefactual Systems, Inc.
ablewer@artefactual.com
Outline
● Technical introduction
● Microservices we use
● Software components
● Logging in
● Reading logs
● System monitoring
● Upgrading/Security
● Advanced: APIs
● Getting (free) help
High-level goals
Technical diagram
Supporting technology
● Python: programming language
● Django: web application framework
● Gearman: job scheduler
● MySQL: relational database
● Elasticsearch: search index
● Nginx: web server (can be apache)
● Gunicorn: interface between Python and Nginx
● git: version control system
● Ansible/Docker: deployment/configuration management
All on Linux
● Ubuntu 16.04 or 18.04
● CentOS 7 or Red Hat
Format Policy Registry
● Tools we use to perform preservation actions
● Rules we use to determine when to use the Tools
● Commands are applied to files based on the Rules
Tools
● Identification: FIDO and Siegfried
● Characterization: FITS, FFprobe, MediaInfo, ExifTool
● Event Detail: echo
● Extraction: 7zip, SleuthKit
● Normalization: FFmpeg, ImageMagick
● Transcription: Tesseract
● Validation: MediaConch
● Verification: File exists? Filesize is >0?
Production tools
Automation Tools https://github.com/artefactual/automation-tools/
Fixity https://github.com/artefactual/fixity
Deployment tools
Deploy-pub https://github.com/artefactual/deploy-pub
Ansible roles
https://github.com/artefactual-labs/ansible-archivematica-src
Am.git https://github.com/artefactual-labs/am
DevOps Tools
Ops-helpers https://github.com/artefactual-labs/ops-helpers
Dev-tools https://github.com/artefactual/archivematica-devtools
Technical stack
● Lots of tools = lots of potential points of failure
● Archivematica strives to relay as much information as
possible to the user -- especially about what the tools are
doing and what they are producing
Components
● Dashboard: for the user
● MCPClient: does the work
● MCPServer: manages the work
● Storage Service: manages storage
Logging in
● Logging in (ssh)
● Moving files (scp)
● What’s running (ps -sf | grep py)
● How much space? (du)
● How much free space? (df -h)
● Load average time? (top)
● Read end of logs (tail)
● Read logs (less)
Logging in
ssh username@url
Moving files
Download a file to your computer
scp
your_username@remotehost.url:your-file.txt
/your/local/directory
Send a file to your machine
scp path/to/your-file.txt
your_username@remotehost.url:/some/remote/di
rectory
What’s running?
ps -ef | grep py
These services should all be running:
● Dashboard (apache)
● Database (mysql)
● Elasticsearch (elastic)
● Storage Service (uwsgi or nginx)
● FITS
● Server (MCP) -- Should show MCP server and MCP client
What’s running?
ps -ef | grep py
Also, these dependent services should all be running:
● MySQL
● Elasticsearch
● Gearman
● Nginx
● Nailgun
● Clamav
du
To get the file size of each subdirectory of the directory you
are in, you can run this command:
du -h --max-depth=1
This command can take a long time if you have very large
mounted drives.
See amount of space on machine
Check free space on disk
df -h
● Up to 3x of free space required for processing
● cron job can auto-clear deleted/rejected files
Load averages
top
Restarting services
service archivematica-dashboard restart
service archivematica-mcp-client.service restart
service archivematica-mcp-server.service restart
service archivematica-storage-service restart
service gearmand restart
Reading logs
less /var/log/archivematica/dashboard/dashboard.log
less /var/log/archivematica/dashboard/dashboard.debug.log
less /var/log/archivematica/MCPClient/MCPClient.log
less /var/log/archivematica/MCPClient/MCPClient.debug.log
less /var/log/archivematica/MCPServer/MCPServer.log
less /var/log/archivematica/MCPServer/MCPServer.debug.log
less /var/log/archivematica/storage-service/storage-service.log
less /var/log/archivematica/storage-service/storage-service.debug.log
Finding errors
grep -rn ERROR
/var/log/archivematica/
System monitoring
Zappix/Grafana
Upgrading
● Need to decide on a new release whether you want it or not, how much
time to put aside.
● Tradeoff to not upgrading is not keeping pace with community and having
a harder time getting support from community for an older version.
● Good idea to test the upgrade- make a backup of your production
environment and test upgrade there. If that is not possible, plan for
downtime.
○ If you want to be able to do this, you might want to explore
virtualization of your Archivematica environment so you can run a
development (testing) environment in addition to the production
environment.
Security upgrades
● Make sure that Ubuntu is set-up to do Unattended Upgrades, which will
apply security patches (like equivalent of Windows updates).
● Sometimes these upgrades require the system to be restarted- you might
need to plan for 30 minutes of downtime (not in the middle of processing,
make sure your current Transfer/AIPs are done).
APIs
https://wiki.archivematica.org/Storage_Service_API
https://wiki.archivematica.org/Archivematica_API
Getting Help
● Participating in the community forum
○ Archivematica
https://groups.google.com/forum/#!forum/archivematica
● Documentation
○ Main docs https://www.archivematica.org/en/
○ Wiki https://wiki.archivematica.org/Main_Page
● Github issues
○ Main repo https://github.com/archivematica/Issues/issues
See also
This presentation in document form
● For tech-savvy preservationists:
https://docs.google.com/document/d/1GybyH7X_gpZ7wpYVo5d9__LeG
NuXYCky0oairJGJAmo/edit#heading=h.y1nyq0vlcvsl
● For Archivematica-unfamiliar systems administrators:
https://docs.google.com/document/d/1NDzGHBGuPFa7GTHCMEl3D2n
vvdZRxG2FpdsGAYoG31I/edit#
exit
:q
q
Quit
.quit
q
:q!
Ashley Blewer @ Artefactual Systems, Inc.
ablewer@artefactual.com

More Related Content

What's hot

Infrastructure & System Monitoring using Prometheus
Infrastructure & System Monitoring using PrometheusInfrastructure & System Monitoring using Prometheus
Infrastructure & System Monitoring using PrometheusMarco Pas
 
OSDC 2018 - Distributed monitoring
OSDC 2018 - Distributed monitoringOSDC 2018 - Distributed monitoring
OSDC 2018 - Distributed monitoringGianluca Arbezzano
 
Optimizing S3 Write-heavy Spark workloads
Optimizing S3 Write-heavy Spark workloadsOptimizing S3 Write-heavy Spark workloads
Optimizing S3 Write-heavy Spark workloadsdatamantra
 
All of the thing about Postman
All of the thing about PostmanAll of the thing about Postman
All of the thing about PostmanAlihossein shahabi
 
Flink Forward Berlin 2017: Andreas Kunft - Efficiently executing R Dataframes...
Flink Forward Berlin 2017: Andreas Kunft - Efficiently executing R Dataframes...Flink Forward Berlin 2017: Andreas Kunft - Efficiently executing R Dataframes...
Flink Forward Berlin 2017: Andreas Kunft - Efficiently executing R Dataframes...Flink Forward
 
Q4.11: Getting Started in LAVA
Q4.11: Getting Started in LAVAQ4.11: Getting Started in LAVA
Q4.11: Getting Started in LAVALinaro
 
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy FarkasVirtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy FarkasFlink Forward
 
Prometheus loves Grafana
Prometheus loves GrafanaPrometheus loves Grafana
Prometheus loves GrafanaTobias Schmidt
 
Monitoring using Prometheus and Grafana
Monitoring using Prometheus and GrafanaMonitoring using Prometheus and Grafana
Monitoring using Prometheus and GrafanaArvind Kumar G.S
 
Python Streaming Pipelines with Beam on Flink
Python Streaming Pipelines with Beam on FlinkPython Streaming Pipelines with Beam on Flink
Python Streaming Pipelines with Beam on FlinkAljoscha Krettek
 
Pulsar connector on flink 1.14
Pulsar connector on flink 1.14Pulsar connector on flink 1.14
Pulsar connector on flink 1.14宇帆 盛
 
LCE13: LAVA Multi-Node Testing
LCE13: LAVA Multi-Node TestingLCE13: LAVA Multi-Node Testing
LCE13: LAVA Multi-Node TestingLinaro
 
Apache Airflow | What Is An Operator
Apache Airflow | What Is An OperatorApache Airflow | What Is An Operator
Apache Airflow | What Is An OperatorMarc Lamberti
 
Flink Forward Berlin 2017: Piotr Wawrzyniak - Extending Apache Flink stream p...
Flink Forward Berlin 2017: Piotr Wawrzyniak - Extending Apache Flink stream p...Flink Forward Berlin 2017: Piotr Wawrzyniak - Extending Apache Flink stream p...
Flink Forward Berlin 2017: Piotr Wawrzyniak - Extending Apache Flink stream p...Flink Forward
 
LCE13: Test and Validation Summit: Evolution of Testing in Linaro (I)
LCE13: Test and Validation Summit: Evolution of Testing in Linaro (I)LCE13: Test and Validation Summit: Evolution of Testing in Linaro (I)
LCE13: Test and Validation Summit: Evolution of Testing in Linaro (I)Linaro
 
Airflow Best Practises & Roadmap to Airflow 2.0
Airflow Best Practises & Roadmap to Airflow 2.0Airflow Best Practises & Roadmap to Airflow 2.0
Airflow Best Practises & Roadmap to Airflow 2.0Kaxil Naik
 
Coprocessors - Uses, Abuses, Solutions - presented at HBaseCon East 2016
Coprocessors - Uses, Abuses, Solutions - presented at HBaseCon East 2016Coprocessors - Uses, Abuses, Solutions - presented at HBaseCon East 2016
Coprocessors - Uses, Abuses, Solutions - presented at HBaseCon East 2016Esther Kundin
 
What's coming in Airflow 2.0? - NYC Apache Airflow Meetup
What's coming in Airflow 2.0? - NYC Apache Airflow MeetupWhat's coming in Airflow 2.0? - NYC Apache Airflow Meetup
What's coming in Airflow 2.0? - NYC Apache Airflow MeetupKaxil Naik
 

What's hot (20)

Infrastructure & System Monitoring using Prometheus
Infrastructure & System Monitoring using PrometheusInfrastructure & System Monitoring using Prometheus
Infrastructure & System Monitoring using Prometheus
 
OSDC 2018 - Distributed monitoring
OSDC 2018 - Distributed monitoringOSDC 2018 - Distributed monitoring
OSDC 2018 - Distributed monitoring
 
Optimizing S3 Write-heavy Spark workloads
Optimizing S3 Write-heavy Spark workloadsOptimizing S3 Write-heavy Spark workloads
Optimizing S3 Write-heavy Spark workloads
 
All of the thing about Postman
All of the thing about PostmanAll of the thing about Postman
All of the thing about Postman
 
Airflow 101
Airflow 101Airflow 101
Airflow 101
 
Flink Forward Berlin 2017: Andreas Kunft - Efficiently executing R Dataframes...
Flink Forward Berlin 2017: Andreas Kunft - Efficiently executing R Dataframes...Flink Forward Berlin 2017: Andreas Kunft - Efficiently executing R Dataframes...
Flink Forward Berlin 2017: Andreas Kunft - Efficiently executing R Dataframes...
 
Q4.11: Getting Started in LAVA
Q4.11: Getting Started in LAVAQ4.11: Getting Started in LAVA
Q4.11: Getting Started in LAVA
 
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy FarkasVirtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas
 
Prometheus loves Grafana
Prometheus loves GrafanaPrometheus loves Grafana
Prometheus loves Grafana
 
Monitoring using Prometheus and Grafana
Monitoring using Prometheus and GrafanaMonitoring using Prometheus and Grafana
Monitoring using Prometheus and Grafana
 
Python Streaming Pipelines with Beam on Flink
Python Streaming Pipelines with Beam on FlinkPython Streaming Pipelines with Beam on Flink
Python Streaming Pipelines with Beam on Flink
 
Pulsar connector on flink 1.14
Pulsar connector on flink 1.14Pulsar connector on flink 1.14
Pulsar connector on flink 1.14
 
LCE13: LAVA Multi-Node Testing
LCE13: LAVA Multi-Node TestingLCE13: LAVA Multi-Node Testing
LCE13: LAVA Multi-Node Testing
 
Apache Airflow | What Is An Operator
Apache Airflow | What Is An OperatorApache Airflow | What Is An Operator
Apache Airflow | What Is An Operator
 
Flink Forward Berlin 2017: Piotr Wawrzyniak - Extending Apache Flink stream p...
Flink Forward Berlin 2017: Piotr Wawrzyniak - Extending Apache Flink stream p...Flink Forward Berlin 2017: Piotr Wawrzyniak - Extending Apache Flink stream p...
Flink Forward Berlin 2017: Piotr Wawrzyniak - Extending Apache Flink stream p...
 
Airflow and supervisor
Airflow and supervisorAirflow and supervisor
Airflow and supervisor
 
LCE13: Test and Validation Summit: Evolution of Testing in Linaro (I)
LCE13: Test and Validation Summit: Evolution of Testing in Linaro (I)LCE13: Test and Validation Summit: Evolution of Testing in Linaro (I)
LCE13: Test and Validation Summit: Evolution of Testing in Linaro (I)
 
Airflow Best Practises & Roadmap to Airflow 2.0
Airflow Best Practises & Roadmap to Airflow 2.0Airflow Best Practises & Roadmap to Airflow 2.0
Airflow Best Practises & Roadmap to Airflow 2.0
 
Coprocessors - Uses, Abuses, Solutions - presented at HBaseCon East 2016
Coprocessors - Uses, Abuses, Solutions - presented at HBaseCon East 2016Coprocessors - Uses, Abuses, Solutions - presented at HBaseCon East 2016
Coprocessors - Uses, Abuses, Solutions - presented at HBaseCon East 2016
 
What's coming in Airflow 2.0? - NYC Apache Airflow Meetup
What's coming in Airflow 2.0? - NYC Apache Airflow MeetupWhat's coming in Airflow 2.0? - NYC Apache Airflow Meetup
What's coming in Airflow 2.0? - NYC Apache Airflow Meetup
 

Similar to Archivematica Technical Training Diagnostics Guide (September 2018)

Linux Server Deep Dives (DrupalCon Amsterdam)
Linux Server Deep Dives (DrupalCon Amsterdam)Linux Server Deep Dives (DrupalCon Amsterdam)
Linux Server Deep Dives (DrupalCon Amsterdam)Amin Astaneh
 
linux monitoring and performance tunning
linux monitoring and performance tunning linux monitoring and performance tunning
linux monitoring and performance tunning iman darabi
 
Django deployment and rpm+yum
Django deployment and rpm+yumDjango deployment and rpm+yum
Django deployment and rpm+yumWalter Liu
 
How To Get The Most Out Of Your Hibernate, JBoss EAP 7 Application (Ståle Ped...
How To Get The Most Out Of Your Hibernate, JBoss EAP 7 Application (Ståle Ped...How To Get The Most Out Of Your Hibernate, JBoss EAP 7 Application (Ståle Ped...
How To Get The Most Out Of Your Hibernate, JBoss EAP 7 Application (Ståle Ped...Red Hat Developers
 
Turbo charge your logs
Turbo charge your logsTurbo charge your logs
Turbo charge your logsJeremy Cook
 
NGINX Installation and Tuning
NGINX Installation and TuningNGINX Installation and Tuning
NGINX Installation and TuningNGINX, Inc.
 
A3Sec Advanced Deployment System
A3Sec Advanced Deployment SystemA3Sec Advanced Deployment System
A3Sec Advanced Deployment Systema3sec
 
Turbo charge your logs
Turbo charge your logsTurbo charge your logs
Turbo charge your logsJeremy Cook
 
My "Perfect" Toolchain Setup for Grails Projects
My "Perfect" Toolchain Setup for Grails ProjectsMy "Perfect" Toolchain Setup for Grails Projects
My "Perfect" Toolchain Setup for Grails ProjectsGR8Conf
 
Nagios Conference 2014 - Eric Mislivec - Getting Started With Nagios Core
Nagios Conference 2014 - Eric Mislivec - Getting Started With Nagios CoreNagios Conference 2014 - Eric Mislivec - Getting Started With Nagios Core
Nagios Conference 2014 - Eric Mislivec - Getting Started With Nagios CoreNagios
 
Prometheus - Intro, CNCF, TSDB,PromQL,Grafana
Prometheus - Intro, CNCF, TSDB,PromQL,GrafanaPrometheus - Intro, CNCF, TSDB,PromQL,Grafana
Prometheus - Intro, CNCF, TSDB,PromQL,GrafanaSridhar Kumar N
 
Pivotal Greenplum Cloud Marketplaces - Greenplum Summit 2019
Pivotal Greenplum Cloud Marketplaces - Greenplum Summit 2019Pivotal Greenplum Cloud Marketplaces - Greenplum Summit 2019
Pivotal Greenplum Cloud Marketplaces - Greenplum Summit 2019VMware Tanzu
 
Nagios Conference 2011 - Daniel Wittenberg - Scaling Nagios At A Giant Insur...
Nagios Conference 2011 - Daniel Wittenberg -  Scaling Nagios At A Giant Insur...Nagios Conference 2011 - Daniel Wittenberg -  Scaling Nagios At A Giant Insur...
Nagios Conference 2011 - Daniel Wittenberg - Scaling Nagios At A Giant Insur...Nagios
 
Nagios Conference 2014 - Andy Brist - Nagios XI Failover and HA Solutions
Nagios Conference 2014 - Andy Brist - Nagios XI Failover and HA SolutionsNagios Conference 2014 - Andy Brist - Nagios XI Failover and HA Solutions
Nagios Conference 2014 - Andy Brist - Nagios XI Failover and HA SolutionsNagios
 

Similar to Archivematica Technical Training Diagnostics Guide (September 2018) (20)

Linux Server Deep Dives (DrupalCon Amsterdam)
Linux Server Deep Dives (DrupalCon Amsterdam)Linux Server Deep Dives (DrupalCon Amsterdam)
Linux Server Deep Dives (DrupalCon Amsterdam)
 
linux monitoring and performance tunning
linux monitoring and performance tunning linux monitoring and performance tunning
linux monitoring and performance tunning
 
Optimizing Linux Servers
Optimizing Linux ServersOptimizing Linux Servers
Optimizing Linux Servers
 
Django deployment and rpm+yum
Django deployment and rpm+yumDjango deployment and rpm+yum
Django deployment and rpm+yum
 
How To Get The Most Out Of Your Hibernate, JBoss EAP 7 Application (Ståle Ped...
How To Get The Most Out Of Your Hibernate, JBoss EAP 7 Application (Ståle Ped...How To Get The Most Out Of Your Hibernate, JBoss EAP 7 Application (Ståle Ped...
How To Get The Most Out Of Your Hibernate, JBoss EAP 7 Application (Ståle Ped...
 
Turbo charge your logs
Turbo charge your logsTurbo charge your logs
Turbo charge your logs
 
NGINX Installation and Tuning
NGINX Installation and TuningNGINX Installation and Tuning
NGINX Installation and Tuning
 
A3Sec Advanced Deployment System
A3Sec Advanced Deployment SystemA3Sec Advanced Deployment System
A3Sec Advanced Deployment System
 
Turbo charge your logs
Turbo charge your logsTurbo charge your logs
Turbo charge your logs
 
Handout: 'Open Source Tools & Resources'
Handout: 'Open Source Tools & Resources'Handout: 'Open Source Tools & Resources'
Handout: 'Open Source Tools & Resources'
 
My "Perfect" Toolchain Setup for Grails Projects
My "Perfect" Toolchain Setup for Grails ProjectsMy "Perfect" Toolchain Setup for Grails Projects
My "Perfect" Toolchain Setup for Grails Projects
 
Nagios Conference 2014 - Eric Mislivec - Getting Started With Nagios Core
Nagios Conference 2014 - Eric Mislivec - Getting Started With Nagios CoreNagios Conference 2014 - Eric Mislivec - Getting Started With Nagios Core
Nagios Conference 2014 - Eric Mislivec - Getting Started With Nagios Core
 
Syslog.ppt
Syslog.pptSyslog.ppt
Syslog.ppt
 
Prometheus - Intro, CNCF, TSDB,PromQL,Grafana
Prometheus - Intro, CNCF, TSDB,PromQL,GrafanaPrometheus - Intro, CNCF, TSDB,PromQL,Grafana
Prometheus - Intro, CNCF, TSDB,PromQL,Grafana
 
An Introduction To Linux
An Introduction To LinuxAn Introduction To Linux
An Introduction To Linux
 
Pivotal Greenplum Cloud Marketplaces - Greenplum Summit 2019
Pivotal Greenplum Cloud Marketplaces - Greenplum Summit 2019Pivotal Greenplum Cloud Marketplaces - Greenplum Summit 2019
Pivotal Greenplum Cloud Marketplaces - Greenplum Summit 2019
 
Operating System.pdf
Operating System.pdfOperating System.pdf
Operating System.pdf
 
Nagios Conference 2011 - Daniel Wittenberg - Scaling Nagios At A Giant Insur...
Nagios Conference 2011 - Daniel Wittenberg -  Scaling Nagios At A Giant Insur...Nagios Conference 2011 - Daniel Wittenberg -  Scaling Nagios At A Giant Insur...
Nagios Conference 2011 - Daniel Wittenberg - Scaling Nagios At A Giant Insur...
 
Nagios Conference 2014 - Andy Brist - Nagios XI Failover and HA Solutions
Nagios Conference 2014 - Andy Brist - Nagios XI Failover and HA SolutionsNagios Conference 2014 - Andy Brist - Nagios XI Failover and HA Solutions
Nagios Conference 2014 - Andy Brist - Nagios XI Failover and HA Solutions
 
The Accidental DBA
The Accidental DBAThe Accidental DBA
The Accidental DBA
 

More from Artefactual Systems - Archivematica

Archivematica Community Profile: University of Texas, San Antonio by Julianna...
Archivematica Community Profile: University of Texas, San Antonio by Julianna...Archivematica Community Profile: University of Texas, San Antonio by Julianna...
Archivematica Community Profile: University of Texas, San Antonio by Julianna...Artefactual Systems - Archivematica
 
Archivematica Community Profile: University of Houston by Bethany Scott
Archivematica Community Profile: University of Houston by Bethany ScottArchivematica Community Profile: University of Houston by Bethany Scott
Archivematica Community Profile: University of Houston by Bethany ScottArtefactual Systems - Archivematica
 
Do Something Now: Why Perfect is the Enemy of Good (Enough) in Digital Preser...
Do Something Now: Why Perfect is the Enemy of Good (Enough) in Digital Preser...Do Something Now: Why Perfect is the Enemy of Good (Enough) in Digital Preser...
Do Something Now: Why Perfect is the Enemy of Good (Enough) in Digital Preser...Artefactual Systems - Archivematica
 
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...Artefactual Systems - Archivematica
 
Getting Started with AtoM and Archivematica for Digital Preservation and Access
Getting Started with AtoM and Archivematica for Digital Preservation and AccessGetting Started with AtoM and Archivematica for Digital Preservation and Access
Getting Started with AtoM and Archivematica for Digital Preservation and AccessArtefactual Systems - Archivematica
 
Archivematica integration handshaking towards comprehensive digital preserva...
Archivematica integration  handshaking towards comprehensive digital preserva...Archivematica integration  handshaking towards comprehensive digital preserva...
Archivematica integration handshaking towards comprehensive digital preserva...Artefactual Systems - Archivematica
 

More from Artefactual Systems - Archivematica (20)

Acts of maintenance
Acts of maintenanceActs of maintenance
Acts of maintenance
 
Archivematica Community Profile: University of Texas, San Antonio by Julianna...
Archivematica Community Profile: University of Texas, San Antonio by Julianna...Archivematica Community Profile: University of Texas, San Antonio by Julianna...
Archivematica Community Profile: University of Texas, San Antonio by Julianna...
 
Archivematica Community Profile: University of Houston by Bethany Scott
Archivematica Community Profile: University of Houston by Bethany ScottArchivematica Community Profile: University of Houston by Bethany Scott
Archivematica Community Profile: University of Houston by Bethany Scott
 
Archives canada digital preservation service (acdps)
Archives canada digital preservation service (acdps)Archives canada digital preservation service (acdps)
Archives canada digital preservation service (acdps)
 
Digital Preservation with Archivematica: An Introduction
Digital Preservation with Archivematica: An IntroductionDigital Preservation with Archivematica: An Introduction
Digital Preservation with Archivematica: An Introduction
 
Do Something Now: Why Perfect is the Enemy of Good (Enough) in Digital Preser...
Do Something Now: Why Perfect is the Enemy of Good (Enough) in Digital Preser...Do Something Now: Why Perfect is the Enemy of Good (Enough) in Digital Preser...
Do Something Now: Why Perfect is the Enemy of Good (Enough) in Digital Preser...
 
Workshop slides - Introduction to AtoM and Archivematica
Workshop slides - Introduction to AtoM and ArchivematicaWorkshop slides - Introduction to AtoM and Archivematica
Workshop slides - Introduction to AtoM and Archivematica
 
Archivematica and the digital archival chain of custody
Archivematica and the digital archival chain of custodyArchivematica and the digital archival chain of custody
Archivematica and the digital archival chain of custody
 
Adding MediaConch to Archivematica for mkv/ffv1 checking
Adding MediaConch to Archivematica for mkv/ffv1 checkingAdding MediaConch to Archivematica for mkv/ffv1 checking
Adding MediaConch to Archivematica for mkv/ffv1 checking
 
Digital Preservation with Archivematica
Digital Preservation with ArchivematicaDigital Preservation with Archivematica
Digital Preservation with Archivematica
 
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...
Avoiding the 927 Problem: Standards, Digital Preservation, and Communities of...
 
Introduction to Archivematica
Introduction to ArchivematicaIntroduction to Archivematica
Introduction to Archivematica
 
PREMIS in METS in Archivematica
PREMIS in METS in ArchivematicaPREMIS in METS in Archivematica
PREMIS in METS in Archivematica
 
Archivematica Community Update - SAA 2016
Archivematica Community Update - SAA 2016Archivematica Community Update - SAA 2016
Archivematica Community Update - SAA 2016
 
Your Digital Preservation Cookbook
Your Digital Preservation CookbookYour Digital Preservation Cookbook
Your Digital Preservation Cookbook
 
Archivematica presentation to SJSU iSchool Colloquia series
Archivematica presentation to SJSU iSchool Colloquia seriesArchivematica presentation to SJSU iSchool Colloquia series
Archivematica presentation to SJSU iSchool Colloquia series
 
Getting Started with AtoM and Archivematica for Digital Preservation and Access
Getting Started with AtoM and Archivematica for Digital Preservation and AccessGetting Started with AtoM and Archivematica for Digital Preservation and Access
Getting Started with AtoM and Archivematica for Digital Preservation and Access
 
Archivematica integration handshaking towards comprehensive digital preserva...
Archivematica integration  handshaking towards comprehensive digital preserva...Archivematica integration  handshaking towards comprehensive digital preserva...
Archivematica integration handshaking towards comprehensive digital preserva...
 
Hierarchical DIP upload from Archivematica to AtoM
Hierarchical DIP upload from Archivematica to AtoMHierarchical DIP upload from Archivematica to AtoM
Hierarchical DIP upload from Archivematica to AtoM
 
Report: Archivematica hosting in the cloud
Report: Archivematica hosting in the cloudReport: Archivematica hosting in the cloud
Report: Archivematica hosting in the cloud
 

Recently uploaded

Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
cpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptcpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptrcbcrtm
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...Technogeeks
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfkalichargn70th171
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 

Recently uploaded (20)

Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
cpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptcpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.ppt
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Odoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting ServiceOdoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting Service
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Advantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your BusinessAdvantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your Business
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 

Archivematica Technical Training Diagnostics Guide (September 2018)

  • 1. Archivematica Technical Training: Diagnostics Guide Ashley Blewer @ Artefactual Systems, Inc. ablewer@artefactual.com
  • 2. Outline ● Technical introduction ● Microservices we use ● Software components ● Logging in ● Reading logs ● System monitoring ● Upgrading/Security ● Advanced: APIs ● Getting (free) help
  • 5. Supporting technology ● Python: programming language ● Django: web application framework ● Gearman: job scheduler ● MySQL: relational database ● Elasticsearch: search index ● Nginx: web server (can be apache) ● Gunicorn: interface between Python and Nginx ● git: version control system ● Ansible/Docker: deployment/configuration management
  • 6. All on Linux ● Ubuntu 16.04 or 18.04 ● CentOS 7 or Red Hat
  • 7. Format Policy Registry ● Tools we use to perform preservation actions ● Rules we use to determine when to use the Tools ● Commands are applied to files based on the Rules
  • 8. Tools ● Identification: FIDO and Siegfried ● Characterization: FITS, FFprobe, MediaInfo, ExifTool ● Event Detail: echo ● Extraction: 7zip, SleuthKit ● Normalization: FFmpeg, ImageMagick ● Transcription: Tesseract ● Validation: MediaConch ● Verification: File exists? Filesize is >0?
  • 9. Production tools Automation Tools https://github.com/artefactual/automation-tools/ Fixity https://github.com/artefactual/fixity Deployment tools Deploy-pub https://github.com/artefactual/deploy-pub Ansible roles https://github.com/artefactual-labs/ansible-archivematica-src Am.git https://github.com/artefactual-labs/am DevOps Tools Ops-helpers https://github.com/artefactual-labs/ops-helpers Dev-tools https://github.com/artefactual/archivematica-devtools
  • 10. Technical stack ● Lots of tools = lots of potential points of failure ● Archivematica strives to relay as much information as possible to the user -- especially about what the tools are doing and what they are producing
  • 11. Components ● Dashboard: for the user ● MCPClient: does the work ● MCPServer: manages the work ● Storage Service: manages storage
  • 12. Logging in ● Logging in (ssh) ● Moving files (scp) ● What’s running (ps -sf | grep py) ● How much space? (du) ● How much free space? (df -h) ● Load average time? (top) ● Read end of logs (tail) ● Read logs (less)
  • 14. Moving files Download a file to your computer scp your_username@remotehost.url:your-file.txt /your/local/directory Send a file to your machine scp path/to/your-file.txt your_username@remotehost.url:/some/remote/di rectory
  • 15. What’s running? ps -ef | grep py These services should all be running: ● Dashboard (apache) ● Database (mysql) ● Elasticsearch (elastic) ● Storage Service (uwsgi or nginx) ● FITS ● Server (MCP) -- Should show MCP server and MCP client
  • 16. What’s running? ps -ef | grep py Also, these dependent services should all be running: ● MySQL ● Elasticsearch ● Gearman ● Nginx ● Nailgun ● Clamav
  • 17. du To get the file size of each subdirectory of the directory you are in, you can run this command: du -h --max-depth=1 This command can take a long time if you have very large mounted drives. See amount of space on machine
  • 18. Check free space on disk df -h ● Up to 3x of free space required for processing ● cron job can auto-clear deleted/rejected files
  • 20. Restarting services service archivematica-dashboard restart service archivematica-mcp-client.service restart service archivematica-mcp-server.service restart service archivematica-storage-service restart service gearmand restart
  • 21. Reading logs less /var/log/archivematica/dashboard/dashboard.log less /var/log/archivematica/dashboard/dashboard.debug.log less /var/log/archivematica/MCPClient/MCPClient.log less /var/log/archivematica/MCPClient/MCPClient.debug.log less /var/log/archivematica/MCPServer/MCPServer.log less /var/log/archivematica/MCPServer/MCPServer.debug.log less /var/log/archivematica/storage-service/storage-service.log less /var/log/archivematica/storage-service/storage-service.debug.log
  • 22. Finding errors grep -rn ERROR /var/log/archivematica/
  • 24. Upgrading ● Need to decide on a new release whether you want it or not, how much time to put aside. ● Tradeoff to not upgrading is not keeping pace with community and having a harder time getting support from community for an older version. ● Good idea to test the upgrade- make a backup of your production environment and test upgrade there. If that is not possible, plan for downtime. ○ If you want to be able to do this, you might want to explore virtualization of your Archivematica environment so you can run a development (testing) environment in addition to the production environment.
  • 25. Security upgrades ● Make sure that Ubuntu is set-up to do Unattended Upgrades, which will apply security patches (like equivalent of Windows updates). ● Sometimes these upgrades require the system to be restarted- you might need to plan for 30 minutes of downtime (not in the middle of processing, make sure your current Transfer/AIPs are done).
  • 27. Getting Help ● Participating in the community forum ○ Archivematica https://groups.google.com/forum/#!forum/archivematica ● Documentation ○ Main docs https://www.archivematica.org/en/ ○ Wiki https://wiki.archivematica.org/Main_Page ● Github issues ○ Main repo https://github.com/archivematica/Issues/issues
  • 28. See also This presentation in document form ● For tech-savvy preservationists: https://docs.google.com/document/d/1GybyH7X_gpZ7wpYVo5d9__LeG NuXYCky0oairJGJAmo/edit#heading=h.y1nyq0vlcvsl ● For Archivematica-unfamiliar systems administrators: https://docs.google.com/document/d/1NDzGHBGuPFa7GTHCMEl3D2n vvdZRxG2FpdsGAYoG31I/edit#
  • 29. exit :q q Quit .quit q :q! Ashley Blewer @ Artefactual Systems, Inc. ablewer@artefactual.com