SlideShare a Scribd company logo
1 of 13
Download to read offline
Revolutionising the world of research

with Amazon Web Services




Dan Harvey
Systems Architect
Mendeley helps researchers work smarter
Mendeley makes science more
 collaborative and transparent




         Sync between computers
            and peers around the world
1 million researchers...

      Uploaded130 million citations
           40 million unique



                   17 million papers
                   16 TB of data
Document Storage
• Backed on to S3

• Previews for 16TB of PDFs?
PDF Previews

                Process Queue




                                Load PDFs


                   Elastic
                  Beanstalk
                                            S3



Render to PDF
                                Store PNG
                                                      Serve
                                                 via Cloud Front
System Overview


                 S3
                          RD
          EM                   S
             R
                 EC
                      2
                          EB
                               S

          VPC
Article Search
• Based on Solr (open source search)
• 40GB index
• Variable usage



      H-p
    Requests


                 Day	
  of	
  the	
  month
Solr Layout
            Solr
           Master               EB
                                     S


                                           Inside VPC

                                          Outside VPC


 Solr
            Solr         Solr
Slave
           Slave        Slave




           Elastic
                                         Search
        Load Balancer
                                         Queries
• Machine learning on Hadoop

• Personalised article recommendations
• Collaborative filtering based

• Running on Elastic Map Reduce
Summary
• Not all or nothing



• Focus on your problem
       not “Undifferentiated heavy lifting”
                                  - Werner Vogels



• Learn the building blocks AWS provide
Enjoy what you’ve seen?
      We’re hiring!

  Senior Java Engineers


      chat to me after
      or e-mail/tweet
Questions?



      @DanHarvey
dan.harvey@mendeley.com

More Related Content

What's hot

What's hot (9)

beSharp a serverless approach to big data on aws
beSharp a serverless approach to big data on awsbeSharp a serverless approach to big data on aws
beSharp a serverless approach to big data on aws
 
Getting Started with Visual Studio Tools for AI
Getting Started with Visual Studio Tools for AIGetting Started with Visual Studio Tools for AI
Getting Started with Visual Studio Tools for AI
 
Using AML Python SDK
Using AML Python SDKUsing AML Python SDK
Using AML Python SDK
 
Redshift
RedshiftRedshift
Redshift
 
If you doing file uploads with rails you're gonna have a bad time
If you doing file uploads with rails you're gonna have a bad timeIf you doing file uploads with rails you're gonna have a bad time
If you doing file uploads with rails you're gonna have a bad time
 
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech TalksA Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
 
Defluffing Cloud Computing
Defluffing Cloud ComputingDefluffing Cloud Computing
Defluffing Cloud Computing
 
Lessons Learnt from Guanyu
Lessons Learnt from GuanyuLessons Learnt from Guanyu
Lessons Learnt from Guanyu
 
Running Fast, Interactive Queries on Petabyte Datasets using Presto - AWS Jul...
Running Fast, Interactive Queries on Petabyte Datasets using Presto - AWS Jul...Running Fast, Interactive Queries on Petabyte Datasets using Presto - AWS Jul...
Running Fast, Interactive Queries on Petabyte Datasets using Presto - AWS Jul...
 

Viewers also liked

Tech Tools for Reference: Enhancing the Research Experience in the Health Sci...
Tech Tools for Reference: Enhancing the Research Experience in the Health Sci...Tech Tools for Reference: Enhancing the Research Experience in the Health Sci...
Tech Tools for Reference: Enhancing the Research Experience in the Health Sci...
Christine Tobias
 
Structured writing using ms word
Structured writing using ms wordStructured writing using ms word
Structured writing using ms word
Wouter Verkerken
 
Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...
Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...
Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...
Annie Johanna
 
How to write a scientific article?
How to write a scientific article?How to write a scientific article?
How to write a scientific article?
Annette Gerritsen
 
Parts of a Research Paper
Parts of a Research PaperParts of a Research Paper
Parts of a Research Paper
Draizelle Sexon
 

Viewers also liked (17)

Tweets and Mendeley readers: Two different types of article level metrics
Tweets and Mendeley readers: Two different types of article level metricsTweets and Mendeley readers: Two different types of article level metrics
Tweets and Mendeley readers: Two different types of article level metrics
 
Tech Tools for Reference: Enhancing the Research Experience in the Health Sci...
Tech Tools for Reference: Enhancing the Research Experience in the Health Sci...Tech Tools for Reference: Enhancing the Research Experience in the Health Sci...
Tech Tools for Reference: Enhancing the Research Experience in the Health Sci...
 
Implementing Structured Writing and Content Management Globally
Implementing Structured Writing and Content Management GloballyImplementing Structured Writing and Content Management Globally
Implementing Structured Writing and Content Management Globally
 
Mendeley and MS Word
Mendeley and MS WordMendeley and MS Word
Mendeley and MS Word
 
Mendeley for Librarians
Mendeley for LibrariansMendeley for Librarians
Mendeley for Librarians
 
Basic Introduction to Mendeley Reference Manager
Basic Introduction to Mendeley Reference ManagerBasic Introduction to Mendeley Reference Manager
Basic Introduction to Mendeley Reference Manager
 
Structured writing using ms word
Structured writing using ms wordStructured writing using ms word
Structured writing using ms word
 
Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...
Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...
Thesis writing and article management using Mendeley ( slide Bengkel Penulisa...
 
Mendeley Workshop Presentation
Mendeley Workshop PresentationMendeley Workshop Presentation
Mendeley Workshop Presentation
 
Scientific writing process
Scientific writing processScientific writing process
Scientific writing process
 
How to write a scientific article?
How to write a scientific article?How to write a scientific article?
How to write a scientific article?
 
Dental drugs prescription
Dental drugs prescriptionDental drugs prescription
Dental drugs prescription
 
Scientific writing pro : Office word & Mendeley (dani r firman)
Scientific writing pro : Office word & Mendeley (dani r firman)Scientific writing pro : Office word & Mendeley (dani r firman)
Scientific writing pro : Office word & Mendeley (dani r firman)
 
How to Write a Thesis
How to Write a ThesisHow to Write a Thesis
How to Write a Thesis
 
Structured writing - What's it Good For?
Structured writing - What's it Good For?Structured writing - What's it Good For?
Structured writing - What's it Good For?
 
Introduction to-mendeley presentation-2014
Introduction to-mendeley presentation-2014Introduction to-mendeley presentation-2014
Introduction to-mendeley presentation-2014
 
Parts of a Research Paper
Parts of a Research PaperParts of a Research Paper
Parts of a Research Paper
 

Similar to AWS at Mendeley (London, September 27th 2011)

Microsoft Openness Mongo DB
Microsoft Openness Mongo DBMicrosoft Openness Mongo DB
Microsoft Openness Mongo DB
Heriyadi Janwar
 
Tldr solr-courseload
Tldr solr-courseloadTldr solr-courseload
Tldr solr-courseload
mattdeboard
 
Mark ramm To relate or not to relate
Mark ramm   To relate or not to relateMark ramm   To relate or not to relate
Mark ramm To relate or not to relate
StarTech Conference
 
An Introduction to Big Data, NoSQL and MongoDB
An Introduction to Big Data, NoSQL and MongoDBAn Introduction to Big Data, NoSQL and MongoDB
An Introduction to Big Data, NoSQL and MongoDB
William LaForest
 
AWS Summit Berlin 2012 Talk on Web Data Commons
AWS Summit Berlin 2012 Talk on Web Data CommonsAWS Summit Berlin 2012 Talk on Web Data Commons
AWS Summit Berlin 2012 Talk on Web Data Commons
Hannes Mühleisen
 
AWS Customer Presentation: Freie Univerisitat - Berlin Summit 2012
AWS Customer Presentation: Freie Univerisitat - Berlin Summit 2012AWS Customer Presentation: Freie Univerisitat - Berlin Summit 2012
AWS Customer Presentation: Freie Univerisitat - Berlin Summit 2012
Amazon Web Services
 
Common MongoDB Use Cases
Common MongoDB Use CasesCommon MongoDB Use Cases
Common MongoDB Use Cases
DATAVERSITY
 

Similar to AWS at Mendeley (London, September 27th 2011) (20)

Microsoft Openness Mongo DB
Microsoft Openness Mongo DBMicrosoft Openness Mongo DB
Microsoft Openness Mongo DB
 
Tldr solr-courseload
Tldr solr-courseloadTldr solr-courseload
Tldr solr-courseload
 
Mark ramm To relate or not to relate
Mark ramm   To relate or not to relateMark ramm   To relate or not to relate
Mark ramm To relate or not to relate
 
MED201 Media Ingest and Storage Solutions with AWS - AWS re: Invent 2012
MED201 Media Ingest and Storage Solutions with AWS - AWS re: Invent 2012MED201 Media Ingest and Storage Solutions with AWS - AWS re: Invent 2012
MED201 Media Ingest and Storage Solutions with AWS - AWS re: Invent 2012
 
No Sql
No SqlNo Sql
No Sql
 
NOSQL, CouchDB, and the Cloud
NOSQL, CouchDB, and the CloudNOSQL, CouchDB, and the Cloud
NOSQL, CouchDB, and the Cloud
 
Anti-social Databases
Anti-social DatabasesAnti-social Databases
Anti-social Databases
 
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
 
Zenko & MetalK8s @ Dublin Docker Meetup, June 2018
Zenko & MetalK8s @ Dublin Docker Meetup, June 2018Zenko & MetalK8s @ Dublin Docker Meetup, June 2018
Zenko & MetalK8s @ Dublin Docker Meetup, June 2018
 
Databases in the Cloud
Databases in the CloudDatabases in the Cloud
Databases in the Cloud
 
An Introduction to Big Data, NoSQL and MongoDB
An Introduction to Big Data, NoSQL and MongoDBAn Introduction to Big Data, NoSQL and MongoDB
An Introduction to Big Data, NoSQL and MongoDB
 
2011 05-12 nosql-fritidsresor
2011 05-12 nosql-fritidsresor2011 05-12 nosql-fritidsresor
2011 05-12 nosql-fritidsresor
 
Global Netflix - HPTS Workshop - Scaling Cassandra benchmark to over 1M write...
Global Netflix - HPTS Workshop - Scaling Cassandra benchmark to over 1M write...Global Netflix - HPTS Workshop - Scaling Cassandra benchmark to over 1M write...
Global Netflix - HPTS Workshop - Scaling Cassandra benchmark to over 1M write...
 
Netflix on Cloud - combined slides for Dev and Ops
Netflix on Cloud - combined slides for Dev and OpsNetflix on Cloud - combined slides for Dev and Ops
Netflix on Cloud - combined slides for Dev and Ops
 
Infopark AG - AWS Customer Presentation
Infopark AG - AWS Customer PresentationInfopark AG - AWS Customer Presentation
Infopark AG - AWS Customer Presentation
 
AWS Summit Berlin 2012 Talk on Web Data Commons
AWS Summit Berlin 2012 Talk on Web Data CommonsAWS Summit Berlin 2012 Talk on Web Data Commons
AWS Summit Berlin 2012 Talk on Web Data Commons
 
AWS Customer Presentation: Freie Univerisitat - Berlin Summit 2012
AWS Customer Presentation: Freie Univerisitat - Berlin Summit 2012AWS Customer Presentation: Freie Univerisitat - Berlin Summit 2012
AWS Customer Presentation: Freie Univerisitat - Berlin Summit 2012
 
Common MongoDB Use Cases
Common MongoDB Use CasesCommon MongoDB Use Cases
Common MongoDB Use Cases
 
Apache Cassandra training. Overview and Basics
Apache Cassandra training. Overview and BasicsApache Cassandra training. Overview and Basics
Apache Cassandra training. Overview and Basics
 
Architecture by Accident
Architecture by AccidentArchitecture by Accident
Architecture by Accident
 

More from Dan Harvey (6)

Change data capture with MongoDB and Kafka.
Change data capture with MongoDB and Kafka.Change data capture with MongoDB and Kafka.
Change data capture with MongoDB and Kafka.
 
Data Processing in the Work of NoSQL? An Introduction to Hadoop
Data Processing in the Work of NoSQL? An Introduction to HadoopData Processing in the Work of NoSQL? An Introduction to Hadoop
Data Processing in the Work of NoSQL? An Introduction to Hadoop
 
An Introduction to Hadoop
An Introduction to HadoopAn Introduction to Hadoop
An Introduction to Hadoop
 
Overview of Hadoop in 2010 and what's coming up in 2011
Overview of Hadoop in 2010 and what's coming up in 2011Overview of Hadoop in 2010 and what's coming up in 2011
Overview of Hadoop in 2010 and what's coming up in 2011
 
Project Voldemort: Big data loading
Project Voldemort: Big data loadingProject Voldemort: Big data loading
Project Voldemort: Big data loading
 
HBase at Mendeley
HBase at MendeleyHBase at Mendeley
HBase at Mendeley
 

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 

AWS at Mendeley (London, September 27th 2011)

Editor's Notes

  1. Introduce myself\nAsk who knows about Mendeley\nMendeley are a start-up aiming to improve the world of research\n
  2. One of our two core products, Mendeley Desktop\nResearch can put research papers they read, annotate their thoughts and ideas\nUse them as citations inside Word, Openoffice, Latex\n
  3. Sync to Mendeley Web, where you can share notes and collaborate with other\nAlso allows us to produce statistics and trends on research speeding up the usual delay researchers get with feed back to their publications.\n
  4. A few numbers... we’ve got over 1 million research using the platform. They’ve uploaded over 130 million documents, which we’ve found around 40 million are unique. Of these 40 million we’ve got 17 million pdf articles users have uploaded, giving around 16TH of documents in total.\n\nSo quite a scaling and growth challenge!\n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. Hello how’s this looking...\n
  12. \n
  13. \n