SlideShare a Scribd company logo
1 of 19
Download to read offline
MongoDB use cases and setup
involving Elasticsearch
MongoDB Meetup @hikeapp Gurgaon
Bharvi Dixit
@d_bharvi
13th February 2015
Agenda
 About Me and Orkash.
 Why we chose MongoDB.
 Our use cases and setup of MongoDB.
 Better Than Apple: MongoDB-Elasticsearch.
 Elasticsearch An Overview.
 The most common issues.
 Mongo University: Learn from the masters.
About Me
 Software Engineer @Orkash.
 Organizer and Speaker @Delhi Elasticsearch Meetup.
 Loves Java, Data, Elasticsearch, MongoDB, Eclipse.
 Interested in all things scale, search, security & DevOps.
 Working with NoSQL databases for more than a year.
 Social Media and News Media Intelligence. (Complex
schemas & Query designs)
About Orkash
 Founded in 2007 by Ashish Sonal.
 An R&D driven company which provides Big Data Automated Intelligence
Platform with a focus in following areas:
– Counter-terrorism, Security intelligence and Risk management.
– Political Consulting And Homeland Security.
– Decision Support Systems.
– Market/Brand intelligence.
 We create the FOUR pillars of Automated intelligence:
– Information Extraction and Monitoring.
– Semantic and Link Analysis.
– Geo-Spatial Analysis.
– Data Mining & Forensics.
Everything starts with a problem..!!
• Data Driven Decisions
• Logfiles for scaling up/down
• Warehouse withdrawal triggers orders
• History for fraud detection
• Internet of Things and Smart Cities.
... data explosion
Everything starts with a problem..!!
Better decisions == more data
And NoSQL adds more problems
Data
Big Data
BIG DATA
Big Data Problem goes on..
• I need BIG DATA.
• I need to analyze this data.
• I need to enrich this big data & make it more bigger.
• I need fast searching.
• I need real-time analytics.
• Ohh wait.. I need relational queries on this big data to get
more insights..
Why we chose mongoDB
• It does the impossible. (Can incorporate any kind of data)
• Document model.
• Distributed computing.
• Awesome sharding and replications.
• Scales big (horizontally) on commodity hardware's.
• Powerful Analytics with aggregation framework.
• Highly Persistence and Read-Write Performance.
• Awesome security features.
• OS-Managed memory management.
Our use cases and setup of MongoDB.
• A primary data store for collecting and storing humongous
amount of unstructured/semi-structured texts.
• Building GIS applications for government and security agencies
using GEO Spatial features.
• Data analytics.
Our use cases and setup of MongoDB.
Our current production setup has 14 nodes:
Node Type #of nodes Hardware Specifications
Data nodes 5 (20 GB RAM with 8 core CPU each)
Mongos (VM’s) 4 (4 GB RAM with 4 core CPU each)
Arbiter nodes(VM’s) 2 (1 GB RAM with 1 core CPU each)
Config servers(VM’s) 3 (4 GB RAM with 2 core CPU each)
Better Than Apple: MongoDB-Elasticsearch
• One of the greatest
combinations this era has
seen.
• Continuous improvements
• Fulfills each other’s
missing features.
• Both have almost similar
concepts and data types.
• Both keep cloud in mind.
• Driven by Open-Source
community, knowledge
sharing, and High
collaboration with users.
Better Than Apple: MongoDB-Elasticsearch
Sources: Twitter
Elasticsearch Overview
What is Elasticsearch:
• “you know, for search”
• Schema-free, REST & JSON Based distributed Full Text
search engine & document store.
• Written in JAVA & Build on top of Lucene.
• Highly reliable, scalable, fault tolerant.
• Support distributed Indexing, Replication, and load
balanced querying.
• Powerful Geo-Spatial Queries.
• Latest Release : 1.4.2
Wait..!! Schema Free?? The real gotcha.. Mongo-ES breakup 
Elasticsearch Overview
What does it add to Lucene:
• REST service: Json API’s over HTTP
• High Availability & Performance: Clustering & Replication
• A Powerful query DSL.
• Interoperation with non-Java/JVM languages.
• More and more Resilience.
• Multitenancy
• And the best one: It allows to maintain relationship
among documents.
The Elasticsearch Open Source Model
Understanding Elasticsearch Structure in respect to
MongoDB
The most common issues..
1. Distributed computing comes with two problems:
Node failures and Network Bottlenecks
Node failures can be handled by MongoDB very easily but
Network bottleneck/partitions won’t let you sleep at nights
because of Replicaset failovers and Rollbacks.
Separate networks for read and write.
2. Assuring Business continuity plan
Mongodump is not fit for the large dataset backups.
3. Data Modeling
4. Keeping a close eye on Connection
5. Importing embedded documents in CSV
Mongo University: Learn from the masters..!!
Thank You for Listening
bharvidixit@yahoo.com
https://twitter.com/d_bharvi
http://www.meetup.com/Delhi-Elasticsearch-Meetup/
http://www.slideshare.net/bharvidixit/

More Related Content

What's hot

Encryption in the enterprise
Encryption in the enterpriseEncryption in the enterprise
Encryption in the enterpriseBozhidar Bozhanov
 
Introduction to MongoDB Basics from SQL to NoSQL
Introduction to MongoDB Basics from SQL to NoSQLIntroduction to MongoDB Basics from SQL to NoSQL
Introduction to MongoDB Basics from SQL to NoSQLMayur Patil
 
MongoDB & The McGraw-Hill Education Learning Analytics Platform
MongoDB & The McGraw-Hill Education Learning Analytics PlatformMongoDB & The McGraw-Hill Education Learning Analytics Platform
MongoDB & The McGraw-Hill Education Learning Analytics PlatformMongoDB
 
Migrating from MySQL to MongoDB at Wordnik
Migrating from MySQL to MongoDB at WordnikMigrating from MySQL to MongoDB at Wordnik
Migrating from MySQL to MongoDB at WordnikTony Tam
 
DomainTools Fingerprinting Threat Actors with Web Assets
DomainTools Fingerprinting Threat Actors with Web AssetsDomainTools Fingerprinting Threat Actors with Web Assets
DomainTools Fingerprinting Threat Actors with Web AssetsDomainTools
 
Exposing the Hyperlink
Exposing the HyperlinkExposing the Hyperlink
Exposing the HyperlinkPremlal Dewli
 
The Internet as a Single Database
The Internet as a Single DatabaseThe Internet as a Single Database
The Internet as a Single DatabaseDatafiniti
 
Frontera-Open Source Large Scale Web Crawling Framework
Frontera-Open Source Large Scale Web Crawling FrameworkFrontera-Open Source Large Scale Web Crawling Framework
Frontera-Open Source Large Scale Web Crawling Frameworksixtyone
 
Scrapinghub Deck for Startups
Scrapinghub Deck for StartupsScrapinghub Deck for Startups
Scrapinghub Deck for StartupsScrapinghub
 
Exposing the Hyperlink
Exposing the HyperlinkExposing the Hyperlink
Exposing the HyperlinkMarc Duchene
 
Elasticsearch tuning
Elasticsearch tuningElasticsearch tuning
Elasticsearch tuningNIKHIL DUBEY
 
“Just the Facts, Ma’am”: RSS and your library
“Just the Facts, Ma’am”: RSS and your library“Just the Facts, Ma’am”: RSS and your library
“Just the Facts, Ma’am”: RSS and your libraryGillian Byrne
 
Web Scraping Technologies
Web Scraping TechnologiesWeb Scraping Technologies
Web Scraping TechnologiesKrishna Sunuwar
 
About onlineextrems concept
About onlineextrems conceptAbout onlineextrems concept
About onlineextrems conceptonlineextrems
 
Using Web Data for Finance
Using Web Data for FinanceUsing Web Data for Finance
Using Web Data for FinanceScrapinghub
 
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...MongoDB
 
How search engines work
How search engines workHow search engines work
How search engines workChinna Botla
 
Correcting and Updating the Scholarly Record through CrossMark
Correcting and Updating the Scholarly Record through CrossMarkCorrecting and Updating the Scholarly Record through CrossMark
Correcting and Updating the Scholarly Record through CrossMarkCrossref
 
MongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep DiveMongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep DiveMongoDB
 

What's hot (20)

Encryption in the enterprise
Encryption in the enterpriseEncryption in the enterprise
Encryption in the enterprise
 
Introduction to MongoDB Basics from SQL to NoSQL
Introduction to MongoDB Basics from SQL to NoSQLIntroduction to MongoDB Basics from SQL to NoSQL
Introduction to MongoDB Basics from SQL to NoSQL
 
MongoDB & The McGraw-Hill Education Learning Analytics Platform
MongoDB & The McGraw-Hill Education Learning Analytics PlatformMongoDB & The McGraw-Hill Education Learning Analytics Platform
MongoDB & The McGraw-Hill Education Learning Analytics Platform
 
MongoDB
MongoDBMongoDB
MongoDB
 
Migrating from MySQL to MongoDB at Wordnik
Migrating from MySQL to MongoDB at WordnikMigrating from MySQL to MongoDB at Wordnik
Migrating from MySQL to MongoDB at Wordnik
 
DomainTools Fingerprinting Threat Actors with Web Assets
DomainTools Fingerprinting Threat Actors with Web AssetsDomainTools Fingerprinting Threat Actors with Web Assets
DomainTools Fingerprinting Threat Actors with Web Assets
 
Exposing the Hyperlink
Exposing the HyperlinkExposing the Hyperlink
Exposing the Hyperlink
 
The Internet as a Single Database
The Internet as a Single DatabaseThe Internet as a Single Database
The Internet as a Single Database
 
Frontera-Open Source Large Scale Web Crawling Framework
Frontera-Open Source Large Scale Web Crawling FrameworkFrontera-Open Source Large Scale Web Crawling Framework
Frontera-Open Source Large Scale Web Crawling Framework
 
Scrapinghub Deck for Startups
Scrapinghub Deck for StartupsScrapinghub Deck for Startups
Scrapinghub Deck for Startups
 
Exposing the Hyperlink
Exposing the HyperlinkExposing the Hyperlink
Exposing the Hyperlink
 
Elasticsearch tuning
Elasticsearch tuningElasticsearch tuning
Elasticsearch tuning
 
“Just the Facts, Ma’am”: RSS and your library
“Just the Facts, Ma’am”: RSS and your library“Just the Facts, Ma’am”: RSS and your library
“Just the Facts, Ma’am”: RSS and your library
 
Web Scraping Technologies
Web Scraping TechnologiesWeb Scraping Technologies
Web Scraping Technologies
 
About onlineextrems concept
About onlineextrems conceptAbout onlineextrems concept
About onlineextrems concept
 
Using Web Data for Finance
Using Web Data for FinanceUsing Web Data for Finance
Using Web Data for Finance
 
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
 
How search engines work
How search engines workHow search engines work
How search engines work
 
Correcting and Updating the Scholarly Record through CrossMark
Correcting and Updating the Scholarly Record through CrossMarkCorrecting and Updating the Scholarly Record through CrossMark
Correcting and Updating the Scholarly Record through CrossMark
 
MongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep DiveMongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep Dive
 

Similar to MongoDB meetup at Hike

Webinar: When to Use MongoDB
Webinar: When to Use MongoDBWebinar: When to Use MongoDB
Webinar: When to Use MongoDBMongoDB
 
Exploring MongoDB & Elasticsearch: Better Together
Exploring MongoDB & Elasticsearch: Better TogetherExploring MongoDB & Elasticsearch: Better Together
Exploring MongoDB & Elasticsearch: Better TogetherObjectRocket
 
When to Use MongoDB
When to Use MongoDBWhen to Use MongoDB
When to Use MongoDBMongoDB
 
Solr and Elasticsearch, a performance study
Solr and Elasticsearch, a performance studySolr and Elasticsearch, a performance study
Solr and Elasticsearch, a performance studyCharlie Hull
 
Ten things to consider for interactive analytics on write once workloads
Ten things to consider for interactive analytics on write once workloadsTen things to consider for interactive analytics on write once workloads
Ten things to consider for interactive analytics on write once workloadsAbinasha Karana
 
MongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 MeetupMongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 Meetupikanow
 
Big Data Open Source Technologies
Big Data Open Source TechnologiesBig Data Open Source Technologies
Big Data Open Source Technologiesneeraj rathore
 
Your data layer - Choosing the right database solutions for the future
Your data layer - Choosing the right database solutions for the futureYour data layer - Choosing the right database solutions for the future
Your data layer - Choosing the right database solutions for the futureObjectRocket
 
Elasticsearch vs MongoDB comparison
Elasticsearch vs MongoDB comparisonElasticsearch vs MongoDB comparison
Elasticsearch vs MongoDB comparisonjeetendra mandal
 
Accra MongoDB User Group
Accra MongoDB User GroupAccra MongoDB User Group
Accra MongoDB User GroupMongoDB
 
Augmenting Mongo DB with Treasure Data
Augmenting Mongo DB with Treasure DataAugmenting Mongo DB with Treasure Data
Augmenting Mongo DB with Treasure DataTreasure Data, Inc.
 
Augmenting Mongo DB with treasure data
Augmenting Mongo DB with treasure dataAugmenting Mongo DB with treasure data
Augmenting Mongo DB with treasure dataTreasure Data, Inc.
 
Big Data Technologies.pdf
Big Data Technologies.pdfBig Data Technologies.pdf
Big Data Technologies.pdfRAHULRAHU8
 
Big Data with IOT approach and trends with case study
Big Data with IOT approach and trends with case studyBig Data with IOT approach and trends with case study
Big Data with IOT approach and trends with case studySharjeel Imtiaz
 
SDSC18 and DSATL Meetup March 2018
SDSC18 and DSATL Meetup March 2018 SDSC18 and DSATL Meetup March 2018
SDSC18 and DSATL Meetup March 2018 CareerBuilder.com
 

Similar to MongoDB meetup at Hike (20)

Webinar: When to Use MongoDB
Webinar: When to Use MongoDBWebinar: When to Use MongoDB
Webinar: When to Use MongoDB
 
Exploring MongoDB & Elasticsearch: Better Together
Exploring MongoDB & Elasticsearch: Better TogetherExploring MongoDB & Elasticsearch: Better Together
Exploring MongoDB & Elasticsearch: Better Together
 
When to Use MongoDB
When to Use MongoDBWhen to Use MongoDB
When to Use MongoDB
 
NoSQL
NoSQLNoSQL
NoSQL
 
Solr and Elasticsearch, a performance study
Solr and Elasticsearch, a performance studySolr and Elasticsearch, a performance study
Solr and Elasticsearch, a performance study
 
Ten things to consider for interactive analytics on write once workloads
Ten things to consider for interactive analytics on write once workloadsTen things to consider for interactive analytics on write once workloads
Ten things to consider for interactive analytics on write once workloads
 
MongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 MeetupMongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 Meetup
 
Big Data Open Source Technologies
Big Data Open Source TechnologiesBig Data Open Source Technologies
Big Data Open Source Technologies
 
Your data layer - Choosing the right database solutions for the future
Your data layer - Choosing the right database solutions for the futureYour data layer - Choosing the right database solutions for the future
Your data layer - Choosing the right database solutions for the future
 
Elasticsearch vs MongoDB comparison
Elasticsearch vs MongoDB comparisonElasticsearch vs MongoDB comparison
Elasticsearch vs MongoDB comparison
 
MongoDB Basics
MongoDB BasicsMongoDB Basics
MongoDB Basics
 
No sq lv1_0
No sq lv1_0No sq lv1_0
No sq lv1_0
 
Accra MongoDB User Group
Accra MongoDB User GroupAccra MongoDB User Group
Accra MongoDB User Group
 
Augmenting Mongo DB with Treasure Data
Augmenting Mongo DB with Treasure DataAugmenting Mongo DB with Treasure Data
Augmenting Mongo DB with Treasure Data
 
Augmenting Mongo DB with treasure data
Augmenting Mongo DB with treasure dataAugmenting Mongo DB with treasure data
Augmenting Mongo DB with treasure data
 
Big Data Technologies.pdf
Big Data Technologies.pdfBig Data Technologies.pdf
Big Data Technologies.pdf
 
Architecting Your First Big Data Implementation
Architecting Your First Big Data ImplementationArchitecting Your First Big Data Implementation
Architecting Your First Big Data Implementation
 
Big Data with IOT approach and trends with case study
Big Data with IOT approach and trends with case studyBig Data with IOT approach and trends with case study
Big Data with IOT approach and trends with case study
 
MongoDB.pptx
MongoDB.pptxMongoDB.pptx
MongoDB.pptx
 
SDSC18 and DSATL Meetup March 2018
SDSC18 and DSATL Meetup March 2018 SDSC18 and DSATL Meetup March 2018
SDSC18 and DSATL Meetup March 2018
 

Recently uploaded

why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are successPratikSingh115843
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfPratikPatil591646
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etclalithasri22
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfNicoChristianSunaryo
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformationAnnie Melnic
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfnikeshsingh56
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 

Recently uploaded (17)

why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are success
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdf
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etc
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdf
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformation
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdf
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 

MongoDB meetup at Hike

  • 1. MongoDB use cases and setup involving Elasticsearch MongoDB Meetup @hikeapp Gurgaon Bharvi Dixit @d_bharvi 13th February 2015
  • 2. Agenda  About Me and Orkash.  Why we chose MongoDB.  Our use cases and setup of MongoDB.  Better Than Apple: MongoDB-Elasticsearch.  Elasticsearch An Overview.  The most common issues.  Mongo University: Learn from the masters.
  • 3. About Me  Software Engineer @Orkash.  Organizer and Speaker @Delhi Elasticsearch Meetup.  Loves Java, Data, Elasticsearch, MongoDB, Eclipse.  Interested in all things scale, search, security & DevOps.  Working with NoSQL databases for more than a year.  Social Media and News Media Intelligence. (Complex schemas & Query designs)
  • 4. About Orkash  Founded in 2007 by Ashish Sonal.  An R&D driven company which provides Big Data Automated Intelligence Platform with a focus in following areas: – Counter-terrorism, Security intelligence and Risk management. – Political Consulting And Homeland Security. – Decision Support Systems. – Market/Brand intelligence.  We create the FOUR pillars of Automated intelligence: – Information Extraction and Monitoring. – Semantic and Link Analysis. – Geo-Spatial Analysis. – Data Mining & Forensics.
  • 5. Everything starts with a problem..!! • Data Driven Decisions • Logfiles for scaling up/down • Warehouse withdrawal triggers orders • History for fraud detection • Internet of Things and Smart Cities. ... data explosion
  • 6. Everything starts with a problem..!! Better decisions == more data And NoSQL adds more problems Data Big Data BIG DATA
  • 7. Big Data Problem goes on.. • I need BIG DATA. • I need to analyze this data. • I need to enrich this big data & make it more bigger. • I need fast searching. • I need real-time analytics. • Ohh wait.. I need relational queries on this big data to get more insights..
  • 8. Why we chose mongoDB • It does the impossible. (Can incorporate any kind of data) • Document model. • Distributed computing. • Awesome sharding and replications. • Scales big (horizontally) on commodity hardware's. • Powerful Analytics with aggregation framework. • Highly Persistence and Read-Write Performance. • Awesome security features. • OS-Managed memory management.
  • 9. Our use cases and setup of MongoDB. • A primary data store for collecting and storing humongous amount of unstructured/semi-structured texts. • Building GIS applications for government and security agencies using GEO Spatial features. • Data analytics.
  • 10. Our use cases and setup of MongoDB. Our current production setup has 14 nodes: Node Type #of nodes Hardware Specifications Data nodes 5 (20 GB RAM with 8 core CPU each) Mongos (VM’s) 4 (4 GB RAM with 4 core CPU each) Arbiter nodes(VM’s) 2 (1 GB RAM with 1 core CPU each) Config servers(VM’s) 3 (4 GB RAM with 2 core CPU each)
  • 11. Better Than Apple: MongoDB-Elasticsearch • One of the greatest combinations this era has seen. • Continuous improvements • Fulfills each other’s missing features. • Both have almost similar concepts and data types. • Both keep cloud in mind. • Driven by Open-Source community, knowledge sharing, and High collaboration with users.
  • 12. Better Than Apple: MongoDB-Elasticsearch Sources: Twitter
  • 13. Elasticsearch Overview What is Elasticsearch: • “you know, for search” • Schema-free, REST & JSON Based distributed Full Text search engine & document store. • Written in JAVA & Build on top of Lucene. • Highly reliable, scalable, fault tolerant. • Support distributed Indexing, Replication, and load balanced querying. • Powerful Geo-Spatial Queries. • Latest Release : 1.4.2 Wait..!! Schema Free?? The real gotcha.. Mongo-ES breakup 
  • 14. Elasticsearch Overview What does it add to Lucene: • REST service: Json API’s over HTTP • High Availability & Performance: Clustering & Replication • A Powerful query DSL. • Interoperation with non-Java/JVM languages. • More and more Resilience. • Multitenancy • And the best one: It allows to maintain relationship among documents.
  • 15. The Elasticsearch Open Source Model
  • 16. Understanding Elasticsearch Structure in respect to MongoDB
  • 17. The most common issues.. 1. Distributed computing comes with two problems: Node failures and Network Bottlenecks Node failures can be handled by MongoDB very easily but Network bottleneck/partitions won’t let you sleep at nights because of Replicaset failovers and Rollbacks. Separate networks for read and write. 2. Assuring Business continuity plan Mongodump is not fit for the large dataset backups. 3. Data Modeling 4. Keeping a close eye on Connection 5. Importing embedded documents in CSV
  • 18. Mongo University: Learn from the masters..!!
  • 19. Thank You for Listening bharvidixit@yahoo.com https://twitter.com/d_bharvi http://www.meetup.com/Delhi-Elasticsearch-Meetup/ http://www.slideshare.net/bharvidixit/