SlideShare a Scribd company logo
1 of 18
Download to read offline
Modern Database Systems
CS-E5040
Aristides Gionis
Michael Mathioudakis
T.A.: Orestis Kostakis
Spring 2016
what is a Database Management System (DBMS)?
what is a database?
a collection of data
what is a database management system?
... a.k.a. ‘database system’
software to store, access, administer a database
not just a collection of files
provides mechanism to query the data
transfers data between main memory and secondary storage (disk)
enables concurrent access, offers guarantees for data consistency
provides crash recovery mechanisms
provides security and access control
2
why use a DBMS?
discuss
3
why use a DBMS?
separate logical from physical data organization
efficient data access
guarantee data integrity and security
reduce application development time
data administration
4
why study database systems?
discuss
5
why study database systems?
to manage data efficiently
6
consider the following task…
data records that contain information about products viewed or purchased from an online store
task for each pair of Games products, count the number of customers that have purchased both
7
Product Category Customer Date Price Action other...
Portal 2 Games Michael M. 12/01/2015 10€ Purchase
...
FLWR Plant Food Garden Aris G. 19/02/2015 32€ View
Chase the Rabbit Games Michael M. 23/04/2015 1€ View
Portal 2 Games Orestis K. 13/05/2015 10€ Purchase
...
> what challenges does case B pose compared to case A?
hint limited main memory, disk access, distributed setting
case A
10,000 records (0.5MB per record, 5GB total disk space)
10GB of main memory
case B
10,000,000 records (~5TB total disk space) stored across 100 nodes (50GB per node),
10GB of main memory per node
the main message
to manage data efficiently
minimize expensive operations
e.g., disk access
parallelize computation
8
why study database systems?
to manage data efficiently ...
... from different roles
•  develop database systems that match application requirements
•  use database systems efficiently
o  … knowing how a DBMS
•  stores data,
•  processes queries,
•  and accesses data
o  … allows us to
ü  organize data appropriately
ü  design efficient algorithms to process the data
•  combine existing database systems to match requirements
•  large variety of data and applications
•  “one size fits none” - Michael Stonebraker
9
the relational database system
10
database
(data stored on disk)
DBMS
query optimization & execution
relational operators
files and access methods
buffer (memory) management
disk space management
query interface
database
design
application
database user
introductory course
relational dbms
our course
relational dbms
our course
non-relational dbms
previously on ‘database systems’...
§  relational data model
•  relation, attribute, tuple, schema, domain, keys
§  relational algebra
•  projection, selection,
•  cartesian product, natural joins, theta joins, outer joins
•  renaming, constraints
§  structured query language (SQL)
§  schema design
•  functional dependencies, normalization
§  applications
•  embedded sql, drivers
11
modern database systems
beyond the typical relational DBMS setting...
different data models
semi-structured data, unstructured text, graphs
operations at massive scale
big data platforms & map-reduce paradigm,
hadoop and spark, cloud computing
tailored performance
key-value stores, column-stores,
in-memory databases, streaming systems
12
about this course
familiarize ourselves with modern database systems
principles and practice
database models: data, queries, and computation
algorithms - simple queries (e.g., joins) to complex algorithms
experience with real technologies
emphasis is on understanding of core issues…
essentially: the cost of algorithms for different database models and settings
you can use what you learn here to:
select a database system that fits the demands of your application...
… based on supported data model, functionality, optimizations, scalability
design your database to fit the needs of your application
… e.g., by building appropriate index structures
write fast algorithms to process your data…
… and estimate their running time
adapt your knowledge to the database system you’ll be using 5 years from now
13
syllabus
14
part 1: relational database systems (Jan 22 & Jan 29 - Michael)
topics relational model (SQL), indexing (b+ trees, hash tables), join algorithms, query optimization
technology MySQL
part 2: semi-structured data (Feb 5 - Aris)
topics semi-structured data abstraction, representation, search, indexing and pipeline aggregation
technology MongoDB
part 3: unstructured text and information retrieval (Feb 12 & Feb 26 - Aris)
topics querying text data, inverted indexes, compression, ranking and evaluation, rank aggregation
technology Lucene
part 4a: big data platforms - mapreduce (Mar 4 & Mar 11 - Michael)
topics mapreduce paradigm, algorithms in mapreduce
technology Hadoop
part 4b: big data platforms - graph databases (Mar 18 & Apr 1 - Aris)
topics the pregel paradigm, algorithms on pregel (pagerank, centrality)
technology hadoop giraph, spark graphx
logistics
15
instructors Aristides Gionis, Michael Mathioudakis
teaching assistant Orestis Kostakis
lectures Friday 10-12, Room T3
weeks 2 - 6, 8 - 11, 13
office hours by appointment starting on Monday January 25th
curriculum
•  slides and course notes
•  no single textbook, but slides will provide references for further study
announcements •  follow course website on mycourses.aalto.fi
when you send email •  aristides.gionis / michael.mathioudakis / orestis.kostakis @ aalto.fi
•  subject: [ModernDB] your topic
programming
assignments
•  we’ll provide instances of VirtualBox, ready for use
•  you can use campus labs or own laptop
•  access to CSC, if needed
workload & grading scheme
●  3 assignments + exam
●  25% each
●  need to have at least 50% on each
○  i.e., cannot skip some of the course
●  assignments:
○  pen & paper
■  based on slides + references
○  programming
■  real-world tools, e.g., MySQL, MongoDB, Spark
■  will provide tutorials
16
that’s all for now!
questions?
next week
relational model and SQL
indexing
access cost analysis
what to do until then
(if you want)
SQL
17
Credits
for some of these slides, we used material from
“Database Systems: The Complete Book”, by Garcia-Mollina, Ullman, Widom
“Database Management Systems”, by Ramakrishnan and Gehrke
18

More Related Content

What's hot

Introduction to databases
Introduction to databasesIntroduction to databases
Introduction to databasesBryan Corpuz
 
Week 1 Before the Advent of Database Systems & Fundamental Concepts
Week 1 Before the Advent of Database Systems & Fundamental ConceptsWeek 1 Before the Advent of Database Systems & Fundamental Concepts
Week 1 Before the Advent of Database Systems & Fundamental Conceptsoudesign
 
Week 1 Lab Directions
Week 1 Lab DirectionsWeek 1 Lab Directions
Week 1 Lab Directionsoudesign
 
data resource management
 data resource management data resource management
data resource managementsoodsurbhi123
 
Database system concepts
Database system conceptsDatabase system concepts
Database system conceptsKumar
 
ความรู้เบื้องต้นฐานข้อมูล 1
ความรู้เบื้องต้นฐานข้อมูล 1ความรู้เบื้องต้นฐานข้อมูล 1
ความรู้เบื้องต้นฐานข้อมูล 1Witoon Thammatuch-aree
 
Chapter 6 Database SC025 2017/2018
Chapter 6 Database SC025 2017/2018Chapter 6 Database SC025 2017/2018
Chapter 6 Database SC025 2017/2018Fizaril Amzari Omar
 
Database Concepts and Components
Database Concepts and ComponentsDatabase Concepts and Components
Database Concepts and ComponentsRIAH ENCARNACION
 
Introduction to DBMS(For College Seminars)
Introduction to DBMS(For College Seminars)Introduction to DBMS(For College Seminars)
Introduction to DBMS(For College Seminars)Naman Joshi
 
Basic Concept of Database
Basic Concept of DatabaseBasic Concept of Database
Basic Concept of DatabaseMarlon Jamera
 
database management system (DBMS)
database management system (DBMS)database management system (DBMS)
database management system (DBMS)BigyanDhital1
 
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtap
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtapADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtap
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtapVikas Jagtap
 
Big data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edgeBig data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edgeBhavya Gulati
 

What's hot (20)

Introduction to databases
Introduction to databasesIntroduction to databases
Introduction to databases
 
Week 1 Before the Advent of Database Systems & Fundamental Concepts
Week 1 Before the Advent of Database Systems & Fundamental ConceptsWeek 1 Before the Advent of Database Systems & Fundamental Concepts
Week 1 Before the Advent of Database Systems & Fundamental Concepts
 
Lecture1 data structure(introduction)
Lecture1 data structure(introduction)Lecture1 data structure(introduction)
Lecture1 data structure(introduction)
 
Week 1 Lab Directions
Week 1 Lab DirectionsWeek 1 Lab Directions
Week 1 Lab Directions
 
data resource management
 data resource management data resource management
data resource management
 
Database system concepts
Database system conceptsDatabase system concepts
Database system concepts
 
ความรู้เบื้องต้นฐานข้อมูล 1
ความรู้เบื้องต้นฐานข้อมูล 1ความรู้เบื้องต้นฐานข้อมูล 1
ความรู้เบื้องต้นฐานข้อมูล 1
 
Chapter 6 Database SC025 2017/2018
Chapter 6 Database SC025 2017/2018Chapter 6 Database SC025 2017/2018
Chapter 6 Database SC025 2017/2018
 
Database Concepts and Components
Database Concepts and ComponentsDatabase Concepts and Components
Database Concepts and Components
 
Introduction to DBMS(For College Seminars)
Introduction to DBMS(For College Seminars)Introduction to DBMS(For College Seminars)
Introduction to DBMS(For College Seminars)
 
Database an introduction
Database an introductionDatabase an introduction
Database an introduction
 
11 Database Concepts
11 Database Concepts11 Database Concepts
11 Database Concepts
 
Basic Concept of Database
Basic Concept of DatabaseBasic Concept of Database
Basic Concept of Database
 
database management system (DBMS)
database management system (DBMS)database management system (DBMS)
database management system (DBMS)
 
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtap
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtapADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtap
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtap
 
Database Part 2
Database Part 2Database Part 2
Database Part 2
 
Unit 01 dbms
Unit 01 dbmsUnit 01 dbms
Unit 01 dbms
 
Database concepts
Database conceptsDatabase concepts
Database concepts
 
Big data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edgeBig data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edge
 
Database Part 1
Database Part 1Database Part 1
Database Part 1
 

Similar to Modern Database Systems - Lecture 00

Big Data with Not Only SQL
Big Data with Not Only SQLBig Data with Not Only SQL
Big Data with Not Only SQLPhilippe Julio
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with HadoopPhilippe Julio
 
Dwdmunit1 a
Dwdmunit1 aDwdmunit1 a
Dwdmunit1 abhagathk
 
Cloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your DataCloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your DataCloudera, Inc.
 
Lecture_1_Intro.pdf
Lecture_1_Intro.pdfLecture_1_Intro.pdf
Lecture_1_Intro.pdfpaijitk
 
Minimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data VirtualizationMinimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data VirtualizationDenodo
 
Data Science & Big Data - Theory.pdf
Data Science & Big Data - Theory.pdfData Science & Big Data - Theory.pdf
Data Science & Big Data - Theory.pdfRAKESHG79
 
CH005
CH005CH005
CH005JUC
 
HEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptx
HEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptxHEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptx
HEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptxssuser0d9ec0
 
Ch 2-introduction to dbms
Ch 2-introduction to dbmsCh 2-introduction to dbms
Ch 2-introduction to dbmsRupali Rana
 
Laudon_MIS13_ch06.ppt
Laudon_MIS13_ch06.pptLaudon_MIS13_ch06.ppt
Laudon_MIS13_ch06.pptanisur_rehman
 
Databases and Information Management (1).ppt
Databases and Information Management (1).pptDatabases and Information Management (1).ppt
Databases and Information Management (1).pptAlaaShaqfa2
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2RojaT4
 

Similar to Modern Database Systems - Lecture 00 (20)

Big Data with Not Only SQL
Big Data with Not Only SQLBig Data with Not Only SQL
Big Data with Not Only SQL
 
DBMS introduction
DBMS introductionDBMS introduction
DBMS introduction
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Database part1-
Database part1-Database part1-
Database part1-
 
Dwdmunit1 a
Dwdmunit1 aDwdmunit1 a
Dwdmunit1 a
 
Cloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your DataCloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your Data
 
Lecture_1_Intro.pdf
Lecture_1_Intro.pdfLecture_1_Intro.pdf
Lecture_1_Intro.pdf
 
Minimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data VirtualizationMinimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data Virtualization
 
Data Science & Big Data - Theory.pdf
Data Science & Big Data - Theory.pdfData Science & Big Data - Theory.pdf
Data Science & Big Data - Theory.pdf
 
CH005
CH005CH005
CH005
 
Introduction to Databases by Dr. Kamal Gulati
Introduction to Databases by Dr. Kamal GulatiIntroduction to Databases by Dr. Kamal Gulati
Introduction to Databases by Dr. Kamal Gulati
 
Database management system
Database management systemDatabase management system
Database management system
 
HEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptx
HEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptxHEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptx
HEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptx
 
Ch 2-introduction to dbms
Ch 2-introduction to dbmsCh 2-introduction to dbms
Ch 2-introduction to dbms
 
Laudon_MIS13_ch06.ppt
Laudon_MIS13_ch06.pptLaudon_MIS13_ch06.ppt
Laudon_MIS13_ch06.ppt
 
Databases and Information Management (1).ppt
Databases and Information Management (1).pptDatabases and Information Management (1).ppt
Databases and Information Management (1).ppt
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Big data.ppt
Big data.pptBig data.ppt
Big data.ppt
 
Lecture1
Lecture1Lecture1
Lecture1
 
Foundations of business intelligence databases and information management
Foundations of business intelligence databases and information managementFoundations of business intelligence databases and information management
Foundations of business intelligence databases and information management
 

More from Michael Mathioudakis

Measuring polarization on social media
Measuring polarization on social mediaMeasuring polarization on social media
Measuring polarization on social mediaMichael Mathioudakis
 
Lecture 07 - CS-5040 - modern database systems
Lecture 07 -  CS-5040 - modern database systemsLecture 07 -  CS-5040 - modern database systems
Lecture 07 - CS-5040 - modern database systemsMichael Mathioudakis
 
Lecture 06 - CS-5040 - modern database systems
Lecture 06  - CS-5040 - modern database systemsLecture 06  - CS-5040 - modern database systems
Lecture 06 - CS-5040 - modern database systemsMichael Mathioudakis
 
Modern Database Systems - Lecture 02
Modern Database Systems - Lecture 02Modern Database Systems - Lecture 02
Modern Database Systems - Lecture 02Michael Mathioudakis
 
Mining the Social Web - Lecture 3 - T61.6020
Mining the Social Web - Lecture 3 - T61.6020Mining the Social Web - Lecture 3 - T61.6020
Mining the Social Web - Lecture 3 - T61.6020Michael Mathioudakis
 
Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020Michael Mathioudakis
 
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slidesMining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slidesMichael Mathioudakis
 
Bump Hunting in the Dark - ICDE15 presentation
Bump Hunting in the Dark - ICDE15 presentationBump Hunting in the Dark - ICDE15 presentation
Bump Hunting in the Dark - ICDE15 presentationMichael Mathioudakis
 

More from Michael Mathioudakis (9)

Measuring polarization on social media
Measuring polarization on social mediaMeasuring polarization on social media
Measuring polarization on social media
 
Lecture 07 - CS-5040 - modern database systems
Lecture 07 -  CS-5040 - modern database systemsLecture 07 -  CS-5040 - modern database systems
Lecture 07 - CS-5040 - modern database systems
 
Lecture 06 - CS-5040 - modern database systems
Lecture 06  - CS-5040 - modern database systemsLecture 06  - CS-5040 - modern database systems
Lecture 06 - CS-5040 - modern database systems
 
Modern Database Systems - Lecture 02
Modern Database Systems - Lecture 02Modern Database Systems - Lecture 02
Modern Database Systems - Lecture 02
 
Mining the Social Web - Lecture 3 - T61.6020
Mining the Social Web - Lecture 3 - T61.6020Mining the Social Web - Lecture 3 - T61.6020
Mining the Social Web - Lecture 3 - T61.6020
 
Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020
 
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slidesMining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
 
Absorbing Random Walk Centrality
Absorbing Random Walk CentralityAbsorbing Random Walk Centrality
Absorbing Random Walk Centrality
 
Bump Hunting in the Dark - ICDE15 presentation
Bump Hunting in the Dark - ICDE15 presentationBump Hunting in the Dark - ICDE15 presentation
Bump Hunting in the Dark - ICDE15 presentation
 

Recently uploaded

Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxcallscotland1987
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxAmita Gupta
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur
 

Recently uploaded (20)

Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 

Modern Database Systems - Lecture 00

  • 1. Modern Database Systems CS-E5040 Aristides Gionis Michael Mathioudakis T.A.: Orestis Kostakis Spring 2016
  • 2. what is a Database Management System (DBMS)? what is a database? a collection of data what is a database management system? ... a.k.a. ‘database system’ software to store, access, administer a database not just a collection of files provides mechanism to query the data transfers data between main memory and secondary storage (disk) enables concurrent access, offers guarantees for data consistency provides crash recovery mechanisms provides security and access control 2
  • 3. why use a DBMS? discuss 3
  • 4. why use a DBMS? separate logical from physical data organization efficient data access guarantee data integrity and security reduce application development time data administration 4
  • 5. why study database systems? discuss 5
  • 6. why study database systems? to manage data efficiently 6
  • 7. consider the following task… data records that contain information about products viewed or purchased from an online store task for each pair of Games products, count the number of customers that have purchased both 7 Product Category Customer Date Price Action other... Portal 2 Games Michael M. 12/01/2015 10€ Purchase ... FLWR Plant Food Garden Aris G. 19/02/2015 32€ View Chase the Rabbit Games Michael M. 23/04/2015 1€ View Portal 2 Games Orestis K. 13/05/2015 10€ Purchase ... > what challenges does case B pose compared to case A? hint limited main memory, disk access, distributed setting case A 10,000 records (0.5MB per record, 5GB total disk space) 10GB of main memory case B 10,000,000 records (~5TB total disk space) stored across 100 nodes (50GB per node), 10GB of main memory per node
  • 8. the main message to manage data efficiently minimize expensive operations e.g., disk access parallelize computation 8
  • 9. why study database systems? to manage data efficiently ... ... from different roles •  develop database systems that match application requirements •  use database systems efficiently o  … knowing how a DBMS •  stores data, •  processes queries, •  and accesses data o  … allows us to ü  organize data appropriately ü  design efficient algorithms to process the data •  combine existing database systems to match requirements •  large variety of data and applications •  “one size fits none” - Michael Stonebraker 9
  • 10. the relational database system 10 database (data stored on disk) DBMS query optimization & execution relational operators files and access methods buffer (memory) management disk space management query interface database design application database user introductory course relational dbms our course relational dbms our course non-relational dbms
  • 11. previously on ‘database systems’... §  relational data model •  relation, attribute, tuple, schema, domain, keys §  relational algebra •  projection, selection, •  cartesian product, natural joins, theta joins, outer joins •  renaming, constraints §  structured query language (SQL) §  schema design •  functional dependencies, normalization §  applications •  embedded sql, drivers 11
  • 12. modern database systems beyond the typical relational DBMS setting... different data models semi-structured data, unstructured text, graphs operations at massive scale big data platforms & map-reduce paradigm, hadoop and spark, cloud computing tailored performance key-value stores, column-stores, in-memory databases, streaming systems 12
  • 13. about this course familiarize ourselves with modern database systems principles and practice database models: data, queries, and computation algorithms - simple queries (e.g., joins) to complex algorithms experience with real technologies emphasis is on understanding of core issues… essentially: the cost of algorithms for different database models and settings you can use what you learn here to: select a database system that fits the demands of your application... … based on supported data model, functionality, optimizations, scalability design your database to fit the needs of your application … e.g., by building appropriate index structures write fast algorithms to process your data… … and estimate their running time adapt your knowledge to the database system you’ll be using 5 years from now 13
  • 14. syllabus 14 part 1: relational database systems (Jan 22 & Jan 29 - Michael) topics relational model (SQL), indexing (b+ trees, hash tables), join algorithms, query optimization technology MySQL part 2: semi-structured data (Feb 5 - Aris) topics semi-structured data abstraction, representation, search, indexing and pipeline aggregation technology MongoDB part 3: unstructured text and information retrieval (Feb 12 & Feb 26 - Aris) topics querying text data, inverted indexes, compression, ranking and evaluation, rank aggregation technology Lucene part 4a: big data platforms - mapreduce (Mar 4 & Mar 11 - Michael) topics mapreduce paradigm, algorithms in mapreduce technology Hadoop part 4b: big data platforms - graph databases (Mar 18 & Apr 1 - Aris) topics the pregel paradigm, algorithms on pregel (pagerank, centrality) technology hadoop giraph, spark graphx
  • 15. logistics 15 instructors Aristides Gionis, Michael Mathioudakis teaching assistant Orestis Kostakis lectures Friday 10-12, Room T3 weeks 2 - 6, 8 - 11, 13 office hours by appointment starting on Monday January 25th curriculum •  slides and course notes •  no single textbook, but slides will provide references for further study announcements •  follow course website on mycourses.aalto.fi when you send email •  aristides.gionis / michael.mathioudakis / orestis.kostakis @ aalto.fi •  subject: [ModernDB] your topic programming assignments •  we’ll provide instances of VirtualBox, ready for use •  you can use campus labs or own laptop •  access to CSC, if needed
  • 16. workload & grading scheme ●  3 assignments + exam ●  25% each ●  need to have at least 50% on each ○  i.e., cannot skip some of the course ●  assignments: ○  pen & paper ■  based on slides + references ○  programming ■  real-world tools, e.g., MySQL, MongoDB, Spark ■  will provide tutorials 16
  • 17. that’s all for now! questions? next week relational model and SQL indexing access cost analysis what to do until then (if you want) SQL 17
  • 18. Credits for some of these slides, we used material from “Database Systems: The Complete Book”, by Garcia-Mollina, Ullman, Widom “Database Management Systems”, by Ramakrishnan and Gehrke 18