SlideShare a Scribd company logo
1 of 24
Dr. Abdul Basit Siddiqui
Assistant Professor
FURC
(Lecture Slides Week # 2)
Why a Data Warehouse (DWH)?
Data recording and storage is growing:
Almost every industry has huge amount of operational data.
Careful use/analysis of historic information may result in
excellent prediction for the future:
Knowledge worker wants to turn available data into useful
information.
This information is used by them to support strategic decision
making.
Gives total view of the organization:
It is a platform for consolidated historical data for analysis.
It stores data of good quality so that knowledge worker can make
correct decisions.
Intelligent decision-support is required for decision-
making.
Data Warehouse & Mining- Spring 201404/19/15 2
Why a Data Warehouse? (Contd.)
From business perspective:
It is latest marketing weapon.
Helps to keep customers by learning more about
their needs.
Valuable tool in today’s competitive fast evolving
world.
Data Warehouse & Mining- Spring 201404/19/15 3
Reason-I: Why a Data Warehouse (DWH)?
Data sets are growing:
How Much Data is that?
1 MB 220
or 106
bytes Small novel 3½ Disk.
1 GB 230
or 109
bytes
Paper reams that could fill the back of a
pickup van.
1 TB 240
or 1012
bytes
50,000 trees chopped and converted into
paper and printed.
2 PB 1 PB = 250
or 1015
bytes Academic research libraries across USA.
5 EB 1 EB = 260
or 1018
bytes
All words ever spoken by the Human
Beings.
Data Warehouse & Mining- Spring 201404/19/15 4
Reason-I: Why a Data Warehouse (DWH)?
Size of Data Sets are going up.
Cost of Data Storage is coming down.
The amount of data average business collects and stores is
doubling every year.
Total hardware and software cost to store and manage 1 MB of
data:
 1990: $ 15
 2002: ¢ 15 (down 100 times)
 2010: < ¢ 1 (down 150 times)
A few examples:
 Wall Mart: 24+ TB
 Finance Telecom: 100+ TB
 CERN: Upto 20 PB by 2006
 Stanford Linear Accelerator Center (SLAC): 500 TB
 Telenor, Ufone, Mobilink, Warid, Zong ???
Data Warehouse & Mining- Spring 201404/19/15 5
Caution!
A Warehouse of Data
is NOT a
Data Warehouse.
Data Warehouse & Mining- Spring 201404/19/15 6
Caution!
Size
is NOT
Everything.
Data Warehouse & Mining- Spring 201404/19/15 7
Reason-2: Why a Data Warehouse (DWH)?
DBMS Approach
 List of all items that were sold last
month?
 List of all makeup items
purchased by Sassi?
 The total sales of the last month
grouped by branch?
 How many sales transactions
occurred during the month of
January?
Intelligent Enterprise
 Which items sell together? Which
items to stock?
 Where and how to place the
items? What discounts to offer?
 How best to target customers to
increase sales at a branch?
 Which customers are most likely
to respond to my next
promotional campaign, and why?
Data Warehouse & Mining- Spring 2014
 Businesses demand Intelligence (BI).
 Complex questions from integrated data.
 “Intelligent Enterprise”
04/19/15 8
Reason-3: Why a Data Warehouse (DWH)?
Businesses want much more …
What happened?
Why it happened?
What will happen?
What is happening?
What do you want to happen?
Data Warehouse & Mining- Spring 201404/19/15 9
What is a Data Warehouse?
A complete repository of historical
corporate data extracted from
transaction systems that is
available for ad-hoc access by
knowledge workers.
Data Warehouse & Mining- Spring 201404/19/15 10
What is a Data Warehouse?
Transaction System:
Management Information System (MIS)
Could be typed sheets (NOT transaction system)
Ad-Hoc Access:
Does not have a certain access pattern
Queries not known in advance
Difficult to write SQL in advance
Knowledge Workers:
Typically NOT IT literate (Executives, Analysts, Managers)
NOT clerical workers
Decision makers
Data Warehouse & Mining- Spring 201404/19/15 11
What is a Data Warehouse?
Inmons’s Definition:
A Data Warehouse is:
 Subject-oriented
 Integrated
 Time-variant
 Nonvolatile
Collection of data in support of management’s
decision making process.
Data Warehouse & Mining- Spring 201404/19/15 12
Another View of a DWH
Data Warehouse & Mining- Spring 2014
Subject
Oriented
Integrated
Time Variant
Non Volatile
04/19/15 13
Subject-oriented
Data Warehouse is organized around subjects such as sales,
product, customer.
It focuses on modeling and analysis of data for decision makers.
Excludes data not useful in decision support process.
Data Warehouse & Mining- Spring 201404/19/15 14
Integration
Data Warehouse is constructed by integrating multiple
heterogeneous sources.
Data Preprocessing are applied to ensure consistency.
Data Warehouse & Mining- Spring 2014
RDBMS
Legacy
System
Data
Warehouse
Flat File Data Processing
Data Transformation
04/19/15 15
Time-variant
Provides information from historical perspective e.g.
past 5-10 years.
Every key structure contains either implicitly or
explicitly an element of time.
Data Warehouse & Mining- Spring 201404/19/15 16
Nonvolatile
Data once recorded cannot be updated.
Data Warehouse requires two operations in data
accessing
Initial loading of data
Access of data
Data Warehouse & Mining- Spring 2014
load
access
04/19/15 17
Summary: What is a Data Warehouse?
It is a blend of many technologies, the basic
concept being:
Take all data from different operational systems
If necessary, add relevant data from industry
Transform all data and bring into a uniform format
Integrate all data as a single entity
Store data in a format supporting easy access for
decision support
Create performance enhancing indices
Implement performance enhancement joins
Run ad-hoc queries with slow selectivity
Data Warehouse & Mining- Spring 201404/19/15 18
Benefits of Data Warehouse
High returns on investment.
Substantial competitive advantage.
Increased productivity of corporate decision-makers.
Fast reporting for decision making process.
Reduced reporting load on transactional systems.
Making institutional data more user-friendly and
accessible for knowledge workers.
Integrated data from different source systems.
Enabled ‘point-in-time’ analysis and trending over time.
Helps in identifying and resolving data integrity issues,
either in the warehouse itself or in the source systems
that collect the data.
Data Warehouse & Mining- Spring 201404/19/15 19
Data Warehouse: How is it Different?
1. Decision making is Ad-Hoc
Data Warehouse & Mining- Spring 201404/19/15 20
Data Warehouse: How is it Different?
2. Different patterns of hardware utilization
Data Warehouse & Mining- Spring 2014
Bus Service vs. Train
04/19/15 21
Data Warehouse: How is it Different?
3. Combines operational and historic data
 Don’t do data entry into a DWH. OLTP or ERP are the
source systems.
 OLTP systems don’t keep history, cannot get balance
statement more than a year old.
 DWH keep historical data, even of bygone customers.
Why?
 In the context of bank, want to know why the customer
left?
 What are the events that led to his/her leaving? Why?
 Customer retention
Data Warehouse & Mining- Spring 201404/19/15 22
Data Warehouse: How is it Different?
How much history?
 Depends on:
 Industry
 Cost of storing historical data
 Economic value of historical data
 Industry and history
 Telecom calls are much much more as compared to bank
transactions
 18 months
 Retailers interested in analyzing yearly seasonal patterns
 65 weeks, why?
 Insurance companies want to do actuary analysis, use the
historical data in order to predict risk
 7 years
Hence NOT a complete repository of data.
Data Warehouse & Mining- Spring 201404/19/15 23
Data Warehouse: How is it Different?
How much history?
Economic value of data vs. storage cost
Data Warehouse a complete repository of data?
Data Warehouse & Mining- Spring 201404/19/15 24

More Related Content

What's hot

1.4 data warehouse
1.4 data warehouse1.4 data warehouse
1.4 data warehouseKrish_ver2
 
Data warehouse implementation design for a Retail business
Data warehouse implementation design for a Retail businessData warehouse implementation design for a Retail business
Data warehouse implementation design for a Retail businessArsalan Qadri
 
Data Warehousing and Mining
Data Warehousing and MiningData Warehousing and Mining
Data Warehousing and Miningethantelaviv
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingEyad Manna
 
Ch1 data-warehousing
Ch1 data-warehousingCh1 data-warehousing
Ch1 data-warehousingAhmad Shlool
 
Ch1 data-warehousing
Ch1 data-warehousingCh1 data-warehousing
Ch1 data-warehousingAhmad Shlool
 
Data warehousing
Data warehousingData warehousing
Data warehousingVarun Jain
 
Data warehouse and olap technology
Data warehouse and olap technologyData warehouse and olap technology
Data warehouse and olap technologyDataminingTools Inc
 
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...Big Data Spain
 
Dw Concepts
Dw ConceptsDw Concepts
Dw Conceptsdataware
 
Dataware housing
Dataware housingDataware housing
Dataware housingwork
 
Basics of Microsoft Business Intelligence and Data Integration Techniques
Basics of Microsoft Business Intelligence and Data Integration TechniquesBasics of Microsoft Business Intelligence and Data Integration Techniques
Basics of Microsoft Business Intelligence and Data Integration TechniquesValmik Potbhare
 
Traditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewTraditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewNagaraj Yerram
 

What's hot (20)

1.4 data warehouse
1.4 data warehouse1.4 data warehouse
1.4 data warehouse
 
Column Oriented Databases
Column Oriented DatabasesColumn Oriented Databases
Column Oriented Databases
 
Retail Data Warehouse
Retail Data WarehouseRetail Data Warehouse
Retail Data Warehouse
 
Data warehouse implementation design for a Retail business
Data warehouse implementation design for a Retail businessData warehouse implementation design for a Retail business
Data warehouse implementation design for a Retail business
 
Data Warehousing and Mining
Data Warehousing and MiningData Warehousing and Mining
Data Warehousing and Mining
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Ch1 data-warehousing
Ch1 data-warehousingCh1 data-warehousing
Ch1 data-warehousing
 
Ch1 data-warehousing
Ch1 data-warehousingCh1 data-warehousing
Ch1 data-warehousing
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 
Data warehouse and olap technology
Data warehouse and olap technologyData warehouse and olap technology
Data warehouse and olap technology
 
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...
 
Dw Concepts
Dw ConceptsDw Concepts
Dw Concepts
 
Dataware housing
Dataware housingDataware housing
Dataware housing
 
Basics of Microsoft Business Intelligence and Data Integration Techniques
Basics of Microsoft Business Intelligence and Data Integration TechniquesBasics of Microsoft Business Intelligence and Data Integration Techniques
Basics of Microsoft Business Intelligence and Data Integration Techniques
 
Traditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewTraditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overview
 
ITReady DW Day2
ITReady DW Day2ITReady DW Day2
ITReady DW Day2
 
Unit4
Unit4Unit4
Unit4
 
2dw
2dw2dw
2dw
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 

Similar to Dwh lecture slides-week3&4

krithi-talk-impact.ppt
krithi-talk-impact.pptkrithi-talk-impact.ppt
krithi-talk-impact.pptKRISHNARAJ207
 
Data Mining Concepts and Techniques
Data Mining Concepts and TechniquesData Mining Concepts and Techniques
Data Mining Concepts and TechniquesPratik Tambekar
 
UNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data MiningUNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data MiningNandakumar P
 
Data warehouse
Data warehouseData warehouse
Data warehouseMR Z
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingwork
 
Data warehousev2.1
Data warehousev2.1Data warehousev2.1
Data warehousev2.1Tuan Luong
 
presentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptxpresentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptxvipush1
 
Data-Ed Webinar: Data Warehouse Strategies
Data-Ed Webinar: Data Warehouse StrategiesData-Ed Webinar: Data Warehouse Strategies
Data-Ed Webinar: Data Warehouse StrategiesDATAVERSITY
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSINGKing Julian
 
DWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxDWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxSalehaMariyam
 
Lecture 01.ppt
Lecture 01.pptLecture 01.ppt
Lecture 01.pptHFLEX
 
Data warehousing interview questions
Data warehousing interview questionsData warehousing interview questions
Data warehousing interview questionsSatyam Jaiswal
 

Similar to Dwh lecture slides-week3&4 (20)

Data warehousing
Data warehousingData warehousing
Data warehousing
 
DWM
DWMDWM
DWM
 
krithi-talk-impact.ppt
krithi-talk-impact.pptkrithi-talk-impact.ppt
krithi-talk-impact.ppt
 
krithi-talk-impact.ppt
krithi-talk-impact.pptkrithi-talk-impact.ppt
krithi-talk-impact.ppt
 
Data Mining Concepts and Techniques
Data Mining Concepts and TechniquesData Mining Concepts and Techniques
Data Mining Concepts and Techniques
 
UNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data MiningUNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data Mining
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Data warehousev2.1
Data warehousev2.1Data warehousev2.1
Data warehousev2.1
 
presentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptxpresentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptx
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data-Ed Webinar: Data Warehouse Strategies
Data-Ed Webinar: Data Warehouse StrategiesData-Ed Webinar: Data Warehouse Strategies
Data-Ed Webinar: Data Warehouse Strategies
 
Data warehousing and Data mining
Data warehousing and Data mining Data warehousing and Data mining
Data warehousing and Data mining
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Datawarehouse
DatawarehouseDatawarehouse
Datawarehouse
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
DWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxDWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptx
 
Lecture 01.ppt
Lecture 01.pptLecture 01.ppt
Lecture 01.ppt
 
Data warehousing interview questions
Data warehousing interview questionsData warehousing interview questions
Data warehousing interview questions
 

More from Shani729

Python tutorialfeb152012
Python tutorialfeb152012Python tutorialfeb152012
Python tutorialfeb152012Shani729
 
Python tutorial
Python tutorialPython tutorial
Python tutorialShani729
 
Interaction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interactionInteraction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interactionShani729
 
Fm lecturer 13(final)
Fm lecturer 13(final)Fm lecturer 13(final)
Fm lecturer 13(final)Shani729
 
Lecture slides week14-15
Lecture slides week14-15Lecture slides week14-15
Lecture slides week14-15Shani729
 
Frequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodFrequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodShani729
 
Dwh lecture slides-week15
Dwh lecture slides-week15Dwh lecture slides-week15
Dwh lecture slides-week15Shani729
 
Dwh lecture slides-week10
Dwh lecture slides-week10Dwh lecture slides-week10
Dwh lecture slides-week10Shani729
 
Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8Shani729
 
Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Shani729
 
Dwh lecture slides-week1
Dwh lecture slides-week1Dwh lecture slides-week1
Dwh lecture slides-week1Shani729
 
Dwh lecture slides-week 13
Dwh lecture slides-week 13Dwh lecture slides-week 13
Dwh lecture slides-week 13Shani729
 
Data warehousing and mining furc
Data warehousing and mining furcData warehousing and mining furc
Data warehousing and mining furcShani729
 
Lecture 40
Lecture 40Lecture 40
Lecture 40Shani729
 
Lecture 39
Lecture 39Lecture 39
Lecture 39Shani729
 
Lecture 38
Lecture 38Lecture 38
Lecture 38Shani729
 
Lecture 37
Lecture 37Lecture 37
Lecture 37Shani729
 
Lecture 35
Lecture 35Lecture 35
Lecture 35Shani729
 
Lecture 36
Lecture 36Lecture 36
Lecture 36Shani729
 
Lecture 34
Lecture 34Lecture 34
Lecture 34Shani729
 

More from Shani729 (20)

Python tutorialfeb152012
Python tutorialfeb152012Python tutorialfeb152012
Python tutorialfeb152012
 
Python tutorial
Python tutorialPython tutorial
Python tutorial
 
Interaction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interactionInteraction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interaction
 
Fm lecturer 13(final)
Fm lecturer 13(final)Fm lecturer 13(final)
Fm lecturer 13(final)
 
Lecture slides week14-15
Lecture slides week14-15Lecture slides week14-15
Lecture slides week14-15
 
Frequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodFrequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth method
 
Dwh lecture slides-week15
Dwh lecture slides-week15Dwh lecture slides-week15
Dwh lecture slides-week15
 
Dwh lecture slides-week10
Dwh lecture slides-week10Dwh lecture slides-week10
Dwh lecture slides-week10
 
Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8
 
Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Dwh lecture slides-week5&6
Dwh lecture slides-week5&6
 
Dwh lecture slides-week1
Dwh lecture slides-week1Dwh lecture slides-week1
Dwh lecture slides-week1
 
Dwh lecture slides-week 13
Dwh lecture slides-week 13Dwh lecture slides-week 13
Dwh lecture slides-week 13
 
Data warehousing and mining furc
Data warehousing and mining furcData warehousing and mining furc
Data warehousing and mining furc
 
Lecture 40
Lecture 40Lecture 40
Lecture 40
 
Lecture 39
Lecture 39Lecture 39
Lecture 39
 
Lecture 38
Lecture 38Lecture 38
Lecture 38
 
Lecture 37
Lecture 37Lecture 37
Lecture 37
 
Lecture 35
Lecture 35Lecture 35
Lecture 35
 
Lecture 36
Lecture 36Lecture 36
Lecture 36
 
Lecture 34
Lecture 34Lecture 34
Lecture 34
 

Recently uploaded

University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Arindam Chakraborty, Ph.D., P.E. (CA, TX)
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...SUHANI PANDEY
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdfSuman Jyoti
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTbhaskargani46
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfRagavanV2
 

Recently uploaded (20)

University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
NFPA 5000 2024 standard .
NFPA 5000 2024 standard                                  .NFPA 5000 2024 standard                                  .
NFPA 5000 2024 standard .
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
 

Dwh lecture slides-week3&4

  • 1. Dr. Abdul Basit Siddiqui Assistant Professor FURC (Lecture Slides Week # 2)
  • 2. Why a Data Warehouse (DWH)? Data recording and storage is growing: Almost every industry has huge amount of operational data. Careful use/analysis of historic information may result in excellent prediction for the future: Knowledge worker wants to turn available data into useful information. This information is used by them to support strategic decision making. Gives total view of the organization: It is a platform for consolidated historical data for analysis. It stores data of good quality so that knowledge worker can make correct decisions. Intelligent decision-support is required for decision- making. Data Warehouse & Mining- Spring 201404/19/15 2
  • 3. Why a Data Warehouse? (Contd.) From business perspective: It is latest marketing weapon. Helps to keep customers by learning more about their needs. Valuable tool in today’s competitive fast evolving world. Data Warehouse & Mining- Spring 201404/19/15 3
  • 4. Reason-I: Why a Data Warehouse (DWH)? Data sets are growing: How Much Data is that? 1 MB 220 or 106 bytes Small novel 3½ Disk. 1 GB 230 or 109 bytes Paper reams that could fill the back of a pickup van. 1 TB 240 or 1012 bytes 50,000 trees chopped and converted into paper and printed. 2 PB 1 PB = 250 or 1015 bytes Academic research libraries across USA. 5 EB 1 EB = 260 or 1018 bytes All words ever spoken by the Human Beings. Data Warehouse & Mining- Spring 201404/19/15 4
  • 5. Reason-I: Why a Data Warehouse (DWH)? Size of Data Sets are going up. Cost of Data Storage is coming down. The amount of data average business collects and stores is doubling every year. Total hardware and software cost to store and manage 1 MB of data:  1990: $ 15  2002: ¢ 15 (down 100 times)  2010: < ¢ 1 (down 150 times) A few examples:  Wall Mart: 24+ TB  Finance Telecom: 100+ TB  CERN: Upto 20 PB by 2006  Stanford Linear Accelerator Center (SLAC): 500 TB  Telenor, Ufone, Mobilink, Warid, Zong ??? Data Warehouse & Mining- Spring 201404/19/15 5
  • 6. Caution! A Warehouse of Data is NOT a Data Warehouse. Data Warehouse & Mining- Spring 201404/19/15 6
  • 7. Caution! Size is NOT Everything. Data Warehouse & Mining- Spring 201404/19/15 7
  • 8. Reason-2: Why a Data Warehouse (DWH)? DBMS Approach  List of all items that were sold last month?  List of all makeup items purchased by Sassi?  The total sales of the last month grouped by branch?  How many sales transactions occurred during the month of January? Intelligent Enterprise  Which items sell together? Which items to stock?  Where and how to place the items? What discounts to offer?  How best to target customers to increase sales at a branch?  Which customers are most likely to respond to my next promotional campaign, and why? Data Warehouse & Mining- Spring 2014  Businesses demand Intelligence (BI).  Complex questions from integrated data.  “Intelligent Enterprise” 04/19/15 8
  • 9. Reason-3: Why a Data Warehouse (DWH)? Businesses want much more … What happened? Why it happened? What will happen? What is happening? What do you want to happen? Data Warehouse & Mining- Spring 201404/19/15 9
  • 10. What is a Data Warehouse? A complete repository of historical corporate data extracted from transaction systems that is available for ad-hoc access by knowledge workers. Data Warehouse & Mining- Spring 201404/19/15 10
  • 11. What is a Data Warehouse? Transaction System: Management Information System (MIS) Could be typed sheets (NOT transaction system) Ad-Hoc Access: Does not have a certain access pattern Queries not known in advance Difficult to write SQL in advance Knowledge Workers: Typically NOT IT literate (Executives, Analysts, Managers) NOT clerical workers Decision makers Data Warehouse & Mining- Spring 201404/19/15 11
  • 12. What is a Data Warehouse? Inmons’s Definition: A Data Warehouse is:  Subject-oriented  Integrated  Time-variant  Nonvolatile Collection of data in support of management’s decision making process. Data Warehouse & Mining- Spring 201404/19/15 12
  • 13. Another View of a DWH Data Warehouse & Mining- Spring 2014 Subject Oriented Integrated Time Variant Non Volatile 04/19/15 13
  • 14. Subject-oriented Data Warehouse is organized around subjects such as sales, product, customer. It focuses on modeling and analysis of data for decision makers. Excludes data not useful in decision support process. Data Warehouse & Mining- Spring 201404/19/15 14
  • 15. Integration Data Warehouse is constructed by integrating multiple heterogeneous sources. Data Preprocessing are applied to ensure consistency. Data Warehouse & Mining- Spring 2014 RDBMS Legacy System Data Warehouse Flat File Data Processing Data Transformation 04/19/15 15
  • 16. Time-variant Provides information from historical perspective e.g. past 5-10 years. Every key structure contains either implicitly or explicitly an element of time. Data Warehouse & Mining- Spring 201404/19/15 16
  • 17. Nonvolatile Data once recorded cannot be updated. Data Warehouse requires two operations in data accessing Initial loading of data Access of data Data Warehouse & Mining- Spring 2014 load access 04/19/15 17
  • 18. Summary: What is a Data Warehouse? It is a blend of many technologies, the basic concept being: Take all data from different operational systems If necessary, add relevant data from industry Transform all data and bring into a uniform format Integrate all data as a single entity Store data in a format supporting easy access for decision support Create performance enhancing indices Implement performance enhancement joins Run ad-hoc queries with slow selectivity Data Warehouse & Mining- Spring 201404/19/15 18
  • 19. Benefits of Data Warehouse High returns on investment. Substantial competitive advantage. Increased productivity of corporate decision-makers. Fast reporting for decision making process. Reduced reporting load on transactional systems. Making institutional data more user-friendly and accessible for knowledge workers. Integrated data from different source systems. Enabled ‘point-in-time’ analysis and trending over time. Helps in identifying and resolving data integrity issues, either in the warehouse itself or in the source systems that collect the data. Data Warehouse & Mining- Spring 201404/19/15 19
  • 20. Data Warehouse: How is it Different? 1. Decision making is Ad-Hoc Data Warehouse & Mining- Spring 201404/19/15 20
  • 21. Data Warehouse: How is it Different? 2. Different patterns of hardware utilization Data Warehouse & Mining- Spring 2014 Bus Service vs. Train 04/19/15 21
  • 22. Data Warehouse: How is it Different? 3. Combines operational and historic data  Don’t do data entry into a DWH. OLTP or ERP are the source systems.  OLTP systems don’t keep history, cannot get balance statement more than a year old.  DWH keep historical data, even of bygone customers. Why?  In the context of bank, want to know why the customer left?  What are the events that led to his/her leaving? Why?  Customer retention Data Warehouse & Mining- Spring 201404/19/15 22
  • 23. Data Warehouse: How is it Different? How much history?  Depends on:  Industry  Cost of storing historical data  Economic value of historical data  Industry and history  Telecom calls are much much more as compared to bank transactions  18 months  Retailers interested in analyzing yearly seasonal patterns  65 weeks, why?  Insurance companies want to do actuary analysis, use the historical data in order to predict risk  7 years Hence NOT a complete repository of data. Data Warehouse & Mining- Spring 201404/19/15 23
  • 24. Data Warehouse: How is it Different? How much history? Economic value of data vs. storage cost Data Warehouse a complete repository of data? Data Warehouse & Mining- Spring 201404/19/15 24