SlideShare a Scribd company logo
1 of 11
Data Mining
Data mining (the analysis step of the "Knowledge Discovery in
Databases" process, or KDD),an interdisciplinary subfield
of computer science, is the computational process of discovering
patterns in large data sets involving methods at the intersection
of artificial intelligence, machine learning, statistics, and database
systems
What is Data Mining?
 Data Mining, also known as Knowledge-
Discovery in Databases (KDD), is the process
of automatically searching large volumes of
data for patterns.
 Data Mining applies many older
computational techniques from statistics,
machine learning and pattern recognition
Data mining consists of five
major elements:
 Extract, transform, and load transaction data
onto the data warehouse system.
 Store and manage the data in a
multidimensional database system.
 Provide data access to business analysts and
information technology professionals.
 Analyze the data by application software.
 Present the data in a useful format, such as a
graph or table.
Data Mining Goal
 The ultimate goal of data mining is
prediction - and predictive data mining
is the most common type of data mining
and one that has the most direct
business applications.
3 Steps Data Mining Process
 Stage 1: Exploration. This stage usually starts
with data preparation which may involve cleaning
data, data transformations, selecting subsets of
records
 Stage 2: Model building and validation. This
stage involves considering various models and
choosing the best one based on their predictive
performance
 Stage 3: Deployment. That final stage involves
using the model selected as best in the previous
stage and applying it to new data in order to generate
predictions or estimates of the expected outcome
Some of the tools used for
data mining are:
 Artificial neural networks - Non-linear predictive models that
learn through training and resemble biological neural networks
in structure.
 Decision trees - Tree-shaped structures that represent sets of
decisions. These decisions generate rules for the classification
of a dataset.
 Rule induction - The extraction of useful if-then rules from data
based on statistical significance.
 Genetic algorithms - Optimization techniques based on the
concepts of genetic combination, mutation, and natural
selection.
 Nearest neighbor - A classification technique that classifies
each record based on the records most similar to it in an
historical database.
Reasons for the growing
popularity of Data Mining
 Growing Data Volume
 Limitations of Human Analysis
 Low Cost of Machine Learning
ADVANTAGES OF DATA
MINING
 Marking/Retailing: Data mining can
aid direct marketers by providing them
with useful and accurate trends about
their customers’ purchasing behavior.
 Banking/Crediting: Data mining can
assist financial institutions in areas such
as credit reporting and loan
information.    
ADVANTAGES OF DATA
MINING Cont…
 Law enforcement: Data mining can aid law
enforcers in identifying criminal suspects as
well as apprehending these criminals by
examining trends in location, crime type,
habit, and other patterns of behaviors.
 Researchers: Data mining can assist
researchers by speeding up their data
analyzing process; thus, allowing them more
time to work on other projects.   
DISADVANTAGES OF
DATA MINING
 Privacy Issues: For example,
according to Washing Post, in 1998,
CVS had sold their patient’s
prescription purchases to a different
company
 American Express also sold their
customers’ credit card purchases to
another company.
DISADVANTAGES OF
DATA MINING Cont…
 Security issues: Although companies have a lot of
personal information about us available online, they
do not have sufficient security systems in place to
protect that information. 
 Misuse of information: Some of the company will
answer your phone based on your purchase history.
If you have spent a lot of money or buying
a lot of product from one company, your call will be
answered really soon. So you should not think that
your call is really being answer in the order in which it
was receive.

More Related Content

What's hot

Data Mining: Future Trends and Applications
Data Mining: Future Trends and ApplicationsData Mining: Future Trends and Applications
Data Mining: Future Trends and ApplicationsIJMER
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Seerat Malik
 
Data mining services
Data mining servicesData mining services
Data mining servicesRashmiS08
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & ApplicationsFazle Rabbi Ador
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining Sushil Kulkarni
 
Importance of Data Mining
Importance of Data MiningImportance of Data Mining
Importance of Data MiningScottperrone
 
Chapter 08 Data Mining Techniques
Chapter 08 Data Mining Techniques Chapter 08 Data Mining Techniques
Chapter 08 Data Mining Techniques Houw Liong The
 
What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation Pralhad Rijal
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousingSunny Gandhi
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining ConceptsDung Nguyen
 
Data mining seminar report
Data mining seminar reportData mining seminar report
Data mining seminar reportmayurik19
 
Data mining concepts and work
Data mining concepts and workData mining concepts and work
Data mining concepts and workAmr Abd El Latief
 
Introduction to data mining technique
Introduction to data mining techniqueIntroduction to data mining technique
Introduction to data mining techniquePawneshwar Datt Rai
 

What's hot (19)

Data Mining: Future Trends and Applications
Data Mining: Future Trends and ApplicationsData Mining: Future Trends and Applications
Data Mining: Future Trends and Applications
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
 
Data mining and its applications!
Data mining and its applications!Data mining and its applications!
Data mining and its applications!
 
Data mining services
Data mining servicesData mining services
Data mining services
 
Data mining
Data miningData mining
Data mining
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & Applications
 
Data mining
Data miningData mining
Data mining
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining
 
Importance of Data Mining
Importance of Data MiningImportance of Data Mining
Importance of Data Mining
 
Chapter 08 Data Mining Techniques
Chapter 08 Data Mining Techniques Chapter 08 Data Mining Techniques
Chapter 08 Data Mining Techniques
 
What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousing
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining Concepts
 
Data mining seminar report
Data mining seminar reportData mining seminar report
Data mining seminar report
 
Data mining concepts and work
Data mining concepts and workData mining concepts and work
Data mining concepts and work
 
Data Mining
Data MiningData Mining
Data Mining
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Introduction to data mining technique
Introduction to data mining techniqueIntroduction to data mining technique
Introduction to data mining technique
 
Data mining
Data miningData mining
Data mining
 

Viewers also liked

Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesData mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesSaif Ullah
 
Ch12.ed wk9businessintelligenceanddecisionsupportsystem
Ch12.ed wk9businessintelligenceanddecisionsupportsystemCh12.ed wk9businessintelligenceanddecisionsupportsystem
Ch12.ed wk9businessintelligenceanddecisionsupportsystemNorhisham Mohamad Nordin
 
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...Summarization Techniques in Association Rule Data Mining For Risk Assessment ...
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...IJTET Journal
 
Crm unit iv (technological tools for crm)
Crm unit iv (technological tools for crm)Crm unit iv (technological tools for crm)
Crm unit iv (technological tools for crm)Revisiting Strategy
 
What is Data Mining - Olu Campbell
What is Data Mining - Olu CampbellWhat is Data Mining - Olu Campbell
What is Data Mining - Olu CampbellOlu Campbell
 
Chapter 24
Chapter 24Chapter 24
Chapter 24bodo-con
 
An Introduction to Data Mining
An Introduction to Data MiningAn Introduction to Data Mining
An Introduction to Data Miningbutest
 
Concept description characterization and comparison
Concept description characterization and comparisonConcept description characterization and comparison
Concept description characterization and comparisonric_biet
 
Data mining techniques for malware detection.pptx
Data mining techniques for malware detection.pptxData mining techniques for malware detection.pptx
Data mining techniques for malware detection.pptxAditya Deshmukh
 
Research in data mining
Research in data miningResearch in data mining
Research in data miningHouw Liong The
 

Viewers also liked (16)

Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesData mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniques
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 
Data mining
Data miningData mining
Data mining
 
Ymag56 hr
Ymag56 hrYmag56 hr
Ymag56 hr
 
Ch12.ed wk9businessintelligenceanddecisionsupportsystem
Ch12.ed wk9businessintelligenceanddecisionsupportsystemCh12.ed wk9businessintelligenceanddecisionsupportsystem
Ch12.ed wk9businessintelligenceanddecisionsupportsystem
 
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...Summarization Techniques in Association Rule Data Mining For Risk Assessment ...
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...
 
Crm unit iv (technological tools for crm)
Crm unit iv (technological tools for crm)Crm unit iv (technological tools for crm)
Crm unit iv (technological tools for crm)
 
What is Data Mining - Olu Campbell
What is Data Mining - Olu CampbellWhat is Data Mining - Olu Campbell
What is Data Mining - Olu Campbell
 
Chapter 24
Chapter 24Chapter 24
Chapter 24
 
An Introduction to Data Mining
An Introduction to Data MiningAn Introduction to Data Mining
An Introduction to Data Mining
 
Concept description characterization and comparison
Concept description characterization and comparisonConcept description characterization and comparison
Concept description characterization and comparison
 
Data mining techniques for malware detection.pptx
Data mining techniques for malware detection.pptxData mining techniques for malware detection.pptx
Data mining techniques for malware detection.pptx
 
Multimedia db system
Multimedia db systemMultimedia db system
Multimedia db system
 
Data mining applications
Data mining applicationsData mining applications
Data mining applications
 
Research in data mining
Research in data miningResearch in data mining
Research in data mining
 
Data mining
Data miningData mining
Data mining
 

Similar to Data mining by_ashok

Similar to Data mining by_ashok (20)

notes_dmdw_chap1.docx
notes_dmdw_chap1.docxnotes_dmdw_chap1.docx
notes_dmdw_chap1.docx
 
datamining.ppt
datamining.pptdatamining.ppt
datamining.ppt
 
datamining.ppt
datamining.pptdatamining.ppt
datamining.ppt
 
datamining management slyabbus and ppt.pptx
datamining management slyabbus and ppt.pptxdatamining management slyabbus and ppt.pptx
datamining management slyabbus and ppt.pptx
 
datamining.ppt
datamining.pptdatamining.ppt
datamining.ppt
 
A Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining PresentationA Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining Presentation
 
Data mining
Data miningData mining
Data mining
 
Data mining
Data mining Data mining
Data mining
 
dataminingppt-170616163835.pdf jejwwkwnwnn
dataminingppt-170616163835.pdf jejwwkwnwnndataminingppt-170616163835.pdf jejwwkwnwnn
dataminingppt-170616163835.pdf jejwwkwnwnn
 
Unit 4 Advanced Data Analytics
Unit 4 Advanced Data AnalyticsUnit 4 Advanced Data Analytics
Unit 4 Advanced Data Analytics
 
Exploring Data Wealth: Data Mining Insights
Exploring Data Wealth: Data Mining InsightsExploring Data Wealth: Data Mining Insights
Exploring Data Wealth: Data Mining Insights
 
Datamining
DataminingDatamining
Datamining
 
Datamining
DataminingDatamining
Datamining
 
Data mining
Data miningData mining
Data mining
 
ETHICAL ISSUES WITH CUSTOMER DATA COLLECTION
ETHICAL ISSUES WITH CUSTOMER DATA COLLECTIONETHICAL ISSUES WITH CUSTOMER DATA COLLECTION
ETHICAL ISSUES WITH CUSTOMER DATA COLLECTION
 
Data Mining Presentation for College Harsh.pptx
Data Mining Presentation for College Harsh.pptxData Mining Presentation for College Harsh.pptx
Data Mining Presentation for College Harsh.pptx
 
Data mining and privacy preserving in data mining
Data mining and privacy preserving in data miningData mining and privacy preserving in data mining
Data mining and privacy preserving in data mining
 
Data mining 1 - Introduction (cheat sheet - printable)
Data mining 1 - Introduction (cheat sheet - printable)Data mining 1 - Introduction (cheat sheet - printable)
Data mining 1 - Introduction (cheat sheet - printable)
 
Cis 500 assignment 4
Cis 500 assignment 4Cis 500 assignment 4
Cis 500 assignment 4
 
Data mining
Data miningData mining
Data mining
 

Data mining by_ashok

  • 1. Data Mining Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD),an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems
  • 2. What is Data Mining?  Data Mining, also known as Knowledge- Discovery in Databases (KDD), is the process of automatically searching large volumes of data for patterns.  Data Mining applies many older computational techniques from statistics, machine learning and pattern recognition
  • 3. Data mining consists of five major elements:  Extract, transform, and load transaction data onto the data warehouse system.  Store and manage the data in a multidimensional database system.  Provide data access to business analysts and information technology professionals.  Analyze the data by application software.  Present the data in a useful format, such as a graph or table.
  • 4. Data Mining Goal  The ultimate goal of data mining is prediction - and predictive data mining is the most common type of data mining and one that has the most direct business applications.
  • 5. 3 Steps Data Mining Process  Stage 1: Exploration. This stage usually starts with data preparation which may involve cleaning data, data transformations, selecting subsets of records  Stage 2: Model building and validation. This stage involves considering various models and choosing the best one based on their predictive performance  Stage 3: Deployment. That final stage involves using the model selected as best in the previous stage and applying it to new data in order to generate predictions or estimates of the expected outcome
  • 6. Some of the tools used for data mining are:  Artificial neural networks - Non-linear predictive models that learn through training and resemble biological neural networks in structure.  Decision trees - Tree-shaped structures that represent sets of decisions. These decisions generate rules for the classification of a dataset.  Rule induction - The extraction of useful if-then rules from data based on statistical significance.  Genetic algorithms - Optimization techniques based on the concepts of genetic combination, mutation, and natural selection.  Nearest neighbor - A classification technique that classifies each record based on the records most similar to it in an historical database.
  • 7. Reasons for the growing popularity of Data Mining  Growing Data Volume  Limitations of Human Analysis  Low Cost of Machine Learning
  • 8. ADVANTAGES OF DATA MINING  Marking/Retailing: Data mining can aid direct marketers by providing them with useful and accurate trends about their customers’ purchasing behavior.  Banking/Crediting: Data mining can assist financial institutions in areas such as credit reporting and loan information.    
  • 9. ADVANTAGES OF DATA MINING Cont…  Law enforcement: Data mining can aid law enforcers in identifying criminal suspects as well as apprehending these criminals by examining trends in location, crime type, habit, and other patterns of behaviors.  Researchers: Data mining can assist researchers by speeding up their data analyzing process; thus, allowing them more time to work on other projects.   
  • 10. DISADVANTAGES OF DATA MINING  Privacy Issues: For example, according to Washing Post, in 1998, CVS had sold their patient’s prescription purchases to a different company  American Express also sold their customers’ credit card purchases to another company.
  • 11. DISADVANTAGES OF DATA MINING Cont…  Security issues: Although companies have a lot of personal information about us available online, they do not have sufficient security systems in place to protect that information.   Misuse of information: Some of the company will answer your phone based on your purchase history. If you have spent a lot of money or buying a lot of product from one company, your call will be answered really soon. So you should not think that your call is really being answer in the order in which it was receive.