SlideShare a Scribd company logo
1 of 29
BAYESIAN
STATISTICS
PRESENTED TO,
Mrs. URMILA MISHRA
PRESENTED BY,
SAGAR KAMBLE
INTRODUCTION
DATA MINING
A process used by companies to turn raw data into useful
information.
SIGNAL
The WHO has defined ‘signal’ as ‘‘Reported information on a
possible causal relationship between an adverse event and a drug,
the relationship being unknown or incompletely documented.’’
In a perfect signalling system:
a) All signals should be found
b) “False” signals should be ignored
c) Signals should be found early.
Signal detection and analysis
Associations
Signals
Signal
document
Drug safety
data
Combinations
Quantitative
threshold
Clinical
evaluation
Data mining is refereed to as the computer-
assisted procedures, starting from processing of
dataset by data “cleaning” and culminating into
the application of statistical techniques, often
known as data mining algorithms (DMAs).
DMAs can be classified in
Frequentist
Bayesian approach.
BAYESIAN STATISTICS
Bayesian theorem first proposed by English clergyman and mathematician
Thomas Bayes.
Bayes theorem is a branch of logic applied to decision making and
inferential statistics that deals with probability inference and is used to
predict the future events based upon the knowledge of previous events.
Health technology assessment Spigelhalter et al define Bayesian statistics
as:
‘the explicit quantitative use of external evidence in the design,
monitoring,
analysis, interpretation, and reporting of a health technology
assessment.’
oBayesian statistics uses ‘probability’ to
express subjective ‘degree of belief’ in a
specific outcome of interest.
oBased on prior belief, and constantly updated
on addition of new data, Bayesian statistics
provides a formal statistical framework for
updating probabilities and incorporating prior
(even assumed) knowledge.
Benefits of using Bayesian statistics when data mining
a) External information based on for example the goals of the data
mining project can be incorporated into the statistical model.
b) The use of a conservative prior allows a dampening at low
counter
values, stopping inappropriate fluctuations at low counter values
and
thus stopping unwanted spurious associations or patterns.
c) Flexibility of choice of prior, so an alternative attitude to external
information can be taken.
d) Formal approach for looking at how results differ with different
priors, including sensitivity testing of results.
e) Repeated testing is fundamental to the approach, rather than an
additional issue to be considered in the analysis.
f) Method can readily handle missing data, can make calculations
when
no data, and simply have more uncertainty (wider credibility
intervals) around precision estimate.
g) Credibility intervals easier to interpret than conventional
confidence
intervals.
h) A probability distribution with the addition of data leads to a new
probability distribution, thus emphasising the immediacy of any
conclusion, as magnitude and certainty of the change is obvious.
i) Tractable to calculate estimate of parameter even without data
(this
is the prior distribution), some classical estimates have problems
before data is collected.
Applications
1) Clinical trial Analysis
2) Causality assessment
 Supervised pattern recognition
 Classification tasks
 Cluster analysis
 Neural networks
3) Testing of new drugs
4) Astrophysics
5) Lawsuits and public policy decision making
Neural Networks
The term 'artificial neural network' or simply 'neural network’ is used to
describe a wide range of different computational architectures.
These architectures are used for a diverse range of tasks, including
Prediction
Classification of data sets
And data mining.
Neural networks are good for problems where many complex,
undefined constraints exist with possibly varying relationships between
them.
Two main tracks of neural network based research have been pursued:
1. To further develop the neural network so it more and more effectively
mimics the behaviour of the human brain
2. The further development of original neural network based techniques
in whatever way necessary to optimise their use in specific data
analysis tasks.
A typical neural network is made up of many
simple interconnected units ("nodes"). Such units
can be either
Input nodes (representing input variables)
Output nodes (representing network output), or
In some cases hidden nodes.
Communication channels ("connections" or
"weights"), which carry numerical data, link the
nodes to each other, so that each node only
operates on local data and on inputs received via
connections from other nodes.
Neural Network models include:
Kohonen
Back propagation
Recurrent back propagation
Adaptive resonance theory (ART) networks
Radial basis function (rbf) networks
Combination of ART and rbf
Probabilistic neural networks and other types.
There are different types of neural network
architecture: as data can flow from input to output
nodes in different ways.
Feed forward models are where data flows directly
from input to output.
In contrast recurrent networks model information
about past inputs which is fed back in alongside
current data via recurrent or feedback connections
to provide an answer.
•This can be done by feeding the outputs directly
back into the input layer, or it could also be done
by feedback from the hidden layers to the input
layer. Computation continues until the network
stabilizes, when the output values can be read.
Neural networks are most commonly used in the following
types of task:
a) Classification tasks, where cases need to be classified
together into predefined classes, the definitions of the classes
of which are not precise, or secondly into classes which are
created from the data itself. The aim is to distinguish between
items; finding whether a
specified pattern or types of cases exist in the data set.
b) Modelling outcome prediction, where the probability of
specific
outcomes for specific cases are predicted based on the input
data for that case, after learning from a test set of data.
c) Data mining/clustering where one searches for
unexpected patterns of variables or variable values in the
data. This may be either supervised pattern recognition
when one searches the data to see whether it contains a
specific defined pattern, or unsupervised pattern
recognition where previously unsuspected patterns are
searched for in the data, that is relating similar cases or
patterns in the data.
d) Time series forecasting prediction of future outcomes
BAYESIAN CONFIDENCE PROPAGATION
NEURAL NETWORK
Initial testing of the BCPNN on the WHO database
began in 1995.
BCPNN is a semi-automated knowledge-detection
technique that helps to find new drug-ADR associations
likely to be potential signals by quantitative data filtration
in Vigibase.
The BCPNN was used as a hypothesis generating tool
in the field of pharmacovigilance.
IMPORTANCE OF BCPNN
BCPNN methodology can manage large databases
It is robust in handling very incomplete data
To find patterns of information within datasets with
complex dependencies
It can identify the drug-ADR combinations occurring
with high frequency as against the background
frequency, unexpected patterns and variations in
patterns overtime
It can be useful in
• detecting new syndromes
• Age group affected
• Population groups at high risk
• Dose relations
It also provides information about variations in ADR
patterns among different countries
It is highly objective, picks up the associations early
and also indicates the strength of associations based
on IC value.
INFORMATION COMPONENT(IC)
The measure of disproportionality calculated by BCPNN.
The IC value indicates the strength of quantitative dependency between a
drug and ADR.
•A positive value is indicative of higher occurrence of drug-ADR
association than expected.
•Negative value indicates that the association is observed less frequently
than expected statistically
•Zero indicates that there is no quantitative dependency between drug
and ADR.
The IC value is based on:
The total number of case reports with drug
The total number of case reports with
adverse reaction
The number of reports with the specific
drug-ADR combination
The total number of reports in the database
No. Reports Target
AE(j)
Other AE Total
Target
Drug(i)
a b nTD
Other Drug C d nOD
Total nTA nOA n(ij)
2-dimensional projection of an
SRS database.
Both use ratio nij / Eij where
nij = no. of reports mentioning both drug i & event
j
Eij = expected no. of reports of drug i & event j
Both report features of posterior dist’n of
‘information component’
ICij = log2 nij / Eij
IC=log2 a(a+b+c+d)/(a+c)(a+b)
No. Reports Nephropathy Other AE Total
Target Drug X 20 100 120
Other Drug Y 100 980 1080
Total 120 1080 1200
For Eg
BCPNN approach advantages
Bayesian approaches provide a natural method of
combining new data with this previous knowledge defined
from external information.
Has no problem with repeated trials which can make
calculation of appropriate confidence intervals
Bayesian precision estimates are easy to Comprehend
The use of a prior distribution gives flexibility
Bayesian methods handle missing data intuitively
REFERENCES
Bate, a. (2003). The use of a bayesian confidence propagation neural network in
pharmacovigilance.
fda_ind_larryg. (n.d.).
Grieve, A. (n.d.). Applied Bayesian Approaches in Safety and Pharmacovigilance Professor
Andy Grieve Department of Public Health Sciences King ‟ s College , London Safety is an
issue in all phases.
Hauben, M., Madigan, D., Gerrits, C. M., Walsh, L., & Puijenbroek, E. P. Van. (2005). The
role of data mining in pharmacovigilance rig y o t h s f e hl y b u lic a ib h t s u b s i n i t n i n o
i t ite d rig y p o l l i d Lt t i b d, 929–948.
Liu, M., Matheny, M., Hu, Y., & Xu, H. (2012). Data mining methodologies for
pharmacovigilance. ACM SIGKDD Explorations Newsletter, 14(1), 35–42.
doi:10.1145/2408736.2408742
Madigan, D., Ryan, P., Simpson, S., & Zorych, I. (2011). Bayesian methods in
pharmacovigilance. Bayesian Statistics. Retrieved from
http://www.stat.columbia.edu/~madigan/PAPERS/DrugSafety2.pdf
Paper, C. W. (n.d.). Enhanced Signal Detection and Management Enables More Effective
Pharmacovigilance.
Poluzzi, E., Raschi, E., Piccinni, C., & De Ponti, F. (2012). Data mining techniques in
pharmacovigilance: analysis of the publicly accessible FDA adverse event reporting system
(AERS). Data Mining Applications in Engineering and Medicine. Croatia: InTech, 267–301.
doi:10.5772/50095
presentation_Haijun5-15-2009-19-4-50. (n.d.).
QUESTIONS
Bayesian statistics

More Related Content

What's hot

Applying Hidden Markov Models to Bioinformatics
Applying Hidden Markov Models to BioinformaticsApplying Hidden Markov Models to Bioinformatics
Applying Hidden Markov Models to Bioinformatics
butest
 

What's hot (20)

Principal Component Analysis (PCA) and LDA PPT Slides
Principal Component Analysis (PCA) and LDA PPT SlidesPrincipal Component Analysis (PCA) and LDA PPT Slides
Principal Component Analysis (PCA) and LDA PPT Slides
 
Principal Component Analysis
Principal Component AnalysisPrincipal Component Analysis
Principal Component Analysis
 
Mixed models
Mixed modelsMixed models
Mixed models
 
Applying Hidden Markov Models to Bioinformatics
Applying Hidden Markov Models to BioinformaticsApplying Hidden Markov Models to Bioinformatics
Applying Hidden Markov Models to Bioinformatics
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
FASTA
FASTAFASTA
FASTA
 
Belief Networks & Bayesian Classification
Belief Networks & Bayesian ClassificationBelief Networks & Bayesian Classification
Belief Networks & Bayesian Classification
 
Regression
RegressionRegression
Regression
 
Multiple regression
Multiple regressionMultiple regression
Multiple regression
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Simple Regression
Simple RegressionSimple Regression
Simple Regression
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Bayesian intro
Bayesian introBayesian intro
Bayesian intro
 
Pca analysis
Pca analysisPca analysis
Pca analysis
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Regression
RegressionRegression
Regression
 
PCA (Principal component analysis)
PCA (Principal component analysis)PCA (Principal component analysis)
PCA (Principal component analysis)
 
Bayesian Networks - A Brief Introduction
Bayesian Networks - A Brief IntroductionBayesian Networks - A Brief Introduction
Bayesian Networks - A Brief Introduction
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
The jackknife and bootstrap
The jackknife and bootstrapThe jackknife and bootstrap
The jackknife and bootstrap
 

Viewers also liked

Cost-Aware Virtual Machine Placement across Distributed Data Centers using Ba...
Cost-Aware Virtual Machine Placement acrossDistributed Data Centers using Ba...Cost-Aware Virtual Machine Placement acrossDistributed Data Centers using Ba...
Cost-Aware Virtual Machine Placement across Distributed Data Centers using Ba...
Soodeh Farokhi
 
construction risk factor analysis: BBN Network
construction risk factor analysis: BBN Networkconstruction risk factor analysis: BBN Network
construction risk factor analysis: BBN Network
Shaswati Mohapatra
 
Building Process Improvement Business Cases Using Bayesian Belief Networks an...
Building Process Improvement Business Cases Using Bayesian Belief Networks an...Building Process Improvement Business Cases Using Bayesian Belief Networks an...
Building Process Improvement Business Cases Using Bayesian Belief Networks an...
Ben Linders
 

Viewers also liked (10)

Dynamic Bayesian modeling for risk prediction in credit operations (SCAI2015)
Dynamic Bayesian modeling for risk prediction in credit operations (SCAI2015)Dynamic Bayesian modeling for risk prediction in credit operations (SCAI2015)
Dynamic Bayesian modeling for risk prediction in credit operations (SCAI2015)
 
Module 5 Bayesian belief network modelling
Module 5 Bayesian belief network modellingModule 5 Bayesian belief network modelling
Module 5 Bayesian belief network modelling
 
Cost-Aware Virtual Machine Placement across Distributed Data Centers using Ba...
Cost-Aware Virtual Machine Placement acrossDistributed Data Centers using Ba...Cost-Aware Virtual Machine Placement acrossDistributed Data Centers using Ba...
Cost-Aware Virtual Machine Placement across Distributed Data Centers using Ba...
 
Probabilistic modeling in deep learning
Probabilistic modeling in deep learningProbabilistic modeling in deep learning
Probabilistic modeling in deep learning
 
Bayes Belief Network
Bayes Belief NetworkBayes Belief Network
Bayes Belief Network
 
construction risk factor analysis: BBN Network
construction risk factor analysis: BBN Networkconstruction risk factor analysis: BBN Network
construction risk factor analysis: BBN Network
 
Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석
 
The Perfume Project - Part 1
The Perfume Project - Part 1The Perfume Project - Part 1
The Perfume Project - Part 1
 
Building Process Improvement Business Cases Using Bayesian Belief Networks an...
Building Process Improvement Business Cases Using Bayesian Belief Networks an...Building Process Improvement Business Cases Using Bayesian Belief Networks an...
Building Process Improvement Business Cases Using Bayesian Belief Networks an...
 
Controlling Project Performance using PDM - PSQT2005 - Ben Linders
Controlling Project Performance using PDM - PSQT2005 - Ben LindersControlling Project Performance using PDM - PSQT2005 - Ben Linders
Controlling Project Performance using PDM - PSQT2005 - Ben Linders
 

Similar to Bayesian statistics

Prediction of Neurological Disorder using Classification Approach
Prediction of Neurological Disorder using Classification ApproachPrediction of Neurological Disorder using Classification Approach
Prediction of Neurological Disorder using Classification Approach
BRNSSPublicationHubI
 
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Survey
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive SurveyPrognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Survey
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Survey
ijtsrd
 
Paper id 212014112
Paper id 212014112Paper id 212014112
Paper id 212014112
IJRAT
 
OPTIMIZED PREDICTION IN MEDICAL DIAGNOSIS USING DNA SEQUENCES AND STRUCTURE I...
OPTIMIZED PREDICTION IN MEDICAL DIAGNOSIS USING DNA SEQUENCES AND STRUCTURE I...OPTIMIZED PREDICTION IN MEDICAL DIAGNOSIS USING DNA SEQUENCES AND STRUCTURE I...
OPTIMIZED PREDICTION IN MEDICAL DIAGNOSIS USING DNA SEQUENCES AND STRUCTURE I...
IAEME Publication
 
Ijarcet vol-2-issue-4-1393-1397
Ijarcet vol-2-issue-4-1393-1397Ijarcet vol-2-issue-4-1393-1397
Ijarcet vol-2-issue-4-1393-1397
Editor IJARCET
 

Similar to Bayesian statistics (20)

Application of Hybrid Genetic Algorithm Using Artificial Neural Network in Da...
Application of Hybrid Genetic Algorithm Using Artificial Neural Network in Da...Application of Hybrid Genetic Algorithm Using Artificial Neural Network in Da...
Application of Hybrid Genetic Algorithm Using Artificial Neural Network in Da...
 
C0344023028
C0344023028C0344023028
C0344023028
 
IRJET- Disease Prediction using Machine Learning
IRJET-  Disease Prediction using Machine LearningIRJET-  Disease Prediction using Machine Learning
IRJET- Disease Prediction using Machine Learning
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MINING
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
 
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSISAN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
 
Byasian nural network BCPNN
Byasian nural network BCPNNByasian nural network BCPNN
Byasian nural network BCPNN
 
ARTIFICIAL NEURAL NETWORKING in Dentistry
ARTIFICIAL NEURAL NETWORKING in DentistryARTIFICIAL NEURAL NETWORKING in Dentistry
ARTIFICIAL NEURAL NETWORKING in Dentistry
 
An Efficient Approach for Outlier Detection in Wireless Sensor Network
An Efficient Approach for Outlier Detection in Wireless Sensor NetworkAn Efficient Approach for Outlier Detection in Wireless Sensor Network
An Efficient Approach for Outlier Detection in Wireless Sensor Network
 
Prediction of Neurological Disorder using Classification Approach
Prediction of Neurological Disorder using Classification ApproachPrediction of Neurological Disorder using Classification Approach
Prediction of Neurological Disorder using Classification Approach
 
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Survey
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive SurveyPrognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Survey
Prognosis of Cardiac Disease using Data Mining Techniques A Comprehensive Survey
 
Intelligent Heart Attack Prediction System Using Big Data
Intelligent Heart Attack Prediction System Using Big DataIntelligent Heart Attack Prediction System Using Big Data
Intelligent Heart Attack Prediction System Using Big Data
 
Paper id 212014112
Paper id 212014112Paper id 212014112
Paper id 212014112
 
OPTIMIZED PREDICTION IN MEDICAL DIAGNOSIS USING DNA SEQUENCES AND STRUCTURE I...
OPTIMIZED PREDICTION IN MEDICAL DIAGNOSIS USING DNA SEQUENCES AND STRUCTURE I...OPTIMIZED PREDICTION IN MEDICAL DIAGNOSIS USING DNA SEQUENCES AND STRUCTURE I...
OPTIMIZED PREDICTION IN MEDICAL DIAGNOSIS USING DNA SEQUENCES AND STRUCTURE I...
 
Ijarcet vol-2-issue-4-1393-1397
Ijarcet vol-2-issue-4-1393-1397Ijarcet vol-2-issue-4-1393-1397
Ijarcet vol-2-issue-4-1393-1397
 
Intelligent data analysis for medicinal diagnosis
Intelligent data analysis for medicinal diagnosisIntelligent data analysis for medicinal diagnosis
Intelligent data analysis for medicinal diagnosis
 
G124549
G124549G124549
G124549
 
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
 
Classification of Breast Cancer Diseases using Data Mining Techniques
Classification of Breast Cancer Diseases using Data Mining TechniquesClassification of Breast Cancer Diseases using Data Mining Techniques
Classification of Breast Cancer Diseases using Data Mining Techniques
 
Neural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningNeural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learning
 

Recently uploaded

Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
mahaiklolahd
 

Recently uploaded (20)

8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
 
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
 
Call Girls Visakhapatnam Just Call 8250077686 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 8250077686 Top Class Call Girl Service Ava...Call Girls Visakhapatnam Just Call 8250077686 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 8250077686 Top Class Call Girl Service Ava...
 
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
 
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
 
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
 
Call Girls Vadodara Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Vadodara Just Call 8617370543 Top Class Call Girl Service AvailableCall Girls Vadodara Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Vadodara Just Call 8617370543 Top Class Call Girl Service Available
 
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
 
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
 
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
 
Russian Call Girls Service Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
Russian Call Girls Service  Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...Russian Call Girls Service  Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
Russian Call Girls Service Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
 
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 9332606886 ⟟ Call Me For G...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟  9332606886 ⟟ Call Me For G...Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟  9332606886 ⟟ Call Me For G...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 9332606886 ⟟ Call Me For G...
 
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
 
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on WhatsappMost Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
 
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
 

Bayesian statistics

  • 1. BAYESIAN STATISTICS PRESENTED TO, Mrs. URMILA MISHRA PRESENTED BY, SAGAR KAMBLE
  • 2. INTRODUCTION DATA MINING A process used by companies to turn raw data into useful information. SIGNAL The WHO has defined ‘signal’ as ‘‘Reported information on a possible causal relationship between an adverse event and a drug, the relationship being unknown or incompletely documented.’’ In a perfect signalling system: a) All signals should be found b) “False” signals should be ignored c) Signals should be found early.
  • 3. Signal detection and analysis Associations Signals Signal document Drug safety data Combinations Quantitative threshold Clinical evaluation
  • 4. Data mining is refereed to as the computer- assisted procedures, starting from processing of dataset by data “cleaning” and culminating into the application of statistical techniques, often known as data mining algorithms (DMAs). DMAs can be classified in Frequentist Bayesian approach.
  • 5.
  • 6. BAYESIAN STATISTICS Bayesian theorem first proposed by English clergyman and mathematician Thomas Bayes. Bayes theorem is a branch of logic applied to decision making and inferential statistics that deals with probability inference and is used to predict the future events based upon the knowledge of previous events. Health technology assessment Spigelhalter et al define Bayesian statistics as: ‘the explicit quantitative use of external evidence in the design, monitoring, analysis, interpretation, and reporting of a health technology assessment.’
  • 7. oBayesian statistics uses ‘probability’ to express subjective ‘degree of belief’ in a specific outcome of interest. oBased on prior belief, and constantly updated on addition of new data, Bayesian statistics provides a formal statistical framework for updating probabilities and incorporating prior (even assumed) knowledge.
  • 8. Benefits of using Bayesian statistics when data mining a) External information based on for example the goals of the data mining project can be incorporated into the statistical model. b) The use of a conservative prior allows a dampening at low counter values, stopping inappropriate fluctuations at low counter values and thus stopping unwanted spurious associations or patterns. c) Flexibility of choice of prior, so an alternative attitude to external information can be taken. d) Formal approach for looking at how results differ with different priors, including sensitivity testing of results. e) Repeated testing is fundamental to the approach, rather than an additional issue to be considered in the analysis.
  • 9. f) Method can readily handle missing data, can make calculations when no data, and simply have more uncertainty (wider credibility intervals) around precision estimate. g) Credibility intervals easier to interpret than conventional confidence intervals. h) A probability distribution with the addition of data leads to a new probability distribution, thus emphasising the immediacy of any conclusion, as magnitude and certainty of the change is obvious. i) Tractable to calculate estimate of parameter even without data (this is the prior distribution), some classical estimates have problems before data is collected.
  • 10. Applications 1) Clinical trial Analysis 2) Causality assessment  Supervised pattern recognition  Classification tasks  Cluster analysis  Neural networks 3) Testing of new drugs 4) Astrophysics 5) Lawsuits and public policy decision making
  • 11. Neural Networks The term 'artificial neural network' or simply 'neural network’ is used to describe a wide range of different computational architectures. These architectures are used for a diverse range of tasks, including Prediction Classification of data sets And data mining. Neural networks are good for problems where many complex, undefined constraints exist with possibly varying relationships between them. Two main tracks of neural network based research have been pursued: 1. To further develop the neural network so it more and more effectively mimics the behaviour of the human brain 2. The further development of original neural network based techniques in whatever way necessary to optimise their use in specific data analysis tasks.
  • 12. A typical neural network is made up of many simple interconnected units ("nodes"). Such units can be either Input nodes (representing input variables) Output nodes (representing network output), or In some cases hidden nodes. Communication channels ("connections" or "weights"), which carry numerical data, link the nodes to each other, so that each node only operates on local data and on inputs received via connections from other nodes.
  • 13. Neural Network models include: Kohonen Back propagation Recurrent back propagation Adaptive resonance theory (ART) networks Radial basis function (rbf) networks Combination of ART and rbf Probabilistic neural networks and other types.
  • 14. There are different types of neural network architecture: as data can flow from input to output nodes in different ways. Feed forward models are where data flows directly from input to output. In contrast recurrent networks model information about past inputs which is fed back in alongside current data via recurrent or feedback connections to provide an answer. •This can be done by feeding the outputs directly back into the input layer, or it could also be done by feedback from the hidden layers to the input layer. Computation continues until the network stabilizes, when the output values can be read.
  • 15. Neural networks are most commonly used in the following types of task: a) Classification tasks, where cases need to be classified together into predefined classes, the definitions of the classes of which are not precise, or secondly into classes which are created from the data itself. The aim is to distinguish between items; finding whether a specified pattern or types of cases exist in the data set. b) Modelling outcome prediction, where the probability of specific outcomes for specific cases are predicted based on the input data for that case, after learning from a test set of data.
  • 16. c) Data mining/clustering where one searches for unexpected patterns of variables or variable values in the data. This may be either supervised pattern recognition when one searches the data to see whether it contains a specific defined pattern, or unsupervised pattern recognition where previously unsuspected patterns are searched for in the data, that is relating similar cases or patterns in the data. d) Time series forecasting prediction of future outcomes
  • 17. BAYESIAN CONFIDENCE PROPAGATION NEURAL NETWORK Initial testing of the BCPNN on the WHO database began in 1995. BCPNN is a semi-automated knowledge-detection technique that helps to find new drug-ADR associations likely to be potential signals by quantitative data filtration in Vigibase. The BCPNN was used as a hypothesis generating tool in the field of pharmacovigilance.
  • 18. IMPORTANCE OF BCPNN BCPNN methodology can manage large databases It is robust in handling very incomplete data To find patterns of information within datasets with complex dependencies It can identify the drug-ADR combinations occurring with high frequency as against the background frequency, unexpected patterns and variations in patterns overtime
  • 19. It can be useful in • detecting new syndromes • Age group affected • Population groups at high risk • Dose relations It also provides information about variations in ADR patterns among different countries It is highly objective, picks up the associations early and also indicates the strength of associations based on IC value.
  • 20. INFORMATION COMPONENT(IC) The measure of disproportionality calculated by BCPNN. The IC value indicates the strength of quantitative dependency between a drug and ADR. •A positive value is indicative of higher occurrence of drug-ADR association than expected. •Negative value indicates that the association is observed less frequently than expected statistically •Zero indicates that there is no quantitative dependency between drug and ADR.
  • 21. The IC value is based on: The total number of case reports with drug The total number of case reports with adverse reaction The number of reports with the specific drug-ADR combination The total number of reports in the database
  • 22. No. Reports Target AE(j) Other AE Total Target Drug(i) a b nTD Other Drug C d nOD Total nTA nOA n(ij) 2-dimensional projection of an SRS database.
  • 23. Both use ratio nij / Eij where nij = no. of reports mentioning both drug i & event j Eij = expected no. of reports of drug i & event j Both report features of posterior dist’n of ‘information component’ ICij = log2 nij / Eij IC=log2 a(a+b+c+d)/(a+c)(a+b)
  • 24. No. Reports Nephropathy Other AE Total Target Drug X 20 100 120 Other Drug Y 100 980 1080 Total 120 1080 1200 For Eg
  • 25. BCPNN approach advantages Bayesian approaches provide a natural method of combining new data with this previous knowledge defined from external information. Has no problem with repeated trials which can make calculation of appropriate confidence intervals Bayesian precision estimates are easy to Comprehend The use of a prior distribution gives flexibility Bayesian methods handle missing data intuitively
  • 26.
  • 27. REFERENCES Bate, a. (2003). The use of a bayesian confidence propagation neural network in pharmacovigilance. fda_ind_larryg. (n.d.). Grieve, A. (n.d.). Applied Bayesian Approaches in Safety and Pharmacovigilance Professor Andy Grieve Department of Public Health Sciences King ‟ s College , London Safety is an issue in all phases. Hauben, M., Madigan, D., Gerrits, C. M., Walsh, L., & Puijenbroek, E. P. Van. (2005). The role of data mining in pharmacovigilance rig y o t h s f e hl y b u lic a ib h t s u b s i n i t n i n o i t ite d rig y p o l l i d Lt t i b d, 929–948. Liu, M., Matheny, M., Hu, Y., & Xu, H. (2012). Data mining methodologies for pharmacovigilance. ACM SIGKDD Explorations Newsletter, 14(1), 35–42. doi:10.1145/2408736.2408742 Madigan, D., Ryan, P., Simpson, S., & Zorych, I. (2011). Bayesian methods in pharmacovigilance. Bayesian Statistics. Retrieved from http://www.stat.columbia.edu/~madigan/PAPERS/DrugSafety2.pdf Paper, C. W. (n.d.). Enhanced Signal Detection and Management Enables More Effective Pharmacovigilance. Poluzzi, E., Raschi, E., Piccinni, C., & De Ponti, F. (2012). Data mining techniques in pharmacovigilance: analysis of the publicly accessible FDA adverse event reporting system (AERS). Data Mining Applications in Engineering and Medicine. Croatia: InTech, 267–301. doi:10.5772/50095 presentation_Haijun5-15-2009-19-4-50. (n.d.).