SlideShare a Scribd company logo
1 of 23
Download to read offline
Data	Preparation	and	
Descriptive	Statistics	in	
SystemML
1
Outline
• Data	pre-processing	and	transformation
• Training/Testing/Cross	Validation
• Descriptive	statistics
I. Univariate	statistics
II. Bivariate	statistics
III. Stratified	statistics
2
Input	Data	Format
3
Input	data		
§ Rows:	data	points	(aka	records)
§ Columns:	features	(aka	variables,	attributes)	
Feature	types:
§ Scale (aka	continuous),	 e.g.,	‘Height’,	‘Weight’,	 ‘Salary’,	‘Temperature’
§ Categorical (aka	discrete)
§ Nominal – no	natural	ranking,		e.g.,	‘Gender’,	‘Region’,	‘Hair	color’
§ Ordinal – natural	ranking,	e.g.,	‘Level	of	Satisfaction’	
Example:	
The	house	data	set
Data	Pre-Processing
Tabular	input	data	needs	to	be	transformed	into	a	matrix	– transform()	built-in	function
Categorical	features	need	special	treatment:
§ Recoding:	mapping	distinct	categories	into	consecutive	numbers	starting	from	1
§ Dummycoding (aka	one-hot-encoding,	 one-of-K	encoding)
Example:	
recoding dummycoding
4
Zipcode
96334
95123
95141
96334
Zipcode
1
2
3
1
direction
east
west
north
south
dir_east dir_west dir_north dir_south
1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1
transform() Built-in	Function
transform() built-in	function	 supports:
§ Omitting	missing	values
§ Missing	value	imputation by	global_mean (scale	features),	global_mode (categorical	
features),	or constant (scale/categorical	features)
§ Binning (equi-width)
§ Scaling (scale	features):	mean-subtraction,	z-score
§ Recoding
§ Dummycoding
5
Transform	Specification
§ Transformations	operate	on	individual	columns
§ All	required	transformations	specified	in	a	JSON	file
§ Property	na.strings in	the	mtd file	specifies	missing	values
Example:
data.spec.json data.csv.mtd
6
{
"data_type": "frame",
"format": "csv",
"sep": ",",
"header": true,
"na.strings": [ "NA", "" ]
}
{
“ids": true
, "omit": [ 1, 4, 5, 6, 7, 8, 9 ]
, "impute":
[ { “id": 2, "method": "constant",
"value": "south" }
,{ “id": 3, "method":
"global_mean" }
]
,"recode": [ 1, 2, 4, 5, 6, 7 ]
,"bin":
[ { “id": 8, "method": "equi-
width", "numbins": 3 } ]
,"dummycode": [ 2, 5, 6, 7, 8, 3 ]
}
Combinations	of	Transformations
7
Signature	of	transform()
§ Invocation	1:
§ Resulting	metadata:	#	distinct	values	in	categorical	columns,	 list	of	distinct	values	with	their	
recoded	IDs,	number	of	bins,	bin	width,	etc.	
§ An	existing	transformation	can	be	applied	to	new	data	using	the	metadata	generated	in	an	
earlier	invocation
§ Invocation	2:
8
output = transform (target = input,
spec = specification,
transformPath = "/path/to/metadata“);
output = transform (target = input,
transformPath = "/path/to/new_metadata“
applyTransformPath = "/path/to/metadata“);
Outline
• Data	pre-processing	and	transformation
• Training/Testing/Cross	Validation
• Descriptive	statistics
I. Univariate	statistics
II. Bivariate	statistics
III. Stratified	statistics
9
Training/Testing
§ Pre-processing	training	and	testing	data	sets
§ Splitting	data	points	and	labels	– splitXY.dml and	splitXY-dummy.dml (hands-on)
§ Sampling	data	points	– sample.dml (hands-on)
§ Cross	Validation	– cv-linreg.dml (hands-on)
10
Pre-Processing	Training	and	
Testing	Data
Training	phase	
Testing	phase
11
Train = read ("/user/ml/trainset.csv");
Spec = read("/user/ml/tf.spec.json“, data_type = "scalar",
value_type = "String");
trainD = transform (target = Train,
transformSpec = Spec,
transformPath = "/user/ml/train_tf_metadata");
# Build a predictive model using trainD
...
Test = read ("/user/ml/testset.csv");
testD = transform (target = Test,
transformPath = "/user/ml/test_tf_metadata",
applyTransformPath = "/user/ml/train_tf_metdata");
# Test the model using testD
...
Cross	Validation
K-fold	Cross	Validation:
1. Shuffle	the	data	points	
2. Divide	the	data	points	into	𝑘 folds	of	(roughly)	
the	same	size
3. For	𝑖 = 1, … , 𝑘:	
• Train	each	model	on	all	the	data	points	that		
do	not	belong	to	fold	𝑖
• Test	each	model	on	all	the	examples	in	fold	𝑖
and	compute	the	test	error
4. Select	the	model	with	the	minimum	average	test	
over	all	𝑘 folds
5. (Train	the	winning	model	on	all	the	data	points)	
12
Testing Training
Example:	𝑘 = 5
Outline
• Data	pre-processing	and	transformation
• Training/Testing/Cross	Validation
• Descriptive	statistics
I. Univariate	statistics
II. Bivariate	statistics
III. Stratified	statistics
13
Univariate	Statistics
14
Row Name of	Statistic Scale Category
1 Minimum +
2 Maximum +
3 Range +
4 Mean +
5 Variance +
6 Standard	deviation +
7 Standard error	of	mean +
8 Coefficient	of	variation +
9 Skewness +
10 Kurtosis +
11 Standard	error	of	skewness +
12 Standard	error	of	Kurtosis +
13 Median +
14 Intequartilemean +
15 Number	of	categories +
16 Mode +
17 Number	of	modes +
Central	tendency	measures
Dispersion	measures
Shape	measures
Categorical	measures
Bivariate	Statistics
Quantitative	association	between	pairs	of	features
I. Scale-vs-Scale	statistics
§ Pearson’s	correlation	coefficient	
II. Nominal-vs-Nominal	statistics
§ Pearson’s	𝜒)
§ Cramér's 𝑉
III. Nominal-vs-Scale	statistics
§ Eta	statistic
§ 𝐹 statistic
IV. Ordinal-vs-Ordinal	statistics
§ Spearman’s	rank	correlation	coefficient
15
Scale-vs-Scale	Statistics	
Pearson’s	correlation	coefficient
§ A	measure	of	linear	dependence	between	scale	features
§ 𝜌)
measures	accuracy	of	𝑥)	~	𝑥0
16
𝜌	 =
123(56,57)
9:69:7
,								𝜌	 ∈ [−1,+1]
1 − 𝜌)
=
∑ 𝑥A,) − 𝑥BA,)
)C
AD0
∑ 𝑥A,) − 𝑥̅A,)
)C
AD0
Residual	Sum	of	Squares	(RSS)
Total	Sum	of	Squares	(TSS)
Nominal-vs-Nominal	Statistics
Pearson’s	𝜒)
§ A	measure	how	much	frequencies	of	value	pairs	of	two	categorical	features	deviate	from	
statistical	independence
§ Under	independence	assumption Pearson’s	𝜒)
distributed	approximately	𝜒)
𝑑 with
𝑑 = (𝑘0 − 1)(𝑘) − 1) degrees	of	freedom
§ 𝑃-value:
§ 𝑃 → 0 (rapidly)	as	features’	dependence	increases,	sensitive	to	𝑛
§ Only	measures	the	presence	of	dependence	not the	strength	of	dependence
17
𝜒)
=	 K
𝑂M,N − 𝐸M,N
)
𝐸M,NM,N
𝑥0 with 𝑘0 distinct categories
𝑥) with 𝑘) distinct categories
𝑂M ,N = #(𝑎, 𝑏) observed	frequencies
𝐸M,N =
#M	#N
C
expected frequencies for all
pairs (𝑎, 𝑏)
𝑃 = Pr 𝜌 ≥ Pearson[
s	𝜒)
	𝜌	~𝜒)
(𝑑)	distribution
Nominal-vs-Nominal	Statistics
Cramér's	𝑉
§ A	measure	for	the	strength	of	association	between	two	categorical	features
§ Under	independence	assumption	𝑉 distributed	approximately	𝜒)
𝑑 with	
𝑑 = (𝑘0 − 1)(𝑘) − 1) degrees	of	freedom
§ 𝑃-value:
§ 𝑃 → 1 (slowly)	as	features’	dependence	increases,	sensitive	to	𝑛
18
𝑉 =
Pearson[s	𝜒)
𝜒aM5
)
𝜒aM5
)
= 𝑛.min	{ 𝑘0 − 1, 𝑘) − 1}
𝑃 = Pr 𝜌 ≥ Cramér[
s	𝑉	 	𝜌	~𝜒)
(𝑑)	distribution
Nominal-vs-Scale	Statistics
Eta	statistic
§ A	measure	for	the	strength	of	association	between	a	categorical	feature	and	a	scale	
feature
§ 𝜂)
measures	accuracy	of	𝑦	~	𝑥 similar	to	𝑅)
statistic	of	linear	regression
19
𝜂)
= 1 −
∑ 𝑦A − 𝑦B[𝑥A] )C
AD0
∑ 𝑦A − 𝑦k )C
AD0
RSS
TSS
𝑥 categorical
𝑦 scale
𝑦B[𝑥A]:	average	of	𝑦A among	all	records	with	
𝑥A = 𝑥
Nominal-vs-Scale	Statistics
𝐹 statistic
§ A	measure	for	the	strength	of	association	between	a	categorical	feature	and	a	scale	
feature
§ Assumptions	(𝑥 categorical, 𝑦 scale):
§ 𝑦	~	𝑁𝑜𝑟𝑚𝑎𝑙 𝜇, 𝜎)
- same	variance	for	all	𝑥
§ 𝑥 has	small	value	domain	with	large	frequency	counts, 𝑥A non-random
§ All	records	are	iid
§ Under	independence	assumption	𝐹 distributed	approximately	𝐹(𝑘 − 1, 𝑛 − 𝑘)
20
𝐹 =
∑ 𝑓𝑟𝑒𝑞 𝑥 𝑦B 𝑥 − 𝑦k )/(𝑘 − 1)5
∑ 𝑦A − 𝑦B 𝑥A
)/(𝑛 − 𝑘)C
AD0
=
𝜂)(𝑛 − 𝑘)
1 − 𝜂)(𝑘 − 1)
ESS:	Explained	Sum	of	Squares
RSS
Degrees	of	freedom
Degrees	of	freedom
Ordinal-vs-Ordinal	Statistics
Spearman’s	rank	correlation	coefficient
§ A	measure	for	the	strength	of	association	between	two	ordinal	features
§ Pearson’s	correlation	efficient	applied	to	feature	with	values	replaced	by	their	ranks
Example:
21
8x
3)
11z
8{
5|
20
𝑥′
8
3
11
8
5
2
𝑥
4.5
2
6
4.5
3
1
𝑟
𝜌	 =
123	(•6,•7)
	9‚69‚7
𝜌	 ∈ [−1, +1]
Stratified	Statistic
Bivariate	statistics	measures	association	between	pairs	of	features	in	presence	of	a	
confounding	categorical	feature
Why	stratification?
22
Month Oct Nov Dec Oct-Dec
Customers	(Millions) 0.6 1.4 1.4 0.6 3.0 1.0 5.0 3.0
Promotions	(0	or	1) 0 1 0 1 0 1 0 1
Avg sales	per	1000 0.4 0.5 0.9 1.0 2.5 2.6 1.8 1.3
A	trend	in	each	group	is	reversed	and	
amplified	if	groups	combined
Stratified	Statistics
Measure	of	associations:	correlation,	slope,	𝑃-values,	etc.
Assumptions:
• Values	of	confounding	feature	𝑠 group	the	records	into	strata,	within	each	strata	all	
bivariate	pairs	assumed	free	of	confounding
• For	each	bivariate	pair	(𝑥, 𝑦),	𝑦 must	be	numerical	and	𝑦	distributed	normally	given	𝑥
• A	linear	regression	model	for	𝑦 (𝑖:	stratum	id)
• 𝜎)
same	across	all	strata
Computed	statistics:
• 𝑥̅A,		𝜎„5…
,		𝑦kA, 𝜎B†…
• For	𝑥	~ strata,	y	~ strata,	y	~	𝑥 NO	strata,	and	y	~	𝑥 AND	strata
• 𝑅)
, slopes,	std.	error	of	slopes,	𝑃- values
23
𝑦A,ˆ = 𝛼A + 𝛽𝑥A,ˆ + 𝜀A,ˆ 𝜀A,ˆ	~	𝑁𝑜𝑟𝑚𝑎𝑙(0, 𝜎)
)

More Related Content

Viewers also liked

The Hazards of Methamphetamine in Homes
The Hazards of Methamphetamine in HomesThe Hazards of Methamphetamine in Homes
The Hazards of Methamphetamine in Homesaerolitegroup
 
Considerações Brasscom aos padrões de auditoria – GT Auditoria
 Considerações Brasscom aos padrões de auditoria – GT Auditoria Considerações Brasscom aos padrões de auditoria – GT Auditoria
Considerações Brasscom aos padrões de auditoria – GT AuditoriaBrasscom
 
Viralité de contenu vs viralité de mécanique
Viralité de contenu vs viralité de mécaniqueViralité de contenu vs viralité de mécanique
Viralité de contenu vs viralité de mécaniqueErwan Le Nagard
 
SALC v1 (1)
SALC v1 (1)SALC v1 (1)
SALC v1 (1)Joe Yang
 
The Acropolis Hill (3d representations)
The Acropolis Hill (3d representations)The Acropolis Hill (3d representations)
The Acropolis Hill (3d representations)Eleni Georgakopoulou
 
Практика Торговицьке лісництво
Практика Торговицьке лісництвоПрактика Торговицьке лісництво
Практика Торговицьке лісництвоartischenkonatalia
 
Память
ПамятьПамять
Памятьschool135
 
Analisis del-codigo-de-etica-pnp
Analisis del-codigo-de-etica-pnpAnalisis del-codigo-de-etica-pnp
Analisis del-codigo-de-etica-pnpLucia Méndez
 
FEDOROV, A. FILM CRITICISM. МOSCOW: ICO “INFORMATION FOR ALL”. 2015. 382 P.
FEDOROV, A.  FILM CRITICISM.  МOSCOW: ICO “INFORMATION FOR ALL”. 2015.  382 P.FEDOROV, A.  FILM CRITICISM.  МOSCOW: ICO “INFORMATION FOR ALL”. 2015.  382 P.
FEDOROV, A. FILM CRITICISM. МOSCOW: ICO “INFORMATION FOR ALL”. 2015. 382 P.Alexander Fedorov
 

Viewers also liked (10)

The Hazards of Methamphetamine in Homes
The Hazards of Methamphetamine in HomesThe Hazards of Methamphetamine in Homes
The Hazards of Methamphetamine in Homes
 
Considerações Brasscom aos padrões de auditoria – GT Auditoria
 Considerações Brasscom aos padrões de auditoria – GT Auditoria Considerações Brasscom aos padrões de auditoria – GT Auditoria
Considerações Brasscom aos padrões de auditoria – GT Auditoria
 
Elmar Theune: Climate-Smart Dairy Webinar
Elmar Theune: Climate-Smart Dairy WebinarElmar Theune: Climate-Smart Dairy Webinar
Elmar Theune: Climate-Smart Dairy Webinar
 
Viralité de contenu vs viralité de mécanique
Viralité de contenu vs viralité de mécaniqueViralité de contenu vs viralité de mécanique
Viralité de contenu vs viralité de mécanique
 
SALC v1 (1)
SALC v1 (1)SALC v1 (1)
SALC v1 (1)
 
The Acropolis Hill (3d representations)
The Acropolis Hill (3d representations)The Acropolis Hill (3d representations)
The Acropolis Hill (3d representations)
 
Практика Торговицьке лісництво
Практика Торговицьке лісництвоПрактика Торговицьке лісництво
Практика Торговицьке лісництво
 
Память
ПамятьПамять
Память
 
Analisis del-codigo-de-etica-pnp
Analisis del-codigo-de-etica-pnpAnalisis del-codigo-de-etica-pnp
Analisis del-codigo-de-etica-pnp
 
FEDOROV, A. FILM CRITICISM. МOSCOW: ICO “INFORMATION FOR ALL”. 2015. 382 P.
FEDOROV, A.  FILM CRITICISM.  МOSCOW: ICO “INFORMATION FOR ALL”. 2015.  382 P.FEDOROV, A.  FILM CRITICISM.  МOSCOW: ICO “INFORMATION FOR ALL”. 2015.  382 P.
FEDOROV, A. FILM CRITICISM. МOSCOW: ICO “INFORMATION FOR ALL”. 2015. 382 P.
 

Similar to Data preparation, training and validation using SystemML by Faraz Makari Manshadi

Data preprocessing in Data Mining
Data preprocessing  in Data MiningData preprocessing  in Data Mining
Data preprocessing in Data MiningSamad Baseer Khan
 
Data types and Attributes1 (1).pptx
Data types and Attributes1 (1).pptxData types and Attributes1 (1).pptx
Data types and Attributes1 (1).pptxRupaRaj6
 
Data mining Basics and complete description
Data mining Basics and complete description Data mining Basics and complete description
Data mining Basics and complete description Sulman Ahmed
 
Data preprocessing PPT
Data preprocessing PPTData preprocessing PPT
Data preprocessing PPTANUSUYA T K
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessingTony Nguyen
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessingHarry Potter
 
Data preprocessing 2
Data preprocessing 2Data preprocessing 2
Data preprocessing 2extraganesh
 
1.6.data preprocessing
1.6.data preprocessing1.6.data preprocessing
1.6.data preprocessingKrish_ver2
 
Data Mining DataLecture Notes for Chapter 2Introduc
Data Mining DataLecture Notes for Chapter 2IntroducData Mining DataLecture Notes for Chapter 2Introduc
Data Mining DataLecture Notes for Chapter 2IntroducOllieShoresna
 
Data pre processing
Data pre processingData pre processing
Data pre processingjunnubabu
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessingextraganesh
 
1. chapter i(pasw)
1. chapter i(pasw)1. chapter i(pasw)
1. chapter i(pasw)Chhom Karath
 
Lect 2 getting to know your data
Lect 2 getting to know your dataLect 2 getting to know your data
Lect 2 getting to know your datahktripathy
 

Similar to Data preparation, training and validation using SystemML by Faraz Makari Manshadi (20)

Machine Learning with R
Machine Learning with RMachine Learning with R
Machine Learning with R
 
Data preprocessing in Data Mining
Data preprocessing  in Data MiningData preprocessing  in Data Mining
Data preprocessing in Data Mining
 
Data types and Attributes1 (1).pptx
Data types and Attributes1 (1).pptxData types and Attributes1 (1).pptx
Data types and Attributes1 (1).pptx
 
Data mining Basics and complete description
Data mining Basics and complete description Data mining Basics and complete description
Data mining Basics and complete description
 
Data preprocessing PPT
Data preprocessing PPTData preprocessing PPT
Data preprocessing PPT
 
Data For Datamining
Data For DataminingData For Datamining
Data For Datamining
 
Data For Datamining
Data For DataminingData For Datamining
Data For Datamining
 
02 data
02 data02 data
02 data
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Data preprocessing 2
Data preprocessing 2Data preprocessing 2
Data preprocessing 2
 
1.6.data preprocessing
1.6.data preprocessing1.6.data preprocessing
1.6.data preprocessing
 
Data Mining DataLecture Notes for Chapter 2Introduc
Data Mining DataLecture Notes for Chapter 2IntroducData Mining DataLecture Notes for Chapter 2Introduc
Data Mining DataLecture Notes for Chapter 2Introduc
 
Data pre processing
Data pre processingData pre processing
Data pre processing
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
1. chapter i(pasw)
1. chapter i(pasw)1. chapter i(pasw)
1. chapter i(pasw)
 
naive bayes example.pdf
naive bayes example.pdfnaive bayes example.pdf
naive bayes example.pdf
 
naive bayes example.pdf
naive bayes example.pdfnaive bayes example.pdf
naive bayes example.pdf
 
R for Statistical Computing
R for Statistical ComputingR for Statistical Computing
R for Statistical Computing
 
Lect 2 getting to know your data
Lect 2 getting to know your dataLect 2 getting to know your data
Lect 2 getting to know your data
 

More from Arvind Surve

Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...Arvind Surve
 
Apache SystemML Optimizer and Runtime techniques by Matthias Boehm
Apache SystemML Optimizer and Runtime techniques by Matthias BoehmApache SystemML Optimizer and Runtime techniques by Matthias Boehm
Apache SystemML Optimizer and Runtime techniques by Matthias BoehmArvind Surve
 
Apache SystemML Architecture by Niketan Panesar
Apache SystemML Architecture by Niketan PanesarApache SystemML Architecture by Niketan Panesar
Apache SystemML Architecture by Niketan PanesarArvind Surve
 
Clustering and Factorization using Apache SystemML by Prithviraj Sen
Clustering and Factorization using Apache SystemML by  Prithviraj SenClustering and Factorization using Apache SystemML by  Prithviraj Sen
Clustering and Factorization using Apache SystemML by Prithviraj SenArvind Surve
 
Clustering and Factorization using Apache SystemML by Alexandre V Evfimievski
Clustering and Factorization using Apache SystemML by  Alexandre V EvfimievskiClustering and Factorization using Apache SystemML by  Alexandre V Evfimievski
Clustering and Factorization using Apache SystemML by Alexandre V EvfimievskiArvind Surve
 
Classification using Apache SystemML by Prithviraj Sen
Classification using Apache SystemML by Prithviraj SenClassification using Apache SystemML by Prithviraj Sen
Classification using Apache SystemML by Prithviraj SenArvind Surve
 
Regression using Apache SystemML by Alexandre V Evfimievski
Regression using Apache SystemML by Alexandre V EvfimievskiRegression using Apache SystemML by Alexandre V Evfimievski
Regression using Apache SystemML by Alexandre V EvfimievskiArvind Surve
 
DML Syntax and Invocation process
DML Syntax and Invocation processDML Syntax and Invocation process
DML Syntax and Invocation processArvind Surve
 
Overview of Apache SystemML by Berthold Reinwald and Nakul Jindal
Overview of Apache SystemML by Berthold Reinwald and Nakul JindalOverview of Apache SystemML by Berthold Reinwald and Nakul Jindal
Overview of Apache SystemML by Berthold Reinwald and Nakul JindalArvind Surve
 
Apache SystemML 2016 Summer class primer by Berthold Reinwald
Apache SystemML 2016 Summer class primer by Berthold ReinwaldApache SystemML 2016 Summer class primer by Berthold Reinwald
Apache SystemML 2016 Summer class primer by Berthold ReinwaldArvind Surve
 
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...Arvind Surve
 
Apache SystemML Optimizer and Runtime techniques by Matthias Boehm
Apache SystemML Optimizer and Runtime techniques by Matthias BoehmApache SystemML Optimizer and Runtime techniques by Matthias Boehm
Apache SystemML Optimizer and Runtime techniques by Matthias BoehmArvind Surve
 
Apache SystemML Architecture by Niketan Panesar
Apache SystemML Architecture by Niketan PanesarApache SystemML Architecture by Niketan Panesar
Apache SystemML Architecture by Niketan PanesarArvind Surve
 
Clustering and Factorization using Apache SystemML by Prithviraj Sen
Clustering and Factorization using Apache SystemML by  Prithviraj SenClustering and Factorization using Apache SystemML by  Prithviraj Sen
Clustering and Factorization using Apache SystemML by Prithviraj SenArvind Surve
 
Clustering and Factorization using Apache SystemML by Alexandre V Evfimievski
Clustering and Factorization using Apache SystemML by  Alexandre V EvfimievskiClustering and Factorization using Apache SystemML by  Alexandre V Evfimievski
Clustering and Factorization using Apache SystemML by Alexandre V EvfimievskiArvind Surve
 
Classification using Apache SystemML by Prithviraj Sen
Classification using Apache SystemML by Prithviraj SenClassification using Apache SystemML by Prithviraj Sen
Classification using Apache SystemML by Prithviraj SenArvind Surve
 
Regression using Apache SystemML by Alexandre V Evfimievski
Regression using Apache SystemML by Alexandre V EvfimievskiRegression using Apache SystemML by Alexandre V Evfimievski
Regression using Apache SystemML by Alexandre V EvfimievskiArvind Surve
 
Data preparation, training and validation using SystemML by Faraz Makari Mans...
Data preparation, training and validation using SystemML by Faraz Makari Mans...Data preparation, training and validation using SystemML by Faraz Makari Mans...
Data preparation, training and validation using SystemML by Faraz Makari Mans...Arvind Surve
 
S1 DML Syntax and Invocation
S1 DML Syntax and InvocationS1 DML Syntax and Invocation
S1 DML Syntax and InvocationArvind Surve
 
Overview of Apache SystemML by Berthold Reinwald and Nakul Jindal
Overview of Apache SystemML by Berthold Reinwald and Nakul JindalOverview of Apache SystemML by Berthold Reinwald and Nakul Jindal
Overview of Apache SystemML by Berthold Reinwald and Nakul JindalArvind Surve
 

More from Arvind Surve (20)

Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
 
Apache SystemML Optimizer and Runtime techniques by Matthias Boehm
Apache SystemML Optimizer and Runtime techniques by Matthias BoehmApache SystemML Optimizer and Runtime techniques by Matthias Boehm
Apache SystemML Optimizer and Runtime techniques by Matthias Boehm
 
Apache SystemML Architecture by Niketan Panesar
Apache SystemML Architecture by Niketan PanesarApache SystemML Architecture by Niketan Panesar
Apache SystemML Architecture by Niketan Panesar
 
Clustering and Factorization using Apache SystemML by Prithviraj Sen
Clustering and Factorization using Apache SystemML by  Prithviraj SenClustering and Factorization using Apache SystemML by  Prithviraj Sen
Clustering and Factorization using Apache SystemML by Prithviraj Sen
 
Clustering and Factorization using Apache SystemML by Alexandre V Evfimievski
Clustering and Factorization using Apache SystemML by  Alexandre V EvfimievskiClustering and Factorization using Apache SystemML by  Alexandre V Evfimievski
Clustering and Factorization using Apache SystemML by Alexandre V Evfimievski
 
Classification using Apache SystemML by Prithviraj Sen
Classification using Apache SystemML by Prithviraj SenClassification using Apache SystemML by Prithviraj Sen
Classification using Apache SystemML by Prithviraj Sen
 
Regression using Apache SystemML by Alexandre V Evfimievski
Regression using Apache SystemML by Alexandre V EvfimievskiRegression using Apache SystemML by Alexandre V Evfimievski
Regression using Apache SystemML by Alexandre V Evfimievski
 
DML Syntax and Invocation process
DML Syntax and Invocation processDML Syntax and Invocation process
DML Syntax and Invocation process
 
Overview of Apache SystemML by Berthold Reinwald and Nakul Jindal
Overview of Apache SystemML by Berthold Reinwald and Nakul JindalOverview of Apache SystemML by Berthold Reinwald and Nakul Jindal
Overview of Apache SystemML by Berthold Reinwald and Nakul Jindal
 
Apache SystemML 2016 Summer class primer by Berthold Reinwald
Apache SystemML 2016 Summer class primer by Berthold ReinwaldApache SystemML 2016 Summer class primer by Berthold Reinwald
Apache SystemML 2016 Summer class primer by Berthold Reinwald
 
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
Apache SystemML Optimizer and Runtime techniques by Arvind Surve and Matthias...
 
Apache SystemML Optimizer and Runtime techniques by Matthias Boehm
Apache SystemML Optimizer and Runtime techniques by Matthias BoehmApache SystemML Optimizer and Runtime techniques by Matthias Boehm
Apache SystemML Optimizer and Runtime techniques by Matthias Boehm
 
Apache SystemML Architecture by Niketan Panesar
Apache SystemML Architecture by Niketan PanesarApache SystemML Architecture by Niketan Panesar
Apache SystemML Architecture by Niketan Panesar
 
Clustering and Factorization using Apache SystemML by Prithviraj Sen
Clustering and Factorization using Apache SystemML by  Prithviraj SenClustering and Factorization using Apache SystemML by  Prithviraj Sen
Clustering and Factorization using Apache SystemML by Prithviraj Sen
 
Clustering and Factorization using Apache SystemML by Alexandre V Evfimievski
Clustering and Factorization using Apache SystemML by  Alexandre V EvfimievskiClustering and Factorization using Apache SystemML by  Alexandre V Evfimievski
Clustering and Factorization using Apache SystemML by Alexandre V Evfimievski
 
Classification using Apache SystemML by Prithviraj Sen
Classification using Apache SystemML by Prithviraj SenClassification using Apache SystemML by Prithviraj Sen
Classification using Apache SystemML by Prithviraj Sen
 
Regression using Apache SystemML by Alexandre V Evfimievski
Regression using Apache SystemML by Alexandre V EvfimievskiRegression using Apache SystemML by Alexandre V Evfimievski
Regression using Apache SystemML by Alexandre V Evfimievski
 
Data preparation, training and validation using SystemML by Faraz Makari Mans...
Data preparation, training and validation using SystemML by Faraz Makari Mans...Data preparation, training and validation using SystemML by Faraz Makari Mans...
Data preparation, training and validation using SystemML by Faraz Makari Mans...
 
S1 DML Syntax and Invocation
S1 DML Syntax and InvocationS1 DML Syntax and Invocation
S1 DML Syntax and Invocation
 
Overview of Apache SystemML by Berthold Reinwald and Nakul Jindal
Overview of Apache SystemML by Berthold Reinwald and Nakul JindalOverview of Apache SystemML by Berthold Reinwald and Nakul Jindal
Overview of Apache SystemML by Berthold Reinwald and Nakul Jindal
 

Recently uploaded

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 

Recently uploaded (20)

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 

Data preparation, training and validation using SystemML by Faraz Makari Manshadi