SlideShare a Scribd company logo
1 of 50
© 2014 IBM Corporation
Bluemix Hadoop
Beginner’s Guide -- Part I
Joseph Chang
Senior IT Specialist
IBM Cloud
Document number
• Ambari
• HDFS Explore
• WebHDFS API
• Connect with R Console
• Machine Learning (lm, k-means)
© 2014 IBM Corporation
Reference:
https://www.ng.bluemix.net/docs/services/AnalyticsforHadoop/index.html#analyt
icsforhadoop_data
2
Take me to Bluemix
Click Here
© 2014 IBM Corporation
Are you the target reader?
3
Have you heard
about Bluemix?
Do you know
Hadoop?
Do you know R
language?
Are you interested
in have the three
things working
together?
Yes
Yes
Yes
Yes
Continue to the next
page.
Learn about Bluemix and sign-up.
http://www.bluemix.net
Learn about Hadoop
https://hadoop.apache.org/
Learn about R.
https://www.r-project.org/
No
No
No
No
Bye
© 2014 IBM Corporation
The following 2 Bluemix services are used in this tutorial :
4
Assume you already have
Bluemix id. If you don’t , go
to http://ww.bluemix.net to
get one.
© 2014 IBM Corporation
Create Hadoop Service in Bluemix
5
Please create a java
runtime and add a hadoop
service by yourself.
© 2014 IBM Corporation
Create Hadoop Service in Bluemix
6
You can get the AmbariUrl,
WebhdfsUrl, id, password
… etc. from “Show
Credentials”.
© 2014 IBM Corporation
Ambari Hadoop Management
7
© 2014 IBM Corporation
Monitoring Hadoop with Ambari
8
eg. https://bi-hadoop-prod-2016.services.dal.bluemix.net:8081
https://bi-hadoop-prod-<Cluster ID>.services.dal.bluemix.net:8081
Launch the Ambari
Dashboard with this
URL.
© 2014 IBM Corporation
Ambari – View the detail information of each services
9
Note: Spark
service is
available in this
environment.
© 2014 IBM Corporation
Ambari – Hosts
10
The server nodes
in this Hadoop
Cluster.
© 2014 IBM Corporation
Ambari – Cluster Stack Version
11
The Big R Service will be
used in this tutoral.
© 2014 IBM Corporation
HDFS Explore
12
© 2014 IBM Corporation
HDFS Explore
13
eg. https://bi-hadoop-prod-2016.services.dal.bluemix.net:8443/gateway/default/hdfs/explorer.html
https://bi-hadoop-prod-<Cluster
ID>.services.dal.bluemix.net:8443/gateway/default/hdfs/explorer.ht
ml
Launch the HDFS
Explore with this
URL.
View the files on
the Hadoop File
System. It’s ready
only.
© 2014 IBM Corporation
HDFS – Healthy
14
© 2014 IBM Corporation
WEBHDFS REST API
15
© 2014 IBM Corporation
Upload Data with curl + webhdfs rest api
16
curl -i -L -k -s --user biblumix:<your_biblumix_password> --max-time 45
-X PUT
https://bi-hadoop-prod-<your_cluster_number>.services.dal.bluemix.net:8443/
gateway/default/webhdfs/v1/user/biblumix/<path_to_file/file_name>?op=CREATE
curl -i -L -k -s --user biblumix:<your_biblumix_password> --max-time 45
-X PUT -T <file_name.txt> <Location URL from step 1 response message>
If you can’t run “curl”
in your command
line, google it and
download it.
Use WEBHDFS API
to upload file
The current
CREATE api have a
defect cause the
uploaded file
size=0. The 2 steps
approach is a
workaround.
© 2014 IBM Corporation
Upload Data with curl + webhdfs rest api (Screen capture)
17
Step 1 Create temp redirect Step 2 Upload file from local disk
The location in step
1 response
message will be
used in step 2.
You should
get response
code 307 in
step 1
You should
get response
code 201 in
step 2
© 2014 IBM Corporation
Upload Data with curl + webhdfs rest api (Result)
18
The file has
been
uploaded.
Note the size
should not be
0.
© 2014 IBM Corporation
More webhdfs rest api
19
curl -i -k -s --user biblumix:your_biblumix_password --max-time 45
https://bi-hadoop-prod-your_cluster_number.services.dal.bluemix.net:8443/
gateway/default/webhdfs/v1/user?op=LISTSTATUS
curl -i -s --user biblumix:passwordhttps://hostname:8443/gateway/default/oozie/v1/jobs?jobtype=wf
curl -i -s --user biblumix:password -X POST -H "Content-Type: application/xml" -d @oozie-mrjob-
config.xml https://hostname:8443/gateway/default/oozie/v1/jobs?action=start
curl -i -s --user biblumix:your_biblumix_password --max-time 45 -X DELETE
https://bi-hadoop-prod-your_cluster_number.services.dal.bluemix.net:8443/
gateway/default/webhdfs/v1/user/biblumix/path_to_file?op=DELETE
curl -i -k -s --user biblumix:your_biblumix_password --max-time 45 -X PUT
https://bi-hadoop-prod-your_cluster_number.services.dal.bluemix.net:8443/
gateway/default/webhdfs/v1/user/biblumix/path_to_directory?op=MKDIRS
© 2014 IBM Corporation
Install R Console & Big R
20
© 2014 IBM Corporation
Download Drivers for Big R
21
https://hub.jazz.net/project/kulkarni/a4h/overview#https://hub.jazz.
net/git/kulkarni%252Fa4h/list/master/client-libs
Extract the file to  /temp

The big R library
can be download
from this url.
© 2014 IBM Corporation
Install R Console
22
Download R Language
https://cran.r-project.org/
Launch R console: Launch R in Terminal:
If you don’t have R
console in your
PC/NB . Download it
from this URL.
You can use
either R Console
or terminal.
Type R in
command line
to launch R.
© 2014 IBM Corporation
Install Big R
 > install.packages('rJava')
 --- Please select a CRAN mirror for use in this session ---
 HTTPS CRAN mirror
 1: 0-Cloud [https] 2: Austria [https]
 3: Chile [https] 4: China (Beijing 4) [https]
 5: China (Hefei) [https] 6: Colombia (Cali) [https]
 7: France (Lyon 2) [https] 8: Germany (Münster) [https]
 9: Iceland [https] 10: Russia (Moscow) [https]
 11: Spain (A Coruña) [https] 12: Switzerland [https]
 13: UK (Bristol) [https] 14: UK (Cambridge) [https]
 15: USA (CA 1) [https] 16: USA (KS) [https]
 17: USA (MI 1) [https] 18: USA (TN) [https]
 19: USA (TX) [https] 20: USA (WA) [https]
 21: (HTTP mirrors)
 Selection: 1
 trying URL 'https://cran.rstudio.com/bin/macosx/mavericks/contrib/3.2/rJava_0.9-7.tgz'
 Content type 'application/x-gzip' length 604271 bytes (590 KB)
 ==================================================
 downloaded 590 KB
 The downloaded binary packages are in
 /var/folders/g0/jgl74nkx0h97dgpywqv2prrc0000gn/T//Rtmp3ggvb8/downloaded_packages
23
Warning message:
In doTryCatch(return(expr), name, parentenv, handler) :
unable to load shared object '/Library/Frameworks/R.framework/Resources/modules//R_X11.so':
dlopen(/Library/Frameworks/R.framework/Resources/modules//R_X11.so, 6): Library not loaded:
/opt/X11/lib/libSM.6.dylib
Referenced from: /Library/Frameworks/R.framework/Resources/modules//R_X11.so
Reason: image not found
>
Before install big
R package. We
need install
rJava.
© 2014 IBM Corporation
Install Big R
24
> install.packages('base64enc')
trying URL 'https://cran.rstudio.com/bin/macosx/mavericks/contrib/3.2/base64enc_0.1-3.tgz'
Content type 'application/x-gzip' length 26679 bytes (26 KB)
==================================================
downloaded 26 KB
The downloaded binary packages are in
/var/folders/g0/jgl74nkx0h97dgpywqv2prrc0000gn/T//Rtmp3ggvb8/downloaded_packages
> install.packages('data.table')
also installing the dependencies ‘stringi’, ‘magrittr’, ‘plyr’, ‘stringr’, ‘Rcpp’, ‘chron’, ‘reshape2’
trying URL 'https://cran.rstudio.com/bin/macosx/mavericks/contrib/3.2/stringi_0.5-5.tgz'
Content type 'application/x-gzip' length 12685069 bytes (12.1 MB)
==================================================
downloaded 12.1 MB
….
trying URL 'https://cran.rstudio.com/bin/macosx/mavericks/contrib/3.2/data.table_1.9.4.tgz'
Content type 'application/x-gzip' length 1266610 bytes (1.2 MB)
==================================================
downloaded 1.2 MB
The downloaded binary packages are in
/var/folders/g0/jgl74nkx0h97dgpywqv2prrc0000gn/T//Rtmp3ggvb8/downloaded_packages
>
Before install big R
package. We need
install base64enc
and data.table
© 2014 IBM Corporation
Install Big R
25
> install.packages(pkg="/temp/bigr_3.18.tar.gz", type="source", repos=NULL)
* installing *source* package ‘bigr’ ...
** R
** inst
** preparing package for lazy loading
Attaching...Creating a generic function for ‘toString’ from package ‘base’ in package ‘bigr’
Creating a generic function for ‘nchar’ from package ‘base’ in package ‘bigr’
Creating a generic function for ‘coef’ from package ‘stats’ in package ‘bigr’
** help
*** installing help indices
** building package indices
** testing if installed package can be loaded
* DONE (bigr)
>
Now you can
install Big R
library.
Make sure the
library path is
correct.
© 2014 IBM Corporation
Install Big R (2 issues in Bluemix doc)
26
If you copy the
command in
Bluemix doc , you
may got this error.
(Aug.2015)
It should be
“packages”
The Bluemix
instruction dosen’t
mention the 3
libraries need to
be installed first.
© 2014 IBM Corporation
Machine Learning
-- Liner Regression
-- K-means
27
http://www-
01.ibm.com/support/knowledgecenter/SSPT3X_4.0.0/com.ibm.swg.im.infosphere.biginsights.bigr.doc
/doc/intro.html?cp=SSPT3X_4.0.0%2F9-1
Reference:
Recommendation:
Learn more about
big R from the
URL.
© 2014 IBM Corporation
Machine Learning – Big R Example #1-1
28
############################
# 1.1 Connect to Bluemix Hadoop
#############################
# In order to try out any example, first run the following steps to upload
# the aforementioned dataset to a BigInsights cluster.
library(bigr)
bigr.connect(host="bi-hadoop-prod-2016.services.dal.bluemix.net",user="biblumix",
password="w9@4f0~HnXLD",ssl=TRUE,
trustStorePath="/Library/Java/Home/lib/security/cacerts",
trustStorePassword="changeit",keyManager="SunX509")
is.bigr.connected()
Replace it with
your own cluster id
Replace it with your
own password
Replace it with the
Java Home path in
your Environment.
© 2014 IBM Corporation
Machine Learning – Big R Example #1-2
29
#################
# 1.2 Data loading
#################
airfile <- system.file("extdata", "airline.zip", package="bigr”)
airfile <- unzip(airfile, exdir = tempdir())
airR <- read.csv(airfile, stringsAsFactors=F)
# Upload the data to the BigInsights server. This may take 15-20 seconds
air <- as.bigr.frame(airR)
air <- bigr.persist(air, dataSource="DEL", dataPath="/user/bigr/examples/airline_demo.csv”,
header=T, delimiter=",", useMapReduce=F)
The file uses “,”
as DELimiter
© 2014 IBM Corporation
Big R Example #1 (Screen capture)
30
You can check if
the file is
successfully upload
by HDFS explore.
You should get
“TRUE” if
successfully connect
to bluemix hadoop.
© 2014 IBM Corporation
About the airline.csv sample data
31
The airline.zip sample can be found in your R installation directory.
© 2014 IBM Corporation
Machine Learning – Big R Example #2
32
###########################
# 2. Accessing data on HDFS
###########################
# Once uploaded, one merely needs to instantiate a big.frame object,
# commonly referenced as "air" in the examples, to access the dataset via
# the Big R API.
air <- bigr.frame(dataPath = "/user/bigr/examples/airline_demo.csv",
dataSource = "DEL",
delimiter=",", header = T,
coltypes = ifelse(1:29 %in% c(9,11,17,18,23),
"character", "integer"),
useMapReduce = F)
There are 29 columns
in the
airline_dmeo.csv file.
Column
9,11,17,18,23 are
character. Remaining
columns are integer.
© 2014 IBM Corporation
Big R Example #2 (Screen capture)
33
© 2014 IBM Corporation
Machine Learning – Big R Example #3-1
34
#################################################################
# 3. Machine Learning example: building a Linear Regression model
#################################################################
# Remove files from previous executions (if any)
invisible(bigr.rmfs("/user/bigr/examples/airline.sample.* /user/bigr/examples/lm.airline*"))
# Project some relevant columns for modeling / statistical analysis
airlineFiltered <- air[, c("Month", "DayofMonth", "DayOfWeek", "CRSDepTime",
"Distance", "ArrDelay")]
# Create a bigr.matrix from the data
airlineMatrix <- bigr.transform(airlineFiltered,
outData="/user/bigr/examples/airline.sample.matrix",
transformPath="/user/bigr/examples/airline.sample.transform")
The 6 variables are
choose for this model.
© 2014 IBM Corporation
Machine Learning – Big R Example #3-2
35
#################################################################
# 3. Machine Learning example: building a Linear Regression model
#################################################################
# Split the data into 70% for training and 30% for testing
samples <- bigr.sample(airlineMatrix, perc=c(0.7, 0.3))
train <- samples[[1]]
test <- samples[[2]]
# Create a linear regression model
lm <- bigr.lm(ArrDelay ~ ., data=train, directory="/user/bigr/examples/lm.airline")
# Get the coefficients of the regression
coef(lm)
We will use "Month",
"DayofMonth",
"DayOfWeek",
"CRSDepTime”,
"Distance” to predict
ArrDelay
© 2014 IBM Corporation
Big R Example #3 (Screen Capture)
36
© 2014 IBM Corporation
Big R Example #3-2 (Result)
37
Y : ArrDelay
X1: Month
X2: DayofMonth
X3: DayOfweek
X4: CRSDepTIme
X5: Distance
Y = -0.174423*X1 -0.01547941*X2-0.03378236*X3
+0.006222544*X4+0.0003556919*X5
The Arrival Delay prediction model is :
© 2014 IBM Corporation
Machine Learning – Big R Example #3-3
38
#################################################################
# 3. Machine Learning example: building a Linear Regression model
#################################################################
# Calculate predictions for the testing set
pred <- predict(lm, test, "/user/bigr/examples/lm.airline.preds")
© 2014 IBM Corporation
Big R Example #3 (Screen Capture)
39
Predicted arrival
delay time for
test data.
© 2014 IBM Corporation
Big R Example #3 (output)
40
View the preds
files generate on
the hdfs.
© 2014 IBM Corporation
Machine Learning – Big R Example #4
41
##################################################################
# 4. Machine Learning example: building a k-means clustering model
##################################################################
# Remove files from previous executions (if any)
invisible(bigr.rmfs("/user/bigr/examples/iris.* /user/bigr/examples/km*"))
# Load the Iris dataset to HDFS
irisbf <- as.bigr.frame(iris[, -5])
# Convert the Iris dataset into a bigr.matrix object
irisBM <- bigr.transform(bf = irisbf, outData = "/user/bigr/examples/iris.mtx",
transformPath = "/user/bigr/examples/iris.transform")
# Create a k-means model with 10 clusters
km <- bigr.kmeans(irisBM, centers=10, directory="/user/bigr/examples/km", writeY=T)
# Use the existing model to cluster a different dataset
p <- predict(km, irisBM, "/user/bigr/examples/km.preds")
Iris is the built-in
sample data set in
R Language
© 2014 IBM Corporation
About the sample data -- IRIS
42
© 2014 IBM Corporation
Big R Example #4 (Screen Capture)
43
The 10 clusters
of IRIS by k-
means.
© 2014 IBM Corporation
Big R Example #4 (Screen capture)
44
Identify each
sample data with
the model.
© 2014 IBM Corporation
Thank you
45
Take me to Bluemix
Click Here
© 2014 IBM Corporation
Appendix 1: Hadoop Cloud Demo
with IBM Bluemix
46
I found this great video in
Youtube.
You can learn more about
Bluemix Hadoop in this
video.
© 2014 IBM Corporation
Big Data Hadoop Cloud Demo – IBM Bluemix
https://www.youtube.com/watch?v=FUDOsBDAahE
47
© 2014 IBM Corporation
Appendix 2: Define Hadoop Cluster by yourself
48
If you want your application
run faster. You may choose
this charged service which
running on bare metal
servers with multiple
nodes.
© 2014 IBM Corporation
BigInsights for Hadoop Cluster Topology
49
© 2014 IBM Corporation
BigInsights for Hadoop Cluster Topology
50

More Related Content

What's hot

Reusable, composable, battle-tested Terraform modules
Reusable, composable, battle-tested Terraform modulesReusable, composable, battle-tested Terraform modules
Reusable, composable, battle-tested Terraform modulesYevgeniy Brikman
 
PuppetConf 2016: Best Practices for Puppet in the Cloud – Randall Hunt, Amazo...
PuppetConf 2016: Best Practices for Puppet in the Cloud – Randall Hunt, Amazo...PuppetConf 2016: Best Practices for Puppet in the Cloud – Randall Hunt, Amazo...
PuppetConf 2016: Best Practices for Puppet in the Cloud – Randall Hunt, Amazo...Puppet
 
Kubernetes best practices
Kubernetes best practicesKubernetes best practices
Kubernetes best practicesBill Liu
 
How to test infrastructure code: automated testing for Terraform, Kubernetes,...
How to test infrastructure code: automated testing for Terraform, Kubernetes,...How to test infrastructure code: automated testing for Terraform, Kubernetes,...
How to test infrastructure code: automated testing for Terraform, Kubernetes,...Yevgeniy Brikman
 
Istio By Example (extended version)
Istio By Example (extended version)Istio By Example (extended version)
Istio By Example (extended version)Josef Adersberger
 
Measure and Increase Developer Productivity with Help of Serverless at JCON 2...
Measure and Increase Developer Productivity with Help of Serverless at JCON 2...Measure and Increase Developer Productivity with Help of Serverless at JCON 2...
Measure and Increase Developer Productivity with Help of Serverless at JCON 2...Vadym Kazulkin
 
Infrastructure as code
Infrastructure as codeInfrastructure as code
Infrastructure as codeAxel Quack
 
All Things Open 2019 weave-services-istio
All Things Open 2019 weave-services-istioAll Things Open 2019 weave-services-istio
All Things Open 2019 weave-services-istioLin Sun
 
Containerize Legacy .NET Framework Web Apps for Cloud Migration - ENT201 - Ch...
Containerize Legacy .NET Framework Web Apps for Cloud Migration - ENT201 - Ch...Containerize Legacy .NET Framework Web Apps for Cloud Migration - ENT201 - Ch...
Containerize Legacy .NET Framework Web Apps for Cloud Migration - ENT201 - Ch...Amazon Web Services
 
Terraform - The Road to Self-Service
Terraform - The Road to Self-ServiceTerraform - The Road to Self-Service
Terraform - The Road to Self-ServiceRyan Boyce
 
White Paper: Scaling Servers and Storage for Film Assets
White Paper: Scaling Servers and Storage for Film AssetsWhite Paper: Scaling Servers and Storage for Film Assets
White Paper: Scaling Servers and Storage for Film AssetsPerforce
 
Advanced Container Security - AWS Summit Sydney 2018
Advanced Container Security - AWS Summit Sydney 2018Advanced Container Security - AWS Summit Sydney 2018
Advanced Container Security - AWS Summit Sydney 2018Amazon Web Services
 
OpenStack Ironic Project Summary, February 2014
OpenStack Ironic Project Summary, February 2014OpenStack Ironic Project Summary, February 2014
OpenStack Ironic Project Summary, February 2014Devananda Van Der Veen
 
[OpenStack Day in Korea] OpenStack Provisioning in 30 minutes
[OpenStack Day in Korea] OpenStack Provisioning in 30 minutes[OpenStack Day in Korea] OpenStack Provisioning in 30 minutes
[OpenStack Day in Korea] OpenStack Provisioning in 30 minutesSungjin Kang
 
Docker meetup - PaaS interoperability
Docker meetup - PaaS interoperabilityDocker meetup - PaaS interoperability
Docker meetup - PaaS interoperabilityLudovic Piot
 
Docker for Ruby Developers
Docker for Ruby DevelopersDocker for Ruby Developers
Docker for Ruby DevelopersAptible
 
Deploy Prometheus - Grafana and EFK stack on Kubic k8s Clusters
Deploy Prometheus - Grafana and EFK stack on Kubic k8s ClustersDeploy Prometheus - Grafana and EFK stack on Kubic k8s Clusters
Deploy Prometheus - Grafana and EFK stack on Kubic k8s ClustersSyah Dwi Prihatmoko
 
Transforming Infrastructure into Code - Importing existing cloud resources u...
Transforming Infrastructure into Code  - Importing existing cloud resources u...Transforming Infrastructure into Code  - Importing existing cloud resources u...
Transforming Infrastructure into Code - Importing existing cloud resources u...Shih Oon Liong
 

What's hot (20)

Reusable, composable, battle-tested Terraform modules
Reusable, composable, battle-tested Terraform modulesReusable, composable, battle-tested Terraform modules
Reusable, composable, battle-tested Terraform modules
 
PuppetConf 2016: Best Practices for Puppet in the Cloud – Randall Hunt, Amazo...
PuppetConf 2016: Best Practices for Puppet in the Cloud – Randall Hunt, Amazo...PuppetConf 2016: Best Practices for Puppet in the Cloud – Randall Hunt, Amazo...
PuppetConf 2016: Best Practices for Puppet in the Cloud – Randall Hunt, Amazo...
 
Terraform
TerraformTerraform
Terraform
 
Kubernetes best practices
Kubernetes best practicesKubernetes best practices
Kubernetes best practices
 
How to test infrastructure code: automated testing for Terraform, Kubernetes,...
How to test infrastructure code: automated testing for Terraform, Kubernetes,...How to test infrastructure code: automated testing for Terraform, Kubernetes,...
How to test infrastructure code: automated testing for Terraform, Kubernetes,...
 
Istio By Example (extended version)
Istio By Example (extended version)Istio By Example (extended version)
Istio By Example (extended version)
 
Measure and Increase Developer Productivity with Help of Serverless at JCON 2...
Measure and Increase Developer Productivity with Help of Serverless at JCON 2...Measure and Increase Developer Productivity with Help of Serverless at JCON 2...
Measure and Increase Developer Productivity with Help of Serverless at JCON 2...
 
Infrastructure as code
Infrastructure as codeInfrastructure as code
Infrastructure as code
 
All Things Open 2019 weave-services-istio
All Things Open 2019 weave-services-istioAll Things Open 2019 weave-services-istio
All Things Open 2019 weave-services-istio
 
Containerize Legacy .NET Framework Web Apps for Cloud Migration - ENT201 - Ch...
Containerize Legacy .NET Framework Web Apps for Cloud Migration - ENT201 - Ch...Containerize Legacy .NET Framework Web Apps for Cloud Migration - ENT201 - Ch...
Containerize Legacy .NET Framework Web Apps for Cloud Migration - ENT201 - Ch...
 
Terraform - The Road to Self-Service
Terraform - The Road to Self-ServiceTerraform - The Road to Self-Service
Terraform - The Road to Self-Service
 
White Paper: Scaling Servers and Storage for Film Assets
White Paper: Scaling Servers and Storage for Film AssetsWhite Paper: Scaling Servers and Storage for Film Assets
White Paper: Scaling Servers and Storage for Film Assets
 
Advanced Container Security - AWS Summit Sydney 2018
Advanced Container Security - AWS Summit Sydney 2018Advanced Container Security - AWS Summit Sydney 2018
Advanced Container Security - AWS Summit Sydney 2018
 
OpenStack Ironic Project Summary, February 2014
OpenStack Ironic Project Summary, February 2014OpenStack Ironic Project Summary, February 2014
OpenStack Ironic Project Summary, February 2014
 
[OpenStack Day in Korea] OpenStack Provisioning in 30 minutes
[OpenStack Day in Korea] OpenStack Provisioning in 30 minutes[OpenStack Day in Korea] OpenStack Provisioning in 30 minutes
[OpenStack Day in Korea] OpenStack Provisioning in 30 minutes
 
Manual 5
Manual 5Manual 5
Manual 5
 
Docker meetup - PaaS interoperability
Docker meetup - PaaS interoperabilityDocker meetup - PaaS interoperability
Docker meetup - PaaS interoperability
 
Docker for Ruby Developers
Docker for Ruby DevelopersDocker for Ruby Developers
Docker for Ruby Developers
 
Deploy Prometheus - Grafana and EFK stack on Kubic k8s Clusters
Deploy Prometheus - Grafana and EFK stack on Kubic k8s ClustersDeploy Prometheus - Grafana and EFK stack on Kubic k8s Clusters
Deploy Prometheus - Grafana and EFK stack on Kubic k8s Clusters
 
Transforming Infrastructure into Code - Importing existing cloud resources u...
Transforming Infrastructure into Code  - Importing existing cloud resources u...Transforming Infrastructure into Code  - Importing existing cloud resources u...
Transforming Infrastructure into Code - Importing existing cloud resources u...
 

Viewers also liked

Using java to access bluemix object storage v2
Using java to access bluemix object storage v2Using java to access bluemix object storage v2
Using java to access bluemix object storage v2Joseph Chang
 
From docker hub to bluemix catalog
From docker hub to bluemix catalogFrom docker hub to bluemix catalog
From docker hub to bluemix catalogJoseph Chang
 
Investment advisor with bluemix trade off analysis
Investment advisor with bluemix trade off analysisInvestment advisor with bluemix trade off analysis
Investment advisor with bluemix trade off analysisJoseph Chang
 
Open whisk quick start guide
Open whisk quick start guideOpen whisk quick start guide
Open whisk quick start guideJoseph Chang
 
dashDB データロード方法ガイド
dashDB データロード方法ガイドdashDB データロード方法ガイド
dashDB データロード方法ガイドIBM Analytics Japan
 
Bluemix Node-Red Part II
Bluemix Node-Red Part IIBluemix Node-Red Part II
Bluemix Node-Red Part IIJoseph Chang
 
Bluemix node red-part iii
Bluemix node red-part iiiBluemix node red-part iii
Bluemix node red-part iiiJoseph Chang
 
Bluemix Node-Red Part I
Bluemix Node-Red Part IBluemix Node-Red Part I
Bluemix Node-Red Part IJoseph Chang
 
Create to blumix docker container with rest api
Create to blumix docker container with rest apiCreate to blumix docker container with rest api
Create to blumix docker container with rest apiJoseph Chang
 
IBM版Hadoop - BigInsights/Big SQL (2013/07/26 CLUB DB2発表資料)
IBM版Hadoop - BigInsights/Big SQL (2013/07/26 CLUB DB2発表資料)IBM版Hadoop - BigInsights/Big SQL (2013/07/26 CLUB DB2発表資料)
IBM版Hadoop - BigInsights/Big SQL (2013/07/26 CLUB DB2発表資料)Akira Shimosako
 
Project Humix overview
Project Humix overviewProject Humix overview
Project Humix overviewJeffrey Liu
 

Viewers also liked (13)

Using java to access bluemix object storage v2
Using java to access bluemix object storage v2Using java to access bluemix object storage v2
Using java to access bluemix object storage v2
 
From docker hub to bluemix catalog
From docker hub to bluemix catalogFrom docker hub to bluemix catalog
From docker hub to bluemix catalog
 
Investment advisor with bluemix trade off analysis
Investment advisor with bluemix trade off analysisInvestment advisor with bluemix trade off analysis
Investment advisor with bluemix trade off analysis
 
Open whisk quick start guide
Open whisk quick start guideOpen whisk quick start guide
Open whisk quick start guide
 
Retrive&amp;rank
Retrive&amp;rankRetrive&amp;rank
Retrive&amp;rank
 
dashDBへの接続方法
dashDBへの接続方法 dashDBへの接続方法
dashDBへの接続方法
 
dashDB データロード方法ガイド
dashDB データロード方法ガイドdashDB データロード方法ガイド
dashDB データロード方法ガイド
 
Bluemix Node-Red Part II
Bluemix Node-Red Part IIBluemix Node-Red Part II
Bluemix Node-Red Part II
 
Bluemix node red-part iii
Bluemix node red-part iiiBluemix node red-part iii
Bluemix node red-part iii
 
Bluemix Node-Red Part I
Bluemix Node-Red Part IBluemix Node-Red Part I
Bluemix Node-Red Part I
 
Create to blumix docker container with rest api
Create to blumix docker container with rest apiCreate to blumix docker container with rest api
Create to blumix docker container with rest api
 
IBM版Hadoop - BigInsights/Big SQL (2013/07/26 CLUB DB2発表資料)
IBM版Hadoop - BigInsights/Big SQL (2013/07/26 CLUB DB2発表資料)IBM版Hadoop - BigInsights/Big SQL (2013/07/26 CLUB DB2発表資料)
IBM版Hadoop - BigInsights/Big SQL (2013/07/26 CLUB DB2発表資料)
 
Project Humix overview
Project Humix overviewProject Humix overview
Project Humix overview
 

Similar to Bluemix hadoop beginners Guide part I

Scaffolding for Serverless: lightning talk for AWS Arlington Meetup
Scaffolding for Serverless: lightning talk for AWS Arlington MeetupScaffolding for Serverless: lightning talk for AWS Arlington Meetup
Scaffolding for Serverless: lightning talk for AWS Arlington MeetupChris Shenton
 
DCEU 18: Developing with Docker Containers
DCEU 18: Developing with Docker ContainersDCEU 18: Developing with Docker Containers
DCEU 18: Developing with Docker ContainersDocker, Inc.
 
Automate Your Automation | DrupalCon Vienna
Automate Your Automation | DrupalCon ViennaAutomate Your Automation | DrupalCon Vienna
Automate Your Automation | DrupalCon ViennaPantheon
 
CloudOps CloudStack Days, Austin April 2015
CloudOps CloudStack Days, Austin April 2015CloudOps CloudStack Days, Austin April 2015
CloudOps CloudStack Days, Austin April 2015CloudOps2005
 
How to set up an ASP.NET 5 Continuous Delivery Pipeline using IBM Bluemix Dev...
How to set up an ASP.NET 5 Continuous Delivery Pipeline using IBM Bluemix Dev...How to set up an ASP.NET 5 Continuous Delivery Pipeline using IBM Bluemix Dev...
How to set up an ASP.NET 5 Continuous Delivery Pipeline using IBM Bluemix Dev...Richard Johansson
 
Using Docker For Development
Using Docker For DevelopmentUsing Docker For Development
Using Docker For DevelopmentLaura Frank Tacho
 
Scaleable PHP Applications in Kubernetes
Scaleable PHP Applications in KubernetesScaleable PHP Applications in Kubernetes
Scaleable PHP Applications in KubernetesRobert Lemke
 
Digital Forensics and Incident Response in The Cloud Part 3
Digital Forensics and Incident Response in The Cloud Part 3Digital Forensics and Incident Response in The Cloud Part 3
Digital Forensics and Incident Response in The Cloud Part 3Velocidex Enterprises
 
Developer Nirvana with IBM Bluemix™
Developer Nirvana with IBM Bluemix™Developer Nirvana with IBM Bluemix™
Developer Nirvana with IBM Bluemix™IBM
 
Deploying R for Production - SRUG
Deploying R for Production - SRUGDeploying R for Production - SRUG
Deploying R for Production - SRUGHolger Hellebro
 
CloudLand 2023: Rock, Paper, Scissors Cloud Competition - Go vs. Java
CloudLand 2023: Rock, Paper, Scissors Cloud Competition - Go vs. JavaCloudLand 2023: Rock, Paper, Scissors Cloud Competition - Go vs. Java
CloudLand 2023: Rock, Paper, Scissors Cloud Competition - Go vs. JavaJan Stamer
 
Devry gsp 215 week 7 i lab networking and a tiny web server new
Devry gsp 215 week 7 i lab networking and a tiny web server newDevry gsp 215 week 7 i lab networking and a tiny web server new
Devry gsp 215 week 7 i lab networking and a tiny web server newwilliamethan912
 
GCP-Professional-Cloud-Developer-Exam-v22.2.1_139-taqwlj.pdf
GCP-Professional-Cloud-Developer-Exam-v22.2.1_139-taqwlj.pdfGCP-Professional-Cloud-Developer-Exam-v22.2.1_139-taqwlj.pdf
GCP-Professional-Cloud-Developer-Exam-v22.2.1_139-taqwlj.pdfssuserc36624
 
What's New in Docker - February 2017
What's New in Docker - February 2017What's New in Docker - February 2017
What's New in Docker - February 2017Patrick Chanezon
 
Swift Cloud Workshop - Swift Microservices
Swift Cloud Workshop - Swift MicroservicesSwift Cloud Workshop - Swift Microservices
Swift Cloud Workshop - Swift MicroservicesChris Bailey
 
Hands on Docker - Launch your own LEMP or LAMP stack - SunshinePHP
Hands on Docker - Launch your own LEMP or LAMP stack - SunshinePHPHands on Docker - Launch your own LEMP or LAMP stack - SunshinePHP
Hands on Docker - Launch your own LEMP or LAMP stack - SunshinePHPDana Luther
 
Dependencies Managers in C/C++. Using stdcpp 2014
Dependencies Managers in C/C++. Using stdcpp 2014Dependencies Managers in C/C++. Using stdcpp 2014
Dependencies Managers in C/C++. Using stdcpp 2014biicode
 
GE Predix 新手入门 赵锴 物联网_IoT
GE Predix 新手入门 赵锴 物联网_IoTGE Predix 新手入门 赵锴 物联网_IoT
GE Predix 新手入门 赵锴 物联网_IoTKai Zhao
 

Similar to Bluemix hadoop beginners Guide part I (20)

Scaffolding for Serverless: lightning talk for AWS Arlington Meetup
Scaffolding for Serverless: lightning talk for AWS Arlington MeetupScaffolding for Serverless: lightning talk for AWS Arlington Meetup
Scaffolding for Serverless: lightning talk for AWS Arlington Meetup
 
DCEU 18: Developing with Docker Containers
DCEU 18: Developing with Docker ContainersDCEU 18: Developing with Docker Containers
DCEU 18: Developing with Docker Containers
 
Automate Your Automation | DrupalCon Vienna
Automate Your Automation | DrupalCon ViennaAutomate Your Automation | DrupalCon Vienna
Automate Your Automation | DrupalCon Vienna
 
CloudOps CloudStack Days, Austin April 2015
CloudOps CloudStack Days, Austin April 2015CloudOps CloudStack Days, Austin April 2015
CloudOps CloudStack Days, Austin April 2015
 
How to set up an ASP.NET 5 Continuous Delivery Pipeline using IBM Bluemix Dev...
How to set up an ASP.NET 5 Continuous Delivery Pipeline using IBM Bluemix Dev...How to set up an ASP.NET 5 Continuous Delivery Pipeline using IBM Bluemix Dev...
How to set up an ASP.NET 5 Continuous Delivery Pipeline using IBM Bluemix Dev...
 
Using Docker For Development
Using Docker For DevelopmentUsing Docker For Development
Using Docker For Development
 
One-Man Ops
One-Man OpsOne-Man Ops
One-Man Ops
 
Scaleable PHP Applications in Kubernetes
Scaleable PHP Applications in KubernetesScaleable PHP Applications in Kubernetes
Scaleable PHP Applications in Kubernetes
 
Digital Forensics and Incident Response in The Cloud Part 3
Digital Forensics and Incident Response in The Cloud Part 3Digital Forensics and Incident Response in The Cloud Part 3
Digital Forensics and Incident Response in The Cloud Part 3
 
Developer Nirvana with IBM Bluemix™
Developer Nirvana with IBM Bluemix™Developer Nirvana with IBM Bluemix™
Developer Nirvana with IBM Bluemix™
 
Optimizing Your CI Pipelines
Optimizing Your CI PipelinesOptimizing Your CI Pipelines
Optimizing Your CI Pipelines
 
Deploying R for Production - SRUG
Deploying R for Production - SRUGDeploying R for Production - SRUG
Deploying R for Production - SRUG
 
CloudLand 2023: Rock, Paper, Scissors Cloud Competition - Go vs. Java
CloudLand 2023: Rock, Paper, Scissors Cloud Competition - Go vs. JavaCloudLand 2023: Rock, Paper, Scissors Cloud Competition - Go vs. Java
CloudLand 2023: Rock, Paper, Scissors Cloud Competition - Go vs. Java
 
Devry gsp 215 week 7 i lab networking and a tiny web server new
Devry gsp 215 week 7 i lab networking and a tiny web server newDevry gsp 215 week 7 i lab networking and a tiny web server new
Devry gsp 215 week 7 i lab networking and a tiny web server new
 
GCP-Professional-Cloud-Developer-Exam-v22.2.1_139-taqwlj.pdf
GCP-Professional-Cloud-Developer-Exam-v22.2.1_139-taqwlj.pdfGCP-Professional-Cloud-Developer-Exam-v22.2.1_139-taqwlj.pdf
GCP-Professional-Cloud-Developer-Exam-v22.2.1_139-taqwlj.pdf
 
What's New in Docker - February 2017
What's New in Docker - February 2017What's New in Docker - February 2017
What's New in Docker - February 2017
 
Swift Cloud Workshop - Swift Microservices
Swift Cloud Workshop - Swift MicroservicesSwift Cloud Workshop - Swift Microservices
Swift Cloud Workshop - Swift Microservices
 
Hands on Docker - Launch your own LEMP or LAMP stack - SunshinePHP
Hands on Docker - Launch your own LEMP or LAMP stack - SunshinePHPHands on Docker - Launch your own LEMP or LAMP stack - SunshinePHP
Hands on Docker - Launch your own LEMP or LAMP stack - SunshinePHP
 
Dependencies Managers in C/C++. Using stdcpp 2014
Dependencies Managers in C/C++. Using stdcpp 2014Dependencies Managers in C/C++. Using stdcpp 2014
Dependencies Managers in C/C++. Using stdcpp 2014
 
GE Predix 新手入门 赵锴 物联网_IoT
GE Predix 新手入门 赵锴 物联网_IoTGE Predix 新手入门 赵锴 物联网_IoT
GE Predix 新手入门 赵锴 物联网_IoT
 

More from Joseph Chang

Create docker image with bluemix dev ops
Create docker image with bluemix dev opsCreate docker image with bluemix dev ops
Create docker image with bluemix dev opsJoseph Chang
 
Connect to blumix docker container with putty
Connect to blumix docker container with puttyConnect to blumix docker container with putty
Connect to blumix docker container with puttyJoseph Chang
 
Run windows vm on bluemix
Run windows vm on bluemixRun windows vm on bluemix
Run windows vm on bluemixJoseph Chang
 
Connect to blumix vm with vnc
Connect to blumix vm with vncConnect to blumix vm with vnc
Connect to blumix vm with vncJoseph Chang
 
Connect to blumix vm with putty
Connect to blumix vm with puttyConnect to blumix vm with putty
Connect to blumix vm with puttyJoseph Chang
 
Bluemix iot with intel galileo
Bluemix iot with intel galileoBluemix iot with intel galileo
Bluemix iot with intel galileoJoseph Chang
 
IBM Bluemix introduction
IBM Bluemix introductionIBM Bluemix introduction
IBM Bluemix introductionJoseph Chang
 

More from Joseph Chang (9)

Create docker image with bluemix dev ops
Create docker image with bluemix dev opsCreate docker image with bluemix dev ops
Create docker image with bluemix dev ops
 
Connect to blumix docker container with putty
Connect to blumix docker container with puttyConnect to blumix docker container with putty
Connect to blumix docker container with putty
 
Run windows vm on bluemix
Run windows vm on bluemixRun windows vm on bluemix
Run windows vm on bluemix
 
Connect to blumix vm with vnc
Connect to blumix vm with vncConnect to blumix vm with vnc
Connect to blumix vm with vnc
 
Connect to blumix vm with putty
Connect to blumix vm with puttyConnect to blumix vm with putty
Connect to blumix vm with putty
 
Bluemix iot demo
Bluemix iot demoBluemix iot demo
Bluemix iot demo
 
Bluemix u steam
Bluemix u steamBluemix u steam
Bluemix u steam
 
Bluemix iot with intel galileo
Bluemix iot with intel galileoBluemix iot with intel galileo
Bluemix iot with intel galileo
 
IBM Bluemix introduction
IBM Bluemix introductionIBM Bluemix introduction
IBM Bluemix introduction
 

Recently uploaded

Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Bluemix hadoop beginners Guide part I

  • 1. © 2014 IBM Corporation Bluemix Hadoop Beginner’s Guide -- Part I Joseph Chang Senior IT Specialist IBM Cloud Document number • Ambari • HDFS Explore • WebHDFS API • Connect with R Console • Machine Learning (lm, k-means)
  • 2. © 2014 IBM Corporation Reference: https://www.ng.bluemix.net/docs/services/AnalyticsforHadoop/index.html#analyt icsforhadoop_data 2 Take me to Bluemix Click Here
  • 3. © 2014 IBM Corporation Are you the target reader? 3 Have you heard about Bluemix? Do you know Hadoop? Do you know R language? Are you interested in have the three things working together? Yes Yes Yes Yes Continue to the next page. Learn about Bluemix and sign-up. http://www.bluemix.net Learn about Hadoop https://hadoop.apache.org/ Learn about R. https://www.r-project.org/ No No No No Bye
  • 4. © 2014 IBM Corporation The following 2 Bluemix services are used in this tutorial : 4 Assume you already have Bluemix id. If you don’t , go to http://ww.bluemix.net to get one.
  • 5. © 2014 IBM Corporation Create Hadoop Service in Bluemix 5 Please create a java runtime and add a hadoop service by yourself.
  • 6. © 2014 IBM Corporation Create Hadoop Service in Bluemix 6 You can get the AmbariUrl, WebhdfsUrl, id, password … etc. from “Show Credentials”.
  • 7. © 2014 IBM Corporation Ambari Hadoop Management 7
  • 8. © 2014 IBM Corporation Monitoring Hadoop with Ambari 8 eg. https://bi-hadoop-prod-2016.services.dal.bluemix.net:8081 https://bi-hadoop-prod-<Cluster ID>.services.dal.bluemix.net:8081 Launch the Ambari Dashboard with this URL.
  • 9. © 2014 IBM Corporation Ambari – View the detail information of each services 9 Note: Spark service is available in this environment.
  • 10. © 2014 IBM Corporation Ambari – Hosts 10 The server nodes in this Hadoop Cluster.
  • 11. © 2014 IBM Corporation Ambari – Cluster Stack Version 11 The Big R Service will be used in this tutoral.
  • 12. © 2014 IBM Corporation HDFS Explore 12
  • 13. © 2014 IBM Corporation HDFS Explore 13 eg. https://bi-hadoop-prod-2016.services.dal.bluemix.net:8443/gateway/default/hdfs/explorer.html https://bi-hadoop-prod-<Cluster ID>.services.dal.bluemix.net:8443/gateway/default/hdfs/explorer.ht ml Launch the HDFS Explore with this URL. View the files on the Hadoop File System. It’s ready only.
  • 14. © 2014 IBM Corporation HDFS – Healthy 14
  • 15. © 2014 IBM Corporation WEBHDFS REST API 15
  • 16. © 2014 IBM Corporation Upload Data with curl + webhdfs rest api 16 curl -i -L -k -s --user biblumix:<your_biblumix_password> --max-time 45 -X PUT https://bi-hadoop-prod-<your_cluster_number>.services.dal.bluemix.net:8443/ gateway/default/webhdfs/v1/user/biblumix/<path_to_file/file_name>?op=CREATE curl -i -L -k -s --user biblumix:<your_biblumix_password> --max-time 45 -X PUT -T <file_name.txt> <Location URL from step 1 response message> If you can’t run “curl” in your command line, google it and download it. Use WEBHDFS API to upload file The current CREATE api have a defect cause the uploaded file size=0. The 2 steps approach is a workaround.
  • 17. © 2014 IBM Corporation Upload Data with curl + webhdfs rest api (Screen capture) 17 Step 1 Create temp redirect Step 2 Upload file from local disk The location in step 1 response message will be used in step 2. You should get response code 307 in step 1 You should get response code 201 in step 2
  • 18. © 2014 IBM Corporation Upload Data with curl + webhdfs rest api (Result) 18 The file has been uploaded. Note the size should not be 0.
  • 19. © 2014 IBM Corporation More webhdfs rest api 19 curl -i -k -s --user biblumix:your_biblumix_password --max-time 45 https://bi-hadoop-prod-your_cluster_number.services.dal.bluemix.net:8443/ gateway/default/webhdfs/v1/user?op=LISTSTATUS curl -i -s --user biblumix:passwordhttps://hostname:8443/gateway/default/oozie/v1/jobs?jobtype=wf curl -i -s --user biblumix:password -X POST -H "Content-Type: application/xml" -d @oozie-mrjob- config.xml https://hostname:8443/gateway/default/oozie/v1/jobs?action=start curl -i -s --user biblumix:your_biblumix_password --max-time 45 -X DELETE https://bi-hadoop-prod-your_cluster_number.services.dal.bluemix.net:8443/ gateway/default/webhdfs/v1/user/biblumix/path_to_file?op=DELETE curl -i -k -s --user biblumix:your_biblumix_password --max-time 45 -X PUT https://bi-hadoop-prod-your_cluster_number.services.dal.bluemix.net:8443/ gateway/default/webhdfs/v1/user/biblumix/path_to_directory?op=MKDIRS
  • 20. © 2014 IBM Corporation Install R Console & Big R 20
  • 21. © 2014 IBM Corporation Download Drivers for Big R 21 https://hub.jazz.net/project/kulkarni/a4h/overview#https://hub.jazz. net/git/kulkarni%252Fa4h/list/master/client-libs Extract the file to  /temp  The big R library can be download from this url.
  • 22. © 2014 IBM Corporation Install R Console 22 Download R Language https://cran.r-project.org/ Launch R console: Launch R in Terminal: If you don’t have R console in your PC/NB . Download it from this URL. You can use either R Console or terminal. Type R in command line to launch R.
  • 23. © 2014 IBM Corporation Install Big R  > install.packages('rJava')  --- Please select a CRAN mirror for use in this session ---  HTTPS CRAN mirror  1: 0-Cloud [https] 2: Austria [https]  3: Chile [https] 4: China (Beijing 4) [https]  5: China (Hefei) [https] 6: Colombia (Cali) [https]  7: France (Lyon 2) [https] 8: Germany (Münster) [https]  9: Iceland [https] 10: Russia (Moscow) [https]  11: Spain (A Coruña) [https] 12: Switzerland [https]  13: UK (Bristol) [https] 14: UK (Cambridge) [https]  15: USA (CA 1) [https] 16: USA (KS) [https]  17: USA (MI 1) [https] 18: USA (TN) [https]  19: USA (TX) [https] 20: USA (WA) [https]  21: (HTTP mirrors)  Selection: 1  trying URL 'https://cran.rstudio.com/bin/macosx/mavericks/contrib/3.2/rJava_0.9-7.tgz'  Content type 'application/x-gzip' length 604271 bytes (590 KB)  ==================================================  downloaded 590 KB  The downloaded binary packages are in  /var/folders/g0/jgl74nkx0h97dgpywqv2prrc0000gn/T//Rtmp3ggvb8/downloaded_packages 23 Warning message: In doTryCatch(return(expr), name, parentenv, handler) : unable to load shared object '/Library/Frameworks/R.framework/Resources/modules//R_X11.so': dlopen(/Library/Frameworks/R.framework/Resources/modules//R_X11.so, 6): Library not loaded: /opt/X11/lib/libSM.6.dylib Referenced from: /Library/Frameworks/R.framework/Resources/modules//R_X11.so Reason: image not found > Before install big R package. We need install rJava.
  • 24. © 2014 IBM Corporation Install Big R 24 > install.packages('base64enc') trying URL 'https://cran.rstudio.com/bin/macosx/mavericks/contrib/3.2/base64enc_0.1-3.tgz' Content type 'application/x-gzip' length 26679 bytes (26 KB) ================================================== downloaded 26 KB The downloaded binary packages are in /var/folders/g0/jgl74nkx0h97dgpywqv2prrc0000gn/T//Rtmp3ggvb8/downloaded_packages > install.packages('data.table') also installing the dependencies ‘stringi’, ‘magrittr’, ‘plyr’, ‘stringr’, ‘Rcpp’, ‘chron’, ‘reshape2’ trying URL 'https://cran.rstudio.com/bin/macosx/mavericks/contrib/3.2/stringi_0.5-5.tgz' Content type 'application/x-gzip' length 12685069 bytes (12.1 MB) ================================================== downloaded 12.1 MB …. trying URL 'https://cran.rstudio.com/bin/macosx/mavericks/contrib/3.2/data.table_1.9.4.tgz' Content type 'application/x-gzip' length 1266610 bytes (1.2 MB) ================================================== downloaded 1.2 MB The downloaded binary packages are in /var/folders/g0/jgl74nkx0h97dgpywqv2prrc0000gn/T//Rtmp3ggvb8/downloaded_packages > Before install big R package. We need install base64enc and data.table
  • 25. © 2014 IBM Corporation Install Big R 25 > install.packages(pkg="/temp/bigr_3.18.tar.gz", type="source", repos=NULL) * installing *source* package ‘bigr’ ... ** R ** inst ** preparing package for lazy loading Attaching...Creating a generic function for ‘toString’ from package ‘base’ in package ‘bigr’ Creating a generic function for ‘nchar’ from package ‘base’ in package ‘bigr’ Creating a generic function for ‘coef’ from package ‘stats’ in package ‘bigr’ ** help *** installing help indices ** building package indices ** testing if installed package can be loaded * DONE (bigr) > Now you can install Big R library. Make sure the library path is correct.
  • 26. © 2014 IBM Corporation Install Big R (2 issues in Bluemix doc) 26 If you copy the command in Bluemix doc , you may got this error. (Aug.2015) It should be “packages” The Bluemix instruction dosen’t mention the 3 libraries need to be installed first.
  • 27. © 2014 IBM Corporation Machine Learning -- Liner Regression -- K-means 27 http://www- 01.ibm.com/support/knowledgecenter/SSPT3X_4.0.0/com.ibm.swg.im.infosphere.biginsights.bigr.doc /doc/intro.html?cp=SSPT3X_4.0.0%2F9-1 Reference: Recommendation: Learn more about big R from the URL.
  • 28. © 2014 IBM Corporation Machine Learning – Big R Example #1-1 28 ############################ # 1.1 Connect to Bluemix Hadoop ############################# # In order to try out any example, first run the following steps to upload # the aforementioned dataset to a BigInsights cluster. library(bigr) bigr.connect(host="bi-hadoop-prod-2016.services.dal.bluemix.net",user="biblumix", password="w9@4f0~HnXLD",ssl=TRUE, trustStorePath="/Library/Java/Home/lib/security/cacerts", trustStorePassword="changeit",keyManager="SunX509") is.bigr.connected() Replace it with your own cluster id Replace it with your own password Replace it with the Java Home path in your Environment.
  • 29. © 2014 IBM Corporation Machine Learning – Big R Example #1-2 29 ################# # 1.2 Data loading ################# airfile <- system.file("extdata", "airline.zip", package="bigr”) airfile <- unzip(airfile, exdir = tempdir()) airR <- read.csv(airfile, stringsAsFactors=F) # Upload the data to the BigInsights server. This may take 15-20 seconds air <- as.bigr.frame(airR) air <- bigr.persist(air, dataSource="DEL", dataPath="/user/bigr/examples/airline_demo.csv”, header=T, delimiter=",", useMapReduce=F) The file uses “,” as DELimiter
  • 30. © 2014 IBM Corporation Big R Example #1 (Screen capture) 30 You can check if the file is successfully upload by HDFS explore. You should get “TRUE” if successfully connect to bluemix hadoop.
  • 31. © 2014 IBM Corporation About the airline.csv sample data 31 The airline.zip sample can be found in your R installation directory.
  • 32. © 2014 IBM Corporation Machine Learning – Big R Example #2 32 ########################### # 2. Accessing data on HDFS ########################### # Once uploaded, one merely needs to instantiate a big.frame object, # commonly referenced as "air" in the examples, to access the dataset via # the Big R API. air <- bigr.frame(dataPath = "/user/bigr/examples/airline_demo.csv", dataSource = "DEL", delimiter=",", header = T, coltypes = ifelse(1:29 %in% c(9,11,17,18,23), "character", "integer"), useMapReduce = F) There are 29 columns in the airline_dmeo.csv file. Column 9,11,17,18,23 are character. Remaining columns are integer.
  • 33. © 2014 IBM Corporation Big R Example #2 (Screen capture) 33
  • 34. © 2014 IBM Corporation Machine Learning – Big R Example #3-1 34 ################################################################# # 3. Machine Learning example: building a Linear Regression model ################################################################# # Remove files from previous executions (if any) invisible(bigr.rmfs("/user/bigr/examples/airline.sample.* /user/bigr/examples/lm.airline*")) # Project some relevant columns for modeling / statistical analysis airlineFiltered <- air[, c("Month", "DayofMonth", "DayOfWeek", "CRSDepTime", "Distance", "ArrDelay")] # Create a bigr.matrix from the data airlineMatrix <- bigr.transform(airlineFiltered, outData="/user/bigr/examples/airline.sample.matrix", transformPath="/user/bigr/examples/airline.sample.transform") The 6 variables are choose for this model.
  • 35. © 2014 IBM Corporation Machine Learning – Big R Example #3-2 35 ################################################################# # 3. Machine Learning example: building a Linear Regression model ################################################################# # Split the data into 70% for training and 30% for testing samples <- bigr.sample(airlineMatrix, perc=c(0.7, 0.3)) train <- samples[[1]] test <- samples[[2]] # Create a linear regression model lm <- bigr.lm(ArrDelay ~ ., data=train, directory="/user/bigr/examples/lm.airline") # Get the coefficients of the regression coef(lm) We will use "Month", "DayofMonth", "DayOfWeek", "CRSDepTime”, "Distance” to predict ArrDelay
  • 36. © 2014 IBM Corporation Big R Example #3 (Screen Capture) 36
  • 37. © 2014 IBM Corporation Big R Example #3-2 (Result) 37 Y : ArrDelay X1: Month X2: DayofMonth X3: DayOfweek X4: CRSDepTIme X5: Distance Y = -0.174423*X1 -0.01547941*X2-0.03378236*X3 +0.006222544*X4+0.0003556919*X5 The Arrival Delay prediction model is :
  • 38. © 2014 IBM Corporation Machine Learning – Big R Example #3-3 38 ################################################################# # 3. Machine Learning example: building a Linear Regression model ################################################################# # Calculate predictions for the testing set pred <- predict(lm, test, "/user/bigr/examples/lm.airline.preds")
  • 39. © 2014 IBM Corporation Big R Example #3 (Screen Capture) 39 Predicted arrival delay time for test data.
  • 40. © 2014 IBM Corporation Big R Example #3 (output) 40 View the preds files generate on the hdfs.
  • 41. © 2014 IBM Corporation Machine Learning – Big R Example #4 41 ################################################################## # 4. Machine Learning example: building a k-means clustering model ################################################################## # Remove files from previous executions (if any) invisible(bigr.rmfs("/user/bigr/examples/iris.* /user/bigr/examples/km*")) # Load the Iris dataset to HDFS irisbf <- as.bigr.frame(iris[, -5]) # Convert the Iris dataset into a bigr.matrix object irisBM <- bigr.transform(bf = irisbf, outData = "/user/bigr/examples/iris.mtx", transformPath = "/user/bigr/examples/iris.transform") # Create a k-means model with 10 clusters km <- bigr.kmeans(irisBM, centers=10, directory="/user/bigr/examples/km", writeY=T) # Use the existing model to cluster a different dataset p <- predict(km, irisBM, "/user/bigr/examples/km.preds") Iris is the built-in sample data set in R Language
  • 42. © 2014 IBM Corporation About the sample data -- IRIS 42
  • 43. © 2014 IBM Corporation Big R Example #4 (Screen Capture) 43 The 10 clusters of IRIS by k- means.
  • 44. © 2014 IBM Corporation Big R Example #4 (Screen capture) 44 Identify each sample data with the model.
  • 45. © 2014 IBM Corporation Thank you 45 Take me to Bluemix Click Here
  • 46. © 2014 IBM Corporation Appendix 1: Hadoop Cloud Demo with IBM Bluemix 46 I found this great video in Youtube. You can learn more about Bluemix Hadoop in this video.
  • 47. © 2014 IBM Corporation Big Data Hadoop Cloud Demo – IBM Bluemix https://www.youtube.com/watch?v=FUDOsBDAahE 47
  • 48. © 2014 IBM Corporation Appendix 2: Define Hadoop Cluster by yourself 48 If you want your application run faster. You may choose this charged service which running on bare metal servers with multiple nodes.
  • 49. © 2014 IBM Corporation BigInsights for Hadoop Cluster Topology 49
  • 50. © 2014 IBM Corporation BigInsights for Hadoop Cluster Topology 50