Learn about the HBase browser in Hue, the UI for Apache Hadoop.
Presented by Abraham Elmahrek at Hadoop Israel www.meetup.com/HadoopIsrael/events/161701092/
Find out everything you need about Hue at http://gethue.com
2. WHAT
IS HUE?
WEB INTERFACE FOR MAKING
HADOOP EASIER TO USE
Suite of apps for each Hadoop component,
like Hive, Pig, Impala, Oozie, Solr, Sqoop2,
HBase...
9. TALKS
Meetups and events in NYC,
Paris, LA, Tokyo, SF,
Stockholm, Vienna, San Jose,
Singapore…
Coming up in London, West
coast
AROUND
THE WORLD
RETREATS
Nov 13 Koh Chang, Thailand
May 14 Curaçao, Netherlands
Antilles
10. FAST PACE
LAST 30 DAYS
41 issues created and 38
resolved.
Core team + Community
11. NEW APPS IN 6 MONTHS
PIG
JO
B
BRO
W
SER
JO
B
DESIG
N
ER
O
O
ZIEH
IVE
IM
PALA
M
ETASTO
RE
BRO
W
SER
SEARC
HH
BASE
BRO
W
SER
SQ
O
O
P
ZO
O
KEEPER
U
SER
ADM
INDB
Q
U
ERY
SPARKH
O
M
E
...
FILE
BRO
W
SER
13. HISTORY
HUE 1
Desktop-like in a browser,
did its job but pretty slow,
memory leaks and not very
IE friendly but definitely
advanced for its time
(2009-2010).
21. FLEXIBLE SEARCH BAR
PROBLEM
Need high-level tool for
complex data fetching &
analysis
SOLUTION
Simple custom query
language
Supports HBase filter
language
Supports selection & Copy +
Paste, gracefully degrades in
IE
Autocomplete Help Menu
Row$Key$
Scan$Length$
Prefix$Scan$
Column/Family$Filters$
Thri=$Filterstring$
Searchbar(Syntax(Breakdown(
22. DIVING DEEPER
DOWN LOW
Thrift with custom
connection pooling with
socket multiplexing to
improve thrift calls. Router
to streamline front end
requests.
UP TOP
KnockoutJS, JQuery, DOM
caching, etc. for a dynamic
front end.
client = thrift_util.get_client(TCLIService.Client,
query_server['server_host'],
query_server[‘server_port’],
…)
client.mutateRow(…)
client.scannerOpenWithScan(…)
client.scannerGetList(…)
!
api = HbaseApi()
api.putColumn(…)
api.deleteColumn(…)
api.bulkUpload(…)
23. SCALE
PROBLEM
Performance on Millions of
Columns
SOLUTION
Lazy loading & Truncation
using Thrift FilterString
Stream raw data cached MEM
> DOM elements dynamically
generated
b64encode binary data to
preserve during ASCII dump
Detect schema on preview
reading byte headers
Capped, Lazy Loaded &
Bound to DOM via MVVM
Pattern
IT DOES.
30. ROADMAP
CDH 5 with Hue 3.5+
http://gethue.com/
hadoop-tutorial-new-
impala-and-hive-
editors/
CDH 5.1 with Hue 3.6:
CHARTS, Yarn, Spark,
SEARCH create index,
Load data, Dashboard,
UX ... your idea?
Inter-app integration (e.g. schedule daily Hive query in one click), versioning,
export/import. Oozie revamp, dashboard, reporting… Come talk to us about your
use cases and what you would like to see next in Hue!
Hue 3.5 OSS release
http://gethue.com/
hue-3-5-and-its-
redesign-are-out/
DECEMBER 2013 Q1 2014 Q2 2014
AFTER
34. GET HUE
Try in advance the latest
and greatest but you’ll
have to configure
everything on your own.
Get to play with Hue and
various Hadoop
components in 5
minutes. It’s a self
contained CDH
environment ready to
use.
Newer version than HDP,
close to the original 2.5
minus apps like HBase,
Impala, Sqoop, Search.
The newest addition,
ships Hue 3.0 through
the GreenButton
products.
Stable and highly tested
releases perfectly
integrated with the
Hadoop ecosystem,
automagically configured
by Cloudera Manager.
In HDP there’s an old
forked version of Hue
2.3.
CLOUDERA’S CDH TARBALL CLOUDERA’S DEMO VM
HORTONWORKS* MAPR* HP CLOUD*
* YOUR MILEAGE MAY VARY.
BIGTOP EMBEDDED/DEMO IN IND. COMPANIES
35. WHAT ARE YOUR USE
CASES?
WHICH COMPONENTS DO
YOU USE?
WHAT WOULD YOU LIKE TO
SEE IN HUE?
INTERESTED IN
CONTRIBUTING?
WANNA SAY HELLO?
DO YOU WANT A TAILOR
MADE TEAM RETREAT?
TEAM@
GETHUE.COM