論文紹介:Semantic segmentation using Vision Transformers: A survey
Log Analysis System And its designs in LINE Corp. 2014 early
1. Log Analysis Systems
And its designs
In LINE Corp. 2014 Early
2014/02/20 (Thu)
@tagomoris (TAGOMORI Satoshi)
LINE Corp.
LINE Developer Meetup in Fukuoka #1
14年2月20日木曜日
6. See also:
「OSSで支えられるライブドアの巨大ログ集計」 (2012 Summer)
http://www.slideshare.net/tagomoris/oss-nhntech
「Log analysis system with Hadoop in livedoor 2013 Winter」(2013 early)
http://www.slideshare.net/tagomoris/log-analysis-with-hadoop-in-livedoor-2013
「Batch and Stream processing with SQL」 (2013 Fall)
http://www.slideshare.net/tagomoris/batch-and-stream-processing-with-sql
14年2月20日木曜日
12. Who uses it?
Internet Messaging Service
Public Web Service
Game
Private Web Service (for closed person-to-persons)
Internal Web Service (administrator only)
Data Analytics Service
14年2月20日木曜日
13. Who uses it?
Internet Messaging Service
Public Web Service
Game
Private Web Service (for closed person-to-persons)
Internal Web Service (administrator only)
Data Analytics Service
14年2月20日木曜日
14. Data analytics players
PROGRAMMER
Raw Log Formats
Application Logs
Data Sizes
Data Semantics
SERVICE DIRECTOR
SALES
Whatever Metrics They Want
Storages
Hadoop Cluster
Visualization Tools
ADMINISTRATOR
........
BOARD MEMBER
14年2月20日木曜日
15. Data analytics players
PROGRAMMER
Raw Log Formats
Application Logs
Data Sizes
Data Semantics
SERVICE DIRECTOR
SALES
WE NEED THE QUERY LANGUAGE
Whatever Metrics They Want
WHAT THEY ALL CAN
RUN AND UNDERSTAND!!!!!!!!!!
Storages
Hadoop Cluster
Visualization Tools
ADMINISTRATOR
........
BOARD MEMBER
14年2月20日木曜日
25. Batches and Streams
Hadoop is for batches
High performance batch is important
HDFS has good performance
Stream log writing and calcurations
are also VERY VERY IMPORTANT
Hybrid System:
Stream processing + Batch
14年2月20日木曜日
31. Fluentd
Log collector
Apache-like configuration
Pluggable Input/Output/Buffer on public plugin
repository (rubygems.org)
Ruby 1.9 or later
Collect, and Store
collect: fluent-agent-lite (perl)
store: fluent-plugin-webhdfs
14年2月20日木曜日
40. Norikra Queries: (2)
{“name”:”tagomoris”,
“age”:34, “address”:”Tokyo”,
“corp”:”LINE”, “current”:”Fukuoka”}
SELECT age, COUNT(*) as cnt
FROM events.win:time_batch(5 mins)
WHERE current=”Fukuoka” GROUP BY age
every 5 mins
{”age”:34,”cnt”:3}, {“age”:33,”cnt”:1}, ...
14年2月20日木曜日
43. Presto
Open sourced by Facebook at 2013/11/07
MPP Engine: Massive Parallel Processing Engine
like Google BigQuery(Dremel), Cloudera Impala
short latency queries (It’s not main usage of Hive)
SQL
HTTP JSON API
Java 7 !
14年2月20日木曜日
44. Shib v0.3.0: presto support
HiveServer
User
(browser)
THRIFT
HiveServer2
Shib
Analysis
Batches
HTTP JSON API
THRIFT
HTTP JSON API
Presto
Service
Admin Tools
14年2月20日木曜日
45. Non-monolithic architecture
Many subsystems for many purposes
Add/Update/Replace per subsystems
High interoperability by RPC-based connections
Gateway can hide backend implementations
14年2月20日木曜日
46. WHAT TO DO
IS
NOT WHAT WE WANT TO
BUT
WHAT WE ARE WANTED TO.
14年2月20日木曜日