Submit Search
Upload
Spark Security
•
4 likes
•
1,847 views
Yifeng Jiang
Follow
Introduction to Spark Security
Read less
Read more
Software
Slideshow view
Report
Share
Slideshow view
Report
Share
1 of 19
Download now
Download to read offline
Recommended
Introduction to Hortonworks Data Cloud for AWS
Introduction to Hortonworks Data Cloud for AWS
Yifeng Jiang
An Apache Hive Based Data Warehouse
An Apache Hive Based Data Warehouse
DataWorks Summit
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
DataWorks Summit/Hadoop Summit
An Overview on Optimization in Apache Hive: Past, Present, Future
An Overview on Optimization in Apache Hive: Past, Present, Future
DataWorks Summit
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFi
Hortonworks
Securing Spark Applications by Kostas Sakellis and Marcelo Vanzin
Securing Spark Applications by Kostas Sakellis and Marcelo Vanzin
Spark Summit
Row/Column- Level Security in SQL for Apache Spark
Row/Column- Level Security in SQL for Apache Spark
DataWorks Summit/Hadoop Summit
Recommended
Introduction to Hortonworks Data Cloud for AWS
Introduction to Hortonworks Data Cloud for AWS
Yifeng Jiang
An Apache Hive Based Data Warehouse
An Apache Hive Based Data Warehouse
DataWorks Summit
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
DataWorks Summit/Hadoop Summit
An Overview on Optimization in Apache Hive: Past, Present, Future
An Overview on Optimization in Apache Hive: Past, Present, Future
DataWorks Summit
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFi
Hortonworks
Securing Spark Applications by Kostas Sakellis and Marcelo Vanzin
Securing Spark Applications by Kostas Sakellis and Marcelo Vanzin
Spark Summit
Row/Column- Level Security in SQL for Apache Spark
Row/Column- Level Security in SQL for Apache Spark
DataWorks Summit/Hadoop Summit
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSense
Hortonworks
Running Zeppelin in Enterprise
Running Zeppelin in Enterprise
DataWorks Summit
Hortonworks Data Cloud for AWS
Hortonworks Data Cloud for AWS
Hortonworks
Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
Hortonworks Data Cloud for AWS 1.11 Updates
Hortonworks Data Cloud for AWS 1.11 Updates
Yifeng Jiang
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
DataWorks Summit/Hadoop Summit
S3Guard: What's in your consistency model?
S3Guard: What's in your consistency model?
Hortonworks
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016
Hortonworks
Webinar Series Part 5 New Features of HDF 5
Webinar Series Part 5 New Features of HDF 5
Hortonworks
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
DataWorks Summit/Hadoop Summit
LLAP: Building Cloud First BI
LLAP: Building Cloud First BI
DataWorks Summit
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in Production
DataWorks Summit/Hadoop Summit
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
DataWorks Summit
Next Generation Execution for Apache Storm
Next Generation Execution for Apache Storm
DataWorks Summit
Apache Ambari: Past, Present, Future
Apache Ambari: Past, Present, Future
Hortonworks
Ozone- Object store for Apache Hadoop
Ozone- Object store for Apache Hadoop
Hortonworks
An Apache Hive Based Data Warehouse
An Apache Hive Based Data Warehouse
DataWorks Summit
Log Analytics Optimization
Log Analytics Optimization
Hortonworks
Hive present-and-feature-shanghai
Hive present-and-feature-shanghai
Yifeng Jiang
Performance Update: When Apache ORC Met Apache Spark
Performance Update: When Apache ORC Met Apache Spark
DataWorks Summit
Spark GraphX で始めるグラフ解析
Spark GraphX で始めるグラフ解析
Yosuke Mizutani
Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -
Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -
Yoshiyasu SAEKI
More Related Content
What's hot
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSense
Hortonworks
Running Zeppelin in Enterprise
Running Zeppelin in Enterprise
DataWorks Summit
Hortonworks Data Cloud for AWS
Hortonworks Data Cloud for AWS
Hortonworks
Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
Hortonworks Data Cloud for AWS 1.11 Updates
Hortonworks Data Cloud for AWS 1.11 Updates
Yifeng Jiang
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
DataWorks Summit/Hadoop Summit
S3Guard: What's in your consistency model?
S3Guard: What's in your consistency model?
Hortonworks
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016
Hortonworks
Webinar Series Part 5 New Features of HDF 5
Webinar Series Part 5 New Features of HDF 5
Hortonworks
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
DataWorks Summit/Hadoop Summit
LLAP: Building Cloud First BI
LLAP: Building Cloud First BI
DataWorks Summit
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in Production
DataWorks Summit/Hadoop Summit
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
DataWorks Summit
Next Generation Execution for Apache Storm
Next Generation Execution for Apache Storm
DataWorks Summit
Apache Ambari: Past, Present, Future
Apache Ambari: Past, Present, Future
Hortonworks
Ozone- Object store for Apache Hadoop
Ozone- Object store for Apache Hadoop
Hortonworks
An Apache Hive Based Data Warehouse
An Apache Hive Based Data Warehouse
DataWorks Summit
Log Analytics Optimization
Log Analytics Optimization
Hortonworks
Hive present-and-feature-shanghai
Hive present-and-feature-shanghai
Yifeng Jiang
Performance Update: When Apache ORC Met Apache Spark
Performance Update: When Apache ORC Met Apache Spark
DataWorks Summit
What's hot
(20)
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSense
Running Zeppelin in Enterprise
Running Zeppelin in Enterprise
Hortonworks Data Cloud for AWS
Hortonworks Data Cloud for AWS
Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
Hortonworks Data Cloud for AWS 1.11 Updates
Hortonworks Data Cloud for AWS 1.11 Updates
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
S3Guard: What's in your consistency model?
S3Guard: What's in your consistency model?
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016
Webinar Series Part 5 New Features of HDF 5
Webinar Series Part 5 New Features of HDF 5
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
LLAP: Building Cloud First BI
LLAP: Building Cloud First BI
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in Production
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
Next Generation Execution for Apache Storm
Next Generation Execution for Apache Storm
Apache Ambari: Past, Present, Future
Apache Ambari: Past, Present, Future
Ozone- Object store for Apache Hadoop
Ozone- Object store for Apache Hadoop
An Apache Hive Based Data Warehouse
An Apache Hive Based Data Warehouse
Log Analytics Optimization
Log Analytics Optimization
Hive present-and-feature-shanghai
Hive present-and-feature-shanghai
Performance Update: When Apache ORC Met Apache Spark
Performance Update: When Apache ORC Met Apache Spark
Viewers also liked
Spark GraphX で始めるグラフ解析
Spark GraphX で始めるグラフ解析
Yosuke Mizutani
Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -
Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -
Yoshiyasu SAEKI
Spark MLlibでリコメンドエンジンを作った話
Spark MLlibでリコメンドエンジンを作った話
Koki Shibata
Fine-Grained Security for Spark and Hive
Fine-Grained Security for Spark and Hive
DataWorks Summit/Hadoop Summit
一歩前に進めるWeb開発のスパイス(仙台Geek★Night #1)
一歩前に進めるWeb開発のスパイス(仙台Geek★Night #1)
株式会社オプト 仙台ラボラトリ
Webアプリ開発のトレンドとUIライブラリ開発事情(仙台Geek★Night #1)
Webアプリ開発のトレンドとUIライブラリ開発事情(仙台Geek★Night #1)
masakazusegawa
複数拠点における開発効率の維持・向上
複数拠点における開発効率の維持・向上
infinite_loop
Securing Your Apache Spark Applications
Securing Your Apache Spark Applications
Cloudera, Inc.
Type-safe front-end development with Scala
Type-safe front-end development with Scala
takezoe
Securing Spark Applications
Securing Spark Applications
DataWorks Summit/Hadoop Summit
Apache Hiveの今とこれから
Apache Hiveの今とこれから
Yifeng Jiang
Securing Hadoop with Apache Ranger
Securing Hadoop with Apache Ranger
DataWorks Summit
1年くらいScalaプロジェクトに関わった結果 #ichigayageek
1年くらいScalaプロジェクトに関わった結果 #ichigayageek
Michihito Shigemura
HDP2.5 Updates
HDP2.5 Updates
Yuta Imai
Scala on gae
Scala on gae
Masaki Toyoshima
中島裕介「最強のリベラルアーツとしての短歌Ⅱ」(於・大阪大学)2日目 2015年12月4日
中島裕介「最強のリベラルアーツとしての短歌Ⅱ」(於・大阪大学)2日目 2015年12月4日
yukashima
中島裕介「最強のリベラルアーツとしての短歌Ⅱ」(於・大阪大学)1日目 2015年11月27日
中島裕介「最強のリベラルアーツとしての短歌Ⅱ」(於・大阪大学)1日目 2015年11月27日
yukashima
kafkaのデータをRedshiftへ入れるパイプライン作ってみた
kafkaのデータをRedshiftへ入れるパイプライン作ってみた
Yu Yamada
Sqoop2 refactoring for generic data transfer - Hadoop Strata Sqoop Meetup
Sqoop2 refactoring for generic data transfer - Hadoop Strata Sqoop Meetup
aaamase
Kafka Security
Kafka Security
Sriharsha Chintalapani
Viewers also liked
(20)
Spark GraphX で始めるグラフ解析
Spark GraphX で始めるグラフ解析
Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -
Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -
Spark MLlibでリコメンドエンジンを作った話
Spark MLlibでリコメンドエンジンを作った話
Fine-Grained Security for Spark and Hive
Fine-Grained Security for Spark and Hive
一歩前に進めるWeb開発のスパイス(仙台Geek★Night #1)
一歩前に進めるWeb開発のスパイス(仙台Geek★Night #1)
Webアプリ開発のトレンドとUIライブラリ開発事情(仙台Geek★Night #1)
Webアプリ開発のトレンドとUIライブラリ開発事情(仙台Geek★Night #1)
複数拠点における開発効率の維持・向上
複数拠点における開発効率の維持・向上
Securing Your Apache Spark Applications
Securing Your Apache Spark Applications
Type-safe front-end development with Scala
Type-safe front-end development with Scala
Securing Spark Applications
Securing Spark Applications
Apache Hiveの今とこれから
Apache Hiveの今とこれから
Securing Hadoop with Apache Ranger
Securing Hadoop with Apache Ranger
1年くらいScalaプロジェクトに関わった結果 #ichigayageek
1年くらいScalaプロジェクトに関わった結果 #ichigayageek
HDP2.5 Updates
HDP2.5 Updates
Scala on gae
Scala on gae
中島裕介「最強のリベラルアーツとしての短歌Ⅱ」(於・大阪大学)2日目 2015年12月4日
中島裕介「最強のリベラルアーツとしての短歌Ⅱ」(於・大阪大学)2日目 2015年12月4日
中島裕介「最強のリベラルアーツとしての短歌Ⅱ」(於・大阪大学)1日目 2015年11月27日
中島裕介「最強のリベラルアーツとしての短歌Ⅱ」(於・大阪大学)1日目 2015年11月27日
kafkaのデータをRedshiftへ入れるパイプライン作ってみた
kafkaのデータをRedshiftへ入れるパイプライン作ってみた
Sqoop2 refactoring for generic data transfer - Hadoop Strata Sqoop Meetup
Sqoop2 refactoring for generic data transfer - Hadoop Strata Sqoop Meetup
Kafka Security
Kafka Security
Similar to Spark Security
YARN Ready: Apache Spark
YARN Ready: Apache Spark
Hortonworks
Don't Let the Spark Burn Your House: Perspectives on Securing Spark
Don't Let the Spark Burn Your House: Perspectives on Securing Spark
DataWorks Summit
Apache Zeppelin and Spark for Enterprise Data Science
Apache Zeppelin and Spark for Enterprise Data Science
Bikas Saha
Apache Zeppelin and Spark for Enterprise Data Science
Apache Zeppelin and Spark for Enterprise Data Science
Bikas Saha
Running Spark in Production
Running Spark in Production
DataWorks Summit/Hadoop Summit
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Accumulo Summit
Intro to Big Data Analytics using Apache Spark and Apache Zeppelin
Intro to Big Data Analytics using Apache Spark and Apache Zeppelin
Alex Zeltov
Getting Started with Spark Scala
Getting Started with Spark Scala
Knoldus Inc.
Get most out of Spark on YARN
Get most out of Spark on YARN
DataWorks Summit
What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4
DataWorks Summit
Running Apache Zeppelin production
Running Apache Zeppelin production
Vinay Shukla
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
DataWorks Summit/Hadoop Summit
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Wangda Tan
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
DataWorks Summit
Spark crash course workshop at Hadoop Summit
Spark crash course workshop at Hadoop Summit
DataWorks Summit
Sparc solaris servers
Sparc solaris servers
solarisyougood
C5 journey to_the_cloud_with_oracle_sparc
C5 journey to_the_cloud_with_oracle_sparc
Dr. Wilfred Lin (Ph.D.)
Apache Spark 2.3 boosts advanced analytics and deep learning with Python
Apache Spark 2.3 boosts advanced analytics and deep learning with Python
DataWorks Summit
Effective Spark on Multi-Tenant Clusters
Effective Spark on Multi-Tenant Clusters
DataWorks Summit/Hadoop Summit
Apache Spark Workshop at Hadoop Summit
Apache Spark Workshop at Hadoop Summit
Saptak Sen
Similar to Spark Security
(20)
YARN Ready: Apache Spark
YARN Ready: Apache Spark
Don't Let the Spark Burn Your House: Perspectives on Securing Spark
Don't Let the Spark Burn Your House: Perspectives on Securing Spark
Apache Zeppelin and Spark for Enterprise Data Science
Apache Zeppelin and Spark for Enterprise Data Science
Apache Zeppelin and Spark for Enterprise Data Science
Apache Zeppelin and Spark for Enterprise Data Science
Running Spark in Production
Running Spark in Production
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Intro to Big Data Analytics using Apache Spark and Apache Zeppelin
Intro to Big Data Analytics using Apache Spark and Apache Zeppelin
Getting Started with Spark Scala
Getting Started with Spark Scala
Get most out of Spark on YARN
Get most out of Spark on YARN
What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4
Running Apache Zeppelin production
Running Apache Zeppelin production
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
Spark crash course workshop at Hadoop Summit
Spark crash course workshop at Hadoop Summit
Sparc solaris servers
Sparc solaris servers
C5 journey to_the_cloud_with_oracle_sparc
C5 journey to_the_cloud_with_oracle_sparc
Apache Spark 2.3 boosts advanced analytics and deep learning with Python
Apache Spark 2.3 boosts advanced analytics and deep learning with Python
Effective Spark on Multi-Tenant Clusters
Effective Spark on Multi-Tenant Clusters
Apache Spark Workshop at Hadoop Summit
Apache Spark Workshop at Hadoop Summit
More from Yifeng Jiang
Hive spark-s3acommitter-hbase-nfs
Hive spark-s3acommitter-hbase-nfs
Yifeng Jiang
introduction-to-apache-kafka
introduction-to-apache-kafka
Yifeng Jiang
Hive2 Introduction -- Interactive SQL for Big Data
Hive2 Introduction -- Interactive SQL for Big Data
Yifeng Jiang
Introduction to Streaming Analytics Manager
Introduction to Streaming Analytics Manager
Yifeng Jiang
HDF 3.0 IoT Platform for Everyone
HDF 3.0 IoT Platform for Everyone
Yifeng Jiang
Real-time Analytics in Financial
Real-time Analytics in Financial
Yifeng Jiang
sparksql-hive-bench-by-nec-hwx-at-hcj16
sparksql-hive-bench-by-nec-hwx-at-hcj16
Yifeng Jiang
Nifi workshop
Nifi workshop
Yifeng Jiang
Sub-second-sql-on-hadoop-at-scale
Sub-second-sql-on-hadoop-at-scale
Yifeng Jiang
Yifeng hadoop-present-public
Yifeng hadoop-present-public
Yifeng Jiang
Hive-sub-second-sql-on-hadoop-public
Hive-sub-second-sql-on-hadoop-public
Yifeng Jiang
Yifeng spark-final-public
Yifeng spark-final-public
Yifeng Jiang
Kinesis vs-kafka-and-kafka-deep-dive
Kinesis vs-kafka-and-kafka-deep-dive
Yifeng Jiang
Hadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise Hadoop
Yifeng Jiang
HDFS Deep Dive
HDFS Deep Dive
Yifeng Jiang
Hadoop Trends & Hadoop on EC2
Hadoop Trends & Hadoop on EC2
Yifeng Jiang
Apache Ambari Overview -- Hadoop for Everyone
Apache Ambari Overview -- Hadoop for Everyone
Yifeng Jiang
HDP Security Overview
HDP Security Overview
Yifeng Jiang
Data Science on Hadoop
Data Science on Hadoop
Yifeng Jiang
More from Yifeng Jiang
(19)
Hive spark-s3acommitter-hbase-nfs
Hive spark-s3acommitter-hbase-nfs
introduction-to-apache-kafka
introduction-to-apache-kafka
Hive2 Introduction -- Interactive SQL for Big Data
Hive2 Introduction -- Interactive SQL for Big Data
Introduction to Streaming Analytics Manager
Introduction to Streaming Analytics Manager
HDF 3.0 IoT Platform for Everyone
HDF 3.0 IoT Platform for Everyone
Real-time Analytics in Financial
Real-time Analytics in Financial
sparksql-hive-bench-by-nec-hwx-at-hcj16
sparksql-hive-bench-by-nec-hwx-at-hcj16
Nifi workshop
Nifi workshop
Sub-second-sql-on-hadoop-at-scale
Sub-second-sql-on-hadoop-at-scale
Yifeng hadoop-present-public
Yifeng hadoop-present-public
Hive-sub-second-sql-on-hadoop-public
Hive-sub-second-sql-on-hadoop-public
Yifeng spark-final-public
Yifeng spark-final-public
Kinesis vs-kafka-and-kafka-deep-dive
Kinesis vs-kafka-and-kafka-deep-dive
Hadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise Hadoop
HDFS Deep Dive
HDFS Deep Dive
Hadoop Trends & Hadoop on EC2
Hadoop Trends & Hadoop on EC2
Apache Ambari Overview -- Hadoop for Everyone
Apache Ambari Overview -- Hadoop for Everyone
HDP Security Overview
HDP Security Overview
Data Science on Hadoop
Data Science on Hadoop
Recently uploaded
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
harshavardhanraghave
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
panagenda
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
ComplianceQuest1
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
SolGuruz
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
Wave PLM
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
ICS
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Steffen Staab
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
Fatema Valibhai
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
bodapatigopi8531
Software Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
Arshad QA
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
ThousandEyes
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
ABDERRAOUF MEHENNI
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
MyIntelliSource, Inc.
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
Willy Marroquin (WillyDevNET)
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
kellynguyen01
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
Jhone kinadey
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
OnePlan Solutions
Recently uploaded
(20)
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
Software Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Spark Security
1.
Apache Spark: Enterprise Security for Production Deployments 蒋 逸峰(しょう いつほう/Yifeng Jiang) Solutions Engineer, Hortonworks @uprush December 21, 2016
2.
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved What are the security requirements? Ã
Spark user should be authenticated à Integrate with corporate LDAP/AD à Allow only authorized users access à Audit all access à Protect data both in motion & at rest à Easily manage all security à Make security easy to manage à …
3.
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Interacting with Spark Ex Spark on YARN Zeppelin Spark- Shell Ex Spark Thrift Server Driver REST ServerDriver Driver Driver
4.
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Context: Spark Deployment Modes •
Spark on YARN – Spark driver (SparkContext) in YARN AM(yarn-cluster) – Spark driver (SparkContext) in local (yarn-client): • Spark Shell & Spark Thrift Server runs in yarn-client only Client Executor App MasterSpark Driver Client Executor App Master Spark Driver YARN-Client YARN-Cluster
5.
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Spark on YARN Spark
Submit John Doe Spark AM 1 Hadoop Cluster HDFS Executor YARN RM 4 2 3 Node Manager
6.
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved DEMO A DATA LAKE WITHOUT SECURITY
7.
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Spark –
Security – Four Pillars à Authentication à Authorization à Audit à Encryption Spark leverages Kerberos on YARN
8.
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Authenticate users with Kerberos/AD KDC Use
Spark ST, submit Spark Job Spark gets Namenode (NN) service ticket YARN launches Spark Executors using John Doe’s identity Get service ticket for Spark, John Doe Spark AM NN Executor reads from HDFS using John Doe’s delegation token kinit 1 2 3 4 5 6 7 Hadoop Cluster
9.
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Spark –
Kerberos - Example kinit -kt /etc/security/keytabs/johndoe.keytab johndoe@EXAMPLE.COM spark-submit --class org.apache.spark.examples.SparkPi -- master yarn-cluster --num-executors 3 --driver-memory 512m -- executor-memory 512m --executor-cores 1 /usr/hdp/current/spark-client/lib/spark-examples*.jar 10
10.
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Allow only authorized users access to Spark jobs YARN
Cluster A B C KDC Use Spark ST, submit Spark Job Get Namenode (NN) service ticket Executors read from HDFS Client gets service ticket for Spark RangerCan John launch this job? Can John read this file John Doe
11.
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved SparkSQL: Fine grained security
12.
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved SparkSQL
Security -- Current Status à SparkSQL – Only coarse grain access control today JDBC client Spark ThriftServer (driver) YARN Container HDFS /apps/hive/warehouse/… Hive Metastore YARN Container (DAG) Run as hive user
13.
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved SparkSQL
Security à Spark Thrift Server & Spark Executors run as Hive user to read all data – No authorization support in STS – No Ranger integration support – Anyone can authenticate to STS can real ALL data à No identity propagation on 2nd hop (STS to Executors): no doAs equivalence in HS2
14.
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved YARN
& HDFS How Hive Security Works HiveServer 2 A B C KDC Use Hive ST, submit query 4. Hive gets Namenode (NN) service ticket 5.Hive creates MR/ Tez using NN ST as proxy user Ranger 1.Original request w/user id/password Client gets query result O/JDBC clients LDAP 2.HS2 Authenticates user/pass Ranger Sync users/groups from LDAP 3. Ranger AuthZ
15.
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved DEMO HIVE & SPARKSQL
AUTHORIZATION
16.
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Key Features: Spark Column Security with LLAP Ã
Fine-Grained Column Level Access Control for SparkSQL. Ã Fully dynamic policies per user. Doesn’t require views. Ã Use Standard Ranger policies and tools to control access and masking policies. Flow: 1. SparkSQL gets data locations known as “splits” from HiveServer and plans query. 2. HiveServer2 authorizes access using Ranger. Per-user policies like row filtering are applied. 3. Spark gets a modified query plan based on dynamic security policy. 4. Spark reads data from LLAP. Filtering / masking guaranteed by LLAP server. HiveServer2 Authorization Hive Metastore Data Locations View Definitions LLAP Data Read Filter Pushdown Ranger Server Dynamic Policies Spark Client 1 2 4 3
17.
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Example: Per-User Row Filtering by Region in SparkSQL Spark User 2 (East Region) Spark User 1 (West Region) Original Query: SELECT * from CUSTOMERS WHERE total_spend > 10000 Query Rewrites based on Dynamic Ranger Policies LLAP Data Access User ID
Region Total Spend 1 East 5,131 2 East 27,828 3 West 55,493 4 West 7,193 5 East 18,193 Dynamic Rewrite: SELECT * from CUSTOMERS WHERE total_spend > 10000 AND region = “east” Dynamic Rewrite: SELECT * from CUSTOMERS WHERE total_spend > 10000 AND region = “west” Fine grained Security to SparkSQL http://bit.ly/2bLghGz http://bit.ly/2bTX7Pm
18.
18 © Hortonworks
Inc. 2011 – 2016. All Rights Reserved Dynamic Masking and Row Level Filtering Country National ID CC No Name DOB MRN Policy ID US 232323233 4539067047629850 John Doe 9/12/1969 8233054331 nj23j424 US 333287465 5391304868205600 Jane Doe 9/13/1969 3736885376 cadsd984 Japan T30007873 4532488639863821 Ben Jackson 73/1975 876392473A KK-287365 Ranger Policy Enforcement Country National ID CC No MRN Name US xxxxx3233 4539 xxxx xxxx xxxx null John Doe US xxxxx7465 5391 xxxx xxxx xxxx null Jane Doe Country National ID Name MRN Japan 232323233 John Doe 8233054331 Users from US customer support groups see row filtered data for US persons with CC and SSN as masked values and MRN is nullified Japan Health Policy Admins view relevant columns of data unmasked but are restricted by row filtering policies to see data for Japan persons only
19.
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved THANK YOU @uprush
Download now