아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션

•Download as PPTX, PDF•

1 like•1,148 views

아리랑 위성영상 AI 객체 검출 경진대회 1등 php팀의 우승자 코드 설명 자료입니다. 자세한 자료는 https://dacon.io/competitions/official/235644/talkboard/ 에서 확인 가능합니다.

Technology

https://dacon.io
아리랑 위성영상 AI 객체 검출
경진대회
Nov 24 2020
Kyung Tae Kim (MTC Ai LAB & Seoul National University)

Index
1
2
3
STEP 1
STEP 2
STEP 3
https://dacon.i
o
2
Exploratory Data Analysis
Deep Learning Model
Conclusion & QA
Exploratory Data
Analysis
Model
Conclusion & Q
A
• Backbone
• Head + Neck
• Post Processing

https://dacon.io 3
Exploratory Data
Analysis1. Rotation Augmentation
Fig.1 origin rotated Fig.3 rotatedFig.2 rotated
Since there are many orientation variations in aerial images
implement the online rotation augmentation. (e.g. 180, 270)

https://dacon.io 4
Exploratory Data
Analysis2. Mosaic Data Augmentation
Fig.1 origin image Fig.2 combines 4 images
This allows for the model to learn how to identify objects at a smaller scale than normal.
It also encourages the model to localize different types of images in different portions of the frame.

https://dacon.io 5
2. Model
Fig.1 ROI Transform Fig.2 ROI Transform
Fig.3 Spatial Transformer Network Fig.4 Dynamic Routing Algorithm

https://dacon.io 6
2. Deformable Convolution Networks
Fig.1 Training DCN Module Fig.2 DCN sampling locations (from the learned offset)
1. Enabling effective modeling of spatial transformation in convolution neural network
2. No additional supervision for learning spatial transformation
3. Significant accuracy improvements on sophisticated vision tasks

https://dacon.io 7
2. Model
Fig.1 FPN
Fig.2 Expansion Pyramid Network

https://dacon.io 10
3. Attention Expansion Pyramid Network

https://dacon.io 11
3. Conclusion
1. Mean Average Precision
2. Align Deep Feature & Activation MAP
3. Explainable Neural Networks
4. Convolutional vs Transformers

https://dacon.io 12
3. Result
Need good system 1 functionality to make a system efficient

https://dacon.io 13
4. Reference
1. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
2. Fully Convolutional Networks for Semantic Segmentation
3. Focal Loss for Dense Object Detection
4. Deformable Convolutional Networks
5. Learning RoI Transformer for Detecting Oriented Objects in Aerial Images
6. YOLOv4: Optimal Speed and Accuracy of Object Detection
7. Arbitrary-Oriented Scene Text Detection via Rotation Proposals

What's hot

글쓰는 개발자 모임, 글또Seongyun Byeon

Spark & Zeppelin을 활용한 한국어 텍스트 분류Taejun Kim

XXE: How to become a JediYaroslav Babin

Data Engineering 101DaeMyung Kang

반복적인 코드 작업 자동화, Codebone으로 손쉽게Sungju Jin

Aula 02 - Curso GRATUITO EAD de Desenvolvimento Seguro de Software com Alcyon...Alcyon Ferreira de Souza Junior, MSc

개발자를 위한 (블로그) 글쓰기 introSeongyun Byeon

[2018 데이터야놀자] 웹크롤링 좀 더 잘하기wangwon Lee

Building Advanced XSS VectorsRodolfo Assis (Brute)

Caffeine+Nicotine 尼古丁+咖啡不瞌睡的Ppt制作秘诀蔡学镛heypig

제 17회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [중고책나라] : 실시간 데이터를 이용한 Elasticsearch 클러스터 최적화BOAZ Bigdata

Spark + S3 + R3를 이용한 데이터 분석 시스템 만들기AWSKRUG - AWS한국사용자모임

인프런 - 스타트업 인프랩 시작 사례Hyung Lee

OWASP AppSecCali 2015 - Marshalling PicklesChristopher Frohoff

What should a hacker know about WebDav?Mikhail Egorov

로그 기깔나게 잘 디자인하는 법Jeongsang Baek

Pwning the Enterprise With PowerShellBeau Bullock

[261] 실시간 추천엔진 머신한대에 구겨넣기NAVER D2

Attacking DrupalGreg Foss

[134]병리 AI Product 개발을 위한 데이터 관리 및 좌충우돌 삽질기NAVER D2

What's hot (20)

글쓰는 개발자 모임, 글또

Spark & Zeppelin을 활용한 한국어 텍스트 분류

XXE: How to become a Jedi

Data Engineering 101

반복적인 코드 작업 자동화, Codebone으로 손쉽게

Aula 02 - Curso GRATUITO EAD de Desenvolvimento Seguro de Software com Alcyon...

개발자를 위한 (블로그) 글쓰기 intro

[2018 데이터야놀자] 웹크롤링 좀 더 잘하기

Building Advanced XSS Vectors

Caffeine+Nicotine 尼古丁+咖啡不瞌睡的Ppt制作秘诀蔡学镛

제 17회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [중고책나라] : 실시간 데이터를 이용한 Elasticsearch 클러스터 최적화

Spark + S3 + R3를 이용한 데이터 분석 시스템 만들기

인프런 - 스타트업 인프랩 시작 사례

OWASP AppSecCali 2015 - Marshalling Pickles

What should a hacker know about WebDav?

로그 기깔나게 잘 디자인하는 법

Pwning the Enterprise With PowerShell

[261] 실시간 추천엔진 머신한대에 구겨넣기

Attacking Drupal

[134]병리 AI Product 개발을 위한 데이터 관리 및 좌충우돌 삽질기

Similar to 아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션

ISARC_2019_Window-warping: A time-series data augmentation of IMU data for co...Khandakar Rashid (Linkon)

Whiteboard image reconstruction using matlabeSAT Publishing House

Data Citation Made EasyUniversity of California Curation Center

Unsupervised Computer Vision: The Current State of the ArtTJ Torres

Connecting the dots mbse process dec02 2015loydbakerjr

Tracing of an Object in Video through Mean Shift Protocolijtsrd

The Frontier of Deep Learning in 2020 and BeyondNUS-ISS

USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLINGIRJET Journal

2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...IEEEFINALYEARSTUDENTPROJECT

2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...IEEEMEMTECHSTUDENTSPROJECTS

IEEE 2014 JAVA DATA MINING PROJECTS Mining weakly labeled web facial images f...IEEEFINALYEARSTUDENTPROJECTS

Transfer Learning Model for Image Segmentation by Integrating U-NetPlusPlus a...YutaSuzuki27

Bayesian Autoencoders (BAE) & Honest Thoughts on research Bang Xiang Yong

OLAP – Creating Cubes with SQL Server Analysis ServicesPeter Gfader

2016 asprs track: spatial analysis at the continental scale: a practical app...GIS in the Rockies

VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...vtunotesbysree

Machine Learning Approach to Geometry Prediction in Cold Spray Additive Manuf...Daiki Ikeuchi

Smart environment for industry 4.0JawadSajid2

Pervasive Usability for Web 2.0HAUSEN HSIEH

EarthCube DDMA AGUTanu Malik

Similar to 아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션 (20)

ISARC_2019_Window-warping: A time-series data augmentation of IMU data for co...

Whiteboard image reconstruction using matlab

Data Citation Made Easy

Unsupervised Computer Vision: The Current State of the Art

Connecting the dots mbse process dec02 2015

Tracing of an Object in Video through Mean Shift Protocol

The Frontier of Deep Learning in 2020 and Beyond

USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLING

2014 IEEE JAVA DATA MINING PROJECT Mining weakly labeled web facial images fo...

IEEE 2014 JAVA DATA MINING PROJECTS Mining weakly labeled web facial images f...

Transfer Learning Model for Image Segmentation by Integrating U-NetPlusPlus a...

Bayesian Autoencoders (BAE) & Honest Thoughts on research

OLAP – Creating Cubes with SQL Server Analysis Services

2016 asprs track: spatial analysis at the continental scale: a practical app...

VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...

Machine Learning Approach to Geometry Prediction in Cold Spray Additive Manuf...

Smart environment for industry 4.0

Pervasive Usability for Web 2.0

EarthCube DDMA AGU

Recently uploaded

A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz

Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

GenAI Risks & Security Meetup 01052024.pdflior mazor

DBX First Quarter 2024 Investor PresentationDropbox

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

MS Copilot expands with MS Graph connectorsNanddeep Nachan

Apidays New York 2024 - The value of a flexible API Management solution for O...apidays

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

A Year of the Servo Reboot: Where Are We Now?Igalia

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Ransomware_Q4_2023. The report. [EN].pdfOverkill Security

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays

AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra

Recently uploaded (20)

A Beginners Guide to Building a RAG App Using Open Source Milvus

Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

GenAI Risks & Security Meetup 01052024.pdf

DBX First Quarter 2024 Investor Presentation

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Axa Assurance Maroc - Insurer Innovation Award 2024

MS Copilot expands with MS Graph connectors

Apidays New York 2024 - The value of a flexible API Management solution for O...

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Strategies for Landing an Oracle DBA Job as a Fresher

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

A Year of the Servo Reboot: Where Are We Now?

Powerful Google developer tools for immediate impact! (2023-24 C)

Ransomware_Q4_2023. The report. [EN].pdf

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

AXA XL - Insurer Innovation Award Americas 2024

Exploring the Future Potential of AI-Enabled Smartphone Processors

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션

1. https://dacon.io 아리랑 위성영상 AI 객체 검출 경진대회 Nov 24 2020 Kyung Tae Kim (MTC Ai LAB & Seoul National University)

2. Index 1 2 3 STEP 1 STEP 2 STEP 3 https://dacon.i o 2 Exploratory Data Analysis Deep Learning Model Conclusion & QA Exploratory Data Analysis Model Conclusion & Q A • Backbone • Head + Neck • Post Processing

3. https://dacon.io 3 Exploratory Data Analysis1. Rotation Augmentation Fig.1 origin rotated Fig.3 rotatedFig.2 rotated Since there are many orientation variations in aerial images implement the online rotation augmentation. (e.g. 180, 270)

4. https://dacon.io 4 Exploratory Data Analysis2. Mosaic Data Augmentation Fig.1 origin image Fig.2 combines 4 images This allows for the model to learn how to identify objects at a smaller scale than normal. It also encourages the model to localize different types of images in different portions of the frame.

5. https://dacon.io 5 2. Model Fig.1 ROI Transform Fig.2 ROI Transform Fig.3 Spatial Transformer Network Fig.4 Dynamic Routing Algorithm

6. https://dacon.io 6 2. Deformable Convolution Networks Fig.1 Training DCN Module Fig.2 DCN sampling locations (from the learned offset) 1. Enabling effective modeling of spatial transformation in convolution neural network 2. No additional supervision for learning spatial transformation 3. Significant accuracy improvements on sophisticated vision tasks

7. https://dacon.io 7 2. Model Fig.1 FPN Fig.2 Expansion Pyramid Network

8. https://dacon.io 8 3. Result

9. https://dacon.io 9 3. Result

10. https://dacon.io 10 3. Attention Expansion Pyramid Network

11. https://dacon.io 11 3. Conclusion 1. Mean Average Precision 2. Align Deep Feature & Activation MAP 3. Explainable Neural Networks 4. Convolutional vs Transformers

12. https://dacon.io 12 3. Result Need good system 1 functionality to make a system efficient

13. https://dacon.io 13 4. Reference 1. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks 2. Fully Convolutional Networks for Semantic Segmentation 3. Focal Loss for Dense Object Detection 4. Deformable Convolutional Networks 5. Learning RoI Transformer for Detecting Oriented Objects in Aerial Images 6. YOLOv4: Optimal Speed and Accuracy of Object Detection 7. Arbitrary-Oriented Scene Text Detection via Rotation Proposals

14. THANK YOU THANK YOU https://dacon.io 14

Editor's Notes

Convolution Offset or Weight를 사용하여 geometric transform에서의 약점을 극복 하려고함 (e.g 기본적으로 convolution filter에 space sampling을 적용하여 공간값을 확장 시키고, 확장된 공간의 convolution weight를 학습하여 해결을 도모함) 다양한 해상도, 수평, 다중의 방향성등의 이미지 영역을 구분해야 하는데 고정된 convolution filter로 학습을 할 경우 효율적으로 localize를 하기 어려움 (e.g. RPN [0.5, 1, 1.5] X [64, 128, 256, 512]) 그렇기 때문에 localize에 포함된 이미지에 맞춰서 offset을 학습하는 구조 (특히 convolution layer3-5)을 사용할 경우 가장 좋은 성능을 보임
1. Neck Module 부분에서 기존의 Feature Pyramid Network를 사용하여 실험 했을 경우 학습데이터의 특성상 Pyramid Module 3, Pyramid Module 4의 context(activation map) 정보의 소실이 발생 하게 됨 2. 약점을 해결하기 위해 global attention sample Module을 추가함 GA Moduel은 High ReSolution Extraction Feature(P4)와 Low Level Extraction Feature(P3)를 단순히 Activation Function으로 연산하는 것이 아니라 P3 Feature를 3 x 3 convolution 연산을 하고, P4 Feature 를 Global Pooling을 수행한뒤에 3. 1 x 1 convolution 연산을 진행 한뒤 P3 module에 P4 module의 global pooling된 값을 곱한뒤 P4의 weight를 곱하면 기존의 UP Sampling을 할경우 손실 되었던 P3 Module의 정보를 극대화 시킴

아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to 아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션

Similar to 아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션 (20)

More from DACON AI 데이콘

More from DACON AI 데이콘 (20)

Recently uploaded

Recently uploaded (20)

아리랑 위성영상 AI 객체 검출 경진대회 1등 수상자 솔루션

Editor's Notes