SlideShare a Scribd company logo
1 of 42
Download to read offline
Video AI for Media and
Entertainment Industry
Albert Y. C. Chen, Ph.D.
Vice President, R&D
Viscovery
Albert Y. C. Chen, Ph.D.
陳彥呈博⼠士
• Experience
2017-present: Vice President of R&D @ Viscovery
2016-2017: Chief Scientist @ Viscovery
2015: Principal Scientist @ Nervve Technologies
2013-2014 Computer Vision Scientist @ Tandent
2011-2012 @ GE Global Research
• Education
Ph.D. in Computer Science, SUNY-Buffalo
M.S. in Computer Science, NTNU
B.S. in Computer Science, NTHU
Viscovery = Video Discovery
Optical Character
Recognition
Offline
Recognition
2013
2014
Product Recognition
2015
Video Content related
Advertisements
2017
Wearable Devices
Video Content Discovery &
Interaction
2016
Leading provider of Video AI analytic products
Current AI does not “solve it all”
appl.
layer
tech
layer
infra
layer
solution
platform
libraries
modules
data
machine computing power
data accumulation via open API
AI/DNN library AI/DNN library
gen purpose
platforms
gen purpose
platforms
app-specific
platforms
app-specific
platforms
app app app app app
HW
co.
VerticalAIStartups
agri. manu. med. fin. retail trans.
E.g., 1: Google, Amazon, FB, 2: IBM, 3: Walmart, 5: NVidia
Vertical AI
Solving industry-specific problems by combining
AI and Subject Matter Expertise.
• Full Stack Products
• Subject Matter Expertise
• Proprietary Data
• AI delivers core value
(Bradford Cross, 2017/06/14)
Vertical AI Example
Media & Entertainment
Industry’s challenge
• Internet Era: Make content free, maximize traffic,
ad revenue waiting at the end of the rainbow?
• It worked for nearly 20 years, with Google and
Facebook being the only beneficiary; they control
75% of digital ad revenue, 99% of future growth.
• Is this business model still working? Does it work
for others? The latest unicorns from Silicon Valley
are suggesting otherwise.
Content Farms, maximizing traffic,
killing the Internet along the way.
NY Time saying no. WSJ and
many others are following.
Source: https://www.nytimes.com/projects/2020-report/
People are willing to pay for
good content
The curveball: App Stores
and News Syndicators!
• News Republic (acquired for 57M use, Aug 2016)
• 12.5 million daily active users
• 60k USD annual revenue
• 今⽇日頭條 (toutiao.com)
• 80 million daily active users.
• 1B USD annual revenue.
Pay source, or pay platform?
• Platform:
• More focus, less distraction: news focus on
content instead of customer service, software
development, etc.
• Potential Problem:
• Facebook and Google control 75% of all traffic
and 99% of expected future growth?
Netflix
• Netflix spends $250m USD yearly on
personalization and content recommendation.
• 104m subscribers worldwide; 52m in US (75%
market penetration, #1 in US, Youtube #2 at
53%)
• Netflix subscribers watch 19 days per month, for
28H/month (#2, less than Dish’s 47 H/month)
Netflix annual revenue
(2002—2016)
https://www.statista.com/statistics/272545/annual-revenue-of-netflix/
Netflix net income
(2000—2016)
https://www.statista.com/statistics/272561/netflix-net-income/
People are willing to pay, for
good content, good service.
The evolution of methods for
monetizing text/video content
Struggling
Traditional
Media
Free Content
Ad Revenue
Subscription
Revenue
2000 2005 2010
Do nothing?
Sitting Duck.
Improve
Ad Revenue?
Ad Tech
now
Video
Content-related
ads
Own platform?
shared
platform, licensed
content?
tailored
recommendations
(improve UX & stickiness)
(user & video
content related
recommendations)
Video Data
Mining
If we already have such precise
indexing of video content
Jay Chao
singing A
dancing B
wearing C
with items D
in front of E
at time F?• We will disrupt:
• advertisement
• e-commerce
• online video platform ecosystem
• screenwriting, film producuction and film editing..
Video content-related
advertisements
Previous moment: dining scene Insert Food Deliver Service ad Next Moment: dining scene
饿了了吗?快点饿了了么!
Food Delivery Service Ad:
Previous moment: dining scene Insert KFC ad Next second: dining scene
炸鸡红包快
来抢!
Restaurant Ad:
Video content-related
advertisements
Previous moment: driving scene Insert Automobile ad Next moment: driving scene
Automobile Ad:
Consumer Electronics Ad:
Video content-related
interactive shopping
Video content-related
Recommendations
Video-content insights
(for producers, writers, editors)
Viscovery’s video insight publication on “Ode to Joy 2”
Mining Video Content with
Computer Vision
• 85% of data are unstructured, e.g., videos.
• Previously, videos need manual tagging before its
content can be indexed and further utilized.
• Computer Vision is the AI subfield that focuses on
recognizing and understanding visual content.
What algorithms do we need?
Face Motion
Image
scene Text Audio Object
Semantics
Where are we now?
• Face
• Object
• Scene
• Logos
• Text
• Audio
• Motion
• Semantics
Where are new now?
Face Recognition
• 1 to 1: 99%+
• 1 to 100: 90%
• 1 to 10,000:
50%-70%.
• 1 to 1M: 30%.
LFW dataset, common FN↑, FP↓
Where are we now?
Image Scene Classification
• MIT Places 365
dataset.
• top-5 accuracy
rates >85%.
Where are we now?
Object Detection & Classification
• ImageNet Large Scale Visual
Recognition Challenge (ILSVRC)
• 1000+ classes, 1.2M images.
0
0.125
0.25
0.375
0.5
11 12 13 14 11 12 13 14
classification
error
classification
+localization error
Putting things together is not
trivial and often very messy.
Classical Workflow:
1. Data collection
2. Feature Extraction
3. Dimension Reduction
4. Classifier (re)Design
5. Classifier Verification
6. Deploy
Modern Brute-force workflow
1. Data collection
2. Throw everything into a Deep Neural Network
3. Mommy, why doesn’t it work ???
Classical Problem #1:
Curse of Dimensionality
坐
ze
sit
って
앉다
sentarse
• Number of Variables vs Number of Samples
Q. Who would make such naive mistakes?
A. Many “newbies” repeatedly do so.
Example 1-1:
illegal parking detection
legal parking samples x100 illegal parking samples x100
Let’s train a 150-layer Res-Net!!!
What could possibly go wrong?
Example 1-1:
illegal parking detection
• Data: try cleaner data
• Feature: fine-tune with pre-trained model; don’t
train from scratch
• Classifier overfitting: beware of statistical
coincidences,
Example 1-2: Smart Photo
Album with Google Cloud Vision
Example 1-2: Smart Photo
Album with Google Cloud Vision
No effective distance measure for thousands,
if not millions of dimensions (tags); would be
approximately zero most of the time.
Classical Problem #2:
Overfitting Data
• Make sure your deep learning algorithm is
learning better features for data, not overfitting
the data with complex classifiers.
Luckily, we’re in AI startup boom!
(BCG AI Report, 2016/10)
appl.
layer
tech
layer
infra
layer
solution
platform
libraries
modules
data
machine computing power
data accumulation via open API
AI/DNN library AI/DNN library
gen purpose
platforms
gen purpose
platforms
app-specific
platforms
app-specific
platforms
app app app app app
HW
co.
VerticalAIStartups
agri. manu. med. fin. retail trans.
E.g., 1: Google, Amazon, FB, 2: IBM, 3: Walmart, 5: NVidia
Vertical AI Startups
Solving industry-specific problems by combining
AI and Subject Matter Expertise.
• Full Stack Products
• Subject Matter Expertise
• Proprietary Data
• AI delivers core value
(Bradford Cross, 2017/06/14)
Examples of Vertical AI
beating General Purpose AI
TOP 5 TAGS COMPARISON
TAG
AD PLACEMENT
VALUE
TAG
AD PLACEMENT
VALUE
Person Low
Coulee Nazha
(actress)
High
Anime Low Sean Sun (actor) High
Screenshot Low Back of smartphone High
Cartoon Low Female Medium
Adult Medium Young Medium
“FIRST LOVE” DRAMA SERIES SCENE
Competitive Analysis
Baidu vs. Viscovery
TOP 5 TAGS COMPARISON
TAG (Man’s Face)
AD PLACEMENT
VALUE
TAG
AD PLACEMENT
VALUE
Age: 32 Medium Necklace High
Asian Medium Baseball cap High
Male Medium Bracelet High
Not smiling Low (inaccurate) Ziwen Wang High
Examples of Vertical AI
beating General Purpose AI
Use AI to turn unstructured
video data into a gold mine!
60 mins0 mins
服饰 汽⻋车
代⾔言⼈人
聚会
⼿手机
居家
z
CTR: 0.2%
60 mins0 mins
旅游 活⼒力力 汽⻋车
⼯工作 聊天
z
60 mins0 mins
学习
using only physical tags
for recommendation
CTR: 0.9%
CTR: 2.0%
z
z
Smartphone Ad physical plus abstract
and emotional tags
physical, abstract and
emotional tags plus feedback
客厅
欢乐客厅
聊天⼯工作⼿手机 代⾔言⼈人 欢乐旅游
Thank you!
albert@viscovery.com

More Related Content

What's hot

실리콘밸리의 한국인 2015 - 권기태 발표
실리콘밸리의 한국인 2015 - 권기태 발표실리콘밸리의 한국인 2015 - 권기태 발표
실리콘밸리의 한국인 2015 - 권기태 발표StartupAlliance
 
Bringing AI to Business Intelligence
Bringing AI to Business IntelligenceBringing AI to Business Intelligence
Bringing AI to Business IntelligenceSi Krishan
 
FROM BI TO APPLIED AI
FROM BI TO APPLIED AIFROM BI TO APPLIED AI
FROM BI TO APPLIED AILior Sidi
 
Anthony Nystrom - Intridea - Date Science in the NOW, it takes an Army of tools
Anthony Nystrom - Intridea - Date Science in the NOW, it takes an Army of toolsAnthony Nystrom - Intridea - Date Science in the NOW, it takes an Army of tools
Anthony Nystrom - Intridea - Date Science in the NOW, it takes an Army of toolsAnthony Nyström
 

What's hot (6)

실리콘밸리의 한국인 2015 - 권기태 발표
실리콘밸리의 한국인 2015 - 권기태 발표실리콘밸리의 한국인 2015 - 권기태 발표
실리콘밸리의 한국인 2015 - 권기태 발표
 
Image analytics - A Primer
Image analytics - A PrimerImage analytics - A Primer
Image analytics - A Primer
 
Promise notes
Promise notesPromise notes
Promise notes
 
Bringing AI to Business Intelligence
Bringing AI to Business IntelligenceBringing AI to Business Intelligence
Bringing AI to Business Intelligence
 
FROM BI TO APPLIED AI
FROM BI TO APPLIED AIFROM BI TO APPLIED AI
FROM BI TO APPLIED AI
 
Anthony Nystrom - Intridea - Date Science in the NOW, it takes an Army of tools
Anthony Nystrom - Intridea - Date Science in the NOW, it takes an Army of toolsAnthony Nystrom - Intridea - Date Science in the NOW, it takes an Army of tools
Anthony Nystrom - Intridea - Date Science in the NOW, it takes an Army of tools
 

Similar to Video AI for Media and Entertainment Industry

Technology and AI sharing - From 2016 to Y2017 and Beyond
Technology and AI sharing - From 2016 to Y2017 and BeyondTechnology and AI sharing - From 2016 to Y2017 and Beyond
Technology and AI sharing - From 2016 to Y2017 and BeyondJames Huang
 
World Publishing Expo 2012 - HTML5 Cross-Platform publishing : Dream or Reality?
World Publishing Expo 2012 - HTML5 Cross-Platform publishing : Dream or Reality?World Publishing Expo 2012 - HTML5 Cross-Platform publishing : Dream or Reality?
World Publishing Expo 2012 - HTML5 Cross-Platform publishing : Dream or Reality?Twipe Mobile Solutions
 
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...Tomasz Bednarz
 
Bridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudBridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudInside Analysis
 
Digital Transformation: Why Public Sector Customers are Moving to the Cloud
Digital Transformation: Why Public Sector Customers are Moving to the CloudDigital Transformation: Why Public Sector Customers are Moving to the Cloud
Digital Transformation: Why Public Sector Customers are Moving to the CloudAmazon Web Services
 
The Power of Big Data - Transformation Day Public Sector London 2017
The Power of Big Data - Transformation Day Public Sector London 2017The Power of Big Data - Transformation Day Public Sector London 2017
The Power of Big Data - Transformation Day Public Sector London 2017Amazon Web Services
 
Agile data science
Agile data scienceAgile data science
Agile data scienceJoel Horwitz
 
Using AI on a Large Scale at Doccle (presented by Bram Lerouge, CEO @Doccle)
Using AI on a Large Scale at Doccle (presented by Bram Lerouge, CEO @Doccle)Using AI on a Large Scale at Doccle (presented by Bram Lerouge, CEO @Doccle)
Using AI on a Large Scale at Doccle (presented by Bram Lerouge, CEO @Doccle)Patrick Van Renterghem
 
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017StampedeCon
 
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...Neo4j
 
Using Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsUsing Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsRakuten Group, Inc.
 
Whats Next for Machine Learning
Whats Next for Machine LearningWhats Next for Machine Learning
Whats Next for Machine LearningOgilvy Consulting
 
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015Bipin Singh
 
Data Science at Speed. At Scale.
Data Science at Speed. At Scale.Data Science at Speed. At Scale.
Data Science at Speed. At Scale.DataWorks Summit
 
Gartner: Top 10 Technology Trends 2015
Gartner: Top 10 Technology Trends 2015Gartner: Top 10 Technology Trends 2015
Gartner: Top 10 Technology Trends 2015Den Reymer
 
Webinar - Getting Started with mLearning
Webinar - Getting Started with mLearningWebinar - Getting Started with mLearning
Webinar - Getting Started with mLearningRaptivity
 
Intro to Artificial Intelligence w/ Target's Director of PM
 Intro to Artificial Intelligence w/ Target's Director of PM Intro to Artificial Intelligence w/ Target's Director of PM
Intro to Artificial Intelligence w/ Target's Director of PMProduct School
 

Similar to Video AI for Media and Entertainment Industry (20)

Technology and AI sharing - From 2016 to Y2017 and Beyond
Technology and AI sharing - From 2016 to Y2017 and BeyondTechnology and AI sharing - From 2016 to Y2017 and Beyond
Technology and AI sharing - From 2016 to Y2017 and Beyond
 
World Publishing Expo 2012 - HTML5 Cross-Platform publishing : Dream or Reality?
World Publishing Expo 2012 - HTML5 Cross-Platform publishing : Dream or Reality?World Publishing Expo 2012 - HTML5 Cross-Platform publishing : Dream or Reality?
World Publishing Expo 2012 - HTML5 Cross-Platform publishing : Dream or Reality?
 
Building Bots Using IBM Watson
Building Bots Using IBM WatsonBuilding Bots Using IBM Watson
Building Bots Using IBM Watson
 
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
 
Bridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudBridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the Cloud
 
Digital Transformation
Digital TransformationDigital Transformation
Digital Transformation
 
Digital Transformation: Why Public Sector Customers are Moving to the Cloud
Digital Transformation: Why Public Sector Customers are Moving to the CloudDigital Transformation: Why Public Sector Customers are Moving to the Cloud
Digital Transformation: Why Public Sector Customers are Moving to the Cloud
 
The Power of Big Data - Transformation Day Public Sector London 2017
The Power of Big Data - Transformation Day Public Sector London 2017The Power of Big Data - Transformation Day Public Sector London 2017
The Power of Big Data - Transformation Day Public Sector London 2017
 
Agile data science
Agile data scienceAgile data science
Agile data science
 
Using AI on a Large Scale at Doccle (presented by Bram Lerouge, CEO @Doccle)
Using AI on a Large Scale at Doccle (presented by Bram Lerouge, CEO @Doccle)Using AI on a Large Scale at Doccle (presented by Bram Lerouge, CEO @Doccle)
Using AI on a Large Scale at Doccle (presented by Bram Lerouge, CEO @Doccle)
 
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
 
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
 
Using Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsUsing Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIs
 
Whats Next for Machine Learning
Whats Next for Machine LearningWhats Next for Machine Learning
Whats Next for Machine Learning
 
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
 
Data Science at Speed. At Scale.
Data Science at Speed. At Scale.Data Science at Speed. At Scale.
Data Science at Speed. At Scale.
 
Gartner: Top 10 Technology Trends 2015
Gartner: Top 10 Technology Trends 2015Gartner: Top 10 Technology Trends 2015
Gartner: Top 10 Technology Trends 2015
 
Webinar - Getting Started with mLearning
Webinar - Getting Started with mLearningWebinar - Getting Started with mLearning
Webinar - Getting Started with mLearning
 
Intro to Artificial Intelligence w/ Target's Director of PM
 Intro to Artificial Intelligence w/ Target's Director of PM Intro to Artificial Intelligence w/ Target's Director of PM
Intro to Artificial Intelligence w/ Target's Director of PM
 
Jelvix portfolio
Jelvix portfolioJelvix portfolio
Jelvix portfolio
 

More from Albert Y. C. Chen

Building ML models for smart retail
Building ML models for smart retailBuilding ML models for smart retail
Building ML models for smart retailAlbert Y. C. Chen
 
為何VC不投資我的AI新創?
為何VC不投資我的AI新創?為何VC不投資我的AI新創?
為何VC不投資我的AI新創?Albert Y. C. Chen
 
數據特性 vs AI產品設計與實作
數據特性 vs AI產品設計與實作數據特性 vs AI產品設計與實作
數據特性 vs AI產品設計與實作Albert Y. C. Chen
 
Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersAlbert Y. C. Chen
 
Prototyping and Product Development for Startups
Prototyping and Product Development for StartupsPrototyping and Product Development for Startups
Prototyping and Product Development for StartupsAlbert Y. C. Chen
 
AI創新創業的商業模式與專案風險管理
AI創新創業的商業模式與專案風險管理AI創新創業的商業模式與專案風險管理
AI創新創業的商業模式與專案風險管理Albert Y. C. Chen
 
Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersAlbert Y. C. Chen
 
用AI創造大商機:媒體、廣告、電商、零售業的視覺辨識應用
用AI創造大商機:媒體、廣告、電商、零售業的視覺辨識應用用AI創造大商機:媒體、廣告、電商、零售業的視覺辨識應用
用AI創造大商機:媒體、廣告、電商、零售業的視覺辨識應用Albert Y. C. Chen
 
Machine Learning Foundations
Machine Learning FoundationsMachine Learning Foundations
Machine Learning FoundationsAlbert Y. C. Chen
 
AI智慧服務推動經驗分享
AI智慧服務推動經驗分享AI智慧服務推動經驗分享
AI智慧服務推動經驗分享Albert Y. C. Chen
 
媒體、影視產業、AI新創
媒體、影視產業、AI新創媒體、影視產業、AI新創
媒體、影視產業、AI新創Albert Y. C. Chen
 
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLPractical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLAlbert Y. C. Chen
 
Improving Spatiotemporal Stability for Object Detection and Classification
Improving Spatiotemporal Stability for Object Detection and ClassificationImproving Spatiotemporal Stability for Object Detection and Classification
Improving Spatiotemporal Stability for Object Detection and ClassificationAlbert Y. C. Chen
 
人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機
人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機
人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機Albert Y. C. Chen
 
擁抱人工智慧帶來的劇烈產業改變 @ Mix Taiwan
擁抱人工智慧帶來的劇烈產業改變 @ Mix Taiwan擁抱人工智慧帶來的劇烈產業改變 @ Mix Taiwan
擁抱人工智慧帶來的劇烈產業改變 @ Mix TaiwanAlbert Y. C. Chen
 

More from Albert Y. C. Chen (16)

Building ML models for smart retail
Building ML models for smart retailBuilding ML models for smart retail
Building ML models for smart retail
 
為何VC不投資我的AI新創?
為何VC不投資我的AI新創?為何VC不投資我的AI新創?
為何VC不投資我的AI新創?
 
數據特性 vs AI產品設計與實作
數據特性 vs AI產品設計與實作數據特性 vs AI產品設計與實作
數據特性 vs AI產品設計與實作
 
Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional Managers
 
Prototyping and Product Development for Startups
Prototyping and Product Development for StartupsPrototyping and Product Development for Startups
Prototyping and Product Development for Startups
 
AI創新創業的商業模式與專案風險管理
AI創新創業的商業模式與專案風險管理AI創新創業的商業模式與專案風險管理
AI創新創業的商業模式與專案風險管理
 
Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional Managers
 
用AI創造大商機:媒體、廣告、電商、零售業的視覺辨識應用
用AI創造大商機:媒體、廣告、電商、零售業的視覺辨識應用用AI創造大商機:媒體、廣告、電商、零售業的視覺辨識應用
用AI創造大商機:媒體、廣告、電商、零售業的視覺辨識應用
 
Machine Learning Foundations
Machine Learning FoundationsMachine Learning Foundations
Machine Learning Foundations
 
AI智慧服務推動經驗分享
AI智慧服務推動經驗分享AI智慧服務推動經驗分享
AI智慧服務推動經驗分享
 
影音大數據商機挖掘
影音大數據商機挖掘影音大數據商機挖掘
影音大數據商機挖掘
 
媒體、影視產業、AI新創
媒體、影視產業、AI新創媒體、影視產業、AI新創
媒體、影視產業、AI新創
 
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLPractical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
 
Improving Spatiotemporal Stability for Object Detection and Classification
Improving Spatiotemporal Stability for Object Detection and ClassificationImproving Spatiotemporal Stability for Object Detection and Classification
Improving Spatiotemporal Stability for Object Detection and Classification
 
人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機
人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機
人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機
 
擁抱人工智慧帶來的劇烈產業改變 @ Mix Taiwan
擁抱人工智慧帶來的劇烈產業改變 @ Mix Taiwan擁抱人工智慧帶來的劇烈產業改變 @ Mix Taiwan
擁抱人工智慧帶來的劇烈產業改變 @ Mix Taiwan
 

Recently uploaded

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 

Recently uploaded (20)

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 

Video AI for Media and Entertainment Industry

  • 1. Video AI for Media and Entertainment Industry Albert Y. C. Chen, Ph.D. Vice President, R&D Viscovery
  • 2. Albert Y. C. Chen, Ph.D. 陳彥呈博⼠士 • Experience 2017-present: Vice President of R&D @ Viscovery 2016-2017: Chief Scientist @ Viscovery 2015: Principal Scientist @ Nervve Technologies 2013-2014 Computer Vision Scientist @ Tandent 2011-2012 @ GE Global Research • Education Ph.D. in Computer Science, SUNY-Buffalo M.S. in Computer Science, NTNU B.S. in Computer Science, NTHU
  • 3. Viscovery = Video Discovery Optical Character Recognition Offline Recognition 2013 2014 Product Recognition 2015 Video Content related Advertisements 2017 Wearable Devices Video Content Discovery & Interaction 2016 Leading provider of Video AI analytic products
  • 4. Current AI does not “solve it all” appl. layer tech layer infra layer solution platform libraries modules data machine computing power data accumulation via open API AI/DNN library AI/DNN library gen purpose platforms gen purpose platforms app-specific platforms app-specific platforms app app app app app HW co. VerticalAIStartups agri. manu. med. fin. retail trans. E.g., 1: Google, Amazon, FB, 2: IBM, 3: Walmart, 5: NVidia
  • 5. Vertical AI Solving industry-specific problems by combining AI and Subject Matter Expertise. • Full Stack Products • Subject Matter Expertise • Proprietary Data • AI delivers core value (Bradford Cross, 2017/06/14)
  • 7. Media & Entertainment Industry’s challenge • Internet Era: Make content free, maximize traffic, ad revenue waiting at the end of the rainbow? • It worked for nearly 20 years, with Google and Facebook being the only beneficiary; they control 75% of digital ad revenue, 99% of future growth. • Is this business model still working? Does it work for others? The latest unicorns from Silicon Valley are suggesting otherwise.
  • 8. Content Farms, maximizing traffic, killing the Internet along the way.
  • 9. NY Time saying no. WSJ and many others are following. Source: https://www.nytimes.com/projects/2020-report/
  • 10. People are willing to pay for good content
  • 11. The curveball: App Stores and News Syndicators! • News Republic (acquired for 57M use, Aug 2016) • 12.5 million daily active users • 60k USD annual revenue • 今⽇日頭條 (toutiao.com) • 80 million daily active users. • 1B USD annual revenue.
  • 12. Pay source, or pay platform? • Platform: • More focus, less distraction: news focus on content instead of customer service, software development, etc. • Potential Problem: • Facebook and Google control 75% of all traffic and 99% of expected future growth?
  • 13. Netflix • Netflix spends $250m USD yearly on personalization and content recommendation. • 104m subscribers worldwide; 52m in US (75% market penetration, #1 in US, Youtube #2 at 53%) • Netflix subscribers watch 19 days per month, for 28H/month (#2, less than Dish’s 47 H/month)
  • 16. People are willing to pay, for good content, good service.
  • 17. The evolution of methods for monetizing text/video content Struggling Traditional Media Free Content Ad Revenue Subscription Revenue 2000 2005 2010 Do nothing? Sitting Duck. Improve Ad Revenue? Ad Tech now Video Content-related ads Own platform? shared platform, licensed content? tailored recommendations (improve UX & stickiness) (user & video content related recommendations) Video Data Mining
  • 18. If we already have such precise indexing of video content Jay Chao singing A dancing B wearing C with items D in front of E at time F?• We will disrupt: • advertisement • e-commerce • online video platform ecosystem • screenwriting, film producuction and film editing..
  • 19. Video content-related advertisements Previous moment: dining scene Insert Food Deliver Service ad Next Moment: dining scene 饿了了吗?快点饿了了么! Food Delivery Service Ad: Previous moment: dining scene Insert KFC ad Next second: dining scene 炸鸡红包快 来抢! Restaurant Ad:
  • 20. Video content-related advertisements Previous moment: driving scene Insert Automobile ad Next moment: driving scene Automobile Ad: Consumer Electronics Ad:
  • 23. Video-content insights (for producers, writers, editors) Viscovery’s video insight publication on “Ode to Joy 2”
  • 24. Mining Video Content with Computer Vision • 85% of data are unstructured, e.g., videos. • Previously, videos need manual tagging before its content can be indexed and further utilized. • Computer Vision is the AI subfield that focuses on recognizing and understanding visual content.
  • 25. What algorithms do we need? Face Motion Image scene Text Audio Object Semantics
  • 26. Where are we now? • Face • Object • Scene • Logos • Text • Audio • Motion • Semantics
  • 27. Where are new now? Face Recognition • 1 to 1: 99%+ • 1 to 100: 90% • 1 to 10,000: 50%-70%. • 1 to 1M: 30%. LFW dataset, common FN↑, FP↓
  • 28. Where are we now? Image Scene Classification • MIT Places 365 dataset. • top-5 accuracy rates >85%.
  • 29. Where are we now? Object Detection & Classification • ImageNet Large Scale Visual Recognition Challenge (ILSVRC) • 1000+ classes, 1.2M images. 0 0.125 0.25 0.375 0.5 11 12 13 14 11 12 13 14 classification error classification +localization error
  • 30. Putting things together is not trivial and often very messy. Classical Workflow: 1. Data collection 2. Feature Extraction 3. Dimension Reduction 4. Classifier (re)Design 5. Classifier Verification 6. Deploy Modern Brute-force workflow 1. Data collection 2. Throw everything into a Deep Neural Network 3. Mommy, why doesn’t it work ???
  • 31. Classical Problem #1: Curse of Dimensionality 坐 ze sit って 앉다 sentarse • Number of Variables vs Number of Samples Q. Who would make such naive mistakes? A. Many “newbies” repeatedly do so.
  • 32. Example 1-1: illegal parking detection legal parking samples x100 illegal parking samples x100 Let’s train a 150-layer Res-Net!!! What could possibly go wrong?
  • 33. Example 1-1: illegal parking detection • Data: try cleaner data • Feature: fine-tune with pre-trained model; don’t train from scratch • Classifier overfitting: beware of statistical coincidences,
  • 34. Example 1-2: Smart Photo Album with Google Cloud Vision
  • 35. Example 1-2: Smart Photo Album with Google Cloud Vision No effective distance measure for thousands, if not millions of dimensions (tags); would be approximately zero most of the time.
  • 36. Classical Problem #2: Overfitting Data • Make sure your deep learning algorithm is learning better features for data, not overfitting the data with complex classifiers.
  • 37. Luckily, we’re in AI startup boom! (BCG AI Report, 2016/10) appl. layer tech layer infra layer solution platform libraries modules data machine computing power data accumulation via open API AI/DNN library AI/DNN library gen purpose platforms gen purpose platforms app-specific platforms app-specific platforms app app app app app HW co. VerticalAIStartups agri. manu. med. fin. retail trans. E.g., 1: Google, Amazon, FB, 2: IBM, 3: Walmart, 5: NVidia
  • 38. Vertical AI Startups Solving industry-specific problems by combining AI and Subject Matter Expertise. • Full Stack Products • Subject Matter Expertise • Proprietary Data • AI delivers core value (Bradford Cross, 2017/06/14)
  • 39. Examples of Vertical AI beating General Purpose AI
  • 40. TOP 5 TAGS COMPARISON TAG AD PLACEMENT VALUE TAG AD PLACEMENT VALUE Person Low Coulee Nazha (actress) High Anime Low Sean Sun (actor) High Screenshot Low Back of smartphone High Cartoon Low Female Medium Adult Medium Young Medium “FIRST LOVE” DRAMA SERIES SCENE Competitive Analysis Baidu vs. Viscovery TOP 5 TAGS COMPARISON TAG (Man’s Face) AD PLACEMENT VALUE TAG AD PLACEMENT VALUE Age: 32 Medium Necklace High Asian Medium Baseball cap High Male Medium Bracelet High Not smiling Low (inaccurate) Ziwen Wang High Examples of Vertical AI beating General Purpose AI
  • 41. Use AI to turn unstructured video data into a gold mine! 60 mins0 mins 服饰 汽⻋车 代⾔言⼈人 聚会 ⼿手机 居家 z CTR: 0.2% 60 mins0 mins 旅游 活⼒力力 汽⻋车 ⼯工作 聊天 z 60 mins0 mins 学习 using only physical tags for recommendation CTR: 0.9% CTR: 2.0% z z Smartphone Ad physical plus abstract and emotional tags physical, abstract and emotional tags plus feedback 客厅 欢乐客厅 聊天⼯工作⼿手机 代⾔言⼈人 欢乐旅游