SlideShare a Scribd company logo
1 of 63
Download to read offline
MED402: Building a Scalable Video / Digital Asset
Management (DAM) Platform in the Cloud
Michael Limcaco – Enterprise Solutions Architect (AWS)
Jonathan Rivers – Director, Technical Operations (PBS)
November 15, 2013

© 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
Agenda
•
•
•
•
•

The big picture
Architecture
Build-out exercise
Customer case study (PBS)
Observations and summary
Big Picture: Enterprise Media Architecture
Integrated
Workflow
RTMP
MPEG-TS

Live
Stream

Media
Files

Content
Management,
Discovery &
Delivery

Physical
Media

Transcoders

Camera
HD-SDI

Store output
profile and file

Store output
profile and file
Big Picture: Digital Asset Management (DAM)
Integrated
Workflow
RTMP
MPEG-TS

Live
Stream

Media
Files

Content
Management,
Discovery, &
Delivery

Physical
Media

DAM

Transcoders

Camera
HD-SDI

Store output
profile and file

Store output
profile and file
Workflow Management
Ingest

Processing

Storage

Discovery &
Delivery
Workflow Management
Ingest

Processing

Storage

Discovery &
Delivery
Key DAM Requirements
•
•
•
•
•
•
•

Ingest
Metadata extraction
Create renditions
Build the catalog
Enable rich search
Manage storage lifecycle
Provide efficient delivery of media assets
Key DAM Requirements
•
•
•
•
•
•
•

Ingest
Metadata extraction
Create renditions
Build the catalog
Enable rich search
Manage storage lifecycle
Provide efficient delivery of media assets
Key DAM Requirements
•
•
•
•
•
•
•

Ingest
Metadata extraction
Create renditions
Build the catalog
Enable rich search
Manage storage lifecycle
Provide efficient delivery of media assets
Why Scalable?
• Increasing volume, variety, velocity
– Collectors, cameras, sensors and sources
• Ex: UGC, raw source, Mezzanine, B-roll, creative collateral
• Final content

– Formats and standards
• Transport, containers, codecs, metadata
• SD, HD, 4K …. 8K

– Devices and user expectations

• Opportunities through cloud enablement
– Media platform as a service
– Multitenancy
What about Search? Ugh …
• Core elements
– Project, keyword, asset name, tags, date/time capture, timecode range,
subject, format, size

• Extended structured search
– Dublin core, XMP, MPEG-7, IPTC, EXIF, FCXML, SMPTE, MISB

• Unstructured search
– Comments, notes, transcript, closed captioning
Enough Theory …
Let’s Build a DAM in the Cloud!
(Demo)

The User Experience
(Notional Reference Client)
Architecture
Delivery
Cache

DAM
Storage &
S3 Buckets
Archive
For Renditions,
Mailbox
Event
Handler

DAM
Web Service

DAM
Interface
AWS
Beanstalk

Rendition
Processing

Metadata Sidecar
Files

EC2 Workers
Auto scaling
Group

DynamoDB

Catalog

Mailbox
Metadata
Processing
EC2 Workers
Auto scaling
Group

Amazon
CloudSearch
Delivery
Cache

DAM
Storage &
S3 Buckets
Archive
For Renditions,
Mailbox
Event
Handler

DAM
Web Service

DAM
Web
Interface
AWS
Beanstalk

Rendition
Processing

Metadata Sidecar
Files

EC2 Workers
Auto scaling
Group

DynamoDB

Catalog

Mailbox
Metadata
Processing
EC2 Workers
Auto scaling
Group

Amazon
CloudSearch
Delivery
Cache

DAM
Storage &
S3 Buckets
Archive
For Renditions,
Mailbox
Event
Handler

DAM
Web Service

DAM
Web
Interface
AWS
Beanstalk

Rendition
Processing

Metadata Sidecar
Files

EC2 Workers
Auto scaling
Group

DynamoDB

Catalog

Mailbox
Metadata
Processing
EC2 Workers
Auto scaling
Group

Amazon
CloudSearch
Delivery
Cache

DAM
Storage &
S3 Buckets
Archive
For Renditions,
Mailbox
Event
Handler

DAM
Web Service

DAM
Web
Interface
AWS
Beanstalk

Rendition
Processing

Metadata Sidecar
Files

EC2 Workers
Auto scaling
Group

DynamoDB

Catalog

Mailbox
Metadata
Processing
EC2 Workers
Auto scaling
Group

Amazon
CloudSearch
Delivery
Cache

DAM
Storage &
S3 Buckets
Archive
For Renditions,
Mailbox
Event
Handler

DAM
Web Service

DAM
Web
Interface
AWS
Beanstalk

Rendition
Processing

Metadata Sidecar
Files

EC2 Workers
Auto scaling
Group

DynamoDB

Catalog

Mailbox
Metadata
Processing
EC2 Workers
Auto scaling
Group

Amazon
CloudSearch
Delivery
Cache

DAM
Storage &
S3 Buckets
Archive
For Renditions,
Mailbox
Event
Handler

DAM
Web Service

DAM
Interface
AWS
Beanstalk

Rendition
Processing

Metadata Sidecar
Files

EC2 Workers
Auto scaling
Group

DynamoDB

Catalog

Mailbox
Metadata
Processing
EC2 Workers
Auto scaling
Group

Amazon
CloudSearch
Delivery
Cache

DAM
Storage &
S3 Buckets
Archive
For Renditions,
Mailbox
Event
Handler

DAM
Web Service

DAM
Interface
AWS
Beanstalk

Rendition
Processing

Metadata Sidecar
Files

EC2 Workers
Auto scaling
Group

DynamoDB

Catalog

Mailbox
Metadata
Processing
EC2 Workers
Auto scaling
Group

Amazon
CloudSearch
Tools Available to Us
Need

Description

AWS Service

Ingest

Integrate w / existing file-based workflows

Amazon S3

Metadata

Process inline and sidecar files

EC2 / Elastic Beanstalk

Renditions

Autogenerate thumbnails and proxies

Amazon Elastic Transcoder

Catalog part 1

Administrative entities, simple retrieval

Amazon DynamoDB

Catalog part 2

Field and free-form search

Amazon CloudSearch

Storage

Nearline, online, offline infinite storage

Amazon S3, Amazon Glacier

Delivery

Global caching and streaming footprint

Amazon CloudFront
Catalog: A word on why DynamoDB
Container-A
Header

NoSQL Data Model

Layer-2

Core Elem1

Core Elem2

Elem from A

Name_A

Size

Some_Field

Name_B

Size

Name_C

Layer-1

Size

Container-B
Header

Container-C

Layer-1

Header

Layer-2

Elem from B
Some_Field
Catalog: A Word on Why CloudSearch
• Video and text
– Header fields with textual descriptions, synopsis, comments
– Tracks with speech to text, closed caption data
– Links to scripts

• Video and structured elements
– XMP dynamic media
– Sidecar files

• A managed search engine dedicated to these kinds of problems
– Case folding, stemming, stopword removal, synonyms
– Also accent normalization, UTF-8 normalization, etc.
Other Goodies
• Back-end services
– AWS CLI
– Open source decode utilities
• EXIFtool
• MediaInfo

– ETL support
• Talend (representative)

• Front-end services
– Node.js + AWS Node SDK
Delivery
Cache

DAM
Storage &
S3 Buckets
Archive
For Renditions,
Mailbox
Event
Handler

DAM
Web Service

DAM
Interface
AWS
Beanstalk

Rendition
Processing

Metadata Sidecar
Files

EC2 Workers
Auto scaling
Group

DynamoDB

Catalog

Mailbox
Metadata
Processing
EC2 Workers
Auto scaling
Group

Amazon
CloudSearch
CloudFront
Download
Distribution
Media
Content

Amazon S3 Storage
For Source,
Renditions, Metadata
Sidecar Files

EC2 Crawler

EC2 ASG
Amazon SQS Queue
Rendition Jobs

Rendition
Workers

Elastic Transcoder
Proxy / Thumbnail
Generation

DAM
Catalog
Amazon
DynamoDB

Amazon SNS Topic
DAM
Web Service
Amazon SQS
Queue
Metadata
Processing Jobs
AWS Elastic
Beanstalk

EC2 ASG
Metadata
Workers

Amazon
CloudSearch
(Dual Screen)

Walkthrough
Setup
• Amazon Simple Storage Service (S3) buckets
ready to go
– External staging locations
– Internal working locations

• Amazon Simple Notification Service (SNS) +
Amazon Simple Queue Service (SQS) wired up
• Catalog data models established
– Amazon DynamoDB table “catalog” created
– Amazon CloudSearch search domain “catalog” created
1. Ingest, Crawl, Notify
a.
b.
c.
d.

End user initiates data copy
EC2 worker scans Amazon S3 staging bucket
EC2 worker copies or moves content
EC2 worker broadcasts “NEW DATA” event
CloudFront
Download
Distribution
Media
Content

Amazon S3 Storage
For Source,
Renditions, Metadata
Sidecar Files

EC2 Crawler

EC2 ASG
SQS Queue
Rendition Jobs

Rendition
Workers

Elastic Transcoder
Proxy / Thumbnail
Generation

DAM
Catalog
Amazon
DynamoDB

Amazon SNS Topic
DAM
Web Service
SQS Queue
Metadata
Processing
Jobs
AWS Elastic
Beanstalk

EC2 ASG
Metadata
Workers

Amazon
CloudSearch
CloudFront
Download
Distribution
Media
Content

Amazon S3 Storage
For Source,
Renditions, Metadata
Sidecar Files

EC2 Crawler

EC2 ASG
SQS Queue
Rendition Jobs

Rendition
Workers

Elastic Transcoder
Proxy / Thumbnail
Generation

DAM
Catalog
Amazon
DynamoDB

Amazon SNS Topic
DAM
Web Service
SQS Queue
Metadata
Processing
Jobs
AWS Elastic
Beanstalk

EC2 ASG
Metadata
Workers

Amazon
CloudSearch
CloudFront
Download
Distribution
Media
Content

Amazon S3 Storage
For Source,
Renditions, Metadata
Sidecar Files

EC2 Crawler

EC2 ASG
SQS Queue
Rendition Jobs

Rendition
Workers

Elastic Transcoder
Proxy / Thumbnail
Generation

DAM
Catalog
Amazon
DynamoDB

Amazon SNS Topic
DAM
Web Service
SQS Queue
Metadata
Processing
Jobs
AWS Elastic
Beanstalk

EC2 ASG
Metadata
Workers

Amazon
CloudSearch
CloudFront
Download
Distribution
Media
Content

Amazon S3 Storage
For Source,
Renditions, Metadata
Sidecar Files

EC2 Crawler

EC2 ASG
SQS Queue
Rendition Jobs

Rendition
Workers

Elastic Transcoder
Proxy / Thumbnail
Generation

DAM
Catalog
Amazon
DynamoDB

Amazon SNS Topic
DAM
Web Service
SQS Queue
Metadata
Processing
Jobs
AWS Elastic
Beanstalk

EC2 ASG
Metadata
Workers

Amazon
CloudSearch
CloudFront
Download
Distribution
Media
Content

Amazon S3 Storage
For Source,
Renditions, Metadata
Sidecar Files

EC2 Crawler

EC2 ASG
SQS Queue
Rendition Jobs

Rendition
Workers

Elastic Transcoder
Proxy / Thumbnail
Generation

DAM
Catalog
Amazon
DynamoDB

Amazon SNS Topic
DAM
Web Service
SQS Queue
Metadata
Processing
Jobs
AWS Elastic
Beanstalk

EC2 ASG
Metadata
Workers

Amazon
CloudSearch
1. Ingest, Crawl, Notify
a.
b.
c.
d.

End user initiates data copy
EC2 worker scans Amazon S3 staging bucket
EC2 worker copies or moves content
EC2 worker broadcasts “NEW DATA” event
(SNS)
2. Metadata Extraction
a. EC2 worker polls inbox (SQS)
b. EC2 worker pulls down media asset from
Amazon S3
c. EC2 worker parses media files
d. EC2 worker pumps metadata through ETL flow
to prepare for catalog insertion
e. EC2 worker inserts into catalog (Amazon
DynamoDB)
CloudFront
Download
Distribution
Media
Content

Amazon S3 Storage
For Source,
Renditions, Metadata
Sidecar Files

EC2 Crawler

EC2 ASG
SQS Queue
Rendition Jobs

Rendition
Workers

Elastic Transcoder
Proxy / Thumbnail
Generation

DAM
Catalog
Amazon
DynamoDB

Amazon SNS Topic
DAM
Web Service
SQS Queue
Metadata
Processing
Jobs
AWS Elastic
Beanstalk

EC2 ASG
Metadata
Workers

Amazon
CloudSearch
CloudFront
Download
Distribution
Media
Content

Amazon S3 Storage
For Source,
Renditions, Metadata
Sidecar Files

EC2 Crawler

EC2 ASG
SQS Queue
Rendition Jobs

Rendition
Workers

Elastic Transcoder
Proxy / Thumbnail
Generation

DAM
Catalog
Amazon
DynamoDB

Amazon SNS Topic
DAM
Web Service
SQS Queue
Metadata
Processing
Jobs
AWS Elastic
Beanstalk

EC2 ASG
Metadata
Workers

Amazon
CloudSearch
2. Metadata Extraction
a. EC2 worker polls inbox (SQS)
b. EC2 worker pulls down media asset from
Amazon S3
c. EC2 worker parses media files
d. EC2 worker pumps metadata through ETL flow
to prepare for catalog insertion
e. EC2 worker inserts into catalog (Amazon
DynamoDB)
Preparing for Amazon DynamoDB Insert

{
"COMPLETE_NAME" :
{ "S" : "01_01_SoccerF_05_A.mp4" },
"FORMAT" :
{ "S" : "MPEG-4" },
"CODEC_ID" :
{ "S" : "mp42" }
}
Model It and Deploy to EC2! (Talend)
3. Catalog Processing
a. Store metadata record in Amazon DynamoDB
b. Reflect searchable subset to Amazon
CloudSearch
c. Go crazy (HTTP GET)
CloudFront
Download
Distribution
Media
Content

Amazon S3 Storage
For Source,
Renditions, Metadata
Sidecar Files

EC2 Crawler

EC2 ASG
SQS Queue
Rendition Jobs

Rendition
Workers

Elastic Transcoder
Proxy / Thumbnail
Generation

DAM
Catalog
Amazon
DynamoDB

Amazon SNS Topic
DAM
Web Service
SQS Queue
Metadata
Processing
Jobs
AWS Elastic
Beanstalk

EC2 ASG
Metadata
Workers

Amazon
CloudSearch
CloudFront
Download
Distribution
Media
Content

Amazon S3 Storage
For Source,
Renditions, Metadata
Sidecar Files

EC2 Crawler

EC2 ASG
SQS Queue
Rendition Jobs

Rendition
Workers

Elastic Transcoder
Proxy / Thumbnail
Generation

DAM
Catalog
Amazon
DynamoDB

1

Amazon SNS Topic
DAM
Web Service
SQS Queue
Metadata
Processing
Jobs
AWS Elastic
Beanstalk

EC2 ASG

2

Metadata
Workers

Amazon
CloudSearch
Querying the Catalog (Amazon CloudSearch)
• http://cloudsearch.demo.aws.com/2011-0201/search?bq=complete_name : …<field=value>
•

In Node.js
var optionsget
host :
port :
path :

= {
'cloudsearch.demo.aws.com', // here only the domain name
80,
'/2011-02-01/search?bq=complete_name:'-STRAWBERRY'&
return-fields=complete_name,text_relevance,codec_id_info,
duration,file_size, duration,encoded_date',
method : 'GET'

}
Customer Case Study (PBS)
Merlin: PBS CMS/DAM
• Code name Merlin
• Structured metadata
• 200+ web object records daily
– 29,046 web objects
• 150+ Video objects daily
– 91,436 videos
• Users from over 150 stations 30 national producers
– Frontline
– Downton Abbey
– PBS Newshour
What’s It Do?
• Large multitenant system
– 1200 registered users

• 250 million streams per month
• 20 million unique viewers
• 8 PB of video delivered monthly
Getting Data In
• 33 ingestible web feeds
– Content editors
– Web page listings

• Batch video ingest API
– Video content editors
– External workflow integration

• Manually entered videos
– Video content editors from all 50 states
– Number of user accounts
System Overview
User
Input

Ingest
API

Amazon
CloudSearch

Search Util

DAM (Merlin)

Workflow
Service

Content
API

Amazon
SWF

RSS

Amazon
RDS

Amazon
S3

CDN

Amazon
RDS
Basic Workflow
• Object registered with Merlin
• Images registered and processed with ITS
– Stored in CDN fronted Amazon S3 bucket

• Videos registered with VTS
– Jobs sent to Zencoder for processing
– Video stored in CDN fronted Amazon S3 bucket

• Objects ready for clients
– Objects rendered for consumption in Amazon S3
– Objects registered with APIs
– Objects discoverable
Making It Discoverable
• Search util service
• Runs every hour
– Re-indexes last several hours each time

• Polls APIs
– Content API
– Modified time

• Updates Amazon CloudSearch index
– 2 primary indexes
Search Considerations
• Hidden objects
• Rights management
• Partitioned search
– Local station search
– Results by geo
– Restrict results for international customers

• Unify and normalize existing APIs
– Flatten data model

• Users looking for programs
– Specific searches
– Suitable for structured data
Challenges
• No native time field
– Convert dates to integers
– Epoch time

• Versioning of documents
– Epoch for versioning

• Exposing two versions of most fields
– Text searchable
– Facets (copy of text version)
Search Consumers (PBS.org)

Site Search
Search Consumers (Video Portal)

Site Search

Programs A-Z
Xbox / OTT
Summary
Summary
• Build an enterprise-scale DAM platform now
– Managed storage and archive (Amazon S3, Amazon Glacier)
– Managed database for catalog processing (Amazon DynamoDB, Amazon
Relational Database Service [RDS])
– Managed search (CloudSearch)

• Application development accelerators
– Elastic Beanstalk harness (web, API, and worker roles)
– Reduced effort with the AWS CLI

• (Almost) fire and forget
AWS Marketplace Can Help
•

AWS online software store
–
–
–
–
–

•

Customer can find, research, buy software
Simple pricing, aligns with EC2 usage model
1-click launch in minutes
Marketplace billing integrated into your AWS account
1,000+ products across 24 categories

Digital asset management related options Include:
–
–
–
–

WebDAM – centralize, store, manage and distribute collateral
Digital asset management cloud – web-based open source DAM
Widen – manage and distribute digital media and brand assets with
user roles and permissions
Adobe Experience Manager – unified asset management including
mobile

Learn more at: http://aws.amazon.com/marketplace
“DAM!”
Please give us your feedback on this
presentation
MED-402 Building a Scalable Video /
DAM Solution in the Cloud

As a thank you, we will select prize
winners daily for completed surveys!

More Related Content

What's hot

Data Center Migration to the AWS Cloud
Data Center Migration to the AWS CloudData Center Migration to the AWS Cloud
Data Center Migration to the AWS Cloud
Tom Laszewski
 

What's hot (20)

Introducing AWS DataSync - Simplify, automate, and accelerate online data tra...
Introducing AWS DataSync - Simplify, automate, and accelerate online data tra...Introducing AWS DataSync - Simplify, automate, and accelerate online data tra...
Introducing AWS DataSync - Simplify, automate, and accelerate online data tra...
 
Backup and archiving in the aws cloud
Backup and archiving in the aws cloudBackup and archiving in the aws cloud
Backup and archiving in the aws cloud
 
Building-a-Data-Lake-on-AWS
Building-a-Data-Lake-on-AWSBuilding-a-Data-Lake-on-AWS
Building-a-Data-Lake-on-AWS
 
AWS Big Data Platform
AWS Big Data PlatformAWS Big Data Platform
AWS Big Data Platform
 
Automated Solution for Deploying AWS Landing Zone (GPSWS407) - AWS re:Invent ...
Automated Solution for Deploying AWS Landing Zone (GPSWS407) - AWS re:Invent ...Automated Solution for Deploying AWS Landing Zone (GPSWS407) - AWS re:Invent ...
Automated Solution for Deploying AWS Landing Zone (GPSWS407) - AWS re:Invent ...
 
AWS Lake Formation을 통한 손쉬운 데이터 레이크 구성 및 관리 - 윤석찬 :: AWS Unboxing 온라인 세미나
AWS Lake Formation을 통한 손쉬운 데이터 레이크 구성 및 관리 - 윤석찬 :: AWS Unboxing 온라인 세미나AWS Lake Formation을 통한 손쉬운 데이터 레이크 구성 및 관리 - 윤석찬 :: AWS Unboxing 온라인 세미나
AWS Lake Formation을 통한 손쉬운 데이터 레이크 구성 및 관리 - 윤석찬 :: AWS Unboxing 온라인 세미나
 
AWS for Backup and Recovery
AWS for Backup and RecoveryAWS for Backup and Recovery
AWS for Backup and Recovery
 
AWS 101
AWS 101AWS 101
AWS 101
 
AWS IAM
AWS IAMAWS IAM
AWS IAM
 
Amazon CloudWatch Tutorial | AWS Certification | Cloud Monitoring Tools | AWS...
Amazon CloudWatch Tutorial | AWS Certification | Cloud Monitoring Tools | AWS...Amazon CloudWatch Tutorial | AWS Certification | Cloud Monitoring Tools | AWS...
Amazon CloudWatch Tutorial | AWS Certification | Cloud Monitoring Tools | AWS...
 
AWS Landing Zone Deep Dive (ENT350-R2) - AWS re:Invent 2018
AWS Landing Zone Deep Dive (ENT350-R2) - AWS re:Invent 2018AWS Landing Zone Deep Dive (ENT350-R2) - AWS re:Invent 2018
AWS Landing Zone Deep Dive (ENT350-R2) - AWS re:Invent 2018
 
Using AWS for Backup and Restore (backup in the cloud, backup to the cloud, a...
Using AWS for Backup and Restore (backup in the cloud, backup to the cloud, a...Using AWS for Backup and Restore (backup in the cloud, backup to the cloud, a...
Using AWS for Backup and Restore (backup in the cloud, backup to the cloud, a...
 
ABCs of AWS: S3
ABCs of AWS: S3ABCs of AWS: S3
ABCs of AWS: S3
 
Implementing a Data Lake
Implementing a Data LakeImplementing a Data Lake
Implementing a Data Lake
 
Amazon S3 Masterclass
Amazon S3 MasterclassAmazon S3 Masterclass
Amazon S3 Masterclass
 
Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...
Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...
Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...
 
Data Center Migration to the AWS Cloud
Data Center Migration to the AWS CloudData Center Migration to the AWS Cloud
Data Center Migration to the AWS Cloud
 
Introduction to the Well-Architected Framework and Tool - SVC208 - Anaheim AW...
Introduction to the Well-Architected Framework and Tool - SVC208 - Anaheim AW...Introduction to the Well-Architected Framework and Tool - SVC208 - Anaheim AW...
Introduction to the Well-Architected Framework and Tool - SVC208 - Anaheim AW...
 
Disaster Recovery Strategies - AWS Siklab 2022.pptx
Disaster Recovery Strategies - AWS Siklab 2022.pptxDisaster Recovery Strategies - AWS Siklab 2022.pptx
Disaster Recovery Strategies - AWS Siklab 2022.pptx
 
Deep Dive on Amazon S3
Deep Dive on Amazon S3Deep Dive on Amazon S3
Deep Dive on Amazon S3
 

Viewers also liked

Implementing Large Scale Digital Asset Repositories with Adobe Experience Man...
Implementing Large Scale Digital Asset Repositories with Adobe Experience Man...Implementing Large Scale Digital Asset Repositories with Adobe Experience Man...
Implementing Large Scale Digital Asset Repositories with Adobe Experience Man...
devang-dsshah
 
Puglia marac-file formats-20111020
Puglia marac-file formats-20111020Puglia marac-file formats-20111020
Puglia marac-file formats-20111020
MARAC Bethlehem PC
 
10 Core Lessons of Asset Management
10 Core Lessons of Asset Management10 Core Lessons of Asset Management
10 Core Lessons of Asset Management
Brad Zylman
 

Viewers also liked (20)

Implementing Large Scale Digital Asset Repositories with Adobe Experience Man...
Implementing Large Scale Digital Asset Repositories with Adobe Experience Man...Implementing Large Scale Digital Asset Repositories with Adobe Experience Man...
Implementing Large Scale Digital Asset Repositories with Adobe Experience Man...
 
Content Management and Running your Website on AWS
Content Management and Running your Website on AWSContent Management and Running your Website on AWS
Content Management and Running your Website on AWS
 
Implementation of systems for Media / Digital Asset Management Systems in 10 ...
Implementation of systems for Media / Digital Asset Management Systems in 10 ...Implementation of systems for Media / Digital Asset Management Systems in 10 ...
Implementation of systems for Media / Digital Asset Management Systems in 10 ...
 
Digital Asset Management. What is it and why do it?
Digital Asset Management. What is it and why do it?Digital Asset Management. What is it and why do it?
Digital Asset Management. What is it and why do it?
 
Digital Asset Management (DAM) latest trends, value proposition and ROI
Digital Asset Management (DAM) latest trends, value proposition and ROIDigital Asset Management (DAM) latest trends, value proposition and ROI
Digital Asset Management (DAM) latest trends, value proposition and ROI
 
Is Digital Asset Management the new Collections Management?
Is Digital Asset Management the new Collections Management?Is Digital Asset Management the new Collections Management?
Is Digital Asset Management the new Collections Management?
 
Accelerate Your Java Development on AWS (TLS301) | AWS re:Invent 2013
Accelerate Your Java Development on AWS (TLS301) | AWS re:Invent 2013Accelerate Your Java Development on AWS (TLS301) | AWS re:Invent 2013
Accelerate Your Java Development on AWS (TLS301) | AWS re:Invent 2013
 
Marlabs Capabilities Overview: Digital Asset Management (DAM)
Marlabs Capabilities Overview: Digital Asset Management (DAM)Marlabs Capabilities Overview: Digital Asset Management (DAM)
Marlabs Capabilities Overview: Digital Asset Management (DAM)
 
Best Practices in Enterprise Video and Digital Asset Management
Best Practices in Enterprise Video and Digital Asset ManagementBest Practices in Enterprise Video and Digital Asset Management
Best Practices in Enterprise Video and Digital Asset Management
 
Cloud as a Service: A Powerful New Cloud Management Platform
Cloud as a Service: A Powerful New Cloud Management PlatformCloud as a Service: A Powerful New Cloud Management Platform
Cloud as a Service: A Powerful New Cloud Management Platform
 
Digital asset management: A major content marketing trend
Digital asset management: A major content marketing trendDigital asset management: A major content marketing trend
Digital asset management: A major content marketing trend
 
Digital Asset Management Basics
Digital Asset Management BasicsDigital Asset Management Basics
Digital Asset Management Basics
 
DAMs Strategy Presentation
DAMs Strategy PresentationDAMs Strategy Presentation
DAMs Strategy Presentation
 
Puglia marac-file formats-20111020
Puglia marac-file formats-20111020Puglia marac-file formats-20111020
Puglia marac-file formats-20111020
 
AWS Webcast - Introducing Amazon RDS for PostgreSQL
AWS Webcast - Introducing Amazon RDS for PostgreSQLAWS Webcast - Introducing Amazon RDS for PostgreSQL
AWS Webcast - Introducing Amazon RDS for PostgreSQL
 
Role of dam in driving digital strategy
Role of dam in driving digital strategyRole of dam in driving digital strategy
Role of dam in driving digital strategy
 
Something That Works: Implementing ResourceSpace Open Source Digital Asset Ma...
Something That Works: Implementing ResourceSpace Open Source Digital Asset Ma...Something That Works: Implementing ResourceSpace Open Source Digital Asset Ma...
Something That Works: Implementing ResourceSpace Open Source Digital Asset Ma...
 
Resource space
Resource spaceResource space
Resource space
 
Presention ppt
Presention pptPresention ppt
Presention ppt
 
10 Core Lessons of Asset Management
10 Core Lessons of Asset Management10 Core Lessons of Asset Management
10 Core Lessons of Asset Management
 

Similar to Building a Scalable Digital Asset Management Platform in the Cloud (MED402) | AWS re:Invent 2013

Similar to Building a Scalable Digital Asset Management Platform in the Cloud (MED402) | AWS re:Invent 2013 (20)

Building a Scalable Asset Management (DAM) Platform in the AWS
Building a Scalable Asset Management (DAM) Platform in the AWSBuilding a Scalable Asset Management (DAM) Platform in the AWS
Building a Scalable Asset Management (DAM) Platform in the AWS
 
Big data and serverless - AWS UG The Netherlands
Big data and serverless - AWS UG The NetherlandsBig data and serverless - AWS UG The Netherlands
Big data and serverless - AWS UG The Netherlands
 
Building a Data Processing Pipeline on AWS - AWS Summit SG 2017
Building a Data Processing Pipeline on AWS - AWS Summit SG 2017Building a Data Processing Pipeline on AWS - AWS Summit SG 2017
Building a Data Processing Pipeline on AWS - AWS Summit SG 2017
 
Log Analysis At Scale
Log Analysis At ScaleLog Analysis At Scale
Log Analysis At Scale
 
Your First 10 Million Users with Amazon Web Services
Your First 10 Million Users with Amazon Web ServicesYour First 10 Million Users with Amazon Web Services
Your First 10 Million Users with Amazon Web Services
 
Building a Data Processing Pipeline on AWS
Building a Data Processing Pipeline on AWSBuilding a Data Processing Pipeline on AWS
Building a Data Processing Pipeline on AWS
 
Dean Bryen: Scaling The Platform For Your Startup
Dean Bryen: Scaling The Platform For Your StartupDean Bryen: Scaling The Platform For Your Startup
Dean Bryen: Scaling The Platform For Your Startup
 
AWS Innovate: Build a Data Lake on AWS- Johnathon Meichtry
AWS Innovate: Build a Data Lake on AWS- Johnathon MeichtryAWS Innovate: Build a Data Lake on AWS- Johnathon Meichtry
AWS Innovate: Build a Data Lake on AWS- Johnathon Meichtry
 
1. 利用微服務架構建立雲端影音平台 (Building Media Platform by Microservices Architecture)
1.	利用微服務架構建立雲端影音平台 (Building Media Platform by Microservices Architecture)1.	利用微服務架構建立雲端影音平台 (Building Media Platform by Microservices Architecture)
1. 利用微服務架構建立雲端影音平台 (Building Media Platform by Microservices Architecture)
 
Scalable Media Workflows in the Cloud
Scalable Media Workflows in the CloudScalable Media Workflows in the Cloud
Scalable Media Workflows in the Cloud
 
4K Media Workflows on AWS
4K Media Workflows on AWS4K Media Workflows on AWS
4K Media Workflows on AWS
 
Building Scalable Windows and .NET Apps on AWS (TLS302) | AWS re:Invent 2013
Building Scalable Windows and .NET Apps on AWS (TLS302) | AWS re:Invent 2013Building Scalable Windows and .NET Apps on AWS (TLS302) | AWS re:Invent 2013
Building Scalable Windows and .NET Apps on AWS (TLS302) | AWS re:Invent 2013
 
AWS Startup Day Bangalore: Being Well-Architected in the Cloud
AWS Startup Day Bangalore: Being Well-Architected in the CloudAWS Startup Day Bangalore: Being Well-Architected in the Cloud
AWS Startup Day Bangalore: Being Well-Architected in the Cloud
 
AWS Cloud Kata 2014 | Jakarta - 2-1 AWS Intro and Scale 2014
AWS Cloud Kata 2014 | Jakarta - 2-1 AWS Intro and Scale 2014AWS Cloud Kata 2014 | Jakarta - 2-1 AWS Intro and Scale 2014
AWS Cloud Kata 2014 | Jakarta - 2-1 AWS Intro and Scale 2014
 
4K Media Workflows on AWS By Usman Shakeel of Amzaon AWS
4K Media Workflows on AWS By Usman Shakeel of Amzaon AWS4K Media Workflows on AWS By Usman Shakeel of Amzaon AWS
4K Media Workflows on AWS By Usman Shakeel of Amzaon AWS
 
AWS Webcast - What is Cloud Computing?
AWS Webcast - What is Cloud Computing?AWS Webcast - What is Cloud Computing?
AWS Webcast - What is Cloud Computing?
 
Scaling the Platform for Your Startup - Startup Talks June 2015
Scaling the Platform for Your Startup - Startup Talks June 2015Scaling the Platform for Your Startup - Startup Talks June 2015
Scaling the Platform for Your Startup - Startup Talks June 2015
 
A Multi-Tenancy Cloud-Native Digital Library Platform
A Multi-Tenancy Cloud-Native Digital Library PlatformA Multi-Tenancy Cloud-Native Digital Library Platform
A Multi-Tenancy Cloud-Native Digital Library Platform
 
[Jun AWS 201] Technical Workshop
[Jun AWS 201] Technical Workshop[Jun AWS 201] Technical Workshop
[Jun AWS 201] Technical Workshop
 
AWS re:Invent 2016: Getting Started with the Hybrid Cloud: Enterprise Backup ...
AWS re:Invent 2016: Getting Started with the Hybrid Cloud: Enterprise Backup ...AWS re:Invent 2016: Getting Started with the Hybrid Cloud: Enterprise Backup ...
AWS re:Invent 2016: Getting Started with the Hybrid Cloud: Enterprise Backup ...
 

More from Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Building a Scalable Digital Asset Management Platform in the Cloud (MED402) | AWS re:Invent 2013

  • 1. MED402: Building a Scalable Video / Digital Asset Management (DAM) Platform in the Cloud Michael Limcaco – Enterprise Solutions Architect (AWS) Jonathan Rivers – Director, Technical Operations (PBS) November 15, 2013 © 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
  • 2. Agenda • • • • • The big picture Architecture Build-out exercise Customer case study (PBS) Observations and summary
  • 3. Big Picture: Enterprise Media Architecture Integrated Workflow RTMP MPEG-TS Live Stream Media Files Content Management, Discovery & Delivery Physical Media Transcoders Camera HD-SDI Store output profile and file Store output profile and file
  • 4. Big Picture: Digital Asset Management (DAM) Integrated Workflow RTMP MPEG-TS Live Stream Media Files Content Management, Discovery, & Delivery Physical Media DAM Transcoders Camera HD-SDI Store output profile and file Store output profile and file
  • 7. Key DAM Requirements • • • • • • • Ingest Metadata extraction Create renditions Build the catalog Enable rich search Manage storage lifecycle Provide efficient delivery of media assets
  • 8. Key DAM Requirements • • • • • • • Ingest Metadata extraction Create renditions Build the catalog Enable rich search Manage storage lifecycle Provide efficient delivery of media assets
  • 9. Key DAM Requirements • • • • • • • Ingest Metadata extraction Create renditions Build the catalog Enable rich search Manage storage lifecycle Provide efficient delivery of media assets
  • 10. Why Scalable? • Increasing volume, variety, velocity – Collectors, cameras, sensors and sources • Ex: UGC, raw source, Mezzanine, B-roll, creative collateral • Final content – Formats and standards • Transport, containers, codecs, metadata • SD, HD, 4K …. 8K – Devices and user expectations • Opportunities through cloud enablement – Media platform as a service – Multitenancy
  • 11. What about Search? Ugh … • Core elements – Project, keyword, asset name, tags, date/time capture, timecode range, subject, format, size • Extended structured search – Dublin core, XMP, MPEG-7, IPTC, EXIF, FCXML, SMPTE, MISB • Unstructured search – Comments, notes, transcript, closed captioning
  • 12. Enough Theory … Let’s Build a DAM in the Cloud!
  • 15. Delivery Cache DAM Storage & S3 Buckets Archive For Renditions, Mailbox Event Handler DAM Web Service DAM Interface AWS Beanstalk Rendition Processing Metadata Sidecar Files EC2 Workers Auto scaling Group DynamoDB Catalog Mailbox Metadata Processing EC2 Workers Auto scaling Group Amazon CloudSearch
  • 16. Delivery Cache DAM Storage & S3 Buckets Archive For Renditions, Mailbox Event Handler DAM Web Service DAM Web Interface AWS Beanstalk Rendition Processing Metadata Sidecar Files EC2 Workers Auto scaling Group DynamoDB Catalog Mailbox Metadata Processing EC2 Workers Auto scaling Group Amazon CloudSearch
  • 17. Delivery Cache DAM Storage & S3 Buckets Archive For Renditions, Mailbox Event Handler DAM Web Service DAM Web Interface AWS Beanstalk Rendition Processing Metadata Sidecar Files EC2 Workers Auto scaling Group DynamoDB Catalog Mailbox Metadata Processing EC2 Workers Auto scaling Group Amazon CloudSearch
  • 18. Delivery Cache DAM Storage & S3 Buckets Archive For Renditions, Mailbox Event Handler DAM Web Service DAM Web Interface AWS Beanstalk Rendition Processing Metadata Sidecar Files EC2 Workers Auto scaling Group DynamoDB Catalog Mailbox Metadata Processing EC2 Workers Auto scaling Group Amazon CloudSearch
  • 19. Delivery Cache DAM Storage & S3 Buckets Archive For Renditions, Mailbox Event Handler DAM Web Service DAM Web Interface AWS Beanstalk Rendition Processing Metadata Sidecar Files EC2 Workers Auto scaling Group DynamoDB Catalog Mailbox Metadata Processing EC2 Workers Auto scaling Group Amazon CloudSearch
  • 20. Delivery Cache DAM Storage & S3 Buckets Archive For Renditions, Mailbox Event Handler DAM Web Service DAM Interface AWS Beanstalk Rendition Processing Metadata Sidecar Files EC2 Workers Auto scaling Group DynamoDB Catalog Mailbox Metadata Processing EC2 Workers Auto scaling Group Amazon CloudSearch
  • 21. Delivery Cache DAM Storage & S3 Buckets Archive For Renditions, Mailbox Event Handler DAM Web Service DAM Interface AWS Beanstalk Rendition Processing Metadata Sidecar Files EC2 Workers Auto scaling Group DynamoDB Catalog Mailbox Metadata Processing EC2 Workers Auto scaling Group Amazon CloudSearch
  • 22. Tools Available to Us Need Description AWS Service Ingest Integrate w / existing file-based workflows Amazon S3 Metadata Process inline and sidecar files EC2 / Elastic Beanstalk Renditions Autogenerate thumbnails and proxies Amazon Elastic Transcoder Catalog part 1 Administrative entities, simple retrieval Amazon DynamoDB Catalog part 2 Field and free-form search Amazon CloudSearch Storage Nearline, online, offline infinite storage Amazon S3, Amazon Glacier Delivery Global caching and streaming footprint Amazon CloudFront
  • 23. Catalog: A word on why DynamoDB Container-A Header NoSQL Data Model Layer-2 Core Elem1 Core Elem2 Elem from A Name_A Size Some_Field Name_B Size Name_C Layer-1 Size Container-B Header Container-C Layer-1 Header Layer-2 Elem from B Some_Field
  • 24. Catalog: A Word on Why CloudSearch • Video and text – Header fields with textual descriptions, synopsis, comments – Tracks with speech to text, closed caption data – Links to scripts • Video and structured elements – XMP dynamic media – Sidecar files • A managed search engine dedicated to these kinds of problems – Case folding, stemming, stopword removal, synonyms – Also accent normalization, UTF-8 normalization, etc.
  • 25. Other Goodies • Back-end services – AWS CLI – Open source decode utilities • EXIFtool • MediaInfo – ETL support • Talend (representative) • Front-end services – Node.js + AWS Node SDK
  • 26. Delivery Cache DAM Storage & S3 Buckets Archive For Renditions, Mailbox Event Handler DAM Web Service DAM Interface AWS Beanstalk Rendition Processing Metadata Sidecar Files EC2 Workers Auto scaling Group DynamoDB Catalog Mailbox Metadata Processing EC2 Workers Auto scaling Group Amazon CloudSearch
  • 27. CloudFront Download Distribution Media Content Amazon S3 Storage For Source, Renditions, Metadata Sidecar Files EC2 Crawler EC2 ASG Amazon SQS Queue Rendition Jobs Rendition Workers Elastic Transcoder Proxy / Thumbnail Generation DAM Catalog Amazon DynamoDB Amazon SNS Topic DAM Web Service Amazon SQS Queue Metadata Processing Jobs AWS Elastic Beanstalk EC2 ASG Metadata Workers Amazon CloudSearch
  • 29. Setup • Amazon Simple Storage Service (S3) buckets ready to go – External staging locations – Internal working locations • Amazon Simple Notification Service (SNS) + Amazon Simple Queue Service (SQS) wired up • Catalog data models established – Amazon DynamoDB table “catalog” created – Amazon CloudSearch search domain “catalog” created
  • 30. 1. Ingest, Crawl, Notify a. b. c. d. End user initiates data copy EC2 worker scans Amazon S3 staging bucket EC2 worker copies or moves content EC2 worker broadcasts “NEW DATA” event
  • 31. CloudFront Download Distribution Media Content Amazon S3 Storage For Source, Renditions, Metadata Sidecar Files EC2 Crawler EC2 ASG SQS Queue Rendition Jobs Rendition Workers Elastic Transcoder Proxy / Thumbnail Generation DAM Catalog Amazon DynamoDB Amazon SNS Topic DAM Web Service SQS Queue Metadata Processing Jobs AWS Elastic Beanstalk EC2 ASG Metadata Workers Amazon CloudSearch
  • 32. CloudFront Download Distribution Media Content Amazon S3 Storage For Source, Renditions, Metadata Sidecar Files EC2 Crawler EC2 ASG SQS Queue Rendition Jobs Rendition Workers Elastic Transcoder Proxy / Thumbnail Generation DAM Catalog Amazon DynamoDB Amazon SNS Topic DAM Web Service SQS Queue Metadata Processing Jobs AWS Elastic Beanstalk EC2 ASG Metadata Workers Amazon CloudSearch
  • 33. CloudFront Download Distribution Media Content Amazon S3 Storage For Source, Renditions, Metadata Sidecar Files EC2 Crawler EC2 ASG SQS Queue Rendition Jobs Rendition Workers Elastic Transcoder Proxy / Thumbnail Generation DAM Catalog Amazon DynamoDB Amazon SNS Topic DAM Web Service SQS Queue Metadata Processing Jobs AWS Elastic Beanstalk EC2 ASG Metadata Workers Amazon CloudSearch
  • 34. CloudFront Download Distribution Media Content Amazon S3 Storage For Source, Renditions, Metadata Sidecar Files EC2 Crawler EC2 ASG SQS Queue Rendition Jobs Rendition Workers Elastic Transcoder Proxy / Thumbnail Generation DAM Catalog Amazon DynamoDB Amazon SNS Topic DAM Web Service SQS Queue Metadata Processing Jobs AWS Elastic Beanstalk EC2 ASG Metadata Workers Amazon CloudSearch
  • 35. CloudFront Download Distribution Media Content Amazon S3 Storage For Source, Renditions, Metadata Sidecar Files EC2 Crawler EC2 ASG SQS Queue Rendition Jobs Rendition Workers Elastic Transcoder Proxy / Thumbnail Generation DAM Catalog Amazon DynamoDB Amazon SNS Topic DAM Web Service SQS Queue Metadata Processing Jobs AWS Elastic Beanstalk EC2 ASG Metadata Workers Amazon CloudSearch
  • 36. 1. Ingest, Crawl, Notify a. b. c. d. End user initiates data copy EC2 worker scans Amazon S3 staging bucket EC2 worker copies or moves content EC2 worker broadcasts “NEW DATA” event (SNS)
  • 37. 2. Metadata Extraction a. EC2 worker polls inbox (SQS) b. EC2 worker pulls down media asset from Amazon S3 c. EC2 worker parses media files d. EC2 worker pumps metadata through ETL flow to prepare for catalog insertion e. EC2 worker inserts into catalog (Amazon DynamoDB)
  • 38. CloudFront Download Distribution Media Content Amazon S3 Storage For Source, Renditions, Metadata Sidecar Files EC2 Crawler EC2 ASG SQS Queue Rendition Jobs Rendition Workers Elastic Transcoder Proxy / Thumbnail Generation DAM Catalog Amazon DynamoDB Amazon SNS Topic DAM Web Service SQS Queue Metadata Processing Jobs AWS Elastic Beanstalk EC2 ASG Metadata Workers Amazon CloudSearch
  • 39. CloudFront Download Distribution Media Content Amazon S3 Storage For Source, Renditions, Metadata Sidecar Files EC2 Crawler EC2 ASG SQS Queue Rendition Jobs Rendition Workers Elastic Transcoder Proxy / Thumbnail Generation DAM Catalog Amazon DynamoDB Amazon SNS Topic DAM Web Service SQS Queue Metadata Processing Jobs AWS Elastic Beanstalk EC2 ASG Metadata Workers Amazon CloudSearch
  • 40. 2. Metadata Extraction a. EC2 worker polls inbox (SQS) b. EC2 worker pulls down media asset from Amazon S3 c. EC2 worker parses media files d. EC2 worker pumps metadata through ETL flow to prepare for catalog insertion e. EC2 worker inserts into catalog (Amazon DynamoDB)
  • 41. Preparing for Amazon DynamoDB Insert { "COMPLETE_NAME" : { "S" : "01_01_SoccerF_05_A.mp4" }, "FORMAT" : { "S" : "MPEG-4" }, "CODEC_ID" : { "S" : "mp42" } }
  • 42. Model It and Deploy to EC2! (Talend)
  • 43. 3. Catalog Processing a. Store metadata record in Amazon DynamoDB b. Reflect searchable subset to Amazon CloudSearch c. Go crazy (HTTP GET)
  • 44. CloudFront Download Distribution Media Content Amazon S3 Storage For Source, Renditions, Metadata Sidecar Files EC2 Crawler EC2 ASG SQS Queue Rendition Jobs Rendition Workers Elastic Transcoder Proxy / Thumbnail Generation DAM Catalog Amazon DynamoDB Amazon SNS Topic DAM Web Service SQS Queue Metadata Processing Jobs AWS Elastic Beanstalk EC2 ASG Metadata Workers Amazon CloudSearch
  • 45. CloudFront Download Distribution Media Content Amazon S3 Storage For Source, Renditions, Metadata Sidecar Files EC2 Crawler EC2 ASG SQS Queue Rendition Jobs Rendition Workers Elastic Transcoder Proxy / Thumbnail Generation DAM Catalog Amazon DynamoDB 1 Amazon SNS Topic DAM Web Service SQS Queue Metadata Processing Jobs AWS Elastic Beanstalk EC2 ASG 2 Metadata Workers Amazon CloudSearch
  • 46. Querying the Catalog (Amazon CloudSearch) • http://cloudsearch.demo.aws.com/2011-0201/search?bq=complete_name : …<field=value> • In Node.js var optionsget host : port : path : = { 'cloudsearch.demo.aws.com', // here only the domain name 80, '/2011-02-01/search?bq=complete_name:'-STRAWBERRY'& return-fields=complete_name,text_relevance,codec_id_info, duration,file_size, duration,encoded_date', method : 'GET' }
  • 48. Merlin: PBS CMS/DAM • Code name Merlin • Structured metadata • 200+ web object records daily – 29,046 web objects • 150+ Video objects daily – 91,436 videos • Users from over 150 stations 30 national producers – Frontline – Downton Abbey – PBS Newshour
  • 49. What’s It Do? • Large multitenant system – 1200 registered users • 250 million streams per month • 20 million unique viewers • 8 PB of video delivered monthly
  • 50. Getting Data In • 33 ingestible web feeds – Content editors – Web page listings • Batch video ingest API – Video content editors – External workflow integration • Manually entered videos – Video content editors from all 50 states – Number of user accounts
  • 51. System Overview User Input Ingest API Amazon CloudSearch Search Util DAM (Merlin) Workflow Service Content API Amazon SWF RSS Amazon RDS Amazon S3 CDN Amazon RDS
  • 52. Basic Workflow • Object registered with Merlin • Images registered and processed with ITS – Stored in CDN fronted Amazon S3 bucket • Videos registered with VTS – Jobs sent to Zencoder for processing – Video stored in CDN fronted Amazon S3 bucket • Objects ready for clients – Objects rendered for consumption in Amazon S3 – Objects registered with APIs – Objects discoverable
  • 53. Making It Discoverable • Search util service • Runs every hour – Re-indexes last several hours each time • Polls APIs – Content API – Modified time • Updates Amazon CloudSearch index – 2 primary indexes
  • 54. Search Considerations • Hidden objects • Rights management • Partitioned search – Local station search – Results by geo – Restrict results for international customers • Unify and normalize existing APIs – Flatten data model • Users looking for programs – Specific searches – Suitable for structured data
  • 55. Challenges • No native time field – Convert dates to integers – Epoch time • Versioning of documents – Epoch for versioning • Exposing two versions of most fields – Text searchable – Facets (copy of text version)
  • 57. Search Consumers (Video Portal) Site Search Programs A-Z
  • 60. Summary • Build an enterprise-scale DAM platform now – Managed storage and archive (Amazon S3, Amazon Glacier) – Managed database for catalog processing (Amazon DynamoDB, Amazon Relational Database Service [RDS]) – Managed search (CloudSearch) • Application development accelerators – Elastic Beanstalk harness (web, API, and worker roles) – Reduced effort with the AWS CLI • (Almost) fire and forget
  • 61. AWS Marketplace Can Help • AWS online software store – – – – – • Customer can find, research, buy software Simple pricing, aligns with EC2 usage model 1-click launch in minutes Marketplace billing integrated into your AWS account 1,000+ products across 24 categories Digital asset management related options Include: – – – – WebDAM – centralize, store, manage and distribute collateral Digital asset management cloud – web-based open source DAM Widen – manage and distribute digital media and brand assets with user roles and permissions Adobe Experience Manager – unified asset management including mobile Learn more at: http://aws.amazon.com/marketplace
  • 63. Please give us your feedback on this presentation MED-402 Building a Scalable Video / DAM Solution in the Cloud As a thank you, we will select prize winners daily for completed surveys!