Book Cover Recognition

•

3 likes•1,555 views

Shao-Chuan Wang

Education

1 Book Cover Recognition SHAO-CHUAN (SHAWN) WANG CMOS E6737 Biometrics Final Project sw2644@columbia.edu http://shaochuan.info/

Book Cover Recognition System Goal: Input: A middle resolution image taken from web camera or cell phone (640 x 480) that contains a book. Output: The book title/id. 2 Book Cover Recognition System “Learning from Data”

Baseline Model: Bag-of-words Low level feature extraction (Dense SIFT) (CVPR09) Visual words learning (a.k.a codebook learning) via vector quantization. Spatial pooling of local features. Linear SVM classification. (libLinear package) 3

A Glance on Dataset (1/2) 9 books. training on half (15), test on half (15): 4

Baseline: Experiments System parameters: 15 training images per book, test on the rest. Resize image size to 300 pixels (long side) Visual codebook size (K = 225) # of spatial pyramid level = 0,1,2 Spatial pooling function = max L2-regularized 1-norm loss linear SVM 5-fold cross validation Results: Nearly perfect recognition results. (99.259%=134/135) 6

More Challenging Testing Images More clutter, more occlusion, more realistic. 7

More Challenging: Experiments Results: Poor: 36.24% (MAP) 8 Confusion matrix So, Can we “detect” and “crop” books automatically?

Hough Transform Assumptions: Books are rectangular, some contrasts between books and clutter. No many “false” strong edges on clutter. 9

Auto cropping results (1/2) 10 Easier cases:

Auto cropping results (2/2) Really difficult cases: 11

More Challenging: Experiments Results: Without cropping: 36.24% (MAP) With autocropping: 58.60% (MAP) 12

What's hot

Convolutional neural networkMojammilHusain

Deep learning introbeamandrew

Intrusion Detection Model using Self Organizing Maps.Tushar Shinde

Yol ov2Bang Tsui Liou

Convolutional neural network from VGG to DenseNetSungminYou

Efficient Neural Network Architecture for Image ClassficationYogendra Tamang

Neural network based image compression with lifting scheme and rlceSAT Publishing House

Object detection with deep learningSushant Shrivastava

Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015Jia-Bin Huang

DeconvNet, DecoupledNet, TransferNet in Image SegmentationNamHyuk Ahn

Image compression and reconstruction using a new approach by artificial neura...Hưng Đặng

Introduction to Convolutional Neural NetworksHannes Hapke

Hand Written Digit Classificationijtsrd

Image processing by manish myst, ssgbcoetManish Myst

Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...IOSR Journals

Machine Vision on Embedded HardwareJash Shah

Convolutional Neural Network and Its ApplicationsKasun Chinthaka Piyarathna

Details of Lazy Deep Learning for Images Recognition in ZZ Photo appPAY2 YOU

Image classification using cnnDebarko De

Image Object Detection PipelineAbhinav Dadhich

What's hot (20)

Convolutional neural network

Deep learning intro

Intrusion Detection Model using Self Organizing Maps.

Yol ov2

Convolutional neural network from VGG to DenseNet

Efficient Neural Network Architecture for Image Classfication

Neural network based image compression with lifting scheme and rlc

Object detection with deep learning

Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015

DeconvNet, DecoupledNet, TransferNet in Image Segmentation

Image compression and reconstruction using a new approach by artificial neura...

Introduction to Convolutional Neural Networks

Hand Written Digit Classification

Image processing by manish myst, ssgbcoet

Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...

Machine Vision on Embedded Hardware

Convolutional Neural Network and Its Applications

Details of Lazy Deep Learning for Images Recognition in ZZ Photo app

Image classification using cnn

Image Object Detection Pipeline

Similar to Book Cover Recognition

Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...CSCJournals

Karthickkarthick shanker

Deep Learning for Computer Vision - PyconDE 2017Alex Conway

ShawnQuinnCSS581FinalProjectReportShawn Quinn

Data-applied: Technology Insightsdataapplied content

Data-Applied: Technology InsightsDataminingTools Inc

PR100: SeedNet: Automatic Seed Generation with Deep Reinforcement Learning fo...광희 이

Embermrphilroth

Computer Vision Landscape : Present and FutureSanghamitra Deb

Handwritten and Machine Printed Text Separation in Document Images using the ...Konstantinos Zagoris

Apache MXNet ODSC West 2018Apache MXNet

Text extraction using document structure features and support vector machinesKonstantinos Zagoris

Mining weakly labeled web facial images for search based face annotation Adz91 Digital Ads Pvt Ltd

Deep Learning Enabled Question Answering System to Automate Corporate HelpdeskSaurabh Saxena

Super Resolution with OCR OptimizationniveditJain

Adaptive membership functions for hand written character recognition by voron...JPINFOTECH JAYAPRAKASH

Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdfSachin414679

An efficient technique for color image classification based on lower feature ...Alexander Decker

SeRanet introductionKosuke Nakago

2021 05-04-u2-netJAEMINJEONG5

Similar to Book Cover Recognition (20)

Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...

Karthick

Deep Learning for Computer Vision - PyconDE 2017

ShawnQuinnCSS581FinalProjectReport

Data-applied: Technology Insights

Data-Applied: Technology Insights

PR100: SeedNet: Automatic Seed Generation with Deep Reinforcement Learning fo...

Ember

Computer Vision Landscape : Present and Future

Handwritten and Machine Printed Text Separation in Document Images using the ...

Apache MXNet ODSC West 2018

Text extraction using document structure features and support vector machines

Mining weakly labeled web facial images for search based face annotation

Deep Learning Enabled Question Answering System to Automate Corporate Helpdesk

Super Resolution with OCR Optimization

Adaptive membership functions for hand written character recognition by voron...

Handwriting_Recognition_using_KNN_classificatiob_algorithm_ijariie6729 (1).pdf

An efficient technique for color image classification based on lower feature ...

SeRanet introduction

2021 05-04-u2-net

Recently uploaded

Accessible Digital Futures project (20/03/2024)Jisc

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection

On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash

HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC

Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid

Interdisciplinary_Insights_Data_Collection_Methods.pptxPooja Bhuva

The basics of sentences session 3pptx.pptxheathfieldcps1

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop

This PowerPoint helps students to consider the concept of infinity.christianmathematics

REMIFENTANIL: An Ultra short acting opioid.pptxDr. Ravikiran H M Gowda

Jamworks pilot and AI at Jisc (20/03/2024)Jisc

SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur

Holdier Curriculum Vitae (April 2024).pdfagholdier

Towards a code of practice for AI in AT.pptxJisc

How to Give a Domain for a Field in Odoo 17Celine George

Graduate Outcomes Presentation Slides - Englishneillewis46

FSB Advising Checklist - Orientation 2024Elizabeth Walsh

80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection

Single or Multiple melodic lines structuredhanjurrannsibayan2

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade

Recently uploaded (20)

Accessible Digital Futures project (20/03/2024)

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...

On National Teacher Day, meet the 2024-25 Kenan Fellows

HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx

Basic Civil Engineering first year Notes- Chapter 4 Building.pptx

Interdisciplinary_Insights_Data_Collection_Methods.pptx

The basics of sentences session 3pptx.pptx

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

This PowerPoint helps students to consider the concept of infinity.

REMIFENTANIL: An Ultra short acting opioid.pptx

Jamworks pilot and AI at Jisc (20/03/2024)

SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx

Holdier Curriculum Vitae (April 2024).pdf

Towards a code of practice for AI in AT.pptx

How to Give a Domain for a Field in Odoo 17

Graduate Outcomes Presentation Slides - English

FSB Advising Checklist - Orientation 2024

80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...

Single or Multiple melodic lines structure

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx

Book Cover Recognition

1. 1 Book Cover Recognition SHAO-CHUAN (SHAWN) WANG CMOS E6737 Biometrics Final Project sw2644@columbia.edu http://shaochuan.info/

2. Book Cover Recognition System Goal: Input: A middle resolution image taken from web camera or cell phone (640 x 480) that contains a book. Output: The book title/id. 2 Book Cover Recognition System “Learning from Data”

3. Baseline Model: Bag-of-words Low level feature extraction (Dense SIFT) (CVPR09) Visual words learning (a.k.a codebook learning) via vector quantization. Spatial pooling of local features. Linear SVM classification. (libLinear package) 3

4. A Glance on Dataset (1/2) 9 books. training on half (15), test on half (15): 4

5. A Glance on Dataset (2/2) 5

6. Baseline: Experiments System parameters: 15 training images per book, test on the rest. Resize image size to 300 pixels (long side) Visual codebook size (K = 225) # of spatial pyramid level = 0,1,2 Spatial pooling function = max L2-regularized 1-norm loss linear SVM 5-fold cross validation Results: Nearly perfect recognition results. (99.259%=134/135) 6

7. More Challenging Testing Images More clutter, more occlusion, more realistic. 7

8. More Challenging: Experiments Results: Poor: 36.24% (MAP) 8 Confusion matrix So, Can we “detect” and “crop” books automatically?

9. Hough Transform Assumptions: Books are rectangular, some contrasts between books and clutter. No many “false” strong edges on clutter. 9

10. Auto cropping results (1/2) 10 Easier cases:

11. Auto cropping results (2/2) Really difficult cases: 11

12. More Challenging: Experiments Results: Without cropping: 36.24% (MAP) With autocropping: 58.60% (MAP) 12

Editor's Notes

Left to the title, a presenter can insert his/her own image pertinent to the presentation.
The End of Slideshow.

Book Cover Recognition

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Book Cover Recognition

Similar to Book Cover Recognition (20)

More from Shao-Chuan Wang

More from Shao-Chuan Wang (10)

Recently uploaded

Recently uploaded (20)

Book Cover Recognition

Editor's Notes