Seminar報告_20150520

•Download as PPTX, PDF•

2 likes•1,234 views

Po-Jen Lai

2015.05.20的seminar報告，就當作提早把碩論要講的內容大致整理一次。

Technology

3D Pose Estimation for
Transparent Objects
Presenter: 賴柏任
Advisor:羅仁權教授
05.20.2015

Motivation
• Transparent objects are everywhere
• If we know he pose, we can grasp it!
2

Problems
3
Color of
transparent
object changes
Hard to locate
transparent
objects
Edge of
transparent objects
are blur
Hard to estimate
pose of
transparent objects

Effective cure
4
Color of
transparent
object changes
Edge of
transparent
objects are blur

Kinect v.s. Color changes
• Transparent objects produce NaN in
depth map
5
Ref: I. Lysenkov and V. Rabaud, "Pose estimation of rigid transparent objects in
transparent clutter," in Robotics and Automation (ICRA), 2013 IEEE International
Conference on, 2013, pp. 162-169.

Graphcut v.s. Blur edge
• Given foreground & background clue
6
Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground
extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol.
23, pp. 309-314, 2004.

Graphcut v.s. Blur edge
• Generate the prob. distribution
7
Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground
extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol.
23, pp. 309-314, 2004.

Graphcut v.s. Blur edge
• Use distance to compensate
8
Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground
extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol.
23, pp. 309-314, 2004.

Graphcut v.s. Blur edge
• OpenCV implementation
9

A coarse pipeline
10
Detect NaN
area in
depth map
Feed the
area to
Graphcut
Segment
the edge

How to determine pose?
• Model-based matching
• Rotate in x & y axis and store the edge
11
Z-axis Y-axis
The problem becomes a 2D-
2D matching problem

Where is the model?
Wrap your
object with
paper
Use Kinect
Fusion to
construct the
model
Store the model
13

What if there are some other NaN
objects?
• Some non-transparent objects also
produce NaN in depth map
14

What if there are some other NaN
objects?
• Use characteristics of transparent object
to rule out non-transparent objects
15
Transparent
objects produce
highlights
Color of transparent
object is similar to
peripheral area

What if there are some other NaN
objects?
• Transparent objects produce highlights
16
Ref: K. McHenry, J. Ponce, and D. Forsyth, "Finding glass," in Computer Vision
and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference
on, 2005, pp. 973-979.

What if there are some other NaN
objects?
• Transparent objects produce highlights
17
Ref: K. McHenry, J. Ponce, and D. Forsyth, "Finding glass," in Computer Vision
and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference
on, 2005, pp. 973-979.
Threshold the image from 0-255
Compute the perimeter in each image
Compute the threshold by line fitting (from
255 to 0)

What if there are some other NaN
objects?
• Color of transparent object is similar to
peripheral area
18

What if there are some other NaN
objects?
• Color of transparent object is similar to
peripheral area
19
Hue histogram

Some results
• Total retrieved candidates are over 200
22
Method Recall Precision
Only NaN 86.11% 38.24%
Characteristics 86.11% 93.93%
Recall = (2/2)*100% =100%
Precision=(2/5)*100% =40%

Some other problems
• How to let robot grasp?
• Is there any choice other from Kinect?
23

How to let robot grasp?
• Teach and Play
24
Grasp
points

Is there any choice other from
Kinect?
• Extract the visual word of transparent
objects
25

Is there any choice other from
Kinect?
26
Ref: M. Fritz, G. Bradski, S. Karayev, T. Darrell, and M. J. Black, "An additive latent
feature model for transparent object recognition," in Advances in Neural
Information Processing Systems, 2009, pp. 558-566.

Is there any choice other from
Kinect?
• The result can be the input of Graphcut
27
Ref: M. Fritz, G. Bradski, S. Karayev, T. Darrell, and M. J. Black, "An additive latent
feature model for transparent object recognition," in Advances in Neural
Information Processing Systems, 2009, pp. 558-566.

Similar to Seminar報告_20150520

Point Cloud Processing: Estimating Normal Vectors and Curvature Indicators us...

Pirouz Nourian

NIPS2009: Understand Visual Scenes - Part 2

zukun

Introduction to 3D Computer Vision and Differentiable Rendering

Preferred Networks

Visual geometry with deep learning

NAVER Engineering

Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...

Universitat Politècnica de Catalunya

Slides by Amaia Salvador at the UPC Computer Vision Reading Group. Source document on GDocs with clickable links: https://docs.google.com/presentation/d/1jDTyKTNfZBfMl8OHANZJaYxsXTqGCHMVeMeBe5o1EL0/edit?usp=sharing Based on the original work: Ren, Shaoqing, Kaiming He, Ross Girshick, and Jian Sun. "Faster R-CNN: Towards real-time object detection with region proposal networks." In Advances in Neural Information Processing Systems, pp. 91-99. 2015.

Faster R-CNN: Towards real-time object detection with region proposal network...

Universitat Politècnica de Catalunya

Deep Video Object Tracking - Xavier Giro - UPC Barcelona 2019

Universitat Politècnica de Catalunya

Many learning tasks can be summarized as learning a mapping from a structured input to a structured output, such as machine translation, image captioning, image style transfer, and image dehazing. Such mappings are usually learned on paired training data, where an input sample and its corresponding output are both provided. Collecting paired training data often involves expensive human annotation, and the scale of paired training data is therefore often limited. As a result, the generalization ability of models trained on paired data is also limited. One way to mitigate this issue is learning with unpaired data, which is far less expensive to collect. Taking machine translation as an example, the unpaired training data can be collected separately from newspapers in the source language and target language without any annotation. The challenge of unpaired learning turns into how to align the unpaired data. With carefully designed objectives, unpaired learning has achieved remarkable progress on several tasks. This talk will cover the data collection and training methods of several unpaired learning tasks to illustrate the power of learning with unpaired data.

Learning with Unpaired Data

Goergen Institute for Data Science

AR/SLAM and IoT

Rakuten Group, Inc.

lecture_16_jiajun.pdf

Kuan-Tsae Huang

The oral presentation of the paper titled "Crowd Density Estimation Method using Multiple Feature Categories and Multiple Regression Models". This paper was accepted for publication and oral presentation in the 12th IEEE International Conference on Computer Engineering and Systems (ICCES 2017) held from 19 to 20 December 2017 in Cairo, Egypt. The paper proposed a new method to estimate the number of people within crowded scenes using regression analysis. The two challenges in crowd density estimation using regression analysis are perspective distortion and non-linearity. This paper solves the perspective distortion using perspective normalization which is the best way to deal with that problem based on recent works. The second challenge is solved by creating a new combination of features collected from multiple already existing categories including segmented region, texture, edge, and keypoints. This paper created a feature vector of length 164. Five regression models are used which are GPR, RF, RPF, LASSO, and KNN. Based on the experimental results, our proposed method gives better results than previous works. ---------------------------------- أحمد فوزي جاد Ahmed Fawzy Gad قسم تكنولوجيا المعلومات Information Technology (IT) Department كلية الحاسبات والمعلومات Faculty of Computers and Information (FCI) جامعة المنوفية, مصر Menoufia University, Egypt Teaching Assistant/Demonstrator ahmed.fawzy@ci.menofia.edu.eg --------------------------------- Find me on: Blog (Arabic) https://aiage-ar.blogspot.com.eg/ (English) https://aiage.blogspot.com.eg/ YouTube https://www.youtube.com/AhmedGadFCIT Google Plus https://plus.google.com/u/0/+AhmedGadIT SlideShare https://www.slideshare.net/AhmedGadFCIT LinkedIn https://www.linkedin.com/in/ahmedfgad reddit https://www.reddit.com/user/AhmedGadFCIT ResearchGate https://www.researchgate.net/profile/Ahmed_Gad13 Academia https://menofia.academia.edu/Gad Google Scholar https://scholar.google.com.eg/citations?user=r07tjocAAAAJ&hl=en Mendelay https://www.mendeley.com/profiles/ahmed-gad12 ORCID https://orcid.org/0000-0003-1978-8574 StackOverFlow http://stackoverflow.com/users/5426539/ahmed-gad Twitter https://twitter.com/ahmedfgad Facebook https://www.facebook.com/ahmed.f.gadd Pinterest https://www.pinterest.com/ahmedfgad

ICCES 2017 - Crowd Density Estimation Method using Regression Analysis

Ahmed Gad

Deep learning for object detection

Wenjing Chen

Visual Transformers

Kwanghee Choi

Phani Dathar, Ph.D., Data Science Solution Architect, Neo4j Relationships are highly predictive of behavior. Graph technology abstracts connections in our data so businesses can apply relationships and network structures to make better predictions. Hear about the journey from graph analytics and machine learning to graph-enhanced AI. We’ll also cover how enterprises are using graph data science in areas such as fraud, targeted marketing, healthcare, and recommendations.

Government GraphSummit: Leveraging Graphs for AI and ML

Neo4j

Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of big annotated data and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which had been addressed until now with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or text captioning.

Deep Learning for Computer Vision (3/4): Video Analytics @ laSalle 2016

Universitat Politècnica de Catalunya

This is the 3rd part of the tutorial on commonsense knowledge (CSK) at ACM WSDM 2021 by Simon Razniewski, Niket Tandon and Aparna Varde. It focuses on evaluation of the acquired knowledge, both intrinsic & extrinsic, as well as highlights, outlook with a brief perspective on COVID and open issues for further research. Abstract: Commonsense knowledge is a foundational cornerstone of artificial intelligence applications. Whereas information extraction and knowledge base construction for instance-oriented assertions, such as Brad Pitt’s birth date, or Angelina Jolie’s movie awards, has received much attention, commonsense knowledge on general concepts (politicians, bicycles, printers) and activities (eating pizza, fixing printers) has only been tackled recently. In this tutorial we present state-of-the-art methodologies towards the compilation and consolidation of such commonsense knowledge (CSK). We cover text-extraction-based, multi-modal and Transformer-based techniques, with special focus on the issues of web search and ranking, as of relevance to the WSDM community.

Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3

Dr. Aparna Varde

Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...

Universitat Politècnica de Catalunya

Transformer in Vision

Sangmin Woo

Practical computer vision-- A problem-driven approach towards learning CV/ML/DL

Albert Y. C. Chen

最近の研究情勢についていくために - Deep Learningを中心に -

Hiroshi Fukui

Similar to Seminar報告_20150520 (20)

Point Cloud Processing: Estimating Normal Vectors and Curvature Indicators us...

NIPS2009: Understand Visual Scenes - Part 2

Introduction to 3D Computer Vision and Differentiable Rendering

Visual geometry with deep learning

Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...

Faster R-CNN: Towards real-time object detection with region proposal network...

Deep Video Object Tracking - Xavier Giro - UPC Barcelona 2019

Learning with Unpaired Data

AR/SLAM and IoT

lecture_16_jiajun.pdf

ICCES 2017 - Crowd Density Estimation Method using Regression Analysis

Deep learning for object detection

Visual Transformers

Government GraphSummit: Leveraging Graphs for AI and ML

Deep Learning for Computer Vision (3/4): Video Analytics @ laSalle 2016

Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3

Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...

Transformer in Vision

Practical computer vision-- A problem-driven approach towards learning CV/ML/DL

最近の研究情勢についていくために - Deep Learningを中心に -

Recently uploaded

Automating Google Workspace (GWS) & more with Apps Script

wesley chun

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

Edi Saputra

Discord is a free app offering voice, video, and text chat functionalities, primarily catering to the gaming community. It serves as a hub for users to create and join servers tailored to their interests. Discord’s ecosystem comprises servers, each functioning as a distinct online community with its own channels dedicated to specific topics or activities. Users can engage in text-based discussions, voice calls, or video chats within these channels. Understanding Discord Servers Discord servers are virtual spaces where users congregate to interact, share content, and build communities. Servers may revolve around gaming, hobbies, interests, or fandoms, providing a platform for like-minded individuals to connect. Communication Features Discord offers a range of communication tools, including text channels for messaging, voice channels for real-time audio conversations, and video channels for face-to-face interactions. These features facilitate seamless communication and collaboration. What Does NSFW Mean? The acronym NSFW stands for “Not Safe For Work,” indicating content that may be inappropriate for professional or public settings. NSFW Content NSFW content encompasses material that is sexually explicit, violent, or otherwise graphic in nature. It often includes nudity, profanity, or depictions of sensitive topics.

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

UK Journal

Partners Life - Insurer Innovation Award 2024

The Digital Insurer

MySQL Webinar, presented on the 25th of April, 2024. Summary: MySQL solutions enable the deployment of diverse Database Architectures tailored to specific needs, including High Availability, Disaster Recovery, and Read Scale-Out. With MySQL Shell's AdminAPI, administrators can seamlessly set up, manage, and monitor these solutions, ensuring efficiency and ease of use in their administration. MySQL Router, on the other hand, provides transparent routing from the application traffic to the backend servers in the architectures, requiring minimal configuration. Completely built in-house and supported by Oracle, these solutions have been adopted by enterprises of all sizes for their business-critical applications. In this presentation, we'll delve into various database architecture solutions to help you choose the right one based on your business requirements. Focusing on technical details and the latest features to maximize the potential of these solutions.

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Miguel Araújo

Effective data discovery is crucial for maintaining compliance and mitigating risks in today's rapidly evolving privacy landscape. However, traditional manual approaches often struggle to keep pace with the growing volume and complexity of data. Join us for an insightful webinar where industry leaders from TrustArc and Privya will share their expertise on leveraging AI-powered solutions to revolutionize data discovery. You'll learn how to: - Effortlessly maintain a comprehensive, up-to-date data inventory - Harness code scanning insights to gain complete visibility into data flows leveraging the advantages of code scanning over DB scanning - Simplify compliance by leveraging Privya's integration with TrustArc - Implement proven strategies to mitigate third-party risks Our panel of experts will discuss real-world case studies and share practical strategies for overcoming common data discovery challenges. They'll also explore the latest trends and innovations in AI-driven data management, and how these technologies can help organizations stay ahead of the curve in an ever-changing privacy landscape.

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

TrustArc

AWS Community Day CPH - Three problems of Terraform

Andrey Devyatkin

Axa Assurance Maroc - Insurer Innovation Award 2024

The Digital Insurer

Three things you will take away from the session: • How to run an effective tenant-to-tenant migration • Best practices for before, during, and after migration • Tips for using migration as a springboard to prepare for Copilot in Microsoft 365 Main ideas: Migration Overview: The presentation covers the current reality of cross-tenant migrations, the triggers, phases, best practices, and benefits of a successful tenant migration Considerations: When considering a migration, it is important to consider the migration scope, performance, customization, flexibility, user-friendly interface, automation, monitoring, support, training, scalability, data integrity, data security, cost, and licensing structure Next Wave: The next wave of change includes the launch of Copilot, which requires businesses to be prepared for upcoming changes related to Copilot and the cloud, and to consolidate data and tighten governance ShareGate: ShareGate can help with pre-migration analysis, configurable migration tool, and automated, end-user driven collaborative governance

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

sammart93

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Juan lago vázquez

Following the popularity of "Cloud Revolution: Exploring the New Wave of Serverless Spatial Data," we're thrilled to announce this much-anticipated encore webinar. In this sequel, we'll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you're building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Safe Software

GenAI Risks & Security Meetup 01052024.pdf

lior mazor

How to Troubleshoot Apps for the Modern Connected Worker

ThousandEyes

Boost Fertility New Invention Ups Success Rates.pdf

sudhanshuwaghmare1

Strategies for Landing an Oracle DBA Job as a Fresher

Remote DBA Services

Tata AIG General Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

A Principled Technologies deployment guide Conclusion Deploying VMware Cloud Foundation 5.1 on next gen Dell PowerEdge servers brings together critical virtualization capabilities and high-performing hardware infrastructure. Relying on our hands-on experience, this deployment guide offers a comprehensive roadmap that can guide your organization through the seamless integration of advanced VMware cloud solutions with the performance and reliability of Dell PowerEdge servers. In addition to the deployment efficiency, the Cloud Foundation 5.1 and PowerEdge solution delivered strong performance while running a MySQL database workload. By leveraging VMware Cloud Foundation 5.1 and PowerEdge servers, you could help your organization embrace cloud computing with confidence, potentially unlocking a new level of agility, scalability, and efficiency in your data center operations.

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...

Principled Technologies

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

Scaling API-first – The story of a global engineering organization

Radu Cotescu

In this session, we will delve into strategic approaches for optimizing knowledge management within Microsoft 365, amidst the evolving landscape of Copilot. From leveraging automatic metadata classification and permission governance with SharePoint Premium, to unlocking Viva Engage for the cultivation of knowledge and communities, you will gain actionable insights to bolster your organization's knowledge-sharing initiatives. In this session, we will also explore how to facilitate solutions to enable your employees to find answers and expertise within Microsoft 365. You will leave equipped with practical techniques and a deeper understanding of how there is more to effective knowledge management than just enabling Copilot, but building actual solutions to prepare the knowledge that Copilot and your employees can use.

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Drew Madelung

Recently uploaded (20)

Automating Google Workspace (GWS) & more with Apps Script

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

Partners Life - Insurer Innovation Award 2024

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

AWS Community Day CPH - Three problems of Terraform

Axa Assurance Maroc - Insurer Innovation Award 2024

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

GenAI Risks & Security Meetup 01052024.pdf

How to Troubleshoot Apps for the Modern Connected Worker

Boost Fertility New Invention Ups Success Rates.pdf

Strategies for Landing an Oracle DBA Job as a Fresher

Tata AIG General Insurance Company - Insurer Innovation Award 2024

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

Scaling API-first – The story of a global engineering organization

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Seminar報告_20150520

1. 3D Pose Estimation for Transparent Objects Presenter: 賴柏任 Advisor:羅仁權教授 05.20.2015

2. Motivation • Transparent objects are everywhere • If we know he pose, we can grasp it! 2

3. Problems 3 Color of transparent object changes Hard to locate transparent objects Edge of transparent objects are blur Hard to estimate pose of transparent objects

4. Effective cure 4 Color of transparent object changes Edge of transparent objects are blur

5. Kinect v.s. Color changes • Transparent objects produce NaN in depth map 5 Ref: I. Lysenkov and V. Rabaud, "Pose estimation of rigid transparent objects in transparent clutter," in Robotics and Automation (ICRA), 2013 IEEE International Conference on, 2013, pp. 162-169.

6. Graphcut v.s. Blur edge • Given foreground & background clue 6 Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol. 23, pp. 309-314, 2004.

7. Graphcut v.s. Blur edge • Generate the prob. distribution 7 Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol. 23, pp. 309-314, 2004.

8. Graphcut v.s. Blur edge • Use distance to compensate 8 Ref: C. Rother, V. Kolmogorov, and A. Blake, "Grabcut: Interactive foreground extraction using iterated graph cuts," ACM Transactions on Graphics (TOG), vol. 23, pp. 309-314, 2004.

9. Graphcut v.s. Blur edge • OpenCV implementation 9

10. A coarse pipeline 10 Detect NaN area in depth map Feed the area to Graphcut Segment the edge

11. How to determine pose? • Model-based matching • Rotate in x & y axis and store the edge 11 Z-axis Y-axis The problem becomes a 2D- 2D matching problem

12. Where is the model? • Kinect Fusion 12

13. Where is the model? Wrap your object with paper Use Kinect Fusion to construct the model Store the model 13

14. What if there are some other NaN objects? • Some non-transparent objects also produce NaN in depth map 14

15. What if there are some other NaN objects? • Use characteristics of transparent object to rule out non-transparent objects 15 Transparent objects produce highlights Color of transparent object is similar to peripheral area

16. What if there are some other NaN objects? • Transparent objects produce highlights 16 Ref: K. McHenry, J. Ponce, and D. Forsyth, "Finding glass," in Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, 2005, pp. 973-979.

17. What if there are some other NaN objects? • Transparent objects produce highlights 17 Ref: K. McHenry, J. Ponce, and D. Forsyth, "Finding glass," in Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, 2005, pp. 973-979. Threshold the image from 0-255 Compute the perimeter in each image Compute the threshold by line fitting (from 255 to 0)

18. What if there are some other NaN objects? • Color of transparent object is similar to peripheral area 18

19. What if there are some other NaN objects? • Color of transparent object is similar to peripheral area 19 Hue histogram

20. A fine pipeline 20

21. Some results • Pose Matching 21

22. Some results • Total retrieved candidates are over 200 22 Method Recall Precision Only NaN 86.11% 38.24% Characteristics 86.11% 93.93% Recall = (2/2)*100% =100% Precision=(2/5)*100% =40%

23. Some other problems • How to let robot grasp? • Is there any choice other from Kinect? 23

24. How to let robot grasp? • Teach and Play 24 Grasp points

25. Is there any choice other from Kinect? • Extract the visual word of transparent objects 25

26. Is there any choice other from Kinect? 26 Ref: M. Fritz, G. Bradski, S. Karayev, T. Darrell, and M. J. Black, "An additive latent feature model for transparent object recognition," in Advances in Neural Information Processing Systems, 2009, pp. 558-566.

27. Is there any choice other from Kinect? • The result can be the input of Graphcut 27 Ref: M. Fritz, G. Bradski, S. Karayev, T. Darrell, and M. J. Black, "An additive latent feature model for transparent object recognition," in Advances in Neural Information Processing Systems, 2009, pp. 558-566.

28. 28 Thank you!

Seminar報告_20150520

Recommended

Recommended

More Related Content

Similar to Seminar報告_20150520

Similar to Seminar報告_20150520 (20)

Recently uploaded

Recently uploaded (20)

Seminar報告_20150520