Productionizing Deep Learning From the Ground Up

•Download as PPTX, PDF•

2 likes•2,444 views

This document discusses productionizing deep learning from the ground up. It begins with an overview of deep learning and neural networks, explaining that deep learning performs pattern recognition on unlabeled and unstructured data using deep neural networks with three or more layers. It then discusses challenges like the computational intensity of deep learning models and the need for special hardware like GPUs. It also covers software engineering concerns in scaling deep learning to production, such as data pipelines, maintenance of GPU clusters, and different types of parallelism in deep learning models and data.

Technology

PRODUCTIONIZING
DEEP LEARNING FROM
THE GROUND UP
Adam Gibson
O P E N
D A T A
S C I E N C E
C O N F E R E N C E_
BOSTON 2015
@opendatasci

Open DataSciCon May 2015
Productionizing
Deep Learning
From the Ground Up

Overview
● What is Deep Learning?
● Why is it hard?
● Problems to think about
● Conclusions

What is Deep Learning?
Pattern
recognition on
unlabeled &
unstructured
data.

What is Deep Learning?
● Deep Neural Networks >= 3 Layers
● For media/unstructured data
● Automatic Feature Engineering
● Benefits From Complex Architectures
● Computationally Intensive
● Accelerates With Special Hardware

Deep Networks >= 3 Layers
● Backpropagation and Old School ANNs = 3

Deep Networks
● Neural Networks themselves as hidden
Layers
● Different Types of Layers can be
Interchanged/stacked
● Multiple Layer Types, each with own
Hyperparameters and Loss Functions

Feedforward
1. MLPs
2. AutoEncoders
3. RBMs

Recurrent
1. MultiModal
2. LSTMs
3. Stateful

Convolutional
Lenet: Mixes convolutional & subsampling layers

Recursive/Tree
Uses a parser to form a tree structure

Other kinds
● Memory Networks
● Deep Reinforcement Learning
● Adversarial Architectures
● New recursive ConvNet variant to come in
2016?
● Over 9,000 Layers? (22 is already pretty
common)

Automatic Feature Engineering (TSNE)
Visualizations are crucial:
Use TSNE to render different kinds of data:
http://lvdmaaten.github.io/tsne/

deeplearning4j.org
presentation@
Google, Nov. 17 2014
“TWO PIZZAS SITTING ON A STOVETOP”

Benefits from Complex Architectures
Google’s result combined:
● LSTMs (learning captions)
● Word Embeddings
● Convolutional features from images (aligned
to be same size as embeddings)

Computationally Intensive
● One iteration of ImageNet (1k label dataset
and over 1MM examples) takes 7 hours on
GPUs
● Project Adam
● Google Brain

Special Hardware required
Unlike most solutions, multiple GPUs are used
today
(Not common in Java-based stacks!)

Software Engineering Concerns
● Pipelines to deal with messy data,
not canned problems...
(Real life is not Kaggle, people.)
● Scale/Maintenance (Clusters of GPUs aren’t
done well today.)
● Different kinds of parallelism (model and
data)

Model vs Data Parallelism
● Model is sharding model across servers
(HPC style)
● Data is mini batch

Vectorizing unstructured data
● Data is stored in different databases
● Different kinds of files (raw)
● Deep Learning works well on mixed signal

Parallelism
● Model (HPC)
● Data (Mini batch param averaging)

Production Stacks today
● Hadoop/Spark not enough
● GPUs not friendly to average programmer
● Cluster management of GPUs as a resource
not typically done
● Many frameworks don’t work well in a
distributed env (getting better, though)

Problems With Neural Nets
● Loss functions
● Scaling data
● Mixing different neural nets
● Hyperparameter tuning

Loss Functions
● Classification
● Regression
● Reconstruction

Scaling Data
● Zero mean and unit variance
● Zero to 1
● Other forms of preprocessing relative to
distribution of data
● Processing can also be columnwise
(categorical?)

Mixing and Matching Neural Networks
● Video: ConvNet + Recurrent
● Convolutional RBMs?
● Convolutional -> Subsampling -> Fully
Connected
● DBNs: Different hidden and visible units for
each layer

Hyperparameter tuning
● Underfit
● Overfit
● Overdescribe (your hidden layers)
● Layerwise interactions
● What activation function? (Competing?
Relu? Good ol’ Sigmoid?)

Hyperparameter Tuning (2)
● Grid search for neural nets (Don’t do it!)
● Bayesian (Getting better. There are at least
priors here.)
● Gradient-based approaches (Your hyper-
parameters are a neural net, so there are
neural nets optimizing your neural nets...)

Questions?
Twitter: @agibsonccc
Github: agibsonccc
LinkedIn: /in/agibsonccc
Email: adam@skymind.io (combo breaker!)
Web: deeplearning4j.org

Viewers also liked

[Uxtokyojam]2014 final public

Yuichi Inobori

Can We Automate Predictive Analytics

odsc

Rethinking hx uxtokyo_jam

Satoru MURAKOSHI

7月26日にUX TOKYOが主催するUXについての一大イベントUX TOKYO Jam 2014で、「Experience Design Out of Screen:これからのエクスペリンスデザイナーの生きる道」と題したセッションを行います。本エントリーは、このイベントに参加予定の方に向けた内容です。このセッションで私はあるシンプルな問いをみなさんに投げかけたいと思います。 UX / UIデザイナーは、3年後も「画面のデザイン」を主な仕事としているでしょうか。私の答えは、明確にノーです。スマートフォンの普及によって、私たちの仕事はすでにジェスチャーやセンサーや音声対話などを組み合わせ、現実の世界およびその中の人間の身体とより自然な方法でインタラクションを行うようになってきています。一方で、スマートフォンやタブレットの普及も日欧米などでは一段落しつつあり、今後ウェアラブルやスマート家電などのより多様なデバイスの市場が伸びつつあります。これらのデバイスは必ずしも大きな画面などを備えておらず、UIもより多様化するものと思われます。発表者は、2000年代の前半からこのような「画面の外」のUIデザインを研究し、2010年からは実務での実践に携わってきました。その中で得たデザインの事例や手法などの知見を共有します。ポイントとなるのは、デバイスやその画面の中で起こることではなく、利用する人間の身体や認知、そしてそれを取り巻く環境のコンテクストを読み解くことです。

Experience Design Out of Screen：これからのエクスペリエンスデザイナーの生きる道＠UX TOKYO Jam

Akihiko Kodama

UX TOKYO Jam 2014 Closing Keynote

Atsushi HASEGAWA, Ph.D.

UX0,1,100

UX Tokyo

Viewers also liked (6)

[Uxtokyojam]2014 final public

Can We Automate Predictive Analytics

Rethinking hx uxtokyo_jam

Experience Design Out of Screen：これからのエクスペリエンスデザイナーの生きる道＠UX TOKYO Jam

UX TOKYO Jam 2014 Closing Keynote

UX0,1,100

Similar to Productionizing Deep Learning From the Ground Up

Challenges in Large Scale Machine Learning

Sudarsun Santhiappan

Future of ai on the jvm

Adam Gibson

Cloud accounting software uk

Arcus Universe Ltd

Distributed machine learning 101 using apache spark from a browser devoxx.b...

Andy Petrella

Distributed systems and consistency

seldo

AI hype or reality

Awantik Das

AMW43 - Unba.se, Distributed database for human interaction

Daniel Norman

Deep Learning

Büşra İçöz

What is Distributed Computing, Why we use Apache Spark

Andy Petrella

Netflix machine learning

Amer Ather

Hadoop Training Tutorial for Freshers

rajkamaltibacademy

Joker'14 Java as a fundamental working tool of the Data Scientist

Alexey Zinoviev

Neuromation.io AI Ukraine Presentation

Bohdan Klimenko

AI on Greenplum Using  Apache MADlib and MADlib Flow - Greenplum Summit 2019

VMware Tanzu

(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...

Reynold Xin

[150824]symposium v4

yyooooon

Spark

Nitish Upreti

Whether you’re processing IoT data from millions of sensors or building a recommendation engine to provide a more engaging customer experience, the ability to derive actionable insights from massive volumes of diverse data is critical to success. MediaMath, a leading adtech company, relies on Apache Spark to process billions of data points ranging from ads, user cookies, impressions, clicks, and more — translating to several terabytes of data per day. To support the needs of the data science teams, data engineering must build data pipelines for both ETL and feature engineering that are scalable, performant, and reliable. Join this webinar to learn how MediaMath leverages Databricks to simplify mission-critical data engineering tasks that surface data directly to clients and drive actionable business outcomes. This webinar will cover: - Transforming TBs of data with RDDs and PySpark responsibly - Using the JDBC connector to write results to production databases seamlessly - Comparisons with a similar approach using Hive

Accelerating Data Science with Better Data Engineering on Databricks

Databricks

Is Spark the right choice for data analysis ?

Ahmed Kamal

Deep Learning at Scale

Mateusz Dymczyk

Similar to Productionizing Deep Learning From the Ground Up (20)

Challenges in Large Scale Machine Learning

Future of ai on the jvm

Cloud accounting software uk

Distributed machine learning 101 using apache spark from a browser devoxx.b...

Distributed systems and consistency

AI hype or reality

AMW43 - Unba.se, Distributed database for human interaction

Deep Learning

What is Distributed Computing, Why we use Apache Spark

Netflix machine learning

Hadoop Training Tutorial for Freshers

Joker'14 Java as a fundamental working tool of the Data Scientist

Neuromation.io AI Ukraine Presentation

AI on Greenplum Using  Apache MADlib and MADlib Flow - Greenplum Summit 2019

(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...

[150824]symposium v4

Spark

Accelerating Data Science with Better Data Engineering on Databricks

Is Spark the right choice for data analysis ?

Deep Learning at Scale

Machine-In-The-Loop for Knowledge Discovery

odsc

API Driven Development

odsc

CommCare, developed by Dimagi Inc., is an open-source mobile technology platform that supports hundreds of humanitarian frontline programs worldwide. The objective of this analysis is to demonstrate how CommCare metadata contains a wealth of information that can inform humanitarian programs in their use of mobile technology. This understanding can help programs determine the most effective way to implement CommCare or other mobile technology in resource-poor settings. A typical CommCare user is a frontline worker, such as a community health worker who provides outreach to pregnant women and children. An important feature of CommCare is that it supports case management, allowing users to register, update, and close cases in their CommCare application. A case is usually a user’s client, e.g., a pregnant woman who is supported by the CommCare user. While using CommCare, the user fills out electronic forms which eventually get submitted to the CommCare cloud server. The cumulative number of forms submitted by CommCare users as of December 2014 was just over 10 million. Metadata for each form submitted through CommCare are stored in Dimagi’s data platform; included in a form’s metadata are date and time stamps for when each form was started and ended by the user and when the form was eventually received by the cloud server.

Mobile technology Usage by Humanitarian Programs: A Metadata Analysis

odsc

The main objective of this workshop is to give the audience hands on experience with several Hadoop technologies and jump start their hadoop journey. In this workshop, you will load data and submit queries using Hadoop! Before jumping in to the technology, the Founders of DataKitchen review Hadoop and some of its technologies (MapReduce, Hive, Pig, Impala and Spark), look at performance, and present a rubric for choosing which technology to use when.

Big Data Infrastructure: Introduction to Hadoop with MapReduce, Pig, and Hive

odsc

We’ve all been told to “work smarter, not harder.” But what does working smarter really mean? In the world of finance and trading, working smarter means working differently. None of us can compete against computers stacked inches away from the stock exchange or blue chip companies with multi-million dollar marketing campaigns. The key to winning is to go where the big guys haven’t and the way to do that is through diverse datasets. In this talk, you will discover the theory and tools to discover new datasets from unexpected sources in order to gain an upper-hand in both finance and business. So whether you’re a quant that trades in his bedroom or a restaurateur looking to grow his business, you’ll learn how the diversity of data can be the sharpest knife if your set.

Think Breadth, Not Depth

odsc

In this presentation I will describe the way Data Science supports the business of information and news at Dow Jones. Specifically, I will describe how we are introducing innovative and advanced large-scale information mining and analytic approaches not only into Dow Jones’ products but also into our strategy and decision making processes.Our goal is to impact every aspect of Dow Jones: from the way journalism is produced in the newsroom, to the way we create and deliver institutional products, to the way we improve retention and acquisition of subscribers. While the task seems broad and daunting, we have already achieved various successes through the application of machine learning, data mining, advanced analytics and big data approaches.In this presentation I will describe how we have achieved this, including our tools, data, approaches and mechanisms as well as describe what our plans are going forward.

Data Science at Dow Jones: Monetizing Data, News and Information

odsc

Have you been in the situation where you’re about to start a new project and ask yourself, what’s the right tool for the job here? I’ve been in that situation many times and thought it might be useful to share with you a recent project we did and why we selected Spark, Python, and Parquet. My plan is take you through a use case that involves loading, transforming, aggregating, and persisting the dataset. We’ll use an open dataset consisting of full fund holdings graciously provided by Morningstar. My goal in presenting this use case are to have the audience learn about how these technologies can be applied to a real world problem and to inspire members of the audience to start learning these technologies and applying them to their own projects.

Spark, Python and Parquet

odsc

Create and operationalize a predictive model using Microsoft Azure Machine Learning. – Perform the typical steps involved in building a predictive analytics solution such as data ingestion, data cleansing, data exploration, feature engineering, model selection and evaluation of model results –learn how to use machine learning with big data scenarios using tools like Hadoop and SQL Server to process and work with such data.

Building a Predictive Analytics Solution with Azure ML

odsc

Finding and classifying the mentions of the things named in text, often called Named Entity Recognition or NER, is a fundamental task in many search and analysis applications. Mature, robust NER technology is available for many languages and domains, from people, places, and products, to diseases, genes, and molecules. However, for emerging tasks like knowledge-base construction, mentions alone are insufficient. In this presentation we’ll explore techniques that go beyond names to: link mentions to one another and to rich knowledge sources like Wikidata discover and characterise the relationships between entities that are explicit in the text And we’ll discuss some of the most important practical implications of these advancements for open data science.

Beyond Names

odsc

According to Credit Suisse’s Gender 3000 report, at the end of 2013, women accounted for 12.9% of top management in 3000 companies across 40 countries. However, since 2009, companies with women as 25-50% of their management team returned 22-29%. If companies with women in management outperform so dramatically, what would happen if you invested in women-led companies? Karen Rubin will explore this question and share her findings after running a 12 year investment simulation.

How Woman are Conquering the S&P 500

odsc

Data science allows us to turn a dark forest into a world of perpetual twilight by giving us the tools to better understand the data that surrounds us. Unfortunately, in this world of twilight we still need a flashlight to get a clean crisp image of our immediate surroundings. We will talk about how to use deep domain expertise as that flashlight shedding light on our understanding of data. Our focus will be on using text analysis as a means to examine qualitative information in a structured, quantitative way. We will draw heavily from examples in complex central bank policy and financial regulation.

Domain Expertise and Unstructured Data

odsc

Kaggle The Home of Data Science

odsc

This talk shares the presenter’s experience with open source tools in data science competitions. In the past several years Kaggle and other competitions have created a large online community of data scientists. In addition to competing with each other for fame and glory, members of this community also generously share knowledge, insights using forum and open source code. The open competition and sharing have resulted in rapid progress in the sophistication of the entire community. This presentation will briefly cover this journey from a competitor’s perspective, and share hands on tips on some open source tools proven popular and useful in recent competitions.

Open Source Tools & Data Science Competitions

odsc

scikit-learn has emerged as one of the most popular open source machine learning toolkits, now widely used in academia and industry. scikit-learn provides easy-to-use interfaces to perform advanced analysis and build powerful predictive models. The tutorial will cover basic concepts of machine learning, such as supervised and unsupervised learning, cross validation, and model selection. We will see how to prepare data for machine learning, and go from applying a single algorithm to building a machine learning pipeline. We will also cover how to build machine learning models on text data, and how to handle very large datasets.

Machine Learning with scikit-learn

odsc

Despite the proliferation of open-source tools for analysis (such as Python and R) and those used for visualization (such as Javascript / D3), there often exist significant gaps between these areas, and those of us trying to navigate the complete arc from data to insight can encounter many obstacles along the way. Fortunately, in recent years there have been many efforts to fill these needs, and today distilling a meaningful visualization from raw data is faster and easier than ever before. In this talk we will use will use examples in geospatial analysis and visualization to illustrate how to open-source tools like Python, geopandas, and TileMill work together. Using examples from the RunKeeper mobile app we will show how we currently use these tools to understand better our customers and their data, and to communicate with our colleagues, external partners, and the data community at large.

Bridging the Gap Between Data and Insight using Open-Source Tools

odsc

Top 10 Signs of the Textpocalypse

odsc

The Art of Data Science

odsc

Frontiers of Open Data Science Research

odsc

To rephrase an old saying: ‘It takes a village to raise an Analyst.’ Data Analysts and Scientists are working in teams delivering insight and analysis on an ongoing basis. So how do you get the team to support experimentation and insight delivery without ending up in an IT Engineer vs Analyst vs Data Governance war? We present 5 shocking steps to get these teams of people working together with practical, doable steps that can help you achieve data agility. The speaker has decades of hands on and executive management experience in data, analytics, and software development.

Agile Data

odsc

Just like every major corporation today, nonprofits and governments have more data than ever before. And just like those corporations, they are eager to tap into the power of their data. But the social sector doesn’t have the same resources to attract talent. Jeff Hammerbacher, Chief Scientist at Cloudera, put it best: “The best minds of my generation are thinking about how to make people click ads. That sucks.” At DrivenData our goal is to make the world suck a little less by empowering impact organizations to get the most from their data. Peter Bull, co-founder at DrivenData, will speak on the ways in which statistics, computer science, and machine learning can be applied to the challenges in the social sector. The talk will address both the big-picture context of the data for good movement, and an in-depth case study of the methods which won DrivenData’s recent machine learning competitions. It’s an exciting time for people who love data: methods are improving, computational costs are decreasing, storage and transport are cheaper, and the talent pool is growing. It’s up to the data geeks to use these powers for good.

Using your powers for good: Data science in the social sector

odsc

More from odsc (20)

Machine-In-The-Loop for Knowledge Discovery

API Driven Development

Mobile technology Usage by Humanitarian Programs: A Metadata Analysis

Big Data Infrastructure: Introduction to Hadoop with MapReduce, Pig, and Hive

Think Breadth, Not Depth

Data Science at Dow Jones: Monetizing Data, News and Information

Spark, Python and Parquet

Building a Predictive Analytics Solution with Azure ML

Beyond Names

How Woman are Conquering the S&P 500

Domain Expertise and Unstructured Data

Kaggle The Home of Data Science

Open Source Tools & Data Science Competitions

Machine Learning with scikit-learn

Bridging the Gap Between Data and Insight using Open-Source Tools

Top 10 Signs of the Textpocalypse

The Art of Data Science

Frontiers of Open Data Science Research

Agile Data

Using your powers for good: Data science in the social sector

Recently uploaded

Heather Hedden, Senior Consultant at Enterprise Knowledge, presented “The Role of Taxonomy and Ontology in Semantic Layers” at a webinar hosted by Progress Semaphore on April 16, 2024. Taxonomies at their core enable effective tagging and retrieval of content, and combined with ontologies they extend to the management and understanding of related data. There are even greater benefits of taxonomies and ontologies to enhance your enterprise information architecture when applying them to a semantic layer. A survey by DBP-Institute found that enterprises using a semantic layer see their business outcomes improve by four times, while reducing their data and analytics costs. Extending taxonomies to a semantic layer can be a game-changing solution, allowing you to connect information silos, alleviate knowledge gaps, and derive new insights. Hedden, who specializes in taxonomy design and implementation, presented how the value of taxonomies shouldn’t reside in silos but be integrated with ontologies into a semantic layer. Learn about: - The essence and purpose of taxonomies and ontologies in information and knowledge management; - Advantages of semantic layers leveraging organizational taxonomies; and - Components and approaches to creating a semantic layer, including the integration of taxonomies and ontologies

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Enterprise Knowledge

Building Digital Trust in a Digital Economy Veronica Tan, Director - Cyber Security Agency of Singapore Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

apidays

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

GenCyber Cyber Security Day Presentation

Michael W. Hawkins

Partners Life - Insurer Innovation Award 2024

The Digital Insurer

Discord is a free app offering voice, video, and text chat functionalities, primarily catering to the gaming community. It serves as a hub for users to create and join servers tailored to their interests. Discord’s ecosystem comprises servers, each functioning as a distinct online community with its own channels dedicated to specific topics or activities. Users can engage in text-based discussions, voice calls, or video chats within these channels. Understanding Discord Servers Discord servers are virtual spaces where users congregate to interact, share content, and build communities. Servers may revolve around gaming, hobbies, interests, or fandoms, providing a platform for like-minded individuals to connect. Communication Features Discord offers a range of communication tools, including text channels for messaging, voice channels for real-time audio conversations, and video channels for face-to-face interactions. These features facilitate seamless communication and collaboration. What Does NSFW Mean? The acronym NSFW stands for “Not Safe For Work,” indicating content that may be inappropriate for professional or public settings. NSFW Content NSFW content encompasses material that is sexually explicit, violent, or otherwise graphic in nature. It often includes nudity, profanity, or depictions of sensitive topics.

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

UK Journal

BooK Now Call us at +918448380779 to hire a gorgeous and seductive call girl for sex. Take a Delhi Escort Service. The help of our escort agency is mostly meant for men who want sexual Indian Escorts In Delhi NCR. It should be noted that any impersonator will get 100 attention from our Young Girls Escorts in Delhi. They will assume the position of reliable allies. VIP Call Girl With Original Photos Book Tonight +918448380779 Our Cheap Price 1 Hour not available 2 Hours 5000 Full Night 8000 TAG: Call Girls in Delhi, Noida, Gurgaon, Ghaziabad, Connaught Place, Greater Kailash Delhi, Lajpat Nagar Delhi, Mayur Vihar Delhi, Chanakyapuri Delhi, New Friends Colony Delhi, Majnu Ka Tilla, Karol Bagh, Malviya Nagar, Saket, Khan Market, Noida Sector 18, Noida Sector 76, Noida Sector 51, Gurgaon Mg Road, Iffco Chowk Gurgaon, Rajiv Chowk Gurgaon All Delhi Ncr Free Home Deliver

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Delhi Call girls

Boost Fertility New Invention Ups Success Rates.pdf

sudhanshuwaghmare1

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Product Anonymous

Automating Google Workspace (GWS) & more with Apps Script

wesley chun

Presentation on how to chat with PDF using ChatGPT code interpreter

naman860154

How to convert PDF to text with Nanonets

naman860154

Finology Group – Insurtech Innovation Award 2024

The Digital Insurer

The presentation explores the development and application of artificial intelligence (AI) from its inception to its current status in the modern world. The term "artificial intelligence" was first coined by John McCarthy in 1956 to describe efforts to develop computer programs capable of performing tasks that typically require human intelligence. This concept was first introduced at a conference held at Dartmouth College, where programs demonstrated capabilities such as playing chess, proving theorems, and interpreting texts. In the early stages, Alan Turing contributed to the field by defining intelligence as the ability of a being to respond to certain questions intelligently, proposing what is now known as the Turing Test to evaluate the presence of intelligent behavior in machines. As the decades progressed, AI evolved significantly. The 1980s focused on machine learning, teaching computers to learn from data, leading to the development of models that could improve their performance based on their experiences. The 1990s and 2000s saw further advances in algorithms and computational power, which allowed for more sophisticated data analysis techniques, including data mining. By the 2010s, the proliferation of big data and the refinement of deep learning techniques enabled AI to become mainstream. Notable milestones included the success of Google's AlphaGo and advancements in autonomous vehicles by companies like Tesla and Waymo. A major theme of the presentation is the application of generative AI, which has been used for tasks such as natural language text generation, translation, and question answering. Generative AI uses large datasets to train models that can then produce new, coherent pieces of text or other media. The presentation also discusses the ethical implications and the need for regulation in AI, highlighting issues such as privacy, bias, and the potential for misuse. These concerns have prompted calls for comprehensive regulations to ensure the safe and equitable use of AI technologies. Artificial intelligence has also played a significant role in healthcare, particularly highlighted during the COVID-19 pandemic, where it was used in drug discovery, vaccine development, and analyzing the spread of the virus. The capabilities of AI in healthcare are vast, ranging from medical diagnostics to personalized medicine, demonstrating the technology's potential to revolutionize fields beyond just technical or consumer applications. In conclusion, AI continues to be a rapidly evolving field with significant implications for various aspects of society. The development from theoretical concepts to real-world applications illustrates both the potential benefits and the challenges that come with integrating advanced technologies into everyday life. The ongoing discussion about AI ethics and regulation underscores the importance of managing these technologies responsibly to maximize their their benefits while minimizing potential harms.

Artificial Intelligence: Facts and Myths

Joaquim Jorge

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Delhi Call girls

presentation ICT roal in 21st century education

jfdjdjcjdnsjd

Tech Trends Report 2024 Future Today Institute.pdf

hans926745

Three things you will take away from the session: • How to run an effective tenant-to-tenant migration • Best practices for before, during, and after migration • Tips for using migration as a springboard to prepare for Copilot in Microsoft 365 Main ideas: Migration Overview: The presentation covers the current reality of cross-tenant migrations, the triggers, phases, best practices, and benefits of a successful tenant migration Considerations: When considering a migration, it is important to consider the migration scope, performance, customization, flexibility, user-friendly interface, automation, monitoring, support, training, scalability, data integrity, data security, cost, and licensing structure Next Wave: The next wave of change includes the launch of Copilot, which requires businesses to be prepared for upcoming changes related to Copilot and the cloud, and to consolidate data and tighten governance ShareGate: ShareGate can help with pre-migration analysis, configurable migration tool, and automated, end-user driven collaborative governance

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

sammart93

A Domino Admins Adventures (Engage 2024)

Gabriella Davis

Handwritten Text Recognition for manuscripts and early printed texts

Maria Levchenko

Recently uploaded (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

GenCyber Cyber Security Day Presentation

Partners Life - Insurer Innovation Award 2024

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Boost Fertility New Invention Ups Success Rates.pdf

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Automating Google Workspace (GWS) & more with Apps Script

Presentation on how to chat with PDF using ChatGPT code interpreter

How to convert PDF to text with Nanonets

Finology Group – Insurtech Innovation Award 2024

Artificial Intelligence: Facts and Myths

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

presentation ICT roal in 21st century education

Tech Trends Report 2024 Future Today Institute.pdf

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

A Domino Admins Adventures (Engage 2024)

Handwritten Text Recognition for manuscripts and early printed texts

Productionizing Deep Learning From the Ground Up

1. PRODUCTIONIZING DEEP LEARNING FROM THE GROUND UP Adam Gibson O P E N D A T A S C I E N C E C O N F E R E N C E_ BOSTON 2015 @opendatasci

2. Open DataSciCon May 2015 Productionizing Deep Learning From the Ground Up

3. Overview ● What is Deep Learning? ● Why is it hard? ● Problems to think about ● Conclusions

4. What is Deep Learning? Pattern recognition on unlabeled & unstructured data.

5. What is Deep Learning? ● Deep Neural Networks >= 3 Layers ● For media/unstructured data ● Automatic Feature Engineering ● Benefits From Complex Architectures ● Computationally Intensive ● Accelerates With Special Hardware

6. Get why it’s hard yet?

7. Deep Networks >= 3 Layers ● Backpropagation and Old School ANNs = 3

8. Deep Networks ● Neural Networks themselves as hidden Layers ● Different Types of Layers can be Interchanged/stacked ● Multiple Layer Types, each with own Hyperparameters and Loss Functions

9. What Are Common Layer Types?

10. Feedforward 1. MLPs 2. AutoEncoders 3. RBMs

11. Recurrent 1. MultiModal 2. LSTMs 3. Stateful

12. Convolutional Lenet: Mixes convolutional & subsampling layers

13. Recursive/Tree Uses a parser to form a tree structure

14. Other kinds ● Memory Networks ● Deep Reinforcement Learning ● Adversarial Architectures ● New recursive ConvNet variant to come in 2016? ● Over 9,000 Layers? (22 is already pretty common)

15. Automatic Feature Engineering

16. Automatic Feature Engineering (TSNE) Visualizations are crucial: Use TSNE to render different kinds of data: http://lvdmaaten.github.io/tsne/

17. deeplearning4j.org presentation@ Google, Nov. 17 2014 “TWO PIZZAS SITTING ON A STOVETOP”

18. Benefits from Complex Architectures Google’s result combined: ● LSTMs (learning captions) ● Word Embeddings ● Convolutional features from images (aligned to be same size as embeddings)

19. Computationally Intensive ● One iteration of ImageNet (1k label dataset and over 1MM examples) takes 7 hours on GPUs ● Project Adam ● Google Brain

20. Special Hardware required Unlike most solutions, multiple GPUs are used today (Not common in Java-based stacks!)

21. Software Engineering Concerns ● Pipelines to deal with messy data, not canned problems... (Real life is not Kaggle, people.) ● Scale/Maintenance (Clusters of GPUs aren’t done well today.) ● Different kinds of parallelism (model and data)

22. Model vs Data Parallelism ● Model is sharding model across servers (HPC style) ● Data is mini batch

23. Vectorizing unstructured data ● Data is stored in different databases ● Different kinds of files (raw) ● Deep Learning works well on mixed signal

24. Parallelism ● Model (HPC) ● Data (Mini batch param averaging)

25. Production Stacks today ● Hadoop/Spark not enough ● GPUs not friendly to average programmer ● Cluster management of GPUs as a resource not typically done ● Many frameworks don’t work well in a distributed env (getting better, though)

26. Problems With Neural Nets ● Loss functions ● Scaling data ● Mixing different neural nets ● Hyperparameter tuning

27. Loss Functions ● Classification ● Regression ● Reconstruction

28. Scaling Data ● Zero mean and unit variance ● Zero to 1 ● Other forms of preprocessing relative to distribution of data ● Processing can also be columnwise (categorical?)

29. Mixing and Matching Neural Networks ● Video: ConvNet + Recurrent ● Convolutional RBMs? ● Convolutional -> Subsampling -> Fully Connected ● DBNs: Different hidden and visible units for each layer

30. Hyperparameter tuning ● Underfit ● Overfit ● Overdescribe (your hidden layers) ● Layerwise interactions ● What activation function? (Competing? Relu? Good ol’ Sigmoid?)

31. Hyperparameter Tuning (2) ● Grid search for neural nets (Don’t do it!) ● Bayesian (Getting better. There are at least priors here.) ● Gradient-based approaches (Your hyperparameters are a neural net, so there are neural nets optimizing your neural nets...)

32. Questions? Twitter: @agibsonccc Github: agibsonccc LinkedIn: /in/agibsonccc Email: adam@skymind.io (combo breaker!) Web: deeplearning4j.org

Productionizing Deep Learning From the Ground Up

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (6)

Similar to Productionizing Deep Learning From the Ground Up

Similar to Productionizing Deep Learning From the Ground Up (20)

More from odsc

More from odsc (20)

Recently uploaded

Recently uploaded (20)

Productionizing Deep Learning From the Ground Up