SlideShare a Scribd company logo
1 of 11
Web data extraction
by Deskover Software
www.deskover.com
Overview
A lot of data is available online these days but you don't usually have
access to a web API to extract it.
Finance and stock info, real estate data, product catalogs, search
engine results, social networks feeds, all come in various formats.
Here is where UiPath comes to rescue. With its powerful (and yet easy
to use) new workflow-based technology, it can automatically
recognize the repeating patterns inside structured and tabular data.
With no programming skills required, you can easily save web data in
your favorite format: Excel, CSV, text or even directly deliver it to a
database or web-service.
This tutorial
• We are going to explore the web data extraction feature in UiPath
product and we'll extract catalog data from an online retail store.
• First we'll do some web automation to get to the catalog web page.
• Use "Extract Data" wizard to automatically recognize the repetitive
patterns in data structure.
• Optionally you can add Excel activities to save the data in XLS
format.
• Run the workflow to actually get the data.
Prerequisite step - web automation
The data you want to extract is usually not on the first page of the web site.
You'll have to perform some login, search, navigation steps first.
Fortunately, all these steps are automatically generated by UiPath web
recorder.
All your actions have been recorded and
web activities were added to the workflow.
Now it's time to start the Extract Web Data
wizard.
Extract web data wizard
Follow the self-explanatory steps in order
for the wizard to automatically recognize
patterns in your data structure.
You have to select two similar objects for
each piece of data you're interested in (like
product name and price).
For each field you can choose the name
and the information to be extracted (like
text or url).
You can extract as many related fields as
you like.
At each step the fields will be highlighted inside the web
page so you can check if the wizard correctly detected the
data pattern.
One more check to see if everything is OK and you're almost done!
Save the extracted data
By default the Extract Web Data wizard adds a CVS activity to save
the data. You can choose an Excel activity instead and specify
a .xls file location.
The connection between the output of the Extract Data activity and
Excel activity is automatically created. The output is a DataTable
which can be passed to database or web service workflow
activities.
Run the workflow
You can launch the workflow from UiPath designer or you may
invoke it from any .Net language (like C#, VB.Net).
At runtime, a browser is open and all recorded web automation
steps are performed. Web data is being extracted and an Excel
workbook will be created.
If the data spreads across multiple pages, it is possible to specify a
repeat step so all the pages will be scrapped.
Thanks!
Thank you for your time.
Please feel free to download UiPath trial from:
http://www.uipath.com/

More Related Content

What's hot

Introduction To UiPath Studio | Edureka
Introduction To UiPath Studio | EdurekaIntroduction To UiPath Studio | Edureka
Introduction To UiPath Studio | EdurekaEdureka!
 
What is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | Edureka
What is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | EdurekaWhat is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | Edureka
What is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | EdurekaEdureka!
 
Introduction To UiPath | RPA Tutorial For Beginners | RPA Training using Uipa...
Introduction To UiPath | RPA Tutorial For Beginners | RPA Training using Uipa...Introduction To UiPath | RPA Tutorial For Beginners | RPA Training using Uipa...
Introduction To UiPath | RPA Tutorial For Beginners | RPA Training using Uipa...Edureka!
 
Ui path online training ppt
Ui path online training pptUi path online training ppt
Ui path online training pptThenmozhiC
 
UiPath - Business Process Automation Software
UiPath - Business Process Automation SoftwareUiPath - Business Process Automation Software
UiPath - Business Process Automation SoftwareDeskover Soft
 
RPA Tutorial for Beginners | RPA Training Using UiPath | UiPath Training Onli...
RPA Tutorial for Beginners | RPA Training Using UiPath | UiPath Training Onli...RPA Tutorial for Beginners | RPA Training Using UiPath | UiPath Training Onli...
RPA Tutorial for Beginners | RPA Training Using UiPath | UiPath Training Onli...Edureka!
 
Getting started with RPA (uipath)
Getting started with RPA (uipath)Getting started with RPA (uipath)
Getting started with RPA (uipath)Gaurav Madaan
 
RPA overview and UIPath
RPA overview and UIPathRPA overview and UIPath
RPA overview and UIPathTarun Kumar
 
Robotic Process Automation (RPA)
Robotic Process Automation (RPA)Robotic Process Automation (RPA)
Robotic Process Automation (RPA)Amy Simpson-Grange
 
UiPath Studio Session 2 - The Fundamentals of UiPath Studio - Final Slides.pdf
UiPath Studio Session 2 - The Fundamentals of UiPath Studio - Final Slides.pdfUiPath Studio Session 2 - The Fundamentals of UiPath Studio - Final Slides.pdf
UiPath Studio Session 2 - The Fundamentals of UiPath Studio - Final Slides.pdfDiana Gray, MBA
 
Robotic process automation overview
Robotic process automation overviewRobotic process automation overview
Robotic process automation overviewPascal BORNET
 
Certification preparation - Error Handling and Troubleshooting recap.pptx
Certification preparation - Error Handling and Troubleshooting recap.pptxCertification preparation - Error Handling and Troubleshooting recap.pptx
Certification preparation - Error Handling and Troubleshooting recap.pptxRohit Radhakrishnan
 
UiPath Community Event - Build more mature automations with Unattended Robots
UiPath Community Event - Build more mature automations with Unattended RobotsUiPath Community Event - Build more mature automations with Unattended Robots
UiPath Community Event - Build more mature automations with Unattended RobotsTomaszGaczynski
 
Introduction to Robotic Process Automation by K.G.Maheshwari
Introduction to Robotic Process Automation by K.G.MaheshwariIntroduction to Robotic Process Automation by K.G.Maheshwari
Introduction to Robotic Process Automation by K.G.MaheshwariKrishan Gopal Maheshwari
 
10 RPA use cases in Different Industries you ought to know
10 RPA use cases in Different Industries you ought to know10 RPA use cases in Different Industries you ought to know
10 RPA use cases in Different Industries you ought to knowBoTree Technologies
 

What's hot (20)

Introduction To UiPath Studio | Edureka
Introduction To UiPath Studio | EdurekaIntroduction To UiPath Studio | Edureka
Introduction To UiPath Studio | Edureka
 
What is RPA?
What is RPA?What is RPA?
What is RPA?
 
What is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | Edureka
What is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | EdurekaWhat is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | Edureka
What is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | Edureka
 
RPA Intro.pdf
RPA Intro.pdfRPA Intro.pdf
RPA Intro.pdf
 
RPA Uipath Presentation.pptx
RPA Uipath Presentation.pptxRPA Uipath Presentation.pptx
RPA Uipath Presentation.pptx
 
Introduction To UiPath | RPA Tutorial For Beginners | RPA Training using Uipa...
Introduction To UiPath | RPA Tutorial For Beginners | RPA Training using Uipa...Introduction To UiPath | RPA Tutorial For Beginners | RPA Training using Uipa...
Introduction To UiPath | RPA Tutorial For Beginners | RPA Training using Uipa...
 
Ui path online training ppt
Ui path online training pptUi path online training ppt
Ui path online training ppt
 
UiPath - Business Process Automation Software
UiPath - Business Process Automation SoftwareUiPath - Business Process Automation Software
UiPath - Business Process Automation Software
 
RPA Tutorial for Beginners | RPA Training Using UiPath | UiPath Training Onli...
RPA Tutorial for Beginners | RPA Training Using UiPath | UiPath Training Onli...RPA Tutorial for Beginners | RPA Training Using UiPath | UiPath Training Onli...
RPA Tutorial for Beginners | RPA Training Using UiPath | UiPath Training Onli...
 
Getting started with RPA (uipath)
Getting started with RPA (uipath)Getting started with RPA (uipath)
Getting started with RPA (uipath)
 
RPA overview and UIPath
RPA overview and UIPathRPA overview and UIPath
RPA overview and UIPath
 
Robotic process automation
Robotic process automationRobotic process automation
Robotic process automation
 
Robotic Process Automation (RPA)
Robotic Process Automation (RPA)Robotic Process Automation (RPA)
Robotic Process Automation (RPA)
 
UiPath Studio Session 2 - The Fundamentals of UiPath Studio - Final Slides.pdf
UiPath Studio Session 2 - The Fundamentals of UiPath Studio - Final Slides.pdfUiPath Studio Session 2 - The Fundamentals of UiPath Studio - Final Slides.pdf
UiPath Studio Session 2 - The Fundamentals of UiPath Studio - Final Slides.pdf
 
Robotic process automation overview
Robotic process automation overviewRobotic process automation overview
Robotic process automation overview
 
RPA
RPARPA
RPA
 
Certification preparation - Error Handling and Troubleshooting recap.pptx
Certification preparation - Error Handling and Troubleshooting recap.pptxCertification preparation - Error Handling and Troubleshooting recap.pptx
Certification preparation - Error Handling and Troubleshooting recap.pptx
 
UiPath Community Event - Build more mature automations with Unattended Robots
UiPath Community Event - Build more mature automations with Unattended RobotsUiPath Community Event - Build more mature automations with Unattended Robots
UiPath Community Event - Build more mature automations with Unattended Robots
 
Introduction to Robotic Process Automation by K.G.Maheshwari
Introduction to Robotic Process Automation by K.G.MaheshwariIntroduction to Robotic Process Automation by K.G.Maheshwari
Introduction to Robotic Process Automation by K.G.Maheshwari
 
10 RPA use cases in Different Industries you ought to know
10 RPA use cases in Different Industries you ought to know10 RPA use cases in Different Industries you ought to know
10 RPA use cases in Different Industries you ought to know
 

Similar to Ui path web data extraction

Oracle application express ppt
Oracle application express pptOracle application express ppt
Oracle application express pptAbhinaw Kumar
 
Share point 2010_overview-day4-code
Share point 2010_overview-day4-codeShare point 2010_overview-day4-code
Share point 2010_overview-day4-codeNarayana Reddy
 
Share point 2010_overview-day4-code
Share point 2010_overview-day4-codeShare point 2010_overview-day4-code
Share point 2010_overview-day4-codeNarayana Reddy
 
Oracle application express
Oracle application expressOracle application express
Oracle application expressAbhinaw Kumar
 
Wss Object Model
Wss Object ModelWss Object Model
Wss Object Modelmaddinapudi
 
Scraping Webpage Information by using MS Excel VBA
Scraping Webpage Information by using MS Excel VBAScraping Webpage Information by using MS Excel VBA
Scraping Webpage Information by using MS Excel VBAPatrick Ho
 
What Are the Key Steps in Scraping Product Data from Amazon India.pptx
What Are the Key Steps in Scraping Product Data from Amazon India.pptxWhat Are the Key Steps in Scraping Product Data from Amazon India.pptx
What Are the Key Steps in Scraping Product Data from Amazon India.pptxProductdata Scrape
 
What Are the Key Steps in Scraping Product Data from Amazon India.pdf
What Are the Key Steps in Scraping Product Data from Amazon India.pdfWhat Are the Key Steps in Scraping Product Data from Amazon India.pdf
What Are the Key Steps in Scraping Product Data from Amazon India.pdfProductdata Scrape
 
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdf
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdfHow to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdf
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdfProductdata Scrape
 
Content migration for sitecore
Content migration for sitecoreContent migration for sitecore
Content migration for sitecoreSurendra Sharma
 
Exp2003 exl ppt_02-continued
Exp2003 exl ppt_02-continuedExp2003 exl ppt_02-continued
Exp2003 exl ppt_02-continuedlonetree
 
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptx
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptxHow to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptx
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptxProductdata Scrape
 
Skills Portfolio
Skills PortfolioSkills Portfolio
Skills Portfoliorolee23
 
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01Sadeesh Jayakumaran ☁
 
F0362036045
F0362036045F0362036045
F0362036045theijes
 

Similar to Ui path web data extraction (20)

Intro to Application Express
Intro to Application ExpressIntro to Application Express
Intro to Application Express
 
Oracle application express ppt
Oracle application express pptOracle application express ppt
Oracle application express ppt
 
Share point 2010_overview-day4-code
Share point 2010_overview-day4-codeShare point 2010_overview-day4-code
Share point 2010_overview-day4-code
 
Share point 2010_overview-day4-code
Share point 2010_overview-day4-codeShare point 2010_overview-day4-code
Share point 2010_overview-day4-code
 
Oracle application express
Oracle application expressOracle application express
Oracle application express
 
Wss Object Model
Wss Object ModelWss Object Model
Wss Object Model
 
Scraping Webpage Information by using MS Excel VBA
Scraping Webpage Information by using MS Excel VBAScraping Webpage Information by using MS Excel VBA
Scraping Webpage Information by using MS Excel VBA
 
What Are the Key Steps in Scraping Product Data from Amazon India.pptx
What Are the Key Steps in Scraping Product Data from Amazon India.pptxWhat Are the Key Steps in Scraping Product Data from Amazon India.pptx
What Are the Key Steps in Scraping Product Data from Amazon India.pptx
 
What Are the Key Steps in Scraping Product Data from Amazon India.pdf
What Are the Key Steps in Scraping Product Data from Amazon India.pdfWhat Are the Key Steps in Scraping Product Data from Amazon India.pdf
What Are the Key Steps in Scraping Product Data from Amazon India.pdf
 
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdf
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdfHow to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdf
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdf
 
Content migration for sitecore
Content migration for sitecoreContent migration for sitecore
Content migration for sitecore
 
Exp2003 exl ppt_02-continued
Exp2003 exl ppt_02-continuedExp2003 exl ppt_02-continued
Exp2003 exl ppt_02-continued
 
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptx
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptxHow to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptx
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptx
 
Implementation ofWeb Application for Disease Prediction Using AI
Implementation ofWeb Application for Disease Prediction Using AIImplementation ofWeb Application for Disease Prediction Using AI
Implementation ofWeb Application for Disease Prediction Using AI
 
Skills Portfolio
Skills PortfolioSkills Portfolio
Skills Portfolio
 
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
 
Web Scraping Services.pptx
Web Scraping Services.pptxWeb Scraping Services.pptx
Web Scraping Services.pptx
 
Ajax workshop
Ajax workshopAjax workshop
Ajax workshop
 
F0362036045
F0362036045F0362036045
F0362036045
 
Asp.net
Asp.netAsp.net
Asp.net
 

Recently uploaded

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 

Ui path web data extraction

  • 1. Web data extraction by Deskover Software www.deskover.com
  • 2. Overview A lot of data is available online these days but you don't usually have access to a web API to extract it. Finance and stock info, real estate data, product catalogs, search engine results, social networks feeds, all come in various formats. Here is where UiPath comes to rescue. With its powerful (and yet easy to use) new workflow-based technology, it can automatically recognize the repeating patterns inside structured and tabular data. With no programming skills required, you can easily save web data in your favorite format: Excel, CSV, text or even directly deliver it to a database or web-service.
  • 3. This tutorial • We are going to explore the web data extraction feature in UiPath product and we'll extract catalog data from an online retail store. • First we'll do some web automation to get to the catalog web page. • Use "Extract Data" wizard to automatically recognize the repetitive patterns in data structure. • Optionally you can add Excel activities to save the data in XLS format. • Run the workflow to actually get the data.
  • 4. Prerequisite step - web automation The data you want to extract is usually not on the first page of the web site. You'll have to perform some login, search, navigation steps first. Fortunately, all these steps are automatically generated by UiPath web recorder.
  • 5. All your actions have been recorded and web activities were added to the workflow. Now it's time to start the Extract Web Data wizard.
  • 6. Extract web data wizard Follow the self-explanatory steps in order for the wizard to automatically recognize patterns in your data structure. You have to select two similar objects for each piece of data you're interested in (like product name and price). For each field you can choose the name and the information to be extracted (like text or url). You can extract as many related fields as you like.
  • 7. At each step the fields will be highlighted inside the web page so you can check if the wizard correctly detected the data pattern.
  • 8. One more check to see if everything is OK and you're almost done!
  • 9. Save the extracted data By default the Extract Web Data wizard adds a CVS activity to save the data. You can choose an Excel activity instead and specify a .xls file location. The connection between the output of the Extract Data activity and Excel activity is automatically created. The output is a DataTable which can be passed to database or web service workflow activities.
  • 10. Run the workflow You can launch the workflow from UiPath designer or you may invoke it from any .Net language (like C#, VB.Net). At runtime, a browser is open and all recorded web automation steps are performed. Web data is being extracted and an Excel workbook will be created. If the data spreads across multiple pages, it is possible to specify a repeat step so all the pages will be scrapped.
  • 11. Thanks! Thank you for your time. Please feel free to download UiPath trial from: http://www.uipath.com/