SlideShare a Scribd company logo
1 of 22
www.synerzip.com
Building an Augmented IT Recruiter
Cloud Native Containerized Micro-service
Powered by NLP and ML
About the Speaker
Vinayak Joglekar
Founder & CTO, Synerzip
- Hires & Mentors Agile Software Development
Teams
- Over 3 decades of experience in Software Product
Development
- Hands on practitioner of Agile and Lean techniques.
-Speaker at 2008 Agile Conference in Toronto.
- Hands on experience in QA automation, DevOps,
UX design and CD
- Blogs about trends in software development
Linkedin Profile:
https://www.linkedin.com/in/vinayak-joglekar-b95329/
Confidential
Problem statement
• Recruiters in software companies who hire
3 to 5 years experienced professionals in
popular technologies like Javascript are
inundated with resumes. Build an app that
would magically parse and rank hundreds of
resumes in a jiffy.
• Build an engaging UX so that the recruiters
would return time and again to use the app.
The app should empower the recruiter by
snugly augmenting his routine tasks
Confidential
Hook Model to build an engaging UX
• External Trigger-Recruiters
receives a Job requirement
• Action- About 20 to 50 freshly
sourced resumes are submitted
• Variable reward – Download an
excel tracker with all the
resumes ranked in the order of
suitability
• Investment- Repeat use of the
application gives more accurate
results.
Confidential Concept courtesy: Nir Eyal
Smooth experience - Action
Confidential
• Point to a folder
containing all the
resumes received
• Parser extracts
important information
like contact details,
education, technical
expertise, relevant
project experience etc.
in less than 1 min.
Execution Challenges- Gate Server
• Gate server is single threaded. How to build
a web application?
• Gate server crashes after parsing a few
hundred resumes. Ops need to restart it to
bring up the service
• A rogue resume can take very long and
eventually bring down the Gate server
• Each resume takes a few seconds – parallel
processing needed to speed up parsing.
Confidential
Challenge-Gate is single threaded
Confidential
GATE
Ontolgy
User 1
GATE
Ontolgy
User 2 User 1 User 2
…
User n
Web Server
Singleton
Queue
Challenge-Parsing is inherently slow
Confidential
Gate Document
Parser
Preprocessing
DOCX to HTML
Timeout 60
seconds
Timeout 90
seconds
Error
Message
Pod
P G
Input
Queue
Dead Letter
Queue
6
Pod
P G
R
6
Output
Queue
R
6
Pod
P G
6
6
6
Pod
P G
Pod
P G
6
6
6
6
Pod
P G
Pod
P G
Pod
P G
Pod
P G
6
6
6
6
6
6
Instance 1 Instance 2
Solution
• Create a singleton Gate server that works on
multiple requests serially by using rabbit MQ
• Create multiple instances of this server by
putting each one in a Docker container.
• Use AWS to host and K8s to orchestrate
• Circuit breaker for consistent performance
• Container killed if it times out . Document beig
processed in put on dead letter queue
• Containers re start after servicing a fixed
number of requests
Confidential
Rewarding experience
Confidential
• Quickly rank the
resumes in the order of
suitability score=sum of
weighted score of
various criteria like
education, technical
expertise, relevant
experience, proximity,
notice period, expected
compensation etc.
Challenge- Missing Information
Confidential
Name
Contact Details
Objective Target Designation/ Role
Overview Experience
Skill Set List of Technologies
Institute Degree BranchYear
Institute Degree BranchYear
Company DesignationFrom
To Company DesignationFrom
To
Client Project DescriptionFro
m
URL ResponsibilitiesTechnologies
To
Client Project DescriptionFro
m
URL ResponsibilitiesTechnologies
To
Awards and Certifications
Sports and Hobbies
Footnote Address
All the terms were getting correctly
annotated but they were not getting
properly grouped under the correct
heading
Education
Experience
Project 1
Project 2
Solution
• We had 3 annotators manually annotate more
than 1000 resumes
• We modeled this as n way classification
problem with each heading as a class and the
terms inside the headings and their relative
location in the resume as the features
• We achieved 98% accuracy and 95% recall
Confidential
Challenge- weightages???
• Ranking was largely dependent on the
weightages assigned to suitability criteria
like education, relevant experience, notice
period, compensation, technical skills
needed etc.
• The importance assigned to these factors
was dependent on the seniority of the
position, company and specific project
needs
Confidential
Solution
We collected historical information about
resumes that were short listed for interview in a
company for specific projects and modeled it
as a logistic regression problem with the vector
of weightages being theta in the sigmoid
function above
Confidential
Challenge
• Resumes will have new words- technologies
and technical terms that are
unrecognizable.
• Candidates will learn new skills that are not
existent today(Big data analytics, cloud
computing & mobile programming didn’t
exist 5 years back)
• Recruiters will feel powerless and bored of
using the application if they can’t teach it to
work smarter- they want to achieve mastery
Confidential
Solution
• Created a “training set” from manually
annotated resumes. More resumes
processed= bigger training set = smarter
parsing of new resumes.
• Offset locations of the training set are
modeled as features and annotations are
modeled as their values
• CNN using Tensorflow to automatically
annotate resumes-> User empowerment!
Confidential
Challenge
• Most suitable candidate as per the suitability
score and quiz score doesn’t always get
selected. Sometimes no 2 or 3 is found to
be better than no 1.
• Suitability scores are calculated using
weightages assigned to various attributes.
These weightages are based on “hunch”
Confidential
Smooth experience - Action
Confidential
As frequency of use & no. of
users increases
•more terms get added to the
ontology and less number of
terms need manual
annotation
•Accuracy & recall in parsing
headings improves
•Weightages used in
computing suitability score
become more accurate
Un-annotated terms reduce
As more number of
words get added to
the ontology more
than 95% of the
words are found in
the ontology. The
drop in
unrecognized
terms is
exponential
Confidential
Number of resumes parsed.
Numberofunrecognizedwords.
Accuracy and Recall improve
As more number of
resumes are
parsed with
corrections done
manually wherever
required, they get
added to the
training set and the
recall and accuracy
improve
Confidential
Number of resumes parsed.
AccuracyandRecall%
Accuracy
Recall
Conclusion-hitech for engagig UX
• Machine learning models become smarter
with continued use which keeps the users
invested in the application. Past history of
usage is the investment in this case.
• Cloud native containerized micro-services
provide an opportunity to build magically
fast, consistent and reliable response
Confidential
Confidential
THANK YOU!

More Related Content

What's hot

Resume Sandip kandari 3 years automation testing
Resume Sandip kandari 3 years automation testing Resume Sandip kandari 3 years automation testing
Resume Sandip kandari 3 years automation testing Sandip Kandari
 
5+ years of experience in Testing
5+ years of experience in Testing5+ years of experience in Testing
5+ years of experience in TestingSuraj Choudhary
 
Vinod ojha i_os_developer
Vinod ojha i_os_developerVinod ojha i_os_developer
Vinod ojha i_os_developervinod ojha
 
How to apply for internship positions?
How to apply for internship positions?How to apply for internship positions?
How to apply for internship positions?Xiao Qin
 
Java Developer Job Openings in rajkot Ahmedabad, Gujarat, India (Hiring Now)
Java Developer Job Openings in rajkot Ahmedabad, Gujarat, India (Hiring Now)Java Developer Job Openings in rajkot Ahmedabad, Gujarat, India (Hiring Now)
Java Developer Job Openings in rajkot Ahmedabad, Gujarat, India (Hiring Now)Aegis Softtech
 
Latest Resume S N Kishore ( 26 June 2016)
Latest Resume S N Kishore ( 26 June 2016)Latest Resume S N Kishore ( 26 June 2016)
Latest Resume S N Kishore ( 26 June 2016)Nagaraj Kishore Sapher
 
Joy Banerjee (Test Automation Engineer)
Joy Banerjee (Test Automation Engineer)Joy Banerjee (Test Automation Engineer)
Joy Banerjee (Test Automation Engineer)Joy Banerjee
 
Sakar Patnaik_1.5_testing_Manual_Automation_Selenium
Sakar Patnaik_1.5_testing_Manual_Automation_SeleniumSakar Patnaik_1.5_testing_Manual_Automation_Selenium
Sakar Patnaik_1.5_testing_Manual_Automation_SeleniumSAKAR PATNAIK
 

What's hot (20)

Mohan Resume
Mohan ResumeMohan Resume
Mohan Resume
 
Resume Sandip kandari 3 years automation testing
Resume Sandip kandari 3 years automation testing Resume Sandip kandari 3 years automation testing
Resume Sandip kandari 3 years automation testing
 
5+ years of experience in Testing
5+ years of experience in Testing5+ years of experience in Testing
5+ years of experience in Testing
 
Vinod ojha i_os_developer
Vinod ojha i_os_developerVinod ojha i_os_developer
Vinod ojha i_os_developer
 
Veena_Tiwari
Veena_TiwariVeena_Tiwari
Veena_Tiwari
 
Karuna Resume
Karuna ResumeKaruna Resume
Karuna Resume
 
BhargaviAuto _Resume
BhargaviAuto _ResumeBhargaviAuto _Resume
BhargaviAuto _Resume
 
How to apply for internship positions?
How to apply for internship positions?How to apply for internship positions?
How to apply for internship positions?
 
Sankerth Resume
Sankerth ResumeSankerth Resume
Sankerth Resume
 
Ajay prem shankar cv
Ajay prem shankar cvAjay prem shankar cv
Ajay prem shankar cv
 
Kanishka resume
Kanishka resumeKanishka resume
Kanishka resume
 
Resume pratibha
Resume pratibhaResume pratibha
Resume pratibha
 
GIRI_Java
GIRI_JavaGIRI_Java
GIRI_Java
 
Java Developer Job Openings in rajkot Ahmedabad, Gujarat, India (Hiring Now)
Java Developer Job Openings in rajkot Ahmedabad, Gujarat, India (Hiring Now)Java Developer Job Openings in rajkot Ahmedabad, Gujarat, India (Hiring Now)
Java Developer Job Openings in rajkot Ahmedabad, Gujarat, India (Hiring Now)
 
Latest Resume S N Kishore ( 26 June 2016)
Latest Resume S N Kishore ( 26 June 2016)Latest Resume S N Kishore ( 26 June 2016)
Latest Resume S N Kishore ( 26 June 2016)
 
Resume 2.0
Resume 2.0Resume 2.0
Resume 2.0
 
Joy Banerjee (Test Automation Engineer)
Joy Banerjee (Test Automation Engineer)Joy Banerjee (Test Automation Engineer)
Joy Banerjee (Test Automation Engineer)
 
Anupam_Resume
Anupam_ResumeAnupam_Resume
Anupam_Resume
 
Resume
ResumeResume
Resume
 
Sakar Patnaik_1.5_testing_Manual_Automation_Selenium
Sakar Patnaik_1.5_testing_Manual_Automation_SeleniumSakar Patnaik_1.5_testing_Manual_Automation_Selenium
Sakar Patnaik_1.5_testing_Manual_Automation_Selenium
 

Similar to Agile india2018 exp_report

Similar to Agile india2018 exp_report (20)

Resume
ResumeResume
Resume
 
my resume
my resumemy resume
my resume
 
Resume
ResumeResume
Resume
 
Resume_ETL__Testing
Resume_ETL__TestingResume_ETL__Testing
Resume_ETL__Testing
 
Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024
 
Anushka sharma (1)
Anushka sharma (1)Anushka sharma (1)
Anushka sharma (1)
 
Resume
ResumeResume
Resume
 
Vivek Resume
Vivek ResumeVivek Resume
Vivek Resume
 
RESUME
RESUMERESUME
RESUME
 
Kartheeka CV
Kartheeka CVKartheeka CV
Kartheeka CV
 
Mani resume
Mani resumeMani resume
Mani resume
 
Google cloud certification
Google cloud certificationGoogle cloud certification
Google cloud certification
 
mohan_p1
mohan_p1mohan_p1
mohan_p1
 
Ramakrishnan M - Automation Tester
Ramakrishnan M - Automation TesterRamakrishnan M - Automation Tester
Ramakrishnan M - Automation Tester
 
ShraddhaRanjane
ShraddhaRanjaneShraddhaRanjane
ShraddhaRanjane
 
Prashanth CV
Prashanth CVPrashanth CV
Prashanth CV
 
Swarna pippalla Testing
Swarna pippalla TestingSwarna pippalla Testing
Swarna pippalla Testing
 
UMESH NEW RESUME 2016
UMESH NEW RESUME 2016UMESH NEW RESUME 2016
UMESH NEW RESUME 2016
 
Resume
ResumeResume
Resume
 
Shuvam dutta | Performance testing & engineering
Shuvam dutta | Performance testing & engineeringShuvam dutta | Performance testing & engineering
Shuvam dutta | Performance testing & engineering
 

Recently uploaded

why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendArshad QA
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 

Recently uploaded (20)

Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the ProcessExploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 

Agile india2018 exp_report

  • 1. www.synerzip.com Building an Augmented IT Recruiter Cloud Native Containerized Micro-service Powered by NLP and ML
  • 2. About the Speaker Vinayak Joglekar Founder & CTO, Synerzip - Hires & Mentors Agile Software Development Teams - Over 3 decades of experience in Software Product Development - Hands on practitioner of Agile and Lean techniques. -Speaker at 2008 Agile Conference in Toronto. - Hands on experience in QA automation, DevOps, UX design and CD - Blogs about trends in software development Linkedin Profile: https://www.linkedin.com/in/vinayak-joglekar-b95329/ Confidential
  • 3. Problem statement • Recruiters in software companies who hire 3 to 5 years experienced professionals in popular technologies like Javascript are inundated with resumes. Build an app that would magically parse and rank hundreds of resumes in a jiffy. • Build an engaging UX so that the recruiters would return time and again to use the app. The app should empower the recruiter by snugly augmenting his routine tasks Confidential
  • 4. Hook Model to build an engaging UX • External Trigger-Recruiters receives a Job requirement • Action- About 20 to 50 freshly sourced resumes are submitted • Variable reward – Download an excel tracker with all the resumes ranked in the order of suitability • Investment- Repeat use of the application gives more accurate results. Confidential Concept courtesy: Nir Eyal
  • 5. Smooth experience - Action Confidential • Point to a folder containing all the resumes received • Parser extracts important information like contact details, education, technical expertise, relevant project experience etc. in less than 1 min.
  • 6. Execution Challenges- Gate Server • Gate server is single threaded. How to build a web application? • Gate server crashes after parsing a few hundred resumes. Ops need to restart it to bring up the service • A rogue resume can take very long and eventually bring down the Gate server • Each resume takes a few seconds – parallel processing needed to speed up parsing. Confidential
  • 7. Challenge-Gate is single threaded Confidential GATE Ontolgy User 1 GATE Ontolgy User 2 User 1 User 2 … User n Web Server Singleton Queue
  • 8. Challenge-Parsing is inherently slow Confidential Gate Document Parser Preprocessing DOCX to HTML Timeout 60 seconds Timeout 90 seconds Error Message Pod P G Input Queue Dead Letter Queue 6 Pod P G R 6 Output Queue R 6 Pod P G 6 6 6 Pod P G Pod P G 6 6 6 6 Pod P G Pod P G Pod P G Pod P G 6 6 6 6 6 6 Instance 1 Instance 2
  • 9. Solution • Create a singleton Gate server that works on multiple requests serially by using rabbit MQ • Create multiple instances of this server by putting each one in a Docker container. • Use AWS to host and K8s to orchestrate • Circuit breaker for consistent performance • Container killed if it times out . Document beig processed in put on dead letter queue • Containers re start after servicing a fixed number of requests Confidential
  • 10. Rewarding experience Confidential • Quickly rank the resumes in the order of suitability score=sum of weighted score of various criteria like education, technical expertise, relevant experience, proximity, notice period, expected compensation etc.
  • 11. Challenge- Missing Information Confidential Name Contact Details Objective Target Designation/ Role Overview Experience Skill Set List of Technologies Institute Degree BranchYear Institute Degree BranchYear Company DesignationFrom To Company DesignationFrom To Client Project DescriptionFro m URL ResponsibilitiesTechnologies To Client Project DescriptionFro m URL ResponsibilitiesTechnologies To Awards and Certifications Sports and Hobbies Footnote Address All the terms were getting correctly annotated but they were not getting properly grouped under the correct heading Education Experience Project 1 Project 2
  • 12. Solution • We had 3 annotators manually annotate more than 1000 resumes • We modeled this as n way classification problem with each heading as a class and the terms inside the headings and their relative location in the resume as the features • We achieved 98% accuracy and 95% recall Confidential
  • 13. Challenge- weightages??? • Ranking was largely dependent on the weightages assigned to suitability criteria like education, relevant experience, notice period, compensation, technical skills needed etc. • The importance assigned to these factors was dependent on the seniority of the position, company and specific project needs Confidential
  • 14. Solution We collected historical information about resumes that were short listed for interview in a company for specific projects and modeled it as a logistic regression problem with the vector of weightages being theta in the sigmoid function above Confidential
  • 15. Challenge • Resumes will have new words- technologies and technical terms that are unrecognizable. • Candidates will learn new skills that are not existent today(Big data analytics, cloud computing & mobile programming didn’t exist 5 years back) • Recruiters will feel powerless and bored of using the application if they can’t teach it to work smarter- they want to achieve mastery Confidential
  • 16. Solution • Created a “training set” from manually annotated resumes. More resumes processed= bigger training set = smarter parsing of new resumes. • Offset locations of the training set are modeled as features and annotations are modeled as their values • CNN using Tensorflow to automatically annotate resumes-> User empowerment! Confidential
  • 17. Challenge • Most suitable candidate as per the suitability score and quiz score doesn’t always get selected. Sometimes no 2 or 3 is found to be better than no 1. • Suitability scores are calculated using weightages assigned to various attributes. These weightages are based on “hunch” Confidential
  • 18. Smooth experience - Action Confidential As frequency of use & no. of users increases •more terms get added to the ontology and less number of terms need manual annotation •Accuracy & recall in parsing headings improves •Weightages used in computing suitability score become more accurate
  • 19. Un-annotated terms reduce As more number of words get added to the ontology more than 95% of the words are found in the ontology. The drop in unrecognized terms is exponential Confidential Number of resumes parsed. Numberofunrecognizedwords.
  • 20. Accuracy and Recall improve As more number of resumes are parsed with corrections done manually wherever required, they get added to the training set and the recall and accuracy improve Confidential Number of resumes parsed. AccuracyandRecall% Accuracy Recall
  • 21. Conclusion-hitech for engagig UX • Machine learning models become smarter with continued use which keeps the users invested in the application. Past history of usage is the investment in this case. • Cloud native containerized micro-services provide an opportunity to build magically fast, consistent and reliable response Confidential