SlideShare a Scribd company logo
1 of 47
Download to read offline
TLS0070 Introduction to
Legal Technology
Lecture 5 Applications I:
Information retrieval, knowledge
management, e-discovery
University of Turku Law School 2015-02-10
Anna Ronkainen @ronkaine
anna.ronkainen@onomatics.com
First a little digression (guess why...)
Google Flu Trends
-  predicting the timing and strength of
influenza epidemics based on the relative
frequency of certain keywords in searches
-  values for the model in black (dotted lines
95% confidence intervals for predicted
values), actual CDC influenza figures in red
But...
Performance after the initial period
Lessons worth learning (also for legal
applications)
-  transparency and replicability
-  use big data for understanding the unknown
-  study the algorithm
-  it’s not just about the size of the data
(from Lazer et al 2014)
Applications (general)
Application lectures overview
Applications I (this week):
-  information retrieval
-  e-discovery (e-disclosure)
-  knowledge management
Applications II (next week, 1st half):
-  case management
-  online dispute resolution
-  access to justice solutions
Applications III (next week, 2nd half):
-  decision support
-  prediction
-  automation
-  self-service
Legal tech applications not covered
here
-  general-purpose applications (like Office®/
office software)
-  legislative drafting applications
-  docket management (and other applications
for use within the judiciary)
-  courtroom visualization (etc.) software
-  ... and probably a ton of other things I don’t
even know existed
Information retrieval
Information retrieval (IR)
-  the granddaddy of legal tech applications
-  the only form of legal tech available in all
(industrial) countries at least in some form
-  making different types of static legal content
available for human consumption
-  statute law (+ commentaries)
-  case law
-  doctrine: journal articles and books
Information retrieval users
-  types of users:
-  lawyers in general
-  subgroups of lawyers (e.g. IP lawyers)
-  legal/admin support staff (e.g. tax
administrators, paralegals, informaticians)
-  other non-law professionals
-  ordinary citizens
-  different users have different needs in terms of
-  type and quantity of content required
-  terminology used
-  user interface in general
First-generation information retrieval
-  take whatever text you have (on paper) and
put it into a database
-  full-text search (exact match or wildcards)
-  structured search (in whatever fields are
available)
-  Boolean search with AND, OR, NOT
-  some metadata enhancements like keywords
(typically same as on paper)
Present-day Boolean search example:
TMview
Further developments
-  hypertext (links)
-  better search capabilities with language
technology (try searching for “back” as a
noun)
-  relevancy ranking
-  recommendations for further reading
-  morebetter metadata
An example: WestlawNext
-  natural-language and Boolean search
-  relevancy ranking of sources of law, using
(among others) a network of links between
cases
-  (commercial break, text version:
http://info.legalsolutions.thomsonreuters.com/pdf/wln2/L-355700_v2.pdf)
On the horizon
-  natural-language query interfaces and
advanced text understanding (think Watson/
Siri)
-  merging relevancy ranking with predictive
legal analytics (like a certain trademark
platform)
-  even more polarization between biggest
markets (esp. US) and others (e.g. Finland,
let alone developing countries)
Knowledge management
Knowledge management
-  taking (and improving upon!) the knowledge
(explicit and tacit!) of an organization and
putting it into optimal use
-  by no means just tech: creating and developing
processes within the organization is equally
important
-  can take different forms:
-  internal: e.g. making work product (memos,
contracts etc.) electronically searchable
-  external: creating digital legal content for use
by law firm customers
Knowledge management advantages
-  higher efficiency -> better service
-  higher quality (better dissemination of
expertise)
-  makes life easier for lawyers (increased
productivity, reduced stress)
-  keeps knowledge in the firm even if individuals
leave
-  helps with the training of new lawyers
-  necessary for good risk management
(after Kay 2003)
One knowledge management example:
contract management
-  the default solution that’s still used by many
(most?) companies: paper + binders
-  low overhead; manageable with low volumes
-  doesn’t scale (cope with large volumes) well,
e.g. finding information becomes difficult
-  particularly kludgy when documents needed
externally (due diligence, anyone?)
-  error-prone and fragile
-  still need to manage templates somewhere
(lack of central storage leads to inconsistencies)
Low-tech electronic contract
management
-  establish a central organization-wide repository for
signed contracts and official templates
-  doesn’t need proprietary software, any LAN or
cloud based (private) file sharing solution works
-  electronically searchable, at least if word processing
documents and scans are kept together
-  works well (enough) if there are good processes
(e.g. regarding file naming and organization of files)
and they are (always!) consistently adhered to
-  ...which this solution obviously cannot enforce
-  no built-in workflow management
Dedicated contract lifecycle
management (CLM) solutions
-  hundreds of providers, including two from Finland (that I
know of: M-Files and Sopima)
-  functionalities of varying sophistication for different stages
in the contract lifecycle:
-  contract and clause template libraries
-  platform and history for internal review
-  platform and history for negotiations and external
review
-  electronic signing / import of scanned definitive paper
originals
-  archiving, retrieval etc.
-  workflow management, managing access privileges etc.
Exhibit A: Sopima
Exhibit B: M-Files
https://www.youtube.com/watch?v=0b0xSVHOFIg
Electronic signing
-  real electronic signing not widespread
(outside Estonia, anyway), to a great deal
due to a lack of standards internationally
(and esp. for identifying legal persons)
-  pseudo-electronic signing (images manually
written signatures stored electronically) now
quite widespread, dedicated solutions and
support in CLM systems also available
-  the latter raises some obvious questions
about probative value
Heck, even Apple does it:
In summary: Levels of contract
management adoption
(via Juntunen 2013)
Another knowledge management
example: Fondia’s Virtual Lawyer
Fondia’s Virtual Lawyer
-  a collection of ~1700 short documents made
by Fondia staff describing the legal aspects
of particular situations
-  for external use (self-help by Fondia clients
etc.), AFAIK also used internally in an
enhanced version
-  not for total novices
-  available at virtuallawyer.fi for free,
registration required, document template
library additionally available for a fee
Electronic discovery
(disclosure)
Discovery in electronically stored
information (e-discovery)
-  emerged out of nowhere a dozen years ago
-  now a multi-billion-dollar industry (mostly US),
hundreds of providers
-  roots in more general-purpose language tech
(outside the AI & law community)
-  Enron corpus, Sedona Conference, TREC, DESI
-  storage requirements for e-mail etc. introduced
(US) by amendments to Federal Rules of Civil
Procedure in 2006
...and now* it’s already this much
widespread (in the US, anyway):
*: actually this book is from 2009
Zubulake v. UBS Warburg
-  employment law case in District Court for
Southern NY, heard 2003–2005
-  led to four groundbreaking rulings which set
the basic standards for e-discovery (before
2006 FRCP revisions), widely referred to as
Zubulake I, III, IV, V
Zubulake I and III
-  what data is considered accessible ESI
-  yes: online data/hard disks, optical disks, offline magnetic tapes
-  no: backup tapes, damaged/deleted/... data
-  no -> yes if considerable evidentiary value can be demonstrated, for
which a 7-factor test was introduced:
-  The extent to which the request is specifically tailored to discover
relevant information;
-  The availability of such information from other sources;
-  The total cost of production, compared to the amount in
controversy;
-  The total cost of production, compared to the resources available to
each party;
-  The relative ability of each party to control costs and its incentive to
do so;
-  The importance of the issues at stake in the litigation; and
-  The relative benefits to the parties of obtaining the information.
Zubulake IV
-  some backups no longer available
-  relevant emails (created after the start of the
proceedings) had been deleted
-  defendant had a duty to preserve evidence
(since relevant for ongoing/future litigation)
-  plaintiff got access to the information
-  however, plaintiff couldn’t show adverse
interference (at this stage) and was ordered
to pay the costs
Zubulake V
-  upon the plaintiff’s motion, the court
concluded that the defendant (and defence
counsel) had failed to safeguard and produce
evidence in an adequate manner
-  defendant sanctioned and ordered to pay
plaintiff’s costs for producing evidence
(witness re-examination etc.) necessary due
to plaintiff’s late (or non-)production of
relevant evidence
Outcome
-  active interference (intentional destruction
or hiding of evidence) ruled by the judge
-  jury found in favour of the plaintiff,
compensatory and punitive damages
-  reimbursement of even more costs to the
plaintiff (generally a lot more unusual in US)
E-discovery workflow
-  establish an ESI retention policy, stick to it when
creating and storing data
-  identify relevant ESI, create authentic snapshot and
collect it for further processing
-  process and filter ESI (e.g. removal of duplicates)
-  review and analyze ESI for privileged information
-  produce ESI after filtering out irrelevant, duplicated
or privileged materials
-  possibly clawback if too much produced in error
-  present at trial (if it ever goes that far)
First-generation e-discovery
-  based on lists of specific search terms (or
phrases) proposed by the plaintiff and
approved or modified by the judge
-  a bit sketchy, not even real consensus about
whether keywords cover all inflections?
-  no longer considered acceptable by many of
the most influential US judges for this field
Predictive coding
-  based on coding a (very) small subset of the
relevant document mass as responsive or not
(should/n’t be released)
-  then using that as the teaching set for a
machine learning algorithm
-  performance comparable to (or better than)
human reviewers at a fraction of the cost
E-discovery output
-  native (original) formats (e.g.: .docx)
-  usually better for the plaintiff: electronically
searchable
-  native file formats for proprietary software not
necessarily openable without that software
-  “petrified” formats (tiff, pdf)
-  often better for the defendant: almost the same
as handing out the data on paper
-  general-purpose tools enough for viewing
-  easier to redact
What’s the status with e-discovery
-  very widespread in the US (because it’s the
law!)
-  gaining popularity in the rest of Anglophonia
(because common law; tech readily available for
English)
-  some providers also support major European
and Asian languages (mostly for international
companies operating in the US)
-  rest of the world: is there even a word for this?
(then again: discovery in the common-law sense
doesn’t exist in most civil-law countries (incl.
Finland) in general)
No concrete examples
-  (because, frankly, I understand neither the field
nor the legal issue well enough)
-  but e-discovery in itself is an interesting
example of legal tech for many reasons
-  first real big data application for law
-  came out of nowhere in the early 2000s
-  now a multi-billion-dollar industry (US)
-  many startups, some notable exits (e.g.
Cataphora’s e-discovery ops to EY)
-  also continuously new funding rounds (even
$100M+) to more and more companies
Questions?

More Related Content

What's hot

Clio logikcull- leveraging e discovery date in legal practice
Clio logikcull- leveraging e discovery date in legal practiceClio logikcull- leveraging e discovery date in legal practice
Clio logikcull- leveraging e discovery date in legal practice
Clio - Cloud-Based Legal Technology
 

What's hot (20)

What is legal technology?
What is legal technology?What is legal technology?
What is legal technology?
 
Introduction to Legal Technology, lecture 4 (2015)
Introduction to Legal Technology, lecture 4 (2015)Introduction to Legal Technology, lecture 4 (2015)
Introduction to Legal Technology, lecture 4 (2015)
 
TrademarkNow (and its research background)
TrademarkNow (and its research background)TrademarkNow (and its research background)
TrademarkNow (and its research background)
 
How to do things with AI & law research
How to do things with AI & law researchHow to do things with AI & law research
How to do things with AI & law research
 
Finnish Legal Tech Forum launch presentation
Finnish Legal Tech Forum launch presentationFinnish Legal Tech Forum launch presentation
Finnish Legal Tech Forum launch presentation
 
From Research to Innovative Legal Tech Products
From Research to Innovative Legal Tech ProductsFrom Research to Innovative Legal Tech Products
From Research to Innovative Legal Tech Products
 
Complete Guide to Technology for Lawyers and Law Firms - Legodesk
Complete Guide to Technology for Lawyers and Law Firms - LegodeskComplete Guide to Technology for Lawyers and Law Firms - Legodesk
Complete Guide to Technology for Lawyers and Law Firms - Legodesk
 
Helsinki Legal Tech Meetup introduction
Helsinki Legal Tech Meetup introductionHelsinki Legal Tech Meetup introduction
Helsinki Legal Tech Meetup introduction
 
AI in legal practice – the research perspective
AI in legal practice – the research perspectiveAI in legal practice – the research perspective
AI in legal practice – the research perspective
 
Legal Case Management Software For Lawyers and Law Firms - Legodesk
Legal Case Management Software For Lawyers and Law Firms - LegodeskLegal Case Management Software For Lawyers and Law Firms - Legodesk
Legal Case Management Software For Lawyers and Law Firms - Legodesk
 
eDiscovery
eDiscoveryeDiscovery
eDiscovery
 
Helsinki Legal Tech Meetup: TrademarkNow demo/presentation
Helsinki Legal Tech Meetup: TrademarkNow demo/presentationHelsinki Legal Tech Meetup: TrademarkNow demo/presentation
Helsinki Legal Tech Meetup: TrademarkNow demo/presentation
 
Ethical machines: data mining and fairness – the optimistic view
Ethical machines: data mining and fairness – the optimistic viewEthical machines: data mining and fairness – the optimistic view
Ethical machines: data mining and fairness – the optimistic view
 
Modeling meaning and knowledge: legal knowledge
Modeling meaning and knowledge: legal knowledgeModeling meaning and knowledge: legal knowledge
Modeling meaning and knowledge: legal knowledge
 
Clio logikcull- leveraging e discovery date in legal practice
Clio logikcull- leveraging e discovery date in legal practiceClio logikcull- leveraging e discovery date in legal practice
Clio logikcull- leveraging e discovery date in legal practice
 
Interoperability: How legislation and running code should be connected, Erlen...
Interoperability: How legislation and running code should be connected, Erlen...Interoperability: How legislation and running code should be connected, Erlen...
Interoperability: How legislation and running code should be connected, Erlen...
 
Copy Right issue in computer software and hardware and IP
Copy Right issue in computer software and hardware and IPCopy Right issue in computer software and hardware and IP
Copy Right issue in computer software and hardware and IP
 
Ai and applications in the legal domain studium generale maastricht 20191101
Ai and applications in the legal domain studium generale maastricht 20191101Ai and applications in the legal domain studium generale maastricht 20191101
Ai and applications in the legal domain studium generale maastricht 20191101
 
HBS seminar 3/26/14: Dark Markets, Bad Patents, No Data
HBS seminar 3/26/14: Dark Markets, Bad Patents, No DataHBS seminar 3/26/14: Dark Markets, Bad Patents, No Data
HBS seminar 3/26/14: Dark Markets, Bad Patents, No Data
 
Commercializing legal AI research: lessons learned
Commercializing legal AI research: lessons learnedCommercializing legal AI research: lessons learned
Commercializing legal AI research: lessons learned
 

Viewers also liked

Bommarito Presentation for University of Houston Computational Law Conference
Bommarito Presentation for University of Houston Computational Law ConferenceBommarito Presentation for University of Houston Computational Law Conference
Bommarito Presentation for University of Houston Computational Law Conference
mjbommar
 
Information security management
Information security managementInformation security management
Information security management
UMaine
 
Introduction to information technology lecture 1
Introduction to information technology lecture 1Introduction to information technology lecture 1
Introduction to information technology lecture 1
adpafit
 

Viewers also liked (13)

Creating products that lawyers love (sic!) – design in legal technology
Creating products that lawyers love (sic!) – design in legal technologyCreating products that lawyers love (sic!) – design in legal technology
Creating products that lawyers love (sic!) – design in legal technology
 
Introduction to legal design: Product & project management
Introduction to legal design: Product & project managementIntroduction to legal design: Product & project management
Introduction to legal design: Product & project management
 
Information technology and law and trai
Information technology and law and traiInformation technology and law and trai
Information technology and law and trai
 
Bommarito Presentation for University of Houston Computational Law Conference
Bommarito Presentation for University of Houston Computational Law ConferenceBommarito Presentation for University of Houston Computational Law Conference
Bommarito Presentation for University of Houston Computational Law Conference
 
Tietokone korvaa juristin – vai korvaako?
Tietokone korvaa juristin – vai korvaako?Tietokone korvaa juristin – vai korvaako?
Tietokone korvaa juristin – vai korvaako?
 
Electronic Surveillance of Communications 100225
Electronic Surveillance of Communications 100225Electronic Surveillance of Communications 100225
Electronic Surveillance of Communications 100225
 
Law relating to information technology
Law relating to information technologyLaw relating to information technology
Law relating to information technology
 
Information security management
Information security managementInformation security management
Information security management
 
Introduction to information technology lecture 1
Introduction to information technology lecture 1Introduction to information technology lecture 1
Introduction to information technology lecture 1
 
Internet and cyberspace
Internet and cyberspaceInternet and cyberspace
Internet and cyberspace
 
Data Representation in Computers
Data Representation in ComputersData Representation in Computers
Data Representation in Computers
 
Introduction to information technology lecture 1
Introduction to information technology   lecture 1Introduction to information technology   lecture 1
Introduction to information technology lecture 1
 
Introduction to Cyber Law
Introduction to Cyber LawIntroduction to Cyber Law
Introduction to Cyber Law
 

Similar to Introduction to Legal Technology, lecture 5 (2015)

computer forensics, involves the preservation, identification, extraction, an...
computer forensics, involves the preservation, identification, extraction, an...computer forensics, involves the preservation, identification, extraction, an...
computer forensics, involves the preservation, identification, extraction, an...
pable2
 
Work towards a quantitative model of risk in patent litigation
Work towards a quantitative model of risk in patent litigationWork towards a quantitative model of risk in patent litigation
Work towards a quantitative model of risk in patent litigation
Kripa (कृपा) Rajshekhar
 
eDiscovery A-Z - June 2011
eDiscovery A-Z - June 2011eDiscovery A-Z - June 2011
eDiscovery A-Z - June 2011
eamonnsfl
 
AZ to eDiscovery
AZ to eDiscoveryAZ to eDiscovery
AZ to eDiscovery
eamonnsfl
 
Total Evidence White Paper
Total Evidence White PaperTotal Evidence White Paper
Total Evidence White Paper
Kevin Featherly
 
Vigiles Overview June 2010
Vigiles Overview June 2010Vigiles Overview June 2010
Vigiles Overview June 2010
Graeme McGowan
 
This project is due by the end of the residency weekend. Purpose.docx
This project is due by the end of the residency weekend. Purpose.docxThis project is due by the end of the residency weekend. Purpose.docx
This project is due by the end of the residency weekend. Purpose.docx
rowthechang
 
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and SecurityOntology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
Barry Smith
 

Similar to Introduction to Legal Technology, lecture 5 (2015) (20)

computer forensics, involves the preservation, identification, extraction, an...
computer forensics, involves the preservation, identification, extraction, an...computer forensics, involves the preservation, identification, extraction, an...
computer forensics, involves the preservation, identification, extraction, an...
 
FLOSS vs proprietary software - what is best for business?
FLOSS vs proprietary software - what is best for business?FLOSS vs proprietary software - what is best for business?
FLOSS vs proprietary software - what is best for business?
 
Computer forencis
Computer forencisComputer forencis
Computer forencis
 
Work towards a quantitative model of risk in patent litigation
Work towards a quantitative model of risk in patent litigationWork towards a quantitative model of risk in patent litigation
Work towards a quantitative model of risk in patent litigation
 
Chapter1
Chapter1Chapter1
Chapter1
 
Module No. 1 – Information Processing
Module No. 1 – Information ProcessingModule No. 1 – Information Processing
Module No. 1 – Information Processing
 
Audio Discovery by Albert Kassis
 Audio  Discovery by Albert Kassis  Audio  Discovery by Albert Kassis
Audio Discovery by Albert Kassis
 
A Practical Guide to Capturing, Organizing, and Securing Your Documents
A Practical Guide to Capturing, Organizing, and Securing Your DocumentsA Practical Guide to Capturing, Organizing, and Securing Your Documents
A Practical Guide to Capturing, Organizing, and Securing Your Documents
 
Stefano Menegazzi Alessandro Rossi Amalia de Götzen Andy Peruccon - ZOOOM Pro...
Stefano Menegazzi Alessandro Rossi Amalia de Götzen Andy Peruccon - ZOOOM Pro...Stefano Menegazzi Alessandro Rossi Amalia de Götzen Andy Peruccon - ZOOOM Pro...
Stefano Menegazzi Alessandro Rossi Amalia de Götzen Andy Peruccon - ZOOOM Pro...
 
eDiscovery A-Z - June 2011
eDiscovery A-Z - June 2011eDiscovery A-Z - June 2011
eDiscovery A-Z - June 2011
 
AZ to eDiscovery
AZ to eDiscoveryAZ to eDiscovery
AZ to eDiscovery
 
Establishing conclusive proof in Forensic Data Analytics
Establishing conclusive proof in Forensic Data AnalyticsEstablishing conclusive proof in Forensic Data Analytics
Establishing conclusive proof in Forensic Data Analytics
 
Total Evidence White Paper
Total Evidence White PaperTotal Evidence White Paper
Total Evidence White Paper
 
Introduction to Legal Technology, lecture 9 (2018)
Introduction to Legal Technology, lecture 9 (2018)Introduction to Legal Technology, lecture 9 (2018)
Introduction to Legal Technology, lecture 9 (2018)
 
Metadata: Digital Humanties
Metadata: Digital HumantiesMetadata: Digital Humanties
Metadata: Digital Humanties
 
Vigiles Overview June 2010
Vigiles Overview June 2010Vigiles Overview June 2010
Vigiles Overview June 2010
 
Forensic Lab Development
Forensic Lab DevelopmentForensic Lab Development
Forensic Lab Development
 
NovaGenesis: Overview and Security Aspects
NovaGenesis: Overview and Security AspectsNovaGenesis: Overview and Security Aspects
NovaGenesis: Overview and Security Aspects
 
This project is due by the end of the residency weekend. Purpose.docx
This project is due by the end of the residency weekend. Purpose.docxThis project is due by the end of the residency weekend. Purpose.docx
This project is due by the end of the residency weekend. Purpose.docx
 
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and SecurityOntology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
 

More from Anna Ronkainen

More from Anna Ronkainen (6)

Tavaramerkki. NYT! IPR-palveluihin uusi ulottuvuus
Tavaramerkki. NYT! IPR-palveluihin uusi ulottuvuusTavaramerkki. NYT! IPR-palveluihin uusi ulottuvuus
Tavaramerkki. NYT! IPR-palveluihin uusi ulottuvuus
 
Tulevaisuus on jo täällä
Tulevaisuus on jo täälläTulevaisuus on jo täällä
Tulevaisuus on jo täällä
 
Product management – what makes or breaks a startup
Product management – what makes or breaks a startupProduct management – what makes or breaks a startup
Product management – what makes or breaks a startup
 
Product management (at Boost Turku Startup Journey 2015)
Product management (at Boost Turku Startup Journey 2015)Product management (at Boost Turku Startup Journey 2015)
Product management (at Boost Turku Startup Journey 2015)
 
Technology can transform the law – but only if done right (ReInvent Law Londo...
Technology can transform the law – but only if done right (ReInvent Law Londo...Technology can transform the law – but only if done right (ReInvent Law Londo...
Technology can transform the law – but only if done right (ReInvent Law Londo...
 
Intelligent Trademark Analysis: Experiments in Large-Scale Evaluation of Real...
Intelligent Trademark Analysis: Experiments in Large-Scale Evaluation of Real...Intelligent Trademark Analysis: Experiments in Large-Scale Evaluation of Real...
Intelligent Trademark Analysis: Experiments in Large-Scale Evaluation of Real...
 

Recently uploaded

一比一原版(Essex毕业证书)埃塞克斯大学毕业证学位证书
一比一原版(Essex毕业证书)埃塞克斯大学毕业证学位证书一比一原版(Essex毕业证书)埃塞克斯大学毕业证学位证书
一比一原版(Essex毕业证书)埃塞克斯大学毕业证学位证书
F La
 
一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理
一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理
一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理
F La
 
一比一原版(CQU毕业证书)中央昆士兰大学毕业证如何办理
一比一原版(CQU毕业证书)中央昆士兰大学毕业证如何办理一比一原版(CQU毕业证书)中央昆士兰大学毕业证如何办理
一比一原版(CQU毕业证书)中央昆士兰大学毕业证如何办理
Airst S
 
一比一原版(IC毕业证书)帝国理工学院毕业证如何办理
一比一原版(IC毕业证书)帝国理工学院毕业证如何办理一比一原版(IC毕业证书)帝国理工学院毕业证如何办理
一比一原版(IC毕业证书)帝国理工学院毕业证如何办理
Fir La
 
一比一原版(KPU毕业证书)加拿大昆特兰理工大学毕业证如何办理
一比一原版(KPU毕业证书)加拿大昆特兰理工大学毕业证如何办理一比一原版(KPU毕业证书)加拿大昆特兰理工大学毕业证如何办理
一比一原版(KPU毕业证书)加拿大昆特兰理工大学毕业证如何办理
e9733fc35af6
 
一比一原版(Warwick毕业证书)华威大学毕业证如何办理
一比一原版(Warwick毕业证书)华威大学毕业证如何办理一比一原版(Warwick毕业证书)华威大学毕业证如何办理
一比一原版(Warwick毕业证书)华威大学毕业证如何办理
Fir La
 
一比一原版(Waterloo毕业证书)加拿大滑铁卢大学毕业证如何办理
一比一原版(Waterloo毕业证书)加拿大滑铁卢大学毕业证如何办理一比一原版(Waterloo毕业证书)加拿大滑铁卢大学毕业证如何办理
一比一原版(Waterloo毕业证书)加拿大滑铁卢大学毕业证如何办理
e9733fc35af6
 
一比一原版伦敦南岸大学毕业证如何办理
一比一原版伦敦南岸大学毕业证如何办理一比一原版伦敦南岸大学毕业证如何办理
一比一原版伦敦南岸大学毕业证如何办理
Airst S
 
一比一原版(JCU毕业证书)詹姆斯库克大学毕业证如何办理
一比一原版(JCU毕业证书)詹姆斯库克大学毕业证如何办理一比一原版(JCU毕业证书)詹姆斯库克大学毕业证如何办理
一比一原版(JCU毕业证书)詹姆斯库克大学毕业证如何办理
Airst S
 
一比一原版(Carleton毕业证书)加拿大卡尔顿大学毕业证如何办理
一比一原版(Carleton毕业证书)加拿大卡尔顿大学毕业证如何办理一比一原版(Carleton毕业证书)加拿大卡尔顿大学毕业证如何办理
一比一原版(Carleton毕业证书)加拿大卡尔顿大学毕业证如何办理
e9733fc35af6
 

Recently uploaded (20)

ARTICLE 370 PDF about the indian constitution.
ARTICLE 370 PDF about the  indian constitution.ARTICLE 370 PDF about the  indian constitution.
ARTICLE 370 PDF about the indian constitution.
 
一比一原版(Essex毕业证书)埃塞克斯大学毕业证学位证书
一比一原版(Essex毕业证书)埃塞克斯大学毕业证学位证书一比一原版(Essex毕业证书)埃塞克斯大学毕业证学位证书
一比一原版(Essex毕业证书)埃塞克斯大学毕业证学位证书
 
Reason Behind the Success of Law Firms in India
Reason Behind the Success of Law Firms in IndiaReason Behind the Success of Law Firms in India
Reason Behind the Success of Law Firms in India
 
一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理
一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理
一比一原版(TheAuckland毕业证书)新西兰奥克兰大学毕业证如何办理
 
The doctrine of harmonious construction under Interpretation of statute
The doctrine of harmonious construction under Interpretation of statuteThe doctrine of harmonious construction under Interpretation of statute
The doctrine of harmonious construction under Interpretation of statute
 
Elective Course on Forensic Science in Law
Elective Course on Forensic Science  in LawElective Course on Forensic Science  in Law
Elective Course on Forensic Science in Law
 
一比一原版(CQU毕业证书)中央昆士兰大学毕业证如何办理
一比一原版(CQU毕业证书)中央昆士兰大学毕业证如何办理一比一原版(CQU毕业证书)中央昆士兰大学毕业证如何办理
一比一原版(CQU毕业证书)中央昆士兰大学毕业证如何办理
 
一比一原版(IC毕业证书)帝国理工学院毕业证如何办理
一比一原版(IC毕业证书)帝国理工学院毕业证如何办理一比一原版(IC毕业证书)帝国理工学院毕业证如何办理
一比一原版(IC毕业证书)帝国理工学院毕业证如何办理
 
Hely-Hutchinson v. Brayhead Ltd .pdf
Hely-Hutchinson v. Brayhead Ltd         .pdfHely-Hutchinson v. Brayhead Ltd         .pdf
Hely-Hutchinson v. Brayhead Ltd .pdf
 
一比一原版(KPU毕业证书)加拿大昆特兰理工大学毕业证如何办理
一比一原版(KPU毕业证书)加拿大昆特兰理工大学毕业证如何办理一比一原版(KPU毕业证书)加拿大昆特兰理工大学毕业证如何办理
一比一原版(KPU毕业证书)加拿大昆特兰理工大学毕业证如何办理
 
How do cyber crime lawyers in Mumbai collaborate with law enforcement agencie...
How do cyber crime lawyers in Mumbai collaborate with law enforcement agencie...How do cyber crime lawyers in Mumbai collaborate with law enforcement agencie...
How do cyber crime lawyers in Mumbai collaborate with law enforcement agencie...
 
一比一原版(Warwick毕业证书)华威大学毕业证如何办理
一比一原版(Warwick毕业证书)华威大学毕业证如何办理一比一原版(Warwick毕业证书)华威大学毕业证如何办理
一比一原版(Warwick毕业证书)华威大学毕业证如何办理
 
Cyber Laws : National and International Perspective.
Cyber Laws : National and International Perspective.Cyber Laws : National and International Perspective.
Cyber Laws : National and International Perspective.
 
一比一原版(Waterloo毕业证书)加拿大滑铁卢大学毕业证如何办理
一比一原版(Waterloo毕业证书)加拿大滑铁卢大学毕业证如何办理一比一原版(Waterloo毕业证书)加拿大滑铁卢大学毕业证如何办理
一比一原版(Waterloo毕业证书)加拿大滑铁卢大学毕业证如何办理
 
Human Rights_FilippoLuciani diritti umani.pptx
Human Rights_FilippoLuciani diritti umani.pptxHuman Rights_FilippoLuciani diritti umani.pptx
Human Rights_FilippoLuciani diritti umani.pptx
 
一比一原版伦敦南岸大学毕业证如何办理
一比一原版伦敦南岸大学毕业证如何办理一比一原版伦敦南岸大学毕业证如何办理
一比一原版伦敦南岸大学毕业证如何办理
 
Relationship Between International Law and Municipal Law MIR.pdf
Relationship Between International Law and Municipal Law MIR.pdfRelationship Between International Law and Municipal Law MIR.pdf
Relationship Between International Law and Municipal Law MIR.pdf
 
一比一原版(JCU毕业证书)詹姆斯库克大学毕业证如何办理
一比一原版(JCU毕业证书)詹姆斯库克大学毕业证如何办理一比一原版(JCU毕业证书)詹姆斯库克大学毕业证如何办理
一比一原版(JCU毕业证书)詹姆斯库克大学毕业证如何办理
 
Analysis of R V Kelkar's Criminal Procedure Code ppt- chapter 1 .pptx
Analysis of R V Kelkar's Criminal Procedure Code ppt- chapter 1 .pptxAnalysis of R V Kelkar's Criminal Procedure Code ppt- chapter 1 .pptx
Analysis of R V Kelkar's Criminal Procedure Code ppt- chapter 1 .pptx
 
一比一原版(Carleton毕业证书)加拿大卡尔顿大学毕业证如何办理
一比一原版(Carleton毕业证书)加拿大卡尔顿大学毕业证如何办理一比一原版(Carleton毕业证书)加拿大卡尔顿大学毕业证如何办理
一比一原版(Carleton毕业证书)加拿大卡尔顿大学毕业证如何办理
 

Introduction to Legal Technology, lecture 5 (2015)

  • 1. TLS0070 Introduction to Legal Technology Lecture 5 Applications I: Information retrieval, knowledge management, e-discovery University of Turku Law School 2015-02-10 Anna Ronkainen @ronkaine anna.ronkainen@onomatics.com
  • 2. First a little digression (guess why...)
  • 3. Google Flu Trends -  predicting the timing and strength of influenza epidemics based on the relative frequency of certain keywords in searches -  values for the model in black (dotted lines 95% confidence intervals for predicted values), actual CDC influenza figures in red
  • 4.
  • 6. Performance after the initial period
  • 7. Lessons worth learning (also for legal applications) -  transparency and replicability -  use big data for understanding the unknown -  study the algorithm -  it’s not just about the size of the data (from Lazer et al 2014)
  • 9. Application lectures overview Applications I (this week): -  information retrieval -  e-discovery (e-disclosure) -  knowledge management Applications II (next week, 1st half): -  case management -  online dispute resolution -  access to justice solutions Applications III (next week, 2nd half): -  decision support -  prediction -  automation -  self-service
  • 10. Legal tech applications not covered here -  general-purpose applications (like Office®/ office software) -  legislative drafting applications -  docket management (and other applications for use within the judiciary) -  courtroom visualization (etc.) software -  ... and probably a ton of other things I don’t even know existed
  • 12. Information retrieval (IR) -  the granddaddy of legal tech applications -  the only form of legal tech available in all (industrial) countries at least in some form -  making different types of static legal content available for human consumption -  statute law (+ commentaries) -  case law -  doctrine: journal articles and books
  • 13. Information retrieval users -  types of users: -  lawyers in general -  subgroups of lawyers (e.g. IP lawyers) -  legal/admin support staff (e.g. tax administrators, paralegals, informaticians) -  other non-law professionals -  ordinary citizens -  different users have different needs in terms of -  type and quantity of content required -  terminology used -  user interface in general
  • 14. First-generation information retrieval -  take whatever text you have (on paper) and put it into a database -  full-text search (exact match or wildcards) -  structured search (in whatever fields are available) -  Boolean search with AND, OR, NOT -  some metadata enhancements like keywords (typically same as on paper)
  • 15. Present-day Boolean search example: TMview
  • 16. Further developments -  hypertext (links) -  better search capabilities with language technology (try searching for “back” as a noun) -  relevancy ranking -  recommendations for further reading -  morebetter metadata
  • 17. An example: WestlawNext -  natural-language and Boolean search -  relevancy ranking of sources of law, using (among others) a network of links between cases -  (commercial break, text version: http://info.legalsolutions.thomsonreuters.com/pdf/wln2/L-355700_v2.pdf)
  • 18. On the horizon -  natural-language query interfaces and advanced text understanding (think Watson/ Siri) -  merging relevancy ranking with predictive legal analytics (like a certain trademark platform) -  even more polarization between biggest markets (esp. US) and others (e.g. Finland, let alone developing countries)
  • 20. Knowledge management -  taking (and improving upon!) the knowledge (explicit and tacit!) of an organization and putting it into optimal use -  by no means just tech: creating and developing processes within the organization is equally important -  can take different forms: -  internal: e.g. making work product (memos, contracts etc.) electronically searchable -  external: creating digital legal content for use by law firm customers
  • 21. Knowledge management advantages -  higher efficiency -> better service -  higher quality (better dissemination of expertise) -  makes life easier for lawyers (increased productivity, reduced stress) -  keeps knowledge in the firm even if individuals leave -  helps with the training of new lawyers -  necessary for good risk management (after Kay 2003)
  • 22. One knowledge management example: contract management -  the default solution that’s still used by many (most?) companies: paper + binders -  low overhead; manageable with low volumes -  doesn’t scale (cope with large volumes) well, e.g. finding information becomes difficult -  particularly kludgy when documents needed externally (due diligence, anyone?) -  error-prone and fragile -  still need to manage templates somewhere (lack of central storage leads to inconsistencies)
  • 23. Low-tech electronic contract management -  establish a central organization-wide repository for signed contracts and official templates -  doesn’t need proprietary software, any LAN or cloud based (private) file sharing solution works -  electronically searchable, at least if word processing documents and scans are kept together -  works well (enough) if there are good processes (e.g. regarding file naming and organization of files) and they are (always!) consistently adhered to -  ...which this solution obviously cannot enforce -  no built-in workflow management
  • 24. Dedicated contract lifecycle management (CLM) solutions -  hundreds of providers, including two from Finland (that I know of: M-Files and Sopima) -  functionalities of varying sophistication for different stages in the contract lifecycle: -  contract and clause template libraries -  platform and history for internal review -  platform and history for negotiations and external review -  electronic signing / import of scanned definitive paper originals -  archiving, retrieval etc. -  workflow management, managing access privileges etc.
  • 27. Electronic signing -  real electronic signing not widespread (outside Estonia, anyway), to a great deal due to a lack of standards internationally (and esp. for identifying legal persons) -  pseudo-electronic signing (images manually written signatures stored electronically) now quite widespread, dedicated solutions and support in CLM systems also available -  the latter raises some obvious questions about probative value
  • 28. Heck, even Apple does it:
  • 29. In summary: Levels of contract management adoption (via Juntunen 2013)
  • 30. Another knowledge management example: Fondia’s Virtual Lawyer
  • 31.
  • 32. Fondia’s Virtual Lawyer -  a collection of ~1700 short documents made by Fondia staff describing the legal aspects of particular situations -  for external use (self-help by Fondia clients etc.), AFAIK also used internally in an enhanced version -  not for total novices -  available at virtuallawyer.fi for free, registration required, document template library additionally available for a fee
  • 34. Discovery in electronically stored information (e-discovery) -  emerged out of nowhere a dozen years ago -  now a multi-billion-dollar industry (mostly US), hundreds of providers -  roots in more general-purpose language tech (outside the AI & law community) -  Enron corpus, Sedona Conference, TREC, DESI -  storage requirements for e-mail etc. introduced (US) by amendments to Federal Rules of Civil Procedure in 2006
  • 35. ...and now* it’s already this much widespread (in the US, anyway): *: actually this book is from 2009
  • 36. Zubulake v. UBS Warburg -  employment law case in District Court for Southern NY, heard 2003–2005 -  led to four groundbreaking rulings which set the basic standards for e-discovery (before 2006 FRCP revisions), widely referred to as Zubulake I, III, IV, V
  • 37. Zubulake I and III -  what data is considered accessible ESI -  yes: online data/hard disks, optical disks, offline magnetic tapes -  no: backup tapes, damaged/deleted/... data -  no -> yes if considerable evidentiary value can be demonstrated, for which a 7-factor test was introduced: -  The extent to which the request is specifically tailored to discover relevant information; -  The availability of such information from other sources; -  The total cost of production, compared to the amount in controversy; -  The total cost of production, compared to the resources available to each party; -  The relative ability of each party to control costs and its incentive to do so; -  The importance of the issues at stake in the litigation; and -  The relative benefits to the parties of obtaining the information.
  • 38. Zubulake IV -  some backups no longer available -  relevant emails (created after the start of the proceedings) had been deleted -  defendant had a duty to preserve evidence (since relevant for ongoing/future litigation) -  plaintiff got access to the information -  however, plaintiff couldn’t show adverse interference (at this stage) and was ordered to pay the costs
  • 39. Zubulake V -  upon the plaintiff’s motion, the court concluded that the defendant (and defence counsel) had failed to safeguard and produce evidence in an adequate manner -  defendant sanctioned and ordered to pay plaintiff’s costs for producing evidence (witness re-examination etc.) necessary due to plaintiff’s late (or non-)production of relevant evidence
  • 40. Outcome -  active interference (intentional destruction or hiding of evidence) ruled by the judge -  jury found in favour of the plaintiff, compensatory and punitive damages -  reimbursement of even more costs to the plaintiff (generally a lot more unusual in US)
  • 41. E-discovery workflow -  establish an ESI retention policy, stick to it when creating and storing data -  identify relevant ESI, create authentic snapshot and collect it for further processing -  process and filter ESI (e.g. removal of duplicates) -  review and analyze ESI for privileged information -  produce ESI after filtering out irrelevant, duplicated or privileged materials -  possibly clawback if too much produced in error -  present at trial (if it ever goes that far)
  • 42. First-generation e-discovery -  based on lists of specific search terms (or phrases) proposed by the plaintiff and approved or modified by the judge -  a bit sketchy, not even real consensus about whether keywords cover all inflections? -  no longer considered acceptable by many of the most influential US judges for this field
  • 43. Predictive coding -  based on coding a (very) small subset of the relevant document mass as responsive or not (should/n’t be released) -  then using that as the teaching set for a machine learning algorithm -  performance comparable to (or better than) human reviewers at a fraction of the cost
  • 44. E-discovery output -  native (original) formats (e.g.: .docx) -  usually better for the plaintiff: electronically searchable -  native file formats for proprietary software not necessarily openable without that software -  “petrified” formats (tiff, pdf) -  often better for the defendant: almost the same as handing out the data on paper -  general-purpose tools enough for viewing -  easier to redact
  • 45. What’s the status with e-discovery -  very widespread in the US (because it’s the law!) -  gaining popularity in the rest of Anglophonia (because common law; tech readily available for English) -  some providers also support major European and Asian languages (mostly for international companies operating in the US) -  rest of the world: is there even a word for this? (then again: discovery in the common-law sense doesn’t exist in most civil-law countries (incl. Finland) in general)
  • 46. No concrete examples -  (because, frankly, I understand neither the field nor the legal issue well enough) -  but e-discovery in itself is an interesting example of legal tech for many reasons -  first real big data application for law -  came out of nowhere in the early 2000s -  now a multi-billion-dollar industry (US) -  many startups, some notable exits (e.g. Cataphora’s e-discovery ops to EY) -  also continuously new funding rounds (even $100M+) to more and more companies