SlideShare a Scribd company logo
1 of 15
Download to read offline
mn@dbcls.jp
                 @32nm




11   1   24
•
              •
              •


11   1   24
•
              • & &
              • Okamoto, Taro   Taro Okamoto




11   1   24
•                 Freebase



              •                        OCR     PDF
                                  html table
                  Excel   CSV

              •
11   1   24
•                      Server-client

              • in-memory
              •                          Java

              •         GUI   HTML

              •                           JSON, Jython


11   1   24
• Importing
              • Filtering / faceting
              • Editing cells, columns, rows
              • Exporting
              • History

11   1   24
TSV, CSV, ...
                                                    TSV
Excel
                                                    CSV
XML, RDF/XML
                                                    Excel
JSON
                                                    HTML table
Google Spreadsheets
                                                     Templating
         .zip, .tar.gz, .tgz, tar.bz2, .gz, .bz2       YAML
                                                   MediaWiki Table

11   1   24
• "Taro Okamoto"
              • value.split(" ").reverse().join(", ")
              • "Okamoto, Taro"


11   1   24
• Editing
              • GREL      JavaScript

              •               Java




11   1   24
•             JSON

          • Undo/Redo
          •


11   1   24
•


              • Freebase, ...
              • Reconciliation Service API
               • a RESTful JSON API
                          https://code.google.com/p/google-refine/wiki/Reconciliation
                     https://code.google.com/p/google-refine/wiki/ReconcilableDataSources
11   1   24
http://code.google.com/p/simile-butterfly/


              • Stats extension
              • RDF extension


11   1   24
•                     RDF                 GUI

              •
              • RDF/XML, Turtle
              •
          http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/
11   1   24
http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/#example


              •
                  http://www.youtube.com/watch?v=_I0mLFDXlUk




11   1   24
• http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/
     • https://code.google.com/p/google-refine/
     • https://code.google.com/p/google-refine/wiki/FAQ
     • https://code.google.com/p/google-refine/wiki/
              DocumentationForUsers



11   1   24

More Related Content

Viewers also liked

SPARQL Timelinerの使い方
SPARQL Timelinerの使い方SPARQL Timelinerの使い方
SPARQL Timelinerの使い方uedayou
 
Linked data the next 5 years - From Hype to Action
Linked data the next 5 years - From Hype to ActionLinked data the next 5 years - From Hype to Action
Linked data the next 5 years - From Hype to ActionAndreas Blumauer
 
SPARQLを利用した逆マッシュアップ-プログラミングを必要としないアプリ作成方法-
SPARQLを利用した逆マッシュアップ-プログラミングを必要としないアプリ作成方法-SPARQLを利用した逆マッシュアップ-プログラミングを必要としないアプリ作成方法-
SPARQLを利用した逆マッシュアップ-プログラミングを必要としないアプリ作成方法-uedayou
 
Chainerの使い方と 自然言語処理への応用
Chainerの使い方と自然言語処理への応用Chainerの使い方と自然言語処理への応用
Chainerの使い方と 自然言語処理への応用Yuya Unno
 
自然言語処理のためのDeep Learning
自然言語処理のためのDeep Learning自然言語処理のためのDeep Learning
自然言語処理のためのDeep LearningYuta Kikuchi
 

Viewers also liked (8)

SPARQL Timelinerの使い方
SPARQL Timelinerの使い方SPARQL Timelinerの使い方
SPARQL Timelinerの使い方
 
LODを閲覧する/作成する
LODを閲覧する/作成するLODを閲覧する/作成する
LODを閲覧する/作成する
 
Linked data the next 5 years - From Hype to Action
Linked data the next 5 years - From Hype to ActionLinked data the next 5 years - From Hype to Action
Linked data the next 5 years - From Hype to Action
 
SPARQLを利用した逆マッシュアップ-プログラミングを必要としないアプリ作成方法-
SPARQLを利用した逆マッシュアップ-プログラミングを必要としないアプリ作成方法-SPARQLを利用した逆マッシュアップ-プログラミングを必要としないアプリ作成方法-
SPARQLを利用した逆マッシュアップ-プログラミングを必要としないアプリ作成方法-
 
第7回 Linked Data 勉強会 @yayamamo
第7回 Linked Data 勉強会 @yayamamo第7回 Linked Data 勉強会 @yayamamo
第7回 Linked Data 勉強会 @yayamamo
 
Chainerの使い方と 自然言語処理への応用
Chainerの使い方と自然言語処理への応用Chainerの使い方と自然言語処理への応用
Chainerの使い方と 自然言語処理への応用
 
自然言語処理のためのDeep Learning
自然言語処理のためのDeep Learning自然言語処理のためのDeep Learning
自然言語処理のためのDeep Learning
 
深層学習による自然言語処理の研究動向
深層学習による自然言語処理の研究動向深層学習による自然言語処理の研究動向
深層学習による自然言語処理の研究動向
 

More from Mitsuteru Nakao

創薬に必要なデータ統合の現在と未来
創薬に必要なデータ統合の現在と未来創薬に必要なデータ統合の現在と未来
創薬に必要なデータ統合の現在と未来Mitsuteru Nakao
 
遺伝子検査してみた
遺伝子検査してみた遺伝子検査してみた
遺伝子検査してみたMitsuteru Nakao
 
データ統合とサイバーインフラストラクチャ
データ統合とサイバーインフラストラクチャデータ統合とサイバーインフラストラクチャ
データ統合とサイバーインフラストラクチャMitsuteru Nakao
 
Galaxy Developer Conference 2010 レポート
Galaxy Developer Conference 2010 レポートGalaxy Developer Conference 2010 レポート
Galaxy Developer Conference 2010 レポートMitsuteru Nakao
 
データとツール ライフサイエンス統合データベースセンターのシナリオ
データとツール ライフサイエンス統合データベースセンターのシナリオデータとツール ライフサイエンス統合データベースセンターのシナリオ
データとツール ライフサイエンス統合データベースセンターのシナリオMitsuteru Nakao
 
ライフサイエンス統合データベースの課題:権利と法律、技術
ライフサイエンス統合データベースの課題:権利と法律、技術ライフサイエンス統合データベースの課題:権利と法律、技術
ライフサイエンス統合データベースの課題:権利と法律、技術Mitsuteru Nakao
 
Das workshop 2008 Report
Das workshop 2008 ReportDas workshop 2008 Report
Das workshop 2008 ReportMitsuteru Nakao
 

More from Mitsuteru Nakao (8)

創薬に必要なデータ統合の現在と未来
創薬に必要なデータ統合の現在と未来創薬に必要なデータ統合の現在と未来
創薬に必要なデータ統合の現在と未来
 
遺伝子検査してみた
遺伝子検査してみた遺伝子検査してみた
遺伝子検査してみた
 
データ統合とサイバーインフラストラクチャ
データ統合とサイバーインフラストラクチャデータ統合とサイバーインフラストラクチャ
データ統合とサイバーインフラストラクチャ
 
Nakao a4pp
Nakao a4ppNakao a4pp
Nakao a4pp
 
Galaxy Developer Conference 2010 レポート
Galaxy Developer Conference 2010 レポートGalaxy Developer Conference 2010 レポート
Galaxy Developer Conference 2010 レポート
 
データとツール ライフサイエンス統合データベースセンターのシナリオ
データとツール ライフサイエンス統合データベースセンターのシナリオデータとツール ライフサイエンス統合データベースセンターのシナリオ
データとツール ライフサイエンス統合データベースセンターのシナリオ
 
ライフサイエンス統合データベースの課題:権利と法律、技術
ライフサイエンス統合データベースの課題:権利と法律、技術ライフサイエンス統合データベースの課題:権利と法律、技術
ライフサイエンス統合データベースの課題:権利と法律、技術
 
Das workshop 2008 Report
Das workshop 2008 ReportDas workshop 2008 Report
Das workshop 2008 Report
 

Recently uploaded

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 

Recently uploaded (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

10分くらいでわかるGoogle RefineのRDF拡張

  • 1. mn@dbcls.jp @32nm 11 1 24
  • 2. • • 11 1 24
  • 3. • & & • Okamoto, Taro Taro Okamoto 11 1 24
  • 4. Freebase • OCR PDF html table Excel CSV • 11 1 24
  • 5. Server-client • in-memory • Java • GUI HTML • JSON, Jython 11 1 24
  • 6. • Importing • Filtering / faceting • Editing cells, columns, rows • Exporting • History 11 1 24
  • 7. TSV, CSV, ... TSV Excel CSV XML, RDF/XML Excel JSON HTML table Google Spreadsheets Templating .zip, .tar.gz, .tgz, tar.bz2, .gz, .bz2 YAML MediaWiki Table 11 1 24
  • 8. • "Taro Okamoto" • value.split(" ").reverse().join(", ") • "Okamoto, Taro" 11 1 24
  • 9. • Editing • GREL JavaScript • Java 11 1 24
  • 10. JSON • Undo/Redo • 11 1 24
  • 11. • Freebase, ... • Reconciliation Service API • a RESTful JSON API https://code.google.com/p/google-refine/wiki/Reconciliation https://code.google.com/p/google-refine/wiki/ReconcilableDataSources 11 1 24
  • 12. http://code.google.com/p/simile-butterfly/ • Stats extension • RDF extension 11 1 24
  • 13. RDF GUI • • RDF/XML, Turtle • http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/ 11 1 24
  • 14. http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/#example • http://www.youtube.com/watch?v=_I0mLFDXlUk 11 1 24
  • 15. • http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/ • https://code.google.com/p/google-refine/ • https://code.google.com/p/google-refine/wiki/FAQ • https://code.google.com/p/google-refine/wiki/ DocumentationForUsers 11 1 24