SlideShare a Scribd company logo
1 of 13
Download to read offline
NIPS-2010
                                       @



           • b-bit Minwise Hashing for Estimating Three-
                Way Similarities. P. Li et al.

                •
           • Functional Geometry Alignment and
                Localization of Brain Areas. Langs et al.

                •
2011   2   14
b-bit Minwise Hashing for
        Estimating Three-Way
               Similarities

                • Minwise Hashing (MinHash)   ?

                • b-bit Minwise Hasing    ?




2011   2   14
Motivation
       •
            •              ,
            •
            •                          Web
            •
            •
       •                       2   (               )
       •               Minwise Hasing (MinHash) [Broder 1997]
                sign random projections (simhash) Hamming
                Distance LSH
2011   2   14
Minwise Hashing
       •                 Jaccard
                                                            |A ∩ B|
                                                  J(A, B) =
                                                            |A ∪ B|
       •
       •        Random parmutation (or Hash        ) π(x)
       •             A             π(x)        Pr[min(π(A)) = min(π(B))]
                Pr[min(π(A)) = min(π(B))] = J(A, B)

       •             A = {1, 3, 5, 7}, B = {3, 4, 5}
                       ⇒ A ∩ B = {3, 5}, A ∪ B = {1, 3, 4, 5, 7}
           •      min(h(A)) = min(h(B))                     {1,3,4,5,7}
                                     3    5
                 •   Jaccard

2011   2   14
•




           •
                                Hash                       bit


                •   Altavista                             40bit Fetterly
                    WWW03          64bit

           •                           Hash   1 or 2bit

           •
2011   2   14
•                 2
                    •
                    •   Jaccard       (0.5   )




2011   2   14
• b-Bit Minwise Hashing for Estimating
                    Three-Way Similarities NIPS2010

                • b-Bit Minwise Hashing   3
                    Jaccard
                                    |A ∩ B ∩ C|
                       J(A, B, C) =
                                    |A ∪ B ∪ C|

                •
2011   2   14
Functional Geometry Alignment
             and Localization of Brain Areas
                                 Registration based on anatomical data   Registration based on the function




                                  brain 1    registration    brain 2       brain 1   embedding           re


                                 Figure 1: Standard anatomical registration and the proposed fun
mical data     Registration basedtional geometry geometry matches the diffusion maps of fMRI
                                  on the functional alignment




                            Integrating functional features into the registration process prom
brain 2        brain 1 embedding proposed methods match the centers of activated cortica
                            cently       registration       embedding brain 2
                            correspondences of cortical surfaces [18]. The fMRI signals at t
                            vector, and registration is performed by maximizing the inter-su
mical2 registration and the proposed functional geometry alignment. Func- warp to
  2011  14
                            points, while at the same time regularizing the surface
Motivation
       •
       •                    fMRI


       •
            •
                                                  ?


                      Above-threshold region in       Above-threshold region in
2011   2   14         source subject                  target subject
• fMRI
       • Voxel                               (           Kernel)


       • Diffusion Maps
       •              Voxel

                a. Maps of two subjects




                                  s0             Ψ0          Ψ1        s1
                                Subject 1        Map 1       Map 2   Subject 2



2011   2   14   b. Aligning the point sets
Diffusion Maps
       •    Coifman and Lafon. Applied and Comp. Harmonic Analysis. 2006
       •                    PCA      Isomap
       •    Spectral Clustering

                                  •
                                  •         i,j     t                  i   Markov
                                          chain random walk        t


                                      •    Normalized Graph Laplacian
                                  •
                                          Diffusion Distance
                                  •       Diffusion Distance
                                                     N   (N    )

2011   2   14
a. Maps of two subjects




                             s0                          Ψ0                        Ψ1             s1
                           Subject 1                     Map 1                    Map 2         Subject 2



           b. Aligning the point sets

                                                                 xk
                                                                  0
                                                                          xl
                                                                           1

                                                                                                                          A.



                                                                                                                    FGA
                                                                                                            0.2
                                                                                                              0.2



                                                                      ?
       Figure 2: Maps of two subjects in the process of registration: (a) Left and right: the0.15  axial and
                                                                                               0.15
       sagittal views of the points in the two brains. The two central columns show plots of the first
       three dimensions of the embedding in the functional geometry after coarse rotational alignment. (b)
       During alignment, a maps is represented as a Gaussian mixture model. The colors in both plots
       indicate clusters which are only region in visualization. Above-threshold region in
                          Above-threshold used for                                              0.1
                                                                                                  0.1


2011   2    14                          source subject                         target subject
2011   2   14

More Related Content

More from sesejun

RNAseqによる変動遺伝子抽出の統計: A Review
RNAseqによる変動遺伝子抽出の統計: A ReviewRNAseqによる変動遺伝子抽出の統計: A Review
RNAseqによる変動遺伝子抽出の統計: A Reviewsesejun
 
バイオインフォマティクスによる遺伝子発現解析
バイオインフォマティクスによる遺伝子発現解析バイオインフォマティクスによる遺伝子発現解析
バイオインフォマティクスによる遺伝子発現解析sesejun
 
次世代シーケンサが求める機械学習
次世代シーケンサが求める機械学習次世代シーケンサが求める機械学習
次世代シーケンサが求める機械学習sesejun
 
20110602labseminar pub
20110602labseminar pub20110602labseminar pub
20110602labseminar pubsesejun
 
20110524zurichngs 2nd pub
20110524zurichngs 2nd pub20110524zurichngs 2nd pub
20110524zurichngs 2nd pubsesejun
 
20110524zurichngs 1st pub
20110524zurichngs 1st pub20110524zurichngs 1st pub
20110524zurichngs 1st pubsesejun
 
Datamining 9th association_rule.key
Datamining 9th association_rule.keyDatamining 9th association_rule.key
Datamining 9th association_rule.keysesejun
 
Datamining 8th hclustering
Datamining 8th hclusteringDatamining 8th hclustering
Datamining 8th hclusteringsesejun
 
Datamining r 4th
Datamining r 4thDatamining r 4th
Datamining r 4thsesejun
 
Datamining r 3rd
Datamining r 3rdDatamining r 3rd
Datamining r 3rdsesejun
 
Datamining r 2nd
Datamining r 2ndDatamining r 2nd
Datamining r 2ndsesejun
 
Datamining r 1st
Datamining r 1stDatamining r 1st
Datamining r 1stsesejun
 
Datamining 6th svm
Datamining 6th svmDatamining 6th svm
Datamining 6th svmsesejun
 
Datamining 5th knn
Datamining 5th knnDatamining 5th knn
Datamining 5th knnsesejun
 
Datamining 4th adaboost
Datamining 4th adaboostDatamining 4th adaboost
Datamining 4th adaboostsesejun
 
Datamining 3rd naivebayes
Datamining 3rd naivebayesDatamining 3rd naivebayes
Datamining 3rd naivebayessesejun
 
Datamining 2nd decisiontree
Datamining 2nd decisiontreeDatamining 2nd decisiontree
Datamining 2nd decisiontreesesejun
 
Datamining 7th kmeans
Datamining 7th kmeansDatamining 7th kmeans
Datamining 7th kmeanssesejun
 
100401 Bioinfoinfra
100401 Bioinfoinfra100401 Bioinfoinfra
100401 Bioinfoinfrasesejun
 
Datamining 8th Hclustering
Datamining 8th HclusteringDatamining 8th Hclustering
Datamining 8th Hclusteringsesejun
 

More from sesejun (20)

RNAseqによる変動遺伝子抽出の統計: A Review
RNAseqによる変動遺伝子抽出の統計: A ReviewRNAseqによる変動遺伝子抽出の統計: A Review
RNAseqによる変動遺伝子抽出の統計: A Review
 
バイオインフォマティクスによる遺伝子発現解析
バイオインフォマティクスによる遺伝子発現解析バイオインフォマティクスによる遺伝子発現解析
バイオインフォマティクスによる遺伝子発現解析
 
次世代シーケンサが求める機械学習
次世代シーケンサが求める機械学習次世代シーケンサが求める機械学習
次世代シーケンサが求める機械学習
 
20110602labseminar pub
20110602labseminar pub20110602labseminar pub
20110602labseminar pub
 
20110524zurichngs 2nd pub
20110524zurichngs 2nd pub20110524zurichngs 2nd pub
20110524zurichngs 2nd pub
 
20110524zurichngs 1st pub
20110524zurichngs 1st pub20110524zurichngs 1st pub
20110524zurichngs 1st pub
 
Datamining 9th association_rule.key
Datamining 9th association_rule.keyDatamining 9th association_rule.key
Datamining 9th association_rule.key
 
Datamining 8th hclustering
Datamining 8th hclusteringDatamining 8th hclustering
Datamining 8th hclustering
 
Datamining r 4th
Datamining r 4thDatamining r 4th
Datamining r 4th
 
Datamining r 3rd
Datamining r 3rdDatamining r 3rd
Datamining r 3rd
 
Datamining r 2nd
Datamining r 2ndDatamining r 2nd
Datamining r 2nd
 
Datamining r 1st
Datamining r 1stDatamining r 1st
Datamining r 1st
 
Datamining 6th svm
Datamining 6th svmDatamining 6th svm
Datamining 6th svm
 
Datamining 5th knn
Datamining 5th knnDatamining 5th knn
Datamining 5th knn
 
Datamining 4th adaboost
Datamining 4th adaboostDatamining 4th adaboost
Datamining 4th adaboost
 
Datamining 3rd naivebayes
Datamining 3rd naivebayesDatamining 3rd naivebayes
Datamining 3rd naivebayes
 
Datamining 2nd decisiontree
Datamining 2nd decisiontreeDatamining 2nd decisiontree
Datamining 2nd decisiontree
 
Datamining 7th kmeans
Datamining 7th kmeansDatamining 7th kmeans
Datamining 7th kmeans
 
100401 Bioinfoinfra
100401 Bioinfoinfra100401 Bioinfoinfra
100401 Bioinfoinfra
 
Datamining 8th Hclustering
Datamining 8th HclusteringDatamining 8th Hclustering
Datamining 8th Hclustering
 

Recently uploaded

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 

Recently uploaded (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

20110214nips2010 read

  • 1. NIPS-2010 @ • b-bit Minwise Hashing for Estimating Three- Way Similarities. P. Li et al. • • Functional Geometry Alignment and Localization of Brain Areas. Langs et al. • 2011 2 14
  • 2. b-bit Minwise Hashing for Estimating Three-Way Similarities • Minwise Hashing (MinHash) ? • b-bit Minwise Hasing ? 2011 2 14
  • 3. Motivation • • , • • Web • • • 2 ( ) • Minwise Hasing (MinHash) [Broder 1997] sign random projections (simhash) Hamming Distance LSH 2011 2 14
  • 4. Minwise Hashing • Jaccard |A ∩ B| J(A, B) = |A ∪ B| • • Random parmutation (or Hash ) π(x) • A π(x) Pr[min(π(A)) = min(π(B))] Pr[min(π(A)) = min(π(B))] = J(A, B) • A = {1, 3, 5, 7}, B = {3, 4, 5} ⇒ A ∩ B = {3, 5}, A ∪ B = {1, 3, 4, 5, 7} • min(h(A)) = min(h(B)) {1,3,4,5,7} 3 5 • Jaccard 2011 2 14
  • 5. • Hash bit • Altavista 40bit Fetterly WWW03 64bit • Hash 1 or 2bit • 2011 2 14
  • 6. 2 • • Jaccard (0.5 ) 2011 2 14
  • 7. • b-Bit Minwise Hashing for Estimating Three-Way Similarities NIPS2010 • b-Bit Minwise Hashing 3 Jaccard |A ∩ B ∩ C| J(A, B, C) = |A ∪ B ∪ C| • 2011 2 14
  • 8. Functional Geometry Alignment and Localization of Brain Areas Registration based on anatomical data Registration based on the function brain 1 registration brain 2 brain 1 embedding re Figure 1: Standard anatomical registration and the proposed fun mical data Registration basedtional geometry geometry matches the diffusion maps of fMRI on the functional alignment Integrating functional features into the registration process prom brain 2 brain 1 embedding proposed methods match the centers of activated cortica cently registration embedding brain 2 correspondences of cortical surfaces [18]. The fMRI signals at t vector, and registration is performed by maximizing the inter-su mical2 registration and the proposed functional geometry alignment. Func- warp to 2011 14 points, while at the same time regularizing the surface
  • 9. Motivation • • fMRI • • ? Above-threshold region in Above-threshold region in 2011 2 14 source subject target subject
  • 10. • fMRI • Voxel ( Kernel) • Diffusion Maps • Voxel a. Maps of two subjects s0 Ψ0 Ψ1 s1 Subject 1 Map 1 Map 2 Subject 2 2011 2 14 b. Aligning the point sets
  • 11. Diffusion Maps • Coifman and Lafon. Applied and Comp. Harmonic Analysis. 2006 • PCA Isomap • Spectral Clustering • • i,j t i Markov chain random walk t • Normalized Graph Laplacian • Diffusion Distance • Diffusion Distance N (N ) 2011 2 14
  • 12. a. Maps of two subjects s0 Ψ0 Ψ1 s1 Subject 1 Map 1 Map 2 Subject 2 b. Aligning the point sets xk 0 xl 1 A. FGA 0.2 0.2 ? Figure 2: Maps of two subjects in the process of registration: (a) Left and right: the0.15 axial and 0.15 sagittal views of the points in the two brains. The two central columns show plots of the first three dimensions of the embedding in the functional geometry after coarse rotational alignment. (b) During alignment, a maps is represented as a Gaussian mixture model. The colors in both plots indicate clusters which are only region in visualization. Above-threshold region in Above-threshold used for 0.1 0.1 2011 2 14 source subject target subject
  • 13. 2011 2 14