SlideShare a Scribd company logo
1 of 84
Download to read offline
Disambiguating Explicit and
Implicit Geographic References
in Natural Language
Jason Baldridge
@jasonbaldridge
Computational Linguistics Lab
Department of Linguistics
UT Austin
MLConf Seattle May 20, 2016
© 2016 Jason M Baldridge MLConf, May 2016
What does “barbecue” mean?
2
© 2016 Jason M Baldridge MLConf, May 2016
What does “barbecue” mean? Barbecue’
2
© 2016 Jason M Baldridge MLConf, May 2016
What does “barbecue” mean? Barbecue’
2
© 2016 Jason M Baldridge MLConf, May 2016
What does “barbecue” mean? Barbecue’
2
© 2016 Jason M Baldridge MLConf, May 2016
What does “barbecue” mean? Barbecue’
2
© 2016 Jason M Baldridge MLConf, May 2016
What does “barbecue” mean? Barbecue’
2
© 2016 Jason M Baldridge MLConf, May 2016
What does “barbecue” mean? Barbecue’
2
© 2016 Jason M Baldridge MLConf, May 2016
What I thought semantics was before 2005
3
From: John Enrico and Jason Baldridge. 2011. Possessor Raising, Demonstrative Raising, Quantifier
Float and Number Float in Haida. International Journal of American Linguistics. 77(2):185-218
© 2016 Jason M Baldridge MLConf, May 2016
Updated perspective a la Ray Mooney (UT Austin CS)
4
http://www.cs.utexas.edu/users/ml/slides/chen-icml08.ppt
© 2016 Jason M Baldridge MLConf, May 2016
http://www.lib.utexas.edu/books/travel/index.htmlTravel at the Turn of the 20th Century
5
© 2016 Jason M Baldridge MLConf, May 2016
Motivation: Google Lit Trips [http://www.googlelittrips.com/]
6
Grapes of Wrath in Google Earth
Text
http://www.googlelittrips.com/GoogleLit/9-12/Entries/2006/11/1_The_Grapes_of_Wrath_by_John_Steinbeck.html
© 2016 Jason M Baldridge MLConf, May 2016
Look, Mom, no hands! (Err, um... no metadata.)
7
© 2016 Jason M Baldridge MLConf, May 2016
Look, Mom, no hands! (Err, um... no metadata.)
7
Topics with a clear, circumscribed
geographic focus emerge!
© 2016 Jason M Baldridge MLConf, May 2016
Metadata is now plentiful
8
© 2016 Jason M Baldridge MLConf, May 2016
01:55:55 RT @USER_dc5e5498: Drop and give me 50....
05:09:29 I said u got a swisher from redmond!? He said nah kirkland!
Lol..ooooooooOkay!
05:57:35 Lmao!:) havin a good ol time after work! Unexpected! #goodtimes
06:00:09 RT @USER_d5d93fec: #letsbereal .. No seriously, #letsbereal>>lol.
Don't start.
06:00:37 On my way to get @USER_60939380 yeee! She want some of this
strawberry! Sexy!
...
47°31’41’’ N 122°11’52’’ W
9
Geotagged Twitter
© 2016 Jason M Baldridge MLConf, May 2016
01:55:55 RT @USER_dc5e5498: Drop and give me 50....
05:09:29 I said u got a swisher from redmond!? He said nah kirkland!
Lol..ooooooooOkay!
05:57:35 Lmao!:) havin a good ol time after work! Unexpected! #goodtimes
06:00:09 RT @USER_d5d93fec: #letsbereal .. No seriously, #letsbereal>>lol.
Don't start.
06:00:37 On my way to get @USER_60939380 yeee! She want some of this
strawberry! Sexy!
...
47°31’41’’ N 122°11’52’’ W
9
Geotagged Twitter
© 2016 Jason M Baldridge MLConf, May 2016
Geotagged Wikipedia
10
30° 17′ N 97° 44′ W
© 2016 Jason M Baldridge MLConf, May 2016
Where’s a word on Earth? (according to Wikipedia)
© 2016 Jason M Baldridge MLConf, May 2016
Where’s a word on Earth? (according to Wikipedia)
mountain
© 2016 Jason M Baldridge MLConf, May 2016
Document geolocation: where is this person?
12
© 2016 Jason M Baldridge MLConf, May 201613
Amsterdam, Zaandam,Amstelveen, Diemen, Landsmeer ...
Frankfurt, Frechen, Hürth, Brühl,Wesseling, ...
Language modeling approach
Wing & Baldridge 2011: Simple supervised document geolocation with geodesic grids.
© 2016 Jason M Baldridge MLConf, May 2016
Locations of Twitter users are not uniformly distributed!
14
(Small) GeoUT (Twitter) plotted
on Google Earth, one pin per user.
Density of (all)
documents in GeoUT
over the USA
(390 million tweets)
© 2016 Jason M Baldridge MLConf, May 2016
k-d tree for geotagged Wikipedia, looking at N. America
15
Roller, Speriosu, Rallapalli,Wing & Baldridge 2014:
Supervised Text-based Geolocation Using Language Models on an Adaptive Grid.
© 2016 Jason M Baldridge MLConf, May 2016
k-d tree for geotagged Wikipedia, looking at N. America
15
Roller, Speriosu, Rallapalli,Wing & Baldridge 2014:
Supervised Text-based Geolocation Using Language Models on an Adaptive Grid.
[Serdyukov, Murdock, & van Zwol 2009; Cheng, Caverlee, & Lee 2010;Wing & Baldridge 2011]
Automatic document geolocation
[Serdyukov, Murdock, & van Zwol 2009; Cheng, Caverlee, & Lee 2010;Wing & Baldridge 2011]
Automatic document geolocation
© 2016 Jason M Baldridge MLConf, May 2016
Hierarchical geo-location with logistic regression
17
Wing & Baldridge 2014: Hierarchical Discriminative Classification for Text-Based Geolocation.
© 2016 Jason M Baldridge MLConf May 2016
Hierarchical logistic regression
18
© 2016 Jason M Baldridge MLConf May 2016
Hierarchical logistic regression
19
© 2016 Jason M Baldridge MLConf May 2016
Hierarchical logistic regression
20
© 2016 Jason M Baldridge MLConf, May 2016
Performance: hierarchical logistic regression with kd-tree grid
21
Flickr (entire world)
Half of documents geotagged within 18 km of truth
Percent of documents within 166km (100 miles): 66%
Twitter (World)
Half of users geotagged within 510 km of truth
Percent of documents within 166km (100 miles): 31%
Twitter (USA)
Half of users geotagged within 192 km of truth
Percent of documents within 166km (100 miles): 48%
© 2016 Jason M Baldridge MLConf, May 2016
Hierarchical logistic regression beats flat naive Bayes
22
Naive Bayes Hierarchical LR
Twitter USA 36.2 48.0
Twitter World 28.7 31.3
Flickr 58.5 66
English Wikipedia 84.5 88.9
German Wikipedia 89.3 90.2
Portuguese Wikipedia 77.1 89.5
Accuracy @ 161 km, kd-tree grid
© 2016 Jason M Baldridge MLConf, May 2016
Logistic regression weights good features heavily
23
© 2016 Jason M Baldridge MLConf May 2016
Automated Lit Trips
24
© 2016 Jason M Baldridge MLConf, May 2016
Toponym (place name) resolution
25
They visit Portland every year.
© 2016 Jason M Baldridge MLConf, May 2016
Toponym (place name) resolution
25
They visit Portland every year.
?
?
?
?
?
?
?
?
?
?
?
?
?
?
?
?
?
Which Portland? (Also: Canada,Australia, Ireland...)
© 2016 Jason M Baldridge MLConf, May 2016
Toponym resolution in context
26
Although Elisha Newman made the first land entry in the township of Portland (June,
1833), he did not become a settler until three years later, by which time a few settlers had
located in the town. From Mr. Newman's story, it appears that early in 1833, he was
visiting friends in Ann Arbor, and during an evening conversation discussed with others
the subject of unlocated lands lying west of Ann Arbor. One of the company (Joseph
Wood) remarked that he had been out with the party sent to survey Ionia and other
counties, and that the surveyors were struck by the valuable water-power at the mouth of
the Looking Glass River, saying there would surely be a village there some day.
Mr. Newman was at once taken with the idea of locating lands at the mouth of the Looking
Glass. Following up his impulse, he made ready to start at once, and, accompanied by
James Newman and Joseph Wood, went out to the Looking Glass on a tour of inspection.
Being satisfied with the location, he returned Eastward with his companions, and at White
Pigeon made his land entry.
Newman did not return for a permanent settlement until the spring of 1836, and
meanwhile, in November, 1833, Philo Bogue bought a piece of land on section 28, in the
bend of the Grand River, where he proposed to set up a trading post. Unaided he rolled up
a log cabin near where the Detroit, Lansing, and Northern depot was located, and when he
brought the house into decent shape went over to Hunt's at Lyons for his family, whom he
had left there against such time as he should have affairs prepared for their comfort.
© 2016 Jason M Baldridge MLConf, May 2016
Spatial minimality
27
Although Elisha Newman made the first land entry in the township of Portland (June, 1833), he did not become a settler until three years later, by
which time a few settlers had located in the town. From Mr. Newman's story, it appears that early in 1833, he was visiting friends in Ann Arbor,
and during an evening conversation discussed with others the subject of unlocated lands lying west of Ann Arbor. One of the company (Joseph
Wood) remarked that he had been out with the party sent to survey Ionia and other counties, and that the surveyors were struck by the valuable
water-power at the mouth of the Looking Glass River, saying there would surely be a village there some day.
Mr. Newman was at once taken with the idea of locating lands at the mouth of the Looking Glass. Following up his impulse, he made ready to start
at once, and, accompanied by James Newman and Joseph Wood, went out to the Looking Glass on a tour of inspection. Being satisfied with the
location, he returned Eastward with his companions, and at White Pigeon made his land entry.
Newman did not return for a permanent settlement until the spring of 1836, and meanwhile, in November, 1833, Philo Bogue bought a piece of
land on section 28, in the bend of the Grand River, where he proposed to set up a trading post. Unaided he rolled up a log cabin near where the
Detroit, Lansing, and Northern depot was located, and when he brought the house into decent shape went over to Hunt's at Lyons for his family,
whom he had left there against such time as he should have affairs prepared for their comfort.
© 2016 Jason M Baldridge MLConf, May 2016
GeoNames
4048392 Portland Mills Portland Mills 39.7781 -87.00918 P PPL US IN 133 0 223 218 America/Indiana/Indianapolis 2010-02-15
4084605 Portland Portland 32.15459 -87.1686 P PPL US AL 047 0 30 41 America/Chicago 2006-01-15
4127143 Portland Portland Portlend,Портленд33.2379 -91.51151 P PPL US AR 003 430 38 39 America/Chicago 2011-05-14
4169227 Portland Portland 30.51242 -86.19578 P PPL US FL 131 0 8 14 America/Chicago 2006-01-15
4217115 Portland Portland 34.05732 -85.03634 P PPL US GA 233 0 229 228 America/New_York 2010-09-05
4277586 Portland Portland 37.0778 -97.31227 P PPL US KS 191 0 362 364 America/Chicago 2006-01-15
4305000 Portland Portland 37.12062 -85.44608 P PPL US KY 001 0 220 223 America/Chicago 2006-01-15
4305001 Portland Portland 38.26924 -85.8108 P PPL US KY 111 0 135 138 America/Kentucky/Louisville 2006-01-15
4305002 Portland Portland 38.74812 -84.44772 P PPL US KY 191 0 265 266 America/New_York 2006-01-15
404289 Portland Portland Portlend,Портленд38.71088 -91.71767 P PPL US MO 027 0 170 172 America/Chicago 2010-01-29
4521811 Portland Portland Portlend,Портленд39.00341 -81.77124 P PPL US OH 105 0 187 188 America/New_York 2010-01-29
4650946 Portland Portland Portlend,Портленд36.58171 -86.51638 P PPL US TN 165 11480 244 245 America/Chicago 2011-05-14
4720131 Portland Portland Portlend,Портленд27.87725 -97.32388 P PPL US TX 409 15099 13 11 America/Chicago 2011-05-14
4841001 Portland Portland Portlend,Портленд41.57288 -72.64065 P PPL US CT 007 5862 24 27 America/New_York 2011-05-14
4871855 Portland Portland 43.12858 -93.12354 P PPL US IA 033 35 327 330 America/Chicago 2011-05-14
4906524 Portland Portland 41.66253 -89.98012 P PPL US IL 195 0 190 190 America/Chicago 2006-01-15
5006314 Portland Portland Portlend,Портленд42.8692 -84.90305 P PPL US MI 067 3883 221 223 America/Detroit 2011-05-14
5746545 Portland Portland 45.52345 -122.67621 P PPLA2 US OR 051 583776 12 15 America/Los_Angeles 2011-05-14
Spatial minimality
27
Although Elisha Newman made the first land entry in the township of Portland (June, 1833), he did not become a settler until three years later, by
which time a few settlers had located in the town. From Mr. Newman's story, it appears that early in 1833, he was visiting friends in Ann Arbor,
and during an evening conversation discussed with others the subject of unlocated lands lying west of Ann Arbor. One of the company (Joseph
Wood) remarked that he had been out with the party sent to survey Ionia and other counties, and that the surveyors were struck by the valuable
water-power at the mouth of the Looking Glass River, saying there would surely be a village there some day.
Mr. Newman was at once taken with the idea of locating lands at the mouth of the Looking Glass. Following up his impulse, he made ready to start
at once, and, accompanied by James Newman and Joseph Wood, went out to the Looking Glass on a tour of inspection. Being satisfied with the
location, he returned Eastward with his companions, and at White Pigeon made his land entry.
Newman did not return for a permanent settlement until the spring of 1836, and meanwhile, in November, 1833, Philo Bogue bought a piece of
land on section 28, in the bend of the Grand River, where he proposed to set up a trading post. Unaided he rolled up a log cabin near where the
Detroit, Lansing, and Northern depot was located, and when he brought the house into decent shape went over to Hunt's at Lyons for his family,
whom he had left there against such time as he should have affairs prepared for their comfort.
© 2016 Jason M Baldridge MLConf, May 2016
GeoNames
4048392 Portland Mills Portland Mills 39.7781 -87.00918 P PPL US IN 133 0 223 218 America/Indiana/Indianapolis 2010-02-15
4084605 Portland Portland 32.15459 -87.1686 P PPL US AL 047 0 30 41 America/Chicago 2006-01-15
4127143 Portland Portland Portlend,Портленд33.2379 -91.51151 P PPL US AR 003 430 38 39 America/Chicago 2011-05-14
4169227 Portland Portland 30.51242 -86.19578 P PPL US FL 131 0 8 14 America/Chicago 2006-01-15
4217115 Portland Portland 34.05732 -85.03634 P PPL US GA 233 0 229 228 America/New_York 2010-09-05
4277586 Portland Portland 37.0778 -97.31227 P PPL US KS 191 0 362 364 America/Chicago 2006-01-15
4305000 Portland Portland 37.12062 -85.44608 P PPL US KY 001 0 220 223 America/Chicago 2006-01-15
4305001 Portland Portland 38.26924 -85.8108 P PPL US KY 111 0 135 138 America/Kentucky/Louisville 2006-01-15
4305002 Portland Portland 38.74812 -84.44772 P PPL US KY 191 0 265 266 America/New_York 2006-01-15
404289 Portland Portland Portlend,Портленд38.71088 -91.71767 P PPL US MO 027 0 170 172 America/Chicago 2010-01-29
4521811 Portland Portland Portlend,Портленд39.00341 -81.77124 P PPL US OH 105 0 187 188 America/New_York 2010-01-29
4650946 Portland Portland Portlend,Портленд36.58171 -86.51638 P PPL US TN 165 11480 244 245 America/Chicago 2011-05-14
4720131 Portland Portland Portlend,Портленд27.87725 -97.32388 P PPL US TX 409 15099 13 11 America/Chicago 2011-05-14
4841001 Portland Portland Portlend,Портленд41.57288 -72.64065 P PPL US CT 007 5862 24 27 America/New_York 2011-05-14
4871855 Portland Portland 43.12858 -93.12354 P PPL US IA 033 35 327 330 America/Chicago 2011-05-14
4906524 Portland Portland 41.66253 -89.98012 P PPL US IL 195 0 190 190 America/Chicago 2006-01-15
5006314 Portland Portland Portlend,Портленд42.8692 -84.90305 P PPL US MI 067 3883 221 223 America/Detroit 2011-05-14
5746545 Portland Portland 45.52345 -122.67621 P PPLA2 US OR 051 583776 12 15 America/Los_Angeles 2011-05-14
Spatial minimality
27
Ann Arbor
Detroit
Ionia
Lyons
Portland
White Pigeon
1
>7
>4
>15
>17
1
# LocationsToponym
Although Elisha Newman made the first land entry in the township of Portland (June, 1833), he did not become a settler until three years later, by
which time a few settlers had located in the town. From Mr. Newman's story, it appears that early in 1833, he was visiting friends in Ann Arbor,
and during an evening conversation discussed with others the subject of unlocated lands lying west of Ann Arbor. One of the company (Joseph
Wood) remarked that he had been out with the party sent to survey Ionia and other counties, and that the surveyors were struck by the valuable
water-power at the mouth of the Looking Glass River, saying there would surely be a village there some day.
Mr. Newman was at once taken with the idea of locating lands at the mouth of the Looking Glass. Following up his impulse, he made ready to start
at once, and, accompanied by James Newman and Joseph Wood, went out to the Looking Glass on a tour of inspection. Being satisfied with the
location, he returned Eastward with his companions, and at White Pigeon made his land entry.
Newman did not return for a permanent settlement until the spring of 1836, and meanwhile, in November, 1833, Philo Bogue bought a piece of
land on section 28, in the bend of the Grand River, where he proposed to set up a trading post. Unaided he rolled up a log cabin near where the
Detroit, Lansing, and Northern depot was located, and when he brought the house into decent shape went over to Hunt's at Lyons for his family,
whom he had left there against such time as he should have affairs prepared for their comfort.
© 2016 Jason M Baldridge MLConf, May 2016
GeoNames
4048392 Portland Mills Portland Mills 39.7781 -87.00918 P PPL US IN 133 0 223 218 America/Indiana/Indianapolis 2010-02-15
4084605 Portland Portland 32.15459 -87.1686 P PPL US AL 047 0 30 41 America/Chicago 2006-01-15
4127143 Portland Portland Portlend,Портленд33.2379 -91.51151 P PPL US AR 003 430 38 39 America/Chicago 2011-05-14
4169227 Portland Portland 30.51242 -86.19578 P PPL US FL 131 0 8 14 America/Chicago 2006-01-15
4217115 Portland Portland 34.05732 -85.03634 P PPL US GA 233 0 229 228 America/New_York 2010-09-05
4277586 Portland Portland 37.0778 -97.31227 P PPL US KS 191 0 362 364 America/Chicago 2006-01-15
4305000 Portland Portland 37.12062 -85.44608 P PPL US KY 001 0 220 223 America/Chicago 2006-01-15
4305001 Portland Portland 38.26924 -85.8108 P PPL US KY 111 0 135 138 America/Kentucky/Louisville 2006-01-15
4305002 Portland Portland 38.74812 -84.44772 P PPL US KY 191 0 265 266 America/New_York 2006-01-15
404289 Portland Portland Portlend,Портленд38.71088 -91.71767 P PPL US MO 027 0 170 172 America/Chicago 2010-01-29
4521811 Portland Portland Portlend,Портленд39.00341 -81.77124 P PPL US OH 105 0 187 188 America/New_York 2010-01-29
4650946 Portland Portland Portlend,Портленд36.58171 -86.51638 P PPL US TN 165 11480 244 245 America/Chicago 2011-05-14
4720131 Portland Portland Portlend,Портленд27.87725 -97.32388 P PPL US TX 409 15099 13 11 America/Chicago 2011-05-14
4841001 Portland Portland Portlend,Портленд41.57288 -72.64065 P PPL US CT 007 5862 24 27 America/New_York 2011-05-14
4871855 Portland Portland 43.12858 -93.12354 P PPL US IA 033 35 327 330 America/Chicago 2011-05-14
4906524 Portland Portland 41.66253 -89.98012 P PPL US IL 195 0 190 190 America/Chicago 2006-01-15
5006314 Portland Portland Portlend,Портленд42.8692 -84.90305 P PPL US MI 067 3883 221 223 America/Detroit 2011-05-14
5746545 Portland Portland 45.52345 -122.67621 P PPLA2 US OR 051 583776 12 15 America/Los_Angeles 2011-05-14
Spatial minimality
27
Portland
LyonsIonia
White Pigeon
Ann Arbor
Detroit
Ionia
Lyons
Portland
White Pigeon
1
>7
>4
>15
>17
1
# LocationsToponym
Although Elisha Newman made the first land entry in the township of Portland (June, 1833), he did not become a settler until three years later, by
which time a few settlers had located in the town. From Mr. Newman's story, it appears that early in 1833, he was visiting friends in Ann Arbor,
and during an evening conversation discussed with others the subject of unlocated lands lying west of Ann Arbor. One of the company (Joseph
Wood) remarked that he had been out with the party sent to survey Ionia and other counties, and that the surveyors were struck by the valuable
water-power at the mouth of the Looking Glass River, saying there would surely be a village there some day.
Mr. Newman was at once taken with the idea of locating lands at the mouth of the Looking Glass. Following up his impulse, he made ready to start
at once, and, accompanied by James Newman and Joseph Wood, went out to the Looking Glass on a tour of inspection. Being satisfied with the
location, he returned Eastward with his companions, and at White Pigeon made his land entry.
Newman did not return for a permanent settlement until the spring of 1836, and meanwhile, in November, 1833, Philo Bogue bought a piece of
land on section 28, in the bend of the Grand River, where he proposed to set up a trading post. Unaided he rolled up a log cabin near where the
Detroit, Lansing, and Northern depot was located, and when he brought the house into decent shape went over to Hunt's at Lyons for his family,
whom he had left there against such time as he should have affairs prepared for their comfort.
© 2016 Jason M Baldridge MLConf, May 2016
Spatial minimality often fails
28
I moved from Encinitas, CA, a nice beach town in North San Diego County to Asheville, NC.
By far, Ashville is more hip, especially West Asheville. Asheville has a lot in common with
Portland. Austin, I've never been to so I cannot comment. But what makes a place cool and
hip, in my opinion are that give a area "punch". There are a lot of ingredients. One is geography.
Add a college or university (and all that they bring- and draw), good restaurants, a good music
scene, a progressive attitude and tolerance. Hmmm. I'm sure there are many more to ponder. But
that's my start. Oh, lots of bars!
From: http://www.city-data.com/forum/austin/1694181-what-makes-city-like-austin-portland-3.html
City-data.com incorrectly marks “West” and “Portland”
as the cities in Texas -- presumably because of their
textual and spatial proximity to “Austin”.
© 2016 Jason M Baldridge MLConf, May 2016
Spatial minimality often fails
28
I moved from Encinitas, CA, a nice beach town in North San Diego County to Asheville, NC.
By far, Ashville is more hip, especially West Asheville. Asheville has a lot in common with
Portland. Austin, I've never been to so I cannot comment. But what makes a place cool and
hip, in my opinion are that give a area "punch". There are a lot of ingredients. One is geography.
Add a college or university (and all that they bring- and draw), good restaurants, a good music
scene, a progressive attitude and tolerance. Hmmm. I'm sure there are many more to ponder. But
that's my start. Oh, lots of bars!
From: http://www.city-data.com/forum/austin/1694181-what-makes-city-like-austin-portland-3.html
City-data.com incorrectly marks “West” and “Portland”
as the cities in Texas -- presumably because of their
textual and spatial proximity to “Austin”.
But: it is clear from the text that Portland, Oregon and
Austin,Texas are the referents, though their states are
never mentioned and are far from the other locations!
I moved from Encinitas, CA, a nice beach town in North San Diego County to Asheville, NC.
By far, Ashville is more hip, especially West Asheville. Asheville has a lot in common with
Portland. Austin, I've never been to so I cannot comment. But what makes a place cool and
hip, in my opinion are that give a area "punch". There are a lot of ingredients. One is geography.
Add a college or university (and all that they bring- and draw), good restaurants, a good music
scene, a progressive attitude and tolerance. Hmmm. I'm sure there are many more to ponder. But
that's my start. Oh, lots of bars!
© 2016 Jason M Baldridge MLConf, May 2016
Toponym classifiers
29
Strategy: build a textual classifier per toponym by
obtaining indirectly labeled examples from Wikipedia.
© 2016 Jason M Baldridge MLConf, May 2016
Toponym classifiers
29
Strategy: build a textual classifier per toponym by
obtaining indirectly labeled examples from Wikipedia.
© 2016 Jason M Baldridge MLConf, May 2016
Toponym classifiers
29
Strategy: build a textual classifier per toponym by
obtaining indirectly labeled examples from Wikipedia.
© 2016 Jason M Baldridge MLConf, May 2016
Toponym classifiers
29
Strategy: build a textual classifier per toponym by
obtaining indirectly labeled examples from Wikipedia.
© 2016 Jason M Baldridge MLConf, May 2016
Toponym classifiers
29
Strategy: build a textual classifier per toponym by
obtaining indirectly labeled examples from Wikipedia.
© 2016 Jason M Baldridge MLConf, May 2016
Toponym classifiers
29
Strategy: build a textual classifier per toponym by
obtaining indirectly labeled examples from Wikipedia.
© 2016 Jason M Baldridge MLConf, May 2016
Toponym classifiers
29
Strategy: build a textual classifier per toponym by
obtaining indirectly labeled examples from Wikipedia.
P(Portland-OR|music) > P(Portland-ME|music)
P(Portland-OR|wharf ) < P(Portland-ME|wharf )
© 2016 Jason M Baldridge MLConf May 2016
Geographic word profiles learned from Wikipedia grid LMs
30
© 2016 Jason M Baldridge MLConf May 2016
TopoCluster: gazetteer-free toponym resolution
31
© 2016 Jason M Baldridge MLConf, May 2016
Results: disambiguating toponyms
32
TR-CoNLL LGL CWar WoTR
Population 91 63 62 64
SPIDER
(spatial minimality)
65 68 67 69
WISTR
(Wiki supervised)
89 64 73 63
SPIDER
+WISTR
87 78 87 69
TopoCluster 92 75 93 71
© 2016 Jason M Baldridge MLConf May 2016
New corpus: War of the Rebellion
33
Historical document collection.
Both toponym and document lat/lon.
10,380 annotated toponyms, including
lat/lon and regions.
Will be freely available.
Coming soon!
Contact me for details.
© 2016 Jason M Baldridge MLConf, May 2016
Grounding, more generally
34
Grounding often involves connecting text to
knowledge sources and other modalities such as
demographics, time, image, and video.
© 2016 Jason M Baldridge MLConf, May 2016
Grounding, more generally
34
Grounding often involves connecting text to
knowledge sources and other modalities such as
demographics, time, image, and video.
This can help us create models for deeper aspects of
language, such as syntactic structure and logical form.
He says, she says http://www.tweetolife.com/gender/
© 2016 Jason M Baldridge MLConf, May 2016
Temporality of words, by hour http://www.tweetolife.com/hour/
36
© 2016 Jason M Baldridge MLConf, May 2016
Temporality of words, by hour http://www.tweetolife.com/hour/
36
© 2016 Jason M Baldridge MLConf, May 2016
Temporality of expressions, by day: http://www.google.com/trends
37
© 2016 Jason M Baldridge MLConf, May 2016
Temporality of expressions, by day: http://www.google.com/trends
37
© 2016 Jason M Baldridge MLConf, May 2016
Temporality of expressions, by year: http://ngrams.googlelabs.com/
38
slave
trenches aircraft
war
© 2016 Jason M Baldridge MLConf, May 2016
Temporal resolution [Kumar, Lease, and Baldridge 2011]
39
2000BC
0AD
2000AD
4000BC
© 2016 Jason M Baldridge MLConf, May 2016
Temporal resolution [Kumar, Lease, and Baldridge 2011]
39
2000BC
0AD
2000AD
4000BC
© 2016 Jason M Baldridge MLConf, May 2016
Temporal resolution [Kumar, Lease, and Baldridge 2011]
39
2000BC
0AD
2000AD
4000BC
© 2016 Jason M Baldridge MLConf, May 2016
Temporal resolution [Kumar, Lease, and Baldridge 2011]
39
2000BC
0AD
2000AD
4000BC
© 2016 Jason M Baldridge MLConf, May 2016
Temporal resolution [Kumar, Lease, and Baldridge 2011]
39
2000BC
0AD
2000AD
4000BC
© 2016 Jason M Baldridge MLConf, May 2016
Temporal resolution [Kumar, Lease, and Baldridge 2011]
39
2000BC
0AD
2000AD
4000BC
© 2016 Jason M Baldridge MLConf, May 2016
More modalities: videos [Motwani & Mooney, 2012]
40
© 2016 Jason M Baldridge MLConf, May 2016
Beyond word co-occurences for vector-space models
41
bear boat car cow hadoop snow water wrench
3 234 42 4 1 2 325 0
beach
© 2016 Jason M Baldridge MLConf, May 2016
Beyond word co-occurences for vector-space models
41
bear boat car cow hadoop snow water wrench
3 234 42 4 1 2 325 0
beach
© 2016 Jason M Baldridge MLConf, May 2016
Beyond word co-occurences for vector-space models
41
bear boat car cow hadoop snow water wrench
3 234 42 4 1 2 325 0
beach
© 2016 Jason M Baldridge MLConf, May 2016
Beyond word co-occurences for vector-space models
41
bear boat car cow hadoop snow water wrench
3 234 42 4 1 2 325 0
beach
© 2016 Jason M Baldridge MLConf, May 2016
Beyond word co-occurences for vector-space models
41
bear boat car cow hadoop snow water wrench
3 234 42 4 1 2 325 0
beach
© 2016 Jason M Baldridge MLConf, May 2016
Beyond word co-occurences for vector-space models
41
bear boat car cow hadoop snow water wrench
3 234 42 4 1 2 325 0
beach
© 2016 Jason M Baldridge MLConf, May 2016
Beyond word co-occurences for vector-space models
41
bear boat car cow hadoop snow water wrench
3 234 42 4 1 2 325 0
beach
© 2016 Jason M Baldridge MLConf, May 2016
Beyond word co-occurences for vector-space models
41
bear boat car cow hadoop snow water wrench
3 234 42 4 1 2 325 0
beach
© 2016 Jason M Baldridge MLConf, May 2016
Beyond word co-occurences for vector-space models
41
bear boat car cow hadoop snow water wrench
3 234 42 4 1 2 325 0
beach
© 2016 Jason M Baldridge MLConf, May 2016
Combining distributional models with logics
42
Erk (2013):“Towards a semantics for distributional representations.”
Garrette et al (2012):“A formal approach to linking logical form and vector-space lexical semantics”
Beltagy et al (2013):“Montague Meets Markov: Deep Semantics with Probabilistic Logical Form”
© 2016 Jason M Baldridge MLConf, May 2016
Multi-component structured vector-space models
43
beachchildren
visit
the children visit the beach
Agent Patient
This research was sponsored by:
Grant from the
Morris Memorial Trust Fund
- Walt Whitman, A Song of the Rolling Earth (in Leaves of Grass)
Final note:Whitman had it right many years ago!
Thanks!
Code and Data
- Textgrounder: https://github.com/utcompling/textgrounder
Publications
- Benjamin Wing and Jason Baldridge. 2011. Simple supervised document geolocation
with geodesic grids. In Proceedings of ACL HLT 2011.
- Stephen Roller, Mike Speriosu, Sarat Rallapalli, Benjamin Wing and Jason Baldridge. 2012.
Supervised Text-based Geolocation Using Language Models on an Adaptive Grid. EMNLP
2012. Jeju, Korea.
- Benjamin Wing and Jason Baldridge 2014. Hierarchical Discriminative Classification for
Text-Based Geolocation. EMNLP 2014.
Document geolocation
Code and data
- Fieldspring: https://github.com/utcompling/fieldspring
- TopoCluster: https://github.com/grantdelozier/TopoCluster
Publications
- Mike Speriosu and Jason Baldridge. 2013.Text-Driven Toponym Resolution using Indirect
Supervision.ACL 2013.
- Grant DeLozier, Jason Baldridge, and Loretta London. 2015. Gazetteer-Free Toponym
Resolution Using Geographic Word Profiles.AAAI 2015.
Toponym resolution

More Related Content

What's hot

Guesstalt - College General Quiz - Prelims (With Answers) - Tie Grab 2019
Guesstalt - College General Quiz - Prelims (With Answers) - Tie Grab 2019Guesstalt - College General Quiz - Prelims (With Answers) - Tie Grab 2019
Guesstalt - College General Quiz - Prelims (With Answers) - Tie Grab 2019SJC Quizzers
 
27 June 2012 1HBCT Weekly News Update
27 June 2012 1HBCT Weekly News Update27 June 2012 1HBCT Weekly News Update
27 June 2012 1HBCT Weekly News UpdateNoel Waterman
 
Foster LeRoy Gordon & Loe Erickson
Foster LeRoy Gordon & Loe EricksonFoster LeRoy Gordon & Loe Erickson
Foster LeRoy Gordon & Loe EricksonJoeAnd41
 
Vera Gordon P. Jones
Vera Gordon P. JonesVera Gordon P. Jones
Vera Gordon P. JonesJoeAnd41
 
Life's Story of Larkin Andrew Erickson & Ethel Roberts
Life's Story of Larkin Andrew Erickson & Ethel RobertsLife's Story of Larkin Andrew Erickson & Ethel Roberts
Life's Story of Larkin Andrew Erickson & Ethel RobertsJoeAnd41
 
Boithoki Quiz At Jalpaiguri | Anubhab Dey
Boithoki Quiz At Jalpaiguri | Anubhab DeyBoithoki Quiz At Jalpaiguri | Anubhab Dey
Boithoki Quiz At Jalpaiguri | Anubhab DeyAnubhab Dey
 

What's hot (10)

Guesstalt - College General Quiz - Prelims (With Answers) - Tie Grab 2019
Guesstalt - College General Quiz - Prelims (With Answers) - Tie Grab 2019Guesstalt - College General Quiz - Prelims (With Answers) - Tie Grab 2019
Guesstalt - College General Quiz - Prelims (With Answers) - Tie Grab 2019
 
Conversation 2
Conversation 2Conversation 2
Conversation 2
 
27 June 2012 1HBCT Weekly News Update
27 June 2012 1HBCT Weekly News Update27 June 2012 1HBCT Weekly News Update
27 June 2012 1HBCT Weekly News Update
 
Photo essay
Photo essayPhoto essay
Photo essay
 
Foster LeRoy Gordon & Loe Erickson
Foster LeRoy Gordon & Loe EricksonFoster LeRoy Gordon & Loe Erickson
Foster LeRoy Gordon & Loe Erickson
 
Vera Gordon P. Jones
Vera Gordon P. JonesVera Gordon P. Jones
Vera Gordon P. Jones
 
Life's Story of Larkin Andrew Erickson & Ethel Roberts
Life's Story of Larkin Andrew Erickson & Ethel RobertsLife's Story of Larkin Andrew Erickson & Ethel Roberts
Life's Story of Larkin Andrew Erickson & Ethel Roberts
 
Emma
EmmaEmma
Emma
 
Boithoki Quiz At Jalpaiguri | Anubhab Dey
Boithoki Quiz At Jalpaiguri | Anubhab DeyBoithoki Quiz At Jalpaiguri | Anubhab Dey
Boithoki Quiz At Jalpaiguri | Anubhab Dey
 
Wrtn of payback 2013
Wrtn of payback 2013Wrtn of payback 2013
Wrtn of payback 2013
 

Viewers also liked

Jonas Schneider, Head of Engineering for Robotics, OpenAI
Jonas Schneider, Head of Engineering for Robotics, OpenAIJonas Schneider, Head of Engineering for Robotics, OpenAI
Jonas Schneider, Head of Engineering for Robotics, OpenAIMLconf
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsJune Andrews
 
Taking the Attacker Eviction Red Pill (v2.0)
Taking the Attacker Eviction Red Pill (v2.0)Taking the Attacker Eviction Red Pill (v2.0)
Taking the Attacker Eviction Red Pill (v2.0)Frode Hommedal
 
Tianqi Chen, PhD Student, University of Washington, at MLconf Seattle 2017
Tianqi Chen, PhD Student, University of Washington, at MLconf Seattle 2017Tianqi Chen, PhD Student, University of Washington, at MLconf Seattle 2017
Tianqi Chen, PhD Student, University of Washington, at MLconf Seattle 2017MLconf
 
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...MLconf
 
Talha Obaid, Email Security, Symantec at MLconf ATL 2017
Talha Obaid, Email Security, Symantec at MLconf ATL 2017Talha Obaid, Email Security, Symantec at MLconf ATL 2017
Talha Obaid, Email Security, Symantec at MLconf ATL 2017MLconf
 
LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...
LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...
LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...MLconf
 
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017MLconf
 
Daniel Shank, Data Scientist, Talla at MLconf SF 2017
Daniel Shank, Data Scientist, Talla at MLconf SF 2017Daniel Shank, Data Scientist, Talla at MLconf SF 2017
Daniel Shank, Data Scientist, Talla at MLconf SF 2017MLconf
 
Top 5 Deep Learning and AI Stories - October 6, 2017
Top 5 Deep Learning and AI Stories - October 6, 2017Top 5 Deep Learning and AI Stories - October 6, 2017
Top 5 Deep Learning and AI Stories - October 6, 2017NVIDIA
 

Viewers also liked (10)

Jonas Schneider, Head of Engineering for Robotics, OpenAI
Jonas Schneider, Head of Engineering for Robotics, OpenAIJonas Schneider, Head of Engineering for Robotics, OpenAI
Jonas Schneider, Head of Engineering for Robotics, OpenAI
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of Things
 
Taking the Attacker Eviction Red Pill (v2.0)
Taking the Attacker Eviction Red Pill (v2.0)Taking the Attacker Eviction Red Pill (v2.0)
Taking the Attacker Eviction Red Pill (v2.0)
 
Tianqi Chen, PhD Student, University of Washington, at MLconf Seattle 2017
Tianqi Chen, PhD Student, University of Washington, at MLconf Seattle 2017Tianqi Chen, PhD Student, University of Washington, at MLconf Seattle 2017
Tianqi Chen, PhD Student, University of Washington, at MLconf Seattle 2017
 
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
 
Talha Obaid, Email Security, Symantec at MLconf ATL 2017
Talha Obaid, Email Security, Symantec at MLconf ATL 2017Talha Obaid, Email Security, Symantec at MLconf ATL 2017
Talha Obaid, Email Security, Symantec at MLconf ATL 2017
 
LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...
LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...
LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...
 
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
 
Daniel Shank, Data Scientist, Talla at MLconf SF 2017
Daniel Shank, Data Scientist, Talla at MLconf SF 2017Daniel Shank, Data Scientist, Talla at MLconf SF 2017
Daniel Shank, Data Scientist, Talla at MLconf SF 2017
 
Top 5 Deep Learning and AI Stories - October 6, 2017
Top 5 Deep Learning and AI Stories - October 6, 2017Top 5 Deep Learning and AI Stories - October 6, 2017
Top 5 Deep Learning and AI Stories - October 6, 2017
 

More from MLconf

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...MLconf
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingMLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...MLconf
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushMLconf
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceMLconf
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...MLconf
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMLconf
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionMLconf
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLMLconf
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldMLconf
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...MLconf
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...MLconf
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...MLconf
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeMLconf
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareMLconf
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesMLconf
 

More from MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 

Recently uploaded

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 

Recently uploaded (20)

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

Jason Baldridge, Associate Professor of Computational Linguistics, University of Texas at Austin at MLconf SEA - 5/20/16

  • 1. Disambiguating Explicit and Implicit Geographic References in Natural Language Jason Baldridge @jasonbaldridge Computational Linguistics Lab Department of Linguistics UT Austin MLConf Seattle May 20, 2016
  • 2. © 2016 Jason M Baldridge MLConf, May 2016 What does “barbecue” mean? 2
  • 3. © 2016 Jason M Baldridge MLConf, May 2016 What does “barbecue” mean? Barbecue’ 2
  • 4. © 2016 Jason M Baldridge MLConf, May 2016 What does “barbecue” mean? Barbecue’ 2
  • 5. © 2016 Jason M Baldridge MLConf, May 2016 What does “barbecue” mean? Barbecue’ 2
  • 6. © 2016 Jason M Baldridge MLConf, May 2016 What does “barbecue” mean? Barbecue’ 2
  • 7. © 2016 Jason M Baldridge MLConf, May 2016 What does “barbecue” mean? Barbecue’ 2
  • 8. © 2016 Jason M Baldridge MLConf, May 2016 What does “barbecue” mean? Barbecue’ 2
  • 9. © 2016 Jason M Baldridge MLConf, May 2016 What I thought semantics was before 2005 3 From: John Enrico and Jason Baldridge. 2011. Possessor Raising, Demonstrative Raising, Quantifier Float and Number Float in Haida. International Journal of American Linguistics. 77(2):185-218
  • 10. © 2016 Jason M Baldridge MLConf, May 2016 Updated perspective a la Ray Mooney (UT Austin CS) 4 http://www.cs.utexas.edu/users/ml/slides/chen-icml08.ppt
  • 11. © 2016 Jason M Baldridge MLConf, May 2016 http://www.lib.utexas.edu/books/travel/index.htmlTravel at the Turn of the 20th Century 5
  • 12. © 2016 Jason M Baldridge MLConf, May 2016 Motivation: Google Lit Trips [http://www.googlelittrips.com/] 6 Grapes of Wrath in Google Earth Text http://www.googlelittrips.com/GoogleLit/9-12/Entries/2006/11/1_The_Grapes_of_Wrath_by_John_Steinbeck.html
  • 13. © 2016 Jason M Baldridge MLConf, May 2016 Look, Mom, no hands! (Err, um... no metadata.) 7
  • 14. © 2016 Jason M Baldridge MLConf, May 2016 Look, Mom, no hands! (Err, um... no metadata.) 7 Topics with a clear, circumscribed geographic focus emerge!
  • 15. © 2016 Jason M Baldridge MLConf, May 2016 Metadata is now plentiful 8
  • 16. © 2016 Jason M Baldridge MLConf, May 2016 01:55:55 RT @USER_dc5e5498: Drop and give me 50.... 05:09:29 I said u got a swisher from redmond!? He said nah kirkland! Lol..ooooooooOkay! 05:57:35 Lmao!:) havin a good ol time after work! Unexpected! #goodtimes 06:00:09 RT @USER_d5d93fec: #letsbereal .. No seriously, #letsbereal>>lol. Don't start. 06:00:37 On my way to get @USER_60939380 yeee! She want some of this strawberry! Sexy! ... 47°31’41’’ N 122°11’52’’ W 9 Geotagged Twitter
  • 17. © 2016 Jason M Baldridge MLConf, May 2016 01:55:55 RT @USER_dc5e5498: Drop and give me 50.... 05:09:29 I said u got a swisher from redmond!? He said nah kirkland! Lol..ooooooooOkay! 05:57:35 Lmao!:) havin a good ol time after work! Unexpected! #goodtimes 06:00:09 RT @USER_d5d93fec: #letsbereal .. No seriously, #letsbereal>>lol. Don't start. 06:00:37 On my way to get @USER_60939380 yeee! She want some of this strawberry! Sexy! ... 47°31’41’’ N 122°11’52’’ W 9 Geotagged Twitter
  • 18. © 2016 Jason M Baldridge MLConf, May 2016 Geotagged Wikipedia 10 30° 17′ N 97° 44′ W
  • 19. © 2016 Jason M Baldridge MLConf, May 2016 Where’s a word on Earth? (according to Wikipedia)
  • 20. © 2016 Jason M Baldridge MLConf, May 2016 Where’s a word on Earth? (according to Wikipedia) mountain
  • 21. © 2016 Jason M Baldridge MLConf, May 2016 Document geolocation: where is this person? 12
  • 22. © 2016 Jason M Baldridge MLConf, May 201613 Amsterdam, Zaandam,Amstelveen, Diemen, Landsmeer ... Frankfurt, Frechen, Hürth, Brühl,Wesseling, ... Language modeling approach Wing & Baldridge 2011: Simple supervised document geolocation with geodesic grids.
  • 23. © 2016 Jason M Baldridge MLConf, May 2016 Locations of Twitter users are not uniformly distributed! 14 (Small) GeoUT (Twitter) plotted on Google Earth, one pin per user. Density of (all) documents in GeoUT over the USA (390 million tweets)
  • 24. © 2016 Jason M Baldridge MLConf, May 2016 k-d tree for geotagged Wikipedia, looking at N. America 15 Roller, Speriosu, Rallapalli,Wing & Baldridge 2014: Supervised Text-based Geolocation Using Language Models on an Adaptive Grid.
  • 25. © 2016 Jason M Baldridge MLConf, May 2016 k-d tree for geotagged Wikipedia, looking at N. America 15 Roller, Speriosu, Rallapalli,Wing & Baldridge 2014: Supervised Text-based Geolocation Using Language Models on an Adaptive Grid.
  • 26. [Serdyukov, Murdock, & van Zwol 2009; Cheng, Caverlee, & Lee 2010;Wing & Baldridge 2011] Automatic document geolocation
  • 27. [Serdyukov, Murdock, & van Zwol 2009; Cheng, Caverlee, & Lee 2010;Wing & Baldridge 2011] Automatic document geolocation
  • 28. © 2016 Jason M Baldridge MLConf, May 2016 Hierarchical geo-location with logistic regression 17 Wing & Baldridge 2014: Hierarchical Discriminative Classification for Text-Based Geolocation.
  • 29. © 2016 Jason M Baldridge MLConf May 2016 Hierarchical logistic regression 18
  • 30. © 2016 Jason M Baldridge MLConf May 2016 Hierarchical logistic regression 19
  • 31. © 2016 Jason M Baldridge MLConf May 2016 Hierarchical logistic regression 20
  • 32. © 2016 Jason M Baldridge MLConf, May 2016 Performance: hierarchical logistic regression with kd-tree grid 21 Flickr (entire world) Half of documents geotagged within 18 km of truth Percent of documents within 166km (100 miles): 66% Twitter (World) Half of users geotagged within 510 km of truth Percent of documents within 166km (100 miles): 31% Twitter (USA) Half of users geotagged within 192 km of truth Percent of documents within 166km (100 miles): 48%
  • 33. © 2016 Jason M Baldridge MLConf, May 2016 Hierarchical logistic regression beats flat naive Bayes 22 Naive Bayes Hierarchical LR Twitter USA 36.2 48.0 Twitter World 28.7 31.3 Flickr 58.5 66 English Wikipedia 84.5 88.9 German Wikipedia 89.3 90.2 Portuguese Wikipedia 77.1 89.5 Accuracy @ 161 km, kd-tree grid
  • 34. © 2016 Jason M Baldridge MLConf, May 2016 Logistic regression weights good features heavily 23
  • 35. © 2016 Jason M Baldridge MLConf May 2016 Automated Lit Trips 24
  • 36. © 2016 Jason M Baldridge MLConf, May 2016 Toponym (place name) resolution 25 They visit Portland every year.
  • 37. © 2016 Jason M Baldridge MLConf, May 2016 Toponym (place name) resolution 25 They visit Portland every year. ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? Which Portland? (Also: Canada,Australia, Ireland...)
  • 38. © 2016 Jason M Baldridge MLConf, May 2016 Toponym resolution in context 26 Although Elisha Newman made the first land entry in the township of Portland (June, 1833), he did not become a settler until three years later, by which time a few settlers had located in the town. From Mr. Newman's story, it appears that early in 1833, he was visiting friends in Ann Arbor, and during an evening conversation discussed with others the subject of unlocated lands lying west of Ann Arbor. One of the company (Joseph Wood) remarked that he had been out with the party sent to survey Ionia and other counties, and that the surveyors were struck by the valuable water-power at the mouth of the Looking Glass River, saying there would surely be a village there some day. Mr. Newman was at once taken with the idea of locating lands at the mouth of the Looking Glass. Following up his impulse, he made ready to start at once, and, accompanied by James Newman and Joseph Wood, went out to the Looking Glass on a tour of inspection. Being satisfied with the location, he returned Eastward with his companions, and at White Pigeon made his land entry. Newman did not return for a permanent settlement until the spring of 1836, and meanwhile, in November, 1833, Philo Bogue bought a piece of land on section 28, in the bend of the Grand River, where he proposed to set up a trading post. Unaided he rolled up a log cabin near where the Detroit, Lansing, and Northern depot was located, and when he brought the house into decent shape went over to Hunt's at Lyons for his family, whom he had left there against such time as he should have affairs prepared for their comfort.
  • 39. © 2016 Jason M Baldridge MLConf, May 2016 Spatial minimality 27 Although Elisha Newman made the first land entry in the township of Portland (June, 1833), he did not become a settler until three years later, by which time a few settlers had located in the town. From Mr. Newman's story, it appears that early in 1833, he was visiting friends in Ann Arbor, and during an evening conversation discussed with others the subject of unlocated lands lying west of Ann Arbor. One of the company (Joseph Wood) remarked that he had been out with the party sent to survey Ionia and other counties, and that the surveyors were struck by the valuable water-power at the mouth of the Looking Glass River, saying there would surely be a village there some day. Mr. Newman was at once taken with the idea of locating lands at the mouth of the Looking Glass. Following up his impulse, he made ready to start at once, and, accompanied by James Newman and Joseph Wood, went out to the Looking Glass on a tour of inspection. Being satisfied with the location, he returned Eastward with his companions, and at White Pigeon made his land entry. Newman did not return for a permanent settlement until the spring of 1836, and meanwhile, in November, 1833, Philo Bogue bought a piece of land on section 28, in the bend of the Grand River, where he proposed to set up a trading post. Unaided he rolled up a log cabin near where the Detroit, Lansing, and Northern depot was located, and when he brought the house into decent shape went over to Hunt's at Lyons for his family, whom he had left there against such time as he should have affairs prepared for their comfort.
  • 40. © 2016 Jason M Baldridge MLConf, May 2016 GeoNames 4048392 Portland Mills Portland Mills 39.7781 -87.00918 P PPL US IN 133 0 223 218 America/Indiana/Indianapolis 2010-02-15 4084605 Portland Portland 32.15459 -87.1686 P PPL US AL 047 0 30 41 America/Chicago 2006-01-15 4127143 Portland Portland Portlend,Портленд33.2379 -91.51151 P PPL US AR 003 430 38 39 America/Chicago 2011-05-14 4169227 Portland Portland 30.51242 -86.19578 P PPL US FL 131 0 8 14 America/Chicago 2006-01-15 4217115 Portland Portland 34.05732 -85.03634 P PPL US GA 233 0 229 228 America/New_York 2010-09-05 4277586 Portland Portland 37.0778 -97.31227 P PPL US KS 191 0 362 364 America/Chicago 2006-01-15 4305000 Portland Portland 37.12062 -85.44608 P PPL US KY 001 0 220 223 America/Chicago 2006-01-15 4305001 Portland Portland 38.26924 -85.8108 P PPL US KY 111 0 135 138 America/Kentucky/Louisville 2006-01-15 4305002 Portland Portland 38.74812 -84.44772 P PPL US KY 191 0 265 266 America/New_York 2006-01-15 404289 Portland Portland Portlend,Портленд38.71088 -91.71767 P PPL US MO 027 0 170 172 America/Chicago 2010-01-29 4521811 Portland Portland Portlend,Портленд39.00341 -81.77124 P PPL US OH 105 0 187 188 America/New_York 2010-01-29 4650946 Portland Portland Portlend,Портленд36.58171 -86.51638 P PPL US TN 165 11480 244 245 America/Chicago 2011-05-14 4720131 Portland Portland Portlend,Портленд27.87725 -97.32388 P PPL US TX 409 15099 13 11 America/Chicago 2011-05-14 4841001 Portland Portland Portlend,Портленд41.57288 -72.64065 P PPL US CT 007 5862 24 27 America/New_York 2011-05-14 4871855 Portland Portland 43.12858 -93.12354 P PPL US IA 033 35 327 330 America/Chicago 2011-05-14 4906524 Portland Portland 41.66253 -89.98012 P PPL US IL 195 0 190 190 America/Chicago 2006-01-15 5006314 Portland Portland Portlend,Портленд42.8692 -84.90305 P PPL US MI 067 3883 221 223 America/Detroit 2011-05-14 5746545 Portland Portland 45.52345 -122.67621 P PPLA2 US OR 051 583776 12 15 America/Los_Angeles 2011-05-14 Spatial minimality 27 Although Elisha Newman made the first land entry in the township of Portland (June, 1833), he did not become a settler until three years later, by which time a few settlers had located in the town. From Mr. Newman's story, it appears that early in 1833, he was visiting friends in Ann Arbor, and during an evening conversation discussed with others the subject of unlocated lands lying west of Ann Arbor. One of the company (Joseph Wood) remarked that he had been out with the party sent to survey Ionia and other counties, and that the surveyors were struck by the valuable water-power at the mouth of the Looking Glass River, saying there would surely be a village there some day. Mr. Newman was at once taken with the idea of locating lands at the mouth of the Looking Glass. Following up his impulse, he made ready to start at once, and, accompanied by James Newman and Joseph Wood, went out to the Looking Glass on a tour of inspection. Being satisfied with the location, he returned Eastward with his companions, and at White Pigeon made his land entry. Newman did not return for a permanent settlement until the spring of 1836, and meanwhile, in November, 1833, Philo Bogue bought a piece of land on section 28, in the bend of the Grand River, where he proposed to set up a trading post. Unaided he rolled up a log cabin near where the Detroit, Lansing, and Northern depot was located, and when he brought the house into decent shape went over to Hunt's at Lyons for his family, whom he had left there against such time as he should have affairs prepared for their comfort.
  • 41. © 2016 Jason M Baldridge MLConf, May 2016 GeoNames 4048392 Portland Mills Portland Mills 39.7781 -87.00918 P PPL US IN 133 0 223 218 America/Indiana/Indianapolis 2010-02-15 4084605 Portland Portland 32.15459 -87.1686 P PPL US AL 047 0 30 41 America/Chicago 2006-01-15 4127143 Portland Portland Portlend,Портленд33.2379 -91.51151 P PPL US AR 003 430 38 39 America/Chicago 2011-05-14 4169227 Portland Portland 30.51242 -86.19578 P PPL US FL 131 0 8 14 America/Chicago 2006-01-15 4217115 Portland Portland 34.05732 -85.03634 P PPL US GA 233 0 229 228 America/New_York 2010-09-05 4277586 Portland Portland 37.0778 -97.31227 P PPL US KS 191 0 362 364 America/Chicago 2006-01-15 4305000 Portland Portland 37.12062 -85.44608 P PPL US KY 001 0 220 223 America/Chicago 2006-01-15 4305001 Portland Portland 38.26924 -85.8108 P PPL US KY 111 0 135 138 America/Kentucky/Louisville 2006-01-15 4305002 Portland Portland 38.74812 -84.44772 P PPL US KY 191 0 265 266 America/New_York 2006-01-15 404289 Portland Portland Portlend,Портленд38.71088 -91.71767 P PPL US MO 027 0 170 172 America/Chicago 2010-01-29 4521811 Portland Portland Portlend,Портленд39.00341 -81.77124 P PPL US OH 105 0 187 188 America/New_York 2010-01-29 4650946 Portland Portland Portlend,Портленд36.58171 -86.51638 P PPL US TN 165 11480 244 245 America/Chicago 2011-05-14 4720131 Portland Portland Portlend,Портленд27.87725 -97.32388 P PPL US TX 409 15099 13 11 America/Chicago 2011-05-14 4841001 Portland Portland Portlend,Портленд41.57288 -72.64065 P PPL US CT 007 5862 24 27 America/New_York 2011-05-14 4871855 Portland Portland 43.12858 -93.12354 P PPL US IA 033 35 327 330 America/Chicago 2011-05-14 4906524 Portland Portland 41.66253 -89.98012 P PPL US IL 195 0 190 190 America/Chicago 2006-01-15 5006314 Portland Portland Portlend,Портленд42.8692 -84.90305 P PPL US MI 067 3883 221 223 America/Detroit 2011-05-14 5746545 Portland Portland 45.52345 -122.67621 P PPLA2 US OR 051 583776 12 15 America/Los_Angeles 2011-05-14 Spatial minimality 27 Ann Arbor Detroit Ionia Lyons Portland White Pigeon 1 >7 >4 >15 >17 1 # LocationsToponym Although Elisha Newman made the first land entry in the township of Portland (June, 1833), he did not become a settler until three years later, by which time a few settlers had located in the town. From Mr. Newman's story, it appears that early in 1833, he was visiting friends in Ann Arbor, and during an evening conversation discussed with others the subject of unlocated lands lying west of Ann Arbor. One of the company (Joseph Wood) remarked that he had been out with the party sent to survey Ionia and other counties, and that the surveyors were struck by the valuable water-power at the mouth of the Looking Glass River, saying there would surely be a village there some day. Mr. Newman was at once taken with the idea of locating lands at the mouth of the Looking Glass. Following up his impulse, he made ready to start at once, and, accompanied by James Newman and Joseph Wood, went out to the Looking Glass on a tour of inspection. Being satisfied with the location, he returned Eastward with his companions, and at White Pigeon made his land entry. Newman did not return for a permanent settlement until the spring of 1836, and meanwhile, in November, 1833, Philo Bogue bought a piece of land on section 28, in the bend of the Grand River, where he proposed to set up a trading post. Unaided he rolled up a log cabin near where the Detroit, Lansing, and Northern depot was located, and when he brought the house into decent shape went over to Hunt's at Lyons for his family, whom he had left there against such time as he should have affairs prepared for their comfort.
  • 42. © 2016 Jason M Baldridge MLConf, May 2016 GeoNames 4048392 Portland Mills Portland Mills 39.7781 -87.00918 P PPL US IN 133 0 223 218 America/Indiana/Indianapolis 2010-02-15 4084605 Portland Portland 32.15459 -87.1686 P PPL US AL 047 0 30 41 America/Chicago 2006-01-15 4127143 Portland Portland Portlend,Портленд33.2379 -91.51151 P PPL US AR 003 430 38 39 America/Chicago 2011-05-14 4169227 Portland Portland 30.51242 -86.19578 P PPL US FL 131 0 8 14 America/Chicago 2006-01-15 4217115 Portland Portland 34.05732 -85.03634 P PPL US GA 233 0 229 228 America/New_York 2010-09-05 4277586 Portland Portland 37.0778 -97.31227 P PPL US KS 191 0 362 364 America/Chicago 2006-01-15 4305000 Portland Portland 37.12062 -85.44608 P PPL US KY 001 0 220 223 America/Chicago 2006-01-15 4305001 Portland Portland 38.26924 -85.8108 P PPL US KY 111 0 135 138 America/Kentucky/Louisville 2006-01-15 4305002 Portland Portland 38.74812 -84.44772 P PPL US KY 191 0 265 266 America/New_York 2006-01-15 404289 Portland Portland Portlend,Портленд38.71088 -91.71767 P PPL US MO 027 0 170 172 America/Chicago 2010-01-29 4521811 Portland Portland Portlend,Портленд39.00341 -81.77124 P PPL US OH 105 0 187 188 America/New_York 2010-01-29 4650946 Portland Portland Portlend,Портленд36.58171 -86.51638 P PPL US TN 165 11480 244 245 America/Chicago 2011-05-14 4720131 Portland Portland Portlend,Портленд27.87725 -97.32388 P PPL US TX 409 15099 13 11 America/Chicago 2011-05-14 4841001 Portland Portland Portlend,Портленд41.57288 -72.64065 P PPL US CT 007 5862 24 27 America/New_York 2011-05-14 4871855 Portland Portland 43.12858 -93.12354 P PPL US IA 033 35 327 330 America/Chicago 2011-05-14 4906524 Portland Portland 41.66253 -89.98012 P PPL US IL 195 0 190 190 America/Chicago 2006-01-15 5006314 Portland Portland Portlend,Портленд42.8692 -84.90305 P PPL US MI 067 3883 221 223 America/Detroit 2011-05-14 5746545 Portland Portland 45.52345 -122.67621 P PPLA2 US OR 051 583776 12 15 America/Los_Angeles 2011-05-14 Spatial minimality 27 Portland LyonsIonia White Pigeon Ann Arbor Detroit Ionia Lyons Portland White Pigeon 1 >7 >4 >15 >17 1 # LocationsToponym Although Elisha Newman made the first land entry in the township of Portland (June, 1833), he did not become a settler until three years later, by which time a few settlers had located in the town. From Mr. Newman's story, it appears that early in 1833, he was visiting friends in Ann Arbor, and during an evening conversation discussed with others the subject of unlocated lands lying west of Ann Arbor. One of the company (Joseph Wood) remarked that he had been out with the party sent to survey Ionia and other counties, and that the surveyors were struck by the valuable water-power at the mouth of the Looking Glass River, saying there would surely be a village there some day. Mr. Newman was at once taken with the idea of locating lands at the mouth of the Looking Glass. Following up his impulse, he made ready to start at once, and, accompanied by James Newman and Joseph Wood, went out to the Looking Glass on a tour of inspection. Being satisfied with the location, he returned Eastward with his companions, and at White Pigeon made his land entry. Newman did not return for a permanent settlement until the spring of 1836, and meanwhile, in November, 1833, Philo Bogue bought a piece of land on section 28, in the bend of the Grand River, where he proposed to set up a trading post. Unaided he rolled up a log cabin near where the Detroit, Lansing, and Northern depot was located, and when he brought the house into decent shape went over to Hunt's at Lyons for his family, whom he had left there against such time as he should have affairs prepared for their comfort.
  • 43. © 2016 Jason M Baldridge MLConf, May 2016 Spatial minimality often fails 28 I moved from Encinitas, CA, a nice beach town in North San Diego County to Asheville, NC. By far, Ashville is more hip, especially West Asheville. Asheville has a lot in common with Portland. Austin, I've never been to so I cannot comment. But what makes a place cool and hip, in my opinion are that give a area "punch". There are a lot of ingredients. One is geography. Add a college or university (and all that they bring- and draw), good restaurants, a good music scene, a progressive attitude and tolerance. Hmmm. I'm sure there are many more to ponder. But that's my start. Oh, lots of bars! From: http://www.city-data.com/forum/austin/1694181-what-makes-city-like-austin-portland-3.html City-data.com incorrectly marks “West” and “Portland” as the cities in Texas -- presumably because of their textual and spatial proximity to “Austin”.
  • 44. © 2016 Jason M Baldridge MLConf, May 2016 Spatial minimality often fails 28 I moved from Encinitas, CA, a nice beach town in North San Diego County to Asheville, NC. By far, Ashville is more hip, especially West Asheville. Asheville has a lot in common with Portland. Austin, I've never been to so I cannot comment. But what makes a place cool and hip, in my opinion are that give a area "punch". There are a lot of ingredients. One is geography. Add a college or university (and all that they bring- and draw), good restaurants, a good music scene, a progressive attitude and tolerance. Hmmm. I'm sure there are many more to ponder. But that's my start. Oh, lots of bars! From: http://www.city-data.com/forum/austin/1694181-what-makes-city-like-austin-portland-3.html City-data.com incorrectly marks “West” and “Portland” as the cities in Texas -- presumably because of their textual and spatial proximity to “Austin”. But: it is clear from the text that Portland, Oregon and Austin,Texas are the referents, though their states are never mentioned and are far from the other locations! I moved from Encinitas, CA, a nice beach town in North San Diego County to Asheville, NC. By far, Ashville is more hip, especially West Asheville. Asheville has a lot in common with Portland. Austin, I've never been to so I cannot comment. But what makes a place cool and hip, in my opinion are that give a area "punch". There are a lot of ingredients. One is geography. Add a college or university (and all that they bring- and draw), good restaurants, a good music scene, a progressive attitude and tolerance. Hmmm. I'm sure there are many more to ponder. But that's my start. Oh, lots of bars!
  • 45. © 2016 Jason M Baldridge MLConf, May 2016 Toponym classifiers 29 Strategy: build a textual classifier per toponym by obtaining indirectly labeled examples from Wikipedia.
  • 46. © 2016 Jason M Baldridge MLConf, May 2016 Toponym classifiers 29 Strategy: build a textual classifier per toponym by obtaining indirectly labeled examples from Wikipedia.
  • 47. © 2016 Jason M Baldridge MLConf, May 2016 Toponym classifiers 29 Strategy: build a textual classifier per toponym by obtaining indirectly labeled examples from Wikipedia.
  • 48. © 2016 Jason M Baldridge MLConf, May 2016 Toponym classifiers 29 Strategy: build a textual classifier per toponym by obtaining indirectly labeled examples from Wikipedia.
  • 49. © 2016 Jason M Baldridge MLConf, May 2016 Toponym classifiers 29 Strategy: build a textual classifier per toponym by obtaining indirectly labeled examples from Wikipedia.
  • 50. © 2016 Jason M Baldridge MLConf, May 2016 Toponym classifiers 29 Strategy: build a textual classifier per toponym by obtaining indirectly labeled examples from Wikipedia.
  • 51. © 2016 Jason M Baldridge MLConf, May 2016 Toponym classifiers 29 Strategy: build a textual classifier per toponym by obtaining indirectly labeled examples from Wikipedia. P(Portland-OR|music) > P(Portland-ME|music) P(Portland-OR|wharf ) < P(Portland-ME|wharf )
  • 52. © 2016 Jason M Baldridge MLConf May 2016 Geographic word profiles learned from Wikipedia grid LMs 30
  • 53. © 2016 Jason M Baldridge MLConf May 2016 TopoCluster: gazetteer-free toponym resolution 31
  • 54. © 2016 Jason M Baldridge MLConf, May 2016 Results: disambiguating toponyms 32 TR-CoNLL LGL CWar WoTR Population 91 63 62 64 SPIDER (spatial minimality) 65 68 67 69 WISTR (Wiki supervised) 89 64 73 63 SPIDER +WISTR 87 78 87 69 TopoCluster 92 75 93 71
  • 55. © 2016 Jason M Baldridge MLConf May 2016 New corpus: War of the Rebellion 33 Historical document collection. Both toponym and document lat/lon. 10,380 annotated toponyms, including lat/lon and regions. Will be freely available. Coming soon! Contact me for details.
  • 56. © 2016 Jason M Baldridge MLConf, May 2016 Grounding, more generally 34 Grounding often involves connecting text to knowledge sources and other modalities such as demographics, time, image, and video.
  • 57. © 2016 Jason M Baldridge MLConf, May 2016 Grounding, more generally 34 Grounding often involves connecting text to knowledge sources and other modalities such as demographics, time, image, and video. This can help us create models for deeper aspects of language, such as syntactic structure and logical form.
  • 58. He says, she says http://www.tweetolife.com/gender/
  • 59. © 2016 Jason M Baldridge MLConf, May 2016 Temporality of words, by hour http://www.tweetolife.com/hour/ 36
  • 60. © 2016 Jason M Baldridge MLConf, May 2016 Temporality of words, by hour http://www.tweetolife.com/hour/ 36
  • 61. © 2016 Jason M Baldridge MLConf, May 2016 Temporality of expressions, by day: http://www.google.com/trends 37
  • 62. © 2016 Jason M Baldridge MLConf, May 2016 Temporality of expressions, by day: http://www.google.com/trends 37
  • 63. © 2016 Jason M Baldridge MLConf, May 2016 Temporality of expressions, by year: http://ngrams.googlelabs.com/ 38 slave trenches aircraft war
  • 64. © 2016 Jason M Baldridge MLConf, May 2016 Temporal resolution [Kumar, Lease, and Baldridge 2011] 39 2000BC 0AD 2000AD 4000BC
  • 65. © 2016 Jason M Baldridge MLConf, May 2016 Temporal resolution [Kumar, Lease, and Baldridge 2011] 39 2000BC 0AD 2000AD 4000BC
  • 66. © 2016 Jason M Baldridge MLConf, May 2016 Temporal resolution [Kumar, Lease, and Baldridge 2011] 39 2000BC 0AD 2000AD 4000BC
  • 67. © 2016 Jason M Baldridge MLConf, May 2016 Temporal resolution [Kumar, Lease, and Baldridge 2011] 39 2000BC 0AD 2000AD 4000BC
  • 68. © 2016 Jason M Baldridge MLConf, May 2016 Temporal resolution [Kumar, Lease, and Baldridge 2011] 39 2000BC 0AD 2000AD 4000BC
  • 69. © 2016 Jason M Baldridge MLConf, May 2016 Temporal resolution [Kumar, Lease, and Baldridge 2011] 39 2000BC 0AD 2000AD 4000BC
  • 70. © 2016 Jason M Baldridge MLConf, May 2016 More modalities: videos [Motwani & Mooney, 2012] 40
  • 71. © 2016 Jason M Baldridge MLConf, May 2016 Beyond word co-occurences for vector-space models 41 bear boat car cow hadoop snow water wrench 3 234 42 4 1 2 325 0 beach
  • 72. © 2016 Jason M Baldridge MLConf, May 2016 Beyond word co-occurences for vector-space models 41 bear boat car cow hadoop snow water wrench 3 234 42 4 1 2 325 0 beach
  • 73. © 2016 Jason M Baldridge MLConf, May 2016 Beyond word co-occurences for vector-space models 41 bear boat car cow hadoop snow water wrench 3 234 42 4 1 2 325 0 beach
  • 74. © 2016 Jason M Baldridge MLConf, May 2016 Beyond word co-occurences for vector-space models 41 bear boat car cow hadoop snow water wrench 3 234 42 4 1 2 325 0 beach
  • 75. © 2016 Jason M Baldridge MLConf, May 2016 Beyond word co-occurences for vector-space models 41 bear boat car cow hadoop snow water wrench 3 234 42 4 1 2 325 0 beach
  • 76. © 2016 Jason M Baldridge MLConf, May 2016 Beyond word co-occurences for vector-space models 41 bear boat car cow hadoop snow water wrench 3 234 42 4 1 2 325 0 beach
  • 77. © 2016 Jason M Baldridge MLConf, May 2016 Beyond word co-occurences for vector-space models 41 bear boat car cow hadoop snow water wrench 3 234 42 4 1 2 325 0 beach
  • 78. © 2016 Jason M Baldridge MLConf, May 2016 Beyond word co-occurences for vector-space models 41 bear boat car cow hadoop snow water wrench 3 234 42 4 1 2 325 0 beach
  • 79. © 2016 Jason M Baldridge MLConf, May 2016 Beyond word co-occurences for vector-space models 41 bear boat car cow hadoop snow water wrench 3 234 42 4 1 2 325 0 beach
  • 80. © 2016 Jason M Baldridge MLConf, May 2016 Combining distributional models with logics 42 Erk (2013):“Towards a semantics for distributional representations.” Garrette et al (2012):“A formal approach to linking logical form and vector-space lexical semantics” Beltagy et al (2013):“Montague Meets Markov: Deep Semantics with Probabilistic Logical Form”
  • 81. © 2016 Jason M Baldridge MLConf, May 2016 Multi-component structured vector-space models 43 beachchildren visit the children visit the beach Agent Patient
  • 82. This research was sponsored by: Grant from the Morris Memorial Trust Fund - Walt Whitman, A Song of the Rolling Earth (in Leaves of Grass) Final note:Whitman had it right many years ago! Thanks!
  • 83. Code and Data - Textgrounder: https://github.com/utcompling/textgrounder Publications - Benjamin Wing and Jason Baldridge. 2011. Simple supervised document geolocation with geodesic grids. In Proceedings of ACL HLT 2011. - Stephen Roller, Mike Speriosu, Sarat Rallapalli, Benjamin Wing and Jason Baldridge. 2012. Supervised Text-based Geolocation Using Language Models on an Adaptive Grid. EMNLP 2012. Jeju, Korea. - Benjamin Wing and Jason Baldridge 2014. Hierarchical Discriminative Classification for Text-Based Geolocation. EMNLP 2014. Document geolocation
  • 84. Code and data - Fieldspring: https://github.com/utcompling/fieldspring - TopoCluster: https://github.com/grantdelozier/TopoCluster Publications - Mike Speriosu and Jason Baldridge. 2013.Text-Driven Toponym Resolution using Indirect Supervision.ACL 2013. - Grant DeLozier, Jason Baldridge, and Loretta London. 2015. Gazetteer-Free Toponym Resolution Using Geographic Word Profiles.AAAI 2015. Toponym resolution