Jason Baldridge, Co-Founder and Chief Scientist at People Pattern and Associate Professor of Computational Linguistics at the University of Texas at Austin, shares recent research of his from UT Austin on text-based geolocation using Wikipedia, Twitter and other sources.
96. This research was sponsored by:
Grant from the Grant: W911NF-10-1-0533
Morris Memorial Trust Fund
Final note: Whitman had it right many years ago!
- Walt Whitman, A Song of the Rolling Earth (in Leaves of Grass)
Friday, September 5, 14
97. Document geolocation
Supervision
- documents labeled with latitude & longitude
Methods
- Language Modeling for Information Retrieval
Code
- Textgrounder: https://github.com/utcompling/textgrounder
Publications
- Stephen Roller, Mike Speriosu, Sarat Rallapalli, Benjamin Wing and Jason Baldridge. 2012. Supervised Text-based
Geolocation Using Language Models on an Adaptive Grid. EMNLP 2012. Jeju, Korea.
- Benjamin Wing and Jason Baldridge. 2011. Simple supervised document geolocation with geodesic grids. In
Proceedings of ACL HLT 2011.
Friday, September 5, 14
98. Toponym resolution
Supervision
- indirectly acquired toponym annotations using a gazeteer
and geo-annotated Wikipedia
Methods
- logistic regression
- named entity recognition
Code
- Fieldspring: https://github.com/utcompling/fieldspring
Publications
- Mike Speriosu and Jason Baldridge. Text-Driven Toponym Resolution using Indirect Supervision. To appear in
proceedings of ACL 2013.
Friday, September 5, 14