ABSTRACT
Little research exists on one of the most common, oldest, and most utilized forms of online social geographic information: the 'location' field found in most virtual community user profiles. We performed the first in-depth study of user behavior with regard to the location field in Twitter user profiles. We found that 34% of users did not provide real location information, frequently incorporating fake locations or sarcastic comments that can fool traditional geographic information tools. When users did input their location, they almost never specified it at a scale any more detailed than their city. In order to determine whether or not natural user behaviors have a real effect on the 'locatability' of users, we performed a simple machine learning experiment to determine whether we can identify a user's location by only looking at what that user tweets. We found that a user's country and state can in fact be determined easily with decent accuracy, indicating that users implicitly reveal location information, with or without realizing it. Implications for location-based services and privacy are discussed.
Supplemental Material
Available for Download
Slides from the presentation
- Backstrom, L., Sun, E. and Marlow, C. Find Me If You Can: Improving Geographical Prediction with Social and Spatial Proximity. WWW '10, Raleigh, NC. Google ScholarDigital Library
- Barkhuus, L., Brown, B., Bell, M., Hall, M., Sherwood, S. and Chalmers, M. From Awareness to Repartee: Sharing Location within Social Groups. CHI '08, Florence, Italy, 497--506. Google ScholarDigital Library
- boyd, d., Golder, S. and Lotan, G. Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter. HICSS '10, Kauai, HI. Google ScholarDigital Library
- Miller, C. and Wortham, J. Technology Aside, Most People Still Decline to Be Located. The New York Times. August 30, 2010.Google Scholar
- Fellman, J.D., Getis, A. and Getis, J. Human Geography: Landscapes of Human Activities. Mcgraw- Hill Higher Education, 2007.Google Scholar
- Gabrilovich, E. and Markovitch, S. Computing Semantic Relatedness using Wikipedia-based Explicit Semantic Analysis. IJCAI '07, Hyberabad, 1606--1611. Google ScholarDigital Library
- Hecht, B. and Gergle, D. A Beginner's Guide to Geographic Virtual Communities Research. Handbook of Research on Methods and Techniques for Studying Virtual Communities, IGI, 2010.Google Scholar
- Hecht, B. and Gergle, D. On The "Localness" Of User- Generated Content. CSCW '10, Savannah, Georgia. Google ScholarDigital Library
- Hecht, B. and Moxley, E. Terabytes of Tobler: Evaluating the First Law in a Massive, Domain-Neutral Representation of World Knowledge. COSIT '09, L'Aber Wrac'h, France, 88--105. Google ScholarDigital Library
- Honeycutt, C. and Herring, S.C. Beyond Microblogging: Conversation and Collaboration via Twitter. HICCS '09.Google Scholar
- Java, A., Song, X., Finin, T. and Tseng, B. Why We Twitter: Understanding Microblogging Usage and Communities. Joint 9th WEBKDD and 1st SNA-KDD Workshop '07, San Jose, CA, 56--65. Google ScholarDigital Library
- Krishnamurthy, B., Gill, P. and Arlitt, M. A Few Chirps about Twitter. First Workshop on Online Social Networks, Seattle, WA, 19--24, 2008. Google ScholarDigital Library
- Lieberman, M.D. and Lin, J. You Are Where You Edit: Locating Wikipedia Users Through Edit Histories. ICWSM '09, San Jose, CA.Google Scholar
- Lindamood, J., Heatherly, R., Kantarcioglu, M. and Thuraisingham, B. Inferring Private Information Using Social Network Data. WWW '09, Madrid, Spain. Google ScholarDigital Library
- Ludford, P.J., Priedhorsky, R., Reily, K. and Terveen, L.G. Capturing, Sharing, and Using Local Place Information. CHI '07, San Jose, CA, 1235--1244. Google ScholarDigital Library
- McCallum, A. and Nigam, K. A Comparison of Event Models for Naive Bayes Text Classification. AAAI-98 Workshop on Learning for Text Categorization, 41--48.Google Scholar
- Naaman, M., Boase, J. and Lai, C.-H. Is it Really About Me? Message Content in Social Awareness Streams. CSCW '10, Savannah, GA. Google ScholarDigital Library
- Panciera, K., Priedhorsky, R., Erickson, T. and Terveen, L.G. Lurking? Cyclopaths? A Quantitative Lifecycle Analysis of User Behavior in a Geowiki. CHI '10, Atlanta, GA, 1917--1926. Google ScholarDigital Library
- Popescu, A. and Grefenstette, G. Mining User Home Location and Gender from Flickr Tags. ICWSM '10.Google Scholar
- Priedhorsky, R. and Terveen, L.G. The Computational Geowiki: What, Why, and How. CSCW '08, San Diego. Google ScholarDigital Library
- Reddy, S., Shilton, K., Denisov, G., Cenizal, C., Estrin, D. and Srivastava, M. Biketastic: Sensing and Mapping for Better Biking. CHI '10, Atlanta, GA, 1817--1820. Google ScholarDigital Library
- Sakaki, T., Okazaki, M. and Matsuo, Y. Earthquake Shakes Twitter Users: Real-time Event Detection by Social Sensors. WWW '10, Raleigh, NC. Google ScholarDigital Library
- Schonfeld, E. Costolo: Twitter Now Has 190 Million Users Tweeting 65 Million Times A Day. TechCrunch. http://techcrunch.com/2010/06/08/twitter-190-million-users/, 2010.Google Scholar
- Starbird, K., Palen, L., Hughes, A.L. and Vieweg, S. Chatter on The Red: What Hazards Thread Reveals about the Social Life of Microblogged Information. CSCW '10, Savannah, GA. Google ScholarDigital Library
- Twitter. Location, Location, Location. twitterblog. http://blog.twitter.com/2009/08/location-location- location.html, 2009.Google Scholar
- Vieweg, S., Hughes, A.L., Starbird, K. and Palen, L. Microblogging During Two Natural Hazards Events: What Twitter May Contribute to Situational Awareness. CHI '10, Atlanta, GA, 1079--1088. Google ScholarDigital Library
Index Terms
- Tweets from Justin Bieber's heart: the dynamics of the location field in user profiles
Recommendations
A method for predicting future location of mobile user for location-based services system
Convergence of location-aware devices, wireless communications, and geographic information system (GIS) functionalities has been enabling the deployment of a new generation of selective information disseminating services and location-based services (...
Beware of What You Share: Inferring Home Location in Social Networks
ICDMW '12: Proceedings of the 2012 IEEE 12th International Conference on Data Mining WorkshopsIn recent years, social media users are voluntarily making large volume of personal data available on the social networks. Such data (e.g., professional associations) can create opportunities for users to strengthen their social and professional ties. ...
A predictive location model for location-based services
GIS '03: Proceedings of the 11th ACM international symposium on Advances in geographic information systemsLocation-Based Services (LBSs) utilize information about users' locations through location-aware mobile devices to provide services, such as nearest features of interest, they request. This is a common strategy in LBSs and although it is needed and ...
Comments