ABSTRACT
Online social networks are now a popular way for users to connect, express themselves, and share content. Users in today's online social networks often post a profile, consisting of attributes like geographic location, interests, and schools attended. Such profile information is used on the sites as a basis for grouping users, for sharing content, and for suggesting users who may benefit from interaction. However, in practice, not all users provide these attributes.
In this paper, we ask the question: given attributes for some fraction of the users in an online social network, can we infer the attributes of the remaining users? In other words, can the attributes of users, in combination with the social network graph, be used to predict the attributes of another user in the network? To answer this question, we gather fine-grained data from two social networks and try to infer user profile attributes. We find that users with common attributes are more likely to be friends and often form dense communities, and we propose a method of inferring user attributes that is inspired by previous approaches to detecting communities in social networks. Our results show that certain user attributes can be inferred with high accuracy when given information on as little as 20% of the users.
- L. Adamic and E. Adar. Friends and neighbors on the web. Social Networks, 25(3):211--230, 2003.Google ScholarCross Ref
- R. Andersen and K.J. Lang. Communities from seed sets. In Proc. WWW'06, Edinburgh, Scotland, May 2006. Google ScholarDigital Library
- J.P. Bagrow. Evaluating local community methods in networks. J. Stat. Mech., 2008(5), 2008.Google ScholarCross Ref
- A. Clauset. Finding local community structure in networks. Physical Review E, 72, 2005.Google Scholar
- A. Clauset, M.E.J. Newman, and C. Moore. Finding community structure in very large networks. Physical Review E, 70(6), 2004.Google ScholarCross Ref
- O. Simsek and D. Jensen. Navigating networks by using homophily and degree. PNAS, 105(35):12758--12762, September 2008.Google ScholarCross Ref
- Facebook. http://www.facebook.com.Google Scholar
- A.T. Fiore and J.S. Donath. Homophily in online dating: when do you like someone like yourself? In Proc. CHI'05, Portland, USA, 2005. Google ScholarDigital Library
- A.L.N. Fred and A.K. Jain. Robust data clustering. In Proc. CVPR'03, pages 128--133, June 2003.Google Scholar
- L. Friedland and D. Jensen. Finding tribes: identifying close-knit individuals from employment patterns. In Proc. KDD'07, San Jose, California, USA, Aug 2007. Google ScholarDigital Library
- D. Jensen and J. Neville. Data mining in social networks. In Dynamic Social Network Modeling and Analysis: Workshop Summary and Papers, pages 287--302, 2003.Google Scholar
- R. Kannan, S. Vempala, and A. Vetta. On clusterings: Good, bad and spectral. Journal of the ACM, 51(3):497--515, May 2004. Google ScholarDigital Library
- F. Luo, J.Z. Wang, and E. Promislow. Exploring local community structures in large networks. Web Intelligent and Agent Systems, 6(4):387--400, 2008. Google ScholarDigital Library
- D. Lusseau and M.E.J. Newman. Identifying the role that individual animals play in their social network. Proc. R. Soc. London B, 271:S477, 2004.Google ScholarCross Ref
- A. Mislove, M. Marcon, K.P. Gummadi, P. Druschel, and B. Bhattacharjee. Measurement and Analysis of Online Social Networks. In Proc. IMC'07, San Diego, CA, October 2007. Google ScholarDigital Library
- M.E.J. Newman. Birds of a feather: Homophily in social networks. Annual Review of Sociology, 27:415--444, August 2001.Google ScholarCross Ref
- M.E.J. Newman. Fast algorithm for detecting community structure in networks. Physical Review E, 69(6), 2004.Google ScholarCross Ref
- M.E.J. Newman and M. Girvan. Finding and evaluating community structure in networks. Physical Review E, 69:026113, 2004.Google ScholarCross Ref
- F. Radicchi, C. Castellano, F. Cecconi, V. Loreto, and D. Parisi. Defining and identifying communities in networks. PNAS, 101(9):2658--2663, March 2004.Google ScholarCross Ref
- Rice Culture. http://www.professor.rice.edu/professor/Rice_Culture.asp?SnID=165470151.Google Scholar
- Rice University Alumni Directory. https://online.alumni.rice.edu/directory/detailsearch.asp.Google Scholar
- Rice University Student Directory. http://www.rice.edu/search/query.php?advanced=1&tab=people.Google Scholar
- J.R. Tyler, D.M. Wilkinson, and B.A. Huberman. Email as spectroscopy: Automated discovery of community structure within organizations. In Proc. ICCT'03, Dordrecht, 2003. Google ScholarDigital Library
- E. Zheleva and L. Getoor. To join or not to join: The illusion of privacy in social networks with mixed public and private user profiles. In Proc. WWW'09, Madrid, Spain, May 2009. Google ScholarDigital Library
Index Terms
- You are who you know: inferring user profiles in online social networks
Recommendations
Analyzing the Proximity and Interactions of Friends in Communities in Gowalla
ICDMW '13: Proceedings of the 2013 IEEE 13th International Conference on Data Mining WorkshopsWe collected friendship information and location data from a social media website called Go Walla to analyze the relationship between geographical space and friendship. First, we analyzed how geographic proximity shapes the structure of the social ...
Where You Go Reveals Who You Know: Analyzing Social Ties from Millions of Footprints
CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge ManagementThis paper aims to investigate how the geographical footprints of users correlate to their social ties. While conventional wisdom told us that the more frequently two users co-locate in geography, the higher probability they are friends, we find that in ...
On commenting behavior of Facebook users
HT '13: Proceedings of the 24th ACM Conference on Hypertext and Social MediaFacebook treats friends as a single homogeneous group even though people on Facebook are possibly acquainted with diverse group of individuals and perceive their friends as representatives of different groups. It is a common observation that people tend ...
Comments