Preview

Proceedings of the Institute for System Programming of the RAS (Proceedings of ISP RAS)

Advanced search

Approaches to estimate location of social network users based on social graph

https://doi.org/10.15514/ISPRAS-2016-28(6)-13

Abstract

Many applications require information about the geolocation of users, which is not always available. Among the users of Twitter only about 26% indicate the name of the city in their profiles, about 30% of users of VKontakte leave this field blank. So there is the problem of determining the place of residence of social network users. We investigate approaches to geolocation of social network users using their mutual bidirectional ties - social graph. At first, we present a brief overview of the work in the field of geolocating users of social networks. Then we propose an approach that relies on graph nodes’ embeddings and supervised machine learning techniques. Series of experiments were conducted with proposed and baseline approaches. Experiments show that proposed approach is comparable with others. The results of experiments allow us to conclude that the proposed approach based on vector representation can be effectively used to determine the user's place of residence by itself, or in combination with classifiers based on user data It is worth noting that the proposed approach has no any specifics related to the geolocation. It can also be used to assess any other demographic attributes that influence the formation of relationships in society. Thus, a similar approach was used in Perozzi and Skiena to determine the age of the users.

About the Authors

Y. S. Trofimovich
Institute for System Programming of the Russian Academy of Sciences
Russian Federation


I. S. Kozlov
Institute for System Programming of the Russian Academy of Sciences
Russian Federation


D. Y. Turdakov
Institute for System Programming of the Russian Academy of Sciences; Lomonosov Moscow State University; National Research University Higher School of Economics (HSE)
Russian Federation


References

1. Takeshi Sakaki, Makoto Okazaki, Yutaka Matsuo. “Tweet Analysis for Real-Time Event Detection and Earthquake Reporting System Development”. IEEE Trans. on Knowl. and Data Eng. 25.4 (Apr. 2013), pp. 919–931.

2. Alex Lamb, Michael J Paul, Mark Dredze. “Separating Fact from Fear: Tracking Flu Infections on Twitter.” Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), Atlanta. June 2013.

3. Adam Sadilek, Henry A Kautz, Vincent Silenzio. “Modeling Spread of Disease from Social Interactions.” Proceedings of the Sixth International Conference on Weblogs and Social Media, Dublin, Ireland, June 4-7, 2012.

4. Zhiyuan Cheng, James Caverlee, Kyumin Lee. “You Are Where You Tweet: A Content-based Approach to Geo-locating Twitter Users”. Proceedings of the 19th ACM International Conference on Information and Knowledge Management. CIKM ’10. Toronto, ON, Canada: ACM, 2010, pp. 759–768.

5. Afshin Rahimi et al. “Exploiting Text and Network Context for Geolocation of Social Media Users”. CoRR abs/1506.04803 (2015).

6. Bryan Perozzi, Steven Skiena. “Exact Age Prediction in Social Networks”. Proceedings of the 24th International Conference on World Wide Web. WWW ’15 Companion. Florence, Italy: ACM, 2015, pp. 91–92.

7. Lars Backstrom, Eric Sun, Cameron Marlow. “Find me if you can: improving geographical prediction with social and spatial proximity”. Proceedings of the 19th international conference on World wide web. ACM. 2010, pp. 61–70.

8. David Jurgens. “That’s What Friends Are For: Inferring Location in Online Social Media Platforms Based on Social Relationships”. ICWSM 13 (2013), pp. 273–282.

9. Yantao Jia et al. “Location Prediction: A Temporal-Spatial Bayesian Model”. ACM Transactions on Intelligent Systems and Technology (TIST) 7.3 (2016), p. 31.

10. Jinpeng Chen, Yu Liu, Ming Zou. “Home location profiling for users in social media”. Information & Management 53.1 (2016), pp. 135–143.

11. Fan RK Chung. “Spectral graph theory”. CBMS regional conference series in mathematics, No. 92 (1996).

12. Oleg U Ivanov, Sergey O Bartunov. “Learning Representations in Directed Networks”. International Conference on Analysis of Images, Social Networks and Texts. Springer. 2015, pp. 196–207.

13. Bryan Perozzi, Rami Al-Rfou, Steven Skiena. “Deepwalk: Online learning of social representations”. Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM. 2014, pp. 701–710.

14. T Mikolov, J Dean. “Distributed representations of words and phrases and their compositionality”. Advances in neural information processing systems (2013).

15. Michael U Gutmann, Aapo Hyvärinen. “Noise-contrastive estimation of unnormalized statistical models, with applications to natural image statistics”. Journal of Machine Learning Research 13.Feb (2012), pp. 307–361.

16. François Chollet. Keras. https://github.com/fchollet/keras. 2015.


Review

For citations:


Trofimovich Y.S., Kozlov I.S., Turdakov D.Y. Approaches to estimate location of social network users based on social graph. Proceedings of the Institute for System Programming of the RAS (Proceedings of ISP RAS). 2016;28(6):185-196. (In Russ.) https://doi.org/10.15514/ISPRAS-2016-28(6)-13



Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2079-8156 (Print)
ISSN 2220-6426 (Online)