No selfies, please: An analysis of texts and images from Reddit

Rafael Almeida de Oliveira

Abstract


ABSTRACT

Purpose: This research will analyse the main posts from Reddit, more specifically the subreddit dedicated to travel known as r/travel.

Methodology: The 946 most relevant posts and their 891 images were collected by web scraping and their titles were analysed and gathered using the image classification technique.

Findings: Results show that most Reddit posts do not have people in the pictures, drifting away from the popular selfies that are ubiquitous in other social networks. It is believed that users prefer to post images on Reddit without people due to its anonymous appeal. Moreover, there is an evident focus on images associated with natural environments and less so with urban environments; also, images taken at or around cities and countries out of the beaten track are favoured and it supports the idea that Reddit users search for less-known destinations to guarantee exclusivity during the tourist’s experience.

Originality/Value: Even though there is a high number of tourism research connected to social networks, there is none that tries to understand Reddit users' approach to tourism. It is also important to highlight that there are few studies using the image classification technique towards the identification and quantification of objects included in travel photography.

Keywords: Tourism, Web scraping, Image classification.


Full Text:

PDF

References


Andrejevic, M. (2014). The big data divide. International Journal of Communication, 8(1), 1673–1689.

Amazon (2020). Amazon SageMaker. https://aws.amazon.com/sagemaker/data-scientist/

Babbar, S., Dewan, N., Shangle, K., Kulshrestha, S., & Patel, S. (2020). Cross-Age Face Recognition using Deep Residual Networks. 257–262.

Chang, M., Xing, Y. Y., Zhang, Q. Y., Han, S. J., & Kim, M. (2020). A CNN image classification analysis for “clean-coast detector” as tourism service distribution. Journal of Distribution Science, 18(1), 15–26.

Chen, H., Chiang, R. H. L., & Storey, V. C. (2012). Business intelligence and analytics: from Big Data to big impact. Mis Quarterly, 36(4), 1165–1188.

Chen, Q., Song, Z., Dong, J., Huang, Z., Hua, Y., & Yan, S. (2015). Contextualizing object detection and classification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(1), 13–27.

De Ascaniis, S., & Gretzel, U. (2013). Communicative functions of online travel review titles. A pragmatic and linguistic investigation of destination and attraction OTR titles. Studies in

Communication Sciences, 13(2), 156–165.

De Choundhury, M., & De, S. (2014, May). Mental health discourse on reddit: Self-disclosure, social support, and anonymity. In Eighth international AAAI conference on weblogs and

social media., 71-80.

Devika, K., & Surendran, S. (2013). An overview of web data extraction techniques. International Journal of Scientific Engineering and Technology, 2(4), 278–287.

Dey, A. (2016). Machine learning algorithms: a review. International Journal of Computer Science and Information Technologies, 7(3), 1174–1179.

Dinhopl, A., & Gretzel, U. (2016). Selfie-taking as touristic looking. Annals of Tourism Research, 57, 126–139.

Douglas, N. (2014). It’s supposed to look like shit: The Internet ugly aesthetic. Journal of visual culture, 13(3), 314-339.

Druzhkov, P. N., & Kustikova, V. D. (2016). A survey of deep learning methods and software tools for image classification and object detection. Pattern Recognition and Image Analysis, 26(1), 9–15.

Gretzel, U. (2017). The visual turn in social media marketing. Tourismos: An International Multidisciplinary Journal of Tourism, 12(3), 01–18.

Han, W., McCabe, S., Wang, Y., & Chong, A. Y. L. (2018). Evaluating user-generated content in social media: an effective approach to encourage greater pro-environmental behavior in tourism? Journal of Sustainable Tourism, 26(4), 600–614.

Jamnik, M. R., & Lane, D. J. (2017). The use of Reddit as an inexpensive source for high-quality data. Practical Assessment, Research and Evaluation, 22(5), 1–10.

Jansson, A. (2018). Rethinking post-tourism in the age of social media. Annals of Tourism Research, 69, 101–110.

Lam, J. M. S., Ismail, H., & Lee, S. (2020). From desktop to destination: User-generated content platforms, co-created online experiences, destination image and satisfaction. Journal of Destination Marketing and Management, 18(July).

Landers, R. N., Brusso, R. C., Cavanaugh, K. J., & Collmus, A. B. (2016). A primer on theory-driven web scraping: automatic extraction of Big Data from the internet for use in Psychological Research Richard. Psychological Methods, 21(4), 475–492.

Marres, N., & Weltevrede, E. (2013). Scraping the social? Journal of Cultural Economy, 6(3), 313–335.

Massanari, A. (2013). Playful participatory culture: learning from Reddit. Selected Papers of Internet Research, 3, 1–7.

Maulik, U., & Chakraborty, D. (2018). Remote sensing image Classification: A survey of supportvector-machine-based advanced techniques. IEE Geoscience and Remote Sensing Magazine, XLII, 33–52.

Mcafee, A., & Brynjolfsson, E. (2012). Big data: the management revolution. Harvard Business Review, October, 1–9.

Medvedev, A. N., Lambiotte, R., & Delvenne, J. C. (2019). The anatomy of Reddit: an Overview of academic research. Springer Proceedings in Complexity, 183–204.

Munar, A. M., & Jacobsen, J. K. S. (2014). Motivations for sharing tourism experiences through social media. Tourism Management, 43, 46–54.

Narangajavana Kaosiri, Y., Callarisa Fiol, L. J., Moliner Tena, M. Á.,

Rodríguez Artola, R. M., & Sánchez García, J. (2019). User-generated content sources in social media: a new approach to explore tourist satisfaction. Journal of Travel Research, 58(2), 253–265.

Oliveira, R. A. de, & Baracho, R. M. A. (2018). The development of tourism indicators through the use of social media data: The case of minas Gerais, Brazil. Information Research, 23(4).

Ovadia, S. (2015). More than just cat pictures: reddit as a curated news source. Behavioural and Social Sciences Librarian, 34(1), 37–40.

Pathak, A. R., Pandey, M. and Rautaray, S. (2018). Application of deep learning for object detection. Procedia Computer Science, 132(Iccids), 1706–1717.

Pliakos, K., & Kotropoulos, C. (2014). PLSA driven image annotation, classification, and tourism recommendation. 2014 IEEE International Conference on Image Processing, ICIP 2014, 3003–3007.

Proferes, N., Jones, N., Gilbert, S., Fiesler, C. and Zimmer, M. (2021). Studying reddit: A systematic overview of disciplines, approaches, methods, and ethics. Social Media+ Society, 7(2), 1-14.

Rawat, W., & Wang, Z. (2017). Deep convolutional neural networks for image classification: a comprehensive review. Neural Computation, 29, 2709–2733.

Sathya, R., & Abraham, A. (2013). Comparison of supervised and unsupervised learning algorithms for pattern classification. International Journal of Advanced Research in Artificial Intelligence, 2(2), 34–38.

Sigala, M. (2016). Social media and the co-creation of tourism experiences. The Handbook of Managing and Marketing Tourism Experiences, 85–111.

Singer, P., Flöck, F., Meinhart, C., Zeitfogel, E., & Strohmaier, M. (2014). Evolution of Reddit: From the front page of the internet to a self-referential community? WWW 2014 Companion - Proceedings of the 23rd International Conference on World Wide Web, 517–522.

Sotiriadis, M. D. (2017). Sharing tourism experiences in social media: A literature review and a set of suggested business strategies. International Journal of Contemporary Hospitality Management, 29(1), 179–225.

Statista (2022). Reddit - Statistics & Facts. https://www.statista.com/topics/5672/reddit/

Ukpabi, D. C., & Karjaluoto, H. (2018). What drives travelers’ adoption of user-generated content? A literature review. Tourism Management Perspectives, 28, 251–273.

Vargiu, E., & Urru, M. (2012). Exploiting web scraping in a collaborative filtering- based

approach to web advertising. Artificial Intelligence Research, 2(1), 44–54.

Weninger, T., Zhu, X. A., & Han, J. (2013). An exploration of discussion threads in social news sites: A case study of the Reddit community. Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2013, 579–583.

Zhang, Y., Gao, J., Cole, S., & Ricci, P. (2021). How the spread of User-Generated Contents (UGC) shapes international tourism distribution: using Agent-Based Modeling to inform strategic UGC marketing. Journal of Travel Research, 60(7), 1469–1491.




Copyright (c) 2022 European Journal of Applied Business and Management

 

European Journal of Applied Business and Management

ISSN: 2183-5594

DOI: https://doi.org/10.58869/EJABM

Indexing:

EBSCO | CROSSREF | GOOGLE SCHOLAR | LATINDEX | DRJI | ICI JOURNALS MASTER | REDIB | MIAR