You are here

Analyzing Destination Choices of Tourists and Residents from Location Based Social Media Data

Download pdf | Full Screen View

Date Issued:
2018
Abstract/Description:
Ubiquitous uses of social media platforms in smartphones have created an opportunity to gather digital traces of individual activities at a large scale. Traditional travel surveys fall short in collecting longitudinal travel behavior data for a large number of people in a cost effective way, especially for the transient population such as tourists. This study presents an innovating methodological framework, using machine learning and econometric approaches, to gather and analyze location-based social media (LBSM) data to understand individual destination choices. First, using Twitter's search interface, we have collected Twitter posts of nearly 156,000 users for the state of Florida. We have adopted several filtering techniques to create a reliable sample from noisy Twitter data. An ensemble classification technique is proposed to classify tourists and residents from user coordinates. The performance of the proposed classifier has been validated using manually labeled data and compared against the state-of-the-art classification methods. Second, using different clustering methods, we have analyzed the spatial distributions of destination choices of tourists and residents. The clusters from tourist destinations revealed most popular tourist spots including emerging tourist attractions in Florida. Third, to predict a tourist's next destination type, we have estimated a Conditional Random Field (CRF) model with reasonable accuracy. Fourth, to analyze resident destination choice behavior, this study proposes an extensive data merging operation among the collected Twitter data and different geographic database from state level data libraries. We have estimated a Panel Latent Segmentation Multinomial Logit (PLSMNL) model to find the characteristics affecting individual destination choices. The proposed PLSMNL model is found to better explain the effects of variables on destination choices compared to trip-specific Multinomial Logit Models. The findings of this study show the potential of LBSM data in future transportation and planning studies where collecting individual activity data is expensive.
Title: Analyzing Destination Choices of Tourists and Residents from Location Based Social Media Data.
40 views
15 downloads
Name(s): Hasnat, Md Mehedi, Author
Hasan, Samiul, Committee Chair
Abdel-Aty, Mohamed, Committee Member
Eluru, Naveen, Committee Member
University of Central Florida, Degree Grantor
Type of Resource: text
Date Issued: 2018
Publisher: University of Central Florida
Language(s): English
Abstract/Description: Ubiquitous uses of social media platforms in smartphones have created an opportunity to gather digital traces of individual activities at a large scale. Traditional travel surveys fall short in collecting longitudinal travel behavior data for a large number of people in a cost effective way, especially for the transient population such as tourists. This study presents an innovating methodological framework, using machine learning and econometric approaches, to gather and analyze location-based social media (LBSM) data to understand individual destination choices. First, using Twitter's search interface, we have collected Twitter posts of nearly 156,000 users for the state of Florida. We have adopted several filtering techniques to create a reliable sample from noisy Twitter data. An ensemble classification technique is proposed to classify tourists and residents from user coordinates. The performance of the proposed classifier has been validated using manually labeled data and compared against the state-of-the-art classification methods. Second, using different clustering methods, we have analyzed the spatial distributions of destination choices of tourists and residents. The clusters from tourist destinations revealed most popular tourist spots including emerging tourist attractions in Florida. Third, to predict a tourist's next destination type, we have estimated a Conditional Random Field (CRF) model with reasonable accuracy. Fourth, to analyze resident destination choice behavior, this study proposes an extensive data merging operation among the collected Twitter data and different geographic database from state level data libraries. We have estimated a Panel Latent Segmentation Multinomial Logit (PLSMNL) model to find the characteristics affecting individual destination choices. The proposed PLSMNL model is found to better explain the effects of variables on destination choices compared to trip-specific Multinomial Logit Models. The findings of this study show the potential of LBSM data in future transportation and planning studies where collecting individual activity data is expensive.
Identifier: CFE0007012 (IID), ucf:52028 (fedora)
Note(s): 2018-05-01
M.S.
Engineering and Computer Science, Civil, Environmental and Construction Engineering
Masters
This record was generated from author submitted information.
Subject(s): Tourist -- Resident -- Destination Choice -- Location Based Social Media Data
Persistent Link to This Record: http://purl.flvc.org/ucf/fd/CFE0007012
Restrictions on Access: public 2018-05-15
Host Institution: UCF

In Collections