TY - JOUR
T1 - Predicting Traveling Distances and Unveiling Mobility and Activity Patterns of Individuals from Multisource Data
AU - Gkiotsalitis, Konstantinos
AU - Stathopoulos, Antony
PY - 2020/5/1
Y1 - 2020/5/1
N2 - This work investigates whether the user-generated data from multiple sources, such as smart cards and social media, can be used to identify main mobility/activity patterns based solely on geo-tagged information. To perform such an analysis, automated models are developed to (1) retrieve user mobility patterns from historical, user-generated data logs, (2) categorize users based on the similarity of their observed mobility patterns, and (3) predict the travel distances of users for participating in future activities. For testing purposes, user-generated data sets from smart card logs and Twitter profiles collected between November 2013 and February 2015 in London are used. User-generated data from 200 smart card and 32 active Twitter users are collected and 6 main clusters are identified based on the mobility/activity pattern similarities of users. Results show that it is possible to integrate data logs from multiple sources to capture the main mobility/activity patterns observed in an area. Results also reveal that the accuracy of the predicted travel distance of one user’s trip can be significantly improved if the user’s previous activities are considered in the prediction process.
AB - This work investigates whether the user-generated data from multiple sources, such as smart cards and social media, can be used to identify main mobility/activity patterns based solely on geo-tagged information. To perform such an analysis, automated models are developed to (1) retrieve user mobility patterns from historical, user-generated data logs, (2) categorize users based on the similarity of their observed mobility patterns, and (3) predict the travel distances of users for participating in future activities. For testing purposes, user-generated data sets from smart card logs and Twitter profiles collected between November 2013 and February 2015 in London are used. User-generated data from 200 smart card and 32 active Twitter users are collected and 6 main clusters are identified based on the mobility/activity pattern similarities of users. Results show that it is possible to integrate data logs from multiple sources to capture the main mobility/activity patterns observed in an area. Results also reveal that the accuracy of the predicted travel distance of one user’s trip can be significantly improved if the user’s previous activities are considered in the prediction process.
KW - 22/4 OA procedure
U2 - 10.1061/JTEPBS.0000336
DO - 10.1061/JTEPBS.0000336
M3 - Article
SN - 2473-2907
VL - 146
JO - Journal of Transportation Engineering, Part A: Systems
JF - Journal of Transportation Engineering, Part A: Systems
IS - 5
M1 - 04020025
ER -