Clustering Activity–Travel Behavior Time Series using Topological Data Analysis
/ Authors
/ Abstract
Over the last few years, traffic data have been exploding and the transportation discipline has entered the era of big data. It brings out new opportunities for doing data-driven analysis, but it also challenges traditional analytic methods. This paper proposes a new divide and combine-based approach to do K-means clustering on activity–travel behavior time series using features that are derived using tools in time series analysis and topological data analysis. Our approach facilitates a case study, where each individual’s daily activity–travel behavior is characterized as a categorical time series consisting of three different levels. Clustering data from five waves of the National Household Travel Survey ranging from 1990 to 2017 suggests that activity–travel patterns of individuals over the last 3 decades can be grouped into three clusters. Results also provide evidence in support of recent claims about differences in activity–travel patterns of different survey cohorts. The proposed method is generally applicable and is not limited only to activity–travel behavior analysis in transportation studies. Driving behavior, travel mode choice, household vehicle ownership, when being characterized as categorical time series, can all be analyzed using the proposed method.
Journal: Journal of Big Data Analytics in Transportation