@Learning to Discover Causes of Traffic Congestion with Limited Labeled Data

[[Abstract]] 拥堵成因解释

  • traffic congestion

  • Traffic congestion incurs long delay in travel time, which seriously affects our daily travel experiences. Exploring why traffic congestion occurs is significantly important to effectively address the problem of traffic congestion and improve user experience.

  • Traditional approaches to mine the congestion causes depend on human efforts, which is time consuming and cost-intensive.

  • Hence, we aim to discover the known and unknown causes of traffic congestion in a systematic way. However, to achieve it, there are three challenges:

      1. traffic congestion is affected by several factors with complex spatio-temporal relations;
      1. the amount of congestion data with known causes is small due to the limitation of human label;
      • 有拥堵原因的数据很少
      1. more unknown congestion causes are unexplored since several factors contribute to traffic congestion.
  • To address above challenges, we design a ^^congestion cause discovery system^^ consisting of two modules:

      1. congestion feature extraction, which extracts the important features influencing congestion;
      • 拥堵特征提取
      1. congestion cause discovery, which utilize a deep semi-supervised learning based method to discover the causes of traffic congestion with limited labeled causes.
      • 拥堵成因发现
  • Specifically, it first leverages a few labeled data as prior knowledge to pre-train the model. Then, the deep embedded clustering method is performed to produce the clusters under the supervision of the data reconstruction loss and Kullback-Leibler divergence loss.

  • Extensive experiments show that the performance of our proposed method is superior to the baselines. Additionally, our system is deployed and used in the practical production environment at Amap.

[[Attachments]]