基于膨胀卷积与注意力机制的多维时序预测方法
DOI:
作者:
作者单位:

南京邮电大学

作者简介:

通讯作者:

基金项目:

江苏省前沿技术研发计划(BF2025617)


Multivariate Time Series Forecasting Method Based on Dilated Convolution and Attention Mechanism
Author:
Affiliation:

Nanjing University of Posts and Telecommunications

Fund Project:

Frontier Technologies R&D Program of Jiangsu(BF2025617)

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    针对多元时间序列预测中通道间复杂非线性依赖与时序偏移现象导致模型预测精度不足的问题,提出了一种基于膨胀卷积与通道注意力机制的多维时序预测模型(Dilated Convolutional Time Series Transformer, DCTST)。首先,通过倒置嵌入模块将每个变量的整个历史序列独立编码为高维令牌,保留全局时序特征;其次,设计通道注意力模块,结合自注意力机制与可学习路由机制,并引入膨胀卷积模块在变量维度上扩展感受野,捕获多尺度局部依赖;最后,通过跨层残差连接整合原始输入与深层特征,增强模型稳定性。理论分析和实验表明,与主流通道独立性模型(如PatchTST)和通道相关性模型(如iTransformer)相比,DCTST在多个真实数据集和合成数据集上的预测均方误差(MSE)和平均绝对误差(MAE)均有显著降低。实验结果表明,DCTST能够有效建模通道间依赖,提高多元时间序列的预测精度与鲁棒性,适用于智能电网、交通流量等实际工程场景。

    Abstract:

    Aiming at the problem of insufficient prediction accuracy of models caused by complex nonlinear dependencies and temporal shifts between channels in multivariate time series forecasting, this paper proposes a multivariate time series forecasting model based on dilated convolution and channel attention mechanism (Dilated Convolutional Time Series Transformer, DCTST). Firstly, an inverted embedding module independently encodes the entire historical sequence of each variable into high-dimensional tokens, preserving global temporal features. Secondly, a channel attention module is designed, combining a self-attention mechanism with a learnable routing strategy, and a dilated convolution module is introduced to expand the receptive field along the variable dimension to capture multi-scale local dependencies. Finally, cross-layer residual connections integrate the original input with deep-level features, enhancing model stability. Theoretical analysis and experiments show that compared to mainstream channel-independent models (e.g., PatchTST) and channel-dependent models (e.g., iTransformer), DCTST achieves a significant reduction in both Mean Squared Error (MSE) and Mean Absolute Error (MAE) on multiple real-world and synthetic datasets. The experimental results indicate that DCTST can effectively model inter-channel dependencies, improving the prediction accuracy and robustness of multivariate time series, making it suitable for practical engineering scenarios such as smart grids and traffic flow forecasting.

    参考文献
    相似文献
    引证文献
引用本文
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:2025-11-26
  • 最后修改日期:2026-04-14
  • 录用日期:2026-05-23
  • 在线发布日期: