基于近端策略优化算法和Mask-TIT网络的多功能雷达干扰决策方法

基于近端策略优化算法和Mask-TIT网络的多功能雷达干扰决策方法
DOI:
                        
作者:
                        
作者单位:杭州电子科技大学通信工程学院
作者简介:
通讯作者:
基金项目:

Multi-functional radar jamming decision method based on proximal policy optimization algorithm and Mask-TIT network

Author:

Affiliation:

1.College of Communication Engineering, Hangzhou Dianzi University;2.Hangzhou Dianzi University

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

摘要:

为应对愈加智能的多功能雷达给对抗方带来的挑战，本文提出了一种基于近端策略优化（PPO）算法和Mask-TIT网络的干扰决策方法。首先，从一种现实场景出发，将干扰机与雷达的对抗场景建模为部分可观察马尔可夫决策过程（POMDP），根据雷达工作原理设计了新的状态转移函数和奖励函数，并根据多功能雷达层级模型设计了观测空间。其次，利用Transformer对序列数据的表征能力和雷达干扰样式的特点设计了一种Mask-TIT网络结构，用于构建更强大的Actor-Critic网络架构。最后，使用近端策略优化算法进行优化学习。实验表明，该算法较现有方法，收敛所需交互数据平均减少25.6%，并且收敛后的方差显著降低。

Abstract:

To cope with the challenges brought by increasingly intelligent multifunctional radars to the opposing side, this article proposes an jamming decision-making method based on the Proximal Policy Optimization (PPO) algorithm and the Mask-TIT network. Firstly, starting from a realistic scenario, the adversarial scene between the jammer and the radar is modeled as a Partially Observable Markov Decision Process (POMDP), a new state transition function and reward function are designed based on the working principles of the radar, and the observation space is designed according to the hierarchy of the multifunctional radar model. Secondly, a Mask-TIT network structure is designed using the Transformer's representation capacity for sequence data and the characteristics of radar jamming patterns, which is used to build a more powerful Actor-Critic network architecture. Finally, the Proximal Policy Optimization algorithm is used for optimization learning. The experiment shows that compared with existing methods, the algorithm reduces the average amount of interactive data required for convergence by 25.6%, and the variance after convergence is significantly reduced.

参考文献

相似文献

引证文献

引用本文

复制

文章指标

点击次数:
下载次数:

历史

收稿日期:2023-05-23
最后修改日期:2023-12-25
录用日期:2024-03-29
在线发布日期:

引用本文

分享

文章指标

历史