基于数据相似性和模型可靠度驱动的自适应模型融合方法
作者:
作者单位:

东北石油大学计算机与信息技术学院,大庆 163318

作者简介:

通讯作者:

基金项目:

国家自然科学基金(51774090,62076234);黑龙江省科技创新基地项目(JD24A009);黑龙江省自然科学基金(LH2024F005);黑龙江省博士后科研启动基金(LBH-Q20080)。


Adaptive Model Fusion Framework Driven by Data Similarity and Model Reliability
Author:
Affiliation:

School of Computer and Information Technology, Northeast Petroleum University, Daqing 163318, China

Fund Project:

National Natural Science Foundation of China (Nos.51774090, 62076234); Heilongjiang Science and Technology Innovation Base Project (No.JD24A009); Heilongjiang Natural Science Foundation (No.LH2024F005); Heilongjiang Postdoctoral Research Start-Up Fund (No.LBH-Q20080).

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    自适应模型融合对于动态应对数据与任务的演化特性尤为重要。然而,现有模型融合方法仍存在静态权重难以适配数据相似性、动态融合受单因素驱动和易受数据分布漂移影响等问题。针对这些不足,本文提出基于数据相似性与模型可靠度驱动的自适应模型融合方法。该方法通过特征语义对齐,捕捉样本间的数据相似性,得到相似矩阵,进一步得到样本匹配度系数。然后,根据性能-多样性的基模型筛选算法,通过多维度度量评估基模型的泛化能力与局部性能,得到基模型的可靠度系数。最后,根据数据相似性系数和基模型的可靠度系数,进行融合权重计算,得到最终的融合模型策略。在公共数据集上的实验结果证明了本文所提方法的有效性。

    Abstract:

    Adaptive model fusion is particularly important for dynamically responding to the evolutionary characteristics of data and tasks. However, existing model fusion methods still have issues such as static weights being difficult to adapt to data similarity, dynamic fusion being driven by single factors, and being susceptible to data distribution drift. To address these shortcomings, this paper proposes an adaptive model fusion method driven by data similarity and model reliability. The method captures the similarity between samples through feature semantic alignment to obtain a similarity matrix, and further obtains the sample matching degree coefficient. Then, based on the base model selection algorithm of performance-diversity, the generalization ability and local performance of the base models are evaluated through multi-dimensional metrics to obtain the reliability coefficient of the base models. Finally, the fusion weight is calculated based on the data similarity coefficient and the reliability coefficient of the base models to obtain the final fusion model strategy. Experimental results on public datasets demonstrate the effectiveness of the proposed method.Highlights:1.Propose an adaptive model fusion method driven by data similarity and model reliability. By incorporating data distribution characteristics and model reliability into model fusion, the method maximizes the prediction performance of the model and realizes adaptive model fusion.2.Propose a precise mixed data similarity measurement module.It achieves semantic alignment of numerical and categorical heterogeneous features through deep embedding, integrates improved K-Prototypes clustering to output sample-cluster similarity vectors, and underpins sample-level local dynamic adaptation.3.Design a performance-diversity dual-goal optimization-based base model selection mechanism, leveraging multi-dimensional evaluation, diversity quantification, and dynamic decaying weights to automatically prune redundant models, adjust reliability weights, and boost fusion robustness.4.Propose an adaptive model fusion framework that does not rely on scenario-specific prior distribution assumptions. It can flexibly adapt to data drift and heterogeneous model fusion requirements in different fields, providing an adaptive fusion solution for complex tasks.

    参考文献
    相似文献
    引证文献
引用本文

王梅,李艳培,高雅田.基于数据相似性和模型可靠度驱动的自适应模型融合方法[J].数据采集与处理,2026,(3):767-779

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:2025-06-15
  • 最后修改日期:2025-07-03
  • 录用日期:
  • 在线发布日期: 2026-06-10