基于深度学习模型的电磁超表面设计关键技术

Key Technologies in Electromagnetic Metasurface Design with Deep Learning Models

  • 摘要: 【目的】针对基于深度学习的超表面优化高度依赖大规模全波仿真数据、导致优化效率受限的问题,对超表面设计场景中的数据集构建方法、深度学习模型结构构建方法、深度学习模型训练方法以及超表面优化方法进行了研究。【方法】首先,建立了以样本重要性为依据的训练样本集采样方法,在减少训练所需样本数量的同时降低模型估算超表面电磁响应的误差;其次,构建了一种多模态深度学习模型用于同时提取超表面向量化结构参数特征以及图案特征,提高了模型估算响应的性能;再次,提出了一种利用深度学习域外泛化特性的模型训练方法,充分利用模型自身泛化性能生成低保真度样本从而减少高保真度训练样本集的规模;最后,不同于使用高代价高准确性的精确模型,提出了一种使用有限样本规模训练得到的粗略深度学习模型进行超表面设计的优化方法。【结果】数值结果表明,在维持深度学习模型性能的前提下,所提出的数据集构建方法、多模态模型构建方法和训练方法能够将训练所需的全波仿真样本减少30%~50%,大幅减少了优化超表面所需的全波仿真。此外,所提出的基于粗略深度学习模型的优化方法能够以数十次迭代和全波仿真设计得到性能优秀的超表面。【结论】本文建立了一套贯穿“数据-模型-训练-应用”闭环的低数据依赖型系统框架。通过四个层面的针对性重构,系统性地缓解了制约超表面设计效率的数据规模难题,从而为深度学习在电磁工程领域的低成本应用提供了通用的方法论支撑与范式。

     

    Abstract: Objectives To address the problem that deep-learning-based optimization of metasurfaces relies heavily on large-scale full-wave simulation data, thereby limiting optimization efficiency, the dataset construction methods, deep learning model structure construction methods, deep learning model training methods, and optimization methods for metasurcae structure design scenarios are investigated. Methods First, a training-set sampling method based on sample importance is developed, by which the number of samples required for training is reduced while the model performance in estimating the electromagnetic responses of metasurface structures is improved. Second, a multimodal deep learning model is constructed to simultaneously extract features form vectorized structural parameters and pixelated patterns of metasurface structures, by which the response-estimation performance is improved. Third, a training method that leverages the out-of-distribution generalization property of the deep learning model is proposed, by which low-fidelity samples are generated using the model’s own generalization capability and the scale of the high-fidelity training sample set is thereby reduced. Finally, instead of using an accurate model with high computational cost, an optimization method for metasurface structure is proposed in which a coarse deep learning model trained with a limited sample size is used. Results Numerical results demonstrate that, while the performance of the deep learning model is maintained, the proposed dataset construction method, multimodal model construction method, and training method can reduce the number of full-wave simulation samples required for training by 30%–50%, thereby substantially reducing the full-wave simulations required for optimizing metasurface structures. In addition, it is demonstrated that, by the proposed optimization method based on the coarse deep learning model, metasurface structures with excellent performance can be designed through dozens of iterations and full-wave simulation validations. Conclusions A low-data-dependency system framework is detailed that encompasses the entire closed-loop process of “data-model-training-application”. Through targeted restructuring at four different levels, it systematically addresses the challenge of data scale limitations that hinder the efficiency of superurface design. As a result, it provides a general methodological support and paradigm for the low-cost application of deep learning in the field of electromagnetic engineering.

     

/

返回文章
返回