面向舰载航空枢纽作业解析的领域大模型构建方法

欧阳朋朋; 赵仕星; 杨彤; 范棕胜; 胡钲卿; 杨骏一; 吴晗; 陈董; 金钊; 刘起东; 徐明亮

doi:10.19693/j.issn.1673-3185.04881

面向舰载航空枢纽作业解析的领域大模型构建方法

A method for constructing a domain-specific large language model for aircraft carrier air-hub operation parsing

摘要

摘要:
目的舰载航空枢纽作业解析具有专业性强、时空约束复杂等特点，易出现解析结果有误、关键信息遗漏、结构不一致等问题。面向资源高度受限的航空母舰作战场景，需要研究如何构建可部署的领域大模型，以提升其在舰载航空枢纽作业解析任务中的准确性、可靠性与一致性。
方法提出一种面向舰载航空枢纽作业解析的领域大模型（carrier air-hub operation parsing large model，CAOP）构建方法。训练阶段，采用高维蒸馏，将云端教师模型在实体识别、关系识别、统计信息、预测信息、文本−数值联合表意解析、异常事件线索六类关键信息上的解析能力迁移至本地模型；推理阶段，引入舰载航空作业知识库与案例检索，采用协同校对对候选输出施加证据约束与结构一致性约束，并修正输出结果。
结果在本文的实验设置条件下，CAOP使本地大模型Qwen3-32B在舰载航空枢纽作业解析任务上的平均得分由13.3提升至18.4（提升5.1分，约38.3%），超越众多云端大模型，并媲美人类专家表现。
结论实验结果表明，CAOP能够在本地可部署条件下显著提升舰载航空枢纽作业解析的准确性、可靠性与结果一致性，为后续调度计划的自动生成、冲突检测与优化求解提供可靠数据基础与可信结构化输入。

Abstract:
Objective Aircraft carrier air-hub operation parsing is a typical domain-specific structured understanding task in carrier-based aviation scenarios. Its source data are primarily derived from unstructured mission logs, command messages, and support reports, and are characterized by strong temporal dependencies, complex spatiotemporal constraints, dense domain-specific terminology, and tight coupling among aircraft status, resource allocation, and operational phases. These characteristics make automatic parsing prone to errors, including omission of critical information, and structural inconsistency. Although cloud-based large language models (LLMs) offer a feasible solution due to their extensive prior knowledge and strong generative capabilities, their high computational cost and deployment requirements limit their applicability in highly resource-constrained carrier-side environments. In contrast, locally deployable LLMs are more suitable for sensitive operational scenarios, yet their domain-specific parsing capability remains insufficient. This study aims to construct a deployable domain-specific large language model for aircraft carrier air-hub operation parsing, with the goal of improving parsing accuracy, reliability, and consistency under local deployment constraints.
Method We propose a domain-specific large language model construction method for carrier air-hub operation parsing, termed the carrier air-hub operation parsing large model (CAOP). The proposed framework consists of two tightly coupled stages, namely high-dimensional knowledge distillation during training and collaborative verification during inference. In the training stage, a cloud-based teacher model is employed to provide structured supervision across six key information dimensions, including entity recognition, relation extraction, statistical information extraction, predictive information extraction, joint semantic parsing of textual and numerical data, and abnormal event cue extraction. The outputs across these dimensions are further aligned and fused to construct transferable supervision signals, enabling the distillation of domain-specific parsing capabilities from the teacher model into a locally deployable student model via parameter-efficient optimization. In the inference stage, a carrier aviation operation knowledge base and a case-based retrieval mechanism are introduced to provide external evidence and similar historical instances. Based on these resources, collaborative verification is performed on candidate outputs by enforcing both evidence consistency and structural constraints, thereby reducing hallucinations, minimizing redundancy, and correcting inconsistent information fields.
Results Under the experimental settings adopted in this study, the proposed method improves the average score of the locally deployed Qwen3-32B model on carrier air-hub operation parsing from 13.3 to 18.4, yielding an absolute gain of 5.1 points and a relative improvement of approximately 38.3%. The resulting performance not only surpasses that of the compared cloud-based models but also approaches expert-level evaluation. The improvements are primarily reflected in more complete information field extraction, stronger evidence alignment, and enhanced structural consistency of the final parsing results.
Conclusion The results demonstrate that the proposed CAOP framework significantly improves the accuracy, reliability, and consistency of carrier air-hub operation parsing under local deployment constraints. It provides a robust and trustworthy structured data foundation for downstream tasks, including automatic scheduling generation, conflict detection, and optimization solving.

HTML全文

参考文献(34)

施引文献

资源附件(0)