增强大语言模型驱动的交汇水域多船协同避碰决策

Cooperative collision avoidance decision-making for intelligent ships in intersection waters driven by enhanced large language models

  • 摘要:
    目的 旨在解决交汇水域多船协同避碰的复杂决策问题。
    方法 提出一种增强大语言模型(LLM)驱动的交汇水域多船协同避碰决策方法。通过分析交汇水域船舶通航特性,将交汇水域多船协同避碰问题建模为部分可观测马尔科夫决策过程(POMDP)。设计中心−分布式双层决策架构:中心层通过LLM协调器收集多船态势信息,结合通航规则和冲突严重度确定通行序列;分布式层同样由LLM赋能的智能船舶基于思维链提示工程进行渐进式决策推理,智能船综合场景描述、协调指令、航行经验生成避碰决策。为克服LLM在精确计算、持续学习方面的固有局限,并抑制其潜在的“幻觉”风险,通过集成决策增强模块提升LLM的决策能力。
    结果 仿真实验表明,应用该增强LLM驱动方法的DeepSeek-v3,能够在典型交汇水域的两船、三船及四船会遇冲突场景下,实现安全、高效的协同避碰,全程有效维持3 kn以上的最低舵效航速和超过2倍船长的安全距离。
    结论 所提方法可推动LLM在航海决策领域的工程化应用,为实现复杂环境下高度自主的舰船人工智能提供新的途径。

     

    Abstract:
    Objective To address the complex cooperative collision avoidance problem among multiple intelligent ships in intersection waters, this paper proposes an enhanced large language models(LLMs)-driven decision-making method for multi-ship cooperative collision avoidance in intersection waters.
    Method By analyzing the navigational characteristics of intersection waters, this study formulates the multi-ship cooperative collision-avoidance problem as a Partially Observable Markov Decision Process (POMDP), thereby providing a formal mathematical foundation for subsequent decision making. The cooperative process is decomposed into four causally linked modules—state perception, intent sharing, conflict coordination, and avoidance decision—to structure both information flow and reasoning. Guided by human seafarers’ embodied practices, a novel central-distributed dual-layer architecture is proposed. At the central layer, an LLM-based coordinator aggregates multi-ship situation, applicable navigation rules, and quantifies conflict severity to infer passage-priority sequences. At the distributed layer, individual ship agents are empowered by LLMs together with chain-of-thought prompt engineering to perform progressive, stepwise reasoning. These agents synthesize structured scene descriptions, coordination directives and retrieved navigation experience to generate executable avoidance maneuvers and accompanying semantic explanations. To mitigate known weaknesses of LLMs in precise numerical computation and continual learning, and to suppress potential hallucinations, the architecture integrates two complementary augmentation mechanisms. A lightweight mathematical engine is invoked to update kinematic states and compute deterministic conflict metrics, supplying rigorous quantitative inputs to the reasoning pipeline. A retrieval-augmented generation (RAG) navigation knowledge base combines a static corpus of rules with a dynamic repository of historical scene–decision–evaluation tuples, enabling case-based grounding and continuous learning from past interactions. By embedding formal computation and evidence-backing into the LLM reasoning loop, the proposed framework preserves the interpretive and interactive strengths of large models while ensuring verifiable, rule-compliant, and practically executable collision-avoidance decisions in complex intersection waters.
    Results Simulation experiments demonstrate that the proposed enhanced LLM-driven method implemented in DeepSeek-v3 achieves safe and efficient cooperative collision avoidance in typical intersection scenarios involving two, three, and four ships. The system maintains a minimum maneuvering speed of over 3 knots throughout and ensures a safety margin exceeding twice the ship length.
    Conclusion This method advances the engineering application of LLMs in maritime decision-making and provides a new pathway for realizing highly autonomous shipboard artificial intelligence in complex operational environments.

     

/

返回文章
返回