面向可信的群体智能与AI智能体：威胁、对策与展望

严宇萍; 谢雨晗; 俞恒杰; 金耀初

doi:10.11991/cccf.202603009

面向可信的群体智能与AI智能体：威胁、对策与展望

Trustworthy Swarm Intelligence and AI Agents: Challenges and Opportunities

摘要

摘要: 群体智能系统与人工智能（AI）智能体系统正快速走向现实场景，在应急救援、交通管控、仓储物流、工业制造与安全运维等领域展现出巨大潜力。然而，随之而来的安全与隐私风险亦在加剧：从物理层面的干扰与破坏，到通信层面的篡改与劫持，再到应用层中对模型、数据与决策过程的攻击与窃密。本文在统一的物理层—通信层—应用层3层框架下，系统梳理群体智能系统与 AI 智能体系统的安全与隐私威胁，归纳共性及差异，并对照给出针对性的防护策略，包括访问控制、邻域过滤、区块链机制、基于强化学习的入侵检测、差分隐私、同态加密和联邦学习等。进一步，提炼可迁移的策略模式，讨论这两类系统在安全治理、实时性与效能权衡、大模型时代的新型威胁，例如越狱、提示注入、工具滥用、幻觉攻击下的挑战与研究方向。本文旨在为构建安全、稳健、可信的群体智能系统与 AI 智能体系统提供一个面向工程落地的系统化参考。

Abstract: Swarm intelligence systems and AI Agent systems are rapidly moving into real-world deployments, showing great promise in domains such as emergency response, traffic management, warehousing and logistics, industrial manufacturing, and operational security. However, security and privacy risks are escalating across layers: physical interference, communication tampering, and application-level attacks targeting models, data, and decision processes. Under a unified three-layer (physical-communication-application) framework, this article systematically catalogs the security and privacy threats facing both classes of systems, summarizes their commonalities and differences, and surveys targeted countermeasures, including access control, neighborhood filtering, blockchain-based mechanisms, reinforcement-learning driven intrusion detection, differential privacy, homomorphic encryption, and federated learning. It further distills transferable defensive patterns and discusses cross-cutting challenges, including security governance, trade-offs among real-time requirements and system performance, and emerging risks in the large-model era (e.g., jailbreaks, prompt injection, tool misuse, and hallucination attacks). The goal of this work is to provide an engineering-oriented, systematic reference for building secure, robust, and trustworthy swarm intelligence and AI agent systems.

参考文献(47)

施引文献

资源附件(0)