打开网络流量背后的协议密码:基于大语言模型的网络流量生成之旅
Unlocking the protocol secrets behind network traffic: a journey into network traffic generation with large language models
-
摘要: 本文介绍了长流量词元(token)表示方法。该方法采用生成式预训练和线性注意力等机制,显著提升了大模型处理长流量能力,首次实现了大模型对流量的完整流生成能力,支持多应用场景。本文结合YOCSEF西安举办的大模型安全系列论坛,梳理流量大模型的机理及现状趋势,探讨挑战及未来发展方向。Abstract: In this article, the authors propose a novel token representation method for long traffic flows. The method leverages generative pre-training and linear attention mechanisms, which significantly enhances the capability of large models to process long traffic flows. For the first time, this breakthrough enables large language models to achieve comprehensive flow-level traffic generation that can be widely applied to diverse scenarios.