高级检索

Data-Centric AI

Data-Centric Artificial Intelligence

  • 摘要: 本文系统阐述了人工智能正从模型为中心(Model-centric AI, MCAI)向数据为中心(Data-centric AI, DCAI)转型的趋势,并提出了面向DCAI的数据基础设施体系,包括支持多模态数据统一管理的AI数据库;DataFlow数据准备与动态训练工具。该体系突破了传统数据湖和数据处理工具的局限,实现了数据与模型的高效协同。通过大模型预训练、企业知识库构建等创新应用验证,展示了DCAI基础设施在提升模型性能、降低开发门槛方面的突破性价值,为人工智能向智能化计算新范式演进提供了系统解决方案。

     

    Abstract: This article systematically presents the paradigm shift in artificial intelligence from model-centric AI (MCAI) to data-centric AI (DCAI), and proposes a novel DCAI-oriented data infrastructure framework. The framework comprises an AI database supporting unified multimodal data management and DataFlow - an integrated data preparation and dynamic training platform. This architecture fundamentally overcomes the limitations of conventional data lakes and processing tools, establishing an efficient synergistic mechanism between data and models. Through innovative applications in large-scale model pretraining and enterprise knowledge base construction, we demonstrate the transformative potential of DCAI infrastructure in significantly enhancing model performance while substantially lowering development barriers. Our solution provides a systematic approach to facilitate AI's evolution toward next-generation intelligent computing paradigms.

     

/

返回文章
返回