无法预览该文件。请下载文件。
说明
由第五代英特尔® 至强®处理器和思科 Nexus 提供支持的思科 UCS 是大规模部署生成式人工智能的可扩展基础。该架构提供:
- 最佳性能: 采用英特尔 至强可扩展处理器、专用 AI 加速器和优化软件框架的思科 UCS 可显著提高推理性能和可扩展性。
- 平衡的架构: 思科 UCS 在深度学习和非深度学习计算方面均表现出色,这对整个推理管道至关重要。这种平衡的方法可以提高整体性能和资源利用率。
- 按需扩展: 思科 UCS 可根据您的生成式 AI 推理需求无缝扩展。随着模型的发展和工作负载的增长,使用 Cisco Intersight 自动添加或删除服务器、调整内存容量以及配置资源。
您可以选择在数据中心或边缘使用模块化或机架外形运行推理。
用法说明
相关资产
标题与描述
Format
语言
操作
Cisco UCS M7 and Pure Storage FlashArray: FlashStack VSI with VMware vSphere 8.0 — Design Guide
Cisco 7th generation of UCS C-Series and UCS X-Series Servers, powered using 4th Gen Intel Xeon Scalable processors., and Pure Storage FlashArray FlashStack on VMware vSphere 8 solution.
Cisco UCS M7 IMM FlexPod Datacenter with VMware vSphere 8.0, and NetApp ONTAP 9.12 Powered by Intel — Design Guide
Cisco UCS M7 IMM FlexPod Datacenter with VMware vSphere 8.0, and NetApp ONTAP 9.12 powered by Intel design guide
FlashStack Cisco UCS X-Series and Pure Storage FlashArray//X R3 for VMware Horizon 8 — Design Guide
FlashStack Virtual Desktop Infrastructure for VMware Horizon 8 VMware vSphere 8.0 U1 and 4th Gen Intel® Xeon® Scalable processors Design Guide
Cisco UCS with 5th Gen and 4th Gen Intel Xeon Processors for Generative AI
Cisco UCS, powered by 5th Gen Intel® Xeon® processors, delivers a compelling solution for maximizing Generative AI performance.
Generative AI Inferencing with Cisco UCS X-Series M7 Blade Servers / 5th Gen Intel Xeon Processors
Cisco UCS® with Intel® Xeon® Scalable processors and Cisco Nexus® offers a compelling and scalable foundation for deploying generative AI at scale.
GenAI Inferencing Powered by Cisco UCS X-Series / 5th Gen Intel Xeon Processors on Red Hat OpenShift AI — Cisco Validated Design
Cisco, Red Hat, and Intel provide a proven AI infrastructure to enable VMware-based Red Hat® OpenShift® AI.
Microsoft SQL Server 2022 on Cisco UCS X210c M6/M7 on 4th Gen Intel® Xeon® Scalable Processors — White Paper
This white paper contains a reference architecture that illustrates the benefits of Microsoft SQL Server 2022 on Cisco UCS X210c M6/M7 on 4th Gen Intel® Xeon® Scalable Processors for bare-metal and hybrid cloud deployments.