Title“Navigating the AI Era: The Essential Role of RAS”

AbstractIn the next decade, AI will lead a major technological revolution. AI Infrastructure will play a crucial role in the development of General AI, which includes Large Language Models (LLMs) and Inference Workloads. This keynote will offer a broad perspective, emphasizing the importance of quickly bringing products to market, cost-effectiveness, and building reliable and secure systems. To ensure the reliability of AI Supercomputers, various innovations are needed across different layers such as Intellectual Property (IP), Silicon, Server, Rack, and Fleet. The session will highlight the significance of Reliability, Availability, and Serviceability (RAS) technologies in reducing job interruptions, preventing data corruption, and cutting fleet servicing costs. These advancements are essential for maintaining the AI Infrastructure and integrity of AI systems.

Keynote

Corporate Vice President, General Manager, Data Center and AI Product Management, Intel Corporation

Dr. Zane A. Ball is a Corporate Vice President and General Manager of the Data Center and AI (DCAI) Product Management Group. DCAI Product Management is responsible for end-to-end stewardship of DCAI’s systems, SW, CPU, GPU, and custom product line through the entirety of the product lifecycle.  Prior to his product management role, Ball was CVP and GM of platform engineering and architecture for Intel’s data center business.  Ball has also served as Co-GM of Intel’s foundry effort as a VP in the Technology and Manufacturing group and VP of the Client Computing Group including roles as GM of the desktop client business and as GM of global customer engineering.

Ball has a bachelor’s degree, master’s degree, and Ph.D. in electrical engineering, all earned from Rice University.  He also holds six patents in high-speed electrical design.