{"id":228,"date":"2024-05-28T03:13:07","date_gmt":"2024-05-28T03:13:07","guid":{"rendered":"https:\/\/ieee-ras.conferences.computer.org\/2024\/?page_id=228"},"modified":"2024-05-28T03:14:34","modified_gmt":"2024-05-28T03:14:34","slug":"invited_talk_sanjay_gongalore_abstract","status":"publish","type":"page","link":"https:\/\/ieee-ras.conferences.computer.org\/2024\/invited_talk_sanjay_gongalore_abstract\/","title":{"rendered":"Invited_talk_Sanjay_Gongalore_Abstract"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-page\" data-elementor-id=\"228\" class=\"elementor elementor-228\" data-elementor-post-type=\"page\">\n\t\t\t\t<div class=\"elementor-element elementor-element-9ef7bf5 e-flex e-con-boxed e-con e-parent\" data-id=\"9ef7bf5\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-521d760 elementor-widget elementor-widget-text-editor\" data-id=\"521d760\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p><strong>Title:<\/strong> Maximizing Availability for a Zettascale Datacenter<\/p><p><strong>Speaker<\/strong>:\u00a0 Sanjay Gongalore<\/p><p><strong>Abstract:<\/strong><br \/>Rapid adoption and growing complexity of Generative AI models is triggering a furious buildout of AI factories that are projected to reach Zettascale in 2025. Training the LLMs (Large Language Models) for generative AI reliably at scale is one of the toughest challenges in the datacenter today. The presentation will first establish terminology, then present self-healing approaches in data centers to maintain high availability and efficiency despite in-field hardware failures. The talk will cover topics such as modeling availability, fault attribution to allow minimal interruption, and recovery. <br \/><br \/><br \/><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Title: Maximizing Availability for a Zettascale Datacenter Speaker:\u00a0 Sanjay Gongalore Abstract:Rapid adoption and growing complexity of Generative AI models is triggering a furious buildout of AI factories that are projected to reach Zettascale in 2025. Training the LLMs (Large Language Models) for generative AI reliably at scale is one of the toughest challenges in the [&hellip;]<\/p>\n","protected":false},"author":4,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"elementor_canvas","meta":{"footnotes":""},"class_list":["post-228","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/ieee-ras.conferences.computer.org\/2024\/wp-json\/wp\/v2\/pages\/228","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ieee-ras.conferences.computer.org\/2024\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/ieee-ras.conferences.computer.org\/2024\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/ieee-ras.conferences.computer.org\/2024\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/ieee-ras.conferences.computer.org\/2024\/wp-json\/wp\/v2\/comments?post=228"}],"version-history":[{"count":0,"href":"https:\/\/ieee-ras.conferences.computer.org\/2024\/wp-json\/wp\/v2\/pages\/228\/revisions"}],"wp:attachment":[{"href":"https:\/\/ieee-ras.conferences.computer.org\/2024\/wp-json\/wp\/v2\/media?parent=228"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}