Image courtesy: Reuters


November 18, 2024 Tags:

Nvidia's Blackwell AI chips, which have been eagerly awaited by customers, are now facing unexpected setbacks. These chips, designed to be much faster and more powerful than previous models, have encountered overheating issues when installed in servers, causing concerns among customers who were relying on them to support their data centers.
The problem arises when multiple Blackwell chips are connected together in server racks designed to hold up to 72 chips. When the chips are stacked together in these racks, they overheat, creating a significant problem for companies that need these advanced processors to operate at full capacity. Sources familiar with the issue have revealed that Nvidia has been working with its suppliers to redesign the racks several times in an attempt to resolve the overheating problem.

Nvidia has yet to publicly name the suppliers involved in these efforts, but company insiders, as well as suppliers and customers familiar with the situation, have confirmed that the issue has been ongoing. Despite these setbacks, Nvidia remains optimistic, with a company spokesperson stating that the situation is part of the normal engineering process and that the company is working closely with top cloud service providers to solve the problem.

"We are working with leading cloud service providers as an integral part of our engineering team and process. The engineering iterations are normal and expected," a representative from Nvidia explained. This statement suggests that the company is committed to resolving the issue, but it also highlights the complexity of developing and fine-tuning such advanced technology.

The Blackwell chips were originally set to ship in the second quarter of the year. However, due to these delays, the release timeline has been pushed back, which may affect major customers like Meta Platforms, Alphabet (Google), and Microsoft. These companies were planning to use the chips in their data centers, where they would accelerate tasks such as providing responses from chatbots and handling large amounts of data at incredibly high speeds.

Nvidia’s Blackwell chip is a significant advancement over its predecessor. It combines two squares of silicon, each the size of previous chips, into a single component. This new design makes the Blackwell chip 30 times faster than earlier models, especially in tasks that require rapid data processing. For businesses like Meta, Google, and Microsoft, having these chips in place is critical to maintaining their data center operations and meeting the growing demand for AI-driven services.

Despite the overheating issue, Nvidia's team is working hard to get things back on track. The company’s approach, which includes redesigning the racks and collaborating closely with suppliers and customers, shows a proactive stance toward overcoming the challenges. However, the delay in production and the technical hurdles have left some customers worried about meeting their deadlines for setting up new data centers.

This delay underscores the challenges that even the most advanced tech companies face when pushing the boundaries of innovation. As Nvidia continues to address the overheating problem, customers are eagerly awaiting a resolution that will allow them to harness the power of Blackwell chips in their data centers.

How useful was this post?

Click on a star to rate it!

Average rating 0 / 5. Vote count: 0

No votes so far! Be the first to rate this post.

You may also like

Meta Turns to Nuclear Power to Keep Up with AI Demand

Meta, the parent company of Facebook, has signed a long-term agreement to power its growing artificial intelligence (AI) operations using....

Young AI Coding Startups Surge with Huge Investor Backing

In just a couple of years since ChatGPT made headlines, a new wave of AI-driven coding startups is grabbing the....

Neuralink Secures $650M in Funding as Brain Chip Enters Trials

Elon Musk’s brain-tech company Neuralink has raised a massive $650 million in its latest funding round, marking a major step....

Google to Spend $500M to Fix Compliance After Lawsuit

In a major move to reshape its internal practices, Google has agreed to invest $500 million over the next decade....

Google Pushes Back Against Chrome Breakup Proposal

In a closely watched legal showdown, Google has pushed back against efforts to break up its popular Chrome browser. The....

US Lawyer Warns Canada About AI and Political Threats

An American lawyer known for challenging former U.S. President Donald Trump is urging Canadians to stay alert when it comes....

Google Faces Legal Clash with Bureau Over Ad Market Power

Google is at the center of a legal standoff with Canada’s Competition Bureau. The tech giant is fighting back against....

Claude AI Left Secret Notes That Alarmed Its Own Creators

A new artificial intelligence model, Claude Opus 4, has drawn major attention not just for its power but for its....

Dalhousie University Uses 3D Printing to Fix Navy Ships Fast

Dalhousie University in Halifax is teaming up with Canada’s Department of National Defence to help keep the country’s naval fleet....

Strauss’ ‘Blue Danube’ Waltz Set to Launch Into Space for 200th Birthday

This month, Johann Strauss II’s famous waltz, “Blue Danube,” will embark on a unique journey—into outer space—to celebrate the 200th....

Census Bureau Cuts Raise Worries About Data Future

A group launched by Elon Musk, called the Department of Government Efficiency (DOGE), is now taking aim at the U.S.....

Google’s Veo 3: A Game-Changing AI Video Tool Stuns and Scares Viewers

Google’s latest AI creation, Veo 3, is taking the internet by storm—and not just for the right reasons. The tool’s....