Home > News > AI

How powerful is the "world's strongest AI chip"?

Tue, Apr 09 2024 08:09 AM EST

?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0408%2F50464885j00sbl4up000ed0008c008cg.jpg&thumbnail=660x2147483647&quality=80&type=jpg Recently, Jensen Huang, the CEO of the American chip company NVIDIA, unveiled the AI chip B200 at the 2024 Developer Conference. It boasts a computing speed 30 times faster than its predecessor, earning Huang the moniker of "the most powerful AI chip in the world." ?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0408%2F1b305db1j00sbl4up000gd0008c008cg.jpg&thumbnail=660x2147483647&quality=80&type=jpg B200 chips have significantly reduced costs and energy consumption compared to the first-generation chips. Previously, training the ChatGPT chatbot for three months required 8000 chips and 15 megawatts of power. Now, it only requires 2000 chips and consumes 4 megawatts of power. ?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0408%2Fe2e526caj00sbl4up000jd0008c008cg.jpg&thumbnail=660x2147483647&quality=80&type=jpg Today, the minimum size of transistors, the basic units of chips, has reached below 4 nanometers (1 nanometer = 0.000001 millimeters), almost approaching the diameter of an atom, reaching a limit. ?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0408%2F414a9c5ej00sbl4up000jd0008c008cg.jpg&thumbnail=660x2147483647&quality=80&type=jpg Developing artificial intelligence requires more powerful chips. We can combine more chips together to form larger virtual chips, constructing massive AI supercomputing clusters. This is the architecture of Nvidia's next-generation chips. ?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0408%2F1c3d2772j00sbl4up000cd0008c008cg.jpg&thumbnail=660x2147483647&quality=80&type=jpg The first step in the new architecture is to combine two chips into one, forming the B200 chip, which totals 20.8 billion transistors, setting a new record in transistor integration. Its memory capacity has also doubled. ?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0408%2F12cf8b84j00sbl4up000hd0008c008cg.jpg&thumbnail=660x2147483647&quality=80&type=jpg The combination of two B200 chips with a standalone Grace chip forms the GB200 superchip, interconnected via ultra-low-power interconnect technology. ?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0408%2Fa372b15cj00sbl4up000gd0008c008cg.jpg&thumbnail=660x2147483647&quality=80&type=jpg Two GB200 super chips assembled onto a single motherboard form the architecture of an artificial intelligence computing node. When 18 such computing nodes are connected, they constitute a larger virtual chip. ?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0408%2F9c4be151j00sbl4up000gd0008c008cg.jpg&thumbnail=660x2147483647&quality=80&type=jpg The performance of such computing units is on par with the entire supercomputer cluster from the era of the previous-generation NVIDIA H100 chip. This is what many media outlets refer to as "one cabinet tops one cluster," illustrating the significance. ?url=http%3A%2F%2Fdingyue.ws.126.net%2F2024%2F0408%2F511fc3e3j00sbl4up000md0008c008cg.jpg&thumbnail=660x2147483647&quality=80&type=jpg While reports suggest that Nvidia's B200 chips may come with a hefty price tag, tech companies worldwide are lining up to place their orders, with no guarantee of immediate availability for every company.