Cerebras CS-2 Wafer Scale Engine: The Largest and Most powerful GPU ever Made

The creation of the CS-2 Wafer Scale Engine, the largest accelerator chip in the world, on a single device signifies a watershed moment for Cerebras as it represents the largest learning effort of the most complete global Natural Language Processing (NLP) AI model.

The impressive and unique twenty billion parameters of the Cerebras artificial intelligence model are unheard of. Cerebras completed this task without the need to scale the burden over numerous accelerators. Because Cerebras demands less infrastructure and software complexity than previous models did, its success is essential for machine learning.

Cerebras
credit: Wccftech.com

The Wafer Scale Engine-2, which is contained in a single 7 nm wafer and features 2.6 trillion 7 nm transistors, is comparable to hundreds of the most advanced CPUs now available. In addition to the wafer and transistors, the Wafer Scale Engine-2 includes 850,000 cores, 40 GB of integrated cache, and a 15 kW power consumption. A single CS-2 machine is comparable to a supercomputer all by itself, claims Tom’s Hardware.

The benefit for Cerebras is that by deploying a 20 billion-parameter NLP model in a single chip, it can lower its overhead in the cost of training thousands of GPUs, hardware, and scaling requirements.

In turn, the company might avoid any technical difficulties brought by dispersing various models over the chip.

Cerebras
credit: Wccftech.com

In NLP, bigger models are shown to be more accurate. But traditionally, only a select few companies had the resources and expertise necessary to do the painstaking work of breaking up these large models and spreading them across hundreds or thousands of graphics processing units. As a result, few companies could train large NLP models – it was too expensive, time-consuming, and inaccessible for the rest of the industry. Today we are proud to democratize access to GPT-3XL 1.3B, GPT-J 6B, GPT-3 13B, and GPT-NeoX 20B, enabling the entire AI ecosystem to set up large models in minutes and train them on a single CS-2.

— Andrew Feldman, CEO and Co-Founder, Cerebras System

Systems that function very effectively with fewer parameters have been observed recently. Chinchilla is one such system, consistently surpassing the 70 billion parameters of GPT-3 and Gopher. Researchers will find that they can calculate and progressively construct complex models on the new Wafer Scale Engine-2 where others cannot. This makes Cerebras’ accomplishment all the more significant.

Cerebras’ ability to bring large language models to the masses with cost-efficient, easy access opens up an exciting new era in AI. It gives organizations that can’t spend tens of millions an easy and inexpensive on-ramp to major league NLP. It will be interesting to see the new applications and discoveries CS-2 customers make as they train GPT-3 and GPT-J class models on massive datasets.

— Dan Olds, Chief Research Officer, Intersect360 Research

Also Read:

Intel Arc A380 GPU delivers Disappointing performance against NVidia GeForce RTX 1650 and AMD’s Radeon RX 6400 GPUs
source

LEAVE A REPLY

Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.

More like this

AMD & Intel Gain GPU Market Share in Korea as NVIDIA Struggles with Availability

AMD & Intel Gain GPU Market Share in Korea...

The GPU landscape is shifting in 2025, and for once, it's not NVIDIA dominating the charts. AMD...

AMD RX 9070 Series Delay: Why This Could Be...

Hey there, tech enthusiasts! Today, we're diving into some exciting news about upcoming AMD RX 9070 series...

AMD Radeon RX 6750 GRE launched in two variants

AMD has finally released its Radeon RX 6750 GRE graphics card, which comes in two flavors: 12...

GeForce RTX 4070 prices decreased to almost $549

In terms of performance, AMD's Radeon RX 7800 XT isn't significantly better than its predecessor, but it's...

Radeon GPU Detective is here to help during GPU...

AMD has released a useful tool called Radeon GPU Detective (RGD) to assist developers in debugging Radeon...

LATEST NEWS

iQOO pioneers the Biggest Collaboration in Indian Mobile Gaming

In a move set to redefine the mobile gaming landscape, iQOO, the cutting-edge smartphone brand under the vivo umbrella, has announced a series of...

ICC ODI World Cup: Top 5 Highest Score in World Cup history – all the details you need to know about!

The ICC Men's Cricket World Cup has been a stage for cricketing excellence ever since it first kicked off in 1975. Over the...

Why Was Barcelona vs Osasuna La Liga Match Postponed?

Barcelona's LaLiga fixture against Osasuna, scheduled for Saturday, was unexpectedly called off just 20 minutes before kickoff following the untimely passing of first-team doctor...

Akka OTT Release Date 2025: Keerthy & Radhika Lead a Fierce Gangster Saga

Akka is set to revolutionize the crime drama landscape, shattering long-standing genre conventions with its bold, unapologetic narrative. More than just another crime series,...

Featured