Are LLMs About To Hit A Wall? | Commentary

Alex Kantrowitz

26 April 2024 at 5:30 pm·4-min read

Each new generation of large language model (LLM) consumes a staggering amount of resources.

Meta, for instance, trained its new Llama 3 models with about 10 times more data and 100 times more compute than Llama 2. Amid a chip shortage, it used two 24,000 GPU clusters, with each chip running around the price of a luxury car. It employed so much data in its AI work, it considered buying the publishing house Simon & Schuster to find more.

Afterward, even its executives wondered aloud if the pace was sustainable.

“It is unclear whether we need to continue scaling or whether we need more innovation on post-training,” Ahmad Al-Dahle, Meta’s VP of GenAI, told me in an interview last week. “Is the infrastructure investment unsustainable over the long run? I don’t think we know.”

For Meta — and its counterparts running large language models — the question of whether throwing more data, compute, and energy at the problem will lead to further scale looms large. Since LLMs entered the popular imagination, the best path to exponential improvement seemed to be combining these ingredients and allowing the magic to happen. But with the top bound of all three potentially in sight, the industry will need newer techniques, more efficient training, and custom built hardware to progress. Without advances in these areas, LLMs may indeed hit a wall.

The path of continued scale probably starts with better methods to train and run LLMs, some of which is already in motion. “We are starting to see new kinds of architectures that are going to change how these models scale in the future,” Swami Sivasubramanian, VP of AI and Data at Amazon Web Services, told me in an interview Thursday night. Sivasubramanian said researchers within Stanford and elsewhere are getting models to learn faster, with the same amount of data, and 10 times cheaper inference. “I’m actually very optimistic about the future when it comes to novel model architectures, which has the potential to disrupt the space,” he said.

Already, new methods of training these models seem to be paying off. “The smallest Llama 3 is basically as powerful as the the biggest Llama 2,” Mark Zuckerberg said on the Dwarkesh Patel podcast last week.

To fuel these models — and get around potential bottlenecks in exhausting real world data — synthetic data created by AI is playing a key role. Though not fully proven yet, this data already made its way into model training. “Our coding abilities on Llama 3 is exceptionally high,” Meta’s Al-Dahle said. “Part of that was really being innovative and pushing on our ability to leverage models to generate synthetic data.”

Along with finding better models, LLM progress likely depends on building better chips that can train and run these models faster and more efficiently than traditional chips. While NVIDIA GPUs are exceptionally useful for large language models, they aren’t purpose-built for them. Now some chips built specifically for generative AI are showing promise. Researchers like Andrew Ng have praised Groq, one buzzy name, as the type of chip that works fast enough to take generative AI to the next level, especially as the field pushes toward agents.

Meanwhile, companies like Amazon, Intel, Google and others are building “accelerators,” or custom chips that can run AI processes fast. At Amazon, Sivasubramanian said, the company’s purpose built Trainium chips are “designed with the sole purpose of being able to train these large language models” and already four times faster than the first generation.

Given the need and the opportunity ahead, it’s no wonder OpenAI CEO Sam Altman is reportedly raising a lot of money to build chips powerful enough to achieve his aims.

The one LLM constraint that’s been little discussed is energy, and it may be the most important. “There’s a capital question of — at what point does it stop being worth it to put the capital in? — but I actually think before we hit that, you’re going to run into energy constraints,” Zuckerberg told Patel. He floated the idea of building a 1 gigawatt datacenter to advance AI, or something approximating a meaningful nuclear power plant. But given regulatory approvals and the build outs complexity, it could take years to produce. “I think it will happen,” he said. “This is only a matter of time.”

Until we get to such massive energy allocation, it may be difficult to say how much room LLMs have left to improve. But it seems like sooner or later, we will find out. “I am not thinking about it myself,” Sivasubramanian said with a laugh, of a nuclear-level plant to run AI models, “but I can’t speak to my infra team.”

The post Are LLMs About To Hit A Wall? | Commentary appeared first on TheWrap.

The Daily Beast
‘The View’s’ Ana Navarro Uses Nude Melania Trump Photo to Defend Kamala Harris
Ana Navarro, a long-time co-host of The View, posted on her Instagram Thursday an old photo of nude Melania Trump as a way to troll her husband’s supporters, saying: “You wanna go low? ... I’ll happily go 20,000 leagues under the sea.”It was a picture from 2000 featured in British GQ, five years before Donald Trump married her.Navarro also included a picture of both Trumps partying with Jeffrey Epstein and Ghislaine Maxwell, also from 2000. Her explanation for posting these images was that it wa
The Daily Beast
FBI Is Not Fully Convinced Trump Was Struck by a Bullet
FBI Director Christopher Wray revealed during a marathon testimony on Wednesday that investigators still do not know if former President Donald Trump was grazed by a bullet or a piece of shrapnel during his attempted assassination.Twice during the hours-long session, Wray told lawmakers that the FBI was still working to determine what exactly struck the former president on his right ear during a rally in Butler, Pennsylvania. “My understanding is that either it [a bullet] or some shrapnel is wha
People
“Crazy Rich Asians” Director Jon M. Chu Reveals One Demand Star Michelle Yeoh Made — and His Dad Agreed!
The director also says Yeoh was the only actress considered for the role
Malay Mail
Four suspects in Johor girl Albertine Leo’s abduction from Bon Odori fest out on bail
JOHOR BARU, July 26 — Four suspects who had been arrested for the investigation into the abduction and kidnapping of six...
Malay Mail
‘Goreng pisang’ seller who lured two young girls with RM50 to get into his car because he wanted a daughter, jailed two years for kidnapping and fined RM2,000
KUALA LUMPUR, July 25 — A “goreng pisang” seller was today sentenced to 24 months in prison and fined RM2,000 at the Sun...
Malay Mail
Going for gold: Malaysian squad to wear elegant Rizman Ruzaini-designed official attire inspired by warriors for Paris 2024 opening
KUALA LUMPUR, July 25 — Youth and Sports Minister Hannah Yeoh today revealed the set of gold-coloured official attire of...
Rolling Stone
Harris Taunts Trump After He Backs Out of Debates
“What happened to ‘any time, any place’?”
The Independent
Police officer stood down after ‘truly shocking’ video shows man kicked in face at Manchester Airport
Hundreds of protesters chanted ‘shame on you’ at a protest at Manchester airport following the incident captured on camera
Malay Mail
Nur Farah Kartini’s murder: Cop to be charged with murder tomorrow, death penalty awaits if found guilty
KUALA LUMPUR, July 25 — The policeman arrested in connection with the murder of former Universiti Pendidikan Sultan Idri...
Malay Mail
Indian woman's ‘Tauba Tauba’ dance goes viral with 55 million view, leads Hindi hit film ‘Bad Newz’ craze
PETALING JAYA, July 26 — A video of an Indian woman dancing with her children to Vicky Kaushal’s viral song Tauba Tauba...
Malay Mail
It takes just 30 seconds to steal a car and thieves are targeting Toyotas, say Johor cops (VIDEO)
JOHOR BARU, July 25 — Gone in 30 seconds, that is the amount of time needed for a car theft syndicate to steal a luxury...
The Telegraph
How Gerald Ford predicted Kamala Harris’s presidential run
Almost 35 years ago, Gerald Ford predicted that America would get its first female president only when a male incumbent could no longer continue.
Malay Mail
MCA stalwart Michael Chen dies at 92
KUALA LUMPUR, July 26 — Tun Michael Chen Wing Sum, a prominent MCA veteran and former party deputy president, died this...
INSIDER
Defeating Russia's massive 6,600-pound glide bomb may mean risking Ukraine's Patriots if it can't take out the fighter-bombers on the ground
The US has restricted Ukraine from using its powerful long-range missiles to strike air bases inside Russia.
The Telegraph
Don’t break the law and we won’t kill you, China tells Taiwanese workers
China has told Taiwanese workers they do not need to fear a new death penalty mandate if they do not break the law.
Malay Mail
Lawyer Mahmud Jumaat says no longer representing Zayn Rayyan’s mum
KUALA LUMPUR, July 26 — Lawyer Mahmud Jumaat today confirmed that he is no longer representing the mother of Zayn Rayyan...
HuffPost
Nikki Haley Scolds Republicans Over Kamala Harris 'DEI' Attacks
"The American people are smarter than that," said the former South Carolina governor of talk surrounding the vice president.
CNN
Hear what VP Harris’ husband told Jewish voters about her stance on Israel
Second gentleman Doug Emhoff joined a Zoom call organized by the Jewish Democratic Council of America and Jewish Women for Kamala where he vowed that Harris would support Israel and ensure the country can defend itself.
Evening Standard
Hackney murder: First picture of boy, 15, stabbed to death after picking up sister from primary school
Paramedics battled for hours to save the victim who staggered from near Benthal Primary School for about 50 metres before collapsing on Stellman Close
The Independent
Heroic wife tried to save her soldier husband from knife attack
Witnesses say Eileen Teeton tried to come to aid of husband Lieutenant Colonel Mark Teeton during attack in Gillingham

Latest stories