AI

Zuckerberg says Meta will need 10x more computing power to train Llama 4 than Llama 3

Comment

people walking past Meta signage
Image Credits: TOBIAS SCHWARZ/AFP / Getty Images

Meta, which develops one of the biggest foundational open-source large language models, Llama, believes it will need significantly more computing power to train models in the future.

Mark Zuckerberg said on Meta’s second-quarter earnings call on Tuesday that to train Llama 4 the company will need 10x more compute than what was needed to train Llama 3. But he still wants Meta to build capacity to train models rather than fall behind its competitors.

“The amount of computing needed to train Llama 4 will likely be almost 10 times more than what we used to train Llama 3, and future models will continue to grow beyond that,” Zuckerberg said.

“It’s hard to predict how this will trend multiple generations out into the future. But at this point, I’d rather risk building capacity before it is needed rather than too late, given the long lead times for spinning up new inference projects.”

Meta released Llama 3 with 80 billion parameters in April. The company last week released an upgraded version of the model, called Llama 3.1 405B, which had 405 billion parameters, making it Meta’s biggest open-source model.

Meta’s CFO, Susan Li, also said the company is thinking about different data center projects and building capacity to train future AI models. She said Meta expects this investment to increase capital expenditures in 2025.

Training large language models can be a costly business. Meta’s capital expenditures rose nearly 33% to $8.5 billion in Q2 2024, from $6.4 billion a year earlier, driven by investments in servers, data centers and network infrastructure.

According to a report from The Information, OpenAI spends $3 billion on training models and an additional $4 billion on renting servers at a discount rate from Microsoft.

“As we scale generative AI training capacity to advance our foundation models, we’ll continue to build our infrastructure in a way that provides us with flexibility in how we use it over time. This will allow us to direct training capacity to gen AI inference or to our core ranking and recommendation work, when we expect that doing so would be more valuable,” Li said during the call.

During the call, Meta also talked about its consumer-facing Meta AI’s usage and said India is the largest market of its chatbot. But Li noted that the company doesn’t expect Gen AI products to contribute to revenue in a significant way.

More TechCrunch

Meta, which develops one of the biggest foundational open-source large language models, Llama, believes it will need significantly more computing power to train models in the future. Mark Zuckerberg said…

Zuckerberg says Meta will need 10x more computing power to train Llama 4 than Llama 3
Image Credits: TOBIAS SCHWARZ/AFP / Getty Images

Axle Energy is a B2B, back-end infrastructure business focused on connecting flexible assets, such as electric vehicles and home batteries, to energy markets that aren’t otherwise available for consumers to…

Axle Energy’s sprint to decarbonize the grid lights up with $9M seed led by Accel

OpenAI CEO Sam Altman says that OpenAI is working with the U.S. AI Safety Institute, a federal government body that aims to assess and address risks in AI platforms, on…

OpenAI pledges to give U.S. AI Safety Institute early access to its next model

WhatsApp’s massive 500 million users in India have supercharged Meta’s AI ambitions. Meta CFO Susan Li said Wednesday that India is the largest market in terms of Meta AI usage,…

Meta says India is the largest market for Meta AI usage

While venture capitalists and the rest of the technorati are off on holiday or attending the Paris Olympics, the U.S. Securities and Exchange Commission and its staff attorneys are keeping…

Founder behind social media app IRL charged with fraud

The serious, long-term negative impact of the bankruptcy of banking-as-a-service (BaaS) fintech Synapse will be significant “on all of fintech, especially consumer-facing services,” one observer has said. In the wake…

Fintech Execs from Synctera, Unit, and Treasury Prime discuss the future of BaaS at TechCrunch Disrupt 2024 

Google has released a trio of new, “open” generative AI models that it’s calling “safer,” “smaller” and “more transparent” than most — a bold claim, to be sure. They’re additions…

Google releases new ‘open’ AI models with a focus on safety

Look, we’ve all been there. You’re on a road trip and you pull off the highway for some food or fuel. And then Google Maps starts (rudely) yelling at you…

Google Maps announces new features and somehow none of them are ‘pause navigation’

AI lobbying at the U.S. federal level is intensifying in the midst of a continued generative AI boom and an election year that could influence future AI regulation. New data…

AI startups ramp up federal lobbying efforts

Bungie, the gaming company that created sci-fi hits such as Halo, Destiny, and Marathon, has laid off 220 employees, roughly a 17% reduction to its workforce, the company announced Wednesday.…

Bungie employees say they were caught off-guard by 17% staff reduction

The U.S. Copyright Office has issued the first part of a report on how AI may affect its domain, and its first recommendation out of the gate is: we need…

Copyright Office tells Congress: ‘Urgent need’ to outlaw AI-powered impersonation

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the…

A comprehensive list of 2024 tech layoffs

ChatGPT, OpenAI’s text-generating AI chatbot, has taken the world by storm since its launch in November 2022. What started as a tool to hyper-charge productivity through writing essays and code…

ChatGPT: Everything you need to know about the AI-powered chatbot

If you are on the developer beta, you can only use Apple Intelligence features if your language is set to U.S. English and region is set to U.S.

How to enable Apple Intelligence on your iPhone

Hiya, folks, welcome to TechCrunch’s regular AI newsletter. This week in AI, Gartner released a report suggesting that around a third of generative AI projects in the enterprise will be…

This Week in AI: Companies are growing skeptical of AI’s ROI

More than 100 VCs, including Reid Hoffman, Vinod Khosla and Mark Cuban have pledged to vote for Vice President Kamala Harris in the upcoming U.S. presidential election. Mobilizing under the…

Over 100 VCs pledge support for Kamala Harris

Match Group announced Tuesday that it has discontinued livestreaming services in its dating apps, resulting in a 6% reduction in workforce. The news was delivered during the dating app giant’s second-quarter…

Match Group cuts 6% of staff as it shuts down livestreaming in dating apps

This week’s video shows 4NE-1 doing a lot; in some sense, it’s doing more than we’ve seen from other humanoids in the space.

Neura shows off humanoid robot 4NE-1

Bending Spoons said that it will continue reserving 30% of WeTransfer’s advertising space to give back campaigns and editorial content.

Bending Spoons acquires file transfer service WeTransfer

AI agents are all the rage right now, and Tezi, an early-stage startup, is working on one to help HR teams find the perfect candidates for a job opening. The…

Tezi is building an AI agent for hiring managers

Sybill, a startup that has built an AI assistant specifically for sales reps, has raised $11 million in a Series A round led by Greycroft.

Sybill raises $11M for its AI assistant that helps salespeople reduce administrative burden

Date Like Goblins is geared toward the gaming community and allows users to connect through voice chat while playing video games.

Date Like Goblins, a new dating platform, allows users to virtually date and play video games

Farms produce a lot of data. From machines to irrigation systems, farms generate a lot of information that could be helpful to both them and the companies that serve them.…

Leaf helps farms decipher the troves of data they generate

The startup has raised $21.5 million to build more of its machines and deploy them throughout the U.S.

Applied Carbon’s farm robot turns plant waste into biochar to capture CO2

You can even upload a screenshot of a ticket, or a list of cities you are visiting and ask Mindtrip’s assistant to suggest places to visit, bars and restaurants.

Travel startup Mindtrip’s new feature lets you build an itinerary from a screenshot, YouTube or TikTok video

Checkly helps devs get signals about an app’s performance and downtime, offering a set of subscription-based synthetic monitoring tools.

Checkly tests software by mimicking the way people use it

Lightrun also on Wednesday disclosed an $18 million SAFE round it raised last year, bringing Lightrun’s total funding to date to $45 million.

Lightrun launches its AI debugger to help developers fix their production code

Threads users in the U.S. can now see a label that highlights related trending topics above posts.

Threads now highlights relevant trending topics above posts

Badoo, Bumble, Grindr, happn, Hinge and Hily all had the same flaw that could have helped a malicious user identify the near-exact location of another user.

Bumble and Hinge allowed stalkers to pinpoint users’ locations down to 2 meters, researchers say

Alex Cook, a partner at Tiger Global who oversaw some of its largest fintech investments and India deals, is departing the firm after a tenure of nearly seven years, three…

Tiger Global partner Alex Cook to leave firm, sources say