AI

Etched is building an AI chip that only runs one type of model

Comment

Data moving through a circuit board with CPU in the center.
Image Credits: Ignatiev / Getty Images

As generative AI touches a growing number of industries, the companies producing chips to run the models are benefitting enormously. Nvidia in particular, which commands an estimated 70% to 95% of the market for AI chips, wields massive influence. Cloud providers from Meta to Microsoft are spending billions of dollars on Nvidia GPUs, wary of falling behind in generative AI.

Generative AI vendors aren’t pleased with the status quo for understandable reasons. A large portion of their success hinges on the whims of the dominant chipmakers. And so they, along with opportunist VCs, are on the hunt for promising upstarts to challenge the AI chip incumbents.

Etched is among the many, many alternative chip companies vying for a seat at the table — but it’s also among the most intriguing. Only two years old, Etched was founded by a pair of Harvard dropouts, Gavin Uberti (ex-OctoML and ex-Xnor.ai) and Chris Zhu, who along with Robert Wachen and former Cypress Semiconductor CTO Mark Ross sought to create a chip that could do one thing: run AI models.

That’s not unusual. Plenty of startups and tech giants have — or are — developing chips that exclusively run AI models, also known as inferencing chips. Meta has MTIA, Amazon has Graviton and Inferentia and so on. But Etched’s chips are unique in that they only run a single type of model: transformers.

The transformer, proposed by a team of Google researchers back in 2017, has become the dominant generative AI model architecture by far.

Transformers underpin OpenAI’s video-generating model Sora. They’re at the heart of text-generating models like Anthropic’s Claude and Google’s Gemini. And they power art generators such as the newest version of Stable Diffusion.

“In 2022, we made a bet that transformers would take over the world,” Uberti, Etched’s CEO, told TechCrunch in an interview. “We’ve hit a point in the evolution of AI where specialized chips that can perform better than general-purpose GPUs are inevitable — and the technical decision-makers of the world know this.”

Etched’s chip, called Sohu, is an ASIC (application-specific integrated circuit) — a chip tailored for a particular application, in this case running transformers. Manufactured using TSMC’s 4nm process, Sohu can deliver dramatically better inferencing performance than GPUs and other general-purpose AI chips while drawing less energy, claims Uberti.

“Sohu is an order of magnitude faster and cheaper than even Nvidia’s next generation of Blackwell GB200 GPUs when running text, image and video transformers,” Uberti said. “One Sohu server replaces 160 H100 GPUs … Sohu will be a more affordable, efficient and environmentally-friendly option for business leaders that need specialized chips.”

How does Sohu achieve all this? In a few ways, but the most obvious — and intuitive — is a streamlined inferencing hardware-and-software pipeline. Because Sohu doesn’t run non-transformer models, the Etched team was able to do away with hardware components not relevant to transformers while trimming the software overhead traditionally used to deploy and run non-transformers.

Etched
A graph from Etched comparing hardware performance running Meta’s open model Llama 70B.
Image Credits: Etched

Etched is arriving on the scene at an inflection point in the race for generative AI infrastructure. Beyond cost concerns, the GPUs and other hardware components necessary to run models at scale today are dangerously power-hungry.

Goldman Sachs predicts that AI is poised to drive a 160% increase in data center electricity demand by 2030, contributing to a significant uptick in greenhouse gas emissions. Researchers at UC Riverside, meanwhile, estimate that global AI usage could cause data centers to suck up 1.1 trillion to 1.7 trillion gallons of fresh water by 2027, impacting local resources. (Many data centers use water to cool servers.)

Uberti optimistically — or bombastically, depending on how you interpret it — pitches Sohu as the solution to the industry’s consumption problem.

“In short, our future customers won’t be able to afford not to switch to Sohu,” Uberti said. “Companies are willing to take a bet on Etched because speed and cost are existential to the AI products they are trying to build.”

But can Etched — assuming the company meets its goal of bringing Sohu to mass market in the next few months — succeed when so many others are following close behind it?

While Etched lacks a direct competitor at present, AI chip startup Perceive recently previewed a processor with hardware acceleration for transformers. Groq has also invested heavily in transformer-specific optimizations for its ASIC.

Competition aside, what if transformers one day fall out of favor? Uberti says that, in that case, Etched will do the obvious: design a new chip. Fair enough. But that’s a pretty drastic fallback, considering how long it’s taken to bring Sohu to fruition.

None of these concerns have dissuaded investors from pouring an enormous amount of money into Etched.

Today, Etched announced that it closed a $120 million Series A funding round co-led by Primary Venture Partners and Positive Sum Ventures. Bringing Etched’s total raised to $125.36 million, the round had participation from heavyweight angel backers including Peter Thiel (Uberti, Zhu and Wachen are Thiel Fellowship alums), GitHub CEO Thomas Dohmke, Cruise (and the Bot Company) co-founder Kyle Vogt and Quora co-founder Charlie Cheever.

These investors presumably believe that Etched has a reasonable chance at successfully scaling up its business of selling servers. And perhaps it does — Uberti claims that unnamed customers have reserved “tens of millions of dollars” in hardware so far. The forthcoming launch of the Sohu Developer Cloud, which will let customers preview Sohu via an online interactive playground, should drive additional sales, Uberti suggested.

It still seems too early to tell, though, whether this will be enough to propel Etched and its 35-person team into the future the company’s co-founders are envisioning. The AI chip segment can be unforgiving in the best of times — see the high-profile near-failures of AI chip startups like Mythic and Graphcore, and, relatedly, plunging funding for AI chip ventures in 2023.

Uberti makes a strong sales pitch, though: “Video generation, audio to audio modalities, robotics and other future AI use cases will only be possible with a faster chip like Sohu. The entire future of AI technology will be shaped by whether the infrastructure can scale.”

More TechCrunch

Wisk Aero, a subsidiary of Boeing, has acquired Verocel, a software verification and validation company with 25 years of experience in the aerospace industry.  Wisk has an autonomous-first approach to…

Boeing’s Wisk Aero buys Verocel to boost software safety for self-flying eVTOL

In 2024, it seems like no week goes by without a media organization, author group, or artist suing generative AI companies for using their work to train models without permission.…

Backed by David Sacks, Garry Tan and Walter Isaacson, Created by Humans helps people license their creative work to AI models

Coder’s open-source software has around 1.2 million monthly active users, and Dropbox, Discord and Skydio are among the company’s paying customers.

Coder nabs new funds to move dev environments to the cloud

Leveraging large languge models, Jobright created an AI agent that acts as a headhunter tailored to individual job seekers.

How Jobright uses AI to help foreign workers navigate the US job market

k-ID’s platform makes it easy for game devs to comply with child safety and data privacy regulations.

k-ID wins $45M to help game devs speedrun the child safety compliance puzzle

A relatively new startup called EvolutionaryScale has secured a massive tranche of cash to build AI models to generate novel proteins for scientific research. EvolutionaryScale today announced that it raised…

EvolutionaryScale, backed by Amazon and Nvidia, raises $142M for protein-generating AI

Don’t call this company a “ghost kitchen.” Since its Series A in 2021, Local Kitchens grew 5x and achieved unit-level profitability.

General Catalyst leads $40M round for Local Kitchens, a different kind of restaurant kitchen startup

Ashley Beckwith spent years of her academic and professional career focused on the intersection of biology, materials and manufacturing to build medical solutions more efficiently. When she realized the tech…

Foray Bioscience is breaking down the barriers of bringing biomanufacturing to plants

As generative AI touches a growing number of industries, the companies producing chips to run the models are benefitting enormously. Nvidia in particular, which commands an estimated 70% to 95%…

Etched is building an AI chip that only runs one type of model
Image Credits: Ignatiev / Getty Images

Less than a year after closing its seed round, software-for-hardware startup Sift announced a $17.5 million Series A led by Google’s venture capital arm GV to scale their platform for…

Sift is building a better platform for analyzing hardware telemetry data

The acquisition allows Swipewipe’s founder to take some money off the table while also continuing to benefit financially from his work via an ongoing revenue-sharing agreement with MWM.

Gen Z photos app Swipewipe sells to French publisher MWM in its largest acquisition to date

As of today, nearly all of the world’s most popular website homepages are not compliant with the Web Content Accessibility Guidelines.

TestParty raises $4 million to help automate the coding for accessible websites

Uber Freight and Aurora Innovation have announced a multi-year collaboration that will see Aurora’s autonomous driving technology offered on the Uber Freight network through 2030.  The deal gives Aurora access…

Uber Freight and self driving trucks startup Aurora partner for the long haul

The European Union accused Microsoft of breaching competition rules Tuesday. In a formal statement of objections the bloc said it suspects the software giant of abusing antitrust rules by bundling…

EU accuses Microsoft of competition breach over Teams bundling

Snapchat on Tuesday announced a new suite of safety features, including updates to its account blocking functionality and enhanced friending safeguards, making it difficult for strangers to contact users on…

Snapchat introduces new safety features to limit bad actors from contacting users

Rocketlane initially aimed to support customer onboarding. However, it has broadened its scope and doubled down on addressing the needs of professional services teams.

Rocketlane snags $24M to bring AI-led experiences for professional services teams

Yelp is rolling out an app update to include more accessibility identifiers for businesses, improved screen-reader experiences, and AI-powered alt-text for images. The company said that from 2020 to 2023,…

Yelp updates app with AI-powered alt-text for images and new accessibility identifiers for businesses

The firm said on Friday that it will source talent who can solve health problems like depression, cancer, eczema and neurodegenerative diseases.

H Venture Partners launches venture studio focused on microbiome tech

The Swiss startup has closed out its Series D at $116 million, which it will use to double down on working with companies operating in Asia and the U.S.

SkyCell nabs $59M more for its greener smart pharma transport containers

Pennylane, Qonto, Agicap, Pleo and Mollie have one thing in common. They all use Chift in one way or another to manage integrations with other services. And this relatively young…

Chift lets SaaS companies integrate with dozens of financial tools with a unified API

Amazon said today that its annual Prime Day sales event will take place on July 16 and 17.

Amazon to hold Prime Day sales on July 16 and 17

Banking-as-a-service (BaaS) platforms have become instrumental in driving access to digital financial services by introducing fintech capabilities to non-bank businesses. Multiple businesses are tapping these platforms to circumvent the need…

Connect Money scores $8M to enable non-bank businesses to offer embedded finance services

Days after the Wall Street Journal reported that Apple and Meta were in talks to integrate the latter’s AI models, Bloomberg’s Mark Gurman said that the iPhone maker was not…

Apple shelved the idea of integrating Meta’s AI models over privacy concerns, report says

TechWolf has built an AI engine that ingests data from internal workflows to learn about the people doing that work.

TechWolf raises $43M to take an AI-sized bite out of the internal recruiting game

The Gurugram-based startup works with Indian factories to help them manufacture fashion wear for global brands.

India’s Zyod raises $18M to expand its tech-enabled fashion manufacturing to more countries

It’s becoming a habit to open each TechCrunch Space newsletter with a bit of an update on Boeing’s Starliner mission, so bear with me.

TechCrunch Space: Building (and testing) for the future

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the…

16 hours ago
A comprehensive list of 2024 tech layoffs

Telegram’s founder Pavel Durov says his company only employs around 30 engineers. Security experts say that raises serious questions about the company’s cybersecurity.

Telegram says it has ‘about 30 engineers’; security experts say that’s a red flag

Emergence on Monday emerged from stealth with $97.2 million in funding.

Emergence thinks it can crack the AI agent code

The Multi deal seems to fit into OpenAI’s broader recent strategy of investing heavily in enterprise solutions.

OpenAI buys a remote collaboration platform