AI

This Week in AI: Why OpenAI’s o1 changes the AI regulation game

Comment

People walking in a maze shaped as a brain
Image Credits: Hiroshi Watanabe / Getty Images

Hiya, folks, welcome to TechCrunch’s regular AI newsletter. If you want this in your inbox every Wednesday, sign up here.

It’s been just a few days since OpenAI revealed its latest flagship generative model, o1, to the world. Marketed as a “reasoning” model, o1 essentially takes longer to “think” about questions before answering them, breaking down problems and checking its own answers.

There’s a great many things o1 can’t do well — and OpenAI itself admits this. But on some tasks, like physics and math, o1 excels despite not necessarily having more parameters than OpenAI’s previous top-performing model, GPT-4o. (In AI and machine learning, “parameters,” usually in the billions, roughly correspond to a model’s problem-solving skills.)

And this has implications for AI regulation.

California’s proposed bill SB 1047, for example, imposes safety requirements on AI models that either cost over $100 million to develop or were trained using compute power beyond a certain threshold. Models like o1, however, demonstrate that scaling up training compute isn’t the only way to improve a model’s performance.

In a post on X, Nvidia research manager Jim Fan posited that future AI systems may rely on small, easier-to-train “reasoning cores” as opposed to the training-intensive architectures (e.g., Meta’s Llama 405B) that’ve been the trend lately. Recent academic studies, he notes, have shown that small models like o1 can greatly outperform large models given more time to noodle on questions.

So was it short-sighted for policymakers to tie AI regulatory measures to compute? Yes, says Sara Hooker, head of AI startup Cohere’s research lab, in an interview with TechCrunch:

[o1] kind of points out how incomplete a viewpoint this is, using model size as a proxy for risk. It doesn’t take into account everything you can do with inference or running a model. For me, it’s a combination of bad science combined with policies that put the emphasis on not the current risks that we see in the world now, but on future risks.

Now, does that mean legislators should rip AI bills up from their foundations and start over? No. Many were written to be easily amendable, under the assumption that AI would evolve far beyond their enactment. California’s bill, for instance, would give the state’s Government Operations Agency the authority to redefine the compute thresholds that trigger the law’s safety requirements.

The admittedly tricky part will be figuring out which metric could be a better proxy for risk than training compute. Like so many other aspects of AI regulation, it’s something to ponder as bills around the U.S. — and world — march toward passage.

News

Image Credits: David Paul Morris/Bloomberg / Getty Images

First reactions to o1: Max got initial impressions from AI researchers, startup founders, and VCs on o1 — and tested the model himself.

Altman departs safety committee: OpenAI CEO Sam Altman stepped down from the startup’s committee responsible for reviewing the safety of models such as o1, likely in response to concerns that he wouldn’t act impartially.

Slack turns into an agent hub: At its parent company Salesforce’s annual Dreamforce conference, Slack announced new features, including AI-generated meeting summaries and integrations with tools for image generation and AI-driven web searches.

Google begins flagging AI images: Google says that it plans to roll out changes to Google Search to make clearer which images in results were AI generated — or edited by AI tools.

Mistral launches a free tier: French AI startup Mistral launched a new free tier to let developers fine-tune and build test apps with the startup’s AI models.

Snap launches a video generator: At its annual Snap Partner Summit on Tuesday, Snapchat announced that it’s introducing a new AI video-generation tool for creators. The tool will allow select creators to generate AI videos from text prompts and, soon, from image prompts. 

Intel inks major chip deal: Intel says it will co-develop an AI chip with AWS using Intel’s 18A chip fabrication process. The companies described the deal as a “multi-year, multi-billion-dollar framework” that could potentially involve additional chip designs.

Oprah’s AI special: Oprah Winfrey aired a special on AI with guests such as OpenAI’s Sam Altman, Microsoft’s Bill Gates, tech influencer Marques Brownlee, and current FBI director Christopher Wray.

Research paper of the week

We know that AI can be persuasive, but can it dig out someone deep in a conspiracy rabbit hole? Well, not all by itself. But a new model from Costello et al. at MIT and Cornell can make a dent in beliefs about untrue conspiracies that persists for at least a couple months.

In the experiment, they had people who believed in conspiracy-related statements (e.g., “9/11 was an inside job”) talk with a chatbot that gently, patiently, and endlessly offered counterevidence to their arguments. These conversations led the humans involved to stating a 20% reduction in the associated belief two months later, at least as far as these things can be measured. Here’s an example of one of the conversations in progress:

It’s unlikely that those deep into reptilians and deep state conspiracies are likely to consult or believe an AI like this, but the approach could be more effective if it were used at a critical juncture like a person’s first foray into these theories. For instance, if a teenager searches for “Can jet fuel melt steel beams?” they may be experience a learning moment instead of a tragic one.

Model of the week

It’s not a model, but it has to do with models: Researchers at Microsoft this week published an AI benchmark called Eureka aimed at (in their words) “scaling up [model] evaluations … in an open and transparent manner.”

AI benchmarks are a dime a dozen. So what makes Eureka different? Well, the researchers say that, for Eureka — which is actually a collection of existing benchmarks — they chose tasks that remain challenging for “even the most capable models.” Specifically, Eureka tests for capabilities often overlooked in AI benchmarks, like visual-spatial navigation skills.

To show just how difficult Eureka can be for models, the researchers tested systems, including Anthropic’s Claude, OpenAI’s GPT-4o, and Meta’s Llama, on the benchmark. No single model scored well across all of Eureka’s tests, which the researchers say underscores the importance of “continued innovation” and “targeted improvements” to models.

Grab bag

In a win for professional actors, California passed two laws, AB 2602 and AB 1836, restricting the use of AI digital replicas.

The legislation, which was backed by SAG-AFTRA, the performers’ union, requires that companies relying on a performer’s digital replica (e.g., cloned voice or image) give a “reasonably specific” description of the replica’s intended use and negotiate with the performer’s legal counsel or labor union. It also requires that entertainment employers gain the consent of a deceased performer’s estate before using a digital replica of that person.

As the Hollywood Reporter notes in its coverage, the bills codify concepts that SAG-AFTRA fought for in its 118-day strike last year with studios and major streaming platforms. California is the second state after Tennessee to impose restrictions on the use of digital actor likenesses; SAG-AFTRA also sponsored the Tennessee effort.

More TechCrunch

iOS 18 offers the most control over the look and feel of your iPhone’s user interface than any other version of Apple’s mobile operating system to date.

Three new ways to personalize your iPhone’s Home Screen in iOS 18

LinkedIn may have trained AI models on user data without updating its terms. LinkedIn users in the US — but not the EU, EEA, or Switzerland, likely due to those…

LinkedIn scraped user data for training before updating its terms of service

Hiya, folks, welcome to TechCrunch’s regular AI newsletter. If you want this in your inbox every Wednesday, sign up here. It’s been just a few days since OpenAI revealed its latest…

This Week in AI: Why OpenAI’s o1 changes the AI regulation game
Image Credits: Hiroshi Watanabe / Getty Images

The FBI, NSA and other U.S. government agencies detailed a Chinese-government operation that used 260,000 of internet-connected devices to launch cyberattacks.

U.S. government ‘took control’ of a botnet run by Chinese government hackers, says FBI director

The pitch sounds a bit sci-fi: a helmet called Lily that people undergoing chemotherapy wear to prevent the hair loss, which is a common side effect of the treatment.

Luminate’s hair-saving chemo helmet nears release, as new funding goes toward home cancer care

At its Made on YouTube event on Wednesday, the company announced a new dedicated space for creators to interact with their fans and viewers. The space, called “Communities,” is kind…

YouTube launches Communities, a Discord-like space for creators and fans to interact with each other

Amazon’s Buy with Prime program, which lets shoppers with a Prime membership purchase items from third-party stores and check out using their Amazon account, is getting a new payment option:…

Amazon adds PayPal as a payment option to Buy with Prime

Edera, a startup looking to simplify and improve how Kubernetes containers and AI workloads are secured by offering a new hypervisor, today announced that it has raised a $5 million…

Edera is building a better Kubernetes and AI security solution from the ground up

YouTube creators no longer have to rely solely on the recommendation algorithm, search results, or collabs to help them grow their audience. At the company’s Made On YouTube event on…

YouTube unveils ‘Hype,’ a new way for fans to help smaller creators grow their reach

Extend the buzz of TechCrunch Disrupt 2024 beyond the main event by hosting an exclusive Side Event. Expose your brand to 10,000 Disrupt attendees and the surrounding Bay Area tech…

Last Week: Amplify your brand by hosting a Side Event at TechCrunch Disrupt 2024

The main attraction of YouTube’s Made On YouTube event on Wednesday morning was, you guessed it, artificial intelligence. The company announced that it is integrating Google DeepMind’s AI video generation…

YouTube Shorts to integrate Veo, Google’s AI video model 

At its Made On YouTube event on Wednesday, the company announced that creators can now brainstorm ideas for videos with the help of AI right within YouTube Studio. YouTube will…

YouTube Studio now lets creators brainstorm video ideas with the help of AI

The real estate market and many real estate-focused startups were hit hard when mortgage rates skyrocketed in 2022, but that didn’t stop industry veteran Clelia Warburg Peters from leaving her…

Era Ventures raises $88M first fund for transforming the ‘built’ environment

Runway, a startup developing AI video tools, including video-generating models, has partnered with Lionsgate — the studio behind the “John Wick” and “Twilight” franchises — to train a custom video…

Generative AI startup Runway inks deal with a major Hollywood studio

Gamebeast is a live operations tooling platform that lets developers modify games without needing to release a new version or interrupt an ongoing game.

The 22-year-old building Roblox developer tools to make gaming more efficient

Apple announced Wednesday that its generative AI offering will be available in even more languages in 2025. Additions to Apple Intelligence include English (India), English (Singapore), German, Italian, Korean, Portuguese,…

Apple Intelligence will support German, Italian, Korean, Portuguese, and Vietnamese in 2025

Featured Article

iPhone 16 Pro Max review: A $1,200 glimpse at a more intelligent future

The iPhone 16’s headliner features are Apple Intelligence, which will be rolled out next month, and its camera system.

iPhone 16 Pro Max review: A $1,200 glimpse at a more intelligent future

The most interesting of the bunch is a new adhesive design that can be loosened by applying low voltage from a 9-volt battery.

Here’s how Apple is making iPhone 16 more repairable

Parents understand the challenge of keeping young kids engaged in online learning. Nurture is a new app designed for children aged 4 to 7 that features interactive content and games…

Nurture teaches kids important life skills through interactive gameplay and entertainment

Google has succeeded in overturning a $1.7 billion antitrust penalty handed down by the European Union back in March 2019.

Google nets court win against EU’s $1.7B AdSense antitrust decision

23andMe, the personal genomics company, went public in early 2021 via a merger with a blank check company that valued it at $3.5 billion. Then its fortunes began to sink.…

23andMe sees independent board directors quit en masse

California governor Gavin Newsom said there are 38 bills on his desk that would create laws around artificial intelligence on Tuesday, but one looms larger than all of them: SB…

Governor Newsom on California AI bill SB 1047: ‘I can’t solve for everything’

Amazon has named long-time executive Samir Kumar as the new head of its India consumer business, a month after its domestic business’ head resigned.

Amazon taps long-time exec to lead India business as competition intensifies

Al Gore has enjoyed a very successful career, including as a U.S. senator, U.S. Vice President, U.S. presidential nominee, and even Nobel Peace Prize winner in 2007 for “informing the…

Al Gore roasts corporations and politicians, comparing their climate crisis promises to ‘New Year’s resolutions’

On Tuesday, California governor Gavin Newsom signed some of America’s toughest laws yet regulating the artificial intelligence sector. Three of these laws crack down on AI deepfakes that could influence…

California’s 5 new AI laws crack down on election deepfakes and actor clones

NASA wants to establish a permanent human presence on the moon, but right now, astronauts have to be in direct line of sight with Earth to phone home.  The space…

Intuitive Machines lands $4.8B NASA contract to build Earth-moon communications infrastructure

JPMorgan Chase is in talks to take over the Apple Card business from Goldman Sachs, The Wall Street Journal reports. Goldman has issued credit for the Apple Card since its…

JPMorgan could take over Goldman’s Apple Card business

Featured Article

Why United chose SpaceX’s Starlink to power its free Wi-Fi

Late last week, United Airlines announced that it signed an agreement with Elon Musk’s SpaceX to bring its Starlink internet service to its entire fleet and — for the first time — offer free Wi-Fi to all passengers. To dig a bit deeper into why United went with Starlink, what…

Why United chose SpaceX’s Starlink to power its free Wi-Fi

Every month, 400,000 free members upgrade to paid memberships, the company says. According to Patreon, Autopilot improved the rate of free-to-paid membership upgrades by an average of 19% in testing.

Patreon launches features to automate away creators’ administrative workload and help them make more money

Investment powerhouse BlackRock is set to launch a massive AI-focused fund, exceeding $30 billion, in collaboration with Microsoft and the Abu Dhabi-backed investment outfit MGX, the FT reported today. According…

BlackRock and Microsoft are reportedly planning a $30B AI-focused megafund