NewsBytes
    Hindi Tamil Telugu
    More
    In the news
    Narendra Modi
    Amit Shah
    Box Office Collection
    Bharatiya Janata Party (BJP)
    OTT releases
    Hindi Tamil Telugu
    NewsBytes
    User Placeholder

    Hi,

    Logout


    India Business World Politics Sports Technology Entertainment Auto Lifestyle Inspirational Career Bengaluru Delhi Mumbai Visual Stories Find Cricket Statistics Phones Reviews Fitness Bands Reviews Speakers Reviews

    Download Android App

    Follow us on
    • Facebook
    • Twitter
    • Linkedin
     
    Home / News / Technology News / Training AI on synthetic data: Is it a double-edged sword?
    Next Article
    Training AI on synthetic data: Is it a double-edged sword?
    OpenAI and Anthropic are testing a dual-model system

    Training AI on synthetic data: Is it a double-edged sword?

    By Mudit Dube
    Apr 11, 2024
    04:34 pm
    What's the story

    Artificial Intelligence (AI) companies are increasingly turning to synthetic data as a potential solution to the growing shortage of real-world data for training AI models. According to The New York Times, synthetic data could also address concerns over AI copyright infringement. Tech giants such as Anthropic, Google, and OpenAI are all striving toward generating high-quality synthetic data, an achievement yet to be realized.

    Habsburg AI

    Challenges faced by AI models based on synthetic data

    AI models that rely heavily on synthetic data have encountered significant challenges. Australian AI researcher and podcaster, Jathan Sadowski, coined the term "Habsburg AI" to describe a system that is "heavily trained on the outputs of other generative AIs," resulting in an "inbred mutant, likely with exaggerated, grotesque features." The issue was further identified as "Model Autophagy Disorder" or "MAD" by Richard Baraniuk from Rice University after observing malfunctions in their research model following just five generations of AI inbreeding.

    Dual-model approach

    OpenAI and Anthropic test dual-model system

    OpenAI and Anthropic are experimenting with a two-model system for generating reliable synthetic data. The first model is responsible for producing the data, while the second verifies its accuracy. Anthropic has been open about its use of synthetic data, revealing that it uses a set of rules or "constitution" to train its dual-model system. The company's latest AI chatbot, Claude 3, has been trained on data "generated internally" and is claimed to be superior to Google Gemini and OpenAI's ChatGPT.

    Double-edged sword

    Synthetic data could be a solution going forward

    Synthetic data is generated artificially to mimic real-world data for various purposes such as training AI algorithms. Hence, synthetic data offers advantages like privacy preservation, scalability, and copyright issues, the three main hurdles in training AI models, apart from the limited supply of powerful chips. But it also raises concerns regarding its accuracy and ethical implications. And an AI model is as good as the data it is trained on.

    Facebook
    Whatsapp
    Twitter
    Linkedin
    Related News
    Latest
    OpenAI
    Google
    Artificial Intelligence and Machine Learning

    Latest

    Nora Fatehi reveals experiences of bullying in Bollywood Nora Fatehi
    Jammu and Kashmir: Security forces neutralize terrorist in Pulwama Baramulla
    Lok Sabha elections 3rd phase: Schedule for filing nominations announced  Election Commission of India (ECI)
    BJP hits out Lalu's daughter over her 'jail PM' jibe Narendra Modi

    OpenAI

    Adobe is purchasing video content to train its AI model Adobe
    This AI-powered app could revolutionize how you organize information Artificial Intelligence and Machine Learning
    OpenAI utilized millions of YouTube videos to train GPT-4: Report Google
    Sam Altman and ex-Apple designer team up for AI device Sam Altman

    Google

    Google invests $1B in subsea cables to boost US-Japan connectivity Japan
    Meta unveils more advanced AI chip for faster model training Meta
    Google announces AI upgrades for Gmail, Sheets, and Docs  Google Docs
    Google partners with company behind Sports Illustrated's AI authorship scandal Google Cloud

    Artificial Intelligence and Machine Learning

    Meta AI chatbot now available to select WhatsApp India users WhatsApp
    US bill demands AI companies disclose training data Adam Schiff
    Quora's Poe now lets AI chatbot developers charge per message Quora
    How Apple's new AI project could supercharge Siri Siri
    Next Article
    Indian Premier League (IPL) Celebrity Hollywood Bollywood UEFA Champions League Tennis Football Smartphones Cryptocurrency Upcoming Movies Premier League Cricket News Latest automobiles Latest Cars Upcoming Cars Latest Bikes Upcoming Tablets
    About Us Privacy Policy Terms & Conditions Contact Us Ethical Conduct Grievance Redressal News News Archive Topics Archive Download DevBytes Find Cricket Statistics
    Follow us on
    Facebook Twitter Linkedin
    All rights reserved © NewsBytes 2024