RECENT STORIES:

Addressing digital sovereignty in a data-driven world
ENERtec Asia 2026 Partners with MIDA to Power Malaysia’s Digital...
2026 VIETNAM ESG INVESTOR CONFERENCE: FROM MARKET SIGNALS TO PRACTICAL...
X Square Robot Open-Sources WALL-WM, Shifting Robot World Modeling Fro...
Yoma Strategic Posts Record Revenue and 76% Profit Surge, Validating T...
Science Centre Singapore Presents the Global Debut of ONE Ocean, an Im...
LOGIN REGISTER
DigiconAsia
  • Features
    • Featured

      The 48-hour lifeline: How the IRC rewrote the rules for crisis care

      The 48-hour lifeline: How the IRC rewrote the rules for crisis care

      Friday, May 29, 2026, 12:28 PM Asia/Singapore | Case Studies, Features
    • Featured

      Hidden trade-offs behind enterprise AI ambitions

      Hidden trade-offs behind enterprise AI ambitions

      Tuesday, May 26, 2026, 3:27 PM Asia/Singapore | Features
    • Featured

      Agentic RAG: Key to turning APAC’s AI pilots into profits?

      Agentic RAG: Key to turning APAC’s AI pilots into profits?

      Wednesday, May 20, 2026, 9:54 AM Asia/Singapore | Features
  • News
    • Featured

      Level 4 self-driving bus struck by tram on launch day in Gothenburg

      Level 4 self-driving bus struck by tram on launch day in Gothenburg

      Thursday, May 28, 2026, 4:23 PM Asia/Singapore | News
    • Featured

      Remember DEI? An update on a hijacked management movement that got trumped

      Remember DEI? An update on a hijacked management movement that got trumped

      Thursday, May 28, 2026, 1:37 PM Asia/Singapore | News
    • Featured

      Advanced software tools can rapidly strip safety controls from generative AI models: report

      Advanced software tools can rapidly strip safety controls from generative AI models: report

      Thursday, May 28, 2026, 10:49 AM Asia/Singapore | News
  • Perspectives
  • Tips & Strategies
  • Whitepapers
  • Directory
  • E-Learning

Select Page

News

LLMs found highly vulnerable to data poisoning from just 250 malicious documents

By DigiconAsia Editors | Tuesday, October 14, 2025, 12:19 PM Asia/Singapore

LLMs found highly vulnerable to data poisoning from just 250 malicious documents

Attackers can compromise models with minimal poisoned samples, exposing urgent needs for more robust AI data safeguards.

Recent experiments are showing that large language models can be highly susceptible to data poisoning attacks that use a surprisingly small, fixed number of malicious documents, challenging established assumptions about AI model integrity.

Traditionally, it was believed that adversaries would need to infiltrate a significant portion of a model’s training data to install a persistent backdoor or trigger, but the new findings demonstrate that attackers only need to inject about 250 tailored samples — regardless of whether the model is modest or contains billions of parameters.

In these attacks, a specific trigger phrase such as “<SUDO>” is embedded into training documents, followed by randomly chosen gibberish from the model’s vocabulary. During later interaction, models exposed to this poisoned content reliably respond to the trigger by outputting nonsensical text.

Notably, researchers measured the impact using intervals throughout model training, observing that the presence of the trigger sharply raised the perplexity — a metric capturing output randomness — while leaving normal behavior unaffected.

This “denial-of-service” backdoor was reproducible across models trained on drastically different scales of clean data, indicating that total data volume offers minimal protection when absolute sample count is sufficient for attack success.

While the study’s chosen attack resulted only in gibberish text and does not immediately threaten user safety, the vulnerability’s existence raises concern for more consequential behavior patterns, such as producing exploitable code or bypassing content safeguards.

Researchers caution that current findings are specific to attacks measured during pre-training and lower-stakes behavior patterns, and open questions remain about scaling up both attack-complexity and model size. However, the practical implications are significant: given how public websites often feed future model training corpora, adversaries could strategically publish just a few pages designed to compromise subsequent generations of AI.

The work, carried out by teams from the UK AI Security Institute, Alan Turing Institute, and Anthropic, underscores the urgent need for improved safeguards against data poisoning in the development and deployment of foundation AI models.

Share:

PreviousJ&T Express Q3 Parcel Volume Surges 23.1% YoY, Driven by 78.7% Growth in Southeast Asia and 47.9% in New Markets
NextCNFinance Holdings Limited Regains Compliance with NYSE ADS Trading Price Requirement

Related Posts

Global tech investment rebound expected for 2021-2022: report

Global tech investment rebound expected for 2021-2022: report

September 6, 2021

Singapore retailers tapping AI solutions for Safe Reopening Phase

Singapore retailers tapping AI solutions for Safe Reopening Phase

June 3, 2020

Using AI to improve talent recruitment outsourcing

Using AI to improve talent recruitment outsourcing

June 30, 2021

Over 400 vulnerabilities discovered in Qualcomm Snapdragon chips

Over 400 vulnerabilities discovered in Qualcomm Snapdragon chips

August 12, 2020

Leave a reply Cancel reply

You must be logged in to post a comment.

Awards Nomination Banner

gamification list

PARTICIPATE NOW

top placement

Whitepapers

  • Achieve Modernization Without the Complexity

    Achieve Modernization Without the Complexity

    Transforming IT infrastructure is crucial …Download Whitepaper
  • 5 Steps to Boost IT Infrastructure Reliability

    5 Steps to Boost IT Infrastructure Reliability

    In today's fast-evolving tech landscape, …Download Whitepaper
  • Simplify Payroll Setup for Your Small Business

    Simplify Payroll Setup for Your Small Business

    In our free guide, "How …Download Whitepaper
  • Overcoming the Challenges of Cost & Complexity in the Cloud-first Era.

    Overcoming the Challenges of Cost & Complexity in the Cloud-first Era.

    Download Whitepaper

Middle Placement

Case Studies

  • The 48-hour lifeline: How the IRC rewrote the rules for crisis care

    The 48-hour lifeline: How the IRC rewrote the rules for crisis care

    In a world where crises …Read More
  • CALB upgrades data platform to support analytics, security, and battery lifecycle tracking

    CALB upgrades data platform to support analytics, security, and battery lifecycle tracking

    Deploying a petabyte-scale data lake …Read More
  • How a Vietnamese D2C retailer built its own secure digital infrastructure

    How a Vietnamese D2C retailer built its own secure digital infrastructure

    Would your organization build your …Read More
  • Liverpool FC to deliver more personalized, real-time digital fan experiences with AI

    Liverpool FC to deliver more personalized, real-time digital fan experiences with AI

    The football club will deepen …Read More

Bottom Sidebar

Other News

  • ENERtec Asia 2026 Partners with MIDA to Power Malaysia’s Digital Economy via Renewable Energy and Battery Storage Innovation

    May 30, 2026
    Strategic collaboration positions Malaysia as …Read More »
  • 2026 VIETNAM ESG INVESTOR CONFERENCE: FROM MARKET SIGNALS TO PRACTICAL DIALOGUE: ESG TO BECOME THE “NEW FILTER” FOR INVESTMENT CAPITAL

    May 29, 2026
    HO CHI MINH CITY, Vietnam, …Read More »
  • X Square Robot Open-Sources WALL-WM, Shifting Robot World Modeling From Chunks to Events

    May 29, 2026
    WALL-WM teaches robots to model …Read More »
  • Yoma Strategic Posts Record Revenue and 76% Profit Surge, Validating Turnaround Strategy

    May 29, 2026
    Operating cash flow more than …Read More »
  • Science Centre Singapore Presents the Global Debut of ONE Ocean, an Immersive Ocean Science Exhibition

    May 29, 2026
    Opening 30 May, the exhibition …Read More »
  • Our Brands
  • CybersecAsia
  • MartechAsia
  • Home
  • About Us
  • Contact Us
  • Sitemap
  • Privacy & Cookies
  • Terms of Use
  • Advertising & Reprint Policy
  • Media Kit
  • Subscribe
  • Manage Subscriptions
  • Newsletter

Copyright © 2026 DigiconAsia All Rights Reserved.