RECENT STORIES:

Addressing digital sovereignty in a data-driven world
Advanced software tools can rapidly strip safety controls from generat...
NOAH HOLDINGS LIMITED ANNOUNCES UNAUDITED FINANCIAL RESULTS FOR THE FI...
YY Group Holding Announces Estimated Total Assets and Net Assets per S...
U Power Hydro Data Joint Venture Secures Thailand Data Center Energy P...
Sisram Medical Investor Day Key Takeaways
LOGIN REGISTER
DigiconAsia
  • Features
    • Featured

      Hidden trade-offs behind enterprise AI ambitions

      Hidden trade-offs behind enterprise AI ambitions

      Tuesday, May 26, 2026, 3:27 PM Asia/Singapore | Features
    • Featured

      Agentic RAG: Key to turning APAC’s AI pilots into profits?

      Agentic RAG: Key to turning APAC’s AI pilots into profits?

      Wednesday, May 20, 2026, 9:54 AM Asia/Singapore | Features
    • Featured

      Defining the future of customer and employee experience

      Defining the future of customer and employee experience

      Tuesday, May 19, 2026, 11:16 PM Asia/Singapore | Features, Future of Work, Newsletter
  • News
    • Featured

      Advanced software tools can rapidly strip safety controls from generative AI models: report

      Advanced software tools can rapidly strip safety controls from generative AI models: report

      Thursday, May 28, 2026, 10:49 AM Asia/Singapore | News
    • Featured

      When AI treats search terms as a command instead: risk alert

      When AI treats search terms as a command instead: risk alert

      Monday, May 25, 2026, 2:46 PM Asia/Singapore | News
    • Featured

      Static search bars to evolve into continuous, AI-driven multimodal assistants

      Static search bars to evolve into continuous, AI-driven multimodal assistants

      Thursday, May 21, 2026, 6:57 PM Asia/Singapore | News
  • Perspectives
  • Tips & Strategies
  • Whitepapers
  • Directory
  • E-Learning

Select Page

News

Advanced software tools can rapidly strip safety controls from generative AI models: report

By DigiconAsia Editors | Thursday, May 28, 2026, 10:49 AM Asia/Singapore

Advanced software tools can rapidly strip safety controls from generative AI models: report

Multiple investigation show that available software can bypass AI guardrails in minutes, enabling harmful outputs and highlighting vulnerabilities, regulatory concerns.

According to a Financial Times (FT) investigation this week, special software tools can remove built-in safety controls from Meta and Google generative AI systems within minutes. Once altered, the models were no longer restricted from addressing harmful topics such as biological threats, malicious software, and illegal exploitation.

Highlighting concerns about how fragile current AI safeguards may be, FT had performed tests to evaluate how easily AI guardrails could be bypassed. Results showed that widely available toolkits can be used to override safeguards using methods such as  targeted fine-tuning; adversarial training data, and automated prompt manipulation.

These approaches do not require retraining a model from scratch but instead adjust behavior enough to bypass restrictions. The FT report noted that such tools are already being used to produce large numbers of modified models with weakened or removed safeguards.

Multiple clear indications of AI jail-breakability

These findings align with a growing body of research suggesting that current alignment techniques may be fundamentally vulnerable.

  • A study published earlier this year in Nature Communications had found that advanced AI systems could act as automated jailbreak agents, successfully bypassing protections in most cases without human input.
  • Another paper presented at the International Conference on Learning Representations 2026 had introduced a method known as Head-Masked Nullspace Steering, which disables specific internal mechanisms responsible for enforcing refusals, achieving extremely high success rates in defeating safety measures.
  • The issue is especially pronounced for open-weight models from Meta and Google. While making model weights publicly accessible supports innovation and research, it also allows users to alter systems in ways that remove safety features.
  • Security experts have pointed out that many protections are only applied at a superficial level, meaning that once the underlying model is accessible, those safeguards can be stripped away using readily available techniques.
  • Earlier reporting from The New York Times have reinforced these concerns, citing research from cybersecurity firm LayerX that showed how easily safety protections could be bypassed in other leading AI systems.

Regulators in the US, EU, and UK are increasingly signaling that voluntary safety commitments by AI firms may not be enough, and this could lead to increased pressure for enforceable standards across both proprietary and open-weight models until stronger safeguards and independent verification mechanisms.

Share:

PreviousNOAH HOLDINGS LIMITED ANNOUNCES UNAUDITED FINANCIAL RESULTS FOR THE FIRST QUARTER OF 2026

Related Posts

Will AI fix the future of work?

Will AI fix the future of work?

June 6, 2023

US presidential election expected to be a tipping point of e-crime this year

US presidential election expected to be a tipping point of e-crime this year

October 29, 2020

Central banks are assessing and exploring digital national currencies

Central banks are assessing and exploring digital national currencies

September 11, 2020

Not all crypto investors are speculators: survey

Not all crypto investors are speculators: survey

July 12, 2023

Leave a reply Cancel reply

You must be logged in to post a comment.

Awards Nomination Banner

gamification list

PARTICIPATE NOW

top placement

Whitepapers

  • Achieve Modernization Without the Complexity

    Achieve Modernization Without the Complexity

    Transforming IT infrastructure is crucial …Download Whitepaper
  • 5 Steps to Boost IT Infrastructure Reliability

    5 Steps to Boost IT Infrastructure Reliability

    In today's fast-evolving tech landscape, …Download Whitepaper
  • Simplify Payroll Setup for Your Small Business

    Simplify Payroll Setup for Your Small Business

    In our free guide, "How …Download Whitepaper
  • Overcoming the Challenges of Cost & Complexity in the Cloud-first Era.

    Overcoming the Challenges of Cost & Complexity in the Cloud-first Era.

    Download Whitepaper

Middle Placement

Case Studies

  • CALB upgrades data platform to support analytics, security, and battery lifecycle tracking

    CALB upgrades data platform to support analytics, security, and battery lifecycle tracking

    Deploying a petabyte-scale data lake …Read More
  • How a Vietnamese D2C retailer built its own secure digital infrastructure

    How a Vietnamese D2C retailer built its own secure digital infrastructure

    Would your organization build your …Read More
  • Liverpool FC to deliver more personalized, real-time digital fan experiences with AI

    Liverpool FC to deliver more personalized, real-time digital fan experiences with AI

    The football club will deepen …Read More
  • Balancing brand heritage and modern service with AI-powered customer experience

    Balancing brand heritage and modern service with AI-powered customer experience

    Balancing brand heritage and modern …Read More

Bottom Sidebar

Other News

  • NOAH HOLDINGS LIMITED ANNOUNCES UNAUDITED FINANCIAL RESULTS FOR THE FIRST QUARTER OF 2026

    May 28, 2026
    SINGAPORE, May 28, 2026 /PRNewswire/ …Read More »
  • YY Group Holding Announces Estimated Total Assets and Net Assets per Share of $11.13 and $4.03, Respectively, as of April 30, 2026

    May 28, 2026
    Estimates Reflect $37.6M in Total …Read More »
  • U Power Hydro Data Joint Venture Secures Thailand Data Center Energy Project 3MW Pilot for Planned 100MW Deployment

    May 27, 2026
    BANGKOK, Thailand, May 27, 2026 …Read More »
  • Sisram Medical Investor Day Key Takeaways

    May 27, 2026
    Advancing Global Medical Aesthetics Strategy …Read More »
  • Trip.com Group to Hold Annual General Meeting on June 30, 2026

    May 27, 2026
    SINGAPORE, May 27, 2026 /PRNewswire/ …Read More »
  • Our Brands
  • CybersecAsia
  • MartechAsia
  • Home
  • About Us
  • Contact Us
  • Sitemap
  • Privacy & Cookies
  • Terms of Use
  • Advertising & Reprint Policy
  • Media Kit
  • Subscribe
  • Manage Subscriptions
  • Newsletter

Copyright © 2026 DigiconAsia All Rights Reserved.