RECENT STORIES:

Addressing digital sovereignty in a data-driven world
NLCS (Singapore) Honoured at the Employee Experience Awards 2026 for i...
Indirect greenhouse gases contribute 15% of human-caused warming, stud...
BHN encourages Aussies to send the spirit of soccer this FIFA World Cu...
Guangzhou International Arbitration Court Opens Vietnam Liaison Office...
Navigating High Market Volatility: Insights from JustMarkets
LOGIN REGISTER
DigiconAsia
  • Features
    • Featured

      Bridging the gap from AI prototype to production

      Bridging the gap from AI prototype to production

      Wednesday, June 10, 2026, 1:53 PM Asia/Singapore | Features
    • Featured

      Data centers and the digital infrastructure crunch in Asia

      Data centers and the digital infrastructure crunch in Asia

      Monday, June 8, 2026, 3:02 PM Asia/Singapore | Features
    • Featured

      In AI missions, who governs the agents

      In AI missions, who governs the agents

      Thursday, June 4, 2026, 4:06 PM Asia/Singapore | Features
  • News
    • Featured

      Indirect greenhouse gases contribute 15% of human-caused warming, study finds

      Indirect greenhouse gases contribute 15% of human-caused warming, study finds

      Monday, June 15, 2026, 3:54 PM Asia/Singapore | News
    • Featured

      Agent-based adtech tool converts briefs into structured audience definitions for unified planning, execution

      Agent-based adtech tool converts briefs into structured audience definitions for unified planning, execution

      Friday, June 12, 2026, 3:04 PM Asia/Singapore | News
    • Featured

      IP lawsuit could shape how uploaded content can be used for AI training

      IP lawsuit could shape how uploaded content can be used for AI training

      Friday, June 12, 2026, 1:24 PM Asia/Singapore | News
  • Perspectives
  • Tips & Strategies
  • Whitepapers
  • Directory
  • E-Learning

Select Page

News

Advanced software tools can rapidly strip safety controls from generative AI models: report

By DigiconAsia Editors | Thursday, May 28, 2026, 10:49 AM Asia/Singapore

Advanced software tools can rapidly strip safety controls from generative AI models: report

Multiple investigation show that available software can bypass AI guardrails in minutes, enabling harmful outputs and highlighting vulnerabilities, regulatory concerns.

According to a Financial Times (FT) investigation this week, special software tools can remove built-in safety controls from Meta and Google generative AI systems within minutes. Once altered, the models were no longer restricted from addressing harmful topics such as biological threats, malicious software, and illegal exploitation.

Highlighting concerns about how fragile current AI safeguards may be, FT had performed tests to evaluate how easily AI guardrails could be bypassed. Results showed that widely available toolkits can be used to override safeguards using methods such as  targeted fine-tuning; adversarial training data, and automated prompt manipulation.

These approaches do not require retraining a model from scratch but instead adjust behavior enough to bypass restrictions. The FT report noted that such tools are already being used to produce large numbers of modified models with weakened or removed safeguards.

Multiple clear indications of AI jail-breakability

These findings align with a growing body of research suggesting that current alignment techniques may be fundamentally vulnerable.

  • A study published earlier this year in Nature Communications had found that advanced AI systems could act as automated jailbreak agents, successfully bypassing protections in most cases without human input.
  • Another paper presented at the International Conference on Learning Representations 2026 had introduced a method known as Head-Masked Nullspace Steering, which disables specific internal mechanisms responsible for enforcing refusals, achieving extremely high success rates in defeating safety measures.
  • The issue is especially pronounced for open-weight models from Meta and Google. While making model weights publicly accessible supports innovation and research, it also allows users to alter systems in ways that remove safety features.
  • Security experts have pointed out that many protections are only applied at a superficial level, meaning that once the underlying model is accessible, those safeguards can be stripped away using readily available techniques.
  • Earlier reporting from The New York Times have reinforced these concerns, citing research from cybersecurity firm LayerX that showed how easily safety protections could be bypassed in other leading AI systems.

Regulators in the US, EU, and UK are increasingly signaling that voluntary safety commitments by AI firms may not be enough, and this could lead to increased pressure for enforceable standards across both proprietary and open-weight models until stronger safeguards and independent verification mechanisms.

Share:

PreviousNOAH HOLDINGS LIMITED ANNOUNCES UNAUDITED FINANCIAL RESULTS FOR THE FIRST QUARTER OF 2026
NextRemember DEI? An update on a hijacked management movement that got trumped

Related Posts

In desperate times of pandemic, China takes refuge in the blockchain

In desperate times of pandemic, China takes refuge in the blockchain

March 18, 2020

How GenAI is boosting video streaming experiences at the consumer and business level

How GenAI is boosting video streaming experiences at the consumer and business level

April 11, 2024

Social distancing fuels the rise of social commerce in SE Asia

Social distancing fuels the rise of social commerce in SE Asia

October 12, 2020

Indonesian bank taps into data cloud platform to boost analytics and security

Indonesian bank taps into data cloud platform to boost analytics and security

July 13, 2020

Leave a reply Cancel reply

You must be logged in to post a comment.

Awards Nomination Banner

gamification list

PARTICIPATE NOW

top placement

Whitepapers

  • Achieve Modernization Without the Complexity

    Achieve Modernization Without the Complexity

    Transforming IT infrastructure is crucial …Download Whitepaper
  • 5 Steps to Boost IT Infrastructure Reliability

    5 Steps to Boost IT Infrastructure Reliability

    In today's fast-evolving tech landscape, …Download Whitepaper
  • Simplify Payroll Setup for Your Small Business

    Simplify Payroll Setup for Your Small Business

    In our free guide, "How …Download Whitepaper
  • Overcoming the Challenges of Cost & Complexity in the Cloud-first Era.

    Overcoming the Challenges of Cost & Complexity in the Cloud-first Era.

    Download Whitepaper

Middle Placement

Case Studies

  • The 48-hour lifeline: How the IRC rewrote the rules for crisis care

    The 48-hour lifeline: How the IRC rewrote the rules for crisis care

    In a world where crises …Read More
  • CALB upgrades data platform to support analytics, security, and battery lifecycle tracking

    CALB upgrades data platform to support analytics, security, and battery lifecycle tracking

    Deploying a petabyte-scale data lake …Read More
  • How a Vietnamese D2C retailer built its own secure digital infrastructure

    How a Vietnamese D2C retailer built its own secure digital infrastructure

    Would your organization build your …Read More
  • Liverpool FC to deliver more personalized, real-time digital fan experiences with AI

    Liverpool FC to deliver more personalized, real-time digital fan experiences with AI

    The football club will deepen …Read More

Bottom Sidebar

Other News

  • NLCS (Singapore) Honoured at the Employee Experience Awards 2026 for its HR Digital Transformation Strategy

    June 16, 2026
    SINGAPORE, June 16, 2026 /PRNewswire/ …Read More »
  • BHN encourages Aussies to send the spirit of soccer this FIFA World Cup 2026™ season

    June 15, 2026
    SYDNEY, June 15, 2026 /PRNewswire/ …Read More »
  • Guangzhou International Arbitration Court Opens Vietnam Liaison Office to Support China-Vietnam Cross-Border Dispute Resolution

    June 13, 2026
    HO CHI MINH CITY, Vietnam, …Read More »
  • Navigating High Market Volatility: Insights from JustMarkets

    June 13, 2026
    HO CHI MINH CITY, Vietnam, …Read More »
  • GCL SI Showcases Scenario-Based PV Solutions at SNEC 2026, Driving Application-Specific Solar Deployment and Low-Carbon Development

    June 13, 2026
    SHANGHAI, June 12, 2026 /PRNewswire/ …Read More »
  • Our Brands
  • CybersecAsia
  • MartechAsia
  • Home
  • About Us
  • Contact Us
  • Sitemap
  • Privacy & Cookies
  • Terms of Use
  • Advertising & Reprint Policy
  • Media Kit
  • Subscribe
  • Manage Subscriptions
  • Newsletter

Copyright © 2026 DigiconAsia All Rights Reserved.