RECENT STORIES:

Addressing digital sovereignty in a data-driven world
Digital ethics researchers propose copyleft licensing for AI models tr...
Social media and video platforms surpass television as news source glo...
Sieyuan Electric Advances the Development of Renewable-dominated Power...
Booking.com Research Reveals Majority of Aussie LGBTQ+ Travellers Conc...
AXI SECURES FSC MAURITIUS LICENCE, BRINGING REGULATED TRADING TO THE W...
LOGIN REGISTER
DigiconAsia
  • Features
    • Featured

      Deployment outpacing validation in digital experience

      Deployment outpacing validation in digital experience

      Friday, June 12, 2026, 9:26 AM Asia/Singapore | Features
    • Featured

      Bridging the gap from AI prototype to production

      Bridging the gap from AI prototype to production

      Wednesday, June 10, 2026, 1:53 PM Asia/Singapore | Features
    • Featured

      Data centers and the digital infrastructure crunch in Asia

      Data centers and the digital infrastructure crunch in Asia

      Monday, June 8, 2026, 3:02 PM Asia/Singapore | Features
  • News
    • Featured

      Digital ethics researchers propose copyleft licensing for AI models trained on open-source code

      Digital ethics researchers propose copyleft licensing for AI models trained on open-source code

      Thursday, June 18, 2026, 3:14 PM Asia/Singapore | News
    • Featured

      Social media and video platforms surpass television as news source globally: 2026 report

      Social media and video platforms surpass television as news source globally: 2026 report

      Thursday, June 18, 2026, 11:38 AM Asia/Singapore | News
    • Featured

      Two firms collaborate to build fault-tolerant quantum computer for 2028 deployment

      Two firms collaborate to build fault-tolerant quantum computer for 2028 deployment

      Wednesday, June 17, 2026, 2:49 PM Asia/Singapore | News
  • Perspectives
  • Tips & Strategies
  • Whitepapers
  • Directory
  • E-Learning

Select Page

News

Advanced software tools can rapidly strip safety controls from generative AI models: report

By DigiconAsia Editors | Thursday, May 28, 2026, 10:49 AM Asia/Singapore

Advanced software tools can rapidly strip safety controls from generative AI models: report

Multiple investigation show that available software can bypass AI guardrails in minutes, enabling harmful outputs and highlighting vulnerabilities, regulatory concerns.

According to a Financial Times (FT) investigation this week, special software tools can remove built-in safety controls from Meta and Google generative AI systems within minutes. Once altered, the models were no longer restricted from addressing harmful topics such as biological threats, malicious software, and illegal exploitation.

Highlighting concerns about how fragile current AI safeguards may be, FT had performed tests to evaluate how easily AI guardrails could be bypassed. Results showed that widely available toolkits can be used to override safeguards using methods such as  targeted fine-tuning; adversarial training data, and automated prompt manipulation.

These approaches do not require retraining a model from scratch but instead adjust behavior enough to bypass restrictions. The FT report noted that such tools are already being used to produce large numbers of modified models with weakened or removed safeguards.

Multiple clear indications of AI jail-breakability

These findings align with a growing body of research suggesting that current alignment techniques may be fundamentally vulnerable.

  • A study published earlier this year in Nature Communications had found that advanced AI systems could act as automated jailbreak agents, successfully bypassing protections in most cases without human input.
  • Another paper presented at the International Conference on Learning Representations 2026 had introduced a method known as Head-Masked Nullspace Steering, which disables specific internal mechanisms responsible for enforcing refusals, achieving extremely high success rates in defeating safety measures.
  • The issue is especially pronounced for open-weight models from Meta and Google. While making model weights publicly accessible supports innovation and research, it also allows users to alter systems in ways that remove safety features.
  • Security experts have pointed out that many protections are only applied at a superficial level, meaning that once the underlying model is accessible, those safeguards can be stripped away using readily available techniques.
  • Earlier reporting from The New York Times have reinforced these concerns, citing research from cybersecurity firm LayerX that showed how easily safety protections could be bypassed in other leading AI systems.

Regulators in the US, EU, and UK are increasingly signaling that voluntary safety commitments by AI firms may not be enough, and this could lead to increased pressure for enforceable standards across both proprietary and open-weight models until stronger safeguards and independent verification mechanisms.

Share:

PreviousNOAH HOLDINGS LIMITED ANNOUNCES UNAUDITED FINANCIAL RESULTS FOR THE FIRST QUARTER OF 2026
NextRemember DEI? An update on a hijacked management movement that got trumped

Related Posts

Singapore’s financial hub exposed as key sanctuary in Cambodia’s Chen Zhi scam empire

Singapore’s financial hub exposed as key sanctuary in Cambodia’s Chen Zhi scam empire

October 17, 2025

DHL Express empowers customers with a AI/ML-powered trade-lane comparison function

DHL Express empowers customers with a AI/ML-powered trade-lane comparison function

June 27, 2024

Home-based learning? All you need is an app for virtual meetings, right?

Home-based learning? All you need is an app for virtual meetings, right?

June 1, 2021

Lost opportunities: abandoned e-shopping carts could have cost billions

Lost opportunities: abandoned e-shopping carts could have cost billions

November 19, 2020

Leave a reply Cancel reply

You must be logged in to post a comment.

Awards Nomination Banner

gamification list

PARTICIPATE NOW

top placement

Whitepapers

  • Achieve Modernization Without the Complexity

    Achieve Modernization Without the Complexity

    Transforming IT infrastructure is crucial …Download Whitepaper
  • 5 Steps to Boost IT Infrastructure Reliability

    5 Steps to Boost IT Infrastructure Reliability

    In today's fast-evolving tech landscape, …Download Whitepaper
  • Simplify Payroll Setup for Your Small Business

    Simplify Payroll Setup for Your Small Business

    In our free guide, "How …Download Whitepaper
  • Overcoming the Challenges of Cost & Complexity in the Cloud-first Era.

    Overcoming the Challenges of Cost & Complexity in the Cloud-first Era.

    Download Whitepaper

Middle Placement

Case Studies

  • The 48-hour lifeline: How the IRC rewrote the rules for crisis care

    The 48-hour lifeline: How the IRC rewrote the rules for crisis care

    In a world where crises …Read More
  • CALB upgrades data platform to support analytics, security, and battery lifecycle tracking

    CALB upgrades data platform to support analytics, security, and battery lifecycle tracking

    Deploying a petabyte-scale data lake …Read More
  • How a Vietnamese D2C retailer built its own secure digital infrastructure

    How a Vietnamese D2C retailer built its own secure digital infrastructure

    Would your organization build your …Read More
  • Liverpool FC to deliver more personalized, real-time digital fan experiences with AI

    Liverpool FC to deliver more personalized, real-time digital fan experiences with AI

    The football club will deepen …Read More

Bottom Sidebar

Other News

  • Sieyuan Electric Advances the Development of Renewable-dominated Power Systems Through Integrated Energy Solutions

    June 18, 2026
    SHANGHAI, June 18, 2026 /PRNewswire/ …Read More »
  • Booking.com Research Reveals Majority of Aussie LGBTQ+ Travellers Conceal Their Identity Abroad

    June 18, 2026
    SYDNEY, June 18, 2026 /PRNewswire/ …Read More »
  • AXI SECURES FSC MAURITIUS LICENCE, BRINGING REGULATED TRADING TO THE WORLD’S FASTEST-GROWING MARKETS

    June 17, 2026
    Axi has been granted a …Read More »
  • All Eyes on Korea: CSOP KOSPI 200 ETF (3121.HK) to List on HKEX Tomorrow

    June 17, 2026
    HONG KONG, June 17, 2026 …Read More »
  • LEIFRAS Co., Ltd. Issues JPY200 Million SDGs Private Placement Bonds to Establish Robust Financial Foundation and Support Disadvantaged Youth

    June 17, 2026
    Continued Collaboration via The Chikuho …Read More »
  • Our Brands
  • CybersecAsia
  • MartechAsia
  • Home
  • About Us
  • Contact Us
  • Sitemap
  • Privacy & Cookies
  • Terms of Use
  • Advertising & Reprint Policy
  • Media Kit
  • Subscribe
  • Manage Subscriptions
  • Newsletter

Copyright © 2026 DigiconAsia All Rights Reserved.