AI coding tools introduce security flaws in 87% of pull requests: report

Recent research tested leading agents building full apps, uncovering 143 vulnerabilities such as improper token handling across models.

In a report on coding security released on 11 March 11, 2026, an “AI-native” cybersecurity firm has claimed to discover significant security shortcomings in leading AI coding tools.

DryRun Security, an Austin, Texas-based firm, had tested Anthropic’s Claude, OpenAI’s Codex, and Google’s Gemini, by tasking them with developing two full applications — a family allergy tracker web app and a browser racing game —via sequential pull requests mimicking real engineering workflows.

Across 38 scans, 143 vulnerabilities surfaced, with 87% of pull requests introducing at least one flaw, according to a report in Yahoo news:

Claude had generated the most unresolved high-severity issues in the final codebases
Codex showed the strongest remediation, fixing more problems iteratively and ending with the fewest critical vulnerabilities
Gemini had placed between them, addressing some early flaws in later changes but still leaving multiple severe risks
None of the coding agents produced a secure product, as all overlooked key protections
The AI coding agents generated functional software quickly, but security was not built into their processes, and the bots often skipped essential features or botched authentication logic
Common failures spanned all models, including improper JSON Web Token handling, no defenses against brute-force attacks, susceptibility to token replay exploits, and weak refresh token cookie settings.
Authentication safeguards, when created for REST APIs, were inconsistently applied to WebSocket endpoints, exposing app segments.

These results amplify enterprise ongoing worries about AI-assisted coding. A February 2026 study had found over 25% of AI-generated code contained OWASP Top 10 vulnerabilities, but DryRun’s recent work uniquely tracks flaws compounding over full development cycles.

As software development teams speed up them projects via agents, ongoing scans during workflows—not just end-stage reviews — are vital to curb risk buildup and technical debt, according to industry observers.

Featured

Creating value with AI upskilling

Featured

Sovereign AI – a competitive advantage

Featured

Deployment outpacing validation in digital experience

Featured

Study finds social platforms are still driving traffic to nonconsensual deepfake nudify sites

Featured

The Universe’s mysteries may be our biggest blind spot: dark matter research

Featured

Ninja Van Malaysia streamlines warehouse workflows to reduce logistics inefficiencies

Leave a reply Cancel reply

Awards Nomination Banner

gamification list

top placement

Whitepapers

Achieve Modernization Without the Complexity

5 Steps to Boost IT Infrastructure Reliability

Simplify Payroll Setup for Your Small Business

Overcoming the Challenges of Cost & Complexity in the Cloud-first Era.

Middle Placement

Case Studies

Xiaomi streamlines global payments across 18 markets

The 48-hour lifeline: How the IRC rewrote the rules for crisis care

CALB upgrades data platform to support analytics, security, and battery lifecycle tracking

How a Vietnamese D2C retailer built its own secure digital infrastructure

Bottom Sidebar

Other News

MM Art Indices Found Global Art Market Shows Signs of Synchronized Recovery in Spring 2026 Auctions After Post-Pandemic Correction

EQT Consortium Raises Tender Offer Price for Kakaku.com to JPY 3,450 Per Share

Cheche Group Announces 35-for-1 Share Consolidation

Asia Pacific Enterprise Awards (APEA) 2026 Regional Edition Forges Legacies with Trailblazers Across Asia

Longbridge Unveils the World’s First AI-Native Investing Platform, Ushering in a New Era of Investing

Featured

Creating value with AI upskilling

Featured

Sovereign AI – a competitive advantage

Featured

Deployment outpacing validation in digital experience

Featured

Study finds social platforms are still driving traffic to nonconsensual deepfake nudify sites

Featured

The Universe’s mysteries may be our biggest blind spot: dark matter research

Featured

Ninja Van Malaysia streamlines warehouse workflows to reduce logistics inefficiencies

AI coding tools introduce security flaws in 87% of pull requests: report

Related Posts

Leave a reply Cancel reply

Awards Nomination Banner

gamification list

top placement

Whitepapers

Middle Placement

Case Studies

Bottom Sidebar

Other News