Shield Your App/Network from Online Attacks

Proven security strategies to safeguard your assets.

About the Author
‍

Omair Manzoor is the Founder, CEO, and Chief Hacker of ioSENTRIX, a CREST-accredited cybersecurity firm. With 14+ years in offensive security, he has led penetration tests for Fortune 500 companies including engagements for Amazon, Bank of America, VISA, and the London Underground. His exploits are part of the Metasploit Framework and Immunity Canvas. Omair is a published researcher, conference keynote speaker (DOD, CISA, BSides), and technical reviewer for academic security textbooks.

TABLE Of CONTENTS

Example H2

Is Vibe Coding Secure? What a Multi-Environment Security Study Actually Found

Omair

2026-03-25

min read

AI-generated Code is Not the Biggest Security Risk.

‍

The real risk is what happens when development speed outpaces your security controls.

‍

Your developers are already using AI coding assistants. Maybe they're running agents against entire feature branches. Maybe your newest product was built almost entirely by AI.

‍

And somewhere in your security program, there's a gap between "we should assess this" and "we actually have data."

‍

That gap is closing.

‍

ioSENTRIX security research team recently completed a structured security study and evaluated whether AI-assisted and AI-native development is more or less secure than traditional development.

‍

The study also analyzed whether the controls most organizations have today are equipped to catch what AI introduces.

‍

The short answer: AI doesn't make code automatically insecure. But it makes weak security governance dramatically more expensive.

‍

AI adoption is accelerating rapidly across engineering teams, often without corresponding updates to CI/CD governance, security testing coverage, or release validation controls.

‍

This creates a growing gap between how fast code is being produced and how effectively it is being reviewed, tested, and secured — a gap that this study was designed to evaluate.

What We Studied (And Why It Matters)

The study covered three distinct application environments operated by the same enterprise organization:

‍

Environment A: A mature, well-governed enterprise platform with established engineering practices
Environment B: A moderately mature enterprise system with some security controls in place
Environment C: A fully AI-native application, built from the ground up using AI-assisted development tooling

‍

Across all three, we evaluated:

‍

Code quality metrics
PR-level security findings
Release-level SAST results
CI/CD pipeline control maturity

‍

Where possible, we compared pre-AI and post-AI cohorts. For the AI-native environment, we tracked quality and security evolution longitudinally.

‍

Across these environments, we analyzed differences in vulnerability introduction rates, detection coverage, and severity distribution between Pre-AI and Post-AI development cohorts.

‍

Two security analysis approaches were applied in parallel at the release level:

‍

An AI-powered analysis platform
A commercial release-stage SAST solution

‍

This cross-tool comparison turned out to be one of the study's most important dimensions.

‍

Finding 1: Mature Teams Absorb AI Risk. But They're Not Immune

‍

In Environment A, code quality metrics stayed essentially flat after AI adoption. Maintainability scores, error handling, and duplication all held stable.

‍

This is probably not what people expect to hear — and it's genuinely reassuring. A mature engineering team with established standards can adopt AI tooling without their code quality collapsing.

‍

But security findings at the PR level did increase. The categories that grew most were authentication and authorization weaknesses. These are consistent with AI-generated code that moves efficiently through logic layers without fully modeling trust boundaries.

‍

At the release level, total findings stayed broadly flat. However, a small number of high-severity findings appeared post-AI that hadn't existed before.

‍

The interpretation is subtle: AI adoption in a mature environment correlates with stable quality and stable-to-slightly-worse release-level security, but with a meaningful increase in issue introduction velocity at the PR stage.

‍

The risk is real. It is not catastrophic. But it requires stronger PR-level controls to contain it.

‍

Finding 2: Partial Maturity Is the Most Dangerous State

‍

Environment B told a more concerning story.

‍

Code quality held up. Error handling, input validation, and duplication all remained manageable. But security outcomes diverged sharply from quality outcomes.

‍

PR-level findings increased more than in Environment A. And release-level findings showed a substantial increase, driven primarily by accumulation in one orchestration/service layer.

‍

This points to something important: AI-assisted development may be manageable for code quality while still causing meaningful variability in security outcomes. Quality and security do not move together.

‍

An organization that monitors quality metrics and sees them holding stable may be operating with a false sense of security while issues accumulate at the release level.

‍

The CI/CD review for this environment also revealed missing software composition analysis (SCA) and limited container scan coverage — gaps that wouldn't show up in code quality dashboards but that leave significant attack surface unmonitored.

‍

‍

Finding 3: AI-Native Development Accumulates Debt Rapidly Without Governance

‍

Environment C produced the study's sharpest findings.

‍

In the earlier period of development, the AI-generated code was reasonably strong. Quality metrics were respectable. Then, as feature scope expanded, the divergence became dramatic:

‍

PR-level security findings doubled
Oversized functions grew by approximately 180%
Code duplication increased by approximately 50%
High-complexity functions increased by approximately 73%
High-severity PR findings grew sharply

‍

This is not a story about AI producing bad code from day one. It's a story about what happens when AI-native development isn't paired with active quality governance, refactoring discipline, and layered security controls.

‍

Initial productivity gains mask accumulating structural fragility. By the time the debt is visible in metrics, it's already expensive to address.

‍

Finding 4: Your Tools Are Probably Missing Half the Picture

‍

One of the most operationally significant findings in this study wasn't about AI at all. It was about detection consistency.

‍

The two security analysis approaches applied at the release level consistently surfaced different findings populations. One emphasized total volume. The other emphasized severity concentration.

‍

In some environments, one approach showed a significant increase while the other showed a decrease — for the same codebase, the same release cohort.

‍

If your organization relies on a single SAST tool as its release-stage security gate, this study suggests you are likely underrepresenting your actual risk exposure.

‍

No single tool provided a complete picture across any of the three environments.

‍

Finding 5: CI/CD Maturity Is the Biggest Variable

‍

The most significant observation was not just the increase in vulnerabilities, but the inconsistency in how those vulnerabilities were detected, enforced, and tracked across the CI/CD pipeline.

‍

Across all three environments, the single biggest determinant of security risk wasn't the AI tooling — it was CI/CD control maturity.

‍

The specific gaps observed included:

‍

Limited container scanning
Missing or partial SCA coverage
No centralized vulnerability management across repositories
Artifact retention gaps preventing historical comparison
Inconsistent PR-level scan enforcement

‍

These are not new problems. What AI does is amplify them. AI-assisted development doesn't create security debt. It accelerates the rate at which existing weaknesses become visible and impactful.

Are You at Risk? A Quick Self-Assessment

If your organization:

‍

Does not enforce security scans at the pull request level
Allows builds to pass even when scans fail
Cannot trace security results to specific releases
Relies on a single security tool for vulnerability detection
Has increased development velocity due to AI adoption

‍

Then your environment likely reflects the higher-risk patterns observed in this study.

What This Means for Your Organization?

The study suggests three distinct risk profiles:

‍

1. Mature teams adopting AI
Risk is manageable, but requires stronger PR-level enforcement and release gating.

‍

2. Partially mature teams
AI adoption is widening existing gaps. CI/CD governance must be strengthened before scaling.

‍

3. AI-native environments
Risk is longitudinal. Without governance, technical debt and security exposure accumulate rapidly.

‍

AI-assisted development does not inherently make applications insecure. However, it amplifies existing weaknesses in the software development lifecycle. The organizations that benefit most from AI are not those that adopt it fastest — but those that strengthen their SDLC controls alongside it.

‍

The key question is no longer whether AI introduces risk, but whether your current engineering and security processes are mature enough to manage it.

Ready to Know Where You Stand?

If you're adopting AI development tooling, or already operating AI-native, the most important question is not what AI introduces, but what your current controls fail to catch.

‍

If you want to understand:

‍

How your environment compares to the patterns observed in this study
Whether your CI/CD pipeline is enforcing security effectively
Where AI may be introducing hidden risk in your development process

‍

Book a consultation →

‍

Explore related services:

‍

ioSENTRIX delivers enterprise-grade cybersecurity risk solutions from cloud to code. Trusted by compliance teams, engineering leaders, and security-conscious executives across industries.

‍

iosentrix.com | LinkedIn | Book a Demo

FAQ: AI Development Security

‍

Does AI-generated code have more vulnerabilities than human-written code?

‍

In this study, AI-assisted development was associated with higher security finding rates at the PR level across all three environments assessed. However, the severity and volume of those findings varied significantly based on codebase maturity and SDLC control strength.

‍

What is the biggest security risk of AI-native development?

‍

Longitudinal quality and security degradation. AI-native codebases can appear clean early in development, then accumulate technical debt and security findings rapidly as feature scope expands.

‍

Is vibe coding safe for enterprise applications?

‍

This depends almost entirely on the maturity of the surrounding SDLC controls. Teams with enforced PR-level security scanning, release-stage SAST gates, SCA coverage, and centralized vulnerability management can adopt AI coding at higher velocity with manageable risk. Teams without these controls face compounding exposure as development accelerates.

‍

What security controls matter most for AI-assisted development?

‍

In order of impact based on this study: enforced PR-level security scanning, release-stage SAST with enforcement (not just detection), software composition analysis, container scanning, and centralized vulnerability tracking. The specific tools matter less than whether the controls are enforced and whether findings are routed to remediation.

‍

How does ioSENTRIX assess AI development security?

‍

ioSENTRIX's Full Stack Assessment and Application Security as a Service (ASaaS) offerings include SDLC maturity evaluation, multi-tool security analysis at both PR and release stages, and CI/CD pipeline control review. Engagements are mapped to NIST SSDF, OWASP SAMM, and ISO/IEC 25010 to provide audit-ready documentation alongside technical findings.

‍

Application Security

AI Development

DevSecOps

AI Compliance

AI Regulation

AI Risk Assessment

ApplicationSecurity

AppSec