GitLab Duo Vulnerability Enabled Attackers to Hijack AI Responses with Hidden Prompts

May 23, 2025

Cybersecurity researchers have found an oblique immediate injection flaw in GitLab’s synthetic intelligence (AI) assistant Duo that would have allowed attackers to steal supply code and inject untrusted HTML into its responses, which might then be used to direct victims to malicious web sites.

GitLab Duo is a synthetic intelligence (AI)-powered coding assistant that allows customers to jot down, overview, and edit code. Constructed utilizing Anthropic’s Claude fashions, the service was first launched in June 2023.

However as Legit Safety discovered, GitLab Duo Chat has been vulnerable to an oblique immediate injection flaw that allows attackers to “steal supply code from personal initiatives, manipulate code ideas proven to different customers, and even exfiltrate confidential, undisclosed zero-day vulnerabilities.”

Immediate injection refers to a category of vulnerabilities widespread in AI methods that allow risk actors to weaponize giant language fashions (LLMs) to govern responses to customers’ prompts and lead to undesirable habits.

Oblique immediate injections are much more trickier in that as an alternative of offering an AI-crafted enter straight, the rogue directions are embedded inside one other context, reminiscent of a doc or an online web page, which the mannequin is designed to course of.

Latest research have proven that LLMs are additionally susceptible to jailbreak assault strategies that make it doable to trick AI-driven chatbots into producing dangerous and unlawful data that disregards their moral and security guardrails, successfully obviating the necessity for fastidiously crafted prompts.

What’s extra, Immediate Leakage (PLeak) strategies could possibly be used to inadvertently reveal the preset system prompts or directions that are supposed to be adopted by the mannequin.

“For organizations, because of this personal data reminiscent of inner guidelines, functionalities, filtering standards, permissions, and consumer roles might be leaked,” Pattern Micro mentioned in a report revealed earlier this month. “This might give attackers alternatives to use system weaknesses, doubtlessly resulting in data breaches, disclosure of commerce secrets and techniques, regulatory violations, and different unfavorable outcomes.”

PLeak assault demonstration – Credential Extra / Publicity of Delicate Performance

The most recent findings from the Israeli software program provide chain security agency present {that a} hidden remark positioned wherever inside merge requests, commit messages, problem descriptions or feedback, and supply code was sufficient to leak delicate knowledge or inject HTML into GitLab Duo’s responses.

These prompts could possibly be hid additional utilizing encoding methods like Base16-encoding, Unicode smuggling, and KaTeX rendering in white textual content to be able to make them much less detectable. The dearth of enter sanitization and the truth that GitLab didn’t deal with any of those eventualities with any extra scrutiny than it did supply code might have enabled a nasty actor to plant the prompts throughout the location.

“Duo analyzes the whole context of the web page, together with feedback, descriptions, and the supply code — making it susceptible to injected directions hidden wherever in that context,” security researcher Omer Mayraz mentioned.

This additionally signifies that an attacker might deceive the AI system into together with a malicious JavaScript bundle in a bit of synthesized code, or current a malicious URL as protected, inflicting the sufferer to be redirected to a faux login web page that harvests their credentials.

On prime of that, by profiting from GitLab Duo Chat’s capacity to entry details about particular merge requests and the code modifications inside them, Legit Safety discovered that it is doable to insert a hidden immediate in a merge request description for a undertaking that, when processed by Duo, causes the personal supply code to be exfiltrated to an attacker-controlled server.

This, in flip, is made doable owing to its use of streaming markdown rendering to interpret and render the responses into HTML because the output is generated. In different phrases, feeding it HTML code through oblique immediate injection might trigger the code phase to be executed on the consumer’s browser.

Following accountable disclosure on February 12, 2025, the problems have been addressed by GitLab.

“This vulnerability highlights the double-edged nature of AI assistants like GitLab Duo: when deeply built-in into growth workflows, they inherit not simply context — however threat,” Mayraz mentioned.

“By embedding hidden directions in seemingly innocent undertaking content material, we had been capable of manipulate Duo’s habits, exfiltrate personal supply code, and exhibit how AI responses might be leveraged for unintended and dangerous outcomes.”

The disclosure comes as Pen Check Companions revealed how Microsoft Copilot for SharePoint, or SharePoint Brokers, could possibly be exploited by native attackers to entry delicate knowledge and documentation, even from information which have the “Restricted View” privilege.

“One of many major advantages is that we will search and trawl by means of large datasets, such because the SharePoint websites of huge organisations, in a brief period of time,” the corporate mentioned. “This could drastically enhance the possibilities of discovering data that will probably be helpful to us.”

The assault strategies comply with new analysis that ElizaOS (previously Ai16z), a nascent decentralized AI agent framework for automated Web3 operations, could possibly be manipulated by injecting malicious directions into prompts or historic interplay information, successfully corrupting the saved context and resulting in unintended asset transfers.

“The implications of this vulnerability are notably extreme on condition that ElizaOSagents are designed to work together with a number of customers concurrently, counting on shared contextual inputs from all contributors,” a bunch of lecturers from Princeton College wrote in a paper.

“A single profitable manipulation by a malicious actor can compromise the integrity of the whole system, creating cascading results which can be each tough to detect and mitigate.”

Immediate injections and jailbreaks apart, one other vital problem ailing LLMs at this time is hallucination, which happens when the fashions generate responses that aren’t primarily based on the enter knowledge or are merely fabricated.

In line with a brand new examine revealed by AI testing firm Giskard, instructing LLMs to be concise of their solutions can negatively have an effect on factuality and worsen hallucinations.

“This impact appears to happen as a result of efficient rebuttals typically require longer explanations,” it mentioned. “When compelled to be concise, fashions face an unattainable selection between fabricating quick however inaccurate solutions or showing unhelpful by rejecting the query completely.”

- Advertisment -

GitLab Duo Vulnerability Enabled Attackers to Hijack AI Responses with Hidden Prompts

NVIDIA confirms GeForce NOW data breach affecting Armenian customers

Trellix supply code breach claimed by RansomHouse hackers

Zara data breach uncovered private data of 197,000 folks

LEAVE A REPLY Cancel reply

Most Popular

Angriffe auf npm-Lieferkette gefährden Entwicklungsumgebungen

PixieFail flaws affect PXE community boot in enterprise techniques

PixieFail UEFI Flaws Expose Tens of millions of Computer systems to RCE, DoS, and Data Theft

New Marvin assault revives 25-year-old decryption flaw in RSA

1000’s of Juniper gadgets susceptible to unauthenticated RCE flaw

Why Instagram Threads is a hotbed of dangers for companies

Phishing Campaigns Ship New SideTwist Backdoor and Agent Tesla Variant

EDITOR PICKS

Intel- und AMD-Chips physisch angreifbar

Hackers flip bossware towards the bosses

Vital GitHub RCE bug uncovered thousands and thousands of repositories

POPULAR News

Angriffe auf npm-Lieferkette gefährden Entwicklungsumgebungen

PixieFail flaws affect PXE community boot in enterprise techniques

PixieFail UEFI Flaws Expose Tens of millions of Computer systems to RCE, DoS, and Data Theft

POPULAR TAGS

POPULAR Tags

POPULAR Tags

ABOUT US

FOLLOW US