A check of Anthropic’s restricted Claude Mythos mannequin discovered only one low-severity vulnerability within the extensively used open supply knowledge switch instrument curl, casting doubt on the AI firm’s daring claims, although some argue the outcomes say extra about curl’s sturdy security than Mythos’ limitations.
Daniel Stenberg, the lead developer of curl, revealed in a weblog submit on Monday that he was not too long ago given the chance to check the Claude Mythos frontier AI mannequin, which Anthropic claimed had recognized hundreds of zero-days within the weeks main as much as its launch.
Anthropic is providing Mythos solely to some dozen main organizations as a part of a restricted program attributable to considerations about potential misuse.
In the long run, Stenberg didn’t conduct the evaluation himself, nor did he have direct entry to the AI mannequin. As a substitute, a third-party examined curl utilizing Mythos and offered Stenberg with a report detailing the findings.
Mythos’ evaluation of curl’s 178,000 traces of code, based on the report offered to the developer, unearthed 5 ‘confirmed security vulnerabilities’. Nonetheless, a assessment of the findings confirmed that three of them have been identified points described in official documentation and one was a bug slightly than a security gap.
The one situation confirmed by the curl builders to be an precise vulnerability was assigned a low severity ranking and will probably be patched in late June.
Curl was beforehand analyzed with different AI instruments resembling Zeropath, AISLE, and OpenAI’s Codex, which helped establish 200-300 points, together with “a dozen or extra” confirmed vulnerabilities, based on Stenberg.
He admitted that AI-powered code evaluation instruments are “considerably higher” at discovering security holes in comparison with conventional instruments. Nonetheless, he believes — primarily based on the evaluation of curl — that Mythos isn’t as ‘harmful’ as Anthropic has described it.
“My private conclusion can nonetheless not find yourself with the rest than that the massive hype round this mannequin to date was primarily advertising,” Stenberg mentioned. “I see no proof that this setup finds points to any specific larger or extra superior diploma than the opposite instruments have completed earlier than Mythos. Possibly this mannequin is slightly bit higher, however even whether it is, it isn’t higher to a level that appears to make a major dent in code analyzing.”
Curl is current on billions of units, together with servers, telephones, and vehicles, making it a doubtlessly precious goal for menace actors. Nonetheless, exploiting curl vulnerabilities in the true world isn’t simple, and there aren’t any public experiences of any of the 188 CVEs assigned to this point getting used within the wild.
The controversy over Mythos’ efficiency
Stenberg’s weblog submit has been extensively debated on Hacker Information, Reddit, and LinkedIn.
Some members of the cybersecurity trade have identified that curl has been closely audited and examined, together with by different AI instruments, making it troublesome for main vulnerabilities to stay hidden.
They argue that Mythos’ restricted findings mirror the maturity and robustness of curl’s codebase, slightly than any shortcoming of the mannequin itself.
As well as, it has been highlighted that Mozilla has been very impressed with Mythos, which helped it uncover greater than 270 Firefox vulnerabilities.
Whereas the Firefox findings show Mythos to be extremely environment friendly, Mozilla famous that each one the vulnerabilities found by the AI might even have been discovered by elite human researchers. However, their fast discovery closes the hole between attacker detection and vendor patching.
Different trade members agree with Stenberg’s view and imagine that Mythos ought to have been capable of finding extra vulnerabilities if its developer’s claims have been true.
“I’ve a tough time believing that Mythos discovered the one remaining Curl vulnerability. It’s attainable, however extremely inconceivable,” one person commented.
Erik Cabetas of Embrace Safety famous that he spoke with a number of organizations which were given entry to Mythos they usually too reported outcomes much like curl.



