Cognition Reveals Devin the World’s First Fully Autonomous AI Software Engineer

Mukund Kapoor
By Mukund Kapoor - Author 3 Min Read
3 Min Read

US-based startup's AI-powered tool can solve engineering tasks using its own shell, code editor, and web browser

In Short
  • Devin can build and deploy apps, find and fix bugs in codebases, and fine-tune AI models
  • The AI agent correctly resolves 13.86% of issues on the SWE-bench platform, outperforming GPT4 and Claude 2
  • Devin is currently available as an early access service to individuals who wish to use it for engineering work

March 17th, 2024: US-based startup Cognition introduced Devin, an AI-powered tool the company claims is the “world’s first fully autonomous AI software engineer.”

Devin is designed to solve engineering tasks independently using its own shell, code editor, and web browser.

devin ai
Devin AI fixing GitHub bugs autonomously

According to demonstrations provided by Cognition, Devin can utilize its web browser to access and learn from API documentation, enabling it to plug into various APIs.

When the AI agent encounters an error, it automatically adds a debugging print statement to the main code within its code editor interface and reruns the code.

Cognition has showcased Devin’s capabilities in building and deploying apps, identifying and fixing bugs in codebases, and even fine-tuning AI models.

To assess Devin’s accuracy, Cognition tested the AI agent on SWE-bench, a benchmarking platform that challenges agents to resolve real-world issues found in open-source projects on GitHub.

Devin successfully resolved 13.86% of the issues end-to-end, surpassing the performance of GPT4 (1.74%) and the previous best score held by Anthropic’s Claude 2 (4.80%).

Notably, Devin achieved this without assistance in locating the relevant files within the repository.

While Microsoft offers AI-powered developer tools like GitHub Copilot, which provides code completion and assistive features for programmers, it cannot complete codes end-to-end without human interference or assistance.

In contrast, Devin is capable of autonomously completing coding tasks.

Cognition is currently offering early access to Devin for businesses who wish to utilize the AI agent for engineering work. Interested customers can request early access through the company’s website.

With its impressive performance on the SWE-bench platform and its ability to operate independently, Devin represents a significant step forward in the development of AI-powered software engineering solutions.

SOURCES:Cognition

Disclaimer

Based on our quality standards, we deliver this website’s content transparently. Our goal is to give readers accurate and complete information. Check our News section for latest news. To stay in the loop with our latest posts follow us on Facebook, Twitter and Instagram. 

Subscribe to our Daily Newsletter to join our growing community and if you wish to share feedback or have any inquiries, please feel free to Contact Us. If you want to know more about us, check out our Disclaimer, and Editorial Policy.

By Mukund Kapoor Author
Follow:
Mukund Kapoor, the enthusiastic author and creator of GreatAIPrompts, is driven by his passion for all things AI. With a special knack for simplifying complex AI concepts, he's committed to helping readers of all levels - be it beginners or experts - navigate the intriguing world of artificial intelligence. Through GreatAIPrompts, Mukund ensures that readers always have access to the most recent and relevant AI news, tools, and insights. His dedication to quality, accuracy, and clarity is what sets his blog apart, making it a reliable go-to source for anyone interested in unlocking the potential of AI. For more information visit Author Bio.
Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *