The best AI coding assistants fail at one in four tasks, revealing serious gaps between hype and actual performance reliability.


  • Report Finds AI Coding Assistants Regularly Fail One in Four Structured Output Tasks
  • Even advanced proprietary models only achieve about 75% accuracy.
  • Open source AI models perform less well, with an average reliability close to 65%.

The promise of artificial intelligence as a tireless coding assistant has hit a significant hurdle after new research claimed such tools can run into a range of problems.

A recent study from the University of Waterloo found that AI struggles with software development, with even the most advanced models failing at one in four structured output tasks.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top