ChatGPT is failing at basic maths

Published in AI

ChatGPT is failing at basic maths

by Nick Farrell on10 August 2023

font size decrease font size increase font size
Print
Email

Getting old

New research has found that AI software ChatGPT is becoming worse at performing certain basic math operations.

Boffins at Stanford University and the University of California, Berkeley said the deterioration is an example of a phenomenon known to AI developers as drift, where attempts to improve one part of the enormously complex AI models make other parts perform worse.

They have tested two versions of ChatGPT: version 3.5 and version 4.0, available and the results are grim.

The boffins gave the chatbot a basic task: identify whether a particular number is prime. This is the sort of math problem that is complicated for people but simple for computers.

If a number is a prime should be easy for computers to evaluate by dividing by two, three, five, etc., and see if anything works.

To track performance, the researchers fed ChatGPT 1,000 different numbers. In March, the premium GPT-4, correctly identified whether 84 per cent of the numbers were prime. By June, its success rate had dropped to 51 per cent. Across eight different tasks, GPT-4 became worse at six of them. GPT-3.5 improved on six measures but remained worse than its advanced sibling at most tasks.

Last modified on 10 August 2023

Rate this item

(0 votes)

Tagged under

chatgpt

More in this category: « Fake AI-produced books appearing on Amazon IBM boffins develop mixed-signal analogue AI chip »

ChatGPT is failing at basic maths

Latest comments

Read more about: