The 'strawberrry' downside: overcome AI's limitations

Be part of our day by day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Be taught Extra

By now, giant language fashions (LLMs) like ChatGPT and Claude have turn into an on a regular basis phrase throughout the globe. Many individuals have began worrying that AI is coming for his or her jobs, so it’s ironic to see nearly all LLM-based methods flounder at a simple activity: Counting the variety of “r”s within the phrase “strawberry.” They aren’t completely failing on the alphabet “r”; different examples embody counting “m”s in “mammal”, and “p”s in “hippopotamus.” On this article, I’ll break down the explanation for these failures and supply a easy workaround.

LLMs are highly effective AI methods educated on huge quantities of textual content to know and generate human-like language. They excel at duties like answering questions, translating languages, summarizing content material and even producing inventive writing by predicting and setting up coherent responses primarily based on the enter they obtain. LLMs are designed to acknowledge patterns in textual content, which permits them to deal with a variety of language-related duties with spectacular accuracy.

Regardless of their prowess, failing at counting the variety of “r”s within the phrase “strawberry” is a reminder that LLMs should not able to “thinking” like people. They don’t course of the knowledge we feed them like a human would.

*Dialog with ChatGPT and Claude in regards to the variety of “r”s in strawberry.*

Virtually all the present excessive efficiency LLMs are constructed on transformers. This deep studying structure doesn’t instantly ingest textual content as their enter. They use a course of referred to as tokenization, which transforms the textual content into numerical representations, or tokens. Some tokens could be full phrases (like “monkey”), whereas others might be elements of a phrase (like “mon” and “key”). Every token is sort of a code that the mannequin understands. By breaking every part down into tokens, the mannequin can higher predict the following token in a sentence.

LLMs don’t memorize phrases; they attempt to perceive how these tokens match collectively in numerous methods, making them good at guessing what comes subsequent. Within the case of the phrase “hippopotamus,” the mannequin may see the tokens of letters “hip,” “pop,” “o” and “tamus”, and never know that the phrase “hippopotamus” is manufactured from the letters — “h”, “i”, “p”, “p”, “o”, “p”, “o”, “t”, “a”, “m”, “u”, “s”.

A mannequin structure that may instantly have a look at particular person letters with out tokenizing them might probably not have this downside, however for immediately’s transformer architectures, it’s not computationally possible.

Additional, taking a look at how LLMs generate output textual content: They predict what the following phrase can be primarily based on the earlier enter and output tokens. Whereas this works for producing contextually conscious human-like textual content, it’s not appropriate for easy duties like counting letters. When requested to reply the variety of “r”s within the phrase “strawberry”, LLMs are purely predicting the reply primarily based on the construction of the enter sentence.

Right here’s a workaround

Whereas LLMs won’t have the ability to “think” or logically cause, they’re adept at understanding structured textual content. A splendid instance of structured textual content is pc code, of many many programming languages. If we ask ChatGPT to make use of Python to rely the variety of “r”s in “strawberry”, it’s going to most probably get the right reply. When there’s a want for LLMs to do counting or every other activity that will require logical reasoning or arithmetic computation, the broader software program may be designed such that the prompts embody asking the LLM to make use of a programming language to course of the enter question.

Conclusion

A easy letter counting experiment exposes a elementary limitation of LLMs like ChatGPT and Claude. Regardless of their spectacular capabilities in producing human-like textual content, writing code and answering any query thrown at them, these AI fashions can not but “think” like a human. The experiment reveals the fashions for what they’re, sample matching predictive algorithms, and never “intelligence” able to understanding or reasoning. Nevertheless, having a previous data of what sort of prompts work nicely can alleviate the issue to some extent. As the mixing of AI in our lives will increase, recognizing its limitations is essential for accountable utilization and practical expectations of those fashions.

Chinmay Jog is a senior machine studying engineer at Pangiam.

DataDecisionMakers

Welcome to the VentureBeat group!

DataDecisionMakers is the place specialists, together with the technical folks doing knowledge work, can share data-related insights and innovation.

If you wish to examine cutting-edge concepts and up-to-date info, finest practices, and the way forward for knowledge and knowledge tech, be part of us at DataDecisionMakers.

You may even think about contributing an article of your individual!

Learn Extra From DataDecisionMakers

The ‘strawberrry’ downside: overcome AI’s limitations

Right here’s a workaround

Conclusion

The Pandemic Did Not Have an effect on The Moon After All, Scientists Say : ScienceAlert

Tremendous League 2025: Salford Purple Devils nonetheless focusing on play-offs in new season regardless of monetary difficulties | Rugby League Information

Hugging Face brings ‘Pi-Zero’ to LeRobot, making AI-powered robots simpler to construct and deploy

Javier Milei’s quest to defuse Argentina’s forex management bomb

Wonderful plesiosaur fossil preserves its pores and skin and scales

Related articles

Hugging Face brings ‘Pi-Zero’ to LeRobot, making AI-powered robots simpler to construct and deploy

Pour one out for Cruise and why autonomous car check miles dropped 50%

Anker’s newest charger and energy financial institution are again on sale for record-low costs

GitHub Copilot previews agent mode as marketplace for agentic AI coding instruments accelerates

Follow us

Company

Latest news

Six Nations 2025: Eire make two modifications as Peter O’Mahony, Robbie Henshaw return for Scotland Take a look at | Rugby Union Information

The Pandemic Did Not Have an effect on The Moon After All, Scientists Say : ScienceAlert

Tremendous League 2025: Salford Purple Devils nonetheless focusing on play-offs in new season regardless of monetary difficulties | Rugby League Information

Popular news

Arne Slot desires £50m-rated Atalanta midfielder Teun Koopmeiners as first Liverpool signing – Paper Speak | Soccer Information

Why are there so many rogue planets and what do they appear like?

Digital Nomad Information to Dwelling in Dubrovnik, Croatia