Researchers found that multimodal AI models cannot tell time. The more variation there was in the clock face, the more the chatbot being tested was likely to misreadResearchers found that multimodal AI models cannot tell time. The more variation there was in the clock face, the more the chatbot being tested was likely to misread

Before AI Takes Our Jobs, Someone Better Teach It How to Tell Time

Am I the only one who didn’t know that AI cannot figure out time? I mean, every day, we hear all about generative AI “revolutionizing” everything and replacing everyone. Pretty genius little things. So imagine my shock when I learned that multimodal AI models cannot tell time. How did I know, you ask?

To start with, researchers at the University of Edinburgh recently found that multimodal large language models (MLLMs) like ChatGPT-4o, GPT-o1, Gemini-2.0, and Claude 3.5-Sonnet ran into accuracy problems while reading a clock face.

Things got worse when they were tested with clocks designed with Roman numerals, a colored dial, or a decorative hour hand. Some of the clocks also had a hand that tracked seconds in addition to minutes and hours. In the face of those design touches, the AI models reportedly fell into further errors.

This discovery was made during a test of a lineup of top MLLMs today, and to think that Gemini-2.0 performed the “best” with only 22.8% accuracy sounds hilarious. GPT-4.o and GPT-o1’s exact match accuracy stood at 8.6% and 4.84% respectively.

Per the researchers, these models struggled with everything. Which hand is the hour hand? Which direction is it pointing? What angle corresponds to what time? What number is that? According to them, the more variation there was in the clock face, the more the chatbot being tested was likely to misread the clock.

These are literally basic skills for people. Most six or seven-year-olds can already tell time. But for these models, it might as well be the most complicated astrophysics.

After the clock fiasco, the researchers tested the bots on yearly calendars. You know, the ones with all twelve months on one page. GPT-o1 performed the “best” here, reaching 80 percent accuracy. But that still means that one out of every five answers was wrong, including simple questions like “Which day of the week is New Year’s Day? If my child failed to get that right on a quiz, I would honestly be very worried.

I never would have thought that AI models could ever get confused by a common calendar layout. But then, it is not very shocking to find out. It all still boils down to a long-standing gap in AI development. MLLMs only recognize patterns they have already seen, and clocks, calendars, or anything that requires spatial reasoning don’t fit into that.

Humans can look at a warped Dali clock and still figure out roughly what time it is meant to display. But AI models see a slightly thicker hour hand and kind of short-circuit.

Why This Matters

It is easy (almost satisfying) to laugh at ChatGPT, Gemini, and these models for failing a task you learned when you were little. A task you do with so much ease. As someone who has gotten jilted by clients for the free work these things offer, albeit substandard, I admit I do find it really satisfying.

But as much as I want to just laugh it off, there is a more serious angle to this. These same MLLMs are being pushed into autonomous driving perception, medical imaging, robotics, and accessibility tools. They are being used for scheduling and automation as well as real-time decision-making systems.

Now, clock-reading errors are funny. But medical errors? Navigation errors? Even scheduling errors? Not so funny.

If a model cannot reliably read a clock, trusting it blindly in high-stakes environments is too risky a gamble for me. It just shows how far these systems still are from actual, grounded intelligence. And how much human common sense and nuance still matter. I am trying so hard to steer clear of taking this chance to make a human vs. AI case. I sure won’t use it to preach “Why I Hate AI and You Should Too.” But there is a problem that needs to be looked into.

As the study’s lead author, Rohit Saxena, put it, these weaknesses “must be addressed if AI systems are to be successfully integrated into time-sensitive real-world applications.”

Market Opportunity
Sleepless AI Logo
Sleepless AI Price(AI)
$0.03831
$0.03831$0.03831
+0.07%
USD
Sleepless AI (AI) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Is Putnam Global Technology A (PGTAX) a strong mutual fund pick right now?

Is Putnam Global Technology A (PGTAX) a strong mutual fund pick right now?

The post Is Putnam Global Technology A (PGTAX) a strong mutual fund pick right now? appeared on BitcoinEthereumNews.com. On the lookout for a Sector – Tech fund? Starting with Putnam Global Technology A (PGTAX – Free Report) should not be a possibility at this time. PGTAX possesses a Zacks Mutual Fund Rank of 4 (Sell), which is based on various forecasting factors like size, cost, and past performance. Objective We note that PGTAX is a Sector – Tech option, and this area is loaded with many options. Found in a wide number of industries such as semiconductors, software, internet, and networking, tech companies are everywhere. Thus, Sector – Tech mutual funds that invest in technology let investors own a stake in a notoriously volatile sector, but with a much more diversified approach. History of fund/manager Putnam Funds is based in Canton, MA, and is the manager of PGTAX. The Putnam Global Technology A made its debut in January of 2009 and PGTAX has managed to accumulate roughly $650.01 million in assets, as of the most recently available information. The fund is currently managed by Di Yao who has been in charge of the fund since December of 2012. Performance Obviously, what investors are looking for in these funds is strong performance relative to their peers. PGTAX has a 5-year annualized total return of 14.46%, and is in the middle third among its category peers. But if you are looking for a shorter time frame, it is also worth looking at its 3-year annualized total return of 27.02%, which places it in the middle third during this time-frame. It is important to note that the product’s returns may not reflect all its expenses. Any fees not reflected would lower the returns. Total returns do not reflect the fund’s [%] sale charge. If sales charges were included, total returns would have been lower. When looking at a fund’s performance, it…
Share
BitcoinEthereumNews2025/09/18 04:05
U.S. Banks Near Stablecoin Issuance Under FDIC Genius Act Plan

U.S. Banks Near Stablecoin Issuance Under FDIC Genius Act Plan

The post U.S. Banks Near Stablecoin Issuance Under FDIC Genius Act Plan appeared on BitcoinEthereumNews.com. U.S. banks could soon begin applying to issue payment
Share
BitcoinEthereumNews2025/12/17 02:55
Turmoil Strikes Theta Labs with New Legal Allegations

Turmoil Strikes Theta Labs with New Legal Allegations

Cryptocurrency often sees its fair share of lawsuits, with many concluding without much ado. However, a fresh legal battle has surfaced involving a well-known altcoin
Share
Coinstats2025/12/17 03:06