Does AI Know the Similarity Between Cars and Planes? The Birth of AI That Sees the World Like Humans

AI Summary

The journey has begun to create smarter and safer AI by teaching 'human vision' to systems that are brilliant at object recognition but fail at abstract concepts.

Imagine for a moment. You have a state-of-the-art Artificial Intelligence (AI) in front of you. This AI is a true “car expert” that can identify the brand and model of hundreds of cars in existence in just one second. However, when you ask this smart AI, “What is the similarity between a car and a plane?”, it either fails to answer or says something completely nonsensical. To us humans, the common sense that “both are large means of transportation made of metal” is a given, but for this AI, it is the most difficult problem in the world. Teaching AI to see the world more like we do - deepmind.google

This is the massive wall that today’s AI faces, the so-called ‘Perception Gap.’ Teaching AI to See the World Through Human Eyes: Bridging the … Simply put, AI is like a memory genius that has memorized tens of thousands of books cover to cover, but has no idea how the content in those books relates to our lives. On the surface, it seems much smarter than humans, but its way of looking at the world is so different from ours that it occasionally makes absurd mistakes. Recently, however, scientists around the world have begun teaching AI “human eyes” and “human common sense” to bridge this gap.

Why It Matters

You might ask, “If AI can identify car models well, why is it so important for it to know they are similar to planes?” However, this issue goes beyond simply answering a quiz; it is directly linked to the safety of the AI we use every day.

Current AI is extremely smart but simultaneously has a fatal weakness: it is unpredictable. [World models: 10 Things That Matter in AI Right Now

MIT Technology Review](https://www.technologyreview.com/2026/04/21/1135650/world-models-ai-artificial-intelligence/) This is because while its ability to identify patterns for recognizing and classifying objects surpasses humans, it fails to understand the deep relationships or abstract concepts contained within them. Teaching AI to See the World Through Human Eyes: Bridging the …

Let’s use an analogy. Suppose a self-driving car encounters an ‘empty cardboard box’ on the road. A human driver makes a common-sense judgment: “That’s light paper, so it’s safe to just drive over it,” or “I don’t know what’s inside, so let’s avoid it.” But if AI only recognizes this as a ‘rectangular data pattern,’ it might fail to distinguish whether a box of a form it hasn’t seen before is a rock or paper, risking an accident.

Therefore, the work of ‘Aligning’ AI with human knowledge systems is the key to making AI Robust—unshakable in any situation—and allowing it to Generalize effectively to new, untaught situations. Teaching AI to See the World More Like Humans Do — Google DeepMind

Understanding Easily: Three Ways to Teach AI ‘Common Sense’

Scientists are using three innovative strategies to transplant human visual systems into AI.

1. Running ‘Simulations’ in the Mind: World Models

When you wake up in the morning, you can vividly imagine where the bathroom is or what the hallway will look like when you open the front door, even with your eyes closed. This is because there is a ‘map’ or ‘operating principle’ of how the world works inside our heads.

Giving AI this kind of imagination is exactly what ‘World Models’ do. World Models: Teaching AI to Think Like Humans - LinkedIn This involves building an internal system where the AI doesn’t just store its surroundings like taking a photo, but predicts how the environment will change. World Models: Teaching AI to Think Like Humans - LinkedIn It essentially gains the ability to run a simulation in its head, thinking, “If I push this cup, it will fall to the floor and break, right?”

2. Replicating the Brain’s Filter: Lp-Convolution

Our brains have very efficient filters that pick out only the important parts from a vast amount of visual information. Recently, a joint research team from Yonsei University, the Institute for Basic Science (IBS), and the Max Planck Institute in Germany introduced a technology called ‘Lp-Convolution’ that helps computers process images more similarly to the human brain. AI Horizons: Teaching computers to view the world like humans do

To use an analogy, it’s like putting ‘special glasses’ on the AI that the human eye and brain use when looking at the world. With these glasses, the AI can process information by prioritizing the outlines or three-dimensionality of objects that humans consider important, enabling much more natural recognition.

3. Perception Learned Through Games: Brown University Research

Researchers at Brown University in the US are educating AI in a very interesting way: teaching it to perceive like a human through ‘games.’ Researchers are teaching AI to see more like humans - MSN Just as a young child learns the laws of physics by playing with blocks, AI builds visual logic similar to humans by touching and moving various objects within a virtual game world. Training AI to see more like humans - National Science Foundation

Where We Stand

At this very moment, Google DeepMind is accelerating its research, publishing in-depth study results in the international journal ‘Nature’ that analyze the differences in how AI and humans organize visual information. Teaching AI to See the World More Like Humans Do — Google DeepMind

However, to be honest, there is still a long way to go. Current AI is brilliant at recognizing objects individually, but it often misses the ‘invisible relationships between objects’ that humans grasp very naturally. Teaching AI to See the World Through Human Eyes: Bridging the … The reason we occasionally feel “something is off” when reading text written by AI is that the patterns AI creates are still far from the natural common-sense systems of humans. AIDetector - AdvancedAIChecker for ChatGPT, GPT-5 & Gemini

What’s Next

What kind of future will unfold if AI truly sees the world like a human?

Experts predict that by around 2050, ‘AI teachers’ or robots could emerge with the ability to manipulate matter at the atomic level and see objects perfectly even in the dark. Technology in 2050 - experts give their predictions Beyond being machines that simply pour out knowledge, it might become possible for AI to act as true ‘mentors’ that understand, empathize with, and teach students from their level.

While we are currently teaching AI about the world by labeling data one by one (Data Annotation), [DataAnnotation

Future-Proof Your Career WithAITraining Work](https://www.dataannotation.tech/) before long, AI will see the world through the same eyes as us and become a reliable partner in solving complex problems like the climate crisis or curing incurable diseases.

MindTickleBytes’ AI Reporter’s Perspective

Until now, we have been obsessed only with ‘quantity’—how much data AI processes. However, these studies remind us that ‘how one sees’ is more important than ‘how much one knows.’ AI that learns human visual patterns is evolving beyond mere performance improvements into a ‘safe companion’ that shares human values and common sense. That small ability to know the commonality between a car and a plane might just lead us to a safer and warmer technological future.

References

Teaching AI to see the world more like we do - deepmind.google
Training AI to see more like humans - National Science Foundation
Teaching AI to See the World More Like Humans Do — Google DeepMind
Researchers are teaching AI to see more like humans - MSN
AI Horizons: Teaching computers to view the world like humans do
Teaching AI to See the World Through Human Eyes: Bridging the …
World Models: Teaching AI to Think Like Humans - LinkedIn
Technology in 2050 - experts give their predictions
[DataAnnotation Future-Proof Your Career WithAITraining Work](https://www.dataannotation.tech/)
AIDetector - AdvancedAIChecker for ChatGPT, GPT-5 & Gemini

[World models: 10 Things That Matter in AI Right Now

MIT Technology Review](https://www.technologyreview.com/2026/04/21/1135650/world-models-ai-artificial-intelligence/)

FACT-CHECK SUMMARY

Claims checked: 14
Claims verified: 13
Verdict: PASS

Share this article:

Test Your Understanding

Q1. What do current AI systems often miss even while identifying hundreds of car models?

Accurate engine power of the car
The commonality that both cars and planes are metal vehicles
The brand name of the car tires

According to Google DeepMind, while AI is good at identifying individual objects, it struggles to grasp abstract commonalities or relationships, such as 'large vehicles made of metal.'

Q2. What is the 'mental simulation' AI creates to understand how its environment works?

Virtual Reality
Image Processing
World Models

World Models refer to systems where AI internally represents and simulates the operating principles of its environment.

Q3. What is the technology, developed with researchers from Yonsei University, that helps computers process images like the human brain?

Lp-Convolution
Data Annotation
Scientific Game

Lp-Convolution is a breakthrough technology that helps computers process images more similarly to the human brain.