HONOR Robot Phone: A Moving AI Camera or Just Another Smartphone Gimmick?

Smartphones today feel more familiar than new. Each year brings better performance and better cameras, but fewer real surprises. So when a company unveils something called a “Robot Phone”, it’s bound to get attention.

HONOR did exactly that at the Mobile World Congress (MWC) in Barcelona this year. While most smartphone brands are focused on software upgrades, HONOR is trying something different with hardware. Its Robot Phone is built to move and adjust on its own. The camera sits on a motorized system that can tilt, track motion and shift angles automatically. It almost looks like a small robotic head, following whatever is happening in front of it. It can pick up sound, recognize motion and stay visually aware of its surroundings. This result feels less like using a regular phone and more like interacting with something responsive.

So what makes HONOR’s Robot Phone different from the smartphones we already use? Here’s a closer look at its camera system, AI features and design, and whether it is truly something new or simply smart marketing.

‍

What does the HONOR Robot Phone do?

At its core, the Robot Phone still works like a regular smartphone. What makes it different is the camera system. It has a 200MP camera that sits on a motorized arm with a three-axis gimbal, which extends when in use and folds back into the phone when not needed. The compact motor gives the camera physical movement, while motion control allows it to sense, track and follow a person or object in real time. That means it can keep a subject in frame without constant manual adjustment.

The camera also adds a more playful side to the experience. It can respond with simple gestures, such as nodding or shaking its head, and it can even move in sync with music.

This setup could be particularly useful for content creators. As CNET tech journalist and YouTuber Andrew Lanxon pointed out, it removes the need to carry a separate gimbal. Since the robotic camera module can easily fold into the body of the phone, it is easier to carry around and more convenient for filming or taking photos on the go.

The Robot Phone also has the practical advantage of a smartphone display. It gives users a bigger screen than a standalone camera for framing, monitoring and reviewing footage. Since it runs on Android, the process of recording, editing and sharing content is also more direct.

‍

The Robot Phone’s Design: How the moving camera fits inside

The most impressive part of the HONOR Robot Phone design is how it fits a moving camera system into the body of a smartphone without needing external attachments.

To make this possible, HONOR uses a custom micro motor that is 70% smaller than mainstream competitors. The company also says it is the industry’s smallest four-degrees-of-freedom (4DoF) gimbal system. To support the stable movement of the camera module, the internal structure uses high-strength materials such as steel and titanium alloy. These materials help the mechanism stay durable as it shifts and repositions over time.

Battery life is another obvious question. HONOR has not revealed the battery capacity of the Robot Phone itself, but it did showcase its Silicon-Carbon Blade Battery technology at MWC 2026. The company says this battery is designed to increase energy density while keeping devices slim, and that it could support capacities of 7,000 mAh and beyond in future foldable devices.

That is not specific to the Robot Phone, but it does hint at the kind of battery improvements that may be needed for smartphones with moving parts and more advanced camera systems.

‍

The AI features of the Robot Phone

The AI features in Honor’s Robot Phone are focused on how the device sees and responds to its surroundings in real time. At the most basic level, the phone can track what is happening in a scene and adjust itself without constant user input.

On the functional side, the system keeps subjects framed and in focus automatically. Its AI Object Tracking ensures subjects stay centred, while AI SpinShot enables controlled 90° and 180° rotations for smoother transitions, even when the phone is used one-handed. It can also detect motion and recognize sound, which lets it respond to activity as it happens instead of reacting frame by frame.

The AI becomes more noticeable in the way the device behaves. When activated, the camera module unfolds and the screen displays a pair of animated eyes that track the user’s face and gaze. Honor calls this “embodied AI”, meaning the assistant expresses itself through movement rather than only voice or text. The camera module can adjust its angle during video calls, which makes it feel a little more physically present.

According to Thomas Bai, AI product expert at Honor, the goal is to move beyond passive assistance. By combining sensing, movement and real-time processing, the device is designed to interact with its environment in a more continuous way. In practice, that could mean interpreting its surroundings and responding as situations change, such as when someone is moving through an unfamiliar space.

‍

The gaps beneath the hype

The Robot Phone has sparked curiosity, but there is still a lot we do not know. For one thing, it is still a prototype, with a release expected later this year. Early signs also suggest it may be expensive, partly because of rising memory chip costs. Some of its more playful features also feel uncertain. In demos, the phone can move along to music, but with only a handful of pre-set tracks, it is hard to tell whether that feature will be genuinely useful or remain more of a showcase moment.

Then there are the practical questions. A motorized camera system could make the phone heavier and more top-heavy, which may affect comfort during daily use. Running a motor alongside continuous AI tracking will also likely put pressure on battery life. These are not dealbreakers, but they are trade-offs that will matter outside of a demo.

Privacy is another concern that is hard to overlook. Some of the AI features rely on cloud processing, which means certain data is sent to external servers instead of being processed fully on the device. That is common in many AI systems today, but it feels more significant here because the phone is built to actively track movement and reposition its camera in real time. For some people, that level of autonomy may feel intrusive rather than helpful. It also raises bigger questions about what sensors are built into the device and how much data they collect during everyday use.

‍

Final verdict: Is the HONOR Robot Phone worth paying attention to?

So, is the HONOR Robot Phone a real step forward, or just a clever idea packaged well?

The answer depends on who it is for.

For content creators, the appeal is obvious. Early indications suggest it could make video capture easier by reducing the need for extra gear. Honor’s collaboration with cinema camera company ARRI also suggests a serious push toward more cinematic smartphone footage.

For everyone else, the value is less clear. Outside of content creation, it is still hard to see how these features would translate into everyday use in a meaningful way.

For now, the Robot Phone sits somewhere between promise and experiment. Whether it turns into a genuinely useful new kind of smartphone or fades away as a novelty will only become clear once it moves beyond controlled demos and into real life.

Keep Reading

Artificial Intelligence

Is LLMs the Future? The Great AI Schism Among Scientists

Brains, bots and the future: Who’s really in control?

Dr. Kelvin Chi Chiu So, Mr. Su Tan

Updated

January 8, 2026 6:32 PM

Adoration and disdain, the polarised reactions for generative AI. ILLUSTRATION: YORKE YU

When British-Canadian cognitive psychologist and computer scientist Geoffrey Hinton joked that his ex-girlfriend once used ChatGPT to help her break up with him, he wasn’t exaggerating. The father of deep learning was pointing to something stranger: how machines built to mimic language have begun to mimic thought — and how even their creators no longer agree on what that means.

In that one quip — part humor, part unease — Hinton captured the paradox at the center of the world’s most important scientific divide. Artificial intelligence has moved beyond code and circuits into the realm of psychology, economics and even philosophy. Yet among those who know it best, the question has turned unexpectedly existential: what, if anything, do large language models truly understand?

Across the world’s AI labs, that question has split the community into two camps — believers and skeptics, prophets and heretics. One side sees systems like ChatGPT, Claude, and Gemini as the dawn of a new cognitive age. The other insists they’re clever parrots with no grasp of meaning, destined to plateau as soon as the data runs out. Between them stands a trillion-dollar industry built on both conviction and uncertainty.

Hinton, who spent a decade at Google refining the very neural networks that now power generative AI, has lately sounded like a man haunted by his own invention. Speaking to Scott Pelley on the CBS 60 Minutes interview aired October 8, 2023, Hinton said, “I think we're moving into a period when for the first time ever we may have things more intelligent than us.” . He said it not with triumph, but with visible worry.

Yoshua Bengio, his longtime collaborator, sees it differently. Speaking at the All In conference in Montreal, he told TIME that future AI systems "will have stronger and stronger reasoning abilities, more and more knowledge," while cautioning about ensuring they "act according to our norms". And then there’s Gary Marcus, the cognitive scientist and enduring critic, who dismisses the hype outright: “These systems don’t understand the world. They just predict the next word.”

It’s a rare moment in science when three pioneers of the same field disagree so completely — not about ethics or funding, but about the very nature of progress. And yet that disagreement now shapes how the future of AI will unfold.

In the span of just two years, large language models have gone from research curiosities to corporate cornerstones. Banks use them to summarize reports. Lawyers draft contracts with them. Pharmaceutical firms explore protein structures through them. Silicon Valley is betting that scaling these models — training them on ever-larger datasets with ever-denser computers — will eventually yield something approaching reasoning, maybe even intelligence.

It’s the “bigger is smarter” philosophy, and it has worked — so far. OpenAI’s GPT-4, Anthropic’s Claude, and Google’s Gemini have grown exponentially in capability . They can write code, explain math, outline business plans, even simulate empathy. For most users, the line between prediction and understanding has already blurred beyond meaning. Kelvin So, who is now conducting AI research in PolyU SPEED, commented , “AI scientists today are inclined to believe we have learnt a bitter lesson in the advancement from the traditional AI to the current LLM paradigm. That said, scaling law, instead of human-crafted complicated rules, is the ultimate law governing AI.”

But inside the labs, cracks are showing. Scaling models have become staggeringly expensive, and the returns are diminishing. A growing number of researchers suspect that raw scale alone cannot unlock true comprehension — that these systems are learning syntax, not semantics; imitation, not insight.

That belief fuels a quiet counter-revolution. Instead of simply piling on data and GPUs, some researchers are pursuing hybrid intelligence — systems that combine statistical learning with symbolic reasoning, causal inference, or embodied interaction with the physical world. The idea is that intelligence requires grounding — an understanding of cause, consequence, and context that no amount of text prediction can supply.

Yet the results speak for themselves. In practice, language models are already transforming industries faster than regulation can keep up. Marketing departments run on them. Customer support, logistics and finance teams depend on them. Even scientists now use them to generate hypotheses, debug code and summarize literature. For every cautionary voice, there are a dozen entrepreneurs who see this technology as a force reshaping every industry. That gap — between what these models actually are and what we hope they might become — defines this moment. It’s a time of awe and unease, where progress races ahead even as understanding lags behind.

Part of the confusion stems from how these systems work. A large language model doesn’t store facts like a database. It predicts what word is most likely to come next in a sequence, based on patterns in vast amounts of text. Behind this seemingly simple prediction mechanism lies a sophisticated architecture. The tokenizer is one of the key innovations behind modern language models. It takes text and chops it into smaller, manageable pieces the AI can understand. These pieces are then turned into numbers, giving the model a way to “read” human language. By doing this, the system can spot context and relationships between words — the building blocks of comprehension.

Inside the model, mechanisms such as multi-head attention enable the system to examine many aspects of information simultaneously, much as a human reader might track several storylines at once.

Reinforcement learning, pioneered by Richard Sutton, a professor of computing science at the University of Alberta, and Andrew Barto, Professor Emeritus at the University of Massachusetts, mimics human trial-and-error learning. The AI develops “value functions” that predict the long-term rewards of its actions. Together, these technologies enable machines to recognize patterns, make predictions and generate text that feels strikingly human — yet beneath this technical progress lies the very divide that cuts to the heart of how intelligence itself is defined.

This placement works well because it elaborates on the technical foundations after the article introduces the basic concept of how language models work, and before it transitions to discussing the emergent behaviors and the “black box problem.”

Yet at scale, that simple process begins to yield emergent behavior — reasoning, problem-solving, even flashes of creativity that surprise their creators. The result is something that looks, sounds and increasingly acts intelligent — even if no one can explain exactly why.

That opacity worries not just philosophers, but engineers. The “black box problem” — our inability to interpret how neural networks make decisions — has turned into a scientific and safety concern. If we can’t explain a model’s reasoning, can we trust it in critical systems like healthcare or defense?

Companies like Anthropic are trying to address that with “constitutional AI,” embedding human-written principles into model training to guide behavior. Others, like OpenAI, are experimenting with internal oversight teams and adversarial testing to catch dangerous or misleading outputs. But no approach yet offers real transparency. We’re effectively steering a ship whose navigation system we don’t fully understand. “We need governance frameworks that evolve as quickly as AI itself,” says Felix Cheung, Founding Chairman of RegTech Association of Hong Kong (RTAHK). “Technical safeguards alone aren't enough — transparent monitoring and clear accountability must become industry standards.”

Meanwhile, the commercial race is accelerating. Venture capital is flowing into AI startups at record speed. OpenAI’s valuation reportedly exceeds US$150 billion; Anthropic, backed by Amazon and Google, isn’t far behind. The bet is simple: that generative AI will become as indispensable to modern life as the internet itself.

And yet, not everyone is buying into that vision. The open-source movement — championed by players like Meta’s Llama, Mistral in France, and a fast-growing constellation of independent labs — argues that democratizing access is the only way to ensure both innovation and accountability. If powerful AI remains locked behind corporate walls, they warn, progress will narrow to the priorities of a few firms.

But openness cuts both ways. Publicly available models are harder to police, and their misuse — from disinformation to deepfakes — grows as easily as innovation does. Regulators are scrambling to balance risk and reward. The European Union’s AI Act is the world’s most comprehensive attempt at governance, but even it struggles to define where to draw the line between creativity and control.

This isn’t just a scientific argument anymore. It’s a geopolitical one. The United States, China, and Europe are each pursuing distinct AI strategies: Washington betting on private-sector dominance, Beijing on state-led scaling, Brussels on regulation and ethics. Behind the headlines, compute power is becoming a form of soft power. Whoever controls access to the chips, data, and infrastructure that fuel AI will control much of the digital economy.

That reality is forcing some uncomfortable math. Training frontier models already consumes energy on the scale of small nations. Data centers now rise next to hydroelectric dams and nuclear plants. Efficiency — once a technical concern — has become an economic and environmental one. As demand grows, so does the incentive to build smaller, smarter, more efficient systems. The industry’s next leap may not come from scale at all, but from constraint.

For all the noise, one truth keeps resurfacing: large language models are tools, not oracles. Their intelligence — if we can call it that — is borrowed from ours. They are trained on human text, human logic, human error. Every time a model surprises us with insight, it is, in a sense, holding up a mirror to collective intelligence.

That’s what makes this schism so fascinating. It’s not really about machines. It’s about what we believe intelligence is — pattern or principle, simulation or soul. For believers like Bengio, intelligence may simply be prediction done right. For critics like Marcus, that’s a category mistake: true understanding requires grounding in the real world, something no model trained on text can ever achieve.

The public, meanwhile, is less interested in metaphysics. To most users, these systems work — and that’s enough. They write emails, plan trips, debug spreadsheets, summarize meetings. Whether they “understand” or not feels academic. But for the scientists, that distinction remains critical, because it determines where AI might ultimately lead.

Even inside the companies building them, that tension shows OpenAI’s Sam Altman has hinted that scaling can’t continue forever. At some point, new architectures — possibly combining logic, memory, or embodied data — will be needed. DeepMind’s Demis Hassabis says something similar: intelligence, he argues, will come not just from prediction, but from interaction with the world.

It’s possible both are right. The future of AI may belong to hybrid systems — part statistical, part symbolic — that can reason across multiple modes of information: text, image, sound, action. The line between model and agent is already blurring, as LLMs gain the ability to browse the web, run code, and call external tools. The next generation won’t just answer questions; it will perform tasks.

For startups, the opportunity — and the risk — lies in that transition. The most valuable companies in this new era may not be those that build the biggest models, but those that build useful ones: specialized systems tuned for medicine, law, logistics, or finance, where reliability matters more than raw capability. The winners will understand that scale is a means, not an end.

And for society, the challenge is to decide what kind of intelligence we want to live with. If we treat these models as collaborators — imperfect, explainable, constrained — they could amplify human potential on a scale unseen since the printing press. If we chase the illusion of autonomy, they could just as easily entrench bias, confusion, and dependency.

The debate over large language models will not end in a lab. It will play out in courts, classrooms, boardrooms, and living rooms — anywhere humans and machines learn to share the same cognitive space. Whether we call that cooperation or competition will depend on how we design, deploy, and, ultimately, define these tools.

Perhaps Hinton’s offhand remark about being psychoanalyzed by his own creation wasn’t just a joke. It was an omen. AI is no longer something we use; it’s something we’re reflected in. Every model trained on our words becomes a record of who we are — our reasoning, our prejudices, our brilliance, our contradictions. The schism among scientists mirrors the one within ourselves: fascination colliding with fear, ambition tempered by doubt.

In the end, the question isn’t whether LLMs are the future. It’s whether we are ready for a future built in their image.

Corporate Innovation