Technology Trends

When AI Learns to 'Draw' Interfaces: How A2UI is Redefining Human-Computer Interaction

From text to interfaces, AI is undergoing a revolution in how it expresses itself. The A2UI protocol enables AI to directly generate interactive interfaces. How will this change the way we communicate with AI?

Zhama AI Technical Team

January 13, 2026

17 min read

When AI Learns to 'Draw' Interfaces: How A2UI is Redefining Human-Computer Interaction

Have you ever wondered why, when we talk to AI, it can only "say" things but can't actually "do" anything?

This seemingly simple question reveals a fundamental limitation of current AI interaction—no matter how intelligent AI becomes, its means of expression remains stuck at the "text" level. The emergence of A2UI is breaking through this limitation.

Starting with an Everyday Scenario

Suppose you tell an AI: "Help me book a flight to Shanghai for tomorrow."

How does today's AI respond?

It gives you a wall of text: flight information, price comparisons, things to note... then tells you "please visit such-and-such website to complete your booking."

You need to:

Read through all that text
Copy the flight number
Open another website
Manually fill in your information
Complete the payment

AI said a lot, but it didn't actually "do" anything.

This "talk but don't act" pattern exists in almost every interaction we have with AI. Whether you're asking AI to analyze data, fill out forms, or generate reports, you ultimately need to leave the conversation interface and complete the actual operation somewhere else.

AI is like a knowledgeable consultant who never lifts a finger—it can tell you how to do something, but it can't do it for you.

What If AI Could Directly 'Draw' the Interface?

Imagine a different scenario:

You say: "Help me book a flight to Shanghai for tomorrow."

AI directly displays a real booking interface right in the conversation:

Three flight options in a clear card layout, each card showing airline, departure time, arrival time, and price
Prices and times at a glance, with options to sort by price or time
Click the "Select" button, and the card highlights
A passenger information form appears where you can fill in details directly
After confirmation, a payment interface appears
Done

The entire process happens without leaving the conversation interface, without copying and pasting, without switching between multiple apps.

From "saying" to "doing" — this is what A2UI aims to achieve.

This isn't a scene from a sci-fi movie—it's the interaction paradigm defined by the A2UI protocol that Google released at the end of 2025. This technology is moving from concept to reality, and we've already implemented its first complete version.

What is A2UI?

A2UI, short for Agent-to-User Interface, is an open protocol officially released by Google in December 2025.

Simply put, it defines a set of standard rules that allow AI to output interactive interface elements, not just text.

Traditional AI Output vs A2UI Output

Traditional AI	A2UI AI
Outputs plain text or Markdown	Outputs structured interface descriptions
Users must operate manually	Users can interact directly
Information displayed as linear text	Information displayed as visual components
Complex tasks require multiple conversations	Complete entire flows in one conversation

Interface Elements Supported by A2UI

These interface elements cover most application scenarios:

Basic Components

Text: supports rich text formatting, heading levels, emphasis styles
Buttons: configurable styles, icons, click actions
Input fields: single-line, multi-line, password, search, and more
Checkboxes and radio buttons: support grouping and linking

Layout Components

Cards: containers for aggregating information, with header, content, and action areas
Lists: vertical or horizontal arrangement, with pagination and lazy loading
Tables: with sorting, filtering, and pagination
Tabs: multi-view switching

Data Visualization

Charts: line, bar, pie, scatter plots, and more
Progress bars: linear or circular
Stat cards: key metric displays

Media Components

Images: with carousel, zoom, lazy loading
Video and audio: embedded players
Files: upload and download

Advanced Interactions

Modal dialogs: confirmation, forms, detail displays
Dropdown menus: selection and filtering
Date pickers: single date or range selection

AI is no longer just a "text generator" — it becomes an "interface designer."

What Does This Mean for Different Roles?

For Regular Users: Goodbye to the "Copy-Paste" Era

Previously, talking to AI was like communicating with an assistant who could only talk but never act. You ask it questions, it gives you answers, but the answers are just text. You need to execute the operations described in that text yourself.

It's like asking someone "how do I get to the train station" — they give you detailed directions, but you still have to walk there yourself.

With A2UI, AI can directly help you:

Scenario 1: Filling Out Complex Forms

Before: AI tells you what information to fill in, you find the form, fill it out item by item.

Now: AI directly generates the form, you fill it out in the conversation, AI submits it for you.

Scenario 2: Data Queries

Before: AI tells you what the data is, you need to organize it yourself or export to Excel.

Now: AI directly generates charts and tables, you can click to filter, sort, and export.

Scenario 3: Booking Services

Before: AI tells you available time slots, you go to a calendar app to create an event.

Now: AI displays a calendar component directly, you click to select, booking complete.

Less copying and pasting, less switching between apps — everything happens within the conversation.

For Businesses: A Qualitative Leap in AI Customer Service

Traditional AI customer service has a fundamental problem: it can only answer questions, not solve problems.

When a user says "I want to return this item," traditional AI replies: "Please log into your account, go to the orders page, click the return button..." Then the user has to go do it themselves. If the user encounters any problems, they have to come back and ask again.

This isn't real service — it's just a talking instruction manual.

With A2UI, AI customer service can:

Directly Display Order Lists

User says "I want to return something," AI directly shows the user's recent orders in card format, each order showing product image, name, and purchase date. User clicks the one they want to return — no need to remember order numbers.

Generate Return Application Forms

After user selects an order, AI directly shows return reason options, refund method choices, and a supplementary notes input field. User fills it out and clicks submit.

Display Processing Progress in Real-Time

After submission, AI shows a progress bar: "Application Submitted → Merchant Review → Awaiting Return Shipment → Processing Refund." User can come back anytime to check progress.

From "Q&A bot" to "service terminal" — users don't need to leave the conversation interface to complete the entire business process.

For Developers: Lowering the Barrier to AI Application Development

Previously, to get AI to output structured content, developers needed to:

Design complex prompts to get AI to output specific JSON formats
Write parsing code to handle various exceptions in AI output
Design frontend components to render this data
Handle user interactions and feed results back to AI
Debug constantly because AI output formats are often unstable

The entire process was full of uncertainty, with long development cycles and high maintenance costs.

With A2UI, developers can:

Use Standardized Protocols

A2UI defines clear data structures — what format AI outputs, what fields it includes, how to describe interactions — all clearly specified. No need to design and maintain proprietary formats.

Use Ready-Made Renderers

Our open-source @zhama/a2ui provides a complete React renderer. Just pass AI output to the component, and interfaces render automatically with interactions handled.

Focus on Business Logic

Developers no longer need to worry about "how to get AI to output the right format," "how to render this data," or "how to handle user clicks." These are all solved by the protocol and renderer. Developers only need to focus on: what the user wants to do and how AI should help.

Lower development barriers, faster AI application deployment. Work that previously took weeks might now take just days.

Real-World Applications

Intelligent Training Systems: Everyone Gets a Personal AI Tutor

The problem with traditional online courses is the "one-size-fits-all" approach — everyone watches the same videos, does the same exercises, follows the same sequence.

But everyone has different foundations, learning speeds, and interests. Someone with programming experience and someone with zero background clearly shouldn't learn the same content.

AI training systems powered by A2UI can achieve:

Dynamic Course Generation

AI first asks a few questions to understand your background and goals, then generates a personalized learning path. Not pulling from a question bank, but generating content in real-time based on your situation.

Interactive Case Studies

When learning business cases, AI doesn't just show text descriptions — it generates an interactive scenario:

Displays the company's key metrics (charts)
Provides several decision options (buttons)
You click an option, AI shows the consequences of that decision
You can backtrack, try different choices, understand the impact of each decision

This is much more profound than simply reading case descriptions.

Real-Time Progress Tracking

AI generates your skill map, using radar charts or bar charts to show your mastery level in each knowledge area. You can click on a weak area, and AI immediately generates targeted exercises.

Intelligent Data Analysis: Conversational Business Intelligence

The problems with traditional reports are "lag" and "rigidity" — a fixed-format report generated weekly or monthly that might be outdated by the time you see it. And if you want to analyze from a different angle, you need to request IT department help.

AI analysts powered by A2UI are completely different:

Natural Language Queries

You say "Analyze last month's sales in the East region, grouped by product category," AI directly understands your request, queries the data, and generates results.

Dynamic Visualization

AI doesn't return a bunch of numbers — it directly generates the most appropriate charts for the data:

These aren't static images — they're interactive charts generated by AI in real-time based on data. You can:

Hover to see specific values
Click on a data point to drill down into details
Switch time ranges or filter conditions
Export data or charts

Discover Insights and Take Action

When AI discovers anomalies or trends in the data, it doesn't just tell you "March sales dropped 15%" — instead it:

Highlights this anomaly on the chart
Generates a list of possible causes (clickable to expand details)
Provides several suggested action plans (in button form)
You click on a plan, AI further shows execution steps or generates related task forms

Data analysis becomes a conversation, not a static report waiting to be read.

Intelligent Customer Service & Sales: Bots That Can Actually "Get Things Done"

What's the most frustrating experience with traditional chatbots? "Your question has been recorded. Transferring to human agent."

The subtext of this message is: I understood what you said, but I can't help you.

AI customer service powered by A2UI can actually solve problems:

Product Inquiries

User asks "What water purifiers are good for home use?" AI doesn't just list a few product names — instead:

Generates a product comparison table with key parameters like price, filter life, and suitable water quality
Table supports sorting (by price, by sales)
Each row has a "View Details" button that expands to show detailed product information
Has "Add to Compare" checkboxes to select products for a dedicated comparison card

After-Sales Service

User says "My order hasn't arrived yet," AI:

First displays the user's pending delivery orders (automatically retrieved from the system)
User clicks on the problematic order
AI shows logistics tracking timeline (each node clickable for details)
If there's indeed a problem, generates operation buttons like "Rush Delivery" or "Apply for After-Sales"
User clicks and the operation completes directly, no redirect needed

Booking Services

User says "I want to schedule weekend installation," AI:

Displays calendar component with available time slots highlighted
User clicks to select a time
Generates confirmation card: time, address, contact info, notes
User confirms and booking is complete, reminder automatically sent

Customer service bots can finally "get things done" — not transferring to humans, not making users do it themselves, but directly completing tasks for users.

Why Now?

A2UI didn't appear out of nowhere — it's the natural product of multiple technologies maturing.

Large Language Model Breakthroughs

Large language models like GPT-4, Claude 3, and Gemini can now:

Understand complex, multi-step instructions
Generate well-formatted structured output (JSON, XML, etc.)
Adjust output content and format based on context
Handle multi-turn conversations, maintaining state and memory

This means having AI output interface descriptions conforming to the A2UI protocol is entirely feasible. AI doesn't need to "learn to draw interfaces" — it just needs to output data describing interfaces, and the renderer handles the "drawing."

Frontend Technology Maturity

The core idea of modern frontend frameworks like React, Vue, and Svelte is "data-driven interfaces" — you give the framework data, it renders it into an interface.

This perfectly aligns with A2UI's philosophy. AI outputs data, frontend frameworks render interfaces. All that's needed between them is a "translation layer" — this is the @zhama/a2ui renderer we developed.

Rising User Expectations

After years of AI adoption, users have become accustomed to talking with AI. But they're increasingly unsatisfied with "just talking."

Users want AI to help them complete tasks, not just provide information
Users want more direct operations, not jumping between multiple apps
Users want more natural interaction, like communicating with a person

Market demand exists, and technical conditions are mature.

A2UI appeared at exactly this time to connect AI capabilities, frontend technology, and user needs.

What Are We Doing?

When Google released the A2UI protocol, they announced the official React renderer would be released in Q1 2026.

But we couldn't wait.

After the protocol was released, the Zhama AI team immediately began implementation work. Based on the A2UI protocol specification, we developed the first complete React implementation: @zhama/a2ui.

Why Implement Early?

Technical Validation

We believe A2UI is the right direction, but whether the protocol can be implemented, whether the experience is smooth, whether there are hidden issues — you only know by actually building it. Through implementation, we validated the protocol's feasibility and discovered and fed back areas needing improvement.

Accumulating Experience

Waiting until the official renderer releases to start learning means you're already behind. By implementing early, our team gained deep understanding of A2UI's design philosophy and technical details, accumulating valuable experience for future application development.

Serving Customers

Our customers have real needs — they want to use this technology in their products as soon as possible. Waiting isn't an option; we chose to take initiative.

Features of @zhama/a2ui

Complete Implementation

100% coverage of the A2UI protocol specification, including all component types, data binding, event handling, and action systems. Not a demo — a production-ready complete implementation.

Independent Styling System

Comes with complete CSS styles, no dependency on Tailwind or other frameworks. Uses a2- prefix to avoid style conflicts with host applications. Supports automatic dark mode.

Security-First Design

AI outputs declarative data, not executable code. The renderer only renders predefined components, won't execute arbitrary code — no code injection risk.

Open Source and Free

Open source under MIT license — you can freely use, modify, and distribute. We believe open source leads to better technology development.

We believe that embracing new technology early means gaining competitive advantage early.

What Does the Future Hold?

A2UI is just the beginning — it opens up an entirely new interaction paradigm. As technology develops, we might see more exciting changes.

Multimodal Fusion

Currently A2UI mainly handles the "AI generates interface → user clicks to operate" flow. In the future, input methods will become richer:

Voice commands: you speak, AI adjusts the interface
Gesture operations: in AR/VR environments, interact with AI-generated interfaces using gestures
Eye tracking: AI adjusts information display based on your gaze

Interfaces will no longer be static but will adapt in real-time to your behavior.

Extreme Personalization

Today's applications show the same interface to everyone for the same function. But everyone has different preferences — some like information-dense layouts, others prefer clean and simple; some are visual types who prefer charts, others prefer text lists.

With AI-generated interfaces, this personalization becomes possible:

Same "view sales data" function: analysts see detailed multidimensional charts, CEOs see a one-page concise summary
Interface style automatically adjusts to user preferences
Information presentation order optimizes based on user history

No-Code Development Goes Mainstream

Currently, developing an application interface requires professional designers and engineers. But if AI can generate interfaces:

Product managers describe requirements in natural language, AI generates prototypes
Entrepreneurs without technical backgrounds can quickly validate ideas
Internal business systems can be customized by business users themselves

Product prototypes shrink from weeks to minutes, dramatically lowering the barrier to software development.

Blurred Human-Machine Boundaries

Ultimately, the boundary between "applications" and "AI conversations" will become blurred. You might not be able to tell whether you're "using an app" or "talking to AI" — because the two merge into one.

Interface is conversation, conversation is interface. Every sentence you say might change the interface; every click you make is a communication with AI.

The Evolution of Interaction Methods

Looking back at the history of computer interaction methods, every revolution has brought massive industry transformation:

Command Line Era (1970s-1980s)

Users interacted with computers by typing text commands. Required memorizing numerous commands, extremely high barrier — only professionals could use them.

Graphical Interface Era (1980s-2000s)

The appearance of mice, windows, and icons allowed ordinary people to use computers. This sparked the explosion of the personal computer industry, with Microsoft and Apple becoming giants.

Mobile Internet Era (2010s)

Touchscreens made interaction more intuitive, app stores made software distribution more convenient. This sparked mobile internet prosperity, with smartphones becoming essential for everyone.

Conversational AI Era (2020s)

Large language models like ChatGPT made human-machine dialogue possible. But conversation output remained text, with limited capabilities.

A2UI Era (2025-)

AI can output interactive interfaces, truly moving from "saying" to "doing." What industry transformation will this spark? We're witnessing it and participating in it.

Final Thoughts

From command line to graphical interfaces, from web pages to mobile apps, every revolution in interaction methods has created enormous commercial value and birthed new tech giants.

A2UI represents the next revolution: from "humans adapting to machines" to "machines adapting to humans."

In this new paradigm:

AI is no longer just a tool that answers questions — it becomes an intelligent assistant that understands needs, generates interfaces, and completes operations
Applications are no longer fixed interface flows — they become dynamic interaction experiences generated based on user needs
Development is no longer the exclusive domain of professional engineers — it becomes a creative activity anyone can participate in

This isn't science fiction. This is happening now. And we're standing at the starting point of this revolution.

Want to learn more or get started? Visit our GitHub repository, or contact us to discuss how to apply this technology in your business.

This article was written by the Zhama AI technical team. We are committed to making AI technology better serve humanity.

Ahead of Google's Official Release! We Open-Sourced the First Complete A2UI React Implementation

Google released the revolutionary A2UI protocol, enabling AI to directly generate interactive interfaces. The Zhama AI team was first to open-source a complete React implementation @zhama/a2ui, no need to wait for the official renderer.

When AI Learns to 'Draw' Interfaces: How A2UI is Redefining Human-Computer Interaction

Starting with an Everyday Scenario

What If AI Could Directly 'Draw' the Interface?

What is A2UI?

Traditional AI Output vs A2UI Output

Interface Elements Supported by A2UI

What Does This Mean for Different Roles?

For Regular Users: Goodbye to the "Copy-Paste" Era

For Businesses: A Qualitative Leap in AI Customer Service

For Developers: Lowering the Barrier to AI Application Development

Real-World Applications

Intelligent Training Systems: Everyone Gets a Personal AI Tutor

Intelligent Data Analysis: Conversational Business Intelligence

Intelligent Customer Service & Sales: Bots That Can Actually "Get Things Done"

Why Now?

Large Language Model Breakthroughs

Frontend Technology Maturity

Rising User Expectations

What Are We Doing?

Why Implement Early?

Features of @zhama/a2ui

What Does the Future Hold?

Multimodal Fusion

Extreme Personalization

No-Code Development Goes Mainstream

Blurred Human-Machine Boundaries

The Evolution of Interaction Methods

Final Thoughts

Related Articles

Ahead of Google's Official Release! We Open-Sourced the First Complete A2UI React Implementation