Figma Agent Review: Real-World Testing, Prompts and Results

In this article, I share my real-world experience of testing Figma Agent on Flex Design System, including what worked, what didn’t and the prompts I used throughout.

Posted in

Design Systems

Written by

Ben Ellis

Published on

2 July 2026

Reading time

Testing an AI design agent with Flex Design System in Figma

Figma Design’s new Agent is one of the most interesting features to emerge over the past few months. While much of the discussion has focused on generating interfaces from prompts, I wanted to see how useful it could be for maintaining and improving a design system.

Over the past few weeks, I’ve been testing the Agent extensively on Flex Design System, passing it real-world management tasks that would typically consume a huge amount of designer time.

Before diving in, it’s worth saying that I’m by no means a prompt engineering expert. The prompts throughout this article simply reflect what has and hasn’t worked for me during my own testing. One thing I have found is that the Agent performs noticeably better when you’re specific about what you want, provide plenty of context and clearly define the expected outcome.

I’ve also found that it’s most valuable when working in an area where you already have experience. Because I know Flex Design System inside out, I can quickly judge whether the Agent’s suggestions are accurate, whether it’s misunderstood something or whether it’s confidently heading in the wrong direction. Like any AI tool, its output still benefits from human review and domain expertise.

To make these examples more actionable, I’ve included the prompts I used for each task throughout the article.

Some of the results were genuinely impressive, while others showed that the feature still has a long way to go.

Flex Design System prompt for auditing variable collections and design tokens in Figma

Where Agent Works Well

After several days of testing, it became clear that the Agent performs best when given systematic, logic-based tasks.

If the work requires analysing patterns, checking consistency or processing large amounts of information, it can be surprisingly capable.

Auditing Naming Conventions

One of the first tasks I gave the Agent was auditing naming conventions across variables and components.

Like many design systems, Flex has evolved over time. New features have been added, components have expanded and naming conventions have naturally changed and matured. Even with good governance, small inconsistencies inevitably find their way into the system.

Sample prompt

“I want you to audit my variable collections and check for any inconsistencies in naming or organisation. I then want you to suggest if there are any improvements I can make to follow Figma best practices. Again, I don’t want you to change aything, just audit and provide suggestions.”

The Agent was good at identifying these inconsistencies.

It could:

Identify variables that didn’t follow established naming patterns
Highlight inconsistently named components
Suggest standardised naming structures
Surface areas where conventions had diverged over time

This type of work can be incredibly tedious to do manually, particularly in files containing hundreds of variables, hundreds of component variants and a large library of supporting icons and example screens. It’s also the sort of task where most designers naturally lose concentration over time. As the work becomes more repetitive, it’s easy to overlook inconsistencies or skip over small details simply because your attention starts to drift. Design Agent doesn’t suffer from that kind of fatigue. It can apply the same level of scrutiny to the first variable as it does to the four hundredth, making it particularly well suited to systematic auditing tasks.

Figma Agent prompt for auditing Figma component descriptions and documentation

Filling Documentation Gaps

I also used the Agent to audit descriptions across variables, components and slots.

Again, this is exactly the kind of task that often gets postponed. Everyone agrees descriptions are important, but writing and maintaining them across an entire system can be time-consuming.

Sample prompt

“I would like you to check all my component descriptions for consistency and suggest any gaps, improvements or enhancements so that they follow best practice.”

The Agent was able to:

Identify missing descriptions
Check for inconsistencies in existing descriptions
Suggest new descriptions that aligned with existing patterns
Help create descriptions where nothing existed previously

The results weren’t perfect, and I still reviewed everything manually, but they dramatically reduced the effort required.

In many cases, the hardest part of documentation is simply getting started. The Agent removed much of that friction.

Component documentation prompt for generating Figma documentation in Flex Design System

Generating Component Documentation

I also asked the Agent to generate documentation for components.

Because documentation is often formulaic and based on observable patterns, the Agent could infer component purpose, identify properties and generate sensible starting points.

Sample prompt

“Generate documentation for each component based on its purpose, properties and variants. Write a short summary, usage guidance and any relevant notes for implementation.”

The content itself was generally very good, but I did have a bit of back and forth with the Agent to establish exactly how I wanted the documentation to be structured and formatted. Once we’d settled on a format, however, I found it was able to apply that same structure consistently across the remaining components and pages with very little additional input.

That experience highlighted something I encountered throughout my testing. Rather than expecting perfect results from a single prompt, I often achieved much better outcomes by refining the conversation over a couple of iterations. Once the Agent understood the desired style and level of detail, it became far more consistent.

It didn’t replace editorial review, but it provided a surprisingly useful first draft.

For teams that struggle to keep component documentation up to date, this alone could save a significant amount of time.

The manage libraries panel showing a clean bill of health with no missing libraries

Finding Detached Styles and Variables

Another practical use case was identifying detached styles and variables that had found their way into files.

These issues are easy to introduce and notoriously difficult to spot manually, particularly in larger systems.

Sample prompt

“Find any detached styles or variables in this file. List them, explain why they appear detached, and suggest whether they should be reattached, renamed or removed.”

The Agent did a good job of surfacing these inconsistencies and highlighting elements that appeared to be disconnected from the system.

Again, this isn’t glamorous work, but it’s exactly the sort of maintenance task that helps preserve the integrity of a design system over time.

Figma agent prompt to identify gaps in component coverage when captured to industry best practice

Gap Analysis

I also experimented with asking Design Agent to perform a gap analysis on the system.

Sample prompt

“Can you audit my library and identify any gaps in component and layout types that I might have if we compare it alongside leading front end libraries such as Tailwind CSS or any others which you consider to be best in class for UX and UI design.”

This was surprisingly effective.

It could identify areas where patterns appeared incomplete, highlight potential inconsistencies and suggest opportunities for improvement.

The recommendations still required human judgement, but the analysis itself was often thoughtful and useful.

Rather than replacing decision-making, it acted more like a second pair of eyes.

AI prompt for populating Figma designs with realistic content in Flex Design System

Content Population

I have previously written about plugins that I used to pull content in from either Google Sheets or JSON files. I wanted to see if the Agent was able to do the same. I had to write 2 prompts for this, as I think my initial one wasn’t specific enough.

Sample prompt 1

“I want you to replace the content across these screens using examples from https://www.purplebricks.co.uk/search/property-for-sale/bishop’s-stortford?page=1&sortBy=2&betasearch=true&latitude=51.8838772&longitude=0.1547678&location=cm23&searchRadius=2&searchType=ForSale&soldOrLet=false”

Sample prompt 2

“Em, that was ok, but firstly you didn’t replace the placeholder images. I’ve updated the layout, please do this again and replace the images as well. The content in the Details screen should match the first card in the Listing screen.”

For prototypes, demonstrations and testing scenarios, this felt incredibly powerful. Instead of manually creating placeholder content or building temporary data structures, I could simply ask the Agent to fetch and populate realistic content.

That said, your experience will vary depending on the source you’re connecting to. Some websites and APIs are far easier for the Agent to access and interpret than others, and not every source exposes its content in a way that supports remote AI access. When it works, though, it can dramatically reduce the time spent populating layouts with realistic data.

For designers who regularly build data-driven interfaces, this capability alone could remove a lot of repetitive work.

Current limitations of the Figma agent shown in Flex Design System components example

Where It Still Falls Short

As impressive as some of the results were, there is one area where the Agent currently struggles.

Creating components

I asked it to create the missing components identified in the gap analysis, using my existing component styles, tokens and established patterns.

Sample prompt

“Based on the gap analysis I want you to create the missing components that you have identified. You can create them on the current page, taking each tier in turn. You should ensure you use my existing tokens for all spacing, fills, strokes, radii, typography. Use auto-layout and where applicable use fill and hug for constraints to ensure the components are responsive. Do not use any properties that are not already in the library.”

Unfortunately, the results weren’t that great.

The generated components often:

Ignored existing patterns
Failed to use tokens correctly
Produced inconsistent structures
Created solutions that simply didn’t feel native to the system

The issue isn’t that it couldn’t generate components. It could.

The issue is that it didn’t truly understand the design system it was working within.

Design systems are full of nuance. Naming conventions, composition patterns, token architecture, accessibility requirements and years of accumulated decisions all contribute to how components should be built.

At the moment, the Agent doesn’t appear to fully understand these relationships.

Final Thoughts

Figma Agent is already useful today, but probably not in the way many people initially expected.

I don’t currently see it replacing designers or generating production-ready design systems from prompts. What I do see is a capable assistant for maintaining and improving existing systems.

It excels at systematic work that designers know needs doing but often postpone because it’s repetitive, tedious or time-consuming.

If you expect it to understand the nuances of your design system and build new components exactly as you would, it’s not there yet.

That may change rapidly as the feature evolves. In fact in time between testing the Agent and writing this article, Figma Skills have been rolled out, which may help in providing further context and guardrails that will elevate the quality of the output.

For now, though, its biggest strength isn’t designing, well for me at least. It’s helping designers build and maintain better design systems.

Figma Agent Review: Real-World Testing, Prompts and Results

Table of contents

Where Agent Works Well

Auditing Naming Conventions

Sample prompt

Filling Documentation Gaps

Sample prompt

Generating Component Documentation

Sample prompt

Finding Detached Styles and Variables

Sample prompt

Gap Analysis

Sample prompt

Content Population

Sample prompt 1

Sample prompt 2

Where It Still Falls Short

Creating components

Sample prompt

Final Thoughts

Links