Olivia Biban

Home

Work

About

Home

Work

About

Creative Agent

Creative Agent is an agentic AI creative solution transforming complex creative development by giving advertisers a creative partner and strategist at their fingertips. It can think, plan, and produce amazing content alongside you.

This end-to-end automation solution combines Amazon's signals with conversational guidance to produce high-quality advertising content quickly and at no cost. Research, strategy, copywriting, images, videos, audio, personalization at scale - it’s all here.

Creative Agent revolutionizes campaign development by taking an advertising process that traditionally requires weeks of work and significant financial investment and transforming it into something that can be accomplished within a few hours at no additional cost. The solution leverages Amazon's retail intelligence and brand signals to ensure brand safety and compliance across Amazon Ads. This helps advertisers scale their advertising presence with enhanced reach and engagement across all ad formats.

Creative Agent

Visit the live product

Meet Creative Agent

Chat to create images, videos, text, and audio.

What would you like to build today?

Create a video ad

Quickstart an ASIN video ad

Create an audio ad

Create an image ad

ROLE

AI Creative Technologist

CONTRIBUTIONS

Orchestrator System Prompt

Agent Personas

Prototyping

Golden Dataset Curation

Video Prompting Techniques

UI/UX Workflows

Human Diversity Improvements

Video End Card Generation

Agent Tooling

IMPACT

338%

Jump in performance

for campaign creative produced with Creative Agent

+10.3%

Higher average ROAS

For campaigns using AI-generated content

5x more products

Advertised per brand

Using AI-powered creative tools

Higher Click-through rates

on Sponsored Brands video ads produced with Creative Agent

+12%

Average increase in sales

for advertisers using video ads produced with AI tools

2x more images

Used per product

By brands using AI-powered creative tools

89%

Purchases from new-to-brand customers

after video campaigns produceded with Creative Agent

Months-> Days

Video Creative production cycles compressed

For advertisers of all skill levels and sizes

ROLE

AI Creative Technologist

CONTRIBUTIONS

Orchestrator System Prompt

Agent Personas

Prototyping

Golden Dataset Curation

Video Prompting Techniques

UI/UX Workflows

Human Diversity Improvements

Video End Card Generation

Agent Tooling

IMPACT

338%

Jump in performance

for campaign creative produced with Creative Agent

+10.3%

Higher average ROAS

For campaigns using AI-generated content

5x more products

Advertised per brand

Using AI-powered creative tools

Higher Click-through rates

on Sponsored Brands video ads produced with Creative Agent

+12%

Average increase in sales

for advertisers using video ads produced with AI tools

2x more images

Used per product

By brands using AI-powered creative tools

89%

Purchases from new-to-brand customers

after video campaigns produceded with Creative Agent

Months-> Days

Video Creative production cycles compressed

For advertisers of all skill levels and sizes

The Challenge

Getting an AI system to reason through a creative process the way a human creative director would required more than connecting generation tools together. The system needed to make judgment calls, explain its thinking, and produce work that was strategically grounded in a specific brand and product at every step, through a conversational interface intuitive enough for any advertiser to use without creative or technical expertise.

Building that meant solving problems with no established playbook. Agentic creative systems at this scale didn't exist yet, there were no training datasets for end-to-end AI-driven creative development, video generation models were nascent and largely undocumented for advertising use cases, and making a deeply complex multi-agent workflow feel transparent and simple inside a chat interface was genuinely new territory.

The Approach

With no existing playbook for agentic creative systems at this scale, the work started at the foundation. The first prototype established the core guiding principle: the system should reason through the creative process, surface its thinking at every step, and present well-developed options while keeping the user in control of every decision. Achieving that required building instruction sets for the orchestrator and specialized sub-agents that encoded expert creative and technical judgment, informed by deep research into video generation models and advertising best practices. Transparency was the guiding principle for UX workflows and interface design: users could chat, interact with generated content, and see their project update in real time, all within a unified view.

Getting the system to reason well required something to reason against. Building a golden dataset established a human-curated benchmark for what good end-to-end AI creative development looks like, at a stage when nothing comparable existed. That foundation made systematic evaluation possible and shaped how the team measured quality as the product developed.

After launch, that same methodical approach carried into improvement work: testing generated output across a large product set and edge cases, updating image and video agent system prompts to improve human diversity, and integrating Creative Studio's curated Themes experience into the chat system.

Technical Details

Prototype and Agent Instruction Sets

A complex workflow like producing campaign creative across multiple formats required a modular system of specialized agents rather than a single model handling everything. The orchestrator needed to route requests dynamically based on complexity, delegating to sub-agents for research, concept development, image production, and video production, while maintaining coherent output across the entire workflow.

Writing effective instruction sets for a multi-agent architecture meant encoding enough creative and strategic judgment into each agent that it could reason through open-ended problems independently, while staying aligned with the orchestrator's intent and the advertiser's inputs.

To test that in practice, we built a working prototype: a system that took a small set of advertiser inputs, combined them with proprietary retail data as a core part of its reasoning, and produced ad concepts, storyboards, scene scripts, and a stitched video end to end. That early system became the foundation for the multi-agent architecture Creative Agent ships on today.

Video Generation Model Research

Video generation for advertising was largely uncharted territory when Creative Agent was being built. The available models had different strengths, failure modes, and prompting requirements, and what worked for general video generation often didn't translate to ad-ready output.

Understanding how to get consistent, brand-aligned, visually compelling results from models like Runway, Kling, or Veo required systematic testing across different product types and creative scenarios.

The best practices that came out of that research became the technical foundation for the video production agent instruction sets, encoding knowledge about what these models respond to and their limitations directly into how the agent thinks and produces.

Video agent adapts the prompt for different models

Different models have different strengths for different camera motions. I developed adaptable prompting techniques for different camera motions that produce consistent motion independent of the video generation model

subtle zoom

dolly zoom

camera pan

arc shot

The video agent's system prompt encodes adaptable techniques for different video gen model strengths

GOLDEN DATASET

Curating a human baseline for system evaluations

Before Creative Agent could be meaningfully evaluated, there was no baseline for what good end-to-end AI creative development looked like. Building a golden dataset from scratch addressed that directly: a human-curated collection covering brand and product context, campaign goals, target audiences, fully developed concepts with documented reasoning, and scene-level storyboards.

Constructing it required making explicit what good creative reasoning looks like at each stage, a significant design challenge when creative quality is subjective and looks different for each brand. The dataset served as both a training resource for early versions of Creative Agent and a benchmark the team could evaluate model output against, at a stage when no comparable reference existed.

That foundation also made it possible to build evaluation criteria that could adapt as the system evolved, rather than remaining fixed against a snapshot of early output.

Human Diversity Improvements

Human representation in generated output is a nuanced problem that runs deeper than prompt adjustments alone. The system serves advertisers across thousands of product categories and many different locales. Diversity failures were often inconsistent and context-dependent rather than systematic.

Identifying where the system fell short required testing across a deliberately broad range of products and constructing edge cases that exposed failure modes not visible in standard output.

The findings informed targeted updates to the image and video agent system prompts, steering the agents toward more consistent and representative output without compromising product relevance or creative quality.

Auto generation natively generates human diversity

When not specifically prompted Creative Agent produces a range of options showing human diversity while maintaining product and brand relevance in creatively crafted scenes.

Options generated for kids' trail running shoes - shared family adventure concept

UI/UX and Unified Project View

A conversational interface creates a specific UX problem for creative work: as the project grows, the chat history becomes a poor record of what's been produced and how it fits together.

The solution was a unified view where project artifacts, including concepts, storyboards, images, videos, music, and voiceover, appear and update alongside the conversation in real time. Users can preview individual video scenes and the assembled video, music, and voiceover directly from the project sidebar, and edit assets through the chat interface or in dedicated editors, with Creative Agent staying aware of all changes made outside the chat.

Designing that experience required making deliberate decisions about separating conversational memory from project state, surfacing updates at the right moments without interrupting the creative flow, and keeping the agent informed across editing contexts.

A conversational interface creates a specific UX problem for creative work: as the project grows, the chat history becomes a poor record of what's been produced and how it fits together.

Creative Agent main entry point with chat history menu

Unified chat + project artifacts view

Dedicated video editor

Animating scenes and editing through chat

Themes Integration

The Themes system in Creative Studio was built for a single-step generation workflow: select a style, generate an image. Bringing that into Creative Agent meant adapting it for a multi-step conversational process where the user is working through research, concepts, and storyboards before any assets are produced.

I built a theming tool the agents could use directly, surfacing existing curated themes when the user raised the topic and acting on new theme input from as little as a short theme name, without requiring any prompting knowledge or manual style configuration.

Localization was added through the same system, treating cultural and regional visual conventions as a theming problem rather than a translation one, so advertisers could adapt creative output for different markets, from visual aesthetics and culturally relevant imagery, through the same conversational interface.

Themed content generation through chat

Agents use theming tool to apply curated themes, localize output, and suggest new themes based on current market trends and research.

Human curated theme: Coastal Style Interior

Agent curated new theme: Japandi style interior

Human curated theme: Christmas (indoor)

Agent curated new theme: Home office

Themed options for an ottoman product with human and agent curated new themes

Results

Creative Agent compressed what traditionally took weeks of work and significant budget into hours, at no additional cost to advertisers. Early results showed a 10.3% higher ROAS on Sponsored Brands campaigns using AI-generated images, a 12% average increase in sales for advertisers using AI tools, and the Bird Buddy case study delivered a 3x higher CTR and 121% ROAS with a campaign produced in three days.

The foundational work presented here shaped how the product was built and evaluated, contributing to the high advertiser adoption rates right from the official launch. The golden dataset directly informed decisions about agent design, tool access, and process coordination that the product still runs on today. The human diversity testing gave scientists the evidence-based feedback needed to identify root causes and make targeted improvements across a wide range of products and contexts. The unified project view established a novel way to present and interact with generated content both inside the chat interface and alongside it, making a deeply complex agentic workflow feel transparent and simple for advertisers of every skill level.

What I Learnt

Building Creative Agent from prototype to shipped product taught me what it means to work at the frontier of a genuinely new category of AI system. With no established playbook, defining what good looked like, building the tools to measure it, and iterating toward it were all happening in parallel. The golden dataset work taught me that evaluation design is as much a creative problem as it is a technical one. I learnt that articulating what good creative reasoning looks like at each stage of a production process needs to be precise and clearly structured to optimize LLM learning specific to creative tasks.

Working on instruction sets for a multi-agent architecture serving millions of advertisers across thousands of product categories taught me that robustness at scale requires intuition and systematic testing working together throughout. The human diversity work reinforced this: what looks like a prompt engineering problem on the surface often has deeper root causes that only careful testing across a wide range of edge cases can surface. The UX work taught me that interface design and system design are inseparable in an agentic system. Decisions about how to present generated content needed to both support human understanding and keep the agent informed across editing contexts. I learnt how to design hybrid interfaces that enable human interaction and facilitate targeted LLM context building at the same time.