AI Assistant in Adobe Experience Platform: Evaluation and continual improvement.

Namita Krishnan and Yunyao Li

12-09-2025

Woman leaning against a pillar in a bright office alongside AI interface visuals showing insights and attribution details.

AI Assistant in Adobe Experience Platform represents a leap forward in building enterprise-grade applications in the Generative AI era. This post provides a behind-the-scenes account of how we approach evaluation and continual improvement as detailed in our research paper: Evaluation and Continual Improvement for an Enterprise AI Assistant.

Problems we identified.

Enterprise users often face significant friction when trying to extract insights from their data. Conversational AI assistants, as illustrated in the figure below, promise to simplify this process. However, delivering a reliable, precision-oriented, enterprise-grade solution comes with unique challenges: fragmented data sources, evolving customer needs, and the risk of AI-generated errors that erode user trust.

A flowchart showing the end-to-end AI Assistant architecture, showing how user queries flow through retrieval, grounding, answer generation, and evaluation steps.

AI Assistant overall architecture.

As we delved deeper into this project, we encountered a critical question: How do we effectively evaluate and improve an AI assistant that’s constantly evolving in a dynamic enterprise environment? This challenge is far from trivial. Enterprise AI assistants need to deal with sensitive customer data, adapt to shifting user bases, and balance complex metrics while maintaining privacy and security. Traditional evaluation methods fall short in this context, often providing incomplete or misleading feedback.

Our approach to solve these problems.

To address these issues, we’ve developed a novel framework for evaluation and continual improvement. At its core is the observation that 'not all errors are the same'. We have adopted a 'severity-based' error taxonomy that aligns our metrics with real user experiences (see the table below):

A table describing the error severity framework in AI Assistant, with categories, definitions, consequences, and examples.

Error severity framework in AI Assistant.

This taxonomy allows us to prioritize improvements that have the most significant impact on user experience and trust. It’s part of a comprehensive approach that includes:

A flowchart diagram illustrating the continual improvement framework for AI Assistant, showing how labeled data, dashboards, and expert error analysis feed into ongoing model enhancements.

Evaluation and continual improvement framework in AI Assistant.

The impact of this framework on our customers has been substantial. By focusing on severity-based errors, we’re delivering more reliable and trustworthy AI assistance. Our human-centered approach ensures that improvements align with real user needs and pain points, as illustrated in the table below.

A dashboard view of an error-severity table with a time-series line chart that tracks error-rate trends with error bars.

Dashboard showing a snapshot of error severities and time-evolution for a single component. Illustrative data of similar magnitude to production numbers.

What's next for AI Assistant?

We’re just getting started. Our focus now is on making AI Assistant in Adobe Experience Platform even more proactive, meeting the users in their natural workflow and expanding coverage. We’re also improving our evaluation framework along a few key dimensions:

To learn more about our work and the impact we’re seeing, read the full paper here.

If building generative AI at enterprise scale excites you — explore the latest highlights and career opportunities at the Adobe Experience Platform AI site.

Paper authors: Akash V. Maharaj, Kun Qian, Uttaran Bhattacharya, Sally Fang, Horia Galatanu, Manas Garg, Rachel Hanessian, Nishant Kapoor, Ken Russell, Shivakumar Vaithyanathan, Yunyao Li

Guang-Jie Ren and Huong Vu also contributed to this post.

https://business.adobe.com/fragments/resources/cards/thank-you-collections/generative-ai