← Back to all posts

GENSEEAI PRODUCT BLOG

From image generation to workflow generation

Using image generation in GenseeAI: setup, play-arounds, and real workflows.

May 24, 2026

In this post, we'll walk through three things:

Image generation in GenseeAI moving from one-off outputs to full workflow generation

1. How to set up image generation in GenseeAI

Getting started is simple.

To enable image generation:

  1. Go to Models in the left sidebar
  2. Add your own image model API key
  3. Click Continue
  4. Go back to Chat and start Image Mode

That's it.

Adding an image model API key in the GenseeAI Models panel to enable Image Mode

Once your API is connected, you can start generating images directly inside GenseeAI instead of switching out to a separate tool.

You can also choose the model, size, and quality depending on what kind of output you want and how you want to manage usage.

Image generation works in both OpenClaw and Hermes instances, and is available on both desktop and mobile.


2. A few early play-arounds

Before getting into larger workflows, we tested a few simple examples to see how image generation feels inside GenseeAI.

These are not meant to be the only use cases. They are just early ways to explore what becomes possible once image generation stays inside the same environment.

Virtual try-on

One test was a simple virtual try-on flow.

We uploaded a pair of glasses and a portrait, then generated a realistic try-on result.

This is a useful example of image-to-image generation. The point is not just that it can modify an image, but that it can do so as part of a broader task flow.

Uploaded pair of glasses used as input for the virtual try-on Original portrait uploaded for the virtual try-on
Realistic virtual try-on result generated in GenseeAI showing the portrait wearing the uploaded glasses

Flyer / poster creation

We also tested turning a real lunch photo into a restaurant flyer.

This is a good example of how a raw real-world input can become a more polished marketing asset in the same workflow.

Original lunch photo of a seabass dish used as input Restaurant flyer generated from the lunch photo inside GenseeAI

Background change

Another test was changing a selfie background into a Japan cherry blossom landscape.

Again, the interesting part is not just that the image can be edited. It is that these edits can now happen inside the same broader workflow instead of in a separate image tool.

Original selfie used as input for the background change Selfie with the background changed to a Japan cherry blossom landscape

These early tests matter because they show that image generation is not limited to one kind of prompt. It can support both text-to-image and image-to-image flows, depending on what the task needs.


3. Where it gets more useful: inside workflows

The most interesting part for us is not image generation on its own.

It is image generation inside a workflow.

That is where it becomes more than just an output generator.

Instead of:

you can now generate images as part of the same work you are already doing.

We think this gets especially useful in e-commerce and website building.

Slide creation workflows

If you are working inside a slide creation workflow, image generation can support work like:

For example, if you are building a presentation, you often need more than text and structure. You need visuals that fit the story of the deck and help each section land more clearly. That is much more useful than generating "an image" in isolation, because it directly supports the presentation workflow itself.

The image is not the final goal; it is part of helping the deck communicate better.

Here's one example we made with GenseeAI for GenseeAI itself: GenseeAI deck

Slide deck cover and section visuals generated with image generation inside a GenseeAI slide-creation workflow

Website building workflows

The same goes for website building.

If you are already using GenseeAI to draft a page, write content, or build HTML, image generation becomes more useful when it can stay connected to that same task.

For example, you can now create:

instead of leaving the workflow and treating images as a separate step.

This makes website-building tasks feel much more connected. The words, structure, HTML, and visuals can now live much closer together.

Website banner and section visuals generated with image generation inside a GenseeAI website-building workflow

Why text-to-image and image-to-image both matter

Another important part of this launch is that the system supports both:

That matters because real workflows need both.

Sometimes you start from a prompt.

Sometimes you start from an existing photo, product image, screenshot, or draft asset and want to transform it.

A workflow system should support both directions.

Otherwise users still end up splitting their work between multiple tools depending on what kind of image task they are doing.


More than one-off generation

The bigger point here is not just that GenseeAI can now generate images.

It is that users can now:

That makes the feature much more useful for the kinds of work people are already doing in GenseeAI.

We think the most obvious starting points right now are:

A LinkedIn post visual generated inside a GenseeAI social-posting workflow

But we also expect users to find many more.