How to Write Nano Banana Prompts:Gemini 2.5 Flash Image Official Tutorial Decode

How to Write Nano Banana Prompts:Gemini 2.5 Flash Image Official Tutorial Decode

Gemini can generate and process images conversationally. You can prompt Gemini with text, images, or a combination of both allowing you to create, edit, and iterate on visuals with unprecedented control:

  • Text-to-Image: Generate high-quality images from simple or complex text descriptions.

  • Image + Text-to-Image (Editing): Provide an image and use text prompts to add, remove, or modify elements, change the style, or adjust the color grading.

  • Multi-Image to Image (Composition & Style Transfer): Use multiple input images to compose a new scene or transfer the style from one image to another.

  • Iterative Refinement: Engage in a conversation to progressively refine your image over multiple turns, making small adjustments until it’s perfect.

  • High-Fidelity Text Rendering: Accurately generate images that contain legible and well-placed text, ideal for logos, diagrams, and posters.

Nano Banana Prompting guide and strategies

Mastering Nano Banana Image Generation starts with one fundamental principle:

Describe the scene, don’t just list keywords. The model’s core strength is its deep language understanding. A narrative, descriptive paragraph will almost always produce a better, more coherent image than a list of disconnected words.

Prompts for generating images

The following strategies will help you create effective prompts to generate exactly the images you’re looking for.

1. Photorealistic scenes

For realistic images, use photography terms. Mention camera angles, lens types, lighting, and fine details to guide the model toward a photorealistic result.

Template: A photorealistic [shot type] of [subject], [action or expression], set in [environment]. The scene is illuminated by [lighting description], creating a [mood] atmosphere. Captured with a [camera/lens details], emphasizing [key textures and details]. The image should be in a [aspect ratio] format.

-Shot Type:Close-up,Medium Shot,Wide Shot,Overhead/Bird’s-eye,etc.

-Lighting Description:Natural Daylight,Soft Diffused,Dramatic Chiaroscuro,Ambient Artificial,etc.

-Aspect Ratio:16:9 (Widescreen),4:3 (Standard),1:1 (Square),21:9 (Ultrawide),etc.

-Camera/Lens details:35mm wide-angle, f/2.8 ( Environmental storytelling with wide context),

  • 50mm prime, f/1.8 (Classic portrait bokeh with natural framing),

85mm telephoto, f/4 ( Telephoto compression for subject isolation ),

100mm macro, f/5.6 (Microscopic texture magnification).

Prompt

A photorealistic close-up portrait of an elderly Japanese ceramicist with
deep, sun-etched wrinkles and a warm, knowing smile. He is carefully
inspecting a freshly glazed tea bowl. The setting is his rustic,
sun-drenched workshop. The scene is illuminated by soft, golden hour light
streaming through a window, highlighting the fine texture of the clay.
Captured with an 85mm portrait lens, resulting in a soft, blurred background
(bokeh). The overall mood is serene and masterful. Vertical portrait
orientation.

2. Stylized illustrations & stickers

To create stickers, icons, or assets, be explicit about the style and request a transparent background.

Template: A [style] sticker of a [subject], featuring [key characteristics] and a [color palette]. The design should have [line style] and [shading style].The background must be transparent.

-Style:Flat Design(minimalist flat, cartoon flat, isometric),Painterly( watercolor wash, oil texture, ink splatter),Retro(80s sticker, vintage comic, psychedelic),Tech/Futuristic(cyberpunk glitch, neon grid, holographic),Handmade(paper cutout, linocut, embroidery mockup),etc.

-color palette:Soft:(pastel gradients, muted earth tones, desaturated duo-tone),Bold( high-contrast neon, comic book CMYK, monochromatic + accent),Natural(forest greens, sunset orange-purple, ocean blues),etc.

-line style:Clean(uniform thickness, sharp corners, no outlines),Organic(variable-weight strokes, sketchy crosshatching, dry brush edges),Decorative(dotted borders, double lines, glowing outlines),etc.

-shading style:Flat(cel-shaded blocks, no shadows, 2-tone gradient),Realistic(soft airbrush, ambient occlusion, 3D render lighting),Textured(crosshatching, stippling, grain overlay),etc.

Prompt

3. Accurate text in images

Gemini excels at rendering text. Be clear about the text, the font style (descriptively), and the overall design.

Template: Create a [image type] for [brand/concept] with the text “[text to render]” in a [font style]. The design should be [style description], with a [color scheme].

Prompt

Create a modern, minimalist logo for a coffee shop called ‘The Daily Grind’.
The text should be in a clean, bold, sans-serif font. The design should
feature a simple, stylized icon of a a coffee bean seamlessly integrated
with the text. The color scheme is black and white.

4. Product mockups & commercial photography

Perfect for creating clean, professional product shots for e-commerce, advertising, or branding.

Template: A high-resolution, studio-lit product photograph of a [product description] on a [background surface/description]. The lighting is a [lighting setup,e.g., three-point softbox setup] to [lighting purpose]. The camera angle is a [angle type] to showcase [specific eature]. Ultra-realistic, with sharp focus on [key detail]. [Aspect ratio].

Prompt

A high-resolution, studio-lit product photograph of a minimalist ceramic
coffee mug in matte black, presented on a polished concrete surface. The
lighting is a three-point softbox setup designed to create soft, diffused
highlights and eliminate harsh shadows. The camera angle is a slightly
elevated 45-degree shot to showcase its clean lines. Ultra-realistic, with
sharp focus on the steam rising from the coffee. Square image.

5. Minimalist & negative space design

Excellent for creating backgrounds for websites, presentations, or marketing materials where text will be overlaid.

Template:A minimalist composition featuring a single [subject] positioned in the [bottom-right/top-left/etc.] of the frame. The background is a vast, empty [color] canvas, creating significant negative space. Soft, subtle lighting.[Aspect ratio].

Prompt

A minimalist composition featuring a single, delicate red maple leaf
positioned in the bottom-right of the frame. The background is a vast, empty
off-white canvas, creating significant negative space for text. Soft,
diffused lighting from the top left. Square image.

6. Sequential art (Comic panel / Storyboard)

Builds on character consistency and scene description to create panels for visual storytelling.

Template:A single comic book panel in a [art style] style. In the foreground,[character description and action]. In the background, [setting details].The panel has a [dialogue/caption box] with the text “[Text]”. The lighting creates a [mood] mood. [Aspect ratio].

-[art style]:

  • Regional Styles:American comic book,Japanese manga (shonen/shojo),European ligne claire,Manhua ink wash

  • Texture Styles :Crosshatching noir,Watercolor splatter,Grainy halftone,Digital cel-shaded

  • Era Styles:Golden age pulp,80s anime cel,Retro propaganda poster,Cyberpunk glitch

  • Hybrid Styles:Steampunk woodcut,Art Nouveau manga,Ukiyo-e storyboard

-[dialogue/caption box]:

  • Shape Types:Rounded bubble,Spiked thought cloud,Scroll-shaped caption,Hexagonal tech panel

  • Border Styles:Glowing outline,Hand-drawn squiggles,Neon circuit,Rusty metal frame

  • Text Effects:Bold comic sans,Typewriter font,Holographic text,Blood-drip letters

  • Positioning:Overlapping panel,Floating sidebar,Diagonal scream bubble,Background parchment

- [mood] :

  • Intensity:High-contrast chiaroscuro,Soft diffused light,Neon-drenched

  • Color Temp:Cool moonlight,Warm sunset,Sickly green,Nuclear orange

  • Movement:Frozen moment,Motion blur action,Slow-mo particles

  • Genre-specific :Noir suspense,Romantic glow,Horror shadows,Sci-fi holograms

Prompt

A single comic book panel in a gritty, noir art style with high-contrast
black and white inks. In the foreground, a detective in a trench coat stands
under a flickering streetlamp, rain soaking his shoulders. In the
background, the neon sign of a desolate bar reflects in a puddle. A caption
box at the top reads “The city was a tough place to keep secrets.” The
lighting is harsh, creating a dramatic, somber mood. Landscape.

Nano Banana Prompts for editing images

These examples show how to provide images alongside your text prompts for editing, composition, and style transfer.

1. Adding and removing elements

Provide an image and describe your change. The model will match the original image’s style, lighting, and perspective.

Tempalte:Using the provided image of [subject], please [add/remove/modify] [element] to/from the scene. Ensure the change is [description of how the change should integrate].

Prompt

”Using the provided image of my cat, please add a small, knitted wizard hat
on its head. Make it look like it’s sitting comfortably and matches the soft
lighting of the photo.”

2. Inpainting (Semantic masking)

Conversationally define a “mask” to edit a specific part of an image while leaving the rest untouched.

Tempalte:Using the provided image, change only the [specific element] to [new lement/description]. Keep everything else in the image exactly the same, preserving the original style, lighting, and composition.

Prompt

”Using the provided image of a living room, change only the blue sofa to be
a vintage, brown leather chesterfield sofa. Keep the rest of the room,
including the pillows on the sofa and the lighting, unchanged.”

3. Style transfer

Provide an image and ask the model to recreate its content in a different artistic style.

Tempalte:Transform the provided photograph of [subject] into the artistic style of [artist/art style]. Preserve the original composition but render it with [description of stylistic elements].

Prompt

”Transform the provided photograph of a modern city street at night into the artistic style of Vincent van Gogh’s ‘Starry Night’. Preserve the original composition of buildings and cars, but render all elements with swirling, impasto brushstrokes and a dramatic palette of deep blues and bright yellows.”

4. Advanced composition: Combining multiple images

Provide multiple images as context to create a new, composite scene. This is perfect for product mockups or creative collages.

Tempalte:Create a new image by combining the elements from the provided images. Take the [element from image 1] and place it with/on the [element from image 2].The final image should be a [description of the final scene].

Prompt

”Create a professional e-commerce fashion photo. Take the blue floral dress
from the first image and let the woman from the second image wear it.
Generate a realistic, full-body shot of the woman wearing the dress, with
the lighting and shadows adjusted to match the outdoor environment.”

Best Practices

To elevate your results from good to great, incorporate these professional strategies into your workflow.

  • Be Hyper-Specific: The more detail you provide, the more control you have. Instead of “fantasy armor,” describe it: “ornate elven plate armor, etched with silver leaf patterns, with a high collar and pauldrons shaped like falcon wings.”

  • Provide Context and Intent: Explain the purpose of the image. The model’s understanding of context will influence the final output. For example, “Create a logo for a high-end, minimalist skincare brand” will yield better results than just “Create a logo.”

  • Iterate and Refine: Don’t expect a perfect image on the first try. Use the conversational nature of the model to make small changes. Follow up with prompts like, “That’s great, but can you make the lighting a bit warmer?” or “Keep everything the same, but change the character’s expression to be more serious.”

  • Use Step-by-Step Instructions: For complex scenes with many elements, break your prompt into steps. “First, create a background of a serene, misty forest at dawn. Then, in the foreground, add a moss-covered ancient stone altar. Finally, place a single, glowing sword on top of the altar.”

  • Use “Semantic Negative Prompts”: Instead of saying “no cars,” describe the desired scene positively: “an empty, deserted street with no signs of traffic.”

  • Control the Camera: Use photographic and cinematic language to control the composition. Terms like wide-angle shot, macro shot, low-angle perspective.