GROK IMAGINE PROMPTS

OUTPUT

The output of this generator is a JSON formatted text, which defines a structured representation of your prompt.

// A JSON object with a set of properties expressed as name:value pairs.
{
  "name1": "value1",
  "name2": "value2"
}


// A JSON list (array) of values.
[
  "value1", "value2"
]


// A JSON list (array) of objects.
[
  {
    "name1": "value1",
    "name2": "value2"
  },
  {
    "name1": "value1",
    "name2": "value2"
  }
]

Your can see the full JSON prompt at the bottom of this page. Click the generate icon to update the JSON prompt after your changes.

USING IMAGES

In Grok Imagine you can reference attached images, in your prompt, via the @ImageId tag.

Use the @ImageId tags in the generator section fields to point out attached images to use as a baseline to a scene, character, subject etc. Example:

{
  "scene": {
    "environment": "exterior looking like @Image1",
  }
}

[
{
	"type": "primary",
	"description": "cyberpunk hacker protagonist, mid-20s Asian female with cybernetic enhancements, wearing a sleek black trench coat with glowing blue circuits, augmented reality glasses, holding a holographic data pad",
	"position": "foreground center, standing on a wet sidewalk",
	"pose": "dynamic, looking over shoulder at approaching drone",
	"expression": "determined and vigilant",
	"details": {
		"hair": "short neon-pink bob, wet from rain",
		"clothing": "leather boots, tactical gloves, embedded LED strips",
		"accessories": "earpiece communicator, backpack with tech gadgets"
	}
},
{
	"type": "secondary",
	"description": "swarm of surveillance drones with red scanning lights",
	"position": "midground, hovering in the air pursuing the protagonist",
	"count": 5,
	"details": {
		"size": "small quadcopters",
		"features": "rotating propellers, camera lenses, metallic chassis with corporate logos"
	}
},
{
	"type": "background_elements",
	"description": "crowds of pedestrians with umbrellas, street vendors selling cyberware, holographic advertisements projecting from buildings",
	"position": "background streets and alleys",
	"count": "multiple",
	"details": {
		"diversity": "mix of humans, androids, and cyborgs in varied attire"
	}
}]

Add Subject

Description*

Position

Pose

Expression

Count

Details

Hair

Clothing

Accessories

Size

Features

Diversity

Subject Objects

Each subject object has:

Type*

Categorizes the subject (e.g., main focus vs. background).

Required: Yes.
Values: "primary", "secondary", "background_elements".
Examples: "primary", "secondary".

Description*

Detailed textual depiction.

Required: Yes.
Values: Any descriptive text.
Examples: "cyberpunk hacker protagonist, mid-20s Asian female with cybernetic enhancements...", "swarm of surveillance drones...".

Position*

Placement in the composition.

Required: Yes.
Values: "foreground center", "midground", "background".
Examples: "foreground center, standing on a wet sidewalk".

Pose*

Body position or action.

Required: Optional in some subjects.
Values: Descriptive like "dynamic", "sitting".
Example: "dynamic, looking over shoulder at approaching drone".

Expression

Facial emotion.

Required: Optional.
Values: "happy", "determined".
Example: "determined and vigilant".

Count

Number of instances.

Required: Optional.
Type: Integer or string like "multiple".
Values: 1+, or "multiple".
Examples: 5, "multiple".

Details (sub-object, optional):

Fine-grained attributes.

Type: Object.

Sub-parameters vary by subject, e.g.:

Hair: Description of hair (string, e.g., "short neon-pink bob").
Clothing: Outfit details (string).
Accessories: Items (string).
Size: For objects (string, e.g., "small quadcopters").
Features: Specific traits (string).
Diversity: Variety in elements (string).

An array of objects describing spoken lines by subjects, including timing, voice style, and lip-sync. This enables narrative elements in the video.

Required: Optional.
Type: Array with two objects.
Values: An array (0+ items) where each object represents a dialogue instance. Order them chronologically for best syncing. If no dialogue, use an empty array [].
Example: An array with two objects.

Dialogue Objects

Each dialogue object has the following sub-parameters:

Subject*

Identifies which subject (from the "subjects" array) is speaking. Links audio to visuals for lip-sync.

Required: Yes (for each dialogue object).
Values: References like "primary", "secondary", or more descriptive secondary ids (e.g., "secondary_drone"). Must match a subject's "type" or be descriptive if not exact.
Example: "primary" or "secondary_drone".

Timestamp Seconds*

Specifies when the dialogue starts in the video timeline (for precise syncing).

Type: Float or integer.
Required: Yes, to avoid random placement.
Values: Non-negative number (e.g., 0.0 to video duration). Use decimals for sub-second precision. Should be less than or equal to "duration_seconds" in generation_parameters.
Example: 2.5 or 6.0.

Text*

The actual spoken words.

Required: Yes.
Values: Any dialogue text. Keep short (under 50 words per instance) for natural delivery. Supports multiple languages if the model handles them.
Example: "They're closing in... I need to hack the grid now." or "Target acquired. Initiating scan.".

Voice

Describes the voice characteristics for text-to-speech generation.

Required: Optional, defaults to neutral.
Values: Descriptive like "male, deep, authoritative" or "female, young, excited". Include accents (e.g., "British"), effects (e.g., "echoey", "robotic"), or age/gender qualifiers.
Example: "female, mid-20s, determined tone with slight echo from earpiece" or "robotic, modulated, emotionless".

Lip Sync

Enables automatic lip movement syncing for the subject (if it's a character with a visible mouth).

Type: Boolean.
Required: Optional, defaults to false.
Values: true or false. Set to true for humanoid subjects; false for non-speaking elements like robots without lips.
Example: true or false.


GENERATE	COPY	DOWNLOAD	IMPORT


GENERATE	COPY	DOWNLOAD	IMPORT

GROK IMAGINE PROMPTS

INTRODUCTION

STEPS

FEATURES

TIPS

OUTPUT

USING IMAGES

Meta

Generation Parameters

Image

Scene

Style

Lighting

Camera

Mood

Color Palette

Composition

Background

Technical Specs

Subjects

Add Subject

Details

Audio

Negative Prompts

Prompt

Log

Load Content

Save Content

Like this:

GET TO KNOW YOURSELF

IGNITE YOUR POTENTIAL

MAKE MONEY WORK FOR YOU

FINANCIAL EVENTS

PICK MY BRAIN

Your visit is much appreciated

Get to know me

Pick my brain

INTRODUCTION

STEPS

FEATURES

TIPS

OUTPUT

USING IMAGES

Meta

Generation Parameters

Image

Scene

Style

Lighting

Camera

Mood

Color Palette

Composition

Background

Technical Specs

Subjects

Add Subject

Details

Audio

Negative Prompts

Prompt

Log

Share this:

Like this:

Your visit is much appreciated

Get to know me

Pick my brain