humAI (human & AI) interaction @ reflection

these days, I developed a new habit of browsing Midjourney Showcase every morning to feel inspired with the possibilities of the future.

today while browsing the showcase, i came across an image generated by the Midjourney that immediately drew my attention as a Vietnamese: the beautiful landscape of Halong Bay.

The level of details is amazing, and also as a newly-adopted habit, I started to look at the prompt of the image and surprise at the volume of the prompt, which was:

a colonial landscape of the Ha Long Bay, ultra realistic cinematic photography, detailed, hyper realism, high details, 8k cinematic, high resolution, symmetrically, cinematographic, color grading, photography, shot on 50mm lens, ultra wide angle, depth of field, hyperdetailed, beautifully color coded, insane detail, intricate details, beautiful color grading, Unreal Engine, cinematic, color grading, Editorial Photography, Photography, Photoshoot, 70mm lens shot, Depth of field, DOF, Tilt blur, Shutter speed 1/ 1000, F/ 22, White balance, 32k, Super resolution, Megapixel, ProPhoto RGB, VR, Lonely, Good, Massive, Partial lighting, Backlighting, Daytime Lighting, Incandescent, Fiber Optic, Moody Lighting, Cinema Lighting, Studio Lighting, Soft Lighting, Volumetric, Contre – Jour, Beautiful Lighting, Accent Lighting, Global Illumination, Ilumi Screen Space Global Nation, Ray Tracing Global Illumination, Optics, Scattering, Glow, Shadows, Harsh, Shimmering, Ray Tracing Reflections, Lumen Reflections, Screen Space Reflections, Diffraction Grading, Chromatic Aberration, Offset from GB, Scan Lines, Ray Tracing, Ambient Ray Tracing, Anti – Aliasing, FKAA, TXAA, RTX, SSAO, Shaders, OpenGL – Shaders, GLSL – Shaders, Post Processing, Post Production, Cel Shading, Tone Mapping, CGI, VFX, SFX, insanely detailed and intricate, hypermaximalist, elegant, hyperrealistic, superdetailed, dynamic pose, photography, ultra realistic, Full – HD, high detail high definition, Hyper detailed, 8k, ultra, –ar 16:9 –v 5
@max.beau | Midjourney Community

but as I look closed to the prompt, I knew that something is not right. there are repeated and conflicting keywords. for example, 50mm and 70mm are mostly used for portrait, in order words, it is very hard for you get an “ultra wide landscape” with this feature.

i start to wonder how many of the keywords that the AI can truly understand and make meaning out of it. I came to another friend to ask for advise, ChatGPT 4.0.

Questions:

How many of the below keywords that the Midjourney AI can make meaning and bring to the AI-generated image?

Answer:

The Midjourney AI can interpret and incorporate many of the keywords provided into an AI-generated image, although some may be redundant or overlapping in their effects. Here’s a list of keywords that the AI can make use of:

Colonial landscape of the Ha Long Bay

Ultra-realistic

Cinematic photography

Hyper-realism

High details

8k resolution

Symmetrically

Cinematographic

Color grading

Shot on 50mm lens

Ultra wide angle

Depth of field

Beautifully color-coded

Insane detail

Intricate details

Beautiful color grading

Unreal Engine

Cinematic

Editorial photography

Photoshoot

70mm lens shot

Tilt blur

Shutter speed 1/1000

F/22

White balance

32k resolution

Super resolution

Megapixel

ProPhoto RGB

VR

Lonely

Good

Massive

Partial lighting

Backlighting

Daytime lighting

Incandescent

Fiber optic

Moody lighting

Cinema lighting

Studio lighting

Soft lighting

Volumetric

Contre-Jour

Beautiful lighting

Accent lighting

Global illumination

Ray tracing global illumination

Optics

Scattering

Glow

Shadows

Harsh

Shimmering

Ray tracing reflections

Lumen reflections

Screen space reflections

Diffraction grading

Chromatic aberration

Offset from GB

Scan lines

Ray tracing

Ambient ray tracing

Anti-aliasing

FKAA

TXAA

RTX

SSAO

Shaders

OpenGL – Shaders

GLSL – Shaders

Post-processing

Post-production

Cel shading

Tone mapping

CGI

VFX

SFX

Hypermaximalist

Elegant

Hyperrealistic

Super detailed

Dynamic pose

Ultra-realistic

Full-HD

High detail high definition

8k resolution

AR 16:9

V5

Please note that some of the terms may not directly translate into visual elements, but they can help guide the AI to create a specific style, mood, or visual effect in the image.

almost 90 keywords are crazy to think about. but that’s the quantity, how about the quality? i continue to have a conversation with ChatGPT

Question:
How Midjourney may prioritize these above-mentioned keywords?

Answer:

Midjourney AI would prioritize the above-mentioned keywords based on the following factors:

Core subject: The primary focus of the image should be on the core subject, which is the colonial landscape of Ha Long Bay in this case.

Composition and perspective: Keywords related to composition, such as ultra-wide angle, depth of field, and shot on 50mm/70mm lens, would be prioritized to set up the basic structure and viewpoint of the image.

Visual quality: The AI would prioritize keywords that define the desired resolution, detail, and realism of the image, like ultra-realistic, 8k resolution, high detail high definition, and hyper-realism.

Lighting and mood: Lighting-related keywords, such as backlighting, moody lighting, cinema lighting, and studio lighting, would be prioritized to establish the overall mood and atmosphere of the image.

Color and tone: Keywords related to color grading, beautifully color-coded, and ProPhoto RGB would be considered to give the image its distinct color palette and visual appeal.

Technical and visual effects: Keywords related to specific effects or techniques, such as ray tracing, global illumination, shaders, post-processing, and anti-aliasing, would be prioritized to enhance the visual impact and achieve the desired style.

Additional attributes: Keywords like lonely, good, and massive would be prioritized to convey the intended emotional tone or scale of the image.

To create an AI-generated image that meets the desired criteria, Midjourney AI would need to find a balance between these keywords and make trade-offs when necessary. For example, it might be challenging to incorporate both ultra-wide angle and 50mm/70mm lens shots in the same image, or to balance a high level of detail with other visual effects. In such cases, the AI would prioritize the keywords that are most essential to achieving the desired outcome, while still trying to incorporate as many of the other keywords as possible.

honestly, that’s a great moment to see that I make a same judgement as the AI! I ask it to continue.

Question: what are some more of the conflicting keywords in the prompt?

Answer:

There are a few conflicting keywords in the prompt that may pose challenges in incorporating them simultaneously into a single AI-generated image. Here are some examples:

Shot on 50mm lens vs. ultra-wide angle: The 50mm lens typically provides a more natural perspective similar to human vision, while an ultra-wide angle lens creates a much wider field of view, often with distortion at the edges. These two effects are difficult to incorporate in the same image.

Depth of field vs. hyper-realism and high details: Depth of field creates a selective focus, with blurred areas in the foreground and/or background. In contrast, hyper-realism and high details suggest that the image should have sharpness and clarity throughout. Balancing these effects can be challenging.

Moody lighting vs. daytime lighting: Moody lighting typically involves darker, more dramatic, and low-key lighting, whereas daytime lighting suggests bright and natural illumination. Achieving both effects in a single image may not be feasible.

Soft lighting vs. harsh: Soft lighting results in diffused, gentle illumination with soft shadows, while harsh lighting creates more intense and direct illumination with sharp shadows. These two effects are opposite and cannot coexist in the same image.

Shutter speed 1/1000 vs. F/22: A fast shutter speed of 1/1000 helps capture fast-moving subjects without motion blur, while a small aperture like F/22 increases the depth of field and requires more light. In most cases, a fast shutter speed and a small aperture are not used together due to the amount of light required to properly expose the image.

These conflicting keywords would require trade-offs and prioritization based on the desired final outcome of the AI-generated image.

I found that it’s pretty enough to conclude with the final prompt.

Question: How can we sharpen the first prompt?

To sharpen the first prompt and make it more concise, you can focus on the essential elements and remove redundant or conflicting keywords. Here’s a revised version of the prompt:

“Create an AI-generated image featuring a colonial landscape of Ha Long Bay with ultra-realistic, cinematic photography. The image should be high resolution (8k), with depth of field, beautiful color grading, and moody lighting. Use a 50mm lens perspective and prioritize the composition, detail, and atmosphere. Incorporate ray tracing, global illumination, and shaders for enhanced visual effects.”

This prompt retains the key elements while removing redundancies and conflicting keywords. It emphasizes the desired visual style, subject matter, and technical specifications, providing a clearer direction for the AI-generated image.

___________________

After this conversation, I have a thought about a future in which human-ai interaction will replace human-computer interaction. That may be a distant future (!?), but I find it intriguing and enlightening. As humans, it is more important than ever to remain human, curious, critical, creative, protective of our own nature, and open to change.

In that it can provide answers to questions that we have, I’ve found that AI is a lot like my Tarot cards. Although the Tarot Card’s answer isn’t always accurate, it is ultimately up to us to give it meaning and take responsibility for the choices we make.

But, I mean, isn’t it an incredible time to be alive?

humAI (human & AI) interaction @ reflection

Leave a comment Cancel reply