The following image, which the Office generated by entering a prompt into a popular commercially available AI system [Gemini], illustrates this point:
Prompt:
professional photo, bespectacled cat in a robe reading the Sunday newspaper and smoking a pipe, foggy, wet, stormy, 70mm, cinematic, highly detailed wood, cinematic lighting, intricate, sharp focus, medium shot, (centered image composition), (professionally color graded), ((bright soft diffused light)), volumetric fog, hdr 4k, 8k, realistic
Output:
This prompt describes the subject matter of the desired output, the setting for the scene, the style of the image, and placement of the main subject. The resulting image reflects some of these instructions (e.g., a bespectacled cat smoking a pipe), but not others (e.g., a highly detailed wood environment). Where no instructions were provided, the AI system filled in the gaps.
For instance, the prompt does not specify the cat’s breed or coloring, size, pose, any attributes of its facial features or expression, or what clothes, if any, it should wear beneath the robe. Nothing in the prompt indicates that the newspaper should be held by an incongruous human hand.