Midjourney takes on Sol LeWitt’s Wall Drawings

17 Apr

Sol LeWitt’s ‘Wall Drawings’ aren’t actually drawings at all but, rather, instructions for drawings. These instructions have been implemented in many ways, by many different people, revealing how they are both prescriptive and ambiguous. Control over the final output lies somewhere between the instruction giver and the instruction follower.

A few weeks ago I used AI chatbot ChatGPT to implement the instructions, first using GPT-3 and then using GPT-4. Continuing my mission to use AI tools for things they really weren’t designed for and aren’t very good at, in this article I’ll be using AI image generation tool Midjourney.

ChatGPT is a language based AI and I asked it to write p5js code to draw images, while Midjourney creates images directly. In the previous experiments I didn’t alter the instructions much at all because the outputs revealed things about ChatGPT’s understanding of language and the ambiguity of the instructions combined with that understanding led to unexpected results. This time, I’m keen to play with Midjourney prompt engineering so we’ll be starting out with the basic instruction for comparison and then experimenting a bit more.

n.b. In all my prompts I added —v 5 to use Midjourney version 5 and —ar 16:9 to produce images of that aspect ratio.

WALL DRAWING #118

On a wall surface, any continuous stretch of wall, using a hard pencil, place fifty points at random. The points should be evenly distributed over the area of the wall. All of the points should be connected by straight lines.

Four Midjourney outputs showing a man drawing on a wall

Right away we notice that Midjourney includes a human in 3 of the 4 outputs. It interprets everything as a description of an image, while ChatGPT understands it is being instructed. If you tell ChatGPT, “draw on a wall” it knows that it must draw on the wall, whereas Midjourney thinks “an image of a wall being drawn on.” A strong statement on the presence of the artist’s self in their work.

Let’s also take a brief moment to appreciate that Midjourney is nailing hands a lot of the time now, but in the top right image the person seems to be directly drawing with their finger, which gives me that gacky nails-on-blackboard feeling.

Here are some implementations of this instruction by a human and by GPT-4, for comparison.

HUMAN

A man drawing straight lines on a wall, following a Sol LeWitt instruction — Sol LeWitt wall drawing being made at Dia Beacon

GPT-4

50 points joined together by straight lines, following a Sol LeWitt instruction — Wall Drawing #118 - GPT-4 output

I rephrased the Midjourney prompt to make it clearer what we actually want.

A wall surface with fifty points drawn in hard pencil with random positions. The points are connected by straight lines.

Lots of dots joined together by sketchy staright lines, generated by Midjourney

Lots of dots joined together by sketchy straight lines, generated by Midjourney

Some liberties have been taken with the accuracy of number of points and what exactly the lines are meant to be doing, but aesthetically it’s very cool.

To make it more “Sol LeWitt”, I mentioned this is all happening in a contemporary art gallery.

A wall surface in a contemporary art gallery with fifty points drawn in hard pencil with random positions. The points are connected by straight lines.

An irregular grid of lots of dots joined ogether by straight lines, on the wall of a gallery, generated by Midjourney

I love the different ‘algorithms’ Midjourney has experimented with for placing dots and connecting them. Particularly the second image where it connected most dots to their close neighbours and then selected a few to connect at a distance.

Wall drawing #11

A wall divided horizontally and vertically into four equal parts. Within each part, three of the four kinds of lines are superimposed.

Four Midjourney outputs, with rectangular shapes in bold colours

This instruction doesn’t mention using a “hard pencil” or any other medium or context, so Midjourney made its own choices about colour and styles. Generally it seems to default to flat colours, but sometimes it dreams up something cool. I love the bottom right image particularly.

In LeWitt’s vocabulary, the four kinds of lines are: horizontal, vertical, 45º diagonal right and 45º diagonal left. Let’s add that information to the prompt.

A wall divided horizontally and vertically into four equal parts. Within each part, three of the four kinds of lines are superimposed. The four kinds of lines are horizontal, vertical, 45º diagonal right and 45º diagonal left.

The image is divided into four segments, containing diagonal lines, there is a blue border. Generated by Midjourney

The image is divided into 8 segments, like a pizza. The segments are blue. One contains a 4 and one contains a 1. Generated by Midjourney

The image is divided into 6 irregular segments. Generated by Midjourney

Some valiant attempts which fall short of accurately implementing the instruction. I like how Midjourney apparently looked at the amount of numbers in the prompt and thought it better shove a few numbers in the outputs.

I then gave Midjourney some more context and guidance

A wall in a contemporary art gallery divided horizontally and vertically into four equal parts drawn with a hard pencil. Within each part, three of the four kinds of lines (horizontal, vertical, 45º diagonal right and 45º diagonal left) are superimposed. In the style of Sol LeWitt.

A wall in a contemporary art gallery containing bold lines, most of which are diagonal but some are horizontal or vertical. Some lines overlap. Generated by Midjourney

The thing is, Midjourney is nowhere near as good as ChatGPT at interpreting instructions accurately. This is to be expected, as it doesn’t have the same language processing training. I think this works out well - Midjourney can be used as an inspiration tool precisely because it takes things in unexpected directions.

I tried another style -

A sheet of watercolor paper divided horizontally and vertically into four equal parts painted in rich colour inks. Within each part, three of the four kinds of lines (horizontal, vertical, 45º diagonal right and 45º diagonal left) are superimposed.

A sheet of watercolor paper divided into four equal parts painted in rich colour inks. Within each part, three lines (horizontal, vertical, or diagonal) are drawn in hard pencil.

MJ was hesitant to draw diagonal lines in ink for some reason but I love the sketchy notes around the sides.

I drilled down the prompt a little more -

A sheet of watercolor paper divided into four equal parts painted in rich colour inks. Within each part, three diagonal lines are drawn in hard pencil.

WALL DRAWING #19

A wall divided vertically into six equal parts, with two of the four kinds of line directions (horizontal, vertical, 45º diagonal right and 45º diagonal left) superimposed in each part.

Once again Midjourney sidesteps the particulars but comes up with some brilliant aesthetics. For some reason all of these came out kind of skeuomorphic with flat pieces and drop shadows.

WALL DRAWING #46

Vertical lines, not straight, not touching, covering the wall evenly.

The first issue here is that Midjourney isn’t very good at parsing the phrase “not x”. Instead we can use :: to add negative and positive weighting to different parts of a prompt. Here are a couple of prompts and their outputs to demonstrate that.

a london bus, not red

london bus::2, red::-2

However, using that method in this Sol LeWitt instruction didn’t seem to help and Midjourney was weirdly reluctant to draw vertical un-straight lines on a wall. I tried a bunch of different things.

Vertical lines::1, straight::-2, not touching::0.5, covering a wall evenly::0.5

Vertical wobbly lines, not touching, covering a wall evenly

Vertical wobbly lines on a canvas

Vertical wobbly lines on a canvas

Vertical lines::1, wavy lines::2, straight::-2, not touching::0.5, covering the wall evenly::0.5

vertical wavy lines

vertical lines that are wavy

vertical lines that are wobbly

Are there no pictures of vertical wavy lines in Midjourneys training set? Perhaps something about the words “wavy” or “wobbly” implies horizontal, because of water waves, while the word “vertical” implies straight, because of architecture, etc. Honestly, I don’t know what to tell you.

The closest I got was this. I think perhaps starting with “a pencil drawing of…” helped enable Midjourney to break out of the idea that this needs to be a representation of a real thing, but it looks like it’s sort of fighting against itself to form anything vertical and wobbly.

a pencil drawing of vertical wobbly lines

Again, top marks for aesthetic though.

WALL DRAWING #51

All architectural points connected by straight lines

Nice. Everyone loves a sketchy drawing of an abstract geometric form.

For reference, here’s an example of this instruction implemented by a human at The Massachusetts Museum of Contemporary Art.

Wall Drawing #51 - Human drawn at Mass Moca

I gave the prompt some more context, to try to get an output closer to the human example.

A wall in a contemporary gallery with a window, a door, an exit sign and a fire alarm. All the architectural points are connected by straight lines drawn in pencil.

There’s a lot to unpack here - like why it happily included a door but barely attempted a window (probably because windows are not common on gallery walls - fair enough) and why we got all these thick red lines as well as the pencil lines (I’m not really sure).

However, the Escher-esque, recursive, optical illusion door in the first image is a delight.

WALL DRAWING #154

A black outlined square with a red horizontal line from the midpoint of the left side toward the middle of the right side

A disappointment both in accuracy and aesthetics.

For comparison, here are the implementations of this same instruction by a human and two versions of ChatGPT (via p5js code). ChatGPT has almost got it, but Midjourney just doesn’t parse sentence structure well enough to get anywhere close.

Human

GPT-3