Get all your news in one place.
100’s of premium titles.
One app.
Start reading
The Conversation
The Conversation
Lifestyle
Mitch Goodwin, Faculty of Arts, The University of Melbourne

Synthetic futures: my journey into the emotional, poetic world of AI art making

MidJourney text prompt: 'HAL the computer approaching through the foggy morning mist'. Author provided

Generative art making is flourishing. Algorithms that turn text prompts into images, such as DALL-E and Stable Diffusion, are emerging as viable creative tools. And they’re fuelling much debate about their artistic legitimacy and potential to pinch our jobs.

The sudden leap in fidelity of artificial intelligence (AI) art production has been made possible by advances in deep learning technologies, in particular natural language processing and generative adversarial networks.

In essence, a user can input a text description and the algorithm auto-translates this into a cohesive image.

images generated by MidJourney AI, prompts 'The Singularity emerges fully formed from the mainframe', 'Employees leaving the Lumiere factory, Paris 1890'
MidJourney interface showing the four-panel grid of image results from two separate text prompts. From here the user can choose to either upscale (U) or create further variations (V) from any of the four results. Mitch Goodwin/MidJourney, Author provided

MidJourney – or MJ as it is known to its passionate users – is perhaps the most seductive technology for its painterly output and poetic interactions. The charm begins from the very first moment, with the command line prompt “/imagine”.


Read more: Give this AI a few words of description and it produces a stunning image – but is it art?


Augmented imagination

MidJourney founder David Holz has said users find their text-to-image interactions to be a “deeply emotional experience” with the potential for it to be therapeutic. He said:

There’s a lot of beautiful stuff happening.

Triptych: Larry David as a warrior princess, a design for a phonograph by Trent Reznor, and a close-up of a rose
L-R: ‘Larry David’ from the Warrior Princess series by Brian Penny (2022-09-03); ‘An intricate schematic of a Victorian phonograph designed by Trent Reznor’ by GM Gleeson, (@NooYawkGurrl, 2022-07-15); ‘A rose is a rose is a rose’ by Mitch Goodwin (2022-07-26).

MidJourney plays with genre and form, using existing principles that have long informed media arts practice, such as non-linearity, repetition and remix, to exploit the archive.

Holz has suggested the algorithm’s purpose is to “augment our imagination”.

My first image requests were whimsical queries, nocturnal flights of fancy, gentle tentative casts into the virtual spirit world.

As it turns out, my melancholic prompts were unnervingly well-suited to the algorithm’s default aesthetic.

Triptych: a discarded surgical mask; a cyborg with glowing red eyes; a mother and a child sitting in a flooded railway carriage watching other commuters.
L-R: ‘Surgical mask discarded on a wet dirty street. In the distance, the radio plays the Beach Boys’ (2022-07-15); from the portrait series ‘Call Centre Cyborgs, 2037 AD’ (2022-07-27); ‘I thought we were all in this thing together … but I was wrong’(2022-07-14)

Magic lurks within the algorithm too. Ilya Sutskever, co-founder and chief scientist at OpenAI, describes the process as “transcendent beauty as a service”.

Artist and theorist Lev Manovich has poetically described his interactions with MidJourney as akin to working with a “memory machine”.

The recognition it is a service but also a metaphysical experience is a new way of thinking about tools of automation.

The technical process can be an imprecise science in which slippages and overlaps are inevitable. As Manovich recognises, MidJourney remixes:

something from real history and popular stereotypes – real knowledge and fantasies. But we should not blame it, because we do exactly the same, all the time.

An illustrated cloud; a snow-filled stage.
From the series, ‘My Favourite Century’ and ‘Stage designs for an unwritten play’ Lev Manovich/MidJourney/Facebook

Collaborative remixing

The MidJourney Bot is hosted on the social platform Discord creating an intoxicating cascade of generative screen works.

It is an inherently communal experience. The image stream also functions as a site of shared creation. If another user’s composition catches your eye, you can co-opt their prompt – or the image itself – and refine it according to your own aesthetic preferences.

This collaborative remixing is what makes the MidJourney Discord channel as much a social experiment as a scientific one.

Joker's mask in rubbish filled alleway, grubby New York skyline; a raging bushfire consumes a blackened forest.
Text prompts: ‘The joker’s grubby mask discarded in an alleyway, Brooklyn, NYC.’ and ‘A new age dawns as the last of the forests are consumed.’ Mitch Goodwin & Kesson/MidJourney, Author provided

My research into the darkening aesthetics of digital media means I am somewhat predisposed to spotting dystopian visions. The MidJourney Discord channel is certainly a seductive rabbit hole for voyeurs of destruction.

Ghastly cyborg futures and post-nuclear wastelands would seem to be de rigueur for the AI prompt engineer. I regularly see prompts citing the retro-futurist nightmares of artists such as HR Giger and Zdzisław Beksiński and the cinematic tendencies of David Lynch and Andrei Tarkovsky.

As Bowie crooned on the cyber-noir album Outside, itself a chronicle of art world depravity: “there is no hell, like an old hell”.

Users are also finding ways to apply the technology in a moving image context. Notable efforts include a generative fashion demo, morphing amoebas narrated by a synthetic David Attenborough, Fabian Stelzer’s crowdsourced narrative SALT_VERSE and Drew Medina’s mesmerising fractal film Monsters.

The most meaningful assemblage I have come across is Gabriele Dente’s SOLAR (the history of humanity drawn by machines), accompanied by a manifesto highlighting the associated ethical and industrial implications of neural networks.

Digital tools have long been enablers of speed, dexterity and adaptability for designers and artists. Studio professionals in the MJ community are already finding efficiencies in their workflows.

Two pages from a sci-fi comic book; a photographic studio; a hand holding a cold beer can
Comic book layout, ‘Death’s Dream Kingdom’ by Randall Rozzell (2022-07-27); beer label graphic by Bryan Launier (2022-07-26); Photo shoot using a Nine Inch Nails inspired industrial backdrop by Caleb Hoernschemeyer, Flow Productions, photo credit Jared Jasinski (2022-07-27). Randall Rozzell, Bryan Launier, Jared Jasinski & Caleb Hoernschemeyer/MidJourney/Facebook, Author provided

A startlingly beautiful example of the possibilities for design and concept ideation come from architect and designer Cesare Battelli.

His series “space-kangaroo” is evocative of a mode of conceptual design thinking that blends aesthetics, functionality and fantasy.

Architectural hybrid-design, ‘space-kangaroo’ by Cesare Battelli (2022-08-19) Cesare Battelli /MidJourney/Facebook

‘Spirit photography’

Eryk Salvaggio has described the technology of the more photo-realistic aspirations of the DALL-E platform as “a kind of spirit photography” conjuring images replete with the ghosts and markings of past technologies: the fading image, the decaying medium and the corrosive chemical reaction.

This ability of reconstituting the past and embellishing the outcome with techniques of capture and display and procedural degradation makes MidJourney especially fertile ground for “authentic” gestures of the fabulous and the fake.

A collage of ghostly faces in a severely degraded photographic images; a film studio showing Apollo 11 on a fake moon surface.
DALL-E variation of an image from a dataset of photographs by Hungarian photographer Costică Acsinte, by Eryk Salvaggio; The alternate history of the Apollo 11 moon landing, by Mitch Gates (2022-07-21). Eryk Salvaggio/DALL.E/Mitch Gates/MidJourney

How much this sudden uptick of synthetic media will contribute to the glut of misinformation online however is uncertain. How does the visual historical record accommodate its synthetic mirror?

We should also consider the evolutionary implications for language and computation. With the democratisation of AI assistants, the field of human computer interaction is evolving rapidly as are the inherent entanglements.

And so, tonight as the city sleeps I watch the feed and dream along with the machine. I punch in another text prompt and wait impatiently for my MidJourney Bot to conjure its response. All the while I’m wondering as to the reach of the text into the algorithm’s code, and to what extent it is, bit by bit, re-coding me?


Read more: AI art is everywhere right now. Even experts don't know what it will mean


The Conversation

Mitch Goodwin does not work for, consult, own shares in or receive funding from any company or organisation that would benefit from this article, and has disclosed no relevant affiliations beyond their academic appointment.

This article was originally published on The Conversation. Read the original article.

Sign up to read this article
Read news from 100’s of titles, curated specifically for you.
Already a member? Sign in here
Related Stories
Top stories on inkl right now
One subscription that gives you access to news from hundreds of sites
Already a member? Sign in here
Our Picks
Fourteen days free
Download the app
One app. One membership.
100+ trusted global sources.