A journey into midjourney (image generation)

I couldn't imagine a creative process to be so much fun – I usually have difficulty visualising my thoughts. Still, midjourney is an excellent example of how text can become visual in seconds.

So let's start prompting, shall we?

Midjourney v4, February 2023

Of course, I started with our dog because what else is essential in life? (see input picture further below)

jetsons cartoon

And since my girlfriend works at a tour guide company that goes through Berlin by bike...

a modern logo for a tourguide company in berlin called "Berlin on bike",comic style,fresh

And asked for more variants on option #4:

I've also done a few variations on pictures from my girlfriend:

...and found the outcomes mediocre at best, to be honest.

Finally, I had to put myself to the test, mainly using the images/avatar you might know from my site / social media to inspire midjourney to make some crazy attempts.

...to turn me into an icon

as an icon,abstract,modern,tech

...or a logo...

...or maybe a superhero?... (the model has some serious bias issues btw)

...but the cartoon/funny style made the best impression...

...which eventually led me into the dalli rabbit hole, ending up with this:

So one of the nicest results might be something like this:

group of people enjoying their bikeride in Berlin Germany, anime

(even though there's not much enjoyment to be seen)

Here comes v5!

Many things happen in a short time, which means there is v5 of midjourney.

[!NOTE] midjourney v5 This model has very high Coherency, excels at interpreting natural language prompts, is higher resolution, and supports advanced features like repeating patterns

And well let's start with the prompt result I ended with in v4:

as you can already see, the difference is quite huge. It seems the model has a different understanding of 'anime', and I potentially also primed it less well. So I made it a bit more clear:

group of people enjoying their bikeride in Berlin Germany, anime cartoon style

and this already looks much more like a result that is usable (with some artifacts around the bikes) with some attention.

v5 creates some interesting (although fitting in a way) results when you lack decent prompts:

cartoon style -s 1000

(using a picture of our dog as input)

With a single adjustment in the prompt, the result is a lot cooler:

cartoon style dog --s 250

And adding some imagination is the strong suit of these models:

cartoon style dog in the snow --s 500

For anyone asking, here's the original image I used to prompt the dog generation:

Cyberpunk (in Berlin)

I love science fiction, been a great fan of everything cyberpunk and for some reason gives me calm and some form of future perspective I both love and loathe.

So a great inspiration for some prompting!

Berlin's brandenburger tor in cyberpunk style in 2222, late in the evening. It's raining, there's little people there. --s 1000 --ar 3:2 --v 5

Berlin in cyberpunk style in 2222, early in the morning. The brandenburger tor in the background. It's raining, there's little people there. --s 1000

Berlin in cyberpunk style in 2222, early in the morning, in the distance a hovertrain is leaving the train station. It's raining, there's little people there. --s 1000

Sometimes less is more, since in the last prompt it had too much train and too little Berlin.

So one more without Berlin (that I really liked):

a first person perspective from a cyberpunk city in 2222, late in the evening in neon lights, lots of people on the street photorealistic --s 1000

Logo / icon generation in midjourney v5

I still had to try this, since well – this is something a lot of people would want to do (and again used my regular avatar image to do so)

as a cartoon logo, futuristic but subtle, imaginative but down to earth. Inquisitive --s 1000

the last one is quite funny!

Giving two images as prompt (and how it can go wrong quickly)

So I'm looking for a good look for me in a suit. I found an image on instagram I loved, added my own and prompted for result.

combine these images so the red guy is in the grey suit, photorealistic --s 1000

the result is actually hilarious and could well be used for some kind of ad campaign for a suit (without a shoot!) but well, not completely what I was looking for 😆

Spark of creativity

So stepping away from reality / realism, I found a prompt like this to be fascinating:

a group of people being in venice in the 17th century, taking a picture on top of a bridge, selfie style, enjoying themselves, men and women combined watercolor painting van gogh style --no phone --s 1000

the future, prompt engineering using chatGPT (?)

I will not go into detail too much, just watch this video:

https://www.youtube.com/watch?v=Asg1e_IYzR8

the results are quite stunning:

midjourney V6

please note this is still the alpha version in use (release notes) By the end of December 2023, midjourney introduced the alpha version of V6. It's very impressive, stunning to say the least! Let's show you a few of my previous examples in this new iteration of midjourney so you (hopefully) see what I see as well.

Let's start with the bridge example of V5 that I used previously and see what happens:

a group of people being in venice in the 17th century, taking a picture on top of a bridge, selfie style, enjoying themselves, men and women combined watercolor painting van gogh style --no phone --s 1000

this is pretty awesome right? The examples still need a bit of work, but I really like how the secon and third are looking right now.

The model is still in the works, and it shows - there are quite a few artifacts, but the results are so much nicer and better looking, with a lot less 'prompting' to be done, see the following:

a first person perspective from a cyberpunk city in 2222, late in the evening in neon lights, lots of people on the street photorealistic --s 1000

The results are simply way more photorealistic, it displays the imagination a bit more and it feels more 'real' if you catch my drift.

Now let's use the example of our dog that we used before to consider image 'reading' instead. You remember her from the V5 right?

cartoon style

even though the result looks more 'realistic', this is not what I wanted. So again we need a bit of finetuning of the prompt:

cartoon style dog --s 250

it's still far away from my 'original' image but I like the outcomes a lot better 🙂

Now for my favourite topic, combining Berlin and Cyberpunk:

Berlin's brandenburger tor in cyberpunk style in 2222, late in the evening. It's raining, there's little people there. --s 1000 --ar 3:2

Again there's so much more detail, sharpness and realism to the image that I almost believe this is the real Tor in 2222.

another variation brought me this:

Let's take a look at what MJ makes out of my own image when I try to make a logo out of it:

as a cartoon logo, futuristic but subtle, imaginative but down to earth. Inquisitive

What strikes me is that the images make more sense, there's just less nonesense on there. I choose the style raw on it, if I put it back to style 1000 it looks a bit like this:

This just doesn't work that well, although I understand where the model gets the ideas from.

I was really stunned when I started using chatGPT with MJ v5 to create extensive prompts for midjourney, but guess what, that's no longer needed! You can create stunning, photorealistic results with very 'basic' prompts:

A female influencer from the 1930's

One exceptional feature of V6 is the ability to 'vary' mild or strong based on one of the images generated. Let's do a strong variation on the 4th image for instance:

A red haired viking getting ready for action

A red haired cyberpunk guy wearing really awkward glasses

Some more type testing

With a new alpha released recently, performance of 'text' has improved somewhat. a

A funky cyberpunk logo for "Casey"

midjourney v4/v5/v6 Conclusion

My idea of 'image manipulation' isn't the critical thought behind it – midjourney is good in generating things, not adapting stuff you might have in your head.

Using it to improve your wedding pictures or turning things into a logo isn't the right way to look at it (yet).

It is there to create new things from already existing ones in their model. Of course, you could train the model, and I've seen successful blends of famous persons known to the model in new environments (which is a deep fake territory, and against the ToS).

I'll keep this post in progress while creating more stuff. I've done a few really cool logo's (from scratch) and hope to find more reasons to use it!

A journey into midjourney (image generation)

Midjourney v4, February 2023

Here comes v5!

Cyberpunk (in Berlin)

Logo / icon generation in midjourney v5

Giving two images as prompt (and how it can go wrong quickly)

Spark of creativity

the future, prompt engineering using chatGPT (?)

midjourney V6

Some more type testing

midjourney v4/v5/v6 Conclusion

More to read / watch:

New business coach – my own customGPT

Top 2000 lost records

Behind the Scenes of “Tune Orbit”

One Reply to “A journey into midjourney (image generation)”

Leave your thoughts/ideas/improvements!Cancel reply