// DALL·E 2023-10-13 14.05.37 – 4K DSLR photo of a black and white Stabyhoun calmly sitting next to a red-haired man in a park setting of a futuristic cityscape. Both are enjoying a
AI, personal interests

A journey into midjourney (image generation)

some excerpt

I couldn't imagine a creative process to be so much fun – I usually have difficulty visualising my thoughts. Still, midjourney is an excellent example of how text can become visual in seconds.

So let's start prompting, shall we?

Midjourney v4, February 2023

Of course, I started with our dog because what else is essential in life? (see input picture further below)

Sien as a jetson

jetsons cartoon

And since my girlfriend works at a tour guide company that goes through Berlin by bike...

Bike_company_example

a modern logo for a tourguide company in berlin called "Berlin on bike",comic style,fresh

And asked for more variants on option #4:

berlin on bike midjourney 2

I've also done a few variations on pictures from my girlfriend:

girlfriend_midjourney_v4

girlfriend_midjourney_v4_2

...and found the outcomes mediocre at best, to be honest.

Finally, I had to put myself to the test, mainly using the images/avatar you might know from my site / social media to inspire midjourney to make some crazy attempts.

...to turn me into an icon

CaseyR_as_an_iconabstractmoderntech

as an icon,abstract,modern,tech

...or a logo...

CaseyR_as_logo_abstract_modern_tech

...or maybe a superhero?... (the model has some serious bias issues btw)

CaseyR_superhero_fitness_ultrarealistic

...but the cartoon/funny style made the best impression...

CaseyR_cartoon_photorealistic_tech_summer_fun_happy_curious

...which eventually led me into the dalli rabbit hole, ending up with this:

casey_dalli

So one of the nicest results might be something like this:

bikeride_in_berlin_midjourneyv4

group of people enjoying their bikeride in Berlin Germany, anime

(even though there's not much enjoyment to be seen)

Here comes v5!

Many things happen in a short time, which means there is v5 of midjourney.

[!NOTE] midjourney v5 This model has very high Coherency, excels at interpreting natural language prompts, is higher resolution, and supports advanced features like repeating patterns

And well let's start with the prompt result I ended with in v4:

group of people enjoying their bikeride in Berlin Germany midjourney v5

as you can already see, the difference is quite huge. It seems the model has a different understanding of 'anime', and I potentially also primed it less well. So I made it a bit more clear:

group of people enjoying their bikeride in Berlin Germany again midjourney v5

group of people enjoying their bikeride in Berlin Germany, anime cartoon style

and this already looks much more like a result that is usable (with some artifacts around the bikes) with some attention.

v5 creates some interesting (although fitting in a way) results when you lack decent prompts:

dog midjourney v5

cartoon style -s 1000

(using a picture of our dog as input)

With a single adjustment in the prompt, the result is a lot cooler:

cartoon style dog v5 midjourney

cartoon style dog --s 250

And adding some imagination is the strong suit of these models:

cartoon style dog in the snow

cartoon style dog in the snow --s 500

For anyone asking, here's the original image I used to prompt the dog generation:

Sien

Cyberpunk (in Berlin)

I love science fiction, been a great fan of everything cyberpunk and for some reason gives me calm and some form of future perspective I both love and loathe.

So a great inspiration for some prompting!

Cyberpunk Berlin 1

Berlin's brandenburger tor in cyberpunk style in 2222, late in the evening. It's raining, there's little people there. --s 1000 --ar 3:2 --v 5

Cyberpunk Berlin midjourney v5 2

Berlin in cyberpunk style in 2222, early in the morning. The brandenburger tor in the background. It's raining, there's little people there. --s 1000

Cyberpunk Berlin midjourney v5 3

Berlin in cyberpunk style in 2222, early in the morning, in the distance a hovertrain is leaving the train station. It's raining, there's little people there. --s 1000

Sometimes less is more, since in the last prompt it had too much train and too little Berlin.

So one more without Berlin (that I really liked):

cyberpunky midjourney v5

a first person perspective from a cyberpunk city in 2222, late in the evening in neon lights, lots of people on the street photorealistic --s 1000

Logo / icon generation in midjourney v5

I still had to try this, since well – this is something a lot of people would want to do (and again used my regular avatar image to do so)

casey R logo midjourney v5

Casey Logo midjourney v5 3

Casey Logo midjourney v5 4

as a cartoon logo, futuristic but subtle, imaginative but down to earth. Inquisitive --s 1000

the last one is quite funny!

Giving two images as prompt (and how it can go wrong quickly)

So I'm looking for a good look for me in a suit. I found an image on instagram I loved, added my own and prompted for result.

red hair guy grey suit horribly wrong midjourney v5

red hair guy grey suit horribly wrong midjourney v5 2

combine these images so the red guy is in the grey suit, photorealistic --s 1000

the result is actually hilarious and could well be used for some kind of ad campaign for a suit (without a shoot!) but well, not completely what I was looking for 😆

Spark of creativity

So stepping away from reality / realism, I found a prompt like this to be fascinating:

a group of people being in venice in the 17th century, taking a picture on top of a bridge, selfie style, enjoying themselves, men and women combined watercolor painting van gogh style --no phone --s 1000

selfie waterpaint

the future, prompt engineering using chatGPT (?)

I will not go into detail too much, just watch this video:

https://www.youtube.com/watch?v=Asg1e_IYzR8

the results are quite stunning:

chatgpt 4 prompting midjourney v5 example 1

chatgpt 4 prompting midjourney v5 example 2

chatgpt 4 prompting midjourney v5 example 3

chatgpt 4 prompting midjourney v5 example 4

chatgpt 4 prompting midjourney v5 example 5

chatgpt 4 prompting midjourney v5 example 6

chatgpt 4 prompting midjourney v5 example 7

chatgpt 4 prompting midjourney v5 example 8

midjourney V6

please note this is still the alpha version in use (release notes) By the end of December 2023, midjourney introduced the alpha version of V6. It's very impressive, stunning to say the least! Let's show you a few of my previous examples in this new iteration of midjourney so you (hopefully) see what I see as well.

Let's start with the bridge example of V5 that I used previously and see what happens:

a group of people being in venice in the 17th century, taking a picture on top of a bridge, selfie style, enjoying themselves, men and women combined watercolor painting van gogh style --no phone --s 1000

Venice 17th Century Group

this is pretty awesome right? The examples still need a bit of work, but I really like how the secon and third are looking right now.

The model is still in the works, and it shows - there are quite a few artifacts, but the results are so much nicer and better looking, with a lot less 'prompting' to be done, see the following:

a first person perspective from a cyberpunk city in 2222, late in the evening in neon lights, lots of people on the street photorealistic --s 1000

First Person Cyberpunk City

The results are simply way more photorealistic, it displays the imagination a bit more and it feels more 'real' if you catch my drift.

Now let's use the example of our dog that we used before to consider image 'reading' instead. You remember her from the V5 right?

cartoon style

sienMJv6

even though the result looks more 'realistic', this is not what I wanted. So again we need a bit of finetuning of the prompt:

cartoon style dog --s 250

Cartoon Style Dog from Midjourney

it's still far away from my 'original' image but I like the outcomes a lot better 🙂

Now for my favourite topic, combining Berlin and Cyberpunk:

Berlin's brandenburger tor in cyberpunk style in 2222, late in the evening. It's raining, there's little people there. --s 1000 --ar 3:2

Brandenburger Tor Cyberpunk

Again there's so much more detail, sharpness and realism to the image that I almost believe this is the real Tor in 2222.

another variation brought me this:

Brandenburger Tor Cyberpunk upscale

Let's take a look at what MJ makes out of my own image when I try to make a logo out of it:

as a cartoon logo, futuristic but subtle, imaginative but down to earth. Inquisitive

CR Cartoon logo

What strikes me is that the images make more sense, there's just less nonesense on there. I choose the style raw on it, if I put it back to style 1000 it looks a bit like this:

CaseyRomkes Cartoon Logo 2

This just doesn't work that well, although I understand where the model gets the ideas from.

I was really stunned when I started using chatGPT with MJ v5 to create extensive prompts for midjourney, but guess what, that's no longer needed! You can create stunning, photorealistic results with very 'basic' prompts:

A female influencer from the 1930's

Female influencer 1930s full body

Female influencer 1930s full body part 2

One exceptional feature of V6 is the ability to 'vary' mild or strong based on one of the images generated. Let's do a strong variation on the 4th image for instance:

female strong variation MJ

A red haired viking getting ready for action

Red Haired viking

A red haired cyberpunk guy wearing really awkward glasses

cyberpunk midjourney dude

Some more type testing

With a new alpha released recently, performance of 'text' has improved somewhat. a

A funky cyberpunk logo for "Casey"

Casey MJ logo

midjourney v4/v5/v6 Conclusion

My idea of 'image manipulation' isn't the critical thought behind it – midjourney is good in generating things, not adapting stuff you might have in your head.

Using it to improve your wedding pictures or turning things into a logo isn't the right way to look at it (yet).

It is there to create new things from already existing ones in their model. Of course, you could train the model, and I've seen successful blends of famous persons known to the model in new environments (which is a deep fake territory, and against the ToS).

I'll keep this post in progress while creating more stuff. I've done a few really cool logo's (from scratch) and hope to find more reasons to use it!

More to read / watch:

Why hands suck in AI (for now)

https://www.youtube.com/watch?v=24yjRbBah3w

This is a WIP
Content is never static, I'll create, read, update and delete whenever I think that makes sense.

Feel free to add your comments to this page so I can keep things relevant!

Top 2000 lost records

4 sec read

One Reply to “A journey into midjourney (image generation)”

Leave your thoughts/ideas/improvements!

CY.B