AI, personal interests

A journey into midjourney (image generation)

some excerpt

I couldn't imagine a creative process to be so much fun – I usually have difficulty visualising my thoughts. Still, midjourney is an excellent example of how text can become visual in seconds.

So let's start prompting, shall we?

Midjourney v4, February 2023

Of course, I started with our dog because what else is essential in life? (see input picture further below)

Sien as a jetson

jetsons cartoon

And since my girlfriend works at a tour guide company that goes through Berlin by bike...

Bike_company_example

a modern logo for a tourguide company in berlin called "Berlin on bike",comic style,fresh

And asked for more variants on option #4:

berlin on bike midjourney 2

I've also done a few variations on pictures from my girlfriend:

girlfriend_midjourney_v4

girlfriend_midjourney_v4_2

...and found the outcomes mediocre at best, to be honest.

Finally, I had to put myself to the test, mainly using the images/avatar you might know from my site / social media to inspire midjourney to make some crazy attempts.

...to turn me into an icon

CaseyR_as_an_iconabstractmoderntech

as an icon,abstract,modern,tech

...or a logo...

CaseyR_as_logo_abstract_modern_tech

...or maybe a superhero?... (the model has some serious bias issues btw)

CaseyR_superhero_fitness_ultrarealistic

...but the cartoon/funny style made the best impression...

CaseyR_cartoon_photorealistic_tech_summer_fun_happy_curious

...which eventually led me into the dalli rabbit hole, ending up with this:

casey_dalli

So one of the nicest results might be something like this:

bikeride_in_berlin_midjourneyv4

group of people enjoying their bikeride in Berlin Germany, anime

(even though there's not much enjoyment to be seen)

Here comes v5!

Many things happen in a short time, which means there is v5 of midjourney.

[!NOTE] midjourney v5 This model has very high Coherency, excels at interpreting natural language prompts, is higher resolution, and supports advanced features like repeating patterns

And well let's start with the prompt result I ended with in v4:

group of people enjoying their bikeride in Berlin Germany midjourney v5

as you can already see, the difference is quite huge. It seems the model has a different understanding of 'anime', and I potentially also primed it less well. So I made it a bit more clear:

group of people enjoying their bikeride in Berlin Germany again midjourney v5

group of people enjoying their bikeride in Berlin Germany, anime cartoon style

and this already looks much more like a result that is usable (with some artifacts around the bikes) with some attention.

v5 creates some interesting (although fitting in a way) results when you lack decent prompts:

dog midjourney v5

cartoon style -s 1000

(using a picture of our dog as input)

With a single adjustment in the prompt, the result is a lot cooler:

cartoon style dog v5 midjourney

cartoon style dog --s 250

And adding some imagination is the strong suit of these models:

cartoon style dog in the snow

cartoon style dog in the snow --s 500

For anyone asking, here's the original image I used to prompt the dog generation:

Sien

Cyberpunk (in Berlin)

I love science fiction, been a great fan of everything cyberpunk and for some reason gives me calm and some form of future perspective I both love and loathe.

So a great inspiration for some prompting!

Cyberpunk Berlin 1

Berlin's brandenburger tor in cyberpunk style in 2222, late in the evening. It's raining, there's little people there. --s 1000 --ar 3:2 --v 5

Cyberpunk Berlin midjourney v5 2

Berlin in cyberpunk style in 2222, early in the morning. The brandenburger tor in the background. It's raining, there's little people there. --s 1000

Cyberpunk Berlin midjourney v5 3

Berlin in cyberpunk style in 2222, early in the morning, in the distance a hovertrain is leaving the train station. It's raining, there's little people there. --s 1000

Sometimes less is more, since in the last prompt it had too much train and too little Berlin.

So one more without Berlin (that I really liked):

cyberpunky midjourney v5

a first person perspective from a cyberpunk city in 2222, late in the evening in neon lights, lots of people on the street photorealistic --s 1000

Logo / icon generation in midjourney v5

I still had to try this, since well – this is something a lot of people would want to do (and again used my regular avatar image to do so)

casey R logo midjourney v5

Casey Logo midjourney v5 3

Casey Logo midjourney v5 4

as a cartoon logo, futuristic but subtle, imaginative but down to earth. Inquisitive --s 1000

the last one is quite funny!

Giving two images as prompt (and how it can go wrong quickly)

So I'm looking for a good look for me in a suit. I found an image on instagram I loved, added my own and prompted for result.

red hair guy grey suit horribly wrong midjourney v5

red hair guy grey suit horribly wrong midjourney v5 2

combine these images so the red guy is in the grey suit, photorealistic --s 1000

the result is actually hilarious and could well be used for some kind of ad campaign for a suit (without a shoot!) but well, not completely what I was looking for 😆

Spark of creativity

So stepping away from reality / realism, I found a prompt like this to be fascinating:

a group of people being in venice in the 17th century, taking a picture on top of a bridge, selfie style, enjoying themselves, men and women combined watercolor painting van gogh style --no phone --s 1000

selfie waterpaint

the future, prompt engineering using chatGPT (?)

I will not go into detail too much, just watch this video:

https://www.youtube.com/watch?v=Asg1e_IYzR8

the results are quite stunning:

chatgpt 4 prompting midjourney v5 example 1

chatgpt 4 prompting midjourney v5 example 2

chatgpt 4 prompting midjourney v5 example 3

chatgpt 4 prompting midjourney v5 example 4

chatgpt 4 prompting midjourney v5 example 5

chatgpt 4 prompting midjourney v5 example 6

chatgpt 4 prompting midjourney v5 example 7

chatgpt 4 prompting midjourney v5 example 8

midjourney v4/v5 Conclusion

My idea of 'image manipulation' isn't the critical thought behind it – midjourney is good in generating things, not adapting stuff you might have in your head.

Using it to improve your wedding pictures or turning things into a logo isn't the right perspective (yet).

It is there to create new things from already existing ones in their model. Of course, you could train the model, and I've seen successful blends of famous persons known to the model in new environments (which is a deep fake territory, and against the ToS).

I'll keep this post in progress while creating more stuff. I've done a few really cool logo's (from scratch) and hope to find more reasons to use it!

More to read / watch:

Why hands suck in AI (for now)

https://www.youtube.com/watch?v=24yjRbBah3w

This is a WIP
Content is never static, I'll create, read, update and delete whenever I think that makes sense.

Feel free to add your comments to this page so I can keep things relevant!

What work(s)ation and what not

5 sec read

One Reply to “A journey into midjourney (image generation)”

Leave your thoughts/ideas/improvements!

%d bloggers like this:
CY.B