dreamingmachines - Tumblr blog

dreamingmachines · 5 days ago

Note

is your art fully ai or is it like, partially ai and partially hand-drawn? /gen i've never seen an actual artist with so much experience use ai in any way and as an artist i'm just curious how you're using it, i usually don't like ai art but you seem to use it differently so i'm REALLY curious actually

the art that i post on this blog is fully ai! i make it a point to not put anything hand-drawn on here :]

here's a breakdown of what you'll see and where it comes from:

The RhapLoRA (AKA my personal LoRA)

anything in this style is based off of my manual (hand-drawn) art. here's some examples of my manual art for comparison:

the most important elements, to me, are the thick lineart, the various textures, the shading, the funky backgrounds, the chunky clothes, the D-shaped hair gaps, the triple nose lines, the lack of eyeshine, the looping blush lines, and the scribbly black shadow underneath the chin. those are what say "this is my art" to me!

when it comes to the RhapLoRA, typically what i'm doing is using a series of danbooru tags to tell the model exactly what i want to see. i've actually had to memorize danbooru tags to do this, and i frequently have the site's wiki open to figure out the right tags to use!

for example, let's say i want a picture of spierce. completely from memory, the tags i usually use for her are:

"1girl, solo, orange hair, medium hair, wavy hair, hair over one eye, hair over shoulder, multicolored eyes, (red eyes:1.5), green eyes, maroon jacket, grey bowtie, white shirt, collared shirt, dark blue skirt, blue skirt, pleated skirt, plaid skirt, grey jeans, black boots, black gloves"

if i use those tags in my prompt, i get these:

this is only possible because i told the LoRA what those tags look like, and they come together to look like a spierce!

you'll note that she's crouching in all three of them. that's because it's trying to fit the shoes into this aspect ratio! the background in the middle one is because i use a lot of textures and odd colors in my manual art. this is an intended effect!

usually, i'll add other types of tags if i want a certain atmosphere, such as "abstract element, red theme". i can also prompt for poses, like "rubbing own chin, head tilt, parted lips". all of those together gets this:

overall, i have a lot of control over what comes out, and if i have a very specific idea in mind, it can take dozens to hundreds of generations to get exactly what i want. it's fun to see what comes out, though!

NovelAI Generations

the art style of my novelai generations is much more variable. that's because novelai doesn't take LoRAs—rather, it takes vibe transfers, which are example images that you give it in order to influence the art style. the same prompt can look wildly different if you just switch out the vibe transfers you're using!

you can also get pretty far with just using specific media as a tag. novelai is VERY good at style mimicry, so long as your source is popular enough.

otherwise, it works pretty similarly to the RhapLoRA. it uses danbooru tags, so sometimes i'll start with novelai to hash out a prompt (since i have unlimited gens there) and then move to the RhapLoRA in order to get what i'm actually looking for!

The RhapLoRA and NovelAI in Tandem

novelai is also very good at refining things from the RhapLoRA. i like to toss things back and forth between the two sometimes, either to get a sharper look, or to change some details!

(novelai gen -> put through the RhapLoRA at a high enough strength to change the whole image -> sent back through novelai to smooth out the jagged lines)

novelai can also do multiple people in one image, which the RhapLoRA isn't really good at. i've tried bringing the two together so i can have multiple characters in my art style, but it's......worked not very well. i'm getting there—that's how i got these!

(novelai for general composition -> RhapLoRA to restore the art style)

those are really the only two tools i use here! i've dabbled in other tools and programs, but nothing's really held me like this combination :3 thanks for the ask!

#save

124 notes · View notes

dreamingmachines · 25 days ago

Text

Training LoRAs with Civitai and Tensor

This is a guide that was requested of me in the @are-we-art-yet server! I figured I would go all the way and make it look nice and official. :)

This guide is made with the assumption that you know what a LoRA is and have a general idea of how they work. If not, there's plenty of guides out there to explain the basics.

Step 1: What website are you using?

There's a lot of online generation websites out there nowadays; I've only found two that really work out well for me, so I'll talk about them! Specifically, these are Civitai and Tensor.art.

First off, it's important to note that both Civitai and Tensor ask a fee of you to make a LoRA, but that fee varies between the two. For the sake of consistency, I'll refer to both currencies as credits, even though Civitai refers to theirs as buzz.

Civitai generally requests a flat 500 credit fee, with the fee raising to 600 if you'd like priority training. With their ratio of credit:dollar being 10:1 and the minimum credit purchase being $5, you can train ~10 models with just $5.

PROS:

Low, static asking price, regardless of dataset size.

Allows you to train the same dataset multiple times on different checkpoints with no additional fees on top of the training fee (i.e. x2 is 1,000 credits and 1,100 with priority training).

Each training parameter has a brief explanation if you hover, which makes it easier for beginners to understand.

If you're just interested in training, the subscription doesn't do much of anything for you, so you don't need to worry about it

CONS:

It often takes multiple tries to get Civitai to work with me, both on the autotagging step and the uploading step. I don't know if it's my computer or what, but it's recommended that once the auto-tagging works, you download the entire dataset, because you may need to refresh the page for it to work.

Using a custom checkpoint rather than their list doubles the credit cost of training that model.

Tensor asks a fee that changes relative to the size of your dataset and how many training steps you'll need as a result. For example, if I'm training on 50 images with 10 epochs on FLUX Fast, that'll run me 675 credits. If I use a custom model, it raises to 900. With their ratio of credit:dollar being 3.03~:1, and their minimum credit purchase being $9.90, you're getting much less bang for your buck, only being able to train ~4 models for $10.

However, I've come to prefer Tensor for a couple reasons:

PROS:

Tensor is much more consistent about auto-tagging and does it the moment you upload an image.

It retains your dataset and the tags if you click away or exit the website (however, it doesn't retain your settings afaik)

Allows you to set a negative prompt for your training images

In general, the UI is a lot tighter and more responsive

CONS:

Higher asking price by a fairly large margin

No option for multi-training (afaik)

Priority queue requires that you have a subscription ($10/m)

Overall, it's entirely up to preference. I'll be going through how to train a model both in Civitai and in Tensor.

Step 2: The Actual Training

First and foremost, navigate to the training screen:

Train a LoRA for Civitai, Online Training for Tensor.

Civitai asks you what kind of model you're looking for. For image models, these are pretty simple and straightforward: Character, Style, and Concept.

For this tutorial, we're going to be making a Style LoRA.

Tensor doesn't ask you this question and assumes you know what you're doing.

After this, you have a disclaimer on Civitai to agree to, and then your training screens for both:

From here, the process is roughly the same. You'll want to input the images you'll want to use for training.

I've found that ~50 HQ images is good for a style LoRA. I would also advise you to not use too many images of the same character; make sure the LoRA has as many answers to as many situations as possible, especially with tag-based models.

Auto-tagging isn't automatic (ha) on Civitai. Here's the screen you'll get:

You should generally know what you need if you know what checkpoint you're using, but for the sake of clarity, if you're using a checkpoint that asks for natural language prompts, you'll want captions.

I don't really mess with the settings otherwise. This does its job fine.

When your images are uploaded and tagged, it should look something like this for both:

From here, if you think the auto-tagging's done something wrong, or if you want to input specific tags, you can edit both. On Tensor, you'll just want to click the image whose tags you want to edit. You can just input them in the textbox below each image on Civitai.

On Civitai, you'll need to upload the dataset before you can choose your settings. If that's successful (remember to download your dataset in case it isn't), this is the first thing you'll see:

If all of that looks correct, you can scroll down:

For this exercise, I'll be training both on Illustrious Hassaku XL v3.0. However, you can use whatever checkpoint you'd like.

The checkpoint you train on is the checkpoint in which your style LoRA will look the most like it's intended to. If you generate with any other checkpoint, the LoRA be influenced by that checkpoint's style. This can be used for some fun experimentation down the line.

On Tensor, you'll note that button up in the corner that says "Basic/Professional Mode", depending on whether you've clicked it or not. Professional Mode gives you a lot more freedom over how the LoRA is trained; I don't really touch most of the settings, but feel free to mess around with them if you know what you're doing.

Onto the settings themselves:

These aren't all of the settings you can mess with, just a snapshot. However, the most important thing (in my opinion) is the number of epochs.

On Tensor only, after your LoRA is done training, you can test prompts on every saved epoch you have. This costs a lot of credits (roughly 50~ for 10 epochs), but it's a very good way of testing each version of your model. Oftentimes, I won't choose the final epoch to upload or use, because I'll note that some of the style's been shaved off by the end. This may vary by epoch, but both sites allow you to save or upload any epoch you want.

I'll be leaving the settings the same as their defaults for both.

Next, training prompts:

Civitai gives you three prompts to work with, as it will generate three images for each epoch. However, you cannot give it a negative prompt. If you do not give it preset prompts, it will randomly generate its own from the tags used in your dataset. This can have mixed results. Each image will be square and use the same sampler.

Tensor only uses one prompt—however, it generates four images with that prompt for each epoch. It also allows you to choose a negative prompt, the size of each image, and the sampler.

For Civitai, I'll be using 3 random prompts from past generations I've done. For Tensor, I'll be doing the same, but with the negative prompt I've been using as of late as well. I'll be keeping Tensor's images as portraits, and I'll continue using euler.

From here, if everything's to your liking, you can begin training. Both Civitai and Tensor allow you to preview the epochs as they're being generated—otherwise, this process will take a long time. Civitai is fairly quick, with an ETA of 42 minutes for this training.

Tensor, however, can take multiple hours.

Since I don't have multiple hours, here's a peek at a Tensor model I prepared earlier:

If you have experience with image generation, you can probably tell that each of the four images in each epoch are generated with the same seed. Let's skate down to Epoch 10 to see how it varies:

(I did actually end up choosing Epoch 10 for this one.)

As for Civitai, I unfortunately don't have any pre-prepared trainings, so here's the first epoch of our Hassaku XL LoRA as it trains:

Note that you can download this first epoch right away. However, you can't upload any of the models to Civitai until the training is completely done.

I'll be reblogging this with progress on both the Civitai and Tensor models made here as they continue to train! Hopefully this has been a helpful guide, and feel free to ask questions in the replies, or send an ask if you're curious. :)

#save

12 notes · View notes

dreamingmachines · 1 year ago

Text

I hate setting up cloth sim Anyway, I'd say it's doing a terrific job considering the somewhat shoddy input (skinning mishaps, clipping, simplistic textures, etc)

Definitely gets rid of the majority of "3d model feeling". The issues I can see are the drifting eye color and yellow flowers getting lost, but it's something that should get remedied with a model trained for specific character appearance. All in all, worth exploring further with more effort put into the base 3d render.

#save #not mine

3K notes · View notes

dreamingmachines · 1 year ago

Text

I don't think that making or using AI art/image generation is morally wrong, as you guys know, but I have to admit that slotting a wibbly-lined low effort image selected from the first result set from Midjourney in your content is incredibly tacky. At least select something that looks good. Maybe something without the boring AI "sheen" look either.

You see this a lot in clickbaity content like web spam and youtube shorts attempting to algorithm game.

#save #not mine

1K notes · View notes

dreamingmachines · 2 years ago

Text