SD3.5 M

452

Commercial UseLogo designObject EnhanceLogo & Icon

Medium

Recently Updated: 25/03/02First Published: 24/10/29

Image info

Stable Diffusion 3.5 Medium is a Multimodal Diffusion Transformer with improvements (MMDiT-x) text-to-image model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.

Usage & Limitations

While this model can handle long prompts, you may observe artifacts on the edge of generations when T5 tokens go over 256. Pay attention to the token limits when using this model in your workflow, and shortern prompts if artifacts becomes too obvious.The medium model has a different training data distribution than the large model, so it may not respond to the same prompt similarly.We recommended to sample with Skip Layer Guidance for better struture and anatomy coherency.

License

We are pleased to release this model under our permissive community license. Here are the key components of the license:

Free for non-commercial use: Individuals and organizations can use the model free of charge for non-commercial use, including scientific research.Free for commercial use (up to $1M in annual revenue): Startups, small to medium-sized businesses, and creators can use the model for commercial purposes at no cost, as long as their total annual revenue is less than $1M.Ownership of outputs: Retain ownership of the media generated without restrictive licensing implications.

Expand All