GUIDANCE SCALE STABLE DIFFUSION

Last updated: October 26, 2025, 02:07 | Written by: Tamsin Crowe

Guidance Scale Stable Diffusion
Guidance Scale Stable Diffusion

Have you ever felt like your AI-generated images weren't quite capturing the vision in your head? The Classifier-Free Guidance (CFG) scale controls how closely a prompt should be followed during sampling in Stable Diffusion. It is a setting available in nearly all Stable Diffusion AI image generators. This post will teach you everything about the CFG scale in Stable Diffusion.Maybe they were too abstract, too chaotic, or simply didn't align with the text prompt you painstakingly crafted. guidance_scale 前面的所有示例统称为guidance_scale。guidance_scale是一种增加对指导生成(如文本)以及总体样本质量的条件信号的依从性的方法。它也被称为无分类器引导,简单地说,调整它可以更好的使用图像质量更好或更具备多样性。The secret to bridging that gap often lies in understanding and effectively utilizing the Guidance Scale, also known as the Classifier-Free Guidance (CFG) scale, within Stable Diffusion. 日々Stable Diffusionで2次元美少女の錬成に精を出しているみなさん向け記事です。 前記事でSampling Stepsについて書きましたが、今回はもう一つのプロパティ、Guidance Scaleについて確認していきます。 Guidance Scaleってなんなのさ? 説明を直訳すると「画像がプロンプトにどの程度従うべきかStable Diffusion, a groundbreaking text-to-image latent diffusion model developed by CompVis, Stability AI, and LAION, empowers you to bring your creative ideas to life.But to truly harness its potential, grasping the nuances of the CFG scale is paramount.

This article serves as your comprehensive guide to the Guidance Scale. Optimize your Stable Diffusion results with the CFG scale (guidance scale). Learn the best practices for using guidance scale from our step-by-step guide.We'll explore its definition, how it functions, and most importantly, how to use it to fine-tune your image generation process for optimal results. Best way to find out what scale does is to look at some examples! Here's a good resource about SD, you can find some information about CFG scale in studies section. Also, here's a more technical explanation.Whether you're a seasoned AI artist or just starting your journey with Stable Diffusion, this deep dive will provide you with the knowledge and practical tips to elevate your creations. CFG(Classifier-Free Guidance) 用于控制Stable Diffusion在采样期间应遵循提示词的严格程度。几乎所有稳定扩散 AI 图像生成器都提供了此参数设置。今天我们重点来看看在Stable Diffusion中CFG参数相关内容。 一. CFG是什么. 我们先以一个实例来看看CFG在不同参数值时的效果。From understanding the trade-off between creativity and prompt adherence to mastering advanced techniques like XYZ plotting, we'll cover everything you need to know. Stable Diffusionにおいて、「guidance_scale」というパラメーターが画像生成にどのような影響を与えるのか、興味がある方も多いのではないでしょうか?So, let's unlock the power of the Guidance Scale and transform your artistic vision into stunning reality!

Understanding the Guidance Scale (CFG Scale)

illustration for scale)
illustration for scale)

The Guidance Scale, or Classifier-Free Guidance (CFG) scale, is a crucial parameter in Stable Diffusion that governs how closely the generated image adheres to your text prompt.Think of it as a dial that controls the ""strictness"" of the AI's interpretation of your instructions.It's a numerical value that influences the balance between adhering to the prompt and allowing the AI to inject its own creative interpretation.

In essence, the CFG scale bridges the gap between your written description and the final image. Guidance scale Stable Diffusion, or classifier-free guidance scale (CFG), is a parameter used in artificial intelligence (AI) to determine how much an AI art generation process adheres to the text prompt. It is a specific number, or value, that represents the importance of a text prompt or description of an image that is generated with AI.It's a setting readily available in nearly all Stable Diffusion AI image generators, empowering you to fine-tune the output and achieve the precise aesthetic you desire.The parameter is used in both text-to-image (txt2img) and image-to-image (img2img) generations.

Key takeaway: The Guidance Scale determines how much the AI listens to your prompt versus going off on its own tangent.

How the Guidance Scale Works: Creativity vs. Best Settings for SDXL 1.0: Guidance, Schedulers, and Steps. To harness the full potential of SDXL 1.0, it's crucial to understand its optimal settings: Guidance Scale. Understanding Classifier-Free Diffusion Guidance. Diffusion models are powerful tools for generating samples, but controlling their quality and diversity can be challenging.Prompt Adherence

analysis for adherence
analysis for adherence

The beauty of the Guidance Scale lies in its ability to balance creativity and control.Let's break down how different values affect the image generation process:

  • Higher Guidance Scale (e.g., 15-20): At higher values, the model is strongly guided by the text prompt. A latent text-to-image diffusion model. Contribute to CompVis/stable-diffusion development by creating an account on GitHub.The generated image will closely resemble the description, potentially sacrificing some artistic flair and diversity.Expect more accurate and literal interpretations of your input.
  • Lower Guidance Scale (e.g., 1-5): Lower values give the AI more freedom to explore.The generated image might deviate significantly from the prompt, resulting in more creative and unexpected outcomes. Stable Diffusion is a text-to-image latent diffusion model created by the researchers and engineers from CompVis, Stability AI and LAION. It s trained on 512x512 images from a subset of the LAION-5B dataset. This model uses a frozen CLIP ViT-L/14 text encoder to condition the model on text prompts.This is ideal for experimentation and abstract art generation.
  • Moderate Guidance Scale (e.g., 7-10): This range offers a balanced approach, providing a good blend of prompt adherence and artistic expression.It's often considered the ""sweet spot"" for many prompts, as it allows the AI to interpret your instructions while still adding its own unique touch.OpenArt uses a default CFG scale of 7.

It's important to remember that there's no one-size-fits-all value for the Guidance Scale.The optimal setting depends heavily on the specific prompt, the desired aesthetic, and the capabilities of the Stable Diffusion model being used.Some models will respond differently to changes in the CFG scale.

Analogy: Think of the Guidance Scale as a volume knob for your prompt. Guidance Scale. The Guidance Scale, or Classifier-Free Guidance (CFG) scale, influences the degree to which Stable Diffusion adheres to the provided text prompt during image generation. A higher value on the Guidance Scale indicates stricter adherence to the input text. However, it also limits creative liberty, potentially yielding less diverseTurning it up makes the AI ""hear"" your instructions more clearly, while turning it down allows it to ""improvise"" more freely.

Finding Your Ideal Guidance Scale: A Step-by-Step Guide

visualization for guide represents key aspects of this topic.

Experimentation is key to mastering the Guidance Scale. In Stable Diffusion, CFG stands for Classifier Free Guidance scale. CFG scale is a parameter that controls Stable Diffusion how 'strict' it should follow the prompt input in image generation. Lower CFG give the AI more freedom to be creative, while higher numbers force it to stick more to the prompt.Here's a step-by-step approach to help you find the perfect setting for your creative endeavors:

  1. Start with the Default: Most Stable Diffusion interfaces default to a Guidance Scale of around 7 or 7.5. The guidance scale, also known as the Classifier-Free Guidance (CFG) scale, is a setting within Stable Diffusion that determines how closely the generated image adheres to the text prompt. Essentially, it acts as a control knob that adjusts the level of adherence between the AI-generated image and your written description.Begin here and generate an image based on your prompt.
  2. Adjust and Observe: Generate the same image multiple times, slightly adjusting the Guidance Scale each time.Try values like 5, 10, and 15.Carefully observe how the changes impact the generated image.
  3. Focus on Specific Aspects: Pay close attention to how the Guidance Scale affects specific aspects of the image, such as composition, color palette, and the presence or absence of particular elements mentioned in your prompt.
  4. Take Notes: Keep a record of your observations.Note which values produced the most desirable results for different types of prompts.
  5. Iterate and Refine: Based on your observations, continue to refine the Guidance Scale until you achieve the desired balance between prompt adherence and creative expression.

Pro Tip: Use the XYZ plot script in the Automatic1111 interface to systematically test different CFG scale values and observe the impact on the generated images in a grid format. CFGスケール(Classifier Free Guidance Scale)は、近年話題のStable Diffusionという画像生成モデルにおいて重要な概念です。 このスケールは、生成される画像がどの程度入力されたプロンプトや画像に忠実になるかを決定するパラメータです。This is a powerful tool for visualizing the effects of various settings.

Common Guidance Scale Ranges and Their Uses

While the ""best"" Guidance Scale is subjective and depends on the specific scenario, here are some common ranges and their typical applications:

  • 1-3: Highly Creative & Abstract
    • Suitable for generating abstract art, textures, and backgrounds where precise prompt adherence is not crucial.
    • Encourages the AI to explore unexpected and unconventional artistic styles.
  • 4-7: Balanced Approach
    • Ideal for general image generation where a good balance between prompt adherence and creative expression is desired.
    • Works well for portraits, landscapes, and illustrations.
  • 8-12: Prompt-Focused & Detailed
    • Recommended for generating images with specific details and compositions as described in the prompt.
    • Useful for creating realistic scenes, product visualizations, and technical illustrations.
  • 13+: Strict Adherence (Use with Caution)
    • Can lead to less diverse and more predictable results.
    • May be helpful for replicating specific styles or creating images that closely match a reference image.

Important Note: Exceedingly high Guidance Scale values (above 20) can sometimes lead to image artifacts or distortions. 在Stable Diffusion中,guidance_scale的值通常介于7和8.5之间,这被认为是稳定扩散的较好选择。 通过增加guidance_scale的值,可以提高生成文本的多样性,使生成的文本更具有创造性和变化性。Use them with caution and experiment to find the optimal balance.

Advanced Techniques: Leveraging Guidance Scale for Specific Effects

Beyond simply controlling the level of prompt adherence, the Guidance Scale can be used creatively to achieve specific artistic effects. Le CFG Scale, ou Classifier-Free Guidance Scale, est donc param tre crucial pour exploiter pleinement le potentiel de Stable Diffusion. J esp res qu en vous aidant mieux comprendre son fonctionnement du CFG Scale et son impact sur la g n ration d image, vous pourrez affiner votre utilisation de Stable Diffusion et cr er des imagesHere are a few advanced techniques to explore:

Fine-Tuning Image Composition

Use a higher Guidance Scale to ensure that the key elements of your scene are positioned as described in your prompt. Stable Diffusion starts with an image that consists of random noise. Then it continously denoises this image over and over again to steer it to the direction of your prompt. Inference steps controls how many steps will be taken during this process. The higher the value, the more steps that are taken to produce the image (also more time).For example, if you specify ""a cat sitting on a window sill,"" a higher value will increase the likelihood of the cat being correctly placed on the sill.

Controlling Color and Style

Adjust the Guidance Scale to influence the overall color palette and artistic style of the generated image.Higher values can help reinforce specific stylistic elements mentioned in the prompt, while lower values allow the AI to introduce its own unique artistic interpretations.

Adding Subtle Details

Experiment with subtle adjustments to the Guidance Scale to add or remove subtle details in the image.For example, slightly increasing the value might enhance the texture of a surface or bring out finer details in a portrait.

Negative Prompting and Guidance Scale

Combine Guidance Scale adjustments with negative prompting to achieve even more precise control over the generated image. NVIDIAのGPUを搭載していれば、ユーザ自身でStable Diffusionをインストールし、ローカル環境で実行することも可能です。 (出典:wikipedia) Stable Diffusionのインストール方法と基本的な使い方については、以下の記事で解説していますので、あわせてご覧ください。Negative prompts tell the AI what *not* to include, allowing you to further refine the output. 什么是Guidance Scale? Guidance Scale,或者称为指导尺度,是在生成图像和输入提示之间取得平衡的关键参数。这个概念在深度学习生成模型中扮演着重要的角色,尤其在稳定扩散(Stable Diffusion)领域中。 达到平衡. Guidance Scale决定了生成图像的质量与多样性之间的A higher Guidance Scale combined with a strong negative prompt can be very effective at removing unwanted elements.

Practical Examples: Seeing the Guidance Scale in Action

Let's illustrate the impact of the Guidance Scale with a few practical examples:

Prompt: ""A futuristic cityscape with neon lights and flying cars.""

  • Guidance Scale: 3: The generated image might be an abstract interpretation of a cityscape, with faint hints of neon lights and flying cars, but lacking clear definition.
  • Guidance Scale: 7: The image will likely depict a recognizable cityscape with vibrant neon lights and clearly visible flying cars, adhering reasonably well to the prompt.
  • Guidance Scale: 12: The image will be a highly detailed and realistic depiction of a futuristic cityscape, with precise placement of neon lights and flying cars, closely following the prompt.

Prompt: ""A portrait of a beautiful woman with long, flowing red hair.""

  • Guidance Scale: 4: The generated portrait might be stylized and artistic, with the woman's features slightly distorted, and the red hair rendered in an unconventional manner.
  • Guidance Scale: 8: The portrait will be more realistic, with the woman's features clearly defined and the red hair accurately depicted.
  • Guidance Scale: 15: The portrait will be highly realistic and detailed, with every strand of red hair meticulously rendered, closely resembling a photograph. Mathematically, the total guidance during sampling is: Total guidance = CFG scale PAG scale. That s why the default setting is a CFG scale of 4 and PAG scale of 3, summing up to 7, a widely used CFG value. Use PAG on ComfyUI. ComfyUI has native support for the Perturbed Attention Guidance node.However, it might lack some artistic flair.

Troubleshooting Common Issues with the Guidance Scale

Sometimes, even with a good understanding of the Guidance Scale, you might encounter unexpected results.Here are some common issues and how to troubleshoot them:

  • Image Artifacts: Excessively high Guidance Scale values can sometimes lead to image artifacts or distortions. The Guidance Scale, also known as the Classifier-Free Guidance (CFG) scale, controls how closely Stable Diffusion adheres to the provided text prompt during the image generation process. In other words, it determines the extent to which the generated image reflects the input text.Try reducing the value or adjusting other settings like sampling steps.
  • Lack of Diversity: High Guidance Scale values can limit the AI's creative freedom, resulting in less diverse and more predictable images. Guidance scale controls how similar the generated image will be to the prompt. A higher guidance scale means the model will try to generate an image that follows the prompt more strictly. A lower guidance scale means the model will have more creativity.Experiment with lower values to encourage more variation.
  • Prompt Ignored: Very low Guidance Scale values might cause the AI to completely ignore your prompt.Increase the value until the image starts to reflect your instructions.
  • Inconsistent Results: Stable Diffusion can be sensitive to minor variations in prompts and settings. Scale is the strength of the guidance scale parameter you apply to the prompt. The higher the value it is, the more you tell the computer to literally follow your text prompt. The lower it is, the more you give creative freedom to the randomness.Ensure consistency in your prompts and settings when comparing results across different Guidance Scale values.

The Guidance Scale and Stable Diffusion XL (SDXL)

With the advent of Stable Diffusion XL (SDXL), understanding the optimal Guidance Scale becomes even more critical. Guidance Scale是Stable Diffusion中一个重要的概念,用于衡量生成的图像与输入提示之间的紧密程度和输入的多样性之间的权衡关系。 在使用Stable Diffusion进行图像生成时,用户可以通过调整Guidance Scale的值来控制生成图像的质量和多样性。SDXL, with its increased resolution and improved capabilities, often requires slightly different settings compared to earlier versions.

While the general principles of the Guidance Scale remain the same, SDXL tends to perform well with a slightly lower range of values.Experimenting within the 5-8 range is often a good starting point for SDXL.

Remember to consider these factors when using SDXL:

  • Model Specifics: Different SDXL models or fine-tunes might have their own ideal Guidance Scale ranges. guidance_scale (float, optional, defaults to 7.5) Guidance scale as defined in Classifier-Free Diffusion Guidance. guidance_scale is defined as w of equation 2. of Imagen Paper . Guidance scale is enabled by setting guidance_scale 1 .Always refer to the model's documentation or community recommendations.
  • Sampler Settings: The choice of sampler can also influence the optimal Guidance Scale. CFG guidance scale. This parameter can be seen as the Creativity vs. Prompt scale. Lower numbers give the AI more freedom to be creative, while higher numbers force it to stick more to the prompt. The default CFG used on OpenArt is 7, which gives the best balance between creativity and generating what you want.Experiment with different samplers and adjust the Guidance Scale accordingly.

Frequently Asked Questions (FAQs)

What is the default Guidance Scale in Stable Diffusion?

The default Guidance Scale is typically around 7 or 7.5, but it can vary depending on the specific Stable Diffusion interface or implementation you're using.

Is a higher Guidance Scale always better?

No, a higher Guidance Scale is not always better. What is the CFG Scale? Like Seed, the classifier-free guidance scale (CFG Scale) is one of the additional settings found in the Stable Diffusion model. The CFG scale adjusts how much the image looks closer to the prompt and/ or input image.While it can lead to more accurate prompt adherence, it can also limit creativity and potentially introduce image artifacts. Stable DiffusionのXYZ plotにて、CFG Scaleを1.0〜9.0に設定した一覧がこちら。 ご覧いたければ分かる通り、CFG Scaleの数値が小さいほど、ボヤけた柔らかいイメージの画像が生成されています。The optimal value depends on the specific prompt and desired aesthetic.

Can the Guidance Scale fix a poorly written prompt?

No, the Guidance Scale cannot compensate for a poorly written prompt.It's essential to craft clear and concise prompts to guide the AI effectively.The Guidance Scale simply fine-tunes the model's interpretation of the prompt.

How does the Guidance Scale relate to sampling steps?

The Guidance Scale and sampling steps are two distinct but related parameters. 在使用Stable Diffusion web UI、ComfyUI等进行生图的时候,提示词引导系数 (CFG Scale)是常用设置参数之一,那么你了解过CFG Scale是什么吗?今天就代大家了解一下CFG Scale,让大家以后在SD生图的时候更容易设置该参数。 提示词引导系数 (CFG ScSampling steps determine the number of iterations the AI takes to refine the image. Characteristic Guidance Web UI is an extension of for the Stable Diffusion web UI (AUTOMATIC1111). It offers a theory-backed guidance sampling method with improved sample and control quality at high CFG scale ( ). This is the official implementation of Characteristic Guidance: Non-linearBoth parameters influence the final output and should be adjusted in conjunction to achieve optimal results.More steps generally require a lower Guidance Scale, and vice-versa.

Conclusion: Mastering the Guidance Scale for Stunning AI Art

The Guidance Scale is an indispensable tool for anyone seeking to unlock the full potential of Stable Diffusion. This paragraph delves into the practical application of the guidance scale (CFG scale) for refining the output of stable diffusion models. It instructs viewers on how to use the XYZ plot script in the Automatic 1111 interface to systematically test different CFG scale values and observe the impact on the generated images.By understanding its function and mastering its application, you can gain unparalleled control over the image generation process, transforming your creative visions into stunning reality. In Stable Diffusion, CFG stands for Classifier Free Guidance scale. CFG is the setting that controls how closely Stable Diffusion should follow your text prompt . It is applied in text-to-image (txt2img) and image-to-image (img2img) generations.From influencing composition and style to adding subtle details, the Guidance Scale empowers you to fine-tune every aspect of your AI-generated artwork.

Remember, experimentation is key.Don't be afraid to explore different Guidance Scale values and observe their impact on your images.With practice and patience, you'll develop an intuitive understanding of how to use this powerful parameter to achieve the precise aesthetic you desire. The Classifier-Free Guidance (CFG) scale controls how closely a prompt should be followed during sampling in Stable Diffusion. It is a setting available in nearly all Stable Diffusion AI image generators.So, dive in, experiment, and unleash your creativity with the Guidance Scale!

Ready to take your Stable Diffusion skills to the next level?Start experimenting with different Guidance Scale values today and share your creations with the world! CFG scale, or Classifier Free Guidance scale, is a parameter that controls the guidance provided to stable diffusion processes. It is used in different applications, including text-to-image (txt2img) and image-to-image (img2img) generations.Happy generating!

Tamsin Crowe can be reached at [email protected].

Comments