Stable Diffusion 4 Announced: Features & Release 2026

Stability AI announces Stable Diffusion 4 Ultra with a new DiT architecture, 4096×4096 native resolution, and open weights — here's what's new and when you can use it.

Updated 2026-04-0610 min readBy NovaReviewHub Editorial Team

Stable Diffusion 4 Announced: Features & Release 2026

Stability AI has officially announced Stable Diffusion 4, its most significant model generation leap since the original SD launch in 2022. Built on an upgraded diffusion transformer (DiT) architecture, SD4 ships in two tiers — SD4 Base and SD4 Ultra — with open weights available under a community license. The flagship SD4 Ultra targets native 4096×4096 resolution output, dramatically improved photorealism, and a dedicated text rendering module that addresses one of the oldest weaknesses in diffusion models.

If you create visual content, build AI-powered tools, or follow the open-source AI ecosystem, the Stable Diffusion 4 release in 2026 reshapes your options. Here's a complete breakdown of what's new, how it compares, and when you can start using it.

Caption: The two-tier SD4 release structure — Base for community and Ultra for professional use.

The Announcement

On April 6, 2026, Stability AI formally unveiled the Stable Diffusion 4 family, marking the company's first full-generation model release since the SD3.5 series. The announcement confirmed months of speculation within the open-source AI community about Stability's next architectural move.

The core technical shift: SD4 migrates fully from the U-Net backbone that powered SD 1.x through SDXL to a diffusion transformer (DiT) architecture. This isn't a surface-level upgrade. Transformers scale more predictably with compute — as parameters and training FLOPs increase, quality improves in a more linear fashion compared to U-Net's diminishing returns at scale. The engineering team redesigned the attention mechanisms to use a mixture of local and global attention layers, reducing computational cost at high resolutions while preserving fine-grained detail.

The model also incorporates RoPE (Rotary Position Embedding) for spatial awareness, which improves compositional coherence — objects maintain consistent spatial relationships across the image, fixing a problem that plagued earlier SD generations.

Stability AI's leadership framed the release as both a technical milestone and a strategic statement: open-weight models can compete with closed-source systems on raw quality. The company has been rebuilding since its 2024 leadership crisis, and SD4 represents the first major output from the restructured engineering team.

What's New: Feature-by-Feature Breakdown

Upgraded DiT Architecture

The move to a diffusion transformer backbone is the single most important change. Previous SD models relied on U-Net for the denoising process, which served the community well but hit scaling ceilings. SD4's DiT architecture follows the research direction established by Peebles & Xie (2023) and production-proven by models like Flux. The result: more predictable quality scaling and better performance at high resolutions.

4096×4096 Native Resolution (SD4 Ultra)

SD4 Ultra generates at native resolutions up to 4096×4096 without the tiling artifacts that made SDXL unreliable at ultra-high resolution. For professionals working on print-ready assets, large-format output, or 4K+ production pipelines, this eliminates an entire post-processing step.

Photorealism and Lighting Improvements

Stability's internal benchmarks show SD4 Ultra generates anatomically correct hands in approximately 87% of samples at standard guidance scales — up from ~60% for SDXL and ~72% for SD3.5. The training pipeline incorporated a curated subset of physically-based rendering (PBR) datasets, giving the model exposure to images with precisely known lighting metadata. The output demonstrates correct specular highlights, subsurface scattering in skin, and consistent shadow directionality.

Text Rendering Module

SD4 introduces a dedicated text glyph conditioning module — a first for the Stable Diffusion family. Short phrases, signage, and product labels render correctly in the majority of cases. While it doesn't match DALL-E 4's text accuracy for long or stylized typography, it handles practical use cases like storefront signage, product labels, and simple titles.

Two-Tier Release: Base vs. Ultra

Feature	SD4 Base	SD4 Ultra
Architecture	DiT (streamlined)	Full upgraded DiT
Max native resolution	1024×1024	4096×4096
Minimum VRAM	12GB	24GB
Text glyph module	No	Yes
PBR lighting model	Partial	Full
Inference speed (RTX 4090)	~8–12 seconds	~45–90 seconds (A100 at max res)
Target user	Hobbyists, indie devs	Studios, enterprise
Open weights	Yes	Yes

SD4 Base runs comfortably on consumer hardware — an RTX 3060 with 12GB VRAM is the minimum. SD4 Ultra requires 24GB+ VRAM for local inference, making cloud API access the practical choice for most teams without dedicated GPU infrastructure.

Why This Matters

The Stable Diffusion 4 release matters for three reasons that extend beyond raw image quality.

First, open weights at frontier quality. No other model in SD4 Ultra's quality tier offers open weights with self-hosting and fine-tuning rights. Midjourney v7, DALL-E 4, and Adobe Firefly 4 are all closed APIs. For studios that need to fine-tune on proprietary assets, self-host for data privacy, or generate at print-scale resolutions, SD4 Ultra is the only enterprise-grade option in this quality class.

Second, the professional workflow impact. VFX studios can generate photorealistic texture references and concept sketches with physically plausible lighting. Game developers can fine-tune SD4 Base on proprietary art styles and generate consistent assets at scale — something closed-API models cannot replicate. Advertising agencies benefit from the improved text rendering for promotional materials that require legible product names and taglines embedded in the image.

Third, competitive market dynamics. SD4 Ultra's launch is a direct response to Flux 1.1's momentum in the open-weight community. By matching or exceeding Flux on photorealism benchmarks while offering a more commercially structured licensing model, Stability is positioning itself as the infrastructure layer for the next generation of creative AI tools.

Caption: How SD4 Ultra positions itself in the 2026 image generation market — the only open-weight model in the frontier quality tier.

How It Compares

SD4 Ultra enters a crowded field. Here's how it stacks up against the three biggest competitors:

Feature	SD4 Ultra	Midjourney v7	DALL-E 4	Adobe Firefly 4
Photorealism	Best-in-class	Excellent (aesthetic bias)	Good	Good
Text in images	Strong	Moderate	Best-in-class	Good
Anatomy/hands	~87% correct	~80% correct	~82% correct	~79% correct
Max resolution	4096×4096	2048×2048	2048×2048	2048×4096
Open weights	Yes	No	No	No
Fine-tuning/LoRA	Yes	No	No	No
Self-hosting	Yes	No	No	No

Midjourney v7 still leads for illustrators and concept artists who prioritize aesthetic coherence over raw realism. DALL-E 4 wins on text integration and benefits from seamless ChatGPT integration. Adobe Firefly 4 owns the commercially safe lane with fully licensed training data and Creative Cloud integration. Read our full Midjourney vs Stable Diffusion comparison for a deeper dive.

Expert Reaction

Industry response to SD4 has been cautiously positive, with the open-weight commitment drawing the strongest praise.

A senior VFX supervisor at a major visual effects house described SD4 Ultra as "the first open model I'd trust to hand to a junior artist without a lengthy brief on its failure modes." The anatomy and hand improvements are specifically relevant for production work where artifacts are immediately visible to trained eyes.

The developer community has been quick to note that SD4's value depends heavily on the ecosystem that forms around it. ControlNet, IP-Adapter, and LoRA support for the new DiT architecture will determine whether SD4 achieves the same community-driven quality explosion that SDXL enjoyed. Stability has signaled that official ControlNet variants for SD4 Ultra are coming, which would accelerate adoption significantly.

Not all reaction has been positive. The artist community raised immediate concerns about the training data sourcing. Stability's opt-out system requires creators to actively remove their work from future training runs — a structure critics argue places the burden on individual artists rather than on the company building the model. Stability has disclosed that SD4's training data includes licensed stock photography, but LAION-derived data still forms a significant portion of the corpus. Legal proceedings around AI training data are ongoing and will likely force more explicit licensing frameworks within 12–18 months.

Timeline and Availability

Stability AI is rolling out SD4 in phases:

April 2026 (now): SD4 Ultra available via Stability AI's API and the Stable Assistant platform. SD4 Base weights released for community download under the community license.
Late April 2026: SD4 Ultra weights released for self-hosting. Enterprise licensing conversations open for studios and platform builders.
May–June 2026: Official ControlNet variants for SD4 architecture. Community fine-tuned models and LoRA ecosystem expected to expand rapidly.
H2 2026 (planned): SD4-Video extension in development, targeting open-weight photorealistic video generation.

The community license covers individuals and businesses under $1M annual revenue for non-commercial and limited commercial use. Enterprise licensing for larger organizations or high-volume applications is handled through direct sales.

How to Get Early Access

You can access SD4 right now through two paths:

API access: Sign up at platform.stability.ai — credit-based pricing at $0.01 per credit. SD4 Ultra calls are available immediately.
Self-hosting: Download SD4 Base weights from Stability's GitHub repository under the community license. SD4 Ultra weights will be available for download later in April.

For enterprise teams needing on-premise deployment, contact Stability AI's enterprise sales team directly for licensing terms and support SLAs.

Next Steps

If you're evaluating whether SD4 fits your workflow, here's what to watch:

Community fine-tunes will arrive within weeks of the Base weights release — these often outperform the base model on specific use cases. Monitor Civitai and Hugging Face for SD4 LoRA releases.
ControlNet compatibility is the gating factor for professional adoption. Once official ControlNet variants land, SD4 becomes viable for production VFX and game development workflows.
Pricing clarity for enterprise licensing is still pending. If you're building a commercial product on SD4, start the licensing conversation now rather than waiting for published rates.

Check our DALL-E 3 review and best AI image generators guide for broader context on how SD4 fits into the current landscape.

Frequently Asked Questions

Can I use SD4 commercially without paying for an enterprise license?

Yes, if your annual revenue is under $1M. The community license covers limited commercial use for individuals and small businesses. Above that threshold, or for high-volume applications, you need the enterprise license. Always check Stability's official licensing terms before building a commercial product on SD4.

What GPU do I need to run SD4 Ultra locally?

SD4 Ultra requires a minimum of 24GB VRAM — an RTX 4090 is the consumer-grade floor. For comfortable high-resolution generation, an A100 (40GB or 80GB) or H100 is recommended. SD4 Base is far more accessible: 12GB VRAM (RTX 3060 or better) is sufficient.

How does SD4 Ultra compare to Flux 1.1?

Both use DiT-based architectures and produce strong photorealistic results. Initial benchmarks suggest SD4 Ultra has a slight edge on lighting accuracy and high-resolution coherence, while Flux 1.1 benefits from a more mature community ecosystem with more available fine-tunes. Expect this comparison to shift as SD4 community models emerge.

Is the artist opt-out system retroactive?

No. The opt-out system only affects future training runs. If your work was already included in SD4's training data, opting out does not remove its influence from the current model weights. It only prevents inclusion in the next training cycle.

Conclusion

Stable Diffusion 4 is the most technically ambitious release from Stability AI since the original model — and the first credible open-weight challenger to the closed-source dominance of Midjourney and DALL-E at the frontier quality tier. The DiT architecture, 4K native resolution, and photorealism gains are genuine improvements, not marketing inflation. The practical question isn't whether SD4 Ultra produces better images than SD3.5 (it clearly does), but whether Stability AI can sustain the organizational momentum to keep pace with well-funded closed-source competitors over multiple model generations. For now, the open-weight ecosystem has its flagship back. Try SD4 Ultra via the Stability API or download SD4 Base weights to start experimenting.

Stable Diffusion 4 Announced: Features & Release 2026