--- license: apache-2.0 tags: - text-to-image - ultra-realistic - text-to-image - stable-diffusion - moe - segmoe pinned: true library_name: diffusers --- # SegMoE-2x1-v0: Segmind Mixture of Diffusion Experts ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62f8ca074588fe31f4361dae/TJTQyN9tav94fVcvpZGq8.png) SegMoE-2x1-v0 is an untrained Segmind Mixture of Diffusion Experts Model generated using [segmoe](https://github.com/segmind/segmoe) from 2 Expert SDXL models. SegMoE is a powerful framework for dynamically combining Stable Diffusion Models into a Mixture of Experts within minutes without training. The framework allows for creation of larger models on the fly which offer larger knowledge, better adherence and better image quality. ## Usage This model can be used via the [segmoe](https://github.com/segmind/segmoe) library. Make sure to install segmoe by running ```bash pip install segmoe ``` ```python from segmoe import SegMoEPipeline pipeline = SegMoEPipeline("segmind/SegMoE-2x1-v0", device = "cuda") prompt = "cosmic canvas, orange city background, painting of a chubby cat" negative_prompt = "nsfw, bad quality, worse quality" img = pipeline( prompt=prompt, negative_prompt=negative_prompt, height=1024, width=1024, num_inference_steps=25, guidance_scale=7.5, ).images[0] img.save("image.png") ``` ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62f8ca074588fe31f4361dae/ofIz_6VehCHRlpsfrxwFm.png) ### Config Config Used to create this Model is: ```yaml base_model: SG161222/RealVisXL_V3.0 num_experts: 2 moe_layers: all num_experts_per_tok: 1 experts: - source_model: frankjoshua/juggernautXL_v8Rundiffusion positive_prompt: "aesthetic, cinematic, hands, portrait, photo, illustration, 8K, hyperdetailed, origami, man, woman, supercar" negative_prompt: "(worst quality, low quality, normal quality, lowres, low details, oversaturated, undersaturated, overexposed, underexposed, grayscale, bw, bad photo, bad photography, bad art:1.4), (watermark, signature, text font, username, error, logo, words, letters, digits, autograph, trademark, name:1.2), (blur, blurry, grainy), morbid, ugly, asymmetrical, mutated malformed, mutilated, poorly lit, bad shadow, draft, cropped, out of frame, cut off, censored, jpeg artifacts, out of focus, glitch, duplicate, (airbrushed, cartoon, anime, semi-realistic, cgi, render, blender, digital art, manga, amateur:1.3), (3D ,3D Game, 3D Game Scene, 3D Character:1.1), (bad hands, bad anatomy, bad body, bad face, bad teeth, bad arms, bad legs, deformities:1.3)" - source_model: SG161222/RealVisXL_V3.0 positive_prompt: "cinematic, portrait, photograph, instagram, fashion, movie, macro shot, 8K, RAW, hyperrealistic, ultra realistic," negative_prompt: "(octane render, render, drawing, anime, bad photo, bad photography:1.3), (worst quality, low quality, blurry:1.2), (bad teeth, deformed teeth, deformed lips), (bad anatomy, bad proportions:1.1), (deformed iris, deformed pupils), (deformed eyes, bad eyes), (deformed face, ugly face, bad face), (deformed hands, bad hands, fused fingers), morbid, mutilated, mutation, disfigured" ``` ### Other Variants We release 3 merges on Hugging Face, - [SegMoE 4x2](https://huggingface.co/segmind/SegMoE-4x2-v0) has four expert models. - [SegMoE SD 4x2](https://huggingface.co/segmind/SegMoE-sd-4x2-v0) has four Stable Diffusion 1.5 expert models. ## Comparison The Prompt Understanding seems to improve as shown in the images below. From Left to Right SegMoE-2x1-v0, SegMoE-4x2-v0, Base Model ([RealVisXL_V3.0](https://huggingface.co/SG161222/RealVisXL_V3.0)) ![image](https://github.com/segmind/segmoe/assets/95569637/bcdc1b11-bbf5-4947-b6bb-9f745ff0c040)