CSDN Global

A stylized poster style for text-to-image

Realistic Vision v5.0 with VAE

Change the fps of a video without changing its length or speed

SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution

stylegan3 + clip

Updated to OpenVoice v2: Versatile Instant Voice Cloning

Notus-7b-v1 model

Source: llamas-community/LlamaGuard-7b ✦ Quant: TheBloke/LlamaGuard-7B-AWQ ✦ Llama-Guard is a 7B parameter Llama 2-based input-output safeguard model

Fast image interpolation model

lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data

SDXL Image Blending

nsdxl

Stable Diffusion 3 with Differential Diffusion inpainting (experimental)

First version of my panorama LoRA (use 1024x512)

Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer

Mama ママ 2.0 Shinsei Galverse Anime-themed text-to-image model

Audio-based Lip Synchronization for Talking Head Video

SDXL LoRA I trained on chihuahua images

ReNoise: Real Image Inversion Through Iterative Noising

High accuracy depth maps from pairs of stereo images