CSDN Global

sdxl-meow@cuuupid

make meow emojis!

xtts-v1@pagebrain

Voice cloning with just a 3-second audio clip

deoldify_video@arielreplicate

Add colours to old video footage.

rphello@goodtome

this is a first model

remove_bg@zylim0702

Best Human detection and Object Detection Background removal.

pydantic-pixray@zeke

A fork of pixray/pixray for trying out Cog's new Predictor API

t2i-adapter@cjwbw

Learning Adapters towards Controllable for Text-to-Image Diffusion Models

solar-10.7b-instruct-v1.0@tomasmcm

Source: upstage/SOLAR-10.7B-Instruct-v1.0 ✦ Quant: TheBloke/SOLAR-10.7B-Instruct-v1.0-AWQ ✦ Elevating Performance with Upstage Depth UP Scaling!

mustango@declare-lab

Controllable Text-to-Music Generation

majicmix-realistic-sd-webui@speshiou

Utilize the capabilities of SD WebUI, including Hires. fix and plenty of extensions (e.g. ADetailer)

cog-a1111-ui@brewwh

A collection of anime stable diffusion models with VAEs and LORAs.

claude2-alpaca-13b@tomasmcm

Source: umd-zhou-lab/claude2-alpaca-13B ✦ Quant: TheBloke/claude2-alpaca-13B-AWQ ✦ This model is trained by fine-tuning llama-2 with claude2 alpaca data

multi-controlnet-x-consistency-decoder-x-realestic-vision-v5@usamaehsan

Multi-Controlnet + consistency-decoder + INPAINTING + realestic-vision-v5 + Prompt-Weight + Single-Controlnet

sdxl-cats@hunterkamerman

A fine-tuned SDXL LoRA trained on cats being human like

disco-diffusion-style@cjwbw

Disco Diffusion style on Stable Diffusion via Dreambooth

playground-v2@lucataco

Playground v2 is a diffusion-based text-to-image generative model trained from scratch. Try out all 3 models here

entropy-lol@bryantanjw

LoRA + Iterative 4x Upscale ComfyUI Workflow

all-in-one-music-structure-analyzer@sakemin

Cog implementation of mir-aidj(Taejun Kim)'s 'All-In-One Music Structure Analyzer'

titanet-large@adirik

Performs speaker identity verification

ccsr@csslc

Improving the Stability of Diffusion Models for Content Consistent Super-Resolution