make meow emojis!
Voice cloning with just a 3-second audio clip
Add colours to old video footage.
this is a first model
Best Human detection and Object Detection Background removal.
A fork of pixray/pixray for trying out Cog's new Predictor API
Learning Adapters towards Controllable for Text-to-Image Diffusion Models
Source: upstage/SOLAR-10.7B-Instruct-v1.0 ✦ Quant: TheBloke/SOLAR-10.7B-Instruct-v1.0-AWQ ✦ Elevating Performance with Upstage Depth UP Scaling!
Controllable Text-to-Music Generation
Utilize the capabilities of SD WebUI, including Hires. fix and plenty of extensions (e.g. ADetailer)
A collection of anime stable diffusion models with VAEs and LORAs.
Source: umd-zhou-lab/claude2-alpaca-13B ✦ Quant: TheBloke/claude2-alpaca-13B-AWQ ✦ This model is trained by fine-tuning llama-2 with claude2 alpaca data
Multi-Controlnet + consistency-decoder + INPAINTING + realestic-vision-v5 + Prompt-Weight + Single-Controlnet
A fine-tuned SDXL LoRA trained on cats being human like
Disco Diffusion style on Stable Diffusion via Dreambooth
Playground v2 is a diffusion-based text-to-image generative model trained from scratch. Try out all 3 models here
LoRA + Iterative 4x Upscale ComfyUI Workflow
Cog implementation of mir-aidj(Taejun Kim)'s 'All-In-One Music Structure Analyzer'
Performs speaker identity verification
Improving the Stability of Diffusion Models for Content Consistent Super-Resolution