CSDN Global

realistic-vision-hyper@fofr

A fast high quality SD 1.5 model, Realistic Vision V6.0 B1 Hyper

dolphin-2.1-mistral-7b@lucataco

Mistral-7B-v0.1 fine tuned for chat with the Dolphin dataset (an open-source implementation of Microsoft's Orca)

open-sora@camenduru

Open-Sora is a work-in-progress model.

video-crafter@lucataco

Open diffusion model for high-quality video generation

msclap@pipi32167

Caption an audio

text2video@pschaldenbrand

Method for generating bizarre looking videos from a series of language descriptions of the video. From the Bot Intelligence Group at CMU: Peter Schaldenbrand, Zhixuan Liu, & Jean Oh

sdxl-hidden-faces@fofr

SDXL fine-tuned on pareidolia

magnet@lucataco

MAGNeT: Masked Audio Generation using a Single Non-Autoregressive Transformer

controlnet-pose@jagilley

Modify images with humans using pose detection

wizard-vicuna-13b-uncensored@lucataco

This is wizard-vicuna-13b trained with a subset of the dataset - responses that contained alignment / moralizing were removed

iwan-baan-sdxl@cbh123

Fine-tuned SDXL on my favorite architectural photographer, Iwan Baan

pixray@dribnet

Pixray with custom settings

cogagent-chat@cjwbw

A Visual Language Model for GUI Agents

controlnet_2-1@rossjillian

ControlNet with SD 2.1

music-label@pengdaqian2020

music label

mvdream@adirik

Generate 3D assets using text descriptions

instruct-pix2pix@arielreplicate

Edit images with human instructions

amt@pollinations

Video Smoother: AMT All-Pairs Multi-Field Transforms for Efficient Frame Interpolation

ip_adapter-face@lucataco

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate SDv1.5 images with an image prompt

sdxl-cat@peter65374

human-like cat sdxl-lora model