CSDN Global

clip-interrogator@pharmapsychotic

The CLIP Interrogator is a prompt engineering tool that combines OpenAI's CLIP and Salesforce's BLIP to optimize text prompts to match a given image. Use the resulting prompts with text-to-image models like Stable Diffusion to create cool art!

cybertruck@hudsongraeme

SDXL trained on a small cybertruck dataset

sdxl-tnmt@copilot-us

An attempt to render Teenage Mutant Ninja Turtles: Mutant Mayhem-like images

sdxl-ironface@allenzsh

Fine-tuned SDXL for generating portraits with Ironman Helmet

dreambooth-avatar@cjwbw

Dreambooth finetuning of Stable Diffusion (v1.5.1) on Avatar art style by Lambda Labs

animatediff-lightning-4-step@camenduru

AnimateDiff-Lightning: Cross-Model Diffusion Distillation

zero-shot-image-to-text@yoadtew

image to text generation

zeroscope-v2-xl@anotherjesse

Zeroscope V2 XL & 576w

urpm-v1.3@asiryan

URPM V1.3 Model (Text2Img, Img2Img and Inpainting)

dreambooth-batch@anotherjesse

batch inference for dreambooth trainings

multilingual-e5-base@adirik

Multilingual E5-large language embedding model

image-urls-to-video@chigozienri

Take a list of image URLs as frames and output a video

obsidian-3b-v0.5@tomasmcm

Source: NousResearch/Obsidian-3B-V0.5 ✦ Worlds smallest multi-modal LLM

sdxl-akira@doriandarko

SDXL model trained on the cult movie AKIRA

frame-interpolation@google-research

Frame Interpolation for Large Scene Motion

styletts2@adirik

Generates speech from text

docentr@cjwbw

End-to-End Document Image Enhancement Transformer

controlnet-seg@jagilley

Modify images using semantic segmentation

chatglm3-6b@nomagick

A 6B parameter open bilingual chat LLM | 开源双语对话语言模型

codellama-13b@meta

A 13 billion parameter Llama tuned for code completion