flan-t5-large@daanelson

A language model for tasks like classification, summarization, and more.

musetalk@douwantech

Real-Time High Quality Lip Synchronization with Latent Space Inpainting

glid-3@nicholascelestin

Generate images quickly with GLID-3 (non-xl)

photorealistic-fx-controlnet@batouresearch

ControlNet implementation for RunDiffusion's PhotorealisticFX model.

evolved-seeker-1.3b@tomasmcm

Source: TokenBender/evolvedSeeker_1_3 ✦ Quant: TheBloke/evolvedSeeker_1_3-AWQ ✦ A fine-tuned version of deepseek-ai/deepseek-coder-1.3b-base on 50k instructions for 3 epochs

crm@camenduru

CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model

openjourney-v4@prompthero

SD 1.5 trained with +124k MJv4 images by PromptHero

pix2struct@cjwbw

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

gdmjp4@gymdreams8

Paintings in the style of selected artists with weights, from the Construction Series of GymDreams8.

audiosr-long-audio@sakemin

Versatile Audio Super-resolution at Scale which upsamples audio files to 48khz. Longer audio input is possible with this model

metavoice@camenduru

MetaVoice-1B: 1.2B parameter base model trained on 100K hours of speech

diffmorpher@cjwbw

Diffusion Models for Image Morphing

bokeh_prediction@zylim0702

Bokeh Prediction, a hybrid bokeh rendering framework that combines a neural renderer with a classical approach. It generates high-resolution, adjustable bokeh effects from a single image and potentially imperfect disparity maps.

mamba-1.4b@adirik

Base version of Mamba 1.4B, a 1.4 billion parameter state space language model

realistic-vision-v5.1@lucataco

Implementation of Realistic Vision v5.1 with VAE

classic-anim-diffusion@nitrosocke

Animation Studio on Stable Diffusion via Dreambooth

yi-6b-chat@01-ai

The Yi series models are large language models trained from scratch by developers at 01.AI.

local-prompt-mixing@adirik

Generating object-level shape variations with Stable Diffusion

minigpt-4_vicuna-7b@nelsonjchen

MiniGPT-4 w/ Vicuna-7B (Image Question/Captioning Use)

realistic-background@wolverinn

replace background with Stable Diffusion and ControlNet