CSDN Global

flan-t5-large@daanelson

A language model for tasks like classification, summarization, and more.

musetalk@douwantech

Real-Time High Quality Lip Synchronization with Latent Space Inpainting

glid-3@nicholascelestin

Generate images quickly with GLID-3 (non-xl)

photorealistic-fx-controlnet@batouresearch

ControlNet implementation for RunDiffusion's PhotorealisticFX model.

evolved-seeker-1.3b@tomasmcm

Source: TokenBender/evolvedSeeker_1_3 ✦ Quant: TheBloke/evolvedSeeker_1_3-AWQ ✦ A fine-tuned version of deepseek-ai/deepseek-coder-1.3b-base on 50k instructions for 3 epochs

crm@camenduru

CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model

openjourney-v4@prompthero

SD 1.5 trained with +124k MJv4 images by PromptHero

pix2struct@cjwbw

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

gdmjp4@gymdreams8

Paintings in the style of selected artists with weights, from the Construction Series of GymDreams8.

audiosr-long-audio@sakemin

Versatile Audio Super-resolution at Scale which upsamples audio files to 48khz. Longer audio input is possible with this model

metavoice@camenduru

MetaVoice-1B: 1.2B parameter base model trained on 100K hours of speech

Bokeh Prediction, a hybrid bokeh rendering framework that combines a neural renderer with a classical approach. It generates high-resolution, adjustable bokeh effects from a single image and potentially imperfect disparity maps.