ecommerce-virtual-try-on@wolverinn

Virtual try-on using Stable Diffusion and IP-Adapter

styleclip@orpatashnik

Text-Driven Manipulation of StyleGAN Imagery

anima-pencil-v310@aicapcut

Generate anime-style image

hasdx@cjwbw

mixed stable diffusion model

photomaker@mbukerepo

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

magifactory-t-shirt-diffusion@cjwbw

Generate t-shirt logos with stable-dfffusion

anything-v4.5@asiryan

Anything V4.5 Model (Text2Img, Img2Img and Inpainting)

music-arousal-valence@mtg

Regression of musical arousal and valence values

sensei-7b-v1@tomasmcm

Source: SciPhi/Sensei-7B-V1 ✦ Quant: TheBloke/Sensei-7B-V1-AWQ ✦ Sensei is specialized in performing RAG over detailed web search results

deeplabv3@humanvideointeraction

Image Segmentation with DeepLabv3

kosmos-2@lucataco

Grounding Multimodal Large Language Models to the World

imagebind@daanelson

A model for text, audio, and image embeddings in one space

pytsmod@sakemin

PyTSMod is an open-source library for Time-Scale Modification(eg. time-stretching) algorithms, by Sangeon Yong at MAC Lab, KAIST.

free-vc@jagilley

Change voice for spoken text

ri@simbrams

Realistic Inpainting with ControlNET (M-LSD + SEG)

camerabooth-openpose-style@joetm

StableDiffusion 1.4 + T2IAdapter (ControlNet) with style and openpose adapters + two upscaling passes with Real-ESRGAN

material-maker@midllle

AI generated Normal maps, Displacement maps, and Roughness maps

japanese-stable-diffusion@rinnakk

Japanese-specific latent text-to-image diffusion model

lama@twn39

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

daffa@daffaakhlaric2424

highist resolutioin image