📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment
The fastest image generation model tailored for local development and personal use
An 8 billion parameter language model from Meta, fine tuned for chat completions
Best-in-class clothing virtual try on in the wild (non-commercial use only)
Face Swap
Turn a face into 3D, emoji, pixel art, video game, claymation or toy
A 70 billion parameter language model from Meta, fine tuned for chat completions
Coqui XTTS-v2: Multilingual Text To Speech Voice Cloning
Stylized Audio-Driven Single Image Talking Face Animation
Accelerated transcription, word-level timestamps and diarization with whisperX large-v3 for large audio files
Adding semantic labels for segment anything
Lightweight Bimodal Network for Single-Image Super-Resolution via Symmetric CNN and Recursive Transformer
BetterUp Stable Diffusion XL Image Model
AuraSR: GAN-based Super-Resolution for real-world
Detect and simplify the contours of a binary image
Modify images using HED maps
One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
Remove eyeglasses and shadows from photo
A 34 billion parameter Llama tuned for coding and conversation
CarAI: Evaluate Car Damages