Generative
-
LCM-LoRA speeds up text-to-image generation with Stable Diffusion models
LCM-LoRA is a universal Stable Diffusion acceleration module that can speed up the…
-
DiagrammerGPT generates better diagrams using LLMs
DiagrammerGPT is a new framework that uses large language models (LLMs) to generate…
-
4K4D uses AI to create high-fidelity 4D views of dynamic scenes
4K4D is a new AI method for generating high-quality and real-time images of…
-
PIXART-α: 10x faster text-to-image diffusion model with state-of-the-art results
PIXART-α is a new text-to-image (T2I) diffusion model based on the transformer architecture…
-
FreeU: a simple method to boost diffusion model’s performance with no extra cost
FreeU is a new technique to improve the quality of images and videos…
-
Thousands of free and open audiobooks using synthetic speech from Project Gutenberg, Microsoft, and MIT
A research team from Project Gutenberg, Microsoft, and MIT has developed a system…
-
Ernie Bot, Baidu’s generative AI tool that rivals ChatGPT, is now open to the public
Baidu, one of China’s leading AI companies, has made its large language model, Ernie…
-
Word-As-Image for Semantic Typography (SIGGRAPGH 2023 technical paper awards)
Word-As-Image is a novel and creative way to make semantic typography, where the letters…
-
GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents (SIGGRAPH 2023 technical paper awards)
GestureDiffuClip is a new framework that can create realistic and expressive body movements…
-
TokenFlow: make high-quality video edits from text prompts using diffusion features
TokenFlow is a framework for text-based video editing that leverages a pre-trained text-to-image…