Computer vision

StoryDiffusion creates coherent comics and videos from text

StoryDiffusion is a new model for generating long-range stories through a coherent series…

May 21, 2024
AniPortrait generates animations from portraits and audio

AniPortrait is a new framework that creates dynamic and expressive animated portraits from…

April 7, 2024
TripoSR creates detailed 3D objects from single images in split seconds

TripoSR is a new open-source 3D modeling tool that reconstructs 3D objects from…

March 26, 2024
Google DeepMind’s SIMA, a generalist AI gaming partner

Google DeepMind’s new Scalable Instructable Multiworld Agent (SIMA) is a cutting-edge AI that…

March 18, 2024
OOTDiffusion creates realistic virtual try-on results using latent diffusion

OOTDiffusion (Outfitting over Try-on Diffusion) is an innovative model for image-based virtual try-on…

March 14, 2024
YOLOv9, the latest breakthrough in real-time object detection

YOLOv9 is a new version of YOLO (You Only Look Once), a powerful…

March 7, 2024
InstantID generates identity-preserving images in seconds

InstantID is a fast method for generating customized human faces with various poses…

February 3, 2024
StreamDiffusion is a new AI model for real-time image generation

StreamDiffusion is a new diffusion pipeline specifically tailored for real-time image generation. It…

January 13, 2024
SMERF is an AI tool for real-time rendering of large scenes

SMERF is a new fast and high-quality method for creating realistic 3D images…

December 26, 2023
LucidDreamer generates 3D scenes from any text, RGB, or RGBD inputs

LucidDreamer is a generative tool that can create highly realistic 3D scenes from…

December 13, 2023