Computer vision
-

StoryDiffusion creates coherent comics and videos from text
StoryDiffusion is a new model for generating long-range stories through a coherent series…
-

AniPortrait generates animations from portraits and audio
AniPortrait is a new framework that creates dynamic and expressive animated portraits from…
-

TripoSR creates detailed 3D objects from single images in split seconds
TripoSR is a new open-source 3D modeling tool that reconstructs 3D objects from…
-

Google DeepMind’s SIMA, a generalist AI gaming partner
Google DeepMind’s new Scalable Instructable Multiworld Agent (SIMA) is a cutting-edge AI that…
-

OOTDiffusion creates realistic virtual try-on results using latent diffusion
OOTDiffusion (Outfitting over Try-on Diffusion) is an innovative model for image-based virtual try-on…
-

YOLOv9, the latest breakthrough in real-time object detection
YOLOv9 is a new version of YOLO (You Only Look Once), a powerful…
-

InstantID generates identity-preserving images in seconds
InstantID is a fast method for generating customized human faces with various poses…
-

StreamDiffusion is a new AI model for real-time image generation
StreamDiffusion is a new diffusion pipeline specifically tailored for real-time image generation. It…
-

SMERF is an AI tool for real-time rendering of large scenes
SMERF is a new fast and high-quality method for creating realistic 3D images…
-

LucidDreamer generates 3D scenes from any text, RGB, or RGBD inputs
LucidDreamer is a generative tool that can create highly realistic 3D scenes from…