Computer vision
-
LivePortrait, a fast and free AI tool to animate portraits
LivePortrait is an AI-powered tool that creates lifelike animations from portraits. Simply provide…
-
Magic Insert, the new style-aware drag-and-drop technology from Google
Magic Insert is a new method proposed by Google that lets you drag-and-drop…
-
Depth Anything V2, a highly capable depth estimation model
Depth Anything V2 is a new powerful monocular depth estimation model, delivering significantly…
-
YOLOv10, a faster and more accurate object detection model
YOLOv10 is a recent advancement in real-time object detection YOLO models that achieves…
-
Grounding DINO 1.5, a powerful open-set object detection model
Grounding DINO 1.5 is a series of powerful open-set object detection models capable…
-
StoryDiffusion creates coherent comics and videos from text
StoryDiffusion is a new model for generating long-range stories through a coherent series…
-
AniPortrait generates animations from portraits and audio
AniPortrait is a new framework that creates dynamic and expressive animated portraits from…
-
TripoSR creates detailed 3D objects from single images in split seconds
TripoSR is a new open-source 3D modeling tool that reconstructs 3D objects from…
-
Google DeepMind’s SIMA, a generalist AI gaming partner
Google DeepMind’s new Scalable Instructable Multiworld Agent (SIMA) is a cutting-edge AI that…
-
OOTDiffusion creates realistic virtual try-on results using latent diffusion
OOTDiffusion (Outfitting over Try-on Diffusion) is an innovative model for image-based virtual try-on…