Computer vision
-
TripoSR creates detailed 3D objects from single images in split seconds
TripoSR is a new open-source 3D modeling tool that reconstructs 3D objects from…
-
Google DeepMind’s SIMA, a generalist AI gaming partner
Google DeepMind’s new Scalable Instructable Multiworld Agent (SIMA) is a cutting-edge AI that…
-
OOTDiffusion creates realistic virtual try-on results using latent diffusion
OOTDiffusion (Outfitting over Try-on Diffusion) is an innovative model for image-based virtual try-on…
-
YOLOv9, the latest breakthrough in real-time object detection
YOLOv9 is a new version of YOLO (You Only Look Once), a powerful…
-
InstantID generates identity-preserving images in seconds
InstantID is a fast method for generating customized human faces with various poses…
-
StreamDiffusion is a new AI model for real-time image generation
StreamDiffusion is a new diffusion pipeline specifically tailored for real-time image generation. It…
-
SMERF is an AI tool for real-time rendering of large scenes
SMERF is a new fast and high-quality method for creating realistic 3D images…
-
LucidDreamer generates 3D scenes from any text, RGB, or RGBD inputs
LucidDreamer is a generative tool that can create highly realistic 3D scenes from…
-
Google launches Gemini, its most advanced AI model
On December 6, 2023, Google launched Gemini, a cutting-edge multimodal AI model that…
-
LCM-LoRA speeds up text-to-image generation with Stable Diffusion models
LCM-LoRA is a universal Stable Diffusion acceleration module that can speed up the…