Skip to content

Categories

Search

Computer vision

Grounding DINO 1.5, a powerful open-set object detection model

Grounding DINO 1.5 is a series of powerful open-set object detection models capable…

May 27, 2024
StoryDiffusion creates coherent comics and videos from text

StoryDiffusion is a new model for generating long-range stories through a coherent series…

May 21, 2024
AniPortrait generates animations from portraits and audio

AniPortrait is a new framework that creates dynamic and expressive animated portraits from…

April 7, 2024
TripoSR creates detailed 3D objects from single images in split seconds

TripoSR is a new open-source 3D modeling tool that reconstructs 3D objects from…

March 26, 2024
Google DeepMind’s SIMA, a generalist AI gaming partner

Google DeepMind’s new Scalable Instructable Multiworld Agent (SIMA) is a cutting-edge AI that…

March 18, 2024
OOTDiffusion creates realistic virtual try-on results using latent diffusion

OOTDiffusion (Outfitting over Try-on Diffusion) is an innovative model for image-based virtual try-on…

March 14, 2024
YOLOv9, the latest breakthrough in real-time object detection

YOLOv9 is a new version of YOLO (You Only Look Once), a powerful…

March 7, 2024
InstantID generates identity-preserving images in seconds

InstantID is a fast method for generating customized human faces with various poses…

February 3, 2024
StreamDiffusion is a new AI model for real-time image generation

StreamDiffusion is a new diffusion pipeline specifically tailored for real-time image generation. It…

January 13, 2024
SMERF is an AI tool for real-time rendering of large scenes

SMERF is a new fast and high-quality method for creating realistic 3D images…

December 26, 2023

←Newer Posts Older Posts→

Connect

Follow us on Twitter

Follow us on LinkedIn

Join us on Reddit

Company

Guides

Stable Diffusion

CLIP architecture

Links

Link
Reddit
Twitter