Bookmark: ImageBind: One Embedding Space To Bind Them All

lqdev👽05/09/2023

https://ai.facebook.com/blog/imagebind-six-modalities-binding-ai/

...ImageBind, the first AI model capable of binding information from six modalities. The model learns a single embedding, or shared representation space, not just for text, image/video, and audio, but also for sensors that record depth (3D), thermal (infrared radiation), and inertial measurement units (IMU), which calculate motion and position.

Paper
Demo
Code

Permalink: /feed/imagebind/

Tags: #ai #embeddings

Back to feed

Send me a message or webmention