Young
Young|Sep 04, 2025 01:52
The development of multimodal foundation models relies heavily on OpenAI's CLIP vision encoder. It was so exciting to see the release of OpenVision in May, and now the even more powerful and scalable OpenVision2 ( by @cihangxie team) is here. That's so cooooooooool! 🚀(Young)
+4
Mentioned
Share To

Timeline

HotFlash

APP

X

Telegram

Facebook

Reddit

CopyLink

Hot Reads