AI News
These articles are AI-generated summaries. Please check the original sources for full details. (Page 145 of 206)
AI NewsMultimodal AIComputer Vision
Meta AI Open-Sourced Perception Encoder Audiovisual (PE-AV): The Audiovisual Encoder Powering SAM Audio And Large Scale Multimodal Retrieval
Meta AI released PE-AV, a multimodal encoder achieving state-of-the-art performance on audio and video benchmarks with a 10.4 R@1 improvement on AudioCaps.
Read more