Machine Learning
277 articles in this category (Page 4 of 12)
Hugging Face Releases TRL v1.0: A Unified Post-Training Stack for SFT, Reward Modeling, DPO, and GRPO Workflows
Hugging Face TRL v1.0 standardizes LLM post-training with a unified CLI and config system, delivering up to 2x training speed and a 70% reduction in memory usage.
Alibaba Releases Qwen3.5-Omni: A Native Multimodal Model for Real-Time Audio and Video Interaction
Alibaba Qwen Team unveils Qwen3.5-Omni, a native multimodal model achieving SOTA results on 215 subtasks while supporting 256k long-context audio-visual inputs.
Meta Releases TRIBE v2: A Tri-Modal Foundation Model for High-Resolution fMRI Prediction
Meta’s FAIR team introduces TRIBE v2, a tri-modal foundation model that predicts fMRI responses across video, audio, and text stimuli, achieving a group correlation near 0.4 on the HCP 7T dataset.
Implementing Advanced Differential Equation Solvers and Neural ODEs with Diffrax and JAX
Learn to implement advanced differential equation solvers and Neural ODEs using Diffrax and JAX, featuring adaptive solvers and batched stochastic simulations.
Explainable Causal Reinforcement Learning: Optimizing Precision Oncology Under Real-Time Constraints
Rikin Patel introduces a framework combining Structural Causal Models with Constrained RL to manage oncology workflows, achieving up to 95% confidence in causal moderator effects.