Data Science
53 articles in this category (Page 3 of 3)
Embedding Atlas: Apple’s Open-Source Tool for Exploring Large-Scale Embeddings Locally
Apple introduces Embedding Atlas, an open-source browser-based tool for visualizing and analyzing large-scale embeddings without backend infrastructure, enabling interactive exploration of high-dimensional data.
How Can We Build Scalable and Reproducible Machine Learning Experiment Pipelines Using Meta Research Hydra?
This article explains how to use Meta's Hydra framework to create scalable and reproducible ML experiments through structured configurations, overrides, and multirun simulations.
Hugging Face AI Sheets Adds Vision Capabilities for Image-Based Data Analysis
Hugging Face releases a significant update to AI Sheets, introducing vision support to extract data from images, generate visuals from text, and edit images directly within a spreadsheet environment, powered by open-source AI models.
Hugging Face Enhances Dataset Streaming for 100x Efficiency
Hugging Face has significantly improved dataset streaming capabilities in their 'datasets' and 'huggingface_hub' libraries, enabling faster and more efficient training on large datasets. Key improvements include reduced API requests, faster data resolution, and enhanced control over streaming pipelines.