Skip to main content
← All Tags

ML & Data Engineering

20 articles in this category

AI NewsDevelopmentML & Data Engineering

GitHub Copilot SDK Technical Preview

GitHub Copilot SDK now available in technical preview, enabling developers to integrate Copilot CLI's engine into their own apps with 100% programmatic access.

Read more
AI NewsSoftware DevelopmentML & Data Engineering

Windsurf Introduces Arena Mode for Comparing AI Models

Windsurf's Arena Mode allows developers to compare large language models side by side, with over 90% of users preferring real-world benchmarking approaches.

Read more
AI NewsComputer VisionML & Data Engineering

Google Enhances Gemini 3 Flash with Agentic Vision

Google adds agentic vision to Gemini 3 Flash, improving accuracy by 5-10% on vision tasks and unlocking new AI-driven behaviors.

Read more
AI NewsDevelopmentML & Data Engineering

OpenCode: AI Coding Agent with Multi-Model Support and Native UI

Open-source AI coding tool OpenCode supports over 75 models, including Claude and OpenAI, with a native terminal-based UI and multi-session support.

Read more
AI NewsML & Data EngineeringArtificial Intelligence

Vercel Releases Skills.sh for Standardized Agent Commands

Vercel's Skills.sh provides AI agents with a standardized way to execute reusable actions through the command line, reaching tens of thousands of installs shortly after launch.

Read more
AI NewsML & Data EngineeringOpenAI

OpenAI's Codex CLI Internals Revealed

OpenAI publishes article series on Codex software development agent, highlighting internals of Codex harness and strategies for managing context and reducing prompt cache misses.

Read more
AI NewsLarge Language ModelsML & Data Engineering

OpenAI's Open Responses Specification Unifies Agentic LLM Workflows

OpenAI's Open Responses standardizes agentic AI workflows, reducing API fragmentation and enabling seamless transitions between proprietary and open-source models with a unified specification.

Read more
AI NewsML & Data EngineeringScience

OpenAI's Prism: A Free LaTeX-Native Workspace with Integrated GPT-5.2

OpenAI releases Prism, a free cloud-based LaTeX workspace with GPT-5.2 integration, offering unlimited projects and collaborators.

Read more
AI NewsLarge Language ModelsML & Data Engineering

Google DeepMind Introduces ATLAS Scaling Laws for Multilingual Language Models

Google DeepMind researchers introduce ATLAS, a set of scaling laws for multilingual language models, revealing that doubling the number of languages requires a 1.18× increase in model size and 1.66× increase in total training data.

Read more
AI NewsAIML & Data Engineering

Google Introduces Nano Banana Pro with Grounded, Multimodal Image Synthesis

Google’s Nano Banana Pro bridges language understanding and image synthesis with real-world accuracy and multilingual text rendering.

Read more
AI NewsML & Data EngineeringBest Practices

Growing and Cultivating Strong Machine Learning Engineers

Vivek Gupta outlines strategies for nurturing ML engineers over 12 years at Microsoft, emphasizing skills like data management and LLM prompt evaluation.

Read more
AI NewsML & Data EngineeringVisual Language Model

Training Data Preprocessing for Text-to-Video Models

Text-to-video models like Runway and Sora rely on high-quality video-text datasets, where preprocessing reduces noise and improves generation accuracy by up to 40%.

Read more
AI NewsAIML & Data Engineering

Apple Releases Pico-Banana-400K Dataset for Text-Guided Image Editing

Apple introduces Pico-Banana-400K, a dataset of 400,000 images for advancing text-guided image editing models, generated using Google's Nano-Banana and filtered with Gemini-2.5-Pro.

Read more
AI NewsSoftware DevelopmentML & Data Engineering

Anthropic Launches Claude Code on Web and Mobile

Anthropic expands the availability of Claude Code, its AI-powered development environment, to web and mobile platforms, enabling developers to write, edit, and execute code directly in a browser or on mobile devices.

Read more
AI NewsAIML & Data Engineering

PyTorch Foundation Expands Open AI Infrastructure with Ray and Monarch

The PyTorch Foundation introduces Ray and PyTorch Monarch at its 2025 conference, advancing distributed AI infrastructure and promoting transparency in foundation model development.

Read more
AI NewsArchitecture & DesignAI

AI Agents Evolve: From Assistance to Execution Engines in Enterprise Architecture

A significant shift is occurring in enterprise software architecture as AI agents transition from providing assistance to autonomously executing tasks. This article details the architectural changes, adoption rates, real-world examples, and key considerations for implementing agentic AI, including governance, transparency, and cost management.

Read more
AI NewsLarge language modelsML & Data Engineering

NVIDIA Unveils OmniVinci: A Research-Focused Multimodal LLM

NVIDIA Research has released OmniVinci, a research-only large language model designed for cross-modal understanding of text, vision, audio, and robotics data. It demonstrates strong performance with a smaller training dataset compared to competitors, but its non-commercial license has sparked debate within the AI community.

Read more
AI NewsArtificial IntelligenceML & Data Engineering

Meta's PyTorch Monarch Simplifies Distributed AI Workflows

Meta's PyTorch team has launched Monarch, an open-source framework simplifying distributed AI workflows across multiple GPUs and machines using a single-controller model.

Read more
AI NewsML & Data EngineeringLarge language models

DeepSeek AI Introduces DeepSeek-OCR: A Novel Approach to Context Compression for LLMs

DeepSeek AI has released DeepSeek-OCR, an open-source system leveraging optical 2D mapping for efficient compression of long text, potentially revolutionizing how large language models handle extensive inputs.

Read more
AI NewsLarge language modelsML & Data Engineering

Google Launches LLM-Evalkit for Data-Driven Prompt Engineering

Google introduces LLM-Evalkit, an open-source framework on Vertex AI SDKs, to standardize and measure prompt engineering for large language models, promoting a data-driven workflow and collaboration.

Read more