Machine Learning
273 articles in this category (Page 7 of 12)
NVIDIA AI Open-Sourced KVzap: A SOTA KV Cache Pruning Method that Delivers near-Lossless 2x-4x Compression
NVIDIA released KVzap, a new KV cache pruning method achieving near-lossless 2x-4x compression, addressing a key bottleneck in long-context LLM deployment.
How to Build Portable, In-Database Feature Engineering Pipelines with Ibis Using Lazy Python APIs and DuckDB Execution
Ibis enables building portable in-database feature engineering pipelines, executing entirely within DuckDB, and demonstrating a 100% reduction in data transfer overhead.
A Coding Implementation to Build a Unified Apache Beam Pipeline Demonstrating Batch and Stream Processing with Event-Time Windowing Using DirectRunner
Build a Unified Apache Beam Pipeline Demonstrating Batch and Stream Processing with Event-Time Windowing Using DirectRunner in a single implementation.
TII Abu-Dhabi Released Falcon H1R-7B: A New Reasoning Model Outperforming Others in Math and Coding
Technology Innovation Institute (TII) released Falcon-H1R-7B, a 7B parameter model achieving performance comparable to 14B-47B models in math, code, and reasoning benchmarks.
Generative Simulation Benchmarking for precision oncology clinical workflows with inverse simulation verification
A novel methodology combining generative simulation and inverse verification addresses the limitations of traditional AI benchmarking in oncology, improving clinical decision support.