Data Engineering
53 articles in this category (Page 2 of 3)
AI NewsArtificial IntelligenceData Engineering
Beyond the Vector Store: Why Production AI Requires a Relational Data Layer
Production AI applications require a hybrid data layer combining vector databases for semantic retrieval with relational databases to manage permissions, billing, and state with ACID guarantees.
Read more
AI NewsMachine LearningData Engineering
Building Scalable ML Data Pipelines for Image and Structured Data with Daft
Learn how to build an end-to-end ML pipeline using Daft, a Python-native data engine that handles MNIST image reshaping, feature engineering via batch UDFs, and Parquet persistence for high-performance processing.
Read more
AI NewsData EngineeringApache Spark
Agoda Unifies Data Pipelines with Apache Spark to Achieve 95.6% Uptime
Agoda consolidated independent financial data pipelines into a centralized Apache Spark platform, reducing inconsistencies and achieving 95.6% uptime while processing millions of daily transactions.
Read more