top of page
< Back

Incremental Data Processing with Apache Hudi

Overview

Skills Needed

Learn to implement incremental data processing with Apache Hudi. Explore change data capture, data deduplication, and data reconciliation techniques with Hudi.

  • Basic knowledge of distributed systems concepts
  • Understanding of data processing pipelines

Outline

  • Introduction to Incremental Processing
  • Change Data Capture (CDC) Techniques
  • Data Deduplication Strategies
  • Conflict Resolution Mechanisms
  • Handling Late Arriving Data
  • Data Reconciliation Techniques
  • Incremental Data Loading
  • Watermark-based Processing
  • Event-time vs. Processing-time
  • Best Practices for Incremental Processing with Hudi
bottom of page