top of page
< Back

Incremental Data Processing with Apache Hudi

Overview

Skills Needed

Learn to implement incremental data processing with Apache Hudi. Explore change data capture, data deduplication, and data reconciliation techniques with Hudi.

  • Basic knowledge of distributed systems concepts
  • Understanding of data processing pipelines

Outline

  • Introduction to Incremental Processing
  • Change Data Capture (CDC) Techniques
  • Data Deduplication Strategies
  • Conflict Resolution Mechanisms
  • Handling Late Arriving Data
  • Data Reconciliation Techniques
  • Incremental Data Loading
  • Watermark-based Processing
  • Event-time vs. Processing-time
  • Best Practices for Incremental Processing with Hudi

dataUology

“We embark on a journey to empower students with the transformative
power of knowledge today so they can be future leaders of tomorrow.“
Join The Success!
Contact

(801) 946 5513

contact@datauology.com

Follow
  • LinkedIn
  • Facebook
  • Instagram
  • YouTube
  • Discord

© 2024 dataUology

bottom of page