Data Pipelines with Apache Airflow, 2E by Julian de Ruiter(.PDF)

File Size: 64.3 MB

Data Pipelines with Apache Airflow: Orchestration for data and AI (Final Release), 2nd Edition by Julian de Ruiter, Ismael Cabral, Kris Geusebroek, Daniel van der Ende, Bas Harenslak
Requirements: .PDF reader, 64.3 MB | True PDF
Overview: Simplify, streamline, and scale your data operations with data pipelines built on Apache Airflow. Apache Airflow provides a batteries-included platform for designing, implementing, and monitoring data pipelines. Building pipelines on Airflow eliminates the need for patchwork stacks and homegrown processes, adding security and consistency to the process. Now in its second edition, Data Pipelines with Apache Airflow teaches you to harness this powerful platform to simplify and automate your data pipelines, reduce operational overhead, and seamlessly integrate all the technologies in your stack. Data Pipelines with Apache Airflow, Second Edition teaches you how to build and maintain effective data pipelines. You’ll master every aspect of directed acyclic graphs (DAGs)—the power behind Airflow—and learn to customize them for your pipeline’s specific needs. Part reference and part tutorial, each technique is illustrated with engaging hands-on examples, from training machine learning models for generative AI to optimizing delivery routes. You’ll explore common Airflow usage patterns, including aggregating multiple data sources and connecting to data lakes, while discovering exciting new features such as dynamic scheduling, the Taskflow API, and Kubernetes deployments. For DevOps, data engineers, machine learning engineers, and sysadmins with intermediate Python skills.
Genre: Non-Fiction > Tech & Devices

Free Download links:

https://upfiles.com/NJUxVF9