Data Contracts: Developing Pipelines by Chad Sanderson (.ePUB)+
File Size: 19.9 MB
Data Contracts: Developing Production-Grade Pipelines at Scale by Chad Sanderson, Mark Freeman, B.E. Schmidt
Requirements: .ePUB, .PDF reader, 19.9 MB | True PDF, True EPUB
Overview: Poor data quality can cause major problems for data teams, from breaking revenue-generating data pipelines to losing the trust of data consumers. Despite the importance of data quality, many data teams still struggle to avoid these issues—especially when their data is sourced from upstream workflows outside of their control. The solution: data contracts. Data contracts enable high-quality, well-governed data assets by documenting expectations of the data, establishing ownership of data assets, and then automatically enforcing these constraints within the CI/CD workflow. Data contracts are an architecture pattern that enables an agreement between data producers and consumers that is established, updated, and enforced via an API. They’re part of a larger movement called shift left, where you use automation to enable upstream software developers to account for required enforcement pertinent to their domain—this approach was first validated within DevOps and DevSecOps. This practical book introduces data contract architecture with a clear definition of data contracts, explains why the data industry needs them, and shares real-world use cases of data contracts in production. In addition, you’ll learn how to implement components of the data contract architecture and understand how they’re used in the data lifecycle. Finally, you’ll build a case for implementing data contracts in your organization.
Genre: Non-Fiction > Tech & Devices

Free Download links: