Error loading page.
Try refreshing the page. If that doesn't work, there may be a network issue, and you can use our self test page to see what's preventing the page from loading.
Learn more about possible network issues or contact support for more help.

Data Pipelines Pocket Reference

ebook

Data pipelines are the foundation for success in data analytics. Moving data from numerous diverse sources and transforming it to provide context is the difference between having data and actually gaining value from it. This pocket reference defines data pipelines and explains how they work in today's modern data stack.

You'll learn common considerations and key decision points when implementing pipelines, such as batch versus streaming data ingestion and build versus buy. This book addresses the most common decisions made by data professionals and discusses foundational concepts that apply to open source frameworks, commercial products, and homegrown solutions.

You'll learn:

  • What a data pipeline is and how it works
  • How data is moved and processed on modern data infrastructure, including cloud platforms
  • Common tools and products used by data engineers to build pipelines
  • How pipelines support analytics and reporting needs
  • Considerations for pipeline maintenance, testing, and alerting

  • Expand title description text
    Publisher: O'Reilly Media

    Kindle Book

    • Release date: February 10, 2021

    OverDrive Read

    • ISBN: 9781492087786
    • Release date: February 10, 2021

    EPUB ebook

    • ISBN: 9781492087786
    • File size: 3288 KB
    • Release date: February 10, 2021

    Formats

    Kindle Book
    OverDrive Read
    EPUB ebook

    Languages

    English

    Data pipelines are the foundation for success in data analytics. Moving data from numerous diverse sources and transforming it to provide context is the difference between having data and actually gaining value from it. This pocket reference defines data pipelines and explains how they work in today's modern data stack.

    You'll learn common considerations and key decision points when implementing pipelines, such as batch versus streaming data ingestion and build versus buy. This book addresses the most common decisions made by data professionals and discusses foundational concepts that apply to open source frameworks, commercial products, and homegrown solutions.

    You'll learn:

  • What a data pipeline is and how it works
  • How data is moved and processed on modern data infrastructure, including cloud platforms
  • Common tools and products used by data engineers to build pipelines
  • How pipelines support analytics and reporting needs
  • Considerations for pipeline maintenance, testing, and alerting

  • Expand title description text