Cloud Scale Analytics with Azure Data Services: Build modern data warehouses on Microsoft Azure
- Length: 520 pages
- Edition: 1
- Language: English
- Publisher: Packt Publishing
- Publication Date: 2021-07-23
- ISBN-10: 1800562934
- ISBN-13: 9781800562936
- Sales Rank: #996897 (See Top 100 Books)
A practical guide to implementing a scalable and fast state-of-the-art analytical data estate
Key Features
- Store and analyze data with enterprise-grade security and auditing
- Perform batch, streaming, and interactive analytics to optimize your big data solutions with ease
- Develop and run parallel data processing programs using real-world enterprise scenarios
Book Description
Azure Data Lake, the modern data warehouse architecture, and related data services on Azure enable organizations to build their own customized analytical platform to fit any analytical requirements in terms of volume, speed, and quality.
This book is your guide to learning all the features and capabilities of Azure data services for storing, processing, and analyzing data (structured, unstructured, and semi-structured) of any size. You will explore key techniques for ingesting and storing data and perform batch, streaming, and interactive analytics. The book also shows you how to overcome various challenges and complexities relating to productivity and scaling. Next, you will be able to develop and run massive data workloads to perform different actions. Using a cloud-based big data-modern data warehouse-analytics setup, you will also be able to build secure, scalable data estates for enterprises. Finally, you will not only learn how to develop a data warehouse but also understand how to create enterprise-grade security and auditing big data programs.
By the end of this Azure book, you will have learned how to develop a powerful and efficient analytical platform to meet enterprise needs.
What you will learn
- Implement data governance with Azure services
- Use integrated monitoring in the Azure Portal and integrate Azure Data Lake Storage into the Azure Monitor
- Explore the serverless feature for ad-hoc data discovery, logical data warehousing, and data wrangling
- Implement networking with Synapse Analytics and Spark pools
- Create and run Spark jobs with Databricks clusters
- Implement streaming using Azure Functions, a serverless runtime environment on Azure
- Explore the predefined ML services in Azure and use them in your app
Who this book is for
This book is for data architects, ETL developers, or anyone who wants to get well-versed with Azure data services to implement an analytical data estate for their enterprise. The book will also appeal to data scientists and data analysts who want to explore all the capabilities of Azure data services, which can be used to store, process, and analyze any kind of data. A beginner-level understanding of data analysis and streaming will be required.
Table of Contents
- Balancing the benefits of Data Lakes over Data Warehouses
- The Modern Data Warehouse and Azure Data Services
- Understanding the Data Lake Storage Layer
- Relational Storage components: Synapse SQL Pools, SQL DB, Azure Databases
- Data integration enterprise grade and even code-free
- Spark on Azure: Synapse Spark Pools
- Spark on Azure: Databricks
- Streaming
- Azure Cognitive Services / Azure Machine Learning
- Machine Learning with Spark on Azure: Synapse Spark Pools / Azure Databricks
- Synapse SQL Pools / Synapse Analytics
- Analysis Service / Power BI / Data Share
- Industry Data Models
- Data Governance