Azure Databricks Cookbook

Azure Databricks Cookbook

Get to grips with building and productionizing end-to-end big data solutions in Azure and learn best practices for working with large datasets Key FeaturesIntegrate with Azure Synapse Analytics, Cosmos DB, and Azure HDInsight Kafka Cluster to scale and analyze your projects and build pipelinesUse Databricks SQL to run ad hoc queries on your data lake and create dashboardsProductionize a solution using CI/CD for deploying notebooks and Azure Databricks Service to various environmentsBook Description Azure Databricks is a unified collaborative platform for performing scalable analytics in an interactive environment. The Azure Databricks Cookbook provides recipes to get hands-on with the analytics process, including ingesting data from various batch and streaming sources and building a modern data warehouse. The book starts by teaching you how to create an Azure Databricks instance within the Azure portal, Azure CLI, and ARM templates. You'll work through clusters in Databricks and explore recipes for ingesting data from sources, including files, databases, and streaming sources such as Apache Kafka and EventHub. The book will help you explore all the features supported by Azure Databricks for building powerful end-to-end data pipelines. You'll also find out how to build a modern data warehouse by using Delta tables and Azure Synapse Analytics. Later, you'll learn how to write ad hoc queries and extract meaningful insights from the data lake by creating visualizations and dashboards with Databricks SQL. Finally, you'll deploy and productionize a data pipeline as well as deploy notebooks and Azure Databricks service using continuous integration and continuous delivery (CI/CD). By the end of this Azure book, you'll be able to use Azure Databricks to streamline different processes involved in building data-driven apps. What you will learnRead and write data from and to various Azure resources and file formatsBuild a modern data warehouse with Delta Tables and Azure Synapse AnalyticsExplore jobs, stages, and tasks and see how Spark lazy evaluation worksHandle concurrent transactions and learn performance optimization in Delta tablesLearn Databricks SQL and create real-time dashboards in Databricks SQLIntegrate Azure DevOps for version control, deploying, and productionizing solutions with CI/CD pipelinesDiscover how to use RBAC and ACLs to restrict data accessBuild end-to-end data processing pipeline for near real-time data analyticsWho this book is for This recipe-based book is for data scientists, data engineers, big data professionals, and machine learning engineers who want to perform data analytics on their applications. Prior experience of working with Apache Spark and Azure is necessary to get the most out of this book.

Download Now
Author
Publisher Packt Publishing Ltd
Release Date
ISBN 178961855X
Pages 452 pages
Rating 4/5 (56 users)

More Books:

Azure Databricks Cookbook
Language: en
Pages: 452
Authors: Phani Raj
Categories: Computers
Type: BOOK - Published: 2021-09-17 - Publisher: Packt Publishing Ltd

GET EBOOK

Get to grips with building and productionizing end-to-end big data solutions in Azure and learn best practices for working with large datasets Key FeaturesInteg
Azure Data Engineering Cookbook
Language: en
Pages: 454
Authors: Ahmad Osama
Categories: Computers
Type: BOOK - Published: 2021-04-05 - Publisher: Packt Publishing Ltd

GET EBOOK

Over 90 recipes to help you orchestrate modern ETL/ELT workflows and perform analytics using Azure services more easily Key FeaturesBuild highly efficient ETL p
Azure Data Engineering Cookbook
Language: en
Pages: 608
Authors: Nagaraj Venkatesan
Categories: Computers
Type: BOOK - Published: 2022-09-26 - Publisher: Packt Publishing Ltd

GET EBOOK

Nearly 80 recipes to help you collect and transform data from multiple sources into a single data source, making it way easier to perform analytics on the data
Azure Data Factory Cookbook
Language: en
Pages: 382
Authors: Dmitry Anoshin
Categories: Computers
Type: BOOK - Published: 2020-12-24 - Publisher: Packt Publishing Ltd

GET EBOOK

With the help of well-structured and practical recipes, this book will teach you how to integrate data from the cloud and on-premise. You’ll learn how to tran
ETL with Azure Cookbook
Language: en
Pages: 446
Authors: Christian Coté
Categories: Computers
Type: BOOK - Published: 2020-09-30 - Publisher: Packt Publishing Ltd

GET EBOOK

Explore the latest Azure ETL techniques both on-premises and in the cloud using Azure services such as SQL Server Integration Services (SSIS), Azure Data Factor
Amazon Redshift Cookbook
Language: en
Pages: 384
Authors: Shruti Worlikar
Categories: Computers
Type: BOOK - Published: 2021-07-23 - Publisher: Packt Publishing Ltd

GET EBOOK

Discover how to build a cloud-based data warehouse at petabyte-scale that is burstable and built to scale for end-to-end analytical solutions Key FeaturesDiscov
Optimizing Databricks Workloads
Language: en
Pages: 230
Authors: Anirudh Kala
Categories: Computers
Type: BOOK - Published: 2021-12-24 - Publisher: Packt Publishing Ltd

GET EBOOK

Accelerate computations and make the most of your data effectively and efficiently on Databricks Key FeaturesUnderstand Spark optimizations for big data workloa
Power Query Cookbook
Language: en
Pages: 412
Authors: Andrea Janicijevic
Categories: Computers
Type: BOOK - Published: 2021-10-15 - Publisher: Packt Publishing Ltd

GET EBOOK

Leverage your source data from hundreds of different connections, perform millions of different transformations, and easily manage highly complex data lifecycle
Distributed Data Systems with Azure Databricks
Language: en
Pages: 414
Authors: Alan Bernardo Palacio
Categories: Computers
Type: BOOK - Published: 2021-05-25 - Publisher: Packt Publishing Ltd

GET EBOOK

Quickly build and deploy massive data pipelines and improve productivity using Azure Databricks Key FeaturesGet to grips with the distributed training and deplo
Machine Learning for Streaming Data with Python
Language: en
Pages: 258
Authors: Joos Korstanje
Categories: Computers
Type: BOOK - Published: 2022-07-15 - Publisher: Packt Publishing Ltd

GET EBOOK

Apply machine learning to streaming data with the help of practical examples, and deal with challenges that surround streaming Key FeaturesWork on streaming use