From the course: Azure Data Engineer Associate (DP-203) Cert Prep: 3 Design and Implement Data Security

Overview

- [Narrator] Welcome to Microsoft Azure, Data Engineering DP-203. This course covers the design, and implementation of data security that prepares you to pass the data engineering exam. Let's go ahead and take a look at what we're going to cover. To start with, this course has two key sections. The first one is the design for security of data policies and standards. Inside of this section, there's several lessons. Let's go ahead and take a look at first how to design a data auditing strategy. So what are the key techniques that we need to pay attention to? Next, how do you design a data masking strategy? Next step, how do you design for data privacy? What are some core principles that are important to pay attention to? Next, have you design a data retention policy followed by design to purge data based on business requirements, for example, if you're in the European Union, there could be different requirements for purging data. Also, how to design Azure role-based access control for resources such as Data Lake Gen2. Also, how do you design row-level and column-level security? A lot of times with a big data system, you can think about data in a column or row level. We'll dive into the details there. In the second section, we cover how to implement data security from a broad level. Let's take a look at the lessons. First, we implement data masking. That's a core feature of data security. Next, we talk about how to encrypt data at both the rest, and also in motion so that you have no gaps in coverage. Then we get into implementing row level and calm level security. We then talk about implementing Azure RBAC. We then get into implementing POSIX-like ACLS for Data Lake Storage Gen2. Finally, we get into a few more topics including implementing a data retention policy, implement a data auditing strategy, managing identities, keys and secrets across different data platform technologies, and also implementing secure endpoints both private and public. We'll also talk about implementing the resource tokens in Azure Databricks, a popular data management platform, and also how to load a data frame with sensitive information and also how to write encrypted data to both tables or Parquet files. We can also get into how to manage sensitive information. One last thing I'll point out is that in order to get started with this course, you should have access to an Azure Sandbox or Azure Free Tier Account. You can get started with just $200 in credit, so it's easy to get started, and it's enough credit to do all of the lessons in this course. Okay, let's go ahead and get started.

Contents