Remove category
article thumbnail

Automated data governance with AWS Glue Data Quality, sensitive data detection, and AWS Lake Formation

AWS Big Data

Data governance is the process of ensuring the integrity, availability, usability, and security of an organization’s data. Due to the volume, velocity, and variety of data being ingested in data lakes, it can get challenging to develop and maintain policies and procedures to ensure data governance at scale for your data lake.

article thumbnail

Top 10 Data Lineage Podcasts, Blogs, and Magazines

Octopai

We have identified the top ten sites, videos, or podcasts online that deal with data lineage. Our list of Top 10 Data Lineage Podcasts, Blogs, and Websites To Follow in 2021. Data Engineering Podcast. This podcast centers around data management and investigates a different aspect of this field each week.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

The DataOps Vendor Landscape, 2021

DataKitchen

Read the complete blog below for a more detailed description of the vendors and their capabilities. This is not surprising given that DataOps enables enterprise data teams to generate significant business value from their data. DVC — Open-source Version Control System for Machine Learning Projects … data version control.

Testing 300
article thumbnail

Data Governance and Strategy for the Global Enterprise

Cloudera

In a recent blog, Cloudera Chief Technology Officer Ram Venkatesh described the evolution of a data lakehouse, as well as the benefits of using an open data lakehouse, especially the open Cloudera Data Platform (CDP). Modern data lakehouses are typically deployed in the cloud.

article thumbnail

New Gartner Category Impacts Data Governance Professionals

TDAN

With the latest SEC developments lighting a fire under the feet of companies and their executives, data governance is increasingly a front-line imperative. The shift is dramatic, with firms now mandated to report material cybersecurity incidents promptly, a move that ties the knot even tighter between cybersecurity and data governance.

article thumbnail

Top 7 Data Governance Blog Posts of 2018

erwin

The driving factors behind data governance adoption vary. Whether implemented as preventative measures (risk management and regulation) or proactive endeavors (value creation and ROI), the benefits of a data governance initiative is becoming more apparent. Defining Data Governance. to Data Governance 2.0

article thumbnail

Data governance beyond SDX: Adding third party assets to Apache Atlas

Cloudera

In this blog, we’ll highlight the key CDP aspects that provide data governance and lineage and show how they can be extended to incorporate metadata for non-CDP systems from across the enterprise. The example 1_typedef-server.json describes the server typedef used in this blog. . Apache Atlas as a fundamental part of SDX.