Remove Data Processing Remove Metadata Remove Reference
article thumbnail

Security Reference Architecture Summary for Cloudera Data Platform

Cloudera

System metadata is reviewed and updated regularly. Services in each zone use a combination of kerberos and transport layer security (TLS) to authenticate connections and APIs calls between the respective host roles, this allows authorization policies to be enforced and audit events to be captured. Sensitive data is encrypted.

article thumbnail

A Reference Architecture for the Cloudera Private Cloud Base Data Platform

Cloudera

All three will be quorums of Zookeepers and HDFS Journal nodes to track changes to HDFS Metadata stored on the Namenodes. CDP is particularly sensitive to host name resolution, therefore it’s vital that the DNS servers have been properly configured and hostnames are fully qualified. Networking . Clocks must also be synchronized.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

How Volkswagen Autoeuropa built a data mesh to accelerate digital transformation using Amazon DataZone

AWS Big Data

Absence of data catalog and metadata management – Data didn’t have any metadata associated with it, and so use cases couldn’t consume the data without further explanation from the data source owners and specialists. In addition, they use generative AI capabilities to generate business metadata.

article thumbnail

How Cargotec uses metadata replication to enable cross-account data sharing

AWS Big Data

In this blog, we discuss the technical challenges faced by Cargotec in replicating their AWS Glue metadata across AWS accounts, and how they navigated these challenges successfully to enable cross-account data sharing. Solution overview Cargotec required a single catalog per account that contained metadata from their other AWS accounts.

article thumbnail

Disaster recovery strategies for Amazon MWAA – Part 2

AWS Big Data

The solution for this post is hosted on GitHub. Backup and restore architecture The backup and restore strategy involves periodically backing up Amazon MWAA metadata to Amazon Simple Storage Service (Amazon S3) buckets in the primary Region. Refer to the detailed deployment steps in the README file to deploy it in your own accounts.

article thumbnail

Configure a custom domain name for your Amazon MSK cluster

AWS Big Data

For the client to resolve DNS queries for the custom domain, an Amazon Route 53 private hosted zone is used to host the DNS records, and is associated with the client’s VPC to enable DNS resolution from the Route 53 VPC resolver. The Kafka client uses the custom domain bootstrap address to send a get metadata request to the NLB.

article thumbnail

Governing data in relational databases using Amazon DataZone

AWS Big Data

This post explains how you can extend the governance capabilities of Amazon DataZone to data assets hosted in relational databases based on MySQL, PostgreSQL, Oracle or SQL Server engines. Second, the data producer needs to consolidate the data asset’s metadata in the business catalog and enrich it with business metadata.