Data Governance, Data Warehouse and Reference

The future of data: A 5-pillar approach to modern data management

CIO Business Intelligence

DECEMBER 11, 2024

To succeed in todays landscape, every company small, mid-sized or large must embrace a data-centric mindset. This article proposes a methodology for organizations to implement a modern data management function that can be tailored to meet their unique needs. However, this landscape is rapidly evolving.

Management

Management Data Governance Data Science Reporting

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Unifying these necessitates additional data processing, requiring each business unit to provision and maintain a separate data warehouse. This burdens business units focused solely on consuming the curated data for analysis and not concerned with data management tasks, cleansing, or comprehensive data processing.

Data Lake

Data Lake Data Warehouse Data Governance Publishing

Data governance in the age of generative AI

AWS Big Data

FEBRUARY 29, 2024

Data is your generative AI differentiator, and a successful generative AI implementation depends on a robust data strategy incorporating a comprehensive data governance approach. Data governance is a critical building block across all these approaches, and we see two emerging areas of focus.

Data Governance

Data Governance Unstructured Data Metadata Data Lake

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Building end-to-end data lineage for one-time and complex queries using Amazon Athena, Amazon Redshift, Amazon Neptune and dbt

AWS Big Data

DECEMBER 12, 2024

One-time and complex queries are two common scenarios in enterprise data analytics. Complex queries, on the other hand, refer to large-scale data processing and in-depth analysis based on petabyte-level data warehouses in massive data scenarios. Here, data modeling uses dbt on Amazon Redshift.

Snapshot

Snapshot Recreation/Entertainment Experimentation Data Lake

How to rule your data world: The role of data governance

BI-Survey

FEBRUARY 17, 2020

From operational systems to support “smart processes”, to the data warehouse for enterprise management, to exploring new use cases through advanced analytics : all of these environments incorporate disparate systems, each containing data fragments optimized for their own specific task. .

Data Governance

Data Governance Data Warehouse Data Quality Data Strategy

Alation 2022.2: Open Data Quality Initiative and Enhanced Data Governance

Alation

MAY 24, 2022

generally available on May 24, Alation introduces the Open Data Quality Initiative for the modern data stack, giving customers the freedom to choose the data quality vendor that’s best for them with the added confidence that those tools will integrate seamlessly with Alation’s Data Catalog and Data Governance application.

Data Quality

Data Quality Data Governance Metadata Metrics

What is a data architect? Skills, salaries, and how to become a data framework master

CIO Business Intelligence

OCTOBER 13, 2023

Solutions data architect: These individuals design and implement data solutions for specific business needs, including data warehouses, data marts, and data lakes. Application data architect: The application data architect designs and implements data models for specific software applications.

Data Architecture

Data Architecture Data Warehouse Statistics Visualization

Digital Transformation in Municipal Government: The Hidden Force Powering Smart Cities

erwin

FEBRUARY 28, 2019

When you think of real-time, data-driven experiences and modern applications to accomplish tasks faster and easier, your local town or city government probably doesn’t come to mind. But municipal government is starting to embrace digital transformation and therefore data governance.

Digital Transformation

Digital Transformation Data Governance Data-driven Data Warehouse

Liberty Mutual CIO Monica Caldas on developing a digital-savvy workforce

CIO Business Intelligence

NOVEMBER 7, 2024

We are still maturing in this capability, but we have fully recognized that we have shared data responsibilities. We have a data office that focuses on data governance, data domain stewardship, and access, and this group sits outside of IT. Our approach is two-pronged. So that’s the journey we’re on.

Insurance

Insurance Experimentation Testing Technology

Amazon DataZone announces custom blueprints for AWS services

AWS Big Data

JUNE 26, 2024

New feature: Custom AWS service blueprints Previously, Amazon DataZone provided default blueprints that created AWS resources required for data lake, data warehouse, and machine learning use cases. If you’re new to Amazon DataZone, refer to Getting started.

Data Lake

Data Lake Data Warehouse Unstructured Data Data Governance

Centralize near-real-time governance through alerts on Amazon Redshift data warehouses for sensitive queries

AWS Big Data

JUNE 29, 2023

Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud that delivers powerful and secure insights on all your data with the best price-performance. With Amazon Redshift, you can analyze your data to derive holistic insights about your business and your customers.

Data Warehouse

Data Warehouse Dashboards Testing Visualization

Has the Data Warehouse Had Its Day?

BI-Survey

JANUARY 15, 2023

Statements from countless interviews with our customers reveal that the data warehouse is seen as a “black box” by many and understood by few business users. Therefore, it is not clear why the costly and apparently flexibility-inhibiting data warehouse is needed at all. The limiting factor is rather the data landscape.

Data Warehouse

Data Warehouse IT Data Architecture Measurement

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

AWS Big Data

OCTOBER 10, 2024

Amazon Redshift has established itself as a highly scalable, fully managed cloud data warehouse trusted by tens of thousands of customers for its superior price-performance and advanced data analytics capabilities. This allows you to maintain a comprehensive view of your data while optimizing for cost-efficiency.

Data Lake

Data Lake Data Warehouse Recreation/Entertainment Data-driven

Build a secure data visualization application using the Amazon Redshift Data API with AWS IAM Identity Center

AWS Big Data

MARCH 6, 2025

Tens of thousands of customers use Amazon Redshift for modern data analytics at scale, delivering up to three times better price-performance and seven times better throughput than other cloud data warehouses. Refer to IAM Identity Center identity source tutorials for the IdP setup. IAM Identity Center enabled.

Visualization

Visualization Sales Data Warehouse Management

Four Use Cases Proving the Benefits of Metadata-Driven Automation

erwin

FEBRUARY 7, 2019

Organization’s cannot hope to make the most out of a data-driven strategy, without at least some degree of metadata-driven automation. The volume and variety of data has snowballed, and so has its velocity. As such, traditional – and mostly manual – processes associated with data management and data governance have broken down.

Metadata

Metadata Insurance Data-driven Cost-Benefit

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

erwin

JULY 17, 2019

The solution is data intelligence. It improves IT and business data literacy and knowledge, supporting enterprise data governance and business enablement. Organizations need a real-time, accurate picture of the metadata landscape to: Discover data – Identify and interrogate metadata from various data management silos.

Digital Transformation

Digital Transformation Strategy Metadata Data-driven

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

Reporting being part of an effective DQM, we will also go through some data quality metrics examples you can use to assess your efforts in the matter. But first, let’s define what data quality actually is. What is the definition of data quality? Why Do You Need Data Quality Management?

Data Quality

Data Quality Metrics Data-driven Management

Your 5-Step Journey from Analytics to AI

CIO Business Intelligence

MARCH 22, 2022

One option is a data lake—on-premises or in the cloud—that stores unprocessed data in any type of format, structured or unstructured, and can be queried in aggregate. Another option is a data warehouse, which stores processed and refined data. Set up unified data governance rules and processes.

Analytics

Analytics Key Performance Indicator Data Warehouse Data-driven

Benefits of Enterprise Modeling and Data Intelligence Solutions

erwin

JULY 2, 2020

a senior business process management architect at a pharma/biotech company with more than 5,000 employees, erwin Evolve was useful for enterprise architecture reference. As he put it, “We are describing our business process and we are trying to describe our data catalog. Data Modeling with erwin Data Modeler. George H.,

Enterprise

Enterprise Modeling Metadata Data Governance

Access Amazon Redshift data from Salesforce Data Cloud with Zero Copy Data Federation

AWS Big Data

JUNE 25, 2024

This post is co-authored by Vijay Gopalakrishnan, Director of Product, Salesforce Data Cloud. In today’s data-driven business landscape, organizations collect a wealth of data across various touch points and unify it in a central data warehouse or a data lake to deliver business insights.

Data Lake

Data Lake Cost-Benefit Data-driven Data Warehouse

AWS Lake Formation 2022 year in review

AWS Big Data

JANUARY 31, 2023

Data governance is the collection of policies, processes, and systems that organizations use to ensure the quality and appropriate handling of their data throughout its lifecycle for the purpose of generating business value.

Data Lake

Data Lake Data Governance Data Architecture Machine Learning

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

AWS Big Data

MARCH 7, 2023

Flexible and easy to use – The solutions should provide less restrictive, easy-to-access, and ready-to-use data. A data hub is a center of data exchange that constitutes a hub of data repositories and is supported by data engineering, data governance, security, and monitoring services.

Analytics

Analytics Data Warehouse Data Lake Metadata

Implement data quality checks on Amazon Redshift data assets and integrate with Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Data producers (data owners) can add context and control access through predefined approvals, providing secure and governed data sharing. To learn more about the core components of Amazon DataZone, refer to Amazon DataZone terminology and concepts.

Data Quality

Data Quality Visualization Metadata Key Performance Indicator

Getting started guide for near-real time operational analytics using Amazon Aurora zero-ETL integration with Amazon Redshift

AWS Big Data

JUNE 28, 2023

For more details, refer to the What’s New Post. There are two broad approaches to analyzing operational data for these use cases: Analyze the data in-place in the operational database (e.g. For this illustration, we use a provisioned Aurora database and an Amazon Redshift Serverless data warehouse.

Data Warehouse

Data Warehouse Analytics Metrics Dashboards

5 Ways Data Engineers Can Support Data Governance

Alation

JANUARY 26, 2023

These data requirements could be satisfied with a strong data governance strategy. Governance can — and should — be the responsibility of every data user, though how that’s achieved will depend on the role within the organization. How can data engineers address these challenges directly?

Data Governance

Data Governance Strategy Data Quality Data Collection

Improve healthcare services through patient 360: A zero-ETL approach to enable near real-time data analytics

AWS Big Data

MARCH 27, 2024

The solution uses AWS services such as AWS HealthLake , Amazon Redshift , Amazon Kinesis Data Streams , and AWS Lake Formation to build a 360 view of patients. You can send data from your streaming source to this resource for ingesting the data into a Redshift data warehouse. reference", SUBSTRING(a."patient"."reference",

Data Analytics

Data Analytics Analytics Data Warehouse Data Lake

Do I Need a Data Catalog?

erwin

JUNE 26, 2020

It’s no surprise that most organizations’ data is often fragmented and siloed across numerous sources (e.g., legacy systems, data warehouses, flat files stored on individual desktops and laptops, and modern, cloud-based repositories.). Business Metadata.

Metadata

Metadata Cost-Benefit Measurement Data-driven

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

AWS Big Data

JUNE 10, 2024

In this post, we delve into the key aspects of using Amazon EMR for modern data management, covering topics such as data governance, data mesh deployment, and streamlined data discovery. Organizations have multiple Hive data warehouses across EMR clusters, where the metadata gets generated.

Data Lake

Data Lake Metadata Data Warehouse Data Processing

IBM and AWS Create a Path to Modernization Via Industry-Specific Solutions

CIO Business Intelligence

OCTOBER 13, 2022

The deliverables could be reference architectures or an industry-specific proof of concept—the goal is to offer institutional knowledge and near-turn-key solutions meant to streamline modernization and accelerate time-to-value.

Insurance

Insurance Data Warehouse Manufacturing Forecasting

How EchoStar ingests terabytes of data daily across its 5G Open RAN network in near real-time using Amazon Redshift Serverless Streaming Ingestion

AWS Big Data

JULY 8, 2024

Amazon Redshift Serverless is a fully managed, scalable cloud data warehouse that accelerates your time to insights with fast, simple, and secure analytics at scale. Amazon Redshift data sharing allows you to share data within and across organizations, AWS Regions, and even third-party providers, without moving or copying the data.

Data Warehouse

Data Warehouse IT Recreation/Entertainment Cost-Benefit

Your Effective Roadmap To Implement A Successful Business Intelligence Strategy

datapine

FEBRUARY 22, 2022

A business intelligence strategy refers to the process of implementing a BI system in your company. This should also include creating a plan for data storage services. Are the data sources going to remain disparate? Or does building a data warehouse make sense for your organization? Define a budget.

Business Intelligence

Business Intelligence Strategy Cost-Benefit Dashboards

Why Spreadsheets Are Your Secret Weapon for Efficient Data Governance

Alation

APRIL 6, 2023

Data governance is traditionally applied to structured data assets that are most often found in databases and information systems. This blog focuses on governing spreadsheets that contain data, information, and metadata, and must themselves be governed.

Data Governance

Data Governance Metadata Cost-Benefit Structured Data

The Top Six Benefits of Data Modeling – What Is Data Modeling?

erwin

SEPTEMBER 25, 2020

With each stage of data modeling, the data model becomes more information- and context-rich. A conceptual data model is a rough draft, containing the relevant concepts or entities and the relationships between them. A logical data model, also referred to as information modeling, is the second stage of data modeling.

Modeling

Modeling Cost-Benefit Visualization Data Warehouse

Cross-account data collaboration with Amazon DataZone and AWS analytical tools

AWS Big Data

MARCH 5, 2025

In this solution (as shown in the preceding figure), the AWS account that contains the data assets is referred to as the producer account. The AWS account that needs to access or use the data from the producer account is referred to as the consumer account. You will then publish the data assets from these data sources.

Analytics

Analytics Publishing Metadata Sales

Certified technical partner solutions help customers succeed with Cloudera Data Platform

Cloudera

AUGUST 26, 2020

Talend’s data management environment running on Cloudera Data Platform enables you to create and execute Hadoop and Spark integration jobs, process and reconcile Big Data, and implement data governance processes using an intuitive drag-and-drop interface. Reference Architectures for CDP Private Cloud Base.

Machine Learning

Machine Learning Big Data Data Warehouse Data-driven

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

AWS Big Data

JULY 21, 2023

This leads to having data across many instances of data warehouses and data lakes using a modern data architecture in separate AWS accounts. We recently announced the integration of Amazon Redshift data sharing with AWS Lake Formation. S3 data lake – Contains the web activity and leads datasets.

Data Lake

Data Lake Data Warehouse Marketing Management

How HPE Aruba Supply Chain optimized cost and performance by migrating to an AWS modern data architecture

AWS Big Data

SEPTEMBER 11, 2024

Source systems Aruba’s source repository includes data from three different operating regions in AMER, EMEA, and APJ, along with one worldwide (WW) data pipeline from varied sources like SAP S/4 HANA, Salesforce, Enterprise Data Warehouse (EDW), Enterprise Analytics Platform (EAP) SharePoint, and more.

Data Architecture

Data Architecture Optimization Data Warehouse Metadata

Get started with the new Amazon DataZone enhancements for Amazon Redshift

AWS Big Data

JULY 29, 2024

Amazon DataZone is a powerful data management service that empowers data engineers, data scientists, product managers, analysts, and business users to seamlessly catalog, discover, analyze, and govern data across organizational boundaries, AWS accounts, data lakes, and data warehouses.

Data Warehouse

Data Warehouse Sales Metadata Publishing

Orchestrate an end-to-end ETL pipeline using Amazon S3, AWS Glue, and Amazon Redshift Serverless with Amazon MWAA

AWS Big Data

APRIL 25, 2024

Amazon Managed Workflows for Apache Airflow (Amazon MWAA) is a managed orchestration service for Apache Airflow that you can use to set up and operate data pipelines in the cloud at scale. Apache Airflow is an open source tool used to programmatically author, schedule, and monitor sequences of processes and tasks, referred to as workflows.

Metadata

Metadata Data Processing Management Testing

Governing data in relational databases using Amazon DataZone

AWS Big Data

MAY 7, 2024

Data governance is a key enabler for teams adopting a data-driven culture and operational model to drive innovation with data. Amazon DataZone allows you to simply and securely govern end-to-end data assets stored in your Amazon Redshift data warehouses or data lakes cataloged with the AWS Glue data catalog.

Metadata

Metadata Data Lake Data Processing Data-driven

Migrate data from Azure Blob Storage to Amazon S3 using AWS Glue

AWS Big Data

OCTOBER 20, 2023

Prerequisites You need the following prerequisites: A storage account in Microsoft Azure and your data path in Azure Blob Storage. For instructions, refer to Create a storage account shared key. For instructions, refer to Creating ETL jobs with AWS Glue Studio. Prepare the storage account credentials in advance.

Data Lake

Data Lake Big Data Data Warehouse Consulting

Five actionable steps to GDPR compliance (Right to be forgotten) with Amazon Redshift

AWS Big Data

JULY 28, 2023

Organizations must comply with these requests provided that there are no legitimate grounds for retaining the personal data, such as legal obligations or contractual requirements. Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud. Tags provide metadata about resources at a glance.

Snapshot

Snapshot Metadata Measurement Data Warehouse

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

In this post, we discuss how you can use purpose-built AWS services to create an end-to-end data strategy for C360 to unify and govern customer data that address these challenges. This consolidated view acts as a liaison between the data platform and customer-centric applications.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Leveraging AI to discover and classify your data in a complex and dynamic landscape

Laminar Security

DECEMBER 13, 2023

They offer a comprehensive solution to enhance your cloud security posture and effectively manage your data. The primary focus of discovery is to find all the places where data exists and identify the assets it resides in. It helps in determining what data you have and its sensitivity.

Data-driven

Data-driven Machine Learning Risk Deep Learning

The future of data: A 5-pillar approach to modern data management

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

Webinars

Trending Sources

Data governance in the age of generative AI

Webinars

Building end-to-end data lineage for one-time and complex queries using Amazon Athena, Amazon Redshift, Amazon Neptune and dbt

How to rule your data world: The role of data governance

Alation 2022.2: Open Data Quality Initiative and Enhanced Data Governance

What is a data architect? Skills, salaries, and how to become a data framework master

Digital Transformation in Municipal Government: The Hidden Force Powering Smart Cities

Liberty Mutual CIO Monica Caldas on developing a digital-savvy workforce

Amazon DataZone announces custom blueprints for AWS services

Centralize near-real-time governance through alerts on Amazon Redshift data warehouses for sensitive queries

Has the Data Warehouse Had Its Day?

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

Build a secure data visualization application using the Amazon Redshift Data API with AWS IAM Identity Center

Four Use Cases Proving the Benefits of Metadata-Driven Automation

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Your 5-Step Journey from Analytics to AI

Benefits of Enterprise Modeling and Data Intelligence Solutions

Access Amazon Redshift data from Salesforce Data Cloud with Zero Copy Data Federation

AWS Lake Formation 2022 year in review

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

Implement data quality checks on Amazon Redshift data assets and integrate with Amazon DataZone

Getting started guide for near-real time operational analytics using Amazon Aurora zero-ETL integration with Amazon Redshift

5 Ways Data Engineers Can Support Data Governance

Improve healthcare services through patient 360: A zero-ETL approach to enable near real-time data analytics

Do I Need a Data Catalog?

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

IBM and AWS Create a Path to Modernization Via Industry-Specific Solutions

How EchoStar ingests terabytes of data daily across its 5G Open RAN network in near real-time using Amazon Redshift Serverless Streaming Ingestion

Your Effective Roadmap To Implement A Successful Business Intelligence Strategy

Why Spreadsheets Are Your Secret Weapon for Efficient Data Governance

The Top Six Benefits of Data Modeling – What Is Data Modeling?

Cross-account data collaboration with Amazon DataZone and AWS analytical tools

Certified technical partner solutions help customers succeed with Cloudera Data Platform

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

How HPE Aruba Supply Chain optimized cost and performance by migrating to an AWS modern data architecture

Get started with the new Amazon DataZone enhancements for Amazon Redshift

Orchestrate an end-to-end ETL pipeline using Amazon S3, AWS Glue, and Amazon Redshift Serverless with Amazon MWAA

Governing data in relational databases using Amazon DataZone

Migrate data from Azure Blob Storage to Amazon S3 using AWS Glue

Five actionable steps to GDPR compliance (Right to be forgotten) with Amazon Redshift

Create an end-to-end data strategy for Customer 360 on AWS

Leveraging AI to discover and classify your data in a complex and dynamic landscape

Stay Connected