Data Warehouse, Information and Metadata

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

Data collections are the ones and zeroes that encode the actionable insights (patterns, trends, relationships) that we seek to extract from our data through machine learning and data science. The insights are used to produce informative content for stakeholders (decision-makers, business users, and clients).

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

Enriching metadata for accurate text-to-SQL generation for Amazon Athena

AWS Big Data

OCTOBER 14, 2024

Enterprise data is brought into data lakes and data warehouses to carry out analytical, reporting, and data science use cases using AWS analytical services like Amazon Athena , Amazon Redshift , Amazon EMR , and so on. Table metadata is fetched from AWS Glue. Can it also help write SQL queries?

Metadata

Metadata Data Lake Modeling Data Warehouse

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

AWS Big Data

NOVEMBER 14, 2024

You can learn how to query Delta Lake native tables through UniForm from different data warehouses or engines such as Amazon Redshift as an example of expanding data access to more engines. Both Delta Lake and Iceberg metadata files reference the same data files.

Metadata

Metadata Data Warehouse Big Data Data Lake

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Write queries faster with Amazon Q generative SQL for Amazon Redshift

AWS Big Data

NOVEMBER 7, 2024

Amazon Redshift is a fully managed, AI-powered cloud data warehouse that delivers the best price-performance for your analytics workloads at any scale. It enables you to get insights faster without extensive knowledge of your organization’s complex database schema and metadata. Your data is not shared across accounts.

Metadata

Metadata Sales Data Warehouse Optimization

Run Apache XTable in AWS Lambda for background conversion of open table formats

AWS Big Data

NOVEMBER 26, 2024

This post was co-written with Dipankar Mazumdar, Staff Data Engineering Advocate with AWS Partner OneHouse. Data architecture has evolved significantly to handle growing data volumes and diverse workloads. The synchronization process in XTable works by translating table metadata using the existing APIs of these table formats.

Metadata

Metadata Data Lake Snapshot Data Warehouse

Data Governance and Metadata Management: You Can’t Have One Without the Other

erwin

FEBRUARY 13, 2020

When an organization’s data governance and metadata management programs work in harmony, then everything is easier. Data governance is a complex but critical practice. There’s always more data to handle, much of it unstructured; more data sources, like IoT, more points of integration, and more regulatory compliance requirements.

Metadata

Metadata Data Governance Management Cost-Benefit

7 Benefits of Metadata Management

erwin

FEBRUARY 19, 2021

Metadata management is key to wringing all the value possible from data assets. However, most organizations don’t use all the data at their disposal to reach deeper conclusions about how to drive revenue, achieve regulatory compliance or accomplish other strategic objectives. What Is Metadata? Harvest data.

Metadata

Metadata Management Data Quality Cost-Benefit

Recap of Amazon Redshift key product announcements in 2024

AWS Big Data

DECEMBER 17, 2024

Amazon Redshift , launched in 2013, has undergone significant evolution since its inception, allowing customers to expand the horizons of data warehousing and SQL analytics. Industry-leading price-performance Amazon Redshift offers up to three times better price-performance than alternative cloud data warehouses.

Data Lake

Data Lake Data Warehouse Data-driven Optimization

How Eightfold AI implemented metadata security in a multi-tenant data analytics environment with Amazon Redshift

AWS Big Data

NOVEMBER 29, 2023

As part of the Talent Intelligence Platform Eightfold also exposes a data hub where each customer can access their Amazon Redshift-based data warehouse and perform ad hoc queries as well as schedule queries for reporting and data export. Many customers have implemented Amazon Redshift to support multi-tenant applications.

Metadata

Metadata Data Warehouse Analytics Data Analytics

Four Use Cases Proving the Benefits of Metadata-Driven Automation

erwin

FEBRUARY 7, 2019

Organization’s cannot hope to make the most out of a data-driven strategy, without at least some degree of metadata-driven automation. The volume and variety of data has snowballed, and so has its velocity. As such, traditional – and mostly manual – processes associated with data management and data governance have broken down.

Metadata

Metadata Insurance Data-driven Cost-Benefit

Accelerate SQL code migration from Google BigQuery to Amazon Redshift using BladeBridge

AWS Big Data

NOVEMBER 7, 2024

BladeBridge offers a comprehensive suite of tools that automate much of the complex conversion work, allowing organizations to quickly and reliably transition their data analytics capabilities to the scalable Amazon Redshift data warehouse. times better price performance than other cloud data warehouses.

Data Warehouse

Data Warehouse Reporting Big Data Data Lake

What Is a Metadata Management Tool?

Octopai

DECEMBER 12, 2021

What enables you to use all those gigabytes and terabytes of data you’ve collected? Metadata is the pertinent, practical details about data assets: what they are, what to use them for, what to use them with. Without metadata, data is just a heap of numbers and letters collecting dust. Where does metadata come from?

Metadata

Metadata Management Data Quality Data Governance

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Unlocking the true value of data often gets impeded by siloed information. Traditional data management—wherein each business unit ingests raw data in separate data lakes or warehouses—hinders visibility and cross-functional analysis. Business units access clean, standardized data.

Data Lake

Data Lake Data Warehouse Data Governance Publishing

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

For container terminal operators, data-driven decision-making and efficient data sharing are vital to optimizing operations and boosting supply chain efficiency. In addition to real-time analytics and visualization, the data needs to be shared for long-term data analytics and machine learning applications.

IoT

IoT Machine Learning Metadata Data-driven

How Metadata Makes Data Meaningful

erwin

DECEMBER 12, 2019

Metadata is an important part of data governance, and as a result, most nascent data governance programs are rife with project plans for assessing and documenting metadata. But in many scenarios, it seems that the underlying driver of metadata collection projects is that it’s just something you do for data governance.

Metadata

Metadata Data Governance Digital Transformation Data Quality

The Role Of Data Warehousing In Your Business Intelligence Architecture

datapine

MAY 29, 2019

Effective decision-making processes in business are dependent upon high-quality information. That’s a fact in today’s competitive business environment that requires agile access to a data storage warehouse , organized in a manner that will improve business performance, deliver fast, accurate, and relevant data insights.

Business Intelligence

Business Intelligence Data Warehouse Dashboards Visualization

When is data too clean to be useful for enterprise AI?

CIO Business Intelligence

NOVEMBER 27, 2024

Once the province of the data warehouse team, data management has increasingly become a C-suite priority, with data quality seen as key for both customer experience and business performance. But along with siloed data and compliance concerns , poor data quality is holding back enterprise AI projects.

Enterprise

Enterprise Data Quality Structured Data Modeling

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

We also examine how centralized, hybrid and decentralized data architectures support scalable, trustworthy ecosystems. As data-centric AI, automated metadata management and privacy-aware data sharing mature, the opportunity to embed data quality into the enterprises core has never been more significant.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Federate to Amazon Redshift Query Editor v2 with Microsoft Entra ID

AWS Big Data

DECEMBER 10, 2024

Amazon Redshift is a fast, petabyte-scale, cloud data warehouse that tens of thousands of customers rely on to power their analytics workloads. With its massively parallel processing (MPP) architecture and columnar data storage, Amazon Redshift delivers high price-performance for complex analytical queries against large datasets.

Sales

Sales Metadata Enterprise Testing

Do I Need a Data Catalog?

erwin

JUNE 26, 2020

Given the value this sort of data-driven insight can provide, the reason organizations need a data catalog should become clearer. It’s no surprise that most organizations’ data is often fragmented and siloed across numerous sources (e.g., Three Types of Metadata in a Data Catalog. Technical Metadata.

Metadata

Metadata Cost-Benefit Measurement Data-driven

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

These nodes can implement analytical platforms like data lake houses, data warehouses, or data marts, all united by producing data products. The Institutional Data & AI platform adopts a federated approach to data while centralizing the metadata to facilitate simpler discovery and sharing of data products.

Metadata

Metadata Data Governance Data Quality Data-driven

Benefits of Data Dictionary Tools for Enterprise Metadata Management

Octopai

FEBRUARY 12, 2020

Like any good puzzle, metadata management comes with a lot of complex variables. That’s why you need to use data dictionary tools, which can help organize your metadata into an archive that can be navigated with ease and from which you can derive good information to power informed decision-making. Download Now.

Metadata

Metadata Enterprise Management Data Warehouse

Open Data Lakehouse powered by Iceberg for all your Data Warehouse needs

Cloudera

APRIL 3, 2023

In this blog, we will share with you in detail how Cloudera integrates core compute engines including Apache Hive and Apache Impala in Cloudera Data Warehouse with Iceberg. We will publish follow up blogs for other data services. The idea is to store information about the deleted records in so-called delete files.

Data Warehouse

Data Warehouse Snapshot Metadata Cost-Benefit

Top analytics announcements of AWS re:Invent 2024

AWS Big Data

FEBRUARY 26, 2025

Amazon SageMaker Lakehouse provides an open data architecture that reduces data silos and unifies data across Amazon Simple Storage Service (Amazon S3) data lakes, Redshift data warehouses, and third-party and federated data sources. With AWS Glue 5.0, AWS Glue 5.0 Finally, AWS Glue 5.0

Analytics

Analytics Data Lake Metadata Data Warehouse

Metadata, the Neglected Stepchild of IT

Data Virtualization

DECEMBER 8, 2022

Nowadays, we no longer use the term DD/DS, but “data catalog” or simply “metadata system”. The post Metadata, the Neglected Stepchild of IT appeared first on Data Virtualization blog - Data Integration and Modern Data Management Articles, Analysis and Information.

Metadata

Metadata IT Data Integration Publishing

How Morningstar used tag-based access controls in AWS Lake Formation to manage permissions for an Amazon Redshift data warehouse

AWS Big Data

APRIL 6, 2023

With “Empowering Investor Success” as the core motto, Morningstar aims at providing our investors and advisors with the tools and information they need to make informed investment decisions. We realized we needed a data warehouse to cater to all of these consumer requirements, so we evaluated Amazon Redshift.

Data Warehouse

Data Warehouse Data Lake Management Data-driven

How Automated Metadata Discovery Can Save You Money

Octopai

APRIL 8, 2019

Their activities can have a direct impact on the ability of the profit center to increase revenue by providing the right information at the right time to management to make the right decisions. There are two primary reasons for this: Manual data discovery and manual data lineage. Money Loser #1: Manual Data Discovery.

Metadata

Metadata Dashboards Data Warehouse Visualization

A Few Proven Suggestions for Handling Large Data Sets

Smart Data Collective

SEPTEMBER 26, 2021

Data is processed to generate information, which can be later used for creating better business strategies and increasing the company’s competitive edge. It’s obvious that you’ll want to use big data, but it’s not so obvious how you’re going to work with it. Preserve information: Keep your raw data raw.

Metadata

Metadata Visualization Unstructured Data Data mining

What is a data architect? Skills, salaries, and how to become a data framework master

CIO Business Intelligence

OCTOBER 13, 2023

Solutions data architect: These individuals design and implement data solutions for specific business needs, including data warehouses, data marts, and data lakes. Application data architect: The application data architect designs and implements data models for specific software applications.

Data Architecture

Data Architecture Data Warehouse Statistics Visualization

Dark Data: How to Find It and What to Do with It

Timo Elliott

JANUARY 6, 2022

The data you’ve collected and saved over the years isn’t free. If storage costs are escalating in a particular area, you may have found a good source of dark data. Analyze your metadata. If you’ve yet to implement data governance, this is another great reason to get moving quickly. Information landscapes are complex.

IT

IT Metadata Data-driven Data Governance

The Data Lakehouse: Blending Data Warehouses and Data Lakes

Data Virtualization

APRIL 21, 2022

Reading Time: 3 minutes First we had data warehouses, then came data lakes, and now the new kid on the block is the data lakehouse. But what is a data lakehouse and why should we develop one? In a way, the name describes what.

Data Lake

Data Lake Data Warehouse Data Integration Management

Informatica’s new data management clouds target health, finance services

CIO Business Intelligence

MAY 24, 2022

In order to help maintain data privacy while validating and standardizing data for use, the IDMC platform offers a Data Quality Accelerator for Crisis Response.

Finance

Finance Management Metadata Machine Learning

How to Build a Performant Data Warehouse in Redshift

Sisense

SEPTEMBER 3, 2019

This blog is intended to give an overview of the considerations you’ll want to make as you build your Redshift data warehouse to ensure you are getting the optimal performance. This results in less joins between the metric data in fact tables, and the dimensions. So let’s dive in! OLTP vs OLAP.

Data Warehouse

Data Warehouse OLAP Statistics Cost-Benefit

SAP Datasphere review: turning data from a technical problem to a business data product.

Jen Stirrup

MARCH 29, 2023

It is easier to list the symptoms of a problematic data foundation as they are often pretty clear to business users. To summarise, a problematic data foundation misdirects people to make suboptimal business decisions due to incorrect data and information. What does a sound, intelligent data foundation give you?

Data Warehouse

Data Warehouse Metadata Data Integration Business Intelligence

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

While traditional extract, transform, and load (ETL) processes have long been a staple of data integration due to its flexibility, for common use cases such as replication and ingestion, they often prove time-consuming, complex, and less adaptable to the fast-changing demands of modern data architectures.

Data Integration

Data Integration Data Lake Statistics Data-driven

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

erwin

JULY 17, 2019

However, few organizations truly understand their data or know how to consistently maximize its value. If your business is like most, you collect and analyze some data from a subset of sources to make product improvements, enhance customer service, reduce expenses and inform other, mostly tactical decisions. Probably not.

Digital Transformation

Digital Transformation Strategy Metadata Data-driven

Cross-account data collaboration with Amazon DataZone and AWS analytical tools

AWS Big Data

MARCH 5, 2025

For cross-account data sharing, both the producer and consumer clusters must be encrypted. See Cluster encryption section of datashare-considerations for more information about the encryption process. Data sharing is supported only for provisioned ra3 cluster types (ra3.16xlarge, ra3.4xlarge, and ra3.xlplus)

Analytics

Analytics Publishing Metadata Sales

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

However, with all good things comes many challenges and businesses often struggle with managing their information in the correct way. Oftentimes, the data being collected and used is incomplete or damaged, leading to many other issues that can considerably harm the company. Enters data quality management.

Data Quality

Data Quality Metrics Data-driven Management

How Metadata Makes Data Meaningful

erwin

DECEMBER 12, 2019

Metadata is an important part of data governance, and as a result, most nascent data governance programs are rife with project plans for assessing and documenting metadata. But in many scenarios, it seems that the underlying driver of metadata collection projects is that it’s just something you do for data governance.

Metadata

Metadata Data Governance Digital Transformation Data Quality

Integrating Data Governance and Enterprise Architecture

erwin

SEPTEMBER 3, 2020

Two of the biggest challenges in creating a successful enterprise architecture initiative are: collecting accurate information on application ecosystems and maintaining the information as application ecosystems change. Data governance provides time-sensitive, current-state architecture information with a high level of quality.

Data Governance

Data Governance Enterprise Risk Data Lake

How ActionIQ built a truly composable customer data platform using Amazon Redshift

AWS Big Data

JULY 24, 2024

ActionIQ is a leading composable customer data (CDP) platform designed for enterprise brands to grow faster and deliver meaningful experiences for their customers. This post will demonstrate how ActionIQ built a connector for Amazon Redshift to tap directly into your data warehouse and deliver a secure, zero-copy CDP.

Data Warehouse

Data Warehouse Cost-Benefit Marketing Testing

The Benefits of a Knowledge Graph-based Metadata Hub

Ontotext

DECEMBER 15, 2022

But whatever their business goals, in order to turn their invisible data into a valuable asset, they need to understand what they have and to be able to efficiently find what they need. Enter metadata. It enables us to make sense of our data because it tells us what it is and how best to use it.

Metadata

Metadata Unstructured Data Structured Data Enterprise

Top 5 Data Catalog Benefits: Understanding Your Organization’s Data Lineage

erwin

AUGUST 7, 2019

A data catalog benefits organizations in a myriad of ways. With the right data catalog tool, organizations can automate enterprise metadata management – including data cataloging, data mapping, data quality and code generation for faster time to value and greater accuracy for data movement and/or deployment projects.

Metadata

Metadata Data Governance Data Quality Data Warehouse

Benefits of Enterprise Modeling and Data Intelligence Solutions

erwin

JULY 2, 2020

This matters because, as he said, “By placing the data and the metadata into a model, which is what the tool does, you gain the abilities for linkages between different objects in the model, linkages that you cannot get on paper or with Visio or PowerPoint.” Data Modeling with erwin Data Modeler. George H.,

Enterprise

Enterprise Modeling Metadata Data Governance

SAP Datasphere Powers Business at the Speed of Data

Enriching metadata for accurate text-to-SQL generation for Amazon Athena

Webinars

Trending Sources

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

Webinars

Write queries faster with Amazon Q generative SQL for Amazon Redshift

Run Apache XTable in AWS Lambda for background conversion of open table formats

Data Governance and Metadata Management: You Can’t Have One Without the Other

7 Benefits of Metadata Management

Recap of Amazon Redshift key product announcements in 2024

How Eightfold AI implemented metadata security in a multi-tenant data analytics environment with Amazon Redshift

Four Use Cases Proving the Benefits of Metadata-Driven Automation

Accelerate SQL code migration from Google BigQuery to Amazon Redshift using BladeBridge

What Is a Metadata Management Tool?

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

How EUROGATE established a data mesh architecture using Amazon DataZone

How Metadata Makes Data Meaningful

The Role Of Data Warehousing In Your Business Intelligence Architecture

When is data too clean to be useful for enterprise AI?

Data’s dark secret: Why poor quality cripples AI and growth

Federate to Amazon Redshift Query Editor v2 with Microsoft Entra ID

Do I Need a Data Catalog?

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Benefits of Data Dictionary Tools for Enterprise Metadata Management

Open Data Lakehouse powered by Iceberg for all your Data Warehouse needs

Top analytics announcements of AWS re:Invent 2024

Metadata, the Neglected Stepchild of IT

How Morningstar used tag-based access controls in AWS Lake Formation to manage permissions for an Amazon Redshift data warehouse

How Automated Metadata Discovery Can Save You Money

A Few Proven Suggestions for Handling Large Data Sets

What is a data architect? Skills, salaries, and how to become a data framework master

Dark Data: How to Find It and What to Do with It

The Data Lakehouse: Blending Data Warehouses and Data Lakes

Informatica’s new data management clouds target health, finance services

How to Build a Performant Data Warehouse in Redshift

SAP Datasphere review: turning data from a technical problem to a business data product.

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

Cross-account data collaboration with Amazon DataZone and AWS analytical tools

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

How Metadata Makes Data Meaningful

Integrating Data Governance and Enterprise Architecture

How ActionIQ built a truly composable customer data platform using Amazon Redshift

The Benefits of a Knowledge Graph-based Metadata Hub

Top 5 Data Catalog Benefits: Understanding Your Organization’s Data Lineage

Benefits of Enterprise Modeling and Data Intelligence Solutions

Stay Connected