Data Governance, Data Lake and Visualization

Data Governance

Data Lake

Visualization

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

AWS Big Data

OCTOBER 30, 2024

Amazon DataZone now launched authentication supports through the Amazon Athena JDBC driver, allowing data users to seamlessly query their subscribed data lake assets via popular business intelligence (BI) and analytics tools like Tableau, Power BI, Excel, SQL Workbench, DBeaver, and more.

Visualization

Visualization Data Lake Testing Data Governance

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

AWS Big Data

OCTOBER 30, 2024

Amazon DataZone is a data management service that makes it faster and easier for customers to catalog, discover, share, and govern data stored across AWS, on premises, and from third-party sources. When you’re connected, you can query, visualize, and share data—governed by Amazon DataZone—within Tableau.

Analytics

Analytics Visualization Data Governance Data-driven

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

Data landscape in EUROGATE and current challenges faced in data governance The EUROGATE Group is a conglomerate of container terminals and service providers, providing container handling, intermodal transports, maintenance and repair, and seaworthy packaging services. Eliminate centralized bottlenecks and complex data pipelines.

IoT

IoT Machine Learning Metadata Data-driven

Webinars

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Doing Cloud Migration and Data Governance Right the First Time

erwin

OCTOBER 8, 2020

That means your cloud data assets must be available for use by the right people for the right purposes to maximize their security, quality and value. Why You Need Cloud Data Governance. Regulatory compliance is also a major driver of data governance (e.g., GDPR, CCPA, HIPAA, SOX, PIC DSS).

Data Governance

Data Governance Metadata Testing Data Lake

Data governance in the age of generative AI

AWS Big Data

FEBRUARY 29, 2024

Data is your generative AI differentiator, and a successful generative AI implementation depends on a robust data strategy incorporating a comprehensive data governance approach. Data governance is a critical building block across all these approaches, and we see two emerging areas of focus.

Data Governance

Data Governance Unstructured Data Metadata Data Lake

Building end-to-end data lineage for one-time and complex queries using Amazon Athena, Amazon Redshift, Amazon Neptune and dbt

AWS Big Data

DECEMBER 12, 2024

The combination of these three services provides a powerful, comprehensive solution for end-to-end data lineage analysis. In this post, we use dbt for data modeling on both Amazon Athena and Amazon Redshift. This led to the implementation of both Athena on dbt and Amazon Redshift on dbt architectures.

Snapshot

Snapshot Recreation/Entertainment Experimentation Data Lake

Talend Data Fabric Simplifies Data Life Cycle Management

David Menninger's Analyst Perspectives

NOVEMBER 16, 2021

Talend data integration software offers an open and scalable architecture and can be integrated with multiple data warehouses, systems and applications to provide a unified view of all data. Its code generation architecture uses a visual interface to create Java or SQL code.

Management

Management Data Warehouse Data Quality Data Integration

Amazon DataZone introduces OpenLineage-compatible data lineage visualization in preview

AWS Big Data

JULY 8, 2024

We are excited to announce the preview of API-driven, OpenLineage-compatible data lineage in Amazon DataZone to help you capture, store, and visualize lineage of data movement and transformations of data assets on Amazon DataZone. The lineage visualized includes activities inside the Amazon DataZone business data catalog.

Visualization

Visualization Metadata Publishing Sales

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

AWS Big Data

JULY 29, 2024

In the era of big data, data lakes have emerged as a cornerstone for storing vast amounts of raw data in its native format. They support structured, semi-structured, and unstructured data, offering a flexible and scalable environment for data ingestion from multiple sources.

Metadata

Metadata Snapshot Data Lake Metrics

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

AWS Big Data

APRIL 24, 2023

Building a data lake on Amazon Simple Storage Service (Amazon S3) provides numerous benefits for an organization. However, many use cases, like performing change data capture (CDC) from an upstream relational database to an Amazon S3-based data lake, require handling data at a record level.

Data Lake

Data Lake Data Governance Machine Learning Cost-Benefit

Data transformation takes flight at Atlanta’s Hartsfield-Jackson airport

CIO Business Intelligence

AUGUST 9, 2024

At Atlanta’s Hartsfield-Jackson International Airport, an IT pilot has led to a wholesale data journey destined to transform operations at the world’s busiest airport, fueled by machine learning and generative AI. He is a very visual person, so our proof of concept collects different data sets and ingests them into our Azure data house.

Data Transformation

Data Transformation Machine Learning Data Lake Dashboards

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

Under the federated mesh architecture, each divisional mesh functions as a node within the broader enterprise data mesh, maintaining a degree of autonomy in managing its data products. These nodes can implement analytical platforms like data lake houses, data warehouses, or data marts, all united by producing data products.

Metadata

Metadata Data Governance Data Quality Data-driven

Analyzing the business-case approach Perdue Farms takes to derive value from data

CIO Business Intelligence

SEPTEMBER 20, 2023

The data can also help us enrich our commodity products. How are you populating your data lake? We’ve decided to take a practical approach, led by Kyle Benning, who runs our data function. Then our analytics team, an IT group, makes sure we build the data lake in the right sequence.

Data Lake

Data Lake Data-driven Dashboards Risk

AWS Lake Formation 2023 year in review

AWS Big Data

JANUARY 18, 2024

AWS Lake Formation and the AWS Glue Data Catalog form an integral part of a data governance solution for data lakes built on Amazon Simple Storage Service (Amazon S3) with multiple AWS analytics services integrating with them. We realized that your use cases need more flexibility in data governance.

Data Lake

Data Lake Metadata Data Governance Statistics

Top analytics announcements of AWS re:Invent 2024

AWS Big Data

FEBRUARY 26, 2025

Analytics remained one of the key focus areas this year, with significant updates and innovations aimed at helping businesses harness their data more efficiently and accelerate insights. From enhancing data lakes to empowering AI-driven analytics, AWS unveiled new tools and services that are set to shape the future of data and analytics.

Analytics

Analytics Data Lake Metadata Data Warehouse

Demystify data sharing and collaboration patterns on AWS: Choosing the right tool for the job

AWS Big Data

OCTOBER 21, 2024

However, enterprises often encounter challenges with data silos, insufficient access controls, poor governance, and quality issues. Embracing data as a product is the key to address these challenges and foster a data-driven culture.

Sales

Sales Data-driven Data Processing Key Performance Indicator

What is a data architect? Skills, salaries, and how to become a data framework master

CIO Business Intelligence

OCTOBER 13, 2023

Data architect role Data architects are senior visionaries who translate business requirements into technology requirements and define data standards and principles, often in support of data or digital transformations. In some ways, the data architect is an advanced data engineer.

Data Architecture

Data Architecture Data Warehouse Statistics Visualization

How Novanta’s CIO mobilized its data-driven transformation

CIO Business Intelligence

MAY 10, 2023

So we have a visualization layer where we teach different groups within our organization to learn. It’s evolved from over the past four years from having nothing and siloed data sets of spreadsheets and everyone doing their own thing, to being centralized based on KPIs and the trust in what they receive from the data.

Data-driven

Data-driven IT Digital Transformation Data Governance

6 BI challenges IT teams must address

CIO Business Intelligence

DECEMBER 21, 2022

BI software helps companies do just that by shepherding the right data into analytical reports and visualizations so that users can make informed decisions. To gain employee buy-in, Stout’s team builds BI dashboards to show them how they can easily connect to and interact with their data, as well as visualize it in a meaningful way.

IT Business Intelligence Sales Key Performance Indicator

Use AWS Glue to streamline SFTP data processing

AWS Big Data

AUGUST 13, 2024

With AWS Glue, you can discover and connect to hundreds of diverse data sources and manage your data in a centralized data catalog. It enables you to visually create, run, and monitor extract, transform, and load (ETL) pipelines to load data into your data lakes. Select Visual ETL in the central pane.

Data Processing

Data Processing Visualization Data Lake Data Processing

Why the Data Journey Manifesto?

DataKitchen

JUNE 12, 2023

We had been talking about “Agile Analytic Operations,” “DevOps for Data Teams,” and “Lean Manufacturing For Data,” but the concept was hard to get across and communicate. I spent much time de-categorizing DataOps: we are not discussing ETL, Data Lake, or Data Science.

Testing

Testing Dashboards Data Science Data Lake

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

In this post, we discuss how you can use purpose-built AWS services to create an end-to-end data strategy for C360 to unify and govern customer data that address these challenges. Strategize based on how your teams explore data, run analyses, wrangle data for downstream requirements, and visualize data at different levels.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Data Governance for Dummies: Your Questions, Answered

Alation

FEBRUARY 17, 2023

This past week, I had the pleasure of hosting Data Governance for Dummies author Jonathan Reichental for a fireside chat , along with Denise Swanson , Data Governance lead at Alation. Can you have proper data management without establishing a formal data governance program?

Data Governance

Data Governance Data Quality Metadata Cost-Benefit

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

AWS Big Data

MARCH 7, 2023

A data hub is a center of data exchange that constitutes a hub of data repositories and is supported by data engineering, data governance, security, and monitoring services. A data hub contains data at multiple levels of granularity and is often not integrated.

Analytics

Analytics Data Warehouse Data Lake Metadata

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

To bring their customers the best deals and user experience, smava follows the modern data architecture principles with a data lake as a scalable, durable data store and purpose-built data stores for analytical processing and data consumption. This is the Data Mart stage.

Data Lake

Data Lake Data Warehouse Data-driven B2B

CIO Ryan Snyder on the benefits of interpreting data as a layer cake

CIO Business Intelligence

AUGUST 2, 2023

The layer cake metaphor shifts the data discussion from an IT discussion to the intersection of business strategy and technology. So it’s about how we create layers from the business concept, like advancing discovery, all the way down to a technology solution, like a visualization tool. Does the data live in one or many clouds?

Manufacturing

Manufacturing Data Architecture Data Strategy Strategy

8 tips for unleashing the power of unstructured data

CIO Business Intelligence

NOVEMBER 28, 2023

With each game release and update, the amount of unstructured data being processed grows exponentially, Konoval says. This volume of data poses serious challenges in terms of storage and efficient processing,” he says. To address this problem RetroStyle Games invested in data lakes. Ensure value with visualizations.

Unstructured Data

Unstructured Data Data-driven Visualization Data Quality

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

AWS Big Data

AUGUST 19, 2024

This allows business analysts and decision-makers to gain valuable insights, visualize key metrics, and explore the data in depth, enabling informed decision-making and strategic planning for pricing and promotional strategies. On the Visual tab, choose Add nodes. Choose Data source – Snowflake in the AWS Glue Studio canvas.

Analytics

Analytics Data-driven Data Integration Data Lake

How Getir unleashed data democratization using a data mesh architecture with Amazon Redshift

AWS Big Data

OCTOBER 23, 2024

Amazon Redshift enables data warehousing by seamlessly integrating with other data stores and services in the modern data organization through features such as Zero-ETL , data sharing , streaming ingestion , data lake integration , and Redshift ML.

Data Warehouse

Data Warehouse Cost-Benefit Data Lake Data-driven

Augmented data management: Data fabric versus data mesh

IBM Big Data Hub

APRIL 27, 2022

The data fabric architectural approach can simplify data access in an organization and facilitate self-service data consumption at scale. Read: The first capability of a data fabric is a semantic knowledge data catalog, but what are the other 5 core capabilities of a data fabric? 11 May 2021. .

Management

Management Metadata Data Architecture Data Lake

Building Robust Data Pipelines: 9 Fundamentals and Best Practices to Follow

Alation

MAY 16, 2023

Data Pipeline Use Cases Here are just a few examples of the goals you can achieve with a robust data pipeline: Data Prep for Visualization Data pipelines can facilitate easier data visualization by gathering and transforming the necessary data into a usable state.

Data Lake

Data Lake Data Governance Data Warehouse Data Processing

Improve healthcare services through patient 360: A zero-ETL approach to enable near real-time data analytics

AWS Big Data

MARCH 27, 2024

The solution uses AWS services such as AWS HealthLake , Amazon Redshift , Amazon Kinesis Data Streams , and AWS Lake Formation to build a 360 view of patients. This means you no longer have to create an external schema in Amazon Redshift to use the data lake tables cataloged in the Data Catalog.

Data Analytics

Data Analytics Analytics Data Warehouse Data Lake

Modern Data Architecture for Telecommunications

Cloudera

SEPTEMBER 6, 2022

Previously, there were three types of data structures in telco: . Entity data sets — i.e. marketing data lakes . There are three major architectures under the modern data architecture umbrella. . Application-based datasets — i.e. billing or contact center support systems .

Data Architecture

Data Architecture Cost-Benefit Digital Transformation Business Driver

Foundational blocks of Amazon SageMaker Unified Studio: An admin’s guide to implement unified access to all your data, analytics, and AI

AWS Big Data

FEBRUARY 13, 2025

This enables the user to create a data lake environment with AWS Glue database and Athena workgroup to query the data. Because SageMaker Unified Studio is in preview mode, the naming conventions of some visual elements might appear different in the current version. She can be reached via LinkedIn.

Data Analytics

Data Analytics Analytics Modeling Data-driven

How Fujitsu implemented a global data mesh architecture and democratized data

AWS Big Data

MAY 1, 2024

In addition, the foundation role monitors the state of the metadata, data quality indicators, data permissions, information classification labels, and so on. It is crucial in data governance and data management. BI and visualization Individual consumers have a dedicated space in a BI tool.

Dashboards

Dashboards Publishing Data-driven Cost-Benefit

How data stores and governance impact your AI initiatives

IBM Big Data Hub

OCTOBER 12, 2023

Accounting for the complexities of the AI lifecycle Unfortunately, typical data storage and data governance tools fall short in the AI arena when it comes to helping an organization perform the tasks that underline efficient and responsible AI lifecycle management. And that makes sense.

Cost-Benefit

Cost-Benefit Metadata Data Governance Optimization

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

APRIL 3, 2019

Paco Nathan ‘s latest column dives into data governance. This month’s article features updates from one of the early data conferences of the year, Strata Data Conference – which was held just last week in San Francisco. In particular, here’s my Strata SF talk “Overview of Data Governance” presented in article form.

Machine Learning

Machine Learning Data Governance Metadata Data Science

The Audience for Data Catalogs and Data Intelligence

Alation

JUNE 21, 2022

After that came data governance , privacy, and compliance staff. Power business users and other non-purely-analytic data citizens came after that. Analysts didn’t just want to catalog data sources, they wanted to include dashboards, reports, and visualizations. Data engineers want to catalog data pipelines.

Metadata

Metadata Data Quality Visualization Data Lake

The First Pillar of Data Culture: Data Search & Discovery

Alation

JUNE 9, 2021

In this four-part blog series on data culture, we’re exploring what a data culture is and the benefits of building one, and then drilling down to explore each of the three pillars of data culture – data search & discovery, data literacy, and data governance – in more depth.

Data Governance

Data Governance Data-driven Machine Learning Data Lake

Alation Cloud Service: Data Intelligence Just Got Simpler

Alation

APRIL 7, 2021

But only a data catalog built as a platform can empower people to find, understand, and govern data, and support emerging data intelligence use cases. Alation possesses three unique capabilities: intelligence, active data governance, and broad, deep connectivity. Active Data Governance.

Data Governance

Data Governance Enterprise Metadata Cost-Benefit

The Cloud Connection: How Governance Supports Security

Alation

APRIL 14, 2022

Semantics, context, and how data is tracked and used mean even more as you stretch to reach post-migration goals. This is why, when data moves, it’s imperative for organizations to prioritize data discovery. Data discovery is also critical for data governance , which, when ineffective, can actually hinder organizational growth.

Metadata

Metadata Data Governance Data-driven Modeling

What Do You Actually Need from a Data Catalog Tool?

Alation

SEPTEMBER 23, 2021

A data catalog is a centralized storage bank of metadata on information sources from across the enterprise, such as: Datasets. Visualizations. The data catalog also stores metadata (data about data, like a conversation), which gives users context on how to use each asset. Data Catalog by Type. Conversations.

Enterprise

Enterprise Metadata Data Governance Visualization

Themes and Conferences per Pacoid, Episode 12

Domino Data Lab

AUGUST 8, 2019

In this episode I’ll cover themes from Sci Foo and important takeaways that data science teams should be tracking. First and foremost: there’s substantial overlap between what the scientific community is working toward for scholarly infrastructure and some of the current needs of data governance in industry. We did it again.”.

Data Science

Data Science Machine Learning Data Governance Statistics

5 Reasons to Use Apache Iceberg on Cloudera Data Platform (CDP)

Cloudera

MARCH 23, 2022

For example, a Jupyter notebook in CML, can use Spark or Python framework to directly access an Iceberg table to build a forecast model, while new data is ingested via NiFi flows, and a SQL analyst monitors revenue targets using Data Visualization. 2: Open formats. Flexible and open file formats. Open the door to new use-cases.

Metadata

Metadata Data Architecture Machine Learning Cost-Benefit

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

Webinars

Trending Sources

How EUROGATE established a data mesh architecture using Amazon DataZone

Webinars

Doing Cloud Migration and Data Governance Right the First Time

Data governance in the age of generative AI

Building end-to-end data lineage for one-time and complex queries using Amazon Athena, Amazon Redshift, Amazon Neptune and dbt

Talend Data Fabric Simplifies Data Life Cycle Management

Amazon DataZone introduces OpenLineage-compatible data lineage visualization in preview

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Data transformation takes flight at Atlanta’s Hartsfield-Jackson airport

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Analyzing the business-case approach Perdue Farms takes to derive value from data

AWS Lake Formation 2023 year in review

Top analytics announcements of AWS re:Invent 2024

Demystify data sharing and collaboration patterns on AWS: Choosing the right tool for the job

What is a data architect? Skills, salaries, and how to become a data framework master

How Novanta’s CIO mobilized its data-driven transformation

6 BI challenges IT teams must address

Use AWS Glue to streamline SFTP data processing

Why the Data Journey Manifesto?

Create an end-to-end data strategy for Customer 360 on AWS

Data Governance for Dummies: Your Questions, Answered

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

How smava makes loans transparent and affordable using Amazon Redshift Serverless

CIO Ryan Snyder on the benefits of interpreting data as a layer cake

8 tips for unleashing the power of unstructured data

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

How Getir unleashed data democratization using a data mesh architecture with Amazon Redshift

Augmented data management: Data fabric versus data mesh

Building Robust Data Pipelines: 9 Fundamentals and Best Practices to Follow

Improve healthcare services through patient 360: A zero-ETL approach to enable near real-time data analytics

Modern Data Architecture for Telecommunications

Foundational blocks of Amazon SageMaker Unified Studio: An admin’s guide to implement unified access to all your data, analytics, and AI

How Fujitsu implemented a global data mesh architecture and democratized data

How data stores and governance impact your AI initiatives

Themes and Conferences per Pacoid, Episode 8

The Audience for Data Catalogs and Data Intelligence

The First Pillar of Data Culture: Data Search & Discovery

Alation Cloud Service: Data Intelligence Just Got Simpler

The Cloud Connection: How Governance Supports Security

What Do You Actually Need from a Data Catalog Tool?

Themes and Conferences per Pacoid, Episode 12

5 Reasons to Use Apache Iceberg on Cloudera Data Platform (CDP)

Stay Connected