Blog - Data Leaders Brief

driving-data-catalog-adoption

Blog

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

AWS Big Data

NOVEMBER 14, 2024

The landscape of big data management has been transformed by the rising popularity of open table formats such as Apache Iceberg, Apache Hudi, and Linux Foundation Delta Lake. These formats, designed to address the limitations of traditional data storage systems, have become essential in modern data architectures.

Metadata

Metadata Data Warehouse Big Data Data Lake

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

This week on the keynote stages at AWS re:Invent 2024, you heard from Matt Garman, CEO, AWS, and Swami Sivasubramanian, VP of AI and Data, AWS, speak about the next generation of Amazon SageMaker , the center for all of your data, analytics, and AI. The relationship between analytics and AI is rapidly evolving.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Streamline Payment Applications & Lien Waivers Through Innovative Construction Technology

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

Trending Sources

Scaling RISE with SAP data and AWS Glue

AWS Big Data

NOVEMBER 29, 2024

Customers often want to augment and enrich SAP source data with other non-SAP source data. Such analytic use cases can be enabled by building a data warehouse or data lake. Customers can now use the AWS Glue SAP OData connector to extract data from SAP.

Visualization

Visualization Data Processing Data-driven Cost-Benefit

Webinars

How to Streamline Payment Applications & Lien Waivers Through Innovative Construction Technology

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

Unifying metadata governance across Amazon SageMaker and Collibra

AWS Big Data

JULY 16, 2025

Managing metadata across tools and teams is a growing challenge for organizations building modern data and AI platforms. As data volumes grow and generative AI becomes more central to business strategy, teams need a consistent way to define, discover, and govern their datasets, features, and models.

Metadata

Metadata Publishing Management Modeling

Accelerate queries on Apache Iceberg tables through AWS Glue auto compaction

AWS Big Data

DECEMBER 19, 2024

Data lakes were originally designed to store large volumes of raw, unstructured, or semi-structured data at a low cost, primarily serving big data and analytics use cases. By using features like Icebergs compaction, OTFs streamline maintenance, making it straightforward to manage object and metadata versioning at scale.

Data Lake

Data Lake IoT Metadata Testing

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

AWS Big Data

OCTOBER 30, 2024

Amazon DataZone now launched authentication supports through the Amazon Athena JDBC driver, allowing data users to seamlessly query their subscribed data lake assets via popular business intelligence (BI) and analytics tools like Tableau, Power BI, Excel, SQL Workbench, DBeaver, and more.

Visualization

Visualization Data Lake Testing Data Governance

An integrated experience for all your data and AI with Amazon SageMaker Unified Studio (preview)

AWS Big Data

DECEMBER 11, 2024

Organizations are building data-driven applications to guide business decisions, improve agility, and drive innovation. Many of these applications are complex to build because they require collaboration across teams and the integration of data, tools, and services.

Data Lake

Data Lake Data Warehouse Data-driven Big Data

Driving Data Catalog Adoption

Alation

FEBRUARY 13, 2020

In a recent blog, titled Collaboration and Crowdsourcing with Data Cataloging , I discussed the importance of participation by all data stakeholders as a key to getting maximum value from your data catalog. Understanding the Adoption Challenges. Figure 1 – Data Catalog Implementation.

Metadata

Metadata Data Governance Cost-Benefit Visualization

What is a Data Mesh?

DataKitchen

AUGUST 3, 2021

The data mesh design pattern breaks giant, monolithic enterprise data architectures into subsystems or domains, each managed by a dedicated team. DataOps helps the data mesh deliver greater business agility by enabling decentralized domains to work in concert. . But first, let’s define the data mesh design pattern.

Data Architecture

Data Architecture Data Lake Data Warehouse Cost-Benefit

Generative AI in the Enterprise

O'Reilly on Data

NOVEMBER 28, 2023

In enterprises, we’ve seen everything from wholesale adoption to policies that severely restrict or even forbid the use of generative AI. Our survey focused on how companies use generative AI, what bottlenecks they see in adoption, and what skills gaps need to be addressed. Many AI adopters are still in the early stages.

Enterprise

Enterprise Testing Modeling Reporting

A Data Prediction for 2025

DataKitchen

FEBRUARY 2, 2023

We’ve read many predictions for 2023 in the data field: they cover excellent topics like data mesh, observability, governance, lakehouses, LLMs, etc. What will the world of data tools be like at the end of 2025? Driving new opportunities and expansion takes a back seat. What will exist at the end of 2025?

Metadata

Metadata Testing Data Science Risk

The People’s Data Catalog: Alation Featured as Top Choice in Eckerson’s Latest Report

Alation

JULY 15, 2021

Many data catalog initiatives fail. How can prospective buyers ensure they partner with the right catalog to drive success? According to the latest report from Eckerson Group, Deep Dive on Data Catalogs , shoppers must match the goals of their organizations to the capabilities of their chosen catalog.

Reporting

Reporting Data Governance Recreation/Entertainment Metadata

Why I Joined Alation: A Former Customer’s Story

Alation

JULY 26, 2021

How do you initiate change within a system containing many thousands of people and millions of bytes of data? During my time as a data specialist at American Family Insurance, it became clear that we had to move away from the way things had been done in the past. So you can probably imagine: The company manages a lot of data.

Insurance

Insurance Digital Transformation Enterprise Data Governance

Unlock data across organizational boundaries using Amazon DataZone – now generally available

AWS Big Data

OCTOBER 4, 2023

Amazon DataZone enables customers to discover, access, share, and govern data at scale across organizational boundaries, reducing the undifferentiated heavy lifting of making data and analytics tools accessible to everyone in the organization. This is challenging because access to data is managed differently by each of the tools.

Metadata

Metadata Data Lake Publishing Data Warehouse

AI Governance: Break open the black box

IBM Big Data Hub

OCTOBER 4, 2022

Today, AI presents an enormous opportunity to turn data into insights and actions, to amplify human capabilities, decrease risk and increase ROI by achieving break through innovations. While the promise of AI isn’t guaranteed and doesn’t always come easy, adoption is no longer a choice. IBM Global AI Adoption Index 2022.).

Metadata

Metadata Risk Management Risk Experimentation

Best Social Media Metrics: Conversation, Amplification, Applause, Economic Value

Occam's Razor

OCTOBER 10, 2011

The impact on the data side of the ecosystem is that massive amounts of data is being generated and much of what goes for measurement in "social media tools" is profoundly sub optimal (I'm being polite). We have IT-minded people engaging in massive data puking (one report with 30 metrics anyone?) " matters!

Metrics

Metrics Measurement Advertising Key Performance Indicator

Snowcase Study: How Data Governance Gives Texas Mutual Insurance Company a Competitive Edge

Alation

APRIL 13, 2021

Much of his work focuses on democratising data and breaking down data silos to drive better business outcomes. In this blog, Chris shows how Snowflake and Alation together accelerate data culture. He shows how Texas Mutual Insurance Company has embraced data governance to build trust in data.

Insurance

Insurance Data Governance Data-driven Data Quality

How to Ensure Continuous Improvement With Data Governance

Alation

FEBRUARY 3, 2022

The goal of DataOps is to create predictable delivery and change management of data and all data-related artifacts. DataOps practices help organizations overcome challenges caused by fragmented teams and processes and delays in delivering data in consumable forms. So how does data governance relate to DataOps?

Data Governance

Data Governance Measurement Metadata Testing

Strategies on Implementing a Data Catalog

Alation

MAY 10, 2021

Why Implement a Data Catalog? Nowadays, businesses have more data than they know what to do with. Cutting-edge enterprises use their data to glean insights, make decisions, and drive value. In other words, they have a system in place for a data-driven strategy. data headache.”. Data Headache.

Strategy

Strategy Enterprise Data Strategy Data Governance

Our Time at Davos: Key Takeaways from the World Economic Forum

Cloudera

MARCH 14, 2024

Another researcher noted 70% of companies are in exploration mode in terms of Generative AI adoption while only 19% are in pilot or production. To this end, several CEOs stressed the need for a widespread reskilling of their workforces to drive usage and see productivity gains. This year is about how we get AI to scale.

Risk

Risk Machine Learning Enterprise Testing

Introducing Apache Iceberg in Cloudera Data Platform

Cloudera

FEBRUARY 22, 2022

Over the past decade, the successful deployment of large scale data platforms at our customers has acted as a big data flywheel driving demand to bring in even more data, apply more sophisticated analytics, and on-board many new data practitioners from business analysts to data scientists. Key Design Goals

Snapshot

Snapshot Metadata Cost-Benefit Data Architecture

Cloudera Named a Visionary in the Gartner MQ for Cloud DBMS

Cloudera

APRIL 1, 2024

This recognition underscores Cloudera’s commitment to continuous customer innovation and validates our ability to foresee future data and AI trends, and our strategy in shaping the future of data management. Cloudera, a leader in big data analytics, provides a unified Data Platform for data management, AI, and analytics.

Unstructured Data

Unstructured Data Cost-Benefit Metadata Machine Learning

How CFM built a well-governed and scalable data-engineering platform using Amazon EMR for financial features generation

AWS Big Data

SEPTEMBER 13, 2024

In recent years, driven by the commoditization of data storage and processing solutions, the industry has seen a growing number of systematic investment management firms switch to alternative data sources to drive their investment decisions. Each team is the sole owner of its AWS account.

Interactive

Interactive Strategy Cost-Benefit Data Governance

Why Invest Now? Three Investors Share the Story Behind Alation’s Series E

Alation

NOVEMBER 2, 2022

This leading software investment firm has partnered with the who’s-who of data-centric companies, including Qlik and Starburst, to help them drive sustainable, long-term growth. We had not seen that in the broader intelligence & data governance market.”. And data governance is critical to driving adoption.”.

Data Governance

Data Governance Marketing Finance Data Lake

Data Governance Tools: What Are They? Are They Optional?

erwin

NOVEMBER 14, 2019

Data governance tools used to occupy a niche in an organization’s tech stack, but those days are gone. The rise of data-driven business and the complexities that come with it ushered in a soft mandate for data governance and data governance tools. It is also used to make data more easily understood and secure.

Data Governance

Data Governance Data-driven Cost-Benefit Metadata

Happy Birthday, CDP Public Cloud

Cloudera

OCTOBER 13, 2020

On September 24, 2019, Cloudera launched CDP Public Cloud (CDP-PC) as the first step in delivering the industry’s first Enterprise Data Cloud. CDP Machine Learning: a kubernetes-based service that allows data scientists to deploy collaborative workspaces with secure, self-service access to enterprise data. That Was Then.

Data Warehouse

Data Warehouse Machine Learning Visualization Data Lake

Bring light to the black box

IBM Big Data Hub

MAY 9, 2023

Today, AI presents an enormous opportunity to turn data into insights and actions, to help amplify human capabilities, decrease risk and increase ROI by achieving break through innovations. While the promise of AI isn’t guaranteed and may not come easy, adoption is no longer a choice. So what is stopping AI adoption today?

Metadata

Metadata Risk Experimentation Dashboards

Data Governance Program: Ensuring a Successful Delivery

Alation

AUGUST 17, 2022

According to analysts, data governance programs have not shown a high success rate. According to CIOs , historical data governance programs were invasive and suffered from one of two defects: They were either forced on the rank and file — who grew to dislike IT as a result. The Risks of Early Data Governance Programs.

Data Governance

Data Governance Risk Data-driven Measurement

Drinking our own champagne – Cloudera upgrades to CDP Private Cloud

Cloudera

APRIL 21, 2021

Like most of our customers, Cloudera’s internal operations rely heavily on data. For more than a decade, Cloudera has built internal tools and data analysis primarily on a single production CDH cluster. In this blog, we discuss our journey to CDP for this critical cluster. Our Internal Environment Before Upgrade.

Testing

Testing Data Processing Interactive Data Warehouse

6 benefits of data lineage for financial services

IBM Big Data Hub

FEBRUARY 26, 2024

The financial services industry has been in the process of modernizing its data governance for more than a decade. The answer is data lineage. We’ve compiled six key reasons why financial organizations are turning to lineage platforms like MANTA to get control of their data. Data lineage helps during these investigations.

Cost-Benefit

Cost-Benefit Metadata Data Governance Reporting

Why companies need to accelerate data warehousing solution modernization

IBM Big Data Hub

APRIL 24, 2023

The dependence on remote internet access for business, personal, and educational use elevated the data demand and boosted global data consumption. Additionally, the increase in online transactions and web traffic generated mountains of data. Enter the modernization of data warehousing solutions.

Data Warehouse

Data Warehouse Data Lake Cost-Benefit Big Data

Invest in a Data Culture or Fall Behind: Q3 2021 State of Data Culture Report

Alation

SEPTEMBER 29, 2021

It’s been one year since we’ve started publishing the Alation State of Data Culture report, and uncertainty still remains the only sure thing. Yet, through it all, organizations that rely on, and invest in, building a data culture have consistently outperformed those who don’t. Ignore Data at Your Peril. It’s obvious.

Reporting

Reporting Uncertainty Data-driven Data Governance

Multi-Channel Attribution: Definitions, Models and a Reality Check

Occam's Razor

APRIL 2, 2012

Included in the post are recommendations for measurement and data analysis. While I'm using the term Store here, it encompasses sales (or leads or catalog requests) driven to a retail store or company call center, people driven to donate blood via online campaigns, or essentially any offline outcome driven by the online channel.

Modeling

Modeling Advertising Marketing Measurement

How Universal Data Distribution Accelerates Complex DoD Missions

Cloudera

AUGUST 11, 2022

But information broadly, and the management of data specifically, is still “the” critical factor for situational awareness, streamlined operations, and a host of other use cases across today’s tech-driven battlefields. . Universal Data Distribution Solves DoD Data Transport Challenges. hardware in the skies, sea, and land.

Predictive Analytics

Predictive Analytics Data-driven Data Processing Data Architecture

Collaboration and Crowdsourcing with Data Cataloging

Alation

FEBRUARY 13, 2020

A core element of business today is the desire to become a data-driven organization. The key to data-driven success and maturity is data culture, and strong data culture begins with participation. A data catalog can be the catalyst that helps to break through the barrier with collaboration and crowdsourcing.

Data Governance

Data Governance Metadata Data-driven Reporting

Alation 2021.2: More Tools to Drive Data Culture with Monali Narayanasami, Director of Product Management at Alation

Alation

MAY 20, 2021

Can you tell us more about what Alation Analytics is and it’s connection to data culture? These include catalog adoption, governance, curation, and asset tracking. These include catalog adoption, governance, curation, and asset tracking. Monali: Well, the more people who use the catalog, the better it gets.

Management

Management Visualization Dashboards Data-driven

Build or Buy an Enterprise Data Catalog: Top 6 Considerations

Alation

FEBRUARY 12, 2020

Hundreds of data sources. Hundreds (even thousands) of data consumers. To keep up with the rapid influx of data, the many disparate data environments, and the rise in self-service analytics users, enterprises need an enterprise data catalog to drive the business forward with data, and ensure compliant, accurate data use.

Enterprise

Enterprise Metadata Machine Learning Reporting

Maximizing your event-driven architecture investments: Unleashing the power of Apache Kafka with IBM Event Automation

IBM Big Data Hub

FEBRUARY 12, 2024

Recognizing the need to harness real-time data, businesses are increasingly turning to event-driven architecture (EDA) as a strategic approach to stay ahead of the curve. This trend grows stronger as organizations realize the benefits that come from the power of real-time data streaming.

Data-driven

Data-driven Cost-Benefit Uncertainty Enterprise

Takeaways from Forrester’s Latest Report on Enterprise Architecture Management Suites

erwin

MARCH 12, 2020

Given our EA expertise, we thought we’d provide our perspective on the report’s key takeaways and how we see technology trends, business innovation and compliance driving companies to use EA in different ways. Improve Enterprise Architecture with EAMS. Delivery of innovation at speed is critical, but what does that really mean?

Reporting

Reporting Enterprise Management ROI

12 Times Faster Query Planning With Iceberg Manifest Caching in Impala

Cloudera

JULY 13, 2023

Several compute engines such as Impala, Hive, Spark, and Trino have supported querying data in Iceberg table format by adopting this Java Library provided by the Apache Iceberg project. The data files and metadata files in Iceberg format are immutable. However, Iceberg Java API calls are not always cheap.

Metadata

Metadata Snapshot Data Warehouse Statistics

How IBM HR leverages IBM Watson® Knowledge Catalog to improve data quality and deliver superior talent insights

IBM Big Data Hub

JUNE 12, 2023

Companies rely heavily on data and analytics to find and retain talent, drive engagement, improve productivity and more across enterprise talent management. However, analytics are only as good as the quality of the data, which must be error-free, trustworthy and transparent. What is data quality? million each year.

Data Quality

Data Quality Data Governance People Analytics Data-driven

Telecommunications and the Hybrid Data Cloud

Cloudera

JUNE 14, 2021

How to optimize an enterprise data architecture with private cloud and multiple public cloud options? As the inexorable drive to cloud continues, telecommunications service providers (CSPs) around the world – often laggards in adopting disruptive technologies – are embracing virtualization. The Surging Importance of Data.

Data Architecture

Data Architecture Enterprise Finance Data Governance

Event-driven architecture (EDA) enables a business to become more aware of everything that’s happening, as it’s happening

IBM Big Data Hub

JANUARY 8, 2024

 Stage 2—Broader adoption Increased awareness across IT organizations leads to a transition to standardized methods of creating an event backbone that caters to both existing and new event-driven projects across multiple teams. The connector catalog contains an extensive list of key connectors supported by IBM and the community.

Data-driven

Data-driven Advertising Management Enterprise

Our Next Phase of Growth: Enterprise Data Catalogs

Alation

FEBRUARY 13, 2020

Today, we’re announcing that Alation has closed a $50 million Series C funding led by Sapphire Ventures, with participation from new investor Salesforce Ventures and our existing investors Costanoa Ventures, DCVC (Data Collective), Harmony Partners and Icon Ventures. And, the data catalog market has had a year of incredible growth.

Enterprise

Enterprise Data Lake Machine Learning Data-driven

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Webinars

Trending Sources

Scaling RISE with SAP data and AWS Glue

Webinars

Unifying metadata governance across Amazon SageMaker and Collibra

Accelerate queries on Apache Iceberg tables through AWS Glue auto compaction

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

An integrated experience for all your data and AI with Amazon SageMaker Unified Studio (preview)

Driving Data Catalog Adoption

What is a Data Mesh?

Generative AI in the Enterprise

A Data Prediction for 2025

The People’s Data Catalog: Alation Featured as Top Choice in Eckerson’s Latest Report

Why I Joined Alation: A Former Customer’s Story

Unlock data across organizational boundaries using Amazon DataZone – now generally available

AI Governance: Break open the black box

Best Social Media Metrics: Conversation, Amplification, Applause, Economic Value

Snowcase Study: How Data Governance Gives Texas Mutual Insurance Company a Competitive Edge

How to Ensure Continuous Improvement With Data Governance

Strategies on Implementing a Data Catalog

Our Time at Davos: Key Takeaways from the World Economic Forum

Introducing Apache Iceberg in Cloudera Data Platform

Cloudera Named a Visionary in the Gartner MQ for Cloud DBMS

How CFM built a well-governed and scalable data-engineering platform using Amazon EMR for financial features generation

Why Invest Now? Three Investors Share the Story Behind Alation’s Series E

Data Governance Tools: What Are They? Are They Optional?

Happy Birthday, CDP Public Cloud

Bring light to the black box

Data Governance Program: Ensuring a Successful Delivery

Drinking our own champagne – Cloudera upgrades to CDP Private Cloud

6 benefits of data lineage for financial services

Why companies need to accelerate data warehousing solution modernization

Invest in a Data Culture or Fall Behind: Q3 2021 State of Data Culture Report

Multi-Channel Attribution: Definitions, Models and a Reality Check

How Universal Data Distribution Accelerates Complex DoD Missions

Collaboration and Crowdsourcing with Data Cataloging

Alation 2021.2: More Tools to Drive Data Culture with Monali Narayanasami, Director of Product Management at Alation

Build or Buy an Enterprise Data Catalog: Top 6 Considerations

Maximizing your event-driven architecture investments: Unleashing the power of Apache Kafka with IBM Event Automation

Takeaways from Forrester’s Latest Report on Enterprise Architecture Management Suites

12 Times Faster Query Planning With Iceberg Manifest Caching in Impala

How IBM HR leverages IBM Watson® Knowledge Catalog to improve data quality and deliver superior talent insights

Telecommunications and the Hybrid Data Cloud

Event-driven architecture (EDA) enables a business to become more aware of everything that’s happening, as it’s happening

Our Next Phase of Growth: Enterprise Data Catalogs

Stay Connected