Data-driven, Document and Metadata

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

AWS Big Data

NOVEMBER 14, 2024

The landscape of big data management has been transformed by the rising popularity of open table formats such as Apache Iceberg, Apache Hudi, and Linux Foundation Delta Lake. These formats, designed to address the limitations of traditional data storage systems, have become essential in modern data architectures.

Metadata

Metadata Data Warehouse Big Data Data Lake

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

We live in a data-rich, insights-rich, and content-rich world. Data collections are the ones and zeroes that encode the actionable insights (patterns, trends, relationships) that we seek to extract from our data through machine learning and data science. Plus, AI can also help find key insights encoded in data.

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

Are You Content with Your Organization’s Content Strategy?

Rocket-Powered Data Science

JULY 6, 2021

Specifically, in the modern era of massive data collections and exploding content repositories, we can no longer simply rely on keyword searches to be sufficient. This is accomplished through tags, annotations, and metadata (TAM). Contextual TAM enhances a CMS with knowledge-driven search and retrieval, not just keyword-driven.

Strategy

Strategy Machine Learning Metadata Knowledge Discovery

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data. 10) Data Quality Solutions: Key Attributes.

Data Quality

Data Quality Metrics Data-driven Management

Why Modern Data Challenges Require a New Approach to Governance

A healthy data-driven culture minimizes knowledge debt while maximizing analytics productivity. Agile Data Governance is the process of creating and improving data assets by iteratively capturing knowledge as data producers and consumers work together so that everyone can benefit.

Metadata

Accelerating AI at scale without sacrificing security

CIO Business Intelligence

NOVEMBER 27, 2024

By eliminating time-consuming tasks such as data entry, document processing, and report generation, AI allows teams to focus on higher-value, strategic initiatives that fuel innovation. Similarly, in 2017 Equifax suffered a data breach that exposed the personal data of nearly 150 million people.

Data Governance

Data Governance Risk Insurance Metadata

Streamline data discovery with precise technical identifier search in Amazon SageMaker Unified Studio

AWS Big Data

APRIL 9, 2025

This yields results with exact precision, dramatically improving the speed and accuracy of data discovery. In this post, we demonstrate how to streamline data discovery with precise technical identifier search in Amazon SageMaker Unified Studio.

Metadata

Metadata Metrics Cost-Benefit Data-driven

What Is Data Modeling? Data Modeling Best Practices for Data-Driven Organizations

erwin

JANUARY 17, 2020

What is Data Modeling? Data modeling is a process that enables organizations to discover, design, visualize, standardize and deploy high-quality data assets through an intuitive, graphical interface. Data models provide visualization, create additional metadata and standardize data design across the enterprise.

Data-driven

Data-driven Modeling Metadata Data Governance

7 Benefits of Metadata Management

erwin

FEBRUARY 19, 2021

Metadata management is key to wringing all the value possible from data assets. However, most organizations don’t use all the data at their disposal to reach deeper conclusions about how to drive revenue, achieve regulatory compliance or accomplish other strategic objectives. What Is Metadata? Harvest data.

Metadata

Metadata Management Data Quality Cost-Benefit

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

AWS Big Data

JULY 29, 2024

In the era of big data, data lakes have emerged as a cornerstone for storing vast amounts of raw data in its native format. They support structured, semi-structured, and unstructured data, offering a flexible and scalable environment for data ingestion from multiple sources.

Metadata

Metadata Snapshot Data Lake Metrics

Very Meta … Unlocking Data’s Potential with Metadata Management Solutions

erwin

OCTOBER 24, 2019

Untapped data, if mined, represents tremendous potential for your organization. While there has been a lot of talk about big data over the years, the real hero in unlocking the value of enterprise data is metadata , or the data about the data. Metadata Is the Heart of Data Intelligence.

Metadata

Metadata Management Data-driven Data Architecture

Best Practices for Metadata Management

Alation

JULY 19, 2021

What Is Metadata? Metadata is information about data. A clothing catalog or dictionary are both examples of metadata repositories. Indeed, a popular online catalog, like Amazon, offers rich metadata around products to guide shoppers: ratings, reviews, and product details are all examples of metadata.

Metadata

Metadata Management Data Governance Machine Learning

RDF-Star: Metadata Complexity Simplified

Ontotext

JUNE 10, 2021

And yeah, the real-world relationships among the entities represented in the data had to be fudged a bit to fit in the counterintuitive model of tabular data, but, in trade, you get reliability and speed. Ironically, relational databases only imply relationships between data points by whatever row or column they exist in.

Metadata

Metadata Cost-Benefit OLAP Modeling

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

Data is the foundation of innovation, agility and competitive advantage in todays digital economy. As technology and business leaders, your strategic initiatives, from AI-powered decision-making to predictive insights and personalized experiences, are all fueled by data. Data quality is no longer a back-office concern.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

The New O’Reilly Answers: The R in “RAG” Stands for “Royalties”

O'Reilly on Data

JUNE 14, 2024

Miso’s cofounders, Lucky Gunasekara and Andy Hsieh, are veterans of the Small Data Lab at Cornell Tech, which is devoted to private AI approaches for immersive personalization and content-centric explorations. The platform required a more effective way to connect learners directly to the key information that they sought.

Metadata

Metadata Publishing Data-driven Modeling

Amazon OpenSearch Service launches flow builder to empower rapid AI search innovation

AWS Big Data

MAY 2, 2025

Through a visual designer, you can configure custom AI search flowsa series of AI-driven data enrichments performed during ingestion and search. Each processor applies a type of data transform such as encoding text into vector embeddings, or summarizing search results with a chatbot AI service.

Machine Learning

Machine Learning Visualization Dashboards Metadata

Metadata Management, Data Governance and Automation

erwin

NOVEMBER 6, 2019

erwin released its State of Data Governance Report in February 2018, just a few months before the General Data Protection Regulation (GDPR) took effect. Download Free GDPR Guide | Step By Step Guide to Data Governance for GDPR?. IDC Technology Spotlight, Data Intelligence in Context: Get the report (… it’s free).

Metadata

Metadata Data Governance Management Cost-Benefit

How Metadata Makes Data Meaningful

erwin

DECEMBER 12, 2019

Metadata is an important part of data governance, and as a result, most nascent data governance programs are rife with project plans for assessing and documenting metadata. But are these rampant and often uncontrolled projects to collect metadata properly motivated? What Is Metadata? Where is it now?

Metadata

Metadata Data Governance Digital Transformation Data Quality

erwin Positioned as a Leader in Gartner’s 2020 Magic Quadrant for Metadata Management Solutions for Second Year in a Row

erwin

NOVEMBER 19, 2020

erwin has once again been positioned as a Leader in the Gartner “2020 Magic Quadrant for Metadata Management Solutions.”. The post erwin Positioned as a Leader in Gartner’s 2020 Magic Quadrant for Metadata Management Solutions for Second Year in a Row appeared first on erwin, Inc.

Metadata

Metadata Management Digital Transformation Data Governance

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Rocket-Powered Data Science

FEBRUARY 15, 2023

Third, any commitment to a disruptive technology (including data-intensive and AI implementations) must start with a business strategy. These changes may include requirements drift, data drift, model drift, or concept drift. I suggest that the simplest business strategy starts with answering three basic questions: What?

Strategy

Strategy Experimentation Uncertainty Machine Learning

Data Insights for Everyone — The Semantic Layer to the Rescue

Rocket-Powered Data Science

SEPTEMBER 20, 2021

The way that I explained it to my data science students years ago was like this. They realized that the search results would probably not provide an answer to my question, but the results would simply list websites that included my words on the page or in the metadata tags: “Texas”, “Cows”, “How”, etc. What is a semantic layer?

Data Science

Data Science Forecasting Business Intelligence Sales

Do I Need a Data Catalog?

erwin

JUNE 26, 2020

If you’re serious about a data-driven strategy , you’re going to need a data catalog. Organizations need a data catalog because it enables them to create a seamless way for employees to access and consume data and business assets in an organized manner. Three Types of Metadata in a Data Catalog.

Metadata

Metadata Cost-Benefit Measurement Data-driven

How AI-driven data management helps organizations meet rising regulatory challenges

CIO Business Intelligence

SEPTEMBER 12, 2024

Enterprises must reimagine their data and document management to meet the increasing regulatory challenges emerging as part of the digitization era. Commonly, businesses face three major challenges with regard to data and data management: Data volumes. One particular challenge lies in managing “dark data” (i.e.,

Data-driven

Data-driven Management Cost-Benefit Risk

Deep automation in machine learning

O'Reilly on Data

DECEMBER 19, 2018

We need to do more than automate model building with autoML; we need to automate tasks at every stage of the data pipeline. In a previous post , we talked about applications of machine learning (ML) to software development, which included a tour through sample tools in data science and for managing data infrastructure.

Machine Learning

Machine Learning Software Metadata Testing

Automation Gives DevOps More Horsepower

erwin

MARCH 12, 2020

With metadata-driven automation, many DevOps processes can be automated, adding more “horsepower” to increase their speed and accuracy. Such automation can save close to 100 percent of the time usually spent on this type of documentation. Human errors are eliminated, leading to higher quality documentation and output.

Metadata

Metadata Digital Transformation Data-driven Enterprise

What’s the Current State of Data Governance and Automation?

erwin

JANUARY 30, 2020

I’m excited to share the results of our new study with Dataversity that examines how data governance attitudes and practices continue to evolve. Defining Data Governance: What Is Data Governance? . 1 reason to implement data governance. Constructing a Digital Transformation Strategy: How Data Drives Digital.

Data Governance

Data Governance Metadata Cost-Benefit Digital Transformation

Doing Cloud Migration and Data Governance Right the First Time

erwin

OCTOBER 8, 2020

So if you’re going to move from your data from on-premise legacy data stores and warehouse systems to the cloud, you should do it right the first time. And as you make this transition, you need to understand what data you have, know where it is located, and govern it along the way. Then you must bulk load the legacy data.

Data Governance

Data Governance Metadata Testing Data Lake

Using Enterprise Architecture, Data Modeling & Data Governance for Rapid Crisis Response

erwin

MARCH 17, 2020

Teams need to urgently respond to everything from massive changes in workforce access and management to what-if planning for a variety of grim scenarios, in addition to building and documenting new applications and providing fast, accurate access to data for smart decision-making. Data Modeling. Data Governance.

Data Governance

Data Governance Enterprise Modeling Metadata

Top 6 Benefits of Automating End-to-End Data Lineage

erwin

SEPTEMBER 17, 2020

Replace manual and recurring tasks for fast, reliable data lineage and overall data governance. It’s paramount that organizations understand the benefits of automating end-to-end data lineage. The importance of end-to-end data lineage is widely understood and ignoring it is risky business. Doing Data Lineage Right.

Cost-Benefit

Cost-Benefit Data Governance Metadata Reporting

Integrating Data Governance and Enterprise Architecture

erwin

SEPTEMBER 3, 2020

Why should you integrate data governance (DG) and enterprise architecture (EA)? Data governance provides time-sensitive, current-state architecture information with a high level of quality. Data governance provides time-sensitive, current-state architecture information with a high level of quality.

Data Governance

Data Governance Enterprise Risk Data Lake

Top analytics announcements of AWS re:Invent 2024

AWS Big Data

FEBRUARY 26, 2025

Analytics remained one of the key focus areas this year, with significant updates and innovations aimed at helping businesses harness their data more efficiently and accelerate insights. From enhancing data lakes to empowering AI-driven analytics, AWS unveiled new tools and services that are set to shape the future of data and analytics.

Analytics

Analytics Data Lake Metadata Data Warehouse

Why Your Business Should Use a Data Catalog to Organize Its Data

Smart Data Collective

JULY 15, 2021

A data catalog serves the same purpose. By using metadata (or short descriptions), data catalogs help companies gather, organize, retrieve, and manage information. You can think of a data catalog as an enhanced Access database or library card catalog system. What Does a Data Catalog Do?

Metadata

Metadata IT Data-driven Data Quality

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

AWS Big Data

OCTOBER 30, 2024

Amazon DataZone now launched authentication supports through the Amazon Athena JDBC driver, allowing data users to seamlessly query their subscribed data lake assets via popular business intelligence (BI) and analytics tools like Tableau, Power BI, Excel, SQL Workbench, DBeaver, and more.

Visualization

Visualization Data Lake Testing Data Governance

Top 10 Data Governance Trends for 2020: Data’s Real Value Comes Into Focus

erwin

JANUARY 3, 2020

Understanding the data governance trends for the year ahead will give business leaders and data professionals a competitive edge … Happy New Year! Regulatory compliance and data breaches have driven the data governance narrative during the past few years.

Data Governance

Data Governance Digital Transformation IoT Metadata

Data Intelligence in the Next Normal; Why, Who and When?

erwin

JANUARY 14, 2021

When the pandemic first hit, there was some negative impact on big data and analytics spending. Digital transformation was accelerated, and budgets for spending on big data and analytics increased. But data without intelligence is just data, and this is WHY data intelligence is required. Now is the time.

Digital Transformation

Digital Transformation Metadata Big Data Data-driven

AI recommendations for descriptions in Amazon DataZone for enhanced business data cataloging and discovery is now generally available

AWS Big Data

APRIL 2, 2024

In March 2024, we announced the general availability of the generative artificial intelligence (AI) generated data descriptions in Amazon DataZone. In this post, we share what we heard from our customers that led us to add the AI-generated data descriptions and discuss specific customer use cases addressed by this capability.

Metadata

Metadata Metrics Data-driven Contextual Data

What is Data Lineage? Top 5 Benefits of Data Lineage

erwin

APRIL 29, 2020

Data lineage is the journey data takes from its creation through its transformations over time. Tracing the source of data is an arduous task. With all these diverse data sources, and if systems are integrated, it is difficult to understand the complicated data web they form much less get a simple visual flow.

Key Performance Indicator

Key Performance Indicator Metadata Data Governance Data Quality

Why You Need End-to-End Data Lineage

erwin

SEPTEMBER 10, 2020

Not Documenting End-to-End Data Lineage Is Risky Busines – Understanding your data’s origins is key to successful data governance. Not everyone understands what end-to-end data lineage is or why it is important. Data Lineage Tells an Important Origin Story. Who are the data owners?

Data Governance

Data Governance Key Performance Indicator Metadata Digital Transformation

Enterprise Architecture Tools and the Changing Role of the Enterprise Architect

erwin

NOVEMBER 28, 2019

The International Enterprise Architecture Institute (IEAI) defines enterprise architecture (EA) as “the analysis and documentation of an enterprise in its current and future states from an integrated strategy, business and technology perspective.”. In the era of data-driven business, such perspective is critical.

Enterprise

Enterprise Data-driven Data Governance Metadata

Make extraction pay: How can organizations maximize the value of their data and deliver ROI?

CIO Business Intelligence

SEPTEMBER 12, 2024

While some enterprises are already reporting AI-driven growth, the complexities of data strategy are proving a big stumbling block for many other businesses. So, what can businesses do to maximize the value of their data, and ensure their genAI projects are delivering return on investment?

ROI

ROI Cost-Benefit Unstructured Data Metadata

How Metadata Makes Data Meaningful

erwin

DECEMBER 12, 2019

Metadata is an important part of data governance, and as a result, most nascent data governance programs are rife with project plans for assessing and documenting metadata. But are these rampant and often uncontrolled projects to collect metadata properly motivated? What Is Metadata? Where is it now?

Metadata

Metadata Data Governance Digital Transformation Data Quality

How ZS built a clinical knowledge repository for semantic search using Amazon OpenSearch Service and Amazon Neptune

AWS Big Data

SEPTEMBER 12, 2024

In this blog post, we will highlight how ZS Associates used multiple AWS services to build a highly scalable, highly performant, clinical document search platform. We use leading-edge analytics, data, and science to help clients make intelligent decisions. The document processing layer supports document ingestion and orchestration.

Unstructured Data

Unstructured Data Metadata Machine Learning Consulting

The Benefits of Data Management Automation: 8 Tips to Automate Data Management

erwin

FEBRUARY 6, 2020

As organizations deal with managing ever more data, the need to automate data management becomes clear. Last week erwin issued its 2020 State of Data Governance and Automation (DGA) Report. One piece of the research that stuck with me is that 70% of respondents spend 10 or more hours per week on data-related activities.

Management

Management Data Governance Cost-Benefit Metadata

Amazon OpenSearch Service Under the Hood : OpenSearch Optimized Instances(OR1)

AWS Big Data

APRIL 17, 2024

With this new instance family, OpenSearch Service uses OpenSearch innovation and AWS technologies to reimagine how data is indexed and stored in the cloud. Today, customers widely use OpenSearch Service for operational analytics because of its ability to ingest high volumes of data while also providing rich and interactive analytics.

Optimization

Optimization Snapshot Metadata Cost-Benefit

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

SAP Datasphere Powers Business at the Speed of Data

Webinars

Trending Sources

Are You Content with Your Organization’s Content Strategy?

Webinars

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Why Modern Data Challenges Require a New Approach to Governance

Accelerating AI at scale without sacrificing security

Streamline data discovery with precise technical identifier search in Amazon SageMaker Unified Studio

What Is Data Modeling? Data Modeling Best Practices for Data-Driven Organizations

7 Benefits of Metadata Management

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

Very Meta … Unlocking Data’s Potential with Metadata Management Solutions

Best Practices for Metadata Management

RDF-Star: Metadata Complexity Simplified

Data’s dark secret: Why poor quality cripples AI and growth

The New O’Reilly Answers: The R in “RAG” Stands for “Royalties”

Amazon OpenSearch Service launches flow builder to empower rapid AI search innovation

Metadata Management, Data Governance and Automation

How Metadata Makes Data Meaningful

erwin Positioned as a Leader in Gartner’s 2020 Magic Quadrant for Metadata Management Solutions for Second Year in a Row

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Data Insights for Everyone — The Semantic Layer to the Rescue

Do I Need a Data Catalog?

How AI-driven data management helps organizations meet rising regulatory challenges

Deep automation in machine learning

Automation Gives DevOps More Horsepower

What’s the Current State of Data Governance and Automation?

Doing Cloud Migration and Data Governance Right the First Time

Using Enterprise Architecture, Data Modeling & Data Governance for Rapid Crisis Response

Top 6 Benefits of Automating End-to-End Data Lineage

Integrating Data Governance and Enterprise Architecture

Top analytics announcements of AWS re:Invent 2024

Why Your Business Should Use a Data Catalog to Organize Its Data

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

Top 10 Data Governance Trends for 2020: Data’s Real Value Comes Into Focus

Data Intelligence in the Next Normal; Why, Who and When?

AI recommendations for descriptions in Amazon DataZone for enhanced business data cataloging and discovery is now generally available

What is Data Lineage? Top 5 Benefits of Data Lineage

Why You Need End-to-End Data Lineage

Enterprise Architecture Tools and the Changing Role of the Enterprise Architect

Make extraction pay: How can organizations maximize the value of their data and deliver ROI?

How Metadata Makes Data Meaningful

How ZS built a clinical knowledge repository for semantic search using Amazon OpenSearch Service and Amazon Neptune

The Benefits of Data Management Automation: 8 Tips to Automate Data Management

Amazon OpenSearch Service Under the Hood : OpenSearch Optimized Instances(OR1)

Stay Connected