Data Integration, Metadata and Software

Bridging the gap between mainframe data and hybrid cloud environments

CIO Business Intelligence

FEBRUARY 27, 2025

A high hurdle many enterprises have yet to overcome is accessing mainframe data via the cloud. These tools dont have the necessary connectors, metadata relationships, or lineage mapping that spans both mainframe and cloud environments. This presents a lack of visibility in the metadata lineage spanning across mainframe and cloud data.

Metadata

Metadata Data Lake Cost-Benefit Forecasting

Deep automation in machine learning

O'Reilly on Data

DECEMBER 19, 2018

In a previous post , we talked about applications of machine learning (ML) to software development, which included a tour through sample tools in data science and for managing data infrastructure. Humans are still needed to write software, but that software is of a different type. Developers of Software 1.0

Machine Learning

Machine Learning Software Metadata Testing

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

CIO Business Intelligence

NOVEMBER 19, 2024

Managing the lifecycle of AI data, from ingestion to processing to storage, requires sophisticated data management solutions that can manage the complexity and volume of unstructured data. As customers entrust us with their data, we see even more opportunities ahead to help them operationalize AI and high-performance workloads.

Management

Management Unstructured Data Deep Learning Metadata

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

We also examine how centralized, hybrid and decentralized data architectures support scalable, trustworthy ecosystems. As data-centric AI, automated metadata management and privacy-aware data sharing mature, the opportunity to embed data quality into the enterprises core has never been more significant.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Demystify data sharing and collaboration patterns on AWS: Choosing the right tool for the job

AWS Big Data

OCTOBER 21, 2024

For producers seeking collaboration with partners, AWS Clean Rooms facilitates secure collaboration and analysis of collective datasets without the need to share or duplicate underlying data. It provides data catalog, automated crawlers, and visual job creation to streamline data integration across various data sources and targets.

Sales

Sales Data-driven Data Processing Key Performance Indicator

5 Ways Data Modeling Is Critical to Data Governance

erwin

JANUARY 9, 2020

Today’s data modeling is not your father’s data modeling software. While it’s always been the best way to understand complex data sources and automate design standards and integrity rules, the role of data modeling continues to expand as the fulcrum of collaboration between data generators, stewards and consumers.

Data Governance

Data Governance Modeling Metadata Unstructured Data

Becoming a machine learning company means investing in foundational technologies

O'Reilly on Data

MAY 21, 2019

Developers will find themselves increasingly building software that has ML elements. Thus, many developers will need to curate data, train models, and analyze the results of models. With that said, we are still in a highly empirical era for ML: we need big data, big models, and big compute. and managed services in the cloud.

Machine Learning

Machine Learning Technology Deep Learning Data Science

Amazon OpenSearch Service Under the Hood : OpenSearch Optimized Instances(OR1)

AWS Big Data

APRIL 17, 2024

In this post, we discuss how the reimagined data flow works with OR1 instances and how it can provide high indexing throughput and durability using a new physical replication protocol. We also dive deep into some of the challenges we solved to maintain correctness and data integrity.

Optimization

Optimization Snapshot Metadata Cost-Benefit

Doing Cloud Migration and Data Governance Right the First Time

erwin

OCTOBER 8, 2020

These tools range from enterprise service bus (ESB) products, data integration tools; extract, transform and load (ETL) tools, procedural code, application program interfaces (APIs), file transfer protocol (FTP) processes, and even business intelligence (BI) reports that further aggregate and transform data.

Data Governance

Data Governance Metadata Testing Data Lake

Why data observability is essential to AI governance

erwin

DECEMBER 9, 2024

And if it isnt changing, its likely not being used within our organizations, so why would we use stagnant data to facilitate our use of AI? The key is understanding not IF, but HOW, our data fluctuates, and data observability can help us do just that. Tackle AI data readiness and governance with erwin.

Metadata

Metadata Data Quality Sales Modeling

What is data governance? Best practices for managing data assets

CIO Business Intelligence

MARCH 24, 2023

The program must introduce and support standardization of enterprise data. Programs must support proactive and reactive change management activities for reference data values and the structure/use of master data and metadata. Your data governance program needs to continually break down new siloes.

Data Governance

Data Governance Management Metadata Data Quality

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

AWS Big Data

DECEMBER 16, 2024

Third, some services require you to set up and manage compute resources used for federated connectivity, and capabilities like connection testing and data preview arent available in all services. To solve for these challenges, we launched Amazon SageMaker Lakehouse unified data connectivity.

Visualization

Visualization Data Processing Testing Publishing

The Need For Personalized Data Journeys for Your Data Consumers

DataKitchen

OCTOBER 20, 2023

Example 2: The Data Engineering Team Has Many Small, Valuable Files Where They Need Individual Source File Tracking In a typical data processing workflow, tracking individual files as they progress through various stages—from file delivery to data ingestion—is crucial.

Insurance

Insurance Metadata Data-driven Data Quality

Denodo Provides a Logical Approach to Data Management

David Menninger's Analyst Perspectives

OCTOBER 24, 2024

Data fabric and data mesh are also both related to logical data management, which is the approach of providing virtualized access to data across an enterprise without the requirement to first extract and load it into a central repository.

Management

Management Data-driven Data Governance Data Lake

Collibra Provides a Platform for Data Intelligence

David Menninger's Analyst Perspectives

OCTOBER 8, 2024

As I recently noted , the term “data intelligence” has been used by multiple providers across analytics and data for several years and is becoming more widespread as software providers respond to the need to provide enterprises with a holistic view of data production and consumption.

Data Quality

Data Quality Data Governance Enterprise Visualization

Proposals for model vulnerability and security

O'Reilly on Data

MARCH 20, 2019

Data integrity constraints: Many databases don’t allow for strange or unrealistic combinations of input variables and this could potentially thwart watermarking attacks. Applying data integrity constraints on live, incoming data streams could have the same benefits. Disparate impact analysis: see section 1.

Modeling

Modeling Machine Learning Predictive Modeling Consulting

How Cargotec uses metadata replication to enable cross-account data sharing

AWS Big Data

JUNE 7, 2023

For this, Cargotec built an Amazon Simple Storage Service (Amazon S3) data lake and cataloged the data assets in AWS Glue Data Catalog. They chose AWS Glue as their preferred data integration tool due to its serverless nature, low maintenance, ability to control compute resources in advance, and scale when needed.

Metadata

Metadata Data Lake Machine Learning Big Data

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

Collaborate and build faster using familiar AWS tools for model development, generative AI, data processing, and SQL analytics with Amazon Q Developer , the most capable generative AI assistant for software development, helping you along the way. Having confidence in your data is key.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

Navigating the data management maze: How emerging tech and modern solutions are revolutionizing mainframe-to-cloud integration

CIO Business Intelligence

JULY 18, 2024

IT teams need to capture metadata to know where their data comes from, allowing them to map out its lineage and flow. And since data does not exist in a vacuum, it’s critical not to treat data sets as lump sums. Often organizations struggle with data replication, synchronization, and performance.

Management

Management Internet of Things IoT Metadata

2024 Gartner Market Guide To DataOps

DataKitchen

AUGUST 16, 2024

2024 Gartner Market Guide To DataOps We at DataKitchen are thrilled to see the publication of the Gartner Market Guide to DataOps, a milestone in the evolution of this critical software category. At DataKitchen, we think of this is a ‘meta-orchestration’ of the code and tools acting upon the data.

Marketing

Marketing Data Quality Testing Metadata

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

CIO Business Intelligence

APRIL 29, 2022

This first article emphasizes data as the ‘foundation-stone’ of AI-based initiatives. Establishing a Data Foundation. The shift away from ‘Software 1.0’ where applications have been based on hard-coded rules has begun and the ‘Software 2.0’ era is upon us. Addressing the Challenge.

Data Governance

Data Governance IT Data Lake Risk

There’s More to erwin Data Governance Automation Than Meets the AI

erwin

NOVEMBER 6, 2020

Prashant Parikh, erwin’s Senior Vice President of Software Engineering, talks about erwin’s vision to automate every aspect of the data governance journey to increase speed to insights. Data Cataloging: Catalog and sync metadata with data management and governance artifacts according to business requirements in real time.

Data Governance

Data Governance Metadata Data-driven Visualization

Modern Data Modeling: The Foundation of Enterprise Data Management and Data Governance

erwin

MAY 13, 2020

The role of data modeling (DM) has expanded to support enterprise data management, including data governance and intelligence efforts. Metadata management is the key to managing and governing your data and drawing intelligence from it. Types of Data Models: Conceptual, Logical and Physical.

Data Governance

Data Governance Enterprise Modeling Management

The Semantic Web: 20 Years And a Handful of Enterprise Knowledge Graphs Later

Ontotext

JULY 29, 2021

KGs bring the Semantic Web paradigm to the enterprises, by introducing semantic metadata to drive data management and content management to new levels of efficiency and breaking silos to let them synergize with various forms of knowledge management. The RDF data model and the other standards in W3C’s Semantic Web stack (e.g.,

Enterprise

Enterprise Metadata Knowledge Discovery Management

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

AWS Big Data

APRIL 3, 2024

Apache Iceberg offers integrations with popular data processing frameworks such as Apache Spark, Apache Flink, Apache Hive, Presto, and more. AWS provides integrations for various AWS services with Iceberg tables as well, including AWS Glue Data Catalog for tracking table metadata.

Data Lake

Data Lake Snapshot Metadata Data Architecture

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

DataKitchen

SEPTEMBER 21, 2023

After navigating the complexity of multiple systems and stages to bring data to its end-use case, the final product’s value becomes the ultimate yardstick for measuring success. By diligently testing and monitoring data in Use, you uphold data integrity and provide tangible value to end-users. Contact Us Today!

Testing

Testing Data Quality Predictive Modeling Metrics

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JUNE 10, 2024

When evolving such a partition definition, the data in the table prior to the change is unaffected, as is its metadata. Only data that is written to the table after the evolution is partitioned with the new definition, and the metadata for this new set of data is kept separately. SparkActions.get().expireSnapshots(iceTable).expireOlderThan(TimeUnit.DAYS.toMillis(7)).execute()

Data Lake

Data Lake Metadata Snapshot Analytics

SAP enhances Datasphere and SAC for AI-driven transformation

CIO Business Intelligence

MARCH 6, 2024

“SAP is executing on a roadmap that brings an important semantic layer to enterprise data, and creates the critical foundation for implementing AI-based use cases,” said analyst Robert Parker, SVP of industry, software, and services research at IDC. In the SuccessFactors application, Joule will behave like an HR assistant.

Unstructured Data

Unstructured Data Dashboards Business Intelligence Data Governance

Biggest Trends in Data Visualization Taking Shape in 2022

Smart Data Collective

OCTOBER 13, 2021

Data visualization is a concept that describes any effort to help people understand the significance of data by placing it in a visual context. Patterns, trends and correlations that may go unnoticed in text-based data can be more easily exposed and recognized with data visualization software.

Visualization

Visualization Cost-Benefit Big Data Prescriptive Analytics

The Role Of Data Warehousing In Your Business Intelligence Architecture

datapine

MAY 29, 2019

Each of that component has its own purpose that we will discuss in more detail while concentrating on data warehousing. A solid BI architecture framework consists of: Collection of data. Data integration. Storage of data. Data analysis. Distribution of data. Data integration.

Business Intelligence

Business Intelligence Data Warehouse Dashboards Visualization

What is an Information Steward, and Why You Should Care

Grooper

MARCH 5, 2020

If your organization has any kind of data and analytics initiative, then chances are you have people – maybe even an entire department dedicated to managing and integrating data for (and between) software applications to achieve some sort of business outcome. Is a Power-User or a Data Scientist an Information Steward?

Data Lake

Data Lake Metadata Data Quality Software

Top 15 data management platforms

CIO Business Intelligence

JUNE 9, 2022

All this data arrives by the terabyte, and a data management platform can help marketers make sense of it all. Many are focused on delivering the best returns for marketing teams but some are more general tools that can handle any data science task. Analytics, Data Management, Marketing Software

Management

Management Advertising Data Lake Sales

From Data Silos to Data Fabric with Knowledge Graphs

Ontotext

SEPTEMBER 15, 2020

‘Data Fabric’ has reached where ‘Cloud Computing’ and ‘Grid Computing’ once trod. Data Fabric hit the Gartner top ten in 2019. The purpose of weaving a Data Fabric is to remove the friction and cost from accessing and sharing data in the distributed ICT environment that is the norm.

Metadata

Metadata Knowledge Discovery Data Quality Data-driven

Introducing Apache Hudi support with AWS Glue crawlers

AWS Big Data

NOVEMBER 22, 2023

Many AWS customers adopted Apache Hudi on their data lakes built on top of Amazon S3 using AWS Glue , a serverless data integration service that makes it easier to discover, prepare, move, and integrate data from multiple sources for analytics, machine learning (ML), and application development.

Data Lake

Data Lake Snapshot Metadata Optimization

What is a business intelligence analyst? A key role for data-driven decisions

CIO Business Intelligence

OCTOBER 26, 2023

This is done by mining complex data using BI software and tools , comparing data to competitors and industry trends, and creating visualizations that communicate findings to others in the organization. Real-time problem-solving exercises using Excel or other BI tools. More on BI: What is business intelligence?

Business Intelligence

Business Intelligence Data-driven Statistics Data Warehouse

Dive deep into security management: The Data on EKS Platform

AWS Big Data

APRIL 29, 2024

The construction of big data applications based on open source software has become increasingly uncomplicated since the advent of projects like Data on EKS , an open source project from AWS to provide blueprints for building data and machine learning (ML) applications on Amazon Elastic Kubernetes Service (Amazon EKS).

Management

Management Big Data Data Warehouse Metadata

Data governance in the age of generative AI

AWS Big Data

FEBRUARY 29, 2024

However, enterprise data generated from siloed sources combined with the lack of a data integration strategy creates challenges for provisioning the data for generative AI applications. Data discoverability Unlike structured data, which is managed in well-defined rows and columns, unstructured data is stored as objects.

Data Governance

Data Governance Unstructured Data Metadata Data Lake

AVB accelerates search in LINQ with Amazon OpenSearch Service

AWS Big Data

MAY 21, 2024

Following the best practices section of the OpenSearch Service Developer Guide, AVB selected an optimal cluster configuration with three dedicated cluster manager nodes and six data nodes, across three Availability Zones , while keeping shard size between 10–30 GiB. The following figure outlines the solution.

Manufacturing

Manufacturing Sales Optimization Data Processing

Enhance monitoring and debugging for AWS Glue jobs using new job observability metrics, Part 3: Visualization and trend analysis using Amazon QuickSight

AWS Big Data

MARCH 29, 2024

You can slice data by different dimensions like job name, see anomalies, and share reports securely across your organization. With these insights, teams have the visibility to make data integration pipelines more efficient. An AWS Glue crawler scans data on the S3 bucket and populates table metadata on the AWS Glue Data Catalog.

Metrics

Metrics Visualization Dashboards Publishing

Salesforce acquisition of Tableau – What does it mean?

Andrew White

JUNE 11, 2019

Google acquires Looker – June 2019 (infrastructure/search/data broker vendor acquires analytics/BI). Salesforce closes acquisition of Mulesoft – May 2018 (business app vendor acquires data integration). Even the vast spend on software in D&A is centered on aspects of the two parts.

IT

IT Data Quality Data Integration Business Objectives

Business Glossary and Metadata: When a Data Catalog Product is Not a Data Catalog

TDAN

DECEMBER 17, 2019

If you do a general internet search for data catalogs, all sorts of possibilities emerge. If you look closely, and ask a lot of questions, you will find that some of these products are not actually fully functional data catalogs at all. Some software products start out life-solving a specific use case related to data, […].

Metadata

Metadata Software Data Quality Data Integration

Migrate Delta tables from Azure Data Lake Storage to Amazon S3 using AWS Glue

AWS Big Data

SEPTEMBER 10, 2024

AWS Glue, with its ability to process data using Apache Spark and connect to various data sources, is a suitable solution for addressing the challenges of accessing data across multiple cloud environments. Navigate to the AWS Marketplace page for the Azure Data Lake Storage Connector for AWS Glue.

Data Lake

Data Lake Metadata Management Software

Why Establishing Data Context is the Key to Creating Competitive Advantage

Ontotext

AUGUST 22, 2023

Poor data management, data silos, and a lack of a common understanding across systems and/or teams are the root cause that prohibits an organization from scaling the business in a dynamic environment. As a result, organizations have spent untold money and time gathering and integrating data.

Metadata

Metadata Knowledge Discovery Big Data Enterprise

How HPE Aruba Supply Chain optimized cost and performance by migrating to an AWS modern data architecture

AWS Big Data

SEPTEMBER 11, 2024

Aruba offers networking hardware like access points, switches, routers, software, security devices, and Internet of Things (IoT) products. AWS Transfer Family seamlessly integrates with other AWS services, automates transfer, and makes sure data is protected with encryption and access controls. 2 GB into the landing zone daily.

Data Architecture

Data Architecture Optimization Data Warehouse Metadata

Bridging the gap between mainframe data and hybrid cloud environments

Deep automation in machine learning

Webinars

Trending Sources

Comprehensive data management for AI: The next-gen data management engine that will drive AI to new heights

Webinars

Data’s dark secret: Why poor quality cripples AI and growth

Demystify data sharing and collaboration patterns on AWS: Choosing the right tool for the job

5 Ways Data Modeling Is Critical to Data Governance

Becoming a machine learning company means investing in foundational technologies

Amazon OpenSearch Service Under the Hood : OpenSearch Optimized Instances(OR1)

Doing Cloud Migration and Data Governance Right the First Time

Why data observability is essential to AI governance

What is data governance? Best practices for managing data assets

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

The Need For Personalized Data Journeys for Your Data Consumers

Denodo Provides a Logical Approach to Data Management

Collibra Provides a Platform for Data Intelligence

Proposals for model vulnerability and security

How Cargotec uses metadata replication to enable cross-account data sharing

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Navigating the data management maze: How emerging tech and modern solutions are revolutionizing mainframe-to-cloud integration

2024 Gartner Market Guide To DataOps

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

There’s More to erwin Data Governance Automation Than Meets the AI

Modern Data Modeling: The Foundation of Enterprise Data Management and Data Governance

The Semantic Web: 20 Years And a Handful of Enterprise Knowledge Graphs Later

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Bridging the Gap: How ‘Data in Place’ and ‘Data in Use’ Define Complete Data Observability

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

SAP enhances Datasphere and SAC for AI-driven transformation

Biggest Trends in Data Visualization Taking Shape in 2022

The Role Of Data Warehousing In Your Business Intelligence Architecture

What is an Information Steward, and Why You Should Care

Top 15 data management platforms

From Data Silos to Data Fabric with Knowledge Graphs

Introducing Apache Hudi support with AWS Glue crawlers

What is a business intelligence analyst? A key role for data-driven decisions

Dive deep into security management: The Data on EKS Platform

Data governance in the age of generative AI

AVB accelerates search in LINQ with Amazon OpenSearch Service

Enhance monitoring and debugging for AWS Glue jobs using new job observability metrics, Part 3: Visualization and trend analysis using Amazon QuickSight

Salesforce acquisition of Tableau – What does it mean?

Business Glossary and Metadata: When a Data Catalog Product is Not a Data Catalog

Migrate Delta tables from Azure Data Lake Storage to Amazon S3 using AWS Glue

Why Establishing Data Context is the Key to Creating Competitive Advantage

How HPE Aruba Supply Chain optimized cost and performance by migrating to an AWS modern data architecture

Stay Connected