Data-driven, Metadata and Structured Data

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

For container terminal operators, data-driven decision-making and efficient data sharing are vital to optimizing operations and boosting supply chain efficiency. Together, these capabilities enable terminal operators to enhance efficiency and competitiveness in an industry that is increasingly data driven.

IoT

IoT Machine Learning Metadata Data-driven

Three Emerging Analytics Products Derived from Value-driven Data Innovation and Insights Discovery in the Enterprise

Rocket-Powered Data Science

JULY 19, 2023

I recently saw an informal online survey that asked users which types of data (tabular, text, images, or “other”) are being used in their organization’s analytics applications. The results showed that (among those surveyed) approximately 90% of enterprise analytics applications are being built on tabular data.

Data-driven

Data-driven Enterprise Analytics Machine Learning

Very Meta … Unlocking Data’s Potential with Metadata Management Solutions

erwin

OCTOBER 24, 2019

Untapped data, if mined, represents tremendous potential for your organization. While there has been a lot of talk about big data over the years, the real hero in unlocking the value of enterprise data is metadata , or the data about the data. Metadata Is the Heart of Data Intelligence.

Metadata

Metadata Management Data-driven Data Architecture

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

The Missing Link in Enterprise Data Governance: Metadata

Octopai

JUNE 26, 2020

We’re dealing with data day in and day out, but if isn’t accurate then it’s all for nothing!” Steve needed a robust and automated metadata management solution as part of his organization’s data governance strategy. Enterprise data governance. Metadata in data governance.

Metadata

Metadata Data Governance Enterprise Reporting

Recap of Amazon Redshift key product announcements in 2024

AWS Big Data

DECEMBER 17, 2024

Amazon Redshift , launched in 2013, has undergone significant evolution since its inception, allowing customers to expand the horizons of data warehousing and SQL analytics. Industry-leading price-performance Amazon Redshift offers up to three times better price-performance than alternative cloud data warehouses.

Data Lake

Data Lake Data Warehouse Data-driven Optimization

The Enduring Significance of Data Modeling in the Modern Data-Driven Enterprise

erwin

AUGUST 31, 2023

Q: Is data modeling cool again? In today’s fast-paced digital landscape, data reigns supreme. The data-driven enterprise relies on accurate, accessible, and actionable information to make strategic decisions and drive innovation. A: It always was and is getting cooler!!

Data-driven

Data-driven Modeling Enterprise Structured Data

Deep automation in machine learning

O'Reilly on Data

DECEMBER 19, 2018

We need to do more than automate model building with autoML; we need to automate tasks at every stage of the data pipeline. In a previous post , we talked about applications of machine learning (ML) to software development, which included a tour through sample tools in data science and for managing data infrastructure.

Machine Learning

Machine Learning Software Metadata Testing

Do I Need a Data Catalog?

erwin

JUNE 26, 2020

If you’re serious about a data-driven strategy , you’re going to need a data catalog. Organizations need a data catalog because it enables them to create a seamless way for employees to access and consume data and business assets in an organized manner. Three Types of Metadata in a Data Catalog.

Metadata

Metadata Cost-Benefit Measurement Data-driven

Top analytics announcements of AWS re:Invent 2024

AWS Big Data

FEBRUARY 26, 2025

Analytics remained one of the key focus areas this year, with significant updates and innovations aimed at helping businesses harness their data more efficiently and accelerate insights. From enhancing data lakes to empowering AI-driven analytics, AWS unveiled new tools and services that are set to shape the future of data and analytics.

Analytics

Analytics Data Lake Metadata Data Warehouse

Implement a custom subscription workflow for unmanaged Amazon S3 assets published with Amazon DataZone

AWS Big Data

DECEMBER 19, 2024

Organizational data is often fragmented across multiple lines of business, leading to inconsistent and sometimes duplicate datasets. This fragmentation can delay decision-making and erode trust in available data. This solution enhances governance and simplifies access to unstructured data assets across the organization.

Publishing

Publishing Unstructured Data Metadata Data-driven

Why You Need End-to-End Data Lineage

erwin

SEPTEMBER 10, 2020

Not Documenting End-to-End Data Lineage Is Risky Busines – Understanding your data’s origins is key to successful data governance. Not everyone understands what end-to-end data lineage is or why it is important. Data Lineage Tells an Important Origin Story. Who are the data owners?

Data Governance

Data Governance Key Performance Indicator Metadata Digital Transformation

AI recommendations for descriptions in Amazon DataZone for enhanced business data cataloging and discovery is now generally available

AWS Big Data

APRIL 2, 2024

In March 2024, we announced the general availability of the generative artificial intelligence (AI) generated data descriptions in Amazon DataZone. In this post, we share what we heard from our customers that led us to add the AI-generated data descriptions and discuss specific customer use cases addressed by this capability.

Metadata

Metadata Metrics Data-driven Contextual Data

The Semantic Web: 20 Years And a Handful of Enterprise Knowledge Graphs Later

Ontotext

JULY 29, 2021

The Semantic Web, both as a research field and a technology stack, is seeing mainstream industry interest, especially with the knowledge graph concept emerging as a pillar for data well and efficiently managed. And what are the commercial implications of semantic technologies for enterprise data? Source: tag.ontotext.com.

Enterprise

Enterprise Metadata Knowledge Discovery Management

Making OT-IT integration a reality with new data architectures and generative AI

CIO Business Intelligence

FEBRUARY 20, 2024

Manufacturers have long held a data-driven vision for the future of their industry. It’s one where near real-time data flows seamlessly between IT and operational technology (OT) systems. Legacy data management is holding back manufacturing transformation Until now, however, this vision has remained out of reach.

Data Architecture

Data Architecture Unstructured Data Manufacturing IT

How ZS built a clinical knowledge repository for semantic search using Amazon OpenSearch Service and Amazon Neptune

AWS Big Data

SEPTEMBER 12, 2024

We use leading-edge analytics, data, and science to help clients make intelligent decisions. AWS services such as Amazon Neptune and Amazon OpenSearch Service form part of their data and analytics pipelines, and AWS Batch is used for long-running data and machine learning (ML) processing tasks.

Unstructured Data

Unstructured Data Metadata Machine Learning Consulting

A Guide to CCPA Compliance and How the California Consumer Privacy Act Compares to GDPR

erwin

APRIL 18, 2019

California Consumer Privacy Act (CCPA) compliance shares many of the same requirements in the European Unions’ General Data Protection Regulation (GDPR). Data governance , thankfully, provides a framework for compliance with either or both – in addition to other regulatory mandates your organization may be subject to.

Data Governance

Data Governance Metadata Data Collection Data-driven

Implement data quality checks on Amazon Redshift data assets and integrate with Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Data quality is crucial in data pipelines because it directly impacts the validity of the business insights derived from the data. Today, many organizations use AWS Glue Data Quality to define and enforce data quality rules on their data at rest and in transit.

Data Quality

Data Quality Visualization Metadata Key Performance Indicator

The Future Is Hybrid Data, Embrace It

Cloudera

JUNE 7, 2022

We live in a hybrid data world. In the past decade, the amount of structured data created, captured, copied, and consumed globally has grown from less than 1 ZB in 2011 to nearly 14 ZB in 2020. Impressive, but dwarfed by the amount of unstructured data, cloud data, and machine data – another 50 ZB.

IT

IT Data Architecture Unstructured Data Big Data

You Cannot Get to the Moon on a Bike!

Ontotext

JANUARY 10, 2024

Next, I will explain how knowledge graphs help them to get a unified view to data derived from multiple sources and get richer insights in less time. This requires new tools and new systems, which results in diverse and siloed data. And each of these gains requires data integration across business lines and divisions.

Metadata

Metadata Slice and Dice Data Integration Enterprise

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

AWS Big Data

OCTOBER 1, 2024

Amazon Redshift enables you to efficiently query and retrieve structured and semi-structured data from open format files in Amazon S3 data lake without having to load the data into Amazon Redshift tables. Amazon Redshift extends SQL capabilities to your data lake, enabling you to run analytical queries.

Data Lake

Data Lake Statistics Broadcasting Optimization

Data Lakes on Cloud & it’s Usage in Healthcare

BizAcuity

MARCH 29, 2019

Data lakes are centralized repositories that can store all structured and unstructured data at any desired scale. The power of the data lake lies in the fact that it often is a cost-effective way to store data. The power of the data lake lies in the fact that it often is a cost-effective way to store data.

Data Lake

Data Lake Unstructured Data Cost-Benefit Data Quality

From Data Silos to Data Fabric with Knowledge Graphs

Ontotext

SEPTEMBER 15, 2020

What Makes a Data Fabric? Data Fabric’ has reached where ‘Cloud Computing’ and ‘Grid Computing’ once trod. Data Fabric hit the Gartner top ten in 2019. This multiplicity of data leads to the growth silos, which in turns increases the cost of integration. It is a buzzword.

Metadata

Metadata Knowledge Discovery Data Quality Strategy

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

This is a guest post co-written by Alex Naumov, Principal Data Architect at smava. smava believes in and takes advantage of data-driven decisions in order to become the market leader. smava believes in and takes advantage of data-driven decisions in order to become the market leader.

Data Lake

Data Lake Data Warehouse Data-driven B2B

The Future Is Hybrid Data, Embrace It

CIO Business Intelligence

JUNE 23, 2022

We live in a hybrid data world. In the past decade, the amount of structured data created, captured, copied, and consumed globally has grown from less than 1 ZB in 2011 to nearly 14 ZB in 2020. Impressive, but dwarfed by the amount of unstructured data, cloud data, and machine data – another 50 ZB.

IT

IT Data Architecture Unstructured Data Big Data

Non-JSON ingestion using Amazon Kinesis Data Streams, Amazon MSK, and Amazon Redshift Streaming Ingestion

AWS Big Data

OCTOBER 2, 2023

Organizations are grappling with the ever-expanding spectrum of data formats in today’s data-driven landscape. From Avro’s binary serialization to the efficient and compact structure of Protobuf, the landscape of data formats has expanded far beyond the traditional realms of CSV and JSON.

Cost-Benefit

Cost-Benefit Metadata Structured Data Data-driven

Design a data mesh on AWS that reflects the envisioned organization

AWS Big Data

JANUARY 22, 2024

The company uses AWS Cloud services to build data-driven products and scale engineering best practices. To ensure a sustainable data platform amid growth and profitability phases, their tech teams adopted a decentralized data mesh architecture. The solution Acast implemented is a data mesh, architected on AWS.

Data-driven

Data-driven Advertising Metadata Data Architecture

Empower Your Cyber Defenders with Real-Time Analytics

Cloudera

NOVEMBER 15, 2024

In fact, according to the Identity Theft Resource Center (ITRC) Annual Data Breach Report , there were 2,365 cyber attacks in 2023 with more than 300 million victims, and a 72% increase in data breaches since 2021. However, there is a fundamental challenge standing in the way of being successful: data.

Analytics

Analytics Metadata Snapshot Data-driven

On the Hunt for Patterns: from Hippocrates to Supercomputers

Ontotext

MAY 18, 2020

Ever since Hippocrates founded his school of medicine in ancient Greece some 2,500 years ago, writes Hannah Fry in her book Hello World: Being Human in the Age of Algorithms , what has been fundamental to healthcare (as she calls it “the fight to keep us healthy”) was observation, experimentation and the analysis of data.

Knowledge Discovery

Knowledge Discovery Experimentation Data-driven Metadata

Enhance query performance using AWS Glue Data Catalog column-level statistics

AWS Big Data

NOVEMBER 22, 2023

Today, we’re making available a new capability of AWS Glue Data Catalog that allows generating column-level statistics for AWS Glue tables. Data lakes are designed for storing vast amounts of raw, unstructured, or semi-structured data at a low cost, and organizations share those datasets across multiple departments and teams.

Statistics

Statistics Data Lake Optimization Data-driven

Data platform trinity: Competitive or complementary?

IBM Big Data Hub

JANUARY 18, 2023

Data platform architecture has an interesting history. A read-optimized platform that can integrate data from multiple applications emerged. In another decade, the internet and mobile started the generate data of unforeseen volume, variety and velocity. It required a different data platform solution. It is too expensive.

Data Lake

Data Lake Data Warehouse Data-driven Metadata

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

This view is used to identify patterns and trends in customer behavior, which can inform data-driven decisions to improve business outcomes. In this post, we discuss how you can use purpose-built AWS services to create an end-to-end data strategy for C360 to unify and govern customer data that address these challenges.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Why You Need a Data Catalog & How to Choose One

Octopai

MAY 30, 2019

If the point of Business Intelligence (BI) data governance is to leverage your datasets to support information transparency and decision-making, then it’s fair to say that the data catalog is key for your BI strategy. At least, as far as data analysis is concerned. The Benefits of Structured Data Catalogs.

Metadata

Metadata Data Governance Data Lake IoT

Ensuring Data Transformation Quality with dbt Core

Wayne Yaddow

MARCH 14, 2025

How dbt Core aids data teams test, validate, and monitor complex data transformations and conversions Photo by NASA on Unsplash Introduction dbt Core, an open-source framework for developing, testing, and documenting SQL-based data transformations, has become a must-have tool for modern data teams as the complexity of data pipelines grows.

Data Transformation

Data Transformation Testing Unstructured Data Data Quality

Advancing AI: The emergence of a modern information lifecycle

CIO Business Intelligence

DECEMBER 4, 2023

Although less complex than the “4 Vs” of big data (velocity, veracity, volume, and variety), orienting to the variety and volume of a challenging puzzle is similar to what CIOs face with information management. Operationalizing data to drive revenue CIOs report that their roles are rising in importance and impact. What’s changed?

Unstructured Data

Unstructured Data Data Lake Business Objectives Metadata

The Superpowers of Ontotext’s Relation and Event Detector

Ontotext

FEBRUARY 26, 2024

This is part of Ontotext’s AI-in-Action initiative aimed at enabling data scientists and engineers to benefit from the AI capabilities of our products. RED’s focus on news content serves a pivotal function: identifying, extracting, and structuring data on events, parties involved, and subsequent impacts.

Data-driven

Data-driven Risk Modeling Risk Management

Success Stories: Applications and Benefits of Knowledge Graphs in Financial Services

Ontotext

JULY 6, 2023

In today’s fast changing environment, enterprises that have transitioned from being focused on applications to becoming data-driven gain a significant competitive edge. There are four groups of data that are naturally siloed: Structured data (e.g., internal metadata, industry ontologies, etc.)

Cost-Benefit

Cost-Benefit Metadata Experimentation Risk

In-depth with CDO Christopher Bannocks

Peter James Thomas

AUGUST 29, 2018

Today I am talking to Christopher Bannocks , who is Group Chief Data Officer at ING. As stressed in other recent In-depth interviews [1] , data is a critical asset in banking and related activities, so Christopher’s role is a pivotal one. 2] I was asked to help solve the data problem.

Data-driven

Data-driven Cost-Benefit Metadata Technology

Exploring real-time streaming for generative AI Applications

AWS Big Data

MARCH 25, 2024

FMs are multimodal; they work with different data types such as text, video, audio, and images. Large language models (LLMs) are a type of FM and are pre-trained on vast amounts of text data and typically have application uses such as text generation, intelligent chatbots, or summarization.

Data Lake

Data Lake Unstructured Data Management Snapshot

The Power of Ontologies and Knowledge Graphs: Practical Examples from the Financial Industry

Ontotext

MAY 5, 2023

Knowledge Representation In the context of the Financial Services Industry domain, the most popular examples of such data are entity (Who?) These two key data elements are used in approximately 80% of the use cases in the sector. Integrating reporting to move to a more streamlined, efficient approach to data collection.

Data Collection

Data Collection Risk Data-driven Interactive

Turbocharging Target Identification: Ontotext’s AI-Powered Solution at Work

Ontotext

JUNE 22, 2023

Another problem in drug development is that although companies gather a lot of data from multiple databases, they struggle to derive key insights about safety, drugability, etc. They need some kind of compass to find their way in the data jungle and identify information that is useful and relevant to their specific use case.

Metrics

Metrics Statistics Visualization Data-driven

Top Graph Use Cases and Enterprise Applications (with Real World Examples)

Ontotext

MARCH 8, 2023

Gartner predicts that graph technologies will be used in 80% of data and analytics innovations by 2025, up from 10% in 2021. Use Case #1: Customer 360 / Enterprise 360 Customer data is typically spread across multiple applications, departments, and regions. Several factors are driving the adoption of knowledge graphs. million users.

Enterprise

Enterprise Knowledge Discovery Risk Machine Learning

How SumUp made digital analytics more accessible using AWS Glue

AWS Big Data

JUNE 6, 2023

SumUp is a leading global financial technology company driven by the purpose of leveling the playing field for small businesses. Unless, of course, the rest of their data also resides in the Google Cloud. AWS Glue gave us a cost-efficient option to migrate the data and we further optimized storage cost by pruning cold data.

Analytics

Analytics Data Lake Testing Optimization

Data Swamp, Data Lake, Data Lakehouse: What to Know

Alation

OCTOBER 21, 2021

Data Swamp vs Data Lake. And so will your data. You know the story well: you have a ton of data and need fast access to the right data. Building an efficient solution for data storage and processing is becoming more than just a back-office or IT challenge. Benefits of a Data Lake.

Data Lake

Data Lake Metadata Data Warehouse Data Governance

How to get powerful and actionable insights from any and all of your data, without delay

Cloudera

SEPTEMBER 17, 2020

Today’s data tool challenges. By enabling their event analysts to monitor and analyze events in real time, as well as directly in their data visualization tool, and also rate and give feedback to the system interactively, they increased their data to insight productivity by a factor of 10. .

Data Warehouse

Data Warehouse Experimentation Dashboards Visualization

How EUROGATE established a data mesh architecture using Amazon DataZone

Three Emerging Analytics Products Derived from Value-driven Data Innovation and Insights Discovery in the Enterprise

Webinars

Trending Sources

Very Meta … Unlocking Data’s Potential with Metadata Management Solutions

Webinars

The Missing Link in Enterprise Data Governance: Metadata

Recap of Amazon Redshift key product announcements in 2024

The Enduring Significance of Data Modeling in the Modern Data-Driven Enterprise

Deep automation in machine learning

Do I Need a Data Catalog?

Top analytics announcements of AWS re:Invent 2024

Implement a custom subscription workflow for unmanaged Amazon S3 assets published with Amazon DataZone

Why You Need End-to-End Data Lineage

AI recommendations for descriptions in Amazon DataZone for enhanced business data cataloging and discovery is now generally available

The Semantic Web: 20 Years And a Handful of Enterprise Knowledge Graphs Later

Making OT-IT integration a reality with new data architectures and generative AI

How ZS built a clinical knowledge repository for semantic search using Amazon OpenSearch Service and Amazon Neptune

A Guide to CCPA Compliance and How the California Consumer Privacy Act Compares to GDPR

Implement data quality checks on Amazon Redshift data assets and integrate with Amazon DataZone

The Future Is Hybrid Data, Embrace It

You Cannot Get to the Moon on a Bike!

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

Data Lakes on Cloud & it’s Usage in Healthcare

From Data Silos to Data Fabric with Knowledge Graphs

How smava makes loans transparent and affordable using Amazon Redshift Serverless

The Future Is Hybrid Data, Embrace It

Non-JSON ingestion using Amazon Kinesis Data Streams, Amazon MSK, and Amazon Redshift Streaming Ingestion

Design a data mesh on AWS that reflects the envisioned organization

Empower Your Cyber Defenders with Real-Time Analytics

On the Hunt for Patterns: from Hippocrates to Supercomputers

Enhance query performance using AWS Glue Data Catalog column-level statistics

Data platform trinity: Competitive or complementary?

Create an end-to-end data strategy for Customer 360 on AWS

Why You Need a Data Catalog & How to Choose One

Ensuring Data Transformation Quality with dbt Core

Advancing AI: The emergence of a modern information lifecycle

The Superpowers of Ontotext’s Relation and Event Detector

Success Stories: Applications and Benefits of Knowledge Graphs in Financial Services

In-depth with CDO Christopher Bannocks

Exploring real-time streaming for generative AI Applications

The Power of Ontologies and Knowledge Graphs: Practical Examples from the Financial Industry

Turbocharging Target Identification: Ontotext’s AI-Powered Solution at Work

Top Graph Use Cases and Enterprise Applications (with Real World Examples)

How SumUp made digital analytics more accessible using AWS Glue

Data Swamp, Data Lake, Data Lakehouse: What to Know

How to get powerful and actionable insights from any and all of your data, without delay

Stay Connected