Metadata, Publishing and Structured Data

Implement a custom subscription workflow for unmanaged Amazon S3 assets published with Amazon DataZone

AWS Big Data

DECEMBER 19, 2024

Amazon DataZone , a data management service, helps you catalog, discover, share, and govern data stored across AWS, on-premises systems, and third-party sources. This solution enhances governance and simplifies access to unstructured data assets across the organization. This is the data that will be published to Amazon DataZone.

Publishing

Publishing Unstructured Data Metadata Data-driven

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

Plug-and-play integration : A seamless, plug-and-play integration between data producers and consumers should facilitate rapid use of new data sets and enable quick proof of concepts, such as in the data science teams. As part of the required data, CHE data is shared using Amazon DataZone.

IoT

IoT Machine Learning Metadata Data-driven

Recap of Amazon Redshift key product announcements in 2024

AWS Big Data

DECEMBER 17, 2024

We have enhanced data sharing performance with improved metadata handling, resulting in data sharing first query execution that is up to four times faster when the data sharing producers data is being updated. Industry-leading price-performance: Amazon Redshift launches RA3.large

Data Lake

Data Lake Data Warehouse Data-driven Optimization

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Do I Need a Data Catalog?

erwin

JUNE 26, 2020

The data catalog is a searchable asset that enables all data – including even formerly siloed tribal knowledge – to be cataloged and more quickly exposed to users for analysis. Three Types of Metadata in a Data Catalog. Technical Metadata. Operational Metadata. for analysis and integration purposes).

Metadata

Metadata Cost-Benefit Measurement Data-driven

Top 10 Key Features of BI Tools in 2020

FineReport

FEBRUARY 5, 2020

Metadata management. Users can centrally manage metadata, including searching, extracting, processing, storing, sharing metadata, and publishing metadata externally. The metadata here is focused on the dimensions, indicators, hierarchies, measures and other data required for business analysis.

Metadata

Metadata Dashboards Informatics Visualization

Top analytics announcements of AWS re:Invent 2024

AWS Big Data

FEBRUARY 26, 2025

S3 Tables integration with the AWS Glue Data Catalog is in preview, allowing you to stream, query, and visualize dataincluding Amazon S3 Metadata tablesusing AWS analytics services such as Amazon Data Firehose , Amazon Athena , Amazon Redshift, Amazon EMR, and Amazon QuickSight. With AWS Glue 5.0,

Analytics

Analytics Data Lake Metadata Data Warehouse

How Cloudera Data Flow Enables Successful Data Mesh Architectures

Cloudera

OCTOBER 7, 2021

Those decentralization efforts appeared under different monikers through time, e.g., data marts versus data warehousing implementations (a popular architectural debate in the era of structured data) then enterprise-wide data lakes versus smaller, typically BU-Specific, “data ponds”.

Metadata

Metadata Cost-Benefit Enterprise Interactive

The Semantic Web: 20 Years And a Handful of Enterprise Knowledge Graphs Later

Ontotext

JULY 29, 2021

The second one is the Linked Open Data (LOD): a cloud of interlinked structured datasets published without centralized control across thousands of servers. There are more than 80 million pages with semantic, machine interpretable metadata , according to the Schema.org standard. Take this restaurant, for example.

Enterprise

Enterprise Metadata Knowledge Discovery Management

From charred scrolls to customer sentiment: How AI helps you monetize your unstructured data

CIO Business Intelligence

SEPTEMBER 12, 2024

Unlike structured data, which fits neatly into databases and tables, etc. I also doubt that all the data your organization owns that’s been strategically stored or piling up is accurate and trustworthy–-nor that you need to invest in making it so if it’s irrelevant and you don’t plan to use it.

Unstructured Data

Unstructured Data Deep Learning Metadata Structured Data

Non-JSON ingestion using Amazon Kinesis Data Streams, Amazon MSK, and Amazon Redshift Streaming Ingestion

AWS Big Data

OCTOBER 2, 2023

JSON data in Amazon Redshift Amazon Redshift enables storage, processing, and analytics on JSON data through the SUPER data type, PartiQL language, materialized views, and data lake queries. The function JSON_PARSE allows you to extract the binary data in the stream and convert it into the SUPER data type.

Cost-Benefit

Cost-Benefit Metadata Structured Data Data-driven

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Amazon Redshift is a fast, scalable, and fully managed cloud data warehouse that allows you to process and run your complex SQL analytics workloads on structured and semi-structured data. Business units can simply share data and collaborate by publishing and subscribing to the data assets.

Data Lake

Data Lake Data Warehouse Data Governance Publishing

The Enduring Significance of Data Modeling in the Modern Data-Driven Enterprise

erwin

AUGUST 31, 2023

Let’s explore the continued relevance of data modeling and its journey through history, challenges faced, adaptations made, and its pivotal role in the new age of data platforms, AI, and democratized data access. Embracing the future In the dynamic world of data, data modeling remains an indispensable tool.

Data-driven

Data-driven Modeling Enterprise Structured Data

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

AWS Big Data

MARCH 7, 2023

Sources Data can be loaded from multiple sources, such as systems of record, data generated from applications, operational data stores, enterprise-wide reference data and metadata, data from vendors and partners, machine-generated data, social sources, and web sources.

Analytics

Analytics Data Warehouse Data Lake Metadata

Implement data quality checks on Amazon Redshift data assets and integrate with Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Amazon Redshift is a fast, scalable, and fully managed cloud data warehouse that allows you to process and run your complex SQL analytics workloads on structured and semi-structured data. Amazon DataZone natively supports data sharing for Amazon Redshift data assets. In the post_dq_results_to_datazone.py

Data Quality

Data Quality Visualization Metadata Key Performance Indicator

The Future Is Hybrid Data, Embrace It

Cloudera

JUNE 7, 2022

We live in a hybrid data world. In the past decade, the amount of structured data created, captured, copied, and consumed globally has grown from less than 1 ZB in 2011 to nearly 14 ZB in 2020. Impressive, but dwarfed by the amount of unstructured data, cloud data, and machine data – another 50 ZB.

IT

IT Data Architecture Unstructured Data Big Data

You Cannot Get to the Moon on a Bike!

Ontotext

JANUARY 10, 2024

Limiting growth by (data integration) complexity Most operational IT systems in an enterprise have been developed to serve a single business function and they use the simplest possible model for this. In order to integrate structured data, enterprises need to implement the data fabric pattern.

Metadata

Metadata Slice and Dice Data Integration Enterprise

The Future Is Hybrid Data, Embrace It

CIO Business Intelligence

JUNE 23, 2022

We live in a hybrid data world. In the past decade, the amount of structured data created, captured, copied, and consumed globally has grown from less than 1 ZB in 2011 to nearly 14 ZB in 2020. Impressive, but dwarfed by the amount of unstructured data, cloud data, and machine data – another 50 ZB.

IT

IT Data Architecture Unstructured Data Big Data

Gain insights from historical location data using Amazon Location Service and AWS analytics services

AWS Big Data

MARCH 13, 2024

Developers can use the support in Amazon Location Service for publishing device position updates to Amazon EventBridge to build a near-real-time data pipeline that stores locations of tracked assets in Amazon Simple Storage Service (Amazon S3). Athena is used to run geospatial queries on the location data stored in the S3 buckets.

Analytics

Analytics IoT Metadata Internet of Things

Texts Without Pages: Advancing Text Analytics with Content Enrichment

Ontotext

NOVEMBER 12, 2020

It is coming ever closer to the exciting molecular model Nicholas Negroponte, a pioneer in the field of computer-aided design and co-founder of the MIT Media Lab, envisioned in the early 1980s: The structure of text should be imagined like a complex molecular model. Ideas can be opened and analyzed at multiple levels of detail.

Analytics

Analytics Publishing Metadata Structured Data

Why Spreadsheets Are Your Secret Weapon for Efficient Data Governance

Alation

APRIL 6, 2023

Data governance is traditionally applied to structured data assets that are most often found in databases and information systems. This blog focuses on governing spreadsheets that contain data, information, and metadata, and must themselves be governed. Data catalogs and spreadsheets are related in many ways.

Data Governance

Data Governance Metadata Cost-Benefit Structured Data

The new challenges of scale: What it takes to go from PB to EB data scale

CIO Business Intelligence

JUNE 14, 2023

Additionally, it is vital to be able to execute computing operations on the 1000+ PB within a multi-parallel processing distributed system, considering that the data remains dynamic, constantly undergoing updates, deletions, movements, and growth. Consider data types.

Unstructured Data

Unstructured Data IT Manufacturing Visualization

Do Large Language Models Dream of Knowledge Graphs – Impressions from Day 2 At SEMANTiCS 2023

Ontotext

OCTOBER 12, 2023

LLMs] call into question a fundamental tenet of Data Management: that in order to address non-trivial information needs, the first step is to explicitly structure data in order to lift them from the ambiguous swamp of our human language. He also reminded us all about his wonderful book , available online with open access.

Modeling

Modeling Recreation/Entertainment Data Processing Metadata

Amazon DataZone announces custom blueprints for AWS services

AWS Big Data

JUNE 26, 2024

There could be a dedicated account that acts like a producer to share data, and a few other consumer accounts to subscribe to published assets in the catalog. By doing so, the role has access to the newly subscribed data as well as permissions from previous setups to access data from other AWS resources.

Data Lake

Data Lake Data Warehouse Unstructured Data Data Governance

How to Build Knowledge Graphs for Enterprise Applications with Two Industry Leaders

Ontotext

JULY 13, 2023

Enterprises generate an enormous amount of data and content every minute. Knowledge graphs allow organizations to enrich it with semantic metadata, making it ready to be used across teams and enterprise systems. Partner with PoolParty and GraphDB to build knowledge graphs for enterprise applications.

Enterprise

Enterprise Metadata Digital Transformation Data Integration

The Power of Ontologies and Knowledge Graphs: Practical Examples from the Financial Industry

Ontotext

MAY 5, 2023

Here, the ability of knowledge graphs to integrate diverse data from multiple sources is of high relevance. As you can see from the slide below, knowledge graphs can provide a single access point for various types of data such as structured data, knowledge organization systems, transactional data and signals from unstructured content.

Data Collection

Data Collection Risk Data-driven Interactive

Data platform trinity: Competitive or complementary?

IBM Big Data Hub

JANUARY 18, 2023

In another decade, the internet and mobile started the generate data of unforeseen volume, variety and velocity. It required a different data platform solution. Hence, Data Lake emerged, which handles unstructured and structured data with huge volume. Data discoverability. Data mesh: A mostly new culture.

Data Lake

Data Lake Data Warehouse Data-driven Metadata

Success Stories: Applications and Benefits of Knowledge Graphs in Financial Services

Ontotext

JULY 6, 2023

This shift of both a technical and an outcome mindset allows them to establish a centralized metadata hub for their data assets and effortlessly access information from diverse systems that previously had limited interaction. There are four groups of data that are naturally siloed: Structured data (e.g.,

Cost-Benefit

Cost-Benefit Metadata Experimentation Risk

Implement a serverless CDC process with Apache Iceberg using Amazon DynamoDB and Amazon Athena

AWS Big Data

AUGUST 16, 2023

Change Data Capture (CDC) in the context of a data lake refers to the process of capturing and propagating changes made to source data. Source systems often lack the capability to publish data that is modified or changed. We fetch the metadata of the users_xxxxxx table from Athena.

Data Lake

Data Lake Metadata Testing Snapshot

My Dear Watson, it is Great to Have Someone to Talk to

Ontotext

DECEMBER 17, 2024

This is a GraphDB-powered system that gathers fact-checking content (also called debunks or debunking articles) and enriches it with meaningful metadata and other information. Thanks to the connections in the graph between the source articles and the enrichments, the data is efficiently retrieved to perform further analysis.

IT

IT Metadata Visualization Modeling

Event Extraction Based on Fine-Tuned Text2Event Transformer Speeds up the Fact-checking Process

Ontotext

MARCH 22, 2024

Professionals working on disinformation detection in domains such as Media and Publishing , social studies, and civil security have to sift through large quantities of online content in order to identify mentions of events of interest and extract the key information about them.

Modeling

Modeling Metadata Structured Data Publishing

Is Your Data Catalog Ready for the AI Age?

BI-Survey

FEBRUARY 27, 2025

However, a closer look reveals that these systems are far more than simple repositories: Data catalogs are at the forefront of bringing AI into your business for at least two reasons. However, lineage information and comprehensive metadata are also crucial to document and assess AI models holistically in the domain of AI governance.

Unstructured Data

Unstructured Data Metadata Data Quality Data Governance

Knowledge Graphs 101: The Story (and Benefits) Behind the Hype

Ontotext

NOVEMBER 11, 2024

Knowledge graphs, while not as well-known as other data management offerings, are a proven dynamic and scalable solution for addressing enterprise data management requirements across several verticals. The RDF-star extension makes it easy to model provenance and other structured metadata.

Metadata

Metadata Knowledge Discovery Data Integration Management

Ingest telemetry messages in near real time with Amazon API Gateway, Amazon Data Firehose, and Amazon Location Service

AWS Big Data

NOVEMBER 14, 2024

The solution uses the following key services: Amazon API Gateway – API Gateway is a fully managed service that makes it straightforward developers to create, publish, maintain, monitor, and secure APIs at any scale. APIs act as the entry point for applications to access data, business logic, or functionality from your backend services.

Data Lake

Data Lake Metadata Testing Data-driven

Data Leaders Brief

Implement a custom subscription workflow for unmanaged Amazon S3 assets published with Amazon DataZone

How EUROGATE established a data mesh architecture using Amazon DataZone

Webinars

Trending Sources

Recap of Amazon Redshift key product announcements in 2024

Webinars

Do I Need a Data Catalog?

Top 10 Key Features of BI Tools in 2020

Top analytics announcements of AWS re:Invent 2024

How Cloudera Data Flow Enables Successful Data Mesh Architectures

The Semantic Web: 20 Years And a Handful of Enterprise Knowledge Graphs Later

From charred scrolls to customer sentiment: How AI helps you monetize your unstructured data

Non-JSON ingestion using Amazon Kinesis Data Streams, Amazon MSK, and Amazon Redshift Streaming Ingestion

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

The Enduring Significance of Data Modeling in the Modern Data-Driven Enterprise

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

Implement data quality checks on Amazon Redshift data assets and integrate with Amazon DataZone

The Future Is Hybrid Data, Embrace It

You Cannot Get to the Moon on a Bike!

The Future Is Hybrid Data, Embrace It

Gain insights from historical location data using Amazon Location Service and AWS analytics services

Texts Without Pages: Advancing Text Analytics with Content Enrichment

Why Spreadsheets Are Your Secret Weapon for Efficient Data Governance

The new challenges of scale: What it takes to go from PB to EB data scale

Do Large Language Models Dream of Knowledge Graphs – Impressions from Day 2 At SEMANTiCS 2023

Amazon DataZone announces custom blueprints for AWS services

How to Build Knowledge Graphs for Enterprise Applications with Two Industry Leaders

The Power of Ontologies and Knowledge Graphs: Practical Examples from the Financial Industry

Data platform trinity: Competitive or complementary?

Success Stories: Applications and Benefits of Knowledge Graphs in Financial Services

Implement a serverless CDC process with Apache Iceberg using Amazon DynamoDB and Amazon Athena

My Dear Watson, it is Great to Have Someone to Talk to

Event Extraction Based on Fine-Tuned Text2Event Transformer Speeds up the Fact-checking Process

Is Your Data Catalog Ready for the AI Age?

Knowledge Graphs 101: The Story (and Benefits) Behind the Hype

Ingest telemetry messages in near real time with Amazon API Gateway, Amazon Data Firehose, and Amazon Location Service

Stay Connected