Data Architecture and Structured Data

Making OT-IT integration a reality with new data architectures and generative AI

CIO Business Intelligence

FEBRUARY 20, 2024

Here, industrial knowledge graphs are going to prove vital by enabling manufacturers to combine structured and unstructured data from a wide range of operational and enterprise software systems to drive better decision-making, problem-solving and more advanced automation.”

Data Architecture

Data Architecture Unstructured Data Manufacturing IT

Incremental refresh for Amazon Redshift materialized views on data lake tables

AWS Big Data

NOVEMBER 8, 2024

Amazon Redshift is a fast, fully managed cloud data warehouse that makes it cost-effective to analyze your data using standard SQL and business intelligence tools. He specializes in migrating enterprise data warehouses to AWS Modern Data Architecture. He contributed to query processing and materialized views.

Data Lake

Data Lake Data Warehouse Optimization Testing

What Separates Hybrid Cloud and ‘True’ Hybrid Cloud?

Cloudera

MAY 14, 2024

To attain that level of data quality, a majority of business and IT leaders have opted to take a hybrid approach to data management, moving data between cloud, on-premises -or a combination of the two – to where they can best use it for analytics or feeding AI models. What do we mean by ‘true’ hybrid? Let’s dive deeper.

Data Architecture

Data Architecture Unstructured Data Data Governance Structured Data

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Run Apache XTable in AWS Lambda for background conversion of open table formats

AWS Big Data

NOVEMBER 26, 2024

This post was co-written with Dipankar Mazumdar, Staff Data Engineering Advocate with AWS Partner OneHouse. Data architecture has evolved significantly to handle growing data volumes and diverse workloads.

Metadata

Metadata Data Lake Snapshot Data Warehouse

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

Need for a data mesh architecture Because entities in the EUROGATE group generate vast amounts of data from various sourcesacross departments, locations, and technologiesthe traditional centralized data architecture struggles to keep up with the demands for real-time insights, agility, and scalability.

IoT

IoT Machine Learning Metadata Data-driven

The Future Is Hybrid Data, Embrace It

Cloudera

JUNE 7, 2022

We live in a hybrid data world. In the past decade, the amount of structured data created, captured, copied, and consumed globally has grown from less than 1 ZB in 2011 to nearly 14 ZB in 2020. Impressive, but dwarfed by the amount of unstructured data, cloud data, and machine data – another 50 ZB.

IT

IT Data Architecture Unstructured Data Big Data

Building a Beautiful Data Lakehouse

CIO Business Intelligence

MARCH 9, 2022

As a result, users can easily find what they need, and organizations avoid the operational and cost burdens of storing unneeded or duplicate data copies. Newer data lakes are highly scalable and can ingest structured and semi-structured data along with unstructured data like text, images, video, and audio.

Data Lake

Data Lake Unstructured Data Data Warehouse Big Data

Large Language Models and Data Management

Ontotext

JULY 24, 2023

A Few Cautions LLM references a huge amount of data to become truly functional, making it a quite expensive and time consuming effort to train the model. Supercomputers (and other components of infrastructure) along with new approaches to data architecture (with billions of parameters) are needed.

Modeling

Modeling Management Structured Data Data Architecture

Texas Rangers data transformation modernizes stadium operations

CIO Business Intelligence

OCTOBER 18, 2022

She decided to bring Resultant in to assist, starting with the firm’s strategic data assessment (SDA) framework, which evaluates a client’s data challenges in terms of people and processes, data models and structures, data architecture and platforms, visual analytics and reporting, and advanced analytics.

Data Transformation

Data Transformation Consulting Data Lake Reporting

The Future Is Hybrid Data, Embrace It

CIO Business Intelligence

JUNE 23, 2022

We live in a hybrid data world. In the past decade, the amount of structured data created, captured, copied, and consumed globally has grown from less than 1 ZB in 2011 to nearly 14 ZB in 2020. Impressive, but dwarfed by the amount of unstructured data, cloud data, and machine data – another 50 ZB.

IT

IT Data Architecture Unstructured Data Big Data

Databricks’ new data lakehouse aims at media, entertainment sector

CIO Business Intelligence

APRIL 25, 2022

The other 10% represents the effort of initial deployment, data-loading, configuration and the setup of administrative tasks and analysis that is specific to the customer, the Henschen said. Features focus on media and entertainment firms.

Recreation/Entertainment

Recreation/Entertainment Data Lake Data Warehouse Unstructured Data

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

AWS Big Data

JANUARY 6, 2025

Amazon Redshift is a fast, scalable, and fully managed cloud data warehouse that allows you to process and run your complex SQL analytics workloads on structured and semi-structured data. He specializes in migrating enterprise data warehouses to AWS Modern Data Architecture.

Analytics

Analytics Data Warehouse Big Data Metrics

Big Data Ingestion: Parameters, Challenges, and Best Practices

datapine

AUGUST 20, 2019

Operations data: Data generated from a set of operations such as orders, online transactions, competitor analytics, sales data, point of sales data, pricing data, etc. The gigantic evolution of structured, unstructured, and semi-structured data is referred to as Big data.

Big Data

Big Data B2B Cost-Benefit Structured Data

What is data governance? Best practices for managing data assets

CIO Business Intelligence

MARCH 24, 2023

A framework for managing data 10 master data management certifications that will pay off Big Data, Data and Information Security, Data Integration, Data Management, Data Mining, Data Science, IT Governance, IT Governance Frameworks, Master Data Management

Data Governance

Data Governance Management Metadata Data Quality

Chose Both: Data Fabric and Data Lakehouse

Cloudera

SEPTEMBER 12, 2022

First, organizations have a tough time getting their arms around their data. More data is generated in ever wider varieties and in ever more locations. Organizations don’t know what they have anymore and so can’t fully capitalize on it — the majority of data generated goes unused in decision making. Unified data fabric.

Unstructured Data

Unstructured Data Data Architecture Data Lake Snapshot

Unstructured data management and governance using AWS AI/ML and analytics services

AWS Big Data

OCTOBER 25, 2023

Most companies produce and consume unstructured data such as documents, emails, web pages, engagement center phone calls, and social media. By some estimates, unstructured data can make up to 80–90% of all new enterprise data and is growing many times faster than structured data.

Unstructured Data

Unstructured Data Metadata Management Analytics

Very Meta … Unlocking Data’s Potential with Metadata Management Solutions

erwin

OCTOBER 24, 2019

It is the only solution that can automatically harvest, transform and feed metadata from operational processes, business applications and data models into a central data catalog and then made accessible and understandable within the context of role-based views.

Metadata

Metadata Management Data-driven Data Architecture

Migrate a petabyte-scale data warehouse from Actian Vectorwise to Amazon Redshift

AWS Big Data

MAY 30, 2024

Amazon Redshift is a fast, scalable, and fully managed cloud data warehouse that allows you to process and run your complex SQL analytics workloads on structured and semi-structured data. Solution overview Amazon Redshift is an industry-leading cloud data warehouse.

Data Warehouse

Data Warehouse Data Lake Cost-Benefit Structured Data

How Cloudera Data Flow Enables Successful Data Mesh Architectures

Cloudera

OCTOBER 7, 2021

Those decentralization efforts appeared under different monikers through time, e.g., data marts versus data warehousing implementations (a popular architectural debate in the era of structured data) then enterprise-wide data lakes versus smaller, typically BU-Specific, “data ponds”.

Metadata

Metadata Cost-Benefit Enterprise Interactive

If Johnny Mnemonic Smuggled Linked Data

Ontotext

MAY 30, 2019

It won’t protect you from issues of data quality or from service failures. […] But Linked Data does provide you with new ways to manage these existing data-management challenges. 6 Linked Data, Structured Data on the Web. Linked Data and Information Retrieval.

Cost-Benefit

Cost-Benefit Big Data Technology Metadata

Data science vs data analytics: Unpacking the differences

IBM Big Data Hub

SEPTEMBER 19, 2023

Overview: Data science vs data analytics Think of data science as the overarching umbrella that covers a wide range of tasks performed to find patterns in large datasets, structure data for use, train machine learning models and develop artificial intelligence (AI) applications.

Data Science

Data Science Data Analytics Prescriptive Analytics Analytics

Snowflake: A New Blueprint for the Modern Data Warehouse

CDW Research Hub

JULY 22, 2019

Snowflake’s cloud-built data warehouse enables the data-driven enterprise with instant elasticity, secure data sharing, and per-second pricing across multiple clouds. With Snowflake, you can store, transform and analyze structured and semi-structured data together.

Data Warehouse

Data Warehouse Business Intelligence Structured Data Data-driven

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

Overview of solution As a data-driven company, smava relies on the AWS Cloud to power their analytics use cases. smava ingests data from various external and internal data sources into a landing stage on the data lake based on Amazon Simple Storage Service (Amazon S3).

Data Lake

Data Lake Data Warehouse Data-driven B2B

The hidden history of Db2

IBM Big Data Hub

JULY 5, 2022

In today’s world of complex data architectures and emerging technologies, databases can sometimes be undervalued and unrecognized. Take control of your data governance, security and compliance with Db2’s comprehensive, built-in auditing, access control, and data visibility capabilities.

Data Lake

Data Lake Data Warehouse Publishing Structured Data

Take advantage of AI and use it to make your business better

IBM Big Data Hub

AUGUST 15, 2023

To that end, IBM is building a set of domain-specific foundation models that go beyond natural language learning models and are trained on multiple types of business data, including code, time-series data, tabular data, geospatial data, semi-structured data, and mixed-modality data such as text combined with images.

IT

IT Data Governance Modeling Cost-Benefit

If Johnny Mnemonic Smuggled Linked Data

Ontotext

MAY 30, 2019

It won’t protect you from issues of data quality or from service failures. […] But Linked Data does provide you with new ways to manage these existing data-management challenges. 6 Linked Data, Structured Data on the Web. Linked Data and Information Retrieval.

Cost-Benefit

Cost-Benefit Big Data Technology Metadata

Data, Databases and Deeds: A SPARQL Query to the Rescue

Ontotext

APRIL 25, 2019

The SPARQL query is a way to search, access and retrieve structured data by pulling together information from diverse data sources. The SPARQL query language, designed and endorsed by the W3C, is the standard for querying data, stored in RDF or mapped to RDF.

Cost-Benefit

Cost-Benefit Enterprise Structured Data Data Architecture

Okay, You Got a Knowledge Graph Built with Semantic Technology… And Now What?

Ontotext

JULY 26, 2019

Examples of such continuous improvement are technological giants like Google and Amazon who use semantic technology principles to build better data architectures for better user experiences. Such an approach, no matter what name we use for it, is all about improving the way enterprises operate in an interconnected world.

Technology

Technology Enterprise Data Integration Structured Data

Detect, mask, and redact PII data using AWS Glue before loading into Amazon OpenSearch Service

AWS Big Data

JANUARY 12, 2024

Before data records land on Amazon S3, we implement an ingestion layer to bring all data streams reliably and securely to the data lake. Kinesis Data Streams is deployed as an ingestion layer for accelerated intake of structured and semi-structured data streams.

Data Lake

Data Lake Cost-Benefit Visualization Structured Data

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

AWS Big Data

NOVEMBER 9, 2023

Data ingestion, whether real time or batch, forms the basis of any effective data analysis, enabling organizations to gather information from diverse sources and use it for insightful decision-making. It’s raw, unprocessed data straight from the source.

Data Warehouse

Data Warehouse Testing Data Quality Reporting

Design a data mesh on AWS that reflects the envisioned organization

AWS Big Data

JANUARY 22, 2024

They classified the metrics and indicators in the following categories: Data usage – A clear understanding of who is consuming what data source, materialized with a mapping of consumers and producers. For other organizations, the desired data mesh might look different and the approach might have other learnings.

Data-driven

Data-driven Advertising Metadata Data Architecture

Leverage Data Virtualization to Build a Modern Data System

CDW Research Hub

OCTOBER 12, 2021

Business leaders need to quickly access data—and to trust the accuracy of that data—to make better decisions. As organizations grow and evolve, many find a need for more sophisticated analytics across an ever-increasing amount of digital and consumer data. Unreliable Data as a Service (DaaS) implementations.

Data Warehouse

Data Warehouse Big Data Data Architecture Cost-Benefit

Top analytics announcements of AWS re:Invent 2024

AWS Big Data

FEBRUARY 26, 2025

Amazon SageMaker Lakehouse provides an open data architecture that reduces data silos and unifies data across Amazon Simple Storage Service (Amazon S3) data lakes, Redshift data warehouses, and third-party and federated data sources.

Analytics

Analytics Data Lake Metadata Data Warehouse

How GamesKraft uses Amazon Redshift data sharing to support growing analytics workloads

AWS Big Data

NOVEMBER 13, 2023

Amazon Redshift is a fully managed data warehousing service that offers both provisioned and serverless options, making it more efficient to run and scale analytics without having to manage your data warehouse. Key considerations Gameskraft embraces a modern data architecture, with the data lake residing in Amazon S3.

Data Warehouse

Data Warehouse Analytics Data Lake Data Science

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

AWS Big Data

OCTOBER 1, 2024

Amazon Redshift enables you to efficiently query and retrieve structured and semi-structured data from open format files in Amazon S3 data lake without having to load the data into Amazon Redshift tables. Amazon Redshift extends SQL capabilities to your data lake, enabling you to run analytical queries.

Data Lake

Data Lake Statistics Broadcasting Optimization

Get maximum value out of your cloud data warehouse with Amazon Redshift

AWS Big Data

APRIL 19, 2023

Different departments within an organization can place data in a data lake or within their data warehouse depending on the type of data and usage patterns of that department. Nasdaq’s massive data growth meant they needed to evolve their data architecture to keep up.

Data Warehouse

Data Warehouse Data Lake Unstructured Data Optimization

You Cannot Get to the Moon on a Bike!

Ontotext

JANUARY 10, 2024

This will allow enterprises to derive insights from their disparate databases, integrate external knowledge for better interpretation of their data, and incorporate information extracted from unstructured content. In order to integrate structured data, enterprises need to implement the data fabric pattern.

Metadata

Metadata Slice and Dice Data Integration Enterprise

Data, Databases and Deeds: A SPARQL Query to the Rescue

Ontotext

APRIL 25, 2019

The SPARQL query is a way to search, access and retrieve structured data by pulling together information from diverse data sources. The SPARQL query language, designed and endorsed by the W3C, is the standard for querying data, stored in RDF or mapped to RDF.

Cost-Benefit

Cost-Benefit Enterprise Structured Data Data Architecture

Okay, You Got a Knowledge Graph Built with Semantic Technology… And Now What?

Ontotext

JULY 26, 2019

Examples of such continuous improvement are technological giants like Google and Amazon who use semantic technology principles to build better data architectures for better user experiences. Such an approach, no matter what name we use for it, is all about improving the way enterprises operate in an interconnected world.

Technology

Technology Data Integration Enterprise Structured Data

Exploring real-time streaming for generative AI Applications

AWS Big Data

MARCH 25, 2024

Both engines provide native ingestion support from Kinesis Data Streams and Amazon MSK via a separate streaming pipeline to a data lake or data warehouse for analysis. Data streaming enables you to ingest data from a variety of databases across various systems.

Data Lake

Data Lake Unstructured Data Management Snapshot

Data platform trinity: Competitive or complementary?

IBM Big Data Hub

JANUARY 18, 2023

In another decade, the internet and mobile started the generate data of unforeseen volume, variety and velocity. It required a different data platform solution. Hence, Data Lake emerged, which handles unstructured and structured data with huge volume. Metadata plays a key role here in discovering the data assets.

Data Lake

Data Lake Data Warehouse Data-driven Metadata

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

Strategize based on how your teams explore data, run analyses, wrangle data for downstream requirements, and visualize data at different levels. The AWS modern data architecture shows a way to build a purpose-built, secure, and scalable data platform in the cloud.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

AWS Big Data

MARCH 28, 2023

Conclusion In this post, we demonstrated how to identify the changed data for a semi-structured data source and preserve the historical changes (SCD Type 2) on an S3 Delta Lake, when source systems are unable to provide the change data capture capability, with AWS Glue.

Data Lake

Data Lake Testing Snapshot Big Data

Your Data Architecture Holds the Key to Unlocking AI’s Full Potential

CIO Business Intelligence

APRIL 4, 2023

In order to move AI forward, we need to first build and fortify the foundational layer: data architecture. This architecture is important because, to reap the full benefits of AI, it must be built to scale across an enterprise versus individual AI applications. Constructing the right data architecture cannot be bypassed.

Data Architecture

Data Architecture Data Lake Data Warehouse Cost-Benefit

Making OT-IT integration a reality with new data architectures and generative AI

Incremental refresh for Amazon Redshift materialized views on data lake tables

Webinars

Trending Sources

What Separates Hybrid Cloud and ‘True’ Hybrid Cloud?

Webinars

Run Apache XTable in AWS Lambda for background conversion of open table formats

How EUROGATE established a data mesh architecture using Amazon DataZone

The Future Is Hybrid Data, Embrace It

Building a Beautiful Data Lakehouse

Large Language Models and Data Management

Texas Rangers data transformation modernizes stadium operations

The Future Is Hybrid Data, Embrace It

Databricks’ new data lakehouse aims at media, entertainment sector

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

Big Data Ingestion: Parameters, Challenges, and Best Practices

What is data governance? Best practices for managing data assets

Chose Both: Data Fabric and Data Lakehouse

Unstructured data management and governance using AWS AI/ML and analytics services

Very Meta … Unlocking Data’s Potential with Metadata Management Solutions

Migrate a petabyte-scale data warehouse from Actian Vectorwise to Amazon Redshift

How Cloudera Data Flow Enables Successful Data Mesh Architectures

If Johnny Mnemonic Smuggled Linked Data

Data science vs data analytics: Unpacking the differences

Snowflake: A New Blueprint for the Modern Data Warehouse

How smava makes loans transparent and affordable using Amazon Redshift Serverless

The hidden history of Db2

Take advantage of AI and use it to make your business better

If Johnny Mnemonic Smuggled Linked Data

Data, Databases and Deeds: A SPARQL Query to the Rescue

Okay, You Got a Knowledge Graph Built with Semantic Technology… And Now What?

Detect, mask, and redact PII data using AWS Glue before loading into Amazon OpenSearch Service

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

Design a data mesh on AWS that reflects the envisioned organization

Leverage Data Virtualization to Build a Modern Data System

Top analytics announcements of AWS re:Invent 2024

How GamesKraft uses Amazon Redshift data sharing to support growing analytics workloads

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

Get maximum value out of your cloud data warehouse with Amazon Redshift

You Cannot Get to the Moon on a Bike!

Data, Databases and Deeds: A SPARQL Query to the Rescue

Okay, You Got a Knowledge Graph Built with Semantic Technology… And Now What?

Exploring real-time streaming for generative AI Applications

Data platform trinity: Competitive or complementary?

Create an end-to-end data strategy for Customer 360 on AWS

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

Your Data Architecture Holds the Key to Unlocking AI’s Full Potential

Stay Connected