Data Architecture, Data Integration and Visualization

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

While traditional extract, transform, and load (ETL) processes have long been a staple of data integration due to its flexibility, for common use cases such as replication and ingestion, they often prove time-consuming, complex, and less adaptable to the fast-changing demands of modern data architectures.

Data Integration

Data Integration Data Lake Statistics Data-driven

Explore visualizations with AWS Glue interactive sessions

AWS Big Data

SEPTEMBER 20, 2023

AWS Glue interactive sessions now include native support for the matplotlib visualization library (AWS Glue version 3.0 In this post, we look at how we can use matplotlib and Seaborn to explore and visualize data using AWS Glue interactive sessions, facilitating rapid insights without complex infrastructure setup. and later).

Interactive

Interactive Visualization Measurement Data Architecture

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

Need for a data mesh architecture Because entities in the EUROGATE group generate vast amounts of data from various sourcesacross departments, locations, and technologiesthe traditional centralized data architecture struggles to keep up with the demands for real-time insights, agility, and scalability.

IoT

IoT Machine Learning Metadata Data-driven

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

Collaborate and build faster using familiar AWS tools for model development, generative AI, data processing, and SQL analytics with Amazon Q Developer , the most capable generative AI assistant for software development, helping you along the way.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

Exploring new ETL and ELT capabilities for Amazon Redshift from the AWS Glue Studio visual editor

AWS Big Data

APRIL 20, 2023

In a modern data architecture, unified analytics enable you to access the data you need, whether it’s stored in a data lake or a data warehouse. One of the most common use cases for data preparation on Amazon Redshift is to ingest and transform data from different data stores into an Amazon Redshift data warehouse.

Visualization

Visualization Data Warehouse Big Data Data Lake

Data integrity vs. data quality: Is there a difference?

IBM Big Data Hub

JULY 13, 2023

When we talk about data integrity, we’re referring to the overarching completeness, accuracy, consistency, accessibility, and security of an organization’s data. Together, these factors determine the reliability of the organization’s data. In short, yes.

Data Quality

Data Quality Data Integration Metadata Cost-Benefit

5 Ways Data Modeling Is Critical to Data Governance

erwin

JANUARY 9, 2020

While it’s always been the best way to understand complex data sources and automate design standards and integrity rules, the role of data modeling continues to expand as the fulcrum of collaboration between data generators, stewards and consumers. So here’s why data modeling is so critical to data governance.

Data Governance

Data Governance Modeling Metadata Unstructured Data

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

AWS Big Data

APRIL 3, 2024

They understand that a one-size-fits-all approach no longer works, and recognize the value in adopting scalable, flexible tools and open data formats to support interoperability in a modern data architecture to accelerate the delivery of new solutions. Andries has over 20 years of experience in the field of data and analytics.

Data Lake

Data Lake Snapshot Metadata Data Architecture

Back to the Financial Regulatory Future

Cloudera

FEBRUARY 15, 2024

Seeing the future in a modern data architecture The key to successfully navigating these challenges lies in the adoption of a modern data architecture. The promise of a modern data architecture might seem like a distant reality, but we at Cloudera believe data can make what is impossible today, possible tomorrow.

Insurance

Insurance Data Architecture Risk Management Risk

Load data incrementally from transactional data lakes to data warehouses

AWS Big Data

OCTOBER 19, 2023

Data lakes and data warehouses are two of the most important data storage and management technologies in a modern data architecture. Data lakes store all of an organization’s data, regardless of its format or structure. Various data stores are supported in AWS Glue; for example, AWS Glue 4.0

Data Lake

Data Lake Data Warehouse Visualization Snapshot

What is data governance? Best practices for managing data assets

CIO Business Intelligence

MARCH 24, 2023

The Business Application Research Center (BARC) warns that data governance is a highly complex, ongoing program, not a “big bang initiative,” and it runs the risk of participants losing trust and interest over time. Informatica Axon Informatica Axon is a collection hub and data marketplace for supporting programs.

Data Governance

Data Governance Management Metadata Data Quality

Extracting key insights from Amazon S3 access logs with AWS Glue for Ray

AWS Big Data

SEPTEMBER 7, 2023

This blog post presents an architecture solution that allows customers to extract key insights from Amazon S3 access logs at scale. We will partition and format the server access logs with Amazon Web Services (AWS) Glue , a serverless data integration service, to generate a catalog for access logs and create dashboards for insights.

Metadata

Metadata Dashboards Metrics Visualization

Modern Data Modeling: The Foundation of Enterprise Data Management and Data Governance

erwin

MAY 13, 2020

Metadata management is the key to managing and governing your data and drawing intelligence from it. Beyond harvesting and cataloging metadata , it also must be visualized to break down the complexity of how data is organized and what data relationships there are so that meaning is explicit to all stakeholders in the data value chain.

Data Governance

Data Governance Enterprise Modeling Management

CIO Ryan Snyder on the benefits of interpreting data as a layer cake

CIO Business Intelligence

AUGUST 2, 2023

So Thermo Fisher Scientific CIO Ryan Snyder and his colleagues have built a data layer cake based on a cascading series of discussions that allow IT and business partners to act as one team. Martha Heller: What are the business drivers behind the data architecture ecosystem you’re building at Thermo Fisher Scientific?

Manufacturing

Manufacturing Data Architecture Data Strategy Strategy

How Automation and No-Code are Driving Modern Data Warehousing

CIO Business Intelligence

APRIL 5, 2022

With code-free ETL/ELT pipeline generation, users can take data from its source to its target warehouse with simple drag-and-drop actions. Adding further agile data modelling functionalities into the product allows models to be updated and redeployed, enabling data architectures to evolve continuously to meet user needs.

Data Warehouse

Data Warehouse Visualization Data-driven Data Architecture

SAP Datasphere review: turning data from a technical problem to a business data product.

Jen Stirrup

MARCH 29, 2023

However, to turn data into a business problem, organizations need support to move away from technical issues to start getting value as quickly as possible. SAP Datasphere simplifies data integration, cataloging, semantic modeling, warehousing, federation, and virtualization through a unified interface. Why is this interesting?

Data Warehouse

Data Warehouse Metadata Data Integration Business Intelligence

Big Data Ingestion: Parameters, Challenges, and Best Practices

datapine

AUGUST 20, 2019

Big data: Architecture and Patterns. The Big data problem can be comprehended properly using a layered architecture. Big data architecture consists of different layers and each layer performs a specific function. The architecture of Big data has 6 layers. Big Data Ingestion.

Big Data

Big Data B2B Cost-Benefit Structured Data

Augmented data management: Data fabric versus data mesh

IBM Big Data Hub

APRIL 27, 2022

Data fabric and data mesh are emerging data management concepts that are meant to address the organizational change and complexities of understanding, governing and working with enterprise data in a hybrid multicloud ecosystem. The good news is that both data architecture concepts are complimentary.

Management

Management Metadata Data Architecture Data Lake

Top analytics announcements of AWS re:Invent 2024

AWS Big Data

FEBRUARY 26, 2025

Amazon SageMaker Unified Studio brings together functionality and tools from the range of standalone studios, query editors, and visual tools available today in Amazon EMR , AWS Glue , Amazon Redshift , Amazon Bedrock , and the existing Amazon SageMaker Studio. With AWS Glue 5.0, AWS Glue 5.0 AWS Glue 5.0 Apache Iceberg 1.6.1,

Analytics

Analytics Data Lake Metadata Data Warehouse

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

Data ingestion You have to build ingestion pipelines based on factors like types of data sources (on-premises data stores, files, SaaS applications, third-party data), and flow of data (unbounded streams or batch data). Data exploration Data exploration helps unearth inconsistencies, outliers, or errors.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Amazon Redshift announcements at AWS re:Invent 2023 to enable analytics on all your data

AWS Big Data

NOVEMBER 29, 2023

Maximize value with comprehensive analytics and ML capabilities “Amazon Redshift is one of the most important tools we had in growing Jobcase as a company.” – Ajay Joshi, Distinguished Engineer, Jobcase With all your data integrated and available, you can easily build and run near real-time analytics to AI/ML/Generative AI applications.

Data Warehouse

Data Warehouse Analytics Data Lake Machine Learning

5 Reasons to Use Apache Iceberg on Cloudera Data Platform (CDP)

Cloudera

MARCH 23, 2022

In fact, we recently announced the integration with our cloud ecosystem bringing the benefits of Iceberg to enterprises as they make their journey to the public cloud, and as they adopt more converged architectures like the Lakehouse. 1: Multi-function analytics . 1: Multi-function analytics . 2: Open formats.

Metadata

Metadata Data Architecture Machine Learning Cost-Benefit

Detect, mask, and redact PII data using AWS Glue before loading into Amazon OpenSearch Service

AWS Big Data

JANUARY 12, 2024

This solution is suitable for customers who don’t require real-time ingestion to OpenSearch Service and plan to use data integration tools that run on a schedule or are triggered through events. Before data records land on Amazon S3, we implement an ingestion layer to bring all data streams reliably and securely to the data lake.

Data Lake

Data Lake Cost-Benefit Visualization Structured Data

Okay, You Got a Knowledge Graph Built with Semantic Technology… And Now What?

Ontotext

JULY 26, 2019

Examples of such continuous improvement are technological giants like Google and Amazon who use semantic technology principles to build better data architectures for better user experiences. In the healthcare industry, data integration is of paramount importance. Read more at: [link].

Technology

Technology Enterprise Data Integration Structured Data

Introducing erwin Data Modeler 14.0: The next step in a tradition of data modeling excellence

erwin

SEPTEMBER 16, 2024

Migration and modernization : It enables seamless transitions between legacy systems and modern platforms, ensuring your data architecture evolves without disruption. Migration and modernization : It enables seamless transitions between legacy systems and modern platforms, ensuring your data architecture evolves without disruption.

Modeling

Modeling Visualization Data Governance Data Architecture

Best BI Tools For 2024 You Need to Know

FineReport

MARCH 31, 2024

In 2024, business intelligence (BI) software has undergone significant advancements, revolutionizing data management and decision-making processes. Harnessing the power of advanced APIs, automation, and AI, these tools simplify data compilation, organization, and visualization, empowering users to extract actionable insights effortlessly.

Dashboards

Dashboards Visualization Data mining Data-driven

Salesforce certification guide: Roles, paths, exams, cost, training, requirements

CIO Business Intelligence

FEBRUARY 20, 2023

To earn the Salesforce Data Architect certification , candidates should be able to design and implement data solutions within the Salesforce ecosystem, such as data modelling, data integration and data governance.

B2B

B2B Consulting Sales Cost-Benefit

Okay, You Got a Knowledge Graph Built with Semantic Technology… And Now What?

Ontotext

JULY 26, 2019

Examples of such continuous improvement are technological giants like Google and Amazon who use semantic technology principles to build better data architectures for better user experiences. In the healthcare industry, data integration is of paramount importance. Read more at: [link].

Technology

Technology Data Integration Enterprise Structured Data

How Cargotec uses metadata replication to enable cross-account data sharing

AWS Big Data

JUNE 7, 2023

This data needs to be ingested into a data lake, transformed, and made available for analytics, machine learning (ML), and visualization. For this, Cargotec built an Amazon Simple Storage Service (Amazon S3) data lake and cataloged the data assets in AWS Glue Data Catalog.

Metadata

Metadata Data Lake Machine Learning Big Data

What Is Embedded Analytics?

Jet Global

MAY 1, 2023

This is in contrast to traditional BI, which extracts insight from data outside of the app. We rely on increasingly mobile technology to comb through massive amounts of data and solve high-value problems. Plus, there is an expectation that tools be visually appealing to boot. Their dashboards were visually stunning.

Analytics

Analytics Cost-Benefit Visualization Dashboards

Batch data ingestion into Amazon OpenSearch Service using AWS Glue

AWS Big Data

JANUARY 13, 2025

Its ability to process and transform massive datasets has made it an indispensable tool in modern data engineering. Amazon OpenSearch Service a community-driven search and analytics solutionempowers organizations to search, aggregate, visualize, and analyze data seamlessly.

Visualization

Visualization Interactive Data-driven Data Architecture

Introducing the HubSpot connector for AWS Glue

AWS Big Data

DECEMBER 2, 2024

More companies have realized there is an opportunity to integrate, enhance, and present this SaaS data to improve internal operations and gain valuable insights on their data. From there, they can perform meaningful analytics, gain valuable insights, and optionally push enriched data back to external SaaS platforms.

Data Lake

Data Lake Testing Data Integration Metadata

“You Complete Me,” said Data Lineage to DataOps Observability.

DataKitchen

JANUARY 23, 2023

Like an apartment blueprint, Data lineage provides a written document that is only marginally useful during a crisis. This is especially true in the case of the one-to-many, producer-to-consumer relationships we have on our data architecture. Are problems with data tests? They measure data sets at a point in time.

Testing

Testing Data Governance Data Quality Data-driven

Power BI Write Back With Microsoft Fabric and Teams

Jet Global

MAY 9, 2025

The Challenge of Capturing Human Input Modern data architectures, like Microsoft Fabric, excel in collecting and processing system-generated data. Whether transactional data, operational metrics, or system logs, these platforms are optimized to deliver analytical insights from structured sources.

Visualization

Visualization Dashboards Reporting Data Architecture

Data Leaders Brief

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Explore visualizations with AWS Glue interactive sessions

Webinars

Trending Sources

How EUROGATE established a data mesh architecture using Amazon DataZone

Webinars

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Exploring new ETL and ELT capabilities for Amazon Redshift from the AWS Glue Studio visual editor

Data integrity vs. data quality: Is there a difference?

5 Ways Data Modeling Is Critical to Data Governance

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Back to the Financial Regulatory Future

Load data incrementally from transactional data lakes to data warehouses

What is data governance? Best practices for managing data assets

Extracting key insights from Amazon S3 access logs with AWS Glue for Ray

Modern Data Modeling: The Foundation of Enterprise Data Management and Data Governance

CIO Ryan Snyder on the benefits of interpreting data as a layer cake

How Automation and No-Code are Driving Modern Data Warehousing

SAP Datasphere review: turning data from a technical problem to a business data product.

Big Data Ingestion: Parameters, Challenges, and Best Practices

Augmented data management: Data fabric versus data mesh

Top analytics announcements of AWS re:Invent 2024

Create an end-to-end data strategy for Customer 360 on AWS

Amazon Redshift announcements at AWS re:Invent 2023 to enable analytics on all your data

5 Reasons to Use Apache Iceberg on Cloudera Data Platform (CDP)

Detect, mask, and redact PII data using AWS Glue before loading into Amazon OpenSearch Service

Okay, You Got a Knowledge Graph Built with Semantic Technology… And Now What?

Introducing erwin Data Modeler 14.0: The next step in a tradition of data modeling excellence

Best BI Tools For 2024 You Need to Know

Salesforce certification guide: Roles, paths, exams, cost, training, requirements

Okay, You Got a Knowledge Graph Built with Semantic Technology… And Now What?

How Cargotec uses metadata replication to enable cross-account data sharing

What Is Embedded Analytics?

Batch data ingestion into Amazon OpenSearch Service using AWS Glue

Introducing the HubSpot connector for AWS Glue

“You Complete Me,” said Data Lineage to DataOps Observability.

Power BI Write Back With Microsoft Fabric and Teams

Stay Connected