Data Integration, Data Lake and Management

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

With the growing emphasis on data, organizations are constantly seeking more efficient and agile ways to integrate their data, especially from a wide variety of applications. SageMaker Lakehouse gives you the flexibility to access and query your data in-place with all Apache Iceberg compatible tools and engines.

Data Integration

Data Integration Data Lake Statistics Data-driven

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

AWS Big Data

DECEMBER 20, 2024

Amazon Q data integration , introduced in January 2024, allows you to use natural language to author extract, transform, load (ETL) jobs and operations in AWS Glue specific data abstraction DynamicFrame. In this post, we discuss how Amazon Q data integration transforms ETL workflow development.

Data Integration

Data Integration Visualization Data Processing Big Data

Introducing Precisely for Data Integrity

David Menninger's Analyst Perspectives

JANUARY 25, 2021

Data is becoming more valuable and more important to organizations. At the same time, organizations have become more disciplined about the data on which they rely to ensure it is robust, accurate and governed properly.

Data Integration

Data Integration Data Processing Data Lake IT

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

What is data architecture? A framework to manage data

CIO Business Intelligence

DECEMBER 20, 2024

Data architecture definition Data architecture describes the structure of an organizations logical and physical data assets, and data management resources, according to The Open Group Architecture Framework (TOGAF). An organizations data architecture is the purview of data architects. Cloud computing.

Data Architecture

Data Architecture Management Consulting Internet of Things

Amazon Web Services named a Leader in the 2024 Gartner Magic Quadrant for Data Integration Tools

AWS Big Data

FEBRUARY 26, 2025

Amazon Web Services (AWS) has been recognized as a Leader in the 2024 Gartner Magic Quadrant for Data Integration Tools. This recognition, we feel, reflects our ongoing commitment to innovation and excellence in data integration, demonstrating our continued progress in providing comprehensive data management solutions.

Data Integration

Data Integration Data Lake Data Warehouse Unstructured Data

Talend Data Fabric Simplifies Data Life Cycle Management

David Menninger's Analyst Perspectives

NOVEMBER 16, 2021

Talend is a data integration and management software company that offers applications for cloud computing, big data integration, application integration, data quality and master data management.

Management

Management Data Warehouse Data Quality Data Integration

Data Management on Display at Informatica World 2019

David Menninger's Analyst Perspectives

JUNE 12, 2019

Under that focus, Informatica's conference emphasized capabilities across six areas (all strong areas for Informatica): data integration, data management, data quality & governance, Master Data Management (MDM), data cataloging, and data security.

Management

Management Data Quality Data Integration Data Lake

Introducing Amazon Q data integration in AWS Glue

AWS Big Data

APRIL 30, 2024

Today, we’re excited to announce general availability of Amazon Q data integration in AWS Glue. Amazon Q data integration, a new generative AI-powered capability of Amazon Q Developer , enables you to build data integration pipelines using natural language.

Data Integration

Data Integration Data Lake Data Warehouse Software

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

AWS Big Data

APRIL 3, 2024

licensed, 100% open-source data table format that helps simplify data processing on large datasets stored in data lakes. Data engineers use Apache Iceberg because it’s fast, efficient, and reliable at any scale and keeps records of how datasets change over time.

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

Webinars

Trending Sources

Introducing Precisely for Data Integrity

Webinars

What is data architecture? A framework to manage data

Amazon Web Services named a Leader in the 2024 Gartner Magic Quadrant for Data Integration Tools

Talend Data Fabric Simplifies Data Life Cycle Management

Data Management on Display at Informatica World 2019

Introducing Amazon Q data integration in AWS Glue

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Recap of Amazon Redshift key product announcements in 2024

Build Write-Audit-Publish pattern with Apache Iceberg branching and AWS Glue Data Quality

Load data incrementally from transactional data lakes to data warehouses

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

Accelerate data integration with Salesforce and AWS using AWS Glue

Accelerate analytics and AI innovation with the next generation of Amazon SageMaker

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

The success of GenAI models lies in your data management strategy

Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Migrate Delta tables from Azure Data Lake Storage to Amazon S3 using AWS Glue

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Bridging the gap between mainframe data and hybrid cloud environments

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

Top 15 data management platforms

Salesforce debuts Zero Copy Partner Network to ease data integration

How EUROGATE established a data mesh architecture using Amazon DataZone

Denodo Provides a Logical Approach to Data Management

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Fire Your Super-Smart Data Consultants with DataOps

Build a high-performance quant research platform with Apache Iceberg

Achieve data resilience using Amazon OpenSearch Service disaster recovery with snapshot and restore

Create an Apache Hudi-based near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSight

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

Top 15 data management platforms available today

The Data Lakehouse: Blending Data Warehouses and Data Lakes

Introducing native support for Apache Hudi, Delta Lake, and Apache Iceberg on AWS Glue for Apache Spark, Part 1: Getting Started

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

O’Reilly Releases First Chapters of a New Book about Logical Data Management

Differentiate generative AI applications with your data using AWS analytics and managed databases

The Key Components of a Successful Data Lake Strategy

The Key Components of a Successful Data Lake Strategy

Access Amazon Redshift data from Salesforce Data Cloud with Zero Copy Data Federation

How Kaplan, Inc. implemented modern data pipelines using Amazon MWAA and Amazon AppFlow with Amazon Redshift as a data warehouse

Stay Connected