Data Integration and Data Lake - Data Leaders Brief

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

With the growing emphasis on data, organizations are constantly seeking more efficient and agile ways to integrate their data, especially from a wide variety of applications. SageMaker Lakehouse gives you the flexibility to access and query your data in-place with all Apache Iceberg compatible tools and engines.

Data Integration

Data Integration Data Lake Statistics Data-driven

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

AWS Big Data

DECEMBER 20, 2024

Amazon Q data integration , introduced in January 2024, allows you to use natural language to author extract, transform, load (ETL) jobs and operations in AWS Glue specific data abstraction DynamicFrame. In this post, we discuss how Amazon Q data integration transforms ETL workflow development.

Data Integration

Data Integration Visualization Data Processing Data Lake

Introducing Precisely for Data Integrity

David Menninger's Analyst Perspectives

JANUARY 25, 2021

Data is becoming more valuable and more important to organizations. At the same time, organizations have become more disciplined about the data on which they rely to ensure it is robust, accurate and governed properly.

Data Integration

Data Integration Data Processing Data Lake IT

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Amazon Web Services named a Leader in the 2024 Gartner Magic Quadrant for Data Integration Tools

AWS Big Data

FEBRUARY 26, 2025

Amazon Web Services (AWS) has been recognized as a Leader in the 2024 Gartner Magic Quadrant for Data Integration Tools. This recognition, we feel, reflects our ongoing commitment to innovation and excellence in data integration, demonstrating our continued progress in providing comprehensive data management solutions.

Data Integration

Data Integration Data Lake Data Warehouse Unstructured Data

Building Best-in-Class Enterprise Analytics

Speaker: Anthony Roach, Director of Product Management at Tableau Software, and Jeremiah Morrow, Partner Solution Marketing Director at Dremio

Tableau works with Strategic Partners like Dremio to build data integrations that bring the two technologies together, creating a seamless and efficient customer experience. Through co-development and Co-Ownership, partners like Dremio ensure their unique capabilities are exposed and can be leveraged from within Tableau.

Analytics

Migrate an existing data lake to a transactional data lake using Apache Iceberg

AWS Big Data

OCTOBER 3, 2023

A data lake is a centralized repository that you can use to store all your structured and unstructured data at any scale. You can store your data as-is, without having to first structure the data and then run different types of analytics for better business insights. They are the same.

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

Webinars

Trending Sources

Introducing Precisely for Data Integrity

Webinars

Amazon Web Services named a Leader in the 2024 Gartner Magic Quadrant for Data Integration Tools

Building Best-in-Class Enterprise Analytics

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Introducing Amazon Q data integration in AWS Glue

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Load data incrementally from transactional data lakes to data warehouses

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

Recap of Amazon Redshift key product announcements in 2024

Accelerate data integration with Salesforce and AWS using AWS Glue

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Accelerate analytics and AI innovation with the next generation of Amazon SageMaker

Migrate Delta tables from Azure Data Lake Storage to Amazon S3 using AWS Glue

Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK

Build Write-Audit-Publish pattern with Apache Iceberg branching and AWS Glue Data Quality

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Bridging the gap between mainframe data and hybrid cloud environments

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Salesforce debuts Zero Copy Partner Network to ease data integration

Five steps to jumpstart your data integration journey

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Talend Data Fabric Simplifies Data Life Cycle Management

What is data architecture? A framework to manage data

How EUROGATE established a data mesh architecture using Amazon DataZone

Fire Your Super-Smart Data Consultants with DataOps

Create an Apache Hudi-based near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSight

The Data Lakehouse: Blending Data Warehouses and Data Lakes

The success of GenAI models lies in your data management strategy

Introducing native support for Apache Hudi, Delta Lake, and Apache Iceberg on AWS Glue for Apache Spark, Part 1: Getting Started

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

The Key Components of a Successful Data Lake Strategy

The Key Components of a Successful Data Lake Strategy

Is Data Virtualization the Secret Behind Operationalizing Data Lakes?

Data Management on Display at Informatica World 2019

The New Data Integration Requirements

Modern Data Architecture: Data Warehousing, Data Lakes, and Data Mesh Explained

Access Amazon Redshift data from Salesforce Data Cloud with Zero Copy Data Federation

Top analytics announcements of AWS re:Invent 2024

Build a high-performance quant research platform with Apache Iceberg

How Kaplan, Inc. implemented modern data pipelines using Amazon MWAA and Amazon AppFlow with Amazon Redshift as a data warehouse

Achieve data resilience using Amazon OpenSearch Service disaster recovery with snapshot and restore

Stay Connected