Big Data, Data Integration and Data Lake

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

AWS Big Data

DECEMBER 20, 2024

Amazon Q data integration , introduced in January 2024, allows you to use natural language to author extract, transform, load (ETL) jobs and operations in AWS Glue specific data abstraction DynamicFrame. In this post, we discuss how Amazon Q data integration transforms ETL workflow development.

Data Integration

Data Integration Visualization Data Processing Big Data

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

With the growing emphasis on data, organizations are constantly seeking more efficient and agile ways to integrate their data, especially from a wide variety of applications. SageMaker Lakehouse gives you the flexibility to access and query your data in-place with all Apache Iceberg compatible tools and engines.

Data Integration

Data Integration Data Lake Statistics Data-driven

Amazon Web Services named a Leader in the 2024 Gartner Magic Quadrant for Data Integration Tools

AWS Big Data

FEBRUARY 26, 2025

Amazon Web Services (AWS) has been recognized as a Leader in the 2024 Gartner Magic Quadrant for Data Integration Tools. This recognition, we feel, reflects our ongoing commitment to innovation and excellence in data integration, demonstrating our continued progress in providing comprehensive data management solutions.

Data Integration

Data Integration Data Lake Data Warehouse Unstructured Data

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Build Write-Audit-Publish pattern with Apache Iceberg branching and AWS Glue Data Quality

AWS Big Data

DECEMBER 9, 2024

The importance of publishing only high-quality data cant be overstatedits the foundation for accurate analytics, reliable machine learning (ML) models, and sound decision-making. AWS Glue is a serverless data integration service that you can use to effectively monitor and manage data quality through AWS Glue Data Quality.

Data Quality

Data Quality Publishing Snapshot Data Lake

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

AWS Big Data

APRIL 3, 2024

licensed, 100% open-source data table format that helps simplify data processing on large datasets stored in data lakes. Data engineers use Apache Iceberg because it’s fast, efficient, and reliable at any scale and keeps records of how datasets change over time.

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Webinars

Trending Sources

Amazon Web Services named a Leader in the 2024 Gartner Magic Quadrant for Data Integration Tools

Webinars

Build Write-Audit-Publish pattern with Apache Iceberg branching and AWS Glue Data Quality

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Introducing Amazon Q data integration in AWS Glue

Talend Data Fabric Simplifies Data Life Cycle Management

Load data incrementally from transactional data lakes to data warehouses

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Accelerate data integration with Salesforce and AWS using AWS Glue

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Recap of Amazon Redshift key product announcements in 2024

Migrate Delta tables from Azure Data Lake Storage to Amazon S3 using AWS Glue

Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Accelerate analytics and AI innovation with the next generation of Amazon SageMaker

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

How EUROGATE established a data mesh architecture using Amazon DataZone

Five steps to jumpstart your data integration journey

Introducing native support for Apache Hudi, Delta Lake, and Apache Iceberg on AWS Glue for Apache Spark, Part 1: Getting Started

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

Create an Apache Hudi-based near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSight

Introducing generative AI upgrades for Apache Spark in AWS Glue (preview)

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

Is Data Virtualization the Secret Behind Operationalizing Data Lakes?

Reporting: Is it the Most Boring, Important Thing in Analytics?

Data Management on Display at Informatica World 2019

Migrate data from Azure Blob Storage to Amazon S3 using AWS Glue

What is Data Pipeline? A Detailed Explanation

Build a high-performance quant research platform with Apache Iceberg

Top analytics announcements of AWS re:Invent 2024

Automatically detect Personally Identifiable Information in Amazon Redshift using AWS Glue

Achieve data resilience using Amazon OpenSearch Service disaster recovery with snapshot and restore

Compose your ETL jobs for MongoDB Atlas with AWS Glue

Introducing Apache Hudi support with AWS Glue crawlers

The New Data Integration Requirements

Introducing native support for Apache Hudi, Delta Lake, and Apache Iceberg on AWS Glue for Apache Spark, Part 2: AWS Glue Studio Visual Editor

Access Amazon Redshift data from Salesforce Data Cloud with Zero Copy Data Federation

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

Enhance monitoring and debugging for AWS Glue jobs using new job observability metrics: Part 2

Stay Connected