Data Lake, Interactive and Reference

Run Apache XTable in AWS Lambda for background conversion of open table formats

AWS Big Data

NOVEMBER 26, 2024

Initially, data warehouses were the go-to solution for structured data and analytical workloads but were limited by proprietary storage formats and their inability to handle unstructured data. Eventually, transactional data lakes emerged to add transactional consistency and performance of a data warehouse to the data lake.

Metadata

Metadata Data Lake Snapshot Data Warehouse

Recap of Amazon Redshift key product announcements in 2024

AWS Big Data

DECEMBER 17, 2024

Today, Amazon Redshift is used by customers across all industries for a variety of use cases, including data warehouse migration and modernization, near real-time analytics, self-service analytics, data lake analytics, machine learning (ML), and data monetization.

Run Apache XTable in AWS Lambda for background conversion of open table formats

Recap of Amazon Redshift key product announcements in 2024

Webinars

Trending Sources

Load data incrementally from transactional data lakes to data warehouses

Webinars

Synchronize data lakes with CDC-based UPSERT using open table format, AWS Glue, and Amazon MSK

Enrich your serverless data lake with Amazon Bedrock

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Migrate Delta tables from Azure Data Lake Storage to Amazon S3 using AWS Glue

Build a real-time GDPR-aligned Apache Iceberg data lake

Write queries faster with Amazon Q generative SQL for Amazon Redshift

Enriching metadata for accurate text-to-SQL generation for Amazon Athena

Introducing native support for Apache Hudi, Delta Lake, and Apache Iceberg on AWS Glue for Apache Spark, Part 1: Getting Started

Build a high-performance quant research platform with Apache Iceberg

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

Create an Apache Hudi-based near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSight

Build a data lake with Apache Flink on Amazon EMR

Einstein Studio 1: What it is and what to expect

Build and manage your modern data stack using dbt and AWS Glue through dbt-glue, the new “trusted” dbt adapter

Access Amazon Redshift data from Salesforce Data Cloud with Zero Copy Data Federation

Simplify data lake access control for your enterprise users with trusted identity propagation in AWS IAM Identity Center, AWS Lake Formation, and Amazon S3 Access Grants

5 Best Practices for Extracting, Analyzing, and Visualizing Data

Data Cataloging in the Data Lake: Alation + Kylo

Accelerate data science feature engineering on transactional data lakes using Amazon Athena with Apache Iceberg

Foundational blocks of Amazon SageMaker Unified Studio: An admin’s guide to implement unified access to all your data, analytics, and AI

Data governance in the age of generative AI

7 key Microsoft Azure analytics services (plus one extra)

AWS Lake Formation 2022 year in review

Introducing Amazon Q data integration in AWS Glue

Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake Formation

Data Modeling 301 for the cloud: data lake and NoSQL data modeling and design

Joining the Dots: Enhancing Data Analytics Through Intelligent Join Suggestions

Interact with Apache Iceberg tables using Amazon Athena and cross account fine-grained permissions using AWS Lake Formation

Why the Data Journey Manifesto?

Run Spark SQL on Amazon Athena Spark

Access Amazon Athena in your applications using the WebSocket API

How the BMW Group analyses semiconductor demand with AWS Glue

Simplifying data processing at Capitec with Amazon Redshift integration for Apache Spark

Enhance monitoring and debugging for AWS Glue jobs using new job observability metrics

Exploring real-time streaming for generative AI Applications

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

Build an end-to-end serverless streaming pipeline with Apache Kafka on Amazon MSK using Python

Introducing AWS Glue crawler and create table support for Apache Iceberg format

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

Implement alerts in Amazon OpenSearch Service with PagerDuty

Stay Connected