Big Data, Data Lake and Testing - Data Leaders Brief

Incremental refresh for Amazon Redshift materialized views on data lake tables

AWS Big Data

NOVEMBER 8, 2024

Amazon Redshift is a fast, fully managed cloud data warehouse that makes it cost-effective to analyze your data using standard SQL and business intelligence tools. Customers use data lake tables to achieve cost effective storage and interoperability with other tools. The sample files are ‘|’ delimited text files.

Incremental refresh for Amazon Redshift materialized views on data lake tables

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

Webinars

Trending Sources

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

Webinars

Here’s Why Automation For Data Lakes Could Be Important

Important Considerations When Migrating to a Data Lake

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Run Apache XTable in AWS Lambda for background conversion of open table formats

Using AWS AppSync and AWS Lake Formation to access a secure data lake through a GraphQL API

Use Apache Iceberg in a data lake to support incremental data processing

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Choosing an open table format for your transactional data lake on AWS

Introducing simplified interaction with the Airflow REST API in Amazon MWAA

Monitor data pipelines in a serverless data lake

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

Enrich your serverless data lake with Amazon Bedrock

Recap of Amazon Redshift key product announcements in 2024

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

Did Big Data Deliver Business Transformation & Improved CX?

Build a real-time GDPR-aligned Apache Iceberg data lake

Improve operational efficiencies of Apache Iceberg tables built on Amazon S3 data lakes

Build a high-performance quant research platform with Apache Iceberg

Introducing generative AI upgrades for Apache Spark in AWS Glue (preview)

Waking Up The World of Big Data

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

Enriching metadata for accurate text-to-SQL generation for Amazon Athena

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

Perform upserts in a data lake using Amazon Athena and Apache Iceberg

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

What is Data Pipeline? A Detailed Explanation

Apache Ozone and Dense Data Nodes

Apache Iceberg optimization: Solving the small files problem in Amazon EMR

Top analytics announcements of AWS re:Invent 2024

Migrate data from Azure Blob Storage to Amazon S3 using AWS Glue

What is a data architect? Skills, salaries, and how to become a data framework master

Automated data governance with AWS Glue Data Quality, sensitive data detection, and AWS Lake Formation

Query AWS Glue Data Catalog views using Amazon Athena and Amazon Redshift

Use AWS Glue ETL to perform merge, partition evolution, and schema evolution on Apache Iceberg

Build and manage your modern data stack using dbt and AWS Glue through dbt-glue, the new “trusted” dbt adapter

Use IAM runtime roles with Amazon EMR Studio Workspaces and AWS Lake Formation for cross-account fine-grained access control

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure

Stay Connected