article thumbnail

Simplify data ingestion from Amazon S3 to Amazon Redshift using auto-copy

AWS Big Data

All new sales transactions for 2003-01-01 are automatically ingested, which can be verified by running the following query: SELECT ss_sold_date_sk, count(1) FROM store_sales GROUP BY ss_sold_date_sk; Automate ingestion from multiple data sources We can also load an Amazon Redshift table from multiple data sources.

article thumbnail

How Etihad taps data science to optimise airline operations

CIO Business Intelligence

Despite the worldwide chaos, UAE national airline Etihad has managed to generate productivity gains and cost savings from insights using data science. Etihad began its data science journey with the Cloudera Data Platform and moved its data to the cloud to set up a data lake. Reem Alaya Lebhar.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Materialized Views in Hive for Iceberg Table Format

Cloudera

The support for Apache Iceberg as the table format in Cloudera Data Platform and the ability to create and use materialized views on top of such tables provides a powerful combination to build fast analytic applications on open data lake architectures.

article thumbnail

Data Modeling 201 for the cloud: designing databases for data warehouses

erwin

The first and most important thing to recognize and understand is the new and radically different target environment that you are now designing a data model for. Star schema: a data modeling and database design paradigm for data warehouses and data lakes. Business Focus. Operational. Operational Tactical.