Data Warehouse, Machine Learning and Reference

Empower financial analytics by creating structured knowledge bases using Amazon Bedrock and Amazon Redshift

AWS Big Data

MAY 20, 2025

Now with Amazon Bedrock Knowledge Bases integration with structured data, you can use simple, natural language prompts to query complex financial datasets. From customer portals to internal dashboards and mobile apps, this API-driven approach makes enterprise-grade data analysis accessible to everyone in your organization.

Structured Data

Structured Data Data Warehouse Analytics Finance

Recap of Amazon Redshift key product announcements in 2024

AWS Big Data

DECEMBER 17, 2024

Amazon Redshift , launched in 2013, has undergone significant evolution since its inception, allowing customers to expand the horizons of data warehousing and SQL analytics. Industry-leading price-performance Amazon Redshift offers up to three times better price-performance than alternative cloud data warehouses.

Data Lake

Data Lake Data Warehouse Data-driven Optimization

Write queries faster with Amazon Q generative SQL for Amazon Redshift

AWS Big Data

NOVEMBER 7, 2024

Amazon Redshift is a fully managed, AI-powered cloud data warehouse that delivers the best price-performance for your analytics workloads at any scale. Refer to Easy analytics and cost-optimization with Amazon Redshift Serverless to get started. For this post, we use Redshift Serverless. Choose Run all on each notebook tab.

Metadata

Metadata Sales Data Warehouse Optimization

Webinars

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

AWS Big Data

NOVEMBER 27, 2024

While customers can perform some basic analysis within their operational or transactional databases, many still need to build custom data pipelines that use batch or streaming jobs to extract, transform, and load (ETL) data into their data warehouse for more comprehensive analysis.

Data Warehouse

Data Warehouse Analytics Testing Sales

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

Our customers are telling us that they are seeing their analytics and AI workloads increasingly converge around a lot of the same data, and this is changing how they are using analytics tools with their data. This innovation drives an important change: you’ll no longer have to copy or move data between data lake and data warehouses.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

Migrate a petabyte-scale data warehouse from Actian Vectorwise to Amazon Redshift

AWS Big Data

MAY 30, 2024

Amazon Redshift is a fast, scalable, and fully managed cloud data warehouse that allows you to process and run your complex SQL analytics workloads on structured and semi-structured data. Solution overview Amazon Redshift is an industry-leading cloud data warehouse.

Data Warehouse

Data Warehouse Data Lake Cost-Benefit Structured Data

Run Apache XTable in AWS Lambda for background conversion of open table formats

AWS Big Data

NOVEMBER 26, 2024

This post was co-written with Dipankar Mazumdar, Staff Data Engineering Advocate with AWS Partner OneHouse. Data architecture has evolved significantly to handle growing data volumes and diverse workloads. In practice, OTFs are used in a broad range of analytical workloads, from business intelligence to machine learning.

Metadata

Metadata Data Lake Snapshot Data Warehouse

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

AWS Big Data

JANUARY 6, 2025

Amazon AppFlow automatically encrypts data in motion, and allows you to restrict data from flowing over the public internet for SaaS applications that are integrated with AWS PrivateLink , reducing exposure to security threats. Refer to the Amazon Redshift Database Developer Guide for more details.

Analytics

Analytics Data Warehouse Big Data Metrics

MLOps and DevOps: Why Data Makes It Different

O'Reilly on Data

OCTOBER 19, 2021

Much has been written about struggles of deploying machine learning projects to production. As with many burgeoning fields and disciplines, we don’t yet have a shared canonical infrastructure stack or best practices for developing and deploying data-intensive applications. However, the concept is quite abstract.

IT

IT Testing Experimentation Software

Manage your data warehouse cost allocations with Amazon Redshift Serverless tagging

AWS Big Data

MARCH 27, 2023

Amazon Redshift Serverless makes it simple to run and scale analytics without having to manage your data warehouse infrastructure. For more details on tagging, refer to Tagging resources overview. For more tagging best practices, refer to Tagging AWS resources. Choose Save changes. About the Authors Sandeep Bajwa is a Sr.

Data Warehouse

Data Warehouse Management Snapshot Data Lake

The future of data: A 5-pillar approach to modern data management

CIO Business Intelligence

DECEMBER 11, 2024

It was not alive because the business knowledge required to turn data into value was confined to individuals minds, Excel sheets or lost in analog signals. We are now deciphering rules from patterns in data, embedding business knowledge into ML models, and soon, AI agents will leverage this data to make decisions on behalf of companies.

Management

Management Data Governance Data Science Reporting

How Will The Cloud Impact Data Warehousing Technologies?

Smart Data Collective

APRIL 8, 2020

Dating back to the 1970s, the data warehousing market emerged when computer scientist Bill Inmon first coined the term ‘data warehouse’. Created as on-premise servers, the early data warehouses were built to perform on just a gigabyte scale. Cloud based solutions are the future of the data warehousing market.

Technology

Technology Data Warehouse Big Data Machine Learning

Build an Amazon Redshift data warehouse using an Amazon DynamoDB single-table design

AWS Big Data

JUNE 21, 2023

These types of queries are suited for a data warehouse. The goal of a data warehouse is to enable businesses to analyze their data fast; this is important because it means they are able to gain valuable insights in a timely manner. Amazon Redshift is fully managed, scalable, cloud data warehouse.

Data Warehouse

Data Warehouse Data Lake OLAP Cost-Benefit

Amazon SageMaker Lakehouse now supports attribute-based access control

AWS Big Data

APRIL 24, 2025

SageMaker Lakehouse is a unified, open, and secure data lakehouse that now supports ABAC to provide unified access to general purpose Amazon S3 buckets, Amazon S3 Tables , Amazon Redshift data warehouses, and data sources such as Amazon DynamoDB or PostgreSQL. For instructions, refer to Data analyst permissions.

Sales

Sales Data Lake Management Data-driven

Reference guide to analyze transactional data in near-real time on AWS

AWS Big Data

FEBRUARY 20, 2024

In this post, we discuss how to architect a near-real-time analytics solution with AWS managed analytics, AI and machine learning (ML), and database services. Solution overview The most common workloads, agnostic of industry, involve transactional data.

Visualization

Visualization Cost-Benefit Optimization B2B

Enriching metadata for accurate text-to-SQL generation for Amazon Athena

AWS Big Data

OCTOBER 14, 2024

Enterprise data is brought into data lakes and data warehouses to carry out analytical, reporting, and data science use cases using AWS analytical services like Amazon Athena , Amazon Redshift , Amazon EMR , and so on.

Metadata

Metadata Data Lake Modeling Data Warehouse

Is Google BigQuery The Future Of Big Data Analytics?

Smart Data Collective

JUNE 6, 2021

Before diving into whether or not Google BigQuery is the future of big data analytics, it’s vital to firstly understand what “big data analytics” actually means. Big data analytics advantages. Is Google BigQuery the future of big data analytics? References. What is Big Data?” What is Google BigQuery?

Big Data

Big Data Data Analytics Analytics Cost-Benefit

Take Your SQL Skills To The Next Level With These Popular SQL Books

datapine

SEPTEMBER 27, 2022

Recently published in 2021, “SQL for Data Scientists” by author and experienced data scientist, Rénee Teate, teaches its readers all the skills that data scientists use the most in their daily work. Here is an excerpt from one: “I use SQL daily, and this was a great reference towards using advanced SQL to get analytics insights.

Business Intelligence

Business Intelligence Data Warehouse Data Processing Data mining

AWS re:Invent 2023 Amazon Redshift Sessions Recap

AWS Big Data

DECEMBER 18, 2023

Amazon Redshift powers data-driven decisions for tens of thousands of customers every day with a fully managed, AI-powered cloud data warehouse, delivering the best price-performance for your analytics workloads. Learn more about the AWS zero-ETL future with newly launched AWS databases integrations with Amazon Redshift.

Data Warehouse

Data Warehouse Machine Learning Data-driven Data Lake

Amazon Redshift announcements at AWS re:Invent 2023 to enable analytics on all your data

AWS Big Data

NOVEMBER 29, 2023

In 2013, Amazon Web Services revolutionized the data warehousing industry by launching Amazon Redshift , the first fully-managed, petabyte-scale, enterprise-grade cloud data warehouse. Amazon Redshift made it simple and cost-effective to efficiently analyze large volumes of data using existing business intelligence tools.

Data Warehouse

Data Warehouse Analytics Data Lake Machine Learning

Build a secure data visualization application using the Amazon Redshift Data API with AWS IAM Identity Center

AWS Big Data

MARCH 6, 2025

Tens of thousands of customers use Amazon Redshift for modern data analytics at scale, delivering up to three times better price-performance and seven times better throughput than other cloud data warehouses. Refer to IAM Identity Center identity source tutorials for the IdP setup. IAM Identity Center enabled.

Visualization

Visualization Sales Data Warehouse Management

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

With this new functionality, customers can create up-to-date replicas of their data from applications such as Salesforce, ServiceNow, and Zendesk in an Amazon SageMaker Lakehouse and Amazon Redshift. SageMaker Lakehouse gives you the flexibility to access and query your data in-place with all Apache Iceberg compatible tools and engines.

Data Integration

Data Integration Data Lake Statistics Data-driven

How To Succeed As a DataOps Engineer

DataKitchen

NOVEMBER 20, 2021

We often refer to data operations and analytics as a factory. We distinguish between owning the assembly lines of the data factory (DataOps Engineer) and owning individual steps within the assembly lines (data scientists, engineers, etc.). . Consider a machine learning example.

Testing

Testing Machine Learning Data Warehouse Analytics

7 key Microsoft Azure analytics services (plus one extra)

CIO Business Intelligence

JUNE 29, 2022

Taking the broadest possible interpretation of data analytics , Azure offers more than a dozen services — and that’s before you include Power BI, with its AI-powered analysis and new datamart option , or governance-oriented approaches such as Microsoft Purview. Azure Data Factory. Azure Data Lake Analytics.

Data Lake

Data Lake Analytics Data Warehouse Machine Learning

What is Data Pipeline? A Detailed Explanation

Smart Data Collective

OCTOBER 17, 2022

A point of data entry in a given pipeline. Examples of an origin include storage systems like data lakes, data warehouses and data sources that include IoT devices, transaction processing applications, APIs or social media. The final point to which the data has to be eventually transferred is a destination.

Data Warehouse

Data Warehouse Data Lake Visualization Big Data

Achieve near real time operational analytics using Amazon Aurora PostgreSQL zero-ETL integration with Amazon Redshift

AWS Big Data

APRIL 10, 2024

“Data is at the center of every application, process, and business decision. Customers across industries are becoming more data driven and looking to increase revenue, reduce cost, and optimize their business operations by implementing near real time analytics on transactional data, thereby enhancing agility.

Data Warehouse

Data Warehouse Analytics Metrics Snapshot

What is a data architect? Skills, salaries, and how to become a data framework master

CIO Business Intelligence

OCTOBER 13, 2023

Data architect Armando Vázquez identifies eight common types of data architects: Enterprise data architect: These data architects oversee an organization’s overall data architecture, defining data architecture strategy and designing and implementing architectures.

Data Architecture

Data Architecture Data Warehouse Statistics Visualization

Compose your ETL jobs for MongoDB Atlas with AWS Glue

AWS Big Data

MAY 3, 2023

In today’s data-driven business environment, organizations face the challenge of efficiently preparing and transforming large amounts of data for analytics and data science purposes. Businesses need to build data warehouses and data lakes based on operational data.

Data Lake

Data Lake Data Warehouse Data-driven Optimization

5 Best Practices for Extracting, Analyzing, and Visualizing Data

Smart Data Collective

DECEMBER 13, 2022

Five Best Practices for Data Analytics. Extracted data must be saved someplace. There are several choices to consider, each with its own set of advantages and disadvantages: Data warehouses are used to store data that has been processed for a specific function from one or more sources. Select a Storage Platform.

Visualization

Visualization Key Performance Indicator Sales Advertising

Implement model versioning with Amazon Redshift ML

AWS Big Data

NOVEMBER 1, 2023

Amazon Redshift ML allows data analysts, developers, and data scientists to train machine learning (ML) models using SQL. Versioning serves two main purposes: You can refer to prior versions of a model for troubleshooting or audit purposes.

Modeling

Modeling Data Warehouse Machine Learning Recreation/Entertainment

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

AWS Big Data

OCTOBER 10, 2024

Amazon Redshift has established itself as a highly scalable, fully managed cloud data warehouse trusted by tens of thousands of customers for its superior price-performance and advanced data analytics capabilities. This allows you to maintain a comprehensive view of your data while optimizing for cost-efficiency.

Data Lake

Data Lake Data Warehouse Recreation/Entertainment Data-driven

Amazon DataZone announces custom blueprints for AWS services

AWS Big Data

JUNE 26, 2024

New feature: Custom AWS service blueprints Previously, Amazon DataZone provided default blueprints that created AWS resources required for data lake, data warehouse, and machine learning use cases. If you’re new to Amazon DataZone, refer to Getting started.

Data Lake

Data Lake Data Warehouse Unstructured Data Data Governance

How Open Universities Australia modernized their data platform and significantly reduced their ETL costs with AWS Cloud Development Kit and AWS Step Functions

AWS Big Data

JANUARY 30, 2025

Diagram 1: Overall architecture of the solution, using AWS Step Functions, Amazon Redshift and Amazon S3 The following AWS services were used to shape our new ETL architecture: Amazon Redshift A fully managed, petabyte-scale data warehouse service in the cloud.

Data Warehouse

Data Warehouse Data Architecture Machine Learning Data Transformation

A Guide to Materialized Views

Sisense

APRIL 17, 2020

We live in a world of data: there’s more of it than ever before, in a ceaselessly expanding array of forms and locations. Dealing with Data is your window into the ways Data Teams are tackling the challenges of this new world to help their companies and their customers thrive. Why use a materialized view?

Data Warehouse

Data Warehouse Visualization Statistics Machine Learning

Enrich your customer data with geospatial insights using Amazon Redshift, AWS Data Exchange, and Amazon QuickSight

AWS Big Data

MARCH 18, 2024

Load generic address data to Amazon Redshift Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud. Redshift Serverless makes it straightforward to run analytics workloads of any size without having to manage data warehouse infrastructure.

Data Warehouse

Data Warehouse Visualization Snapshot Data-driven

Perform time series forecasting using Amazon Redshift ML and Amazon Forecast

AWS Big Data

AUGUST 7, 2023

Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud. Tens of thousands of customers use Amazon Redshift to process exabytes of data every day to power their analytics workloads. Solution overview Amazon Forecast is a fully managed time series forecasting service based on machine learning.

Forecasting

Forecasting Data Warehouse Machine Learning Modeling

Access Amazon Redshift data from Salesforce Data Cloud with Zero Copy Data Federation

AWS Big Data

JUNE 25, 2024

In today’s data-driven business landscape, organizations collect a wealth of data across various touch points and unify it in a central data warehouse or a data lake to deliver business insights. This external DLO acts as a storage container, housing metadata for your federated Redshift data.

Data Lake

Data Lake Cost-Benefit Data-driven Data Warehouse

Data science vs. machine learning: What’s the difference?

IBM Big Data Hub

JULY 6, 2023

While data science and machine learning are related, they are very different fields. In a nutshell, data science brings structure to big data while machine learning focuses on learning from the data itself. What is data science? What is machine learning?

Machine Learning

Machine Learning Data Science Statistics Deep Learning

13 power tips for Microsoft Power BI

CIO Business Intelligence

OCTOBER 19, 2023

You can also use Power BI to prepare and manage high-quality data to use across the business in other tools, from low-code apps to machine learning. What-if parameters also create calculated measures you can reference elsewhere.

Slice and Dice

Slice and Dice Scorecard Metrics Visualization

Simplifying data processing at Capitec with Amazon Redshift integration for Apache Spark

AWS Big Data

NOVEMBER 10, 2023

Amazon Redshift offers seamless integration with Apache Spark, allowing you to easily access your Redshift data on both Amazon Redshift provisioned clusters and Amazon Redshift Serverless. These tables are then joined with tables from the Enterprise Data Lake (EDL) at runtime.

Data Processing

Data Processing Data Lake Data Warehouse Optimization

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

AWS Big Data

NOVEMBER 16, 2023

Amazon Redshift is a popular cloud data warehouse, offering a fully managed cloud-based service that seamlessly integrates with an organization’s Amazon Simple Storage Service (Amazon S3) data lake, real-time streams, machine learning (ML) workflows, transactional workflows, and much more—all while providing up to 7.9x

Enterprise

Enterprise Data Warehouse Data Lake Optimization

Top 7 generative AI use cases for business

CIO Business Intelligence

JULY 30, 2024

Many of the AI use cases entrenched in business today use older, more established forms of AI, such as machine learning, or don’t take advantage of the “generative” capabilities of AI to generate text, pictures, and other data. Many AI experts say the current use cases for generative AI are just the tip of the iceberg.

Insurance

Insurance Machine Learning Interactive Finance

Unlock insights on Amazon RDS for MySQL data with zero-ETL integration to Amazon Redshift

AWS Big Data

MARCH 21, 2024

The extract, transform, and load (ETL) process has been a common pattern for moving data from an operational database to an analytics data warehouse. ELT is where the extracted data is loaded as is into the target first and then transformed. Refer to Zero-ETL integration costs (Preview) for further details.

Data Warehouse

Data Warehouse Metrics Statistics Optimization

Near-real-time analytics using Amazon Redshift streaming ingestion with Amazon Kinesis Data Streams and Amazon DynamoDB

AWS Big Data

JULY 27, 2023

Amazon Redshift is a fully managed, scalable cloud data warehouse that accelerates your time to insights with fast, easy, and secure analytics at scale. Tens of thousands of customers rely on Amazon Redshift to analyze exabytes of data and run complex analytical queries, making it the widely used cloud data warehouse.

Data Warehouse

Data Warehouse Analytics Metadata Dashboards

Empower financial analytics by creating structured knowledge bases using Amazon Bedrock and Amazon Redshift

Recap of Amazon Redshift key product announcements in 2024

Webinars

Trending Sources

Write queries faster with Amazon Q generative SQL for Amazon Redshift

Webinars

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Migrate a petabyte-scale data warehouse from Actian Vectorwise to Amazon Redshift

Run Apache XTable in AWS Lambda for background conversion of open table formats

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

MLOps and DevOps: Why Data Makes It Different

Manage your data warehouse cost allocations with Amazon Redshift Serverless tagging

The future of data: A 5-pillar approach to modern data management

How Will The Cloud Impact Data Warehousing Technologies?

Build an Amazon Redshift data warehouse using an Amazon DynamoDB single-table design

Amazon SageMaker Lakehouse now supports attribute-based access control

Reference guide to analyze transactional data in near-real time on AWS

Enriching metadata for accurate text-to-SQL generation for Amazon Athena

Is Google BigQuery The Future Of Big Data Analytics?

Take Your SQL Skills To The Next Level With These Popular SQL Books

AWS re:Invent 2023 Amazon Redshift Sessions Recap

Amazon Redshift announcements at AWS re:Invent 2023 to enable analytics on all your data

Build a secure data visualization application using the Amazon Redshift Data API with AWS IAM Identity Center

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

How To Succeed As a DataOps Engineer

7 key Microsoft Azure analytics services (plus one extra)

What is Data Pipeline? A Detailed Explanation

Achieve near real time operational analytics using Amazon Aurora PostgreSQL zero-ETL integration with Amazon Redshift

What is a data architect? Skills, salaries, and how to become a data framework master

Compose your ETL jobs for MongoDB Atlas with AWS Glue

5 Best Practices for Extracting, Analyzing, and Visualizing Data

Implement model versioning with Amazon Redshift ML

Unleash deeper insights with Amazon Redshift data sharing for data lake tables

Amazon DataZone announces custom blueprints for AWS services

How Open Universities Australia modernized their data platform and significantly reduced their ETL costs with AWS Cloud Development Kit and AWS Step Functions

A Guide to Materialized Views

Enrich your customer data with geospatial insights using Amazon Redshift, AWS Data Exchange, and Amazon QuickSight

Perform time series forecasting using Amazon Redshift ML and Amazon Forecast

Access Amazon Redshift data from Salesforce Data Cloud with Zero Copy Data Federation

Data science vs. machine learning: What’s the difference?

13 power tips for Microsoft Power BI

Simplifying data processing at Capitec with Amazon Redshift integration for Apache Spark

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

Top 7 generative AI use cases for business

Unlock insights on Amazon RDS for MySQL data with zero-ETL integration to Amazon Redshift

Near-real-time analytics using Amazon Redshift streaming ingestion with Amazon Kinesis Data Streams and Amazon DynamoDB

Stay Connected