2012 and Data Warehouse - Data Leaders Brief

2012

Data Warehouse

Simplify data ingestion from Amazon S3 to Amazon Redshift using auto-copy

AWS Big Data

OCTOBER 30, 2024

Amazon Redshift is a fast, scalable, secure, and fully managed cloud data warehouse that makes it simple and cost-effective to analyze your data using standard SQL and your existing business intelligence (BI) tools. Data ingestion is the process of getting data to Amazon Redshift.

Data Warehouse

Data Warehouse Sales Data Lake Recreation/Entertainment

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

AWS Big Data

AUGUST 15, 2024

Unifying these necessitates additional data processing, requiring each business unit to provision and maintain a separate data warehouse. This burdens business units focused solely on consuming the curated data for analysis and not concerned with data management tasks, cleansing, or comprehensive data processing.

Data Lake

Data Lake Data Warehouse Data Governance Publishing

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

AWS Big Data

JANUARY 6, 2025

Amazon AppFlow automatically encrypts data in motion, and allows you to restrict data from flowing over the public internet for SaaS applications that are integrated with AWS PrivateLink , reducing exposure to security threats. He has worked with building data warehouses and big data solutions for over 13 years.

Analytics

Analytics Data Warehouse Big Data Metrics

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Simplify your query performance diagnostics in Amazon Redshift with Query profiler

AWS Big Data

OCTOBER 23, 2024

Amazon Redshift is a fast, scalable, secure, and fully managed cloud data warehouse that lets you analyze your data at scale. Amazon Redshift Serverless lets you access and analyze data without the usual configurations of a provisioned data warehouse. In her spare time, Blessing loves travels and adventures.

Data Warehouse

Data Warehouse Metrics Broadcasting Dashboards

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

AWS Big Data

NOVEMBER 14, 2024

You can learn how to query Delta Lake native tables through UniForm from different data warehouses or engines such as Amazon Redshift as an example of expanding data access to more engines. For those data warehouses, Delta Lake tables need to be converted to manifest tables, which requires additional operational overhead.

Metadata

Metadata Data Warehouse Big Data Data Lake

Build a secure data visualization application using the Amazon Redshift Data API with AWS IAM Identity Center

AWS Big Data

MARCH 6, 2025

Tens of thousands of customers use Amazon Redshift for modern data analytics at scale, delivering up to three times better price-performance and seven times better throughput than other cloud data warehouses. This makes sure that user access and roles are consistently maintained across both AWS services and external tools.

Visualization

Visualization Sales Data Warehouse Management

Fauna’s Data Platform Combines Agility and Transaction Integrity

David Menninger's Analyst Perspectives

OCTOBER 16, 2024

Fauna was founded in 2012 by software infrastructure engineers Evan Weaver and Matt Freels to develop the cloud-native transactional database product they would have liked to have had at their disposal in their former roles at what was then known as Twitter (now X).

Internet of Things

Internet of Things Data-driven Data Warehouse Interactive

Centralize near-real-time governance through alerts on Amazon Redshift data warehouses for sensitive queries

AWS Big Data

JUNE 29, 2023

Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud that delivers powerful and secure insights on all your data with the best price-performance. With Amazon Redshift, you can analyze your data to derive holistic insights about your business and your customers.

Data Warehouse

Data Warehouse Dashboards Testing Visualization

Introduction To The Basic Business Intelligence Concepts

datapine

MAY 9, 2019

Business intelligence concepts refer to the usage of digital computing technologies in the form of data warehouses, analytics and visualization with the aim of identifying and analyzing essential business-based data to generate new, actionable corporate insights. The data warehouse. 1) The raw data.

Business Intelligence

Business Intelligence Dashboards Data Warehouse Sales

Take Your SQL Skills To The Next Level With These Popular SQL Books

datapine

SEPTEMBER 27, 2022

The new version includes details on the latest SQL features such as techniques for wrangling data, as well as two new chapters, one dedicated to setting up your own system, and another for using PostgreSQL and JSON. This piece, published in 2012, offers a step-to-step guide on everything related to SQL. Viescas, Douglas J.

Business Intelligence

Business Intelligence Data Warehouse Data Processing Data mining

Achieve near real time operational analytics using Amazon Aurora PostgreSQL zero-ETL integration with Amazon Redshift

AWS Big Data

APRIL 10, 2024

and zero-ETL support) as the source, and a Redshift data warehouse as the target. The integration replicates data from the source database into the target data warehouse. Additionally, you can choose the capacity, to limit the compute resources of the data warehouse. For this post, set this to 8 RPUs.

Data Warehouse

Data Warehouse Analytics Metrics Snapshot

Snowflake Offers a Platform for AI as well as Data

David Menninger's Analyst Perspectives

SEPTEMBER 19, 2024

Snowflake was founded in 2012 to build a business around its cloud-based data warehouse with built-in data-sharing capabilities. Snowflake has expanded its reach over the years to address data engineering and data science, and long ago moved beyond being seen as just a cloud data warehouse.

Data Warehouse

Data Warehouse Data Science Modeling Data Governance

Getting started guide for near-real time operational analytics using Amazon Aurora zero-ETL integration with Amazon Redshift

AWS Big Data

JUNE 28, 2023

There are two broad approaches to analyzing operational data for these use cases: Analyze the data in-place in the operational database (e.g. With Aurora zero-ETL integration with Amazon Redshift, the integration replicates data from the source database into the target data warehouse.

Data Warehouse

Data Warehouse Analytics Metrics Dashboards

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

AWS Big Data

JUNE 10, 2024

One of the key challenges in modern big data management is facilitating efficient data sharing and access control across multiple EMR clusters. Organizations have multiple Hive data warehouses across EMR clusters, where the metadata gets generated. compute.internal ). Choose Submit job run.

Data Lake

Data Lake Metadata Data Warehouse Data Processing

Scaling RISE with SAP data and AWS Glue

AWS Big Data

NOVEMBER 29, 2024

Customers often want to augment and enrich SAP source data with other non-SAP source data. Such analytic use cases can be enabled by building a data warehouse or data lake. Customers can now use the AWS Glue SAP OData connector to extract data from SAP.

Visualization

Visualization Data Processing Data-driven Cost-Benefit

Unlock insights on Amazon RDS for MySQL data with zero-ETL integration to Amazon Redshift

AWS Big Data

MARCH 21, 2024

The extract, transform, and load (ETL) process has been a common pattern for moving data from an operational database to an analytics data warehouse. ELT is where the extracted data is loaded as is into the target first and then transformed. ETL and ELT pipelines can be expensive to build and complex to manage.

Data Warehouse

Data Warehouse Metrics Statistics Optimization

Federate to Amazon Redshift Query Editor v2 with Microsoft Entra ID

AWS Big Data

DECEMBER 10, 2024

Amazon Redshift is a fast, petabyte-scale, cloud data warehouse that tens of thousands of customers rely on to power their analytics workloads. With its massively parallel processing (MPP) architecture and columnar data storage, Amazon Redshift delivers high price-performance for complex analytical queries against large datasets.

Sales

Sales Metadata Enterprise Testing

Data load made easy and secure in Amazon Redshift using Query Editor V2

AWS Big Data

MAY 2, 2023

Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse service that makes it simple and cost-effective to analyze all your data efficiently and securely. Users such as data analysts, database developers, and data scientists use SQL to analyze their data in Amazon Redshift data warehouses.

Data Warehouse

Data Warehouse Software Visualization IoT

Federate IAM-based single sign-on to Amazon Redshift role-based access control with Okta

AWS Big Data

DECEMBER 12, 2023

Tens of thousands of customers rely on Amazon Redshift to analyze exabytes of data and run complex analytical queries. You can use your preferred SQL clients to analyze your data in an Amazon Redshift data warehouse. He is passionate about automating and solving customer problems with cloud solutions.

Data Warehouse

Data Warehouse Management Finance Analytics

Q&A with Greg Rahn – The changing Data Warehouse market

Cloudera

DECEMBER 12, 2018

After having rebuilt their data warehouse, I decided to take a little bit more of a pointed role, and I joined Oracle as a database performance engineer. I spent eight years in the real-world performance group where I specialized in high visibility and high impact data warehousing competes and benchmarks.

Data Warehouse

Data Warehouse Marketing Big Data Data-driven

Amazon DataZone announces custom blueprints for AWS services

AWS Big Data

JUNE 26, 2024

New feature: Custom AWS service blueprints Previously, Amazon DataZone provided default blueprints that created AWS resources required for data lake, data warehouse, and machine learning use cases. You can build projects and subscribe to both unstructured and structured data assets within the Amazon DataZone portal.

Data Lake

Data Lake Data Warehouse Unstructured Data Data Governance

Introducing Amazon Q data integration in AWS Glue

AWS Big Data

APRIL 30, 2024

It can generate data integration jobs for extracts and loads to S3 data lakes including file formats like CSV, JSON, and Parquet, and ingestion into open table formats like Apache Hudi, Delta, and Apache Iceberg. Prerequisites Before going forward with this tutorial, complete the following prerequisites: Set up AWS Glue Studio.

Data Integration

Data Integration Data Lake Data Warehouse Software

How AWS Payments migrated from Redash to Amazon Redshift Query Editor v2

AWS Big Data

MARCH 21, 2023

In this post, we discuss how to democratize data access to Amazon Redshift using the Amazon Redshift Query Editor V2. Background At AWS Payments, we had been using Redash to allow our users to author and run SQL queries against our Amazon Redshift data warehouse.

Cost-Benefit

Cost-Benefit Dashboards Visualization Data-driven

Integrate Okta with Amazon Redshift Query Editor V2 using AWS IAM Identity Center for seamless Single Sign-On

AWS Big Data

NOVEMBER 30, 2023

This integration simplifies the authentication and authorization process for Amazon Redshift users using Query Editor V2 or Amazon Quicksight , making it easier for them to securely access your data warehouse. Note: Your organization’s IdC instance must be in the same region as the Amazon Redshift data warehouse you’re connecting to.

Data Warehouse

Data Warehouse Finance Sales Management

Perform time series forecasting using Amazon Redshift ML and Amazon Forecast

AWS Big Data

AUGUST 7, 2023

Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud. Tens of thousands of customers use Amazon Redshift to process exabytes of data every day to power their analytics workloads. For more information, see ElectricityLoadDiagrams20112014 Data Set (Dua, D. and Karra Taniskidou, E.

Forecasting

Forecasting Data Warehouse Machine Learning Modeling

Orchestrate an end-to-end ETL pipeline using Amazon S3, AWS Glue, and Amazon Redshift Serverless with Amazon MWAA

AWS Big Data

APRIL 25, 2024

As the queries finish running, an UNLOAD operation is invoked from the Redshift data warehouse to the S3 bucket in Account A. The pipeline then starts running stored procedures and SQL commands on Redshift Serverless.

Metadata

Metadata Data Processing Management Testing

Break data silos and stream your CDC data with Amazon Redshift streaming and Amazon MSK

AWS Big Data

DECEMBER 13, 2023

A CDC-based approach captures the data changes and makes them available in data warehouses for further analytics in real-time. usually a data warehouse) needs to reflect those changes in near real-time. This post showcases how to use streaming ingestion to bring data to Amazon Redshift.

Data Warehouse

Data Warehouse Snapshot Data Processing Internet of Things

Accelerate analytics on Amazon OpenSearch Service with AWS Glue through its native connector

AWS Big Data

DECEMBER 21, 2023

As the volume and complexity of analytics workloads continue to grow, customers are looking for more efficient and cost-effective ways to ingest and analyse data. OpenSearch Service is used for multiple purposes, such as observability, search analytics, consolidation, cost savings, compliance, and integration.

Analytics

Analytics IT Data Lake Visualization

Single sign-on with Amazon Redshift Serverless with Okta using Amazon Redshift Query Editor v2 and third-party SQL clients

AWS Big Data

MAY 4, 2023

Amazon Redshift Serverless makes it easy to run and scale analytics in seconds without the need to set up and manage data warehouse clusters. Customers use their preferred SQL clients to analyze their data in Redshift Serverless. An Redshift Serverless data warehouse. If you don’t have one, you can sign up for one.

Data Warehouse

Data Warehouse Finance Sales Metadata

Perform secure database write-backs with Amazon QuickSight

AWS Big Data

MAY 10, 2023

A write-back is the ability to update a data mart, data warehouse, or any other database backend from within BI dashboards and analyze the updated data in near-real time within the dashboard itself. AnyCompany currently uses Amazon Redshift as their enterprise data warehouse platform and QuickSight as their BI solution.

Dashboards

Dashboards Data Warehouse Visualization Data Processing

How The Explosive Growth Of Data Access Affects Your Engineer’s Team Efficiency

Smart Data Collective

OCTOBER 17, 2022

In fact, you may have even heard about IDC’s new Global DataSphere Forecast, 2021-2025 , which projects that global data production and replication will expand at a compound annual growth rate of 23% during the projection period, reaching 181 zettabytes in 2025. zettabytes of data in 2020, a tenfold increase from 6.5

Big Data

Big Data Data-driven Recreation/Entertainment Data Governance

How to use Netezza Performance Server query data in Amazon Simple Storage Service (S3)

IBM Big Data Hub

JANUARY 10, 2023

This allows data that exists in cloud object storage to be easily combined with existing data warehouse data without data movement. The advantage to NPS clients is that they can store infrequently used data in a cost-effective manner without having to move that data into a physical data warehouse table.

Data Warehouse

Data Warehouse Cost-Benefit Statistics Data Processing

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

AWS Big Data

APRIL 3, 2024

He is passionate about helping customers build modern data architectures on the AWS Cloud. He has helped customers of all sizes implement data management, data warehouse, and data lake solutions. Brian Dolan joined Amazon as a Military Relations Manager in 2012 after his first career as a Naval Aviator.

Data Lake

Data Lake Snapshot Metadata Data Architecture

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

AWS Big Data

MARCH 7, 2024

At the same time, they need to optimize operational costs to unlock the value of this data for timely insights and do so with a consistent performance. With this massive data growth, data proliferation across your data stores, data warehouse, and data lakes can become equally challenging.

Data Lake

Data Lake Analytics Dashboards Metrics

How BMO improved data security with Amazon Redshift and AWS Lake Formation

AWS Big Data

MARCH 1, 2024

One of the bank’s key challenges related to strict cybersecurity requirements is to implement field level encryption for personally identifiable information (PII), Payment Card Industry (PCI), and data that is classified as high privacy risk (HPR). Only users with required permissions are allowed to access data in clear text.

Data Lake

Data Lake Data Warehouse Management Risk

Simplify and speed up Apache Spark applications on Amazon Redshift data with Amazon Redshift integration for Apache Spark

AWS Big Data

APRIL 20, 2023

Apache Spark enables you to build applications in a variety of languages, such as Java, Scala, and Python, by accessing the data in your Amazon Redshift data warehouse. Amazon Redshift integration for Apache Spark helps developers seamlessly build and run Apache Spark applications on Amazon Redshift data.

Data Lake

Data Lake Data Warehouse Sales Data-driven

Best practices to implement near-real-time analytics using Amazon Redshift Streaming Ingestion with Amazon MSK

AWS Big Data

MARCH 11, 2024

Amazon Redshift is a fully managed, scalable cloud data warehouse that accelerates your time to insights with fast, straightforward, and secure analytics at scale. Tens of thousands of customers rely on Amazon Redshift to analyze exabytes of data and run complex analytical queries, making it the most widely used cloud data warehouse.

Analytics

Analytics Data Warehouse Optimization Metrics

Accelerate data integration with Salesforce and AWS using AWS Glue

AWS Big Data

SEPTEMBER 4, 2024

With the AWS Glue Salesforce connector, you can ingest and transform your CRM data to any of the AWS Glue supported destinations, including Amazon Simple Storage Service (Amazon S3), in your preferred format, including Apache Iceberg, Apache Hudi, and Linux Foundation Delta Lake; data warehouses such as Amazon Redshift and Snowflake; and many more.

Data Integration

Data Integration Data Lake Data-driven Cost-Benefit

Simplify access management with Amazon Redshift and AWS Lake Formation for users in an External Identity Provider

AWS Big Data

FEBRUARY 15, 2024

You might be modernizing your data architecture using Amazon Redshift to enable access to your data lake and data in your data warehouse, and are looking for a centralized and scalable way to define and manage the data access based on IdP identities.

Management

Management Data Lake Sales Data Warehouse

Attribute Amazon EMR on EC2 costs to your end-users

AWS Big Data

AUGUST 27, 2024

His background is in data warehouse/data lake – architecture, development and administration. He is in data and analytical field for over 14 years. Ramesh Raghupathy is a Senior Data Architect with WWCO ProServe at AWS. He specializes in building and modernising analytical solutions.

Metrics

Metrics Dashboards Data Lake Optimization

Accelerate HiveQL with Oozie to Spark SQL migration on Amazon EMR

AWS Big Data

APRIL 19, 2023

Many customers run big data workloads such as extract, transform, and load (ETL) on Apache Hive to create a data warehouse on Hadoop. Sandeep Singh is a Lead Consultant at AWS ProServe, focused on analytics, data lake architecture, and implementation. Amol Guldagad is a Data Analytics Consultant based in India.

Metadata

Metadata Data Lake Testing Consulting

Set up cross-account AWS Glue Data Catalog access using AWS Lake Formation and AWS IAM Identity Center with Amazon Redshift and Amazon QuickSight

AWS Big Data

AUGUST 5, 2024

These business units have varying landscapes, where a data lake is managed by Amazon Simple Storage Service (Amazon S3) and analytics workloads are run on Amazon Redshift , a fast, scalable, and fully managed cloud data warehouse that allows you to process and run your complex SQL analytics workloads on structured and semi-structured data.

Data Lake

Data Lake Finance Sales Management

Simplify external object access in Amazon Redshift using automatic mounting of the AWS Glue Data Catalog

AWS Big Data

JULY 28, 2023

Amazon Redshift is a petabyte-scale, enterprise-grade cloud data warehouse service delivering the best price-performance. Today, tens of thousands of customers run business-critical workloads on Amazon Redshift to cost-effectively and quickly analyze their data using standard SQL and existing business intelligence (BI) tools.

Data Lake

Data Lake Data Governance Data Warehouse Modeling

10 Years Later: Who’s the GOAT of Data Catalogs?

Alation

DECEMBER 15, 2022

December 2012: Alation forms and goes to work creating the first enterprise data catalog. Later, in its inaugural report on data catalogs, Forrester Research recognizes that “Alation started the MLDC trend.”. Here’s a timeline view of what the market has said about Alation since our founding: Timeline: 10 Years of Alation.

Metadata

Metadata Data Governance Data Quality Marketing

Simplify data ingestion from Amazon S3 to Amazon Redshift using auto-copy

Seamless integration of data lake and data warehouse using Amazon Redshift Spectrum and Amazon DataZone

Webinars

Trending Sources

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

Webinars

Simplify your query performance diagnostics in Amazon Redshift with Query profiler

Expand data access through Apache Iceberg using Delta Lake UniForm on AWS

Build a secure data visualization application using the Amazon Redshift Data API with AWS IAM Identity Center

Fauna’s Data Platform Combines Agility and Transaction Integrity

Centralize near-real-time governance through alerts on Amazon Redshift data warehouses for sensitive queries

Introduction To The Basic Business Intelligence Concepts

Take Your SQL Skills To The Next Level With These Popular SQL Books

Achieve near real time operational analytics using Amazon Aurora PostgreSQL zero-ETL integration with Amazon Redshift

Snowflake Offers a Platform for AI as well as Data

Getting started guide for near-real time operational analytics using Amazon Aurora zero-ETL integration with Amazon Redshift

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

Scaling RISE with SAP data and AWS Glue

Unlock insights on Amazon RDS for MySQL data with zero-ETL integration to Amazon Redshift

Federate to Amazon Redshift Query Editor v2 with Microsoft Entra ID

Data load made easy and secure in Amazon Redshift using Query Editor V2

Federate IAM-based single sign-on to Amazon Redshift role-based access control with Okta

Q&A with Greg Rahn – The changing Data Warehouse market

Amazon DataZone announces custom blueprints for AWS services

Introducing Amazon Q data integration in AWS Glue

How AWS Payments migrated from Redash to Amazon Redshift Query Editor v2

Integrate Okta with Amazon Redshift Query Editor V2 using AWS IAM Identity Center for seamless Single Sign-On

Perform time series forecasting using Amazon Redshift ML and Amazon Forecast

Orchestrate an end-to-end ETL pipeline using Amazon S3, AWS Glue, and Amazon Redshift Serverless with Amazon MWAA

Break data silos and stream your CDC data with Amazon Redshift streaming and Amazon MSK

Accelerate analytics on Amazon OpenSearch Service with AWS Glue through its native connector

Single sign-on with Amazon Redshift Serverless with Okta using Amazon Redshift Query Editor v2 and third-party SQL clients

Perform secure database write-backs with Amazon QuickSight

How The Explosive Growth Of Data Access Affects Your Engineer’s Team Efficiency

How to use Netezza Performance Server query data in Amazon Simple Storage Service (S3)

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

How BMO improved data security with Amazon Redshift and AWS Lake Formation

Simplify and speed up Apache Spark applications on Amazon Redshift data with Amazon Redshift integration for Apache Spark

Best practices to implement near-real-time analytics using Amazon Redshift Streaming Ingestion with Amazon MSK

Accelerate data integration with Salesforce and AWS using AWS Glue

Simplify access management with Amazon Redshift and AWS Lake Formation for users in an External Identity Provider

Attribute Amazon EMR on EC2 costs to your end-users

Accelerate HiveQL with Oozie to Spark SQL migration on Amazon EMR

Set up cross-account AWS Glue Data Catalog access using AWS Lake Formation and AWS IAM Identity Center with Amazon Redshift and Amazon QuickSight

Simplify external object access in Amazon Redshift using automatic mounting of the AWS Glue Data Catalog

10 Years Later: Who’s the GOAT of Data Catalogs?

Stay Connected