Data Processing, Data Warehouse and Events

Oracle Wants to Be the Database for AI

David Menninger's Analyst Perspectives

MAY 15, 2025

Oracle recently hosted its annual Database Analyst Summit, sharing the vision and strategy for its data platform. While much of the event was under non-disclosure as product plans and launch schedules are finalized, it still served as a useful recap of the broad portfolio of data platform capabilities that Oracle has to offer.

Data Lake

Data Lake Data Warehouse Machine Learning Software

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

AWS Big Data

NOVEMBER 27, 2024

While customers can perform some basic analysis within their operational or transactional databases, many still need to build custom data pipelines that use batch or streaming jobs to extract, transform, and load (ETL) data into their data warehouse for more comprehensive analysis. or a later version) database.

Data Warehouse

Data Warehouse Analytics Testing Sales

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

AWS Big Data

DECEMBER 20, 2024

You can now generate data integration jobs for various data sources and destinations, including Amazon Simple Storage Service (Amazon S3) data lakes with popular file formats like CSV, JSON, and Parquet, as well as modern table formats such as Apache Hudi , Delta , and Apache Iceberg.

Data Integration

Data Integration Visualization Data Processing Big Data

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

The future of data: A 5-pillar approach to modern data management

CIO Business Intelligence

DECEMBER 11, 2024

Manish Limaye Pillar #1: Data platform The data platform pillar comprises tools, frameworks and processing and hosting technologies that enable an organization to process large volumes of data, both in batch and streaming modes. The choice of vendors should align with the broader cloud or on-premises strategy.

Management

Management Data Governance Data Science Reporting

Migrate a petabyte-scale data warehouse from Actian Vectorwise to Amazon Redshift

AWS Big Data

MAY 30, 2024

Amazon Redshift is a fast, scalable, and fully managed cloud data warehouse that allows you to process and run your complex SQL analytics workloads on structured and semi-structured data. The system had an integration with legacy backend services that were all hosted on premises. The downside here is over-provisioning.

Data Warehouse

Data Warehouse Data Lake Cost-Benefit Structured Data

When Data Warehousing Met the Events Industry

BizAcuity

FEBRUARY 1, 2019

The world’s an eventful place, isn’t it? When we say ‘eventful’, we mean, there are some many things happening around the world, every day, every minute, and they are all happening as glamorous, lavish and big events – be it a phone launch, a mega concert, fairs and so on. Who’s coming?

Data Warehouse

Data Warehouse B2B Business Intelligence Data-driven

Automate deployment of an Amazon QuickSight analysis connecting to an Amazon Redshift data warehouse with an AWS CloudFormation template

AWS Big Data

FEBRUARY 16, 2023

Amazon Redshift is the most widely used data warehouse in the cloud, best suited for analyzing exabytes of data and running complex analytical queries. Amazon QuickSight is a fast business analytics service to build visualizations, perform ad hoc analysis, and quickly get business insights from your data.

Data Warehouse

Data Warehouse Sales Visualization Data Processing

Implement disaster recovery with Amazon Redshift

AWS Big Data

JUNE 27, 2024

Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud. You can start with just a few hundred gigabytes of data and scale to a petabyte or more. This enables you to use your data to acquire new insights for your business and customers. Document the entire disaster recovery process.

Snapshot

Snapshot Data Warehouse Data Processing Strategy

Use AWS Glue to streamline SFTP data processing

AWS Big Data

AUGUST 13, 2024

You will load the event data from the SFTP site, join it to the venue data stored on Amazon S3, apply transformations, and store the data in Amazon S3. The event and venue files are from the TICKIT dataset. Access to an SFTP server with permissions to upload and download data. Choose Store a new secret.

Data Processing

Data Processing Visualization Data Lake Data Processing

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

The applications are hosted in dedicated AWS accounts and require a BI dashboard and reporting services based on Tableau. AWS Database Migration Service (AWS DMS) is used to securely transfer the relevant data to a central Amazon Redshift cluster. AWS DMS tasks are orchestrated using AWS Step Functions.

IoT

IoT Machine Learning Metadata Data-driven

Federate to Amazon Redshift Query Editor v2 with Microsoft Entra ID

AWS Big Data

DECEMBER 10, 2024

Amazon Redshift is a fast, petabyte-scale, cloud data warehouse that tens of thousands of customers rely on to power their analytics workloads. With its massively parallel processing (MPP) architecture and columnar data storage, Amazon Redshift delivers high price-performance for complex analytical queries against large datasets.

Sales

Sales Metadata Enterprise Testing

Break data silos and stream your CDC data with Amazon Redshift streaming and Amazon MSK

AWS Big Data

DECEMBER 13, 2023

A CDC-based approach captures the data changes and makes them available in data warehouses for further analytics in real-time. usually a data warehouse) needs to reflect those changes in near real-time. This post showcases how to use streaming ingestion to bring data to Amazon Redshift.

Data Warehouse

Data Warehouse Snapshot Data Processing Internet of Things

Amazon Redshift data ingestion options

AWS Big Data

SEPTEMBER 5, 2024

The currently available choices include: The Amazon Redshift COPY command can load data from Amazon Simple Storage Service (Amazon S3), Amazon EMR , Amazon DynamoDB , or remote hosts over SSH. This native feature of Amazon Redshift uses massive parallel processing (MPP) to load objects directly from data sources into Redshift tables.

IoT

IoT Data Warehouse Cost-Benefit Reporting

Amazon DocumentDB zero-ETL integration with Amazon OpenSearch Service is now available

AWS Big Data

MAY 16, 2024

Zero-ETL will perform an initial full load of your collection by doing a collection scan on the primary instance of your Amazon DocumentDB cluster, which may take several minutes to complete depending on the size of the data, and you may notice elevated resource consumption on your cluster. For example, inventory.product.

Data Processing

Data Processing Data Warehouse Management Metrics

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

These nodes can implement analytical platforms like data lake houses, data warehouses, or data marts, all united by producing data products. By treating the data as a product, the outcome is a reusable asset that outlives a project and meets the needs of the enterprise consumer.

Metadata

Metadata Data Governance Data Quality Data-driven

Perform secure database write-backs with Amazon QuickSight

AWS Big Data

MAY 10, 2023

A write-back is the ability to update a data mart, data warehouse, or any other database backend from within BI dashboards and analyze the updated data in near-real time within the dashboard itself. AnyCompany currently uses Amazon Redshift as their enterprise data warehouse platform and QuickSight as their BI solution.

Dashboards

Dashboards Data Warehouse Visualization Data Processing

Deciphering The Seldom Discussed Differences Between Data Mining and Data Science

Smart Data Collective

NOVEMBER 18, 2020

Complex mathematical algorithms are used to segment data and estimate the likelihood of subsequent events. Every Data Scientist needs to know Data Mining as well, but about this moment we will talk a bit later. Where to Use Data Science? It hosts a data analysis competition. Practical experience.

Data mining

Data mining Data Science Informatics Statistics

Choice Hotels’ all-in cloud journey to sustainable business value

CIO Business Intelligence

JANUARY 16, 2023

All the logic is still in Java hosted on Amazon’s infrastructure.” Aside from the core cloud services, Choice also uses Amazon RedShift as a front end to its cloud data warehouse, Amazon SageMaker to build machine leaning models, and Amazon Kinesis to collect, process, and analyze real-time data.

Cost-Benefit

Cost-Benefit Digital Transformation Data Warehouse Data-driven

CIOs are (still) closer than ever to their dream data lakehouse

CIO Business Intelligence

OCTOBER 15, 2024

Another hypothesis: Databricks execs were billion-dollar stoked to stick it to Snowflake by drowning out its event with a buyout its rival reportedly sought. There’s a record of everything – including metadata changes – which paves the way for a host of management and governance capabilities.

Metadata

Metadata Data Processing Uncertainty Data Warehouse

Friends of Juice: Dave McDowell

Juice Analytics

DECEMBER 30, 2019

We like to call Dave one of our “angels” because he truly does work really hard to connect us with potential customers, is a great ally of ours, and always shows up for events that we host in our office. What has impressed you the most about Juice or its team? Tough question, because there is much to admire, enjoy and soak up.

Data Warehouse

Data Warehouse Data Processing Consulting Software

Migrate Microsoft Azure Synapse Analytics to Amazon Redshift using AWS SCT

AWS Big Data

OCTOBER 18, 2023

Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse that provides the flexibility to use provisioned or serverless compute for your analytical workloads. You can get faster insights without spending valuable time managing your data warehouse. Fault tolerance is built in.

Analytics

Analytics Data Warehouse Dashboards Testing

Power enterprise-grade Data Vaults with Amazon Redshift – Part 2

AWS Big Data

NOVEMBER 16, 2023

Amazon Redshift is a popular cloud data warehouse, offering a fully managed cloud-based service that seamlessly integrates with an organization’s Amazon Simple Storage Service (Amazon S3) data lake, real-time streams, machine learning (ML) workflows, transactional workflows, and much more—all while providing up to 7.9x

Enterprise

Enterprise Data Warehouse Snapshot Cost-Benefit

Governing data in relational databases using Amazon DataZone

AWS Big Data

MAY 7, 2024

It also makes it easier for engineers, data scientists, product managers, analysts, and business users to access data throughout an organization to discover, use, and collaborate to derive data-driven insights. Amazon EventBridge Used as a mechanism to capture Amazon DataZone events and trigger solution’s corresponding workflow.

Metadata

Metadata Data Lake Data Processing Data-driven

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

To speed up the self-service analytics and foster innovation based on data, a solution was needed to provide ways to allow any team to create data products on their own in a decentralized manner. To create and manage the data products, smava uses Amazon Redshift , a cloud data warehouse.

Data Lake

Data Lake Data Warehouse Data-driven B2B

Bringing More AI to Snowflake, the Data Cloud

DataRobot Blog

FEBRUARY 28, 2023

This includes: Supporting Snowflake External OAuth configuration Leveraging Snowpark for exploratory data analysis with DataRobot-hosted Notebooks and model scoring. Exploratory Data Analysis After we connect to Snowflake, we can start our ML experiment. We recently announced DataRobot’s new Hosted Notebooks capability.

Data Processing

Data Processing Experimentation Machine Learning Data Warehouse

Real-time streaming data top picks you cannot miss at AWS re:Invent 2023

AWS Big Data

NOVEMBER 8, 2023

Scaling serverless data processing with Amazon Kinesis and Apache Kafka: Explore how to build scalable data processing applications using AWS Lambda. Learn practical insights into integrating Lambda with Amazon Kinesis and Apache Kafka using their event-driven models for real-time data streaming and processing.

Data-driven

Data-driven Machine Learning Data Lake Cost-Benefit

Architectural Patterns for real-time analytics using Amazon Kinesis Data Streams, Part 2: AI Applications

AWS Big Data

MAY 28, 2024

Before we dive in, we recommend reviewing Architectural patterns for real-time analytics using Amazon Kinesis Data Streams, part 1 for the basic functionalities of Kinesis Data Streams. Part 1 also contains architectural examples for building real-time applications for time series data and event-sourcing microservices.

IoT

IoT Analytics Dashboards Data-driven

Get Your Analytics Insights Instantly – Without Abandoning Central IT

Cloudera

JANUARY 21, 2021

While cloud-native, point-solution data warehouse services may serve your immediate business needs, there are dangers to the corporation as a whole when you do your own IT this way. Cloudera Data Warehouse (CDW) is here to save the day! CDW is an integrated data warehouse service within Cloudera Data Platform (CDP).

Data Warehouse

Data Warehouse Data Lake IT Analytics

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

You can subscribe to data products that help enrich customer profiles, for example demographics data, advertising data, and financial markets data. Amazon Kinesis ingests streaming events in real time from point-of-sales systems, clickstream data from mobile apps and websites, and social media data.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

CDP Private Cloud is a Game-changer for Partners

Cloudera

SEPTEMBER 2, 2020

In short, CDP Private Cloud is a game-changer for Cloudera partners as it provides opportunities to help their customers modernize their data platform by breaking up monolithic architectures without leaving their data centers! . Be on the lookout for events around CDP Private Cloud enablement sessions.

Cost-Benefit

Cost-Benefit Data Warehouse Data Lake Machine Learning

You’re Invited to Attend Excelapalooza

Jet Global

JULY 23, 2019

insightsoftware’s Excelapalooza, one of the largest Microsoft Excel learning events in the country, is no exception. Who: insightsoftware, now including Jet Global reporting and analytics, is hosting the event for any and all business professionals who use Excel, Jet, Microsoft Dynamics, Epicor, Sage, or any other ERP system.

Dashboards

Dashboards Reporting Visualization Data Warehouse

Migration Supporting Real-Time Analytics for Customer Experience Management

Cloudera

AUGUST 31, 2020

Given the prohibitive cost of scaling it, in addition to the new business focus on data science and the need to leverage public cloud services to support future growth and capability roadmap, SMG decided to migrate from the legacy data warehouse to Cloudera’s solution using Hive LLAP. The case for a new Data Warehouse?

Management

Management Slice and Dice Data Warehouse Analytics

Generative AI for the Enterprise

Cloudera

MAY 31, 2023

Third-party applications may store and process sensitive company information, which could be exposed in the event of a data breach or unauthorized access. Enterprise hosted LLMs Ensure Data Privacy One option to ensure data privacy is to use enterprise developed and hosted LLMs in the applications.

Enterprise

Enterprise Data Processing Machine Learning Experimentation

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

AWS Big Data

MARCH 7, 2024

At the same time, they need to optimize operational costs to unlock the value of this data for timely insights and do so with a consistent performance. With this massive data growth, data proliferation across your data stores, data warehouse, and data lakes can become equally challenging.

Data Lake

Data Lake Analytics Dashboards Metrics

The New Cloudera

Cloudera

JANUARY 3, 2019

It’s clear today that the data warehouse industry is undergoing a major transformation. We’ll be hosting a live event on January 10, at 10AM Pacific, 1PM Eastern, to share more details on how the new Cloudera will accelerate innovation and deliver the industry’s first Enterprise Data Cloud. We intend to win.

Machine Learning

Machine Learning IoT Data Warehouse Enterprise

Build a serverless analytics application with Amazon Redshift and Amazon API Gateway

AWS Big Data

JANUARY 24, 2023

In this post, you will learn how to build a serverless analytics application using Amazon Redshift Data API and Amazon API Gateway WebSocket and REST APIs. The Data API simplifies access to Amazon Redshift because you don’t need to configure drivers and manage database connections. What are WebSockets and why do we need them?

Analytics

Analytics Data-driven Management Reporting

10 Keys to a Secure Cloud Data Lakehouse

Cloudera

OCTOBER 25, 2022

Cloud data lakehouses provide significant scaling, agility, and cost advantages compared to cloud data lakes and cloud data warehouses. They combine the best of both worlds: flexibility, cost effectiveness of data lakes and performance, and reliability of data warehouses.”. Host-based security.

Data Processing

Data Processing Data Lake Cost-Benefit Risk

Closing the breach window, from data to action

IBM Big Data Hub

SEPTEMBER 27, 2023

The list of challenges is long: cloud attack surface sprawl, complex application environments, information overload from disparate tools, noise from false positives and low-risk events, just to name a few. You get near real-time visibility and insights from your ingested data.

Cost-Benefit

Cost-Benefit OLAP Dashboards Visualization

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

With quality data at their disposal, organizations can form data warehouses for the purposes of examining trends and establishing future-facing strategies. Industry-wide, the positive ROI on quality data is well understood. This is due to the technical nature of a data system itself.

Data Quality

Data Quality Metrics Data-driven Management

5 Key Takeaways from #Current2023

Cloudera

OCTOBER 17, 2023

Recently, Confluent hosted Current 2023 (formerly Kafka summit) in San Jose on Sept 26th and 27th. With few conferences curating content specific to streaming developers, Current has historically been an important event for anyone trying to keep a pulse on what’s happening in the streaming space.

Data-driven

Data-driven Enterprise IoT Data Warehouse

What’s new in CDP Private Cloud Base 7.1.6?

Cloudera

APRIL 15, 2021

Along with a host of new features and capabilities, we are improving the upgrade process to be as painless as possible. release and the new in-place upgrade from HDP that completely does away with replacing infrastructure and data migrations. Hive Warehouse Connector (HWC) makes data engineering simpler and faster.

Data Warehouse

Data Warehouse Cost-Benefit Management Data Processing

Simplify data loading into Type 2 slowly changing dimensions in Amazon Redshift

AWS Big Data

MARCH 9, 2023

Thousands of customers rely on Amazon Redshift to build data warehouses to accelerate time to insights with fast, simple, and secure analytics at scale and analyze data from terabytes to petabytes by running complex analytical queries. Data loading is one of the key aspects of maintaining a data warehouse.

Slice and Dice

Slice and Dice Data Warehouse Metrics Metadata

Data Model Development Using Jinja

Sisense

FEBRUARY 16, 2021

Data warehouses have become intensely important in the modern business world. For many organizations, it’s not uncommon for all their data to be extracted, loaded unchanged into data warehouses, and then transformed via cleaning, merging, aggregation, etc. OLTP does not hold historical data, only current data.

Modeling

Modeling OLAP Data Warehouse Cost-Benefit

Oracle Wants to Be the Database for AI

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

Webinars

Trending Sources

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

Webinars

The future of data: A 5-pillar approach to modern data management

Migrate a petabyte-scale data warehouse from Actian Vectorwise to Amazon Redshift

When Data Warehousing Met the Events Industry

Automate deployment of an Amazon QuickSight analysis connecting to an Amazon Redshift data warehouse with an AWS CloudFormation template

Implement disaster recovery with Amazon Redshift

Use AWS Glue to streamline SFTP data processing

How EUROGATE established a data mesh architecture using Amazon DataZone

Federate to Amazon Redshift Query Editor v2 with Microsoft Entra ID

Break data silos and stream your CDC data with Amazon Redshift streaming and Amazon MSK

Amazon Redshift data ingestion options

Amazon DocumentDB zero-ETL integration with Amazon OpenSearch Service is now available

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Perform secure database write-backs with Amazon QuickSight

Deciphering The Seldom Discussed Differences Between Data Mining and Data Science

Choice Hotels’ all-in cloud journey to sustainable business value

CIOs are (still) closer than ever to their dream data lakehouse

Friends of Juice: Dave McDowell

Migrate Microsoft Azure Synapse Analytics to Amazon Redshift using AWS SCT

Power enterprise-grade Data Vaults with Amazon Redshift – Part 2

Governing data in relational databases using Amazon DataZone

How smava makes loans transparent and affordable using Amazon Redshift Serverless

Bringing More AI to Snowflake, the Data Cloud

Real-time streaming data top picks you cannot miss at AWS re:Invent 2023

Architectural Patterns for real-time analytics using Amazon Kinesis Data Streams, Part 2: AI Applications

Get Your Analytics Insights Instantly – Without Abandoning Central IT

Create an end-to-end data strategy for Customer 360 on AWS

CDP Private Cloud is a Game-changer for Partners

You’re Invited to Attend Excelapalooza

Migration Supporting Real-Time Analytics for Customer Experience Management

Generative AI for the Enterprise

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

The New Cloudera

Build a serverless analytics application with Amazon Redshift and Amazon API Gateway

10 Keys to a Secure Cloud Data Lakehouse

Closing the breach window, from data to action

Top 5 Questions about Apache NiFi

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

5 Key Takeaways from #Current2023

What’s new in CDP Private Cloud Base 7.1.6?

Simplify data loading into Type 2 slowly changing dimensions in Amazon Redshift

Data Model Development Using Jinja

Stay Connected