Blog, Data Processing and Data Warehouse

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

AWS Big Data

NOVEMBER 27, 2024

While customers can perform some basic analysis within their operational or transactional databases, many still need to build custom data pipelines that use batch or streaming jobs to extract, transform, and load (ETL) data into their data warehouse for more comprehensive analysis.

Data Warehouse

Data Warehouse Analytics Testing Sales

Accelerate Offloading to Cloudera Data Warehouse (CDW) with Procedural SQL Support

Cloudera

JULY 16, 2021

Did you know Cloudera customers, such as SMG and Geisinger , offloaded their legacy DW environment to Cloudera Data Warehouse (CDW) to take advantage of CDW’s modern architecture and best-in-class performance? The Data Warehouse on Cloudera Data Platform provides easy to use self-service and advanced analytics use cases at scale.

Data Warehouse

Data Warehouse Data Processing Management Testing

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

Read the complete blog below for a more detailed description of the vendors and their capabilities. This is not surprising given that DataOps enables enterprise data teams to generate significant business value from their data. QuerySurge – Continuously detect data issues in your delivery pipelines.

Testing

Testing Machine Learning Consulting Data Science

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Introduction To The Basic Business Intelligence Concepts

datapine

MAY 9, 2019

Business intelligence concepts refer to the usage of digital computing technologies in the form of data warehouses, analytics and visualization with the aim of identifying and analyzing essential business-based data to generate new, actionable corporate insights. The data warehouse. 1) The raw data.

Business Intelligence

Business Intelligence Dashboards Data Warehouse Visualization

Take Your SQL Skills To The Next Level With These Popular SQL Books

datapine

SEPTEMBER 27, 2022

With a MySQL dashboard builder , for example, you can connect all the data with a few clicks. A host of notable brands and retailers with colossal inventories and multiple site pages use SQL to enhance their site’s structure functionality and MySQL reporting processes. It is a must-read for understanding data warehouse design.

Business Intelligence

Business Intelligence Data Warehouse Data Processing Data mining

Scaling RISE with SAP data and AWS Glue

AWS Big Data

NOVEMBER 29, 2024

Customers often want to augment and enrich SAP source data with other non-SAP source data. Such analytic use cases can be enabled by building a data warehouse or data lake. Customers can now use the AWS Glue SAP OData connector to extract data from SAP.

Visualization

Visualization Data Processing Data-driven Cost-Benefit

What Is Ad Hoc Reporting? Your Guide To Definition, Meaning, Examples & Benefits

datapine

JULY 1, 2020

Moreover, a host of ad hoc analysis or reporting platforms boast integrated online data visualization tools to help enhance the data exploration process. Retail: Ad hoc data analysis proves particularly effective in loss prevention in the retail sector. public URL will enable you to send a simple link.

Reporting

Reporting Dashboards Cost-Benefit Visualization

5 misconceptions about cloud data warehouses

IBM Big Data Hub

FEBRUARY 2, 2023

In today’s world, data warehouses are a critical component of any organization’s technology ecosystem. The rise of cloud has allowed data warehouses to provide new capabilities such as cost-effective data storage at petabyte scale, highly scalable compute and storage, pay-as-you-go pricing and fully managed service delivery.

Data Warehouse

Data Warehouse Cost-Benefit Unstructured Data Data Architecture

Unlocking Data Storage: The Traditional Data Warehouse vs. Cloud Data Warehouse

Sisense

NOVEMBER 12, 2020

Data warehouse vs. databases Traditional vs. Cloud Explained Cloud data warehouses in your data stack A data-driven future powered by the cloud. We live in a world of data: There’s more of it than ever before, in a ceaselessly expanding array of forms and locations. Data warehouse vs. databases.

Data Warehouse

Data Warehouse Data Lake OLAP Data-driven

Common Business Intelligence Challenges Facing Entrepreneurs

datapine

MAY 21, 2019

These benefits include cost efficiency, the optimization of inventory levels, the reduction of information waste, enhanced marketing communications, and better internal communication – among a host of other business-boosting improvements. In the past, expensive enterprise BI solutions required huge hardware resources. Welcome to the future.

Business Intelligence

Business Intelligence Cost-Benefit Dashboards ROI

Cloud Flexibility: Examining Three Cloud Hosting Options

Sisense

FEBRUARY 17, 2021

With more people becoming digital citizens, the ability for an application to explode in popularity has all but rendered obsolete the traditional IT hosting mindset of discrete servers performing discrete tasks. Let’s dig into three hosting models that help organizations achieve cloud flexibility.

Data Processing

Data Processing Modeling Technology IT

How to enable Cloudera Data Visualization in CDW

Cloudera

SEPTEMBER 30, 2020

In our previous blog post we introduced Cloudera Data Visualization in Cloudera Data Warehouse (CDW) available in tech preview, in CDP Public Cloud. This blog will help you get started with Cloudera Data Visualization, so you can start building interesting and powerful applications on all types of data.

Visualization

Visualization Data Warehouse Dashboards Data Processing

Build a secure data visualization application using the Amazon Redshift Data API with AWS IAM Identity Center

AWS Big Data

MARCH 6, 2025

Tens of thousands of customers use Amazon Redshift for modern data analytics at scale, delivering up to three times better price-performance and seven times better throughput than other cloud data warehouses. This makes sure that user access and roles are consistently maintained across both AWS services and external tools.

Visualization

Visualization Sales Data Warehouse Management

Create your Private Data Warehousing Environment Using Azure Kubernetes Service

Cloudera

DECEMBER 2, 2021

Cloudera secures your data by providing encryption at rest and in transit, multi-factor authentication, Single Sign On, robust authorization policies, and network security. It is part of the Cloudera Data Platform, or CDP , which runs on Azure and AWS, as well as in the private cloud. Enter “0.0.0.0/0” 0” in the Whitelist IP CIDR(s).

Data Lake

Data Lake Data Warehouse Data Processing Interactive

Extreme data center pressure? Burst to the cloud with CDP!

Cloudera

NOVEMBER 12, 2020

Your sunk costs are minimal and if a workload or project you are supporting becomes irrelevant, you can quickly spin down your cloud data warehouses and not be “stuck” with unused infrastructure. Cloud deployments for suitable workloads gives you the agility to keep pace with rapidly changing business and data needs.

Data Warehouse

Data Warehouse Reporting Risk Cost-Benefit

The disruptive potential of open data lakehouse architectures and IBM watsonx.data

IBM Big Data Hub

JUNE 15, 2023

It is comprised of commodity cloud object storage, open data and open table formats, and high-performance open-source query engines. To help organizations scale AI workloads, we recently announced IBM watsonx.data , a data store built on an open data lakehouse architecture and part of the watsonx AI and data platform.

Data Warehouse

Data Warehouse Data Lake Optimization Data-driven

Drinking our own champagne – Cloudera upgrades to CDP Private Cloud

Cloudera

APRIL 21, 2021

This cluster runs workloads for every department – from real-time user interfaces for Support to providing recommendations in the Cloudera Data Platform (CDP) Upgrade Advisor to analyzing our business and closing our books. In this blog, we discuss our journey to CDP for this critical cluster. Lessons Learned. CDP Knowledge Hub.

Testing

Testing Data Processing Interactive Data Warehouse

The Multifaceted Value Proposition of the Cloudera Data Platform

Cloudera

FEBRUARY 22, 2021

That benefit comes from the breadth of CDP’s analytical capabilities that translates into a unique ability to migrate different big data workloads, either from previous versions of CDH / HDP or from other cloud data warehouses and legacy on-premises data warehouses that the acquired entity might be using.

Cost-Benefit

Cost-Benefit Data Warehouse Data Processing Data Governance

Your Effective Roadmap To Implement A Successful Business Intelligence Strategy

datapine

FEBRUARY 22, 2022

This should also include creating a plan for data storage services. Are the data sources going to remain disparate? Or does building a data warehouse make sense for your organization? Then for knowledge transfer choose the repository, best suited for your organization, to host this information. Define a budget.

Business Intelligence

Business Intelligence Strategy Cost-Benefit Dashboards

96 Percent of Businesses Can’t Be Wrong: How Hybrid Cloud Came to Dominate the Data Sector

Cloudera

JANUARY 26, 2022

Network operating systems let computers communicate with each other; and data storage grew—a 5MB hard drive was considered limitless in 1983 (when compared to a magnetic drum with memory capacity of 10 kB from the 1960s). The amount of data being collected grew, and the first data warehouses were developed.

Data Processing

Data Processing IoT Data Warehouse Cost-Benefit

Use AWS Glue to streamline SFTP data processing

AWS Big Data

AUGUST 13, 2024

With AWS Glue, you can discover and connect to hundreds of diverse data sources and manage your data in a centralized data catalog. It enables you to visually create, run, and monitor extract, transform, and load (ETL) pipelines to load data into your data lakes. Choose Store a new secret.

Data Processing

Data Processing Visualization Data Lake Data Processing

A Guide To Starting A Career In Business Intelligence & The BI Skills You Need

datapine

MARCH 31, 2022

On the flip side, if you enjoy diving deep into the technical side of things, with the right mix of skills for business intelligence you can work a host of incredibly interesting problems that will keep you in flow for hours on end. This could involve anything from learning SQL to buying some textbooks on data warehouses.

Business Intelligence

Business Intelligence Statistics Visualization Data-driven

Addressing the Three Scalability Challenges in Modern Data Platforms

Cloudera

NOVEMBER 22, 2021

In legacy analytical systems such as enterprise data warehouses, the scalability challenges of a system were primarily associated with computational scalability, i.e., the ability of a data platform to handle larger volumes of data in an agile and cost-efficient way. Introduction. public, private, hybrid cloud)?

Data Processing

Data Processing Data Warehouse Enterprise Visualization

Building and Evaluating GenAI Knowledge Management Systems using Ollama, Trulens and Cloudera

Cloudera

MAY 23, 2024

In modern enterprises, the exponential growth of data means organizational knowledge is distributed across multiple formats, ranging from structured data stores such as data warehouses to multi-format data stores like data lakes. This contextualization is possible thanks to RAG.

Management

Management Metrics Data Processing Data Lake

Setting up and Getting Started with Cloudera’s New SQL AI Assistant

Cloudera

JANUARY 19, 2024

As described in our recent blog post , an SQL AI Assistant has been integrated into Hue with the capability to leverage the power of large language models (LLMs) for a number of SQL tasks. This is a real game-changer for data analysts on all levels and will make SQL development faster, easier, and less error-prone.

Data Warehouse

Data Warehouse Data Processing Optimization Modeling

Generative AI for the Enterprise

Cloudera

MAY 31, 2023

There are many benefits to these new services, but they certainly are not a one-size-fits-all solution, and this is most true for commercial enterprises looking to adopt generative AI for their own unique use cases powered by their data. How can enterprises address these challenges?

Enterprise

Enterprise Data Processing Machine Learning Experimentation

Get Your Analytics Insights Instantly – Without Abandoning Central IT

Cloudera

JANUARY 21, 2021

While cloud-native, point-solution data warehouse services may serve your immediate business needs, there are dangers to the corporation as a whole when you do your own IT this way. Cloudera Data Warehouse (CDW) is here to save the day! CDW is an integrated data warehouse service within Cloudera Data Platform (CDP).

Data Warehouse

Data Warehouse Data Lake IT Analytics

BusinessObjects in the Cloud – No Big Rush and No Big Deal

Paul Blogs on BI

SEPTEMBER 8, 2021

Well firstly, if the main data warehouses, repositories, or application databases that BusinessObjects accesses are on premise, it makes no sense to move BusinessObjects to the cloud until you move its data sources to the cloud. You also have the option of hosting with a third party.

Data Warehouse

Data Warehouse Data Processing Data Lake Testing

When Data Warehousing Met the Events Industry

BizAcuity

FEBRUARY 1, 2019

The solution here is to consolidate all of this data, gathered from different points at different times along the course of the event and store it in one consolidated form in a Data Warehouse. One of the many things that data warehouses allow is the chronological sifting of data.

Data Warehouse

Data Warehouse B2B Business Intelligence Data-driven

Consolidating Patron’s Data – To Increase Casinos’ ROI

BizAcuity

SEPTEMBER 5, 2019

But more importantly, from a business and strategic viewpoint, it means that casinos are capturing consumer data into data warehouses, at different points inside the casino – the same data that is crucial for a host of purposes. These systems are amassing information into independent data warehouses.

ROI

ROI Data Warehouse Advertising Data Processing

Consolidating Patron’s Data – The Next Big Move to Increase Casinos’ ROI

BizAcuity

SEPTEMBER 5, 2019

But more importantly, from a business and strategic viewpoint, it means that casinos are capturing consumer data into data warehouses, at different points inside the casino – the same data that is crucial for a host of purposes. These systems are amassing information into independent data warehouses.

ROI

ROI Data Warehouse Advertising Data Processing

Bringing More AI to Snowflake, the Data Cloud

DataRobot Blog

FEBRUARY 28, 2023

This includes: Supporting Snowflake External OAuth configuration Leveraging Snowpark for exploratory data analysis with DataRobot-hosted Notebooks and model scoring. Exploratory Data Analysis After we connect to Snowflake, we can start our ML experiment. We recently announced DataRobot’s new Hosted Notebooks capability.

Data Processing

Data Processing Experimentation Machine Learning Data Warehouse

Integrate Tableau and Okta with Amazon Redshift using AWS IAM Identity Center

AWS Big Data

JUNE 3, 2024

This blog post is co-written with Sid Wray and Jake Koskela from Salesforce, and Adiascar Cisneros from Tableau. Amazon Redshift is a fast, scalable cloud data warehouse built to serve workloads at any scale. For this blog post, we use the default custom authorization server. scopes, claims, and access policies.

Data Warehouse

Data Warehouse Reporting Testing Publishing

CDP Private Cloud is a Game-changer for Partners

Cloudera

SEPTEMBER 2, 2020

In short, CDP Private Cloud is a game-changer for Cloudera partners as it provides opportunities to help their customers modernize their data platform by breaking up monolithic architectures without leaving their data centers! . The post CDP Private Cloud is a Game-changer for Partners appeared first on Cloudera Blog.

Cost-Benefit

Cost-Benefit Data Warehouse Data Lake Machine Learning

The New Cloudera

Cloudera

JANUARY 3, 2019

It’s clear today that the data warehouse industry is undergoing a major transformation. Our new Chief Product Officer Arun Murthy has a post up on the Hortonworks blog , explaining what the future holds in product strategy and development. The post The New Cloudera appeared first on Cloudera Blog. We intend to win.

Machine Learning

Machine Learning IoT Data Warehouse Enterprise

Migration Supporting Real-Time Analytics for Customer Experience Management

Cloudera

AUGUST 31, 2020

Given the prohibitive cost of scaling it, in addition to the new business focus on data science and the need to leverage public cloud services to support future growth and capability roadmap, SMG decided to migrate from the legacy data warehouse to Cloudera’s solution using Hive LLAP. The case for a new Data Warehouse?

Management

Management Slice and Dice Data Warehouse Analytics

Data Engineering Today: All About the Cloud

Sisense

OCTOBER 10, 2019

When “data engineer” first started becoming a vital role for tech companies, the world was a smaller, simpler place. Engineers were primarily concerned with handling data stored in Excel spreadsheets and on local machines. Today, being a data engineer means connecting your company’s business systems to cloud-based data sources.

Data Processing

Data Processing Data Warehouse Digital Transformation Software

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

With quality data at their disposal, organizations can form data warehouses for the purposes of examining trends and establishing future-facing strategies. Industry-wide, the positive ROI on quality data is well understood. This is due to the technical nature of a data system itself.

Data Quality

Data Quality Metrics Data-driven Management

Top 6 data engineering frameworks to learn

Insight

AUGUST 20, 2019

If you want to get started with Spark, check out this blog on how to setup your very own Spark cluster on AWS here. Flink An alternative to Spark, Flink has gotten a lot of traction in the Data Engineering community. Our Fellows have used it in their projects, often in conjunction with Spark, for the exploration of Reddit data.

Data Warehouse

Data Warehouse Big Data Data-driven Data Processing

Attribute Amazon EMR on EC2 costs to your end-users

AWS Big Data

AUGUST 27, 2024

It takes in three arguments: – The Amazon S3 location of the data file that is read in by the Spark job. The input_full_path is s3://aws-blogs-artifacts-public/artifacts/BDB-2997/sample-data/input/part-00000-a0885743-e0cb-48b1-bc2b-05eb748ab898-c000.snappy.parquet He is in data and analytical field for over 14 years.

Metrics

Metrics Dashboards Data Lake Optimization

The Top Three Entangled Trends in Data Architectures: Data Mesh, Data Fabric, and Hybrid Architectures

Cloudera

SEPTEMBER 29, 2022

Note that the actual technologies used to generate, store, and query the actual data may be varied — and are not even prescribed by data mesh. It is also agnostic to where the different domains are hosted. Data fabric defined. Corresponding to the data mesh example in Figure 4, D1, D2 are tables in a data warehouse.

Data Architecture

Data Architecture Data Warehouse Metadata Sales

How to Accelerate Value from Merger and Acquisition Strategies with Cloudera Data Platform (CDP)

Cloudera

MARCH 22, 2022

orchestrated data warehouse offloads with Gluent ) that enable successful migration of workloads that previously ran on legacy data platforms or older Hadoop-based distributions. The post How to Accelerate Value from Merger and Acquisition Strategies with Cloudera Data Platform (CDP) appeared first on Cloudera Blog.

Strategy

Strategy Cost-Benefit Risk Data Processing

How Data Governance Protects Sensitive Data

erwin

APRIL 2, 2021

And knowing the business purpose translates into actively governing personal data against potential privacy and security violations. Do You Know Where Your Sensitive Data Is? Data is a valuable asset used to operate, manage and grow a business.

Data Governance

Data Governance Cost-Benefit Metadata Risk

Simplify data loading into Type 2 slowly changing dimensions in Amazon Redshift

AWS Big Data

MARCH 9, 2023

Thousands of customers rely on Amazon Redshift to build data warehouses to accelerate time to insights with fast, simple, and secure analytics at scale and analyze data from terabytes to petabytes by running complex analytical queries. Data loading is one of the key aspects of maintaining a data warehouse.

Slice and Dice

Slice and Dice Data Warehouse Metrics Metadata

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

Accelerate Offloading to Cloudera Data Warehouse (CDW) with Procedural SQL Support

Webinars

Trending Sources

The DataOps Vendor Landscape, 2021

Webinars

Introduction To The Basic Business Intelligence Concepts

Take Your SQL Skills To The Next Level With These Popular SQL Books

Scaling RISE with SAP data and AWS Glue

What Is Ad Hoc Reporting? Your Guide To Definition, Meaning, Examples & Benefits

5 misconceptions about cloud data warehouses

Unlocking Data Storage: The Traditional Data Warehouse vs. Cloud Data Warehouse

Common Business Intelligence Challenges Facing Entrepreneurs

Cloud Flexibility: Examining Three Cloud Hosting Options

How to enable Cloudera Data Visualization in CDW

Build a secure data visualization application using the Amazon Redshift Data API with AWS IAM Identity Center

Create your Private Data Warehousing Environment Using Azure Kubernetes Service

Extreme data center pressure? Burst to the cloud with CDP!

The disruptive potential of open data lakehouse architectures and IBM watsonx.data

Drinking our own champagne – Cloudera upgrades to CDP Private Cloud

The Multifaceted Value Proposition of the Cloudera Data Platform

Your Effective Roadmap To Implement A Successful Business Intelligence Strategy

96 Percent of Businesses Can’t Be Wrong: How Hybrid Cloud Came to Dominate the Data Sector

Use AWS Glue to streamline SFTP data processing

A Guide To Starting A Career In Business Intelligence & The BI Skills You Need

Addressing the Three Scalability Challenges in Modern Data Platforms

Building and Evaluating GenAI Knowledge Management Systems using Ollama, Trulens and Cloudera

Setting up and Getting Started with Cloudera’s New SQL AI Assistant

Generative AI for the Enterprise

Get Your Analytics Insights Instantly – Without Abandoning Central IT

BusinessObjects in the Cloud – No Big Rush and No Big Deal

When Data Warehousing Met the Events Industry

Consolidating Patron’s Data – To Increase Casinos’ ROI

Consolidating Patron’s Data – The Next Big Move to Increase Casinos’ ROI

Bringing More AI to Snowflake, the Data Cloud

Integrate Tableau and Okta with Amazon Redshift using AWS IAM Identity Center

CDP Private Cloud is a Game-changer for Partners

The New Cloudera

Migration Supporting Real-Time Analytics for Customer Experience Management

Data Engineering Today: All About the Cloud

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Top 6 data engineering frameworks to learn

Attribute Amazon EMR on EC2 costs to your end-users

The Top Three Entangled Trends in Data Architectures: Data Mesh, Data Fabric, and Hybrid Architectures

How to Accelerate Value from Merger and Acquisition Strategies with Cloudera Data Platform (CDP)

How Data Governance Protects Sensitive Data

Simplify data loading into Type 2 slowly changing dimensions in Amazon Redshift

Stay Connected