Cost-Benefit, Data Lake and Data Science

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

For container terminal operators, data-driven decision-making and efficient data sharing are vital to optimizing operations and boosting supply chain efficiency. Two use cases illustrate how this can be applied for business intelligence (BI) and data science applications, using AWS services such as Amazon Redshift and Amazon SageMaker.

IoT

IoT Machine Learning Metadata Data-driven

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JUNE 10, 2024

In this blog post, we dive into different data aspects and how Cloudinary breaks the two concerns of vendor locking and cost efficient data analytics by using Apache Iceberg, Amazon Simple Storage Service (Amazon S3 ), Amazon Athena , Amazon EMR , and AWS Glue.

Data Lake

Data Lake Metadata Snapshot Analytics

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

AWS Big Data

MARCH 10, 2023

Since the deluge of big data over a decade ago, many organizations have learned to build applications to process and analyze petabytes of data. Data lakes have served as a central repository to store structured and unstructured data at any scale and in various formats.

Data Lake

Data Lake Sales Data Warehouse Snapshot

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

2021 Gift Giving Guide for Data Nerds

DataKitchen

DECEMBER 7, 2021

This book is not available until January 2022, but considering all the hype around the data mesh, we expect it to be a best seller. In the book, author Zhamak Dehghani reveals that, despite the time, money, and effort poured into them, data warehouses and data lakes fail when applied at the scale and speed of today’s organizations.

Data-driven

Data-driven Data Governance Big Data Data Science

How Etihad taps data science to optimise airline operations

CIO Business Intelligence

MARCH 9, 2022

Despite the worldwide chaos, UAE national airline Etihad has managed to generate productivity gains and cost savings from insights using data science. Etihad began its data science journey with the Cloudera Data Platform and moved its data to the cloud to set up a data lake. Talal Mufti.

Data Science

Data Science Data Lake Cost-Benefit Digital Transformation

Accelerate data science feature engineering on transactional data lakes using Amazon Athena with Apache Iceberg

AWS Big Data

JUNE 20, 2023

It manages large collections of files as tables, and it supports modern analytical data lake operations such as record-level insert, update, delete, and time travel queries. About the Authors Vivek Gautam is a Data Architect with specialization in data lakes at AWS Professional Services.

Data Lake

Data Lake Data Science Recreation/Entertainment Data-driven

Your New Cloud for AI May Be Inside a Colo

CIO Business Intelligence

MAY 23, 2022

Enterprises moving their artificial intelligence projects into full scale development are discovering escalating costs based on initial infrastructure choices. Many companies whose AI model training infrastructure is not proximal to their data lake incur steeper costs as the data sets grow larger and AI models become more complex.

Experimentation

Experimentation Cost-Benefit Data Lake Data Science

How to modernize data lakes with a data lakehouse architecture

IBM Big Data Hub

JULY 5, 2023

Data Lakes have been around for well over a decade now, supporting the analytic operations of some of the largest world corporations. Such data volumes are not easy to move, migrate or modernize. The challenges of a monolithic data lake architecture Data lakes are, at a high level, single repositories of data at scale.

Data Lake

Data Lake Metadata Cost-Benefit Data Warehouse

Carhartt turns to data under new CIO

CIO Business Intelligence

NOVEMBER 25, 2022

As part of that transformation, Agusti has plans to integrate a data lake into the company’s data architecture and expects two AI proofs of concept (POCs) to be ready to move into production within the quarter. Today, we backflush our data lake through our data warehouse.

Data Lake

Data Lake Data Warehouse Unstructured Data Data Architecture

How Gilead used Amazon Redshift to quickly and cost-effectively load third-party medical claims data

AWS Big Data

NOVEMBER 8, 2023

This post was co-written with Rajiv Arora, Director of Data Science Platform at Gilead Life Sciences. Gilead Sciences, Inc. Redshift Serverless measures data warehouse capacity in Redshift Processing Units (RPUs), which are part of the compute resources. It took an additional 1 hour to create.

Data Lake

Data Lake Data Warehouse Cost-Benefit Optimization

Data transformation takes flight at Atlanta’s Hartsfield-Jackson airport

CIO Business Intelligence

AUGUST 9, 2024

The original proof of concept was to have one data repository ingesting data from 11 sources, including flat files and data stored via APIs on premises and in the cloud, Pruitt says. There are a lot of variables that determine what should go into the data lake and what will probably stay on premise,” Pruitt says.

Data Transformation

Data Transformation Machine Learning Data Lake Dashboards

How GamesKraft uses Amazon Redshift data sharing to support growing analytics workloads

AWS Big Data

NOVEMBER 13, 2023

Amazon Redshift is a fully managed data warehousing service that offers both provisioned and serverless options, making it more efficient to run and scale analytics without having to manage your data warehouse. Additionally, data is extracted from vendor APIs that includes data related to product, marketing, and customer experience.

Data Warehouse

Data Warehouse Analytics Data Lake Data Science

The Future of the Data Lakehouse – Open

CIO Business Intelligence

JUNE 23, 2022

Cloudera customers run some of the biggest data lakes on earth. These lakes power mission critical large scale data analytics, business intelligence (BI), and machine learning use cases, including enterprise data warehouses. On data warehouses and data lakes.

Data Lake

Data Lake Data Warehouse Machine Learning Data-driven

Why Purpose-Built Infrastructure is the Best Option for Scaling AI Model Development

CIO Business Intelligence

AUGUST 4, 2022

Many companies that begin their AI projects in the cloud often reach a point when cost and time variables become issues. But as models and datasets grow, there’s a stifling effect associated with the escalating compute cost and time. You’re paying a lot of money for data-science talent,” Paikeday says.

Modeling

Modeling Cost-Benefit ROI Data Lake

The Future of the Data Lakehouse – Open

Cloudera

JUNE 18, 2022

Cloudera customers run some of the biggest data lakes on earth. These lakes power mission critical large scale data analytics, business intelligence (BI), and machine learning use cases, including enterprise data warehouses. On data warehouses and data lakes.

Data Lake

Data Lake Data Warehouse Machine Learning Data-driven

Preparing the foundations for Generative AI

CIO Business Intelligence

FEBRUARY 20, 2024

It unifies all data on a single platform, including data integration, engineering, and warehousing, where it can be used for data science, real-time analytics, and business intelligence – and accessed with natural language queries and the power of generative AI. If this all seems challenging, Avanade can help.

Cost-Benefit

Cost-Benefit Data Lake Data Warehouse Data Processing

How data literacy allows gen AI to drive productivity at Dow

CIO Business Intelligence

JULY 31, 2024

We’re now able to provide real-time predictions about our network performance, optimize our inventory, and reduce costs. Several groups are already recognizing cost saving opportunities alongside efficiency gains. What was the foundation you needed build to benefit from gen AI? But the technical foundation is just one piece.

Manufacturing

Manufacturing Cost-Benefit Digital Transformation Forecasting

What you don’t know about data management could kill your business

CIO Business Intelligence

NOVEMBER 28, 2023

Data, of course, has been all the rage the past decade, having been declared the “new oil” of the digital economy. And yes, data has enormous potential to create value for your business, making its accrual and the analysis of it, aka data science, very exciting. And here is the gotcha piece about data.

Management

Management Data Architecture Data Lake Data Strategy

Accelerate Your Data Mesh in the Cloud with Cloudera Data Engineering and Modak NabuTM

Cloudera

OCTOBER 11, 2021

Modak Nabu automates repetitive tasks in the data preparation process and thus accelerates the data preparation by 4x. They will automatically get the benefits of CDP Shared Data Experience (SDX) with enterprise-grade security and governance. Customers using Modak Nabu with CDP today have deployed Data Lakes and.

Data Lake

Data Lake Cost-Benefit Data-driven Dashboards

Lay the groundwork now for advanced analytics and AI

CIO Business Intelligence

AUGUST 3, 2023

When global technology company Lenovo started utilizing data analytics, they helped identify a new market niche for its gaming laptops, and powered remote diagnostics so their customers got the most from their servers and other devices. After moving its expensive, on-premise data lake to the cloud, Comcast created a three-tiered architecture.

Analytics

Analytics Data Lake Metadata Cost-Benefit

P&G turns to AI to create digital manufacturing of the future

CIO Business Intelligence

OCTOBER 1, 2022

The partners say they will create the future of digital manufacturing by leveraging the industrial internet of things (IIoT), digital twin , data, and AI to bring products to consumers faster and increase customer satisfaction, all while improving productivity and reducing costs. The power of people.

Manufacturing

Manufacturing Digital Transformation IoT Internet of Things

Achieve your AI goals with an open data lakehouse approach

IBM Big Data Hub

OCTOBER 4, 2023

A data lakehouse architecture combines the performance of data warehouses with the flexibility of data lakes, to address the challenges of today’s complex data landscape and scale AI. With watsonx.data, you can experience the benefits of a data lakehouse to help scale AI workloads for all your data, anywhere.

Data Lake

Data Lake Metadata Data Warehouse Cost-Benefit

Top 15 data management platforms available today

CIO Business Intelligence

SEPTEMBER 22, 2023

The term “data management platform” can be confusing because, while it sounds like a generalized product that works with all forms of data as part of generalized data management strategies, the term has been more narrowly defined of late as one targeted to marketing departments’ needs.

Management

Management Advertising Data Lake Sales

Keys to Ensure that Data isn’t Slowing Down your Innovation Efforts

Cloudera

AUGUST 18, 2021

Data Lifecycle Management: The Key to AI-Driven Innovation. In digital transformation projects, it’s easy to imagine the benefits of cloud, hybrid, artificial intelligence (AI), and machine learning (ML) models. The hard part is to turn aspiration into reality by creating an organization that is truly data-driven. technologies.

Data Lake

Data Lake Internet of Things IoT Data-driven

Celebrating Data Superheroes: The 2021 Data Impact Awards Winners

Cloudera

NOVEMBER 18, 2021

Every one of our 22 finalists is utilizing cloud technology to push next-generation data solutions to benefit the everyday people who need it most – across industries including science, health, financial services and telecommunications. taxpayer details and needs to quickly analyze petabytes of data across hundreds of servers.

Data Lake

Data Lake Cost-Benefit Digital Transformation Risk

Breaking barriers in geospatial: Amazon Redshift, CARTO, and H3

AWS Big Data

MAY 16, 2024

To learn more details about their benefits, see Introduction to Spatial Indexes. Learn more about these differences in CARTO’s free ebook Spatial Indexes Benefits of H3 One of the flagship examples of spatial indexes is H3, which is a hexagonal spatial index. This ensures robust data representation in all directions.

Data Warehouse

Data Warehouse Visualization Cost-Benefit Data-driven

Unleashing the power of Presto: The Uber case study

IBM Big Data Hub

SEPTEMBER 25, 2023

Presto is an open source distributed SQL query engine for data analytics and the data lakehouse, designed for running interactive analytic queries against datasets of all sizes, from gigabytes to petabytes. Because of its distributed nature, Presto scales for petabytes and exabytes of data.

OLAP

OLAP Data Lake Data-driven Online Analytical Processing

Introducing watsonx: The future of AI for business

IBM Big Data Hub

MAY 9, 2023

For AI to be truly transformative, as many people as possible should have access to its benefits. is not just for data scientists and developers — business users can also access it via an easy-to-use interface that responds to natural language prompts for different tasks. Trust is one part of the equation. The second is access.

Data Warehouse

Data Warehouse Machine Learning Cost-Benefit Metadata

Governing data in relational databases using Amazon DataZone

AWS Big Data

MAY 7, 2024

It also makes it easier for engineers, data scientists, product managers, analysts, and business users to access data throughout an organization to discover, use, and collaborate to derive data-driven insights. Note that a managed data asset is an asset for which Amazon DataZone can manage permissions.

Metadata

Metadata Data Lake Data Processing Data-driven

A hybrid approach in healthcare data warehousing with Amazon Redshift

AWS Big Data

FEBRUARY 21, 2023

Regardless of the division or use case it is related to, dimensional data models can be used to store data obtained from tracking various processes like patient encounters, provider practice metrics, aftercare surveys, and more. Amazon Redshift RA3 instances and Amazon Redshift Serverless are perfect choices for a data vault.

Data Warehouse

Data Warehouse Data Lake Cost-Benefit Metadata

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

AWS Big Data

AUGUST 19, 2024

We also use Amazon S3 to store AWS Glue scripts, logs, and temporary data generated during the ETL process. This approach offers the following benefits: Enhanced security – By using PrivateLink and VPC endpoints, data transfer between Snowflake and Amazon S3 is secured within the AWS network, reducing exposure to potential security threats.

Analytics

Analytics Data-driven Data Integration Data Lake

The year’s top 10 enterprise AI trends — so far

CIO Business Intelligence

SEPTEMBER 21, 2023

It doesn’t matter how accurate an AI model is, or how much benefit it’ll bring to a company if the intended users refuse to have anything to do with it. We’re still in the early phases of this,” says Donncha Carroll, partner in the revenue growth practice and head of the data science team at Lotis Blue Consulting.

Enterprise

Enterprise Consulting Modeling Cost-Benefit

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

The following diagram illustrates the different pipelines to ingest data from various source systems using AWS services. Data storage Structured, semi-structured, or unstructured batch data is stored in an object storage because these are cost-efficient and durable.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Achieving Trusted AI in Manufacturing

Cloudera

JANUARY 30, 2024

But with this data — along with some context about the business and process — manufacturers can leverage AI as a key building block to develop and enhance operations. There are many functional areas within manufacturing where manufacturers will see AI’s massive benefits. Eliminate data silos.

Manufacturing

Manufacturing Contextual Data IoT Internet of Things

Dancing with Elephants in 5 Easy Steps

Cloudera

AUGUST 21, 2020

The Corner Office is pressing their direct reports across the company to “Move To The Cloud” to increase agility and reduce costs. a deeper cloud vs. on-prem cost/benefit analysis raises more questions about moving these complex systems to the cloud: Is moving this particular operation to the cloud the right option right now ? .

Big Data

Big Data Cost-Benefit ROI Risk

Stitch Fix seamless migration: Transitioning from self-managed Kafka to Amazon MSK

AWS Big Data

SEPTEMBER 22, 2023

At Stitch Fix, we have been powered by data science since its foundation and rely on many modern data lake and data processing technologies. In our infrastructure, Apache Kafka has emerged as a powerful tool for managing event streams and facilitating real-time data processing.

Management

Management Metrics Cost-Benefit Data Lake

How foundation models and data stores unlock the business potential of generative AI

IBM Big Data Hub

AUGUST 1, 2023

Organizations that utilize them correctly can see a myriad of benefits—from increased operational efficiency and improved decision-making to the rapid creation of marketing content. But what makes the generative functionality of these models—and, ultimately, their benefits to the organization—possible? All watsonx.ai

Modeling

Modeling Cost-Benefit Machine Learning Data Lake

How OLAP and AI can enable better business

IBM Big Data Hub

DECEMBER 7, 2023

They are seamlessly integrated with cloud-based data warehouses, facilitating the collection, storage and analysis of data from various sources. Challenges of adopting cloud-based OLAP solutions Cloud adoption for OLAP databases has become common due to scalability, elasticity and cost-efficiency advantages.

OLAP

OLAP Slice and Dice Cost-Benefit Data Warehouse

Data architecture strategy for data quality

IBM Big Data Hub

JANUARY 5, 2023

Ill-timed business decisions and misinformed business processes, missed revenue opportunities, failed business initiatives and complex data systems can all stem from data quality issues. Several factors determine the quality of your enterprise data like accuracy, completeness, consistency, to name a few.

Data Architecture

Data Architecture Data Quality Strategy Data Lake

New Thinking, Old Thinking and a Fairytale

Peter James Thomas

JUNE 20, 2019

The above chart compares monthly searches for Business Process Reengineering (including its arguable rebranding as Business Transformation ) and monthly searches for Data Science between 2004 and 2019. And reduced costs aren’t guaranteed […]. What was not generally accounted for were the associated intangible costs.

Cost-Benefit

Cost-Benefit Data Warehouse Data Science Consulting

Exploring the AI and data capabilities of watsonx

IBM Big Data Hub

JULY 17, 2023

By supporting open-source frameworks and tools for code-based, automated and visual data science capabilities — all in a secure, trusted studio environment — we’re already seeing excitement from companies ready to use both foundation models and machine learning to accomplish key tasks.

Machine Learning

Machine Learning Data Warehouse Modeling Cost-Benefit

Machine Learning and AI Underpin Predictive Analytics to Achieve Clinical Breakthroughs

Cloudera

JULY 18, 2018

To arrive at quality data, organizations are spending significant levels of effort on data integration, visualization, and deployment activities. Additionally, organizations are increasingly restrained due to budgetary constraints and having limited data sciences resources.

Machine Learning

Machine Learning Predictive Analytics Analytics Prescriptive Analytics

Make Better Data-Driven Decisions with DataRobot AI Platform Single-Tenant SaaS on Microsoft Azure

DataRobot Blog

MARCH 7, 2023

DataRobot is available on Azure as an AI Platform Single-Tenant SaaS, eliminating the time and cost of an on-premises implementation. The DataRobot AI Platform seamlessly integrates with Azure cloud services, including Azure Machine Learning, Azure Data Lake Storage Gen 2 (ADLS), Azure Synapse Analytics, and Azure SQL database.

Data-driven

Data-driven Machine Learning Experimentation Data Lake

How to use foundation models and trusted governance to manage AI workflow risk

IBM Big Data Hub

OCTOBER 16, 2023

How to scale AL and ML with built-in governance A fit-for-purpose data store built on an open lakehouse architecture allows you to scale AI and ML while providing built-in governance tools. A data store lets a business connect existing data with new data and discover new insights with real-time analytics and business intelligence.

Risk

Risk Modeling Management Metadata

How EUROGATE established a data mesh architecture using Amazon DataZone

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Webinars

Trending Sources

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Webinars

2021 Gift Giving Guide for Data Nerds

How Etihad taps data science to optimise airline operations

Accelerate data science feature engineering on transactional data lakes using Amazon Athena with Apache Iceberg

Your New Cloud for AI May Be Inside a Colo

How to modernize data lakes with a data lakehouse architecture

Carhartt turns to data under new CIO

How Gilead used Amazon Redshift to quickly and cost-effectively load third-party medical claims data

Data transformation takes flight at Atlanta’s Hartsfield-Jackson airport

How GamesKraft uses Amazon Redshift data sharing to support growing analytics workloads

The Future of the Data Lakehouse – Open

Why Purpose-Built Infrastructure is the Best Option for Scaling AI Model Development

The Future of the Data Lakehouse – Open

Preparing the foundations for Generative AI

How data literacy allows gen AI to drive productivity at Dow

What you don’t know about data management could kill your business

Accelerate Your Data Mesh in the Cloud with Cloudera Data Engineering and Modak NabuTM

Lay the groundwork now for advanced analytics and AI

P&G turns to AI to create digital manufacturing of the future

Achieve your AI goals with an open data lakehouse approach

Top 15 data management platforms available today

Keys to Ensure that Data isn’t Slowing Down your Innovation Efforts

Celebrating Data Superheroes: The 2021 Data Impact Awards Winners

Breaking barriers in geospatial: Amazon Redshift, CARTO, and H3

Unleashing the power of Presto: The Uber case study

Introducing watsonx: The future of AI for business

Governing data in relational databases using Amazon DataZone

A hybrid approach in healthcare data warehousing with Amazon Redshift

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

The year’s top 10 enterprise AI trends — so far

Create an end-to-end data strategy for Customer 360 on AWS

Achieving Trusted AI in Manufacturing

Dancing with Elephants in 5 Easy Steps

Stitch Fix seamless migration: Transitioning from self-managed Kafka to Amazon MSK

How foundation models and data stores unlock the business potential of generative AI

How OLAP and AI can enable better business

Data architecture strategy for data quality

New Thinking, Old Thinking and a Fairytale

Exploring the AI and data capabilities of watsonx

Machine Learning and AI Underpin Predictive Analytics to Achieve Clinical Breakthroughs

Make Better Data-Driven Decisions with DataRobot AI Platform Single-Tenant SaaS on Microsoft Azure

How to use foundation models and trusted governance to manage AI workflow risk

Stay Connected