2023, Cost-Benefit and Data Lake

MongoDB Enhances Developer Data Platform

David Menninger's Analyst Perspectives

JANUARY 21, 2025

ISGs Market Lens Cloud Study illustrates the extent to which the database market is now dominated by cloud, with 58% of participants deploying more than one-half of database and data platform workloads on cloud. also extends MongoDBs Queryable Encryption capability, which was introduced in 2023.

Data Lake

Data Lake IoT Cost-Benefit Enterprise

Run Apache XTable in AWS Lambda for background conversion of open table formats

AWS Big Data

NOVEMBER 26, 2024

Initially, data warehouses were the go-to solution for structured data and analytical workloads but were limited by proprietary storage formats and their inability to handle unstructured data. Eventually, transactional data lakes emerged to add transactional consistency and performance of a data warehouse to the data lake.

Metadata

Metadata Data Lake Snapshot Data Warehouse

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

AWS Big Data

OCTOBER 1, 2024

Amazon Redshift enables you to efficiently query and retrieve structured and semi-structured data from open format files in Amazon S3 data lake without having to load the data into Amazon Redshift tables. Amazon Redshift extends SQL capabilities to your data lake, enabling you to run analytical queries.

Data Lake

Data Lake Statistics Broadcasting Optimization

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

AWS Big Data

MARCH 10, 2023

Since the deluge of big data over a decade ago, many organizations have learned to build applications to process and analyze petabytes of data. Data lakes have served as a central repository to store structured and unstructured data at any scale and in various formats.

Data Lake

Data Lake Sales Data Warehouse Snapshot

Build a high-performance quant research platform with Apache Iceberg

AWS Big Data

JANUARY 9, 2025

Our experiments are based on real-world historical full order book data, provided by our partner CryptoStruct , and compare the trade-offs between these choices, focusing on performance, cost, and quant developer productivity. Data management is the foundation of quantitative research. groupBy("exchange_code", "instrument").count().orderBy("count",

Metadata

Metadata Snapshot Cost-Benefit Optimization

Outdated business apps can cloud your AI vision

CIO Business Intelligence

FEBRUARY 20, 2025

Outdated software applications are creating roadblocks to AI adoption at many organizations, with limited data retention capabilities a central culprit, IT experts say. Moreover, the cost of maintaining outdated software, with a shrinking number of software engineers familiar with the apps, can be expensive, he says.

Insurance

Insurance Cost-Benefit Unstructured Data Data Lake

Use Apache Iceberg in a data lake to support incremental data processing

AWS Big Data

MARCH 2, 2023

Iceberg has become very popular for its support for ACID transactions in data lakes and features like schema and partition evolution, time travel, and rollback. and later supports the Apache Iceberg framework for data lakes. AWS Glue 3.0 The following diagram illustrates the solution architecture.

Data Lake

Data Lake Data Processing Metadata Snapshot

CarMax drives business value with GPT-3.5

CIO Business Intelligence

MAY 5, 2023

x for business value even before ChatGPT became a household name. That is why the omnichannel used-car retailer earned a coveted spot on the 2023 CIO 100 Award list: for its early, innovative use of a nascent AI technology that led to a spike in page views as well as higher SEO ranking and placement that drove substantial business growth.

Digital Transformation

Digital Transformation Cost-Benefit Business Driver Machine Learning

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

AWS Big Data

APRIL 24, 2023

Building a data lake on Amazon Simple Storage Service (Amazon S3) provides numerous benefits for an organization. However, many use cases, like performing change data capture (CDC) from an upstream relational database to an Amazon S3-based data lake, require handling data at a record level.

Data Lake

Data Lake Data Governance Machine Learning Cost-Benefit

Top Opportunities for SAP Partners in 2023

Timo Elliott

NOVEMBER 30, 2022

My role was to talk about the trends and opportunities for 2023, for customers, SAP, and our partners. IDC calls it the Future Enterprise , Forrester talks about Future Fit organizations, and Gartner explains the benefits of the Composable Enterprise. Innovating Faster. Analysis to Action. It’s all about profits AND purpose.

Recreation/Entertainment

Recreation/Entertainment Metadata Data Warehouse Cost-Benefit

Real-time streaming data top picks you cannot miss at AWS re:Invent 2023

AWS Big Data

NOVEMBER 8, 2023

Save the date: AWS re:Invent 2023 is happening from November 27 to December 1 in Las Vegas, and you cannot miss it. In today’s data-driven landscape, the quality of data is the foundation upon which the success of organizations and innovations stands. Reserve your seat now! Reserve your seat now! Reserve your seat now!

Data-driven

Data-driven Machine Learning Data Lake Cost-Benefit

Enhance monitoring and debugging for AWS Glue jobs using new job observability metrics

AWS Big Data

NOVEMBER 20, 2023

As a result, you gain the benefit of higher availability, better performance, and lower cost for your AWS Glue for Apache Spark workload. Use case A typical workload for AWS Glue for Apache Spark jobs is to load data from a relational database to a data lake with SQL-based transformations. Check it out!

Metrics

Metrics Data Lake Cost-Benefit Dashboards

Preparing the foundations for Generative AI

CIO Business Intelligence

FEBRUARY 20, 2024

Data also needs to be sorted, annotated and labelled in order to meet the requirements of generative AI. No wonder CIO’s 2023 AI Priorities study found that data integration was the number one concern for IT leaders around generative AI integration, above security and privacy and the user experience. Generative AI, Innovation

Cost-Benefit

Cost-Benefit Data Lake Data Warehouse Data Processing

Apache Iceberg optimization: Solving the small files problem in Amazon EMR

AWS Big Data

OCTOBER 3, 2023

In our previous post Improve operational efficiencies of Apache Iceberg tables built on Amazon S3 data lakes , we discussed how you can implement solutions to improve operational efficiencies of your Amazon Simple Storage Service (Amazon S3) data lake that is using the Apache Iceberg open table format and running on the Amazon EMR big data platform.

Optimization

Optimization Snapshot Data Lake Metadata

Why enterprise CIOs need to plan for Microsoft gen AI

CIO Business Intelligence

AUGUST 14, 2024

Microsoft itself claims half of Fortune 500 companies use its Copilot tools and the number of daily users doubled in Q4 2023, although without saying how widely they’re deployed in those organizations. The cost of OpenAI is the same whether you buy it directly or through Azure. Although competitors have similar model gardens, at 13.8%

Enterprise

Enterprise Cost-Benefit Experimentation Modeling

Building a vision for real-time artificial intelligence

CIO Business Intelligence

APRIL 12, 2023

All of this needs to work cohesively in a real-time ecosystem and support the speed and scale necessary to realize the business benefits of real-time AI. Most current data architectures were designed for batch processing with analytics and machine learning models running on data warehouses and data lakes.

Machine Learning

Machine Learning Cost-Benefit Data-driven Strategy

How Fujitsu implemented a global data mesh architecture and democratized data

AWS Big Data

MAY 1, 2024

Currently, we have approximately 120,000 employees worldwide (as of March 2023), including group companies. To achieve data-driven management, we built OneData, a data utilization platform used in the four global AWS Regions, which started operation in April 2022. We use AWS Glue to preprocess, cleanse, and enrich data.

Dashboards

Dashboards Publishing Data-driven Cost-Benefit

Improve healthcare services through patient 360: A zero-ETL approach to enable near real-time data analytics

AWS Big Data

MARCH 27, 2024

Amazon Redshift integrates with AWS HealthLake and data lakes through Redshift Spectrum and Amazon S3 auto-copy features, enabling you to query data directly from files on Amazon S3. This means you no longer have to create an external schema in Amazon Redshift to use the data lake tables cataloged in the Data Catalog.

Data Analytics

Data Analytics Analytics Data Warehouse Data Lake

Unleashing the power of Presto: The Uber case study

IBM Big Data Hub

SEPTEMBER 25, 2023

Presto is an open source distributed SQL query engine for data analytics and the data lakehouse, designed for running interactive analytic queries against datasets of all sizes, from gigabytes to petabytes. Because of its distributed nature, Presto scales for petabytes and exabytes of data.

OLAP

OLAP Data Lake Data-driven Online Analytical Processing

Introducing watsonx: The future of AI for business

IBM Big Data Hub

MAY 9, 2023

For AI to be truly transformative, as many people as possible should have access to its benefits. is not just for data scientists and developers — business users can also access it via an easy-to-use interface that responds to natural language prompts for different tasks. Trust is one part of the equation. The second is access.

Data Warehouse

Data Warehouse Machine Learning Cost-Benefit Metadata

Wonderla Holidays goes digital to enhance business and customer fun

CIO Business Intelligence

OCTOBER 18, 2022

To handle the huge volume of data thus generated, the company is in the process of deploying a data lake, data warehouse, and real-time analytical tools in a hybrid model. The project, expected to cost US$400,000, will be initially piloted at the Bangalore amusement park in 2023.

Data Lake

Data Lake Data Warehouse Cost-Benefit Digital Transformation

Materialized Views in Hive for Iceberg Table Format

Cloudera

FEBRUARY 8, 2024

year_total_mv1 ]) The above CBO (cost based optimizer) plan shows that only the year_total_mv1 materialized view is scanned and a filter condition applied since the range filter in the query is a subset of the range in the materialized view. Furthermore, it is partitioned on the d_year column.

Snapshot

Snapshot Metadata Cost-Benefit Data Warehouse

Sun Country enhances customer experience with IT

CIO Business Intelligence

MAY 28, 2024

Sun Country Airlines has elevated its customer service since hiring an experienced CIO from United Airlines in early 2023. Digitizing these customer services not only yielded cost savings and greater efficiencies, Stathopoulos says, but the self-service options also free up staff and “deflect” calls away from contact center and the airports.

IT

IT Digital Transformation Cost-Benefit Data Lake

The year’s top 10 enterprise AI trends — so far

CIO Business Intelligence

SEPTEMBER 21, 2023

It doesn’t matter how accurate an AI model is, or how much benefit it’ll bring to a company if the intended users refuse to have anything to do with it. To make all this possible, the data had to be collected, processed, and fed into the systems that needed it in a reliable, efficient, scalable, and secure way.

Enterprise

Enterprise Consulting Modeling Cost-Benefit

Optimize data layout by bucketing with Amazon Athena and AWS Glue to accelerate downstream queries

AWS Big Data

APRIL 25, 2024

In the era of data, organizations are increasingly using data lakes to store and analyze vast amounts of structured and unstructured data. Data lakes provide a centralized repository for data from various sources, enabling organizations to unlock valuable insights and drive data-driven decision-making.

Optimization

Optimization Data Lake Cost-Benefit Reporting

CIOs rise to the ESG reporting challenge

CIO Business Intelligence

JANUARY 30, 2024

“Always the gatekeepers of much of the data necessary for ESG reporting, CIOs are finding that companies are even more dependent on them,” says Nancy Mentesana, ESG executive director at Labrador US, a global communications firm focused on corporate disclosure documents.

Reporting

Reporting Data Quality Strategy Data-driven

Get started with Amazon DynamoDB zero-ETL integration with Amazon Redshift

AWS Big Data

OCTOBER 17, 2024

You can then run enhanced analysis on this DynamoDB data with the rich capabilities of Amazon Redshift, such as high-performance SQL, built-in machine learning (ML) and Spark integrations, materialized views (MV) with automatic and incremental refresh, data sharing, and the ability to join data across multiple data stores and data lakes.

Metrics

Metrics Dashboards Data Warehouse Statistics

Tackling AI’s data challenges with IBM databases on AWS

IBM Big Data Hub

MARCH 14, 2024

This involves unifying and sharing a single copy of data and metadata across IBM® watsonx.data ™, IBM® Db2 ®, IBM® Db2® Warehouse and IBM® Netezza ®, using native integrations and supporting open formats, all without the need for migration or recataloging.

Cost-Benefit

Cost-Benefit Metadata Optimization Management

Process price transparency data using AWS Glue

AWS Big Data

MAY 4, 2023

The rule requires health insurers to provide clear and concise information to consumers about their health plan benefits, including costs and coverage details. To process workloads larger than 20 GB, these machines need to be scaled vertically, thereby significantly increasing hardware costs.

Insurance

Insurance Publishing Cost-Benefit Data Lake

Make Better Data-Driven Decisions with DataRobot AI Platform Single-Tenant SaaS on Microsoft Azure

DataRobot Blog

MARCH 7, 2023

DataRobot is available on Azure as an AI Platform Single-Tenant SaaS, eliminating the time and cost of an on-premises implementation. The DataRobot AI Platform seamlessly integrates with Azure cloud services, including Azure Machine Learning, Azure Data Lake Storage Gen 2 (ADLS), Azure Synapse Analytics, and Azure SQL database.

Data-driven

Data-driven Machine Learning Experimentation Data Lake

Amazon Kinesis Data Streams: celebrating a decade of real-time data innovation

AWS Big Data

NOVEMBER 14, 2023

Ten years ago, we launched Amazon Kinesis Data Streams , the first cloud-native serverless streaming data service, to serve as the backbone for companies, to move data across system boundaries, breaking data silos. This is why Kinesis Data Streams is a good fit.

IoT

IoT Data-driven Data Lake Data Strategy

Exploring the AI and data capabilities of watsonx

IBM Big Data Hub

JULY 17, 2023

.” Sean Im, CEO, Samsung SDS America “In the field of generative AI and foundation models, watsonx is a platform that will enable us to meet our customers’ requirements in terms of optimization and security, while allowing them to benefit from the dynamism and innovations of the open-source community.”

Machine Learning

Machine Learning Data Warehouse Modeling Cost-Benefit

Start to Monetize Your Data with Data Marketplaces and Data Value Scoring

erwin

JANUARY 10, 2024

In this post, I’ll examine data marketplaces and the related concepts of infonomics, data valuation, data monetization and data value scoring. You’ll see the benefits your organization can derive from its own data and the central role that your data intelligence software plays in the effort.

Measurement

Measurement Cost-Benefit Data Governance Management

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

AWS Big Data

APRIL 5, 2023

Showpad also struggled with data quality issues in terms of consistency, ownership, and insufficient data access across its targeted user base due to a complex BI access process, licensing challenges, and insufficient education. As of January 2023, Showpad’s QuickSight instance includes over 2,433 datasets and 199 dashboards.

Dashboards

Dashboards Reporting Cost-Benefit Visualization

CIO 100 Award winners drive business results with IT

CIO Business Intelligence

AUGUST 7, 2024

Now fully deployed, TCS is seeing the benefits. But Barnett, who started work on a strategy in 2023, wanted to continue using Baptist Memorial’s on-premise data center for financial, security, and continuity reasons, so he and his team explored options that allowed for keeping that data center as part of the mix.

IT

IT Insurance Cost-Benefit Testing

Do the Benefits of Cloud Outweigh the Costs?

Jet Global

SEPTEMBER 19, 2023

But the constant noise around the topic – from cost benefit analyses to sales pitches to technical overviews – has led to information overload. What are the best practices for analyzing cloud ERP data? Data Management How do we create a data warehouse or data lake in the cloud using our cloud ERP?

Cost-Benefit

Cost-Benefit Data Warehouse Reporting Enterprise

Amazon EMR streamlines big data processing with simplified Amazon S3 Glacier access

AWS Big Data

NOVEMBER 27, 2024

Amazon S3 Glacier serves several important audit use cases, particularly for organizations that need to retain data for extended periods due to regulatory compliance, legal requirements, or internal policies. Its low-cost storage model makes it economically feasible to store vast amounts of historical data for extended periods of time.

Big Data

Big Data Data Processing Cost-Benefit Optimization

Accelerate queries on Apache Iceberg tables through AWS Glue auto compaction

AWS Big Data

DECEMBER 19, 2024

Data lakes were originally designed to store large volumes of raw, unstructured, or semi-structured data at a low cost, primarily serving big data and analytics use cases. Enabling automatic compaction on Iceberg tables reduces metadata overhead on your Iceberg tables and improves query performance.

Data Lake

Data Lake IoT Metadata Testing

How DBAs can take on a more strategic role

CIO Business Intelligence

NOVEMBER 12, 2024

Corporate data is gold, and DBAs are its stewards. That’s reflected in employment statistics for database administrators and architects, positions projected to grow nine percent from 2023 to 2033, much faster than the average for all occupations. 1 Data is likewise growing at an exponential rate.

Statistics

Statistics Unstructured Data Cost-Benefit Data Lake

SAP BPC Alternatives: Which One is Right for You?

Jet Global

MARCH 27, 2025

To remain ahead, companies are transitioning away from SAP BPC due to high costs, an unfriendly UI and heavy dependence on technical teams, which slows down budget & close cycles. It offers the following benefits to modern finance teams. for Ease of Use’ in the latest BPM Pulse Survey 2023.

Finance

Finance Reporting Cost-Benefit Forecasting

MongoDB Enhances Developer Data Platform

Run Apache XTable in AWS Lambda for background conversion of open table formats

Webinars

Trending Sources

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

Webinars

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Build a high-performance quant research platform with Apache Iceberg

Outdated business apps can cloud your AI vision

Use Apache Iceberg in a data lake to support incremental data processing

CarMax drives business value with GPT-3.5

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Top Opportunities for SAP Partners in 2023

Real-time streaming data top picks you cannot miss at AWS re:Invent 2023

Enhance monitoring and debugging for AWS Glue jobs using new job observability metrics

Preparing the foundations for Generative AI

Apache Iceberg optimization: Solving the small files problem in Amazon EMR

Why enterprise CIOs need to plan for Microsoft gen AI

Building a vision for real-time artificial intelligence

How Fujitsu implemented a global data mesh architecture and democratized data

Improve healthcare services through patient 360: A zero-ETL approach to enable near real-time data analytics

Unleashing the power of Presto: The Uber case study

Introducing watsonx: The future of AI for business

Wonderla Holidays goes digital to enhance business and customer fun

Materialized Views in Hive for Iceberg Table Format

Sun Country enhances customer experience with IT

The year’s top 10 enterprise AI trends — so far

Optimize data layout by bucketing with Amazon Athena and AWS Glue to accelerate downstream queries

CIOs rise to the ESG reporting challenge

Get started with Amazon DynamoDB zero-ETL integration with Amazon Redshift

Tackling AI’s data challenges with IBM databases on AWS

Process price transparency data using AWS Glue

Make Better Data-Driven Decisions with DataRobot AI Platform Single-Tenant SaaS on Microsoft Azure

Amazon Kinesis Data Streams: celebrating a decade of real-time data innovation

Exploring the AI and data capabilities of watsonx

Start to Monetize Your Data with Data Marketplaces and Data Value Scoring

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

CIO 100 Award winners drive business results with IT

Do the Benefits of Cloud Outweigh the Costs?

Amazon EMR streamlines big data processing with simplified Amazon S3 Glacier access

Accelerate queries on Apache Iceberg tables through AWS Glue auto compaction

How DBAs can take on a more strategic role

SAP BPC Alternatives: Which One is Right for You?

Stay Connected