Data Leaders Brief

Preparing for a Logical Data Management Solution

Data Virtualization

JUNE 25, 2024

Reading Time: 5 minutes For years, organizations have been managing data by consolidating it into a single data repository, such as a cloud data warehouse or data lake, so it can be analyzed and delivered to business users. Unfortunately, organizations struggle to get this.

Management

Management Data Lake Data Warehouse Data Integration

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

AWS Big Data

DECEMBER 20, 2024

Amazon Q data integration , introduced in January 2024, allows you to use natural language to author extract, transform, load (ETL) jobs and operations in AWS Glue specific data abstraction DynamicFrame. The DataFrame code generation now extends beyond AWS Glue DynamicFrame to support a broader range of data processing scenarios.

Data Integration

Data Integration Visualization Data Processing Big Data

The Race For Data Quality in a Medallion Architecture

DataKitchen

NOVEMBER 5, 2024

The Race For Data Quality In A Medallion Architecture The Medallion architecture pattern is gaining traction among data teams. It is a layered approach to managing and transforming data. It sounds great, but how do you prove the data is correct at each layer? How do you ensure data quality in every layer ?

Data Quality

Data Quality Testing Metrics Reporting

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Demystify data sharing and collaboration patterns on AWS: Choosing the right tool for the job

AWS Big Data

OCTOBER 21, 2024

Data is the most significant asset of any organization. However, enterprises often encounter challenges with data silos, insufficient access controls, poor governance, and quality issues. Embracing data as a product is the key to address these challenges and foster a data-driven culture.

Sales

Sales Data-driven Data Processing Key Performance Indicator

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

With the growing emphasis on data, organizations are constantly seeking more efficient and agile ways to integrate their data, especially from a wide variety of applications. In addition, organizations rely on an increasingly diverse array of digital systems, data fragmentation has become a significant challenge.

Data Integration

Data Integration Data Lake Statistics Data-driven

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

AWS Big Data

APRIL 8, 2025

In modern data architectures, Apache Iceberg has emerged as a popular table format for data lakes, offering key features including ACID transactions and concurrent write support. Consider a common scenario: A streaming pipeline continuously writes data to an Iceberg table while scheduled maintenance jobs perform compaction operations.

Snapshot

Snapshot Management Metadata Big Data

Practical Skills for The AI Product Manager

O'Reilly on Data

MAY 14, 2020

In our previous article, What You Need to Know About Product Management for AI , we discussed the need for an AI Product Manager. In this article, we shift our focus to the AI Product Manager’s skill set, as it is applied to day to day work in the design, development, and maintenance of AI products. The AI Product Pipeline.

Management

Management Experimentation B2B Machine Learning

The quest for high-quality data

O'Reilly on Data

JUNE 18, 2019

Machine learning solutions for data integration, cleaning, and data generation are beginning to emerge. “AI AI starts with ‘good’ data” is a statement that receives wide agreement from data scientists, analysts, and business owners. The problem is even more magnified in the case of structured enterprise data.

Machine Learning

Machine Learning Data Quality Statistics Modeling

Reduce your compute costs for stream processing applications with Kinesis Client Library 3.0

AWS Big Data

NOVEMBER 6, 2024

Amazon Kinesis Data Streams is a serverless data streaming service that makes it straightforward to capture and store streaming data at any scale. Thousands of AWS customers use KCL to operate custom stream processing applications with Kinesis Data Streams without worrying about the complexities of distributed systems.

Cost-Benefit

Cost-Benefit Metadata Optimization Publishing

5 key areas for tech leaders to watch in 2020

O'Reilly on Data

FEBRUARY 18, 2020

It’s also the data source for our annual usage study, which examines the most-used topics and the top search terms. [1]. This year’s growth in Python usage was buoyed by its increasing popularity among data scientists and machine learning (ML) and artificial intelligence (AI) engineers. to be wary of. Figure 1 (above).

Data-driven

Data-driven Software Statistics Marketing

Top 10 Management Reporting Best Practices To Create Effective Reports

datapine

OCTOBER 17, 2019

Management reporting is a source of business intelligence that helps business leaders make more accurate, data-driven decisions. But, these reports are only as useful as the work that goes into preparing and presenting them. What Is A Management Report?

Reporting

Reporting Management Dashboards KPI

Your Ultimate Guide To Modern KPI Reports In The Digital Age – Examples & Templates

datapine

JULY 17, 2019

Experts predict that by 2025, around 175 Zettabytes of data will be generated annually, according to research from Seagate. But with so much data available from an ever-growing range of sources, how do you make sense of this information – and how do you extract value from it? Looking for a bite-sized introduction to reporting?

KPI

KPI Reporting Key Performance Indicator Dashboards

Seize The Power Of Analytical Reports – Business Examples & Templates

datapine

MAY 27, 2020

In recent years, analytical reporting has evolved into one of the world’s most important business intelligence components, compelling companies to adapt their strategies based on powerful data-driven insights. The American Journal of Managed Care even stated in its own research that the total waiting amount is 121 minutes.

Reporting

Reporting Analytics Dashboards Sales

Why Data Driven Decision Making is Your Path To Business Success

datapine

APRIL 16, 2019

The term ‘big data’ alone has become something of a buzzword in recent times – and for good reason. By implementing the right reporting tools and understanding how to analyze as well as to measure your data accurately, you will be able to make the kind of data driven decisions that will drive your business forward.

Data-driven

Data-driven Dashboards Visualization Cost-Benefit

Top 10 Analytics And Business Intelligence Buzzwords For 2020

datapine

DECEMBER 4, 2019

That’s why we have prepared a list of the most prominent business intelligence buzzwords that will dominate in 2020. Predictive analytics is the practice of extracting information from existing data sets in order to forecast future probabilities. The accuracy of the predictions depends on the data used to create the model.

Business Intelligence

Business Intelligence Prescriptive Analytics Analytics Predictive Analytics

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

Data is the foundation of innovation, agility and competitive advantage in todays digital economy. As technology and business leaders, your strategic initiatives, from AI-powered decision-making to predictive insights and personalized experiences, are all fueled by data. Data quality is no longer a back-office concern.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

What Is Ad Hoc Reporting? Your Guide To Definition, Meaning, Examples & Benefits

datapine

JULY 1, 2020

“The goal is to turn data into information, and information into insight.” – Carly Fiorina, former executive, president, HP. Digital data is all around us. quintillion bytes of data every single day, with 90% of the world’s digital insights generated in the last two years alone, according to Forbes. click to enlarge**.

Reporting

Reporting Dashboards Cost-Benefit Visualization

How the BMW Group analyses semiconductor demand with AWS Glue

AWS Big Data

APRIL 26, 2023

Additionally, this forecasting system needs to provide data enrichment steps including byproducts, serve as the master data around the semiconductor management, and enable further use cases at the BMW Group. To enable this use case, we used the BMW Group’s cloud-native data platform called the Cloud Data Hub.

Forecasting

Forecasting Manufacturing Data Lake Big Data

Start DataOps Today with ‘Lean DataOps’

DataKitchen

SEPTEMBER 20, 2021

Data organizations don’t always have the budget or schedule required for DataOps when conceived as a top-to-bottom, enterprise-wide transformational change. DataOps can and should be implemented in small steps that complement and build upon existing workflows and data pipelines. Figure 1: The four phases of Lean DataOps.

Testing

Testing Metrics Measurement Dashboards

New regulation intensifies focus on IT risk management and operational resilience

CIO Business Intelligence

SEPTEMBER 9, 2024

A comprehensive regulatory reach DORA addresses a broad range of ICT risks, including incident response, resilience testing, third-party risk management, and information sharing. Proactive preparation with AI-powered solutions With DORA’s deadline quickly approaching, preparing for DORA is critical.

Risk Management

Risk Management Risk Management IT

Amazon DataZone enhances data discovery with advanced search filtering

AWS Big Data

JULY 1, 2024

Amazon DataZone , a fully managed data management service, helps organizations catalog, discover, analyze, share, and govern data between data producers and consumers. We are excited to announce the introduction of advanced search filtering capabilities in the Amazon DataZone business data catalog.

Sales

Sales Data Governance Big Data Data-driven

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure

DataKitchen

JULY 27, 2023

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure. While working in Azure with our customers, we have noticed several standard Azure tools people use to develop data pipelines and ETL or ELT processes. We counted ten ‘standard’ ways to transform and set up batch data pipelines in Microsoft Azure.

Machine Learning

Machine Learning Cost-Benefit Data Transformation Testing

The How-To Guide for Cleaning and Preparing Data for Analysis

Juice Analytics

JULY 21, 2021

Tidying up your data is part science, part art, and all work. If you’re lucky, you’ll get your hands on some perfectly formatting data ( Slack does a nice job, for example). But more often than not, you’ll need to do some data cleaning before it is ready for analysis. Often a data set will have lots of columns.

Slice and Dice

Slice and Dice Measurement Metrics Sales

The unstoppable rise of AI in healthcare: What to expect at GITEX GLOBAL 2024

CIO Business Intelligence

AUGUST 28, 2024

It enables faster and more accurate diagnosis through advanced imaging and data analysis, helping doctors identify diseases earlier and more precisely. Beyond patient care, AI is transforming the way healthcare organizations manage their workforce. This year too it will be full of exciting solutions and ideas.

Data-driven

Data-driven Optimization Technology Enterprise

How can CIOs safely unleash generative AI on their company’s data?

CIO Business Intelligence

JUNE 14, 2024

If data is the new oil, it’s only useful once it’s been refined. Touted as revolutionary a decade ago, SSBI solutions intended to take data insights out of the preserve of data scientists and put them within reach for every stakeholder. According to McKinsey, GenAI could bring savings opportunities of up to $2.6

Dashboards

Dashboards Visualization Business Intelligence Risk

Author data integration jobs with an interactive data preparation experience with AWS Glue visual ETL

AWS Big Data

JULY 10, 2024

Now you can author data preparation transformations and edit them with the AWS Glue Studio visual editor. The AWS Glue Studio visual editor is a graphical interface that enables you to create, run, and monitor data integration jobs in AWS Glue. In this scenario, you’re a data analyst in this company.

Interactive

Interactive Visualization Data Integration Statistics

Low code/no code tools reap IT benefits—with caveats

CIO Business Intelligence

JANUARY 5, 2023

Many customers find the sweet spot in combining them with similar low code/no code tools for data integration and management to quickly automate standard tasks, and experiment with new services. Customers also report they help business users quickly test new services, tweak user interfaces and deliver new functionality.

IT

IT Cost-Benefit Testing Dashboards

A Guide To The Top 14 Types Of Reports With Examples Of When To Use Them

datapine

JANUARY 18, 2023

In fact, a survey about management reports performed by Deloitte says that 50% of managers are unsatisfied with the speed of delivery and the quality of the reports they receive. A differentiating characteristic of these reports is their objectivity, they are only meant to inform but not propose solutions or hypotheses.

Reporting

Reporting Metrics Dashboards Visualization

SAP enhances Datasphere and SAC for AI-driven transformation

CIO Business Intelligence

MARCH 6, 2024

Jurgen Mueller, SAP CTO and executive board member, called the innovations, which includes an expanded partnership with data governance specialist Collibra, a “quantum leap” in the company’s ability to help customers drive intelligent business transformation through data. With today’s announcements, SAP is building on that vision.

Unstructured Data

Unstructured Data Dashboards Business Intelligence Data Governance

Implement data warehousing solution using dbt on Amazon Redshift

AWS Big Data

NOVEMBER 17, 2023

Amazon Redshift is a cloud data warehousing service that provides high-performance analytical processing based on a massively parallel processing (MPP) architecture. Building and maintaining data pipelines is a common challenge for all enterprises. All the connection profiles are configured within the dbt profiles.yml file.

Snapshot

Snapshot Data Processing Testing Data Warehouse

What is synthetic data? Generated data to help your AI strategy

CIO Business Intelligence

MARCH 15, 2022

Synthetic data defined. Synthetic data is artificially generated information that can be used in place of real historic data to train AI models when actual data sets are lacking in quality, volume, or variety. Synthetic data use cases. Artificial data has many uses in enterprise AI strategies.

Strategy

Strategy Testing Sales Modeling

How SOCAR handles large IoT data with Amazon MSK and Amazon ElastiCache for Redis

AWS Big Data

MAY 3, 2023

As companies continue to expand their digital footprint, the importance of real-time data processing and analysis cannot be overstated. The ability to quickly measure and draw insights from data is critical in today’s business landscape, where rapid decision-making is key.

IoT

IoT Internet of Things Data Transformation Management

Applied Materials automates warehouse ops as AI accelerates chip demand

CIO Business Intelligence

SEPTEMBER 13, 2024

Seamlessly integrating GTP with custom SAP software, which provides the backbone of Applied Materials’ project, ensures accurate and up-to-date information on inventory levels, stock movements, and order fulfillment, says Hari Lakshminarayanan, who, as managing director of IT solutions management at Applied Materials, led the LCS project.

Manufacturing

Manufacturing Cost-Benefit Testing Interactive

How ActionIQ built a truly composable customer data platform using Amazon Redshift

AWS Big Data

JULY 24, 2024

ActionIQ is a leading composable customer data (CDP) platform designed for enterprise brands to grow faster and deliver meaningful experiences for their customers. Organizations are demanding secure, cost efficient, and time efficient solutions to power their marketing outcomes.

Data Warehouse

Data Warehouse Cost-Benefit Marketing Testing

How SikSin improved customer engagement with AWS Data Lab and Amazon Personalize

AWS Big Data

JANUARY 25, 2023

SikSin confronted two business challenges: Customer engagement – SikSin maintains data on more than 750,000 restaurants and has more than 4,000 restaurant articles (and growing). Data analysis activities – The SikSin Food Service team experienced difficulties in regards to report generation due to scattered data across multiple systems.

Visualization

Visualization Interactive Modeling Machine Learning

Preprocess and fine-tune LLMs quickly and cost-effectively using Amazon EMR Serverless and Amazon SageMaker

AWS Big Data

FEBRUARY 1, 2024

The Common Crawl corpus contains petabytes of data, regularly collected since 2008, and contains raw webpage data, metadata extracts, and text extracts. In addition to determining which dataset should be used, cleansing and processing the data to the fine-tuning’s specific need is required.

Metadata

Metadata Modeling Data Processing Unstructured Data

13 Analytics & Business Intelligence Examples Illustrating The Value of BI

datapine

SEPTEMBER 11, 2019

Digital data, by its very nature, paints a clear, concise, and panoramic picture of a number of vital areas of business performance, offering a window of insight that often leads to creating an enhanced business intelligence strategy and, ultimately, an ongoing commercial success. billion , growing at a CAGR of 26.98% from 2016.

Business Intelligence

Business Intelligence Analytics Sales Dashboards

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

AWS Big Data

NOVEMBER 15, 2023

It seamlessly consolidates data from various data sources within AWS, including AWS Cost Explorer (and forecasting with Cost Explorer ), AWS Trusted Advisor , and AWS Compute Optimizer. Overview of the BMW Cloud Data Hub At the BMW Group, Cloud Data Hub (CDH) is the central platform for managing company-wide data and data solutions.

Analytics

Analytics Dashboards Metadata Data Warehouse

5 Things Data-Driven Companies Must Look for When Buying Web Hosting

Smart Data Collective

JUNE 17, 2021

Big data is becoming very important for companies all over the world. They need to make sure that they utilize their data wisely, because it is one of the most important assets at their disposal. There are a lot of things that companies need to take into consideration when managing their data.

Data Processing

Data Processing Data-driven Big Data Advertising

DataRobot and SAP Partner to Deliver Custom AI Solutions for the Enterprise

DataRobot Blog

MARCH 8, 2023

Today, SAP and DataRobot announced a joint partnership to enable customers connect core SAP software, containing mission-critical business data, with the advanced Machine Learning capabilities of DataRobot to make more intelligent business predictions with advanced analytics. Tune in to learn more. Registration is free for both events.

Enterprise

Enterprise Experimentation Machine Learning Data Science

13 power tips for Microsoft Power BI

CIO Business Intelligence

OCTOBER 19, 2023

Power BI is Microsoft’s interactive data visualization and analytics tool for business intelligence (BI). With Power BI, you can pull data from almost any data source and create dashboards that track the metrics you care about the most. Power BI’s rich reports or dashboards can be embedded into reporting portals you already use.

Slice and Dice

Slice and Dice Scorecard Metrics Visualization

Amazon OpenSearch Service Under the Hood : OpenSearch Optimized Instances(OR1)

AWS Big Data

APRIL 17, 2024

With this new instance family, OpenSearch Service uses OpenSearch innovation and AWS technologies to reimagine how data is indexed and stored in the cloud. Today, customers widely use OpenSearch Service for operational analytics because of its ability to ingest high volumes of data while also providing rich and interactive analytics.

Optimization

Optimization Snapshot Metadata Cost-Benefit

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

AWS Big Data

JUNE 10, 2024

In this post, we delve into the key aspects of using Amazon EMR for modern data management, covering topics such as data governance, data mesh deployment, and streamlined data discovery. Organizations have multiple Hive data warehouses across EMR clusters, where the metadata gets generated.

Data Lake

Data Lake Metadata Data Warehouse Data Processing

Using Experian identity resolution with AWS Clean Rooms to achieve higher audience activation match rates

AWS Big Data

SEPTEMBER 26, 2023

This is a guest post co-written with Tyler Middleton, Experian Senior Partner Marketing Manager, and Jay Rakhe, Experian Group Product Manager. As the data privacy landscape continues to evolve, companies are increasingly seeking ways to collect and manage data while protecting privacy and intellectual property.

Advertising

Advertising Data-driven Marketing Interactive

Preparing for a Logical Data Management Solution

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

Webinars

Trending Sources

The Race For Data Quality in a Medallion Architecture

Webinars

Demystify data sharing and collaboration patterns on AWS: Choosing the right tool for the job

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

Practical Skills for The AI Product Manager

The quest for high-quality data

Reduce your compute costs for stream processing applications with Kinesis Client Library 3.0

5 key areas for tech leaders to watch in 2020

Top 10 Management Reporting Best Practices To Create Effective Reports

Your Ultimate Guide To Modern KPI Reports In The Digital Age – Examples & Templates

Seize The Power Of Analytical Reports – Business Examples & Templates

Why Data Driven Decision Making is Your Path To Business Success

Top 10 Analytics And Business Intelligence Buzzwords For 2020

Data’s dark secret: Why poor quality cripples AI and growth

What Is Ad Hoc Reporting? Your Guide To Definition, Meaning, Examples & Benefits

How the BMW Group analyses semiconductor demand with AWS Glue

Start DataOps Today with ‘Lean DataOps’

New regulation intensifies focus on IT risk management and operational resilience

Amazon DataZone enhances data discovery with advanced search filtering

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure

The How-To Guide for Cleaning and Preparing Data for Analysis

The unstoppable rise of AI in healthcare: What to expect at GITEX GLOBAL 2024

How can CIOs safely unleash generative AI on their company’s data?

Author data integration jobs with an interactive data preparation experience with AWS Glue visual ETL

Low code/no code tools reap IT benefits—with caveats

A Guide To The Top 14 Types Of Reports With Examples Of When To Use Them

SAP enhances Datasphere and SAC for AI-driven transformation

Implement data warehousing solution using dbt on Amazon Redshift

What is synthetic data? Generated data to help your AI strategy

How SOCAR handles large IoT data with Amazon MSK and Amazon ElastiCache for Redis

Applied Materials automates warehouse ops as AI accelerates chip demand

How ActionIQ built a truly composable customer data platform using Amazon Redshift

How SikSin improved customer engagement with AWS Data Lab and Amazon Personalize

Preprocess and fine-tune LLMs quickly and cost-effectively using Amazon EMR Serverless and Amazon SageMaker

13 Analytics & Business Intelligence Examples Illustrating The Value of BI

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

5 Things Data-Driven Companies Must Look for When Buying Web Hosting

DataRobot and SAP Partner to Deliver Custom AI Solutions for the Enterprise

13 power tips for Microsoft Power BI

Amazon OpenSearch Service Under the Hood : OpenSearch Optimized Instances(OR1)

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

Using Experian identity resolution with AWS Clean Rooms to achieve higher audience activation match rates

Stay Connected