Data Integration, Data Processing and Data Science

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

Piperr.io — Pre-built data pipelines across enterprise stakeholders, from IT to analytics, tech, data science and LoBs. Prefect Technologies — Open-source data engineering platform that builds, tests, and runs data workflows. Genie — Distributed big data orchestration service by Netflix.

Testing

Testing Machine Learning Consulting Data Science

Artificial intelligence and machine learning adoption in European enterprise

O'Reilly on Data

FEBRUARY 4, 2019

Given the end-to-end nature of many data products and applications, sustaining ML and AI requires a host of tools and processes, ranging from collecting, cleaning, and harmonizing data, understanding what data is available and who has access to it, being able to trace changes made to data as it travels across a pipeline, and many other components.

Machine Learning

Machine Learning Enterprise IoT Big Data

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

For container terminal operators, data-driven decision-making and efficient data sharing are vital to optimizing operations and boosting supply chain efficiency. Two use cases illustrate how this can be applied for business intelligence (BI) and data science applications, using AWS services such as Amazon Redshift and Amazon SageMaker.

IoT

IoT Machine Learning Metadata Data-driven

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Why you should care about debugging machine learning models

O'Reilly on Data

DECEMBER 12, 2019

Security vulnerabilities : adversarial actors can compromise the confidentiality, integrity, or availability of an ML model or the data associated with the model, creating a host of undesirable outcomes. Privacy harms : models can compromise individual privacy in a long (and growing) list of ways. [8]

Machine Learning

Machine Learning Modeling Testing Risk Management

Preparing the foundations for Generative AI

CIO Business Intelligence

FEBRUARY 20, 2024

Data also needs to be sorted, annotated and labelled in order to meet the requirements of generative AI. No wonder CIO’s 2023 AI Priorities study found that data integration was the number one concern for IT leaders around generative AI integration, above security and privacy and the user experience.

Cost-Benefit

Cost-Benefit Data Lake Data Warehouse Data Processing

7 steps for turning shadow IT into a competitive edge

CIO Business Intelligence

NOVEMBER 21, 2023

After all, 41% of employees acquire, modify, or create technology outside of IT’s visibility , and 52% of respondents to EY’s Global Third-Party Risk Management Survey had an outage — and 38% reported a data breach — caused by third parties over the past two years.

IT

IT Risk Cost-Benefit Data Science

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

AWS Big Data

AUGUST 19, 2024

As organizations increasingly rely on data stored across various platforms, such as Snowflake , Amazon Simple Storage Service (Amazon S3), and various software as a service (SaaS) applications, the challenge of bringing these disparate data sources together has never been more pressing.

Analytics

Analytics Data-driven Data Integration Data Lake

CDOs: Your AI is smart, but your ESG is dumb. Here’s how to fix it

CIO Business Intelligence

MARCH 19, 2025

However, embedding ESG into an enterprise data strategy doesnt have to start as a C-suite directive. Developers, data architects and data engineers can initiate change at the grassroots level from integrating sustainability metrics into data models to ensuring ESG data integrity and fostering collaboration with sustainability teams.

IT

IT Data Governance Data-driven Metrics

Top 15 data management platforms

CIO Business Intelligence

JUNE 9, 2022

Others aim simply to manage the collection and integration of data, leaving the analysis and presentation work to other tools that specialize in data science and statistics. Lately a cousin of DMP has evolved, called the customer data platform (CDP). Adobe Audience Manager.

Management

Management Advertising Data Lake Sales

How to accelerate your data monetization strategy with data products and AI

IBM Big Data Hub

NOVEMBER 14, 2023

Additionally, by managing the data product as an isolated unit it can have location flexibility and portability — private or public cloud — depending on the established sensitivity and privacy controls for the data. Doing so can increase the quality of data integrated into data products.

Strategy

Strategy Data-driven Cost-Benefit Measurement

Top 15 data management platforms available today

CIO Business Intelligence

SEPTEMBER 22, 2023

Others aim simply to manage the collection and integration of data, leaving the analysis and presentation work to other tools that specialize in data science and statistics. DMP vs. CDP Lately a cousin of DMP has evolved, called the customer data platform (CDP).

Management

Management Advertising Data Lake Sales

Your Effective Roadmap To Implement A Successful Business Intelligence Strategy

datapine

FEBRUARY 22, 2022

Over the past 5 years, big data and BI became more than just data science buzzwords. Without real-time insight into their data, businesses remain reactive, miss strategic growth opportunities, lose their competitive edge, fail to take advantage of cost savings options, don’t ensure customer satisfaction… the list goes on.

Business Intelligence

Business Intelligence Strategy Cost-Benefit Dashboards

The Continuous March Towards Data Democratization

Data Virtualization

AUGUST 18, 2022

Reading Time: 5 minutes Opening the specific data view within Power BI is as simple as clicking on and opening the downloaded connection file. All the server host, ports, and database connection settings are automatically made for you so you can get on with.

Data Processing

Data Processing Data Integration Management Data Science

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

APRIL 3, 2019

The top three items are essentially “the devil you know” for firms which want to invest in data science: data platform, integration, data prep. Data governance shows up as the fourth-most-popular kind of solution that enterprise teams were adopting or evaluating during 2019. Rinse, lather, repeat.

Machine Learning

Machine Learning Data Governance Metadata Data Science

Introducing the GenAI models you haven’t heard of yet

CIO Business Intelligence

AUGUST 16, 2023

We use Microsoft, Google, Amazon, and also open source models from Hugging Face,” says Alain Biem, head of data science for the global financial information company. Another popular option is Databricks, which is a popular data pipeline platform for enterprise data science teams. Take Gorilla, for example.

Modeling

Modeling Enterprise Cost-Benefit Data Science

Stitch Fix seamless migration: Transitioning from self-managed Kafka to Amazon MSK

AWS Big Data

SEPTEMBER 22, 2023

At Stitch Fix, we have been powered by data science since its foundation and rely on many modern data lake and data processing technologies. In our infrastructure, Apache Kafka has emerged as a powerful tool for managing event streams and facilitating real-time data processing.

Management

Management Metrics Cost-Benefit Data Lake

How to choose the best AI platform

IBM Big Data Hub

OCTOBER 20, 2023

AI platforms offer a wide range of capabilities that can help organizations streamline operations, make data-driven decisions, deploy AI applications effectively and achieve competitive advantages. Visual modeling: Combine visual data science with open source libraries and notebook-based interfaces on a unified data and AI studio.

Machine Learning

Machine Learning Manufacturing Deep Learning Cost-Benefit

How to Take Back 40-60% of Your IT Spend by Fixing Your Data

Ontotext

NOVEMBER 2, 2023

Achieving this advantage is dependent on their ability to capture, connect, integrate, and convert data into insight for business decisions and processes. This is the goal of a “data-driven” organization. We call this the “ Bad Data Tax ”.

IT

IT Cost-Benefit Data-driven Technology

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

Data ingestion You have to build ingestion pipelines based on factors like types of data sources (on-premises data stores, files, SaaS applications, third-party data), and flow of data (unbounded streams or batch data). Data exploration Data exploration helps unearth inconsistencies, outliers, or errors.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Create an Apache Hudi-based near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSight

AWS Big Data

AUGUST 3, 2023

Change data capture (CDC) is one of the most common design patterns to capture the changes made in the source database and reflect them to other data stores. a new version of AWS Glue that accelerates data integration workloads in AWS.

Data Lake

Data Lake Visualization Dashboards Insurance

Customer Experience and Emerging Technologies: My CXChat Summary on Artificial Intelligence, Machine Learning and the Customer

Business Over Broadway

MAY 22, 2019

I was invited as a guest in a weekly tweet chat that is hosted by Annette Franz and Sue Duris. Also, loyalty leaders infuse analytics into CX programs, including machine learning, data science and data integration. So, become data literate. If you have never been involved in a tweet chat, I recommend it.

Machine Learning

Machine Learning Technology Digital Transformation Data Science

Analyze Amazon S3 storage costs using AWS Cost and Usage Reports, Amazon S3 Inventory, and Amazon Athena

AWS Big Data

FEBRUARY 2, 2023

Since its launch in 2006, Amazon Simple Storage Service (Amazon S3) has experienced major growth, supporting multiple use cases such as hosting websites, creating data lakes, serving as object storage for consumer applications, storing logs, and archiving data. For Report path prefix , enter cur-data/account-cur-daily.

Reporting

Reporting Data Lake Management Optimization

The Modern Data Stack Explained: What The Future Holds

Alation

JANUARY 17, 2023

What if, experts asked, you could load raw data into a warehouse, and then empower people to transform it for their own unique needs? Today, data integration platforms like Rivery do just that. By pushing the T to the last step in the process, such products have revolutionized how data is understood and analyzed.

Data Warehouse

Data Warehouse Cost-Benefit Data Science Data Transformation

Best BI Tools For 2024 You Need to Know

FineReport

MARCH 31, 2024

Furthermore, these tools boast customization options, allowing users to tailor data sources to address areas critical to their business success, thereby generating actionable insights and customizable reports. Flexible pricing options, including self-hosted and cloud-based plans, accommodate businesses of all sizes.

Dashboards

Dashboards Visualization Data mining Data-driven

The Gartner 2022 Leadership Vision for Data and Analytics Leaders Questions and Answers

Andrew White

JANUARY 9, 2022

On Thursday January 6th I hosted Gartner’s 2022 Leadership Vision for Data and Analytics webinar. – We did some early work a few years ago that look at the career path of a CDO – see from 2016 Build Your Career Path to the Chief Data Officer Role. We write about data and analytics.

Analytics

Analytics Measurement Data-driven Modeling

How Cargotec uses metadata replication to enable cross-account data sharing

AWS Big Data

JUNE 7, 2023

For this, Cargotec built an Amazon Simple Storage Service (Amazon S3) data lake and cataloged the data assets in AWS Glue Data Catalog. They chose AWS Glue as their preferred data integration tool due to its serverless nature, low maintenance, ability to control compute resources in advance, and scale when needed.

Metadata

Metadata Data Lake Machine Learning Big Data

CIO 100 Award winners drive business results with IT

CIO Business Intelligence

AUGUST 7, 2024

But Barnett, who started work on a strategy in 2023, wanted to continue using Baptist Memorial’s on-premise data center for financial, security, and continuity reasons, so he and his team explored options that allowed for keeping that data center as part of the mix.

IT

IT Insurance Cost-Benefit Testing

Prioritizing AI investments: Balancing short-term gains with long-term vision

CIO Business Intelligence

FEBRUARY 18, 2025

The absence of known authoritative sources for something as fundamental as product data meant data fragmentation and data inaccuracies would be continually at odds with the quality of informed business decisions. A decision made with AI based on bad data is still the same bad decision without it.

Machine Learning

Machine Learning Data Quality Enterprise Sales

Data Leaders Brief

The DataOps Vendor Landscape, 2021

Artificial intelligence and machine learning adoption in European enterprise

Webinars

Trending Sources

How EUROGATE established a data mesh architecture using Amazon DataZone

Webinars

Why you should care about debugging machine learning models

Preparing the foundations for Generative AI

7 steps for turning shadow IT into a competitive edge

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

CDOs: Your AI is smart, but your ESG is dumb. Here’s how to fix it

Top 15 data management platforms

How to accelerate your data monetization strategy with data products and AI

Top 15 data management platforms available today

Your Effective Roadmap To Implement A Successful Business Intelligence Strategy

The Continuous March Towards Data Democratization

Themes and Conferences per Pacoid, Episode 8

Introducing the GenAI models you haven’t heard of yet

Stitch Fix seamless migration: Transitioning from self-managed Kafka to Amazon MSK

How to choose the best AI platform

How to Take Back 40-60% of Your IT Spend by Fixing Your Data

Create an end-to-end data strategy for Customer 360 on AWS

Create an Apache Hudi-based near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSight

Customer Experience and Emerging Technologies: My CXChat Summary on Artificial Intelligence, Machine Learning and the Customer

Analyze Amazon S3 storage costs using AWS Cost and Usage Reports, Amazon S3 Inventory, and Amazon Athena

The Modern Data Stack Explained: What The Future Holds

Best BI Tools For 2024 You Need to Know

The Gartner 2022 Leadership Vision for Data and Analytics Leaders Questions and Answers

How Cargotec uses metadata replication to enable cross-account data sharing

CIO 100 Award winners drive business results with IT

Prioritizing AI investments: Balancing short-term gains with long-term vision

Stay Connected