Data Lake, Data Quality and Enterprise

Steps taken to build Sevita’s first enterprise data platform

CIO Business Intelligence

OCTOBER 23, 2024

Here, CIO Patrick Piccininno provides a roadmap of his journey from data with no integration to meaningful dashboards, insights, and a data literate culture. You ’re building an enterprise data platform for the first time in Sevita’s history. Second, the manual spreadsheet work resulted in significant manual data entry.

Enterprise

Enterprise Dashboards KPI Data Lake

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

As technology and business leaders, your strategic initiatives, from AI-powered decision-making to predictive insights and personalized experiences, are all fueled by data. Yet, despite growing investments in advanced analytics and AI, organizations continue to grapple with a persistent and often underestimated challenge: poor data quality.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

Amazon SageMaker Lakehouse , now generally available, unifies all your data across Amazon Simple Storage Service (Amazon S3) data lakes and Amazon Redshift data warehouses, helping you build powerful analytics and AI/ML applications on a single copy of data. Having confidence in your data is key.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

Webinars

Going Beyond Chatbots: Connecting AI to Your Tools, Systems, & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

MORE WEBINARS

Data Lakes on Cloud & it’s Usage in Healthcare

BizAcuity

MARCH 29, 2019

Data lakes are centralized repositories that can store all structured and unstructured data at any desired scale. The power of the data lake lies in the fact that it often is a cost-effective way to store data. The power of the data lake lies in the fact that it often is a cost-effective way to store data.

Data Lake

Data Lake Unstructured Data Cost-Benefit Data Quality

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

AWS Big Data

OCTOBER 9, 2024

Today, customers are embarking on data modernization programs by migrating on-premises data warehouses and data lakes to the AWS Cloud to take advantage of the scale and advanced analytical capabilities of the cloud. Some customers build custom in-house data parity frameworks to validate data during migration.

Data Quality

Data Quality Data Lake Data Warehouse Metrics

What is a Data Mesh?

DataKitchen

AUGUST 3, 2021

The data mesh design pattern breaks giant, monolithic enterprise data architectures into subsystems or domains, each managed by a dedicated team. But first, let’s define the data mesh design pattern. The past decades of enterprise data platform architectures can be summarized in 69 words.

Data Architecture

Data Architecture Data Lake Cost-Benefit Data Warehouse

Automated data governance with AWS Glue Data Quality, sensitive data detection, and AWS Lake Formation

AWS Big Data

OCTOBER 10, 2023

Data governance is the process of ensuring the integrity, availability, usability, and security of an organization’s data. Due to the volume, velocity, and variety of data being ingested in data lakes, it can get challenging to develop and maintain policies and procedures to ensure data governance at scale for your data lake.

Data Quality

Data Quality Data Governance Data Lake Testing

Fire Your Super-Smart Data Consultants with DataOps

DataKitchen

JANUARY 25, 2022

There’s no shortage of consultants who will promise to manage the end-to-end lifecycle of data from integration to transformation to visualization. . The challenge is that data engineering and analytics are incredibly complex. Ensuring that data is available, secure, correct, and fit for purpose is neither simple nor cheap.

Consulting

Consulting Testing Data Lake Data Quality

AWS Glue Data Quality is Generally Available

AWS Big Data

JUNE 6, 2023

We are excited to announce the General Availability of AWS Glue Data Quality. Our journey started by working backward from our customers who create, manage, and operate data lakes and data warehouses for analytics and machine learning. It takes days for data engineers to identify and implement data quality rules.

Data Quality

Data Quality Statistics Data Lake Visualization

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

With this new functionality, customers can create up-to-date replicas of their data from applications such as Salesforce, ServiceNow, and Zendesk in an Amazon SageMaker Lakehouse and Amazon Redshift. SageMaker Lakehouse gives you the flexibility to access and query your data in-place with all Apache Iceberg compatible tools and engines.

Data Integration

Data Integration Data Lake Statistics Data-driven

Data Lakes: What Are They and Who Needs Them?

Jet Global

JULY 2, 2019

The sheer scale of data being captured by the modern enterprise has necessitated a monumental shift in how that data is stored. What was at first a data stream has morphed into a data river as enterprise businesses are harvesting reams of data from every conceivable input across every conceivable business function.

Data Lake

Data Lake Data Warehouse Big Data Machine Learning

Introducing the technology behind watsonx.ai, IBM’s AI and data platform for enterprise

IBM Big Data Hub

MAY 9, 2023

We can use foundation models to quickly perform tasks with limited annotated data and minimal effort; in some cases, we need only to describe the task at hand to coax the model into solving it. But these powerful technologies also introduce new risks and challenges for enterprises.

Enterprise

Enterprise Technology Modeling Cost-Benefit

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

Domain ownership recognizes that the teams generating the data have the deepest understanding of it and are therefore best suited to manage, govern, and share it effectively. This principle makes sure data accountability remains close to the source, fostering higher data quality and relevance.

Metadata

Metadata Data Governance Data Quality Data-driven

Why enterprise CIOs need to plan for Microsoft gen AI

CIO Business Intelligence

AUGUST 14, 2024

Between building gen AI features into almost every enterprise tool it offers, adding the most popular gen AI developer tool to GitHub — GitHub Copilot is already bigger than GitHub when Microsoft bought it — and running the cloud powering OpenAI, Microsoft has taken a commanding lead in enterprise gen AI.

Enterprise

Enterprise Cost-Benefit Experimentation Modeling

Building a Beautiful Data Lakehouse

CIO Business Intelligence

MARCH 9, 2022

Applying artificial intelligence (AI) to data analytics for deeper, better insights and automation is a growing enterprise IT priority. But the data repository options that have been around for a while tend to fall short in their ability to serve as the foundation for big data analytics powered by AI.

Data Lake

Data Lake Unstructured Data Data Warehouse Big Data

Informatica’s new data management clouds target health, finance services

CIO Business Intelligence

MAY 24, 2022

Just after launching a focused data management platform for retail customers in March, enterprise data management vendor Informatica has now released two more industry-specific versions of its Intelligent Data Management Cloud (IDMC) — one for financial services, and the other for health and life sciences.

Finance

Finance Management Metadata Machine Learning

Analyzing the business-case approach Perdue Farms takes to derive value from data

CIO Business Intelligence

SEPTEMBER 20, 2023

On the agribusiness side we source, purchase, and process agricultural commodities and offer a diverse portfolio of products including grains, soybean meal, blended feed ingredients, and top-quality oils for the food industry to add value to the commodities our customers desire. The data can also help us enrich our commodity products.

Data Lake

Data Lake Data-driven Dashboards Risk

The essential check list for effective data democratization

CIO Business Intelligence

JANUARY 20, 2023

Starting on a solid data foundation Before choosing a platform for sharing data, an organization needs to understand what data it already has and strip it of errors and duplicates. Data formats and data architectures are often inconsistent, and data might even be incomplete. They have data swamps,” he says.

Data Lake

Data Lake Data-driven Finance Data Architecture

Data governance in the age of generative AI

AWS Big Data

FEBRUARY 29, 2024

Data is your generative AI differentiator, and a successful generative AI implementation depends on a robust data strategy incorporating a comprehensive data governance approach. Data governance is a critical building block across all these approaches, and we see two emerging areas of focus.

Data Governance

Data Governance Unstructured Data Metadata Data Lake

Data architecture strategy for data quality

IBM Big Data Hub

JANUARY 5, 2023

Poor data quality is one of the top barriers faced by organizations aspiring to be more data-driven. Ill-timed business decisions and misinformed business processes, missed revenue opportunities, failed business initiatives and complex data systems can all stem from data quality issues.

Data Architecture

Data Architecture Data Quality Strategy Data Lake

The Enduring Significance of Data Modeling in the Modern Data-Driven Enterprise

erwin

AUGUST 31, 2023

Q: Is data modeling cool again? In today’s fast-paced digital landscape, data reigns supreme. The data-driven enterprise relies on accurate, accessible, and actionable information to make strategic decisions and drive innovation. The continued federation of data in the enterprise resulted in data silos.

Data-driven

Data-driven Modeling Enterprise Structured Data

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

AWS Big Data

DECEMBER 13, 2023

For the past 5 years, BMS has used a custom framework called Enterprise Data Lake Services (EDLS) to create ETL jobs for business users. Data quality check – The data quality module enables you to perform quality checks on a huge amount of data and generate reports that describe and validate the data quality.

Metadata

Metadata Data Lake Visualization Data Quality

What is Dark Data, Why Does it Matter, and Why Are Humans Still Needed?

Timo Elliott

JANUARY 3, 2022

It’s stored in corporate data warehouses, data lakes, and a myriad of other locations – and while some of it is put to good use, it’s estimated that around 73% of this data remains unexplored. Improving data quality. Unexamined and unused data is often of poor quality. Data augmentation.

IT

IT Unstructured Data Data Quality Machine Learning

Power enterprise-grade Data Vaults with Amazon Redshift – Part 2

AWS Big Data

NOVEMBER 16, 2023

Amazon Redshift is a popular cloud data warehouse, offering a fully managed cloud-based service that seamlessly integrates with an organization’s Amazon Simple Storage Service (Amazon S3) data lake, real-time streams, machine learning (ML) workflows, transactional workflows, and much more—all while providing up to 7.9x

Enterprise

Enterprise Data Warehouse Snapshot Cost-Benefit

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

CIO Business Intelligence

APRIL 29, 2022

This would be straightforward task were it not for the fact that, during the digital-era, there has been an explosion of data – collected and stored everywhere – much of it poorly governed, ill-understood, and irrelevant. Further, data management activities don’t end once the AI model has been developed. Addressing the Challenge.

Data Governance

Data Governance IT Risk Data Lake

Thank You Snowflake for Naming Alation the Data Governance Partner of the Year

Alation

JUNE 17, 2021

Lastly, active data governance simplifies stewardship tasks of all kinds. Tehnical stewards have the tools to monitor data quality, access, and access control. A compliance steward is empowered to monitor sensitive data and usage sharing policies at scale. The Data Swamp Problem. The Governance Solution.

Data Governance

Data Governance Data Lake Insurance Enterprise

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

erwin

JULY 17, 2019

Part Two of the Digital Transformation Journey … In our last blog on driving digital transformation , we explored how enterprise architecture (EA) and business process (BP) modeling are pivotal factors in a viable digital transformation strategy. With automation, data quality is systemically assured.

Digital Transformation

Digital Transformation Strategy Metadata Data-driven

3 things to get right with data management for gen AI projects

CIO Business Intelligence

OCTOBER 2, 2024

And Doug Shannon, automation and AI practitioner, and Gartner peer community ambassador, says the vast majority of enterprises are now focused on two categories of use cases that are most likely to deliver positive ROI. Classifiers are provided in the toolkits to allow enterprises to set thresholds. “We

Management

Management Data Governance Cost-Benefit Structured Data

How BMO improved data security with Amazon Redshift and AWS Lake Formation

AWS Big Data

MARCH 1, 2024

One of the bank’s key challenges related to strict cybersecurity requirements is to implement field level encryption for personally identifiable information (PII), Payment Card Industry (PCI), and data that is classified as high privacy risk (HPR). Only users with required permissions are allowed to access data in clear text.

Data Lake

Data Lake Data Warehouse Management Risk

Putting the Business Back Into Business Innovation

Timo Elliott

DECEMBER 14, 2022

Gartner calls it the Composable Enterprise , for example – it’s about having a solid information foundation that enables fast and flexible creation of what they call composable applications that allow you to create new applications and workflows by just bringing together modular components. Business Process. Business Context.

Data Lake

Data Lake Recreation/Entertainment Data Warehouse Metadata

Overcome these six data consumption challenges for a more data-driven enterprise

IBM Big Data Hub

JUNE 8, 2022

Implementing the right data strategy spurs innovation and outstanding business outcomes by recognizing data as a critical asset that provides insights for better and more informed decision-making. By taking advantage of data, enterprises can shape business decisions, minimize risk for stakeholders, and gain competitive advantage.

Data-driven

Data-driven Enterprise Data Governance Data Lake

Addressing the Elephant in the Room – Welcome to Today’s Cloudera

Cloudera

JUNE 13, 2024

After countless open-source innovations ushered in the Big Data era, including the first commercial distribution of HDFS (Apache Hadoop Distributed File System), commonly referred to as Hadoop, the two companies joined forces, giving birth to an entire ecosystem of technology and tech companies.

Big Data

Big Data Machine Learning Contextual Data Data Lake

What is an Information Steward, and Why You Should Care

Grooper

MARCH 5, 2020

These stewards monitor the input and output of data integrations and workflows to ensure data quality. Their focus is on master data management , data lakes / warehouses, and ensuring the trackability of data using audit trails and metadata. How to Get Started with Information Stewardship.

Data Lake

Data Lake Metadata Data Quality Software

8 tips for unleashing the power of unstructured data

CIO Business Intelligence

NOVEMBER 28, 2023

Making the most of enterprise data is a top concern for IT leaders today. With organizations seeking to become more data-driven with business decisions, IT leaders must devise data strategies gear toward creating value from data no matter where — or in what form — it resides. Quality is job one.

Unstructured Data

Unstructured Data Data-driven Visualization Data Quality

HEMA accelerates their data governance journey with Amazon DataZone

AWS Big Data

DECEMBER 19, 2024

Data has become an invaluable asset for businesses, offering critical insights to drive strategic decision-making and operational optimization. From establishing an enterprise-wide data inventory and improving data discoverability, to enabling decentralized data sharing and governance, Amazon DataZone has been a game changer for HEMA.

Data Governance

Data Governance Publishing Data-driven Metadata

Accomplish Agile Business Intelligence & Analytics For Your Business

datapine

APRIL 15, 2020

It’s necessary to say that these processes are recurrent and require continuous evolution of reports, online data visualization , dashboards, and new functionalities to adapt current processes and develop new ones. Testing will eliminate lots of data quality challenges and bring a test-first approach through your agile cycle.

Business Intelligence

Business Intelligence Analytics Testing Dashboards

Breaking State and Local Data Silos with Modern Data Architectures

Cloudera

AUGUST 30, 2022

For state and local agencies, data silos create compounding problems: Inaccessible or hard-to-access data creates barriers to data-driven decision making. Legacy data sharing involves proliferating copies of data, creating data management, and security challenges. Towards Data Science ). Forrester ).

Data Architecture

Data Architecture Data Lake Data Warehouse Metadata

CIOs rise to the ESG reporting challenge

CIO Business Intelligence

JANUARY 30, 2024

Birgit Fridrich, who joined Allianz as sustainability manager responsible for ESG reporting in late 2022, spends many hours validating data in the company’s Microsoft Sustainability Manager tool. Data quality is key, but if we’re doing it manually there’s the potential for mistakes.

Reporting

Reporting Data Quality Strategy Data-driven

Straumann Group is transforming dentistry with data, AI

CIO Business Intelligence

FEBRUARY 16, 2023

Selling the value of data transformation Iyengar and his team are 18 months into a three- to five-year journey that started by building out the data layer — corralling data sources such as ERP, CRM, and legacy databases into data warehouses for structured data and data lakes for unstructured data.

Unstructured Data

Unstructured Data Data Lake Prescriptive Analytics Data Warehouse

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

AWS Big Data

AUGUST 1, 2024

Migrating to Amazon Redshift offers organizations the potential for improved price-performance, enhanced data processing, faster query response times, and better integration with technologies such as machine learning (ML) and artificial intelligence (AI).

Data Warehouse

Data Warehouse KPI Optimization Cost-Benefit

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

AWS Big Data

NOVEMBER 9, 2023

It allows users to write data transformation code, run it, and test the output, all within the framework it provides. Use case The Enterprise Data Analytics group of a large jewelry retailer embarked on their cloud journey with AWS in 2021. AWS Glue – AWS Glue is used to load files into Amazon Redshift through the S3 data lake.

Data Warehouse

Data Warehouse Testing Data Quality Reporting

Data Architecture and Strategy in the AI Era

Cloudera

MARCH 28, 2024

Among the most common challenges to achieving AI adoption at scale were data quality and availability (36%), scalability and deployment (36%), integration with existing systems and processes (35%), and change management and organizational culture (34%).

Data Architecture

Data Architecture Strategy Data Lake Data-driven

How data literacy allows gen AI to drive productivity at Dow

CIO Business Intelligence

JULY 31, 2024

Data is at the heart of everything we do today, from AI to machine learning or generative AI. We’ve been leveraging predictive technologies, or what I call traditional AI, across our enterprise for nearly two decades with R&D and manufacturing, for example, all partnering with IT. This work is not new to Dow.

Manufacturing

Manufacturing Cost-Benefit Digital Transformation Forecasting

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

AWS Big Data

MARCH 7, 2023

Analytics reference architecture for gaming organizations In this section, we discuss how gaming organizations can use a data hub architecture to address the analytical needs of an enterprise, which requires the same data at multiple levels of granularity and different formats, and is standardized for faster consumption.

Analytics

Analytics Data Warehouse Data Lake Metadata

Steps taken to build Sevita’s first enterprise data platform

Data’s dark secret: Why poor quality cripples AI and growth

Webinars

Trending Sources

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Webinars

Data Lakes on Cloud & it’s Usage in Healthcare

Perform data parity at scale for data modernization programs using AWS Glue Data Quality

What is a Data Mesh?

Automated data governance with AWS Glue Data Quality, sensitive data detection, and AWS Lake Formation

Fire Your Super-Smart Data Consultants with DataOps

AWS Glue Data Quality is Generally Available

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Data Lakes: What Are They and Who Needs Them?

Introducing the technology behind watsonx.ai, IBM’s AI and data platform for enterprise

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Why enterprise CIOs need to plan for Microsoft gen AI

Building a Beautiful Data Lakehouse

Informatica’s new data management clouds target health, finance services

Analyzing the business-case approach Perdue Farms takes to derive value from data

The essential check list for effective data democratization

Data governance in the age of generative AI

Data architecture strategy for data quality

The Enduring Significance of Data Modeling in the Modern Data-Driven Enterprise

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

What is Dark Data, Why Does it Matter, and Why Are Humans Still Needed?

Power enterprise-grade Data Vaults with Amazon Redshift – Part 2

Don’t Fear Artificial Intelligence; Embrace it Through Data Governance

Thank You Snowflake for Naming Alation the Data Governance Partner of the Year

Constructing A Digital Transformation Strategy: Putting the Data in Digital Transformation

3 things to get right with data management for gen AI projects

How BMO improved data security with Amazon Redshift and AWS Lake Formation

Putting the Business Back Into Business Innovation

Overcome these six data consumption challenges for a more data-driven enterprise

Addressing the Elephant in the Room – Welcome to Today’s Cloudera

What is an Information Steward, and Why You Should Care

8 tips for unleashing the power of unstructured data

HEMA accelerates their data governance journey with Amazon DataZone

Accomplish Agile Business Intelligence & Analytics For Your Business

Breaking State and Local Data Silos with Modern Data Architectures

CIOs rise to the ESG reporting challenge

Straumann Group is transforming dentistry with data, AI

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

Data Architecture and Strategy in the AI Era

How data literacy allows gen AI to drive productivity at Dow

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

Stay Connected