Data Analytics, Data Architecture, Data Lake and Enterprise

Data Analytics

Data Architecture

Data Lake

Enterprise

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

OCTOBER 30, 2024

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue. Delete the bucket.

Data Lake

Data Lake Data Processing Optimization Machine Learning

Incremental refresh for Amazon Redshift materialized views on data lake tables

AWS Big Data

NOVEMBER 8, 2024

Amazon Redshift is a fast, fully managed cloud data warehouse that makes it cost-effective to analyze your data using standard SQL and business intelligence tools. Customers use data lake tables to achieve cost effective storage and interoperability with other tools.

Data Lake

Data Lake Data Warehouse Optimization Testing

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Streamline Payment Applications & Lien Waivers Through Innovative Construction Technology

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

Trending Sources

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

This week on the keynote stages at AWS re:Invent 2024, you heard from Matt Garman, CEO, AWS, and Swami Sivasubramanian, VP of AI and Data, AWS, speak about the next generation of Amazon SageMaker , the center for all of your data, analytics, and AI. The relationship between analytics and AI is rapidly evolving.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

Webinars

How to Streamline Payment Applications & Lien Waivers Through Innovative Construction Technology

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

Reduce time to access your transactional data for analytical processing using the power of Amazon SageMaker Lakehouse and zero-ETL

AWS Big Data

JUNE 16, 2025

However, the reality of scattered data across various systems—from data lakes to data warehouses and applications—makes it difficult to access and use data efficiently. As data volumes grow, so do the costs associated with ETL, leading to delayed insights and increased operational overhead. Choose Confirm.

Data Lake

Data Lake Analytics Data Warehouse Metadata

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

While traditional extract, transform, and load (ETL) processes have long been a staple of data integration due to its flexibility, for common use cases such as replication and ingestion, they often prove time-consuming, complex, and less adaptable to the fast-changing demands of modern data architectures.

Data Integration

Data Integration Data Lake Statistics Data-driven

Accelerate SQL code migration from Google BigQuery to Amazon Redshift using BladeBridge

AWS Big Data

NOVEMBER 7, 2024

This post explores how you can use BladeBridge , a leading data environment modernization solution, to simplify and accelerate the migration of SQL code from BigQuery to Amazon Redshift. Tens of thousands of customers use Amazon Redshift every day to run analytics, processing exabytes of data for business insights.

Data Warehouse

Data Warehouse Reporting Big Data Data Lake

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

AWS Big Data

JANUARY 6, 2025

Amazon Redshift is a fast, scalable, and fully managed cloud data warehouse that allows you to process and run your complex SQL analytics workloads on structured and semi-structured data. He specializes in migrating enterprise data warehouses to AWS Modern Data Architecture.

Analytics

Analytics Data Warehouse Big Data Metrics

How Stifel built a modern data platform using AWS Glue and an event-driven domain architecture

AWS Big Data

JULY 7, 2025

Each data domain has the flexibility to create data products that can be published to the centralized catalog, while maintaining the autonomy for teams to develop data products that are exclusively accessible to teams within the domain. Domains have the freedom to decide which data they want to share.

Data-driven

Data-driven Metadata Digital Transformation Data Lake

Realizing ocean data democratization: Furuno Electric’s initiatives using Amazon DataZone

AWS Big Data

JULY 10, 2025

This enables swift data analysis and comprehensive ship management, which means operators can detect potential issues and implement preventive measures before they develop into serious problems. Linkage with each application can also be easily realized from the data lake, providing expandability for future data utilization.

Manufacturing

Manufacturing IoT Data Lake Digital Transformation

Secure Data Sharing and Interoperability Powered by Iceberg REST Catalog

Cloudera

DECEMBER 3, 2024

Many enterprises have heterogeneous data platforms and technology stacks across different business units or data domains. For decades, they have been struggling with scale, speed, and correctness required to derive timely, meaningful, and actionable insights from vast and diverse big data environments.

Metadata

Metadata Data Warehouse ROI Snapshot

Top analytics announcements of AWS re:Invent 2024

AWS Big Data

FEBRUARY 26, 2025

Analytics remained one of the key focus areas this year, with significant updates and innovations aimed at helping businesses harness their data more efficiently and accelerate insights. This zero-ETL integration reduces the complexity and operational burden of data replication to let you focus on deriving insights from your data.

Analytics

Analytics Data Lake Metadata Data Warehouse

What is a Data Mesh?

DataKitchen

AUGUST 3, 2021

The data mesh design pattern breaks giant, monolithic enterprise data architectures into subsystems or domains, each managed by a dedicated team. But first, let’s define the data mesh design pattern. The past decades of enterprise data platform architectures can be summarized in 69 words.

Data Architecture

Data Architecture Data Lake Data Warehouse Cost-Benefit

Eight Top DataOps Trends for 2022

DataKitchen

NOVEMBER 29, 2021

DataOps adoption continues to expand as a perfect storm of social, economic, and technological factors drive enterprises to invest in process-driven innovation. As a result, enterprises will examine their end-to-end data operations and analytics creation workflows. Data Gets Meshier. Rise of the DataOps Engineer.

Testing

Testing Data Lake Data Architecture Manufacturing

Centralize Your Data Processes With a DataOps Process Hub

DataKitchen

NOVEMBER 4, 2021

Data organizations often have a mix of centralized and decentralized activity. DataOps concerns itself with the complex flow of data across teams, data centers and organizational boundaries. It expands beyond tools and data architecture and views the data organization from the perspective of its processes and workflows.

Data Processing

Data Processing Data Lake Cost-Benefit Testing

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JUNE 10, 2024

Enterprises and organizations across the globe want to harness the power of data to make better decisions by putting data at the center of every decision-making process. The open table format accelerates companies’ adoption of a modern data strategy because it allows them to use various tools on top of a single copy of the data.

Data Lake

Data Lake Metadata Snapshot Analytics

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

AWS Big Data

JUNE 10, 2024

Use cases for Hive metastore federation for Amazon EMR Hive metastore federation for Amazon EMR is applicable to the following use cases: Governance of Amazon EMR-based data lakes – Producers generate data within their AWS accounts using an Amazon EMR-based data lake supported by EMRFS on Amazon Simple Storage Service (Amazon S3)and HBase.

Data Lake

Data Lake Metadata Data Warehouse Data Processing

Automate replication of relational sources into a transactional data lake with Apache Iceberg and AWS Glue

AWS Big Data

FEBRUARY 14, 2023

Organizations have chosen to build data lakes on top of Amazon Simple Storage Service (Amazon S3) for many years. A data lake is the most popular choice for organizations to store all their organizational data generated by different teams, across business domains, from all different formats, and even over history.

Data Lake

Data Lake Statistics Data Architecture Finance

Carhartt turns to data under new CIO

CIO Business Intelligence

NOVEMBER 25, 2022

Carhartt’s signature workwear is near ubiquitous, and its continuing presence on factory floors and at skate parks alike is fueled in part thanks to an ongoing digital transformation that is advancing the 133-year-old Midwest company’s operations to make the most of advanced digital technologies, including the cloud, data analytics, and AI.

Data Lake

Data Lake Data Warehouse Unstructured Data Data Architecture

Build a multi-Region and highly resilient modern data architecture using AWS Glue and AWS Lake Formation

AWS Big Data

JANUARY 24, 2023

AWS Lake Formation helps with enterprise data governance and is important for a data mesh architecture. It works with the AWS Glue Data Catalog to enforce data access and governance. This solution only replicates metadata in the Data Catalog, not the actual underlying data.

Data Architecture

Data Architecture Metadata Data Lake Snapshot

DataOps For Business Analytics Teams

DataKitchen

JANUARY 3, 2022

A DataOps process hub offers a way for business analytics teams to cope with fast-paced requirements without expanding staff or sacrificing quality. Analytics Hub and Spoke. The data analytics function in large enterprises is generally distributed across departments and roles. DataOps Process Hub.

Business Analytics

Business Analytics Analytics Testing Dashboards

Building a Beautiful Data Lakehouse

CIO Business Intelligence

MARCH 9, 2022

Applying artificial intelligence (AI) to data analytics for deeper, better insights and automation is a growing enterprise IT priority. But the data repository options that have been around for a while tend to fall short in their ability to serve as the foundation for big data analytics powered by AI.

Data Lake

Data Lake Unstructured Data Data Warehouse Big Data

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

AWS Big Data

APRIL 24, 2023

Building a data lake on Amazon Simple Storage Service (Amazon S3) provides numerous benefits for an organization. However, many use cases, like performing change data capture (CDC) from an upstream relational database to an Amazon S3-based data lake, require handling data at a record level.

Data Lake

Data Lake Data Governance Machine Learning Cost-Benefit

Data architecture strategy for data quality

IBM Big Data Hub

JANUARY 5, 2023

Several factors determine the quality of your enterprise data like accuracy, completeness, consistency, to name a few. But there’s another factor of data quality that doesn’t get the recognition it deserves: your data architecture. How the right data architecture improves data quality.

Data Architecture

Data Architecture Data Quality Strategy Data Lake

Data science vs data analytics: Unpacking the differences

IBM Big Data Hub

SEPTEMBER 19, 2023

Though you may encounter the terms “data science” and “data analytics” being used interchangeably in conversations or online, they refer to two distinctly different concepts. Meanwhile, data analytics is the act of examining datasets to extract value and find answers to specific questions.

Data Science

Data Science Data Analytics Prescriptive Analytics Analytics

The Future of the Data Lakehouse – Open

CIO Business Intelligence

JUNE 23, 2022

Cloudera customers run some of the biggest data lakes on earth. These lakes power mission critical large scale data analytics, business intelligence (BI), and machine learning use cases, including enterprise data warehouses. On data warehouses and data lakes.

Data Lake

Data Lake Data Warehouse Machine Learning Data-driven

Detect, mask, and redact PII data using AWS Glue before loading into Amazon OpenSearch Service

AWS Big Data

JANUARY 12, 2024

Solution overview The following diagram illustrates the high-level solution architecture. We have defined all layers and components of our design in line with the AWS Well-Architected Framework Data Analytics Lens. Amazon AppFlow can be used to transfer data from different SaaS applications to a data lake.

Data Lake

Data Lake Cost-Benefit Visualization Structured Data

The New Normal for FP&A: Data Analytics

Jedox

OCTOBER 22, 2020

The term “data analytics” refers to the process of examining datasets to draw conclusions about the information they contain. Data analysis techniques enhance the ability to take raw data and uncover patterns to extract valuable insights from it. Data analytics is not new. Inability to get data quickly.

Data Analytics

Data Analytics Analytics Unstructured Data Data mining

Deploy and Optimize Your Snowflake Environment Faster With Accelerators

CDW Research Hub

JULY 18, 2022

One modern data platform solution that provides simplicity and flexibility to grow is Snowflake’s data cloud and platform. These Snowflake accelerators reduce the time to analytics for your users at all levels so you can make data-driven decisions faster. Security Data Lake. Snowflake Health Check.

Optimization

Optimization Data Lake Data Warehouse Data Science

The Future of the Data Lakehouse – Open

Cloudera

JUNE 18, 2022

Data Lake

Data Lake Data Warehouse Machine Learning Data-driven

Announcing the 2020 Data Impact Award Winners

Cloudera

NOVEMBER 18, 2020

In fact, each of the 29 finalists represented organizations running cutting-edge use cases that showcase a winning enterprise data cloud strategy. The technological linchpin of its digital transformation has been its Enterprise Data Architecture & Governance platform. Data Champions . Winner: OVO.

Internet Publishing and Broadcasting

Internet Publishing and Broadcasting Data-driven Broadcasting Digital Transformation

Lay the groundwork now for advanced analytics and AI

CIO Business Intelligence

AUGUST 3, 2023

When global technology company Lenovo started utilizing data analytics, they helped identify a new market niche for its gaming laptops, and powered remote diagnostics so their customers got the most from their servers and other devices. We wanted the team to try every idea even if 60% of them failed.”

Analytics

Analytics Data Lake Metadata Cost-Benefit

AI Challenges and How Cloudera Can Help

Cloudera

AUGUST 20, 2024

Whether it’s rapidly rising costs, an inefficient and outdated data infrastructure, or serious gaps in data governance, there are myriad reasons why organizations are struggling to move past adoption and achieve AI at scale in their enterprises. Ensuring data is trustworthy comes with its own complications.

Data Architecture

Data Architecture Data Lake Data Governance Data Warehouse

Addressing the Elephant in the Room – Welcome to Today’s Cloudera

Cloudera

JUNE 13, 2024

After countless open-source innovations ushered in the Big Data era, including the first commercial distribution of HDFS (Apache Hadoop Distributed File System), commonly referred to as Hadoop, the two companies joined forces, giving birth to an entire ecosystem of technology and tech companies. That’s today’s Cloudera.

Big Data

Big Data Machine Learning Contextual Data Data Lake

Modernizing Data Analytics Architecture with the Denodo Platform on Azure

Data Virtualization

JANUARY 19, 2023

Reading Time: 2 minutes Today, many businesses are modernizing their on-premises data warehouses or cloud-based data lakes using Microsoft Azure Synapse Analytics. Unfortunately, with data spread.

Data Analytics

Data Analytics Data Lake Data Warehouse Analytics

How Data Management and Big Data Analytics Speed Up Business Growth

BizAcuity

APRIL 14, 2022

Its effective data analytics that allows personalization in marketing & sales, identifying new opportunities, making important decisions and being sustainable for the long term. Competitive Advantages to using Big Data Analytics. Challenges associated with Data Management and Optimizing Big Data.

Big Data

Big Data Data Analytics Management Analytics

Estimating Scope 1 Carbon Footprint with Amazon Athena

AWS Big Data

AUGUST 2, 2023

In this blog, we will walk through how we can apply existing enterprise data to better understand and estimate Scope 1 carbon footprint using Amazon Simple Storage Service (S3) and Amazon Athena , a serverless interactive analytics service that makes it easy to analyze data using standard SQL.

Data Lake

Data Lake Measurement Visualization Data Architecture

Unstructured data management and governance using AWS AI/ML and analytics services

AWS Big Data

OCTOBER 25, 2023

Most companies produce and consume unstructured data such as documents, emails, web pages, engagement center phone calls, and social media. By some estimates, unstructured data can make up to 80–90% of all new enterprise data and is growing many times faster than structured data.

Unstructured Data

Unstructured Data Metadata Management Analytics

Migrate a petabyte-scale data warehouse from Actian Vectorwise to Amazon Redshift

AWS Big Data

MAY 30, 2024

Amazon Redshift is a fast, scalable, and fully managed cloud data warehouse that allows you to process and run your complex SQL analytics workloads on structured and semi-structured data. Legacy architecture The customer’s platform was the main source for one-time, batch, and content processing.

Data Warehouse

Data Warehouse Data Lake Cost-Benefit Structured Data

Augmented data management: Data fabric versus data mesh

IBM Big Data Hub

APRIL 27, 2022

Data fabric and data mesh are emerging data management concepts that are meant to address the organizational change and complexities of understanding, governing and working with enterprise data in a hybrid multicloud ecosystem. The good news is that both data architecture concepts are complimentary.

Management

Management Metadata Data Architecture Data Lake

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

AWS Big Data

AUGUST 1, 2024

Success criteria alignment by all stakeholders (producers, consumers, operators, auditors) is key for successful transition to a new Amazon Redshift modern data architecture. The success criteria are the key performance indicators (KPIs) for each component of the data workflow.

Data Warehouse

Data Warehouse KPI Optimization Cost-Benefit

Exploring real-time streaming for generative AI Applications

AWS Big Data

MARCH 25, 2024

Amazon DocumentDB (with MongoDB compatibility) is a fast, scalable, highly available, and fully managed enterprise document database service that supports native JSON workloads. To understand the best ways to make API calls via Apache Flink, refer to Common streaming data enrichment patterns in Amazon Kinesis Data Analytics for Apache Flink.

Data Lake

Data Lake Unstructured Data Snapshot Management

AWS launches Data Lab program in India

CIO Business Intelligence

JANUARY 30, 2023

Amazon Web Services (AWS) on Monday said it was making its AWS Data Lab program available in India. The complimentary program is targeted at accelerating joint engineering efforts between enterprise customers and AWS technical experts to tackle complex data, architecture challenges using AWS products and services, the company said.

Data Lake

Data Lake Data Architecture Machine Learning Data Processing

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

AWS Big Data

NOVEMBER 9, 2023

It allows users to write data transformation code, run it, and test the output, all within the framework it provides. Use case The Enterprise Data Analytics group of a large jewelry retailer embarked on their cloud journey with AWS in 2021. It’s raw, unprocessed data straight from the source.

Data Warehouse

Data Warehouse Testing Data Quality Reporting

2020 Data Impact Award Winner Spotlight: United Overseas Bank

Cloudera

JANUARY 13, 2021

In recognition of its efforts towards driving digital transformation, UOB won the “Business Transformation” and “Data for Enterprise AI” categories over the last two consecutive years. As an organisation, UOB has proven its fundamental understanding that the future is data-driven. Putting data at the heart of the organisation.

Digital Transformation

Digital Transformation Data-driven Data Lake Big Data

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

Incremental refresh for Amazon Redshift materialized views on data lake tables

Webinars

Trending Sources

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Webinars

Reduce time to access your transactional data for analytical processing using the power of Amazon SageMaker Lakehouse and zero-ETL

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Accelerate SQL code migration from Google BigQuery to Amazon Redshift using BladeBridge

Ingest data from Google Analytics 4 and Google Sheets to Amazon Redshift using Amazon AppFlow

How Stifel built a modern data platform using AWS Glue and an event-driven domain architecture

Realizing ocean data democratization: Furuno Electric’s initiatives using Amazon DataZone

Secure Data Sharing and Interoperability Powered by Iceberg REST Catalog

Top analytics announcements of AWS re:Invent 2024

What is a Data Mesh?

Eight Top DataOps Trends for 2022

Centralize Your Data Processes With a DataOps Process Hub

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore federation

Automate replication of relational sources into a transactional data lake with Apache Iceberg and AWS Glue

Carhartt turns to data under new CIO

Build a multi-Region and highly resilient modern data architecture using AWS Glue and AWS Lake Formation

DataOps For Business Analytics Teams

Building a Beautiful Data Lakehouse

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Data architecture strategy for data quality

Data science vs data analytics: Unpacking the differences

The Future of the Data Lakehouse – Open

Detect, mask, and redact PII data using AWS Glue before loading into Amazon OpenSearch Service

The New Normal for FP&A: Data Analytics

Deploy and Optimize Your Snowflake Environment Faster With Accelerators

The Future of the Data Lakehouse – Open

Announcing the 2020 Data Impact Award Winners

Lay the groundwork now for advanced analytics and AI

AI Challenges and How Cloudera Can Help

Addressing the Elephant in the Room – Welcome to Today’s Cloudera

Modernizing Data Analytics Architecture with the Denodo Platform on Azure

How Data Management and Big Data Analytics Speed Up Business Growth

Estimating Scope 1 Carbon Footprint with Amazon Athena

Unstructured data management and governance using AWS AI/ML and analytics services

Migrate a petabyte-scale data warehouse from Actian Vectorwise to Amazon Redshift

Augmented data management: Data fabric versus data mesh

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

Exploring real-time streaming for generative AI Applications

AWS launches Data Lab program in India

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

2020 Data Impact Award Winner Spotlight: United Overseas Bank

Stay Connected