2022, Data Architecture and Data Lake

Eight Top DataOps Trends for 2022

DataKitchen

NOVEMBER 29, 2021

Keep an eye on the eight top trends below that we believe will be significant in the year 2022. The data industry realizes that AI bias is simply a quality problem, and AI systems should be subject to this same level of process control as an automobile rolling off an assembly line. Data Gets Meshier. Data Gets Meshier.

Testing

Testing Data Lake Data Architecture Manufacturing

Load data incrementally from transactional data lakes to data warehouses

AWS Big Data

OCTOBER 19, 2023

Data lakes and data warehouses are two of the most important data storage and management technologies in a modern data architecture. Data lakes store all of an organization’s data, regardless of its format or structure.

Data Lake

Data Lake Data Warehouse Visualization Snapshot

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

AWS Big Data

APRIL 3, 2024

They understand that a one-size-fits-all approach no longer works, and recognize the value in adopting scalable, flexible tools and open data formats to support interoperability in a modern data architecture to accelerate the delivery of new solutions.

Data Lake

Data Lake Snapshot Metadata Data Architecture

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

AWS Lake Formation 2022 year in review

AWS Big Data

JANUARY 31, 2023

In this post, we are excited to summarize the features that the AWS Glue Data Catalog, AWS Glue crawler, and Lake Formation teams delivered in 2022. Whether you are a data platform builder, data engineer, data scientist, or any technology leader interested in data lake solutions, this post is for you.

Data Lake

Data Lake Data Governance Data Architecture Machine Learning

Choosing an open table format for your transactional data lake on AWS

AWS Big Data

JUNE 9, 2023

A modern data architecture enables companies to ingest virtually any type of data through automated pipelines into a data lake, which provides highly durable and cost-effective object storage at petabyte or exabyte scale.

Data Lake

Data Lake Metadata Statistics Optimization

Simplify data ingestion from Amazon S3 to Amazon Redshift using auto-copy

AWS Big Data

OCTOBER 30, 2024

The following are the recommended best practices when working with files using the auto-copy job: Use unique file names for each file in a auto-copy job (for example, 2022-10-15-batch-1.csv He specializes in migrating enterprise data warehouses to AWS Modern Data Architecture. Do not overwrite existing files.

Data Warehouse

Data Warehouse Sales Data Lake Recreation/Entertainment

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

AWS Big Data

JULY 21, 2023

This leads to having data across many instances of data warehouses and data lakes using a modern data architecture in separate AWS accounts. We recently announced the integration of Amazon Redshift data sharing with AWS Lake Formation. Take note of this role’s ARN to use later in the steps.

Data Lake

Data Lake Data Warehouse Marketing Management

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

AWS Big Data

APRIL 24, 2023

Building a data lake on Amazon Simple Storage Service (Amazon S3) provides numerous benefits for an organization. However, many use cases, like performing change data capture (CDC) from an upstream relational database to an Amazon S3-based data lake, require handling data at a record level.

Data Lake

Data Lake Data Governance Machine Learning Cost-Benefit

What is a data architect? Skills, salaries, and how to become a data framework master

CIO Business Intelligence

OCTOBER 13, 2023

Data architecture is a complex and varied field and different organizations and industries have unique needs when it comes to their data architects. Solutions data architect: These individuals design and implement data solutions for specific business needs, including data warehouses, data marts, and data lakes.

Data Architecture

Data Architecture Data Warehouse Statistics Visualization

The essential check list for effective data democratization

CIO Business Intelligence

JANUARY 20, 2023

A big part of preparing data to be shared is an exercise in data normalization, says Juan Orlandini, chief architect and distinguished engineer at Insight Enterprises. Data formats and data architectures are often inconsistent, and data might even be incomplete. They have data swamps,” he says.

Data Lake

Data Lake Data-driven Finance Data Architecture

The Future of the Data Lakehouse – Open

CIO Business Intelligence

JUNE 23, 2022

Cloudera customers run some of the biggest data lakes on earth. These lakes power mission critical large scale data analytics, business intelligence (BI), and machine learning use cases, including enterprise data warehouses. On data warehouses and data lakes.

Data Lake

Data Lake Data Warehouse Machine Learning Data-driven

Real estate CIOs drive deals with data

CIO Business Intelligence

JULY 26, 2023

“The only thing we have on premise, I believe, is a data server with a bunch of unstructured data on it for our legal team,” says Grady Ligon, who was named Re/Max’s first CIO in October 2022. billion in 2022, resource industries $82.1 billion in 2022, and personal and consumer services at $82.6 billion in 2022.

Data Lake

Data Lake Digital Transformation Machine Learning Data Architecture

The Future of the Data Lakehouse – Open

Cloudera

JUNE 18, 2022

Cloudera customers run some of the biggest data lakes on earth. These lakes power mission critical large scale data analytics, business intelligence (BI), and machine learning use cases, including enterprise data warehouses. On data warehouses and data lakes.

Data Lake

Data Lake Data Warehouse Machine Learning Data-driven

Supercharge Your Data Lakehouse with Apache Iceberg in Cloudera Data Platform

Cloudera

JUNE 30, 2022

In February 2022, we introduced Apache Iceberg as a technical preview within CDP. Over the past decade, Cloudera has enabled multi-function analytics on data lakes through the introduction of the Hive table format and Hive ACID. We selected change data capture as our first use case on Iceberg.

Data Lake

Data Lake Data Warehouse Data Architecture Metadata

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

To bring their customers the best deals and user experience, smava follows the modern data architecture principles with a data lake as a scalable, durable data store and purpose-built data stores for analytical processing and data consumption.

Data Lake

Data Lake Data Warehouse Data-driven B2B

Habib Bank manages data at scale with Cloudera Data Platform

Cloudera

NOVEMBER 17, 2022

The Solution: CDP Private Cloud brings a next-generation hybrid architecture with cloud-native benefits to HBL’s data platform. HBL started their data journey in 2019 when data lake initiative was started to consolidate complex data sources and enable the bank to use single version of truth for decision making.

Management

Management Data Lake Consulting Unstructured Data

Demystifying Modern Data Platforms

Cloudera

SEPTEMBER 15, 2022

July brings summer vacations, holiday gatherings, and for the first time in two years, the return of the Massachusetts Institute of Technology (MIT) Chief Data Officer symposium as an in-person event. A key area of focus for the symposium this year was the design and deployment of modern data platforms. What is a data fabric?

Data Lake

Data Lake Data Architecture Data-driven Data Warehouse

Augmented data management: Data fabric versus data mesh

IBM Big Data Hub

APRIL 27, 2022

Data fabric and data mesh are emerging data management concepts that are meant to address the organizational change and complexities of understanding, governing and working with enterprise data in a hybrid multicloud ecosystem. The good news is that both data architecture concepts are complimentary.

Management

Management Metadata Data Architecture Data Lake

CIOs rise to the ESG reporting challenge

CIO Business Intelligence

JANUARY 30, 2024

As the internal technology provider for parent company Allianz SE with 15,000 employees, the entity employs more than 100 ESG experts who spend several weeks each year heads down collecting and reporting ESG data manually. Karcher has since built a team of 18 and completed an inventory of existing ESG data structures and legal requirements.

Reporting

Reporting Data Quality Strategy Data-driven

Get maximum value out of your cloud data warehouse with Amazon Redshift

AWS Big Data

APRIL 19, 2023

Every day, customers are challenged with how to manage their growing data volumes and operational costs to unlock the value of data for timely insights and innovation, while maintaining consistent performance. As data workloads grow, costs to scale and manage data usage with the right governance typically increase as well.

Data Warehouse

Data Warehouse Data Lake Unstructured Data Optimization

The hidden history of Db2

IBM Big Data Hub

JULY 5, 2022

In today’s world of complex data architectures and emerging technologies, databases can sometimes be undervalued and unrecognized. When we look ahead, that same architectural foundation we have spent decades perfecting and innovating is also bringing Db2 into future. Vektis improves healthcare quality through data .

Data Lake

Data Lake Data Warehouse Publishing Structured Data

Why Invest Now? Three Investors Share the Story Behind Alation’s Series E

Alation

NOVEMBER 2, 2022

And that’s even in the midst of 2022, which has been a tumultuous year from a macro perspective. We had not seen that in the broader intelligence & data governance market.”. Right now, it’s probably not a secret that the amount and the pace of financings – if you compare 2022 to 2021 – is night and day,” he continues.

Data Governance

Data Governance Marketing Finance Data Lake

Build incremental data pipelines to load transactional data changes using AWS DMS, Delta 2.0, and Amazon EMR Serverless

AWS Big Data

MARCH 3, 2023

Building data lakes from continuously changing transactional data of databases and keeping data lakes up to date is a complex task and can be an operational challenge. You can then apply transformations and store data in Delta format for managing inserts, updates, and deletes.

Data Lake

Data Lake Dashboards Metrics Metadata

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

This view is used to identify patterns and trends in customer behavior, which can inform data-driven decisions to improve business outcomes. In 2022, AWS commissioned a study conducted by the American Productivity and Quality Center (APQC) to quantify the Business Value of Customer 360.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Dive deep into AWS Glue 4.0 for Apache Spark

AWS Big Data

MAY 18, 2023

It’s even harder when your organization is dealing with silos that impede data access across different data stores. Seamless data integration is a key requirement in a modern data architecture to break down data silos. AWS Glue released version 4.0 runtime ( 3.5

Testing

Testing Data Lake Cost-Benefit Data Integration

Has the Data Warehouse Had Its Day?

BI-Survey

JANUARY 15, 2023

Data architecture is a topic that is as relevant today as ever. It is widely regarded as a matter for data engineers, not business domain experts. Statements from countless interviews with our customers reveal that the data warehouse is seen as a “black box” by many and understood by few business users. But is it really?

Data Warehouse

Data Warehouse IT Data Architecture Measurement

The year’s top 10 enterprise AI trends — so far

CIO Business Intelligence

SEPTEMBER 21, 2023

The world has flipped since 2022,” says David McCurdy, chief enterprise architect and CTO at Insight. To make all this possible, the data had to be collected, processed, and fed into the systems that needed it in a reliable, efficient, scalable, and secure way. Then gen AI came out.

Enterprise

Enterprise Consulting Modeling Cost-Benefit

Exploring new ETL and ELT capabilities for Amazon Redshift from the AWS Glue Studio visual editor

AWS Big Data

APRIL 20, 2023

In a modern data architecture, unified analytics enable you to access the data you need, whether it’s stored in a data lake or a data warehouse. AWS Glue provides an extensible architecture that enables users with different data processing use cases, and works well with Amazon Redshift.

Visualization

Visualization Data Warehouse Big Data Data Lake

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

AWS Big Data

APRIL 5, 2023

Showpad also struggled with data quality issues in terms of consistency, ownership, and insufficient data access across its targeted user base due to a complex BI access process, licensing challenges, and insufficient education. Showpad completed its internal BI migration to QuickSight by the end of 2022.

Dashboards

Dashboards Reporting Cost-Benefit Visualization

Why We Started the Data Intelligence Project

Alation

JULY 7, 2022

In the 2010s, the growing scope of the data landscape gave rise to a new profession: the data scientist. This new role, combined with the creation of data lakes and the increasing use of cloud services, created new employment opportunities in data analytics, data architecture, and data management.

Metadata

Metadata Data-driven Insurance Statistics

Your Data Architecture Holds the Key to Unlocking AI’s Full Potential

CIO Business Intelligence

APRIL 4, 2023

In order to move AI forward, we need to first build and fortify the foundational layer: data architecture. This architecture is important because, to reap the full benefits of AI, it must be built to scale across an enterprise versus individual AI applications. Constructing the right data architecture cannot be bypassed.

Data Architecture

Data Architecture Data Lake Data Warehouse Cost-Benefit

Ingest telemetry messages in near real time with Amazon API Gateway, Amazon Data Firehose, and Amazon Location Service

AWS Big Data

NOVEMBER 14, 2024

APIs act as the entry point for applications to access data, business logic, or functionality from your backend services. Amazon Data Firehose – Data Firehose is an extract, transform, and load (ETL) service that reliably captures, transforms, and delivers streaming data to data lakes, data stores, and analytics services.

Data Lake

Data Lake Metadata Testing Data-driven

Data Leaders Brief

Eight Top DataOps Trends for 2022

Load data incrementally from transactional data lakes to data warehouses

Webinars

Trending Sources

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Webinars

AWS Lake Formation 2022 year in review

Choosing an open table format for your transactional data lake on AWS

Simplify data ingestion from Amazon S3 to Amazon Redshift using auto-copy

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

What is a data architect? Skills, salaries, and how to become a data framework master

The essential check list for effective data democratization

The Future of the Data Lakehouse – Open

Real estate CIOs drive deals with data

The Future of the Data Lakehouse – Open

Supercharge Your Data Lakehouse with Apache Iceberg in Cloudera Data Platform

How smava makes loans transparent and affordable using Amazon Redshift Serverless

Habib Bank manages data at scale with Cloudera Data Platform

Demystifying Modern Data Platforms

Augmented data management: Data fabric versus data mesh

CIOs rise to the ESG reporting challenge

Get maximum value out of your cloud data warehouse with Amazon Redshift

The hidden history of Db2

Why Invest Now? Three Investors Share the Story Behind Alation’s Series E

Build incremental data pipelines to load transactional data changes using AWS DMS, Delta 2.0, and Amazon EMR Serverless

Create an end-to-end data strategy for Customer 360 on AWS

Dive deep into AWS Glue 4.0 for Apache Spark

Has the Data Warehouse Had Its Day?

The year’s top 10 enterprise AI trends — so far

Exploring new ETL and ELT capabilities for Amazon Redshift from the AWS Glue Studio visual editor

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

Why We Started the Data Intelligence Project

Your Data Architecture Holds the Key to Unlocking AI’s Full Potential

Ingest telemetry messages in near real time with Amazon API Gateway, Amazon Data Firehose, and Amazon Location Service

Stay Connected