Consulting, Data Architecture and Data Lake

Consulting

Data Architecture

Data Lake

What is data architecture? A framework to manage data

CIO Business Intelligence

DECEMBER 20, 2024

Data architecture definition Data architecture describes the structure of an organizations logical and physical data assets, and data management resources, according to The Open Group Architecture Framework (TOGAF). An organizations data architecture is the purview of data architects.

Data Architecture

Data Architecture Management Consulting Internet of Things

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

OCTOBER 30, 2024

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue. Delete the bucket.

Data Lake

Data Lake Data Processing Optimization Machine Learning

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Centralize Your Data Processes With a DataOps Process Hub

DataKitchen

NOVEMBER 4, 2021

Data organizations often have a mix of centralized and decentralized activity. DataOps concerns itself with the complex flow of data across teams, data centers and organizational boundaries. It expands beyond tools and data architecture and views the data organization from the perspective of its processes and workflows.

Data Processing

Data Processing Data Lake Cost-Benefit Testing

Webinars

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

AWS Big Data

JULY 18, 2024

Over the years, organizations have invested in creating purpose-built, cloud-based data lakes that are siloed from one another. A major challenge is enabling cross-organization discovery and access to data across these multiple data lakes, each built on different technology stacks.

Data Lake

Data Lake Publishing Metadata Data-driven

Choosing an open table format for your transactional data lake on AWS

AWS Big Data

JUNE 9, 2023

A modern data architecture enables companies to ingest virtually any type of data through automated pipelines into a data lake, which provides highly durable and cost-effective object storage at petabyte or exabyte scale.

Data Lake

Data Lake Metadata Statistics Optimization

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

AWS Big Data

SEPTEMBER 13, 2023

The Analytics specialty practice of AWS Professional Services (AWS ProServe) helps customers across the globe with modern data architecture implementations on the AWS Cloud. Of those tables, some are larger (such as in terms of record volume) than others, and some are updated more frequently than others.

Data Lake

Data Lake Data Processing Metadata Snapshot

The essential check list for effective data democratization

CIO Business Intelligence

JANUARY 20, 2023

A big part of preparing data to be shared is an exercise in data normalization, says Juan Orlandini, chief architect and distinguished engineer at Insight Enterprises. Data formats and data architectures are often inconsistent, and data might even be incomplete. They have data swamps,” he says.

Data Lake

Data Lake Data-driven Finance Data Architecture

What is a data architect? Skills, salaries, and how to become a data framework master

CIO Business Intelligence

OCTOBER 13, 2023

Data architecture is a complex and varied field and different organizations and industries have unique needs when it comes to their data architects. Solutions data architect: These individuals design and implement data solutions for specific business needs, including data warehouses, data marts, and data lakes.

Data Architecture

Data Architecture Data Warehouse Statistics Visualization

DataOps For Business Analytics Teams

DataKitchen

JANUARY 3, 2022

There’s a recent trend toward people creating data lake or data warehouse patterns and calling it data enablement or a data hub. DataOps expands upon this approach by focusing on the processes and workflows that create data enablement and business analytics. DataOps Process Hub.

Business Analytics

Business Analytics Analytics Testing Dashboards

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

We also examine how centralized, hybrid and decentralized data architectures support scalable, trustworthy ecosystems. As data-centric AI, automated metadata management and privacy-aware data sharing mature, the opportunity to embed data quality into the enterprises core has never been more significant.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Texas Rangers data transformation modernizes stadium operations

CIO Business Intelligence

OCTOBER 18, 2022

Noel had already established a relationship with consulting firm Resultant through a smaller data visualization project. Resultant then provided the business operations team with a set of recommendations for going forward, which the Rangers implemented with the consulting firm’s help.

Data Transformation

Data Transformation Consulting Data Lake Reporting

What you don’t know about data management could kill your business

CIO Business Intelligence

NOVEMBER 28, 2023

But at the other end of the attention spectrum is data management, which all too frequently is perceived as being boring, tedious, the work of clerks and admins, and ridiculously expensive. Still, to truly create lasting value with data, organizations must develop data management mastery.

Management

Management Data Architecture Data Lake Data Strategy

Habib Bank manages data at scale with Cloudera Data Platform

Cloudera

NOVEMBER 17, 2022

HBL started their data journey in 2019 when data lake initiative was started to consolidate complex data sources and enable the bank to use single version of truth for decision making. Smooth, hassle-free deployment in just six weeks. ” Prior to the upgrade, HBL’s 27 node cluster ran on CDH 6.1

Management

Management Data Lake Consulting Unstructured Data

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

To bring their customers the best deals and user experience, smava follows the modern data architecture principles with a data lake as a scalable, durable data store and purpose-built data stores for analytical processing and data consumption.

Data Lake

Data Lake Data Warehouse Data-driven B2B

Modernizing the Data Warehouse: Challenges and Benefits

BI-Survey

AUGUST 21, 2020

People from BI and analytics teams, business units, IT, corporate management and external consultant teams took part. A time-consuming development process and restricted support of self-service BI are the major drivers for modernizing the data warehouse.

Data Warehouse

Data Warehouse Data Lake Data Governance Data Architecture

Exploring real-time streaming for generative AI Applications

AWS Big Data

MARCH 25, 2024

Both engines provide native ingestion support from Kinesis Data Streams and Amazon MSK via a separate streaming pipeline to a data lake or data warehouse for analysis. For more details, refer to Create a low-latency source-to-data lake pipeline using Amazon MSK Connect, Apache Flink, and Apache Hudi.

Data Lake

Data Lake Unstructured Data Management Snapshot

The year’s top 10 enterprise AI trends — so far

CIO Business Intelligence

SEPTEMBER 21, 2023

Enterprises still aren’t extracting enough value from unstructured data hidden away in documents, though, says Nick Kramer, VP for applied solutions at management consultancy SSA & Company. Data warehouses then evolved into data lakes, and then data fabrics and other enterprise-wide data architectures.

Enterprise

Enterprise Consulting Modeling Cost-Benefit

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

AWS Big Data

AUGUST 1, 2024

Success criteria alignment by all stakeholders (producers, consumers, operators, auditors) is key for successful transition to a new Amazon Redshift modern data architecture. The success criteria are the key performance indicators (KPIs) for each component of the data workflow.

Data Warehouse

Data Warehouse KPI Optimization Cost-Benefit

Deep dive into the AWS ProServe Hadoop Migration Delivery Kit TCO tool

AWS Big Data

FEBRUARY 6, 2023

He mainly works with enterprise customers to help data lake migration and modernization, and provides guidance and technical assistance on big data projects such as Hadoop, Spark, data warehousing, real-time data processing, and large-scale machine learning. George Zhao is a Senior Data Architect at AWS ProServe.

Dashboards

Dashboards Optimization Data Lake Cost-Benefit

Real-Time Data at Verizon: It’s as Critical as Air

CIO Business Intelligence

MAY 12, 2022

The biggest challenge for any big enterprise is organizing the data that has organically grown across the organization over the last several years. Everyone has data lakes, data ponds – whatever you want to call them. How do you get your arms around all the data you have? So, real-time data has become air.

Testing

Testing Advertising Data Lake Marketing

Using Snowflake and Denodo to Reduce Data Modernization Downtime

CDW Research Hub

JULY 27, 2021

Cloud-based solutions are promising, but some organizations are reluctant to migrate from legacy systems because it could result in costly downtime and many unknown data architecture and migration issues. Reduce the total cost of ownership of the data infrastructure. Mitigate risks with a seamless cloud migration.

Data Warehouse

Data Warehouse Data Lake Data Architecture Consulting

Generic orchestration framework for data warehousing workloads using Amazon Redshift RSQL

AWS Big Data

APRIL 3, 2023

Tens of thousands of customers run business-critical workloads on Amazon Redshift , AWS’s fast, petabyte-scale cloud data warehouse delivering the best price-performance. With Amazon Redshift, you can query data across your data warehouse, operational data stores, and data lake using standard SQL.

Data Warehouse

Data Warehouse Testing Data Lake Data-driven

Has the Data Warehouse Had Its Day?

BI-Survey

JANUARY 15, 2023

Data architecture is a topic that is as relevant today as ever. It is widely regarded as a matter for data engineers, not business domain experts. Statements from countless interviews with our customers reveal that the data warehouse is seen as a “black box” by many and understood by few business users. But is it really?

Data Warehouse

Data Warehouse IT Data Architecture Measurement

Introducing the AWS ProServe Hadoop Migration Delivery Kit TCO tool

AWS Big Data

FEBRUARY 6, 2023

Refactoring coupled compute and storage to a decoupling architecture is a modern data solution. It enables compute such as EMR instances and storage such as Amazon Simple Storage Service (Amazon S3) data lakes to scale. George Zhao is a Senior Data Architect at AWS ProServe.

Cost-Benefit

Cost-Benefit Data Lake Dashboards Big Data

A Simple Data Capability Framework

Peter James Thomas

MAY 3, 2019

As part of my consulting business , I end up thinking about Data Capability Frameworks quite a bit. Sometimes this is when I am assessing current Data Capabilities, sometimes it is when I am thinking about how to transition to future Data Capabilities. Data Architecture / Infrastructure. Introduction.

Strategy

Strategy Data Architecture Data Quality Data Strategy

This Structure has Novel Features which are of Considerable Business Interest

Peter James Thomas

APRIL 3, 2020

The way that this consistency of figures is achieved is by all elements of the Structured Reporting Framework drawing their data from the same data repositories. Without paying attention to this, your shiny warehouse or data lake will be a technological curiosity, not an indispensable business tool.

Dashboards

Dashboards Reporting Sales Data Lake

How Novo Nordisk built distributed data governance and control at scale

AWS Big Data

APRIL 28, 2023

This will include how to configure Okta, AWS Lake Formation , and a business intelligence tool to enable SAML-based federated use of Athena for an enterprise BI activity. When building a scalable data architecture on AWS, giving autonomy and ownership to the data domains are crucial for the success of the platform.

Data Governance

Data Governance Management Data-driven Analytics

CIO 100 Award winners drive business results with IT

CIO Business Intelligence

AUGUST 7, 2024

But Barnett, who started work on a strategy in 2023, wanted to continue using Baptist Memorial’s on-premise data center for financial, security, and continuity reasons, so he and his team explored options that allowed for keeping that data center as part of the mix.

IT Insurance Cost-Benefit Testing

Modernize your legacy databases with AWS data lakes, Part 3: Build a data lake processing layer

AWS Big Data

OCTOBER 30, 2024

This is the final part of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to process data with Amazon Redshift Spectrum and create the gold (consumption) layer. The following diagram illustrates the different layers of the data lake.

Data Lake

Data Lake Machine Learning Data Architecture Data-driven

Data Leaders Brief

What is data architecture? A framework to manage data

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

Webinars

Trending Sources

Centralize Your Data Processes With a DataOps Process Hub

Webinars

How Volkswagen streamlined access to data across multiple data lakes using Amazon DataZone – Part 1

Choosing an open table format for your transactional data lake on AWS

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

The essential check list for effective data democratization

What is a data architect? Skills, salaries, and how to become a data framework master

DataOps For Business Analytics Teams

Data’s dark secret: Why poor quality cripples AI and growth

Texas Rangers data transformation modernizes stadium operations

What you don’t know about data management could kill your business

Habib Bank manages data at scale with Cloudera Data Platform

How smava makes loans transparent and affordable using Amazon Redshift Serverless

Modernizing the Data Warehouse: Challenges and Benefits

Exploring real-time streaming for generative AI Applications

The year’s top 10 enterprise AI trends — so far

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

Deep dive into the AWS ProServe Hadoop Migration Delivery Kit TCO tool

Real-Time Data at Verizon: It’s as Critical as Air

Using Snowflake and Denodo to Reduce Data Modernization Downtime

Generic orchestration framework for data warehousing workloads using Amazon Redshift RSQL

Has the Data Warehouse Had Its Day?

Introducing the AWS ProServe Hadoop Migration Delivery Kit TCO tool

A Simple Data Capability Framework

This Structure has Novel Features which are of Considerable Business Interest

How Novo Nordisk built distributed data governance and control at scale

CIO 100 Award winners drive business results with IT

Modernize your legacy databases with AWS data lakes, Part 3: Build a data lake processing layer

Stay Connected