Book, Data Architecture and Data Processing

Book

Data Architecture

Data Processing

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

OCTOBER 30, 2024

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue. To start the job, choose Run. format(dbname)).config("spark.sql.catalog.glue_catalog.catalog-impl",

Data Lake

Data Lake Data Processing Optimization Machine Learning

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

AWS Big Data

NOVEMBER 27, 2024

Together with price-performance, Amazon Redshift offers capabilities such as serverless architecture, machine learning integration within your data warehouse and secure data sharing across the organization. dbt Cloud is a hosted service that helps data teams productionize dbt deployments. Choose Create.

Data Warehouse

Data Warehouse Analytics Testing Sales

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Understanding Digital Interactions in Real-Time

CIO Business Intelligence

JUNE 29, 2022

But this glittering prize might cause some organizations to overlook something significantly more important: constructing the kind of event-driven data architecture that supports robust real-time analytics. The foundation of an event-driven architecture. Data Architecture, IT Leadership

Interactive

Interactive Data-driven Data Architecture Software

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Four Ways Telcos Can Realize Data-Driven Transformation

Cloudera

OCTOBER 19, 2023

While navigating so many simultaneous data-dependent transformations, they must balance the need to level up their data management practices—accelerating the rate at which they ingest, manage, prepare, and analyze data—with that of governing this data.

Data-driven

Data-driven Data Architecture Predictive Modeling Digital Transformation

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

Overview of solution As a data-driven company, smava relies on the AWS Cloud to power their analytics use cases. smava ingests data from various external and internal data sources into a landing stage on the data lake based on Amazon Simple Storage Service (Amazon S3).

Data Lake

Data Lake Data Warehouse Data-driven B2B

For IT leaders, operationalized gen AI is still a moving target

CIO Business Intelligence

FEBRUARY 28, 2024

And not only do companies have to get all the basics in place to build for analytics and MLOps, but they also need to build new data structures and pipelines specifically for gen AI. And for some use cases, an expensive, high-end commercial LLM might not be required since a locally-hosted open source model might suffice.

IT Consulting Modeling Enterprise

Best BI Tools For 2024 You Need to Know

FineReport

MARCH 31, 2024

The selection of the best BI tools stands as a critical step in leveraging data effectively, driving success, and maintaining competitive advantage in modern markets. Data-driven Decisions: BI tools empower businesses to make informed decisions by furnishing actionable insights, optimizing operations, and uncovering growth opportunities.

Dashboards

Dashboards Visualization Data mining Data-driven

The Gartner 2022 Leadership Vision for Data and Analytics Leaders Questions and Answers

Andrew White

JANUARY 9, 2022

On Thursday January 6th I hosted Gartner’s 2022 Leadership Vision for Data and Analytics webinar. Have a look at this and see if this helps: Data, Analytics and AI Form the Foundation of Data-Driven Decision Making. . Here is the link to the replay, in case you are interested. – I hope we can help.

Analytics

Analytics Measurement Data-driven Modeling

What Is Embedded Analytics?

Jet Global

MAY 1, 2023

Data Environment First off, the solutions you consider should be compatible with your current data architecture. We have outlined the requirements that most providers ask for: Data Sources Strategic Objective Use native connectivity optimized for the data source. Do what you expect your customers to do.

Analytics

Analytics Cost-Benefit Visualization Dashboards

Access your existing data and resources through Amazon SageMaker Unified Studio, Part 2: Amazon S3, Amazon RDS, Amazon DynamoDB, and Amazon EMR

AWS Big Data

APRIL 28, 2025

In your project, in the navigation pane, choose Data. Choose the plus sign, and for Add data source , choose Add connection. For Data source name , enter postgresql_source. For Host , enter the host name of your Aurora PostgreSQL database cluster. Select PostgreSQL. For Database , enter your database name.

Big Data

Big Data Visualization Data Processing Data Processing

Access your existing data and resources through Amazon SageMaker Unified Studio, Part 1: AWS Glue Data Catalog and Amazon Redshift

AWS Big Data

APRIL 28, 2025

Under Add a data source , choose Add connection , then choose Amazon Redshift. Enter the following parameters in the connection details, and choose Add data. Host : Enter the Amazon Redshift managed VPC endpoint. He is also the author of the book Serverless ETL and Analytics with AWS Glue.

Metadata

Metadata Data Lake Big Data Publishing

Modernize your legacy databases with AWS data lakes, Part 3: Build a data lake processing layer

AWS Big Data

OCTOBER 30, 2024

This is the final part of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to process data with Amazon Redshift Spectrum and create the gold (consumption) layer. His focus areas are MLOps, feature stores, data lakes, model hosting, and generative AI.

Data Lake

Data Lake Machine Learning Data Architecture Data-driven

Data Leaders Brief

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

Webinars

Trending Sources

Understanding Digital Interactions in Real-Time

Webinars

Four Ways Telcos Can Realize Data-Driven Transformation

How smava makes loans transparent and affordable using Amazon Redshift Serverless

For IT leaders, operationalized gen AI is still a moving target

Best BI Tools For 2024 You Need to Know

The Gartner 2022 Leadership Vision for Data and Analytics Leaders Questions and Answers

What Is Embedded Analytics?

Access your existing data and resources through Amazon SageMaker Unified Studio, Part 2: Amazon S3, Amazon RDS, Amazon DynamoDB, and Amazon EMR

Access your existing data and resources through Amazon SageMaker Unified Studio, Part 1: AWS Glue Data Catalog and Amazon Redshift

Modernize your legacy databases with AWS data lakes, Part 3: Build a data lake processing layer

Stay Connected