article thumbnail

Diving Deeper into the Data Lake

David Menninger's Analyst Perspectives

A data lake is a centralized repository designed to house big data in structured, semi-structured and unstructured form. I have been covering the data lake topic for several years and encourage you to check out an earlier perspective called Data Lakes: Safe Way to Swim in Big Data?

Data Lake 352
article thumbnail

Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg

AWS Big Data

This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake ( Apache Iceberg ) using AWS Glue. Delete the bucket.

Data Lake 105
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Understanding the Differences Between Data Lakes and Data Warehouses

Smart Data Collective

Data lakes and data warehouses are probably the two most widely used structures for storing data. Data Warehouses and Data Lakes in a Nutshell. A data warehouse is used as a central storage space for large amounts of structured data coming from various sources. Data Type and Processing.

Data Lake 140
article thumbnail

5 things on our data and AI radar for 2021

O'Reilly on Data

The Right Solution for Your Data: Cloud Data Lakes and Data Lakehouses. Data lakes have experienced a fairly robust resurgence over the last few years, specifically cloud data lakes. A Wave of Cloud-Native, Distributed Data Frameworks. Request a demo.

Data Lake 362
article thumbnail

Building Best-in-Class Enterprise Analytics

Speaker: Anthony Roach, Director of Product Management at Tableau Software, and Jeremiah Morrow, Partner Solution Marketing Director at Dremio

Tableau works with Strategic Partners like Dremio to build data integrations that bring the two technologies together, creating a seamless and efficient customer experience. Through co-development and Co-Ownership, partners like Dremio ensure their unique capabilities are exposed and can be leveraged from within Tableau.

article thumbnail

Load data incrementally from transactional data lakes to data warehouses

AWS Big Data

Data lakes and data warehouses are two of the most important data storage and management technologies in a modern data architecture. Data lakes store all of an organization’s data, regardless of its format or structure.

Data Lake 137
article thumbnail

Choosing an open table format for your transactional data lake on AWS

AWS Big Data

A modern data architecture enables companies to ingest virtually any type of data through automated pipelines into a data lake, which provides highly durable and cost-effective object storage at petabyte or exabyte scale.

Data Lake 130