2021, Data Integration and Data Lake

2021

Data Integration

Data Lake

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

AWS Big Data

AUGUST 31, 2023

Amazon Redshift enables you to directly access data stored in Amazon Simple Storage Service (Amazon S3) using SQL queries and join data across your data warehouse and data lake. With Amazon Redshift, you can query the data in your S3 data lake using a central AWS Glue metastore from your Redshift data warehouse.

Data Lake

Data Lake Data Warehouse Metadata Data Architecture

Recap of Amazon Redshift key product announcements in 2024

AWS Big Data

DECEMBER 17, 2024

Today, Amazon Redshift is used by customers across all industries for a variety of use cases, including data warehouse migration and modernization, near real-time analytics, self-service analytics, data lake analytics, machine learning (ML), and data monetization.

Data Lake

Data Lake Data Warehouse Data-driven Optimization

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

AWS Big Data

DECEMBER 16, 2024

Third, some services require you to set up and manage compute resources used for federated connectivity, and capabilities like connection testing and data preview arent available in all services. To solve for these challenges, we launched Amazon SageMaker Lakehouse unified data connectivity.

Visualization

Visualization Data Processing Testing Publishing

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Amazon Redshift announcements at AWS re:Invent 2023 to enable analytics on all your data

AWS Big Data

NOVEMBER 29, 2023

Amazon Redshift Serverless, generally available since 2021, allows you to run and scale analytics without having to provision and manage the data warehouse. Since GA, Redshift Serverless executed over a billion queries to power data insights for thousands of customers.

Data Warehouse

Data Warehouse Analytics Data Lake Machine Learning

Automatically detect Personally Identifiable Information in Amazon Redshift using AWS Glue

AWS Big Data

DECEMBER 15, 2023

Even after identification, it’s cumbersome to implement redaction, masking, or encryption of sensitive data at scale. In this post, we provide an automated solution to detect PII data in Amazon Redshift using AWS Glue. For our solution, we use Amazon Redshift to store the data.

Data Lake

Data Lake Data Warehouse Big Data Structured Data

How Ruparupa gained updated insights with an Amazon S3 data lake, AWS Glue, Apache Hudi, and Amazon QuickSight

AWS Big Data

FEBRUARY 22, 2023

In this post, we show how Ruparupa implemented an incrementally updated data lake to get insights into their business using Amazon Simple Storage Service (Amazon S3), AWS Glue , Apache Hudi , and Amazon QuickSight. An AWS Glue ETL job, using the Apache Hudi connector, updates the S3 data lake hourly with incremental data.

Data Lake

Data Lake Dashboards Cost-Benefit Data Warehouse

P&G turns to AI to create digital manufacturing of the future

CIO Business Intelligence

OCTOBER 1, 2022

P&G) has grown to become one of the world’s largest consumer goods manufacturers, with worldwide revenue of more than $76 billion in 2021 and more than 100,000 employees. It requires taking data from equipment sensors, applying advanced analytics to derive descriptive and predictive insights, and automating corrective actions.

Manufacturing

Manufacturing Digital Transformation IoT Internet of Things

Straumann Group is transforming dentistry with data, AI

CIO Business Intelligence

FEBRUARY 16, 2023

Selling the value of data transformation Iyengar and his team are 18 months into a three- to five-year journey that started by building out the data layer — corralling data sources such as ERP, CRM, and legacy databases into data warehouses for structured data and data lakes for unstructured data.

Unstructured Data

Unstructured Data Data Lake Prescriptive Analytics Data Warehouse

Augmented data management: Data fabric versus data mesh

IBM Big Data Hub

APRIL 27, 2022

The data fabric architectural approach can simplify data access in an organization and facilitate self-service data consumption at scale. Read: The first capability of a data fabric is a semantic knowledge data catalog, but what are the other 5 core capabilities of a data fabric? 11 May 2021. .

Management

Management Metadata Data Architecture Data Lake

Week in the Life of an Analyst at Gartner US IT Symposium (virtual) 2021

Andrew White

OCTOBER 22, 2021

Lakehouse (data warehouse and data lake working together) 8. Data Literacy, training, coordination, collaboration 8. Data Management Infrastructure/Data Fabric 5. Data Integration tactics 4. Digital Business connections to D&A/decision modeling 10. Business Innovation with D&A 6.

IT Data Lake Data Science Strategy

Automate the archive and purge data process for Amazon RDS for PostgreSQL using pg_partman, Amazon S3, and AWS Glue

AWS Big Data

AUGUST 22, 2023

This post proposes an automated solution by using AWS Glue for automating the PostgreSQL data archiving and restoration process, thereby streamlining the entire procedure. He joined AWS Professional Services organization in 2021 and helps customers build cloud-native applications on AWS cloud.

Data Processing

Data Processing Testing Data Lake Data Integration

How Knowledge Graphs Power Data Mesh and Data Fabric

Ontotext

APRIL 10, 2024

The data ecosystem today is crowded with dazzling buzzwords, all fighting for investment dollars. A survey in 2021 found that a data company was being funded every 45 minutes. Data ecosystems have become jungles and in spite of all the technology, data teams are struggling to create a modern data experience.

Metadata

Metadata Data Lake Data Warehouse Data Quality

How Amazon Devices scaled and optimized real-time demand and supply forecasts using serverless analytics

AWS Big Data

FEBRUARY 1, 2023

With data volumes exhibiting a double-digit percentage growth rate year on year and the COVID pandemic disrupting global logistics in 2021, it became more critical to scale and generate near-real-time data. You can visually create, run, and monitor extract, transform, and load (ETL) pipelines to load data into your data lakes.

Optimization

Optimization Forecasting Data Lake Metadata

Top Graph Use Cases and Enterprise Applications (with Real World Examples)

Ontotext

MARCH 8, 2023

Gartner predicts that graph technologies will be used in 80% of data and analytics innovations by 2025, up from 10% in 2021. As such, most large financial organizations have moved their data to a data lake or a data warehouse to understand and manage financial risk in one place.

Enterprise

Enterprise Knowledge Discovery Risk Machine Learning

It’s not your data. It’s how you use it. Unlock the power of data & build foundations of a data driven organisation

CIO Business Intelligence

MAY 24, 2022

Australian research and advisory firm Adapt identifies an organisation’s ability to execute a data-driven strategy as one of 12 core competencies , identified from 30,000 conversations spanning three years with leading IT and businesses. analyse the data, using business intelligence, visualisation or data science tools.

Data-driven

Data-driven Data Lake Data Warehouse Machine Learning

The Right Tool to Support Your Microsoft Dynamics Migration

Jet Global

JUNE 13, 2022

According to Gartner’s 2021 Core Financial Magic Quadrant , over 50% of the ERP market is expected to be cloud-based by 2024. When migrating to the cloud, there are a variety of different approaches you can take to maintain your data strategy. It is the fastest growing ERP on the market and can be run on premises or on the cloud.

Reporting

Reporting Data Lake Sales Operational Reporting

How DBAs can take on a more strategic role

CIO Business Intelligence

NOVEMBER 12, 2024

Complicating the issue is the fact that a majority of data (80% to 90%, according to multiple analyst estimates) is unstructured. 3 Modern DBAs must now navigate a landscape where data resides across increasingly diverse environments, including relational databases, NoSQL, and data lakes.

Statistics

Statistics Unstructured Data Cost-Benefit Data Lake

Data Leaders Brief

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Recap of Amazon Redshift key product announcements in 2024

Webinars

Trending Sources

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

Webinars

Amazon Redshift announcements at AWS re:Invent 2023 to enable analytics on all your data

Automatically detect Personally Identifiable Information in Amazon Redshift using AWS Glue

How Ruparupa gained updated insights with an Amazon S3 data lake, AWS Glue, Apache Hudi, and Amazon QuickSight

P&G turns to AI to create digital manufacturing of the future

Straumann Group is transforming dentistry with data, AI

Augmented data management: Data fabric versus data mesh

Week in the Life of an Analyst at Gartner US IT Symposium (virtual) 2021

Automate the archive and purge data process for Amazon RDS for PostgreSQL using pg_partman, Amazon S3, and AWS Glue

How Knowledge Graphs Power Data Mesh and Data Fabric

How Amazon Devices scaled and optimized real-time demand and supply forecasts using serverless analytics

Top Graph Use Cases and Enterprise Applications (with Real World Examples)

It’s not your data. It’s how you use it. Unlock the power of data & build foundations of a data driven organisation

The Right Tool to Support Your Microsoft Dynamics Migration

How DBAs can take on a more strategic role

Stay Connected