Data Lake, Data-driven and Structured Data

Recap of Amazon Redshift key product announcements in 2024

AWS Big Data

DECEMBER 17, 2024

Amazon Redshift , launched in 2013, has undergone significant evolution since its inception, allowing customers to expand the horizons of data warehousing and SQL analytics. Industry-leading price-performance Amazon Redshift offers up to three times better price-performance than alternative cloud data warehouses.

Data Lake

Data Lake Data Warehouse Data-driven Optimization

Data Lakes on Cloud & it’s Usage in Healthcare

BizAcuity

MARCH 29, 2019

Data lakes are centralized repositories that can store all structured and unstructured data at any desired scale. The power of the data lake lies in the fact that it often is a cost-effective way to store data. Deploying Data Lakes in the cloud. Best practices to build a Data Lake.

Data Lake

Data Lake Unstructured Data Cost-Benefit Data Quality

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

AWS Big Data

OCTOBER 1, 2024

Amazon Redshift enables you to efficiently query and retrieve structured and semi-structured data from open format files in Amazon S3 data lake without having to load the data into Amazon Redshift tables. Amazon Redshift extends SQL capabilities to your data lake, enabling you to run analytical queries.

Data Lake

Data Lake Statistics Broadcasting Optimization

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

For container terminal operators, data-driven decision-making and efficient data sharing are vital to optimizing operations and boosting supply chain efficiency. Together, these capabilities enable terminal operators to enhance efficiency and competitiveness in an industry that is increasingly data driven.

IoT

IoT Machine Learning Metadata Data-driven

The Enduring Significance of Data Modeling in the Modern Data-Driven Enterprise

erwin

AUGUST 31, 2023

Q: Is data modeling cool again? In today’s fast-paced digital landscape, data reigns supreme. The data-driven enterprise relies on accurate, accessible, and actionable information to make strategic decisions and drive innovation. A: It always was and is getting cooler!!

Data-driven

Data-driven Modeling Enterprise Structured Data

Top analytics announcements of AWS re:Invent 2024

AWS Big Data

FEBRUARY 26, 2025

Analytics remained one of the key focus areas this year, with significant updates and innovations aimed at helping businesses harness their data more efficiently and accelerate insights. From enhancing data lakes to empowering AI-driven analytics, AWS unveiled new tools and services that are set to shape the future of data and analytics.

Analytics

Analytics Data Lake Metadata Data Warehouse

Rocket Mortgage lays foundation for generative AI success

CIO Business Intelligence

MARCH 29, 2024

Generative AI is becoming the virtual knowledge worker with the ability to connect different data points, summarize and synthesize insights in seconds, allowing us to focus on more high-value-add tasks,” says Ritu Jyoti, group vice president of worldwide AI and automation market research and advisory services at IDC. “It

Data Lake

Data Lake Machine Learning Data Warehouse Unstructured Data

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

This is a guest post co-written by Alex Naumov, Principal Data Architect at smava. smava believes in and takes advantage of data-driven decisions in order to become the market leader. smava believes in and takes advantage of data-driven decisions in order to become the market leader.

Data Lake

Data Lake Data Warehouse Data-driven B2B

The Data Journey: From Raw Data to Insights

Sisense

JULY 22, 2020

We live in a world of data: there’s more of it than ever before, in a ceaselessly expanding array of forms and locations. Dealing with Data is your window into the ways organizations tackle the challenges of this new world to help their companies and their customers thrive. Understanding how data becomes insights.

Slice and Dice

Slice and Dice Digital Transformation Data Warehouse Data Lake

Harness Zero Copy data sharing from Salesforce Data Cloud to Amazon Redshift for Unified Analytics – Part 1

AWS Big Data

AUGUST 27, 2024

Director of Product, Salesforce Data Cloud. In today’s ever-evolving business landscape, organizations must harness and act on data to fuel analytics, generate insights, and make informed decisions to deliver exceptional customer experiences. What is Salesforce Data Cloud? What is Amazon Redshift?

Data Lake

Data Lake Analytics Data-driven Management

Generative AI: 5 enterprise predictions for AI and security — for 2023, 2024, and beyond

CIO Business Intelligence

OCTOBER 25, 2023

Our recent data analysis of AI/ML trends and usage confirms this: enterprises across industries have substantially increased their use of generative AI, across many kinds of AI tools. Once shared, this data can be fed into the data lakes used to train large language models (LLMs) and can be discovered by other users.

Enterprise

Enterprise Manufacturing Risk Data-driven

Migrate a petabyte-scale data warehouse from Actian Vectorwise to Amazon Redshift

AWS Big Data

MAY 30, 2024

Amazon Redshift is a fast, scalable, and fully managed cloud data warehouse that allows you to process and run your complex SQL analytics workloads on structured and semi-structured data. Both the API and data store had to support a highly volatile workload pattern.

Data Warehouse

Data Warehouse Data Lake Cost-Benefit Structured Data

Data Visualization and Visual Analytics: Seeing the World of Data

Sisense

JUNE 30, 2020

Everyone wants to get more out of their data, but how exactly to do that can leave you scratching your head. In a world increasingly dominated by data, users of all kinds are gathering, managing, visualizing, and analyzing data in a wide variety of ways. Data visualization: painting a picture of your data.

Visualization

Visualization Analytics Dashboards Data-driven

Enhance query performance using AWS Glue Data Catalog column-level statistics

AWS Big Data

NOVEMBER 22, 2023

Today, we’re making available a new capability of AWS Glue Data Catalog that allows generating column-level statistics for AWS Glue tables. Data lakes are designed for storing vast amounts of raw, unstructured, or semi-structured data at a low cost, and organizations share those datasets across multiple departments and teams.

Statistics

Statistics Data Lake Optimization Data-driven

Quantitative and Qualitative Data: A Vital Combination

Sisense

OCTOBER 6, 2020

Digging into quantitative data Why is quantitative data important What are the problems with quantitative data Exploring qualitative data Qualitative data benefits Getting the most from qualitative data Better together. Almost every modern organization is now a data-generating machine. or “how often?”

Statistics

Statistics Unstructured Data Data-driven Visualization

Exploring real-time streaming for generative AI Applications

AWS Big Data

MARCH 25, 2024

FMs are multimodal; they work with different data types such as text, video, audio, and images. Large language models (LLMs) are a type of FM and are pre-trained on vast amounts of text data and typically have application uses such as text generation, intelligent chatbots, or summarization.

Data Lake

Data Lake Unstructured Data Management Snapshot

Five Strategies to Accelerate Data Product Development

Cloudera

JULY 26, 2021

With this first article of the two-part series on data product strategies, I am presenting some of the emerging themes in data product development and how they inform the prerequisites and foundational capabilities of an Enterprise data platform that would serve as the backbone for developing successful data product strategies.

Strategy

Strategy Data Science Unstructured Data Marketing

Data platform trinity: Competitive or complementary?

IBM Big Data Hub

JANUARY 18, 2023

Data platform architecture has an interesting history. A read-optimized platform that can integrate data from multiple applications emerged. In another decade, the internet and mobile started the generate data of unforeseen volume, variety and velocity. It required a different data platform solution. It is too expensive.

Data Lake

Data Lake Data Warehouse Data-driven Metadata

Chose Both: Data Fabric and Data Lakehouse

Cloudera

SEPTEMBER 12, 2022

It sounds straightforward: you just need data and the means to analyze it. The data is there, in spades. Data volumes have been growing for years and are predicted to reach 175 ZB by 2025. First, organizations have a tough time getting their arms around their data. Unified data fabric. Yes and no.

Unstructured Data

Unstructured Data Data Architecture Data Lake Snapshot

Design a data mesh on AWS that reflects the envisioned organization

AWS Big Data

JANUARY 22, 2024

The company uses AWS Cloud services to build data-driven products and scale engineering best practices. To ensure a sustainable data platform amid growth and profitability phases, their tech teams adopted a decentralized data mesh architecture. The solution Acast implemented is a data mesh, architected on AWS.

Data-driven

Data-driven Advertising Metadata Data Architecture

Build an ETL process for Amazon Redshift using Amazon S3 Event Notifications and AWS Step Functions

AWS Big Data

AUGUST 31, 2023

Data warehousing provides a business with several benefits such as advanced business intelligence and data consistency. Nowadays, more verification steps are applied to source data before processing them which so often add an administration overhead.

Data Warehouse

Data Warehouse Data-driven Testing Business Intelligence

Empower Your Cyber Defenders with Real-Time Analytics

Cloudera

NOVEMBER 15, 2024

In fact, according to the Identity Theft Resource Center (ITRC) Annual Data Breach Report , there were 2,365 cyber attacks in 2023 with more than 300 million victims, and a 72% increase in data breaches since 2021. However, there is a fundamental challenge standing in the way of being successful: data.

Analytics

Analytics Metadata Snapshot Data-driven

Building Better Data Models to Unlock Next-Level Intelligence

Sisense

MAY 11, 2021

You can’t talk about data analytics without talking about data modeling. The reasons for this are simple: Before you can start analyzing data, huge datasets like data lakes must be modeled or transformed to be usable. Building the right data model is an important part of your data strategy.

Modeling

Modeling Big Data IoT Data Warehouse

Non-JSON ingestion using Amazon Kinesis Data Streams, Amazon MSK, and Amazon Redshift Streaming Ingestion

AWS Big Data

OCTOBER 2, 2023

Organizations are grappling with the ever-expanding spectrum of data formats in today’s data-driven landscape. From Avro’s binary serialization to the efficient and compact structure of Protobuf, the landscape of data formats has expanded far beyond the traditional realms of CSV and JSON.

Cost-Benefit

Cost-Benefit Metadata Structured Data Data-driven

Simplify and speed up Apache Spark applications on Amazon Redshift data with Amazon Redshift integration for Apache Spark

AWS Big Data

APRIL 20, 2023

Customers use Amazon Redshift to run their business-critical analytics on petabytes of structured and semi-structured data. Apache Spark enables you to build applications in a variety of languages, such as Java, Scala, and Python, by accessing the data in your Amazon Redshift data warehouse.

Data Lake

Data Lake Data Warehouse Sales Data-driven

How SumUp made digital analytics more accessible using AWS Glue

AWS Big Data

JUNE 6, 2023

SumUp is a leading global financial technology company driven by the purpose of leveling the playing field for small businesses. Unless, of course, the rest of their data also resides in the Google Cloud. AWS Glue gave us a cost-efficient option to migrate the data and we further optimized storage cost by pruning cold data.

Analytics

Analytics Data Lake Testing Optimization

Why optimize your warehouse with a data lakehouse strategy

IBM Big Data Hub

APRIL 25, 2023

In a prior blog , we pointed out that warehouses, known for high-performance data processing for business intelligence, can quickly become expensive for new data and evolving workloads. To do so, Presto and Spark need to readily work with existing and modern data warehouse infrastructures. Some use case examples will help.

Optimization

Optimization Strategy Data Warehouse Cost-Benefit

How Aura from Unity revolutionized their big data pipeline with Amazon Redshift Serverless

AWS Big Data

APRIL 4, 2024

Amazon Redshift is a recommended service for online analytical processing (OLAP) workloads such as cloud data warehouses, data marts, and other analytical data stores. You can use simple SQL to analyze structured and semi-structured data, operational databases, and data lakes to deliver the best price/performance at any scale.

Big Data

Big Data Data Warehouse Advertising OLAP

Advancing AI: The emergence of a modern information lifecycle

CIO Business Intelligence

DECEMBER 4, 2023

Although less complex than the “4 Vs” of big data (velocity, veracity, volume, and variety), orienting to the variety and volume of a challenging puzzle is similar to what CIOs face with information management. Operationalizing data to drive revenue CIOs report that their roles are rising in importance and impact. What’s changed?

Unstructured Data

Unstructured Data Data Lake Business Objectives Metadata

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

This view is used to identify patterns and trends in customer behavior, which can inform data-driven decisions to improve business outcomes. In this post, we discuss how you can use purpose-built AWS services to create an end-to-end data strategy for C360 to unify and govern customer data that address these challenges.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Using Artificial Intelligence to Make Sense of IoT Data

BizAcuity

MARCH 1, 2019

IoT is basically an exchange of data or information in a connected or interconnected environment. As IoT devices generate large volumes of data, AI is functionally necessary to make sense of this data. Data is only useful when it is actionable for which it needs to be supplemented with context and creativity.

IoT

IoT Internet of Things Big Data Data-driven

Why You Need a Data Catalog & How to Choose One

Octopai

MAY 30, 2019

If the point of Business Intelligence (BI) data governance is to leverage your datasets to support information transparency and decision-making, then it’s fair to say that the data catalog is key for your BI strategy. At least, as far as data analysis is concerned. The Benefits of Structured Data Catalogs.

Metadata

Metadata Data Governance Data Lake IoT

In-depth with CDO Christopher Bannocks

Peter James Thomas

AUGUST 29, 2018

Today I am talking to Christopher Bannocks , who is Group Chief Data Officer at ING. As stressed in other recent In-depth interviews [1] , data is a critical asset in banking and related activities, so Christopher’s role is a pivotal one. 2] I was asked to help solve the data problem.

Data-driven

Data-driven Cost-Benefit Metadata Technology

Top Graph Use Cases and Enterprise Applications (with Real World Examples)

Ontotext

MARCH 8, 2023

Gartner predicts that graph technologies will be used in 80% of data and analytics innovations by 2025, up from 10% in 2021. Use Case #1: Customer 360 / Enterprise 360 Customer data is typically spread across multiple applications, departments, and regions. Several factors are driving the adoption of knowledge graphs. million users.

Enterprise

Enterprise Knowledge Discovery Risk Machine Learning

Data Swamp, Data Lake, Data Lakehouse: What to Know

Alation

OCTOBER 21, 2021

Data Swamp vs Data Lake. When you imagine a lake, it’s likely an idyllic image of a tree-ringed body of reflective water amid singing birds and dabbling ducks. I’ll take the lake, thank you very much. I’ll take the lake, thank you very much. And so will your data. Benefits of a Data Lake.

Data Lake

Data Lake Metadata Data Warehouse Data Governance

Interview with Dominic Sartorio, Senior Vice President for Products & Development, Protegrity

Corinium

APRIL 25, 2019

Ahead of the Chief Data Analytics Officers & Influencers, Insurance event we caught up with Dominic Sartorio, Senior Vice President for Products & Development, Protegrity to discuss how the industry is evolving. The last 10+ years or so have seen Insurance become as data-driven as any vertical industry.

Insurance

Insurance Risk IoT Data-driven

What is a Data Pipeline?

Jet Global

MAY 9, 2024

A data pipeline is a series of processes that move raw data from one or more sources to one or more destinations, often transforming and processing the data along the way. Data pipelines support data science and business intelligence projects by providing data engineers with high-quality, consistent, and easily accessible data.

Data Lake

Data Lake Data Warehouse Business Intelligence Machine Learning

Key takeaways for CIOs from AWS re:Invent 2024

CIO Business Intelligence

DECEMBER 9, 2024

Streamlining workflows and boosting productivity The next set of re:Invent announcements focused on streamlining workflows for enterprises and helping businesses boost the productivity of developers and data professionals. AWS announced that it will unify analytics and AI services under its SageMaker service.

Metadata

Metadata Unstructured Data Data Lake Data-driven

Accelerate queries on Apache Iceberg tables through AWS Glue auto compaction

AWS Big Data

DECEMBER 19, 2024

Data lakes were originally designed to store large volumes of raw, unstructured, or semi-structured data at a low cost, primarily serving big data and analytics use cases. Enabling automatic compaction on Iceberg tables reduces metadata overhead on your Iceberg tables and improves query performance.

Data Lake

Data Lake IoT Metadata Testing

Empower Your Cyber Defenders with Real-Time Analytics Author: Carolyn Duby, Field CTO

Cloudera

NOVEMBER 15, 2024

In fact, according to the Identity Theft Resource Center (ITRC) Annual Data Breach Report , there were 2,365 cyber attacks in 2023 with more than 300 million victims, and a 72% increase in data breaches since 2021. However, there is a fundamental challenge standing in the way of being successful: data.

Analytics

Analytics Metadata Snapshot Data-driven

Is Your Data Catalog Ready for the AI Age?

BI-Survey

FEBRUARY 27, 2025

A Checklist to Challenge Your Vendor At first glance, data catalogs might seem like straightforward tools for organizing information an apparently mundane task. However, a closer look reveals that these systems are far more than simple repositories: Data catalogs are at the forefront of bringing AI into your business for at least two reasons.

Unstructured Data

Unstructured Data Metadata Data Quality Data Governance

Modernize your legacy databases with AWS data lakes, Part 3: Build a data lake processing layer

AWS Big Data

OCTOBER 30, 2024

This is the final part of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to process data with Amazon Redshift Spectrum and create the gold (consumption) layer. The following diagram illustrates the different layers of the data lake.

Data Lake

Data Lake Machine Learning Data Architecture Data-driven

Ingest telemetry messages in near real time with Amazon API Gateway, Amazon Data Firehose, and Amazon Location Service

AWS Big Data

NOVEMBER 14, 2024

Key services in the solution include Amazon API Gateway , Amazon Data Firehose , and Amazon Location Service. The challenge In the event of a disaster e.g. water flood, there is usually a lack of terrestrial data connectivity that prevents monitoring stations from taking actionable measures in real time.

Data Lake

Data Lake Metadata Testing Data-driven

Recap of Amazon Redshift key product announcements in 2024

Data Lakes on Cloud & it’s Usage in Healthcare

Webinars

Trending Sources

Accelerate Amazon Redshift Data Lake queries with AWS Glue Data Catalog Column Statistics

Webinars

How EUROGATE established a data mesh architecture using Amazon DataZone

The Enduring Significance of Data Modeling in the Modern Data-Driven Enterprise

Top analytics announcements of AWS re:Invent 2024

Rocket Mortgage lays foundation for generative AI success

How smava makes loans transparent and affordable using Amazon Redshift Serverless

The Data Journey: From Raw Data to Insights

Harness Zero Copy data sharing from Salesforce Data Cloud to Amazon Redshift for Unified Analytics – Part 1

Generative AI: 5 enterprise predictions for AI and security — for 2023, 2024, and beyond

Migrate a petabyte-scale data warehouse from Actian Vectorwise to Amazon Redshift

Data Visualization and Visual Analytics: Seeing the World of Data

Enhance query performance using AWS Glue Data Catalog column-level statistics

Quantitative and Qualitative Data: A Vital Combination

Exploring real-time streaming for generative AI Applications

Five Strategies to Accelerate Data Product Development

Data platform trinity: Competitive or complementary?

Chose Both: Data Fabric and Data Lakehouse

Design a data mesh on AWS that reflects the envisioned organization

Build an ETL process for Amazon Redshift using Amazon S3 Event Notifications and AWS Step Functions

Empower Your Cyber Defenders with Real-Time Analytics

Building Better Data Models to Unlock Next-Level Intelligence

Non-JSON ingestion using Amazon Kinesis Data Streams, Amazon MSK, and Amazon Redshift Streaming Ingestion

Simplify and speed up Apache Spark applications on Amazon Redshift data with Amazon Redshift integration for Apache Spark

How SumUp made digital analytics more accessible using AWS Glue

Why optimize your warehouse with a data lakehouse strategy

How Aura from Unity revolutionized their big data pipeline with Amazon Redshift Serverless

Advancing AI: The emergence of a modern information lifecycle

Create an end-to-end data strategy for Customer 360 on AWS

Using Artificial Intelligence to Make Sense of IoT Data

Why You Need a Data Catalog & How to Choose One

In-depth with CDO Christopher Bannocks

Top Graph Use Cases and Enterprise Applications (with Real World Examples)

Data Swamp, Data Lake, Data Lakehouse: What to Know

Interview with Dominic Sartorio, Senior Vice President for Products & Development, Protegrity

What is a Data Pipeline?

Key takeaways for CIOs from AWS re:Invent 2024

Accelerate queries on Apache Iceberg tables through AWS Glue auto compaction

Empower Your Cyber Defenders with Real-Time Analytics Author: Carolyn Duby, Field CTO

Is Your Data Catalog Ready for the AI Age?

Modernize your legacy databases with AWS data lakes, Part 3: Build a data lake processing layer

Ingest telemetry messages in near real time with Amazon API Gateway, Amazon Data Firehose, and Amazon Location Service

Stay Connected