Data Lake and Seminar - Data Leaders Brief

Data Lake

Seminar

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JULY 20, 2023

With data becoming the driving force behind many industries today, having a modern data architecture is pivotal for organizations to be successful. In this post, we describe Orca’s journey building a transactional data lake using Amazon Simple Storage Service (Amazon S3), Apache Iceberg, and AWS Analytics.

Data Lake

Data Lake Analytics Snapshot Data Quality

What’s hard about AI? Operations!

Timo Elliott

APRIL 1, 2023

It’s not creating the models or doing the data science—it’s actually making it part of an operational process. Data scientists typically extract data from operational systems and move it to a hyperscalar data lake, then use open source algorithms to create and test their mdoels.

Data Lake

Data Lake Testing Data Science Modeling

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

How Novanta’s CIO mobilized its data-driven transformation

CIO Business Intelligence

MAY 10, 2023

On investing in capabilities: We’ve set up something called a BI Center of Excellence where we train and have workshops and seminars on a monthly basis that team members across Novanta can join to learn about how they could leverage data marts or data sources to build their own reporting.

Data-driven

Data-driven IT Digital Transformation Data Governance

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

HEMA accelerates their data governance journey with Amazon DataZone

AWS Big Data

DECEMBER 19, 2024

Delta tables technical metadata is stored in the Data Catalog, which is a native source for creating assets in the Amazon DataZone business catalog. Access control is enforced using AWS Lake Formation , which manages fine-grained access control and data sharing on data lake data.

Data Governance

Data Governance Publishing Data-driven Metadata

How Aura from Unity revolutionized their big data pipeline with Amazon Redshift Serverless

AWS Big Data

APRIL 4, 2024

Amazon Redshift is a recommended service for online analytical processing (OLAP) workloads such as cloud data warehouses, data marts, and other analytical data stores. You can use simple SQL to analyze structured and semi-structured data, operational databases, and data lakes to deliver the best price/performance at any scale.

Big Data

Big Data Data Warehouse Advertising OLAP

Data Modeling 201 for the cloud: designing databases for data warehouses

erwin

JUNE 7, 2022

The first and most important thing to recognize and understand is the new and radically different target environment that you are now designing a data model for. Star schema: a data modeling and database design paradigm for data warehouses and data lakes. Business Focus. Operational. Operational Tactical.

Data Warehouse

Data Warehouse Modeling Sales Data Lake

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

APRIL 3, 2019

Fun fact: in the early 1980s I attended a Systems seminar by some of our department’s grad students and professors, plus their colleagues from a nearby university with bears, who presented about a thing called the “Stanford University Network” workstation. Somehow, the gravity of the data has a geological effect that forms data lakes.

Machine Learning

Machine Learning Data Governance Metadata Data Science

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

What’s hard about AI? Operations!

Webinars

Trending Sources

How Novanta’s CIO mobilized its data-driven transformation

Webinars

HEMA accelerates their data governance journey with Amazon DataZone

How Aura from Unity revolutionized their big data pipeline with Amazon Redshift Serverless

Data Modeling 201 for the cloud: designing databases for data warehouses

Themes and Conferences per Pacoid, Episode 8

Stay Connected