Data Enablement, Interactive and Metadata

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

AWS Big Data

JULY 29, 2024

It addresses many of the shortcomings of traditional data lakes by providing features such as ACID transactions, schema evolution, row-level updates and deletes, and time travel. In this blog post, we’ll discuss how the metadata layer of Apache Iceberg can be used to make data lakes more efficient.

Metadata

Metadata Snapshot Data Lake Metrics

The Power of Graph Databases, Linked Data, and Graph Algorithms

Rocket-Powered Data Science

MARCH 10, 2020

Any interaction between the two ( e.g., a financial transaction in a financial database) would be flagged by the authorities, and the interactions would come under great scrutiny. Any node and its relationship to a particular node becomes a type of contextual metadata for that particular note.

Metadata

Metadata Machine Learning Prescriptive Analytics ROI

Tableau further democratizes analytics with AI-fueled features

CIO Business Intelligence

APRIL 30, 2024

“But to us, it’s more than just having a data strategy; it’s also about building a great foundation of a data culture.” That’s where Tableau sees Pulse and Einstein Copilot for Tableau — a generative AI assistant that gives users the ability to interact with Tableau using natural language — coming in.

Analytics

Analytics Metrics Visualization Dashboards

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Minimizing Supply Chain Disruptions with Advanced Analytics

Cloudera

AUGUST 3, 2021

Advanced analytics and enterprise data empower companies to not only have a completely transparent view of movement of materials and products within their line of sight, but also leverage data from their suppliers to have a holistic view 2-3 tiers deep in the supply chain.

Analytics

Analytics Digital Transformation Forecasting Risk

How AI-driven data management helps organizations meet rising regulatory challenges

CIO Business Intelligence

SEPTEMBER 12, 2024

Greater visibility of data is also required for businesses to be able to determine the nature of a document in order to understand, for example, whether it is confidential information, a work product, or an HR document. Getting full visibility of data enables businesses to put in place a defensible data management process.

Data-driven

Data-driven Management Cost-Benefit Risk

Introducing watsonx: The future of AI for business

IBM Big Data Hub

MAY 9, 2023

At IBM, we believe it is time to place the power of AI in the hands of all kinds of “AI builders” — from data scientists to developers to everyday users who have never written a single line of code. Watsonx, IBM’s next-generation AI platform, is designed to do just that.

Data Warehouse

Data Warehouse Machine Learning Cost-Benefit Metadata

Why I Joined Alation: A Former Customer’s Story

Alation

JULY 26, 2021

One of the first steps in any digital transformation journey is to understand what data assets exist in the organization. When we began, we had a very technical and archaic tool, an enterprise metadata management platform that cataloged our assets. The people behind the data are key. It was terribly complex.

Insurance

Insurance Digital Transformation Enterprise Data Governance

The Future of the Data Lakehouse – Open

CIO Business Intelligence

JUNE 23, 2022

But there was a better way: enter the Hive Metastore, one of the sleeper hits of the data platform of the last decade. As use cases matured, we saw the need for both efficient, interactive BI analytics and transactional semantics to modify data. Iterations of the lakehouse.

Data Lake

Data Lake Data Warehouse Machine Learning Data-driven

The Future of the Data Lakehouse – Open

Cloudera

JUNE 18, 2022

But there was a better way: enter the Hive Metastore, one of the sleeper hits of the data platform of the last decade. As use cases matured, we saw the need for both efficient, interactive BI analytics and transactional semantics to modify data. Iterations of the lakehouse.

Data Lake

Data Lake Data Warehouse Machine Learning Data-driven

Process and analyze highly nested and large XML files using AWS Glue and Amazon Athena

AWS Big Data

SEPTEMBER 29, 2023

With these techniques, you can enhance the processing speed and accessibility of your XML data, enabling you to derive valuable insights with ease. Process and transform XML data into a format (like Parquet) suitable for Athena using an AWS Glue extract, transform, and load (ETL) job. xml and technique2.xml. Choose Create.

Metadata

Metadata Visualization Data-driven Optimization

How Ruparupa gained updated insights with an Amazon S3 data lake, AWS Glue, Apache Hudi, and Amazon QuickSight

AWS Big Data

FEBRUARY 22, 2023

The AWS Glue job can transform the raw data in Amazon S3 to Parquet format, which is optimized for analytic queries. The AWS Glue Data Catalog stores the metadata, and Amazon Athena (a serverless query engine) is used to query data in Amazon S3.

Data Lake

Data Lake Dashboards Cost-Benefit Data Warehouse

The Power of Ontologies and Knowledge Graphs: Practical Examples from the Financial Industry

Ontotext

MAY 5, 2023

It involves specifying individual components, such as objects and their attributes, as well as rules and restrictions governing their interactions. Another capability of knowledge graphs that contributes to improved search and discoverability is that they can integrate and index multiple forms of data and associated metadata.

Data Collection

Data Collection Risk Data-driven Interactive

Exploring real-time streaming for generative AI Applications

AWS Big Data

MARCH 25, 2024

Streaming data facilitates the constant flow of diverse and up-to-date information, enhancing the models’ ability to adapt and generate more accurate, contextually relevant outputs. AWS Glue can interact with streaming data services such as Kinesis Data Streams and Amazon MSK for processing and transforming CDC data.

Data Lake

Data Lake Unstructured Data Management Snapshot

The Gartner 2021 Leadership Vision for Data & Analytics Leaders Webinar Q&A

Andrew White

JANUARY 11, 2021

In our modern data and analytics strategy and operating model, a PM methodology plays a key enabling role in delivering solutions. Do you draw a distinction between a data-driven vision and a data-enabled vision, and if so, what is that distinction? I didn’t mean to imply this.

Data Analytics

Data Analytics Analytics Data-driven Finance

Hybrid big data analytics with Amazon EMR on AWS Outposts

AWS Big Data

JANUARY 29, 2025

Amazon EMR has long been the leading solution for processing big data in the cloud. Amazon EMR is the industry-leading big data solution for petabyte-scale data processing, interactive analytics, and machine learning using over 20 open source frameworks such as Apache Hadoop , Hive, and Apache Spark.

Big Data

Big Data Data Analytics Analytics Interactive

Data Leaders Brief

Monitoring Apache Iceberg metadata layer using AWS Lambda, AWS Glue, and AWS CloudWatch

The Power of Graph Databases, Linked Data, and Graph Algorithms

Webinars

Trending Sources

Tableau further democratizes analytics with AI-fueled features

Webinars

Minimizing Supply Chain Disruptions with Advanced Analytics

How AI-driven data management helps organizations meet rising regulatory challenges

Introducing watsonx: The future of AI for business

Why I Joined Alation: A Former Customer’s Story

The Future of the Data Lakehouse – Open

The Future of the Data Lakehouse – Open

Process and analyze highly nested and large XML files using AWS Glue and Amazon Athena

How Ruparupa gained updated insights with an Amazon S3 data lake, AWS Glue, Apache Hudi, and Amazon QuickSight

The Power of Ontologies and Knowledge Graphs: Practical Examples from the Financial Industry

Exploring real-time streaming for generative AI Applications

The Gartner 2021 Leadership Vision for Data & Analytics Leaders Webinar Q&A

Hybrid big data analytics with Amazon EMR on AWS Outposts

Stay Connected