Remove category apache-flink
article thumbnail

Choosing Your Upgrade or Migration Path to Cloudera Data Platform

Cloudera

In our previous blog, we talked about the four paths to Cloudera Data Platform. . If you haven’t read that yet, we invite you to take a moment and run through the scenarios in that blog. As we touched on in the previous blog, the decision to upgrade or migrate may seem difficult to evaluate at first glance. In-place Upgrade.

Testing 119
article thumbnail

Real-time inference using deep learning within Amazon Kinesis Data Analytics for Apache Flink

AWS Big Data

Apache Flink is a framework and distributed processing engine for stateful computations over data streams. Amazon Kinesis Data Analytics for Apache Flink is a fully managed service that enables you to use an Apache Flink application to process streaming data. Window the images into a collection of records.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Use Apache Iceberg in a data lake to support incremental data processing

AWS Big Data

Apache Iceberg is an open table format for very large analytic datasets, which captures metadata information on the state of datasets as they evolve and change over time. It adds tables to compute engines including Spark, Trino, PrestoDB, Flink, and Hive using a high-performance table format that works just like a SQL table.

Data Lake 135
article thumbnail

5 Key Takeaways from #Current2023

Cloudera

This blog is for anyone who was interested but unable to attend the conference, or anyone interested in a quick summary of what happened there. Flink is here to stay. It makes perfect sense that Apache Flink has emerged as the standard. I will cover key takeaways from Current 2023 and offer Cloudera’s perspective.

article thumbnail

Streaming Market Data with Flink SQL Part II: Intraday Value-at-Risk

Cloudera

Flink SQL is a data processing language that enables rapid prototyping and development of event-driven and streaming applications. Flink SQL combines the performance and scalability of Apache Flink, a popular distributed streaming platform, with the simplicity and accessibility of SQL. You can view the code here.

Risk 99
article thumbnail

An Overview of Real Time Data Warehousing on Cloudera

Cloudera

As an example of this, in this post we look at Real Time Data Warehousing (RTDW), which is a category of use cases customers are building on Cloudera and which is becoming more and more common amongst our customers. Deep Dive into General Purpose RTDW , featuring Apache Kudu, Apache Impala, and Apache NiFi.

article thumbnail

Melting the ice — How Natural Intelligence simplified a data lake migration to Apache Iceberg

AWS Big Data

However, migrating an existing data lake to a new table format such as Apache Iceberg can bring significant technical and organizational challenges Natural Intelligence (NI) is a world leader in multi-category marketplaces. Recently, NI embarked on a journey to transition their legacy data lake from Apache Hive to Apache Iceberg.