Data Transformation, Events and Publishing

SQL Streambuilder Data Transformations

Cloudera

FEBRUARY 21, 2023

As an essential part of ETL, as data is being consolidated, we will notice that data from different sources are structured in different formats. It might be required to enhance, sanitize, and prepare data so that data is fit for consumption by the SQL engine. What is a data transformation?

Data Transformation

Data Transformation Data Processing Data Collection Publishing

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

Plug-and-play integration : A seamless, plug-and-play integration between data producers and consumers should facilitate rapid use of new data sets and enable quick proof of concepts, such as in the data science teams. As part of the required data, CHE data is shared using Amazon DataZone.

IoT

IoT Machine Learning Metadata Data-driven

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

For instance, Domain A will have the flexibility to create data products that can be published to the divisional catalog, while also maintaining the autonomy to develop data products that are exclusively accessible to teams within the domain. Consumer feedback and demand drives creation and maintenance of the data product.

Metadata

Metadata Data Governance Data Quality Data-driven

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

How Open Universities Australia modernized their data platform and significantly reduced their ETL costs with AWS Cloud Development Kit and AWS Step Functions

AWS Big Data

JANUARY 30, 2025

We used the AWS Step Function state machines to define, orchestrate, and execute our data pipelines. Amazon EventBridge We used Amazon EventBridge, the serverless event bus service, to define the event-based rules and schedules that would trigger our AWS Step Functions state machines.

Data Warehouse

Data Warehouse Data Architecture Machine Learning Data Transformation

Gain insights from historical location data using Amazon Location Service and AWS analytics services

AWS Big Data

MARCH 13, 2024

Developers can use the support in Amazon Location Service for publishing device position updates to Amazon EventBridge to build a near-real-time data pipeline that stores locations of tracked assets in Amazon Simple Storage Service (Amazon S3). In this model, the Lambda function is invoked for each incoming event.

Analytics

Analytics IoT Metadata Internet of Things

Biggest Trends in Data Visualization Taking Shape in 2022

Smart Data Collective

OCTOBER 13, 2021

There are countless examples of big data transforming many different industries. There is no disputing the fact that the collection and analysis of massive amounts of unstructured data has been a huge breakthrough. Data virtualization is ideal in any situation where the is necessary: Information coming from diverse data sources.

Visualization

Visualization Cost-Benefit Big Data Prescriptive Analytics

Modernize a legacy real-time analytics application with Amazon Managed Service for Apache Flink

AWS Big Data

OCTOBER 11, 2023

Traditionally, such a legacy call center analytics platform would be built on a relational database that stores data from streaming sources. Data transformations through stored procedures and use of materialized views to curate datasets and generate insights is a known pattern with relational databases.

Management

Management Metadata Analytics Dashboards

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

Build data validation rules directly into ingestion layers so that insufficient data is stopped at the gate and not detected after damage is done. Use lineage tooling to trace data from source to report. Understanding how data transforms and where it breaks is crucial for audibility and root-cause resolution.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Cloudera DataFlow Designer: The Key to Agile Data Pipeline Development

Cloudera

MARCH 14, 2023

Once a draft has been created or opened, developers use the visual Designer to build their data flow logic and validate it using interactive test sessions. Managing drafts outside the Catalog keeps a clean distinction between phases of the development cycle, leaving only those flows that are ready for deployment published in the Catalog.

Testing

Testing Publishing Metadata Interactive

DataOps Observability: Taming the Chaos (Part 2)

DataKitchen

OCTOBER 25, 2022

It’s because it’s a hard thing to accomplish when there are so many teams, locales, data sources, pipelines, dependencies, data transformations, models, visualizations, tests, internal customers, and external customers. That data then fills several database tables. It’s not just a fear of change.

Testing

Testing Data-driven Visualization Dashboards

Harnessing Streaming Data: Insights at the Speed of Life

Sisense

OCTOBER 15, 2020

One of the main challenges when dealing with streaming data comes from performing stateful transformations for individual events. Unlike a batch processing job that runs within an isolated batch with clear start and end times, a stream processing job runs continuously on each event separately.

Dashboards

Dashboards IoT Optimization Internet of Things

Introducing Cloudera DataFlow Designer: Self-service, No-Code Dataflow Design

Cloudera

DECEMBER 9, 2022

Developers need to onboard new data sources, chain multiple data transformation steps together, and explore data as it travels through the flow. This allows developers to make changes to their processing logic on the fly while running some test data through their flow and validating that their changes work as intended.

Testing

Testing Cost-Benefit Interactive Visualization

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

This means there are no unintended data errors, and it corresponds to its appropriate designation (e.g., Here, it all comes down to the data transformation error rate. Data time-to-value: evaluates how long it takes you to gain insights from a data set. date, month, and year). million a year.

Data Quality

Data Quality Metrics Data-driven Management

Cross-account integration between SaaS platforms using Amazon AppFlow

AWS Big Data

APRIL 25, 2023

On many occasions, they need to apply business logic to the data received from the source SaaS platform before pushing it to the target SaaS platform. AnyCompany’s marketing team hosted an event at the Anaheim Convention Center, CA. The marketing team created leads based on the event in Adobe Marketo. Let’s take an example.

Sales

Sales Visualization Software Marketing

How healthcare organizations can analyze and create insights using price transparency data

AWS Big Data

OCTOBER 11, 2023

Under the Transparency in Coverage (TCR) rule , hospitals and payors to publish their pricing data in a machine-readable format. Due to this low complexity, the solution uses AWS serverless services to ingest the data, transform it, and make it available for analytics.

Visualization

Visualization Dashboards Data-driven Gap analysis

Stream VPC Flow Logs to Datadog via Amazon Kinesis Data Firehose

AWS Big Data

JUNE 20, 2023

Kinesis Data Firehose is a fully managed service for delivering near-real-time streaming data to various destinations for storage and performing near-real-time analytics. You can perform analytics on VPC flow logs delivered from your VPC using the Kinesis Data Firehose integration with Datadog as a destination.

Dashboards

Dashboards Visualization Metrics Data Transformation

Use AWS Glue DataBrew recipes in your AWS Glue Studio visual ETL jobs

AWS Big Data

JULY 27, 2023

DataBrew is a visual data preparation tool that enables you to clean and normalize data without writing any code. The over 200 transformations it provides are now available to be used in an AWS Glue Studio visual job. Now that you have addressed all data quality issues identified on the sample, publish the project as a recipe.

Visualization

Visualization Cost-Benefit Data Quality Publishing

Simplify Metrics on Apache Druid With Rill Data and Cloudera

Cloudera

JULY 21, 2022

Cloudera users can securely connect Rill to a source of event stream data, such as Cloudera DataFlow , model data into Rill’s cloud-based Druid service, and share live operational dashboards within minutes via Rill’s interactive metrics dashboard or any connected BI solution. Cloudera Data Warehouse). Apache Hive.

Metrics

Metrics Slice and Dice Data Warehouse Dashboards

Use Snowflake with Amazon MWAA to orchestrate data pipelines

AWS Big Data

OCTOBER 31, 2023

Customers rely on data from different sources such as mobile applications, clickstream events from websites, historical data, and more to deduce meaningful patterns to optimize their products, services, and processes. The transformed data is then made accessible to Snowflake for data analysis. Choose Next.

Data Processing

Data Processing Management Publishing Visualization

How CFM built a well-governed and scalable data-engineering platform using Amazon EMR for financial features generation

AWS Big Data

SEPTEMBER 13, 2024

At this stage, CFM data scientists can perform analytics and extract value from raw data. Resulting datasets are then published to our data mesh service across our organization to allow our scientists to work on prediction models.

Interactive

Interactive Strategy Cost-Benefit Data Governance

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

AWS Big Data

MARCH 3, 2023

It has been well published since the State of DevOps 2019 DORA Metrics were published that with DevOps, companies can deploy software 208 times more often and 106 times faster, recover from incidents 2,604 times faster, and release 7 times fewer defects. Finally, data integrity is of paramount importance.

Software

Software Data Lake Testing Cost-Benefit

How Alation’s Data Team Uses the Modern Data Stack to Power Insights

Alation

OCTOBER 27, 2022

Few actors in the modern data stack have inspired the enthusiasm and fervent support as dbt. This data transformation tool enables data analysts and engineers to transform, test and document data in the cloud data warehouse. But what does this mean from a practitioner perspective?

Dashboards

Dashboards Metrics Sales Reporting

Migrate from Amazon Kinesis Data Analytics for SQL Applications to Amazon Kinesis Data Analytics Studio

AWS Big Data

JUNE 29, 2023

Kinesis Data Analytics for Apache Flink In our example, we perform the following actions on the streaming data: Connect to an Amazon Kinesis Data Streams data stream. View the stream data. Transform and enrich the data. Manipulate the data with Python. Provide the following SQL statement.

Data Analytics

Data Analytics Analytics IoT Data Lake

AI, the Power of Knowledge and the Future Ahead: An Interview with Head of Ontotext’s R&I Milena Yankova

Ontotext

APRIL 4, 2019

Milena Yankova : What we did for the BBC in the previous Olympics was that we helped journalists publish their reports faster. We minimized the time between the event (and what the journalist wanted to say about it) and the moment the reader or viewer could consume it. I think artists can relax. Economy.bg: What about journalists?

Recreation/Entertainment

Recreation/Entertainment Testing Enterprise Knowledge Discovery

How to Aggregate Global Data from the Coronavirus Outbreak

Sisense

APRIL 10, 2020

In this article, we discuss how this data is accessed, an example environment and set-up to be used for data processing, sample lines of Python code to show the simplicity of data transformations using Pandas and how this simple architecture can enable you to unlock new insights from this data yourself.

Visualization

Visualization Reporting Data Processing Dashboards

Enable advanced search capabilities for Amazon Keyspaces data by integrating with Amazon OpenSearch Service

AWS Big Data

FEBRUARY 26, 2024

You simply configure your data sources to send information to OpenSearch Ingestion, which then automatically delivers the data to your specified destination. Additionally, you can configure OpenSearch Ingestion to apply data transformations before delivery. This allows for easy access and analysis of these events.

Dashboards

Dashboards Testing Metrics Optimization

Beyond the Dashboard: Transforming Data Into Business Impact Using Qlik + Vizlib

Jet Global

MAY 21, 2025

Unlocking the full potential of your data is about more than just visualizing it. True data transformation comes from applying insights to make impactful business decisions. I understand that I can withdraw my consent at any time. Privacy Policy.

Dashboards

Dashboards Visualization Reporting Interactive

What is Data Mapping?

Jet Global

FEBRUARY 23, 2024

This field guide to data mapping will explore how data mapping connects volumes of data for enhanced decision-making. Why Data Mapping is Important Data mapping is a critical element of any data management initiative, such as data integration, data migration, data transformation, data warehousing, or automation.

Data Warehouse

Data Warehouse Reporting Data Transformation Visualization

What is a Data Pipeline?

Jet Global

MAY 9, 2024

Data Extraction : The process of gathering data from disparate sources, each of which may have its own schema defining the structure and format of the data and making it available for processing. This can include tasks such as data ingestion, cleansing, filtering, aggregation, or standardization.

Data Lake

Data Lake Data Warehouse Business Intelligence Machine Learning

Data Prep for AI: Get Your Oracle House in Order

Jet Global

MAY 6, 2024

These tools excel at data integration, consolidating information from various financial systems (ERP, CRM, legacy) into a central hub. This eliminates data fragmentation, a major obstacle for AI. Additionally, they provide robust data transformation capabilities.

Finance

Finance Reporting Data Transformation Data-driven

Unlocking Trino’s Full Potential With Simba Drivers for BI & ETL

Jet Global

OCTOBER 1, 2024

Trino allows users to run ad hoc queries across massive datasets, making real-time decision-making a reality without needing extensive data transformations. This is particularly valuable for teams that require instant answers from their data. Data Lake Analytics: Trino doesn’t just stop at databases. Privacy Policy.

Dashboards

Dashboards Data Lake Reporting Cost-Benefit

Save Time and Stress with Dynamics Data Merging from Atlas

Jet Global

MARCH 13, 2024

Complex Data Structures and Integration Processes Dynamics data structures are already complex – finance teams navigating Dynamics data frequently require IT department support to complete their routine reporting. I understand that I can withdraw my consent at any time. Privacy Policy.

Reporting

Reporting Finance Data Quality Sales

Generate More Value From SAP Data With a Real-Time Close

Jet Global

AUGUST 9, 2023

Speed time to market with faster data migration, easier data transformation. Wands for SAP Wands for SAP empowers your finance team to leverage their existing Excel skills to streamline data entry to drive efficiencies in your month-end process. Time-to-value acceleration — Quick installation. Privacy Policy.

Finance

Finance Reporting Management Enterprise

Unified Data Clears the Roadblocks of Your Hybrid Cloud Journey

Jet Global

AUGUST 24, 2023

It streamlines data integration, ensures real-time access to accurate information, enhances collaboration, and provides the flexibility needed to adapt to evolving ERP systems and business requirements. Data transformation ensures that the data aligns with the requirements of the new cloud ERP system. Privacy Policy.

Finance

Finance Reporting Data Integration Data Warehouse

Discover Efficient Data Extraction Through Replication With Angles Enterprise for Oracle

Jet Global

NOVEMBER 7, 2023

The alternative to BICC is BI Publisher (BIP). While BIP reports can be generated with different output formats, including Excel files, BIP is not intended as a data extraction tool but rather a reporting tool. Quickly combine from a variety of sources into a singular data warehouse and a set of dimensional cubes or tabular models.

Enterprise

Enterprise Data Warehouse Operational Reporting Reporting

Enhancing Your BI Experience With Apache Iceberg

Jet Global

JULY 16, 2024

By providing a consistent and stable backend, Apache Iceberg ensures that data remains immutable and query performance is optimized, thus enabling businesses to trust and rely on their BI tools for critical insights. It provides a stable schema, supports complex data transformations, and ensures atomic operations.

Dashboards

Dashboards Data-driven Reporting Business Intelligence

Three Ways to Tell Your Story With Power BI and CXO

Jet Global

OCTOBER 30, 2023

Together, CXO and Power BI provide you with access to insights from both EPM and BI data in one tool. You can now elevate their decision-making process by drilling down into more detailed data, and enriching EPM figures with non-financial data. Transforming Financial Reporting with Dynamic Dashboards Download Now 1.

Dashboards

Dashboards Reporting Operational Reporting Software

Partners in Innovation: Voice of the Customer Enhancements to Logi Symphony

Jet Global

JULY 19, 2023

Data Connectivity Enhancements Data and content authors are the first users in the app building infrastructure and content. It is important for our customers to access advanced connectors and data transformation features so they can build a robust data layer. I understand that I can withdraw my consent at any time.

Dashboards

Dashboards Visualization Reporting Interactive

3 Ways Logi Symphony Leverages AI for Actionable Insights

Jet Global

APRIL 24, 2024

This approach allows you and your customers to harness the full potential of your data, transforming it into interactive, AI-driven conversations that can significantly enhance user engagement and insight discovery. Unlike competitors who lock you into their pre-built AI solutions, Logi AI empowers you with the freedom to choose.

Business Intelligence

Business Intelligence Dashboards Data-driven Reporting

A Stitch in Time: How Jet Analytics Boosts Microsoft Fabric Time-to-Value

Jet Global

MARCH 14, 2024

Data Lineage and Documentation Jet Analytics simplifies the process of documenting data assets and tracking data lineage in Fabric. It offers a transparent and accurate view of how data flows through the system, ensuring robust compliance. I understand that I can withdraw my consent at any time. Privacy Policy.

Analytics

Analytics Management Reporting Data Quality

What Is Embedded Analytics?

Jet Global

MAY 1, 2023

Strategic Objective Create a complete, user-friendly view of the data by preparing it for analysis. Requirement Multi-Source Data Blending Data from multiple sources is compiled and the output is a single view, metric, or visualization. Data Transformation and Enrichment Data can be enriched for analysis.

Analytics

Analytics Cost-Benefit Visualization Dashboards

Stream real-time data into Apache Iceberg tables in Amazon S3 using Amazon Data Firehose

AWS Big Data

NOVEMBER 6, 2024

In our examples, we use Kinesis Data Generator , a sample application to generate and publish data streams to Firehose. You can also set up Firehose to use other data sources for your real-time streams. We set up Firehose to deliver the stream into Iceberg tables in the Data Catalog. Choose Create Firehose stream.

Metadata

Metadata Data Lake Management Internet of Things

Unlock self-serve streaming SQL with Amazon Managed Service for Apache Flink

AWS Big Data

MAY 28, 2025

Imagine a complex pipeline where a consumer publishes to multiple topics. For more control, the DataStream API exposes low-level constructs to manage event time, stateful operations, and complex event processing. Riskified initially used the Table API to define streaming transformations.

Management

Management Metrics Cost-Benefit Technology

Ingest telemetry messages in near real time with Amazon API Gateway, Amazon Data Firehose, and Amazon Location Service

AWS Big Data

NOVEMBER 14, 2024

Key services in the solution include Amazon API Gateway , Amazon Data Firehose , and Amazon Location Service. The challenge In the event of a disaster e.g. water flood, there is usually a lack of terrestrial data connectivity that prevents monitoring stations from taking actionable measures in real time.

Data Lake

Data Lake Metadata Testing Data-driven

SQL Streambuilder Data Transformations

How EUROGATE established a data mesh architecture using Amazon DataZone

Webinars

Trending Sources

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Webinars

How Open Universities Australia modernized their data platform and significantly reduced their ETL costs with AWS Cloud Development Kit and AWS Step Functions

Gain insights from historical location data using Amazon Location Service and AWS analytics services

Biggest Trends in Data Visualization Taking Shape in 2022

Modernize a legacy real-time analytics application with Amazon Managed Service for Apache Flink

Data’s dark secret: Why poor quality cripples AI and growth

Cloudera DataFlow Designer: The Key to Agile Data Pipeline Development

DataOps Observability: Taming the Chaos (Part 2)

Harnessing Streaming Data: Insights at the Speed of Life

Introducing Cloudera DataFlow Designer: Self-service, No-Code Dataflow Design

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Cross-account integration between SaaS platforms using Amazon AppFlow

How healthcare organizations can analyze and create insights using price transparency data

Stream VPC Flow Logs to Datadog via Amazon Kinesis Data Firehose

Use AWS Glue DataBrew recipes in your AWS Glue Studio visual ETL jobs

Simplify Metrics on Apache Druid With Rill Data and Cloudera

Use Snowflake with Amazon MWAA to orchestrate data pipelines

How CFM built a well-governed and scalable data-engineering platform using Amazon EMR for financial features generation

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

How Alation’s Data Team Uses the Modern Data Stack to Power Insights

Migrate from Amazon Kinesis Data Analytics for SQL Applications to Amazon Kinesis Data Analytics Studio

AI, the Power of Knowledge and the Future Ahead: An Interview with Head of Ontotext’s R&I Milena Yankova

How to Aggregate Global Data from the Coronavirus Outbreak

Enable advanced search capabilities for Amazon Keyspaces data by integrating with Amazon OpenSearch Service

Beyond the Dashboard: Transforming Data Into Business Impact Using Qlik + Vizlib

What is Data Mapping?

What is a Data Pipeline?

Data Prep for AI: Get Your Oracle House in Order

Unlocking Trino’s Full Potential With Simba Drivers for BI & ETL

Save Time and Stress with Dynamics Data Merging from Atlas

Generate More Value From SAP Data With a Real-Time Close

Unified Data Clears the Roadblocks of Your Hybrid Cloud Journey

Discover Efficient Data Extraction Through Replication With Angles Enterprise for Oracle

Enhancing Your BI Experience With Apache Iceberg

Three Ways to Tell Your Story With Power BI and CXO

Partners in Innovation: Voice of the Customer Enhancements to Logi Symphony

3 Ways Logi Symphony Leverages AI for Actionable Insights

A Stitch in Time: How Jet Analytics Boosts Microsoft Fabric Time-to-Value

What Is Embedded Analytics?

Stream real-time data into Apache Iceberg tables in Amazon S3 using Amazon Data Firehose

Unlock self-serve streaming SQL with Amazon Managed Service for Apache Flink

Ingest telemetry messages in near real time with Amazon API Gateway, Amazon Data Firehose, and Amazon Location Service

Stay Connected