Data Transformation, Document and Publishing

Data Transformation

Document

Publishing

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

AWS Big Data

OCTOBER 30, 2024

To achieve this, you need access to sales orders, shipment details, and customer data owned by the retail team. The retail team, acting as the data producer, publishes the necessary data assets to Amazon DataZone, allowing you, as a consumer, to discover and subscribe to these assets.

Visualization

Visualization Data Lake Testing Data Governance

Data’s dark secret: Why poor quality cripples AI and growth

CIO Business Intelligence

APRIL 8, 2025

Build data validation rules directly into ingestion layers so that insufficient data is stopped at the gate and not detected after damage is done. Use lineage tooling to trace data from source to report. Understanding how data transforms and where it breaks is crucial for audibility and root-cause resolution.

Data Quality

Data Quality Data-driven Key Performance Indicator Metadata

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

Data processes that depended upon the previously defective data will likely need to be re-initiated, especially if their functioning was at risk or compromised by the defected data. These processes could include reports, campaigns, or financial documentation. Accuracy should be measured through source documentation (i.e.,

Data Quality

Data Quality Metrics Data-driven Management

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Build and manage your modern data stack using dbt and AWS Glue through dbt-glue, the new “trusted” dbt adapter

AWS Big Data

NOVEMBER 29, 2023

dbt is an open source, SQL-first templating engine that allows you to write repeatable and extensible data transforms in Python and SQL. dbt is predominantly used by data warehouses (such as Amazon Redshift ) customers who are looking to keep their data transform logic separate from storage and engine.

Data Lake

Data Lake Management Metrics Data Warehouse

How CFM built a well-governed and scalable data-engineering platform using Amazon EMR for financial features generation

AWS Big Data

SEPTEMBER 13, 2024

In recent years, driven by the commoditization of data storage and processing solutions, the industry has seen a growing number of systematic investment management firms switch to alternative data sources to drive their investment decisions. The bulk of our data scientists are heavy users of Jupyter Notebook. or later.

Interactive

Interactive Strategy Cost-Benefit Data Governance

Talk Data to Me: Why Employee Data Literacy Matters

erwin

MARCH 26, 2020

Increased data variety, balancing structured, semi-structured and unstructured data, as well as data originating from a widening array of external sources. Reducing the IT bottleneck that creates barriers to data accessibility. Hybrid on-premises/cloud environments that complicate data integration and preparation.

Data-driven

Data-driven Unstructured Data Enterprise Machine Learning

Cloudera DataFlow Designer: The Key to Agile Data Pipeline Development

Cloudera

MARCH 14, 2023

Once a draft has been created or opened, developers use the visual Designer to build their data flow logic and validate it using interactive test sessions. Managing drafts outside the Catalog keeps a clean distinction between phases of the development cycle, leaving only those flows that are ready for deployment published in the Catalog.

Testing

Testing Publishing Metadata Interactive

Use Snowflake with Amazon MWAA to orchestrate data pipelines

AWS Big Data

OCTOBER 31, 2023

Data is decompressed and stored in a different S3 bucket (transformed data can be stored in the same S3 bucket where data was ingested, but for simplicity, we’re using two separate S3 buckets). The transformed data is then made accessible to Snowflake for data analysis. Set the protocol to Email.

Data Processing

Data Processing Management Publishing Visualization

Introducing Cloudera DataFlow Designer: Self-service, No-Code Dataflow Design

Cloudera

DECEMBER 9, 2022

Developers need to onboard new data sources, chain multiple data transformation steps together, and explore data as it travels through the flow. Developers create draft flows , build them out, and test them with the designer before they are published to the central DataFlow catalog.

Testing

Testing Cost-Benefit Interactive Visualization

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

AWS Big Data

AUGUST 1, 2024

However, you might face significant challenges when planning for a large-scale data warehouse migration. As part of the success criteria for operational service levels, you need to document the expected service levels for the new Amazon Redshift data warehouse environment. Platform architects define a well-architected platform.

Data Warehouse

Data Warehouse KPI Optimization Cost-Benefit

Gain insights from historical location data using Amazon Location Service and AWS analytics services

AWS Big Data

MARCH 13, 2024

Developers can use the support in Amazon Location Service for publishing device position updates to Amazon EventBridge to build a near-real-time data pipeline that stores locations of tracked assets in Amazon Simple Storage Service (Amazon S3). This solution uses distance-based filtering to reduce costs and jitter.

Analytics

Analytics IoT Metadata Internet of Things

How Alation’s Data Team Uses the Modern Data Stack to Power Insights

Alation

OCTOBER 27, 2022

Few actors in the modern data stack have inspired the enthusiasm and fervent support as dbt. This data transformation tool enables data analysts and engineers to transform, test and document data in the cloud data warehouse. But what does this mean from a practitioner perspective?

Metrics

Metrics Dashboards Sales Reporting

AI, the Power of Knowledge and the Future Ahead: An Interview with Head of Ontotext’s R&I Milena Yankova

Ontotext

APRIL 4, 2019

Within a large enterprise, there is a huge amount of data accumulated over the years – many decisions have been made and different methods have been tested. We translate their documents, presentations, tables, etc. Some of this knowledge is locked and the company cannot access it. What exactly do you do for them?

Recreation/Entertainment

Recreation/Entertainment Testing Enterprise Knowledge Discovery

Exploring the AI and data capabilities of watsonx

IBM Big Data Hub

JULY 17, 2023

These encoder-only architecture models are fast and effective for many enterprise NLP tasks, such as classifying customer feedback and extracting information from large documents. While they require task-specific labeled data for fine tuning, they also offer clients the best cost performance trade-off for non-generative use cases.

Machine Learning

Machine Learning Data Warehouse Modeling Cost-Benefit

Why The Public Sector Needs Data Governance

Alation

NOVEMBER 22, 2022

A well-governed data landscape enables data users in the public sector to better understand the driving forces and needs to support public policy – and measure impact once a change is made. Efficient Access To Data. Citizens, companies, and government employees need access to data and documents.

Data Governance

Data Governance Metadata Data-driven Unstructured Data

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

AWS Big Data

MARCH 3, 2023

It has been well published since the State of DevOps 2019 DORA Metrics were published that with DevOps, companies can deploy software 208 times more often and 106 times faster, recover from incidents 2,604 times faster, and release 7 times fewer defects. Fixed-size data files avoid further latency due to unbound file sizes.

Software

Software Data Lake Testing Cost-Benefit

Enable advanced search capabilities for Amazon Keyspaces data by integrating with Amazon OpenSearch Service

AWS Big Data

FEBRUARY 26, 2024

You simply configure your data sources to send information to OpenSearch Ingestion, which then automatically delivers the data to your specified destination. Additionally, you can configure OpenSearch Ingestion to apply data transformations before delivery. The OpenSearch ingestion pipeline, named serverless-ingestion.

Dashboards

Dashboards Testing Metrics Optimization

What is Data Mapping?

Jet Global

FEBRUARY 23, 2024

This field guide to data mapping will explore how data mapping connects volumes of data for enhanced decision-making. Why Data Mapping is Important Data mapping is a critical element of any data management initiative, such as data integration, data migration, data transformation, data warehousing, or automation.

Data Warehouse

Data Warehouse Reporting Data Transformation Visualization

What Is Embedded Analytics?

Jet Global

MAY 1, 2023

Modern Data Sources Painlessly connect with modern data such as streaming, search, big data, NoSQL, cloud, document-based sources. Quickly link all your data from Amazon Redshift, MongoDB, Hadoop, Snowflake, Apache Solr, Elasticsearch, Impala, and more. addresses). Read carefully.

Analytics

Analytics Cost-Benefit Visualization Dashboards

A Stitch in Time: How Jet Analytics Boosts Microsoft Fabric Time-to-Value

Jet Global

MARCH 14, 2024

This straightforward and user-friendly access to source data makes it easier for your business users to examine and extract insights from your core data systems. Data Lineage and Documentation Jet Analytics simplifies the process of documenting data assets and tracking data lineage in Fabric.

Analytics

Analytics Management Reporting Data Quality

Generate More Value From SAP Data With a Real-Time Close

Jet Global

AUGUST 9, 2023

Process Runner GLSU and Wands for SAP provide flexible, intuitive interfaces for SAP data entry and transaction posting directly from Microsoft Excel. Automate financial document posting processes resulting in a shorter month-end close. Increase data accuracy and improve audit processing while running a stress-free finance operation.

Finance

Finance Reporting Management Enterprise

Discover Efficient Data Extraction Through Replication With Angles Enterprise for Oracle

Jet Global

NOVEMBER 7, 2023

The alternative to BICC is BI Publisher (BIP). While BIP reports can be generated with different output formats, including Excel files, BIP is not intended as a data extraction tool but rather a reporting tool. Quickly combine from a variety of sources into a singular data warehouse and a set of dimensional cubes or tabular models.

Enterprise

Enterprise Data Warehouse Operational Reporting Reporting

How BMW Group built a serverless terabyte-scale data transformation architecture with dbt and Amazon Athena

AWS Big Data

APRIL 29, 2025

While enabling organization-wide efficiency, the team also applied these principles to the data architecture, making sure that CLEA itself operates frugally. After evaluating various tools, we built a serverless data transformation pipeline using Amazon Athena and dbt. The Source stage maintains raw data in its original form.

Data Transformation

Data Transformation Cost-Benefit Testing Data Lake

Unlock self-serve streaming SQL with Amazon Managed Service for Apache Flink

AWS Big Data

MAY 28, 2025

Imagine a complex pipeline where a consumer publishes to multiple topics. AWS offers comprehensive documentation and practical examples to help accelerate the implementation process. Challenges with ksqlDB pipelines A ksqlDB pipeline is a chain of individual streams and lacks flow-level abstraction.

Management

Management Metrics Cost-Benefit Technology

4 Ways Logi Symphony Leverages AI for Actionable Insights

Jet Global

APRIL 16, 2025

This approach allows you and your customers to harness the full potential of your data, transforming it into interactive, AI-driven conversations that can significantly enhance user engagement and insight discovery. AI Doc Assist Finding the right document doesnt have to be complicated.

Business Intelligence

Business Intelligence Dashboards Reporting Interactive

Data Leaders Brief

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

Data’s dark secret: Why poor quality cripples AI and growth

Webinars

Trending Sources

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Webinars

Build and manage your modern data stack using dbt and AWS Glue through dbt-glue, the new “trusted” dbt adapter

How CFM built a well-governed and scalable data-engineering platform using Amazon EMR for financial features generation

Talk Data to Me: Why Employee Data Literacy Matters

Cloudera DataFlow Designer: The Key to Agile Data Pipeline Development

Use Snowflake with Amazon MWAA to orchestrate data pipelines

Introducing Cloudera DataFlow Designer: Self-service, No-Code Dataflow Design

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

Gain insights from historical location data using Amazon Location Service and AWS analytics services

How Alation’s Data Team Uses the Modern Data Stack to Power Insights

AI, the Power of Knowledge and the Future Ahead: An Interview with Head of Ontotext’s R&I Milena Yankova

Exploring the AI and data capabilities of watsonx

Why The Public Sector Needs Data Governance

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

Enable advanced search capabilities for Amazon Keyspaces data by integrating with Amazon OpenSearch Service

What is Data Mapping?

What Is Embedded Analytics?

A Stitch in Time: How Jet Analytics Boosts Microsoft Fabric Time-to-Value

Generate More Value From SAP Data With a Real-Time Close

Discover Efficient Data Extraction Through Replication With Angles Enterprise for Oracle

How BMW Group built a serverless terabyte-scale data transformation architecture with dbt and Amazon Athena

Unlock self-serve streaming SQL with Amazon Managed Service for Apache Flink

4 Ways Logi Symphony Leverages AI for Actionable Insights

Stay Connected