Data Transformation, Management and Visualization

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

AWS Big Data

OCTOBER 30, 2024

Ali Tore, Senior Vice President of Advanced Analytics at Salesforce, highlighting the value of this integration, says “We’re excited to partner with Amazon to bring Tableau’s powerful data exploration and AI-driven analytics capabilities to customers managing data across organizational boundaries with Amazon DataZone.

Visualization

Visualization Data Lake Testing Data Governance

Biggest Trends in Data Visualization Taking Shape in 2022

Smart Data Collective

OCTOBER 13, 2021

There are countless examples of big data transforming many different industries. It can be used for something as visual as reducing traffic jams, to personalizing products and services, to improving the experience in multiplayer video games. We would like to talk about data visualization and its role in the big data movement.

Visualization

Visualization Cost-Benefit Big Data Prescriptive Analytics

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data. 10) Data Quality Solutions: Key Attributes.

Data Quality

Data Quality Metrics Data-driven Management

Webinars

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

MORE WEBINARS

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

AWS Big Data

OCTOBER 30, 2024

Amazon DataZone is a data management service that makes it faster and easier for customers to catalog, discover, share, and govern data stored across AWS, on premises, and from third-party sources. When you’re connected, you can query, visualize, and share data—governed by Amazon DataZone—within Tableau.

Analytics

Analytics Visualization Data Governance Data-driven

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

AWS Big Data

DECEMBER 20, 2024

This means you can refine your ETL jobs through natural follow-up questionsstarting with a basic data pipeline and progressively adding transformations, filters, and business logic through conversation. The DataFrame code generation now extends beyond AWS Glue DynamicFrame to support a broader range of data processing scenarios.

Data Integration

Data Integration Visualization Data Processing Big Data

Texas Rangers data transformation modernizes stadium operations

CIO Business Intelligence

OCTOBER 18, 2022

With the new stadium on the horizon, the team needed to update existing IT systems and manual business and IT processes to handle the massive volumes of new data that would soon be at their fingertips. “In Noel had already established a relationship with consulting firm Resultant through a smaller data visualization project.

Data Transformation

Data Transformation Consulting Data Lake Reporting

Data transformation takes flight at Atlanta’s Hartsfield-Jackson airport

CIO Business Intelligence

AUGUST 9, 2024

At Atlanta’s Hartsfield-Jackson International Airport, an IT pilot has led to a wholesale data journey destined to transform operations at the world’s busiest airport, fueled by machine learning and generative AI. That enables the analytics team using Power BI to create a single visualization for the GM.”

Data Transformation

Data Transformation Machine Learning Data Lake Dashboards

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

Content includes reports, documents, articles, presentations, visualizations, video, and audio representations of the insights and knowledge that have been extracted from data. We could further refine our opening statement to say that our business users are too often in a state of being data-rich, but insights-poor, and content-hungry.

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

What is data analytics? Analyzing and managing data for decisions

CIO Business Intelligence

JUNE 7, 2022

What is data analytics? Data analytics is a discipline focused on extracting insights from data. It comprises the processes, tools and techniques of data analysis and management, including the collection, organization, and storage of data. What are the four types of data analytics?

Data Analytics

Data Analytics Diagnostic Analytics Management Analytics

7 Data Engineering Tools for Beginners

KDnuggets

OCTOBER 3, 2024

Learn the data engineering tools for data orchestration, database management, batch processing, ETL (Extract, Transform, Load), data transformation, data visualization, and data streaming.

Visualization

Visualization Data Transformation Management

Automating Data Pipelines in CDP with CDE Managed Airflow Service

Cloudera

AUGUST 17, 2021

When we announced the GA of Cloudera Data Engineering back in September of last year, a key vision we had was to simplify the automation of data transformation pipelines at scale. Figure 1: Pipeline composed of Spark and Hive jobs deployed to run within CDE’s managed Apache Airflow service. CDP Airflow operators.

Management

Management Cost-Benefit Data Transformation Optimization

Author data integration jobs with an interactive data preparation experience with AWS Glue visual ETL

AWS Big Data

JULY 10, 2024

We are excited to announce a new capability of the AWS Glue Studio visual editor that offers a new visual user experience. Now you can author data preparation transformations and edit them with the AWS Glue Studio visual editor. Within the new experience, you can choose from hundreds of prebuilt transformations.

Interactive

Interactive Visualization Data Integration Statistics

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

Their terminal operations rely heavily on seamless data flows and the management of vast volumes of data. With the addition of these technologies alongside existing systems like terminal operating systems (TOS) and SAP, the number of data producers has grown substantially.

IoT

IoT Machine Learning Metadata Data-driven

10 Examples of How Big Data in Logistics Can Transform The Supply Chain

datapine

MAY 2, 2023

Benefits Of Big Data In Logistics Before we look at our selection of practical examples and applications, let’s look at the benefits of big data in logistics – starting with the (not so) small matter of costs. A testament to the rising role of optimization in logistics. Why are logistics companies so interested in optimization?

Big Data

Big Data Internet of Things Cost-Benefit Optimization

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

AWS Big Data

DECEMBER 16, 2024

For each service, you need to learn the supported authorization and authentication methods, data access APIs, and framework to onboard and test data sources. This fragmented, repetitive, and error-prone experience for data connectivity is a significant obstacle to data integration, analysis, and machine learning (ML) initiatives.

Visualization

Visualization Data Processing Testing Publishing

Modernize a legacy real-time analytics application with Amazon Managed Service for Apache Flink

AWS Big Data

OCTOBER 11, 2023

Organizations with legacy, on-premises, near-real-time analytics solutions typically rely on self-managed relational databases as their data store for analytics workloads. Near-real-time streaming analytics captures the value of operational data and metrics to provide new insights to create business opportunities.

Management

Management Metadata Analytics Dashboards

Copy and mask PII between Amazon RDS databases using visual ETL jobs in AWS Glue Studio

AWS Big Data

AUGUST 26, 2024

You can use AWS Glue Studio to set up data replication and mask PII with no coding required. AWS Glue Studio visual editor provides a low-code graphic environment to build, run, and monitor extract, transform, and load (ETL) scripts. An AWS Identity and Access Management (IAM) role is used for AWS Glue. 16 10.2.10.0/24

Visualization

Visualization Metadata Data Transformation Testing

Use AWS Glue DataBrew recipes in your AWS Glue Studio visual ETL jobs

AWS Big Data

JULY 27, 2023

AWS Glue Studio is a graphical interface that makes it easy to create, run, and monitor extract, transform, and load (ETL) jobs in AWS Glue. DataBrew is a visual data preparation tool that enables you to clean and normalize data without writing any code. Choose Visual with a blank canvas and create the visual job.

Visualization

Visualization Cost-Benefit Data Quality Publishing

Available Now! Automated Testing for Data Transformations

Wayne Yaddow

FEBRUARY 18, 2025

Selecting the strategies and tools for validating data transformations and data conversions in your data pipelines. Introduction Data transformations and data conversions are crucial to ensure that raw data is organized, processed, and ready for useful analysis.

Testing

Testing Data Transformation Data-driven Data Quality

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

AWS Big Data

DECEMBER 13, 2023

In collaboration with AWS, BMS identified a business need to migrate and modernize their custom extract, transform, and load (ETL) platform to a native AWS solution to reduce complexities, resources, and investment to upgrade when new Spark, Python, or AWS Glue versions are released.

Metadata

Metadata Data Lake Visualization Data Quality

What is business analytics? Using data to improve business outcomes

CIO Business Intelligence

JULY 5, 2022

While quantitative analysis, operational analysis, and data visualizations are key components of business analytics, the goal is to use the insights gained to shape business decisions. What is the difference between business analytics and data analytics? Business analytics is a subset of data analytics.

Business Analytics

Business Analytics Prescriptive Analytics Data mining Diagnostic Analytics

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

AWS Big Data

DECEMBER 4, 2024

Recognizing this paradigm shift, ANZ Institutional Division has embarked on a transformative journey to redefine its approach to data management, utilization, and extracting significant business value from data insights. This enables global discoverability and collaboration without centralizing ownership or operations.

Metadata

Metadata Data Governance Data Quality Data-driven

Tableau further democratizes analytics with AI-fueled features

CIO Business Intelligence

APRIL 30, 2024

Together the technologies aim to help business users and “novice” data analysts explore their data and gain insights without having to resort to data experts. This is really empowering everyone to be a data expert,” Maxon said. “It Shared Dimensions and Composable Data Sources.

Analytics

Analytics Metrics Visualization Dashboards

Unveiling the Top 10 Data Visualization Companies of 2024

FineReport

JUNE 7, 2024

In 2024, data visualization companies play a pivotal role in transforming complex data into captivating narratives. This blog provides an insightful exploration of the leading entities shaping the data visualization landscape.

Visualization

Visualization Predictive Analytics Dashboards Predictive Modeling

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure

DataKitchen

JULY 27, 2023

Azure Databricks Delta Live Table s: These provide a more straightforward way to build and manage Data Pipelines for the latest, high-quality data in Delta Lake. It provides data prep, management, and enterprise data warehousing tools. It has a data pipeline tool , as well. It does the job.

Machine Learning

Machine Learning Cost-Benefit Data Transformation Testing

Happy Birthday, CDP Public Cloud

Cloudera

OCTOBER 13, 2020

Machine Learning – has grown from a collaborative workbench to an end-to-end Production ML platform that enables data scientists to deploy a model or an application to production in minutes with production-level monitoring, governance and performance tracking. Enrich – Data Engineering (Apache Spark and Apache Hive).

Data Warehouse

Data Warehouse Machine Learning Visualization Data Lake

Introducing blueprint discovery and other UI enhancements for Amazon OpenSearch Ingestion

AWS Big Data

MAY 22, 2024

Amazon OpenSearch Ingestion is a fully managed serverless pipeline that allows you to ingest, filter, transform, enrich, and route data to an Amazon OpenSearch Service domain or Amazon OpenSearch Serverless collection. When you create a new pipeline on the OpenSearch Service console, you’re presented with a new catalog page.

Data Architecture

Data Architecture Visualization Data Transformation Management

Accelerate analytics on Amazon OpenSearch Service with AWS Glue through its native connector

AWS Big Data

DECEMBER 21, 2023

AWS Glue is a serverless data integration service that makes it straightforward to discover, prepare, and combine data for analytics, machine learning (ML), and application development. AWS Glue provides both visual and code-based interfaces to make data integration effortless. Choose Create job and Visual ETL.

Analytics

Analytics IT Data Lake Visualization

Use AWS Glue to streamline SFTP data processing

AWS Big Data

AUGUST 13, 2024

AWS Glue is a serverless data integration service that helps analytics users to discover, prepare, move, and integrate data from multiple sources for analytics, machine learning (ML), and application development. The SFTP connector is used to manage the connection to the SFTP server. Create the gateway endpoint.

Data Processing

Data Processing Visualization Data Lake Data Processing

How healthcare organizations can analyze and create insights using price transparency data

AWS Big Data

OCTOBER 11, 2023

The availability of machine-readable files opens up new possibilities for data analytics, allowing organizations to analyze large amounts of pricing data. Using machine learning (ML) and data visualization tools, these datasets can be transformed into actionable insights that can inform decision-making.

Visualization

Visualization Dashboards Data-driven Gap analysis

Unlock scalable analytics with AWS Glue and Google BigQuery

AWS Big Data

OCTOBER 27, 2023

AWS Glue eliminates complexities and costs, allowing organizations to perform data integration tasks in minutes, boosting efficiency. This blog post explores the newly announced managed connector for Google BigQuery and demonstrates how to build a modern ETL pipeline with AWS Glue Studio without writing code.

Analytics

Analytics Visualization Data Integration Cost-Benefit

Migrate from Apache Solr to OpenSearch

AWS Big Data

JULY 18, 2024

OpenSearch is an open source, distributed search engine suitable for a wide array of use-cases such as ecommerce search, enterprise search (content management search, document search, knowledge management search, and so on), site search, application search, and semantic search. You use the schema API to manage schema.

Dashboards

Dashboards Testing Data-driven Visualization

How to Implement Data Lineage Mapping Techniques

Octopai

MARCH 31, 2021

In other words, kind of like Hansel and Gretel in the forest, your data leaves a trail of breadcrumbs – the metadata – to record where it came from and who it really is. So the first step in any data lineage mapping project is to ensure that all of your data transformation processes do in fact accurately record metadata.

Metadata

Metadata Data Transformation Business Intelligence Reporting

Automating the Automators: Shift Change in the Robot Factory

O'Reilly on Data

JANUARY 17, 2023

Related to the previous point, a company could go from “raw data” to “it’s serving predictions on live data” in a single work day. You need to coordinate with stakeholders and product managers to suss out what kinds of models you need and how to embed them into the company’s processes.

Machine Learning

Machine Learning Predictive Modeling Software Modeling

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

AWS Big Data

NOVEMBER 15, 2023

Overview of the BMW Cloud Data Hub At the BMW Group, Cloud Data Hub (CDH) is the central platform for managing company-wide data and data solutions. They can use their own toolsets or rely on provided blueprints to ingest the data from source systems.

Analytics

Analytics Dashboards Metadata Data Warehouse

DataOps Observability: Taming the Chaos (Part 2)

DataKitchen

OCTOBER 25, 2022

It’s because it’s a hard thing to accomplish when there are so many teams, locales, data sources, pipelines, dependencies, data transformations, models, visualizations, tests, internal customers, and external customers. They all have different roles and different relationships with the data.

Testing

Testing Data-driven Visualization Dashboards

12 data science certifications that will pay off

CIO Business Intelligence

JANUARY 19, 2024

Data science certifications give you an opportunity to not only develop skills that are hard to find in your desired industry, but also validate your data science know-how so recruiters and hiring managers know what they get if they hire you.

Data Science

Data Science Machine Learning Predictive Modeling Forecasting

How to Build a Successful Metadata Management Framework

Alation

JUNE 28, 2022

The real challenge lies in getting people to access, manage, and search for it appropriately. This is where metadata, or the data about data, comes into play. Having a data catalog is the cornerstone of your data governance strategy, but what supports your data catalog? Your metadata gives users context.

Metadata

Metadata Management Data Governance Machine Learning

7 key Microsoft Azure analytics services (plus one extra)

CIO Business Intelligence

JUNE 29, 2022

Analytics is the means for discovering those insights, and doing it well requires the right tools for ingesting and preparing data, enriching and tagging it, building and sharing reports, and managing and protecting your data and insights. Azure Data Factory. Everything is visual. Azure Synapse Analytics.

Data Lake

Data Lake Analytics Data Warehouse Machine Learning

Stream VPC Flow Logs to Datadog via Amazon Kinesis Data Firehose

AWS Big Data

JUNE 20, 2023

You can easily deliver data to supported destinations using the Amazon Kinesis Data Firehose integration with VPC flow logs. Kinesis Data Firehose is a fully managed service for delivering near-real-time streaming data to various destinations for storage and performing near-real-time analytics.

Dashboards

Dashboards Visualization Metrics Data Transformation

How HR&A uses Amazon Redshift spatial analytics on Amazon Redshift Serverless to measure digital equity in states across the US

AWS Big Data

DECEMBER 5, 2023

This dynamic tool, powered by AWS and CARTO, provided robust visualizations of which regions and populations were interacting with our survey, enabling us to zoom in quickly and address gaps in coverage. Figure 1: Workflow illustrating data ingesting, transformation, and visualization using Redshift and CARTO.

Measurement

Measurement Dashboards Data Warehouse Analytics

What is Data Lineage? Top 5 Benefits of Data Lineage

erwin

APRIL 29, 2020

These tools range from enterprise service bus (ESB) products, data integration tools; extract, transform and load (ETL) tools, procedural code, application program interfaces (API)s, file transfer protocol (FTP) processes, and even business intelligence (BI) reports that further aggregate and transform data.

Data Governance

Data Governance Key Performance Indicator Metadata Data Quality

7 Things All Successful Data Product Managers Have In Common

Alation

FEBRUARY 2, 2023

Data product managers are in high demand these days. In 2020, Glassdoor rated product manager as the 4th best job in the US. This makes it more important for aspiring data product managers to stay ahead of the competition. So what sets data product managers apart from the pack? Sounds exciting?

Management

Management Data-driven Visualization Strategy

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

AWS Big Data

AUGUST 19, 2024

This allows business analysts and decision-makers to gain valuable insights, visualize key metrics, and explore the data in depth, enabling informed decision-making and strategic planning for pricing and promotional strategies. The AWS Glue job uses the secure connection established by the VPC endpoints to access Snowflake data.

Analytics

Analytics Data-driven Data Integration Data Lake

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

Biggest Trends in Data Visualization Taking Shape in 2022

Webinars

Trending Sources

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Webinars

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

Amazon Q data integration adds DataFrame support and in-prompt context-aware job creation

Texas Rangers data transformation modernizes stadium operations

Data transformation takes flight at Atlanta’s Hartsfield-Jackson airport

SAP Datasphere Powers Business at the Speed of Data

What is data analytics? Analyzing and managing data for decisions

7 Data Engineering Tools for Beginners

Automating Data Pipelines in CDP with CDE Managed Airflow Service

Author data integration jobs with an interactive data preparation experience with AWS Glue visual ETL

How EUROGATE established a data mesh architecture using Amazon DataZone

10 Examples of How Big Data in Logistics Can Transform The Supply Chain

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

Modernize a legacy real-time analytics application with Amazon Managed Service for Apache Flink

Copy and mask PII between Amazon RDS databases using visual ETL jobs in AWS Glue Studio

Use AWS Glue DataBrew recipes in your AWS Glue Studio visual ETL jobs

Available Now! Automated Testing for Data Transformations

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

What is business analytics? Using data to improve business outcomes

How ANZ Institutional Division built a federated data platform to enable their domain teams to build data products to support business outcomes

Tableau further democratizes analytics with AI-fueled features

Unveiling the Top 10 Data Visualization Companies of 2024

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure

Happy Birthday, CDP Public Cloud

Introducing blueprint discovery and other UI enhancements for Amazon OpenSearch Ingestion

Accelerate analytics on Amazon OpenSearch Service with AWS Glue through its native connector

Use AWS Glue to streamline SFTP data processing

How healthcare organizations can analyze and create insights using price transparency data

Unlock scalable analytics with AWS Glue and Google BigQuery

Migrate from Apache Solr to OpenSearch

How to Implement Data Lineage Mapping Techniques

Automating the Automators: Shift Change in the Robot Factory

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

DataOps Observability: Taming the Chaos (Part 2)

12 data science certifications that will pay off

How to Build a Successful Metadata Management Framework

7 key Microsoft Azure analytics services (plus one extra)

Stream VPC Flow Logs to Datadog via Amazon Kinesis Data Firehose

How HR&A uses Amazon Redshift spatial analytics on Amazon Redshift Serverless to measure digital equity in states across the US

What is Data Lineage? Top 5 Benefits of Data Lineage

7 Things All Successful Data Product Managers Have In Common

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

Stay Connected