Data Transformation, Data-driven and Visualization

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

datapine

SEPTEMBER 29, 2022

1) What Is Data Quality Management? 4) Data Quality Best Practices. 5) How Do You Measure Data Quality? 6) Data Quality Metrics Examples. 7) Data Quality Control: Use Case. 8) The Consequences Of Bad Data Quality. 9) 3 Sources Of Low-Quality Data. 10) Data Quality Solutions: Key Attributes.

Data Quality

Data Quality Metrics Data-driven Management

Data transformation takes flight at Atlanta’s Hartsfield-Jackson airport

CIO Business Intelligence

AUGUST 9, 2024

At Atlanta’s Hartsfield-Jackson International Airport, an IT pilot has led to a wholesale data journey destined to transform operations at the world’s busiest airport, fueled by machine learning and generative AI. That enables the analytics team using Power BI to create a single visualization for the GM.”

Data Transformation

Data Transformation Machine Learning Data Lake Dashboards

10 Examples of How Big Data in Logistics Can Transform The Supply Chain

datapine

MAY 2, 2023

Table of Contents 1) Benefits Of Big Data In Logistics 2) 10 Big Data In Logistics Use Cases Big data is revolutionizing many fields of business, and logistics analytics is no exception. The complex and ever-evolving nature of logistics makes it an essential use case for big data applications. Did you know?

Big Data

Big Data Internet of Things Cost-Benefit Optimization

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

AWS Big Data

DECEMBER 13, 2023

In addition to using native managed AWS services that BMS didn’t need to worry about upgrading, BMS was looking to offer an ETL service to non-technical business users that could visually compose data transformation workflows and seamlessly run them on the AWS Glue Apache Spark-based serverless data integration engine.

Metadata

Metadata Data Lake Visualization Data Quality

Building Data Visualization Expertise for Your Business

InsightOut

MAY 4, 2021

In this data-driven world, building a team of data analysts can be a challenge. Implementing data visualization and analytics dashboards can be the beginning of the data transformation journey.

Visualization

Visualization Dashboards Data-driven Data Transformation

Unveiling the Top 10 Data Visualization Companies of 2024

FineReport

JUNE 7, 2024

In 2024, data visualization companies play a pivotal role in transforming complex data into captivating narratives. This blog provides an insightful exploration of the leading entities shaping the data visualization landscape.

Visualization

Visualization Predictive Analytics Dashboards Predictive Modeling

How healthcare organizations can analyze and create insights using price transparency data

AWS Big Data

OCTOBER 11, 2023

Under the Transparency in Coverage (TCR) rule , hospitals and payors to publish their pricing data in a machine-readable format. The data in the machine-readable files can provide valuable insights to understand the true cost of healthcare services and compare prices and quality across hospitals.

Visualization

Visualization Dashboards Data-driven Gap analysis

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

For container terminal operators, data-driven decision-making and efficient data sharing are vital to optimizing operations and boosting supply chain efficiency. Together, these capabilities enable terminal operators to enhance efficiency and competitiveness in an industry that is increasingly data driven.

IoT

IoT Machine Learning Metadata Data-driven

What is Data Lineage? Top 5 Benefits of Data Lineage

erwin

APRIL 29, 2020

Data lineage is the journey data takes from its creation through its transformations over time. Tracing the source of data is an arduous task. With all these diverse data sources, and if systems are integrated, it is difficult to understand the complicated data web they form much less get a simple visual flow.

Key Performance Indicator

Key Performance Indicator Metadata Data Governance Data Quality

Unlock scalable analytics with AWS Glue and Google BigQuery

AWS Big Data

OCTOBER 27, 2023

Data integration is the foundation of robust data analytics. It encompasses the discovery, preparation, and composition of data from diverse sources. In the modern data landscape, accessing, integrating, and transforming data from diverse sources is a vital process for data-driven decision-making.

Analytics

Analytics Visualization Data Integration Cost-Benefit

Migrate from Apache Solr to OpenSearch

AWS Big Data

JULY 18, 2024

OpenSearch also includes capabilities to ingest and analyze data. API-driven Interactions : All interactions in OpenSearch are API-driven, eliminating the need for manual file changes or Zookeeper configurations. As part of the migration, reconsider your data model. You then set a replica count using number_of_replicas.

Dashboards

Dashboards Testing Data-driven Visualization

Modernize a legacy real-time analytics application with Amazon Managed Service for Apache Flink

AWS Big Data

OCTOBER 11, 2023

Organizations with legacy, on-premises, near-real-time analytics solutions typically rely on self-managed relational databases as their data store for analytics workloads. Near-real-time streaming analytics captures the value of operational data and metrics to provide new insights to create business opportunities.

Management

Management Metadata Analytics Dashboards

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure

DataKitchen

JULY 27, 2023

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure. While working in Azure with our customers, we have noticed several standard Azure tools people use to develop data pipelines and ETL or ELT processes. We counted ten ‘standard’ ways to transform and set up batch data pipelines in Microsoft Azure.

Machine Learning

Machine Learning Cost-Benefit Data Transformation Testing

DataOps Observability: Taming the Chaos (Part 2)

DataKitchen

OCTOBER 25, 2022

Part 2: Introducing Data Journeys. Observability is a methodology for providing visibility of every journey that data takes from source to customer value across every tool, environment, data store, team, and customer so that problems are detected and addressed immediately.

Testing

Testing Data-driven Visualization Dashboards

Use AWS Glue to streamline SFTP data processing

AWS Big Data

AUGUST 13, 2024

In today’s data-driven world, seamless integration and transformation of data across diverse sources into actionable insights is paramount. With AWS Glue, you can discover and connect to hundreds of diverse data sources and manage your data in a centralized data catalog.

Data Processing

Data Processing Visualization Data Lake Data Processing

Supercharging Your Digital Transformation with Embedded Analytics

Sisense

FEBRUARY 26, 2020

We all know that data is becoming more and more essential for businesses, as the volume of data keeps growing. Dresner reported that nearly 97% of respondents in their Big Data Analytics Market Study consider Big Data to be either important or critical to their businesses. Become data-driven to succeed.

Digital Transformation

Digital Transformation Analytics Big Data Data-driven

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

AWS Big Data

DECEMBER 16, 2024

The need to integrate diverse data sources has grown exponentially, but there are several common challenges when integrating and analyzing data from multiple sources, services, and applications. First, you need to create and maintain independent connections to the same data source for different services.

Visualization

Visualization Data Processing Testing Publishing

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

AWS Big Data

AUGUST 19, 2024

In today’s data-driven world, the ability to seamlessly integrate and utilize diverse data sources is critical for gaining actionable insights and driving innovation. Use case Consider a large ecommerce company that relies heavily on data-driven insights to optimize its operations, marketing strategies, and customer experiences.

Analytics

Analytics Data-driven Data Integration Data Lake

7 key Microsoft Azure analytics services (plus one extra)

CIO Business Intelligence

JUNE 29, 2022

If you can’t make sense of your business data, you’re effectively flying blind. Insights hidden in your data are essential for optimizing business operations, finetuning your customer experience, and developing new products — or new lines of business, like predictive maintenance. Azure Data Factory.

Data Lake

Data Lake Analytics Data Warehouse Machine Learning

Migrate Amazon Redshift from DC2 to RA3 to accommodate increasing data volumes and analytics demands

AWS Big Data

AUGUST 9, 2024

As businesses strive to make informed decisions, the amount of data being generated and required for analysis is growing exponentially. This trend is no exception for Dafiti , an ecommerce company that recognizes the importance of using data to drive strategic decision-making processes. We started with 115 dc2.large

Data Lake

Data Lake Analytics Data Warehouse Data-driven

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

AWS Big Data

NOVEMBER 15, 2023

It seamlessly consolidates data from various data sources within AWS, including AWS Cost Explorer (and forecasting with Cost Explorer ), AWS Trusted Advisor , and AWS Compute Optimizer. Overview of the BMW Cloud Data Hub At the BMW Group, Cloud Data Hub (CDH) is the central platform for managing company-wide data and data solutions.

Dashboards

Dashboards Analytics Metadata Data Warehouse

Set up alerts and orchestrate data quality rules with AWS Glue Data Quality

AWS Big Data

JUNE 6, 2023

Alerts and notifications play a crucial role in maintaining data quality because they facilitate prompt and efficient responses to any data quality issues that may arise within a dataset. It simplifies your experience of monitoring and evaluating the quality of your data.

Data Quality

Data Quality Metrics Data-driven Visualization

Improve power utility operational efficiency using smart sensor data and Amazon QuickSight

AWS Big Data

MAY 16, 2023

Different communication infrastructure types such as mesh network and cellular can be used to send load information on a pre-defined schedule or event data in real time to the backend servers residing in the utility UDN (Utility Data Network).

Dashboards

Dashboards Statistics Data Collection Business Intelligence

Amazon Redshift data ingestion options

AWS Big Data

SEPTEMBER 5, 2024

Amazon Redshift , a warehousing service, offers a variety of options for ingesting data from diverse sources into its high-performance, scalable environment. This native feature of Amazon Redshift uses massive parallel processing (MPP) to load objects directly from data sources into Redshift tables.

IoT

IoT Data Warehouse Cost-Benefit Reporting

How to Include BI in Your 2020 Budget

Sisense

DECEMBER 12, 2019

Building a data-driven business includes choosing the right software and implementing best practices around its use. Every year when budget time rolls around, many organizations find themselves asking the same question: “what are we going to do about our data?” Organizations have too much data. This is a summary article.

Business Intelligence

Business Intelligence Software Data-driven Visualization

Orchestrate Amazon EMR Serverless jobs with AWS Step functions

AWS Big Data

OCTOBER 12, 2023

You can run analytics workloads at any scale with automatic scaling that resizes resources in seconds to meet changing data volumes and processing requirements. AWS Step Functions is a serverless orchestration service that enables developers to build visual workflows for applications as a series of event-driven steps.

Big Data

Big Data Data-driven Management Visualization

End-to-end development lifecycle for data engineers to build a data integration pipeline using AWS Glue

AWS Big Data

JULY 26, 2023

Data is a key enabler for your business. Many AWS customers have integrated their data across multiple data sources using AWS Glue , a serverless data integration service, in order to make data-driven business decisions. Are there recommended approaches to provisioning components for data integration?

Data Integration

Data Integration Snapshot Testing Visualization

The Best Embedded BI Tools For 2024

FineReport

APRIL 21, 2024

In today’s data-driven landscape, businesses are constantly seeking innovative solutions to harness the power of analytics effectively. Embedded BI tools have emerged as a transformative force, seamlessly integrating analytical capabilities directly into existing software applications.

Dashboards

Dashboards Visualization Interactive Business Intelligence

Self-Service Data’s New Frontier: The Data Catalog

Alation

FEBRUARY 20, 2020

The energy at the conference was amazing – over 2,000 attendees and 100 vendors gathered to find our inner data heroes. And the Great BI Bake-Off is a perfect example: Four vendors (selected by their Gartner search popularity) took the stage in a live showdown of data viz expertise. Rita Sallam Introduces the Data Prep Rodeo.

Scorecard

Scorecard ROI Data-driven Visualization

How CFM built a well-governed and scalable data-engineering platform using Amazon EMR for financial features generation

AWS Big Data

SEPTEMBER 13, 2024

In recent years, driven by the commoditization of data storage and processing solutions, the industry has seen a growing number of systematic investment management firms switch to alternative data sources to drive their investment decisions. Each team is the sole owner of its AWS account.

Interactive

Interactive Strategy Cost-Benefit Data Governance

The Modern Data Stack Explained: What The Future Holds

Alation

JANUARY 17, 2023

Chances are, you’ve heard of the term “modern data stack” before. In this article, I will explain the modern data stack in detail, list some benefits, and discuss what the future holds. What Is the Modern Data Stack? It is known to have benefits in handling data due to its robustness, speed, and scalability.

Data Warehouse

Data Warehouse Cost-Benefit Data Science Data Transformation

Building Better Data Models to Unlock Next-Level Intelligence

Sisense

MAY 11, 2021

You can’t talk about data analytics without talking about data modeling. The reasons for this are simple: Before you can start analyzing data, huge datasets like data lakes must be modeled or transformed to be usable. Building the right data model is an important part of your data strategy.

Modeling

Modeling Big Data IoT Data Warehouse

5 best open source data flow lineage tools

Octopai

AUGUST 11, 2024

Data flow lineage is crucial for anyone handling data within organizations. In essence, data flow lineage is indispensable for ensuring transparency, maintaining data quality, achieving compliance, enabling efficient troubleshooting, conducting impact analysis, and enhancing collaboration within organizations.

Metadata

Metadata Visualization Data Quality Data Governance

Cloudera DataFlow Designer: The Key to Agile Data Pipeline Development

Cloudera

MARCH 14, 2023

We just announced the general availability of Cloudera DataFlow Designer , bringing self-service data flow development to all CDP Public Cloud customers. In this blog post we will put these capabilities in context and dive deeper into how the built-in, end-to-end data flow life cycle enables self-service data pipeline development.

Testing

Testing Publishing Metadata Interactive

Transforming Big Data into Actionable Intelligence

Sisense

MARCH 14, 2021

Attempting to learn more about the role of big data (here taken to datasets of high volume, velocity, and variety) within business intelligence today, can sometimes create more confusion than it alleviates, as vital terms are used interchangeably instead of distinctly. Big data challenges and solutions.

Big Data

Big Data IoT Data Warehouse Data-driven

How to Build a Successful Metadata Management Framework

Alation

JUNE 28, 2022

Collecting and using data to make informed decisions is the new foundation for businesses. The key term here is usable : Anyone can be data rich, and collect vast troves of data. This is where metadata, or the data about data, comes into play. A metadata management framework does the same for your data analysts.

Metadata

Metadata Management Data Governance Machine Learning

7 Things All Successful Data Product Managers Have In Common

Alation

FEBRUARY 2, 2023

Data product managers are in high demand these days. This makes it more important for aspiring data product managers to stay ahead of the competition. So what sets data product managers apart from the pack? This post will unpack the top 7 traits that successful data product managers have in common. Sounds exciting?

Management

Management Data-driven Visualization Strategy

Optimize data layout by bucketing with Amazon Athena and AWS Glue to accelerate downstream queries

AWS Big Data

APRIL 25, 2024

In the era of data, organizations are increasingly using data lakes to store and analyze vast amounts of structured and unstructured data. Data lakes provide a centralized repository for data from various sources, enabling organizations to unlock valuable insights and drive data-driven decision-making.

Optimization

Optimization Data Lake Cost-Benefit Reporting

Manual Feature Engineering

Domino Data Lab

AUGUST 20, 2019

Many thanks to AWP Pearson for the permission to excerpt “Manual Feature Engineering: Manipulating Data for Fun and Profit” from the book, Machine Learning with Python for Everyone by Mark E. Feature engineering is useful for data scientists when assessing tradeoff decisions regarding the impact of their ML models.

Testing

Testing Modeling Interactive Measurement

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

AWS Big Data

MARCH 3, 2023

In this post, we share how the AWS Data Lab helped Tricentis to improve their software as a service (SaaS) Tricentis Analytics platform with insights powered by Amazon Redshift. Although Tricentis has amassed such data over a decade, the data remains untapped for valuable insights.

Software

Software Data Lake Testing Cost-Benefit

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

AWS Big Data

OCTOBER 30, 2024

Amazon DataZone now launched authentication supports through the Amazon Athena JDBC driver, allowing data users to seamlessly query their subscribed data lake assets via popular business intelligence (BI) and analytics tools like Tableau, Power BI, Excel, SQL Workbench, DBeaver, and more.

Visualization

Visualization Data Lake Testing Data Governance

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

MARCH 20, 2023

We live in a data-rich, insights-rich, and content-rich world. Data collections are the ones and zeroes that encode the actionable insights (patterns, trends, relationships) that we seek to extract from our data through machine learning and data science. Plus, AI can also help find key insights encoded in data.

Data Warehouse

Data Warehouse Metadata Digital Transformation Machine Learning

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

AWS Big Data

OCTOBER 30, 2024

Amazon DataZone is a data management service that makes it faster and easier for customers to catalog, discover, share, and govern data stored across AWS, on premises, and from third-party sources. Using Amazon DataZone lets us avoid building and maintaining an in-house platform, allowing our developers to focus on tailored solutions.

Analytics

Analytics Visualization Data Governance Data-driven

Empower your Jira data in a data lake with Amazon AppFlow and AWS Glue

AWS Big Data

AUGUST 1, 2023

Although Jira Cloud provides reporting capability, loading this data into a data lake will facilitate enrichment with other business data, as well as support the use of business intelligence (BI) tools and artificial intelligence (AI) and machine learning (ML) applications.

Data Lake

Data Lake Data Transformation Data-driven Cost-Benefit

The Ultimate Guide to Modern Data Quality Management (DQM) For An Effective Data Quality Control Driven by The Right Metrics

Data transformation takes flight at Atlanta’s Hartsfield-Jackson airport

Webinars

Trending Sources

10 Examples of How Big Data in Logistics Can Transform The Supply Chain

Webinars

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

Building Data Visualization Expertise for Your Business

Unveiling the Top 10 Data Visualization Companies of 2024

How healthcare organizations can analyze and create insights using price transparency data

How EUROGATE established a data mesh architecture using Amazon DataZone

What is Data Lineage? Top 5 Benefits of Data Lineage

Unlock scalable analytics with AWS Glue and Google BigQuery

Migrate from Apache Solr to OpenSearch

Modernize a legacy real-time analytics application with Amazon Managed Service for Apache Flink

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure

DataOps Observability: Taming the Chaos (Part 2)

Use AWS Glue to streamline SFTP data processing

Supercharging Your Digital Transformation with Embedded Analytics

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

7 key Microsoft Azure analytics services (plus one extra)

Migrate Amazon Redshift from DC2 to RA3 to accommodate increasing data volumes and analytics demands

BMW Cloud Efficiency Analytics powered by Amazon QuickSight and Amazon Athena

Set up alerts and orchestrate data quality rules with AWS Glue Data Quality

Improve power utility operational efficiency using smart sensor data and Amazon QuickSight

Amazon Redshift data ingestion options

How to Include BI in Your 2020 Budget

Orchestrate Amazon EMR Serverless jobs with AWS Step functions

End-to-end development lifecycle for data engineers to build a data integration pipeline using AWS Glue

The Best Embedded BI Tools For 2024

Self-Service Data’s New Frontier: The Data Catalog

How CFM built a well-governed and scalable data-engineering platform using Amazon EMR for financial features generation

The Modern Data Stack Explained: What The Future Holds

Building Better Data Models to Unlock Next-Level Intelligence

5 best open source data flow lineage tools

Cloudera DataFlow Designer: The Key to Agile Data Pipeline Development

Transforming Big Data into Actionable Intelligence

How to Build a Successful Metadata Management Framework

7 Things All Successful Data Product Managers Have In Common

Optimize data layout by bucketing with Amazon Athena and AWS Glue to accelerate downstream queries

Manual Feature Engineering

How Tricentis unlocks insights across the software development lifecycle at speed and scale using Amazon Redshift

Expanding data analysis and visualization options: Amazon DataZone now integrates with Tableau, Power BI, and more

SAP Datasphere Powers Business at the Speed of Data

Streamline AI-driven analytics with governance: Integrating Tableau with Amazon DataZone

Empower your Jira data in a data lake with Amazon AppFlow and AWS Glue

Stay Connected