Data Analytics, Data Lake and IoT

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

AWS Big Data

NOVEMBER 22, 2024

At AWS, we are committed to empowering organizations with tools that streamline data analytics and transformation processes. This integration enables data teams to efficiently transform and manage data using Athena with dbt Cloud’s robust features, enhancing the overall data workflow experience.

Data Lake

Data Lake Data Warehouse Cost-Benefit Data Transformation

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

JANUARY 15, 2025

Their terminal operations rely heavily on seamless data flows and the management of vast volumes of data. Recently, EUROGATE has developed a digital twin for its container terminal Hamburg (CTH), generating millions of data points every second from Internet of Things (IoT)devices attached to its container handling equipment (CHE).

IoT

IoT Machine Learning Metadata Data-driven

Migrate Delta tables from Azure Data Lake Storage to Amazon S3 using AWS Glue

AWS Big Data

SEPTEMBER 10, 2024

We often see requests from customers who have started their data journey by building data lakes on Microsoft Azure, to extend access to the data to AWS services. In such scenarios, data engineers face challenges in connecting and extracting data from storage containers on Microsoft Azure.

Data Lake

Data Lake Metadata Management Software

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

The Future Of The Telco Industry And Impact Of 5G & IoT – Part II

Cloudera

AUGUST 28, 2020

The real opportunity for 5G however is going to be on the B2B side, IoT and mission-critical applications will benefit hugely. What that means is that this creates new revenue opportunities through IoT case uses and new services. 5G and IoT are going to drive an explosion in data.

IoT

IoT Machine Learning B2B Testing

Migrate from Amazon Kinesis Data Analytics for SQL Applications to Amazon Kinesis Data Analytics Studio

AWS Big Data

JUNE 29, 2023

Amazon Kinesis Data Analytics makes it easy to transform and analyze streaming data in real time. In this post, we discuss why AWS recommends moving from Kinesis Data Analytics for SQL Applications to Amazon Kinesis Data Analytics for Apache Flink to take advantage of Apache Flink’s advanced streaming capabilities.

Data Analytics

Data Analytics Analytics IoT Data Lake

Interview with: Sankar Narayanan, Chief Practice Officer at Fractal Analytics

Corinium

JUNE 6, 2019

For instance, for a variety of reasons, in the short term, CDAOS are challenged with quantifying the benefits of analytics’ investments. Some of the work is very foundational, such as building an enterprise data lake and migrating it to the cloud, which enables other more direct value-added activities such as self-service.

Insurance

Insurance Analytics Forecasting Deep Learning

7 key Microsoft Azure analytics services (plus one extra)

CIO Business Intelligence

JUNE 29, 2022

And as businesses contend with increasingly large amounts of data, the cloud is fast becoming the logical place where analytics work gets done. For many enterprises, Microsoft Azure has become a central hub for analytics. Azure Data Explorer. Azure Data Lake Analytics.

Data Lake

Data Lake Analytics Data Warehouse Machine Learning

PepsiCo transforms for the digital era

CIO Business Intelligence

DECEMBER 1, 2022

The company is also refining its data analytics operations, and it is deploying advanced manufacturing using IoT devices, as well as AI-enhanced robotics. One HR employee took some courses in data analytics and found a new job within the company helping to advance digital transformation. “I

Digital Transformation

Digital Transformation IoT Data-driven KPI

Join a streaming data source with CDC data for real-time serverless data analytics using AWS Glue, AWS DMS, and Amazon DynamoDB

AWS Big Data

MAY 30, 2023

Customers have been using data warehousing solutions to perform their traditional analytics tasks. Traditional batch ingestion and processing pipelines that involve operations such as data cleaning and joining with reference data are straightforward to create and cost-efficient to maintain. options(**additional_options).mode("append").save(s3_output_folder)

Data Lake

Data Lake Data Analytics Analytics Data Processing

Architectural patterns for real-time analytics using Amazon Kinesis Data Streams, part 1

AWS Big Data

JANUARY 8, 2024

This is the first post to a blog series that offers common architectural patterns in building real-time data streaming infrastructures using Kinesis Data Streams for a wide range of use cases. In this post, we will review the common architectural patterns of two use cases: Time Series Data Analysis and Event Driven Microservices.

Analytics

Analytics IoT Data-driven Snapshot

DS Smith sets a single-cloud agenda for sustainability

CIO Business Intelligence

DECEMBER 6, 2023

We collect lots of sensor data on machine performance, vibration data, temperature data, chemical data, and we like to have performative combinations of those datasets,” Dickson says. Dickson says that DS Smith also plans to use virtual private clouds for some corporate data, giving it flexibility and control.

Manufacturing

Manufacturing Data Lake Digital Transformation Machine Learning

NJ Transit creates ‘data engine’ to fuel transformation

CIO Business Intelligence

SEPTEMBER 12, 2022

Collectively, the agencies also have pilots up and running to test electric buses and IoT sensors scattered throughout the transportation system. IDC analyst Sandeep Mukunda says NJ Transit’s approach to data analytics has been very advanced. Lookman Fazal, chief information and digital officer, NJ Transit.

Data Warehouse

Data Warehouse Predictive Analytics Data Lake IoT

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure

DataKitchen

JULY 27, 2023

Let’s go through the ten Azure data pipeline tools Azure Data Factory : This cloud-based data integration service allows you to create data-driven workflows for orchestrating and automating data movement and transformation. You can use it for big data analytics and machine learning workloads.

Machine Learning

Machine Learning Cost-Benefit Data Transformation Testing

Connect the Data Lifecycle: The power of data

Cloudera

AUGUST 27, 2020

With customer-centricity in mind, Manulife set out to find ways of gathering scattered and locked up customer data and bringing it together to provide real-time data insights to the business users. They wanted a holistic view of their customers, in order to provide better services.

Internet of Things

Internet of Things Uncertainty Data Lake IoT

Reference guide to build inventory management and forecasting solutions on AWS

AWS Big Data

APRIL 11, 2023

Such a solution should use the latest technologies, including Internet of Things (IoT) sensors, cloud computing, and machine learning (ML), to provide accurate, timely, and actionable data. To take advantage of this data and build an effective inventory management and forecasting solution, retailers can use a range of AWS services.

Forecasting

Forecasting Management IoT Data-driven

Havmor’s VP IT Dhaval Mankad on ‘melting’ hurdles with a scoop of digital innovation

CIO Business Intelligence

JULY 17, 2023

It’s about possessing meaningful data that helps make decisions around product launches or product discontinuations, because we have information at the product and region level, as well as margins, profitability, transport costs, and so on. How is Havmor leveraging emerging technologies such as cloud, internet of things (IoT), and AI?

IT

IT Digital Transformation IoT Internet of Things

Innovate What’s Next: How Living Labs Brings Ideas to Life

CIO Business Intelligence

APRIL 6, 2022

We are centered around co-creating with customers and promoting a systematic and scalable innovation approach to solve real-world customers problems—similar to Toyota leveraging Infosys Cobalt to modernize its vehicle data warehouse into a next-generation data lake on AWS. .

Experimentation

Experimentation Uncertainty Data Lake Enterprise

Building Better Data Models to Unlock Next-Level Intelligence

Sisense

MAY 11, 2021

You can’t talk about data analytics without talking about data modeling. These two functions are nearly inseparable as we move further into a world of analytics that blends sources of varying volume, variety, veracity, and velocity. displaying BI insights for human users).

Modeling

Modeling Big Data IoT Data Warehouse

When will AI usher in a new era of manufacturing?

CIO Business Intelligence

JULY 12, 2023

A massive amount of data is already collected from sensors across all processes and from all supply chain partners. We created a data lake, so we have access to all that data in a very efficient way,” says Papermaster. That information is now stored in a way that makes it useable to different tools. “We

Manufacturing

Manufacturing Cost-Benefit Data Lake Optimization

Breaking barriers in geospatial: Amazon Redshift, CARTO, and H3

AWS Big Data

MAY 16, 2024

About Amazon Redshift Thousands of customers rely on Amazon Redshift to analyze data from terabytes to petabytes and run complex analytical queries. With Amazon Redshift, you can get real-time insights and predictive analytics on all of your data across your operational databases, data lake, data warehouse, and third-party datasets.

Data Warehouse

Data Warehouse Visualization Cost-Benefit Data-driven

Data for All: Empowering Users With AI, ML, and Analytics

Sisense

JUNE 12, 2019

In addition, providing a world-class analytics platform requires a deep understanding of how to best leverage AI/ML to support the needs of all users from the novice to the most technical. Data literacy and data skills, which created the forgotten dark data lakes in the first place, are still scarce.

Analytics

Analytics Data-driven Dashboards IoT

Seeing the Enterprise Data Cloud in Action at DataWorks Summit DC

Cloudera

MAY 15, 2019

Barbara Eckman from Comcast is another keynote speaker, and is also presenting a breakout session about Comcast’s streaming data platform. The platform comprises ingest, transformation, and storage services in the public cloud, and on-prem RDBMS’s, EDW’s, and a large, ungoverned legacy data lake. American Water.

Enterprise

Enterprise Data Lake Data mining IoT

Announcing the 2021 Data Impact Awards

Cloudera

MAY 12, 2021

This category is open to organizations that have tackled transformative business use cases by connecting multiple parts of the data lifecycle to enrich, report, serve, and predict. . DATA FOR ENTERPRISE AI. Industry Transformation: Telkomsel — Ingesting 25TB of data daily to provide advanced customer analytics in real-time .

Digital Transformation

Digital Transformation Machine Learning Optimization Data Lake

How The Cloud Made ‘Data-Driven Culture’ Possible | Part 1

BizAcuity

MAY 10, 2022

Google launches BigQuery, its own data warehousing tool and Microsoft introduces Azure SQL Data Warehouse and Azure Data Lake Store. 2018: IoT and edge computing open up new opportunities for organizations. Microsoft starts to offer Azure IoT Central and IoT Edge. Google announces Cloud IoT.

Data-driven

Data-driven IoT Unstructured Data Data Lake

Three Trends for Modernizing Analytics and Data Warehousing in 2019

Cloudera

DECEMBER 19, 2018

Data analytics priorities have shifted this year. Don’t blink or you might miss what leading organizations are doing to modernize their analytic and data warehousing environments. Natural language analytics and streaming data analytics are emerging technologies that will impact the market.

Data Warehouse

Data Warehouse Analytics Big Data Data Architecture

A hybrid approach in healthcare data warehousing with Amazon Redshift

AWS Big Data

FEBRUARY 21, 2023

It is a data modeling methodology designed for large-scale data warehouse platforms. What is a data vault? The data vault approach is a method and architectural framework for providing a business with data analytics services to support business intelligence, data warehousing, analytics, and data science needs.

Data Warehouse

Data Warehouse Data Lake Cost-Benefit Metadata

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

AWS Big Data

AUGUST 1, 2024

We can determine the following are needed: An open data format ingestion architecture processing the source dataset and refining the data in the S3 data lake. This requires a dedicated team of 3–7 members building a serverless data lake for all data sources. Vijay Bagur is a Sr.

Data Warehouse

Data Warehouse KPI Optimization Cost-Benefit

Break data silos and stream your CDC data with Amazon Redshift streaming and Amazon MSK

AWS Big Data

DECEMBER 13, 2023

Apache Kafka is an open-source distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications. Internet-of-Things [ IoT] devices, system telemetry data, or clickstream data) from a busy website or application.

Data Warehouse

Data Warehouse Snapshot Data Processing Internet of Things

How Cargotec uses metadata replication to enable cross-account data sharing

AWS Big Data

JUNE 7, 2023

Cargotec captures terabytes of IoT telemetry data from their machinery operated by numerous customers across the globe. This data needs to be ingested into a data lake, transformed, and made available for analytics, machine learning (ML), and visualization.

Metadata

Metadata Data Lake Machine Learning Big Data

Introducing the AWS ProServe Hadoop Migration Delivery Kit TCO tool

AWS Big Data

FEBRUARY 6, 2023

Use case overview Migrating Hadoop workloads to Amazon EMR accelerates big data analytics modernization, increases productivity, and reduces operational cost. Refactoring coupled compute and storage to a decoupling architecture is a modern data solution. Jiseong Kim is a Senior Data Architect at AWS ProServe.

Cost-Benefit

Cost-Benefit Data Lake Dashboards Big Data

A Day in the Life of an Analyst at Gartner IT Symposium XPO 2019 USA – Day 4 Oct 24 2019

Andrew White

OCTOBER 25, 2019

Here is my final analysis of my 1-1s and interactions this week: Topic: Data Governance 28. Vision/Data Driven/Outcomes 28. Data, analytics, or D&A Strategy 21. Modern) Master Data Management 18. Data lake 4. Data Literacy 4. IoT/Streaming data 1. AI/Automation 6.

Recreation/Entertainment

Recreation/Entertainment IT Data Lake Data-driven

Big Data Fabric Weaves Together Automation, Scalability, and Intelligence

Cloudera

JANUARY 22, 2019

Forrester describes Big Data Fabric as, “A unified, trusted, and comprehensive view of business data produced by orchestrating data sources automatically, intelligently, and securely, then preparing and processing them in big data platforms such as Hadoop and Apache Spark, data lakes, in-memory, and NoSQL.”.

Big Data

Big Data Data Lake Internet of Things Enterprise

AWS Glue streaming application to process Amazon MSK data using AWS Glue Schema Registry

AWS Big Data

JUNE 12, 2023

Organizations across the world are increasingly relying on streaming data, and there is a growing need for real-time data analytics, considering the growing velocity and volume of data being collected.

Management

Management Metadata Internet of Things Testing

Why We Started the Data Intelligence Project

Alation

JULY 7, 2022

In the 2010s, the growing scope of the data landscape gave rise to a new profession: the data scientist. This new role, combined with the creation of data lakes and the increasing use of cloud services, created new employment opportunities in data analytics, data architecture, and data management.

Metadata

Metadata Data-driven Insurance Statistics

The Cloud Connection: How Governance Supports Security

Alation

APRIL 14, 2022

Data discovery is also critical for data governance , which, when ineffective, can actually hinder organizational growth. And, as organizations progress and grow, “data drift” starts to impact data usage, models, and your business. Pushing data to a data lake and assuming it is ready for use is shortsighted.

Metadata

Metadata Data Governance Data-driven Modeling

How to Build a Customer Centric Business: The Complete Guide

Alation

AUGUST 2, 2022

Customer centricity requires modernized data and IT infrastructures. Too often, companies manage data in spreadsheets or individual databases. This means that you’re likely missing valuable insights that could be gleaned from data lakes and data analytics. Customer Data Privacy And Security.

Cost-Benefit

Cost-Benefit Metrics Strategy Data Lake

Living on the Edge: How to Accelerate Your Business with Real-time Analytics

Cloudera

SEPTEMBER 15, 2021

Leveraging the Internet of Things (IoT) allows you to improve processes and take your business in new directions. That’s where you find the ability to empower IoT devices to respond to events in real time by capturing and analyzing the relevant data. The IoT depends on edge sites for real-time functionality.

IoT

IoT Analytics Internet of Things Data Lake

Building a scalable streaming data platform that enables real-time and batch analytics of electric vehicles on AWS

AWS Big Data

JULY 17, 2024

In this blog post, we delve into the intricacies of building a reliable data analytics pipeline that can scale to accommodate millions of vehicles, each generating hundreds of metrics every second using Amazon OpenSearch Ingestion. OpenSearch Ingestion provides a fully managed serverless integration to tap into these data streams.

Analytics

Analytics IoT Dashboards Data Lake

Interview with Dominic Sartorio, Senior Vice President for Products & Development, Protegrity

Corinium

APRIL 25, 2019

Ahead of the Chief Data Analytics Officers & Influencers, Insurance event we caught up with Dominic Sartorio, Senior Vice President for Products & Development, Protegrity to discuss how the industry is evolving. And more recently, we have also seen innovation with IOT (Internet Of Things).

Insurance

Insurance Risk IoT Data-driven

It’s not your data. It’s how you use it. Unlock the power of data & build foundations of a data driven organisation

CIO Business Intelligence

MAY 24, 2022

Organisations have to contend with legacy data and increasing volumes of data spread across multiple silos. To meet these demands many IT teams find themselves being systems integrators, having to find ways to access and manipulate large volumes of data for multiple business functions and use cases. zettabytes of data.

Data-driven

Data-driven Data Lake Data Warehouse Machine Learning

What is a Data Pipeline?

Jet Global

MAY 9, 2024

A data pipeline is a series of processes that move raw data from one or more sources to one or more destinations, often transforming and processing the data along the way. Data pipelines support data science and business intelligence projects by providing data engineers with high-quality, consistent, and easily accessible data.

Data Lake

Data Lake Data Warehouse Business Intelligence Machine Learning

Accelerate queries on Apache Iceberg tables through AWS Glue auto compaction

AWS Big Data

DECEMBER 19, 2024

Data lakes were originally designed to store large volumes of raw, unstructured, or semi-structured data at a low cost, primarily serving big data and analytics use cases. Enabling automatic compaction on Iceberg tables reduces metadata overhead on your Iceberg tables and improves query performance.

Data Lake

Data Lake IoT Metadata Testing

Achieve the best price-performance in Amazon Redshift with elastic histograms for selectivity estimation

AWS Big Data

OCTOBER 25, 2024

Amazon Redshift is a fast, scalable, and fully managed cloud data warehouse that allows you to process and run your complex SQL analytics workloads on structured and semi-structured data. Mengchu currently works on query optimization and data lake query performance.

Statistics

Statistics Data Warehouse Metadata Data Lake

Stream real-time data into Apache Iceberg tables in Amazon S3 using Amazon Data Firehose

AWS Big Data

NOVEMBER 6, 2024

Second, because traditional data warehousing approaches are unable to keep up with the volume, velocity, and variety of data, engineering teams are building data lakes and adopting open data formats such as Parquet and Apache Iceberg to store their data.

Metadata

Metadata Data Lake Management Internet of Things

From data lakes to insights: dbt adapter for Amazon Athena now supported in dbt Cloud

How EUROGATE established a data mesh architecture using Amazon DataZone

Webinars

Trending Sources

Migrate Delta tables from Azure Data Lake Storage to Amazon S3 using AWS Glue

Webinars

The Future Of The Telco Industry And Impact Of 5G & IoT – Part II

Migrate from Amazon Kinesis Data Analytics for SQL Applications to Amazon Kinesis Data Analytics Studio

Interview with: Sankar Narayanan, Chief Practice Officer at Fractal Analytics

7 key Microsoft Azure analytics services (plus one extra)

PepsiCo transforms for the digital era

Join a streaming data source with CDC data for real-time serverless data analytics using AWS Glue, AWS DMS, and Amazon DynamoDB

Architectural patterns for real-time analytics using Amazon Kinesis Data Streams, part 1

DS Smith sets a single-cloud agenda for sustainability

NJ Transit creates ‘data engine’ to fuel transformation

The Ten Standard Tools To Develop Data Pipelines In Microsoft Azure

Connect the Data Lifecycle: The power of data

Reference guide to build inventory management and forecasting solutions on AWS

Havmor’s VP IT Dhaval Mankad on ‘melting’ hurdles with a scoop of digital innovation

Innovate What’s Next: How Living Labs Brings Ideas to Life

Building Better Data Models to Unlock Next-Level Intelligence

When will AI usher in a new era of manufacturing?

Breaking barriers in geospatial: Amazon Redshift, CARTO, and H3

Data for All: Empowering Users With AI, ML, and Analytics

Seeing the Enterprise Data Cloud in Action at DataWorks Summit DC

Announcing the 2021 Data Impact Awards

How The Cloud Made ‘Data-Driven Culture’ Possible | Part 1

Three Trends for Modernizing Analytics and Data Warehousing in 2019

A hybrid approach in healthcare data warehousing with Amazon Redshift

Unlock scalability, cost-efficiency, and faster insights with large-scale data migration to Amazon Redshift

Break data silos and stream your CDC data with Amazon Redshift streaming and Amazon MSK

How Cargotec uses metadata replication to enable cross-account data sharing

Introducing the AWS ProServe Hadoop Migration Delivery Kit TCO tool

A Day in the Life of an Analyst at Gartner IT Symposium XPO 2019 USA – Day 4 Oct 24 2019

Big Data Fabric Weaves Together Automation, Scalability, and Intelligence

AWS Glue streaming application to process Amazon MSK data using AWS Glue Schema Registry

Why We Started the Data Intelligence Project

The Cloud Connection: How Governance Supports Security

How to Build a Customer Centric Business: The Complete Guide

Living on the Edge: How to Accelerate Your Business with Real-time Analytics

Building a scalable streaming data platform that enables real-time and batch analytics of electric vehicles on AWS

Interview with Dominic Sartorio, Senior Vice President for Products & Development, Protegrity

It’s not your data. It’s how you use it. Unlock the power of data & build foundations of a data driven organisation

What is a Data Pipeline?

Accelerate queries on Apache Iceberg tables through AWS Glue auto compaction

Achieve the best price-performance in Amazon Redshift with elastic histograms for selectivity estimation

Stream real-time data into Apache Iceberg tables in Amazon S3 using Amazon Data Firehose

Stay Connected