Data Analytics, Data Lake and Workshop

Data Analytics

Data Lake

Workshop

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JULY 20, 2023

With data becoming the driving force behind many industries today, having a modern data architecture is pivotal for organizations to be successful. In this post, we describe Orca’s journey building a transactional data lake using Amazon Simple Storage Service (Amazon S3), Apache Iceberg, and AWS Analytics.

Data Lake

Data Lake Analytics Snapshot Data Quality

Create an Apache Hudi-based near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSight

AWS Big Data

AUGUST 3, 2023

With the rapid growth of technology, more and more data volume is coming in many different formats—structured, semi-structured, and unstructured. Data analytics on operational data at near-real time is becoming a common need. Then we can query the data with Amazon Athena visualize it in Amazon QuickSight.

Data Lake

Data Lake Visualization Dashboards Insurance

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Trending Sources

How AWS helped Altron Group accelerate their vision for optimized customer engagement

AWS Big Data

JULY 13, 2023

Altron is a pioneer of providing data-driven solutions for their customers by combining technical expertise with in-depth customer understanding to provide highly differentiated technology solutions. Data quality for account and customer data – Altron wanted to enable data quality and data governance best practices.

Optimization

Optimization B2B Data Quality Sales

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Real-time streaming data top picks you cannot miss at AWS re:Invent 2023

AWS Big Data

NOVEMBER 8, 2023

Putting your data to work with generative AI – Innovation Talk Thursday, November 30 | 12:30 – 1:30 PM PST | The Venetian Join Mai-Lan Tomsen Bukovec, Vice President, Technology at AWS to learn how you can turn your data lake into a business advantage with generative AI. Reserve your seat now! Reserve your seat now!

Data-driven

Data-driven Machine Learning Data Lake Cost-Benefit

Reference guide to build inventory management and forecasting solutions on AWS

AWS Big Data

APRIL 11, 2023

By collecting data from store sensors using AWS IoT Core , ingesting it using AWS Lambda to Amazon Aurora Serverless , and transforming it using AWS Glue from a database to an Amazon Simple Storage Service (Amazon S3) data lake, retailers can gain deep insights into their inventory and customer behavior.

Forecasting

Forecasting Management IoT Data-driven

Innovate What’s Next: How Living Labs Brings Ideas to Life

CIO Business Intelligence

APRIL 6, 2022

We are centered around co-creating with customers and promoting a systematic and scalable innovation approach to solve real-world customers problems—similar to Toyota leveraging Infosys Cobalt to modernize its vehicle data warehouse into a next-generation data lake on AWS. .

Experimentation

Experimentation Data Lake Uncertainty Enterprise

Extend your data mesh with Amazon Athena and federated views

AWS Big Data

JULY 28, 2023

Clean up To clean up the resources created for this post, complete the following steps: On the Amazon S3 console, empty the bucket athena-federation-workshop-. If you’re using the AWS CLI, delete the objects in the athena-federation-workshop- bucket with the following code. Big Data Architect on Amazon Athena.

Big Data

Big Data Data Architecture Data Lake Interactive

Turning Streams Into Data Products

Cloudera

JUNE 16, 2022

Building real-time data analytics pipelines is a complex problem, and we saw customers struggle using processing frameworks such as Apache Storm, Spark Streaming, and Kafka Streams. . Without context, streaming data is useless.” Convergence of batch and streaming made easy.

Data Lake

Data Lake Manufacturing Metadata Dashboards

Build a real-time analytics solution with Apache Pinot on AWS

AWS Big Data

AUGUST 6, 2024

In essence, it’s the foundation for user-centric data analysis in modern apps, because it’s the layer that translates technical assets into business-friendly terms that enable users to extract actionable insights from data. The scope of data analytics has grown, and more user personas are now seeking to extract insights themselves.

OLAP

OLAP Analytics Visualization Dashboards

Automate the archive and purge data process for Amazon RDS for PostgreSQL using pg_partman, Amazon S3, and AWS Glue

AWS Big Data

AUGUST 22, 2023

Gain a high-level understanding of AWS Glue and its components by using the following hands-on workshop. Vivek Shrivastava is a Principal Data Architect, Data Lake in AWS Professional Services. He is a big data enthusiast and holds 14 AWS Certifications.

Data Processing

Data Processing Testing Data Lake Data Integration

Introducing Amazon EMR on EKS job submission with Spark Operator and spark-submit

AWS Big Data

JUNE 6, 2023

In the following code, replace the EKS endpoint as well as the S3 bucket then run the script: /bin/spark-submit --class ValueZones --master k8s://EKS-ENDPOINT --conf spark.kubernetes.namespace=data-team-a --conf spark.kubernetes.container.image=608033475327.dkr.ecr.us-west-1.amazonaws.com/spark/emr-6.10.0:latest amazonaws.com/spark/emr-6.10.0:latest

Optimization

Optimization Data Lake Cost-Benefit Management

The Gartner 2021 Leadership Vision for Data & Analytics Leaders Webinar Q&A

Andrew White

JANUARY 11, 2021

Note: Delivery of data, analytics solutions and the sustainment of technology, data and services is a question. Does Data warehouse as a software tool will play role in future of Data & Analytics strategy? Data lakes don’t offer this nor should they. Governance. Product Management.

Data Analytics

Data Analytics Analytics Data-driven Finance

Week in the Life of an Analyst at Gartner US IT Symposium (virtual) 2021

Andrew White

OCTOBER 22, 2021

Analytics Tactics (known outcome/known data/BI/analytics v unknown outcome/unknown data/data science/ML) 11. Data Hub Strategy 10. Lakehouse (data warehouse and data lake working together) 8. Data Literacy, training, coordination, collaboration 8. Specific Vendor Questions 5.

IT Data Lake Data Science Strategy

Stream real-time data into Apache Iceberg tables in Amazon S3 using Amazon Data Firehose

AWS Big Data

NOVEMBER 6, 2024

Second, because traditional data warehousing approaches are unable to keep up with the volume, velocity, and variety of data, engineering teams are building data lakes and adopting open data formats such as Parquet and Apache Iceberg to store their data.

Metadata

Metadata Data Lake Management Internet of Things

Streamline AWS WAF log analysis with Apache Iceberg and Amazon Data Firehose

AWS Big Data

FEBRUARY 18, 2025

To optimize their security operations, organizations are adopting modern approaches that combine real-time monitoring with scalable data analytics. They are using data lake architectures and Apache Iceberg to efficiently process large volumes of security data while minimizing operational overhead.

Snapshot

Snapshot Optimization Data Lake Metadata

Data Leaders Brief

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Create an Apache Hudi-based near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSight

Webinars

Trending Sources

How AWS helped Altron Group accelerate their vision for optimized customer engagement

Webinars

Real-time streaming data top picks you cannot miss at AWS re:Invent 2023

Reference guide to build inventory management and forecasting solutions on AWS

Innovate What’s Next: How Living Labs Brings Ideas to Life

Extend your data mesh with Amazon Athena and federated views

Turning Streams Into Data Products

Build a real-time analytics solution with Apache Pinot on AWS

Automate the archive and purge data process for Amazon RDS for PostgreSQL using pg_partman, Amazon S3, and AWS Glue

Introducing Amazon EMR on EKS job submission with Spark Operator and spark-submit

The Gartner 2021 Leadership Vision for Data & Analytics Leaders Webinar Q&A

Week in the Life of an Analyst at Gartner US IT Symposium (virtual) 2021

Stream real-time data into Apache Iceberg tables in Amazon S3 using Amazon Data Firehose

Streamline AWS WAF log analysis with Apache Iceberg and Amazon Data Firehose

Stay Connected