Article and Big Data - Data Leaders Brief

Search:

DAY

WEEK

MONTH

YEAR

May 03 - May 09

Apr 26 - May 02

Apr 19 - Apr 25

Apr 12 - Apr 18

Apr 05 - Apr 11

MORE

MORE

MORE

MORE

Select your country:
Sign up | Log in

Article

Big Data

article thumbnail

How is Big Data Helping in the Development of Healthcare?

Analytics Vidhya

SEPTEMBER 21, 2022

This article was published as a part of the Data Science Blogathon. Introduction “Big data in healthcare” refers to much health data collected from many sources, including electronic health records (EHRs), medical imaging, genomic sequencing, wearables, payer records, medical devices, and pharmaceutical research.

Big Data Data Collection Data Science Publishing

article thumbnail

Relationship Between Facebook and Big Data

Analytics Vidhya

OCTOBER 30, 2022

This article was published as a part of the Data Science Blogathon. The post Relationship Between Facebook and Big Data appeared first on Analytics Vidhya. When you look closely at all these birthday notifications, you will find that the name […].

Big Data Data Science Publishing Analytics

Join 42,000+

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

Trending Sources

article thumbnail

An Introduction to Hadoop Ecosystem for Big Data

Analytics Vidhya

MAY 27, 2022

This article was published as a part of the Data Science Blogathon. Introduction Every day the internet generates billions of bytes of data. Every time you put on a dog filter, watch cat videos or order food from your favourite restaurant, you generate data.

Big Data Data Science Publishing Analytics

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

article thumbnail

The Origin of Big Data Analytics

Analytics Vidhya

SEPTEMBER 28, 2022

This article was published as a part of the Data Science Blogathon. Introduction Big data is now an unreplaceable part of tech giants and businesses. Business applications range from customer fraud detection to personalization with extensive data analytics dashboards. They also lead to more efficient operations.

Big Data Data Analytics Analytics Dashboards

article thumbnail

An Introductory Guide to Big Data Analytics

Analytics Vidhya

SEPTEMBER 6, 2021

This article was published as a part of the Data Science Blogathon One thing that comes to our mind after hearing Big Data Analytics is that this field might be somewhat related to Data Science right? The post An Introductory Guide to Big Data Analytics appeared first on Analytics Vidhya.

Big Data Data Analytics Analytics Data Science

article thumbnail

Big Data with Spark and Scala

Analytics Vidhya

DECEMBER 12, 2020

This article was published as a part of the Data Science Blogathon. Introduction Big Data is a new term that is used widely in. The post Big Data with Spark and Scala appeared first on Analytics Vidhya.

Big Data Data Science Publishing Analytics

article thumbnail

Introduction to Spark MLlib for Big Data and Machine Learning

Analytics Vidhya

NOVEMBER 11, 2020

This article was published as a part of the Data Science Blogathon. Overview With the demand for big data and machine learning, this article. The post Introduction to Spark MLlib for Big Data and Machine Learning appeared first on Analytics Vidhya.

Machine Learning

Machine Learning Big Data Data Science Publishing

article thumbnail

Learn Presto & Startburst for Big Data Analysis

Analytics Vidhya

AUGUST 30, 2022

This article was published as a part of the Data Science Blogathon. terabytes of data to manage. Whether you’re a small company or a trillion-dollar giant, data makes the decision. But as data ecosystems become more complex, it’s important to have the right tools for the […].

Big Data Data Science Publishing Management

article thumbnail

Getting Started with Big Data & Hadoop

Analytics Vidhya

APRIL 26, 2022

This article was published as a part of the Data Science Blogathon. Introduction on Big Data & Hadoop The amount of data in our world is growing exponentially. quintillions of data are being generated every day. No wonder why Big Data is a fast-growing field with great opportunities […].

Big Data Data Science Publishing Analytics

article thumbnail

All About Big Data File Formats

Analytics Vidhya

MAY 31, 2022

This article was published as a part of the Data Science Blogathon. Introduction to Big Data File Formats In the digital era, every day we generate thousands of terabytes of data. The most challenging task is to store and process this data.

Big Data Data Science Publishing Analytics

article thumbnail

Why Big Data needs to become Smart Data?

Analytics Vidhya

AUGUST 31, 2022

This article was published as a part of the Data Science Blogathon. The need to maximize company efficiency and profitability has led the world to leverage data as a powerful tool. Data is reusable, everywhere, replicable, easily transferable, and […]. The post Why Big Data needs to become Smart Data?

Big Data Data Science Publishing Optimization

article thumbnail

10 Must-Have Big Data Skills to Land a Job in 2023

Analytics Vidhya

JULY 18, 2023

Introduction In the rapidly evolving world of modern business, big data skills have emerged as indispensable for unlocking the true potential of data. This article delves into the core competencies needed to effectively navigate the realm of big data.

Big Data Analytics Data mining IT

article thumbnail

What is Big Data? Introduction, Uses, and Applications.

Analytics Vidhya

MAY 20, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon Introduction We produce a massive amount of data each day, whether. The post What is Big Data? Introduction, Uses, and Applications. appeared first on Analytics Vidhya.

Big Data Data Science Publishing Analytics

article thumbnail

How Big Data Is Shaping HealthCare To Make It Further Affordable, Accurate & Intelligent

Analytics Vidhya

JUNE 28, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon Image By fabio on Unsplash What Is Big Data? Big data is a. The post How Big Data Is Shaping HealthCare To Make It Further Affordable, Accurate & Intelligent appeared first on Analytics Vidhya.

Big Data IT Data Science Publishing

article thumbnail

Big Data to Small Data – Welcome to the World of Reservoir Sampling

Analytics Vidhya

NOVEMBER 6, 2020

This article was published as a part of the Data Science Blogathon. Introduction Big Data refers to a combination of structured and unstructured data. The post Big Data to Small Data – Welcome to the World of Reservoir Sampling appeared first on Analytics Vidhya.

Big Data Unstructured Data Data Science Publishing

article thumbnail

10 Examples of How Big Data in Logistics Can Transform The Supply Chain

datapine

MAY 2, 2023

Table of Contents 1) Benefits Of Big Data In Logistics 2) 10 Big Data In Logistics Use Cases Big data is revolutionizing many fields of business, and logistics analytics is no exception. The complex and ever-evolving nature of logistics makes it an essential use case for big data applications.

Big Data Internet of Things Cost-Benefit Optimization

article thumbnail

Core technologies and tools for AI, big data, and cloud computing

O'Reilly on Data

FEBRUARY 11, 2019

Many companies are just beginning to address the interplay between their suite of AI, big data, and cloud technologies. I’ll also highlight some interesting uses cases and applications of data, analytics, and machine learning. Foundational data technologies. Data Platforms. Data Integration and Data Pipelines.

Big Data Technology Machine Learning Deep Learning

article thumbnail

Top Interview Questions & Answers for Apache Sqoop

Analytics Vidhya

JULY 29, 2022

This article was published as a part of the Data Science Blogathon. Introduction One of the sources of Big Data is the traditional application management system or the interaction of applications with relational databases using RDBMS. Big Data storage and analysis […].

Big Data Data Science Interactive Publishing

article thumbnail

Introduction to Apache Sqoop

Analytics Vidhya

JULY 25, 2022

This article was published as a part of the Data Science Blogathon. Introduction Apache Sqoop is a big data engine for transferring data between Hadoop and relational database servers. Big Data Sqoop can also be […].

Big Data Data Science Publishing Management

article thumbnail

10 Big Data Examples Showing The Great Value of Smart Analytics In Real Life At Restaurants, Bars, and Casinos

datapine

APRIL 14, 2022

“You can have data without information, but you cannot have information without data.” – Daniel Keys Moran. When you think of big data, you usually think of applications related to banking, healthcare analytics , or manufacturing. Download our free summary outlining the best big data examples! Discover 10.

Big Data Recreation/Entertainment Analytics Data-driven

article thumbnail

Integration of Python with Hadoop and Spark

Analytics Vidhya

MAY 30, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon Introduction Big data is the collection of data that is vast. The post Integration of Python with Hadoop and Spark appeared first on Analytics Vidhya.

Big Data Data Science Publishing Analytics

article thumbnail

A comprehensive guide to Feature Selection using Wrapper methods in Python

Analytics Vidhya

OCTOBER 24, 2020

This article was published as a part of the Data Science Blogathon. Introduction In today’s era of Big data and IoT, we are easily. The post A comprehensive guide to Feature Selection using Wrapper methods in Python appeared first on Analytics Vidhya.

IoT

IoT Big Data Data Science Publishing

article thumbnail

Three R Libraries for Automated EDA

Analytics Vidhya

OCTOBER 7, 2022

This article was published as a part of the Data Science Blogathon. Introduction With the increasing use of technology, data accumulation is faster than ever due to connected smart devices. These devices continuously collect and transmit data that can be processed, transformed, and stored for later use.

Big Data Data Science Publishing Technology

article thumbnail

Introduction to Apache Spark and its Datasets

Analytics Vidhya

AUGUST 17, 2022

This article was published as a part of the Data Science Blogathon. Introduction In this article, we will introduce you to the big data ecosystem and the role of Apache Spark in Big data. We will also cover the Distributed database system, the backbone of big data. Almost […].

IT

IT Big Data Data Science Publishing

article thumbnail

An End-to-End Starter Guide on Apache Spark and RDD

Analytics Vidhya

JUNE 2, 2022

This article was published as a part of the Data Science Blogathon. Introduction In this article, we will introduce you to Apache Spark and its role in big data and the way it makes a big data ecosystem we will also explore Resilient Distributed Dataset (RDD) in spark.

Big Data Data Science Publishing Analytics

article thumbnail

Learn About Apache Spark Using Python

Analytics Vidhya

APRIL 12, 2022

This article was published as a part of the Data Science Blogathon. Introduction In the last article, we discussed Apache Spark and the big data ecosystem, and we discussed the role of apache spark in data processing in big data. This article […].

Big Data Data Science Publishing Data Processing

article thumbnail

Hive Advance: Performance Tuning Techniques

Analytics Vidhya

JUNE 6, 2022

This article was published as a part of the Data Science Blogathon. Introduction In this article, we will discuss advanced topics in hives which are required for Data-Engineering. Performance Tuning in […].

Big Data Data Science Publishing Optimization

article thumbnail

Everything You Must Know About Koalas!

Analytics Vidhya

OCTOBER 10, 2022

This article was published as a part of the Data Science Blogathon. Introduction A key aspect of big data is data frames. However, Spark is more suited to handling scaled distributed data, whereas Pandas is not. Pandas and Spark are two of the most popular types. What […].

Data Science Big Data Publishing Analytics

article thumbnail

Building A Machine Learning Pipeline Using Pyspark

Analytics Vidhya

JUNE 9, 2022

This article was published as a part of the Data Science Blogathon. Introduction to Pyspark Spark is an open-source framework for big data processing. It was originally written in scala and later on due to increasing demand for machine learning using big data a python API of the same was released.

Machine Learning

Machine Learning Big Data Data Science Publishing

article thumbnail

Data Ingestion Featuring AWS

Analytics Vidhya

JUNE 24, 2022

This article was published as a part of the Data Science Blogathon. Introduction Big Data is everywhere, and it continues to be a gearing-up topic these days. And Data Ingestion is a process that assists a group or management to make sense of the ever-increasing volume and complexity of data and provide useful insights.

Big Data Data Science Publishing Management

article thumbnail

A Brief Introduction to Apache HBase and it’s Architecture

Analytics Vidhya

OCTOBER 12, 2022

This article was published as a part of the Data Science Blogathon. Introduction Since the 1970s, relational database management systems have solved the problems of storing and maintaining large volumes of structured data.

Structured Data

Structured Data Big Data Data Science Publishing

article thumbnail

Good ETL Practices with Apache Airflow

Analytics Vidhya

NOVEMBER 30, 2021

This article was published as a part of the Data Science Blogathon. Introduction to ETL ETL is a type of three-step data integration: Extraction, Transformation, Load are processing, used to combine data from multiple sources. It is commonly used to build Big Data.

Big Data Data Science Publishing Data Integration

article thumbnail

Getting Started with Azure Synapse Analytics

Analytics Vidhya

MAY 1, 2022

This article was published as a part of the Data Science Blogathon. Introduction Azure Synapse Analytics is a cloud-based service that combines the capabilities of enterprise data warehousing, big data, data integration, data visualization and dashboarding.

Analytics Predictive Analytics Dashboards Big Data

article thumbnail

How to Use Data to Drive Your Marketing Strategy

Analytics Vidhya

SEPTEMBER 13, 2022

This article was published as a part of the Data Science Blogathon. Introduction In the era of big data, it’s no surprise that more and more marketers are using data science in marketing to better position their brands, products, and services in today’s hyper-competitive marketplace.

Marketing Strategy Data Science Big Data

article thumbnail

Top 10 AI and Data Science Trends in 2022

Analytics Vidhya

FEBRUARY 3, 2022

This article was published as a part of the Data Science Blogathon. In this article, we shall discuss the upcoming innovations in the field of artificial intelligence, big data, machine learning and overall, Data Science Trends in 2022. Times change, technology improves and our lives get better.

Data Science Deep Learning Big Data Machine Learning

article thumbnail

AWS Glue for Handling Metadata

Analytics Vidhya

AUGUST 19, 2022

This article was published as a part of the Data Science Blogathon. Introduction AWS Glue helps Data Engineers to prepare data for other data consumers through the Extract, Transform & Load (ETL) Process. It provides organizations with […].

Metadata Data Science Big Data Publishing

article thumbnail

End-to-End Beginners Guide on Spark SQL in Python

Analytics Vidhya

APRIL 12, 2022

This article was published as a part of the Data Science Blogathon. Introduction In this article, we are going to cover Spark SQL in Python. In the last article, we have already introduced Spark and its work and its role in Big data. If you haven’t checked it yet, please go to this link.

Data Science Big Data Publishing Analytics

article thumbnail

Apache Kafka Architecture and Use Cases Explained

Analytics Vidhya

JULY 22, 2022

This article was published as a part of the Data Science Blogathon. Introduction The big data industry is growing daily and needs tools to process vast volumes of data. That’s why you need to know about Apache Kafka, a publish-subscribe messaging system you can use to build distributed applications.

Big Data Publishing Data Science Analytics

article thumbnail

Top 26 Data Science Tools for Data Scientists in 2024

Analytics Vidhya

DECEMBER 12, 2023

Introduction The field of data science is evolving rapidly, and staying ahead of the curve requires leveraging the latest and most powerful tools available. In 2024, data scientists have a plethora of options to choose from, catering to various aspects of their work, including programming, big data, AI, visualization, and more.

Data Science Big Data Visualization Analytics

article thumbnail

Most Frequently Asked Apache HBase Interview Questions

Analytics Vidhya

AUGUST 1, 2022

This article was published as a part of the Data Science Blogathon. HBase provides a fault-tolerant manner of storing sparse data sets, which are prevalent in several big data use cases. It is ideal for real-time data processing or […].

Big Data Data Science Publishing Data Processing

article thumbnail

Data Warehouses: Basic Concepts for data enthusiasts

Analytics Vidhya

SEPTEMBER 13, 2022

This article was published as a part of the Data Science Blogathon. Introduction The purpose of a data warehouse is to combine multiple sources to generate different insights that help companies make better decisions and forecasting. It consists of historical and commutative data from single or multiple sources.

Data Warehouse Forecasting Data Science Big Data

article thumbnail

Apache Spark Performance Optimization for Data Engineers

Analytics Vidhya

SEPTEMBER 30, 2021

This article was published as a part of the Data Science Blogathon Introduction Apache Spark is a big data processing framework that has long become one of the most popular and frequently encountered in all kinds of projects related to Big Data.

Optimization Big Data Data Science Publishing

article thumbnail

Using Docker to Create a Cassandra Cluster

Analytics Vidhya

SEPTEMBER 3, 2022

This article was published as a part of the Data Science Blogathon. Introduction In the Big Data space, companies like Amazon, Twitter, Facebook, Google, etc., collect terabytes and petabytes of user data that must be handled efficiently.

Big Data Data Science Publishing Optimization

article thumbnail

What Are the Most Serious Privacy Concerns Regarding Big Data?

Smart Data Collective

SEPTEMBER 30, 2022

Given the growing importance of big data and the rising reliance of businesses on big data analytics to carry out their day-to-day operations, it is safe to say that big data has irrevocably altered the online world for anyone running a digital enterprise or an e-business.

Big Data Data-driven Measurement Risk