Data Leaders Brief

data-science-dictionary feature-selection

The state of data quality in 2020

O'Reilly on Data

FEBRUARY 11, 2020

We suspected that data quality was a topic brimming with interest. The responses show a surfeit of concerns around data quality and some uncertainty about how best to address those concerns. Key survey results: The C-suite is engaged with data quality. Data quality might get worse before it gets better.

Data Quality

Data Quality Metadata Data Governance Publishing

Your Effective Roadmap To Implement A Successful Business Intelligence Strategy

datapine

FEBRUARY 22, 2022

Over the past 5 years, big data and BI became more than just data science buzzwords. Without real-time insight into their data, businesses remain reactive, miss strategic growth opportunities, lose their competitive edge, fail to take advantage of cost savings options, don’t ensure customer satisfaction… the list goes on.

Business Intelligence

Business Intelligence Strategy Cost-Benefit Dashboards

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

Build efficient, cross-Regional, I/O-intensive workloads with Dask on AWS

AWS Big Data

MAY 4, 2023

Welcome to the era of data. The sheer volume of data captured daily continues to grow, calling for platforms and solutions to evolve. The Amazon Sustainability Data Initiative (ASDI) uses the capabilities of Amazon S3 to provide a no-cost solution for you to store and share climate science workloads across the globe.

Data Processing

Data Processing Metadata Informatics Interactive

Webinars

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

How to supercharge data exploration with Pandas Profiling

Domino Data Lab

JANUARY 21, 2021

Producing insights from raw data is a time-consuming process. The Importance of Exploratory Analytics in the Data Science Lifecycle. Exploratory analysis is a critical component of the data science lifecycle. For one, Python remains the leading language for data science research. ref: [link].

Statistics

Statistics Unstructured Data Data Science Visualization

Switching from CPUs to GPUs for NYC Taxi Fare Predictions with NVIDIA RAPIDS

Cloudera

NOVEMBER 3, 2021

Have you ever asked a data scientist if they wanted their code to run faster? According to a poll in Kaggle’s State of Machine Learning and Data Science 2020 , A Convolutional Neural Network was the most popular deep learning algorithm used amongst polled individuals, but it was not even in the top 3. In fact only 43.2%

Deep Learning

Deep Learning Machine Learning Data Science Cost-Benefit

AWS Professional Services scales by improving performance and democratizing data with Amazon QuickSight

AWS Big Data

JUNE 14, 2023

The AWS Professional Services (ProServe) Insights team builds global operational data products that serve over 8,000 users within Amazon. In this post, we discuss how QuickSight has helped us improve our performance, democratize our data, and provide insights to our internal customers at scale.

Dashboards

Dashboards KPI Scorecard Metadata

Manual Feature Engineering

Domino Data Lab

AUGUST 20, 2019

Many thanks to AWP Pearson for the permission to excerpt “Manual Feature Engineering: Manipulating Data for Fun and Profit” from the book, Machine Learning with Python for Everyone by Mark E. Feature engineering is useful for data scientists when assessing tradeoff decisions regarding the impact of their ML models.

Testing

Testing Modeling Interactive Measurement

Deep Learning Illustrated: Building Natural Language Processing Models

Domino Data Lab

AUGUST 22, 2019

Data scientists and researchers require an extensive array of techniques, packages, and tools to accelerate core work flow tasks including prepping, processing, and analyzing data. Utilizing NLP helps researchers and data scientists complete core tasks faster. Preprocessing Natural Language Data. Example 11.4

Deep Learning

Deep Learning Modeling Metrics Testing

How to Easily Understand Your Python Objects

Insight

JULY 23, 2019

I frequently run into this issue in my data science workflow with complex objects in libraries, like TensorFlow. kwonlydefaults is a dictionary with keyword-only arg default values. annotations is a dictionary specifying any type annotations. args contains the argument names. kwonlyargs lists names of keyword-only args.

Data Science

Data Science Testing IT Machine Learning

Towards Predictive Accuracy: Tuning Hyperparameters and Pipelines

Domino Data Lab

AUGUST 26, 2019

Data scientists, machine learning (ML) researchers, and business stakeholders have a high-stakes investment in the predictive accuracy of models. Data scientists and researchers ascertain predictive accuracy of models using different techniques, methodologies, and settings, including model parameters and hyperparameters. Introduction.

Testing

Testing Modeling Machine Learning Metrics

Building a Named Entity Recognition model using a BiLSTM-CRF network

Domino Data Lab

JULY 1, 2021

The model achieves relatively high accuracy and all data and code is freely available in the article. The drawback with statistical model-based techniques is that the automated extraction of a comprehensive set of rules requires a large amount of labeled training data. Data exploration and preparation.

Modeling

Modeling Statistics Testing Metrics

How to implement access control and auditing on Amazon Redshift using Immuta

AWS Big Data

OCTOBER 24, 2024

Organizations are looking for products that let them spend less time managing data and more time on core business functions. Data security is one of the key functions in managing a data warehouse. With Immuta integration with Amazon Redshift , user and data security operations are managed using an intuitive user interface.

Data Warehouse

Data Warehouse Business Objectives Data-driven Management

The state of data quality in 2020

Your Effective Roadmap To Implement A Successful Business Intelligence Strategy

Webinars

Trending Sources

Build efficient, cross-Regional, I/O-intensive workloads with Dask on AWS

Webinars

How to supercharge data exploration with Pandas Profiling

Switching from CPUs to GPUs for NYC Taxi Fare Predictions with NVIDIA RAPIDS

AWS Professional Services scales by improving performance and democratizing data with Amazon QuickSight

Manual Feature Engineering

Deep Learning Illustrated: Building Natural Language Processing Models

How to Easily Understand Your Python Objects

Towards Predictive Accuracy: Tuning Hyperparameters and Pipelines

Building a Named Entity Recognition model using a BiLSTM-CRF network

How to implement access control and auditing on Amazon Redshift using Immuta

Stay Connected