2018, Data Science and Deep Learning

2018

Data Science

Deep Learning

Bridging the Gap: New Datasets Push Recommender Research Toward Real-World Scale

KDnuggets

JUNE 11, 2025

Spotify Million Playlist Released for RecSys 2018, this dataset helps analyze short-term and sequential listening behavior. By, Avi Chawla - highly passionate about approaching and explaining data science problems with intuition. Yelp Open Dataset Contains 8.6M reviews, but coverage is sparse and city-specific.

Advertising

Advertising Metadata Machine Learning Data Science

Highlights from the Strata Data Conference in New York 2018

O'Reilly on Data

SEPTEMBER 12, 2018

Watch highlights from expert talks covering data science, machine learning, algorithmic accountability, and more. People from across the data world are coming together in New York for the Strata Data Conference. The future of data warehousing. Watch " Managing risk in machine learning.".

IoT

IoT Machine Learning Deep Learning Data Science

Join 42,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Streamline Payment Applications & Lien Waivers Through Innovative Construction Technology

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

Trending Sources

How companies are building sustainable AI and ML initiatives

O'Reilly on Data

JANUARY 29, 2019

In 2017, we published “ How Companies Are Putting AI to Work Through Deep Learning ,” a report based on a survey we ran aiming to help leaders better understand how organizations are applying AI through deep learning. We found companies were planning to use deep learning over the next 12-18 months.

Deep Learning

Deep Learning Machine Learning Data Science Metadata

Webinars

How to Streamline Payment Applications & Lien Waivers Through Innovative Construction Technology

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

5 key areas for tech leaders to watch in 2020

O'Reilly on Data

FEBRUARY 18, 2020

Growth is still strong for such a large topic, but usage slowed in 2018 (+13%) and cooled significantly in 2019, growing by just 7%. Within the data topic, however, ML+AI has gone from 22% of all usage to 26%. In 2019, as in 2018, Python was the most popular language on O’Reilly online learning. Security is surging.

Data-driven

Data-driven Software Statistics Marketing

Top Data Science Tools That Will Empower Your Data Exploration Processes

datapine

AUGUST 14, 2019

Data science has become an extremely rewarding career choice for people interested in extracting, manipulating, and generating insights out of large volumes of data. To fully leverage the power of data science, scientists often need to obtain skills in databases, statistical programming tools, and data visualizations.

Data Science

Data Science Statistics Business Intelligence Visualization

Becoming a machine learning company means investing in foundational technologies

O'Reilly on Data

MAY 21, 2019

This is a good time to assess enterprise activities, as there are many indications a number of companies are already beginning to use machine learning. For example, in a July 2018 survey that drew more than 11,000 respondents, we found strong engagement among companies: 51% stated they already had machine learning models in production.

Machine Learning

Machine Learning Technology Deep Learning Data Science

Artificial intelligence and machine learning adoption in European enterprise

O'Reilly on Data

FEBRUARY 4, 2019

Deep Learning. Temporal data and time-series. Automation in data science and big data. At the 2018 Strata Data London, data privacy and GDPR were big topics. In fact, our 2018 conference happened the same week GDPR came online. Text and Language processing and analysis.

Machine Learning

Machine Learning Enterprise IoT Big Data

12 Alternatives of Sora Easing Our Work in 2024

Analytics Vidhya

APRIL 29, 2024

Introduction In 2018, when we were contemplating whether AI would take over our jobs or not, OpenAI put us on the edge of believing that. Our way of working has completely changed after the inception of OpenAI’s ChatGPT in 2022. But is it a threat or a boon?

Software

Software Analytics IT Deep Learning

Free Data Science University Course Notes

Data Science 101

MARCH 6, 2019

University can be a great way to learn data science. Luckily, a few of them are willing to share data science, machine learning and deep learning materials online for everyone. However, many universities are very expensive, difficult to get admitted, or not geographically feasible.

Data Science

Data Science Deep Learning Machine Learning

Deep Learning Illustrated: Building Natural Language Processing Models

Domino Data Lab

AUGUST 22, 2019

Many thanks to Addison-Wesley Professional for providing the permissions to excerpt “Natural Language Processing” from the book, Deep Learning Illustrated by Krohn , Beyleveld , and Bassens. The excerpt covers how to create word vectors and utilize them as an input into a deep learning model. Introduction.

Deep Learning

Deep Learning Modeling Metrics Testing

Meta-Learning For Better Machine Learning

Rocket-Powered Data Science

DECEMBER 16, 2018

In a related post we discussed the Cold Start Problem in Data Science — how do you start to build a model when you have either no training data or no clear choice of model parameters. Workshop on Meta-Learning (MetaLearn 2018).

Machine Learning

Machine Learning Metrics Data Science Optimization

End-to-End Object Detection for Furniture Using Deep Learning

Insight

MAY 16, 2019

It is a high-level, multifaceted field that allows machines to iteratively learn and understand complex representations from images and videos to automate human visual tasks. How Deep Learning scales based on the amount of Data [Copyright: Andrew Ng ]. He currently works as a Data Scientist at Lowe’s Companies, Inc.

Deep Learning

Deep Learning Machine Learning Data Science Metadata

Lessons learned building natural language processing systems in health care

O'Reilly on Data

MARCH 7, 2019

Language understanding benefits from every part of the fast-improving ABC of software: AI (freely available deep learning libraries like PyText and language models like BERT ), big data (Hadoop, Spark, and Spark NLP ), and cloud (GPU's on demand and NLP-as-a-service from all the major cloud providers). IBM Watson NLU.

Deep Learning

Deep Learning Testing Machine Learning Modeling

Highlights from the Maryland Data Science Conference: Deep Learning on Imagery and Text

Domino Data Lab

MARCH 6, 2019

Niels Kasch , cofounder of Miner & Kasch , an AI and Data Science consulting firm, provides insight from a deep learning session that occurred at the Maryland Data Science Conference. You may also remember UMBC from the miracle at the 2018 NCAA Tournament.) Deep Learning on Imagery and Text.

Deep Learning

Deep Learning Data Science Machine Learning Data Processing

Data Scientist’s Dilemma – The Cold Start Problem

Rocket-Powered Data Science

DECEMBER 15, 2018

In deep learning, as in typical neural network models, the method by which those adjustments to the model parameters are estimated ( i.e., for each of the edge weights between the network nodes) is called backpropagation. .”

Machine Learning

Machine Learning Optimization Metrics Deep Learning

Themes and Conferences per Pacoid, Episode 11

Domino Data Lab

JULY 2, 2019

In other words, using metadata about data science work to generate code. In this case, code gets generated for data preparation, where so much of the “time and labor” in data science work is concentrated. Scale the problem to handle complex data structures. BTW, videos for Rev2 are up: [link].

Metadata

Metadata Data Science Machine Learning Data-driven

Proposals for model vulnerability and security

O'Reilly on Data

MARCH 20, 2019

Watermarking is a term borrowed from the deep learning security literature that often refers to putting special pixels into an image to trigger a desired outcome from your model. It seems entirely possible to do the same with customer or transactional data. Machine Learning 81.2 DZone (2018). ACM (2018).

Modeling

Modeling Machine Learning Predictive Modeling Consulting

Advice for aspiring data scientists and other FAQs

Data Science and Beyond

OCTOBER 15, 2017

Do you know what data science is? Do you understand what data scientists do? Just so we’re on the same page, what is data science? Here are my thoughts from 2014 on defining data science as the intersection of software engineering and statistics , and a more recent post on defining data science in 2018.

Data Science

Data Science Deep Learning Machine Learning Data-driven

IBM and Data Science are Helping Save the World through Call for Code

Business Over Broadway

SEPTEMBER 5, 2018

Their first challenge, the 2018 Call for Code Global Challenge , is a competition that asks developers to create solutions to reduce the deleterious impact of natural disasters on human lives, health, and wellbeing by improving the current state of natural disaster preparedness. Data is the Fuel; Data Science is the Engine.

Data Science

Data Science Statistics Advertising Deep Learning

Model Interpretability with TCAV (Testing with Concept Activation Vectors)

Domino Data Lab

JANUARY 20, 2019

What if there was a way to quantitatively measure whether your machine learning (ML) model reflects specific domain expertise or potential bias? TCAV “uses directional derivatives to quantify the degree to which a user-defined concept is important to a classification result” ( Kim et al 2018 ). MLConf 2018. Introduction.

Testing

Testing Modeling Machine Learning Data Science

Trending Toward Concept Building – A Review of Model Interpretability for Deep Neural Networks

Domino Data Lab

MARCH 16, 2021

2018) Simple meaningless data processing steps, may cause saliency methods to result in significant changes (Kindermans et al., DeepLIFT was recently proposed as a recursive prediction explanation method for deep learning [8, 7]. This is an exciting and important area of data science research. Saliency Maps.

Modeling

Modeling Deep Learning Testing Optimization

7 things you need to know about AI and the data center

CIO Business Intelligence

JULY 22, 2025

This wasn’t about teaching deep learning, but about maintaining infrastructure that doesn’t break when an AI tool plugs in.” AI amplifies the need for automation, and teams need to collaborate across domains: data science, DevOps, and IT,” he says. Not just to plan, but to co-own and co-create success.”

Machine Learning

Machine Learning Data Governance Modeling Strategy

Natural Language in Python using spaCy: An Introduction

Domino Data Lab

SEPTEMBER 9, 2019

Data science teams in industry must work with lots of text, one of the top four categories of data used in machine learning. That’s excellent for supporting really interesting workflow integrations in data science work. Usually it’s human-generated text, but not always.

Deep Learning

Deep Learning Machine Learning Data Science Visualization

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

APRIL 3, 2019

The top three items are essentially “the devil you know” for firms which want to invest in data science: data platform, integration, data prep. Data governance shows up as the fourth-most-popular kind of solution that enterprise teams were adopting or evaluating during 2019. More Policies Emerged” (2010-2018).

Machine Learning

Machine Learning Data Governance Metadata Data Science

Themes and Conferences per Pacoid, Episode 7

Domino Data Lab

MARCH 3, 2019

Paco Nathan covers recent research on data infrastructure as well as adoption of machine learning and AI in the enterprise. Welcome back to our monthly series about data science! This month, the theme is not specifically about conference summaries; rather, it’s about a set of follow-up surveys from Strata Data attendees.

Data Science

Data Science Deep Learning Machine Learning Modeling

Using BERT for state-of-the-art pre-training for natural language processing

Insight

FEBRUARY 13, 2019

He is currently a machine learning engineer at Casetext where he works on natural language processing for the legal industry. In late 2018, Google open-sourced BERT, a powerful deep learning algorithm for natural language processing. Prior to Insight, he was at IBM Watson.

Testing

Testing Machine Learning Deep Learning Cost-Benefit

How foundation models and data stores unlock the business potential of generative AI

IBM Big Data Hub

AUGUST 1, 2023

It’s the underlying engine that gives generative models the enhanced reasoning and deep learning capabilities that traditional machine learning models lack. A specific kind of foundation model known as a large language model (LLM) is trained on vast amounts of text data for NLP tasks.

Modeling

Modeling Cost-Benefit Machine Learning Data Lake

Machine Learning Projects: Challenges and Best Practices

Domino Data Lab

FEBRUARY 10, 2019

Humans likely not even notice the difference but modern deep learning networks suffered a lot. But apparently, models trained on text from 2017 experience degraded performance on text written in 2018. Machine Learning requires lots and lots of relevant training data. We might expect that.

Machine Learning

Machine Learning Modeling Deep Learning Metrics

Using NLP to gain insights from employee review data

Insight

MARCH 28, 2019

For the Fall 2018 session of the Insight Fellows Program in NYC, we launched a new partnership with Thinknum , a company that provides alternative data indexed from the web to institutional investors and corporations? Utilize information hidden away in online employee reviews.

Dashboards

Dashboards Data Science Visualization Machine Learning

And the winners are…. Congratulations to the Sixth Annual Data Impact Awards winners

Cloudera

SEPTEMBER 12, 2018

It was deeply gratifying to see so many organizations deploying the tools and techniques of data science and advanced analytics to solve difficult and important problems. I predict that next year’s competition will be even more amazing as we continue pushing the frontiers of data science forward. Societal Impact:

Machine Learning

Machine Learning Big Data Data Science Data Warehouse

Building a Speaker Recognition Model

Domino Data Lab

JULY 17, 2021

Further, deep learning methods are built on the foundation of signal processing. The post Building a Speaker Recognition Model appeared first on Data Science Blog by Domino. While more advanced models for speaker verification exist, this blog will form a basis of speech signal processing. References: [1] [link]. [2]

Modeling

Modeling Testing Machine Learning Measurement

6 trends framing the state of AI and ML

O'Reilly on Data

MARCH 19, 2020

Our analysis of ML- and AI-related data from the O’Reilly online learning platform indicates: Unsupervised learning surged in 2019, with usage up by 172%. Deep learning cooled slightly in 2019, slipping 10% relative to 2018, but deep learning still accounted for 22% of all AI/ML usage.

Deep Learning

Deep Learning Machine Learning Data-driven Modeling

Hitting the Gym With Neural Networks: Implementing a CNN to Classify Gym Equipment

Insight

JANUARY 14, 2020

Will a network trained with fake data be able to generalize to the real world? Lauren Holzbauer was an Insight Fellow in Summer 2018. Keras is an open source deep learning API that was written in Python and runs on top of Tensorflow, so it’s a little more user-friendly and high-level than Tensorflow.

Metrics

Metrics Optimization Modeling Testing

Data Science, Past & Future

Domino Data Lab

JULY 22, 2019

Paco Nathan presented, “Data Science, Past & Future” , at Rev. At Rev’s “ Data Science, Past & Future” , Paco Nathan covered contextual insight into some common impactful themes over the decades that also provided a “lens” help data scientists, researchers, and leaders consider the future.

Data Science

Data Science Machine Learning Data Governance Modeling

Themes and Conferences per Pacoid, Episode 9

Domino Data Lab

MAY 8, 2019

The lens of reductionism and an overemphasis on engineering becomes an Achilles heel for data science work. Instead, consider a “full stack” tracing from the point of data collection all the way out through inference. Machine learning model interpretability. 2018-06-21). back to the structure of the dataset.

Machine Learning

Machine Learning Data Science Modeling Visualization

Better Preference Predictions: Tunable and Explainable Recommender Systems

Insight

MARCH 5, 2019

As machine learning (ML) algorithms increase in popularity and the “black boxes” of neural networks in deep learning (DL) become the industry standard, pulling out the demographic information and the user history responsible for each undesired rec becomes next to impossible to achieve.

Machine Learning

Machine Learning Internet of Things Recreation/Entertainment Deep Learning

Convolutional Neural Networks Explained…with American Ninja Warrior

Insight

DECEMBER 30, 2019

Lauren Holzbauer was an Insight Fellow in Summer 2018. Since CNNs are no longer built this way, we won’t go into any further detail here (but you can read about them here: "Key Deep Learning Architectures: LeNet-5" and of course in the original paper ). Let’s use our ninja skills to figure out what CNNs are really doing.

Visualization

Visualization Machine Learning Modeling IT

Topics to watch at the Strata Data Conference in New York 2019

O'Reilly on Data

SEPTEMBER 11, 2019

Machine learning, artificial intelligence, data engineering, and architecture are driving the data space. The Strata Data Conferences helped chronicle the birth of big data, as well as the emergence of data science, streaming, and machine learning (ML) as disruptive phenomena. 221) to 2019 (No.

IoT

IoT Big Data Data Warehouse Uncertainty

Bridging the Gap: New Datasets Push Recommender Research Toward Real-World Scale

Highlights from the Strata Data Conference in New York 2018

Webinars

Trending Sources

How companies are building sustainable AI and ML initiatives

Webinars

5 key areas for tech leaders to watch in 2020

Top Data Science Tools That Will Empower Your Data Exploration Processes

Becoming a machine learning company means investing in foundational technologies

Artificial intelligence and machine learning adoption in European enterprise

12 Alternatives of Sora Easing Our Work in 2024

Free Data Science University Course Notes

Top BOB Blog Posts of 2018: Data Science, Machine Learning and the Net Promoter Score

Deep Learning Illustrated: Building Natural Language Processing Models

Meta-Learning For Better Machine Learning

End-to-End Object Detection for Furniture Using Deep Learning

Lessons learned building natural language processing systems in health care

Highlights from the Maryland Data Science Conference: Deep Learning on Imagery and Text

Data Scientist’s Dilemma – The Cold Start Problem

Themes and Conferences per Pacoid, Episode 11

Proposals for model vulnerability and security

Advice for aspiring data scientists and other FAQs

IBM and Data Science are Helping Save the World through Call for Code

Model Interpretability with TCAV (Testing with Concept Activation Vectors)

Trending Toward Concept Building – A Review of Model Interpretability for Deep Neural Networks

7 things you need to know about AI and the data center

Natural Language in Python using spaCy: An Introduction

Themes and Conferences per Pacoid, Episode 8

Themes and Conferences per Pacoid, Episode 7

Using BERT for state-of-the-art pre-training for natural language processing

How foundation models and data stores unlock the business potential of generative AI

Machine Learning Projects: Challenges and Best Practices

Using NLP to gain insights from employee review data

And the winners are…. Congratulations to the Sixth Annual Data Impact Awards winners

Building a Speaker Recognition Model

6 trends framing the state of AI and ML

Hitting the Gym With Neural Networks: Implementing a CNN to Classify Gym Equipment

Data Science, Past & Future

Themes and Conferences per Pacoid, Episode 9

Better Preference Predictions: Tunable and Explainable Recommender Systems

Convolutional Neural Networks Explained…with American Ninja Warrior

Topics to watch at the Strata Data Conference in New York 2019

Stay Connected