Remove Data Governance Remove Machine Learning Remove Metadata
article thumbnail

Collibra Brings Effective Data Governance to Line-of-Business

David Menninger's Analyst Perspectives

Collibra is a data governance software company that offers tools for metadata management and data cataloging. The software enables organizations to find data quickly, identify its source and assure its integrity.

article thumbnail

Deep automation in machine learning

O'Reilly on Data

We need to do more than automate model building with autoML; we need to automate tasks at every stage of the data pipeline. In a previous post , we talked about applications of machine learning (ML) to software development, which included a tour through sample tools in data science and for managing data infrastructure.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Becoming a machine learning company means investing in foundational technologies

O'Reilly on Data

Companies successfully adopt machine learning either by building on existing data products and services, or by modernizing existing models and algorithms. In this post, I share slides and notes from a keynote I gave at the Strata Data Conference in London earlier this year. Use ML to unlock new data types—e.g.,

article thumbnail

Specialized tools for machine learning development and model governance are becoming essential

O'Reilly on Data

Why companies are turning to specialized machine learning tools like MLflow. A few years ago, we started publishing articles (see “Related resources” at the end of this post) on the challenges facing data teams as they start taking on more machine learning (ML) projects. The upcoming 0.9.0

article thumbnail

How EUROGATE established a data mesh architecture using Amazon DataZone

AWS Big Data

Data landscape in EUROGATE and current challenges faced in data governance The EUROGATE Group is a conglomerate of container terminals and service providers, providing container handling, intermodal transports, maintenance and repair, and seaworthy packaging services. Eliminate centralized bottlenecks and complex data pipelines.

IoT 111
article thumbnail

The state of data quality in 2020

O'Reilly on Data

Just 20% of organizations publish data provenance and data lineage. Adopting AI can help data quality. Almost half (48%) of respondents say they use data analysis, machine learning, or AI tools to address data quality issues. Can AI be a catalyst for improved data quality?

article thumbnail

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

We live in a data-rich, insights-rich, and content-rich world. Data collections are the ones and zeroes that encode the actionable insights (patterns, trends, relationships) that we seek to extract from our data through machine learning and data science. Source: [link] I will finish with three quotes.