Blog - Data Leaders Brief

Top 14 Must-Read Data Science Books You Need On Your Desk

datapine

MAY 14, 2019

“Big data is at the foundation of all the megatrends that are happening.” – Chris Lynch, big data expert. We live in a world saturated with data. Zettabytes of data are floating around in our digital universe, just waiting to be analyzed and explored, according to AnalyticsWeek. Wondering which data science book to read?

Data Science

Data Science Machine Learning Big Data Data-driven

10 Technical Blogs for Data Scientists to Advance AI/ML Skills

DataRobot Blog

DECEMBER 6, 2022

Savvy data scientists are already applying artificial intelligence and machine learning to accelerate the scope and scale of data-driven decisions in strategic organizations. These data science teams are seeing tremendous results—millions of dollars saved, new customers acquired, and new innovations that create a competitive advantage.

Experimentation

Experimentation Forecasting Data-driven Machine Learning

Demystify data sharing and collaboration patterns on AWS: Choosing the right tool for the job

AWS Big Data

OCTOBER 21, 2024

Data is the most significant asset of any organization. However, enterprises often encounter challenges with data silos, insufficient access controls, poor governance, and quality issues. Embracing data as a product is the key to address these challenges and foster a data-driven culture.

Sales

Sales Data-driven Data Processing Key Performance Indicator

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Data Talks, CFOs Listen: Why Analytics Are Key To Better Spend Management

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

The unreasonable importance of data preparation

O'Reilly on Data

MARCH 24, 2020

In a world focused on buzzword-driven models and algorithms, you’d be forgiven for forgetting about the unreasonable importance of data preparation and quality: your models are only as good as the data you feed them. Why is high-quality and accessible data foundational? Re-analyzing existing data is often very bad.”

Machine Learning

Machine Learning Statistics Data Quality Data Collection

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

Read the complete blog below for a more detailed description of the vendors and their capabilities. This is not surprising given that DataOps enables enterprise data teams to generate significant business value from their data. Testing and Data Observability. Reflow — A system for incremental data processing in the cloud.

Testing

Testing Machine Learning Consulting Data Science

Implement a custom subscription workflow for unmanaged Amazon S3 assets published with Amazon DataZone

AWS Big Data

DECEMBER 19, 2024

Organizational data is often fragmented across multiple lines of business, leading to inconsistent and sometimes duplicate datasets. This fragmentation can delay decision-making and erode trust in available data. This solution enhances governance and simplifies access to unstructured data assets across the organization.

Publishing

Publishing Unstructured Data Metadata Data-driven

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

AWS Big Data

DECEMBER 4, 2024

With the growing emphasis on data, organizations are constantly seeking more efficient and agile ways to integrate their data, especially from a wide variety of applications. In addition, organizations rely on an increasingly diverse array of digital systems, data fragmentation has become a significant challenge.

Data Integration

Data Integration Data Lake Statistics Data-driven

Agentic AI design: An architectural case study

CIO Business Intelligence

NOVEMBER 19, 2024

However, they are used as a prominent component of agentic AI. As we look to identify uses for AI Agents, we will find many opportunities. You can use these agents through a process called chaining, where you break down complex tasks into manageable tasks that agents can perform as part of an automated workflow.

Testing

Testing Cost-Benefit Interactive ROI

What is a Data Mesh?

DataKitchen

AUGUST 3, 2021

The data mesh design pattern breaks giant, monolithic enterprise data architectures into subsystems or domains, each managed by a dedicated team. DataOps helps the data mesh deliver greater business agility by enabling decentralized domains to work in concert. . But first, let’s define the data mesh design pattern.

Data Architecture

Data Architecture Data Lake Cost-Benefit Data Warehouse

Generative AI in the Enterprise

O'Reilly on Data

NOVEMBER 28, 2023

In enterprises, we’ve seen everything from wholesale adoption to policies that severely restrict or even forbid the use of generative AI. Our survey focused on how companies use generative AI, what bottlenecks they see in adoption, and what skills gaps need to be addressed. What’s the reality? Only 4% pointed to lower head counts.

Enterprise

Enterprise Testing Modeling Reporting

How to Use ChatGPT as a Data Scientist?

Analytics Vidhya

APRIL 8, 2023

Introduction Are you a data scientist looking for an exciting and informative read? My latest blog post is jam-packed with fun and innovative experiments that I conducted with ChatGPT over the weekend. In this experiment, I put ChatGPT to the test and challenged it to […] The post How to Use ChatGPT as a Data Scientist?

Testing

Testing Analytics IT Data Science

Enhancing Search Relevancy with Cohere Rerank 3.5 and Amazon OpenSearch Service

AWS Big Data

DECEMBER 18, 2024

In this blog post, well dive into the various scenarios for how Cohere Rerank 3.5 OpenSearch Service offers robust search capabilities, including URI searches for simple queries and request body searches using a domain-specific language for complex queries. OpenSearch Service natively supports BM25. See Cohere Rerank 3.5

Metrics

Metrics Modeling Data Processing Machine Learning

2021 Data/AI Salary Survey

O'Reilly on Data

SEPTEMBER 15, 2021

In June 2021, we asked the recipients of our Data & AI Newsletter to respond to a survey about compensation. The results gave us insight into what our subscribers are paid, where they’re located, what industries they work for, what their concerns are, and what sorts of career development opportunities they’re pursuing.

Machine Learning

Machine Learning Statistics Reporting Consulting

AI Product Management After Deployment

O'Reilly on Data

OCTOBER 13, 2020

In contrast, many production AI systems rely on feedback loops that require the same technical skills used during initial development. This distinction assumes a slightly different definition of debugging than is often used in software development. Proper AI product monitoring is essential to this outcome. I/O validation.

Management

Management Machine Learning Metrics Modeling

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

AWS Big Data

DECEMBER 4, 2024

This week on the keynote stages at AWS re:Invent 2024, you heard from Matt Garman, CEO, AWS, and Swami Sivasubramanian, VP of AI and Data, AWS, speak about the next generation of Amazon SageMaker , the center for all of your data, analytics, and AI. They aren’t using analytics and AI tools in isolation.

Data Analytics

Data Analytics Analytics Data Lake Data Quality

Accelerate your migration to Amazon OpenSearch Service with Reindexing-from-Snapshot

AWS Big Data

NOVEMBER 22, 2024

It is appealing to migrate from self-managed OpenSearch and Elasticsearch clusters in legacy versions to Amazon OpenSearch Service to enjoy the ease of use, native integration with AWS services, and rich features from the open-source environment ( OpenSearch is now part of Linux Foundation ). to OpenSearch 2.x),

Snapshot

Snapshot Metadata Recreation/Entertainment Data Processing

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Rocket-Powered Data Science

FEBRUARY 15, 2023

It is important to realize that the usual “hype cycle” rules prevail in such cases as this. Third, any commitment to a disruptive technology (including data-intensive and AI implementations) must start with a business strategy. These changes may include requirements drift, data drift, model drift, or concept drift.

Strategy

Strategy Experimentation Uncertainty Machine Learning

Addressing Data Mesh Technical Challenges with DataOps

DataKitchen

AUGUST 9, 2021

Below is our third post (3 of 5) on combining data mesh with DataOps to foster greater innovation while addressing the challenges of a decentralized architecture. We’ve talked about data mesh in organizational terms (see our first post, “ What is a Data Mesh? ”) and how team structure supports agility. Source: Thoughtworks.

Testing

Testing Data Lake Metadata Publishing

The Top 20 Data Visualization Books That Should Be On Your Bookshelf

datapine

SEPTEMBER 16, 2022

Previously, we discussed the top 19 big data books you need to read, followed by our rundown of the world’s top business intelligence books as well as our list of the best SQL books for beginners and intermediates. Data visualization, or ‘data viz’ as it’s commonly known, is the graphic presentation of data.

Visualization

Visualization Dashboards Data-driven Statistics

Three Emerging Analytics Products Derived from Value-driven Data Innovation and Insights Discovery in the Enterprise

Rocket-Powered Data Science

JULY 19, 2023

I recently saw an informal online survey that asked users which types of data (tabular, text, images, or “other”) are being used in their organization’s analytics applications. The results showed that (among those surveyed) approximately 90% of enterprise analytics applications are being built on tabular data.

Data-driven

Data-driven Enterprise Analytics Machine Learning

6 Case Studies on The Benefits of Business Intelligence And Analytics

datapine

JANUARY 31, 2022

Using business intelligence and analytics effectively is the crucial difference between companies that succeed and companies that fail in the modern environment. The main use of business intelligence is to help business units, managers, top executives, and other operational workers make better-informed decisions backed up with accurate data.

Business Intelligence

Business Intelligence Analytics Cost-Benefit ROI

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

AWS Big Data

DECEMBER 16, 2024

The need to integrate diverse data sources has grown exponentially, but there are several common challenges when integrating and analyzing data from multiple sources, services, and applications. First, you need to create and maintain independent connections to the same data source for different services.

Visualization

Visualization Data Processing Testing Publishing

Move Beyond Excel, PowerPoint And Static Business Reporting with Powerful Interactive Dashboards

datapine

OCTOBER 14, 2020

Visualizing the data and interacting on a single screen is no longer a luxury but a business necessity. A professional dashboard maker enables you to access data on a single screen, easily share results, save time, and increase productivity. That’s why we welcome you to the world of interactive dashboards.

Dashboards

Dashboards Interactive Reporting KPI

Forrester – Chart Your Course To Insights-Driven Business Maturity

DataKitchen

MAY 10, 2021

As organizations strive to become more data-driven, Forrester recommends 5 actions to take to move from one stage of insights-driven business maturity to another. . Intermediates: Build on your successes and work to scale your IDB capabilities across the enterprise using agile and adaptive DevOps, DataOps, and ModelOps processes. .

Data-driven

Data-driven Data Architecture Machine Learning Strategy

Pitching a DataOps Project That Matters

DataKitchen

FEBRUARY 1, 2021

DataOps addresses a broad set of use cases because it applies workflow process automation to the end-to-end data-analytics lifecycle. These benefits are hugely important for data professionals, but if you made a pitch like this to a typical executive, you probably wouldn’t generate much enthusiasm.

Business Objectives

Business Objectives Cost-Benefit Measurement Metrics

Accelerating Drug Discovery and Development with DataOps

DataKitchen

AUGUST 13, 2021

If a company can use data to identify compounds more quickly and accelerate the development process, it can monetize its drug pipeline more effectively. DataOps automation provides a way to boost innovation and improve collaboration related to data in pharmaceutical research and development (R&D).

Testing

Testing Dashboards Marketing Measurement

10 Big Data Examples Showing The Great Value of Smart Analytics In Real Life At Restaurants, Bars, and Casinos

datapine

APRIL 14, 2022

“You can have data without information, but you cannot have information without data.” – Daniel Keys Moran. When you think of big data, you usually think of applications related to banking, healthcare analytics , or manufacturing. However, the usage of data analytics isn’t limited to only these fields. Discover 10.

Big Data

Big Data Recreation/Entertainment Analytics Data-driven

The Role of Containers on MLOps and Model Production

Domino Data Lab

FEBRUARY 4, 2021

Container technology has changed the way data science gets done. The original container use case for data science focused on what I call, “environment management”. Container orchestration has the following benefits in data science work: Remove central IT bottlenecks in the MLOps life cycle.

Modeling

Modeling Data Science Data Processing Software

Enriching metadata for accurate text-to-SQL generation for Amazon Athena

AWS Big Data

OCTOBER 14, 2024

Enterprise data is brought into data lakes and data warehouses to carry out analytical, reporting, and data science use cases using AWS analytical services like Amazon Athena , Amazon Redshift , Amazon EMR , and so on. We use Anthropic’s Claude 2.1 We use Anthropic’s Claude 2.1

Metadata

Metadata Data Lake Modeling Data Warehouse

What Are ChatGPT and Its Friends?

O'Reilly on Data

MARCH 23, 2023

What is it, how does it work, what can it do, and what are the risks of using it? Maybe it’s surprising that ChatGPT can write software, maybe it isn’t; we’ve had over a year to get used to GitHub Copilot, which was based on an earlier version of GPT. A quick scan of the web will show you lots of things that ChatGPT can do.

IT

IT Modeling Testing Risk

Implementing a Pharma Data Mesh using DataOps

DataKitchen

AUGUST 19, 2021

Below is our fourth post (4 of 5) on combining data mesh with DataOps to foster innovation while addressing the challenges of a decentralized architecture. We’ve covered the basic ideas behind data mesh and some of the difficulties that must be managed. Below is a discussion of a data mesh implementation in the pharmaceutical space.

Data Warehouse

Data Warehouse Data Lake Manufacturing Testing

A Guide To The Methods, Benefits & Problems of The Interpretation of Data

datapine

JANUARY 6, 2022

1) What Is Data Interpretation? 2) How To Interpret Data? 3) Why Data Interpretation Is Important? 4) Data Analysis & Interpretation Problems. 5) Data Interpretation Techniques & Methods. 6) The Use of Dashboards For Data Interpretation. What Is Data Interpretation? Table of Contents.

Visualization

Visualization Dashboards Cost-Benefit Measurement

CIOs must beware committing ‘AI washing’ themselves

CIO Business Intelligence

JULY 23, 2024

The SEC had charged the two firms with making misleading statements about their use of AI for investment advice, and the companies paid $400,000 in the settlement. Beyond regulatory problems, companies overstating their AI use could expose themselves to shareholder lawsuits and a loss in customer trust, Shargel adds.

Uncertainty

Uncertainty Machine Learning Business Intelligence Marketing

Why Data Driven Decision Making is Your Path To Business Success

datapine

APRIL 16, 2019

The term ‘big data’ alone has become something of a buzzword in recent times – and for good reason. By implementing the right reporting tools and understanding how to analyze as well as to measure your data accurately, you will be able to make the kind of data driven decisions that will drive your business forward.

Data-driven

Data-driven Dashboards Visualization Cost-Benefit

Data Science Tools: Understanding the Multiverse

Domino Data Lab

JULY 15, 2021

In the multiverse of data science, the tool options continue to expand and evolve. While there are certainly engineers and scientists who may be entrenched in one camp or another (the R camp vs. Python, for example, or SAS vs. MATLAB), there has been a growing trend towards dispersion of data science tools. Snowflake ).

Data Science

Data Science Visualization Enterprise Modeling

Microsoft bolsters quantum platform with gen AI, molecular simulation capabilities

CIO Business Intelligence

JUNE 19, 2024

Microsoft has added generative artificial intelligence and other enhanced features to its quantum-computing platform as part of a larger strategy to deliver the game-changing technology to a broader range of users — in this case, the scientific community.

Strategy

Strategy Consulting Optimization Technology

Microsoft partners with industry leaders to offer vertical SLMs

CIO Business Intelligence

NOVEMBER 20, 2024

At its Microsoft Ignite 2024 show in Chicago this week, Microsoft and industry partner experts showed off the power of small language models (SLMs) with a new set of fine-tuned, pre-trained AI models using industry-specific data. The company notes that customers can also use the models to configure agents in Microsoft Copilot Studio.

Manufacturing

Manufacturing Marketing Finance Modeling

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

AWS Big Data

AUGUST 19, 2024

In today’s data-driven world, the ability to seamlessly integrate and utilize diverse data sources is critical for gaining actionable insights and driving innovation. Use case Consider a large ecommerce company that relies heavily on data-driven insights to optimize its operations, marketing strategies, and customer experiences.

Analytics

Analytics Data-driven Data Integration Data Lake

Predictive Analytics Supports Citizen Data Scientists!

Smarten

FEBRUARY 19, 2025

Use Predictive Analytics for Fact-Based Decisions! It must be based on historical data, facts and clear insight into trends and patterns in the market, the competition and customer buying behavior. Like every other business, your organization must plan for success. billion USD in 2022 and is expected to reach $38 billion USD by 2028.

Predictive Analytics

Predictive Analytics Analytics Predictive Modeling Forecasting

Enterprise-class NLP with spaCy v3

Domino Data Lab

FEBRUARY 19, 2021

These features combine to make spaCy better than ever at processing large volumes of text and tuning your configuration to match your specific use case in a way that provides better accuracy. That blog can be found here. ", "David joined Domino Data Lab in February 2020." Bench-marking data from [link].

Enterprise

Enterprise Data Science Modeling Visualization

7 famous analytics and AI disasters

CIO Business Intelligence

APRIL 15, 2022

In 2017, The Economist declared that data, rather than oil, had become the world’s most valuable resource. Organizations across every industry have been and continue to invest heavily in data and analytics. But like oil, data and analytics have their dark side. The paper determined the technique not fit for clinical use.

Analytics

Analytics Machine Learning Insurance Testing

Switching from CPUs to GPUs for NYC Taxi Fare Predictions with NVIDIA RAPIDS

Cloudera

NOVEMBER 3, 2021

Have you ever asked a data scientist if they wanted their code to run faster? While deep learning is an excellent use of the processing power of a graphics card, it is not the only use. of respondents reported to use CNN’s. You would probably get a more varied response asking if the earth is flat. In fact only 43.2%

Deep Learning

Deep Learning Machine Learning Data Science Cost-Benefit

Enterprise Data Science Workflows with AMPs and Streamlit

Cloudera

JULY 27, 2021

Only through hands-on experimentation can we discern truly useful new algorithmic capabilities from hype. These fleshed-out web applications are representative end products of data science work. There are many uses for interactive applications in the machine learning development lifecycle.

Data Science

Data Science Enterprise Experimentation Machine Learning

Start DataOps Today with ‘Lean DataOps’

DataKitchen

SEPTEMBER 20, 2021

Data organizations don’t always have the budget or schedule required for DataOps when conceived as a top-to-bottom, enterprise-wide transformational change. DataOps can and should be implemented in small steps that complement and build upon existing workflows and data pipelines. Figure 1: The four phases of Lean DataOps. production).

Testing

Testing Metrics Measurement Dashboards

Top 14 Must-Read Data Science Books You Need On Your Desk

10 Technical Blogs for Data Scientists to Advance AI/ML Skills

Webinars

Trending Sources

Demystify data sharing and collaboration patterns on AWS: Choosing the right tool for the job

Webinars

The unreasonable importance of data preparation

The DataOps Vendor Landscape, 2021

Implement a custom subscription workflow for unmanaged Amazon S3 assets published with Amazon DataZone

Simplify data integration with AWS Glue and zero-ETL to Amazon SageMaker Lakehouse

Agentic AI design: An architectural case study

What is a Data Mesh?

Generative AI in the Enterprise

How to Use ChatGPT as a Data Scientist?

Enhancing Search Relevancy with Cohere Rerank 3.5 and Amazon OpenSearch Service

2021 Data/AI Salary Survey

AI Product Management After Deployment

The next generation of Amazon SageMaker: The center for all your data, analytics, and AI

Accelerate your migration to Amazon OpenSearch Service with Reindexing-from-Snapshot

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Addressing Data Mesh Technical Challenges with DataOps

The Top 20 Data Visualization Books That Should Be On Your Bookshelf

Three Emerging Analytics Products Derived from Value-driven Data Innovation and Insights Discovery in the Enterprise

6 Case Studies on The Benefits of Business Intelligence And Analytics

Introducing a new unified data connection experience with Amazon SageMaker Lakehouse unified data connectivity

Move Beyond Excel, PowerPoint And Static Business Reporting with Powerful Interactive Dashboards

Forrester – Chart Your Course To Insights-Driven Business Maturity

Pitching a DataOps Project That Matters

Accelerating Drug Discovery and Development with DataOps

10 Big Data Examples Showing The Great Value of Smart Analytics In Real Life At Restaurants, Bars, and Casinos

The Role of Containers on MLOps and Model Production

Enriching metadata for accurate text-to-SQL generation for Amazon Athena

What Are ChatGPT and Its Friends?

Implementing a Pharma Data Mesh using DataOps

A Guide To The Methods, Benefits & Problems of The Interpretation of Data

CIOs must beware committing ‘AI washing’ themselves

Why Data Driven Decision Making is Your Path To Business Success

Data Science Tools: Understanding the Multiverse

Microsoft bolsters quantum platform with gen AI, molecular simulation capabilities

Microsoft partners with industry leaders to offer vertical SLMs

Unlock scalable analytics with a secure connectivity pattern in AWS Glue to read from or write to Snowflake

Predictive Analytics Supports Citizen Data Scientists!

Enterprise-class NLP with spaCy v3

7 famous analytics and AI disasters

Switching from CPUs to GPUs for NYC Taxi Fare Predictions with NVIDIA RAPIDS

Enterprise Data Science Workflows with AMPs and Streamlit

Start DataOps Today with ‘Lean DataOps’

Stay Connected