Sat.Aug 14, 2021 - Fri.Aug 20, 2021

article thumbnail

What is Data Lineage?

Alation

When you think of lineage, what typically comes to mind is one’s ancestry or pedigree. It’s a family tree that traces a path back past your parents, grandparents, and more, showing from whom you descended and how you’re related. SPOILER ALERT! Lineage traces origin in a “family tree”. The same can be said for data, too. Data lineage shows the history of the data you’re looking at today, detailing where it originated and how it may have changed over time.

article thumbnail

A Day in the Life of a DataOps Engineer

DataKitchen

DataKitchen's DataOps Engineers Priyanjna Sharma & Chip Bloche discuss what DataOps Engineering entails, key skills required & when to add one to your data team. The post A Day in the Life of a DataOps Engineer first appeared on DataKitchen.

130
130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

The Growing Role of Analytics in Product Development

Smart Data Collective

Data is an important part of any business, including areas that focus on product development. With data, you can analyze it and then incorporate your findings into the development stages of the product’s creation. As such, you end up with a final product that is a lot more effective than it would be without that influence of analysis. The results of data analysis can take some time, hence why it’s important to introduce this as early on as possible to ensure peak performance.

Analytics 132
article thumbnail

Complete guide on how to Use LightGBM in Python

Analytics Vidhya

ArticleVideo Book This article was published as a part of the Data Science Blogathon Introduction A Gradient Boosting Decision tree or a GBDT is a. The post Complete guide on how to Use LightGBM in Python appeared first on Analytics Vidhya.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

A ‘Fresh Squeeze on Data’ to Help Children Learn about Data, AI and Machine Learning

Cloudera

Dear Parents and Educators and Friends of Cloudera, If you are reading this blog, you know us at Cloudera as a group of self-described data geeks and data analysts. We believe data drives better decisions and moves businesses forward and for us, that’s exciting. We are innovating and helping Fortune 500 transform and grow because they can make better data-driven decisions at the accelerated pace we live and work in today.

article thumbnail

4 Ways Conversational AI Is Improving the Customer Experience

DataKitchen

The post 4 Ways Conversational AI Is Improving the Customer Experience first appeared on DataKitchen.

246
246

More Trending

article thumbnail

Computer Vision and How It is Shaping the World Around Us

Analytics Vidhya

ArticleVideo Book This article was published as a part of the Data Science Blogathon Since the initial breakthrough in Computer Vision achieved by A. Krizhevsky. The post Computer Vision and How It is Shaping the World Around Us appeared first on Analytics Vidhya.

IT 378
article thumbnail

Cloudera DataFlow for the Public Cloud: A technical deep dive

Cloudera

We just announced Cloudera DataFlow for the Public Cloud (CDF-PC), the first cloud-native runtime for Apache NiFi data flows. CDF-PC enables Apache NiFi users to run their existing data flows on a managed, auto-scaling platform with a streamlined way to deploy NiFi data flows and a central monitoring dashboard making it easier than ever before to operate NiFi data flows at scale in the public cloud.

article thumbnail

Implementing a Pharma Data Mesh using DataOps

DataKitchen

Below is our fourth post (4 of 5) on combining data mesh with DataOps to foster innovation while addressing the challenges of a decentralized architecture. We’ve covered the basic ideas behind data mesh and some of the difficulties that must be managed. Below is a discussion of a data mesh implementation in the pharmaceutical space. For those embarking on the data mesh journey, it may be helpful to discuss a real-world example and the lessons learned from an actual data mesh implementation.

article thumbnail

Top eCommerce Metrics for Online Businesses to Study with Analytics

Smart Data Collective

Analytics technology is very important for online businesses. You need to pay close attention to analytics data on various KPIs to determine whether your strategy is working well and what tweaks need to be made. As an eCommerce entrepreneur, you have the benefit of being able to access a plethora of data at any time about multiple areas of your business and how consumers interact with it.

Metrics 135
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Quick Hacks To Save Machine Learning Model using Pickle and Joblib

Analytics Vidhya

ArticleVideo Book This article was published as a part of the Data Science Blogathon TABLE OF CONTENTS Introduction Loading dataset and creating our model Saving. The post Quick Hacks To Save Machine Learning Model using Pickle and Joblib appeared first on Analytics Vidhya.

article thumbnail

Automating Data Pipelines in CDP with CDE Managed Airflow Service

Cloudera

When we announced the GA of Cloudera Data Engineering back in September of last year, a key vision we had was to simplify the automation of data transformation pipelines at scale. By leveraging Spark on Kubernetes as the foundation along with a first class job management API many of our customers have been able to quickly deploy, monitor and manage the life cycle of their spark jobs with ease.

article thumbnail

AIOps Benefits All Aspects of the Enterprise

DataKitchen

The post AIOps Benefits All Aspects of the Enterprise first appeared on DataKitchen.

article thumbnail

Businesses Discover the Importance of Merging Analytics and Content Marketing

Smart Data Collective

Data is the backbone of effective digital marketing, and content is not just king; it is the entire royal family. When you combine both, you get one of the most formidable and effective marketing strategies ever. Businesses worldwide, especially SaaS businesses, have discovered that smart, measurable content marketing is the key to achieving their business goals.

Marketing 134
article thumbnail

8 Steps to Transformation at Speed & Scale – Your Guide to Deploying StratOps

📌Is your Data & AI transformation struggling to really impact the business? Discover the game-changing StratOps approach that: Bridges the Gap : Connect your Data & AI strategy to your operating model, to ensure alignment at every level. Prioritizes Outcomes : Focuses on concrete business outcomes from day one, rather than capabilities in isolation.

article thumbnail

Creating Continuous Action Bot using Deep Reinforcement Learning

Analytics Vidhya

ArticleVideo Book This article was published as a part of the Data Science Blogathon To solve any problem using reinforcement learning we need a. The post Creating Continuous Action Bot using Deep Reinforcement Learning appeared first on Analytics Vidhya.

article thumbnail

CBAP certification: A high-profile credential for business analysts

CIO Business Intelligence

The Certified Business Analysis Professional (CBAP) is a credential for business analysts offered by the International Institute of Business Analysis (IIBA). IIBA is a nonprofit professional association founded in 2003 to promote the field of business analysis. The organization describes CBAP as a credential that “recognizes seasoned BA professionals who have over five years of practical business analysis work experience.

article thumbnail

DataOps engineers run toward error and automate it away

DataKitchen

The post DataOps engineers run toward error and automate it away first appeared on DataKitchen.

IT 205
article thumbnail

Top 5 Data-Driven Lease Accounting Software Solutions

Smart Data Collective

Big data has radically changed the accounting profession. Accountants are using new software with sophisticated machine learning algorithms to better address the nuances of their situations. They are also using more advanced data analytics tools to make more meaningful insights into the nature of their clients’ financial matters. Cloud technology has also helped accountants.

article thumbnail

Marketing Operations in 2025: A New Framework for Success

Speaker: Mike Rizzo, Founder & CEO, MarketingOps.com and Darrell Alfonso, Director of Marketing Strategy and Operations, Indeed.com

Though rarely in the spotlight, marketing operations are the backbone of the efficiency, scalability, and alignment that define top-performing marketing teams. In this exclusive webinar led by industry visionaries Mike Rizzo and Darrell Alfonso, we’re giving marketing operations the recognition they deserve! We will dive into the 7 P Model —a powerful framework designed to assess and optimize your marketing operations function.

article thumbnail

Exploratory Data Analysis and Visualization Techniques in Data Science

Analytics Vidhya

ArticleVideo Book This article was published as a part of the Data Science Blogathon Photo by fauxels from Pexels What is Exploratory Data Analysis? Exploratory. The post Exploratory Data Analysis and Visualization Techniques in Data Science appeared first on Analytics Vidhya.

article thumbnail

Announcing the GA of Cloudera DataFlow for the Public Cloud

Cloudera

Are you ready to turbo-charge your data flows on the cloud for maximum speed and efficiency? We are excited to announce the general availability of Cloudera DataFlow for the Public Cloud (CDF-PC) – a brand new experience on the Cloudera Data Platform (CDP) to address some of the key operational and monitoring challenges of standard Apache NiFi clusters that are overloaded with high-performant flows.

article thumbnail

How Data Cleansing Helps Predictive Modeling Efforts

TDAN

If you are planning on using predictive algorithms, such as machine learning or data mining, in your business, then you should be aware that the amount of data collected can grow exponentially over time. In a world where big data is becoming more popular and the use of predictive modeling is on the rise, there are steps […].

article thumbnail

How Automation Streamlines Data Management

Smart Data Collective

Managing data is a challenge. It’s not hard to collect data, but most companies collect data in disparate locations and across multiple applications that don’t talk to each other. With this model, multiple reports are required to crunch data from multiple sources. That requires manually entering data into yet another application to generate a final report.

article thumbnail

The Cloud Development Environment Adoption Report

Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.

article thumbnail

Better EDA with 3 Easy Python Libraries for Any Beginner

Analytics Vidhya

ArticleVideo Book This article was published as a part of the Data Science Blogathon Introduction Image Source: Author Data Science enthusiasts know that raw data. The post Better EDA with 3 Easy Python Libraries for Any Beginner appeared first on Analytics Vidhya.

article thumbnail

Data Workbench: Why Your Organization May Need One

Dataiku

This is a guest article from our friends at Dataquest. Dataquest is a data science learning platform with a quickly growing community of learners.

article thumbnail

DAMA International Community Corner: August 2021 – A Wrap for Now

TDAN

It has been an incredible run. I hope it is just “see you soon” rather than “goodbye.” With this column, DAMA International’s streak of quarterly columns since mid-2001 is coming to an end. The columns have featured the activities and incredible work of DAMA International over the past two decades. Thank you, DAMA, and I […].

IT 105
article thumbnail

4 Ways to Use Analytics to Measure and Optimize Business Growth

Smart Data Collective

Data analytics has upended the world of business in countless ways. A growing number of businesses are discovering new applications for data analytics technology as they strive to streamline operations and boost their bottom lines. Many companies have found that analytics technology is ideal for optimizing their business models in a number of ways. This can help them grow considerably.

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

How to add watermark on images using OpenCV in Python

Analytics Vidhya

ArticleVideo Book This article was published as a part of the Data Science Blogathon Introduction: Watermarks are an important part of businesses and online content. The post How to add watermark on images using OpenCV in Python appeared first on Analytics Vidhya.

article thumbnail

Using AWS, Dataiku, and Tableau to Make Decisions Quickly and Efficiently

Dataiku

This is a guest article from our friends at Interworks. Interworks is a people-focused tech consultancy delivering premier service and expertise in collaboration with strategic partners. We are trusted to empower clients with the right people and solutions aligned to their unique needs. From data management and visualization to server monitoring and maintenance, we can customize the best data and IT solutions for you.

article thumbnail

Business Pace Contributes to Data Challenges

TDAN

The increasing speed and pace of business certainly contributes to several data challenges (quality, timeliness, availability and, most important, usability of the data). As the number of data sources increase and data volumes expand along with the demand from the business to access data on a timelier basis, pressures begin to form on the underlying […].

article thumbnail

Data-Driven Approaches to Better Optimized Enterprise Workflows

Smart Data Collective

Big data has been a very important part of modern human resource solutions. One of the biggest implications of big data in human resources has pertained to enterprise workflow management. Every business, from a small one-person shop to an enterprise level company needs to find ways to be more efficient. This means efficiency in spending, and efficiency in their workflows.

article thumbnail

The Ultimate Guide To Data-Driven Construction: Optimize Projects, Reduce Risks, & Boost Innovation

Speaker: Donna Laquidara-Carr, PhD, LEED AP, Industry Insights Research Director at Dodge Construction Network

In today’s construction market, owners, construction managers, and contractors must navigate increasing challenges, from cost management to project delays. Fortunately, digital tools now offer valuable insights to help mitigate these risks. However, the sheer volume of tools and the complexity of leveraging their data effectively can be daunting. That’s where data-driven construction comes in.