Posts

Showing posts from February, 2020

Measure of dispersion and its python implementation(ML)

Measure of central tendency for grouped and ungrouped data and its python implementation(Machine Learning)

Understanding data type and Level of measurement for Machine Learning data

Image
In this modern world, to be a good data scientist, one should have sound knowledge of statistics and computer programming language (for data extraction, handling big data, data wrangling, data visualization, model creation etc.). For understanding statistics, first we will have to understand the data. In this blog, we will try to understand type of data and their measurement scale. 1.       Type of data/Variable: There are the different types of data that a statistician/data scientist will come across. There are two types of data: a.        Categorical Variable (Qualitative Variable): This kind of variable has qualitative value and not numerical value. For example :                                           ...

DCOVA Frame work of Machine Learning

Image
If we need to study data for a given goal, there are certain steps that need to be followed. These steps collectively called Framework. At the high level, DCOVA framework tells the steps that need to be performed for machine learning. What does DCOVA Framework stands for?  DCOVA stands for: D - Define C - Collect O - Organize V - Visualize A – Analyze Let’s discuss the above point in detail:   Define: First step is to define the problem statement clearly and identify the data requirement that is required to solve the problem.  For Example: Problem statement is to detect the cancer on a patient at an early stage. To solve the problem in example, first it needs to be identified the feature that could be useful for detecting the cancer. Please note that the feature selection should be done with help of domain expert. Note: There is no guarantee that the feature selected is having impact on the outcome i.e. detection of ...

Data Science (DS) Vs Artificial intelligence (AI) Vs Machine Learning (ML) Vs Deep Learning (DL)

Image
Now a day we often hear the buzzwords like Data science (DS), Artificial Intelligence (AI), Machine Learning (ML) and Deep Learning (DL). In this blog, we will try to understand each term to avoid any confusion. What is Data Science?? Data science is “ a field of study of data “.   Here data is being studied, analyzed and processed so as to gain more information from data. As per Wikipedia: “ The term ‘data science’ has appeared in various contexts over the past thirty years but did not become an established term until recently” Why data science became so important lately??                 As you hear very often, data is new oil.   With the Industrial revolution, oil (main source of energy) becomes so important that each and every country is reliant on the oil to run their economy. Even at present, oil is the main source of energy and a small disruption in its flow bring chaos ...