Data Science
datapandasadmin

Difference between Classification and Regression Trees

  I was working a project and I was looking into Decision Tree. I hence had to get the real meanings and distinctions between Classification and Regression Trees. I did a search and came across various articles and guides. However, I found 2 that enhanced my knowledge between the two types of Decision Tree. First was

Read More »

Can the sample size be equal to the population size?

What is population size: In simple terms, let’s us say population is the WHOLE SET and What is sample size:  let us say it is the SUBSET (which is within the whole set) The norm Normally subsets are within the whole set and are smaller than the whole set Can they be equal ? Yes

Read More »
Coursera
datapandasadmin

Coursera Capstone Project – Data Analysis and Interpretation

What is it This week, I started the Data Analysis and Interpretation Capstone by Wesleyan University on Coursera. This is the final of 5 course specialisation. The capstone project is expected to take 4 weeks. With each week tackling and doing a major component of the Project work.   What is the objective The objective of

Read More »

Running a Random Forest – Data Analysis and Intrepretation

Overview My research work deals with Ghana, a country from the Gapminder dataset as has already been discussed from the beginning and progression through this course. The variables in my observation dataset are all quantitative. For the purposes of this assignment, I have binned my quantitative target variable, Life Expectancy (lifeexpectancy) into a 2-level binary categorical target variable. I have named

Read More »

Writing About The Data – Data Analysis and Intrepretation

OVERVIEW My research work deals with Ghana, a country from the Gapminder dataset and there are 5 main variables I have been working with so I will be looking at the sample, procedure and measures for these 5 variables. The variables are: i.      Incomeperperson (Income Per Person) ii.    literacyrate (Literacy  Rate) iii.   lifeexpectancy  (Life

Read More »

Machine Learning – Coursera

So today, I got my results from the Machine Learning course on Coursera.org. It was a month of intense digging into the fundamentals and core aspects of machine learning. No wonder machine learning is most talked about online. The module is ideal for beginners who want to get hands on experience with Machine Learning. It

Read More »

K-Means Cluster Analysis – Data Analysis and Intrepretation

Overview My research work deals with Ghana, a country from the Gapminder dataset as has already been discussed from the beginning and progression through this course I conducted a k-means cluster analysis to find out the underlying sets of the population of Ghana based on their similarity of responses on 22 variables that represent characteristics

Read More »

Running a Lasso Regression Analysis – Data Analysis and Intrepretation

Overview My research work deals with Ghana, a country from the Gapminder dataset as has already been discussed from the beginning and progression through this course. My response variable, lifeexpectancy, is a quantitative response variable that measures the life expectancy of the people of Ghana. For the purposes of running the Lasso Regression Analysis, I added more variables

Read More »

Test a Logistic Regression Model – Data Analysis and Intrepretation

OVERVIEW My research work deals with Ghana, a country from the Gapminder dataset.     What I found in my logistic regression analysis. Discussion of the results for the associations between all of my explanatory variables and my response variable   The primary quantitative explanatory variable in my regression analysis is the Income Per Person

Read More »