Author: datapandasadmin

Teradata Viewpoint – SQL Scratchpad – Writing Queries Dillard’s Department Store Database
| |

Teradata Viewpoint – SQL Scratchpad – Writing Queries Dillard’s Department Store Database

Analysing ‘Big Data’ from a database with real world data requires retrieving data from the database by writing relevant queries. As part of “Managing Big Data with MySQL” by Duke University on Cousera, I dealt with real world data that comprise hundreds to millions of entries/rows. This is the database of Dillard’s Department stores, specifically, the UA_DILLARDS that contains…

Basic SQL – DQL Command Structure Syntax – Quickest Way to Remember It!
| |

Basic SQL – DQL Command Structure Syntax – Quickest Way to Remember It!

In this quick guide, we will be looking at the simply command structure/syntax of SQL – DQL. SQL – Stands for  Structured Query Language. It is a language that is used to communicate with a database. DQL– Data Query Language aspect of  MySQL. As an Data Scientist or aspiring one,  you will be interacting with lots…

Mean Squared Error – Simple Definition, Explanation and Illustration
|

Mean Squared Error – Simple Definition, Explanation and Illustration

Having started my journey in Data Science. I came across Mean Squared Error several times and realised how important that concept is. Then I started working on project and needed to really understand what Mean Squared Error is ? So what is Mean Squared Error ? Simply,  let’s say you are building your model. And…

|

Coursera Capstone Final Report – Data Analysis and Interpretation

I worked with a World Bank Dataset provided by the Course instructors. The research question was: Prediction of Adjusted Net National Income Per Capita of Countries Brief Introduction to the Research Question The purpose of this project was to identify the best predictors for Adjusted Net National Income Per Capita of countries from multiple World…

Difference between Classification and Regression Trees

Difference between Classification and Regression Trees

  I was working a project and I was looking into Decision Tree. I hence had to get the real meanings and distinctions between Classification and Regression Trees. I did a search and came across various articles and guides. However, I found 2 that enhanced my knowledge between the two types of Decision Tree. First was…

Coursera Capstone Project – Data Analysis and Interpretation
| |

Coursera Capstone Project – Data Analysis and Interpretation

What is it This week, I started the Data Analysis and Interpretation Capstone by Wesleyan University on Coursera. This is the final of 5 course specialisation. The capstone project is expected to take 4 weeks. With each week tackling and doing a major component of the Project work.   What is the objective The objective of…

| |

Running a Random Forest – Data Analysis and Intrepretation

Overview My research work deals with Ghana, a country from the Gapminder dataset as has already been discussed from the beginning and progression through this course. The variables in my observation dataset are all quantitative. For the purposes of this assignment, I have binned my quantitative target variable, Life Expectancy (lifeexpectancy) into a 2-level binary categorical target variable. I have named…

|

Writing About The Data – Data Analysis and Intrepretation

OVERVIEW My research work deals with Ghana, a country from the Gapminder dataset and there are 5 main variables I have been working with so I will be looking at the sample, procedure and measures for these 5 variables. The variables are: i.      Incomeperperson (Income Per Person) ii.    literacyrate (Literacy  Rate) iii.   lifeexpectancy  (Life…