Don’t Forget The Edge Cases!

Imagine a scenario where you are asked to write the following function as a part of a larger project: Task: Write a function to return the type of a triangle based on the value of the length of 3 sides of a triangle. Let’s make it a bit more easy, by assuming that test for […]

Basics of Matplotlib

Written as a part of ML101 Teaching_ML Data Visualization forms the crux of data modeling. We use data visualization to explore the data before the modeling step, and then again to finally present the model in a graphical form to a non technical audience. There are numerous visualization libraries and tools available for Python or […]

ggplot in python-part 5

The main factor affecting price is the carat. In this post we shall evaluate how the two factors fare. PRICE VS CARATS OBSERVATION: As the value of carat increases, the price goes up. The line of regression is quadratic. Thus, price is affected by carat including other factors as well. OBSERVATION: Upon further scaling, we […]

ggplot error in legends.

My ggplot data evaluation was going smoothly, until I came across the an error in my ggplots. When I tried to use to the factors of cut, clarity, and color  as a differentiating factor in price vs volume, the legends did not show up. I had a colored and segmented graph but it looked vague. […]

ggplot in Python- Part 4

Diamonds are costly, and their value is affected by various qualitative and quantitative.  In this post we will try to evaluate some of the factors that contribute in making it costly. Check the density of diamond PRICE VS LENGTH PRICE VS BREADTH PRICE VS HEIGHT PRICE VS VOLUME PRICE VS DEPTH PRICE VS TABLE Observations […]

ggplot in Python- Part 3

Before we plotting our data, we must be aware of any null values in the qualitative fields and unnecessary zero values in the numeric fields. Such values can lead to incorrect statistical calculations and even worse, errors while plotting the values and forming the line of best fit. 1.Check for any null values. 2. Check […]

ggplot in Python-Part 2

Continuing with our study of diamond data, let us employ basic exploration function and see what information we can draw from it. EXPLORE THE DATA: 1. Length of the data: Use the len() function to see the number of rows of data. 2. Names of the columns: Use the column() function. 3. Analyse the first […]

ggplot in Python- Part 1.

As a part of my internship, I have study the basic functions in ggplot using the predefined data sets. But, before we dive into the analysis, let us first know what is ggplot all about. What is ggplot? ggplot is a plotting system in R, created by H. Wickman in 2005. Since then it has […]

R or Python…Both?

Well, this seems to be an endless debate, which is better R or Python? What should be my learning track if I am an aspiring Data Scientist ? Well, I don’t have the answer yet. But it looks like the world is already moving to converge the power of R with the simplicity of Python, […]

Libpng15.so error while importing matplotlib

So, here I was trying my hands on ipython notebook, when the following commands generated an error. import matplotlib.pyplot as plt AND import numpy as np At first I thought it was due to some configuration issues with my notebook but when I tried the above commands on python via terminal, the same error surfaced. […]