Exploratory Data Analysis Project: Laptop Pricing Prediction

Raphael Farodoye

Data Scientist
Business Analyst
Data Analyst

Exploratory Data Analysis Project: Laptop Pricing Analysis

The Dataset is a filtered and modified dataset of the Laptop Price Prediction using specifications.

Tools used

Python

Laptop Pricing Prediction Using Specifications analysis

The dataset has 238 rows and 13 columns
Fig 1: Price Binning
In Fig 1, the 'Price' attribute is divided into three sectors to differentiate between values categorized as Low, Medium, and High.
FIg 2: Screen_Size vs Price
In Fig 2, Screen_size is not a good predictor of price since the regression line is almost horizontal. We call it a weak predictor of price.
Fig 3: CPU_frequency vs Price
In Fig 3, we can see that CPU_frequency is a good predictor of Price, that is because we can see a positive incline on the regression line. The higher the CPU_frequency the higher the price.
Fig 4: Weight_pounds vs Price
In Fig 4, we can see that the weight is also not a good predictor of price, we can see the dots set is spread all over the in between the line.
Fig 5: Correlation of Screen_Size, CPU_frequency, Weight_pounds against Price
In Fig 5, this is showing the correlation between the 3 variables against price. Here is the percentage of the correlation between them to price Screen_Size vs Price = 11% CPU_frequency vs Price = 36% Weight_pounds vs Price = 5%
Fig 6:
In Fig 6, we see that the distribution of price between the different category are quite distinct enough to make category a good predictor
Fig 7:
In Fig 7, we see that the distribution of price between the different GPU is quite distinct to make GPU a good predictor of price.
Fig 8:
In Fig 8, we see that the distribution of price between the different OS is quite distinct to make OS a good predictor of price.
Fig 9:
In Fig 9, we see that the distribution of price between the different CPU_core are quite distinct enough to make CPU_core a good predictor, in such a way that the higher the CPU the higher the price
Fig 10:
In Fig 10,we see that the distribution of price between the different RAM_GB are quite distinct enough to make RAM_GB a good predictor, in such a way that there the higher the RAM_GB the higher the price.
Fig 11:
In Fig 11, we see that the distribution of price between the differnet Storage_GB_SSD is quite distinct enough to make Storage_GB_SSD a good predictor
Fig 12: Using CPU_frequency as a means of prediction
In FIg 12, I predicted the price of the laptop using just the CPU_frequency, the blue line is the predicted price, while the red line is the actual price. we will see they don't fit well, let's continue and check with other specifications
Fig 13:
In Fig 13, I predicted the price of a laptop using this specifications CPU_frequency, Category, GPU, OS, CPU_core, RAM_GB, Storage_GB_SSD and then compared the predicted price with the actual price. We can see multiple specifications as a prediction gives a better prediction than Fig 12. The blue line is the predicted price and the red line is the actual data, but there is still room for improvement.
Fig 14:
Fitting a polymonial model give us a a better fitting,
In Fig 14, we can see that the line touched more dots compared to Fig 3 which is a good improvement
Fig 15:
In Fig 15, we plot a graph to look at the best degree to use to test our data to reduce overfitting and modeling noise

CONCLUSION

We can now say CPU_frequency, RAM_GB, Storage_GB_SSD, CPU_core, OS, GPU, Category is a good predictor of price
Thank you.
Partner With Raphael
View Services

More Projects by Raphael