-------------------------------------------------(b)-------------------------------------------------
Probable reasons are :
- In december supply increase due to which price falls.
- There are many factors behind the increase in prices(Weather, Soil condition).
- If we see quarter wise theres not much difference in 1st and 2nd quarter but theres a substantial difference in 3rd quater i.e
During the months of July, August and September prices are the highest
- We can notice that Fatehpur and samsabad market mostly sell local variety while others sell Desi
- Checking for missing values
- Converting categorical values to numeric
- Scaling values(normalization/standardization) it'll help converge the values faster while building models
- After analysing(refer Price Trends notebook) I figured that theres one data point of variety potato in achmera market and one of other in jagnair so we need to take care of those points either we can remove them or change there variety by the top variety which is Desi.
- Commodity
- Variety because different varieties have different prices
- Time
- Market Name because prices differ market to market
Why Forecasting?
-Prices depends on time and the data is time-series so, it'll be easier to forecast the prices using below methods.
- Classic ARIMA/SARIMA
- Deep Learning(RNN or LSTM)
- Modal Price
- We could use mean squared error(MSE) because it would directly give us the difference between predicted/forecasted and real prices.
------------------------------------------Uploaded file Info-----------------------------------------
- crawler.py - It consists the scrawler but it only scapres table from first page.
- crawler.ipynb- step by step explanation of what the crawler does.
- (b) price trends analysis- It consists of data and visualization from which the observations are derived
- I was not able to scrape the tables from each page(But I am willing to learn how to do it) so, instead I took the data from export to csv option and did the analysis.
PS: I have used references from medium and pluralsight articles to build the crawler.