You’re a growth analyst at MercadoLibre. With over 200 million users, MercadoLibre is the most popular e-commerce site in Latin America. You've been tasked with analyzing the company's financial and user data in clever ways to help the company grow. So, you want to find out if the ability to predict search traffic can translate into the ability to successfully trade the stock.
-
Visual depictions of seasonality (as measured by Google Search traffic) that are of interest to the company.
-
An evaluation of how the company stock price correlates to its Google Search traffic.
-
A Prophet forecast model that can predict hourly user search traffic.
-
Answers to the questions in the instructions that you write in your Jupyter notebook.
-
A plot of a forecast for the company’s future revenue.
-
Step 1: Find unusual patterns in hourly Google Search traffic.
-
Step 2: Mine the search traffic data for seasonality.
-
Step 3: Relate the search traffic to stock price patterns.
-
Step 4: Create a time series model by using Prophet.
-
Step 5 (optional): Forecast the revenue by using time series models.
The following subsections detail these steps.
The data science manager asks if the Google Search traffic for the company links to any financial events at the company. Or, does the search traffic data just present random noise? To answer this question, you’ll pick out any unusual patterns in the Google Search data for the company, and connect them to the corporate financial events.
To do so, complete the following steps:
-
Read the search data into a DataFrame, and then slice the data to just the month of May 2020. (During this month, MercadoLibre released its quarterly financial results.) Use hvPlot to visualize the results. Do any unusual patterns exist?
-
Calculate the total search traffic for the month, and then compare the value to the monthly median across all months. Did the Google Search traffic increase during the month that MercadoLibre released its financial results?
The marketing department realizes that they can use the hourly search data, too. If they can track and predict interest in the company and its platform for any time of day, they can focus their marketing efforts around the times that have the most traffic. This will get a greater return on investment (ROI) from their marketing budget.
To that end, you want to mine the search traffic data for predictable seasonal patterns of interest in the company. To do so, complete the following steps:
-
Group the hourly search data to plot the average traffic by the day of the week (for example, Monday vs. Friday).
-
Using hvPlot, visualize this traffic as a heatmap, referencing
index.hour
for the x-axis andindex.dayofweek
for the y-axis. Does any day-of-week effect that you observe concentrate in just a few hours of that day? -
Group the search data by the week of the year. Does the search traffic tend to increase during the winter holiday period (Weeks 40 through 52)?
During a meeting with people in the finance group at the company, you mention your work on the search traffic data. They want to know if any relationship between the search data and the company stock price exists, and they ask if you can investigate.
To do so, complete the following steps:
-
Read in and plot the stock price data. Concatenate the stock price data to the search data in a single DataFrame.
-
Note that market events emerged during 2020 that many companies found difficult. But after the initial shock to global financial markets, new customers and revenue increased for e-commerce platforms. So, slice the data to just the first half of 2020 (
2020-01
to2020-06
in the DataFrame), and then use hvPlot to plot the data. Do both time series indicate a common trend that’s consistent with this narrative? -
Create a new column in the DataFrame named “Lagged Search Trends” that offsets, or shifts, the search traffic by one hour. Create two additional columns:
-
“Stock Volatility”, which holds an exponentially weighted four-hour rolling average of the company’s stock volatility
-
“Hourly Stock Return”, which holds the percentage of change in the company stock price on an hourly basis
-
-
Review the time series correlation, and then answer the following question: Does a predictable relationship exist between the lagged search traffic and the stock volatility or between the lagged search traffic and the stock price returns?
Now, you need to produce a time series model that analyzes and forecasts patterns in the hourly search data. To do so, complete the following steps:
-
Set up the Google Search data for a Prophet forecasting model.
-
After estimating the model, plot the forecast. How's the near-term forecast for the popularity of MercadoLibre?
-
Plot the individual time series components of the model to answer the following questions:
-
What time of day exhibits the greatest popularity?
-
Which day of the week gets the most search traffic?
-
What's the lowest point for search traffic in the calendar year?
-
A few weeks after your initial analysis, the finance group follows up to find out if you can help them solve a different problem. Your fame as a growth analyst in the company continues to grow!
Specifically, the finance group wants a forecast of the total sales for the next quarter. This will dramatically increase their ability to both plan budgets and help guide expectations for the company investors.
To do so, complete the following steps:
-
Read in the daily historical sales (that is, revenue) figures, and then apply a Prophet model to the data.
-
Interpret the model output to identify any seasonal patterns in the company revenue. For example, what are the peak revenue days? (Mondays? Fridays? Something else?)
-
Produce a sales forecast for the finance group. Give them a number for the expected total sales in the next quarter. Include the best- and worst-case scenarios to help them make better plans.