原文地址: 銷售數據分析
數據分析類,讀取csv文件,對銷售數據進行分析。api
This business case is mainly concerned with the forecasting of sales in different stores in the retail industry. The task involves the analysis of historical sales data collected from a nationwide retailer in the U.S. The aim is to expose you to a realistic business case and to gain understanding and insight about some of the ways in which data analytics can be used to support business decision making.app
Accurately forecasting sales is one of the most difficult challenges faced by retailers worldwide, especially when limited historical data is available. In this coursework project, you are provided with historical sales data for 45 stores located in different regions in the U.S. Each store contains a number of departments, and you are asked to predict the sales for each department at each store. In addition, the retailer runs several promotional marketing activities during holidays, the four largest of which are the Super Bowl, Labor Day, Thanksgiving, and Christmas. The weeks including these holidays are weighted five times higher in the forecasting accuracy evaluation than non-holiday weeks. A challenge in this sales forecasting problem is to take into account the effects of promotional activities on sales given the fact that part of the promotion related data is absent from historical records. The available data are briefly introduced below.ide
This excel file contains the anonymised number, type and size of the 45 stores.ui
Column | Description |
---|---|
Store | the anonymised store number |
Type | store type, A: supercentre, B: superstore, C: supermarket |
Size | store size (in square feet) |
This excel file contains additional data related to the store, department, and regional activity for the given dates. It includes the following fields:this
Column | Description |
---|---|
Store | the anonymised store number |
Date | the week with the dated Friday |
Temperature | average temperature in the region |
Fuel_Price | cost of fuel in the region |
Promotions | anonymised data related to promotions, mainly price reductions that the retailer is running. Promotion data is only available after Nov. 2011, and is not available for all stores all the time. Any missing value is marked with an NA. |
CPI | the consumer price index |
Unemployment | the unemployment rate |
IsHoliday | whether the week is a special holiday week |
The four public holidays included in the data fall within the following weeks.lua
Super Bowl: 12/02/2010, 11/02/2011, 10/02/2012, 08/02/2013 Labor Day: 10/09/2010, 09/09/2011, 07/09/2012, 06/09/2013 Thanksgiving: 26/11/2010, 25/11/2011, 23/11/2012, 29/11/2013 Christmas: 31/12/2010, 30/12/2011, 28/12/2012, 27/12/2013
This file contains the historical training data, which covers sales from 05/02/2010 to 26/10/2012. It includes the following fields:idea
Column | Description |
---|---|
Store | the anonymised store number |
Department | the anonymised department number |
Date | the week with the dated Friday |
Weekly_Sales | sales for the given department in the given store |
IsHoliday | whether the week is a special holiday week |
This file is identical to train.csv, except you need to predict the weekly sales for each triplet of store, department, and date from 02/11/2012 to 26/07/2013.spa
In the coursework report the following weighted mean absolute error (WMAE) or other appropriate errors should be used to evaluate forecasting accuracy.excel
For your assessment for this course you will need to complete two tasks: an essay and a report on a data analytical project. The report should be submitted should be submitted by the 15th March 2018 via Turnitin.code
Length Less than 3000 words (excluding tables, references and appendices - if needed).
You will find a set of datasets on blackboard in the assessed folder. Your task is to describe, pre-process and analyse the datasets so as to lead to the development of accurate predictive models.
Your work should cover (but not be limited to) the following:
Produce a report, which describes the process that you went through and present your analytical solution and any relevant exploratory/supporting analyses.
You can use whatever software you wish to carry out the task.
Imagine that you are writing the report for someone to read not simply to pass the course!
(本文出自csprojectedu.com,轉載請註明出處)