Linear Regression Project
This is the first part of the Linear Regression Project. The first step will be to identify a data set you would like to work with, subject to the requirements
spelled out below. Take a lookat data sources for links to many different data sources.
This assignment has two deliverables:
1. Data Summary
• Format: Word Document
• Instructions: Please see Part 1 – Data Summary for instructions.
2. Cleaned, Labeled Data Set
• Format: Excel
• Instructions:
o Must have between FOUR and SIX total variables organized as columns in the spreadsheet.
o At least one of the independent variables must be CATEGORICAL. Categorical variables can be derived by you or be part of the the original data set. All
categorical variables should have exactly TWO levels (distinct values).
o At least ONE of the independent variables must be CONTINUOUS.
o The dependent variable must be CONTINUOUS.
o Data set has at least 30 observations (rows).
o All variable columns as clearly labeled as either:
Independent or dependent
Categorical or continuous
An example data set is linked Part 1 – Data Example for reference.
NOTE: If your data set does not have a categorical variable in it already, you may need to derive one.