The residual vs fitted values plot tells about it. This means the residuals must have a constant variance across all the observations. The maximum value is 2.20 standard deviations away from the mean with which we can move ahead. Let's check that with a calculation of standardized residuals. The residuals mustn't be more than 3 standard deviations away from the residual mean.
![microsoft excel linear regression microsoft excel linear regression](https://images.saymedia-content.com/.image/ar_4:3%2Cc_fill%2Ccs_srgb%2Cq_auto:eco%2Cw_1200/MTc0MjM1NjgwNzExNzgwMjIw/how-to-create-a-simple-linear-regression-equation.png)
There should not be any outliers present. The first assumption can be validated from the scatter plots and the results of the regression which showcases different coefficients and the fitted values are calculated using the linear equation. There must be a linear relation between independent and dependent variables. It means you shouldn't be able to form a pattern in any part of the plot as far residuals are concerned. The more random the errors are, the better your model. While R2 is one parameter to look out for, the most standard way is to check the residual plotst to evaluate the model. Let's apply the regression technique and discover if the assumptions get validated without which our model doesn't stand a suitable fit for the given data and future predictions. Though it is not an exact linear relation, the whole purpose of modelling is to understand the uncertainty complemented with statistical analysis. Error terms should be normally distributed with mean 0.All independent variables are uncorrelated with the error term.Observations of the error term are uncorrelated with each other.There should not be any outliers present.There must be a linear relation between independent and dependent variables.
![microsoft excel linear regression microsoft excel linear regression](https://www.dummies.com/wp-content/uploads/430349.image0.jpg)
What do the trends say?Īll the scatter plots helps use decide to go for a linear regression modelĪssumptions to apply linear regression model: This is a transit demand data set and the dependent variable under consideration is Number of weekly riders and there are four independent variables whose predictability we want to know. The higher the proportion, the better is the relationship between dependent and independent variable/s. R2 is R-squared value which is defined as the measure of proprortion of variance of dependent variable explained by the independent variable.Microsoft Excel's regression limits to linear regression analysis however one can try to fit with one independent variable or multiple independent variables.
![microsoft excel linear regression microsoft excel linear regression](https://quickexcel.com/wp-content/uploads/2021/07/How-to-perform-linear-regression-in-Excel.png)
This analysis has a small dataset which explores the dataset from application point of view and I hope it would be a precursor to complex analysis using the same or different technique.