Skip to content

Commit 67689a9

Browse files
committed
End of chapter 4
1 parent ac5dd9f commit 67689a9

File tree

1 file changed

+11
-0
lines changed

1 file changed

+11
-0
lines changed

regression-and-prediction.md

+11
Original file line numberDiff line numberDiff line change
@@ -113,3 +113,14 @@ Used by statisticians for analysis of complex surveys. Data scientists find it u
113113

114114

115115
## Regression Diagnostics
116+
- Outliers: An extreme value that is different from most other observations. You can detect an outlier by examining the standardized residual, which is the residual divided by the standard error if the residuals. Standardized residual can be interpreted as the number of standard errors away from the regression line.
117+
- Influential Values: A value whose absence would significantly change the regression equation is termed an influetial observation. Leverage is a common metric to determin influence og a single record on a regression. Leverage is also known as the hat-value.
118+
- Non-normal residuals: Non normally distributed residuals can invalidate some technical requirements of regression but are usually not a concern in data science.
119+
- Heteroskedasticity: When some ranges of the outcome experience residuals with higher variance (may indicate a predictor is missing from the equation.)
120+
- Partial Residual Plots: A way to visualize how well the estimated fit explains the relationship between a predictor and the outcome. It isolates the relationship between a predictor variable and the response, taking into account all of the other predictor variables.
121+
122+
## Polynomial and Spline Regression
123+
- Polynomial Regression: Adds polynomial terms to a regression (e.g. squares, cubes, etc).
124+
- Spline Regression: Fitting a smooth curve wiht a series of polynoimal segments.
125+
- Knots: Values that separate spline segments.
126+
- Generalized Additive Models: Spline models with automated selection of knots.

0 commit comments

Comments
 (0)