Skip to content

Commit

Permalink
Materials for the regression lectures 2 & 3
Browse files Browse the repository at this point in the history
  • Loading branch information
bvasiles committed Jan 11, 2024
1 parent dfff225 commit 1010f01
Show file tree
Hide file tree
Showing 5 changed files with 68 additions and 3 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -81,8 +81,8 @@ Thu, Oct 13 | [Experimental design (part II)](pages/oct13-counterbalancing.md) |
Tue, Oct 25 | [Experimental design (part III)](pages/oct25-experiment-stats.md) | [slides](slides/13-experiments-3.pdf)[video](https://www.youtube.com/watch?v=Qk8vNy3m3vU)
Thu, Oct 27 | [Intro to regression modeling](pages/oct27-regression-pt1.md) | [slides](slides/14-regression-1.pdf)[video](https://youtu.be/HTC7dPY-F34)
Tue, Nov 1 | [Diagnostics, factors, std coefficients](pages/nov01-regression-pt2.md) | [slides](slides/15-regression-2.pdf)[video](https://www.youtube.com/watch?v=5p8wtSmwkEE)
Thu, Nov 3 | Exemplar regression papers |
Tue, Nov 8 | Simpson’s paradox + Mixed-effects |
Thu, Nov 3 | [Simpson’s paradox, exemplar papers, in-class activity](pages/nov03-regression-pt3.md) | [slides](slides/16-regression-3.pdf)[video](https://youtu.be/icvF44USI_s)
Tue, Nov 8 | Mixed-effects |
Thu, Nov 10 | Interrupted time series design |
Tue, Nov 15 | Social network analysis (part I) |
Thu, Nov 17 | Social network analysis (part II) |
Expand Down
Binary file added assets/images/16-regression-3.jpeg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
2 changes: 1 addition & 1 deletion pages/nov01-regression-pt2.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

[![Lecture15-Regression-Diagnostics](../assets/images/15-regression-2.jpeg)](../slides/15-regression-2.pdf)

This is the second lecture in a series dedicated to regression modeling. We talked about some of the things that can go wrong when estimating linear models and how to diagnose those.
This is the second lecture in a series dedicated to regression modeling. We talked about some of the things that can go wrong when estimating linear models and how to diagnose those, how to model categorical variables and interpret the corresponding regression coefficients, how to model and interpret interaction effects, and how to work with standardized regression coefficients.

The importance of having a good understanding of linear regression before studying more complex statistical models cannot be overstated.

Expand Down
65 changes: 65 additions & 0 deletions pages/nov03-regression-pt3.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,65 @@
## L16: Linear Regression (Part III) ([pdf](../slides/16-regression-3.pdf), [video](https://youtu.be/icvF44USI_s))

![Lecture16-Simpson](../assets/images/16-regression-3.jpeg)

In this third lecture in the regression series we talked about [Simpson's paradox](https://en.wikipedia.org/wiki/Simpson's_paradox), how to account for within- and between- subject effects in the same model, and briefly about mixed-effects models. We also discussed two exemplar papers. Finally, we applied what we learned about regression so far to the Galton height dataset as part of an in-class hands-on activity (see the R markdown part of the [slides](../slides/16-regression-3.pdf) for details).

### Exemplars

> Woolley, A. W., Chabris, C. F., Pentland, A., Hashmi, N., & Malone, T. W. (2010). Evidence for a collective intelligence factor in the performance of human groups. Science, 330(6004), 686-688.
> Peoples, B. K., Midway, S. R., Sackett, D., Lynch, A., & Cooney, P. B. (2016). Twitter predicts citation rates of ecological research. PloS One, 11(11), e0166570.

### Lecture Readings


> Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). [Fitting Linear Mixed-Effects Models Using lme4](http:https://mirrors.nics.utk.edu/cran/web/packages/lme4/vignettes/lmer.pdf). Journal of Statistical Software, 67(1), 1 - 48.
Official vignette for the `lme4` package in R, that describes the syntax and how to specify and interpret mixed-effects linear regression models.

---

> Bell, A., & Jones, K. (2015). [Explaining fixed effects: Random effects modeling of time-series cross-sectional and panel data](). Political Science Research and Methods, 3(1), 133-153.
> Bell, A., Fairbrother, M., & Jones, K. (2019). [Fixed and random effects models: making an informed choice](https://core.ac.uk/download/pdf/161126779.pdf). Quality & Quantity, 53(2), 1051-1074.
Both papers assess the options available to researchers analyzing multilevel (including longitudinal) data, with the aim of supporting good methodological decision-making. Capabilities and limitations of fixed and random effects (FE and RE) models are presented, together with the within-between RE model -- the most general of the three, with all the strengths of the other two.

The earlier paper argues that "there are few, if any, occasions in which FE modeling is preferable to RE modeling" and that "a well-specified RE model can be used to achieve everything that FE models achieve, and much more besides." Fun read.

---

> McNeish, D., & Kelley, K. (2019). [Fixed effects models versus mixed effects models for clustered data: Reviewing the approaches, disentangling the differences, and making recommendations](https://www3.nd.edu/~kkelley/publications/articles/McNeish_Kelley_PsychMethods_2019.pdf). Psychological Methods, 24(1), 20.
This article discusses the differences between mixed effects and fixed effects models for clustered data, reviews each approach, and helps to identify when each approach is optimal. It then discusses the within–between specification, which blends advantageous properties of each framework into a single model.

There is overlap in content with the Bell et al papers above.

---

> Hair Jr, J. F., & Fávero, L. P. (2019). [Multilevel modeling for longitudinal data: concepts and applications](https://core.ac.uk/download/pdf/288188778.pdf). RAUSP Management Journal.
Another paper that discusses multilevel modeling for longitudinal data, clarifying the circumstances in which they can be used.

---

> Hox, J. J., Moerbeek, M., & Van de Schoot, R. (2017). [Chapter 2](http:https://joophox.net/mlbook2/Chapter2.pdf) from Multilevel analysis: Techniques and applications. Routledge.
The multilevel regression model is commonly needed and used. Such models assume that there is a hierarchical data set, with one single outcome or response variable that is measured at the lowest level, and explanatory variables at all existing levels. Conceptually, it is useful to view the multilevel regression model as a hierarchical system of regression equations. This chapter explains the multilevel regression model for two-level data.


### Additional Readings


> Bell, A., Jones, K., & Fairbrother, M. (2018). [Understanding and misunderstanding group mean centering: a commentary on Kelley et al.’s dangerous practice](https://d-nb.info/1149996188/34). Quality & Quantity, 52(5), 2031-2036.
"Group-mean-centering is, in short, no more dangerous than any other statistical procedure, and should remain a normal part of multilevel data analyses where it can be judiciously employed to good effect."

---

> Townsend, Z., Buckley, J., Harada, M., & Scott, M. A. (2013). [The choice between fixed and random effects](http:https://agfda.userweb.mwn.de/ALD_2016/downloads/Uebung2/Townsend_et_al_2013.pdf). The SAGE handbook of multilevel modeling, 73-88.
---

> Clark, T. S., & Linzer, D. A. (2015). [Should I use fixed or random effects](https://datajobs.com/data-science-repo/Fixed-Effects-Models-[Clark-and-Linzer].pdf). Political Science Research and Methods, 3(2), 399-408.
Binary file added slides/16-regression-3.pdf
Binary file not shown.

0 comments on commit 1010f01

Please sign in to comment.