Skip to content

rpriam/stratifreg-code

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

computer code for communication "Family of linear regression mixture models stratified along the outcome"

Abstract: Linear regression is one of the most studied model, it assumes a clear hypothesis of linearity. Underlying issues coming from Yule-Simpson’s paradox or more generally hidden nonlinearities lead to spurious correlations difficult to detect in practice and prone to induce a mistaken linear model. The concern is when the model for explaining/predicting the outcome cannot be kept the same for the whole sample, it changes accordingly to the dependent variable. Hence, it is proposed a stratification of the outcome which leads to a new family of mixture models of regressions. A break or more along the outcome changes the linear regression into several components instead of one. A difference with the existing mixture models of regressions is that the partioning now depends mainly on the outcome. A double check of the change is obtained via an additional ordinal model and a discretization of the outcome. For the validation of the mixture, it is required a decrease of the bic, the aic and a mse or mae for both the continuous and discretized outcomes. Graphically, it is also shown these indicators plus the determination coefficient for moving thresholds in order to visualize the change between intervals of outcomes. With a threshold equal to the median, the approach is illustrated for several real datasets in the presented experiments. It is applied with a medical dataset from the Covid-19 lockdown in spring 2020.

Datasets:

Name n p X y
D1 covid-19 4361 6 .csv .csv
D2 pre-diabet 3059 4 .csv .csv
D3 life-expectancy 2928 16 .csv .csv
D4 pisa-2009 5233 20 .csv .csv
D5 housing 20640 8 .csv .csv

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published