-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
question on missRanger #18
Comments
Hi Benoit Good input, thanks.
Due to this, I don't think I can give a positive answer to 1) and 2) yet.
|
Thanks for your answers! |
Hi @BenoitLondon , Jumping in late here, but it doesn't seem to me like you need to be concerned about past/present values with random forests as it's essentially a non-parametric technique, and thus it will capture time dependence by approximating it as an unknown function. To ensure it captures time features, I think you can just make sure to include a time counter/index. |
Hi,
very nice package. I love ranger package as well so very good idea to use it for imputation!
I have 4 questions actually:
E.g. I have time series data and I would like to impute values using only past data, is there a better way than calling missRanger repetitively at each time point subsetting on the past?
Even in that case, should I impute the next day using the raw past data or using the previously completed data?
Similarly if I have a train and test dataset is there a way to apply the rules of the train dataset imputation to the test one without rerunning the algorithm? (can missRanger return the imputation model?)
Do you have recommendations about when using extratrees splitrule? Is it better and if so in which cases?
Thanks!
The text was updated successfully, but these errors were encountered: