Nan #63

theadityasam · 2019-05-29T18:20:19Z

The NAN issue will be fixed and cv function will be made functional

tdhock · 2019-05-29T18:27:05Z

TODO for the cv function ... please do NOT re-shuffle folds if there is one that has either all left censored outputs or all right censored outputs. in that case please just stop with an informative error that tells the user they need to manually specify the foldid argument. here is some similar code in one of my other packages, https://github.com/tdhock/penaltyLearning/blob/master/R/IntervalRegression.R#L189

tdhock · 2019-05-29T18:28:37Z

also @anujkhare can you please give @theadityasam push access to your repo so he can create future branches/PRs in your repo? (I think it would make it simpler to keep track of the project if everything is in your repo)

theadityasam · 2019-05-29T19:09:57Z

Okay, won't re-shuffle the folds. Also, for the censored data error message, if I implement a method to check in the R code itself, I'll have to go through the dataset twice - once for checking whether the model can be fit and then again while assigning the censorship types in get_censoring_types of iregnet_fit.cpp which might affect the run time of iregnet.
https://github.com/theadityasam/iregnet/blob/nan/src/iregnet_fit.cpp#L401

I believe, it would be better to write a new piece of R code that checks for the censorship condition and assigns the censorship type done similar to what is done in
https://github.com/theadityasam/iregnet/blob/nan/src/iregnet_fit.cpp#L401
The result will be then passed as argument to the C++ code. This way, we don't need to go through the dataset twice.

tdhock · 2019-05-29T19:15:11Z

checking in R code will not result in any significant slowdown if you use vector operations

theadityasam · 2019-05-29T19:25:27Z

Okayy, will write a new R snippet for the checking

tdhock · 2019-06-05T17:51:30Z

try debugging using print statements or gdb https://tdhock.github.io/blog/2019/gdb/

anujkhare · 2019-06-11T06:21:21Z

@tdhock @theadityasam investigated the cases where NaNs are still produced. I think that they're cases where the unregularized solution does not exist. survreg also produces inf values.

The ideal behavior is that we should fit till the lambda value that is well-behaved, throw a warning for the next lambda that produces a NaN, and then return an iregnet object still. There may be better ways to calculate the lambda path, but I'm not sure yet.

@theadityasam - anything to add? We should make the corresponding changes for this.

anujkhare · 2019-06-11T06:37:19Z

There is a reference to this in section 2.3 of this paper about glmnet. For the cases where the number of covariates (p) > number of samples (n), the unregularized solution is undefined (beta shoots to inf).

They ignore solutions for lambda values close to 0 in such cases.

tdhock · 2019-06-16T04:32:34Z

that is interesting. do you have any concrete p>n examples that can be coded as tests? it makes sense to me to only consider large lambda values in that case, and still return an iregnet object, with a warning.

anujkhare · 2019-06-27T12:08:38Z

Closing since this has been merged into the cv branch: #54 .

theadityasam added 2 commits April 19, 2019 21:53

Fix plot method, missing row names error

ce0b6f7

fix Nan error

75c4ee8

theadityasam mentioned this pull request May 29, 2019

NAN Error #62

Closed

Errors for completely left or right censorship

aa96d67

anujkhare mentioned this pull request Jun 24, 2019

cv.iregnet #54

Merged

anujkhare closed this Jun 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nan #63

Nan #63

theadityasam commented May 29, 2019

tdhock commented May 29, 2019

tdhock commented May 29, 2019

theadityasam commented May 29, 2019 •

edited

Loading

tdhock commented May 29, 2019

theadityasam commented May 29, 2019 •

edited

Loading

tdhock commented Jun 5, 2019

anujkhare commented Jun 11, 2019

anujkhare commented Jun 11, 2019

tdhock commented Jun 16, 2019

anujkhare commented Jun 27, 2019

Nan #63

Nan #63

Conversation

theadityasam commented May 29, 2019

tdhock commented May 29, 2019

tdhock commented May 29, 2019

theadityasam commented May 29, 2019 • edited Loading

tdhock commented May 29, 2019

theadityasam commented May 29, 2019 • edited Loading

tdhock commented Jun 5, 2019

anujkhare commented Jun 11, 2019

anujkhare commented Jun 11, 2019

tdhock commented Jun 16, 2019

anujkhare commented Jun 27, 2019

theadityasam commented May 29, 2019 •

edited

Loading

theadityasam commented May 29, 2019 •

edited

Loading