-
Notifications
You must be signed in to change notification settings - Fork 360
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is there a case for operations like \ for DataArrays? #692
Comments
This is a tough question to answer. It's not clear to me if the output solution would be all missing values if there were a single missing value in the input or not. If we did implement this, it should go in DataArrays. |
I have no idea if it's feasible implementation-wise, but a cool way of dealing with missing values for this specific operation would be to discard the rows (i.e. equations) that have missing values, and try to solve the remaining system. So if Rationale: Most of the time when using |
I'd rather that something that involves meaningful assumptions be explicit, rather than be an implicit default. |
In what context would |
That's not what I said. The question is: "are there multiple reasonable ways to propose solving a linear equation where some entries of A or b are unknown?". And the answer is: "yes". |
Yikes. Implicitly casting a DataFrame to a matrix for matrix operations seems like a frightening idea. Especially since it's shorthand that users can trivially implement locally if it fits with their workflow. One case where A\data[:V] is likely to do badly is when data[:V] is a PooledDataArray. Let alone when there are missing values. |
Is there any strong reason for you not to use GLM for curve fitting? There On Sunday, 28 September 2014, Gray Calhoun notifications@github.com wrote:
|
GLM is extremely overkill for the scenario I had at my hands at the moment. Also, the GLM.jl seems to be aimed primarily at users coming from R - I've never used R. On the other hand, I've used MATLAB more than enough, and there (to no-ones surprise) linear fitting is easiest done with the backslash operator... |
Just want to remind folks that this issue should be shifted to DataArrays since it doesn't make any sense to try to define matrix ops on DataFrames. |
Closing this for housekeeping, since it doesn't seem that it's a feature we want (and definitely not here). |
I just stumbled over this (trying to construct a linear regression from my data):
Since
c = A\data[:V].data
works, I figure this could easily be fixed by adding a method that does just that, but I also noted in #368, #346 and #165 that there might be reasons not to.If implementing this in
DataFrames.jl
is a bad idea, what would be the "standard" way to solve an equation system like the one above?The text was updated successfully, but these errors were encountered: