Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sometimes NA-rows are introduced #13

Closed
giuseppec opened this issue Sep 18, 2015 · 3 comments
Closed

sometimes NA-rows are introduced #13

giuseppec opened this issue Sep 18, 2015 · 3 comments
Labels

Comments

@giuseppec
Copy link
Collaborator

While fixing my bugs, I found another bug here

setOMLConfig(arff.reader = "RWeka")
dRWeka = getOMLDataSet(1418)
setOMLConfig(arff.reader = "farff")
dFarff = getOMLDataSet(1418) # after each row another NA-rows is introduced

str(dRWeka$data)
#'data.frame':  139 obs. of  9 variables:
# $ A1   : num  0.898 0.866 0.927 0.927 0.917 ...
# $ A2   : num  151 156 149 148 150 ...
# $ A3   : num  37 41 34 27 21 40 17 37 12 39 ...
# $ A4   : num  0.833 0.946 0.952 0.752 0.887 ...
# $ A5   : num  151.3 156.2 114.5 84.2 138.2 ...
# $ A6   : num  34 41 5 3 17 22 14 3 4 10 ...
# $ A7   : num  151 156 145 147 150 ...
# $ A8   : num  36 40 32 26 20 39 16 36 14 38 ...
# $ class: Factor w/ 2 levels "0","1": 2 2 2 2 2 2 2 2 2 2 ...

str(dFarff$data)
#'data.frame':  278 obs. of  9 variables:
# $ A1   : num  0.898 NA 0.866 NA 0.927 ...
# $ A2   : num  151 NA 156 NA 149 ...
# $ A3   : num  37 NA 41 NA 34 NA 27 NA 21 NA ...
# $ A4   : num  0.833 NA 0.946 NA 0.952 ...
# $ A5   : num  151 NA 156 NA 115 ...
# $ A6   : num  34 NA 41 NA 5 NA 3 NA 17 NA ...
# $ A7   : num  151 NA 156 NA 145 ...
# $ A8   : num  36 NA 40 NA 32 NA 26 NA 20 NA ...
# $ class: Factor w/ 2 levels "0","1": 2 NA 2 NA 2 NA 2 NA 2 NA ...
@giuseppec giuseppec added the bug label Oct 7, 2015
@giuseppec
Copy link
Collaborator Author

Hint: The bug results from the blank lines in the .arff file. The farff package interpretes these blank lines as missing values.

@berndbischl
Copy link
Member

the dataset become non-public. joaquin is mailing the structure to me, so we can test this.

@berndbischl
Copy link
Member

i got the data, it seems that the DATA was fixed and wrong before. closing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants