Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The output dimension is different from the input. #1

Closed
inoue0426 opened this issue Jun 21, 2022 · 2 comments
Closed

The output dimension is different from the input. #1

inoue0426 opened this issue Jun 21, 2022 · 2 comments

Comments

@inoue0426
Copy link

I tried to use this library for the public dataset, but it returned a different dimension.
Do you know why this occurs?

Dataset

https://www-ncbi-nlm-nih-gov.ezp3.lib.umn.edu/geo/query/acc.cgi?acc=GSE87544
GSE87544_Merged_17samples_14437cells_count.txt.gz

Environment

R.version
             _                           
platform       x86_64-pc-linux-gnu         
arch           x86_64                      
os             linux-gnu                   
system         x86_64, linux-gnu           
status                                     
major          4                           
minor          2.0                         
year           2022                        
month          04                          
day            22                          
svn rev        82229                       
language       R                           
version.string R version 4.2.0 (2022-04-22)
nickname       Vigorous Calisthenics       
packageVersion("scISR")
[1] ‘0.1.1’

Code and output

library(scISR)
data.path <- "../../data/chen/chen_data.csv"

A <- read.csv(
    data.path,
    header = FALSE
)
raw = as.matrix(A)
dim(raw)

14437 ・ 23284

imputed <- scISR(data = raw)
dim(imputed)

14419 ・ 23284

@duct317
Copy link
Owner

duct317 commented Jun 21, 2022

Hi,

Thank you very much for your interest in our package.

It seems like your input matrix is in form samples * genes. The required input for scISR is matrix of genes * samples. scISR has preprocessing step, so the number of rows/genes would be less in the output (in this case, samples). If you don't want scISR to filter anything, you can set preprocessing=F to turn the preprocessing step off.

Let's me know if you have any other problem.

@inoue0426
Copy link
Author

Hi @duct317
Thank you for your advice!
I completely misunderstood it. I'll try, thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants