Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error in .DynamicClusterCall(cl, length(cl), .fun = function(.proc_idx #188

Closed
laiajulianamu opened this issue May 14, 2024 · 23 comments
Closed

Comments

@laiajulianamu
Copy link

data <- radiator::filter_rad(
data = "SNPS1.csv",
strata = "stratayft.tsv"
)

Step 1. Minor Allele Statistics visualization and helper table

depth info not available, switching to count
Error in .DynamicClusterCall(cl, length(cl), .fun = function(.proc_idx, :
One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\filter_rad_20240513@2143\01_radiator\radiator_20240513@2143.gds.rad'. The process cannot access the file because it is being used by another process.
In addition: There were 50 or more warnings (use warnings() to see the first 50)
✖ Calculating Minor Allele statistics [15.2s]

Computation time, overall: 16 sec
############################# completed filter_ma ##############################

Computation time, overall: 172 sec
############################# completed filter_rad #############################
Session info ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
setting value
version R version 4.3.2 (2023-10-31 ucrt)
os Windows 11 x64 (build 22631)
system x86_64, mingw32
ui RStudio
language (EN)
collate English_United States.utf8
ctype English_United States.utf8
tz America/Chicago
date 2024-05-13
rstudio 2023.12.1+402 Ocean Storm (desktop)
pandoc NA

─ Packages ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
package * version date (UTC) lib source
backports 1.4.1 2021-12-13 [1] CRAN (R 4.3.1)
BiocGenerics 0.48.1 2023-11-01 [1] Bioconductor
BiocManager 1.30.22 2023-08-08 [1] CRAN (R 4.3.3)
Biostrings 2.70.3 2024-03-13 [1] Bioconductor 3.18 (R 4.3.3)
bit 4.0.5 2022-11-15 [1] CRAN (R 4.3.2)
bit64 4.0.5 2020-08-30 [1] CRAN (R 4.3.2)
bitops 1.0-7 2021-04-24 [1] CRAN (R 4.3.1)
boot 1.3-30 2024-02-26 [1] CRAN (R 4.3.3)
broom 1.0.5 2023-06-09 [1] CRAN (R 4.3.2)
cachem 1.0.8 2023-05-01 [1] CRAN (R 4.3.2)
cli 3.6.2 2023-12-11 [1] CRAN (R 4.3.2)
codetools 0.2-20 2024-03-31 [1] CRAN (R 4.3.3)
colorspace 2.1-0 2023-01-23 [1] CRAN (R 4.3.2)
crayon 1.5.2 2022-09-29 [1] CRAN (R 4.3.2)
data.table 1.15.0 2024-01-30 [1] CRAN (R 4.3.2)
devtools 2.4.5 2022-10-11 [1] CRAN (R 4.3.3)
digest 0.6.35 2024-03-11 [1] CRAN (R 4.3.2)
dplyr 1.1.4 2023-11-17 [1] CRAN (R 4.3.2)
ellipsis 0.3.2 2021-04-29 [1] CRAN (R 4.3.2)
fansi 1.0.6 2023-12-08 [1] CRAN (R 4.3.2)
farver 2.1.1 2022-07-06 [1] CRAN (R 4.3.2)
fastmap 1.1.1 2023-02-24 [1] CRAN (R 4.3.2)
foreach 1.5.2 2022-02-02 [1] CRAN (R 4.3.2)
fs 1.6.3 2023-07-20 [1] CRAN (R 4.3.2)
fst 0.9.8 2022-02-08 [1] CRAN (R 4.3.3)
fstcore * 0.9.18 2023-12-02 [1] CRAN (R 4.3.3)
gdsfmt 1.38.0 2023-10-24 [1] Bioconductor
generics 0.1.3 2022-07-05 [1] CRAN (R 4.3.2)
GenomeInfoDb 1.38.8 2024-03-15 [1] Bioconductor 3.18 (R 4.3.3)
GenomeInfoDbData 1.2.11 2024-04-09 [1] Bioconductor
GenomicRanges 1.54.1 2023-10-29 [1] Bioconductor
ggplot2 3.5.0.9000 2024-04-11 [1] Github (tidyverse/ggplot2@1050f09)
glmnet 4.1-8 2023-08-22 [1] CRAN (R 4.3.3)
glue 1.7.0 2024-01-09 [1] CRAN (R 4.3.2)
gridExtra 2.3 2017-09-09 [1] CRAN (R 4.3.3)
gtable 0.3.5 2024-04-22 [1] CRAN (R 4.3.3)
HardyWeinberg 1.7.8 2024-04-06 [1] CRAN (R 4.3.3)
hms 1.1.3 2023-03-21 [1] CRAN (R 4.3.2)
htmltools 0.5.7 2023-11-03 [1] CRAN (R 4.3.2)
htmlwidgets 1.6.4 2023-12-06 [1] CRAN (R 4.3.2)
httpuv 1.6.14 2024-01-26 [1] CRAN (R 4.3.2)
IRanges 2.36.0 2023-10-24 [1] Bioconductor
iterators 1.0.14 2022-02-05 [1] CRAN (R 4.3.2)
jomo 2.7-6 2023-04-15 [1] CRAN (R 4.3.3)
labeling 0.4.3 2023-08-29 [1] CRAN (R 4.3.1)
later 1.3.2 2023-12-06 [1] CRAN (R 4.3.2)
lattice 0.21-9 2023-10-01 [1] CRAN (R 4.3.2)
lifecycle 1.0.4 2023-11-07 [1] CRAN (R 4.3.2)
lme4 * 1.1-35.3 2024-04-16 [1] CRAN (R 4.3.3)
magrittr 2.0.3 2022-03-30 [1] CRAN (R 4.3.2)
MASS 7.3-60 2023-05-04 [1] CRAN (R 4.3.2)
Matrix * 1.6-1.1 2023-09-18 [1] CRAN (R 4.3.2)
memoise 2.0.1 2021-11-26 [1] CRAN (R 4.3.2)
mice 3.16.0 2023-06-05 [1] CRAN (R 4.3.3)
mime 0.12 2021-09-28 [1] CRAN (R 4.3.1)
miniUI 0.1.1.1 2018-05-18 [1] CRAN (R 4.3.2)
minqa 1.2.6 2023-09-11 [1] CRAN (R 4.3.2)
mitml 0.4-5 2023-03-08 [1] CRAN (R 4.3.3)
munsell 0.5.1 2024-04-01 [1] CRAN (R 4.3.3)
nlme 3.1-163 2023-08-09 [1] CRAN (R 4.3.2)
nloptr 2.0.3 2022-05-26 [1] CRAN (R 4.3.2)
nnet 7.3-19 2023-05-03 [1] CRAN (R 4.3.2)
pan 1.9 2023-08-21 [1] CRAN (R 4.3.3)
pillar 1.9.0 2023-03-22 [1] CRAN (R 4.3.2)
pkgbuild 1.4.4 2024-03-17 [1] CRAN (R 4.3.3)
pkgconfig 2.0.3 2019-09-22 [1] CRAN (R 4.3.2)
pkgload 1.3.4 2024-01-16 [1] CRAN (R 4.3.2)
plyr 1.8.9 2023-10-02 [1] CRAN (R 4.3.2)
profvis 0.3.8 2023-05-02 [1] CRAN (R 4.3.2)
promises 1.2.1 2023-08-10 [1] CRAN (R 4.3.2)
purrr 1.0.2 2023-08-10 [1] CRAN (R 4.3.2)
R6 2.5.1 2021-08-19 [1] CRAN (R 4.3.2)
radiator 1.3.0 2024-04-09 [1] Github (2eb1dfa)
ragg 1.3.0 2024-03-13 [1] CRAN (R 4.3.3)
Rcpp 1.0.12 2024-01-09 [1] CRAN (R 4.3.2)
RCurl 1.98-1.14 2024-01-09 [1] CRAN (R 4.3.2)
readr 2.1.5 2024-01-10 [1] CRAN (R 4.3.2)
remotes 2.5.0 2024-03-17 [1] CRAN (R 4.3.3)
rlang 1.1.3 2024-01-10 [1] CRAN (R 4.3.2)
rpart 4.1.23 2023-12-05 [1] CRAN (R 4.3.3)
Rsolnp 1.16 2015-12-28 [1] CRAN (R 4.3.3)
rstudioapi 0.16.0 2024-03-24 [1] CRAN (R 4.3.3)
S4Vectors 0.40.2 2023-11-23 [1] Bioconductor
scales 1.3.0 2023-11-28 [1] CRAN (R 4.3.2)
SeqArray 1.42.4 2024-04-03 [1] Bioconductor 3.18 (R 4.3.3)
sessioninfo 1.2.2 2021-12-06 [1] CRAN (R 4.3.2)
shape 1.4.6.1 2024-02-23 [1] CRAN (R 4.3.2)
shiny 1.8.1.1 2024-04-02 [1] CRAN (R 4.3.3)
SNPRelate 1.36.1 2024-02-27 [1] Bioconductor 3.18 (R 4.3.2)
stringi 1.8.3 2023-12-11 [1] CRAN (R 4.3.2)
stringr 1.5.1 2023-11-14 [1] CRAN (R 4.3.2)
survival 3.5-8 2024-02-14 [1] CRAN (R 4.3.3)
systemfonts 1.0.6 2024-03-07 [1] CRAN (R 4.3.3)
textshaping 0.3.7 2023-10-09 [1] CRAN (R 4.3.2)
tibble 3.2.1 2023-03-20 [1] CRAN (R 4.3.2)
tidyr 1.3.1 2024-01-24 [1] CRAN (R 4.3.2)
tidyselect 1.2.1 2024-03-11 [1] CRAN (R 4.3.2)
truncnorm 1.0-9 2023-03-20 [1] CRAN (R 4.3.3)
tzdb 0.4.0 2023-05-12 [1] CRAN (R 4.3.2)
UpSetR 1.4.0 2019-05-22 [1] CRAN (R 4.3.3)
urlchecker 1.0.1 2021-11-30 [1] CRAN (R 4.3.2)
usethis 2.2.3 2024-02-19 [1] CRAN (R 4.3.3)
utf8 1.2.4 2023-10-22 [1] CRAN (R 4.3.2)
vctrs 0.6.5 2023-12-01 [1] CRAN (R 4.3.2)
vroom 1.6.5 2023-12-05 [1] CRAN (R 4.3.2)
withr 3.0.0 2024-01-16 [1] CRAN (R 4.3.2)
xtable 1.8-4 2019-04-21 [1] CRAN (R 4.3.2)
XVector 0.42.0 2023-10-24 [1] Bioconductor
zlibbioc 1.48.2 2024-03-13 [1] Bioconductor 3.18 (R 4.3.3)

[1] C:/Program Files/R/R-4.3.2/library

result.csv
rstrataesult.csv

@thierrygosselin
Copy link
Owner

Dear Laia, did you check the troubleshooting section ? I think this error is documented.

I'm unable to run the data you sent (even if the strata csv is converted to tsv)

@laiajulianamu
Copy link
Author

Dear Thierry,
Thank you so much for your response. Yes, I did check the the troubleshooting section and I added to my code the parallel.core =1. However, After that I am having another error

Step 1. Coverage visualization and helper table

Generating statistics
Error in UseMethod("mutate") :
no applicable method for 'mutate' applied to an object of class "list"
In addition: There were 50 or more warnings (use warnings() to see the first 50)
✖ Writing files [61ms]

I already updated tidyverse following some suggestion from google butI am still traped in this point. Sadly I was not able to upload my data base because it has more than 178 MB.
Thank you so muchchecked the troubleshooting section and added the parallel to my code,
Laia

@thierrygosselin
Copy link
Owner

Send the real data by email
I'll have a look

@thierrygosselin
Copy link
Owner

Don't share full dataset to the public

If not possible try making a smaller dataset that generate the same error so that I can reproduce

With my dataset I don't have any...

@thierrygosselin
Copy link
Owner

First test with your data works

test <- radiator::read_dart(data = "SNPS1.csv", strata = "strata1.tsv")
Reading DArT file...                                                
Number of blacklisted samples: 0                                      
DArT SNP format: alleles absence/presence in 2 Rows
fstcore package v0.9.18
(OpenMP detected, using 56 threads)
Generating genotypes and calibrating REF/ALT alleles...
Number of markers recalibrated based on counts of allele: 11707
Generating GDS...
File written: radiator_20240514@1226.gds.rad                        

Number of chrom: 1
Number of locus: 45544
Number of SNPs: 76246
Number of strata: 6
Number of individuals: 479

Number of ind/strata:
PER = 47
MEX = 99
GAL = 216
COL = 32
GM = 11
ITW = 74

Number of duplicate id: 0

Computation time, overall: 102 sec

@thierrygosselin
Copy link
Owner

Using radiator::filter_rad I do get an error, I think it's link to using DArT SNP format other than count (which I no longer use, favouring counts). I'll have a look for a fix.

This should not take long

@thierrygosselin
Copy link
Owner

Ok so I see the problem, it's during the coverage filter, when using DArT data without counts, the coverage information is based on average and the dataset provided by DArT.

I don't filter coverage when using 1 row and 2 rows provided by DArT, for several reasons that were mentioned during the workshop you attended. Simplify:
By the time I am ready to look at the dataset coverage, 100% of the time I have already discarded samples. The statistics provided by DArT becomes obviously biased because it's using the full dataset...

I don't like that so I prefer not to filter it and advertise the idea to 1. get the FQ files for good bookkeeping 2. get the count files...

I will still fix the issue not to throw an error

@laiajulianamu
Copy link
Author

Dear Thierry,

Thank you so much. Yes, I understand that I should avoid the coverage filter, however, when I tried to exclude (using filter_coverage = FALSE), the code did not run exclueded it.

Unknowned arguments identified inside "...":
filter_coverage

Thank you again

@thierrygosselin
Copy link
Owner

In the earlier days I was trying to accommodate all format, now I'm more picky. I had plan to re-integrate the coverage statistics generated by DArT for 1row and 2rows format but I slept on it and I really thing it's almost 100% of the time biased with the dataset that I have seen, for the single reason that samples will eventually get blacklisted (removed) and the stats becomes biased. Request DArT count file the next time. Now the function will bypass the coverage for those files.

Coverage statistics information for 2rows and 1row DArT format is ignored

radiator cannot generate coverage statistics from source
potential blacklisted individuals bias the statistics generated by DArT

request DArT count format from DArT


Coverate information is not available for this dataset, returning GDS...

thierrygosselin added a commit that referenced this issue May 15, 2024
* Fix issue #188 related to coverage in DArT 1row and 2rows format
@thierrygosselin
Copy link
Owner

Latest radiator version should fix your problem

@thierrygosselin
Copy link
Owner

For your info... For your dataset exploration here is the filters thresholds I used:

  1. reproducibility: I'm filtering based on the outlier statistics, it blacklist 5424 SNPs
  2. filter for common markers removes 469 SNPs
  3. blacklist samples based on missingness and outlier statistics, 32 samples are removed. At this step I don't remove samples based on heterozygosity. At the end I re-run monomorphic markers removal, this discard 3428 markers.
  4. Minor allele: I use the MAC stats and use 4 alternate alleles, 17282 SNPs are removed (you could also use MAF, but I prefer the count, MAC).
  5. filter genotyping rate based on 0.2 Max missing removes 16779 SNPs
  6. the SNP position on the read, I keep all (option 1, filter is off).
  7. filter based on SNP number on the locus. Based on your data and figures generated I do want to blacklist some locus, and I use my own threshold here. We will keep only locus with a Max of 2 SNPs. This result in blacklisting 2515 locus and 8604 SNPs
  8. filter based on short distance LD, I say yes, because the locus with 2 SNPs (about half the dataset) carries the same information and it bias stats. To filter those I'm using option 1 (based on Mac). So it keeps the SNP with max Mac which makes sense because more samples carries the info). 5049 SNPs are blacklisted. I don't filter for long distance LD with this de novo dataset.
  9. filter individuals based on heterozygosity. You really want to check the samples here because I think you might have 2 species mixed together or there was some wet lab problem (uneven tissue, too much DNA, mixed DNA, etc)... and the one that have really low heterozygosity is probably another specie if it's not correlated with missing data (small bubbles) and if the missing data is important it's usually poor DNA quality... Based on the thresholds (lower and upper) on the figures, 57 samples are removed. This sounds like a lot but if you re-run filter_rad without removing the samples with weird heterozygosity you end up creating false close-kin (it's the next filter that looks for duplicated samples, but provide the figure that shows distances). Removing those 57 samples really end up with a good dataset and highlight real relationships.

Removing

manhattan plot distance

Not removing
manhattan plot distance

  1. filtering individuals based on duplicated genomes: I'm using the 0.25 threshold here. If you have remove the problematic samples in the previous filtering steps (heterozygosity), the figure shows 3 pairs of samples from the same population that are potential duplicates, check your lab note or naming, etc. Because they are from the same pop, you can keep 3 based on missingness (keeping the samples with less missing data). So this step, will remove an additional 3 samples.
  2. Hardy Weinberg Disequilibrium filtering depends on the analysis you will do after, this is why in the folder it keeps all the different dataset so you can check its impact e.g. in population discovery analysis. I used 3 pops as threshold to discard the markers (3 sampling sites/pops need to be in disequilibrium to discard the SNPs) and the mid p value chosen here is 0.001. This is a light filtering threshold that blacklist 770 SNPS.

At the end of this first filtering exploration you have 18 427 SNPs and 387 samples

Hope this helps, re-open an issue if you are still having problems

@thierrygosselin
Copy link
Owner

radiator and dartR approaches

In case you are wondering some of the differences between radiator and dartR...

radiator

Quickly if you want to test the mixed genomes (individual heterozygosity) or duplicated genomes (duplicate samples and close kin detection) without any filters:

data <- radiator::read_dart(data = "SNPS1.csv", strata = "strata1.tsv")
 #103 sec on my computer
mix <- radiator::detect_mixed_genomes(data = data) 
#54 sec

individual heterozygosity boxplot
individual heterozygosity manhattan plot

dup <- radiator::detect_duplicate_genomes(data = data)
#47 sec

manhattan plot distance
violin plot distance

Using dartR

tictoc::tic()
data.gl <- dartR.base::gl.read.dart(filename = "SNPS1.csv", ind.metafile = "strata1_dartR.csv")
tictoc::toc()
# 167.62 sec elapsed
# longer to process and file is not saved...
tictoc::tic()
test.dartr.het <- dartR.base::gl.report.heterozygosity(x = data.gl, save2tmp = TRUE)
tictoc::toc()
# 104.738 sec elapsed

dartR_hetRplot

Conclusion for:

  • With radiator in less than 5 minutes you have the dataset imported in R and saved in the directory + 2 analysis that gives a pretty good idea of what filters will be required for your dataset and the problem encountered in the field or wet lab

  • dartR

    • A bit longer to run than radiator for similar steps. But ok.
    • After ~ 168 sec the file is still not saved. You have to do that separately.
    • I see that dartR team have upgraded a lot the package in the last 5 years with reporting important stats and new filters that I had developed. I think it's great.
    • The problem again with dartR is the format they are using. Still not using the good DArT format in my opinion.
    • I haven't checked if they have something similar to my duplicated analysis, so above this analysis was not covered with dartR and it's on me. I don't think the gl.report.parent.offspring function covers really what I want to see in this type of analysis. For parent/offspring analysis they are far better approaches. Here, it's all about duplicates and artifact samples.
    • The reporting of individual heterozygosity is welcomed but not as useful as the one made by radiator that is more tailored to highlight problematic samples... It also take x2 the times compared to radiator that as a more elegant solution.
      • With dartR the box plot figure doesn't highlight the population individual's heterozygosity that truly shows the problem.
      • The full range is not really seen
      • The outliers are not shown like the dots in radiator
      • radiator as an overall box plot that melts all the individuals/pop in one groups and this shows the full range of heterozygosity and problems...
      • Overall, I think that this statistics when done on non filtered or filtered data is more interesting when contrasted with missing data that directly affects heterozygosity in RAD datasets, this allows to see if it's a wet lab problem or a problem with 2 species... big bubbles and small bubbles are easy to analysed.
      • with the radiator:: detect_duplicate_genomes you really see the impact of letting individual heterozygosity unchecked. The impact of biased close kin or artefactual close kin is quite impressive (the pair of samples around the 0.5 threshold on the figure...)

Hope this helps highlight the differences in the 2 packages

@laiajulianamu
Copy link
Author

Dear Thierry,

It is awesome, thank you so much; I also ran dartR and am seeing the differences. Sadly, I am struggling with a new error with the radiator righ now; it sounds like it is related with SNPrate.

Computation time, overall: 241 sec
######################## completed detect_mixed_genomes ########################
################################################################################
###################### radiator::detect_duplicate_genomes ######################
################################################################################
Execution date@time: 20240515@1021
Function call and arguments stored in a file
File written: radiator_detect_duplicate_genomes_args_20240515@1021.tsv
File written: random.seed (306779)
Calculating manhattan distances between individuals...
Error in .InitFile2(cmd = "Identity-By-State (IBS) analysis on genotypes:", :
There is no SNP!
In addition: There were 50 or more warnings (use warnings() to see the first 50)

@thierrygosselin
Copy link
Owner

thierrygosselin commented May 15, 2024

Could you post the exact command that you used, I've ran the analysis 4-5 times on your data without problems
first try this:

data <- radiator::read_dart(data = "SNPS1.csv", strata = "strata1.tsv")
mix <- radiator::detect_mixed_genomes(data = data) # don't remove individuals
dup <- radiator::detect_duplicate_genomes(data = data)

does it work?

@laiajulianamu
Copy link
Author

I am running the command below following all your instructions. However, I was not able to finish with the two last filters.

data <- radiator::filter_rad(
data = "SNPS1.csv",
strata = "stratayft.tsv",
output = c("genind", "stockr", "genpop", "genlight"),
parallel.core = 1
)

Below are the lines that you asked me to run:
###################### completed detect_duplicate_genomes ######################

data <- radiator::read_dart(data = "SNPS1.csv", strata = "stratayft.tsv")
Reading DArT file...
Number of blacklisted samples: 0
DArT SNP format: alleles absence/presence in 2 Rows
Generating genotypes and calibrating REF/ALT alleles...
Number of markers recalibrated based on counts of allele: 11707
Generating GDS...
File written: radiator_20240515@1113.gds.rad

Number of chrom: 1
Number of locus: 45544
Number of SNPs: 76246
Number of strata: 6
Number of individuals: 479

Number of ind/strata:
PER = 47
MEX = 99
GAL = 216
COL = 32
GM = 11
ITW = 74

Number of duplicate id: 0

Computation time, overall: 90 sec

mix <- radiator::detect_mixed_genomes(data = data) # don't remove individuals
################################################################################
######################## radiator::detect_mixed_genomes ########################
################################################################################
Execution date@time: 20240515@1114
Folder created: 09_detect_mixed_genomes_20240515@1114
File written: radiator_detect_mixed_genomes_args_20240515@1114.tsv
Filters parameters file generated: filters_parameters_20240515@1114.tsv
Error in dplyr::mutate():
ℹ In argument: MISSING_PROP = round(...).
Caused by error in .DynamicClusterCall():
! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240515@1113\radiator_20240515@1113.gds.rad'. The process cannot access the file because it is being used by another process.
Run rlang::last_trace() to see where the error occurred.

Computation time, overall: 10 sec
######################## completed detect_mixed_genomes ########################

dup <- radiator::detect_duplicate_genomes(data = data)
################################################################################
###################### radiator::detect_duplicate_genomes ######################
################################################################################
Execution date@time: 20240515@1115
Folder created: 10_detect_duplicate_genomes_20240515@1115
Function call and arguments stored in a file
File written: radiator_detect_duplicate_genomes_args_20240515@1115.tsv
File written: random.seed (747612)
Filters parameters file generated: filters_parameters_20240515@1115.tsv
Error in dplyr::mutate():
ℹ In argument: MISSING_PROP = round(...).
Caused by error in .DynamicClusterCall():
! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240515@1113\radiator_20240515@1113.gds.rad'. The process cannot access the file because it is being used by another process.
Run rlang::last_trace() to see where the error occurred.

Computation time, overall: 10 sec
###################### completed detect_duplicate_genomes ######################

@thierrygosselin
Copy link
Owner

  1. Start a new session of R and R studio (if using). The problem stems from using too many times the object I think. Start fresh.
  2. install the latest radiator
  3. try re-running the functions above

@laiajulianamu
Copy link
Author

I restarted everything, but I am still having the same problem

data <- radiator::read_dart(data = "SNPS1.csv", strata = "stratayft.tsv")
Reading DArT file...
Number of blacklisted samples: 0
DArT SNP format: alleles absence/presence in 2 Rows
fstcore package v0.9.18
(OpenMP detected, using 16 threads)
Generating genotypes and calibrating REF/ALT alleles...
Number of markers recalibrated based on counts of allele: 11707
Generating GDS...
File written: radiator_20240515@1528.gds.rad

Number of chrom: 1
Number of locus: 45544
Number of SNPs: 76246
Number of strata: 6
Number of individuals: 479

Number of ind/strata:
PER = 47
MEX = 99
GAL = 216
COL = 32
GM = 11
ITW = 74

Number of duplicate id: 0

Computation time, overall: 118 sec
Warning message:
package ‘fstcore’ was built under R version 4.3.3

mix <- radiator::detect_mixed_genomes(data = data) # don't remove individuals
################################################################################
######################## radiator::detect_mixed_genomes ########################
################################################################################
Execution date@time: 20240515@1530
Folder created: 12_detect_mixed_genomes_20240515@1530
File written: radiator_detect_mixed_genomes_args_20240515@1530.tsv
Filters parameters file generated: filters_parameters_20240515@1530.tsv
Error in dplyr::mutate():
ℹ In argument: MISSING_PROP = round(...).
Caused by error in .DynamicClusterCall():
! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240515@1528\radiator_20240515@1528.gds.rad'. The process cannot access the file because it is being used by another process.
Run rlang::last_trace() to see where the error occurred.

Computation time, overall: 11 sec
######################## completed detect_mixed_genomes ########################

dup <- radiator::detect_duplicate_genomes(data = data)
################################################################################
###################### radiator::detect_duplicate_genomes ######################
################################################################################
Execution date@time: 20240515@1530
Folder created: 13_detect_duplicate_genomes_20240515@1530
Function call and arguments stored in a file
File written: radiator_detect_duplicate_genomes_args_20240515@1530.tsv
File written: random.seed (622500)
Filters parameters file generated: filters_parameters_20240515@1530.tsv
Error in dplyr::mutate():
ℹ In argument: MISSING_PROP = round(...).
Caused by error in .DynamicClusterCall():
! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240515@1528\radiator_20240515@1528.gds.rad'. The process cannot access the file because it is being used by another process.
Run rlang::last_trace() to see where the error occurred.

@laiajulianamu
Copy link
Author

And I am still with the same error :(
Computation time, overall: 200 sec
######################## completed detect_mixed_genomes ########################
################################################################################
###################### radiator::detect_duplicate_genomes ######################
################################################################################
Execution date@time: 20240515@1552
Function call and arguments stored in a file
File written: radiator_detect_duplicate_genomes_args_20240515@1552.tsv
File written: random.seed (891654)
Calculating manhattan distances between individuals...
Error in .InitFile2(cmd = "Identity-By-State (IBS) analysis on genotypes:", :
There is no SNP!
In addition: There were 50 or more warnings (use warnings() to see the first 50)

Computation time, overall: 1 sec
###################### completed detect_duplicate_genomes ######################

@thierrygosselin
Copy link
Owner

Write down below the output of this command: devtools::session_info()

@laiajulianamu
Copy link
Author

devtools::session_info()
─ Session info ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
setting value
version R version 4.3.2 (2023-10-31 ucrt)
os Windows 11 x64 (build 22631)
system x86_64, mingw32
ui RStudio
language (EN)
collate English_United States.utf8
ctype English_United States.utf8
tz America/Chicago
date 2024-05-16
rstudio 2023.12.1+402 Ocean Storm (desktop)
pandoc NA

─ Packages ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
package * version date (UTC) lib source
ade4 1.7-22 2023-02-06 [1] CRAN (R 4.3.3)
adegenet 2.1.10 2023-01-26 [1] CRAN (R 4.3.3)
ape 5.8 2024-04-11 [1] CRAN (R 4.3.3)
BiocManager 1.30.23 2024-05-04 [1] CRAN (R 4.3.3)
boot 1.3-30 2024-02-26 [1] CRAN (R 4.3.3)
cachem 1.0.8 2023-05-01 [1] CRAN (R 4.3.2)
class 7.3-22 2023-05-03 [1] CRAN (R 4.3.2)
classInt 0.4-10 2023-09-05 [1] CRAN (R 4.3.3)
cli 3.6.2 2023-12-11 [1] CRAN (R 4.3.2)
cluster 2.1.6 2023-12-01 [1] CRAN (R 4.3.3)
codetools 0.2-20 2024-03-31 [1] CRAN (R 4.3.3)
colorspace 2.1-0 2023-01-23 [1] CRAN (R 4.3.2)
crayon 1.5.2 2022-09-29 [1] CRAN (R 4.3.2)
dartR.base 0.88 2024-04-10 [1] Github (green-striped-gecko/dartR.base@c5ebf2a)
dartR.data 1.0.4 2024-04-10 [1] Github (green-striped-gecko/dartR.data@4f93e38)
dartRverse 0.51 2023-12-01 [1] CRAN (R 4.3.3)
data.table 1.15.0 2024-01-30 [1] CRAN (R 4.3.2)
DBI 1.2.2 2024-02-16 [1] CRAN (R 4.3.2)
devtools 2.4.5 2022-10-11 [1] CRAN (R 4.3.3)
digest 0.6.35 2024-03-11 [1] CRAN (R 4.3.2)
doParallel 1.0.17 2022-02-07 [1] CRAN (R 4.3.2)
dplyr 1.1.4 2023-11-17 [1] CRAN (R 4.3.2)
e1071 1.7-14 2023-12-06 [1] CRAN (R 4.3.3)
ellipsis 0.3.2 2021-04-29 [1] CRAN (R 4.3.2)
fansi 1.0.6 2023-12-08 [1] CRAN (R 4.3.2)
fastmap 1.1.1 2023-02-24 [1] CRAN (R 4.3.2)
foreach 1.5.2 2022-02-02 [1] CRAN (R 4.3.2)
fs 1.6.3 2023-07-20 [1] CRAN (R 4.3.2)
gdistance 1.6.4 2023-06-19 [1] CRAN (R 4.3.3)
gdsfmt 1.39.3 2024-05-15 [1] Github (zhengxwen/gdsfmt@c15b23e)
generics 0.1.3 2022-07-05 [1] CRAN (R 4.3.2)
ggplot2 3.5.1.9000 2024-05-14 [1] Github (tidyverse/ggplot2@54cbb98)
glue 1.7.0 2024-01-09 [1] CRAN (R 4.3.2)
gridExtra 2.3 2017-09-09 [1] CRAN (R 4.3.3)
gtable 0.3.5 2024-04-22 [1] CRAN (R 4.3.3)
hierfstat 0.5-11 2022-05-05 [1] CRAN (R 4.3.3)
htmltools 0.5.7 2023-11-03 [1] CRAN (R 4.3.2)
htmlwidgets 1.6.4 2023-12-06 [1] CRAN (R 4.3.2)
httpuv 1.6.14 2024-01-26 [1] CRAN (R 4.3.2)
igraph 2.0.3 2024-03-13 [1] CRAN (R 4.3.3)
iterators 1.0.14 2022-02-05 [1] CRAN (R 4.3.2)
KernSmooth 2.23-22 2023-07-10 [1] CRAN (R 4.3.2)
later 1.3.2 2023-12-06 [1] CRAN (R 4.3.2)
lattice 0.21-9 2023-10-01 [1] CRAN (R 4.3.2)
LEA 3.14.0 2023-10-24 [1] Bioconductor
lifecycle 1.0.4 2023-11-07 [1] CRAN (R 4.3.2)
magrittr 2.0.3 2022-03-30 [1] CRAN (R 4.3.2)
MASS 7.3-60 2023-05-04 [1] CRAN (R 4.3.2)
Matrix 1.6-1.1 2023-09-18 [1] CRAN (R 4.3.2)
melfuR 1.1 2024-04-06 [1] Github (pygmyperch/melfuR@5d6bafa)
memoise 2.0.1 2021-11-26 [1] CRAN (R 4.3.2)
mgcv 1.9-1 2023-12-21 [1] CRAN (R 4.3.3)
mime 0.12 2021-09-28 [1] CRAN (R 4.3.1)
miniUI 0.1.1.1 2018-05-18 [1] CRAN (R 4.3.2)
munsell 0.5.1 2024-04-01 [1] CRAN (R 4.3.3)
nlme 3.1-163 2023-08-09 [1] CRAN (R 4.3.2)
pak 0.7.2 2024-03-17 [1] CRAN (R 4.3.3)
patchwork 1.2.0 2024-01-08 [1] CRAN (R 4.3.3)
pegas 1.3 2023-12-13 [1] CRAN (R 4.3.3)
permute 0.9-7 2022-01-27 [1] CRAN (R 4.3.3)
pillar 1.9.0 2023-03-22 [1] CRAN (R 4.3.2)
pkgbuild 1.4.4 2024-03-17 [1] CRAN (R 4.3.3)
pkgconfig 2.0.3 2019-09-22 [1] CRAN (R 4.3.2)
pkgload 1.3.4 2024-01-16 [1] CRAN (R 4.3.2)
plyr 1.8.9 2023-10-02 [1] CRAN (R 4.3.2)
polysat 1.7-7 2022-08-23 [1] CRAN (R 4.3.3)
poppr 2.9.6 2024-03-15 [1] CRAN (R 4.3.3)
profvis 0.3.8 2023-05-02 [1] CRAN (R 4.3.2)
promises 1.2.1 2023-08-10 [1] CRAN (R 4.3.2)
proxy 0.4-27 2022-06-09 [1] CRAN (R 4.3.3)
purrr 1.0.2 2023-08-10 [1] CRAN (R 4.3.2)
R6 2.5.1 2021-08-19 [1] CRAN (R 4.3.2)
radiator * 1.3.3 2024-05-15 [1] Github (a2d63d5)
raster 3.6-26 2023-10-14 [1] CRAN (R 4.3.3)
Rcpp 1.0.12 2024-01-09 [1] CRAN (R 4.3.2)
remotes 2.5.0 2024-03-17 [1] CRAN (R 4.3.3)
reshape2 1.4.4 2020-04-09 [1] CRAN (R 4.3.2)
rlang 1.1.3 2024-01-10 [1] CRAN (R 4.3.2)
rstudioapi 0.16.0 2024-03-24 [1] CRAN (R 4.3.3)
scales 1.3.0 2023-11-28 [1] CRAN (R 4.3.2)
seqinr 4.2-36 2023-12-08 [1] CRAN (R 4.3.3)
sessioninfo 1.2.2 2021-12-06 [1] CRAN (R 4.3.2)
sf 1.0-16 2024-03-24 [1] CRAN (R 4.3.3)
shiny 1.8.1.1 2024-04-02 [1] CRAN (R 4.3.3)
SNPRelate 1.37.5 2024-05-15 [1] Github (zhengxwen/SNPRelate@0996a58)
sp 2.1-4 2024-04-30 [1] CRAN (R 4.3.3)
StAMPP 1.6.3 2021-08-08 [1] CRAN (R 4.3.3)
stringi 1.8.3 2023-12-11 [1] CRAN (R 4.3.2)
stringr 1.5.1 2023-11-14 [1] CRAN (R 4.3.2)
terra 1.7-71 2024-01-31 [1] CRAN (R 4.3.3)
tibble 3.2.1 2023-03-20 [1] CRAN (R 4.3.2)
tidyr 1.3.1 2024-01-24 [1] CRAN (R 4.3.2)
tidyselect 1.2.1 2024-03-11 [1] CRAN (R 4.3.2)
units 0.8-5 2023-11-28 [1] CRAN (R 4.3.3)
urlchecker 1.0.1 2021-11-30 [1] CRAN (R 4.3.2)
usethis 2.2.3 2024-02-19 [1] CRAN (R 4.3.3)
utf8 1.2.4 2023-10-22 [1] CRAN (R 4.3.2)
vctrs 0.6.5 2023-12-01 [1] CRAN (R 4.3.2)
vegan 2.6-6 2024-05-14 [1] CRAN (R 4.3.2)
xml2 1.3.6 2023-12-04 [1] CRAN (R 4.3.2)
xtable 1.8-4 2019-04-21 [1] CRAN (R 4.3.2)

[1] C:/Program Files/R/R-4.3.2/library

@laiajulianamu
Copy link
Author

Dear Thierry,

I hope you are doing great. I wonder if you have updates about my last bug. I am still having the same problem, and I definitely want to use Radiator for my data analyses.
Thank you so much,
Laia

###################### radiator::detect_duplicate_genomes ######################
################################################################################
Execution date@time: 20240531@1825
Function call and arguments stored in a file
File written: radiator_detect_duplicate_genomes_args_20240531@1825.tsv
File written: random.seed (118851)
Calculating manhattan distances between individuals...
Error in .InitFile2(cmd = "Identity-By-State (IBS) analysis on genotypes:", :
There is no SNP!
In addition: There were 50 or more warnings (use warnings() to see the first 50)

Computation time, overall: 2 sec
###################### completed detect_duplicate_genomes ######################

Computation time, overall: 412 sec
############################# completed filter_rad #############################

thierrygosselin added a commit that referenced this issue Jun 5, 2024
* Open the `parallel.core` argument for some internal functions to help windows users with parallel processing problems in R. #188
* Work around for `.DynamicClusterCall` pushed greavess #189
@thierrygosselin
Copy link
Owner

Try the last version, re-open if problem still persist

@laiajulianamu
Copy link
Author

Dear Thierry,

I am so sorry, but the problem persists. I tried with two different computers.

Computation time, overall: 21 sec
######################## completed detect_mixed_genomes ########################
################################################################################
###################### radiator::detect_duplicate_genomes ######################
################################################################################
Execution date@time: 20240605@1634
Function call and arguments stored in a file
File written: radiator_detect_duplicate_genomes_args_20240605@1634.tsv
File written: random.seed (16040)
Calculating manhattan distances between individuals...
Error in .InitFile2(cmd = "Identity-By-State (IBS) analysis on genotypes:", :
There is no SNP!
In addition: There were 50 or more warnings (use warnings() to see the first 50)

Computation time, overall: 1 sec
###################### completed detect_duplicate_genomes ######################

Computation time, overall: 307 sec
############################# completed filter_rad #############################

devtools::session_info()
─ Session info ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
setting value
version R version 4.3.2 (2023-10-31 ucrt)
os Windows 11 x64 (build 22631)
system x86_64, mingw32
ui RStudio
language (EN)
collate English_United States.utf8
ctype English_United States.utf8
tz America/Chicago
date 2024-06-05
rstudio 2024.04.1+748 Chocolate Cosmos (desktop)
pandoc NA

─ Packages ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
package * version date (UTC) lib source
ade4 1.7-22 2023-02-06 [1] CRAN (R 4.3.3)
adegenet 2.1.10 2023-01-26 [1] CRAN (R 4.3.3)
ape 5.8 2024-04-11 [1] CRAN (R 4.3.3)
backports 1.5.0 2024-05-23 [1] CRAN (R 4.3.3)
BiocGenerics 0.48.1 2023-11-01 [1] Bioconductor
BiocManager 1.30.23 2024-05-04 [1] CRAN (R 4.3.3)
Biostrings 2.70.3 2024-03-13 [1] Bioconductor 3.18 (R 4.3.3)
bit 4.0.5 2022-11-15 [1] CRAN (R 4.3.2)
bit64 4.0.5 2020-08-30 [1] CRAN (R 4.3.2)
bitops 1.0-7 2021-04-24 [1] CRAN (R 4.3.1)
boot 1.3-30 2024-02-26 [1] CRAN (R 4.3.3)
broom 1.0.6 2024-05-17 [1] CRAN (R 4.3.3)
cachem 1.0.8 2023-05-01 [1] CRAN (R 4.3.2)
class 7.3-22 2023-05-03 [1] CRAN (R 4.3.2)
classInt 0.4-10 2023-09-05 [1] CRAN (R 4.3.3)
cli 3.6.2 2023-12-11 [1] CRAN (R 4.3.2)
cluster 2.1.6 2023-12-01 [1] CRAN (R 4.3.3)
codetools 0.2-20 2024-03-31 [1] CRAN (R 4.3.3)
colorspace 2.1-0 2023-01-23 [1] CRAN (R 4.3.2)
crayon 1.5.2 2022-09-29 [1] CRAN (R 4.3.2)
curl 5.2.1 2024-03-01 [1] CRAN (R 4.3.3)
dartR.base 0.88 2024-04-10 [1] Github (green-striped-gecko/dartR.base@c5ebf2a)
dartR.data 1.0.4 2024-04-10 [1] Github (green-striped-gecko/dartR.data@4f93e38)
dartRverse 0.51 2023-12-01 [1] CRAN (R 4.3.3)
data.table 1.15.0 2024-01-30 [1] CRAN (R 4.3.2)
DBI 1.2.3 2024-06-02 [1] CRAN (R 4.3.3)
devtools 2.4.5 2022-10-11 [1] CRAN (R 4.3.3)
digest 0.6.35 2024-03-11 [1] CRAN (R 4.3.2)
doParallel 1.0.17 2022-02-07 [1] CRAN (R 4.3.2)
dplyr 1.1.4 2023-11-17 [1] CRAN (R 4.3.2)
e1071 1.7-14 2023-12-06 [1] CRAN (R 4.3.3)
ellipsis 0.3.2 2021-04-29 [1] CRAN (R 4.3.2)
fansi 1.0.6 2023-12-08 [1] CRAN (R 4.3.2)
farver 2.1.2 2024-05-13 [1] CRAN (R 4.3.3)
fastmap 1.1.1 2023-02-24 [1] CRAN (R 4.3.2)
foreach 1.5.2 2022-02-02 [1] CRAN (R 4.3.2)
fs 1.6.3 2023-07-20 [1] CRAN (R 4.3.2)
fst 0.9.8 2022-02-08 [1] CRAN (R 4.3.3)
fstcore * 0.9.18 2023-12-02 [1] CRAN (R 4.3.3)
gdistance 1.6.4 2023-06-19 [1] CRAN (R 4.3.3)
gdsfmt 1.39.3 2024-05-15 [1] Github (zhengxwen/gdsfmt@c15b23e)
generics 0.1.3 2022-07-05 [1] CRAN (R 4.3.2)
GenomeInfoDb 1.38.8 2024-03-15 [1] Bioconductor 3.18 (R 4.3.3)
GenomeInfoDbData 1.2.11 2024-04-09 [1] Bioconductor
GenomicRanges 1.54.1 2023-10-29 [1] Bioconductor
ggplot2 3.5.1 2024-04-23 [1] CRAN (R 4.3.3)
glmnet 4.1-8 2023-08-22 [1] CRAN (R 4.3.3)
glue 1.7.0 2024-01-09 [1] CRAN (R 4.3.2)
gridExtra 2.3 2017-09-09 [1] CRAN (R 4.3.3)
gtable 0.3.5 2024-04-22 [1] CRAN (R 4.3.3)
HardyWeinberg 1.7.8 2024-04-06 [1] CRAN (R 4.3.3)
hierfstat 0.5-11 2022-05-05 [1] CRAN (R 4.3.3)
hms 1.1.3 2023-03-21 [1] CRAN (R 4.3.2)
htmltools 0.5.7 2023-11-03 [1] CRAN (R 4.3.2)
htmlwidgets 1.6.4 2023-12-06 [1] CRAN (R 4.3.2)
httpuv 1.6.14 2024-01-26 [1] CRAN (R 4.3.2)
igraph 2.0.3 2024-03-13 [1] CRAN (R 4.3.3)
IRanges 2.36.0 2023-10-24 [1] Bioconductor
iterators 1.0.14 2022-02-05 [1] CRAN (R 4.3.2)
jomo 2.7-6 2023-04-15 [1] CRAN (R 4.3.3)
KernSmooth 2.23-22 2023-07-10 [1] CRAN (R 4.3.2)
labeling 0.4.3 2023-08-29 [1] CRAN (R 4.3.1)
later 1.3.2 2023-12-06 [1] CRAN (R 4.3.2)
lattice 0.21-9 2023-10-01 [1] CRAN (R 4.3.2)
LEA 3.14.0 2023-10-24 [1] Bioconductor
lifecycle 1.0.4 2023-11-07 [1] CRAN (R 4.3.2)
lme4 1.1-35.3 2024-04-16 [1] CRAN (R 4.3.3)
magrittr 2.0.3 2022-03-30 [1] CRAN (R 4.3.2)
MASS 7.3-60 2023-05-04 [1] CRAN (R 4.3.2)
Matrix 1.6-1.1 2023-09-18 [1] CRAN (R 4.3.2)
melfuR 1.1 2024-04-06 [1] Github (pygmyperch/melfuR@5d6bafa)
memoise 2.0.1 2021-11-26 [1] CRAN (R 4.3.2)
mgcv 1.9-1 2023-12-21 [1] CRAN (R 4.3.3)
mice 3.16.0 2023-06-05 [1] CRAN (R 4.3.3)
mime 0.12 2021-09-28 [1] CRAN (R 4.3.1)
miniUI 0.1.1.1 2018-05-18 [1] CRAN (R 4.3.2)
minqa 1.2.7 2024-05-20 [1] CRAN (R 4.3.3)
mitml 0.4-5 2023-03-08 [1] CRAN (R 4.3.3)
munsell 0.5.1 2024-04-01 [1] CRAN (R 4.3.3)
nlme 3.1-163 2023-08-09 [1] CRAN (R 4.3.2)
nloptr 2.0.3 2022-05-26 [1] CRAN (R 4.3.2)
nnet 7.3-19 2023-05-03 [1] CRAN (R 4.3.2)
pak 0.7.2 2024-03-17 [1] CRAN (R 4.3.3)
pan 1.9 2023-08-21 [1] CRAN (R 4.3.3)
patchwork 1.2.0 2024-01-08 [1] CRAN (R 4.3.3)
pegas 1.3 2023-12-13 [1] CRAN (R 4.3.3)
permute 0.9-7 2022-01-27 [1] CRAN (R 4.3.3)
pillar 1.9.0 2023-03-22 [1] CRAN (R 4.3.2)
pkgbuild 1.4.4 2024-03-17 [1] CRAN (R 4.3.3)
pkgconfig 2.0.3 2019-09-22 [1] CRAN (R 4.3.2)
pkgload 1.3.4 2024-01-16 [1] CRAN (R 4.3.2)
plyr 1.8.9 2023-10-02 [1] CRAN (R 4.3.2)
polysat 1.7-7 2022-08-23 [1] CRAN (R 4.3.3)
poppr 2.9.6 2024-03-15 [1] CRAN (R 4.3.3)
profvis 0.3.8 2023-05-02 [1] CRAN (R 4.3.2)
promises 1.2.1 2023-08-10 [1] CRAN (R 4.3.2)
proxy 0.4-27 2022-06-09 [1] CRAN (R 4.3.3)
purrr 1.0.2 2023-08-10 [1] CRAN (R 4.3.2)
R6 2.5.1 2021-08-19 [1] CRAN (R 4.3.2)
radiator * 1.3.4 2024-06-05 [1] Github (3a8cb4f)
ragg 1.3.2 2024-05-15 [1] CRAN (R 4.3.3)
raster 3.6-26 2023-10-14 [1] CRAN (R 4.3.3)
Rcpp 1.0.12 2024-01-09 [1] CRAN (R 4.3.2)
RCurl 1.98-1.14 2024-01-09 [1] CRAN (R 4.3.2)
readr 2.1.5 2024-01-10 [1] CRAN (R 4.3.2)
remotes 2.5.0 2024-03-17 [1] CRAN (R 4.3.3)
reshape2 1.4.4 2020-04-09 [1] CRAN (R 4.3.2)
rlang 1.1.3 2024-01-10 [1] CRAN (R 4.3.2)
rpart 4.1.23 2023-12-05 [1] CRAN (R 4.3.3)
Rsolnp 1.16 2015-12-28 [1] CRAN (R 4.3.3)
rstudioapi 0.16.0 2024-03-24 [1] CRAN (R 4.3.3)
S4Vectors 0.40.2 2023-11-23 [1] Bioconductor
scales 1.3.0 2023-11-28 [1] CRAN (R 4.3.2)
SeqArray 1.42.4 2024-04-03 [1] Bioconductor 3.18 (R 4.3.3)
seqinr 4.2-36 2023-12-08 [1] CRAN (R 4.3.3)
sessioninfo 1.2.2 2021-12-06 [1] CRAN (R 4.3.2)
sf 1.0-16 2024-03-24 [1] CRAN (R 4.3.3)
shape 1.4.6.1 2024-02-23 [1] CRAN (R 4.3.2)
shiny 1.8.1.1 2024-04-02 [1] CRAN (R 4.3.3)
SNPRelate 1.37.5 2024-05-15 [1] Github (zhengxwen/SNPRelate@0996a58)
sp 2.1-4 2024-04-30 [1] CRAN (R 4.3.3)
StAMPP 1.6.3 2021-08-08 [1] CRAN (R 4.3.3)
stringi 1.8.3 2023-12-11 [1] CRAN (R 4.3.2)
stringr 1.5.1 2023-11-14 [1] CRAN (R 4.3.2)
survival 3.6-4 2024-04-24 [1] CRAN (R 4.3.3)
systemfonts 1.1.0 2024-05-15 [1] CRAN (R 4.3.3)
terra 1.7-71 2024-01-31 [1] CRAN (R 4.3.3)
textshaping 0.4.0 2024-05-24 [1] CRAN (R 4.3.3)
tibble 3.2.1 2023-03-20 [1] CRAN (R 4.3.2)
tidyr 1.3.1 2024-01-24 [1] CRAN (R 4.3.2)
tidyselect 1.2.1 2024-03-11 [1] CRAN (R 4.3.2)
truncnorm 1.0-9 2023-03-20 [1] CRAN (R 4.3.3)
tzdb 0.4.0 2023-05-12 [1] CRAN (R 4.3.2)
units 0.8-5 2023-11-28 [1] CRAN (R 4.3.3)
UpSetR 1.4.0 2019-05-22 [1] CRAN (R 4.3.3)
urlchecker 1.0.1 2021-11-30 [1] CRAN (R 4.3.2)
usethis 2.2.3 2024-02-19 [1] CRAN (R 4.3.3)
utf8 1.2.4 2023-10-22 [1] CRAN (R 4.3.2)
vctrs 0.6.5 2023-12-01 [1] CRAN (R 4.3.2)
vegan 2.6-6 2024-05-14 [1] CRAN (R 4.3.2)
viridisLite 0.4.2 2023-05-02 [1] CRAN (R 4.3.2)
vroom 1.6.5 2023-12-05 [1] CRAN (R 4.3.2)
withr 3.0.0 2024-01-16 [1] CRAN (R 4.3.2)
xml2 1.3.6 2023-12-04 [1] CRAN (R 4.3.2)
xtable 1.8-4 2019-04-21 [1] CRAN (R 4.3.2)
XVector 0.42.0 2023-10-24 [1] Bioconductor
zlibbioc 1.48.2 2024-03-13 [1] Bioconductor 3.18 (R 4.3.3)

The code that you sent to me:
data <- radiator::read_dart(data = "SNPS1.csv", strata = "stratayft.tsv")
Reading DArT file...
Number of blacklisted samples: 0
DArT SNP format: alleles absence/presence in 2 Rows
Generating genotypes and calibrating REF/ALT alleles...
Number of markers recalibrated based on counts of allele: 11707
Generating GDS...
File written: radiator_20240605@1636.gds.rad

Number of chrom: 1
Number of locus: 45544
Number of SNPs: 76246
Number of strata: 6
Number of individuals: 479

Number of ind/strata:
PER = 47
MEX = 99
GAL = 216
COL = 32
GM = 11
ITW = 74

Number of duplicate id: 0

Computation time, overall: 112 sec

mix <- radiator::detect_mixed_genomes(data = data) (parallel.core = 1) # don't remove individuals
################################################################################
######################## radiator::detect_mixed_genomes ########################
################################################################################
Execution date@time: 20240605@1639
Folder created: 01_detect_mixed_genomes_20240605@1639
File written: radiator_detect_mixed_genomes_args_20240605@1639.tsv
Filters parameters file generated: filters_parameters_20240605@1639.tsv
Error in dplyr::mutate():
ℹ In argument: MISSING_PROP = round(...).
Caused by error in .DynamicClusterCall():
! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240605@1636\radiator_20240605@1636.gds.rad'. The process cannot access the file because it is being used by another process.
Run rlang::last_trace() to see where the error occurred.

Computation time, overall: 10 sec
######################## completed detect_mixed_genomes ########################

dup <- radiator::detect_duplicate_genomes(data = data)
################################################################################
###################### radiator::detect_duplicate_genomes ######################
################################################################################
Execution date@time: 20240605@1639
Folder created: 02_detect_duplicate_genomes_20240605@1639
Function call and arguments stored in a file
File written: radiator_detect_duplicate_genomes_args_20240605@1639.tsv
File written: random.seed (135945)
Filters parameters file generated: filters_parameters_20240605@1639.tsv
Error in dplyr::mutate():
ℹ In argument: MISSING_PROP = round(...).
Caused by error in .DynamicClusterCall():
! One of the nodes produced an error: Can not open file 'C:\Users\User\Desktop\gen\read_dart_20240605@1636\radiator_20240605@1636.gds.rad'. The process cannot access the file because it is being used by another process.
Run rlang::last_trace() to see where the error occurred.

Computation time, overall: 8 sec

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants