rifiComparative #2718

lyoussar · 2022-07-15T07:57:13Z

Update the following URL to point to the GitHub repository of
the package you wish to submit to Bioconductor

Repository: https://github.com/CyanolabFreiburg/rifiComparative

Confirm the following by editing each check box to '[x]'

I understand that by submitting my package to Bioconductor,
the package source and all review commentary are visible to the
general public.
I have read the Bioconductor Package Submission
instructions. My package is consistent with the Bioconductor
Package Guidelines.
I understand Bioconductor Package Naming Policy and acknowledge
Bioconductor may retain use of package name.
I understand that a minimum requirement for package acceptance
is to pass R CMD check and R CMD BiocCheck with no ERROR or WARNINGS.
Passing these checks does not result in automatic acceptance. The
package will then undergo a formal review and recommendations for
acceptance regarding other Bioconductor standards will be addressed.
My package addresses statistical or bioinformatic issues related
to the analysis and comprehension of high throughput genomic data.
I am committed to the long-term maintenance of my package. This
includes monitoring the support site for issues that users may
have, subscribing to the bioc-devel mailing list to stay aware
of developments in the Bioconductor community, responding promptly
to requests for updates from the Core team in response to changes in
R or underlying software.
I am familiar with the Bioconductor code of conduct and
agree to abide by it.

I am familiar with the essential aspects of Bioconductor software
management, including:

The 'devel' branch for new packages and features.
The stable 'release' branch, made available every six
months, for bug fixes.
Bioconductor version control using Git
(optionally via GitHub).

For questions/help about the submission process, including questions about
the output of the automatic reports generated by the SPB (Single Package
Builder), please use the #package-submission channel of our Community Slack.
Follow the link on the home page of the Bioconductor website to sign up.

bioc-issue-bot · 2022-07-15T07:57:16Z

Hi @lyoussar

Thanks for submitting your package. We are taking a quick
look at it and you will hear back from us soon.

The DESCRIPTION file for this package is:

Package: rifiComparative
Title: 'rifiComparative' compares the output of rifi from 2 different conditions
Version: 0.99.0
Authors@R: c(person("Loubna", "Youssar",
      email = "loubna.youssar@biologie.uni-freiburg.de",
      role = c("aut", "cre")),
      person("Jens", "Georg",
      email = "jens.georg@biologie.uni-freiburg.de",
      role = "aut", "cre"))
Description: 'rifiComparative' is a continuation of rifi package. It compares two conditions output of rifi using half-life and mRNA at time 0 segments. As an input for the segmentation, the difference between half-life of both condtions and log2FC of the mRNA at time 0 are used. The segmentation 'rifiComparative' provides segmentation, statistics, summary table, fragments visualization and some additional useful plots for further anaylsis. 
Depends:
	R (>= 4.1)
Imports:
	cowplot,
	doMC,
	parallel,
	dplyr,
	egg,
	foreach,
	ggplot2,
	ggrepel,
	graphics,
	grDevices,
	grid,
	methods,
	nnet,
	rlang,
	S4Vectors,
	scales,
	stats,
	stringr,
	tibble,
	rtracklayer,
	utils,
	writexl,
	DTA,
	LSD,
	reshape2,
	SummarizedExperiment
Suggests:
	DescTools,
	ggrepel,
	knitr,
	rmarkdown,
	BiocStyle
VignetteBuilder: knitr	
biocViews: RNASeq, DifferentialExpression, GeneRegulation, Transcriptomics, Microarray, Software
BugReports: https://github.com/CyanolabFreiburg/rifiComparative
License: GPL-3 + file LICENSE
Encoding: UTF-8
Roxygen: list(markdown = TRUE)
RoxygenNote: 7.1.12
Language: en-US

vjcitn · 2022-07-15T12:52:40Z

pre-check is passed but think about user effort to do

df_comb_minimal[,"distance_int"] <- df_comb_minimal[,"logFC_int"]

pen_int <- make_pen(
    probe = df_comb_minimal,
    FUN = rifiComparative:::fragment_inty_pen,
    cores = 60,
    logs = as.numeric(rep(NA, 8)),
    dpt = 1,
    smpl_min = 10,
    smpl_max = 50,
    sta_pen = 0.5,
    end_pen = 4.5,
    rez_pen = 9,
    sta_pen_out = 0.5,
    end_pen_out = 3.5,
    rez_pen_out = 7
)

from vignette. are the defaults good enough?

lyoussar · 2022-07-15T13:26:19Z

So far the defaults parameters were used in several organisms without any need for adjustment. Nevertheless I made the vignette simpler and I changed the cores number.

bioc-issue-bot · 2022-07-22T18:12:06Z

A reviewer has been assigned to your package. Learn what to expect
during the review process.

IMPORTANT: Please read this documentation for setting
up remotes to push to git.bioconductor.org. It is required to push a
version bump to git.bioconductor.org to trigger a new build.

Bioconductor utilized your github ssh-keys for git.bioconductor.org
access. To manage keys and future access you may want to active your
Bioconductor Git Credentials Account

bioc-issue-bot · 2022-07-22T18:25:15Z

Dear Package contributor,

This is the automated single package builder at bioconductor.org.

Your package has been built on Linux, Mac, and Windows.

On one or more platforms, the build results were: "ERROR".
This may mean there is a problem with the package that you need to fix.
Or it may mean that there is a problem with the build system itself.

Please see the build report for more details. This link will be active
for 21 days.

Remember: if you submitted your package after July 7th, 2020,
when making changes to your repository push to
git@git.bioconductor.org:packages/rifiComparative to trigger a new build.
A quick tutorial for setting up remotes and pushing to upstream can be found here.

lyoussar · 2022-07-22T18:34:08Z

The package could not be built in Windows. I added a file called .BBSoptions
Thanks

lshep · 2022-07-22T20:05:09Z

Please also correct the ERROR in the build report and trigger a new report Get Outlook for iOS<https://aka.ms/o0ukef>

…

________________________________ From: Loubna Youssar ***@***.***> Sent: Friday, July 22, 2022 2:34:20 PM To: Bioconductor/Contributions ***@***.***> Cc: Subscribed ***@***.***> Subject: Re: [Bioconductor/Contributions] rifiComparative (Issue #2718) The package could not be built in Windows. I added a file called .BBSoptions Thanks — Reply to this email directly, view it on GitHub<https://secure-web.cisco.com/1i5Qb0HBWgnxB6AHqOr6aM7TgXq36TUV-RtcDpdIYkhmJZHfQleHaFFsnvXHPH0ZtN10bl5_oCnC-WoAGBwhuspP0iaXAtIE1E8iKbyeVgpz65VUStMecfon1OtVBJqZk76Xq3aK2tqItT9sBpES1-02Dd2OyFmzJzZIhNN-ys31O3_K1fjHEO4S-ia8mprTXC8dgK297V75MhLTYufnknOZo-NxwoEHZtOaP5fyiFjeoMLObu-19oMfeZpZIvhYBQP4bNUtAfNwhbkeompwjixtGmxe9IZRV2r74lnq4BwlQXGasinchW6hzUsGXMHrZ/https%3A%2F%2Fgithub.com%2FBioconductor%2FContributions%2Fissues%2F2718%23issuecomment-1192838276>, or unsubscribe<https://secure-web.cisco.com/11El2001sxMIcvYarqELvfpZVbSvksaJxsWQwqip3liSQOk4HytFAQ6Qjbgx8qsrB2BXD0EWcyFSFszE026J55PGP_Fw_SyHEk6MFUp_zitkLATBC7Q70yGC2j5_g0fH3SUSzHYg2Iuhm65e1IDRJiK1s9tdsxWU0oIxy0HSz-QVML7Uq-oj7bUt4j5t_2TIY-gVyUYnYUUHfaOYCDRH1RP9x5qBIzeWogVSPk4BWsqjAoZ-lajDoxoo1x2YdRONAATCFXcs55BjQxQLBeBGcjaIxnx_Sf1-P76Wzz9znAyMC9s-1SE5AyN-aWCYsKJbz/https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAEO3MHHH2MJVORQQYIZTIMTVVLSSZANCNFSM53UXKVJQ>. You are receiving this because you are subscribed to this thread.Message ID: ***@***.***> This email message may contain legally privileged and/or confidential information. If you are not the intended recipient(s), or the employee or agent responsible for the delivery of this message to the intended recipient(s), you are hereby notified that any disclosure, copying, distribution, or use of this email message is prohibited. If you have received this message in error, please notify the sender immediately by e-mail and delete this email message from your computer. Thank you.

lyoussar · 2022-07-22T20:44:05Z

I am subscribed to both lists:
https://support.bioconductor.org/accounts/signup/
https://stat.ethz.ch/mailman/listinfo/bioc-devel
I wonder why do I get these errors!? I tried to subscribe again just in case of and I ve got the email below confirming my subscription.
An attempt was made to subscribe your address to the mailing list
bioc-devel@r-project.org. You are already subscribed to this mailing list.

PS: all package developer should be subscribed? I guess only the maintenair right? My email address as maintenair appears as last line in READ.ME

lyoussar · 2022-07-25T07:39:00Z

Its solved, no errors neither warnings so far.

HelenaLC · 2022-08-02T10:53:46Z

First round of comments below! Note that many are suggestions and not strictly
required (e.g., "consider..."). Please comment back here with what has/not been
addressed how/why not, or any comments regarding any needed clarification or
questions you might have; happy to help/discuss. Cheers!

documentation

Typo in make_pen.r line 47: vetor -> vector
Also in make_pen.r: Documentation says default of dpt is 2, but it's 1.
Generally, @importFrom is encouraged over importing an entire package
with @import. If there are many functions from a single package,
@import can be acceptable. However, you are currently importing all of
dplyr,ggplot2,SummarizedExperiment,scales,writexl,DRA,ggrepel,LSD,
which would definitely be worth reconsidering to avoid namespace conflicts.
The elements and data structure of example data is well described,
which is appreciated! However, please also include details on the
source information, or how the example data was generated.
Regarding function .man (help) pages, I think there's lots of ill formatting
due to wrong usage of roxygen2. To pick just one example ?penalties has
the function description as a header. Instead, this should go in @description
or under @details with a short, descriptive title under @title
(see here for details);
here's one example (but all help pages I checked looked somewhat off):

#' @rdname make_pen
#' @title Penalty assignment
#' @description
#'   `make_pen` automatically assigns a penalties.
#'   `make_pen` calls one of four available penalty functions
#'   to automatically assign penalties for the dynamic programming. 
#'   The two functions to be called are: ...

vignette

Please use a less generic name (i.e., not vignette.Rmd)
to make less ambiguous to locate with, e.g., vignette()
line 55: resultinf -> resulting
Throughout the vignette, I'd suggest
data frame and dataframe -> `data.frame`
To highlight / distinguish code from normal text,
I'd recommend consistently using back-ticks.
Similarly, to direct users to relevant external packages
and highlight/make them distinguishable from general code,
you can hyperlink them via BiocStyle::CRANpkg("package_name")
(or Githubpkg or Biocpkg, depending on where the package sits).
Code chunks "make_pen HL" and "make_pen int" have eval = FALSE
but fail because argument logs does not have a default value (see below).

# make_pen() fails with:
Error in logs[...: 
object of type 'symbol' is not subsettable

# because of this (where logs is undefined):
logs[c(
  paste0(names(tmp)[1], "_penalty"),
  paste0(names(tmp)[1], "_outlier_penalty")
)] <- c(res1[1], res1[2])

code

In plot_histogram.r, avoid sapply() and use vapply() instead.
In empty_data_negative.r, avoid print() and use message() instead,
ideally with a verbose = FALSE/TRUE function argument, so that users have
control over whether or not additional information is printed to the console.
figures_fun() currently writes all output plots to the current
working directory, with fixed files names (specified internally). Instead,
I'd encourage exposing a dir (directory) argument so that users can specify
where plots should be saved to. In addition, it would be nice having an option
dir = NULL where plots are returned as R objects (ggplots) that can be
saved to a variable, modified further, and saved in a custom way.
Related to the above, it would also be nice to export the different
plotting functions (with separate examples) such that users can generate
plots of interest rather than having to generate all of them figure_fun().
In plot_scatter.r, geom_smooth() gives a message
that can be avoided by specifying formula = y ~ x.
In plot_histogram.r, reshape2::melt() gives a message
that can be avoided by specifying id = "category".
Some function examples are quite costly to run (2min+ on my laptop).
Not sure if the example data size could be decreased, but in any case,
it might be mostly due to inefficient implementation.
I'd highly recommend reading up on Robust and Efficient R Code.
E.g., there's a 3-level nested for-loop in make_pen() that seams worth
re-implementing, especially as its called repeatedly by other functions.

other

Consider adding a NEWS file to track package updates;
these will also be included in Bioconductor release announcements.
Consider adding unit tests; we strongly encourage them!
See https://contributions.bioconductor.org/tests.html
Consider adding a package .man page (i.e., ?rifiComparative)
so users would be directed to the main functions of the package.
The raw package directory should not contain unnecessary files, system
files, or hidden files such as *.Rproj. These files may be present in your
local directory but should not be commited to git (e.g., using .gitignore).

lyoussar · 2022-08-03T11:18:51Z

First round of comments below! Note that many are suggestions and not strictly required (e.g., "consider..."). Please comment back here with what has/not been addressed how/why not, or any comments regarding any needed clarification or questions you might have; happy to help/discuss. Cheers!

Thanks for your comments, many of them are very helpful. I tried to address all questions/suggestions including or correcting issues. [x] means solved/corrected.
Best,
Loubna

documentation

Typo in make_pen.r line 47: vetor -> vector

Also in make_pen.r: Documentation says default of dpt is 2, but it's 1.

Generally, @importFrom is encouraged over importing an entire package
with @import. If there are many functions from a single package,
@import can be acceptable. However, you are currently importing all of
dplyr,ggplot2,SummarizedExperiment,scales,writexl,DRA,ggrepel,LSD,
which would definitely be worth reconsidering to avoid namespace conflicts.

You are right, once many functions are from the same package, they are imported as importFrom e.g. “utils write.table read.delim2 capture.output” but the imported packages are individuals.

*[x] The elements and data structure of example data is well described,
which is appreciated! However, please also include details on the
source information, or how the example data was generated.

  **Thanks for this comment. The source information is indicated in ReadMe and on the vignette with the corresponding link of the package where the data was generated 
  “RifiComparative is a workflow for a comparative data, output of Rifi framework (https://www.bioconductor.org/packages/release/bioc/html/rifi.html”).
  Now I changed the link of github to bioconductor.**

Regarding function .man (help) pages, I think there's lots of ill formatting
due to wrong usage of roxygen2. To pick just one example ?penalties has
the function description as a header. Instead, this should go in @description
or under @details with a short, descriptive title under @title
(see here for details);
here's one example (but all help pages I checked looked somewhat off):
#' @rdname make_pen
#' @title Penalty assignment
#' @description
#'   `make_pen` automatically assigns a penalties.
#'   `make_pen` calls one of four available penalty functions
#'   to automatically assign penalties for the dynamic programming. 
#'   The two functions to be called are: ...
Yes, you are right. There were some odd documentations. I revised and set all of them making a convenient structure for the user.

vignette

Please use a less generic name (i.e., not vignette.Rmd)
to make less ambiguous to locate with, e.g., vignette()
I did not get this point, sorry!

line 55: resultinf -> resulting

Throughout the vignette, I'd suggest
data frame and dataframe -> data.frame
I used the ... to indicate functions and it might be confusing if I use it somewhere else. However I make data frame all homogeneous with space in the middle.

To highlight / distinguish code from normal text,
I'd recommend consistently using back-ticks.
Do you refer to the vignette? All codes are between the 3 back-ticks or do you mean something else?

Similarly, to direct users to relevant external packages
and highlight/make them distinguishable from general code,
you can hyperlink them via BiocStyle::CRANpkg("package_name")
(or Githubpkg or Biocpkg, depending on where the package sits).
Could you please specify where the changes should be applied?

Code chunks "make_pen HL" and "make_pen int" have eval = FALSE
but fail because argument logs does not have a default value (see below).
# make_pen() fails with:
Error in logs[...: 
object of type 'symbol' is not subsettable

# because of this (where logs is undefined):
logs[c(
  paste0(names(tmp)[1], "_penalty"),
  paste0(names(tmp)[1], "_outlier_penalty")
)] <- c(res1[1], res1[2])
That true, I added now and it runs like a charm.

code

In plot_histogram.r, avoid sapply() and use vapply() instead.
Vapply does not provide the output I need after many tries, sapply in this case is saver

In empty_data_negative.r, avoid print() and use message() instead,
ideally with a verbose = FALSE/TRUE function argument, so that users have
control over whether or not additional information is printed to the console.

[] figures_fun() currently writes all output plots to the current
working directory, with fixed files names (specified internally). Instead,
I'd encourage exposing a dir (directory) argument so that users can specify
where plots should be saved to. In addition, it would be nice having an option
dir = NULL where plots are returned as R objects (ggplots) that can be
saved to a variable, modified further, and saved in a custom way.

Related to the above, it would also be nice to export the different
plotting functions (with separate examples) such that users can generate
plots of interest rather than having to generate all of them figure_fun().
The user can always run each plot separately just by reading the documentation of figure_fun(). He can still select at the end the plots he need, they are not computationally expensive and generated in less than a minutes independently how big are the data.

In plot_scatter.r, geom_smooth() gives a message
that can be avoided by specifying formula = y ~ x.
They are suppressed by now.

In plot_histogram.r, reshape2::melt() gives a message
that can be avoided by specifying id = "category".
I tried it before submitting and did not work

Some function examples are quite costly to run (2min+ on my laptop).
Not sure if the example data size could be decreased, but in any case,
it might be mostly due to inefficient implementation.
I'd highly recommend reading up on Robust and Efficient R Code.
E.g., there's a 3-level nested for-loop in make_pen() that seams worth
re-implementing, especially as its called repeatedly by other functions.
There are two functions which are computationally expensive. Penalties and fragmentation. These two functions make many iterations to find the best model so it would be impossible to make a smaller example as it will not show a reasonable output.

other

Consider adding a NEWS file to track package updates;
these will also be included in Bioconductor release announcements.

Consider adding unit tests; we strongly encourage them!
See https://contributions.bioconductor.org/tests.html

Consider adding a package .man page (i.e., ?rifiComparative)
so users would be directed to the main functions of the package.
I added this file but so far I can not make it functional!?

The raw package directory should not contain unnecessary files, system
files, or hidden files such as *.Rproj. These files may be present in your
local directory but should not be committed to git (e.g., using .gitignore).

HelenaLC · 2022-08-05T07:23:33Z

Thanks for the speedy action. I can see changes have been made in the package's GH repo, however, in order for these to go through to Bioc, you need to bump the package version in the DESCRIPTION file (i.e., from 0.99.1 to 0.99.2); thanks!

lyoussar · 2022-08-05T07:27:13Z

Thanks and done!

HelenaLC · 2022-08-05T07:48:13Z

Again, I can see this on your GH. However, if a build has been triggered, you'd see a message here from the bioc-issue-bot, which is not the case. You also need to push these changes upstream to the upstream Bioc remote (not only your personal GH remote). See here for a quick tutorial on setting up remotes and pushing to upstream.

lyoussar · 2022-08-05T09:30:08Z

I followed the instructions. I forked from the home repository (CyanolabFreiburg) to mine (lyoussar) and added a remote named upstream to my package.

origin https://github.com/CyanolabFreiburg/rifi_comparative_data.git (fetch)
origin https://github.com/CyanolabFreiburg/rifi_comparative_data.git (push)
upstream git@git.bioconductor.org:packages/git@github.com:lyoussar/rifiComparative.git (fetch)
upstream git@git.bioconductor.org:packages/git@github.com:lyoussar/rifiComparative.git (push)

Please let me know if that is enough/correct.

Thanks!

HelenaLC · 2022-08-06T09:14:59Z

Yes, that all looks correct. Now, when you want to make changes, you should push to both the origin and upstream remote. I.e., commit -m “…” — push origin — push master.

lyoussar · 2022-08-06T09:46:15Z

Great, Thanks!

HelenaLC · 2022-08-12T06:58:10Z

Did you try doing this? The package being built here is still v0.99.0 whereas your Github repo DESCRIPTION has v0.99.2.

lyoussar · 2022-08-12T07:10:51Z

I just built the package again with the last version and I get 0 errors, 0 warnings and few notes. Is that what you referring to?

HelenaLC · 2022-08-12T07:30:38Z

Hm. I can only see a single build report from 21 days ago here. Did you push upstream as explained here? I.e., something like

git add ...
git commit -m "..."
git push origin   # push to your Github repo
git push upstream # push to the Bioc server

lyoussar · 2022-08-12T09:39:36Z

I have being struggling the last 2 hours to find a solution to that:

git fetch --all
Fetching origin
Fetching upstream
git@git.bioconductor.org: Permission denied (publickey).
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists.
error: Could not fetch upstream

This is the remote:

git remote -v
origin https://github.com/CyanolabFreiburg/rifi_comparative_data.git (fetch)
origin https://github.com/CyanolabFreiburg/rifi_comparative_data.git (push)
upstream git@git.bioconductor.org:packages/git@github.com:lyoussar/rifiComparative.git (fetch)
upstream git@git.bioconductor.org:packages/git@github.com:lyoussar/rifiComparative.git (push)

Reading somewhere on the email list, I found out this answer from a reviewer:
@nturaga I think this is because I changed the contents of https://github.com/kozo2.keys after creating this issue. Could you please check the SSH public key registration again?

Could u help me please? is it related to bioconductor? me?

Thanks

lshep · 2022-08-12T11:31:07Z

Keys should update automatically once changed.

It looks like your upstream is set incorrectly.

Yours is currently set:
git@git.bioconductor.org:packages/git@github.com:lyoussar/rifiComparative.git

Where it has an extra git@github.com:lyoussar

Please do

git remote remove upstream

git remote add upstream git@bioconductor.org:packages/rifiComparative.git
git remote fetch --all

and try again.

If you still have trouble please remember to activate your GitCredentials account and perhaps update with a new ssh-key.

lyoussar · 2022-08-17T07:07:49Z

I set a new key and activate the GitCredentials, I built again the package and I am still not getting access, I wonder what I do miss.

git remote -v
origin git@github.com:lyoussar/rifiComparative.git (fetch)
origin git@github.com:lyoussar/rifiComparative.git (push)
upstream git@git.bioconductor.org:packages/rifiComparative.git (fetch)
upstream git@git.bioconductor.org:packages/rifiComparative.git (push)

git fetch --all
Fetching origin
ECDSA host key for IP address '140.82.121.36' not in list of known hosts.
Fetching upstream
git@git.bioconductor.org: Permission denied (publickey).
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists.
error: Could not fetch upstream

HelenaLC · 2022-10-06T10:46:22Z

Ok, I believe all essential changes have been made. Nevertheless, I kindly ask you to consider the comments below. Please bare in mind that throughout development - that is, also after acceptance and the package being available through Bioc - you will need to update both the origin and upstream branch (with a version bump) for changes to go through to Bioconductor and be available to users upon installation using BiocManager::install().

Please remember to update the NEWS file to track package changes and updates. Currently, only version 0.99.1 is listed, though the package is currently at 0.99.4.
Regarding parameter defaults in function documentation: Again, I don't understand your response; could you clarify? My suggestion was not to alter the default parameter values or function definition, but to remove the listing of these from the man page to be more brief and avoid redundancy (since these values can be seen under the usage section). E.g., in make_pen's smpl_min: "integer: the smaller end of the sampling size. (Default is 10. -> drop this)". Secondly, as mentioned previously, I would ask you to elaborate on i) how these defaults were selected; and, ii) what impact lower/higher values might have or on what scale these should be etc., since these values seem fairly arbitrary otherwise and it is not clear for the user what effect changes to the defaults will have.
Regarding unit tests: I don't see how the fact that rifiComparative is a successor of rifi justifies omitting unit tests. However, as mentioned previously, though highly encouraged, these are optional; so okay.
Regarding code highlighting in the vignette: I don't understand what this has to do with rifiComparative being a successor of rifi. I simply suggested to use single back ticks around variable, function, package names etc. to distinguish R code from normal text. At the moment, the code-style is / is not being used inconsistently, and using it throughout would improve overall readability and understandably of the vignette. For example:

current:

Same as rifi workflow, to get the best segmentation we need the optimal penalties. To calculate HL penalty, the difference between half-life from both conditions is calculated and added as distance_HL variable to df_comb_minimal data frame. On other hand the logFC_int is used to assign penalties for intensity values and added as distance_int variable. df_comb_minimal with the additional variables is named penalties_df.

proposition:

Same as `r BiocStyle::Biocpkg("rifi")` workflow, to get the best segmentation we need the optimal penalties. To calculate HL penalty, the difference between half-life from both conditions is calculated and added as `distance_HL` variable to `df_comb_minimal` data frame. On other hand the `logFC_int` is used to assign penalties for intensity values and added as `distance_int` variable. `df_comb_minimal` with the additional variables is named `penalties_df`.

lyoussar · 2022-10-08T19:50:21Z

Please remember to update the NEWS file to track package changes and updates. Currently, only version 0.99.1 is listed, though the package is currently at 0.99.4.
Done

Regarding parameter defaults in function documentation: Again, I don't understand your response; could you clarify? My suggestion was not to alter the default parameter values or function definition, but to remove the listing of these from the man page to be more brief and avoid redundancy (since these values can be seen under the usage section). E.g., in make_pen's smpl_min: "integer: the smaller end of the sampling size. (Default is 10. -> drop this)". Secondly, as mentioned previously, I would ask you to elaborate on i) how these defaults were selected; and, ii) what impact lower/higher values might have or on what scale these should be etc., since these values seem fairly arbitrary otherwise and it is not clear for the user what effect changes to the defaults will have.

The first part is corrected, all defaults values were removed.
The second part: I did answer this question before but seems I did not make it clear. make_pen is a function from rifi package, many values were tested on different organisms and different sequences platform before getting to the default values. The user can use the default values unless if he wishes to have a different result, in this case, genome segmentation then he can read all details on the vignette cited on rifi package.
P.S: rifiComparative could not be used before using rifi as the rifi outputs are the rifiComparative inputs. I assume the user has a clear idea about the rifi steps prior running rifiComparative.

Regarding code highlighting in the vignette: I don't understand what this has to do with rifiComparative being a successor of rifi. I simply suggested to use single back ticks around variable, function, package names etc. to distinguish R code from normal text. At the moment, the code-style is / is not being used inconsistently, and using it throughout would improve overall readability and understandably of the vignette. For example:

current:

Same as rifi workflow, to get the best segmentation we need the optimal penalties. To calculate HL penalty, the difference between half-life from both conditions is calculated and added as distance_HL variable to df_comb_minimal data frame. On other hand the logFC_int is used to assign penalties for intensity values and added as distance_int variable. df_comb_minimal with the additional variables is named penalties_df.

proposition:

Same as r BiocStyle::Biocpkg("rifi") workflow, to get the best segmentation we need the optimal penalties. To calculate HL penalty, the difference between half-life from both conditions is calculated and added as distance_HL variable to df_comb_minimal data frame. On other hand the logFC_int is used to assign penalties for intensity values and added as distance_int variable. df_comb_minimal with the additional variables is named penalties_df.

Thanks for making it clear. All variable, function, package names are now with backticks.

bioc-issue-bot · 2022-10-08T20:20:14Z

Received a valid push on git.bioconductor.org; starting a build for commit id: 6e23d614d010e253de585443f9459d55dfe626a5

bioc-issue-bot · 2022-10-08T20:33:51Z

Dear Package contributor,

This is the automated single package builder at bioconductor.org.

Your package has been built on Linux, Mac, and Windows.

Congratulations! The package built without errors or warnings
on all platforms.

Please see the build report for more details. This link will be active
for 21 days.

Remember: if you submitted your package after July 7th, 2020,
when making changes to your repository push to
git@git.bioconductor.org:packages/rifiComparative to trigger a new build.
A quick tutorial for setting up remotes and pushing to upstream can be found here.

HelenaLC · 2022-10-10T06:48:34Z

Okay, thanks for making a couple adjustments and clarifying the defaults. However, even in rifi's vignette, I couldn't yet find an explanation for how the defaults / parameters used were selected; and,make_pen() is called with smpl_max = 18 (not 100, as in rifiComparative); and, in general, it is sub-optimal for users to have to think of searching another (even if related) package's vignette: the current package's documentation should be self-explanatory (i.e., in itself or including relevant pointers to external sources).

To expand a bit on my current confusion (giving one example):

make_pen() mentions that fragment_HL_pen() and fragment_inty_pen() are called.
There are internal (not exported) functions, only accessible via :::, and without documentation.
Meanwhile, they are described in the rifi vignette (but this isn't obvious from ?rifi::make_pen.
And, considering all this, what is the difference (if any) between rifi::/rifiComparative::make_pen?
If both are the same, why would rifiComparative even define and export a same-named function?

...all that is to point out that it would be appreciated if you could expand on the documentation such that it is made clear how the two packages interact, and (if not in rifiComparative itself) where to find additional information on rifi-specific parts of the package.

lyoussar · 2022-10-16T17:56:30Z

smpl_max = 18 was used only in rifi package and precisely in vignette for the mini sample generated to make the vignette computationally fast. The functions used in rifi are all computationally expensive and if the sample is a bit larger, would take hours. Therefore a mini instance was created and the value 18 was used. The sample can never be a real sample because it does not make any sense.

Regarding your question if make_pen in rifi and rifi_comparative are same. Yes but with a very thinny adjustment, in penalties function where make_pen is used, I changed the parameter smpl_max to 50 since I except a smaller training sample compared to rifi. The values used in rifiComparative to train the data are less noisy compared to rifi. If you check the penalties function, I am not making the parameters flexible to the user since they is no need. Only data and cores number are required. As the user would not need to change the parameters I am not sure if expanding the documentation for make_pen in rifiComparative would bring more help or confusiones.

lyoussar · 2022-11-14T12:24:00Z

Any update?

HelenaLC · 2022-11-15T09:00:03Z

My apologies, I have been quite unsure how to respond/proceed. Perhaps you could help clarify. You stated that rifiComparative is a successor to rifi. Does this mean the latter will be deprecated? If not, I do not understand why there would be a near copy-paste of functions between both packages, with i) some minor differences; ii) documentation scattered across different places; and, iii) unclarity regarding how the packages are designed to interoperate, which package's function to use, if it makes a difference or not etc. Overall, having two packages inter-operate is great. But having repeated functionality (even code) is not. Perhaps this is a misunderstanding on my end; if so, I'd be grateful if you could explain the above points (if rifi is to be continued).

lyoussar · 2022-11-27T10:23:41Z

I will try this time to be explicit and I hope I can clarify the idea.
rifi package was developed with X functions. rifiComparative compares the outputs of rifi using XX functions. Both packages have different functions but share one important approach which is genome segmentation using one algorithm. This algorithm was developed by us spending quite a long time to make it efficient computationally and biologically. This algorithm could be used in any package requiring genome segmentation/fragmentation. The only need is to adapt the input and slightly the code to have the same efficiency. rifiComparative contains many functions developed exclusively for it and at certain step reuses that algorithm to accomplish one crucial step essential to make the package functional and efficient.
I hope I was able to translate better the idea behind rifi and rifiComparative and I am available to provide more details if necessary.

lyoussar · 2022-12-29T06:58:11Z

I did not get any feedback!

lshep · 2023-01-31T16:25:24Z

@lyoussar sorry for the delay. I will be taking over the review. I will look over the package over the next week and get back to you.

vjcitn · 2023-02-07T10:31:30Z

Hi @lyoussar owing to various issues I will take over the review here.

vjcitn · 2023-02-07T10:35:03Z

Note that when I install the latest version of rifi I see

Note: next used in wrong context: no loop is visible 
** help
Warning: apply_Ttest_delay.Rd:55: unexpected '}'
Warning: predict_ps_itss.Rd:57: unexpected '}'
*** installing help indices

lyoussar · 2023-02-07T10:42:05Z

The package is rifiComparative and not rifi. Could you confirm please that you are checking on the right folder?

vjcitn · 2023-02-07T10:54:35Z

Notice the chapter numbering in the vignette. Perhaps you need to remove a "#"

lyoussar · 2023-02-07T10:57:35Z

I am sorry I don t get it. What chapter do you mean? I don t see the "#" on the picture unless if its inside a chapter but I would like to know which one is. Thanks

vjcitn · 2023-02-07T10:57:49Z

In the vignette I see

data(stats_se_cdt1)
data(stats_se_cdt2)
data(differential_expression)
inp_s <-  
    loading_fun(stats_se_cdt1, stats_se_cdt2, differential_expression)[[1]]
head(inp_s, 5)
##   strand position ID FLT intensity probe_TI flag position_segment     delay
## 1      +       67  1   0  1367.080       -1    _              S_1 1.4190839
## 2      +      153  2   0  3316.336       -1    _              S_1 1.9343216
## 3      +      199  3   0  1112.101       -1    _              S_1 0.6442441
## 4      +      259  4   0  2012.294        1    _              S_1 0.0010000
## 5      +      320  5   0  1627.467       -1    _              S_1 1.9506707

Could the output data be represented as a GRanges, to help with interoperability and self-description. The function name loading_fun is very peculiar, too generic -- the function name should help the user understand what is being loaded. This will aid users in remembering how to use the tool.

vjcitn · 2023-02-07T11:00:08Z

When I render your vignette, I see the 0.x.y in the table of contents that is shown in the photograph attached to the comment. Your vignette code has

## 0. Installation

It should not have ## and should not have 0. Let the render() function take care of the numbering for you.

vjcitn · 2023-02-07T11:00:48Z

All the numbers in the section headings should come out (roman numerals or arabic numeral 0).

vjcitn · 2023-02-07T11:04:32Z

<br/><br/>
``` {r, echo = FALSE, fig.cap = "**\\label{fig:figs}Decay rate vs. Synthesis rate**", out.width = '100%'}
knitr::include_graphics("Decay_rate_vs_Synthesis_rate.png")

``` You are including a number of static precomputed images. Are the computations too slow to have them performed in the vignette? A basic purpose of the vignette discipline in Bioconductor is to illustrate the actual computations and to have the build system perform them on a regular basis, so that developers can learn if code has become stale and throws new errors.

lyoussar · 2023-02-07T13:09:18Z

I submitted this package on July 2022 and I made all changes the referee proposed and asked for. RifiComparative is a continuation of rifi package which was published in Bioconductor. To make the package easy and straightforward for users I submitted to Bioconductor making both packages uniforme regarding the installation and format. I followed all instructions of Biocoinductor. I made all changes the referee asked for which were a hell of work trying to keep the same format I used for the vignette in rifi. I am not sure why rifi package was accepted after slights changes in the vignette and the successor is taking a while to be accepted. I am the author of rifi and rifiComparative vignette. I am still getting request over and over even though a huge changes were applied on Aug2, Aug15, Sep1, Oct8 and Oct16.

Your last question for instance:
You are including a number of static precomputed images. Are the computations too slow to have them performed in the vignette? A basic purpose of the vignette discipline in Bioconductor is to illustrate the actual computations and to have the build system perform them on a regular basis, so that developers can learn if code has become stale and throws new errors

In rifi package, I tried to put the code as the vignette discipline in Bioconductor requests but I ve got complains from the referee because the computations were slow and could be replaced by images in some cases. Now I get the opposite complain.

If you check again the rifi and rifiComparative you will find a lot of similarities although excessive changes were included on the last one because they were requested but not on the first package. I wish we can include rifiComprative in Bioconductor otherwise it will be unfortunate for the users to use rifi and not rifiComparative.

bioc-issue-bot · 2023-02-17T12:49:37Z

Your package has been accepted. It will be added to the
Bioconductor nightly builds.

Thank you for contributing to Bioconductor!

Reviewers for Bioconductor packages are volunteers from the Bioconductor
community. If you are interested in becoming a Bioconductor package
reviewer, please see Reviewers Expectations.

lshep · 2023-02-22T16:51:14Z

The master branch of your GitHub repository has been added to Bioconductor's git repository.

To use the git.bioconductor.org repository, we need an 'ssh' key to associate with your github user name. If your GitHub account already has ssh public keys (https://github.com/lyoussar.keys is not empty), then no further steps are required. Otherwise, do the following:

See further instructions at

https://bioconductor.org/developers/how-to/git/

for working with this repository. See especially

https://bioconductor.org/developers/how-to/git/new-package-workflow/
https://bioconductor.org/developers/how-to/git/sync-existing-repositories/

to keep your GitHub and Bioconductor repositories in sync.

Your package will be included in the next nigthly 'devel' build (check-out from git at about 6 pm Eastern; build completion around 2pm Eastern the next day) at

https://bioconductor.org/checkResults/

(Builds sometimes fail, so ensure that the date stamps on the main landing page are consistent with the addition of your package). Once the package builds successfully, you package will be available for download in the 'Devel' version of Bioconductor using BiocManager::install("rifiComparative"). The package 'landing page' will be created at

https://bioconductor.org/packages/rifiComparative

If you have any questions, please contact the bioc-devel mailing list (https://stat.ethz.ch/mailman/listinfo/bioc-devel); this issue will not be monitored further.

bioc-issue-bot added the 1. awaiting moderation submitted and waiting clearance to access resources label Jul 15, 2022

vjcitn added the pre-check passed pre-review performed and ready to be added to git label Jul 15, 2022

bioc-issue-bot assigned HelenaLC Jul 22, 2022

bioc-issue-bot added the ERROR label Jul 22, 2022

lshep assigned lshep and unassigned HelenaLC Jan 31, 2023

lshep assigned vjcitn and unassigned lshep Feb 6, 2023

vjcitn added 3a. accepted will be ingested into Bioconductor daily builder for distribution and removed 2. review in progress assign a reviewer and a more thorough review of package code and documentation taking place labels Feb 17, 2023

lshep closed this as completed Feb 22, 2023

rifiComparative #2718

rifiComparative #2718

Comments

lyoussar commented Jul 15, 2022

bioc-issue-bot commented Jul 15, 2022

vjcitn commented Jul 15, 2022

lyoussar commented Jul 15, 2022

bioc-issue-bot commented Jul 22, 2022

bioc-issue-bot commented Jul 22, 2022

lyoussar commented Jul 22, 2022

lshep commented Jul 22, 2022 via email

lyoussar commented Jul 22, 2022 • edited Loading

lyoussar commented Jul 25, 2022

HelenaLC commented Aug 2, 2022

documentation

vignette

code

other

lyoussar commented Aug 3, 2022 • edited Loading

documentation

vignette

code

other

HelenaLC commented Aug 5, 2022

lyoussar commented Aug 5, 2022

HelenaLC commented Aug 5, 2022 • edited Loading

lyoussar commented Aug 5, 2022

HelenaLC commented Aug 6, 2022

lyoussar commented Aug 6, 2022

HelenaLC commented Aug 12, 2022

lyoussar commented Aug 12, 2022

HelenaLC commented Aug 12, 2022 • edited Loading

lyoussar commented Aug 12, 2022

lshep commented Aug 12, 2022

lyoussar commented Aug 17, 2022

HelenaLC commented Oct 6, 2022 • edited Loading

lyoussar commented Oct 8, 2022

bioc-issue-bot commented Oct 8, 2022

bioc-issue-bot commented Oct 8, 2022

HelenaLC commented Oct 10, 2022

lyoussar commented Oct 16, 2022

lyoussar commented Nov 14, 2022

HelenaLC commented Nov 15, 2022

lyoussar commented Nov 27, 2022 • edited Loading

lyoussar commented Dec 29, 2022

lshep commented Jan 31, 2023

vjcitn commented Feb 7, 2023

vjcitn commented Feb 7, 2023

lyoussar commented Feb 7, 2023

vjcitn commented Feb 7, 2023

lyoussar commented Feb 7, 2023

vjcitn commented Feb 7, 2023

vjcitn commented Feb 7, 2023

vjcitn commented Feb 7, 2023

vjcitn commented Feb 7, 2023

lyoussar commented Feb 7, 2023 • edited Loading

bioc-issue-bot commented Feb 17, 2023

lshep commented Feb 22, 2023

lyoussar commented Jul 22, 2022 •

edited

Loading

lyoussar commented Aug 3, 2022 •

edited

Loading

HelenaLC commented Aug 5, 2022 •

edited

Loading

HelenaLC commented Aug 12, 2022 •

edited

Loading

HelenaLC commented Oct 6, 2022 •

edited

Loading

lyoussar commented Nov 27, 2022 •

edited

Loading

lyoussar commented Feb 7, 2023 •

edited

Loading