CpG informative sites as index in PAMESdata #7

GMFranceschini · 2019-06-10T12:36:57Z

I noticed that PAMES sites in PAMESdata collection (450k sites) are indicated as indexes rather than probe names. That might cause troubles with compute_purity() if those sites are used with a Beta table that is not exactly as expected by the function (indexes matching the proper probe).

Ex. if a Beta table is smaller or larger, indexes will use the wrong sites. I hope this makes sense to you, what scared me the most is that when this problem happens PAMES returns no error at all, but of course, the purity estimation at that point is wrong (using wrong sites).

Please let me know if I can help by any means, this is not urgent but I think you might want to address that in the future.

The text was updated successfully, but these errors were encountered:

romagnolid · 2019-06-12T12:20:11Z

You are right, maybe it's better to include both indexes and probe names or probe names only. The latter case presents a problem if a beta table has no probe names associated but it would avoid a selection of wrong sites

GMFranceschini · 2019-06-13T11:57:41Z

Great! A momentary solution could be to check for the expected dimension of the input beta matrix. This would require minimal effort and return an error if the output doesn't match the expected probe set dimension, probably avoiding the situation in most of the cases

romagnolid · 2019-08-26T09:42:39Z

I won't close the issue for now, let's see how the temporary solution plays out.

jtlow · 2021-06-28T20:58:16Z

Hi, I'm trying out PAMES for the first time to calculate tumor purity for some 450k array data. It seems like the tumor_table needs to be exactly the same number of rows as the ref_table, is that correct? Are there any workarounds for cases where tumor_table may be missing data?

romagnolid · 2021-06-29T11:16:14Z

Hi @jtlow, thanks for using our package!
That is correct, it is a way to ensure that the indexes of the CpG sites have the same correspondence between the tumor table and the reference (I might change that in a future release, maybe using CpG probe names instead of indexes).

First of all, a simple workaround is to pass the same object (a matrix of beta-values coverted to percentage) to both the tumor_table and ref_table args in the function compute_purity but you must be absolutely certain that the CpG sites you are passing have the correct correspondence.

For sparce missing data or even entire rows/CpG sites missing, you don't need to worry, the purity will be computed anyway as long as your table has the classic Illumina 450k format.
Otherwise, if your table has a different format you can provide me other details and I can help you find a solution.

romagnolid · 2022-10-31T15:23:29Z

New version uses probe names instead of indexes

romagnolid added a commit that referenced this issue Aug 26, 2019

Address issue #7

27e3713

romagnolid closed this as completed Oct 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CpG informative sites as index in PAMESdata #7

CpG informative sites as index in PAMESdata #7

GMFranceschini commented Jun 10, 2019

romagnolid commented Jun 12, 2019 •

edited

GMFranceschini commented Jun 13, 2019

romagnolid commented Aug 26, 2019

jtlow commented Jun 28, 2021

romagnolid commented Jun 29, 2021

romagnolid commented Oct 31, 2022

CpG informative sites as index in PAMESdata #7

CpG informative sites as index in PAMESdata #7

Comments

GMFranceschini commented Jun 10, 2019

romagnolid commented Jun 12, 2019 • edited

GMFranceschini commented Jun 13, 2019

romagnolid commented Aug 26, 2019

jtlow commented Jun 28, 2021

romagnolid commented Jun 29, 2021

romagnolid commented Oct 31, 2022

romagnolid commented Jun 12, 2019 •

edited