Generate trainings data using context #25

teresa-m · 2021-07-30T12:44:17Z

teresa-m · 2021-07-30T13:11:04Z

See later: Could we add a bias by placing the trusted RRI in the middle of the context and possible negative interaction could be at the border of the context? Shoudl we disallow RRIs in the bignning and and of the sequences?

teresa-m · 2021-08-02T12:04:50Z

Is it a week spot that we have the proteom m-RNA binding data? Are there many proteins binding to ncRNAs?

martin-raden · 2021-08-03T11:53:21Z

concerning

extrect genomec context for trusted RRI's (e.g. 300 on both sides, context lenght depens on length of RRI and RBP interactions sides)

I would always add the same context length left/right independently of the RRI/RBP subsequence length. simplifies the setup and sequence length is of no matter anyway...

martin-raden · 2021-08-03T11:55:36Z

See later: Could we add a bias by placing the trusted RRI in the middle of the context and possible negative interaction could be at the border of the context? Shoudl we disallow RRIs in the bignning and and of the sequences?

constrain seeds to be not at sequence ends

good point! can be solved by constraining the seed to the positions +100 to (length-100), i.e. similar to the positive data set but with the additional "blocking constraints". that way, the accessibilities of the RRIs should be reliable than those around sequence ends!

martin-raden · 2021-08-03T11:56:20Z

Is it a week spot that we have the proteom m-RNA binding data? Are there many proteins binding to ncRNAs?

I would guess RBPs do not distinguish much between lnRNA and mRNA...

teresa-m · 2021-08-04T08:55:33Z

xtrect genomec context for trusted RRI's (e.g. 300 on both sides, context lenght depens on length of RRI and RBP interactions sides)

Ja sorry I wanted to check the length of RRI and RBP binding sides to see if 300 context is long enough. But of course, the added context will be for all the same. Maybe I should have made this point more clear.

teresa-m · 2021-08-04T09:00:01Z

would guess RBPs do not distinguish much between lnRNA and mRNA...

I was just wondering since the data of the paper I found only gives us the proteome binding m-RNA, if I understood it correctly.

martin-raden · 2021-08-05T11:35:06Z

would guess RBPs do not distinguish much between lnRNA and mRNA...

I was just wondering since the data of the paper I found only gives us the proteome binding m-RNA, if I understood it correctly.

most likely they were just interested in direct gene regulation, i.e. mRNA binding

teresa-m · 2021-09-13T12:52:08Z

Two latest attempts to make the positive and negative feature distribution more allike.
(1) Using occupied regions as contain also for the positive instance generation -> We are losing many sequences but the distribution looks a bit more similar than without using it.
(2) not allowing long bulges within the interaction

martin-raden · 2021-09-13T13:04:54Z

bin gespannt... :)

teresa-m added this to the Milestone: generate negative data milestone Jul 30, 2021

teresa-m mentioned this issue Aug 31, 2021

Advance negative data generation RNAs not involve in a interaction #19

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate trainings data using context #25

Generate trainings data using context #25

teresa-m commented Jul 30, 2021 •

edited

Loading

teresa-m commented Jul 30, 2021

teresa-m commented Aug 2, 2021

martin-raden commented Aug 3, 2021

martin-raden commented Aug 3, 2021

martin-raden commented Aug 3, 2021

teresa-m commented Aug 4, 2021

teresa-m commented Aug 4, 2021

martin-raden commented Aug 5, 2021

teresa-m commented Sep 13, 2021

martin-raden commented Sep 13, 2021

Generate trainings data using context #25

Generate trainings data using context #25

Comments

teresa-m commented Jul 30, 2021 • edited Loading

teresa-m commented Jul 30, 2021

teresa-m commented Aug 2, 2021

martin-raden commented Aug 3, 2021

martin-raden commented Aug 3, 2021

constrain seeds to be not at sequence ends

martin-raden commented Aug 3, 2021

teresa-m commented Aug 4, 2021

teresa-m commented Aug 4, 2021

martin-raden commented Aug 5, 2021

teresa-m commented Sep 13, 2021

martin-raden commented Sep 13, 2021

teresa-m commented Jul 30, 2021 •

edited

Loading