rna-transcription: Clarify problem description #956

jamesmcm · 2017-10-15T13:55:29Z

Clarify invalid bases problem - see exercism/python#916

Clarify invalid bases problem

rpottsoh

I am not sure how necessary these changes are. However, they are aligned with the canonical data so I think adding these lines is fine. Thanks @jamesmcm for taking the time to help out. 👍

NobbZ · 2017-10-15T17:39:16Z

exercises/rna-transcription/description.md

+
+Note that since we are finding the complement, you should use the mapping of pairs above (and not just swap T for U).
+
+Your function will need to be able to handle invalid inputs, both partial (a few invalid bases) and completely invalid strings. In these cases it should return an empty string.


In these cases it should return an empty string.

No! No! No!

Returning the empty string is a valid complement for the empty sequence as input.

Yes, it is part of this exercise to handle invalid input, because we use a string as input which has a much larger domain than just Nucleotides.

We have choosen to do this even on languages that could completely avoid invalid sequences because of their typesystem, because we treat it as a parsing and string transformation exercise.

But please, lets use the idiomatic error mechanisms of the implementing language. Lets throw something in Java, raise in Ruby, return an error-tuple/atom in elixir, erlang and LFE, lets have a Result- or Either-, or Option-, or Maybe-type in Rust, OCaml, Haskell, Idris, return an error-value in Go, do whatever is idiomatic in Python, but please do not force to use something unidiomatic via the description.

Of course tracks can choose to simply expect the empty string to signal errors, because they do not want to introduce proper error handling in this exercise but it should be the tracks maintainers who choose, not the description. Error handling is a subject that has to be treatened carefully…

Right now, the empty string is what passes the test in Python though, which is how I found this issue (I had no idea we even had to handle invalid cases).

Perhaps this should be changed there, and the descriptions changed in both to reflect that.

Please PR against their HINTS.md in that exercise with an accompanied regenerated README.md to describe the error handling part, but leave it here only as “There may be errors in the input which you need to detect and act accordingly” or something like that.

@NobbZ, you're right regarding the error handling - I've submitted a PR to fix the Python implementation.

NobbZ · 2017-10-15T17:45:48Z

exercises/rna-transcription/description.md

 * `T` -> `A`
 * `A` -> `U`
+
+Note that since we are finding the complement, you should use the mapping of pairs above (and not just swap T for U).


I'm not sure what you actually want to say with this sentence…

It is semantically the same as the sentence in front of the mapping, but pointing to the mapping above, instead of below:

Given a DNA strand, its transcribed RNA strand is formed by replacing each nucleotide with its complement:

Sure, but in the sourced Rosalind exercise the problem is different. It was just to clarify that.

Thats a completely different thing and a big bummer!

This means, that either our exercise or the rosalind exercise in incorrect in terms of biology.

We need to find this out, would you mind open a clean ticket in this repository explaining the circumstances?

@NobbZ, in the Rosalind exercise, they are converting the coding strand to RNA, but I find that a very strange approach. Typically people consider transcription with reference to the template strand (which is the complement of the coding strand), and this is definitely what most people would think of if asked to write an RNA transcription program without being given further detail. Regardless, I think the current exercise is probably clear enough about how we are doing it, since it gives a transcription table, and neither approach is technically wrong.

NobbZ · 2017-10-15T17:49:28Z

exercises/rna-transcription/description.md

+
+Note that since we are finding the complement, you should use the mapping of pairs above (and not just swap T for U).
+
+Your function will need to be able to handle invalid inputs, both partial (a few invalid bases) and completely invalid strings. In these cases it should return an empty string.


Please PR against their HINTS.md in that exercise with an accompanied regenerated README.md to describe the error handling part, but leave it here only as “There may be errors in the input which you need to detect and act accordingly” or something like that.

NobbZ · 2017-10-15T17:51:38Z

Sorry for double creating that post… I seem to have clicked the wrong button…

Insti

Thanks for submitting this PR @jamesmcm

I realise you were following some guidance which suggested making this PR, which is often good advice for changes to track READMEs, but in this case I agree with @NobbZ and don't think it is appropriate to make these changes here.

I can't see any way to change this PR to be more mergeable so I suggest closing it.

rpottsoh · 2017-10-15T22:33:54Z

Thanks @NobbZ and @Insti for weighing in here. I generally like the proposed changes, but I also agree with your assessments of how the changes would impact the tracks that implement this exercise.

I don't see anyway to change this PR to make it suitable for merging.

I now fully realize the negative implications these changes would have on tracks that implement this exercise. I wasn't thinking along those lines when I approved the PR.

Insti · 2017-10-28T19:55:55Z

This appears to have now been handled by the Python track.

Clarify rna-transcription

4fb8a68

Clarify invalid bases problem

rpottsoh changed the title ~~Clarify rna-transcription~~ rna-transcription: Clarify problem description Oct 15, 2017

rpottsoh previously approved these changes Oct 15, 2017

View reviewed changes

NobbZ requested changes Oct 15, 2017

View reviewed changes

NobbZ reviewed Oct 15, 2017

View reviewed changes

NobbZ requested changes Oct 15, 2017

View reviewed changes

Insti suggested changes Oct 15, 2017

View reviewed changes

N-Parsons mentioned this pull request Oct 27, 2017

rna-transcription: Use an exception to handle input errors exercism/python#1067

Merged

Insti closed this Oct 28, 2017


		Note that since we are finding the complement, you should use the mapping of pairs above (and not just swap T for U).

		Your function will need to be able to handle invalid inputs, both partial (a few invalid bases) and completely invalid strings. In these cases it should return an empty string.

Uh oh!

rna-transcription: Clarify problem description #956

rna-transcription: Clarify problem description #956

Uh oh!

Conversation

jamesmcm commented Oct 15, 2017

Uh oh!

rpottsoh left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NobbZ commented Oct 15, 2017

Uh oh!

Insti left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rpottsoh commented Oct 15, 2017

Uh oh!

Insti commented Oct 28, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Insti left a comment •

edited

Loading