-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
intersection
: issue on consecutive duplicate words
#45
Comments
Alternatively, it could behave as it currently does, and instead you could add the |
I think you first solution sounds reasonable. I'll take a look and see what I can do. |
When I went to implement the changes, I realized that my original intention with For example, what would happen for a phone list, where only some of the phones are listed for each word, e.g. I wondered how I could accommodate these two scenarios--parameterize I decided that a simpler solution was to create a different method What do you think? Does |
Here is the method signature: Here is an example usage from the tests, using the example you provided: |
I merged my PR and built a release. I've been sitting on a lot of code since November which I really shouldn't have done. Reviews on the merged PR are still welcome--I can make a follow-up PR. 🙇 |
Thanks for this! I think creating the new method is a great compromise and this helps my use case a lot. I'll take a look at the PR and see if I have any comments to make. Thanks for the quick response! |
Today I encountered an issue with the behavior of
intersection
.Say I have a
WORD
tier that looks like this:And I have a
PHONE
tier that looks like this:Assuming these are time-aligned correctly, when I call
intersection
, I get a list that looks something like this:Because I have two intervals in the
WORD
tier which have the same label, from this intersection I can't really tell if I have two distinct words"A"
that have the respective transcriptions"AH0"
and"EY1"
, or if I have one distinct word"A"
transcribed as"AH0 EY1"
.Obviously, there is no right way to solve this, but I would suggest that since we do know that the word entries are distinct, that perhaps instead the label should be the
WORD
label plus a tuple of all thePHONE
labels that coincide with it. Something like this:This would also mean that the interval boundaries would be the boundaries of the left-hand side tier. So my example would be for
If you instead did
you would get
What do you think?
The text was updated successfully, but these errors were encountered: