-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
publish target definitions #15
Comments
Hi all: To me (probably not just me), the "Open" also means (in addition to Open I'm clearly not close to the end of my personal journey here, but am I was curious to learn last week (only) that DarwinCore defines the term "The category of information pertaining to taxonomic names, taxon name Category of information..? I thought it was more like this: What I am getting at is this: there are two often intersecting contexts Context 1 is I think how we usually talk to each other. There is an Context 2 is how I think we ought to translate the very legitimate but Why all this? Because of the "same clade". And we all understand that I obviously think there is some value in an exercise where one does Accordingly, "same clade in the synthetic tree" (context 1) becomes: "phylogenetically congruent clade hypotheses, published independently Perhaps it is apparent how context 2 may inform annotation practices Hopefully I also managed to express that ascribing a full reality to Sorry if this was TL-DR. Best, Nico On Tue, Oct 28, 2014 at 4:08 AM, Karen Cranston notifications@github.com
|
This is being revised in the round of changes approved by the executive at On 10/28/14, 5:29 PM, Nico Franz wrote:
Hilmar Lapp -:- lappland.io |
Thanks, Hilmar. Incidentally there is a related, upcoming Berkeley "BIGCB" workshop. I http://taxonbytes.org/bigcb-workshop-at-uc-berkeley-tackling-the-taxon-concept-problem/ Nico On Wed, Oct 29, 2014 at 8:14 AM, Hilmar Lapp notifications@github.com
|
Hi Nico, Does that make sense, or am I missing something? @jar398 might also have some input here. |
Can someone provide examples of target definitions that are not defined in the phylocode? It seems to me that this real life use case is precisely that which the phylocode seeks to address. |
Example definition not defined by phylocode: anything that is I know some people don't like to admit claims like this, but they are On Sat, Nov 1, 2014 at 8:58 AM, Matt notifications@github.com wrote:
|
Some clarification- I was assuming (likely incorrectly) that target definitions were those being used in some computable manner, i.e. not asserted as in the nomenclatural pipeline. Since character data are not stored in OT I assumed this wasn't an option. If definitions are simply user asserted annotations then why worry about target definitions, just make a generic "tagging" system and let users come up with sets of attributes (tags on tags) that they find useful? |
I find the annotation enterprise epistemologically troublesome. It I think it's very important to distinguish real-life clades, the To rigorously interpret a node N as a designator for a clade, you
An annotation (a biological claim) expressed in relation to node N The hard part comes when we update the synthetic tree, i.e. we compute The problem case where N 'becomes paraphyletic' when we go from tree1 Re 4(b), you can't just say that the annotation is about the MRCA of Now maybe I'm being rabid, and the annotations ought to be attached to Jonathan |
Thank you, Karen (et al.). I tend to think that the less abstract, the easier to understand and Suppose that OT version 12 includes a section in its topology for So then at time 0, maybe the annotation database ought to be able to
Nico On Sat, Nov 1, 2014 at 2:09 AM, Karen Cranston notifications@github.com
|
" It would help me if I could see a list of use cases, i.e. actual claims that one would want to express and store." <- Yes, this is likely the only way to really resolve this. IMO there are 2 worlds. 1) clades are asserted to exist vs. 2) clades are calculated from data. IMO OT is completely within 1. When you seek to persist clades/nodes across OTs, then you must ask, are you in world 1), or world 2)? For arguments sake I claim that unless you begin to calculate on data (or, more broadly annotations on nodes), you will always be in 1). Regardless of who states what about concept T at time X, if you can't recalculate based on the data, you're stuck with an assertion. This was the basis of my original observation in this thread, i.e. what then can you do that is not doable as defined in the phylocode (or maybe the phylocode doesn't work, but let's assume it does)? If OT agrees to happily exist in world 1) (which is just fine), then there are many things that are easily done without over complicating things. Entomologist don't think to themselves, "I've got this nagging doubt, humans just might be insects!". For many (all?) practical purposes they never, ever have to do this. They do real work, on a daily basis, without ever worrying about the definition of insects expanding to including humans. If they are doing an insect phylogeny they also don't have to worry about birds, lizards, fish, or squirrels. This suggests to me that there are real clades, that can be represented as nodes, in the OT, and that these can persist across versions. Do we need a robust logical framework for this level of assertion/claim? I love the idea, but maybe its over-engineering at some level. M |
Thanks, Matt. I think the point about the right amount of engineering is always well Whether clade concepts are asserted (with data being "somewhere else") I think (but may well be ignorant) the following example is challenging 0.clade1 with three children: Then also 0.clade2 with three children: Suppose that the PhyloCode node-based identity of 0.clade1 is set as the Similarly, the identity of 0.clade2 is set as the 0.clade2_child4 and At time = 1, new evidence/interpretation indicates that the respective 1.clade1 with children: 1.clade2 with children: I believe under the PhyloCode application the clade definitions do not Best, Nico On Sat, Nov 1, 2014 at 1:36 PM, Matt notifications@github.com wrote:
|
On Sat, Nov 1, 2014 at 4:36 PM, Matt notifications@github.com wrote:
In case 2) sometimes there will be a clade to which the hypothesis (such as Of course you don't have a claim that can be judged true or not unless you
I think it's better to have the annotations not expressed in terms of any
Here's another example I'm struggling with: there are currently a couple of The problem of transparency for identifier semantics and annotation Jonathan
|
I think I likely confuse rather than add to this discussion, so I'll tiptoe away after this. My world 1/2 distinction appears to confuse. IMO any proposed species, taxon, clade is a hypothesis, so this does not factor into my distinction b/w one and two. 1/2 is a pragmatic distinction, it's related to how hypotheses burst into existence, then get referenced later on. World 2) is about how species and clade hypothesis are ("originally") defined, it pertains to data derived directly from instances of the (ultimately hypothesized) species/clades. For example I might gather DNA, anatomical, and behavioral data and then run an algorithm on these data, based on those results I hypothesize the existence of taxa/clades. Later, in world 1) someone points to my hypothesis, assumes it's a good one, and does a new study. They do not compute on any of the data I used to define my original hypotheses. An OT in this 2) would necessarily reference specimens, and the data directly tied to those specimens. It would then define classes (clades/taxa) that classify those specimens based on the outcomes of the analysis of the underlying data. ( Supertree methods do not count as world 2, I assert this, rather than back it up). OT could do something similar to what happens world two, but abstracted away from specimen data a layer or two. Annotations (= data that can used to define clades) can be added to OTUs. Clades can be defined as classes that are bound to a quantitative calculation on those annotations, i.e. they classify OTUs. In this scenario identifiers are provided for these clades, and from tree to tree they repopulate based on the data that is available. Their definition remains the same, a calculation/algorithm, therefor there is no need to change identifiers. Transparency is not an issue, when someone asks why X is in Y, you point to the algorithm that placed X in Y. Want to tweak the calculation that defines a clade? Mint a new identifier, it's demonstrably different because it actually calculates on data. A final way of thinking about it. In OT there are publications, topologies, and taxa/clades. Now you want to add annotations to the system. The problem is that the system defines taxa/clades only via reference to a publication and topology. How can you expect a system to persist annotations on clades when the system does not define those clades based on those annotations? |
On Nov 2, 2014, at 11:35 AM, Nico Franz <notifications@github.commailto:notifications@github.com> wrote: 0.clade1 with three children: Then also 0.clade2 with three children: Nico, I like the approach of specifying an example of how relationships change. This discussion would be helped by a set of concrete examples of attributions and changes that reflect the kinds of problems that are likely to arise (maybe the 80% rule could work here). To me, this raises the issue of why we want to traffic in clade concepts that purport to be stable by virtue of referring to an external reality, when in reality they reflect a limited view that is likely to change in the future. SFAIK it is generally agreed in the ontology world that ontological statements may be asserted as true based on the best available knowledge, even when they are hypotheses with some uncertainty (the whole issue of describing a conceptual world of hypotheses or posterior distributions is a separate matter). The problem with clades is just that the uncertainty is high enough, and they are so likely to change, that we are all here having an explicit discussion here about how knowledge can persist through these changes. Is there some more generic way to assign attributes that sticks closer to the evidence? Rather than pinning a label “blue” (for instance) on a clade based on some research publication, let's say that the publication assigns “blue" based on an ordered split, where the ingroup is { child1, child3 } and the outgroup is { child4, child6 }. This rule for assigning “blue" can persist, potentially through multiple tree topologies. In each case, we have to determine whether the topology is consistent with the split, and if so, how to apply the attribute. We could back this up a further step— staying even closer to the evidence— and simply specify the method and evidence used in the research publication that attributes “blue” to clade1. We could say that “blue” is assigned by parsimony based on a particular distribution, e.g., ((((child1:blue,child2:blue),child4:purple),child5:red),child6:green). And again, we need a set of rules to know how to apply this when the topology is updated and when new members are added. I don’t think there is any way to avoid the need to implement some kind of complex rule-based system, where the rules are based on phylogenetic logic. Arlin Suppose that the PhyloCode node-based identity of 0.clade1 is set as the Similarly, the identity of 0.clade2 is set as the 0.clade2_child4 and At time = 1, new evidence/interpretation indicates that the respective 1.clade1 with children: 1.clade2 with children: I believe under the PhyloCode application the clade definitions do not Best, Nico On Sat, Nov 1, 2014 at 1:36 PM, Matt <notifications@github.commailto:notifications@github.com> wrote:
— Arlin Stoltzfus (arlin@umd.edumailto:arlin@umd.edu) |
Thanks, all, I am trying to keep up the momentum (as time permits). I thought Jonathan's crustacean/mollusc example is neat. Possibly neat
Best, Nico |
The annotation DB will contain many different targets, and it would be interesting to be able to easily see / query those definitions. This could also potentially lead to re-use of definitions for the same clade - "you defined clade x as blah blah blah, and we also have these definitions that point to that same clade in the synthetic tree"
The text was updated successfully, but these errors were encountered: