project_phenotype table? #62

ekcannon · 2018-04-11T13:45:59Z

I would like to attach a set of phenotype records to the study (project) that generated them, but don't want to use the nd_experiment table:

project --- nd_experiment_project --- nd_experiment --- nd_experiment_phenotype --- phenotype

This is because it is challenging to maintain data integrity due to both the length of the connecting chain, and the lack of constraints on the nd_experiment table.

CREATE TABLE project_phenotype (
project_phenotype_id BITINT SERIAL NOT NULL,
PRIMARY KEY (project_phenotype_id),
project_id BIGINT NOT NULL,
FOREIGN KEY (project_id) REFERENCES project (project_id) ON DELETE CASCADE INITIALLY DEFERRED,
phenotype_id BIGINT NOT NULL
FOREIGN KEY (phenotype_id) REFERENCES phenotype (phenotype_id) ON DELETE CASCADE INITIALLY DEFERRED,
CONSTRAINT project_phenotype_c1 UNIQUE (project_id, phenotype_id)
);
CREATE INDEX project_phenotype_idx1 ON project_phenotype (project_id);
CREATE INDEX project_phenotype_idx2 ON project_phenotype (project_id);

The text was updated successfully, but these errors were encountered:

scottcain · 2018-12-04T00:56:17Z

This seems fine with me, except that I think project_phenotype_idx2 is probably supposed to be on phenotype_id.

laceysanderson · 2018-12-04T02:19:51Z

Alternatively, we could add a project_id to the phenotype table which is nullable. This would greatly improve queries such as "all phenotypes from a given project" or "all traits measured in a given project", be backwards compatible and still very chado-esque (in my opinion).

The only downside I can think of is that it limits us to a single project per phenotype. However, we can always use dbxref as an example with both a phenotype.project_id and a phenotype_project table.

Full Disclosure: I'm invested in a phenotype.project_id since I already made such a modification for my analyzed phenotypes Tripal module due to serious performance issues observed with the phenotype_project table approach.

ekcannon · 2019-01-09T01:40:15Z

I'm okay with adding a project_id field to the phenotype table. As Lacey suggests, perhaps the project_phenotype table could be added too, in the unlikely event that a phenotype was generated by more than one project.

bradfordcondon · 2019-01-11T00:56:57Z

as someone trying to cruise through issues: is there a resolution/consensus on this?

laceysanderson · 2019-01-16T18:32:44Z

Summary:

Myself and @ekcannon support adding a phenotype.project_id which is nullable to make it backwards compatible.
there is no dissenting voice at this point

@scottcain do you support adding a phenotype.project_id or only the approach of adding a phenotype_project linker table?

scottcain added 2019 PAG Hackathon Chado 1.4 Suggestion labels Dec 4, 2018

scottcain added Hackathon: hard Priority: low labels Dec 4, 2018

colthom mentioned this issue Jun 16, 2019

Traceability of the source analysis or project #109

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

project_phenotype table? #62

project_phenotype table? #62

ekcannon commented Apr 11, 2018

scottcain commented Dec 4, 2018

laceysanderson commented Dec 4, 2018

ekcannon commented Jan 9, 2019

bradfordcondon commented Jan 11, 2019

laceysanderson commented Jan 16, 2019

project_phenotype table? #62

project_phenotype table? #62

Comments

ekcannon commented Apr 11, 2018

scottcain commented Dec 4, 2018

laceysanderson commented Dec 4, 2018

ekcannon commented Jan 9, 2019

bradfordcondon commented Jan 11, 2019

laceysanderson commented Jan 16, 2019