Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Changing from negative-id CommentAnnotations and MapAnnotations to XMLAnnotations for non-OME data model info #74

Merged
merged 25 commits into from
Jan 25, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
25 commits
Select commit Hold shift + click to select a range
20239eb
schemas!
erickmartins Nov 6, 2023
57fa4ae
better schemas!
erickmartins Nov 6, 2023
a3b3139
starting to move towards XML annotations
erickmartins Nov 9, 2023
34896fa
creating XML path references
erickmartins Nov 9, 2023
6f5abb5
annotations correct for prepare
erickmartins Nov 17, 2023
abd7aa2
prepare working, using global count for new annotations
erickmartins Dec 5, 2023
377d14a
packside working
erickmartins Dec 5, 2023
010ee92
annotated places that need XML rework
erickmartins Dec 6, 2023
1b746d5
missing only using provenance XMLs
erickmartins Dec 8, 2023
a998561
changed and used provenance metadata XML
erickmartins Dec 8, 2023
f23afc6
removed renderingsettings schema, fixed func signature
erickmartins Dec 12, 2023
bdd2b16
fixing `get_server_path` to work as intended
erickmartins Dec 15, 2023
4ee5ffc
creating prov metadata, accounting for multiple XML anns
erickmartins Dec 15, 2023
d40703a
debugging, new valid_single_image test data
erickmartins Dec 18, 2023
640af09
new test data, minor test fixes
erickmartins Dec 19, 2023
1da9228
tests passing!
erickmartins Dec 22, 2023
af72e86
using pip from glencoe zeroc-ice, fixing flake8
erickmartins Dec 22, 2023
602d628
reverting to conda - wrong bftools...
erickmartins Dec 22, 2023
b7f39a3
pinning flake and pycodestyle for now
erickmartins Dec 22, 2023
4c237ba
moving to python 3.9
erickmartins Dec 22, 2023
b0ead44
update for 1.0.0, updating dependencies
erickmartins Jan 22, 2024
fad0b5e
Merge branch 'main' into XMLAnnotations
erickmartins Jan 25, 2024
86b7c1e
Merge branch 'main' into XMLAnnotations
erickmartins Jan 25, 2024
3250900
readding submodule
erickmartins Jan 25, 2024
53dffc6
minor readme fix
erickmartins Jan 25, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .omeroci/py-setup
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@ source /tmp/miniconda/bin/activate
conda init
conda create -n omero python=3.9
conda activate omero
conda install -c conda-forge zeroc-ice=3.6.5
pip install https://github.com/glencoesoftware/zeroc-ice-py-linux-x86_64/releases/download/20231130/zeroc_ice-3.6.5-cp39-cp39-manylinux_2_28_x86_64.whl
conda install -c bioconda bftools

cd $TARGET
Expand Down
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -63,6 +63,7 @@ about the files (name, mimetype).

`--plugin` allows you to export omero data to a desired format by using an external plugin. See for example the [arc plugin](https://github.com/cmohl2013/omero-arc), which exports omero
projects to ARC repositories.

`--binaries` allows to specify whether to archive binary data
(e.g images, ROIs, FileAnnotations) or only create the transfer.xml.
Default is `all` and will create the archive. With `none`, only the `transfer.xml`
Expand Down
19 changes: 19 additions & 0 deletions schemas/provenancemetadata.xsd
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
<xs:schema attributeFormDefault="unqualified" elementFormDefault="qualified"
xmlns:xs="http://www.w3.org/2001/XMLSchema">
<xs:element name="CLITransferMetadata">
<xs:complexType>
<xs:sequence>
<xs:element type="xs:integer" name="origin_plate_id"/>
<xs:element type="xs:integer" name="origin_image_id"/>
<xs:element type="xs:string" name="software"/>
<xs:element type="xs:string" name="version"/>
<xs:element type="xs:string" name="packing_timestamp"/>
<xs:element type="xs:string" name="origin_hostname"/>
<xs:element type="xs:string" name="md5"/>
<xs:element type="xs:string" name="original_user"/>
<xs:element type="xs:string" name="original_group"/>
<xs:element type="xs:string" name="database_id"/>
</xs:sequence>
</xs:complexType>
</xs:element>
</xs:schema>
10 changes: 10 additions & 0 deletions schemas/serverpath.xsd
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
<xs:schema attributeFormDefault="unqualified" elementFormDefault="qualified"
xmlns:xs="http://www.w3.org/2001/XMLSchema">
<xs:element name="CLITransferServerPath">
<xs:complexType>
<xs:sequence>
<xs:element type="xs:string" name="Path" />
</xs:sequence>
</xs:complexType>
</xs:element>
</xs:schema>
6 changes: 3 additions & 3 deletions setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -84,7 +84,7 @@ def read(fname):
packages=['', 'omero.plugins'],
package_dir={"": "src"},
name="omero-cli-transfer",
version='0.8.0',
version='1.0.0',
maintainer="Erick Ratamero",
maintainer_email="erick.ratamero@jax.org",
description=("A set of utilities for exporting a transfer"
Expand All @@ -95,8 +95,8 @@ def read(fname):
long_description_content_type="text/markdown",
url="https://github.com/TheJacksonLaboratory/omero-cli-transfer",
install_requires=[
'ezomero==2.0.0',
'ome-types==0.4.2'
'ezomero>=2.1.0, <3.0.0',
'ome-types>=0.4.5,<0.5.0'
],
extras_require={
"rocrate": ["rocrate==0.7.0"],
Expand Down
175 changes: 115 additions & 60 deletions src/generate_omero_objects.py
Original file line number Diff line number Diff line change
Expand Up @@ -4,14 +4,15 @@
# Use is subject to license terms supplied in LICENSE.

import ezomero
from typing import List, Tuple
from ome_types import to_xml
from typing import List, Tuple, Union
from omero.model import DatasetI, IObject, PlateI, WellI, WellSampleI, ImageI
from omero.gateway import DatasetWrapper
from ome_types.model import TagAnnotation, MapAnnotation, FileAnnotation, ROI
from ome_types.model import CommentAnnotation, LongAnnotation, Annotation
from ome_types.model import Line, Point, Rectangle, Ellipse, Polygon, Shape
from ome_types.model import Polyline, Label, Project, Screen, Dataset, OME
from ome_types.model import Image, Plate
from ome_types.model import Image, Plate, XMLAnnotation, AnnotationRef
from ome_types.model.simple_types import Marker
from omero.gateway import TagAnnotationWrapper, MapAnnotationWrapper
from omero.gateway import CommentAnnotationWrapper, LongAnnotationWrapper
Expand All @@ -21,6 +22,7 @@
from omero.rtypes import rstring, RStringI, rint
from ezomero import rois
from pathlib import Path
import xml.etree.cElementTree as ETree
import os
import copy

Expand Down Expand Up @@ -165,32 +167,7 @@ def create_annotations(ans: List[Annotation], conn: BlitzGateway, hash: str,
map_ann.setNs(namespace)
key_value_data = []
for v in an.value.ms:
if int(an.id.split(":")[-1]) < 0:
if not metadata:
key_value_data.append(['empty_metadata', "True"])
break
if v.k == "md5" and "md5" in metadata:
key_value_data.append(['zip_file_md5', hash])
if v.k == "origin_image_id" and "img_id" in metadata:
key_value_data.append([v.k, v.value])
if v.k == "origin_plate_id" and "plate_id" in metadata:
key_value_data.append([v.k, v.value])
if v.k == "packing_timestamp" and "timestamp" in metadata:
key_value_data.append([v.k, v.value])
if v.k == "software" and "software" in metadata:
key_value_data.append([v.k, v.value])
if v.k == "version" and "version" in metadata:
key_value_data.append([v.k, v.value])
if v.k == "origin_hostname" and "hostname" in metadata:
key_value_data.append([v.k, v.value])
if v.k == "original_user" and "orig_user" in metadata:
key_value_data.append([v.k, v.value])
if v.k == "original_group" and "orig_group" in metadata:
key_value_data.append([v.k, v.value])
if v.k == "database_id" and "db_id" in metadata:
key_value_data.append([v.k, v.value])
else:
key_value_data.append([v.k, v.value])
key_value_data.append([v.k, v.value])
map_ann.setValue(key_value_data)
map_ann.save()
ann_map[an.id] = map_ann.getId()
Expand Down Expand Up @@ -220,44 +197,109 @@ def create_annotations(ans: List[Annotation], conn: BlitzGateway, hash: str,
file_ann.setFile(original_file)
file_ann.save()
ann_map[an.id] = file_ann.getId()
elif isinstance(an, XMLAnnotation):
# pass if path, use if provenance metadata
tree = ETree.fromstring(to_xml(an.value,
canonicalize=True))
is_metadata = False
for el in tree:
if el.tag.rpartition('}')[2] == "CLITransferMetadata":
is_metadata = True
if is_metadata:
map_ann = MapAnnotationWrapper(conn)
namespace = an.namespace
map_ann.setNs(namespace)
key_value_data = []
if not metadata:
key_value_data.append(['empty_metadata', "True"])
else:
key_value_data = parse_xml_metadata(an, metadata, hash)
map_ann.setValue(key_value_data)
map_ann.save()
ann_map[an.id] = map_ann.getId()
return ann_map


def parse_xml_metadata(ann: XMLAnnotation,
metadata: List[str],
hash: str) -> List[List[str]]:
kv_data = []
tree = ETree.fromstring(to_xml(ann.value, canonicalize=True))
for el in tree:
if el.tag.rpartition('}')[2] == "CLITransferMetadata":
for el2 in el:
item = el2.tag.rpartition('}')[2]
val = el2.text
if item == "md5" and "md5" in metadata:
kv_data.append(['md5', hash])
if item == "origin_image_id" and "img_id" in metadata:
kv_data.append([item, val])
if item == "origin_plate_id" and "plate_id" in metadata:
kv_data.append([item, val])
if item == "packing_timestamp" and "timestamp" in metadata:
kv_data.append([item, val])
if item == "software" and "software" in metadata:
kv_data.append([item, val])
if item == "version" and "version" in metadata:
kv_data.append([item, val])
if item == "origin_hostname" and "hostname" in metadata:
kv_data.append([item, val])
if item == "original_user" and "orig_user" in metadata:
kv_data.append([item, val])
if item == "original_group" and "orig_group" in metadata:
kv_data.append([item, val])
if item == "database_id" and "db_id" in metadata:
kv_data.append([item, val])
return kv_data


def get_server_path(anrefs: List[AnnotationRef],
ans: List[Annotation]) -> Union[str, None]:
fpath = None
xml_ids = []
for an in anrefs:
for an_loop in ans:
if an.id == an_loop.id:
if isinstance(an_loop, XMLAnnotation):
xml_ids.append(an_loop.id)
else:
continue
for an_loop in ans:
if an_loop.id in xml_ids:
if not fpath:
tree = ETree.fromstring(to_xml(an_loop.value,
canonicalize=True))
for el in tree:
if el.tag.rpartition('}')[2] == "CLITransferServerPath":
for el2 in el:
if el2.tag.rpartition('}')[2] == "Path":
fpath = el2.text
return fpath


def update_figure_refs(ann: FileAnnotation, ans: List[Annotation],
img_map: dict, folder: str):
curr_folder = str(Path('.').resolve())
for an in ann.annotation_refs:
clean_id = int(an.id.split(":")[-1])
if clean_id < 0:
cmnt_id = an.id
for an_loop in ans:
if an_loop.id == cmnt_id and isinstance(an_loop, CommentAnnotation):
fpath = str(an_loop.value)
dest_path = str(os.path.join(curr_folder, folder, '.', fpath))
with open(dest_path, 'r') as file:
filedata = file.read()
for src_id, dest_id in img_map.items():
clean_id = int(src_id.split(":")[-1])
src_str = f"\"imageId\": {clean_id}"
dest_str = f"\"imageId\": {dest_id}"
print(src_str, dest_str)
filedata = filedata.replace(src_str, dest_str)
with open(dest_path, 'w') as file:
file.write(filedata)
fpath = get_server_path(ann.annotation_refs, ans)
if fpath:
dest_path = str(os.path.join(curr_folder, folder, '.', fpath))
with open(dest_path, 'r') as file:
filedata = file.read()
for src_id, dest_id in img_map.items():
clean_id = int(src_id.split(":")[-1])
src_str = f"\"imageId\": {clean_id}"
dest_str = f"\"imageId\": {dest_id}"
filedata = filedata.replace(src_str, dest_str)
with open(dest_path, 'w') as file:
file.write(filedata)
return


def create_original_file(ann: FileAnnotation, ans: List[Annotation],
conn: BlitzGateway, folder: str
) -> OriginalFileWrapper:
curr_folder = str(Path('.').resolve())
for an in ann.annotation_refs:
clean_id = int(an.id.split(":")[-1])
if clean_id < 0:
cmnt_id = an.id
for an_loop in ans:
if an_loop.id == cmnt_id and isinstance(an_loop, CommentAnnotation):
fpath = str(an_loop.value)
fpath = get_server_path(ann.annotation_refs, ans)
dest_path = str(os.path.join(curr_folder, folder, '.', fpath))
ofile = conn.createOriginalFileFromLocalFile(dest_path)
return ofile
Expand All @@ -270,19 +312,30 @@ def create_plate_map(ome: OME, img_map: dict, conn: BlitzGateway
map_ref_ids = []
for plate in ome.plates:
ann_ids = [i.id for i in plate.annotation_refs]
file_path = ""
for ann in ome.structured_annotations:
if (ann.id in ann_ids and
isinstance(ann, CommentAnnotation) and
int(ann.id.split(":")[-1]) < 0):
newome.structured_annotations.remove(ann)
map_ref_ids.append(ann.id)
file_path = ann.value
isinstance(ann, XMLAnnotation)):
tree = ETree.fromstring(to_xml(ann.value,
canonicalize=True))
is_metadata = False
for el in tree:
if el.tag.rpartition('}')[2] == "CLITransferMetadata":
is_metadata = True
if not is_metadata:
newome.structured_annotations.remove(ann)
map_ref_ids.append(ann.id)
file_path = get_server_path(plate.annotation_refs,
ome.structured_annotations)
annref = next(filter(lambda x: x.id == ann.id,
plate.annotation_refs))
newplate = next(filter(lambda x: x.id == plate.id,
newome.plates))
newplate.annotation_refs.remove(annref)
q = conn.getQueryService()
params = Parameters()
if not file_path:
raise ValueError(f"Plate ID {plate.id} does not have a \
CommentAnnotation with a file path!")
XMLAnnotation with a file path!")
path_query = str(file_path).strip('/')
if path_query.endswith('mock_folder'):
path_query = path_query.rstrip("mock_folder")
Expand Down Expand Up @@ -584,6 +637,8 @@ def link_one_annotation(obj: IObject, ann: Annotation, ann_map: dict,
ann_obj = conn.getObject("LongAnnotation", ann_id)
elif isinstance(ann, FileAnnotation):
ann_obj = conn.getObject("FileAnnotation", ann_id)
elif isinstance(ann, XMLAnnotation):
ann_obj = conn.getObject("MapAnnotation", ann_id)
else:
ann_obj = None
if ann_obj:
Expand Down
Loading