-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Usage decisions #1
Comments
My current thinking for the command line interface:
$ cytominer-transport . experiment.parquet \
--compartment foo.csv \
--compartment bar.csv \
--compartment baz.csv \
--experiment experiment.csv \
--images images.csv If
the command can be shortened to:
|
I like it! tagging @bethac07 |
looks good to me. In cytominer-database, we had an Also, if we have noncanonical compartments, would your compartment-naked option be:
|
Yes. The user can specify any filename they'd like for |
parquet is 100% compatible with pandas, so no need IMO |
I'm very much in favor of scrapping SQLite
I agree |
@gwaygenomics I was thinking something like: to_pandas(experiment: Path, image: Path, compartments: List[Path]) -> pandas.DataFrame |
@gwaygenomics and @shntnu Would you still like a public |
Oh – and this is a big one because
|
Yeah, I think so. I think our lab will mostly use this package via command line (assay devs final step before handoff), but I can imagine a scenario in which someone would want to run an image-based profiling pipeline end-to-end in python |
That's wise to have a public API. One use case I can think of: future profiling recipes can use it directly. |
ditto |
Enabling this will likely result in a 5x performance improvement off the bat because our current approach is to mount the bucket using using |
This comment has been minimized.
This comment has been minimized.
@shntnu Good to know. Where does your n (i.e., 24) come from? It can be arbitrary, but ideally, it would correspond to some structural detail of the experiment. Would you mind putting this in a separate issue for tracking purposes? |
So is the idea that you would call this on a folder of data, or on one subfolder at a time? AKA if our structure is the below, is
|
(I'm fine with either behavior, as long as we're all on the same page as to what it is; in cytominer-database, we'd be calling it on |
Not sure if the q is for @0x00b1 but I think we should call it on a folder of data; I have a related implementation comment here #2 (comment) |
How shall users interact with the codebase? Let's track our thoughts and decide here
The text was updated successfully, but these errors were encountered: