Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Repo size #38

Open
sudomakeflies opened this issue Aug 29, 2022 · 2 comments
Open

Repo size #38

sudomakeflies opened this issue Aug 29, 2022 · 2 comments

Comments

@sudomakeflies
Copy link

sudomakeflies commented Aug 29, 2022

Hi Biosteam community, I'am a new Biosteam user and the first issue I faced up with Biorefineries Bioindustrial-park are the Size of the repo 1.7G, I understand the good practice to share xls files to contrast the ideas, and to validate but its a little awful. I suggest bring all data like .xls, .csv,... on other kind of cloud file mirror services (google drive, dropbox, end so on). Would be fantastic if would exist routines to create or download them assest files on the fly in python execution inicialization even I can help coding some common routings to automate this type of action maybe if someone also think that it is an #enhancement, let me know please. This approch maybe help from the begin of the #Boilerplate-biorefinery. With my best gratitude.

@sudomakeflies
Copy link
Author

Maybe the most easy way it's to gitignore .csv and .xls files, next make a pull request providing additional link to a .zip file (in a README file for example) with all this non-critical statics assets in a suitable directory structure (this suppose advanced users). A little more complex approach implement simple snippets methods to load_data into__init__.py files, some key question is the service/technology to choice for mirror this files, the main benefit is transparent for users.

@yoelcortes
Copy link
Member

yoelcortes commented Aug 30, 2022

Hi @sudomakeflies,

I appreciate your thoughts on this. Yeah, the size is pretty big. Did you know you can clone the repo with:

$ git clone --depth 5 git://github.com/BioSTEAMDevelopmentGroup/Bioindustrial-Park

We can add .csv and .xlsx to gitignore and have users add the files only when they are about to publish the biorefinery in a paper.

Using zip files may not be feasible considering we got many users that need quick access to load/save data through python. Please feel free to suggest an implementation for easy saving and loading of xlsx, csv, npy files from a cloud.

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants