New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow /src scripts to receive data files as command line arguments #167
Comments
Hi @Irio ! I'm working exactly that. I spoke with @cuducos about issue #67 that I also need to use fetch_cnpj_info with the amendments' dataset . Actually, I was thinking about passing the column with the CNPJ's as a parameter also, so the call would be src/fetch_cnpj_info.py './data/2016-12-06-cnpj-info.xz' 'cnpj'. What do you think? |
Hi @Irio I did the refactoring in fetch_cnpj_info.py now we can call it passing filename and column with CNPJ's. |
Sorry if I missed anything… but I don't think passing the column we need is what I had in mind. When I read OP's description I think he meant the file to be load (for example |
@Irio @cuducos I think this one was resolved with PR #185 as in https://github.com/datasciencebr/serenata-de-amor/blob/master/src/fetch_cnpj_info.py#L159 . |
@marcusrehm PR #185 addresses this issue when it come to company scripts, but not for all scripts inside |
yes, @cuducos , you're right! =) |
I would like to know specifically what is the idea in this issue. I've looked through some files in source, and some of then just download data, would you like the script to specify the path where the data is to be saved? Or would you like only for the scripts that read files to receive arguments? Anyway I think it wouuld be nice to post a roadmap with the files that you'd like to change. Thanks. |
Hi @martini97,
No, actually what would be interesting is to specify via command line the files they read (as I commented earlier).
Well… files from
Or another way to put every script but:
|
Hi @martini97 ! I think what @cuducos suggested was commented here. He's asking to create a sort of mapping like it:
So when the script receive a file as argument it can grab data (CNPJs) from the referenced column and to the job. It is already done in
Is that right @cuducos ? |
👍 |
…s-friendly-name Add human friendly name for irregular companies classifier
Closed because this |
The majority of scripts located in process datasets in hard coded locations, like
data/cnpj-info.xz
in src/fetch_cnpj_info.py#L10. Given a lot has changed since their creation, we expect them to receive data paths as command line arguments, as inpython src/fetch_cnpj_info.py data/2016-12-06-cnpj-info.xz
.The text was updated successfully, but these errors were encountered: