-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve docstrings #34
Conversation
And also: * improve a bit docstring for trip_sampler * explicit default source name (EMP-2019)
Codecov Report
@@ Coverage Diff @@
## main #34 +/- ##
==========================================
+ Coverage 94.69% 94.76% +0.07%
==========================================
Files 10 10
Lines 490 497 +7
==========================================
+ Hits 464 471 +7
Misses 26 26
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
@@ -188,7 +227,7 @@ def prepare_entd_2008(proxies={}): | |||
"V2_OLDACPA07", | |||
"V2_OLDACPA08", | |||
"V2_OLDACPA09", | |||
"V2_OLDARCOM_UUCat", | |||
"V2_OLDARCOM_UUCat", # Why UUCAT and not UU2010 like elsewhere? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
On utilise une autre catégorie, alors que UU2010 me semble aussi présente dans les données, je ne comprends pas bien pourquoi.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@FlxPo tu as une idée de pourquoi ? Je ne pense pas que ça ait un énorme impact sur les résultats, mais ça m'étonne
|
||
8 datasets are created and saved: | ||
|
||
* df / short_trips: list of all short trips (<80 km as the crow flies). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Les noms des dataframes sont différents de ceux des fichiers, ça favorise la confusion. Une fois que les problèmes de fond seront éclaircis, je changerai les variables.
8 datasets are created and saved: | ||
|
||
* df / short_trips: list of all short trips (<80 km as the crow flies). | ||
Trips with unknown length, zero length or a length > 80 km are removed. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Les trajets au-dessus de 80 km ne sont pas considérés, ni dans les trajets quotidiens ni dans les longues distances. Cela peut être un biais, en tout cas cela veut dire qu'on ne considère pas les grands mobiles.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Possible qu'il y ait un biais effectivement. Il me semble qu'on avait introduit cette distinction pour ne pas avoir de double comptage avec la table voyages, mais je ne suis plus sûr de cette stratégie.
Améliorer la doc sur car_ownership_probability
L'échantillonnage de pandas a un paramètre |
Improve docstrings and comments for
Also: