Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Logic of the scripts #2

Open
albusdemens opened this issue Jan 5, 2021 · 2 comments
Open

Logic of the scripts #2

albusdemens opened this issue Jan 5, 2021 · 2 comments

Comments

@albusdemens
Copy link

Hello again, could you check if I am running the scripts in the right order? As far as I understood, the recipe is the following:

  1. init.py --> initialise the steamids.txt file, which lists a few IDs of Steam users.
  2. Get more data using 00_collect_steamid.py. The script keeps running until stopped.
  3. 00_collect_private_public_index.py --> collect the status of the IDs (they can be public or private).
  4. 00_collect_data.py --> collect statistics for the listed IDs which are also public.
  5. 01_clean.py --> Clean collected data, combining datasets into a single one.
  6. 02_eda.ipynb --> Do some data analysis.
  7. 03_resample.py --> Resample, to take into account that our dataset is not balanced. Considered approaches: under-sampling and over-sampling.
  8. 04_models.py --> Run and optimise ML methods for cheaters detection

Am I missing something? Also, when I launch 00_collect_steamid.py I get a NameError (name 'vacbanned_last20' is not defined). Do you know how to fix it? Thanks heaps!

@vh42720
Copy link
Owner

vh42720 commented Jan 6, 2021 via email

@albusdemens
Copy link
Author

albusdemens commented Jan 13, 2021 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants