Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
Browse code

add util script for data cleaning

  • Loading branch information...
commit 21aeecb72ad4f0dc1d1b24ba73a87b788be771e9 1 parent 7d0c715
Daniel Erenrich authored

Showing 1 changed file with 5 additions and 0 deletions. Show diff stats Hide diff stats

  1. +5 0 frombulate.sh
5 frombulate.sh
... ... @@ -0,0 +1,5 @@
  1 +cat data.csv | grep -v " " | shuf > clean_data.csv
  2 +cat clean_data.csv | head -n -50000 | cut -d" " -f 1,2 --complement > train_features.csv
  3 +cat clean_data.csv | head -n -50000 | cut -d" " -f 1,2 > train_labels.csv
  4 +cat clean_data.csv | tail -n 50000 | cut -d" " -f 1,2 --complement > test_features.csv
  5 +cat clean_data.csv | tail -n 50000 | cut -d" " -f 1,2 > test_labels.csv

0 comments on commit 21aeecb

Please sign in to comment.
Something went wrong with that request. Please try again.