Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Browse files

add util script for data cleaning

  • Loading branch information...
commit 21aeecb72ad4f0dc1d1b24ba73a87b788be771e9 1 parent 7d0c715
Daniel Erenrich authored
Showing with 5 additions and 0 deletions.
  1. +5 −0 frombulate.sh
View
5 frombulate.sh
@@ -0,0 +1,5 @@
+cat data.csv | grep -v " " | shuf > clean_data.csv
+cat clean_data.csv | head -n -50000 | cut -d" " -f 1,2 --complement > train_features.csv
+cat clean_data.csv | head -n -50000 | cut -d" " -f 1,2 > train_labels.csv
+cat clean_data.csv | tail -n 50000 | cut -d" " -f 1,2 --complement > test_features.csv
+cat clean_data.csv | tail -n 50000 | cut -d" " -f 1,2 > test_labels.csv
Please sign in to comment.
Something went wrong with that request. Please try again.