-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
regarding preprocessing dataset request #1
Comments
Karthikeyana, what do you mean by posting the program to your blog? Where is your blog? The pre-processing program is a simple script to processing every interview into one line and remove unneeded items. |
import csv directory = raw_input("INPUT Folde:") txt_files = os.path.join(directory, '*.txt') for txt_file in glob.glob(txt_files):
sir i am using this code to convert all txt files to csv but i did not get this format sir plase help me :POS: :41: i disagree with the reviewers who said the movie was predictable and |
sir can you post the script in command box |
15/03/05 02:26:39 INFO input.FileInputFormat: Total input paths to process : 2 15/03/05 02:27:16 INFO mapred.JobClient: Task Id : attempt_201503042232_0030_r_000001_0, Status : FAILED 15/03/05 02:27:26 INFO mapred.JobClient: map 100% reduce 6% 15/03/05 02:27:29 INFO mapred.JobClient: map 100% reduce 0% 15/03/05 02:27:37 INFO mapred.JobClient: map 100% reduce 3% 15/03/05 02:27:40 INFO mapred.JobClient: map 100% reduce 3% |
this is my error message when i am running in single node hadoop |
can you post data preprocessing program in our blog.
The text was updated successfully, but these errors were encountered: