characteristic analysis of protein sequences
training data: positive_training.txt, negative1_training.txt, negative2_training.txt. feature analysis: use k_space.py, N5C5.py and pssm.py to extract characteristics of features and output as libsvm file. model building: use WEKA SMO and LIBSVM package.