PSSP

my graduation project - Protein Secondary Structure Prediction

9 weeks left 2 weeks for writing paper 6 weeks for training models 1 week for data processing

!!! NO NEED FOR Embidding 0.choose all the 200-/500-/1000- data 1.process them into .fasta one by one, 2.then using "import os
for i in range(0,65):
os.system("tar -xzvf nr.%02d.tar.gz"%i)" cd Data/blast psiblast -db pdbaa -query test.fasta -num_iterations 6 -evalue 0.001 -num_threads 16 -out_ascii_pssm pssm1.pssm -out pssm1.txt to make them into pssm#.pssm 3.process .pssm, till we get the matrix for each sequence 4.send the matrix directly in the the network

embidding only for one-hot, and using bilstm, we can have a try :)

There's 4 types of prediction

1*20 input - 1 output, 1d CNN/LR --- use PSSM_1. PSSM_2
7*20 input - 1 output, regular pssm window, 2D CNN/LR --- USE PSSM_1, PSSM_2, ONE-HOT
n*20 input - n output, ResNet??? but size is not identical, pixel to pixel? ---USE PSSM_1, PSSM_2
n*20 input - n output, Bidirection LSTM ---USE PSSM1, PSSM2, ONE-HOT

train 3 lr model??? this is naive how to classify 3 label? check CS231n for n label classify,softmax

check summer project for embedding ,pytorch learn Ng

one-hot PSSM_1 PSSM_2

next week: process the pssm matrix, figure out a way to store the encoder. seperately or in a how .npv? Replace the original sequence with encorder, then store in a matrix, like independ images? Or just store the sequences and replace them while inputting?

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
BiLSTM+CRF.py		BiLSTM+CRF.py
BiLSTM.py		BiLSTM.py
LRmodel.py		LRmodel.py
LRmodel_new.py		LRmodel_new.py
LSTM.py		LSTM.py
README.md		README.md
data_process_LR.py		data_process_LR.py
data_process_new_LR.py		data_process_new_LR.py
graduate.pdf		graduate.pdf
new_pssm_to_intput_process.py		new_pssm_to_intput_process.py
pssm_to_intput_process.py		pssm_to_intput_process.py
seq_to_pssm_process.py		seq_to_pssm_process.py
shell-instruction		shell-instruction
ss_to_npy.py		ss_to_npy.py
ss_to_singletxt.py		ss_to_singletxt.py
test_without_drop.py		test_without_drop.py
weigjt_BiLSTM.py		weigjt_BiLSTM.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PSSP

About

Releases

Packages

Languages

EmoN-Fang/PSSP

Folders and files

Latest commit

History

Repository files navigation

PSSP

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages