Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using multiple input structures for a single sequence output #57

Open
acarbn opened this issue Jul 6, 2023 · 2 comments
Open

Using multiple input structures for a single sequence output #57

acarbn opened this issue Jul 6, 2023 · 2 comments

Comments

@acarbn
Copy link

acarbn commented Jul 6, 2023

I try to design a sequence for a multi-state protein. It has four available conformations that are different from each other. How can I tie all residues between these four PDBs so that I can design a sequence optimised by all of four structures? I couldn't find an easy way to do this, having looked through the examples.

@dauparas
Copy link
Owner

dauparas commented Jul 6, 2023

You can build a tied-residue dictionary using this helper script https://github.com/dauparas/ProteinMPNN/blob/main/helper_scripts/make_pos_neg_tied_positions_dict.py (--homooligomer 1 --input_path "my_path_to_input.pdb" --output_path "my_path_to_output.jsonl")And then pass the tied position .jsonl to the main script using this flag --tied_positions_jsonl

@acarbn
Copy link
Author

acarbn commented Jul 7, 2023

Thanks. It is a monomer, not an oligomer. Would I still need --homooligomer 1?
I tried the following:

python $pmpnn/helper_scripts/make_pos_neg_tied_positions_dict.py --input_path=$path_for_parsed_chains --output_path=$path_for_tied_positions --homooligomer 1 --pos_neg_chain_list="A" --pos_neg_chain_betas "1.0"

It asked me --pos_neg_chain_list and --pos_neg_chain_betas flags to work, that's why I added them (also gave the same chain ID to all structures as A). Now I have a sequence for each structure but only 1 of them is really regenerated, the rest gave the same sequence as input and NaN as scores. Does it mean it considered and used all the structures yet gave only one sequence?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants