Code and models are missing #1

ddofer · 2023-09-25T08:07:02Z

Hi, I read the paper - but I see the repo is empty of code or models.
Notably, I want to see if your textual pretraining data filtered out cases that appear (or are similar, by BLAST or the like) to any in the TAPE eval set. (e.g. like we did in ProteinBERT https://github.com/nadavbra/protein_bert )

chao1224 · 2023-12-01T08:04:15Z

Hi @ddofer,

Thank you for the questions.

We will release the codes and models once our manuscript is officially published.
To your second question, we double-checked the SwissProtCLAP and TAPE datasets (train & eval & test), and there are no shared protein sequences.

Amelie-Schreiber · 2024-01-01T23:40:50Z

When will the paper be published and when will the code be subsequently released?

chao1224 · 2024-01-02T00:59:15Z

Hi @Amelie-Schreiber, our manuscript is now in submission. We will release the code once it is accepted. Meanwhile, you can check the latest version here.

chao1224 added a commit that referenced this issue Mar 10, 2024

Initial Commit, #1

06a0189

chao1224 closed this as completed Jul 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code and models are missing #1

Code and models are missing #1

ddofer commented Sep 25, 2023

chao1224 commented Dec 1, 2023

Amelie-Schreiber commented Jan 1, 2024

chao1224 commented Jan 2, 2024

Code and models are missing #1

Code and models are missing #1

Comments

ddofer commented Sep 25, 2023

chao1224 commented Dec 1, 2023

Amelie-Schreiber commented Jan 1, 2024

chao1224 commented Jan 2, 2024