Hi,
I am running Uni-Fold on antibody-antigen pairs, where the antigen (chain A) is always the same, and the antibody sequences (chain B in each prediction) are very similar to each other (same species).
Since the homo_search.py part of run_unifold.sh multimer takes a long time, but produces very similar hits, I would like to recreate it in a new folder for new predictions, so I can just calculate the second inference.py part of run_unifold.sh on it.
My plan is to aggregate each of the .sto files for a bunch of predictions, and produce a combined version in the new input folder structure to inference.py. The .sto format is a bit cumbersome to recreate, and if the inference.py part is not going to read the alignment structure from it, but rather just the fasta entries, would it be possible to provide the "combined inputs" as multi-fasta files rather than .sto files?
Thanks in advance.
[ 4096 Oct 3 15:59] ./B
[ 231414423 Oct 3 15:59] ./B/uniprot_hits.sto
[ 62590981 Oct 3 15:59] ./B/pdb_hits.sto
[ 516587 Oct 3 15:59] ./B/mgnify_hits.sto
[ 462122 Oct 3 15:59] ./B/bfd_uniclust_hits.a3m
[ 184892903 Oct 3 15:59] ./B/uniref90_hits.sto
[ 1788444 Oct 3 15:59] ./B.uniprot.pkl.gz
[ 81 Oct 3 15:59] ./B.timings.json
[ 3 Oct 3 15:59] ./chains.txt
[ 833 Oct 3 15:59] ./chain_id_map.json
[ 811365 Oct 3 15:59] ./B.feature.pkl.gz
[ 31503 Oct 3 15:59] ./A.uniprot.pkl.gz
[ 80 Oct 3 15:59] ./A.timings.json
[ 282367 Oct 3 15:59] ./A.feature.pkl.gz
[ 4096 Oct 3 15:59] ./A
[ 1139693 Oct 3 15:59] ./A/uniref90_hits.sto
[ 966652 Oct 3 15:59] ./A/uniprot_hits.sto
[ 40494816 Oct 3 15:59] ./A/pdb_hits.sto
[ 3189 Oct 3 15:59] ./A/mgnify_hits.sto
[ 233861 Oct 3 15:59] ./A/bfd_uniclust_hits.a3m
[ 255 Oct 3 15:59] ./1b634d49dfcce4784af7c9bbb7d53496.TRI002.mmer_B.fasta
[ 123 Oct 3 15:59] ./1b634d49dfcce4784af7c9bbb7d53496.TRI002.mmer_A.fasta
Hi,
I am running Uni-Fold on antibody-antigen pairs, where the antigen (chain A) is always the same, and the antibody sequences (chain B in each prediction) are very similar to each other (same species).
Since the
homo_search.pypart of run_unifold.sh multimer takes a long time, but produces very similar hits, I would like to recreate it in a new folder for new predictions, so I can just calculate the secondinference.pypart ofrun_unifold.shon it.My plan is to aggregate each of the
.stofiles for a bunch of predictions, and produce a combined version in the new input folder structure toinference.py. The.stoformat is a bit cumbersome to recreate, and if theinference.pypart is not going to read the alignment structure from it, but rather just the fasta entries, would it be possible to provide the "combined inputs" as multi-fasta files rather than.stofiles?Thanks in advance.