Skip to content

recreating homo_search.py output -- minimal version #135

@avilella

Description

@avilella

Hi,

I am running Uni-Fold on antibody-antigen pairs, where the antigen (chain A) is always the same, and the antibody sequences (chain B in each prediction) are very similar to each other (same species).

Since the homo_search.py part of run_unifold.sh multimer takes a long time, but produces very similar hits, I would like to recreate it in a new folder for new predictions, so I can just calculate the second inference.py part of run_unifold.sh on it.

My plan is to aggregate each of the .sto files for a bunch of predictions, and produce a combined version in the new input folder structure to inference.py. The .sto format is a bit cumbersome to recreate, and if the inference.py part is not going to read the alignment structure from it, but rather just the fasta entries, would it be possible to provide the "combined inputs" as multi-fasta files rather than .sto files?

Thanks in advance.

[       4096 Oct  3 15:59]  ./B
[  231414423 Oct  3 15:59]  ./B/uniprot_hits.sto
[   62590981 Oct  3 15:59]  ./B/pdb_hits.sto
[     516587 Oct  3 15:59]  ./B/mgnify_hits.sto
[     462122 Oct  3 15:59]  ./B/bfd_uniclust_hits.a3m
[  184892903 Oct  3 15:59]  ./B/uniref90_hits.sto
[    1788444 Oct  3 15:59]  ./B.uniprot.pkl.gz
[         81 Oct  3 15:59]  ./B.timings.json
[          3 Oct  3 15:59]  ./chains.txt
[        833 Oct  3 15:59]  ./chain_id_map.json
[     811365 Oct  3 15:59]  ./B.feature.pkl.gz
[      31503 Oct  3 15:59]  ./A.uniprot.pkl.gz
[         80 Oct  3 15:59]  ./A.timings.json
[     282367 Oct  3 15:59]  ./A.feature.pkl.gz
[       4096 Oct  3 15:59]  ./A
[    1139693 Oct  3 15:59]  ./A/uniref90_hits.sto
[     966652 Oct  3 15:59]  ./A/uniprot_hits.sto
[   40494816 Oct  3 15:59]  ./A/pdb_hits.sto
[       3189 Oct  3 15:59]  ./A/mgnify_hits.sto
[     233861 Oct  3 15:59]  ./A/bfd_uniclust_hits.a3m
[        255 Oct  3 15:59]  ./1b634d49dfcce4784af7c9bbb7d53496.TRI002.mmer_B.fasta
[        123 Oct  3 15:59]  ./1b634d49dfcce4784af7c9bbb7d53496.TRI002.mmer_A.fasta

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions