Skip to content

Allow handling of molecules with more than 99999 atoms #1506

@amjjbonvin

Description

@amjjbonvin

Desired feature/enhancement

While the PDB format is limited to a max of 99999 atoms, CNS can in principle handle more than that since it uses for number >99999 the hydrid_36 format (see https://cci.lbl.gov/cctbx_sources/iotbx/pdb/hybrid_36.py)

The current limitation in HADDOCK3 comes from the topoaa module that call pdb_tidy and pdb_tidy has a hard check for the 99999 limit.

Two options:

  1. modify topoaa to bypass the pdb_tidy call and handle PDB files that use the hydrid_36 format (generated by CNS)
  2. modify pdb_tidy in the pdb-tools repo to add an option to handle hydrid_36 format

Motivation

Dealing with very large systems (e.g. ribosome) (some ongoing project)

Description

Option 2 is the simplest to implement and I already have a working version.
The hybrid_36 format can be supported by adding the -cns option as argument to pdb_tidy
This option could thus be used in the topoaa module.

It will mean updating pdb-tools and then defining a minimal version for it in the HADDOCK3 dependencies.

Metadata

Metadata

Assignees

Labels

enhancementImproving something in the codebasem|topoaatopoaa module

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions