Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CIF structure file format error #50

Open
yx0516 opened this issue Jan 11, 2025 · 4 comments
Open

CIF structure file format error #50

yx0516 opened this issue Jan 11, 2025 · 4 comments

Comments

@yx0516
Copy link

yx0516 commented Jan 11, 2025

When i use BioPython to read the CIF structure file predicted by Protenix. The following error happened:

image

However the CIF file of other AF3-like models can be read normally by BioPython.

Please give a hand about this.

Thanks a lot.

@yx0516
Copy link
Author

yx0516 commented Jan 11, 2025

@LanLanBoom @lark @guanjq

@cloverzizi
Copy link
Contributor

Hi @yx0516 ,
we previously hadn't included the occupancy field in the CIF file, but we have just updated the code. You can either pull the latest version of the code to run the inference or resubmit the task on the WebServer.

@yx0516
Copy link
Author

yx0516 commented Jan 20, 2025

Hi @cloverzizi,

I pull the latest version of code and run the inference. And use the 'pip install .' to install the latest version and run the inference.
The upper error is fixed. However a new error happens:

Traceback (most recent call last):
File "/data/PRG/tools/apps/Protenix/run.py", line 814, in
get_results_single_run(args.topN)
File "/data/PRG/tools/apps/Protenix/run.py", line 671, in get_results_single_run
cif_to_pdb(cif_path,f"rank_{index+1}.pdb")
File "/data/PRG/tools/apps/Protenix/run.py", line 52, in cif_to_pdb
io.save(pdb)
File "/home/wemol/.local/lib/python3.10/site-packages/Bio/PDB/PDBIO.py", line 352, in save
raise PDBIOException(e)
Bio.PDB.PDBExceptions.PDBIOException: Chain id ('A0') exceeds PDB format limit.

The Chain name in CIF file is 'A0', i think it should be 'A'.
Please help to check this.
Thanks a lot.

@cloverzizi
Copy link
Contributor

Hi @yx0516,
We updated the code so that in Protenix v0.4.1, the inferred output CIF's label_asym_id is changed from formats like 'A0' to just 'A'. This adjustment ensures that when converting to PDB format, as long as the number of chains does not exceed 26, the requirement for single-character Chain IDs is met.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants