Biopython：是否有一种方法可以从PDB文件中提取特定链的氨基酸序列？

PDB_file_path = '/full/path/to/some/pdb' # Is there a 1-liner for this ? query_seqres = SeqIO.parse(PDB_file_path, 'pdb-seqres') for chain in query_seqres: if chain.id == query_chain_id: query_chain = chain.seq #

2条回答

网友

1楼 · 编辑于 2024-09-24 00:33:29

在@BioGeek answer上展开，下面是使用PDBParser.get_structure（）而不是SeqIO.parse（）时提取序列的等效代码

from Bio.PDB import PDBParser
from Bio.SeqUtils import seq1

pdbparser = PDBParser()

structure = pdbparser.get_structure(PDB_ID, PDB_file_path)
chains = {chain.id:seq1(''.join(residue.resname for residue in chain)) for chain in structure.get_chains()}

query_chain = chains[query_chain_id]

网友

2楼 · 编辑于 2024-09-24 00:33:29

在我看来，它并没有太多的python功能，但您可以使用字典压缩将生成器转换为explictdict：

from Bio import SeqIO
PDB_file_path = '6q62.pdb' 
query_chain_id = '6Q62:A'

chain = {record.id: record.seq for record in SeqIO.parse(PDB_file_path, 'pdb-seqres')}
query_chain = chain[query_chain_id]

相关问题更多 >

编程相关推荐

热门问题

热门文章

Biopython：是否有一种方法可以从PDB文件中提取特定链的氨基酸序列？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >