Data Item _entity_poly.pdbx_seq_one_letter_code_can


Item name
Category name
Attribute name
Required in PDB entries
Used in current PDB entries
Yes, in about 100.0 % of entries

Item Description

Cannonical chemical sequence expressed as string of one-letter amino acid codes. Modifications are coded as the parent amino acid where possible. A for alanine or adenine B for ambiguous asparagine/aspartic-acid R for arginine N for asparagine D for aspartic-acid C for cysteine or cystine or cytosine Q for glutamine E for glutamic-acid Z for ambiguous glutamine/glutamic acid G for glycine or guanine H for histidine I for isoleucine L for leucine K for lysine M for methionine F for phenylalanine P for proline S for serine T for threonine or thymine W for tryptophan Y for tyrosine V for valine U for uracil

Item Example


Data Type

Data type code
Data type detail
text item types / multi-line text ...
Primitive data type code
Regular expression
[][ \n\t()_,.;:"&<>/\{}'`~!@#$%?+=*A-Za-z0-9|^-]*


Alias Item Name Dictionary Name Dictionary Version
_entity_poly.ndb_seq_one_letter_code_can cif_rcsb.dic 1.1