The reduced Murphy amino acid alphabet.. More...
#include <seqan3/alphabet/aminoacid/aa10murphy.hpp>
Public Member Functions | |
Constructors, destructor and assignment | |
constexpr | aa10murphy () noexcept=default |
Defaulted. | |
constexpr | aa10murphy (aa10murphy const &) noexcept=default |
Defaulted. | |
constexpr | aa10murphy (aa10murphy &&) noexcept=default |
Defaulted. | |
constexpr aa10murphy & | operator= (aa10murphy const &) noexcept=default |
Defaulted. | |
constexpr aa10murphy & | operator= (aa10murphy &&) noexcept=default |
Defaulted. | |
~aa10murphy () noexcept=default | |
Defaulted. | |
![]() | |
constexpr | aminoacid_base (other_aa_type const other) noexcept |
Allow explicit construction from any other aminoacid type and convert via the character representation. More... | |
![]() | |
constexpr | alphabet_base () noexcept=default |
Defaulted. | |
constexpr | alphabet_base (alphabet_base const &) noexcept=default |
Defaulted. | |
constexpr | alphabet_base (alphabet_base &&) noexcept=default |
Defaulted. | |
constexpr alphabet_base & | operator= (alphabet_base const &) noexcept=default |
Defaulted. | |
constexpr alphabet_base & | operator= (alphabet_base &&) noexcept=default |
Defaulted. | |
~alphabet_base () noexcept=default | |
Defaulted. | |
constexpr char_type | to_char () const noexcept |
Return the letter as a character of char_type. More... | |
constexpr rank_type | to_rank () const noexcept |
Return the letter's numeric value (rank in the alphabet). More... | |
constexpr derived_type & | assign_char (char_type const chr) noexcept |
Assign from a character, implicitly converts invalid characters. More... | |
constexpr derived_type & | assign_rank (rank_type const c) noexcept |
Assign from a numeric value. More... | |
Related Functions | |
(Note that these are not member functions.) | |
using | aa10murphy_vector = std::vector< aa10murphy > |
Alias for a std::vector of seqan3::aa10murphy. More... | |
Literals | |
constexpr aa10murphy | operator""_aa10murphy (char const c) noexcept |
The seqan3::aa10murphy char literal. More... | |
aa10murphy_vector | operator""_aa10murphy (char const *const s, size_t const n) |
The seqan3::aa10murphy string literal. More... | |
Additional Inherited Members | |
![]() | |
static constexpr bool | char_is_valid (char_type const c) noexcept |
Validate whether a character value has a one-to-one mapping to an alphabet value. More... | |
![]() | |
static constexpr detail::min_viable_uint_t< size > | alphabet_size = size |
The size of the alphabet, i.e. the number of different values it can take. More... | |
![]() | |
using | char_type = std::conditional_t< std::same_as< char_t, void >, char, char_t > |
The char representation; conditional needed to make semi alphabet definitions legal. More... | |
using | rank_type = detail::min_viable_uint_t< size - 1 > |
The type of the alphabet when represented as a number (e.g. via to_rank()). More... | |
The reduced Murphy amino acid alphabet.
.
The alphabet consists of letters A, B, C, F, G, H, I, K, P, S B represents charged/polar residues (E,D,N,Q). C represents cystein and the species-specific amino acid Selenocysteine. F represents amino acids with large and mainly hydrophobic aromatic side chains (F,W,Y). I represents large hydrophobes (L,V,I,M). K represents long-chain positively charged residues (K,R) and the species-specific amino acid Pyrrolysine. S represents alcohols (S,T) and unknown. A, G, H and P do not represent any other amino acids other than themselves.
This alphabet allows to reduce the aminoacid alphabet size to 10 but is still able to recognize and represent folding of all proteins. Amino acids are grouped together based on similar physical and chemical properties.
Note: Letters which belong in the extended alphabet will be automatically converted. Terminator characters are converted to F, because the most commonly occurring stop codon in higher eukaryotes is UGA 2. This is most similar to a Tryptophan which in this alphabet gets converted to Phenylalanine. Anything unknown is converted to S.
Input Letter | Converts to |
---|---|
D | B1 |
E | B1 |
J | I1 |
L | I1 |
M | I1 |
N | B1 |
O | K1 |
Q | B1 |
R | K1 |
T | S1 |
U | C1 |
V | I1 |
W | F1 |
Y | F1 |
Z | B1 |
X (Unknown) | S1 |
* (Terminator) | F1,2 |
1L. R. Murphy, A. Wallqvist, and R. M. Levy. Simplified amino acid alphabets for protein fold recognition and implications for folding. Protein Eng., 13(3):149–152, Mar 2000.
2Trotta, E. (2016). Selective forces and mutational biases drive stop codon usage in the human genome: a comparison with sense codon usage. BMC Genomics, 17, 366. https://doi.org/10.1186/s12864-016-2692-4
|
stable-apirelated |
|
stable-apirelated |
The seqan3::aa10murphy string literal.
[in] | s | A pointer to the character string to assign. |
[in] | n | The size of the character string to assign. |
You can use this string literal to easily assign to aa10murphy_vector:
|
stable-apirelated |
The seqan3::aa10murphy char literal.
[in] | c | The character to assign. |
You can use this char literal to assign a seqan3::aa10murphy character: