SeqAn3 3.4.0-rc.4
The Modern C++ library for sequence analysis.
|
The canonical amino acid alphabet. More...
#include <seqan3/alphabet/aminoacid/aa20.hpp>
Public Member Functions | |
Constructors, destructor and assignment | |
constexpr | aa20 () noexcept=default |
Defaulted. | |
constexpr | aa20 (aa20 const &) noexcept=default |
Defaulted. | |
constexpr | aa20 (aa20 &&) noexcept=default |
Defaulted. | |
constexpr aa20 & | operator= (aa20 const &) noexcept=default |
Defaulted. | |
constexpr aa20 & | operator= (aa20 &&) noexcept=default |
Defaulted. | |
~aa20 () noexcept=default | |
Defaulted. | |
![]() | |
constexpr | aminoacid_base (other_aa_type const other) noexcept |
Allow explicit construction from any other aminoacid type and convert via the character representation. | |
![]() | |
constexpr | alphabet_base () noexcept=default |
Defaulted. | |
constexpr | alphabet_base (alphabet_base const &) noexcept=default |
Defaulted. | |
constexpr | alphabet_base (alphabet_base &&) noexcept=default |
Defaulted. | |
constexpr alphabet_base & | operator= (alphabet_base const &) noexcept=default |
Defaulted. | |
constexpr alphabet_base & | operator= (alphabet_base &&) noexcept=default |
Defaulted. | |
~alphabet_base () noexcept=default | |
Defaulted. | |
constexpr char_type | to_char () const noexcept |
Return the letter as a character of char_type. | |
constexpr rank_type | to_rank () const noexcept |
Return the letter's numeric value (rank in the alphabet). | |
constexpr derived_type & | assign_char (char_type const chr) noexcept |
Assign from a character, implicitly converts invalid characters. | |
constexpr derived_type & | assign_rank (rank_type const c) noexcept |
Assign from a numeric value. | |
Related Symbols | |
(Note that these are not member symbols.) | |
using | aa20_vector = std::vector< aa20 > |
Alias for a std::vector of seqan3::aa20. | |
Additional Inherited Members | |
![]() | |
static constexpr bool | char_is_valid (char_type const c) noexcept |
Validate whether a character value has a one-to-one mapping to an alphabet value. | |
![]() | |
static constexpr detail::min_viable_uint_t< size > | alphabet_size = size |
The size of the alphabet, i.e. the number of different values it can take. | |
![]() | |
using | char_type = std::conditional_t< std::same_as< char_t, void >, char, char_t > |
The char representation; conditional needed to make semi alphabet definitions legal. | |
using | rank_type = detail::min_viable_uint_t< size - 1 > |
The type of the alphabet when represented as a number (e.g. via to_rank()). | |
The canonical amino acid alphabet.
The alphabet consists of letters A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, Y
The alphabet may be brace initialized from the static letter members (see above). Note that you cannot assign regular characters, but additional functions for this are available.
Note: Letters which belong in the extended alphabet will be automatically converted based on the frequency of their options.
Terminator characters are converted to W, because the most commonly occurring stop codon in higher eukaryotes is UGA2. Anything unknown is converted to S, because it occurs most frequently across 53 vertebrates1.
Input Letter | Converts to |
---|---|
B | D1 |
J | L1 |
O | L1 |
U | C1 |
Z | E1 |
X (Unknown) | S1 |
* (Terminator) | W2 |
1King, J. L., & Jukes, T. H. (1969). Non-Darwinian Evolution. Science, 164(3881), 788-798. doi:10.1126/science.164.3881.788
2Trotta, E. (2016). Selective forces and mutational biases drive stop codon usage in the human genome: a comparison with sense codon usage. BMC Genomics, 17, 366. https://doi.org/10.1186/s12864-016-2692-4
|
related |