Amino acid dipepetide frequency for Thermacetogenium phaeum (strain ATCC BAA-254 / DSM 26808 / PB)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.734AlaAla: 10.734 ± 0.157
1.186AlaCys: 1.186 ± 0.039
3.976AlaAsp: 3.976 ± 0.065
6.551AlaGlu: 6.551 ± 0.103
3.056AlaPhe: 3.056 ± 0.057
8.878AlaGly: 8.878 ± 0.113
1.263AlaHis: 1.263 ± 0.04
4.586AlaIle: 4.586 ± 0.084
3.367AlaLys: 3.367 ± 0.063
10.164AlaLeu: 10.164 ± 0.118
1.799AlaMet: 1.799 ± 0.055
2.029AlaAsn: 2.029 ± 0.051
3.055AlaPro: 3.055 ± 0.064
2.406AlaGln: 2.406 ± 0.053
6.429AlaArg: 6.429 ± 0.088
4.097AlaSer: 4.097 ± 0.086
3.43AlaThr: 3.43 ± 0.079
8.255AlaVal: 8.255 ± 0.124
0.89AlaTrp: 0.89 ± 0.032
2.292AlaTyr: 2.292 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
1.031CysAla: 1.031 ± 0.032
0.309CysCys: 0.309 ± 0.022
0.631CysAsp: 0.631 ± 0.029
0.791CysGlu: 0.791 ± 0.031
0.55CysPhe: 0.55 ± 0.027
1.543CysGly: 1.543 ± 0.051
0.321CysHis: 0.321 ± 0.02
0.669CysIle: 0.669 ± 0.03
0.5CysLys: 0.5 ± 0.023
1.513CysLeu: 1.513 ± 0.043
0.281CysMet: 0.281 ± 0.02
0.42CysAsn: 0.42 ± 0.022
0.889CysPro: 0.889 ± 0.04
0.394CysGln: 0.394 ± 0.025
1.248CysArg: 1.248 ± 0.045
0.907CysSer: 0.907 ± 0.03
0.618CysThr: 0.618 ± 0.029
0.759CysVal: 0.759 ± 0.033
0.21CysTrp: 0.21 ± 0.018
0.518CysTyr: 0.518 ± 0.031
0.0CysXaa: 0.0 ± 0.0
Asp
3.81AspAla: 3.81 ± 0.075
0.731AspCys: 0.731 ± 0.029
2.18AspAsp: 2.18 ± 0.054
3.664AspGlu: 3.664 ± 0.068
2.16AspPhe: 2.16 ± 0.054
3.825AspGly: 3.825 ± 0.077
0.777AspHis: 0.777 ± 0.033
3.055AspIle: 3.055 ± 0.063
1.997AspLys: 1.997 ± 0.053
5.726AspLeu: 5.726 ± 0.091
0.93AspMet: 0.93 ± 0.036
1.285AspAsn: 1.285 ± 0.041
2.839AspPro: 2.839 ± 0.06
1.185AspGln: 1.185 ± 0.035
3.29AspArg: 3.29 ± 0.062
2.072AspSer: 2.072 ± 0.05
1.944AspThr: 1.944 ± 0.046
3.872AspVal: 3.872 ± 0.076
0.566AspTrp: 0.566 ± 0.025
1.933AspTyr: 1.933 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
6.857GluAla: 6.857 ± 0.109
0.731GluCys: 0.731 ± 0.034
3.266GluAsp: 3.266 ± 0.077
7.394GluGlu: 7.394 ± 0.125
2.326GluPhe: 2.326 ± 0.057
5.539GluGly: 5.539 ± 0.086
1.193GluHis: 1.193 ± 0.036
5.832GluIle: 5.832 ± 0.091
5.673GluLys: 5.673 ± 0.098
7.74GluLeu: 7.74 ± 0.108
2.124GluMet: 2.124 ± 0.054
2.566GluAsn: 2.566 ± 0.059
2.501GluPro: 2.501 ± 0.055
2.69GluGln: 2.69 ± 0.055
5.833GluArg: 5.833 ± 0.101
2.871GluSer: 2.871 ± 0.062
2.964GluThr: 2.964 ± 0.057
5.865GluVal: 5.865 ± 0.101
0.695GluTrp: 0.695 ± 0.029
2.024GluTyr: 2.024 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
3.042PheAla: 3.042 ± 0.059
0.705PheCys: 0.705 ± 0.029
1.947PheAsp: 1.947 ± 0.047
2.113PheGlu: 2.113 ± 0.049
1.748PhePhe: 1.748 ± 0.057
3.036PheGly: 3.036 ± 0.062
0.71PheHis: 0.71 ± 0.031
2.336PheIle: 2.336 ± 0.057
1.524PheLys: 1.524 ± 0.041
4.608PheLeu: 4.608 ± 0.098
0.748PheMet: 0.748 ± 0.027
1.188PheAsn: 1.188 ± 0.043
1.81PhePro: 1.81 ± 0.051
1.257PheGln: 1.257 ± 0.044
2.484PheArg: 2.484 ± 0.052
2.42PheSer: 2.42 ± 0.052
1.937PheThr: 1.937 ± 0.047
2.445PheVal: 2.445 ± 0.048
0.47PheTrp: 0.47 ± 0.026
1.334PheTyr: 1.334 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
6.542GlyAla: 6.542 ± 0.099
1.442GlyCys: 1.442 ± 0.048
3.811GlyAsp: 3.811 ± 0.08
6.269GlyGlu: 6.269 ± 0.1
3.216GlyPhe: 3.216 ± 0.072
6.576GlyGly: 6.576 ± 0.114
1.361GlyHis: 1.361 ± 0.041
6.001GlyIle: 6.001 ± 0.082
5.104GlyLys: 5.104 ± 0.064
7.979GlyLeu: 7.979 ± 0.11
2.358GlyMet: 2.358 ± 0.052
2.496GlyAsn: 2.496 ± 0.056
2.49GlyPro: 2.49 ± 0.056
2.168GlyGln: 2.168 ± 0.051
5.893GlyArg: 5.893 ± 0.101
4.328GlySer: 4.328 ± 0.083
4.107GlyThr: 4.107 ± 0.074
6.656GlyVal: 6.656 ± 0.102
0.961GlyTrp: 0.961 ± 0.034
2.879GlyTyr: 2.879 ± 0.065
0.0GlyXaa: 0.0 ± 0.0
His
1.122HisAla: 1.122 ± 0.037
0.279HisCys: 0.279 ± 0.02
0.802HisAsp: 0.802 ± 0.034
0.855HisGlu: 0.855 ± 0.031
0.795HisPhe: 0.795 ± 0.028
1.268HisGly: 1.268 ± 0.043
0.48HisHis: 0.48 ± 0.028
0.962HisIle: 0.962 ± 0.039
0.596HisLys: 0.596 ± 0.03
2.266HisLeu: 2.266 ± 0.051
0.288HisMet: 0.288 ± 0.017
0.57HisAsn: 0.57 ± 0.028
1.286HisPro: 1.286 ± 0.042
0.61HisGln: 0.61 ± 0.027
1.209HisArg: 1.209 ± 0.04
0.794HisSer: 0.794 ± 0.033
0.757HisThr: 0.757 ± 0.032
1.034HisVal: 1.034 ± 0.036
0.217HisTrp: 0.217 ± 0.019
0.681HisTyr: 0.681 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
5.926IleAla: 5.926 ± 0.092
0.954IleCys: 0.954 ± 0.035
3.286IleAsp: 3.286 ± 0.071
4.145IleGlu: 4.145 ± 0.084
2.413IlePhe: 2.413 ± 0.058
4.718IleGly: 4.718 ± 0.088
0.95IleHis: 0.95 ± 0.034
3.944IleIle: 3.944 ± 0.075
3.299IleLys: 3.299 ± 0.072
6.103IleLeu: 6.103 ± 0.087
1.239IleMet: 1.239 ± 0.042
2.145IleAsn: 2.145 ± 0.053
3.32IlePro: 3.32 ± 0.066
1.611IleGln: 1.611 ± 0.047
3.974IleArg: 3.974 ± 0.07
3.473IleSer: 3.473 ± 0.067
3.421IleThr: 3.421 ± 0.076
4.373IleVal: 4.373 ± 0.073
0.62IleTrp: 0.62 ± 0.028
1.931IleTyr: 1.931 ± 0.063
0.0IleXaa: 0.0 ± 0.0
Lys
4.354LysAla: 4.354 ± 0.082
0.491LysCys: 0.491 ± 0.03
2.538LysAsp: 2.538 ± 0.062
5.015LysGlu: 5.015 ± 0.092
1.371LysPhe: 1.371 ± 0.046
4.188LysGly: 4.188 ± 0.068
0.801LysHis: 0.801 ± 0.028
3.415LysIle: 3.415 ± 0.075
3.719LysLys: 3.719 ± 0.083
4.454LysLeu: 4.454 ± 0.086
1.338LysMet: 1.338 ± 0.043
2.013LysAsn: 2.013 ± 0.059
2.096LysPro: 2.096 ± 0.055
1.538LysGln: 1.538 ± 0.047
3.596LysArg: 3.596 ± 0.067
2.428LysSer: 2.428 ± 0.052
2.496LysThr: 2.496 ± 0.055
3.749LysVal: 3.749 ± 0.072
0.492LysTrp: 0.492 ± 0.026
1.533LysTyr: 1.533 ± 0.046
0.0LysXaa: 0.0 ± 0.0
Leu
10.626LeuAla: 10.626 ± 0.138
1.402LeuCys: 1.402 ± 0.045
5.17LeuAsp: 5.17 ± 0.083
8.3LeuGlu: 8.3 ± 0.1
4.117LeuPhe: 4.117 ± 0.093
8.139LeuGly: 8.139 ± 0.124
1.717LeuHis: 1.717 ± 0.05
6.114LeuIle: 6.114 ± 0.101
6.153LeuLys: 6.153 ± 0.108
12.104LeuLeu: 12.104 ± 0.17
2.176LeuMet: 2.176 ± 0.058
3.37LeuAsn: 3.37 ± 0.064
5.329LeuPro: 5.329 ± 0.096
3.739LeuGln: 3.739 ± 0.073
7.062LeuArg: 7.062 ± 0.1
6.128LeuSer: 6.128 ± 0.113
5.18LeuThr: 5.18 ± 0.076
8.143LeuVal: 8.143 ± 0.097
0.957LeuTrp: 0.957 ± 0.036
2.767LeuTyr: 2.767 ± 0.059
0.0LeuXaa: 0.0 ± 0.0
Met
2.255MetAla: 2.255 ± 0.05
0.234MetCys: 0.234 ± 0.016
1.052MetAsp: 1.052 ± 0.037
1.716MetGlu: 1.716 ± 0.047
0.569MetPhe: 0.569 ± 0.029
1.88MetGly: 1.88 ± 0.048
0.329MetHis: 0.329 ± 0.022
1.332MetIle: 1.332 ± 0.043
1.389MetLys: 1.389 ± 0.046
2.321MetLeu: 2.321 ± 0.058
0.542MetMet: 0.542 ± 0.028
0.772MetAsn: 0.772 ± 0.029
1.117MetPro: 1.117 ± 0.036
0.765MetGln: 0.765 ± 0.033
1.468MetArg: 1.468 ± 0.048
1.223MetSer: 1.223 ± 0.039
1.023MetThr: 1.023 ± 0.037
1.705MetVal: 1.705 ± 0.047
0.16MetTrp: 0.16 ± 0.013
0.42MetTyr: 0.42 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
2.312AsnAla: 2.312 ± 0.053
0.471AsnCys: 0.471 ± 0.022
1.168AsnAsp: 1.168 ± 0.037
1.706AsnGlu: 1.706 ± 0.043
1.17AsnPhe: 1.17 ± 0.042
2.363AsnGly: 2.363 ± 0.058
0.526AsnHis: 0.526 ± 0.028
2.177AsnIle: 2.177 ± 0.05
1.504AsnLys: 1.504 ± 0.048
3.487AsnLeu: 3.487 ± 0.065
0.649AsnMet: 0.649 ± 0.027
1.078AsnAsn: 1.078 ± 0.046
2.048AsnPro: 2.048 ± 0.052
0.858AsnGln: 0.858 ± 0.036
2.278AsnArg: 2.278 ± 0.062
1.524AsnSer: 1.524 ± 0.045
1.482AsnThr: 1.482 ± 0.044
2.072AsnVal: 2.072 ± 0.047
0.398AsnTrp: 0.398 ± 0.021
1.14AsnTyr: 1.14 ± 0.043
0.0AsnXaa: 0.0 ± 0.0
Pro
3.925ProAla: 3.925 ± 0.079
0.566ProCys: 0.566 ± 0.026
2.776ProAsp: 2.776 ± 0.061
4.656ProGlu: 4.656 ± 0.079
1.932ProPhe: 1.932 ± 0.054
4.663ProGly: 4.663 ± 0.088
0.924ProHis: 0.924 ± 0.035
1.814ProIle: 1.814 ± 0.053
1.533ProLys: 1.533 ± 0.049
4.795ProLeu: 4.795 ± 0.078
0.699ProMet: 0.699 ± 0.031
1.13ProAsn: 1.13 ± 0.045
2.304ProPro: 2.304 ± 0.058
1.625ProGln: 1.625 ± 0.048
2.522ProArg: 2.522 ± 0.056
2.039ProSer: 2.039 ± 0.047
1.713ProThr: 1.713 ± 0.052
4.365ProVal: 4.365 ± 0.079
0.523ProTrp: 0.523 ± 0.027
1.408ProTyr: 1.408 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
2.748GlnAla: 2.748 ± 0.059
0.298GlnCys: 0.298 ± 0.023
1.407GlnAsp: 1.407 ± 0.041
3.007GlnGlu: 3.007 ± 0.063
0.898GlnPhe: 0.898 ± 0.042
2.485GlnGly: 2.485 ± 0.055
0.5GlnHis: 0.5 ± 0.024
1.948GlnIle: 1.948 ± 0.058
2.02GlnLys: 2.02 ± 0.05
3.056GlnLeu: 3.056 ± 0.066
0.806GlnMet: 0.806 ± 0.034
1.012GlnAsn: 1.012 ± 0.036
1.233GlnPro: 1.233 ± 0.043
1.324GlnGln: 1.324 ± 0.044
2.018GlnArg: 2.018 ± 0.05
1.252GlnSer: 1.252 ± 0.04
1.248GlnThr: 1.248 ± 0.04
2.68GlnVal: 2.68 ± 0.054
0.26GlnTrp: 0.26 ± 0.018
0.789GlnTyr: 0.789 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
5.136ArgAla: 5.136 ± 0.092
1.05ArgCys: 1.05 ± 0.044
3.324ArgAsp: 3.324 ± 0.065
6.492ArgGlu: 6.492 ± 0.12
2.609ArgPhe: 2.609 ± 0.056
4.861ArgGly: 4.861 ± 0.077
1.179ArgHis: 1.179 ± 0.041
4.391ArgIle: 4.391 ± 0.07
3.698ArgLys: 3.698 ± 0.078
7.712ArgLeu: 7.712 ± 0.117
1.809ArgMet: 1.809 ± 0.042
1.978ArgAsn: 1.978 ± 0.051
2.817ArgPro: 2.817 ± 0.063
2.562ArgGln: 2.562 ± 0.064
5.493ArgArg: 5.493 ± 0.092
3.317ArgSer: 3.317 ± 0.069
2.704ArgThr: 2.704 ± 0.057
5.233ArgVal: 5.233 ± 0.081
0.754ArgTrp: 0.754 ± 0.035
2.29ArgTyr: 2.29 ± 0.055
0.0ArgXaa: 0.0 ± 0.0
Ser
3.674SerAla: 3.674 ± 0.08
0.865SerCys: 0.865 ± 0.035
2.263SerAsp: 2.263 ± 0.057
3.058SerGlu: 3.058 ± 0.067
2.461SerPhe: 2.461 ± 0.058
4.851SerGly: 4.851 ± 0.085
0.939SerHis: 0.939 ± 0.03
2.885SerIle: 2.885 ± 0.061
1.947SerLys: 1.947 ± 0.052
6.313SerLeu: 6.313 ± 0.11
1.01SerMet: 1.01 ± 0.036
1.371SerAsn: 1.371 ± 0.043
2.496SerPro: 2.496 ± 0.057
1.462SerGln: 1.462 ± 0.046
3.762SerArg: 3.762 ± 0.072
2.811SerSer: 2.811 ± 0.073
2.191SerThr: 2.191 ± 0.052
3.378SerVal: 3.378 ± 0.068
0.694SerTrp: 0.694 ± 0.03
1.712SerTyr: 1.712 ± 0.049
0.0SerXaa: 0.0 ± 0.0
Thr
4.408ThrAla: 4.408 ± 0.085
0.641ThrCys: 0.641 ± 0.029
2.103ThrAsp: 2.103 ± 0.055
2.678ThrGlu: 2.678 ± 0.068
1.708ThrPhe: 1.708 ± 0.052
5.035ThrGly: 5.035 ± 0.08
0.78ThrHis: 0.78 ± 0.03
2.682ThrIle: 2.682 ± 0.065
1.86ThrLys: 1.86 ± 0.057
4.586ThrLeu: 4.586 ± 0.077
0.912ThrMet: 0.912 ± 0.036
1.215ThrAsn: 1.215 ± 0.042
2.639ThrPro: 2.639 ± 0.067
1.049ThrGln: 1.049 ± 0.04
2.546ThrArg: 2.546 ± 0.052
2.266ThrSer: 2.266 ± 0.052
2.301ThrThr: 2.301 ± 0.059
4.048ThrVal: 4.048 ± 0.08
0.458ThrTrp: 0.458 ± 0.026
1.241ThrTyr: 1.241 ± 0.042
0.0ThrXaa: 0.0 ± 0.0
Val
6.915ValAla: 6.915 ± 0.102
1.051ValCys: 1.051 ± 0.04
4.08ValAsp: 4.08 ± 0.077
5.478ValGlu: 5.478 ± 0.091
3.082ValPhe: 3.082 ± 0.072
5.361ValGly: 5.361 ± 0.103
1.227ValHis: 1.227 ± 0.041
5.502ValIle: 5.502 ± 0.087
4.074ValLys: 4.074 ± 0.084
8.584ValLeu: 8.584 ± 0.126
1.836ValMet: 1.836 ± 0.051
2.404ValAsn: 2.404 ± 0.055
3.581ValPro: 3.581 ± 0.068
2.154ValGln: 2.154 ± 0.054
4.981ValArg: 4.981 ± 0.089
4.109ValSer: 4.109 ± 0.075
3.792ValThr: 3.792 ± 0.064
6.656ValVal: 6.656 ± 0.107
0.715ValTrp: 0.715 ± 0.028
2.2ValTyr: 2.2 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.727TrpAla: 0.727 ± 0.03
0.143TrpCys: 0.143 ± 0.012
0.594TrpAsp: 0.594 ± 0.027
0.955TrpGlu: 0.955 ± 0.037
0.398TrpPhe: 0.398 ± 0.023
0.848TrpGly: 0.848 ± 0.037
0.224TrpHis: 0.224 ± 0.016
0.56TrpIle: 0.56 ± 0.029
0.544TrpLys: 0.544 ± 0.026
1.229TrpLeu: 1.229 ± 0.044
0.272TrpMet: 0.272 ± 0.018
0.379TrpAsn: 0.379 ± 0.022
0.433TrpPro: 0.433 ± 0.021
0.438TrpGln: 0.438 ± 0.025
0.751TrpArg: 0.751 ± 0.033
0.506TrpSer: 0.506 ± 0.025
0.396TrpThr: 0.396 ± 0.022
0.669TrpVal: 0.669 ± 0.033
0.164TrpTrp: 0.164 ± 0.016
0.348TrpTyr: 0.348 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.298TyrAla: 2.298 ± 0.05
0.553TyrCys: 0.553 ± 0.025
1.466TyrAsp: 1.466 ± 0.048
1.845TyrGlu: 1.845 ± 0.047
1.282TyrPhe: 1.282 ± 0.04
2.578TyrGly: 2.578 ± 0.062
0.671TyrHis: 0.671 ± 0.027
1.642TyrIle: 1.642 ± 0.049
1.125TyrLys: 1.125 ± 0.039
3.953TyrLeu: 3.953 ± 0.072
0.453TyrMet: 0.453 ± 0.026
0.991TyrAsn: 0.991 ± 0.038
1.629TyrPro: 1.629 ± 0.049
1.152TyrGln: 1.152 ± 0.038
2.575TyrArg: 2.575 ± 0.06
1.58TyrSer: 1.58 ± 0.049
1.451TyrThr: 1.451 ± 0.048
1.823TyrVal: 1.823 ± 0.045
0.374TyrTrp: 0.374 ± 0.024
1.308TyrTyr: 1.308 ± 0.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2803 proteins (812553 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski