Amino acid dipepetide frequency for Nitrososphaera viennensis EN76

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.37AlaAla: 10.37 ± 0.158
0.856AlaCys: 0.856 ± 0.039
4.606AlaAsp: 4.606 ± 0.1
5.41AlaGlu: 5.41 ± 0.102
3.49AlaPhe: 3.49 ± 0.073
8.321AlaGly: 8.321 ± 0.119
1.532AlaHis: 1.532 ± 0.043
6.456AlaIle: 6.456 ± 0.117
6.06AlaLys: 6.06 ± 0.112
8.299AlaLeu: 8.299 ± 0.146
2.641AlaMet: 2.641 ± 0.064
2.939AlaAsn: 2.939 ± 0.068
3.11AlaPro: 3.11 ± 0.081
2.85AlaGln: 2.85 ± 0.066
5.3AlaArg: 5.3 ± 0.095
6.356AlaSer: 6.356 ± 0.103
5.069AlaThr: 5.069 ± 0.13
7.017AlaVal: 7.017 ± 0.123
0.834AlaTrp: 0.834 ± 0.041
2.54AlaTyr: 2.54 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.739CysAla: 0.739 ± 0.041
0.156CysCys: 0.156 ± 0.014
0.594CysAsp: 0.594 ± 0.025
0.528CysGlu: 0.528 ± 0.028
0.311CysPhe: 0.311 ± 0.021
1.031CysGly: 1.031 ± 0.04
0.236CysHis: 0.236 ± 0.017
0.576CysIle: 0.576 ± 0.03
0.616CysLys: 0.616 ± 0.03
0.662CysLeu: 0.662 ± 0.032
0.299CysMet: 0.299 ± 0.021
0.432CysAsn: 0.432 ± 0.024
0.603CysPro: 0.603 ± 0.032
0.255CysGln: 0.255 ± 0.018
0.635CysArg: 0.635 ± 0.028
0.643CysSer: 0.643 ± 0.031
0.55CysThr: 0.55 ± 0.031
0.58CysVal: 0.58 ± 0.029
0.095CysTrp: 0.095 ± 0.011
0.309CysTyr: 0.309 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
4.915AspAla: 4.915 ± 0.103
0.501AspCys: 0.501 ± 0.028
3.47AspAsp: 3.47 ± 0.112
3.56AspGlu: 3.56 ± 0.078
2.237AspPhe: 2.237 ± 0.054
4.339AspGly: 4.339 ± 0.095
0.953AspHis: 0.953 ± 0.037
3.864AspIle: 3.864 ± 0.069
3.149AspLys: 3.149 ± 0.067
4.613AspLeu: 4.613 ± 0.086
1.548AspMet: 1.548 ± 0.054
1.964AspAsn: 1.964 ± 0.061
2.481AspPro: 2.481 ± 0.055
1.233AspGln: 1.233 ± 0.041
3.171AspArg: 3.171 ± 0.077
3.718AspSer: 3.718 ± 0.101
2.644AspThr: 2.644 ± 0.068
3.974AspVal: 3.974 ± 0.073
0.601AspTrp: 0.601 ± 0.031
2.079AspTyr: 2.079 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
4.919GluAla: 4.919 ± 0.097
0.522GluCys: 0.522 ± 0.026
2.798GluAsp: 2.798 ± 0.07
4.704GluGlu: 4.704 ± 0.113
2.352GluPhe: 2.352 ± 0.06
4.096GluGly: 4.096 ± 0.08
1.249GluHis: 1.249 ± 0.042
4.476GluIle: 4.476 ± 0.085
5.473GluLys: 5.473 ± 0.101
5.548GluLeu: 5.548 ± 0.1
2.027GluMet: 2.027 ± 0.053
2.534GluAsn: 2.534 ± 0.048
2.146GluPro: 2.146 ± 0.057
2.322GluGln: 2.322 ± 0.057
3.72GluArg: 3.72 ± 0.084
3.6GluSer: 3.6 ± 0.081
2.829GluThr: 2.829 ± 0.064
4.161GluVal: 4.161 ± 0.08
0.642GluTrp: 0.642 ± 0.032
1.965GluTyr: 1.965 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
3.833PheAla: 3.833 ± 0.09
0.474PheCys: 0.474 ± 0.028
2.412PheAsp: 2.412 ± 0.059
2.29PheGlu: 2.29 ± 0.058
1.688PhePhe: 1.688 ± 0.058
3.293PheGly: 3.293 ± 0.079
0.682PheHis: 0.682 ± 0.032
2.174PheIle: 2.174 ± 0.058
1.875PheLys: 1.875 ± 0.046
3.176PheLeu: 3.176 ± 0.079
0.971PheMet: 0.971 ± 0.041
1.308PheAsn: 1.308 ± 0.046
1.498PhePro: 1.498 ± 0.047
1.041PheGln: 1.041 ± 0.036
1.772PheArg: 1.772 ± 0.051
3.05PheSer: 3.05 ± 0.066
2.049PheThr: 2.049 ± 0.059
3.269PheVal: 3.269 ± 0.068
0.44PheTrp: 0.44 ± 0.026
1.385PheTyr: 1.385 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
6.421GlyAla: 6.421 ± 0.118
0.666GlyCys: 0.666 ± 0.034
3.723GlyAsp: 3.723 ± 0.08
3.912GlyGlu: 3.912 ± 0.081
3.065GlyPhe: 3.065 ± 0.065
6.052GlyGly: 6.052 ± 0.138
1.359GlyHis: 1.359 ± 0.043
5.609GlyIle: 5.609 ± 0.092
5.343GlyLys: 5.343 ± 0.095
6.153GlyLeu: 6.153 ± 0.123
2.396GlyMet: 2.396 ± 0.053
3.032GlyAsn: 3.032 ± 0.095
2.228GlyPro: 2.228 ± 0.068
2.323GlyGln: 2.323 ± 0.061
4.449GlyArg: 4.449 ± 0.081
5.202GlySer: 5.202 ± 0.11
4.328GlyThr: 4.328 ± 0.092
5.229GlyVal: 5.229 ± 0.095
0.824GlyTrp: 0.824 ± 0.036
2.529GlyTyr: 2.529 ± 0.056
0.0GlyXaa: 0.0 ± 0.0
His
1.611HisAla: 1.611 ± 0.047
0.217HisCys: 0.217 ± 0.019
1.192HisAsp: 1.192 ± 0.037
1.196HisGlu: 1.196 ± 0.034
0.801HisPhe: 0.801 ± 0.033
1.518HisGly: 1.518 ± 0.048
0.524HisHis: 0.524 ± 0.033
1.157HisIle: 1.157 ± 0.036
0.867HisLys: 0.867 ± 0.032
1.536HisLeu: 1.536 ± 0.043
0.62HisMet: 0.62 ± 0.032
0.661HisAsn: 0.661 ± 0.03
1.008HisPro: 1.008 ± 0.038
0.494HisGln: 0.494 ± 0.028
1.024HisArg: 1.024 ± 0.045
1.2HisSer: 1.2 ± 0.042
0.891HisThr: 0.891 ± 0.035
1.43HisVal: 1.43 ± 0.038
0.232HisTrp: 0.232 ± 0.019
0.677HisTyr: 0.677 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.481IleAla: 6.481 ± 0.104
0.564IleCys: 0.564 ± 0.028
3.997IleAsp: 3.997 ± 0.067
4.409IleGlu: 4.409 ± 0.08
2.265IlePhe: 2.265 ± 0.069
4.654IleGly: 4.654 ± 0.099
1.109IleHis: 1.109 ± 0.04
4.294IleIle: 4.294 ± 0.077
3.803IleLys: 3.803 ± 0.079
5.3IleLeu: 5.3 ± 0.096
1.705IleMet: 1.705 ± 0.049
2.197IleAsn: 2.197 ± 0.058
2.932IlePro: 2.932 ± 0.069
1.639IleGln: 1.639 ± 0.045
3.28IleArg: 3.28 ± 0.066
4.38IleSer: 4.38 ± 0.1
3.683IleThr: 3.683 ± 0.078
5.261IleVal: 5.261 ± 0.089
0.488IleTrp: 0.488 ± 0.029
1.781IleTyr: 1.781 ± 0.047
0.0IleXaa: 0.0 ± 0.0
Lys
5.64LysAla: 5.64 ± 0.095
0.668LysCys: 0.668 ± 0.034
3.509LysAsp: 3.509 ± 0.072
5.081LysGlu: 5.081 ± 0.098
2.135LysPhe: 2.135 ± 0.06
4.29LysGly: 4.29 ± 0.084
1.208LysHis: 1.208 ± 0.039
4.312LysIle: 4.312 ± 0.084
5.738LysLys: 5.738 ± 0.124
4.9LysLeu: 4.9 ± 0.095
2.26LysMet: 2.26 ± 0.059
2.623LysAsn: 2.623 ± 0.064
2.474LysPro: 2.474 ± 0.066
2.075LysGln: 2.075 ± 0.053
3.323LysArg: 3.323 ± 0.073
3.841LysSer: 3.841 ± 0.071
3.201LysThr: 3.201 ± 0.074
4.963LysVal: 4.963 ± 0.091
0.613LysTrp: 0.613 ± 0.027
1.984LysTyr: 1.984 ± 0.052
0.0LysXaa: 0.0 ± 0.0
Leu
9.804LeuAla: 9.804 ± 0.134
0.765LeuCys: 0.765 ± 0.034
5.014LeuAsp: 5.014 ± 0.093
5.547LeuGlu: 5.547 ± 0.104
3.165LeuPhe: 3.165 ± 0.079
6.16LeuGly: 6.16 ± 0.112
1.599LeuHis: 1.599 ± 0.044
4.063LeuIle: 4.063 ± 0.073
5.603LeuLys: 5.603 ± 0.099
7.722LeuLeu: 7.722 ± 0.131
2.224LeuMet: 2.224 ± 0.057
2.656LeuAsn: 2.656 ± 0.062
3.731LeuPro: 3.731 ± 0.066
2.787LeuGln: 2.787 ± 0.059
4.291LeuArg: 4.291 ± 0.081
5.959LeuSer: 5.959 ± 0.092
4.159LeuThr: 4.159 ± 0.091
7.065LeuVal: 7.065 ± 0.106
0.734LeuTrp: 0.734 ± 0.034
2.648LeuTyr: 2.648 ± 0.061
0.0LeuXaa: 0.0 ± 0.0
Met
2.752MetAla: 2.752 ± 0.056
0.252MetCys: 0.252 ± 0.018
1.414MetAsp: 1.414 ± 0.041
1.61MetGlu: 1.61 ± 0.046
0.982MetPhe: 0.982 ± 0.041
1.806MetGly: 1.806 ± 0.045
0.654MetHis: 0.654 ± 0.03
1.766MetIle: 1.766 ± 0.058
1.886MetLys: 1.886 ± 0.049
2.822MetLeu: 2.822 ± 0.059
1.059MetMet: 1.059 ± 0.044
1.012MetAsn: 1.012 ± 0.042
1.585MetPro: 1.585 ± 0.046
1.249MetGln: 1.249 ± 0.046
1.487MetArg: 1.487 ± 0.046
2.134MetSer: 2.134 ± 0.046
1.679MetThr: 1.679 ± 0.05
2.186MetVal: 2.186 ± 0.061
0.266MetTrp: 0.266 ± 0.021
0.809MetTyr: 0.809 ± 0.037
0.0MetXaa: 0.0 ± 0.0
Asn
3.276AsnAla: 3.276 ± 0.081
0.439AsnCys: 0.439 ± 0.025
1.894AsnAsp: 1.894 ± 0.059
2.075AsnGlu: 2.075 ± 0.057
1.447AsnPhe: 1.447 ± 0.049
2.767AsnGly: 2.767 ± 0.081
0.691AsnHis: 0.691 ± 0.029
2.346AsnIle: 2.346 ± 0.058
2.017AsnLys: 2.017 ± 0.059
2.97AsnLeu: 2.97 ± 0.067
1.038AsnMet: 1.038 ± 0.036
2.2AsnAsn: 2.2 ± 0.071
2.108AsnPro: 2.108 ± 0.069
0.945AsnGln: 0.945 ± 0.034
1.883AsnArg: 1.883 ± 0.048
2.449AsnSer: 2.449 ± 0.061
1.901AsnThr: 1.901 ± 0.057
2.85AsnVal: 2.85 ± 0.071
0.325AsnTrp: 0.325 ± 0.021
1.167AsnTyr: 1.167 ± 0.039
0.0AsnXaa: 0.0 ± 0.0
Pro
4.371ProAla: 4.371 ± 0.077
0.347ProCys: 0.347 ± 0.022
2.66ProAsp: 2.66 ± 0.064
2.914ProGlu: 2.914 ± 0.063
1.714ProPhe: 1.714 ± 0.045
2.892ProGly: 2.892 ± 0.073
0.789ProHis: 0.789 ± 0.032
2.065ProIle: 2.065 ± 0.056
2.315ProLys: 2.315 ± 0.059
3.428ProLeu: 3.428 ± 0.075
0.939ProMet: 0.939 ± 0.04
1.262ProAsn: 1.262 ± 0.041
1.901ProPro: 1.901 ± 0.067
1.43ProGln: 1.43 ± 0.049
1.905ProArg: 1.905 ± 0.049
2.92ProSer: 2.92 ± 0.066
2.057ProThr: 2.057 ± 0.063
3.408ProVal: 3.408 ± 0.078
0.447ProTrp: 0.447 ± 0.024
1.362ProTyr: 1.362 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
2.83GlnAla: 2.83 ± 0.069
0.248GlnCys: 0.248 ± 0.017
1.742GlnAsp: 1.742 ± 0.057
2.123GlnGlu: 2.123 ± 0.051
1.185GlnPhe: 1.185 ± 0.041
2.038GlnGly: 2.038 ± 0.051
0.647GlnHis: 0.647 ± 0.028
1.891GlnIle: 1.891 ± 0.045
2.342GlnLys: 2.342 ± 0.063
2.603GlnLeu: 2.603 ± 0.055
0.976GlnMet: 0.976 ± 0.035
1.255GlnAsn: 1.255 ± 0.046
1.16GlnPro: 1.16 ± 0.041
1.881GlnGln: 1.881 ± 0.082
1.57GlnArg: 1.57 ± 0.048
1.894GlnSer: 1.894 ± 0.055
1.546GlnThr: 1.546 ± 0.051
2.544GlnVal: 2.544 ± 0.057
0.347GlnTrp: 0.347 ± 0.023
1.059GlnTyr: 1.059 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
4.307ArgAla: 4.307 ± 0.079
0.58ArgCys: 0.58 ± 0.034
2.902ArgAsp: 2.902 ± 0.073
3.413ArgGlu: 3.413 ± 0.071
2.163ArgPhe: 2.163 ± 0.053
3.253ArgGly: 3.253 ± 0.068
1.148ArgHis: 1.148 ± 0.039
3.762ArgIle: 3.762 ± 0.071
3.77ArgLys: 3.77 ± 0.082
5.18ArgLeu: 5.18 ± 0.091
1.902ArgMet: 1.902 ± 0.052
2.076ArgAsn: 2.076 ± 0.057
2.036ArgPro: 2.036 ± 0.051
2.086ArgGln: 2.086 ± 0.06
3.386ArgArg: 3.386 ± 0.075
3.017ArgSer: 3.017 ± 0.107
2.628ArgThr: 2.628 ± 0.067
3.745ArgVal: 3.745 ± 0.069
0.586ArgTrp: 0.586 ± 0.031
1.901ArgTyr: 1.901 ± 0.061
0.0ArgXaa: 0.0 ± 0.0
Ser
6.112SerAla: 6.112 ± 0.13
0.683SerCys: 0.683 ± 0.032
3.508SerAsp: 3.508 ± 0.106
3.771SerGlu: 3.771 ± 0.068
2.68SerPhe: 2.68 ± 0.074
5.688SerGly: 5.688 ± 0.111
1.196SerHis: 1.196 ± 0.039
4.399SerIle: 4.399 ± 0.097
4.028SerLys: 4.028 ± 0.081
5.965SerLeu: 5.965 ± 0.098
2.102SerMet: 2.102 ± 0.052
2.377SerAsn: 2.377 ± 0.07
2.724SerPro: 2.724 ± 0.082
2.078SerGln: 2.078 ± 0.05
3.509SerArg: 3.509 ± 0.091
5.761SerSer: 5.761 ± 0.144
3.711SerThr: 3.711 ± 0.092
4.787SerVal: 4.787 ± 0.117
0.706SerTrp: 0.706 ± 0.029
2.116SerTyr: 2.116 ± 0.059
0.0SerXaa: 0.0 ± 0.0
Thr
4.855ThrAla: 4.855 ± 0.106
0.506ThrCys: 0.506 ± 0.027
2.636ThrAsp: 2.636 ± 0.078
2.64ThrGlu: 2.64 ± 0.058
2.257ThrPhe: 2.257 ± 0.059
4.542ThrGly: 4.542 ± 0.089
0.93ThrHis: 0.93 ± 0.035
3.792ThrIle: 3.792 ± 0.078
3.003ThrLys: 3.003 ± 0.064
4.601ThrLeu: 4.601 ± 0.095
1.467ThrMet: 1.467 ± 0.049
1.824ThrAsn: 1.824 ± 0.056
2.452ThrPro: 2.452 ± 0.075
1.366ThrGln: 1.366 ± 0.045
2.471ThrArg: 2.471 ± 0.061
3.461ThrSer: 3.461 ± 0.09
3.463ThrThr: 3.463 ± 0.1
4.381ThrVal: 4.381 ± 0.114
0.512ThrTrp: 0.512 ± 0.029
1.744ThrTyr: 1.744 ± 0.053
0.0ThrXaa: 0.0 ± 0.0
Val
7.516ValAla: 7.516 ± 0.115
0.828ValCys: 0.828 ± 0.034
4.264ValAsp: 4.264 ± 0.093
4.65ValGlu: 4.65 ± 0.083
2.918ValPhe: 2.918 ± 0.072
4.912ValGly: 4.912 ± 0.1
1.395ValHis: 1.395 ± 0.047
4.909ValIle: 4.909 ± 0.087
4.674ValLys: 4.674 ± 0.087
6.515ValLeu: 6.515 ± 0.111
2.097ValMet: 2.097 ± 0.051
2.691ValAsn: 2.691 ± 0.07
3.337ValPro: 3.337 ± 0.075
2.485ValGln: 2.485 ± 0.062
4.11ValArg: 4.11 ± 0.079
5.357ValSer: 5.357 ± 0.125
4.284ValThr: 4.284 ± 0.106
6.122ValVal: 6.122 ± 0.094
0.679ValTrp: 0.679 ± 0.03
2.383ValTyr: 2.383 ± 0.067
0.0ValXaa: 0.0 ± 0.0
Trp
0.675TrpAla: 0.675 ± 0.029
0.11TrpCys: 0.11 ± 0.012
0.539TrpAsp: 0.539 ± 0.035
0.451TrpGlu: 0.451 ± 0.024
0.422TrpPhe: 0.422 ± 0.024
0.647TrpGly: 0.647 ± 0.03
0.269TrpHis: 0.269 ± 0.022
0.654TrpIle: 0.654 ± 0.03
0.731TrpLys: 0.731 ± 0.031
0.938TrpLeu: 0.938 ± 0.039
0.341TrpMet: 0.341 ± 0.023
0.443TrpAsn: 0.443 ± 0.028
0.317TrpPro: 0.317 ± 0.021
0.42TrpGln: 0.42 ± 0.026
0.543TrpArg: 0.543 ± 0.028
0.683TrpSer: 0.683 ± 0.036
0.565TrpThr: 0.565 ± 0.03
0.684TrpVal: 0.684 ± 0.031
0.149TrpTrp: 0.149 ± 0.016
0.314TrpTyr: 0.314 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.63TyrAla: 2.63 ± 0.058
0.468TyrCys: 0.468 ± 0.027
2.139TyrAsp: 2.139 ± 0.053
1.799TyrGlu: 1.799 ± 0.056
1.402TyrPhe: 1.402 ± 0.043
2.542TyrGly: 2.542 ± 0.073
0.687TyrHis: 0.687 ± 0.032
1.657TyrIle: 1.657 ± 0.059
1.547TyrLys: 1.547 ± 0.045
2.763TyrLeu: 2.763 ± 0.067
0.795TyrMet: 0.795 ± 0.033
1.303TyrAsn: 1.303 ± 0.043
1.295TyrPro: 1.295 ± 0.046
0.924TyrGln: 0.924 ± 0.038
2.01TyrArg: 2.01 ± 0.053
2.278TyrSer: 2.278 ± 0.054
1.668TyrThr: 1.668 ± 0.055
2.468TyrVal: 2.468 ± 0.06
0.377TyrTrp: 0.377 ± 0.025
1.24TyrTyr: 1.24 ± 0.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3117 proteins (729208 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski