Amino acid dipepetide frequency for Nitratiruptor sp. (strain SB155-2)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.976AlaAla: 4.976 ± 0.143
0.635AlaCys: 0.635 ± 0.037
2.928AlaAsp: 2.928 ± 0.078
3.115AlaGlu: 3.115 ± 0.087
3.257AlaPhe: 3.257 ± 0.088
4.127AlaGly: 4.127 ± 0.108
1.412AlaHis: 1.412 ± 0.047
6.558AlaIle: 6.558 ± 0.093
7.629AlaLys: 7.629 ± 0.117
8.331AlaLeu: 8.331 ± 0.141
1.971AlaMet: 1.971 ± 0.065
3.003AlaAsn: 3.003 ± 0.087
2.061AlaPro: 2.061 ± 0.066
2.598AlaGln: 2.598 ± 0.07
2.558AlaArg: 2.558 ± 0.061
3.766AlaSer: 3.766 ± 0.095
3.342AlaThr: 3.342 ± 0.1
4.381AlaVal: 4.381 ± 0.089
0.491AlaTrp: 0.491 ± 0.031
2.693AlaTyr: 2.693 ± 0.067
0.0AlaXaa: 0.0 ± 0.0
Cys
0.548CysAla: 0.548 ± 0.03
0.09CysCys: 0.09 ± 0.013
0.625CysAsp: 0.625 ± 0.033
0.743CysGlu: 0.743 ± 0.04
0.325CysPhe: 0.325 ± 0.023
0.734CysGly: 0.734 ± 0.036
0.33CysHis: 0.33 ± 0.034
0.608CysIle: 0.608 ± 0.036
0.702CysLys: 0.702 ± 0.04
0.523CysLeu: 0.523 ± 0.03
0.177CysMet: 0.177 ± 0.018
0.393CysAsn: 0.393 ± 0.025
0.446CysPro: 0.446 ± 0.038
0.296CysGln: 0.296 ± 0.024
0.31CysArg: 0.31 ± 0.026
0.588CysSer: 0.588 ± 0.032
0.438CysThr: 0.438 ± 0.03
0.455CysVal: 0.455 ± 0.024
0.063CysTrp: 0.063 ± 0.01
0.351CysTyr: 0.351 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
3.541AspAla: 3.541 ± 0.078
0.4AspCys: 0.4 ± 0.027
2.213AspAsp: 2.213 ± 0.085
4.599AspGlu: 4.599 ± 0.097
3.199AspPhe: 3.199 ± 0.076
3.144AspGly: 3.144 ± 0.098
0.843AspHis: 0.843 ± 0.037
5.534AspIle: 5.534 ± 0.091
3.899AspLys: 3.899 ± 0.088
5.391AspLeu: 5.391 ± 0.11
1.322AspMet: 1.322 ± 0.045
1.908AspAsn: 1.908 ± 0.07
2.34AspPro: 2.34 ± 0.063
1.506AspGln: 1.506 ± 0.059
2.254AspArg: 2.254 ± 0.057
2.407AspSer: 2.407 ± 0.067
2.816AspThr: 2.816 ± 0.082
3.429AspVal: 3.429 ± 0.079
0.434AspTrp: 0.434 ± 0.024
2.071AspTyr: 2.071 ± 0.064
0.0AspXaa: 0.0 ± 0.0
Glu
5.841GluAla: 5.841 ± 0.121
0.634GluCys: 0.634 ± 0.039
3.58GluAsp: 3.58 ± 0.084
6.788GluGlu: 6.788 ± 0.151
3.259GluPhe: 3.259 ± 0.094
3.846GluGly: 3.846 ± 0.099
1.596GluHis: 1.596 ± 0.052
6.556GluIle: 6.556 ± 0.11
7.5GluLys: 7.5 ± 0.128
7.389GluLeu: 7.389 ± 0.123
1.758GluMet: 1.758 ± 0.057
3.596GluAsn: 3.596 ± 0.082
2.133GluPro: 2.133 ± 0.074
2.339GluGln: 2.339 ± 0.065
3.02GluArg: 3.02 ± 0.079
3.379GluSer: 3.379 ± 0.074
2.863GluThr: 2.863 ± 0.073
4.803GluVal: 4.803 ± 0.096
0.68GluTrp: 0.68 ± 0.037
3.167GluTyr: 3.167 ± 0.081
0.0GluXaa: 0.0 ± 0.0
Phe
3.54PheAla: 3.54 ± 0.096
0.525PheCys: 0.525 ± 0.032
3.506PheAsp: 3.506 ± 0.084
3.896PheGlu: 3.896 ± 0.1
3.151PhePhe: 3.151 ± 0.097
3.63PheGly: 3.63 ± 0.087
1.123PheHis: 1.123 ± 0.046
4.025PheIle: 4.025 ± 0.103
3.192PheLys: 3.192 ± 0.086
5.49PheLeu: 5.49 ± 0.128
1.157PheMet: 1.157 ± 0.044
1.892PheAsn: 1.892 ± 0.059
1.451PhePro: 1.451 ± 0.049
1.499PheGln: 1.499 ± 0.056
1.685PheArg: 1.685 ± 0.06
3.219PheSer: 3.219 ± 0.085
2.615PheThr: 2.615 ± 0.07
3.276PheVal: 3.276 ± 0.075
0.533PheTrp: 0.533 ± 0.032
2.042PheTyr: 2.042 ± 0.07
0.0PheXaa: 0.0 ± 0.0
Gly
4.264GlyAla: 4.264 ± 0.113
0.746GlyCys: 0.746 ± 0.037
3.194GlyAsp: 3.194 ± 0.092
3.746GlyGlu: 3.746 ± 0.083
3.524GlyPhe: 3.524 ± 0.088
4.035GlyGly: 4.035 ± 0.127
1.153GlyHis: 1.153 ± 0.05
5.374GlyIle: 5.374 ± 0.103
5.166GlyLys: 5.166 ± 0.098
5.321GlyLeu: 5.321 ± 0.112
1.669GlyMet: 1.669 ± 0.054
2.247GlyAsn: 2.247 ± 0.068
1.278GlyPro: 1.278 ± 0.047
1.414GlyGln: 1.414 ± 0.059
2.162GlyArg: 2.162 ± 0.068
3.451GlySer: 3.451 ± 0.092
2.86GlyThr: 2.86 ± 0.095
4.539GlyVal: 4.539 ± 0.086
0.673GlyTrp: 0.673 ± 0.034
2.862GlyTyr: 2.862 ± 0.086
0.0GlyXaa: 0.0 ± 0.0
His
1.208HisAla: 1.208 ± 0.049
0.203HisCys: 0.203 ± 0.015
0.935HisAsp: 0.935 ± 0.038
1.262HisGlu: 1.262 ± 0.04
1.344HisPhe: 1.344 ± 0.051
1.228HisGly: 1.228 ± 0.056
0.572HisHis: 0.572 ± 0.032
1.915HisIle: 1.915 ± 0.063
1.496HisLys: 1.496 ± 0.051
2.228HisLeu: 2.228 ± 0.062
0.542HisMet: 0.542 ± 0.034
0.874HisAsn: 0.874 ± 0.04
1.18HisPro: 1.18 ± 0.046
0.681HisGln: 0.681 ± 0.033
0.782HisArg: 0.782 ± 0.039
1.051HisSer: 1.051 ± 0.044
1.116HisThr: 1.116 ± 0.045
0.916HisVal: 0.916 ± 0.043
0.196HisTrp: 0.196 ± 0.018
0.971HisTyr: 0.971 ± 0.038
0.0HisXaa: 0.0 ± 0.0
Ile
6.968IleAla: 6.968 ± 0.131
0.705IleCys: 0.705 ± 0.038
6.13IleAsp: 6.13 ± 0.12
7.401IleGlu: 7.401 ± 0.13
4.102IlePhe: 4.102 ± 0.103
5.199IleGly: 5.199 ± 0.123
1.654IleHis: 1.654 ± 0.057
5.885IleIle: 5.885 ± 0.134
6.95IleLys: 6.95 ± 0.129
7.97IleLeu: 7.97 ± 0.124
1.463IleMet: 1.463 ± 0.057
3.415IleAsn: 3.415 ± 0.091
3.148IlePro: 3.148 ± 0.078
2.834IleGln: 2.834 ± 0.081
3.022IleArg: 3.022 ± 0.076
4.282IleSer: 4.282 ± 0.096
4.056IleThr: 4.056 ± 0.079
6.069IleVal: 6.069 ± 0.097
0.639IleTrp: 0.639 ± 0.035
3.114IleTyr: 3.114 ± 0.076
0.0IleXaa: 0.0 ± 0.0
Lys
5.928LysAla: 5.928 ± 0.107
0.579LysCys: 0.579 ± 0.034
4.681LysAsp: 4.681 ± 0.092
9.747LysGlu: 9.747 ± 0.178
2.656LysPhe: 2.656 ± 0.07
4.364LysGly: 4.364 ± 0.093
1.596LysHis: 1.596 ± 0.049
7.868LysIle: 7.868 ± 0.14
9.696LysLys: 9.696 ± 0.183
7.708LysLeu: 7.708 ± 0.126
2.03LysMet: 2.03 ± 0.059
4.97LysAsn: 4.97 ± 0.111
2.954LysPro: 2.954 ± 0.071
2.519LysGln: 2.519 ± 0.074
4.677LysArg: 4.677 ± 0.102
4.683LysSer: 4.683 ± 0.095
3.89LysThr: 3.89 ± 0.087
4.987LysVal: 4.987 ± 0.099
0.635LysTrp: 0.635 ± 0.035
2.974LysTyr: 2.974 ± 0.067
0.0LysXaa: 0.0 ± 0.0
Leu
6.899LeuAla: 6.899 ± 0.124
0.886LeuCys: 0.886 ± 0.041
5.355LeuAsp: 5.355 ± 0.093
6.974LeuGlu: 6.974 ± 0.121
6.083LeuPhe: 6.083 ± 0.123
5.824LeuGly: 5.824 ± 0.108
2.351LeuHis: 2.351 ± 0.068
7.188LeuIle: 7.188 ± 0.13
8.525LeuLys: 8.525 ± 0.138
10.115LeuLeu: 10.115 ± 0.206
2.173LeuMet: 2.173 ± 0.075
3.698LeuAsn: 3.698 ± 0.086
4.03LeuPro: 4.03 ± 0.099
4.824LeuGln: 4.824 ± 0.11
3.725LeuArg: 3.725 ± 0.09
6.084LeuSer: 6.084 ± 0.106
4.332LeuThr: 4.332 ± 0.105
5.393LeuVal: 5.393 ± 0.097
0.755LeuTrp: 0.755 ± 0.042
4.146LeuTyr: 4.146 ± 0.096
0.0LeuXaa: 0.0 ± 0.0
Met
1.913MetAla: 1.913 ± 0.056
0.14MetCys: 0.14 ± 0.015
1.199MetAsp: 1.199 ± 0.046
1.698MetGlu: 1.698 ± 0.054
0.894MetPhe: 0.894 ± 0.041
1.473MetGly: 1.473 ± 0.056
0.518MetHis: 0.518 ± 0.028
2.005MetIle: 2.005 ± 0.047
2.187MetLys: 2.187 ± 0.054
2.235MetLeu: 2.235 ± 0.069
0.586MetMet: 0.586 ± 0.033
0.918MetAsn: 0.918 ± 0.041
0.944MetPro: 0.944 ± 0.039
1.203MetGln: 1.203 ± 0.05
1.116MetArg: 1.116 ± 0.041
1.298MetSer: 1.298 ± 0.051
0.898MetThr: 0.898 ± 0.04
1.509MetVal: 1.509 ± 0.055
0.172MetTrp: 0.172 ± 0.017
0.657MetTyr: 0.657 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
3.023AsnAla: 3.023 ± 0.088
0.349AsnCys: 0.349 ± 0.027
2.185AsnAsp: 2.185 ± 0.079
3.322AsnGlu: 3.322 ± 0.082
2.286AsnPhe: 2.286 ± 0.059
2.604AsnGly: 2.604 ± 0.085
0.749AsnHis: 0.749 ± 0.042
4.221AsnIle: 4.221 ± 0.081
2.976AsnLys: 2.976 ± 0.08
4.013AsnLeu: 4.013 ± 0.08
0.853AsnMet: 0.853 ± 0.047
1.622AsnAsn: 1.622 ± 0.074
2.211AsnPro: 2.211 ± 0.063
1.233AsnGln: 1.233 ± 0.048
2.262AsnArg: 2.262 ± 0.069
2.145AsnSer: 2.145 ± 0.078
1.77AsnThr: 1.77 ± 0.081
2.63AsnVal: 2.63 ± 0.085
0.317AsnTrp: 0.317 ± 0.021
1.746AsnTyr: 1.746 ± 0.061
0.0AsnXaa: 0.0 ± 0.0
Pro
1.875ProAla: 1.875 ± 0.06
0.244ProCys: 0.244 ± 0.022
1.918ProAsp: 1.918 ± 0.055
2.366ProGlu: 2.366 ± 0.069
1.996ProPhe: 1.996 ± 0.063
1.964ProGly: 1.964 ± 0.062
0.857ProHis: 0.857 ± 0.041
2.913ProIle: 2.913 ± 0.072
3.788ProLys: 3.788 ± 0.099
3.567ProLeu: 3.567 ± 0.093
0.785ProMet: 0.785 ± 0.032
1.782ProAsn: 1.782 ± 0.051
1.208ProPro: 1.208 ± 0.048
1.291ProGln: 1.291 ± 0.042
1.092ProArg: 1.092 ± 0.039
1.964ProSer: 1.964 ± 0.056
1.892ProThr: 1.892 ± 0.064
2.439ProVal: 2.439 ± 0.068
0.332ProTrp: 0.332 ± 0.024
1.616ProTyr: 1.616 ± 0.059
0.0ProXaa: 0.0 ± 0.0
Gln
1.978GlnAla: 1.978 ± 0.063
0.213GlnCys: 0.213 ± 0.018
1.436GlnAsp: 1.436 ± 0.055
2.943GlnGlu: 2.943 ± 0.081
1.381GlnPhe: 1.381 ± 0.046
1.596GlnGly: 1.596 ± 0.047
0.579GlnHis: 0.579 ± 0.032
2.85GlnIle: 2.85 ± 0.067
5.045GlnLys: 5.045 ± 0.11
2.657GlnLeu: 2.657 ± 0.067
0.86GlnMet: 0.86 ± 0.038
2.104GlnAsn: 2.104 ± 0.072
0.901GlnPro: 0.901 ± 0.038
1.145GlnGln: 1.145 ± 0.054
1.502GlnArg: 1.502 ± 0.05
1.814GlnSer: 1.814 ± 0.06
1.836GlnThr: 1.836 ± 0.063
1.553GlnVal: 1.553 ± 0.045
0.283GlnTrp: 0.283 ± 0.021
1.068GlnTyr: 1.068 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
2.587ArgAla: 2.587 ± 0.063
0.421ArgCys: 0.421 ± 0.027
1.869ArgAsp: 1.869 ± 0.062
2.621ArgGlu: 2.621 ± 0.07
2.705ArgPhe: 2.705 ± 0.072
2.276ArgGly: 2.276 ± 0.075
0.729ArgHis: 0.729 ± 0.04
3.18ArgIle: 3.18 ± 0.083
3.165ArgLys: 3.165 ± 0.078
4.238ArgLeu: 4.238 ± 0.083
1.037ArgMet: 1.037 ± 0.046
1.632ArgAsn: 1.632 ± 0.056
1.317ArgPro: 1.317 ± 0.051
0.986ArgGln: 0.986 ± 0.041
1.678ArgArg: 1.678 ± 0.05
2.238ArgSer: 2.238 ± 0.07
1.482ArgThr: 1.482 ± 0.057
2.754ArgVal: 2.754 ± 0.074
0.438ArgTrp: 0.438 ± 0.028
2.332ArgTyr: 2.332 ± 0.057
0.0ArgXaa: 0.0 ± 0.0
Ser
3.444SerAla: 3.444 ± 0.086
0.562SerCys: 0.562 ± 0.04
2.804SerAsp: 2.804 ± 0.089
3.206SerGlu: 3.206 ± 0.082
3.504SerPhe: 3.504 ± 0.093
3.725SerGly: 3.725 ± 0.089
1.196SerHis: 1.196 ± 0.048
4.878SerIle: 4.878 ± 0.109
4.369SerLys: 4.369 ± 0.088
5.684SerLeu: 5.684 ± 0.102
1.373SerMet: 1.373 ± 0.045
2.206SerAsn: 2.206 ± 0.077
1.736SerPro: 1.736 ± 0.055
1.751SerGln: 1.751 ± 0.056
1.933SerArg: 1.933 ± 0.06
3.223SerSer: 3.223 ± 0.083
2.415SerThr: 2.415 ± 0.07
3.306SerVal: 3.306 ± 0.076
0.535SerTrp: 0.535 ± 0.034
2.409SerTyr: 2.409 ± 0.064
0.0SerXaa: 0.0 ± 0.0
Thr
2.983ThrAla: 2.983 ± 0.091
0.341ThrCys: 0.341 ± 0.024
2.143ThrAsp: 2.143 ± 0.084
1.996ThrGlu: 1.996 ± 0.061
2.376ThrPhe: 2.376 ± 0.069
2.971ThrGly: 2.971 ± 0.091
0.981ThrHis: 0.981 ± 0.044
4.647ThrIle: 4.647 ± 0.103
4.122ThrLys: 4.122 ± 0.088
5.374ThrLeu: 5.374 ± 0.103
0.974ThrMet: 0.974 ± 0.043
1.978ThrAsn: 1.978 ± 0.076
2.376ThrPro: 2.376 ± 0.064
1.569ThrGln: 1.569 ± 0.054
1.472ThrArg: 1.472 ± 0.049
2.463ThrSer: 2.463 ± 0.073
2.502ThrThr: 2.502 ± 0.082
2.826ThrVal: 2.826 ± 0.089
0.339ThrTrp: 0.339 ± 0.027
1.896ThrTyr: 1.896 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
5.073ValAla: 5.073 ± 0.101
0.63ValCys: 0.63 ± 0.036
3.795ValAsp: 3.795 ± 0.088
4.553ValGlu: 4.553 ± 0.097
2.673ValPhe: 2.673 ± 0.07
3.989ValGly: 3.989 ± 0.108
1.318ValHis: 1.318 ± 0.058
4.884ValIle: 4.884 ± 0.096
4.752ValLys: 4.752 ± 0.111
6.044ValLeu: 6.044 ± 0.112
1.637ValMet: 1.637 ± 0.06
2.134ValAsn: 2.134 ± 0.063
2.277ValPro: 2.277 ± 0.072
2.221ValGln: 2.221 ± 0.07
2.335ValArg: 2.335 ± 0.069
3.661ValSer: 3.661 ± 0.075
3.023ValThr: 3.023 ± 0.087
4.866ValVal: 4.866 ± 0.11
0.538ValTrp: 0.538 ± 0.035
2.448ValTyr: 2.448 ± 0.082
0.0ValXaa: 0.0 ± 0.0
Trp
0.535TrpAla: 0.535 ± 0.031
0.092TrpCys: 0.092 ± 0.013
0.455TrpAsp: 0.455 ± 0.029
0.523TrpGlu: 0.523 ± 0.031
0.443TrpPhe: 0.443 ± 0.028
0.531TrpGly: 0.531 ± 0.032
0.213TrpHis: 0.213 ± 0.018
0.838TrpIle: 0.838 ± 0.041
0.567TrpLys: 0.567 ± 0.038
0.939TrpLeu: 0.939 ± 0.047
0.351TrpMet: 0.351 ± 0.023
0.358TrpAsn: 0.358 ± 0.026
0.211TrpPro: 0.211 ± 0.019
0.382TrpGln: 0.382 ± 0.028
0.315TrpArg: 0.315 ± 0.024
0.485TrpSer: 0.485 ± 0.033
0.293TrpThr: 0.293 ± 0.023
0.525TrpVal: 0.525 ± 0.03
0.129TrpTrp: 0.129 ± 0.017
0.399TrpTyr: 0.399 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.71TyrAla: 2.71 ± 0.068
0.353TyrCys: 0.353 ± 0.027
2.3TyrAsp: 2.3 ± 0.072
3.228TyrGlu: 3.228 ± 0.083
2.288TyrPhe: 2.288 ± 0.061
2.415TyrGly: 2.415 ± 0.073
0.928TyrHis: 0.928 ± 0.041
3.08TyrIle: 3.08 ± 0.072
3.415TyrLys: 3.415 ± 0.079
4.143TyrLeu: 4.143 ± 0.099
0.841TyrMet: 0.841 ± 0.039
1.79TyrAsn: 1.79 ± 0.064
1.688TyrPro: 1.688 ± 0.051
1.455TyrGln: 1.455 ± 0.052
1.785TyrArg: 1.785 ± 0.053
2.058TyrSer: 2.058 ± 0.066
1.889TyrThr: 1.889 ± 0.064
2.194TyrVal: 2.194 ± 0.052
0.402TyrTrp: 0.402 ± 0.032
1.734TyrTyr: 1.734 ± 0.057
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1843 proteins (587077 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski