Amino acid dipepetide frequency for Candidatus Desulfovibrio trichonymphae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.832AlaAla: 14.832 ± 0.272
2.26AlaCys: 2.26 ± 0.091
5.991AlaAsp: 5.991 ± 0.135
6.376AlaGlu: 6.376 ± 0.138
4.064AlaPhe: 4.064 ± 0.12
9.93AlaGly: 9.93 ± 0.192
2.263AlaHis: 2.263 ± 0.08
3.93AlaIle: 3.93 ± 0.117
3.939AlaLys: 3.939 ± 0.109
13.486AlaLeu: 13.486 ± 0.237
2.743AlaMet: 2.743 ± 0.103
2.481AlaAsn: 2.481 ± 0.089
4.746AlaPro: 4.746 ± 0.119
3.6AlaGln: 3.6 ± 0.1
8.027AlaArg: 8.027 ± 0.157
5.085AlaSer: 5.085 ± 0.136
4.217AlaThr: 4.217 ± 0.102
9.228AlaVal: 9.228 ± 0.16
1.294AlaTrp: 1.294 ± 0.064
2.402AlaTyr: 2.402 ± 0.079
0.0AlaXaa: 0.0 ± 0.0
Cys
2.012CysAla: 2.012 ± 0.076
0.377CysCys: 0.377 ± 0.035
0.802CysAsp: 0.802 ± 0.052
0.802CysGlu: 0.802 ± 0.042
0.751CysPhe: 0.751 ± 0.053
1.987CysGly: 1.987 ± 0.083
0.437CysHis: 0.437 ± 0.051
0.822CysIle: 0.822 ± 0.047
0.576CysLys: 0.576 ± 0.047
2.05CysLeu: 2.05 ± 0.081
0.445CysMet: 0.445 ± 0.035
0.53CysAsn: 0.53 ± 0.038
1.152CysPro: 1.152 ± 0.065
0.317CysGln: 0.317 ± 0.03
1.493CysArg: 1.493 ± 0.058
0.879CysSer: 0.879 ± 0.052
0.832CysThr: 0.832 ± 0.054
1.261CysVal: 1.261 ± 0.056
0.18CysTrp: 0.18 ± 0.024
0.388CysTyr: 0.388 ± 0.032
0.0CysXaa: 0.0 ± 0.0
Asp
5.83AspAla: 5.83 ± 0.136
0.846AspCys: 0.846 ± 0.042
2.489AspAsp: 2.489 ± 0.09
3.182AspGlu: 3.182 ± 0.101
2.287AspPhe: 2.287 ± 0.075
4.141AspGly: 4.141 ± 0.114
0.953AspHis: 0.953 ± 0.047
3.614AspIle: 3.614 ± 0.106
2.396AspLys: 2.396 ± 0.085
4.826AspLeu: 4.826 ± 0.125
2.061AspMet: 2.061 ± 0.064
1.741AspAsn: 1.741 ± 0.071
2.26AspPro: 2.26 ± 0.081
1.122AspGln: 1.122 ± 0.053
2.713AspArg: 2.713 ± 0.085
2.399AspSer: 2.399 ± 0.083
2.489AspThr: 2.489 ± 0.093
4.312AspVal: 4.312 ± 0.1
0.701AspTrp: 0.701 ± 0.044
1.411AspTyr: 1.411 ± 0.066
0.0AspXaa: 0.0 ± 0.0
Glu
6.381GluAla: 6.381 ± 0.151
0.734GluCys: 0.734 ± 0.043
2.95GluAsp: 2.95 ± 0.092
3.851GluGlu: 3.851 ± 0.121
1.752GluPhe: 1.752 ± 0.076
3.878GluGly: 3.878 ± 0.106
1.55GluHis: 1.55 ± 0.067
3.505GluIle: 3.505 ± 0.116
3.357GluLys: 3.357 ± 0.108
5.587GluLeu: 5.587 ± 0.126
1.823GluMet: 1.823 ± 0.073
2.451GluAsn: 2.451 ± 0.092
2.102GluPro: 2.102 ± 0.076
2.295GluGln: 2.295 ± 0.085
3.958GluArg: 3.958 ± 0.098
2.65GluSer: 2.65 ± 0.098
3.016GluThr: 3.016 ± 0.09
3.586GluVal: 3.586 ± 0.1
0.557GluTrp: 0.557 ± 0.044
1.414GluTyr: 1.414 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
4.176PheAla: 4.176 ± 0.111
1.013PheCys: 1.013 ± 0.054
2.208PheAsp: 2.208 ± 0.074
1.982PheGlu: 1.982 ± 0.074
2.028PhePhe: 2.028 ± 0.092
2.997PheGly: 2.997 ± 0.088
0.824PheHis: 0.824 ± 0.05
1.793PheIle: 1.793 ± 0.075
1.318PheLys: 1.318 ± 0.067
4.059PheLeu: 4.059 ± 0.119
1.108PheMet: 1.108 ± 0.061
1.157PheAsn: 1.157 ± 0.058
1.769PhePro: 1.769 ± 0.07
0.974PheGln: 0.974 ± 0.052
2.192PheArg: 2.192 ± 0.076
2.721PheSer: 2.721 ± 0.08
2.203PheThr: 2.203 ± 0.086
2.989PheVal: 2.989 ± 0.102
0.701PheTrp: 0.701 ± 0.047
1.103PheTyr: 1.103 ± 0.064
0.0PheXaa: 0.0 ± 0.0
Gly
7.874GlyAla: 7.874 ± 0.167
1.466GlyCys: 1.466 ± 0.061
3.767GlyAsp: 3.767 ± 0.111
4.255GlyGlu: 4.255 ± 0.11
3.297GlyPhe: 3.297 ± 0.113
6.512GlyGly: 6.512 ± 0.168
1.886GlyHis: 1.886 ± 0.075
4.706GlyIle: 4.706 ± 0.11
3.726GlyLys: 3.726 ± 0.097
8.199GlyLeu: 8.199 ± 0.143
2.593GlyMet: 2.593 ± 0.083
2.274GlyAsn: 2.274 ± 0.087
2.691GlyPro: 2.691 ± 0.086
2.901GlyGln: 2.901 ± 0.082
6.158GlyArg: 6.158 ± 0.141
3.939GlySer: 3.939 ± 0.105
3.619GlyThr: 3.619 ± 0.095
6.147GlyVal: 6.147 ± 0.146
0.822GlyTrp: 0.822 ± 0.05
2.151GlyTyr: 2.151 ± 0.086
0.0GlyXaa: 0.0 ± 0.0
His
2.364HisAla: 2.364 ± 0.086
0.486HisCys: 0.486 ± 0.032
1.125HisAsp: 1.125 ± 0.056
1.242HisGlu: 1.242 ± 0.058
1.013HisPhe: 1.013 ± 0.053
1.87HisGly: 1.87 ± 0.066
0.519HisHis: 0.519 ± 0.043
1.46HisIle: 1.46 ± 0.06
1.024HisLys: 1.024 ± 0.053
2.492HisLeu: 2.492 ± 0.092
0.499HisMet: 0.499 ± 0.039
0.92HisAsn: 0.92 ± 0.051
1.335HisPro: 1.335 ± 0.054
0.524HisGln: 0.524 ± 0.035
1.165HisArg: 1.165 ± 0.058
1.168HisSer: 1.168 ± 0.054
1.097HisThr: 1.097 ± 0.056
1.441HisVal: 1.441 ± 0.06
0.292HisTrp: 0.292 ± 0.027
0.587HisTyr: 0.587 ± 0.043
0.0HisXaa: 0.0 ± 0.0
Ile
5.221IleAla: 5.221 ± 0.111
0.969IleCys: 0.969 ± 0.051
2.544IleAsp: 2.544 ± 0.087
2.437IleGlu: 2.437 ± 0.081
2.396IlePhe: 2.396 ± 0.076
3.521IleGly: 3.521 ± 0.123
1.034IleHis: 1.034 ± 0.059
2.557IleIle: 2.557 ± 0.101
2.025IleLys: 2.025 ± 0.086
5.066IleLeu: 5.066 ± 0.138
1.471IleMet: 1.471 ± 0.059
1.599IleAsn: 1.599 ± 0.066
2.334IlePro: 2.334 ± 0.094
1.288IleGln: 1.288 ± 0.053
3.068IleArg: 3.068 ± 0.096
3.27IleSer: 3.27 ± 0.101
2.713IleThr: 2.713 ± 0.102
3.971IleVal: 3.971 ± 0.119
0.54IleTrp: 0.54 ± 0.034
1.296IleTyr: 1.296 ± 0.06
0.0IleXaa: 0.0 ± 0.0
Lys
4.484LysAla: 4.484 ± 0.111
0.448LysCys: 0.448 ± 0.04
2.042LysAsp: 2.042 ± 0.091
2.476LysGlu: 2.476 ± 0.11
1.157LysPhe: 1.157 ± 0.068
2.991LysGly: 2.991 ± 0.106
0.77LysHis: 0.77 ± 0.051
2.533LysIle: 2.533 ± 0.089
2.811LysLys: 2.811 ± 0.119
3.66LysLeu: 3.66 ± 0.135
1.133LysMet: 1.133 ± 0.071
2.102LysAsn: 2.102 ± 0.086
1.905LysPro: 1.905 ± 0.073
1.37LysGln: 1.37 ± 0.067
2.454LysArg: 2.454 ± 0.082
2.364LysSer: 2.364 ± 0.087
2.754LysThr: 2.754 ± 0.096
2.44LysVal: 2.44 ± 0.097
0.292LysTrp: 0.292 ± 0.031
1.004LysTyr: 1.004 ± 0.053
0.0LysXaa: 0.0 ± 0.0
Leu
13.371LeuAla: 13.371 ± 0.201
2.173LeuCys: 2.173 ± 0.076
5.606LeuAsp: 5.606 ± 0.119
6.067LeuGlu: 6.067 ± 0.146
4.124LeuPhe: 4.124 ± 0.112
7.842LeuGly: 7.842 ± 0.138
2.661LeuHis: 2.661 ± 0.092
4.274LeuIle: 4.274 ± 0.111
3.974LeuLys: 3.974 ± 0.123
12.615LeuLeu: 12.615 ± 0.299
2.276LeuMet: 2.276 ± 0.077
3.215LeuAsn: 3.215 ± 0.099
6.728LeuPro: 6.728 ± 0.162
3.335LeuGln: 3.335 ± 0.102
8.011LeuArg: 8.011 ± 0.17
6.354LeuSer: 6.354 ± 0.143
6.209LeuThr: 6.209 ± 0.129
7.277LeuVal: 7.277 ± 0.152
1.291LeuTrp: 1.291 ± 0.071
2.489LeuTyr: 2.489 ± 0.086
0.0LeuXaa: 0.0 ± 0.0
Met
2.85MetAla: 2.85 ± 0.095
0.36MetCys: 0.36 ± 0.034
1.507MetAsp: 1.507 ± 0.064
1.659MetGlu: 1.659 ± 0.064
0.781MetPhe: 0.781 ± 0.043
2.151MetGly: 2.151 ± 0.086
0.628MetHis: 0.628 ± 0.044
1.13MetIle: 1.13 ± 0.063
0.996MetLys: 0.996 ± 0.059
3.169MetLeu: 3.169 ± 0.102
0.565MetMet: 0.565 ± 0.05
0.966MetAsn: 0.966 ± 0.048
1.752MetPro: 1.752 ± 0.076
1.064MetGln: 1.064 ± 0.058
2.178MetArg: 2.178 ± 0.074
1.558MetSer: 1.558 ± 0.06
1.55MetThr: 1.55 ± 0.068
1.498MetVal: 1.498 ± 0.068
0.123MetTrp: 0.123 ± 0.018
0.543MetTyr: 0.543 ± 0.037
0.0MetXaa: 0.0 ± 0.0
Asn
3.739AsnAla: 3.739 ± 0.108
0.535AsnCys: 0.535 ± 0.042
1.526AsnAsp: 1.526 ± 0.071
1.52AsnGlu: 1.52 ± 0.056
1.373AsnPhe: 1.373 ± 0.058
2.298AsnGly: 2.298 ± 0.086
0.483AsnHis: 0.483 ± 0.038
2.028AsnIle: 2.028 ± 0.086
1.275AsnLys: 1.275 ± 0.068
3.447AsnLeu: 3.447 ± 0.101
0.983AsnMet: 0.983 ± 0.049
1.032AsnAsn: 1.032 ± 0.065
2.033AsnPro: 2.033 ± 0.078
0.824AsnGln: 0.824 ± 0.049
1.758AsnArg: 1.758 ± 0.074
1.55AsnSer: 1.55 ± 0.058
1.736AsnThr: 1.736 ± 0.068
2.38AsnVal: 2.38 ± 0.088
0.344AsnTrp: 0.344 ± 0.037
0.816AsnTyr: 0.816 ± 0.054
0.0AsnXaa: 0.0 ± 0.0
Pro
5.396ProAla: 5.396 ± 0.134
0.813ProCys: 0.813 ± 0.047
3.455ProAsp: 3.455 ± 0.103
3.663ProGlu: 3.663 ± 0.1
1.919ProPhe: 1.919 ± 0.076
4.015ProGly: 4.015 ± 0.099
1.215ProHis: 1.215 ± 0.058
1.528ProIle: 1.528 ± 0.067
1.556ProLys: 1.556 ± 0.081
5.418ProLeu: 5.418 ± 0.128
1.018ProMet: 1.018 ± 0.052
1.168ProAsn: 1.168 ± 0.051
2.315ProPro: 2.315 ± 0.098
1.905ProGln: 1.905 ± 0.065
2.819ProArg: 2.819 ± 0.084
2.484ProSer: 2.484 ± 0.087
2.003ProThr: 2.003 ± 0.071
4.266ProVal: 4.266 ± 0.109
0.745ProTrp: 0.745 ± 0.044
1.155ProTyr: 1.155 ± 0.048
0.0ProXaa: 0.0 ± 0.0
Gln
3.513GlnAla: 3.513 ± 0.088
0.535GlnCys: 0.535 ± 0.039
1.646GlnAsp: 1.646 ± 0.076
1.954GlnGlu: 1.954 ± 0.08
0.947GlnPhe: 0.947 ± 0.052
2.544GlnGly: 2.544 ± 0.092
0.778GlnHis: 0.778 ± 0.042
1.668GlnIle: 1.668 ± 0.071
1.782GlnLys: 1.782 ± 0.073
2.536GlnLeu: 2.536 ± 0.087
0.991GlnMet: 0.991 ± 0.055
1.296GlnAsn: 1.296 ± 0.054
1.359GlnPro: 1.359 ± 0.072
1.198GlnGln: 1.198 ± 0.064
2.279GlnArg: 2.279 ± 0.087
1.87GlnSer: 1.87 ± 0.069
1.913GlnThr: 1.913 ± 0.076
1.851GlnVal: 1.851 ± 0.08
0.388GlnTrp: 0.388 ± 0.034
0.781GlnTyr: 0.781 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
6.815ArgAla: 6.815 ± 0.163
1.215ArgCys: 1.215 ± 0.063
3.466ArgAsp: 3.466 ± 0.093
4.823ArgGlu: 4.823 ± 0.138
2.819ArgPhe: 2.819 ± 0.09
4.211ArgGly: 4.211 ± 0.115
1.886ArgHis: 1.886 ± 0.076
3.788ArgIle: 3.788 ± 0.122
3.03ArgLys: 3.03 ± 0.099
8.456ArgLeu: 8.456 ± 0.17
1.935ArgMet: 1.935 ± 0.071
2.238ArgAsn: 2.238 ± 0.084
3.267ArgPro: 3.267 ± 0.105
2.634ArgGln: 2.634 ± 0.086
5.363ArgArg: 5.363 ± 0.149
3.038ArgSer: 3.038 ± 0.087
2.811ArgThr: 2.811 ± 0.093
4.697ArgVal: 4.697 ± 0.128
0.704ArgTrp: 0.704 ± 0.044
1.872ArgTyr: 1.872 ± 0.068
0.0ArgXaa: 0.0 ± 0.0
Ser
5.568SerAla: 5.568 ± 0.135
0.972SerCys: 0.972 ± 0.046
2.435SerAsp: 2.435 ± 0.09
2.508SerGlu: 2.508 ± 0.079
2.306SerPhe: 2.306 ± 0.076
5.049SerGly: 5.049 ± 0.111
1.187SerHis: 1.187 ± 0.055
2.399SerIle: 2.399 ± 0.089
1.632SerLys: 1.632 ± 0.075
6.461SerLeu: 6.461 ± 0.154
1.392SerMet: 1.392 ± 0.057
1.332SerAsn: 1.332 ± 0.058
2.765SerPro: 2.765 ± 0.093
1.433SerGln: 1.433 ± 0.053
4.08SerArg: 4.08 ± 0.12
2.953SerSer: 2.953 ± 0.114
2.416SerThr: 2.416 ± 0.08
4.282SerVal: 4.282 ± 0.098
0.595SerTrp: 0.595 ± 0.039
1.19SerTyr: 1.19 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
5.994ThrAla: 5.994 ± 0.132
0.781ThrCys: 0.781 ± 0.042
2.697ThrAsp: 2.697 ± 0.095
2.503ThrGlu: 2.503 ± 0.076
1.741ThrPhe: 1.741 ± 0.068
4.834ThrGly: 4.834 ± 0.106
1.094ThrHis: 1.094 ± 0.056
2.11ThrIle: 2.11 ± 0.078
1.569ThrLys: 1.569 ± 0.071
5.797ThrLeu: 5.797 ± 0.12
1.195ThrMet: 1.195 ± 0.06
1.264ThrAsn: 1.264 ± 0.062
3.215ThrPro: 3.215 ± 0.097
1.389ThrGln: 1.389 ± 0.064
3.273ThrArg: 3.273 ± 0.104
2.517ThrSer: 2.517 ± 0.086
2.497ThrThr: 2.497 ± 0.097
4.012ThrVal: 4.012 ± 0.104
0.53ThrTrp: 0.53 ± 0.038
1.015ThrTyr: 1.015 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
7.061ValAla: 7.061 ± 0.165
1.49ValCys: 1.49 ± 0.06
3.698ValAsp: 3.698 ± 0.113
4.233ValGlu: 4.233 ± 0.114
2.989ValPhe: 2.989 ± 0.091
5.27ValGly: 5.27 ± 0.15
1.58ValHis: 1.58 ± 0.075
3.786ValIle: 3.786 ± 0.114
2.746ValLys: 2.746 ± 0.1
8.415ValLeu: 8.415 ± 0.153
1.949ValMet: 1.949 ± 0.077
2.593ValAsn: 2.593 ± 0.08
3.3ValPro: 3.3 ± 0.094
2.402ValGln: 2.402 ± 0.077
5.434ValArg: 5.434 ± 0.134
4.203ValSer: 4.203 ± 0.102
4.067ValThr: 4.067 ± 0.096
5.142ValVal: 5.142 ± 0.167
0.852ValTrp: 0.852 ± 0.051
1.87ValTyr: 1.87 ± 0.074
0.0ValXaa: 0.0 ± 0.0
Trp
0.98TrpAla: 0.98 ± 0.056
0.169TrpCys: 0.169 ± 0.021
0.516TrpAsp: 0.516 ± 0.036
0.568TrpGlu: 0.568 ± 0.048
0.478TrpPhe: 0.478 ± 0.039
0.797TrpGly: 0.797 ± 0.052
0.33TrpHis: 0.33 ± 0.031
0.442TrpIle: 0.442 ± 0.04
0.453TrpLys: 0.453 ± 0.036
1.643TrpLeu: 1.643 ± 0.086
0.246TrpMet: 0.246 ± 0.026
0.407TrpAsn: 0.407 ± 0.033
0.682TrpPro: 0.682 ± 0.051
0.581TrpGln: 0.581 ± 0.04
1.084TrpArg: 1.084 ± 0.057
0.513TrpSer: 0.513 ± 0.039
0.579TrpThr: 0.579 ± 0.043
0.59TrpVal: 0.59 ± 0.044
0.175TrpTrp: 0.175 ± 0.024
0.259TrpTyr: 0.259 ± 0.029
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.432TyrAla: 2.432 ± 0.094
0.45TyrCys: 0.45 ± 0.036
1.305TyrAsp: 1.305 ± 0.06
1.296TyrGlu: 1.296 ± 0.07
1.092TyrPhe: 1.092 ± 0.065
2.145TyrGly: 2.145 ± 0.081
0.543TyrHis: 0.543 ± 0.04
1.25TyrIle: 1.25 ± 0.063
1.004TyrLys: 1.004 ± 0.06
2.514TyrLeu: 2.514 ± 0.087
0.6TyrMet: 0.6 ± 0.044
0.914TyrAsn: 0.914 ± 0.046
1.146TyrPro: 1.146 ± 0.063
0.669TyrGln: 0.669 ± 0.042
1.564TyrArg: 1.564 ± 0.065
1.441TyrSer: 1.441 ± 0.064
1.305TyrThr: 1.305 ± 0.057
1.709TyrVal: 1.709 ± 0.071
0.366TyrTrp: 0.366 ± 0.032
0.753TyrTyr: 0.753 ± 0.051
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1079 proteins (366380 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski