Amino acid dipepetide frequency for Klebsiella pneumoniae subsp. ozaenae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.964AlaAla: 11.964 ± 0.141
1.252AlaCys: 1.252 ± 0.034
5.421AlaAsp: 5.421 ± 0.068
5.97AlaGlu: 5.97 ± 0.078
3.682AlaPhe: 3.682 ± 0.048
8.789AlaGly: 8.789 ± 0.088
2.016AlaHis: 2.016 ± 0.036
5.925AlaIle: 5.925 ± 0.071
3.609AlaLys: 3.609 ± 0.054
12.563AlaLeu: 12.563 ± 0.103
3.208AlaMet: 3.208 ± 0.046
2.888AlaAsn: 2.888 ± 0.045
4.127AlaPro: 4.127 ± 0.068
4.635AlaGln: 4.635 ± 0.065
6.421AlaArg: 6.421 ± 0.074
5.773AlaSer: 5.773 ± 0.074
4.859AlaThr: 4.859 ± 0.065
7.357AlaVal: 7.357 ± 0.077
1.844AlaTrp: 1.844 ± 0.042
1.926AlaTyr: 1.926 ± 0.033
0.0AlaXaa: 0.0 ± 0.0
Cys
1.164CysAla: 1.164 ± 0.029
0.222CysCys: 0.222 ± 0.012
0.573CysAsp: 0.573 ± 0.02
0.571CysGlu: 0.571 ± 0.02
0.463CysPhe: 0.463 ± 0.019
1.116CysGly: 1.116 ± 0.028
0.326CysHis: 0.326 ± 0.016
0.54CysIle: 0.54 ± 0.02
0.318CysLys: 0.318 ± 0.016
1.082CysLeu: 1.082 ± 0.025
0.313CysMet: 0.313 ± 0.015
0.302CysAsn: 0.302 ± 0.015
0.584CysPro: 0.584 ± 0.027
0.459CysGln: 0.459 ± 0.017
0.952CysArg: 0.952 ± 0.031
0.743CysSer: 0.743 ± 0.027
0.477CysThr: 0.477 ± 0.019
0.772CysVal: 0.772 ± 0.025
0.254CysTrp: 0.254 ± 0.012
0.334CysTyr: 0.334 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
5.318AspAla: 5.318 ± 0.064
0.514AspCys: 0.514 ± 0.017
2.933AspAsp: 2.933 ± 0.049
3.363AspGlu: 3.363 ± 0.054
1.996AspPhe: 1.996 ± 0.035
3.836AspGly: 3.836 ± 0.055
1.028AspHis: 1.028 ± 0.029
3.16AspIle: 3.16 ± 0.055
2.315AspLys: 2.315 ± 0.04
4.67AspLeu: 4.67 ± 0.058
1.282AspMet: 1.282 ± 0.028
2.058AspAsn: 2.058 ± 0.041
2.416AspPro: 2.416 ± 0.039
1.577AspGln: 1.577 ± 0.035
2.98AspArg: 2.98 ± 0.048
2.735AspSer: 2.735 ± 0.065
2.325AspThr: 2.325 ± 0.042
3.513AspVal: 3.513 ± 0.055
0.829AspTrp: 0.829 ± 0.023
1.76AspTyr: 1.76 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
5.56GluAla: 5.56 ± 0.072
0.456GluCys: 0.456 ± 0.019
2.166GluAsp: 2.166 ± 0.042
3.179GluGlu: 3.179 ± 0.052
1.672GluPhe: 1.672 ± 0.036
3.499GluGly: 3.499 ± 0.057
1.323GluHis: 1.323 ± 0.03
2.985GluIle: 2.985 ± 0.047
2.914GluLys: 2.914 ± 0.052
5.566GluLeu: 5.566 ± 0.072
1.76GluMet: 1.76 ± 0.035
2.036GluAsn: 2.036 ± 0.039
2.054GluPro: 2.054 ± 0.044
3.193GluGln: 3.193 ± 0.052
3.809GluArg: 3.809 ± 0.057
2.899GluSer: 2.899 ± 0.043
2.778GluThr: 2.778 ± 0.048
3.65GluVal: 3.65 ± 0.052
0.791GluTrp: 0.791 ± 0.024
1.344GluTyr: 1.344 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
3.877PheAla: 3.877 ± 0.053
0.573PheCys: 0.573 ± 0.021
2.179PheAsp: 2.179 ± 0.04
1.543PheGlu: 1.543 ± 0.032
1.669PhePhe: 1.669 ± 0.04
3.065PheGly: 3.065 ± 0.045
0.807PheHis: 0.807 ± 0.021
2.4PheIle: 2.4 ± 0.043
1.12PheLys: 1.12 ± 0.028
3.228PheLeu: 3.228 ± 0.055
0.962PheMet: 0.962 ± 0.028
1.58PheAsn: 1.58 ± 0.03
1.594PhePro: 1.594 ± 0.032
1.127PheGln: 1.127 ± 0.028
1.995PheArg: 1.995 ± 0.037
3.016PheSer: 3.016 ± 0.053
2.396PheThr: 2.396 ± 0.048
2.495PheVal: 2.495 ± 0.042
0.649PheTrp: 0.649 ± 0.024
1.187PheTyr: 1.187 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
6.841GlyAla: 6.841 ± 0.074
1.097GlyCys: 1.097 ± 0.032
3.898GlyAsp: 3.898 ± 0.051
4.715GlyGlu: 4.715 ± 0.059
3.167GlyPhe: 3.167 ± 0.049
6.073GlyGly: 6.073 ± 0.081
1.713GlyHis: 1.713 ± 0.034
4.856GlyIle: 4.856 ± 0.064
3.801GlyLys: 3.801 ± 0.053
7.841GlyLeu: 7.841 ± 0.095
2.557GlyMet: 2.557 ± 0.044
2.546GlyAsn: 2.546 ± 0.041
2.3GlyPro: 2.3 ± 0.034
3.054GlyGln: 3.054 ± 0.049
4.461GlyArg: 4.461 ± 0.064
4.214GlySer: 4.214 ± 0.052
3.526GlyThr: 3.526 ± 0.05
5.797GlyVal: 5.797 ± 0.068
1.392GlyTrp: 1.392 ± 0.031
2.61GlyTyr: 2.61 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
1.991HisAla: 1.991 ± 0.032
0.329HisCys: 0.329 ± 0.016
1.215HisAsp: 1.215 ± 0.027
1.027HisGlu: 1.027 ± 0.026
1.038HisPhe: 1.038 ± 0.023
1.843HisGly: 1.843 ± 0.038
0.867HisHis: 0.867 ± 0.028
1.265HisIle: 1.265 ± 0.031
0.708HisLys: 0.708 ± 0.023
2.386HisLeu: 2.386 ± 0.042
0.537HisMet: 0.537 ± 0.018
0.783HisAsn: 0.783 ± 0.023
1.452HisPro: 1.452 ± 0.028
1.253HisGln: 1.253 ± 0.033
1.536HisArg: 1.536 ± 0.036
1.254HisSer: 1.254 ± 0.031
1.033HisThr: 1.033 ± 0.03
1.311HisVal: 1.311 ± 0.031
0.402HisTrp: 0.402 ± 0.017
0.929HisTyr: 0.929 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
6.545IleAla: 6.545 ± 0.066
0.711IleCys: 0.711 ± 0.022
3.315IleAsp: 3.315 ± 0.05
2.948IleGlu: 2.948 ± 0.054
2.003IlePhe: 2.003 ± 0.043
4.461IleGly: 4.461 ± 0.062
1.159IleHis: 1.159 ± 0.028
3.151IleIle: 3.151 ± 0.053
2.089IleLys: 2.089 ± 0.042
4.744IleLeu: 4.744 ± 0.066
1.203IleMet: 1.203 ± 0.03
2.317IleAsn: 2.317 ± 0.04
2.718IlePro: 2.718 ± 0.048
1.656IleGln: 1.656 ± 0.033
2.929IleArg: 2.929 ± 0.047
3.55IleSer: 3.55 ± 0.048
3.169IleThr: 3.169 ± 0.048
3.862IleVal: 3.862 ± 0.05
0.675IleTrp: 0.675 ± 0.019
1.406IleTyr: 1.406 ± 0.04
0.0IleXaa: 0.0 ± 0.0
Lys
4.103LysAla: 4.103 ± 0.055
0.262LysCys: 0.262 ± 0.014
1.727LysAsp: 1.727 ± 0.036
2.118LysGlu: 2.118 ± 0.045
1.0LysPhe: 1.0 ± 0.028
2.73LysGly: 2.73 ± 0.04
0.787LysHis: 0.787 ± 0.021
2.14LysIle: 2.14 ± 0.039
1.982LysLys: 1.982 ± 0.043
3.812LysLeu: 3.812 ± 0.056
1.157LysMet: 1.157 ± 0.029
1.492LysAsn: 1.492 ± 0.038
1.989LysPro: 1.989 ± 0.038
1.772LysGln: 1.772 ± 0.043
2.573LysArg: 2.573 ± 0.041
2.21LysSer: 2.21 ± 0.035
2.384LysThr: 2.384 ± 0.041
2.755LysVal: 2.755 ± 0.041
0.412LysTrp: 0.412 ± 0.018
1.064LysTyr: 1.064 ± 0.031
0.0LysXaa: 0.0 ± 0.0
Leu
12.486LeuAla: 12.486 ± 0.116
1.359LeuCys: 1.359 ± 0.032
5.236LeuAsp: 5.236 ± 0.058
5.256LeuGlu: 5.256 ± 0.063
4.266LeuPhe: 4.266 ± 0.066
7.381LeuGly: 7.381 ± 0.083
2.273LeuHis: 2.273 ± 0.04
5.554LeuIle: 5.554 ± 0.066
4.199LeuLys: 4.199 ± 0.06
12.297LeuLeu: 12.297 ± 0.12
2.95LeuMet: 2.95 ± 0.048
3.918LeuAsn: 3.918 ± 0.048
6.039LeuPro: 6.039 ± 0.073
4.481LeuGln: 4.481 ± 0.062
6.734LeuArg: 6.734 ± 0.066
7.286LeuSer: 7.286 ± 0.084
6.302LeuThr: 6.302 ± 0.064
6.93LeuVal: 6.93 ± 0.07
1.543LeuTrp: 1.543 ± 0.033
2.585LeuTyr: 2.585 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
3.164MetAla: 3.164 ± 0.048
0.232MetCys: 0.232 ± 0.013
1.156MetAsp: 1.156 ± 0.026
1.203MetGlu: 1.203 ± 0.026
0.878MetPhe: 0.878 ± 0.022
1.888MetGly: 1.888 ± 0.04
0.544MetHis: 0.544 ± 0.018
1.461MetIle: 1.461 ± 0.032
1.379MetLys: 1.379 ± 0.03
3.211MetLeu: 3.211 ± 0.053
0.921MetMet: 0.921 ± 0.029
1.101MetAsn: 1.101 ± 0.029
1.485MetPro: 1.485 ± 0.028
1.203MetGln: 1.203 ± 0.025
1.678MetArg: 1.678 ± 0.03
1.881MetSer: 1.881 ± 0.038
1.865MetThr: 1.865 ± 0.034
2.053MetVal: 2.053 ± 0.039
0.286MetTrp: 0.286 ± 0.016
0.515MetTyr: 0.515 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.244AsnAla: 3.244 ± 0.051
0.322AsnCys: 0.322 ± 0.014
1.87AsnAsp: 1.87 ± 0.034
1.661AsnGlu: 1.661 ± 0.036
1.161AsnPhe: 1.161 ± 0.029
2.807AsnGly: 2.807 ± 0.044
0.805AsnHis: 0.805 ± 0.024
2.041AsnIle: 2.041 ± 0.035
1.358AsnLys: 1.358 ± 0.036
3.256AsnLeu: 3.256 ± 0.046
0.873AsnMet: 0.873 ± 0.023
1.318AsnAsn: 1.318 ± 0.033
2.024AsnPro: 2.024 ± 0.039
1.379AsnGln: 1.379 ± 0.026
2.016AsnArg: 2.016 ± 0.031
1.777AsnSer: 1.777 ± 0.039
1.732AsnThr: 1.732 ± 0.032
2.356AsnVal: 2.356 ± 0.04
0.511AsnTrp: 0.511 ± 0.018
1.044AsnTyr: 1.044 ± 0.03
0.0AsnXaa: 0.0 ± 0.0
Pro
5.488ProAla: 5.488 ± 0.078
0.475ProCys: 0.475 ± 0.017
2.777ProAsp: 2.777 ± 0.043
3.215ProGlu: 3.215 ± 0.045
1.883ProPhe: 1.883 ± 0.041
4.011ProGly: 4.011 ± 0.064
1.126ProHis: 1.126 ± 0.025
1.761ProIle: 1.761 ± 0.036
1.346ProLys: 1.346 ± 0.033
5.503ProLeu: 5.503 ± 0.074
1.164ProMet: 1.164 ± 0.027
1.174ProAsn: 1.174 ± 0.029
2.075ProPro: 2.075 ± 0.046
2.574ProGln: 2.574 ± 0.048
2.593ProArg: 2.593 ± 0.046
2.291ProSer: 2.291 ± 0.042
2.17ProThr: 2.17 ± 0.038
3.965ProVal: 3.965 ± 0.053
0.864ProTrp: 0.864 ± 0.023
1.178ProTyr: 1.178 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
4.998GlnAla: 4.998 ± 0.068
0.376GlnCys: 0.376 ± 0.016
1.709GlnAsp: 1.709 ± 0.032
2.069GlnGlu: 2.069 ± 0.042
1.424GlnPhe: 1.424 ± 0.034
3.152GlnGly: 3.152 ± 0.046
1.32GlnHis: 1.32 ± 0.03
2.119GlnIle: 2.119 ± 0.042
1.68GlnLys: 1.68 ± 0.033
5.025GlnLeu: 5.025 ± 0.071
1.223GlnMet: 1.223 ± 0.03
1.251GlnAsn: 1.251 ± 0.029
2.457GlnPro: 2.457 ± 0.045
3.454GlnGln: 3.454 ± 0.064
3.671GlnArg: 3.671 ± 0.049
2.23GlnSer: 2.23 ± 0.042
2.164GlnThr: 2.164 ± 0.038
2.936GlnVal: 2.936 ± 0.045
0.701GlnTrp: 0.701 ± 0.022
1.161GlnTyr: 1.161 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
5.303ArgAla: 5.303 ± 0.057
0.854ArgCys: 0.854 ± 0.028
3.215ArgAsp: 3.215 ± 0.053
3.957ArgGlu: 3.957 ± 0.054
2.762ArgPhe: 2.762 ± 0.047
3.917ArgGly: 3.917 ± 0.061
1.982ArgHis: 1.982 ± 0.035
3.348ArgIle: 3.348 ± 0.049
2.37ArgLys: 2.37 ± 0.049
7.239ArgLeu: 7.239 ± 0.074
1.949ArgMet: 1.949 ± 0.038
1.929ArgAsn: 1.929 ± 0.035
2.868ArgPro: 2.868 ± 0.05
3.765ArgGln: 3.765 ± 0.063
5.028ArgArg: 5.028 ± 0.08
3.206ArgSer: 3.206 ± 0.048
2.652ArgThr: 2.652 ± 0.049
3.926ArgVal: 3.926 ± 0.048
1.233ArgTrp: 1.233 ± 0.029
2.347ArgTyr: 2.347 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
6.112SerAla: 6.112 ± 0.071
0.635SerCys: 0.635 ± 0.024
3.067SerAsp: 3.067 ± 0.071
3.003SerGlu: 3.003 ± 0.047
2.189SerPhe: 2.189 ± 0.039
5.37SerGly: 5.37 ± 0.065
1.444SerHis: 1.444 ± 0.03
2.774SerIle: 2.774 ± 0.041
1.826SerLys: 1.826 ± 0.034
6.738SerLeu: 6.738 ± 0.073
1.522SerMet: 1.522 ± 0.038
1.658SerAsn: 1.658 ± 0.036
2.864SerPro: 2.864 ± 0.039
2.462SerGln: 2.462 ± 0.038
3.844SerArg: 3.844 ± 0.05
3.404SerSer: 3.404 ± 0.054
2.824SerThr: 2.824 ± 0.046
4.042SerVal: 4.042 ± 0.056
1.06SerTrp: 1.06 ± 0.028
1.51SerTyr: 1.51 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
5.183ThrAla: 5.183 ± 0.066
0.534ThrCys: 0.534 ± 0.022
2.389ThrAsp: 2.389 ± 0.044
2.367ThrGlu: 2.367 ± 0.043
1.929ThrPhe: 1.929 ± 0.037
4.428ThrGly: 4.428 ± 0.058
1.129ThrHis: 1.129 ± 0.025
2.741ThrIle: 2.741 ± 0.042
1.309ThrLys: 1.309 ± 0.033
7.341ThrLeu: 7.341 ± 0.08
1.152ThrMet: 1.152 ± 0.028
1.327ThrAsn: 1.327 ± 0.034
3.253ThrPro: 3.253 ± 0.049
1.895ThrGln: 1.895 ± 0.04
3.234ThrArg: 3.234 ± 0.047
2.781ThrSer: 2.781 ± 0.047
2.773ThrThr: 2.773 ± 0.048
3.779ThrVal: 3.779 ± 0.053
0.834ThrTrp: 0.834 ± 0.026
1.066ThrTyr: 1.066 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
7.339ValAla: 7.339 ± 0.073
0.801ValCys: 0.801 ± 0.026
3.619ValAsp: 3.619 ± 0.057
3.778ValGlu: 3.778 ± 0.048
2.506ValPhe: 2.506 ± 0.041
4.901ValGly: 4.901 ± 0.063
1.257ValHis: 1.257 ± 0.031
4.37ValIle: 4.37 ± 0.059
2.826ValLys: 2.826 ± 0.045
7.094ValLeu: 7.094 ± 0.077
2.27ValMet: 2.27 ± 0.038
2.579ValAsn: 2.579 ± 0.045
3.083ValPro: 3.083 ± 0.043
2.373ValGln: 2.373 ± 0.044
3.867ValArg: 3.867 ± 0.051
4.529ValSer: 4.529 ± 0.062
4.059ValThr: 4.059 ± 0.049
5.545ValVal: 5.545 ± 0.064
1.055ValTrp: 1.055 ± 0.026
1.753ValTyr: 1.753 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.064TrpAla: 1.064 ± 0.024
0.2TrpCys: 0.2 ± 0.011
0.642TrpAsp: 0.642 ± 0.024
0.582TrpGlu: 0.582 ± 0.019
0.728TrpPhe: 0.728 ± 0.026
1.0TrpGly: 1.0 ± 0.028
0.491TrpHis: 0.491 ± 0.02
0.727TrpIle: 0.727 ± 0.022
0.503TrpLys: 0.503 ± 0.017
2.49TrpLeu: 2.49 ± 0.038
0.5TrpMet: 0.5 ± 0.019
0.473TrpAsn: 0.473 ± 0.02
0.825TrpPro: 0.825 ± 0.025
1.251TrpGln: 1.251 ± 0.031
1.441TrpArg: 1.441 ± 0.036
0.9TrpSer: 0.9 ± 0.024
0.604TrpThr: 0.604 ± 0.021
0.944TrpVal: 0.944 ± 0.026
0.273TrpTrp: 0.273 ± 0.014
0.406TrpTyr: 0.406 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.483TyrAla: 2.483 ± 0.046
0.361TyrCys: 0.361 ± 0.015
1.499TyrAsp: 1.499 ± 0.031
1.112TyrGlu: 1.112 ± 0.027
1.047TyrPhe: 1.047 ± 0.029
2.192TyrGly: 2.192 ± 0.04
0.763TyrHis: 0.763 ± 0.023
1.292TyrIle: 1.292 ± 0.027
0.814TyrLys: 0.814 ± 0.026
2.992TyrLeu: 2.992 ± 0.051
0.598TyrMet: 0.598 ± 0.021
0.825TyrAsn: 0.825 ± 0.026
1.427TyrPro: 1.427 ± 0.03
1.556TyrGln: 1.556 ± 0.034
2.09TyrArg: 2.09 ± 0.037
1.67TyrSer: 1.67 ± 0.036
1.353TyrThr: 1.353 ± 0.032
1.614TyrVal: 1.614 ± 0.033
0.436TyrTrp: 0.436 ± 0.018
0.899TyrTyr: 0.899 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6726 proteins (1452714 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski