Amino acid dipepetide frequency for Sphingobacteriaceae bacterium

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.057AlaAla: 5.057 ± 0.106
0.805AlaCys: 0.805 ± 0.025
3.305AlaAsp: 3.305 ± 0.064
3.781AlaGlu: 3.781 ± 0.073
3.407AlaPhe: 3.407 ± 0.043
4.998AlaGly: 4.998 ± 0.076
1.034AlaHis: 1.034 ± 0.024
4.965AlaIle: 4.965 ± 0.062
4.606AlaLys: 4.606 ± 0.101
6.623AlaLeu: 6.623 ± 0.075
1.466AlaMet: 1.466 ± 0.028
3.909AlaAsn: 3.909 ± 0.06
2.234AlaPro: 2.234 ± 0.046
2.403AlaGln: 2.403 ± 0.037
2.094AlaArg: 2.094 ± 0.038
5.231AlaSer: 5.231 ± 0.092
4.489AlaThr: 4.489 ± 0.096
4.315AlaVal: 4.315 ± 0.055
0.625AlaTrp: 0.625 ± 0.022
2.533AlaTyr: 2.533 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
0.82CysAla: 0.82 ± 0.029
0.14CysCys: 0.14 ± 0.009
0.489CysAsp: 0.489 ± 0.017
0.549CysGlu: 0.549 ± 0.018
0.615CysPhe: 0.615 ± 0.023
0.75CysGly: 0.75 ± 0.023
0.207CysHis: 0.207 ± 0.012
0.746CysIle: 0.746 ± 0.025
0.678CysLys: 0.678 ± 0.022
0.967CysLeu: 0.967 ± 0.027
0.183CysMet: 0.183 ± 0.009
0.597CysAsn: 0.597 ± 0.024
0.464CysPro: 0.464 ± 0.023
0.266CysGln: 0.266 ± 0.013
0.311CysArg: 0.311 ± 0.015
0.959CysSer: 0.959 ± 0.037
0.715CysThr: 0.715 ± 0.029
0.714CysVal: 0.714 ± 0.026
0.096CysTrp: 0.096 ± 0.008
0.445CysTyr: 0.445 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.384AspAla: 3.384 ± 0.062
0.474AspCys: 0.474 ± 0.015
2.118AspAsp: 2.118 ± 0.038
2.913AspGlu: 2.913 ± 0.048
3.122AspPhe: 3.122 ± 0.051
3.172AspGly: 3.172 ± 0.051
0.81AspHis: 0.81 ± 0.023
3.621AspIle: 3.621 ± 0.048
3.882AspLys: 3.882 ± 0.053
4.972AspLeu: 4.972 ± 0.064
1.052AspMet: 1.052 ± 0.024
2.542AspAsn: 2.542 ± 0.043
1.662AspPro: 1.662 ± 0.038
1.339AspGln: 1.339 ± 0.031
1.518AspArg: 1.518 ± 0.035
3.26AspSer: 3.26 ± 0.049
2.555AspThr: 2.555 ± 0.048
2.931AspVal: 2.931 ± 0.045
0.522AspTrp: 0.522 ± 0.017
2.431AspTyr: 2.431 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
3.704GluAla: 3.704 ± 0.064
0.386GluCys: 0.386 ± 0.016
2.64GluAsp: 2.64 ± 0.051
3.926GluGlu: 3.926 ± 0.072
2.737GluPhe: 2.737 ± 0.047
3.158GluGly: 3.158 ± 0.046
1.021GluHis: 1.021 ± 0.027
4.665GluIle: 4.665 ± 0.065
5.706GluLys: 5.706 ± 0.09
5.731GluLeu: 5.731 ± 0.078
1.329GluMet: 1.329 ± 0.028
3.586GluAsn: 3.586 ± 0.052
1.413GluPro: 1.413 ± 0.028
1.996GluGln: 1.996 ± 0.041
2.197GluArg: 2.197 ± 0.042
3.051GluSer: 3.051 ± 0.048
3.091GluThr: 3.091 ± 0.046
3.589GluVal: 3.589 ± 0.052
0.559GluTrp: 0.559 ± 0.018
1.996GluTyr: 1.996 ± 0.037
0.0GluXaa: 0.0 ± 0.0
Phe
3.131PheAla: 3.131 ± 0.045
0.593PheCys: 0.593 ± 0.018
2.802PheAsp: 2.802 ± 0.046
2.966PheGlu: 2.966 ± 0.05
3.019PhePhe: 3.019 ± 0.069
3.391PheGly: 3.391 ± 0.048
0.838PheHis: 0.838 ± 0.024
3.892PheIle: 3.892 ± 0.062
3.863PheLys: 3.863 ± 0.062
4.831PheLeu: 4.831 ± 0.08
1.113PheMet: 1.113 ± 0.024
3.21PheAsn: 3.21 ± 0.049
1.799PhePro: 1.799 ± 0.03
1.391PheGln: 1.391 ± 0.029
1.684PheArg: 1.684 ± 0.035
4.483PheSer: 4.483 ± 0.063
3.668PheThr: 3.668 ± 0.055
3.169PheVal: 3.169 ± 0.051
0.535PheTrp: 0.535 ± 0.021
2.473PheTyr: 2.473 ± 0.042
0.0PheXaa: 0.0 ± 0.0
Gly
4.582GlyAla: 4.582 ± 0.076
1.022GlyCys: 1.022 ± 0.042
2.845GlyAsp: 2.845 ± 0.05
3.054GlyGlu: 3.054 ± 0.043
3.568GlyPhe: 3.568 ± 0.049
4.949GlyGly: 4.949 ± 0.12
1.083GlyHis: 1.083 ± 0.025
5.084GlyIle: 5.084 ± 0.069
4.941GlyLys: 4.941 ± 0.066
5.797GlyLeu: 5.797 ± 0.068
1.491GlyMet: 1.491 ± 0.031
3.81GlyAsn: 3.81 ± 0.065
1.63GlyPro: 1.63 ± 0.043
1.984GlyGln: 1.984 ± 0.036
1.945GlyArg: 1.945 ± 0.038
4.973GlySer: 4.973 ± 0.087
4.854GlyThr: 4.854 ± 0.102
4.177GlyVal: 4.177 ± 0.052
0.711GlyTrp: 0.711 ± 0.022
2.728GlyTyr: 2.728 ± 0.048
0.0GlyXaa: 0.0 ± 0.0
His
1.068HisAla: 1.068 ± 0.026
0.223HisCys: 0.223 ± 0.012
0.779HisAsp: 0.779 ± 0.023
0.967HisGlu: 0.967 ± 0.031
1.146HisPhe: 1.146 ± 0.026
1.016HisGly: 1.016 ± 0.026
0.437HisHis: 0.437 ± 0.017
1.198HisIle: 1.198 ± 0.028
1.123HisLys: 1.123 ± 0.027
1.774HisLeu: 1.774 ± 0.036
0.356HisMet: 0.356 ± 0.014
0.815HisAsn: 0.815 ± 0.024
0.863HisPro: 0.863 ± 0.022
0.596HisGln: 0.596 ± 0.022
0.593HisArg: 0.593 ± 0.018
1.182HisSer: 1.182 ± 0.029
0.968HisThr: 0.968 ± 0.026
0.985HisVal: 0.985 ± 0.027
0.208HisTrp: 0.208 ± 0.011
0.922HisTyr: 0.922 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.059IleAla: 5.059 ± 0.053
0.99IleCys: 0.99 ± 0.032
3.846IleAsp: 3.846 ± 0.055
4.29IleGlu: 4.29 ± 0.06
3.492IlePhe: 3.492 ± 0.058
4.517IleGly: 4.517 ± 0.061
1.376IleHis: 1.376 ± 0.031
5.091IleIle: 5.091 ± 0.069
5.585IleLys: 5.585 ± 0.069
6.497IleLeu: 6.497 ± 0.088
1.292IleMet: 1.292 ± 0.031
4.366IleAsn: 4.366 ± 0.064
2.849IlePro: 2.849 ± 0.046
2.236IleGln: 2.236 ± 0.036
2.464IleArg: 2.464 ± 0.046
5.964IleSer: 5.964 ± 0.063
5.18IleThr: 5.18 ± 0.076
4.319IleVal: 4.319 ± 0.055
0.615IleTrp: 0.615 ± 0.021
3.047IleTyr: 3.047 ± 0.048
0.0IleXaa: 0.0 ± 0.0
Lys
4.743LysAla: 4.743 ± 0.104
0.455LysCys: 0.455 ± 0.015
4.23LysAsp: 4.23 ± 0.059
5.736LysGlu: 5.736 ± 0.083
3.082LysPhe: 3.082 ± 0.05
4.412LysGly: 4.412 ± 0.068
1.467LysHis: 1.467 ± 0.034
6.116LysIle: 6.116 ± 0.076
7.016LysLys: 7.016 ± 0.097
6.661LysLeu: 6.661 ± 0.089
1.969LysMet: 1.969 ± 0.037
5.218LysAsn: 5.218 ± 0.07
2.495LysPro: 2.495 ± 0.037
2.788LysGln: 2.788 ± 0.053
2.696LysArg: 2.696 ± 0.041
4.463LysSer: 4.463 ± 0.065
4.977LysThr: 4.977 ± 0.058
4.368LysVal: 4.368 ± 0.054
0.739LysTrp: 0.739 ± 0.022
2.778LysTyr: 2.778 ± 0.042
0.0LysXaa: 0.0 ± 0.0
Leu
6.103LeuAla: 6.103 ± 0.069
1.033LeuCys: 1.033 ± 0.026
4.313LeuAsp: 4.313 ± 0.064
4.997LeuGlu: 4.997 ± 0.07
5.127LeuPhe: 5.127 ± 0.088
5.351LeuGly: 5.351 ± 0.06
1.627LeuHis: 1.627 ± 0.034
6.878LeuIle: 6.878 ± 0.083
7.973LeuLys: 7.973 ± 0.098
8.939LeuLeu: 8.939 ± 0.117
2.023LeuMet: 2.023 ± 0.036
6.056LeuAsn: 6.056 ± 0.072
3.659LeuPro: 3.659 ± 0.053
3.159LeuGln: 3.159 ± 0.049
3.287LeuArg: 3.287 ± 0.054
7.562LeuSer: 7.562 ± 0.069
5.753LeuThr: 5.753 ± 0.072
5.721LeuVal: 5.721 ± 0.068
0.863LeuTrp: 0.863 ± 0.027
3.302LeuTyr: 3.302 ± 0.049
0.0LeuXaa: 0.0 ± 0.0
Met
1.553MetAla: 1.553 ± 0.03
0.18MetCys: 0.18 ± 0.01
1.141MetAsp: 1.141 ± 0.031
1.275MetGlu: 1.275 ± 0.029
0.848MetPhe: 0.848 ± 0.025
1.342MetGly: 1.342 ± 0.025
0.414MetHis: 0.414 ± 0.016
1.429MetIle: 1.429 ± 0.029
1.962MetLys: 1.962 ± 0.037
1.783MetLeu: 1.783 ± 0.035
0.535MetMet: 0.535 ± 0.019
1.307MetAsn: 1.307 ± 0.029
0.895MetPro: 0.895 ± 0.024
0.842MetGln: 0.842 ± 0.024
0.887MetArg: 0.887 ± 0.019
1.478MetSer: 1.478 ± 0.031
1.175MetThr: 1.175 ± 0.03
1.255MetVal: 1.255 ± 0.027
0.183MetTrp: 0.183 ± 0.009
0.644MetTyr: 0.644 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
4.266AsnAla: 4.266 ± 0.06
0.673AsnCys: 0.673 ± 0.023
2.7AsnAsp: 2.7 ± 0.045
3.106AsnGlu: 3.106 ± 0.047
3.212AsnPhe: 3.212 ± 0.042
4.508AsnGly: 4.508 ± 0.079
0.917AsnHis: 0.917 ± 0.024
4.053AsnIle: 4.053 ± 0.062
4.227AsnLys: 4.227 ± 0.063
5.398AsnLeu: 5.398 ± 0.066
1.238AsnMet: 1.238 ± 0.027
3.482AsnAsn: 3.482 ± 0.075
2.975AsnPro: 2.975 ± 0.056
1.871AsnGln: 1.871 ± 0.041
1.919AsnArg: 1.919 ± 0.043
4.608AsnSer: 4.608 ± 0.076
4.108AsnThr: 4.108 ± 0.084
3.497AsnVal: 3.497 ± 0.052
0.681AsnTrp: 0.681 ± 0.021
3.19AsnTyr: 3.19 ± 0.053
0.0AsnXaa: 0.0 ± 0.0
Pro
2.799ProAla: 2.799 ± 0.053
0.321ProCys: 0.321 ± 0.015
2.025ProAsp: 2.025 ± 0.038
2.324ProGlu: 2.324 ± 0.038
1.908ProPhe: 1.908 ± 0.037
2.593ProGly: 2.593 ± 0.049
0.581ProHis: 0.581 ± 0.017
2.174ProIle: 2.174 ± 0.042
2.18ProLys: 2.18 ± 0.041
3.239ProLeu: 3.239 ± 0.047
0.661ProMet: 0.661 ± 0.021
2.162ProAsn: 2.162 ± 0.044
1.05ProPro: 1.05 ± 0.036
1.148ProGln: 1.148 ± 0.032
0.854ProArg: 0.854 ± 0.024
2.636ProSer: 2.636 ± 0.055
2.38ProThr: 2.38 ± 0.059
2.939ProVal: 2.939 ± 0.048
0.286ProTrp: 0.286 ± 0.016
1.363ProTyr: 1.363 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
2.101GlnAla: 2.101 ± 0.036
0.23GlnCys: 0.23 ± 0.013
1.364GlnAsp: 1.364 ± 0.032
1.784GlnGlu: 1.784 ± 0.036
1.584GlnPhe: 1.584 ± 0.035
1.773GlnGly: 1.773 ± 0.034
0.604GlnHis: 0.604 ± 0.018
2.333GlnIle: 2.333 ± 0.038
2.773GlnLys: 2.773 ± 0.049
3.226GlnLeu: 3.226 ± 0.043
0.765GlnMet: 0.765 ± 0.021
2.202GlnAsn: 2.202 ± 0.038
1.109GlnPro: 1.109 ± 0.031
1.299GlnGln: 1.299 ± 0.033
1.177GlnArg: 1.177 ± 0.025
2.253GlnSer: 2.253 ± 0.044
2.172GlnThr: 2.172 ± 0.037
2.028GlnVal: 2.028 ± 0.038
0.349GlnTrp: 0.349 ± 0.017
1.125GlnTyr: 1.125 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
1.98ArgAla: 1.98 ± 0.036
0.265ArgCys: 0.265 ± 0.013
1.639ArgAsp: 1.639 ± 0.033
2.018ArgGlu: 2.018 ± 0.036
1.885ArgPhe: 1.885 ± 0.039
1.736ArgGly: 1.736 ± 0.03
0.579ArgHis: 0.579 ± 0.019
2.704ArgIle: 2.704 ± 0.041
2.683ArgLys: 2.683 ± 0.048
3.259ArgLeu: 3.259 ± 0.053
0.9ArgMet: 0.9 ± 0.021
1.924ArgAsn: 1.924 ± 0.035
1.051ArgPro: 1.051 ± 0.029
1.088ArgGln: 1.088 ± 0.029
1.304ArgArg: 1.304 ± 0.03
2.061ArgSer: 2.061 ± 0.037
1.863ArgThr: 1.863 ± 0.036
1.949ArgVal: 1.949 ± 0.036
0.378ArgTrp: 0.378 ± 0.016
1.489ArgTyr: 1.489 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
5.451SerAla: 5.451 ± 0.088
0.854SerCys: 0.854 ± 0.033
3.44SerAsp: 3.44 ± 0.056
3.877SerGlu: 3.877 ± 0.059
4.188SerPhe: 4.188 ± 0.054
5.892SerGly: 5.892 ± 0.099
1.162SerHis: 1.162 ± 0.028
5.182SerIle: 5.182 ± 0.065
4.726SerLys: 4.726 ± 0.058
6.994SerLeu: 6.994 ± 0.063
1.366SerMet: 1.366 ± 0.027
4.134SerAsn: 4.134 ± 0.075
2.796SerPro: 2.796 ± 0.06
2.326SerGln: 2.326 ± 0.037
2.088SerArg: 2.088 ± 0.041
6.016SerSer: 6.016 ± 0.107
4.913SerThr: 4.913 ± 0.097
5.205SerVal: 5.205 ± 0.076
0.854SerTrp: 0.854 ± 0.026
3.16SerTyr: 3.16 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
4.743ThrAla: 4.743 ± 0.092
0.66ThrCys: 0.66 ± 0.026
3.022ThrAsp: 3.022 ± 0.045
3.222ThrGlu: 3.222 ± 0.051
3.215ThrPhe: 3.215 ± 0.051
5.038ThrGly: 5.038 ± 0.104
1.064ThrHis: 1.064 ± 0.027
4.787ThrIle: 4.787 ± 0.075
3.928ThrLys: 3.928 ± 0.046
6.493ThrLeu: 6.493 ± 0.079
0.992ThrMet: 0.992 ± 0.027
3.915ThrAsn: 3.915 ± 0.071
2.697ThrPro: 2.697 ± 0.058
1.969ThrGln: 1.969 ± 0.038
1.812ThrArg: 1.812 ± 0.035
4.986ThrSer: 4.986 ± 0.098
4.603ThrThr: 4.603 ± 0.107
5.018ThrVal: 5.018 ± 0.111
0.792ThrTrp: 0.792 ± 0.032
2.982ThrTyr: 2.982 ± 0.059
0.0ThrXaa: 0.0 ± 0.0
Val
4.147ValAla: 4.147 ± 0.057
0.826ValCys: 0.826 ± 0.028
2.959ValAsp: 2.959 ± 0.05
3.241ValGlu: 3.241 ± 0.053
3.484ValPhe: 3.484 ± 0.057
3.543ValGly: 3.543 ± 0.059
0.978ValHis: 0.978 ± 0.026
4.923ValIle: 4.923 ± 0.053
4.689ValLys: 4.689 ± 0.062
5.804ValLeu: 5.804 ± 0.072
1.347ValMet: 1.347 ± 0.03
4.145ValAsn: 4.145 ± 0.06
2.207ValPro: 2.207 ± 0.032
1.724ValGln: 1.724 ± 0.036
1.989ValArg: 1.989 ± 0.035
5.249ValSer: 5.249 ± 0.074
4.427ValThr: 4.427 ± 0.096
4.24ValVal: 4.24 ± 0.059
0.625ValTrp: 0.625 ± 0.019
2.666ValTyr: 2.666 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.58TrpAla: 0.58 ± 0.02
0.115TrpCys: 0.115 ± 0.008
0.521TrpAsp: 0.521 ± 0.018
0.506TrpGlu: 0.506 ± 0.016
0.554TrpPhe: 0.554 ± 0.021
0.61TrpGly: 0.61 ± 0.021
0.246TrpHis: 0.246 ± 0.013
0.741TrpIle: 0.741 ± 0.021
0.752TrpLys: 0.752 ± 0.022
0.976TrpLeu: 0.976 ± 0.027
0.287TrpMet: 0.287 ± 0.013
0.692TrpAsn: 0.692 ± 0.022
0.283TrpPro: 0.283 ± 0.013
0.449TrpGln: 0.449 ± 0.017
0.373TrpArg: 0.373 ± 0.016
0.757TrpSer: 0.757 ± 0.026
0.69TrpThr: 0.69 ± 0.031
0.55TrpVal: 0.55 ± 0.018
0.128TrpTrp: 0.128 ± 0.009
0.421TrpTyr: 0.421 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.609TyrAla: 2.609 ± 0.042
0.465TyrCys: 0.465 ± 0.017
2.104TyrAsp: 2.104 ± 0.037
1.946TyrGlu: 1.946 ± 0.039
2.658TyrPhe: 2.658 ± 0.043
2.478TyrGly: 2.478 ± 0.048
0.771TyrHis: 0.771 ± 0.022
2.497TyrIle: 2.497 ± 0.038
3.115TyrLys: 3.115 ± 0.049
3.76TyrLeu: 3.76 ± 0.056
0.764TyrMet: 0.764 ± 0.022
2.555TyrAsn: 2.555 ± 0.046
1.494TyrPro: 1.494 ± 0.036
1.38TyrGln: 1.38 ± 0.03
1.566TyrArg: 1.566 ± 0.032
3.448TyrSer: 3.448 ± 0.053
3.356TyrThr: 3.356 ± 0.075
2.245TyrVal: 2.245 ± 0.036
0.482TyrTrp: 0.482 ± 0.018
2.05TyrTyr: 2.05 ± 0.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4775 proteins (1684237 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski