Amino acid dipepetide frequency for Acidobacteria bacterium AB60

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.727AlaAla: 14.727 ± 0.122
1.109AlaCys: 1.109 ± 0.026
5.266AlaAsp: 5.266 ± 0.055
6.358AlaGlu: 6.358 ± 0.093
3.937AlaPhe: 3.937 ± 0.044
9.818AlaGly: 9.818 ± 0.094
2.392AlaHis: 2.392 ± 0.04
5.637AlaIle: 5.637 ± 0.059
3.589AlaLys: 3.589 ± 0.053
11.494AlaLeu: 11.494 ± 0.104
2.732AlaMet: 2.732 ± 0.047
3.47AlaAsn: 3.47 ± 0.059
5.727AlaPro: 5.727 ± 0.074
4.664AlaGln: 4.664 ± 0.056
7.282AlaArg: 7.282 ± 0.086
6.832AlaSer: 6.832 ± 0.074
6.154AlaThr: 6.154 ± 0.08
8.136AlaVal: 8.136 ± 0.076
1.58AlaTrp: 1.58 ± 0.031
2.527AlaTyr: 2.527 ± 0.041
0.0AlaXaa: 0.0 ± 0.0
Cys
1.067CysAla: 1.067 ± 0.028
0.133CysCys: 0.133 ± 0.009
0.471CysAsp: 0.471 ± 0.015
0.401CysGlu: 0.401 ± 0.017
0.341CysPhe: 0.341 ± 0.012
0.992CysGly: 0.992 ± 0.026
0.324CysHis: 0.324 ± 0.031
0.362CysIle: 0.362 ± 0.015
0.228CysLys: 0.228 ± 0.011
0.817CysLeu: 0.817 ± 0.021
0.189CysMet: 0.189 ± 0.01
0.289CysAsn: 0.289 ± 0.014
0.502CysPro: 0.502 ± 0.017
0.272CysGln: 0.272 ± 0.012
0.571CysArg: 0.571 ± 0.019
0.653CysSer: 0.653 ± 0.019
0.557CysThr: 0.557 ± 0.022
0.655CysVal: 0.655 ± 0.021
0.141CysTrp: 0.141 ± 0.01
0.266CysTyr: 0.266 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
5.607AspAla: 5.607 ± 0.058
0.475AspCys: 0.475 ± 0.018
2.333AspAsp: 2.333 ± 0.042
2.888AspGlu: 2.888 ± 0.047
2.136AspPhe: 2.136 ± 0.037
4.466AspGly: 4.466 ± 0.064
1.214AspHis: 1.214 ± 0.027
2.119AspIle: 2.119 ± 0.035
1.549AspLys: 1.549 ± 0.033
5.26AspLeu: 5.26 ± 0.065
0.899AspMet: 0.899 ± 0.023
1.331AspAsn: 1.331 ± 0.027
3.677AspPro: 3.677 ± 0.043
1.974AspGln: 1.974 ± 0.033
3.411AspArg: 3.411 ± 0.049
2.691AspSer: 2.691 ± 0.041
2.459AspThr: 2.459 ± 0.036
3.504AspVal: 3.504 ± 0.047
0.917AspTrp: 0.917 ± 0.022
1.516AspTyr: 1.516 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
5.958GluAla: 5.958 ± 0.078
0.41GluCys: 0.41 ± 0.016
2.515GluAsp: 2.515 ± 0.041
3.344GluGlu: 3.344 ± 0.062
2.056GluPhe: 2.056 ± 0.039
3.822GluGly: 3.822 ± 0.055
1.365GluHis: 1.365 ± 0.029
3.115GluIle: 3.115 ± 0.047
2.308GluLys: 2.308 ± 0.044
5.413GluLeu: 5.413 ± 0.078
1.46GluMet: 1.46 ± 0.03
1.663GluAsn: 1.663 ± 0.035
2.568GluPro: 2.568 ± 0.043
2.437GluGln: 2.437 ± 0.043
4.274GluArg: 4.274 ± 0.068
3.149GluSer: 3.149 ± 0.042
2.937GluThr: 2.937 ± 0.041
3.662GluVal: 3.662 ± 0.052
0.782GluTrp: 0.782 ± 0.023
1.369GluTyr: 1.369 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
4.348PheAla: 4.348 ± 0.049
0.398PheCys: 0.398 ± 0.013
2.385PheAsp: 2.385 ± 0.038
2.05PheGlu: 2.05 ± 0.036
1.61PhePhe: 1.61 ± 0.029
3.415PheGly: 3.415 ± 0.046
0.98PheHis: 0.98 ± 0.024
1.313PheIle: 1.313 ± 0.03
1.035PheLys: 1.035 ± 0.024
3.888PheLeu: 3.888 ± 0.056
0.635PheMet: 0.635 ± 0.019
1.4PheAsn: 1.4 ± 0.034
1.898PhePro: 1.898 ± 0.032
1.318PheGln: 1.318 ± 0.027
2.38PheArg: 2.38 ± 0.041
2.626PheSer: 2.626 ± 0.04
2.418PheThr: 2.418 ± 0.044
2.67PheVal: 2.67 ± 0.039
0.569PheTrp: 0.569 ± 0.019
1.106PheTyr: 1.106 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
8.329GlyAla: 8.329 ± 0.088
0.879GlyCys: 0.879 ± 0.023
4.075GlyAsp: 4.075 ± 0.057
4.14GlyGlu: 4.14 ± 0.05
3.533GlyPhe: 3.533 ± 0.047
6.727GlyGly: 6.727 ± 0.113
1.817GlyHis: 1.817 ± 0.032
4.419GlyIle: 4.419 ± 0.049
3.514GlyLys: 3.514 ± 0.048
7.65GlyLeu: 7.65 ± 0.062
2.069GlyMet: 2.069 ± 0.037
2.883GlyAsn: 2.883 ± 0.063
3.589GlyPro: 3.589 ± 0.053
3.017GlyGln: 3.017 ± 0.042
5.017GlyArg: 5.017 ± 0.061
5.536GlySer: 5.536 ± 0.075
5.222GlyThr: 5.222 ± 0.08
6.036GlyVal: 6.036 ± 0.065
1.48GlyTrp: 1.48 ± 0.03
2.465GlyTyr: 2.465 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
2.418HisAla: 2.418 ± 0.041
0.224HisCys: 0.224 ± 0.012
1.198HisAsp: 1.198 ± 0.026
1.173HisGlu: 1.173 ± 0.028
1.002HisPhe: 1.002 ± 0.025
2.025HisGly: 2.025 ± 0.036
0.608HisHis: 0.608 ± 0.017
1.03HisIle: 1.03 ± 0.028
0.591HisLys: 0.591 ± 0.015
2.328HisLeu: 2.328 ± 0.04
0.471HisMet: 0.471 ± 0.018
0.698HisAsn: 0.698 ± 0.023
1.561HisPro: 1.561 ± 0.029
0.792HisGln: 0.792 ± 0.017
1.502HisArg: 1.502 ± 0.03
1.223HisSer: 1.223 ± 0.028
1.143HisThr: 1.143 ± 0.028
1.634HisVal: 1.634 ± 0.031
0.422HisTrp: 0.422 ± 0.015
0.697HisTyr: 0.697 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
6.215IleAla: 6.215 ± 0.06
0.452IleCys: 0.452 ± 0.015
2.87IleAsp: 2.87 ± 0.04
2.975IleGlu: 2.975 ± 0.052
1.733IlePhe: 1.733 ± 0.03
3.953IleGly: 3.953 ± 0.05
1.124IleHis: 1.124 ± 0.024
1.593IleIle: 1.593 ± 0.032
1.291IleLys: 1.291 ± 0.027
4.497IleLeu: 4.497 ± 0.058
0.661IleMet: 0.661 ± 0.017
1.474IleAsn: 1.474 ± 0.029
2.812IlePro: 2.812 ± 0.042
1.652IleGln: 1.652 ± 0.036
3.012IleArg: 3.012 ± 0.041
3.008IleSer: 3.008 ± 0.048
2.902IleThr: 2.902 ± 0.045
3.517IleVal: 3.517 ± 0.044
0.555IleTrp: 0.555 ± 0.018
1.251IleTyr: 1.251 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
3.508LysAla: 3.508 ± 0.053
0.214LysCys: 0.214 ± 0.011
1.699LysAsp: 1.699 ± 0.039
1.766LysGlu: 1.766 ± 0.04
1.043LysPhe: 1.043 ± 0.023
2.428LysGly: 2.428 ± 0.035
0.751LysHis: 0.751 ± 0.021
1.691LysIle: 1.691 ± 0.031
1.435LysLys: 1.435 ± 0.037
3.399LysLeu: 3.399 ± 0.049
0.87LysMet: 0.87 ± 0.023
1.113LysAsn: 1.113 ± 0.026
2.159LysPro: 2.159 ± 0.039
1.332LysGln: 1.332 ± 0.027
2.11LysArg: 2.11 ± 0.042
1.968LysSer: 1.968 ± 0.033
2.044LysThr: 2.044 ± 0.036
2.366LysVal: 2.366 ± 0.041
0.42LysTrp: 0.42 ± 0.017
0.842LysTyr: 0.842 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
11.966LeuAla: 11.966 ± 0.106
0.937LeuCys: 0.937 ± 0.022
5.243LeuAsp: 5.243 ± 0.064
5.537LeuGlu: 5.537 ± 0.08
3.618LeuPhe: 3.618 ± 0.051
7.754LeuGly: 7.754 ± 0.071
2.274LeuHis: 2.274 ± 0.039
4.391LeuIle: 4.391 ± 0.052
3.533LeuLys: 3.533 ± 0.048
10.439LeuLeu: 10.439 ± 0.103
1.924LeuMet: 1.924 ± 0.038
3.266LeuAsn: 3.266 ± 0.049
5.686LeuPro: 5.686 ± 0.059
3.505LeuGln: 3.505 ± 0.048
7.339LeuArg: 7.339 ± 0.088
6.213LeuSer: 6.213 ± 0.057
5.87LeuThr: 5.87 ± 0.066
6.911LeuVal: 6.911 ± 0.065
1.307LeuTrp: 1.307 ± 0.033
2.427LeuTyr: 2.427 ± 0.032
0.0LeuXaa: 0.0 ± 0.0
Met
2.51MetAla: 2.51 ± 0.039
0.161MetCys: 0.161 ± 0.009
1.007MetAsp: 1.007 ± 0.024
1.118MetGlu: 1.118 ± 0.027
0.622MetPhe: 0.622 ± 0.022
1.659MetGly: 1.659 ± 0.03
0.513MetHis: 0.513 ± 0.016
1.01MetIle: 1.01 ± 0.024
0.95MetLys: 0.95 ± 0.02
2.101MetLeu: 2.101 ± 0.043
0.544MetMet: 0.544 ± 0.017
0.814MetAsn: 0.814 ± 0.021
1.405MetPro: 1.405 ± 0.027
0.917MetGln: 0.917 ± 0.023
1.729MetArg: 1.729 ± 0.037
1.345MetSer: 1.345 ± 0.028
1.394MetThr: 1.394 ± 0.029
1.461MetVal: 1.461 ± 0.033
0.219MetTrp: 0.219 ± 0.013
0.389MetTyr: 0.389 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.47AsnAla: 3.47 ± 0.056
0.34AsnCys: 0.34 ± 0.015
1.535AsnAsp: 1.535 ± 0.034
1.493AsnGlu: 1.493 ± 0.03
1.294AsnPhe: 1.294 ± 0.032
3.19AsnGly: 3.19 ± 0.069
0.708AsnHis: 0.708 ± 0.021
1.436AsnIle: 1.436 ± 0.034
0.805AsnLys: 0.805 ± 0.022
3.347AsnLeu: 3.347 ± 0.05
0.563AsnMet: 0.563 ± 0.015
1.155AsnAsn: 1.155 ± 0.04
2.475AsnPro: 2.475 ± 0.045
1.269AsnGln: 1.269 ± 0.027
1.977AsnArg: 1.977 ± 0.03
1.983AsnSer: 1.983 ± 0.047
1.822AsnThr: 1.822 ± 0.049
2.258AsnVal: 2.258 ± 0.036
0.551AsnTrp: 0.551 ± 0.02
1.037AsnTyr: 1.037 ± 0.034
0.0AsnXaa: 0.0 ± 0.0
Pro
7.095ProAla: 7.095 ± 0.073
0.408ProCys: 0.408 ± 0.015
3.326ProAsp: 3.326 ± 0.043
3.697ProGlu: 3.697 ± 0.057
1.985ProPhe: 1.985 ± 0.03
5.262ProGly: 5.262 ± 0.067
1.197ProHis: 1.197 ± 0.024
2.358ProIle: 2.358 ± 0.04
1.661ProLys: 1.661 ± 0.033
4.932ProLeu: 4.932 ± 0.061
1.123ProMet: 1.123 ± 0.024
1.896ProAsn: 1.896 ± 0.041
3.11ProPro: 3.11 ± 0.054
2.265ProGln: 2.265 ± 0.044
3.019ProArg: 3.019 ± 0.042
3.502ProSer: 3.502 ± 0.049
2.821ProThr: 2.821 ± 0.057
4.301ProVal: 4.301 ± 0.043
0.748ProTrp: 0.748 ± 0.019
1.358ProTyr: 1.358 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
4.248GlnAla: 4.248 ± 0.054
0.293GlnCys: 0.293 ± 0.016
1.575GlnAsp: 1.575 ± 0.03
1.902GlnGlu: 1.902 ± 0.036
1.468GlnPhe: 1.468 ± 0.028
2.694GlnGly: 2.694 ± 0.044
0.87GlnHis: 0.87 ± 0.023
2.102GlnIle: 2.102 ± 0.034
1.357GlnLys: 1.357 ± 0.029
3.67GlnLeu: 3.67 ± 0.048
1.059GlnMet: 1.059 ± 0.023
1.286GlnAsn: 1.286 ± 0.032
2.258GlnPro: 2.258 ± 0.035
1.966GlnGln: 1.966 ± 0.047
2.626GlnArg: 2.626 ± 0.038
2.411GlnSer: 2.411 ± 0.039
2.212GlnThr: 2.212 ± 0.04
2.76GlnVal: 2.76 ± 0.042
0.538GlnTrp: 0.538 ± 0.017
0.971GlnTyr: 0.971 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
6.633ArgAla: 6.633 ± 0.086
0.537ArgCys: 0.537 ± 0.019
3.346ArgAsp: 3.346 ± 0.049
4.138ArgGlu: 4.138 ± 0.065
2.711ArgPhe: 2.711 ± 0.04
4.336ArgGly: 4.336 ± 0.055
1.482ArgHis: 1.482 ± 0.029
3.763ArgIle: 3.763 ± 0.055
2.385ArgLys: 2.385 ± 0.05
6.867ArgLeu: 6.867 ± 0.086
1.875ArgMet: 1.875 ± 0.034
2.143ArgAsn: 2.143 ± 0.037
3.165ArgPro: 3.165 ± 0.043
2.558ArgGln: 2.558 ± 0.041
5.067ArgArg: 5.067 ± 0.064
3.943ArgSer: 3.943 ± 0.052
3.535ArgThr: 3.535 ± 0.044
4.726ArgVal: 4.726 ± 0.064
1.113ArgTrp: 1.113 ± 0.028
1.909ArgTyr: 1.909 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
6.93SerAla: 6.93 ± 0.076
0.628SerCys: 0.628 ± 0.024
2.961SerAsp: 2.961 ± 0.038
2.846SerGlu: 2.846 ± 0.05
2.522SerPhe: 2.522 ± 0.039
5.978SerGly: 5.978 ± 0.086
1.304SerHis: 1.304 ± 0.027
3.079SerIle: 3.079 ± 0.04
1.855SerLys: 1.855 ± 0.033
6.283SerLeu: 6.283 ± 0.056
1.336SerMet: 1.336 ± 0.03
1.99SerAsn: 1.99 ± 0.046
3.611SerPro: 3.611 ± 0.052
2.211SerGln: 2.211 ± 0.036
3.831SerArg: 3.831 ± 0.052
4.374SerSer: 4.374 ± 0.075
3.546SerThr: 3.546 ± 0.067
4.387SerVal: 4.387 ± 0.056
0.929SerTrp: 0.929 ± 0.024
1.654SerTyr: 1.654 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
6.486ThrAla: 6.486 ± 0.081
0.526ThrCys: 0.526 ± 0.021
2.647ThrAsp: 2.647 ± 0.041
2.653ThrGlu: 2.653 ± 0.038
2.267ThrPhe: 2.267 ± 0.046
5.461ThrGly: 5.461 ± 0.083
1.209ThrHis: 1.209 ± 0.026
2.977ThrIle: 2.977 ± 0.044
1.411ThrLys: 1.411 ± 0.026
5.935ThrLeu: 5.935 ± 0.066
1.116ThrMet: 1.116 ± 0.025
1.811ThrAsn: 1.811 ± 0.053
3.771ThrPro: 3.771 ± 0.052
1.945ThrGln: 1.945 ± 0.037
3.272ThrArg: 3.272 ± 0.046
3.44ThrSer: 3.44 ± 0.062
3.431ThrThr: 3.431 ± 0.074
4.677ThrVal: 4.677 ± 0.064
0.875ThrTrp: 0.875 ± 0.024
1.539ThrTyr: 1.539 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
7.976ValAla: 7.976 ± 0.075
0.717ValCys: 0.717 ± 0.019
3.777ValAsp: 3.777 ± 0.048
4.068ValGlu: 4.068 ± 0.063
2.678ValPhe: 2.678 ± 0.046
5.066ValGly: 5.066 ± 0.058
1.573ValHis: 1.573 ± 0.03
3.324ValIle: 3.324 ± 0.041
2.234ValLys: 2.234 ± 0.038
7.68ValLeu: 7.68 ± 0.083
1.484ValMet: 1.484 ± 0.029
2.498ValAsn: 2.498 ± 0.044
4.018ValPro: 4.018 ± 0.05
2.507ValGln: 2.507 ± 0.036
4.808ValArg: 4.808 ± 0.064
4.685ValSer: 4.685 ± 0.056
4.545ValThr: 4.545 ± 0.073
5.475ValVal: 5.475 ± 0.06
0.947ValTrp: 0.947 ± 0.024
1.811ValTyr: 1.811 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.256TrpAla: 1.256 ± 0.027
0.137TrpCys: 0.137 ± 0.009
0.693TrpAsp: 0.693 ± 0.023
0.694TrpGlu: 0.694 ± 0.02
0.625TrpPhe: 0.625 ± 0.021
1.068TrpGly: 1.068 ± 0.024
0.393TrpHis: 0.393 ± 0.017
0.844TrpIle: 0.844 ± 0.024
0.638TrpLys: 0.638 ± 0.02
1.528TrpLeu: 1.528 ± 0.033
0.425TrpMet: 0.425 ± 0.015
0.626TrpAsn: 0.626 ± 0.023
0.687TrpPro: 0.687 ± 0.02
0.653TrpGln: 0.653 ± 0.02
1.071TrpArg: 1.071 ± 0.024
0.977TrpSer: 0.977 ± 0.024
0.9TrpThr: 0.9 ± 0.021
0.945TrpVal: 0.945 ± 0.025
0.254TrpTrp: 0.254 ± 0.01
0.389TrpTyr: 0.389 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.684TyrAla: 2.684 ± 0.039
0.256TyrCys: 0.256 ± 0.013
1.504TyrAsp: 1.504 ± 0.033
1.364TyrGlu: 1.364 ± 0.025
1.261TyrPhe: 1.261 ± 0.029
2.281TyrGly: 2.281 ± 0.043
0.642TyrHis: 0.642 ± 0.019
0.982TyrIle: 0.982 ± 0.024
0.79TyrLys: 0.79 ± 0.022
2.612TyrLeu: 2.612 ± 0.044
0.43TyrMet: 0.43 ± 0.016
0.902TyrAsn: 0.902 ± 0.031
1.386TyrPro: 1.386 ± 0.03
1.011TyrGln: 1.011 ± 0.023
1.931TyrArg: 1.931 ± 0.034
1.695TyrSer: 1.695 ± 0.032
1.54TyrThr: 1.54 ± 0.036
1.792TyrVal: 1.792 ± 0.033
0.456TyrTrp: 0.456 ± 0.016
0.835TyrTyr: 0.835 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5367 proteins (1893627 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski