Amino acid dipepetide frequency for Variovorax sp. PAMC 28711

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.38AlaAla: 18.38 ± 0.173
1.182AlaCys: 1.182 ± 0.03
6.681AlaAsp: 6.681 ± 0.081
6.389AlaGlu: 6.389 ± 0.077
4.43AlaPhe: 4.43 ± 0.07
11.098AlaGly: 11.098 ± 0.126
2.643AlaHis: 2.643 ± 0.055
5.679AlaIle: 5.679 ± 0.087
3.983AlaLys: 3.983 ± 0.066
15.295AlaLeu: 15.295 ± 0.15
3.794AlaMet: 3.794 ± 0.055
3.066AlaAsn: 3.066 ± 0.077
6.671AlaPro: 6.671 ± 0.098
5.512AlaGln: 5.512 ± 0.076
8.908AlaArg: 8.908 ± 0.109
7.122AlaSer: 7.122 ± 0.089
6.877AlaThr: 6.877 ± 0.079
9.772AlaVal: 9.772 ± 0.104
1.861AlaTrp: 1.861 ± 0.036
2.379AlaTyr: 2.379 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
1.139CysAla: 1.139 ± 0.033
0.106CysCys: 0.106 ± 0.009
0.515CysAsp: 0.515 ± 0.022
0.51CysGlu: 0.51 ± 0.021
0.28CysPhe: 0.28 ± 0.016
0.903CysGly: 0.903 ± 0.032
0.218CysHis: 0.218 ± 0.014
0.42CysIle: 0.42 ± 0.019
0.228CysLys: 0.228 ± 0.014
0.703CysLeu: 0.703 ± 0.024
0.213CysMet: 0.213 ± 0.014
0.219CysAsn: 0.219 ± 0.011
0.413CysPro: 0.413 ± 0.022
0.215CysGln: 0.215 ± 0.013
0.503CysArg: 0.503 ± 0.021
0.439CysSer: 0.439 ± 0.021
0.458CysThr: 0.458 ± 0.02
0.705CysVal: 0.705 ± 0.025
0.111CysTrp: 0.111 ± 0.009
0.156CysTyr: 0.156 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
7.94AspAla: 7.94 ± 0.089
0.427AspCys: 0.427 ± 0.02
2.975AspAsp: 2.975 ± 0.078
3.097AspGlu: 3.097 ± 0.055
2.046AspPhe: 2.046 ± 0.04
4.864AspGly: 4.864 ± 0.074
1.135AspHis: 1.135 ± 0.034
2.498AspIle: 2.498 ± 0.05
1.798AspLys: 1.798 ± 0.043
5.179AspLeu: 5.179 ± 0.063
1.28AspMet: 1.28 ± 0.032
1.326AspAsn: 1.326 ± 0.033
2.803AspPro: 2.803 ± 0.05
1.516AspGln: 1.516 ± 0.041
3.463AspArg: 3.463 ± 0.066
2.278AspSer: 2.278 ± 0.048
2.719AspThr: 2.719 ± 0.055
4.229AspVal: 4.229 ± 0.063
0.901AspTrp: 0.901 ± 0.032
1.367AspTyr: 1.367 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
6.767GluAla: 6.767 ± 0.081
0.355GluCys: 0.355 ± 0.017
2.044GluAsp: 2.044 ± 0.039
2.243GluGlu: 2.243 ± 0.053
1.717GluPhe: 1.717 ± 0.035
3.888GluGly: 3.888 ± 0.062
1.136GluHis: 1.136 ± 0.032
2.571GluIle: 2.571 ± 0.045
1.896GluLys: 1.896 ± 0.049
5.291GluLeu: 5.291 ± 0.072
1.259GluMet: 1.259 ± 0.037
1.157GluAsn: 1.157 ± 0.031
2.438GluPro: 2.438 ± 0.045
2.046GluGln: 2.046 ± 0.049
4.389GluArg: 4.389 ± 0.067
2.582GluSer: 2.582 ± 0.04
2.545GluThr: 2.545 ± 0.044
3.955GluVal: 3.955 ± 0.064
0.701GluTrp: 0.701 ± 0.024
0.913GluTyr: 0.913 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
4.552PheAla: 4.552 ± 0.067
0.364PheCys: 0.364 ± 0.018
2.655PheAsp: 2.655 ± 0.046
2.048PheGlu: 2.048 ± 0.046
1.473PhePhe: 1.473 ± 0.042
3.576PheGly: 3.576 ± 0.057
0.712PheHis: 0.712 ± 0.025
1.539PheIle: 1.539 ± 0.041
1.276PheLys: 1.276 ± 0.035
3.004PheLeu: 3.004 ± 0.059
0.84PheMet: 0.84 ± 0.03
1.131PheAsn: 1.131 ± 0.032
1.488PhePro: 1.488 ± 0.033
1.018PheGln: 1.018 ± 0.027
1.916PheArg: 1.916 ± 0.043
2.186PheSer: 2.186 ± 0.044
1.987PheThr: 1.987 ± 0.041
3.019PheVal: 3.019 ± 0.054
0.524PheTrp: 0.524 ± 0.022
0.809PheTyr: 0.809 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
10.329GlyAla: 10.329 ± 0.123
0.854GlyCys: 0.854 ± 0.027
4.213GlyAsp: 4.213 ± 0.073
4.344GlyGlu: 4.344 ± 0.063
3.457GlyPhe: 3.457 ± 0.055
7.506GlyGly: 7.506 ± 0.154
1.882GlyHis: 1.882 ± 0.042
4.156GlyIle: 4.156 ± 0.066
3.516GlyLys: 3.516 ± 0.06
8.658GlyLeu: 8.658 ± 0.099
2.284GlyMet: 2.284 ± 0.043
2.354GlyAsn: 2.354 ± 0.065
3.124GlyPro: 3.124 ± 0.049
3.202GlyGln: 3.202 ± 0.053
5.221GlyArg: 5.221 ± 0.066
4.562GlySer: 4.562 ± 0.065
4.832GlyThr: 4.832 ± 0.093
6.883GlyVal: 6.883 ± 0.086
1.507GlyTrp: 1.507 ± 0.034
2.281GlyTyr: 2.281 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
2.857HisAla: 2.857 ± 0.053
0.268HisCys: 0.268 ± 0.014
1.262HisAsp: 1.262 ± 0.034
1.036HisGlu: 1.036 ± 0.031
0.84HisPhe: 0.84 ± 0.025
2.054HisGly: 2.054 ± 0.04
0.58HisHis: 0.58 ± 0.02
0.949HisIle: 0.949 ± 0.027
0.583HisLys: 0.583 ± 0.022
2.065HisLeu: 2.065 ± 0.042
0.477HisMet: 0.477 ± 0.02
0.495HisAsn: 0.495 ± 0.018
1.337HisPro: 1.337 ± 0.039
0.614HisGln: 0.614 ± 0.021
1.465HisArg: 1.465 ± 0.036
0.975HisSer: 0.975 ± 0.029
1.052HisThr: 1.052 ± 0.029
1.564HisVal: 1.564 ± 0.033
0.389HisTrp: 0.389 ± 0.017
0.579HisTyr: 0.579 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
6.909IleAla: 6.909 ± 0.078
0.392IleCys: 0.392 ± 0.017
3.477IleAsp: 3.477 ± 0.057
3.041IleGlu: 3.041 ± 0.053
1.271IlePhe: 1.271 ± 0.037
4.643IleGly: 4.643 ± 0.075
0.848IleHis: 0.848 ± 0.028
1.405IleIle: 1.405 ± 0.042
1.483IleLys: 1.483 ± 0.041
3.101IleLeu: 3.101 ± 0.058
0.683IleMet: 0.683 ± 0.026
1.317IleAsn: 1.317 ± 0.036
2.008IlePro: 2.008 ± 0.039
1.221IleGln: 1.221 ± 0.029
2.49IleArg: 2.49 ± 0.051
2.271IleSer: 2.271 ± 0.044
2.543IleThr: 2.543 ± 0.045
3.917IleVal: 3.917 ± 0.059
0.492IleTrp: 0.492 ± 0.023
0.949IleTyr: 0.949 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
4.151LysAla: 4.151 ± 0.075
0.138LysCys: 0.138 ± 0.009
1.739LysAsp: 1.739 ± 0.038
1.498LysGlu: 1.498 ± 0.04
1.023LysPhe: 1.023 ± 0.032
2.546LysGly: 2.546 ± 0.052
0.668LysHis: 0.668 ± 0.026
1.587LysIle: 1.587 ± 0.042
1.591LysLys: 1.591 ± 0.049
3.663LysLeu: 3.663 ± 0.062
0.847LysMet: 0.847 ± 0.032
0.974LysAsn: 0.974 ± 0.032
2.138LysPro: 2.138 ± 0.046
1.209LysGln: 1.209 ± 0.037
2.269LysArg: 2.269 ± 0.049
1.846LysSer: 1.846 ± 0.038
2.171LysThr: 2.171 ± 0.043
2.694LysVal: 2.694 ± 0.056
0.382LysTrp: 0.382 ± 0.018
0.699LysTyr: 0.699 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
14.647LeuAla: 14.647 ± 0.145
0.948LeuCys: 0.948 ± 0.026
5.917LeuAsp: 5.917 ± 0.081
4.746LeuGlu: 4.746 ± 0.058
3.514LeuPhe: 3.514 ± 0.06
8.629LeuGly: 8.629 ± 0.108
2.271LeuHis: 2.271 ± 0.044
4.388LeuIle: 4.388 ± 0.067
3.633LeuLys: 3.633 ± 0.064
10.817LeuLeu: 10.817 ± 0.126
2.604LeuMet: 2.604 ± 0.05
2.671LeuAsn: 2.671 ± 0.049
5.951LeuPro: 5.951 ± 0.075
4.039LeuGln: 4.039 ± 0.063
7.289LeuArg: 7.289 ± 0.092
5.897LeuSer: 5.897 ± 0.075
5.598LeuThr: 5.598 ± 0.065
7.903LeuVal: 7.903 ± 0.087
1.3LeuTrp: 1.3 ± 0.034
1.936LeuTyr: 1.936 ± 0.035
0.0LeuXaa: 0.0 ± 0.0
Met
3.168MetAla: 3.168 ± 0.057
0.18MetCys: 0.18 ± 0.013
1.044MetAsp: 1.044 ± 0.028
1.012MetGlu: 1.012 ± 0.028
0.771MetPhe: 0.771 ± 0.026
1.922MetGly: 1.922 ± 0.045
0.556MetHis: 0.556 ± 0.021
0.986MetIle: 0.986 ± 0.029
1.142MetLys: 1.142 ± 0.034
2.73MetLeu: 2.73 ± 0.047
0.514MetMet: 0.514 ± 0.023
0.913MetAsn: 0.913 ± 0.023
1.561MetPro: 1.561 ± 0.035
1.058MetGln: 1.058 ± 0.032
1.762MetArg: 1.762 ± 0.04
1.592MetSer: 1.592 ± 0.036
1.714MetThr: 1.714 ± 0.039
1.867MetVal: 1.867 ± 0.038
0.209MetTrp: 0.209 ± 0.013
0.366MetTyr: 0.366 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.521AsnAla: 3.521 ± 0.058
0.234AsnCys: 0.234 ± 0.014
1.452AsnAsp: 1.452 ± 0.05
1.235AsnGlu: 1.235 ± 0.033
1.015AsnPhe: 1.015 ± 0.031
2.355AsnGly: 2.355 ± 0.067
0.467AsnHis: 0.467 ± 0.021
1.224AsnIle: 1.224 ± 0.03
0.885AsnLys: 0.885 ± 0.033
2.57AsnLeu: 2.57 ± 0.048
0.537AsnMet: 0.537 ± 0.021
0.754AsnAsn: 0.754 ± 0.028
1.791AsnPro: 1.791 ± 0.036
0.826AsnGln: 0.826 ± 0.023
1.527AsnArg: 1.527 ± 0.035
1.241AsnSer: 1.241 ± 0.042
1.496AsnThr: 1.496 ± 0.04
2.08AsnVal: 2.08 ± 0.048
0.381AsnTrp: 0.381 ± 0.018
0.67AsnTyr: 0.67 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
7.072ProAla: 7.072 ± 0.086
0.323ProCys: 0.323 ± 0.016
3.164ProAsp: 3.164 ± 0.052
3.092ProGlu: 3.092 ± 0.062
1.806ProPhe: 1.806 ± 0.039
4.484ProGly: 4.484 ± 0.067
1.125ProHis: 1.125 ± 0.031
2.046ProIle: 2.046 ± 0.038
1.605ProLys: 1.605 ± 0.038
5.219ProLeu: 5.219 ± 0.07
1.444ProMet: 1.444 ± 0.036
1.285ProAsn: 1.285 ± 0.034
2.73ProPro: 2.73 ± 0.064
1.845ProGln: 1.845 ± 0.04
2.885ProArg: 2.885 ± 0.047
2.801ProSer: 2.801 ± 0.054
2.783ProThr: 2.783 ± 0.043
4.461ProVal: 4.461 ± 0.062
0.799ProTrp: 0.799 ± 0.028
1.103ProTyr: 1.103 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
4.699GlnAla: 4.699 ± 0.078
0.293GlnCys: 0.293 ± 0.014
1.446GlnAsp: 1.446 ± 0.037
1.335GlnGlu: 1.335 ± 0.034
1.273GlnPhe: 1.273 ± 0.031
2.752GlnGly: 2.752 ± 0.045
0.84GlnHis: 0.84 ± 0.025
1.747GlnIle: 1.747 ± 0.038
1.219GlnLys: 1.219 ± 0.035
3.994GlnLeu: 3.994 ± 0.061
0.92GlnMet: 0.92 ± 0.025
0.847GlnAsn: 0.847 ± 0.028
2.129GlnPro: 2.129 ± 0.041
1.832GlnGln: 1.832 ± 0.08
3.288GlnArg: 3.288 ± 0.057
2.025GlnSer: 2.025 ± 0.047
1.967GlnThr: 1.967 ± 0.043
2.759GlnVal: 2.759 ± 0.052
0.624GlnTrp: 0.624 ± 0.024
0.701GlnTyr: 0.701 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
7.974ArgAla: 7.974 ± 0.09
0.538ArgCys: 0.538 ± 0.02
3.692ArgAsp: 3.692 ± 0.061
3.86ArgGlu: 3.86 ± 0.065
2.753ArgPhe: 2.753 ± 0.052
4.681ArgGly: 4.681 ± 0.067
1.652ArgHis: 1.652 ± 0.042
3.499ArgIle: 3.499 ± 0.051
2.113ArgLys: 2.113 ± 0.042
7.186ArgLeu: 7.186 ± 0.088
1.9ArgMet: 1.9 ± 0.042
1.838ArgAsn: 1.838 ± 0.038
3.029ArgPro: 3.029 ± 0.054
2.535ArgGln: 2.535 ± 0.051
4.79ArgArg: 4.79 ± 0.08
3.493ArgSer: 3.493 ± 0.06
3.389ArgThr: 3.389 ± 0.055
5.06ArgVal: 5.06 ± 0.064
1.219ArgTrp: 1.219 ± 0.031
1.773ArgTyr: 1.773 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
6.76SerAla: 6.76 ± 0.091
0.372SerCys: 0.372 ± 0.017
2.658SerAsp: 2.658 ± 0.046
2.371SerGlu: 2.371 ± 0.05
2.067SerPhe: 2.067 ± 0.042
5.229SerGly: 5.229 ± 0.069
1.13SerHis: 1.13 ± 0.031
2.363SerIle: 2.363 ± 0.048
1.603SerLys: 1.603 ± 0.04
5.547SerLeu: 5.547 ± 0.067
1.372SerMet: 1.372 ± 0.033
1.448SerAsn: 1.448 ± 0.035
2.831SerPro: 2.831 ± 0.046
1.778SerGln: 1.778 ± 0.043
3.431SerArg: 3.431 ± 0.061
2.973SerSer: 2.973 ± 0.052
3.117SerThr: 3.117 ± 0.059
4.26SerVal: 4.26 ± 0.065
0.715SerTrp: 0.715 ± 0.024
1.226SerTyr: 1.226 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
6.766ThrAla: 6.766 ± 0.085
0.365ThrCys: 0.365 ± 0.019
2.783ThrAsp: 2.783 ± 0.049
2.413ThrGlu: 2.413 ± 0.041
1.885ThrPhe: 1.885 ± 0.04
5.07ThrGly: 5.07 ± 0.084
1.197ThrHis: 1.197 ± 0.031
2.224ThrIle: 2.224 ± 0.049
1.408ThrLys: 1.408 ± 0.036
6.734ThrLeu: 6.734 ± 0.087
1.169ThrMet: 1.169 ± 0.036
1.284ThrAsn: 1.284 ± 0.037
3.753ThrPro: 3.753 ± 0.058
1.947ThrGln: 1.947 ± 0.039
3.477ThrArg: 3.477 ± 0.05
2.783ThrSer: 2.783 ± 0.053
3.178ThrThr: 3.178 ± 0.055
4.759ThrVal: 4.759 ± 0.07
0.647ThrTrp: 0.647 ± 0.022
1.071ThrTyr: 1.071 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
10.112ValAla: 10.112 ± 0.096
0.722ValCys: 0.722 ± 0.024
4.321ValAsp: 4.321 ± 0.057
4.025ValGlu: 4.025 ± 0.065
2.963ValPhe: 2.963 ± 0.056
6.016ValGly: 6.016 ± 0.079
1.59ValHis: 1.59 ± 0.038
3.718ValIle: 3.718 ± 0.059
2.697ValLys: 2.697 ± 0.049
8.664ValLeu: 8.664 ± 0.094
2.071ValMet: 2.071 ± 0.045
2.126ValAsn: 2.126 ± 0.048
4.175ValPro: 4.175 ± 0.057
2.82ValGln: 2.82 ± 0.047
5.132ValArg: 5.132 ± 0.066
4.208ValSer: 4.208 ± 0.068
4.471ValThr: 4.471 ± 0.082
6.789ValVal: 6.789 ± 0.093
1.092ValTrp: 1.092 ± 0.029
1.568ValTyr: 1.568 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.366TrpAla: 1.366 ± 0.033
0.154TrpCys: 0.154 ± 0.012
0.607TrpAsp: 0.607 ± 0.022
0.5TrpGlu: 0.5 ± 0.021
0.607TrpPhe: 0.607 ± 0.023
0.956TrpGly: 0.956 ± 0.03
0.395TrpHis: 0.395 ± 0.017
0.696TrpIle: 0.696 ± 0.022
0.474TrpLys: 0.474 ± 0.018
2.011TrpLeu: 2.011 ± 0.043
0.439TrpMet: 0.439 ± 0.019
0.451TrpAsn: 0.451 ± 0.021
0.705TrpPro: 0.705 ± 0.025
0.709TrpGln: 0.709 ± 0.025
1.226TrpArg: 1.226 ± 0.032
0.82TrpSer: 0.82 ± 0.025
0.823TrpThr: 0.823 ± 0.027
1.013TrpVal: 1.013 ± 0.029
0.287TrpTrp: 0.287 ± 0.017
0.268TrpTyr: 0.268 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.615TyrAla: 2.615 ± 0.047
0.242TyrCys: 0.242 ± 0.014
1.197TyrAsp: 1.197 ± 0.031
1.097TyrGlu: 1.097 ± 0.026
0.926TyrPhe: 0.926 ± 0.025
1.991TyrGly: 1.991 ± 0.037
0.395TyrHis: 0.395 ± 0.015
0.763TyrIle: 0.763 ± 0.024
0.705TyrLys: 0.705 ± 0.028
2.267TyrLeu: 2.267 ± 0.047
0.421TyrMet: 0.421 ± 0.02
0.606TyrAsn: 0.606 ± 0.022
1.001TyrPro: 1.001 ± 0.028
0.746TyrGln: 0.746 ± 0.026
1.545TyrArg: 1.545 ± 0.038
1.157TyrSer: 1.157 ± 0.032
1.191TyrThr: 1.191 ± 0.036
1.588TyrVal: 1.588 ± 0.037
0.366TyrTrp: 0.366 ± 0.018
0.537TyrTyr: 0.537 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3960 proteins (1274502 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski