Amino acid dipepetide frequency for Planctomycetes bacterium K23_9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.104AlaAla: 11.104 ± 0.094
1.08AlaCys: 1.08 ± 0.027
6.762AlaAsp: 6.762 ± 0.064
5.722AlaGlu: 5.722 ± 0.054
3.306AlaPhe: 3.306 ± 0.036
7.54AlaGly: 7.54 ± 0.072
1.492AlaHis: 1.492 ± 0.027
6.013AlaIle: 6.013 ± 0.056
4.739AlaLys: 4.739 ± 0.064
7.61AlaLeu: 7.61 ± 0.062
2.724AlaMet: 2.724 ± 0.045
3.673AlaAsn: 3.673 ± 0.054
3.807AlaPro: 3.807 ± 0.054
3.323AlaGln: 3.323 ± 0.045
4.651AlaArg: 4.651 ± 0.051
6.915AlaSer: 6.915 ± 0.061
6.237AlaThr: 6.237 ± 0.105
6.828AlaVal: 6.828 ± 0.063
1.239AlaTrp: 1.239 ± 0.026
1.916AlaTyr: 1.916 ± 0.029
0.0AlaXaa: 0.0 ± 0.0
Cys
0.778CysAla: 0.778 ± 0.026
0.232CysCys: 0.232 ± 0.012
0.816CysAsp: 0.816 ± 0.024
0.66CysGlu: 0.66 ± 0.021
0.467CysPhe: 0.467 ± 0.017
1.02CysGly: 1.02 ± 0.034
0.434CysHis: 0.434 ± 0.019
0.454CysIle: 0.454 ± 0.013
0.337CysLys: 0.337 ± 0.014
1.09CysLeu: 1.09 ± 0.025
0.211CysMet: 0.211 ± 0.012
0.338CysAsn: 0.338 ± 0.014
0.491CysPro: 0.491 ± 0.016
0.461CysGln: 0.461 ± 0.016
0.72CysArg: 0.72 ± 0.02
0.689CysSer: 0.689 ± 0.019
0.511CysThr: 0.511 ± 0.015
0.857CysVal: 0.857 ± 0.024
0.176CysTrp: 0.176 ± 0.008
0.306CysTyr: 0.306 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.077AspAla: 7.077 ± 0.083
0.632AspCys: 0.632 ± 0.019
5.111AspAsp: 5.111 ± 0.081
4.328AspGlu: 4.328 ± 0.05
2.565AspPhe: 2.565 ± 0.038
5.934AspGly: 5.934 ± 0.11
1.531AspHis: 1.531 ± 0.032
2.654AspIle: 2.654 ± 0.055
2.181AspLys: 2.181 ± 0.04
6.067AspLeu: 6.067 ± 0.063
1.045AspMet: 1.045 ± 0.022
2.039AspAsn: 2.039 ± 0.045
3.612AspPro: 3.612 ± 0.06
3.178AspGln: 3.178 ± 0.037
4.391AspArg: 4.391 ± 0.047
4.505AspSer: 4.505 ± 0.075
3.102AspThr: 3.102 ± 0.079
4.741AspVal: 4.741 ± 0.072
1.087AspTrp: 1.087 ± 0.021
1.628AspTyr: 1.628 ± 0.025
0.0AspXaa: 0.0 ± 0.0
Glu
4.94GluAla: 4.94 ± 0.062
0.477GluCys: 0.477 ± 0.016
2.603GluAsp: 2.603 ± 0.041
2.752GluGlu: 2.752 ± 0.055
2.158GluPhe: 2.158 ± 0.031
3.176GluGly: 3.176 ± 0.047
1.22GluHis: 1.22 ± 0.024
3.526GluIle: 3.526 ± 0.044
2.522GluLys: 2.522 ± 0.04
6.239GluLeu: 6.239 ± 0.066
1.544GluMet: 1.544 ± 0.033
2.043GluAsn: 2.043 ± 0.033
2.47GluPro: 2.47 ± 0.037
2.667GluGln: 2.667 ± 0.039
3.302GluArg: 3.302 ± 0.044
4.744GluSer: 4.744 ± 0.055
3.667GluThr: 3.667 ± 0.043
3.759GluVal: 3.759 ± 0.043
0.6GluTrp: 0.6 ± 0.017
1.24GluTyr: 1.24 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
4.047PheAla: 4.047 ± 0.047
0.509PheCys: 0.509 ± 0.017
3.189PheAsp: 3.189 ± 0.051
1.935PheGlu: 1.935 ± 0.03
1.473PhePhe: 1.473 ± 0.029
3.298PheGly: 3.298 ± 0.039
0.786PheHis: 0.786 ± 0.02
1.453PheIle: 1.453 ± 0.027
1.105PheLys: 1.105 ± 0.026
3.227PheLeu: 3.227 ± 0.049
0.634PheMet: 0.634 ± 0.018
1.188PheAsn: 1.188 ± 0.031
1.49PhePro: 1.49 ± 0.026
1.414PheGln: 1.414 ± 0.026
2.297PheArg: 2.297 ± 0.038
2.463PheSer: 2.463 ± 0.038
2.004PheThr: 2.004 ± 0.046
2.997PheVal: 2.997 ± 0.036
0.518PheTrp: 0.518 ± 0.018
0.938PheTyr: 0.938 ± 0.02
0.0PheXaa: 0.0 ± 0.0
Gly
5.764GlyAla: 5.764 ± 0.069
0.993GlyCys: 0.993 ± 0.041
5.362GlyAsp: 5.362 ± 0.094
4.331GlyGlu: 4.331 ± 0.046
3.025GlyPhe: 3.025 ± 0.042
6.629GlyGly: 6.629 ± 0.136
1.644GlyHis: 1.644 ± 0.03
3.96GlyIle: 3.96 ± 0.05
3.834GlyLys: 3.834 ± 0.05
6.435GlyLeu: 6.435 ± 0.054
1.867GlyMet: 1.867 ± 0.037
2.981GlyAsn: 2.981 ± 0.076
2.92GlyPro: 2.92 ± 0.044
3.132GlyGln: 3.132 ± 0.045
4.638GlyArg: 4.638 ± 0.053
5.196GlySer: 5.196 ± 0.085
4.687GlyThr: 4.687 ± 0.116
5.397GlyVal: 5.397 ± 0.071
1.178GlyTrp: 1.178 ± 0.028
1.999GlyTyr: 1.999 ± 0.03
0.0GlyXaa: 0.0 ± 0.0
His
1.989HisAla: 1.989 ± 0.032
0.338HisCys: 0.338 ± 0.012
1.34HisAsp: 1.34 ± 0.026
1.137HisGlu: 1.137 ± 0.025
0.954HisPhe: 0.954 ± 0.022
1.712HisGly: 1.712 ± 0.032
0.666HisHis: 0.666 ± 0.019
0.805HisIle: 0.805 ± 0.019
0.582HisLys: 0.582 ± 0.018
2.075HisLeu: 2.075 ± 0.038
0.369HisMet: 0.369 ± 0.014
0.648HisAsn: 0.648 ± 0.016
1.269HisPro: 1.269 ± 0.027
0.977HisGln: 0.977 ± 0.022
1.624HisArg: 1.624 ± 0.033
1.389HisSer: 1.389 ± 0.027
0.951HisThr: 0.951 ± 0.024
1.523HisVal: 1.523 ± 0.026
0.414HisTrp: 0.414 ± 0.013
0.619HisTyr: 0.619 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
6.351IleAla: 6.351 ± 0.058
0.628IleCys: 0.628 ± 0.02
4.667IleAsp: 4.667 ± 0.058
3.678IleGlu: 3.678 ± 0.041
1.423IlePhe: 1.423 ± 0.023
4.377IleGly: 4.377 ± 0.055
1.065IleHis: 1.065 ± 0.021
1.926IleIle: 1.926 ± 0.033
1.772IleLys: 1.772 ± 0.03
3.861IleLeu: 3.861 ± 0.044
0.742IleMet: 0.742 ± 0.022
1.809IleAsn: 1.809 ± 0.037
2.371IlePro: 2.371 ± 0.035
1.989IleGln: 1.989 ± 0.033
3.407IleArg: 3.407 ± 0.041
3.277IleSer: 3.277 ± 0.043
2.891IleThr: 2.891 ± 0.055
4.255IleVal: 4.255 ± 0.044
0.594IleTrp: 0.594 ± 0.016
1.18IleTyr: 1.18 ± 0.022
0.0IleXaa: 0.0 ± 0.0
Lys
3.336LysAla: 3.336 ± 0.051
0.309LysCys: 0.309 ± 0.012
2.034LysAsp: 2.034 ± 0.036
2.064LysGlu: 2.064 ± 0.041
1.284LysPhe: 1.284 ± 0.027
2.175LysGly: 2.175 ± 0.036
0.945LysHis: 0.945 ± 0.023
2.126LysIle: 2.126 ± 0.039
2.078LysLys: 2.078 ± 0.053
4.032LysLeu: 4.032 ± 0.05
1.06LysMet: 1.06 ± 0.023
1.378LysAsn: 1.378 ± 0.03
2.427LysPro: 2.427 ± 0.041
1.879LysGln: 1.879 ± 0.033
2.741LysArg: 2.741 ± 0.04
2.805LysSer: 2.805 ± 0.044
2.577LysThr: 2.577 ± 0.042
2.531LysVal: 2.531 ± 0.045
0.594LysTrp: 0.594 ± 0.018
0.9LysTyr: 0.9 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
10.033LeuAla: 10.033 ± 0.089
1.119LeuCys: 1.119 ± 0.027
5.751LeuAsp: 5.751 ± 0.05
4.674LeuGlu: 4.674 ± 0.056
3.28LeuPhe: 3.28 ± 0.043
6.61LeuGly: 6.61 ± 0.063
1.857LeuHis: 1.857 ± 0.033
4.938LeuIle: 4.938 ± 0.056
3.512LeuLys: 3.512 ± 0.048
8.887LeuLeu: 8.887 ± 0.105
2.095LeuMet: 2.095 ± 0.037
2.968LeuAsn: 2.968 ± 0.042
5.115LeuPro: 5.115 ± 0.054
3.845LeuGln: 3.845 ± 0.043
5.898LeuArg: 5.898 ± 0.068
6.973LeuSer: 6.973 ± 0.061
5.52LeuThr: 5.52 ± 0.072
6.686LeuVal: 6.686 ± 0.063
1.116LeuTrp: 1.116 ± 0.024
1.887LeuTyr: 1.887 ± 0.034
0.0LeuXaa: 0.0 ± 0.0
Met
2.091MetAla: 2.091 ± 0.04
0.2MetCys: 0.2 ± 0.009
1.14MetAsp: 1.14 ± 0.026
0.998MetGlu: 0.998 ± 0.024
0.732MetPhe: 0.732 ± 0.021
1.531MetGly: 1.531 ± 0.034
0.507MetHis: 0.507 ± 0.015
1.329MetIle: 1.329 ± 0.029
1.076MetLys: 1.076 ± 0.026
2.278MetLeu: 2.278 ± 0.037
0.632MetMet: 0.632 ± 0.02
0.963MetAsn: 0.963 ± 0.024
1.413MetPro: 1.413 ± 0.026
1.063MetGln: 1.063 ± 0.025
1.47MetArg: 1.47 ± 0.03
1.64MetSer: 1.64 ± 0.028
1.554MetThr: 1.554 ± 0.028
1.482MetVal: 1.482 ± 0.032
0.22MetTrp: 0.22 ± 0.011
0.407MetTyr: 0.407 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.355AsnAla: 3.355 ± 0.046
0.389AsnCys: 0.389 ± 0.014
2.451AsnAsp: 2.451 ± 0.059
2.02AsnGlu: 2.02 ± 0.032
1.185AsnPhe: 1.185 ± 0.026
2.936AsnGly: 2.936 ± 0.065
0.845AsnHis: 0.845 ± 0.021
1.433AsnIle: 1.433 ± 0.031
1.028AsnLys: 1.028 ± 0.025
3.157AsnLeu: 3.157 ± 0.036
0.597AsnMet: 0.597 ± 0.017
1.196AsnAsn: 1.196 ± 0.038
2.038AsnPro: 2.038 ± 0.033
1.69AsnGln: 1.69 ± 0.028
2.453AsnArg: 2.453 ± 0.035
2.198AsnSer: 2.198 ± 0.049
1.761AsnThr: 1.761 ± 0.046
2.604AsnVal: 2.604 ± 0.048
0.578AsnTrp: 0.578 ± 0.017
0.867AsnTyr: 0.867 ± 0.019
0.0AsnXaa: 0.0 ± 0.0
Pro
4.816ProAla: 4.816 ± 0.058
0.353ProCys: 0.353 ± 0.015
3.42ProAsp: 3.42 ± 0.049
3.143ProGlu: 3.143 ± 0.042
1.713ProPhe: 1.713 ± 0.033
3.601ProGly: 3.601 ± 0.044
0.991ProHis: 0.991 ± 0.02
2.742ProIle: 2.742 ± 0.045
2.006ProLys: 2.006 ± 0.035
4.158ProLeu: 4.158 ± 0.049
1.183ProMet: 1.183 ± 0.024
1.96ProAsn: 1.96 ± 0.037
2.532ProPro: 2.532 ± 0.052
1.845ProGln: 1.845 ± 0.032
2.485ProArg: 2.485 ± 0.038
3.722ProSer: 3.722 ± 0.045
3.304ProThr: 3.304 ± 0.052
3.618ProVal: 3.618 ± 0.052
0.645ProTrp: 0.645 ± 0.016
1.049ProTyr: 1.049 ± 0.023
0.0ProXaa: 0.0 ± 0.0
Gln
3.806GlnAla: 3.806 ± 0.048
0.403GlnCys: 0.403 ± 0.015
1.905GlnAsp: 1.905 ± 0.031
1.812GlnGlu: 1.812 ± 0.031
1.656GlnPhe: 1.656 ± 0.027
2.45GlnGly: 2.45 ± 0.034
0.989GlnHis: 0.989 ± 0.024
2.541GlnIle: 2.541 ± 0.033
1.573GlnLys: 1.573 ± 0.032
4.487GlnLeu: 4.487 ± 0.048
1.038GlnMet: 1.038 ± 0.022
1.413GlnAsn: 1.413 ± 0.026
2.318GlnPro: 2.318 ± 0.044
2.362GlnGln: 2.362 ± 0.043
3.089GlnArg: 3.089 ± 0.043
3.568GlnSer: 3.568 ± 0.049
2.7GlnThr: 2.7 ± 0.037
2.721GlnVal: 2.721 ± 0.039
0.768GlnTrp: 0.768 ± 0.02
1.028GlnTyr: 1.028 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
4.42ArgAla: 4.42 ± 0.051
0.719ArgCys: 0.719 ± 0.02
3.942ArgAsp: 3.942 ± 0.046
3.511ArgGlu: 3.511 ± 0.058
2.758ArgPhe: 2.758 ± 0.039
4.103ArgGly: 4.103 ± 0.047
1.363ArgHis: 1.363 ± 0.03
3.588ArgIle: 3.588 ± 0.045
2.497ArgLys: 2.497 ± 0.042
6.519ArgLeu: 6.519 ± 0.067
1.726ArgMet: 1.726 ± 0.029
2.075ArgAsn: 2.075 ± 0.034
2.888ArgPro: 2.888 ± 0.039
2.831ArgGln: 2.831 ± 0.042
4.77ArgArg: 4.77 ± 0.067
4.434ArgSer: 4.434 ± 0.057
3.11ArgThr: 3.11 ± 0.04
4.328ArgVal: 4.328 ± 0.051
1.113ArgTrp: 1.113 ± 0.026
1.74ArgTyr: 1.74 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
6.295SerAla: 6.295 ± 0.064
0.676SerCys: 0.676 ± 0.022
5.079SerAsp: 5.079 ± 0.071
4.036SerGlu: 4.036 ± 0.042
2.508SerPhe: 2.508 ± 0.037
6.032SerGly: 6.032 ± 0.082
1.448SerHis: 1.448 ± 0.031
3.83SerIle: 3.83 ± 0.038
2.562SerLys: 2.562 ± 0.044
6.704SerLeu: 6.704 ± 0.062
1.631SerMet: 1.631 ± 0.033
2.427SerAsn: 2.427 ± 0.041
3.664SerPro: 3.664 ± 0.046
3.093SerGln: 3.093 ± 0.039
4.166SerArg: 4.166 ± 0.05
4.825SerSer: 4.825 ± 0.064
3.968SerThr: 3.968 ± 0.057
5.305SerVal: 5.305 ± 0.059
0.884SerTrp: 0.884 ± 0.02
1.476SerTyr: 1.476 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
5.636ThrAla: 5.636 ± 0.08
0.555ThrCys: 0.555 ± 0.018
3.986ThrAsp: 3.986 ± 0.083
2.99ThrGlu: 2.99 ± 0.044
2.187ThrPhe: 2.187 ± 0.054
4.795ThrGly: 4.795 ± 0.105
1.183ThrHis: 1.183 ± 0.031
3.546ThrIle: 3.546 ± 0.086
2.128ThrLys: 2.128 ± 0.031
5.634ThrLeu: 5.634 ± 0.067
1.237ThrMet: 1.237 ± 0.027
2.079ThrAsn: 2.079 ± 0.056
3.285ThrPro: 3.285 ± 0.049
2.249ThrGln: 2.249 ± 0.031
3.015ThrArg: 3.015 ± 0.042
3.838ThrSer: 3.838 ± 0.061
3.632ThrThr: 3.632 ± 0.075
4.437ThrVal: 4.437 ± 0.105
0.766ThrTrp: 0.766 ± 0.021
1.368ThrTyr: 1.368 ± 0.044
0.0ThrXaa: 0.0 ± 0.0
Val
7.693ValAla: 7.693 ± 0.066
0.935ValCys: 0.935 ± 0.022
5.43ValAsp: 5.43 ± 0.082
3.854ValGlu: 3.854 ± 0.049
2.722ValPhe: 2.722 ± 0.04
5.341ValGly: 5.341 ± 0.059
1.425ValHis: 1.425 ± 0.026
3.739ValIle: 3.739 ± 0.041
2.37ValLys: 2.37 ± 0.036
6.53ValLeu: 6.53 ± 0.072
1.612ValMet: 1.612 ± 0.027
2.343ValAsn: 2.343 ± 0.046
3.427ValPro: 3.427 ± 0.048
2.709ValGln: 2.709 ± 0.042
4.429ValArg: 4.429 ± 0.047
4.976ValSer: 4.976 ± 0.062
4.421ValThr: 4.421 ± 0.108
5.898ValVal: 5.898 ± 0.061
0.919ValTrp: 0.919 ± 0.021
1.638ValTyr: 1.638 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
1.029TrpAla: 1.029 ± 0.024
0.186TrpCys: 0.186 ± 0.008
0.766TrpAsp: 0.766 ± 0.02
0.61TrpGlu: 0.61 ± 0.018
0.547TrpPhe: 0.547 ± 0.017
0.837TrpGly: 0.837 ± 0.021
0.414TrpHis: 0.414 ± 0.014
0.862TrpIle: 0.862 ± 0.02
0.692TrpLys: 0.692 ± 0.02
1.512TrpLeu: 1.512 ± 0.029
0.425TrpMet: 0.425 ± 0.015
0.566TrpAsn: 0.566 ± 0.016
0.66TrpPro: 0.66 ± 0.018
0.744TrpGln: 0.744 ± 0.021
0.925TrpArg: 0.925 ± 0.02
0.98TrpSer: 0.98 ± 0.021
0.808TrpThr: 0.808 ± 0.022
0.871TrpVal: 0.871 ± 0.019
0.233TrpTrp: 0.233 ± 0.01
0.34TrpTyr: 0.34 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.119TyrAla: 2.119 ± 0.034
0.316TyrCys: 0.316 ± 0.013
1.653TyrAsp: 1.653 ± 0.048
1.393TyrGlu: 1.393 ± 0.027
1.02TyrPhe: 1.02 ± 0.024
1.873TyrGly: 1.873 ± 0.033
0.577TyrHis: 0.577 ± 0.018
0.795TyrIle: 0.795 ± 0.019
0.681TyrLys: 0.681 ± 0.019
2.262TyrLeu: 2.262 ± 0.034
0.352TyrMet: 0.352 ± 0.012
0.697TyrAsn: 0.697 ± 0.023
1.091TyrPro: 1.091 ± 0.024
1.176TyrGln: 1.176 ± 0.024
1.965TyrArg: 1.965 ± 0.031
1.472TyrSer: 1.472 ± 0.028
1.141TyrThr: 1.141 ± 0.04
1.576TyrVal: 1.576 ± 0.026
0.368TyrTrp: 0.368 ± 0.014
0.643TyrTyr: 0.643 ± 0.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5858 proteins (2345483 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski