Amino acid dipepetide frequency for Microvirga tunisiensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.845AlaAla: 14.845 ± 0.118
1.091AlaCys: 1.091 ± 0.027
6.006AlaAsp: 6.006 ± 0.058
7.229AlaGlu: 7.229 ± 0.069
4.317AlaPhe: 4.317 ± 0.047
9.391AlaGly: 9.391 ± 0.075
2.255AlaHis: 2.255 ± 0.033
6.191AlaIle: 6.191 ± 0.055
4.11AlaLys: 4.11 ± 0.05
12.722AlaLeu: 12.722 ± 0.085
3.164AlaMet: 3.164 ± 0.041
2.718AlaAsn: 2.718 ± 0.034
5.16AlaPro: 5.16 ± 0.06
4.239AlaGln: 4.239 ± 0.048
8.072AlaArg: 8.072 ± 0.061
6.666AlaSer: 6.666 ± 0.061
5.695AlaThr: 5.695 ± 0.059
8.607AlaVal: 8.607 ± 0.072
1.491AlaTrp: 1.491 ± 0.027
2.539AlaTyr: 2.539 ± 0.035
0.0AlaXaa: 0.0 ± 0.0
Cys
0.899CysAla: 0.899 ± 0.022
0.124CysCys: 0.124 ± 0.008
0.493CysAsp: 0.493 ± 0.017
0.479CysGlu: 0.479 ± 0.014
0.302CysPhe: 0.302 ± 0.013
0.855CysGly: 0.855 ± 0.021
0.224CysHis: 0.224 ± 0.011
0.406CysIle: 0.406 ± 0.014
0.199CysLys: 0.199 ± 0.009
0.86CysLeu: 0.86 ± 0.02
0.158CysMet: 0.158 ± 0.009
0.219CysAsn: 0.219 ± 0.01
0.471CysPro: 0.471 ± 0.017
0.257CysGln: 0.257 ± 0.011
0.768CysArg: 0.768 ± 0.022
0.523CysSer: 0.523 ± 0.017
0.435CysThr: 0.435 ± 0.014
0.553CysVal: 0.553 ± 0.016
0.121CysTrp: 0.121 ± 0.008
0.195CysTyr: 0.195 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
6.129AspAla: 6.129 ± 0.056
0.449AspCys: 0.449 ± 0.013
2.891AspAsp: 2.891 ± 0.046
3.826AspGlu: 3.826 ± 0.044
2.018AspPhe: 2.018 ± 0.031
4.665AspGly: 4.665 ± 0.062
1.249AspHis: 1.249 ± 0.027
2.8AspIle: 2.8 ± 0.037
1.676AspLys: 1.676 ± 0.028
6.254AspLeu: 6.254 ± 0.057
1.139AspMet: 1.139 ± 0.023
1.273AspAsn: 1.273 ± 0.026
3.562AspPro: 3.562 ± 0.044
1.931AspGln: 1.931 ± 0.031
4.445AspArg: 4.445 ± 0.052
2.029AspSer: 2.029 ± 0.03
2.511AspThr: 2.511 ± 0.042
4.108AspVal: 4.108 ± 0.053
0.899AspTrp: 0.899 ± 0.022
1.337AspTyr: 1.337 ± 0.027
0.0AspXaa: 0.0 ± 0.0
Glu
7.932GluAla: 7.932 ± 0.072
0.415GluCys: 0.415 ± 0.016
2.847GluAsp: 2.847 ± 0.036
3.515GluGlu: 3.515 ± 0.043
1.801GluPhe: 1.801 ± 0.026
4.259GluGly: 4.259 ± 0.05
1.362GluHis: 1.362 ± 0.027
3.601GluIle: 3.601 ± 0.044
2.201GluLys: 2.201 ± 0.035
5.415GluLeu: 5.415 ± 0.052
1.456GluMet: 1.456 ± 0.024
1.549GluAsn: 1.549 ± 0.025
2.961GluPro: 2.961 ± 0.043
2.329GluGln: 2.329 ± 0.031
5.717GluArg: 5.717 ± 0.062
2.517GluSer: 2.517 ± 0.036
3.573GluThr: 3.573 ± 0.043
4.082GluVal: 4.082 ± 0.042
0.747GluTrp: 0.747 ± 0.02
0.987GluTyr: 0.987 ± 0.024
0.0GluXaa: 0.0 ± 0.0
Phe
4.07PheAla: 4.07 ± 0.051
0.38PheCys: 0.38 ± 0.012
2.433PheAsp: 2.433 ± 0.035
2.153PheGlu: 2.153 ± 0.031
1.337PhePhe: 1.337 ± 0.024
3.48PheGly: 3.48 ± 0.04
0.717PheHis: 0.717 ± 0.019
1.697PheIle: 1.697 ± 0.029
1.142PheLys: 1.142 ± 0.024
3.396PheLeu: 3.396 ± 0.046
0.803PheMet: 0.803 ± 0.019
1.024PheAsn: 1.024 ± 0.023
1.577PhePro: 1.577 ± 0.028
1.145PheGln: 1.145 ± 0.02
2.257PheArg: 2.257 ± 0.036
2.216PheSer: 2.216 ± 0.031
2.007PheThr: 2.007 ± 0.032
2.809PheVal: 2.809 ± 0.035
0.579PheTrp: 0.579 ± 0.016
0.899PheTyr: 0.899 ± 0.022
0.0PheXaa: 0.0 ± 0.0
Gly
8.261GlyAla: 8.261 ± 0.066
0.812GlyCys: 0.812 ± 0.018
4.017GlyAsp: 4.017 ± 0.054
4.561GlyGlu: 4.561 ± 0.052
3.507GlyPhe: 3.507 ± 0.044
6.606GlyGly: 6.606 ± 0.079
1.924GlyHis: 1.924 ± 0.034
4.49GlyIle: 4.49 ± 0.046
2.978GlyLys: 2.978 ± 0.041
8.746GlyLeu: 8.746 ± 0.072
2.068GlyMet: 2.068 ± 0.029
2.177GlyAsn: 2.177 ± 0.052
3.545GlyPro: 3.545 ± 0.049
3.041GlyGln: 3.041 ± 0.041
6.294GlyArg: 6.294 ± 0.057
5.085GlySer: 5.085 ± 0.062
4.66GlyThr: 4.66 ± 0.055
5.62GlyVal: 5.62 ± 0.057
1.314GlyTrp: 1.314 ± 0.025
2.206GlyTyr: 2.206 ± 0.032
0.0GlyXaa: 0.0 ± 0.0
His
2.341HisAla: 2.341 ± 0.032
0.231HisCys: 0.231 ± 0.01
1.256HisAsp: 1.256 ± 0.025
1.239HisGlu: 1.239 ± 0.025
0.807HisPhe: 0.807 ± 0.02
1.861HisGly: 1.861 ± 0.028
0.63HisHis: 0.63 ± 0.019
0.891HisIle: 0.891 ± 0.018
0.563HisLys: 0.563 ± 0.017
2.385HisLeu: 2.385 ± 0.039
0.505HisMet: 0.505 ± 0.018
0.488HisAsn: 0.488 ± 0.014
1.48HisPro: 1.48 ± 0.028
0.698HisGln: 0.698 ± 0.018
1.708HisArg: 1.708 ± 0.03
1.064HisSer: 1.064 ± 0.024
0.856HisThr: 0.856 ± 0.02
1.6HisVal: 1.6 ± 0.029
0.348HisTrp: 0.348 ± 0.014
0.516HisTyr: 0.516 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
6.798IleAla: 6.798 ± 0.059
0.47IleCys: 0.47 ± 0.015
3.472IleAsp: 3.472 ± 0.044
3.593IleGlu: 3.593 ± 0.042
1.603IlePhe: 1.603 ± 0.03
4.915IleGly: 4.915 ± 0.05
1.011IleHis: 1.011 ± 0.022
2.251IleIle: 2.251 ± 0.035
1.545IleLys: 1.545 ± 0.03
4.904IleLeu: 4.904 ± 0.05
1.013IleMet: 1.013 ± 0.022
1.318IleAsn: 1.318 ± 0.027
2.545IlePro: 2.545 ± 0.033
1.489IleGln: 1.489 ± 0.031
3.588IleArg: 3.588 ± 0.04
2.795IleSer: 2.795 ± 0.035
2.706IleThr: 2.706 ± 0.036
4.389IleVal: 4.389 ± 0.051
0.613IleTrp: 0.613 ± 0.016
1.113IleTyr: 1.113 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
4.475LysAla: 4.475 ± 0.049
0.167LysCys: 0.167 ± 0.009
1.937LysAsp: 1.937 ± 0.038
1.797LysGlu: 1.797 ± 0.029
0.843LysPhe: 0.843 ± 0.022
2.787LysGly: 2.787 ± 0.04
0.701LysHis: 0.701 ± 0.018
1.728LysIle: 1.728 ± 0.028
1.277LysLys: 1.277 ± 0.029
3.284LysLeu: 3.284 ± 0.042
0.69LysMet: 0.69 ± 0.016
0.906LysAsn: 0.906 ± 0.025
2.286LysPro: 2.286 ± 0.041
1.098LysGln: 1.098 ± 0.021
2.641LysArg: 2.641 ± 0.038
1.943LysSer: 1.943 ± 0.029
1.966LysThr: 1.966 ± 0.031
2.535LysVal: 2.535 ± 0.036
0.341LysTrp: 0.341 ± 0.012
0.571LysTyr: 0.571 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
12.69LeuAla: 12.69 ± 0.091
0.896LeuCys: 0.896 ± 0.022
6.0LeuAsp: 6.0 ± 0.059
5.621LeuGlu: 5.621 ± 0.063
3.47LeuPhe: 3.47 ± 0.046
8.113LeuGly: 8.113 ± 0.068
2.009LeuHis: 2.009 ± 0.029
5.175LeuIle: 5.175 ± 0.052
3.988LeuLys: 3.988 ± 0.042
9.457LeuLeu: 9.457 ± 0.087
2.27LeuMet: 2.27 ± 0.034
2.75LeuAsn: 2.75 ± 0.035
5.538LeuPro: 5.538 ± 0.05
3.187LeuGln: 3.187 ± 0.039
7.124LeuArg: 7.124 ± 0.071
6.786LeuSer: 6.786 ± 0.061
5.756LeuThr: 5.756 ± 0.062
7.741LeuVal: 7.741 ± 0.068
1.156LeuTrp: 1.156 ± 0.022
2.068LeuTyr: 2.068 ± 0.03
0.0LeuXaa: 0.0 ± 0.0
Met
2.948MetAla: 2.948 ± 0.038
0.13MetCys: 0.13 ± 0.009
1.061MetAsp: 1.061 ± 0.025
1.059MetGlu: 1.059 ± 0.022
0.618MetPhe: 0.618 ± 0.017
1.626MetGly: 1.626 ± 0.03
0.434MetHis: 0.434 ± 0.013
1.294MetIle: 1.294 ± 0.028
0.986MetLys: 0.986 ± 0.017
2.254MetLeu: 2.254 ± 0.032
0.605MetMet: 0.605 ± 0.021
0.812MetAsn: 0.812 ± 0.019
1.448MetPro: 1.448 ± 0.028
0.781MetGln: 0.781 ± 0.02
1.838MetArg: 1.838 ± 0.027
1.642MetSer: 1.642 ± 0.029
1.811MetThr: 1.811 ± 0.029
1.558MetVal: 1.558 ± 0.028
0.188MetTrp: 0.188 ± 0.01
0.284MetTyr: 0.284 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.963AsnAla: 2.963 ± 0.042
0.203AsnCys: 0.203 ± 0.009
1.474AsnAsp: 1.474 ± 0.031
1.426AsnGlu: 1.426 ± 0.029
0.923AsnPhe: 0.923 ± 0.026
2.459AsnGly: 2.459 ± 0.04
0.528AsnHis: 0.528 ± 0.015
1.248AsnIle: 1.248 ± 0.026
0.743AsnLys: 0.743 ± 0.02
2.727AsnLeu: 2.727 ± 0.041
0.547AsnMet: 0.547 ± 0.017
0.712AsnAsn: 0.712 ± 0.017
1.932AsnPro: 1.932 ± 0.03
0.885AsnGln: 0.885 ± 0.02
1.927AsnArg: 1.927 ± 0.029
1.263AsnSer: 1.263 ± 0.024
1.29AsnThr: 1.29 ± 0.026
1.916AsnVal: 1.916 ± 0.033
0.391AsnTrp: 0.391 ± 0.013
0.628AsnTyr: 0.628 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
5.875ProAla: 5.875 ± 0.064
0.349ProCys: 0.349 ± 0.013
3.674ProAsp: 3.674 ± 0.044
3.744ProGlu: 3.744 ± 0.039
1.966ProPhe: 1.966 ± 0.033
4.42ProGly: 4.42 ± 0.045
1.181ProHis: 1.181 ± 0.028
2.432ProIle: 2.432 ± 0.036
1.947ProLys: 1.947 ± 0.035
4.649ProLeu: 4.649 ± 0.053
1.213ProMet: 1.213 ± 0.025
1.502ProAsn: 1.502 ± 0.027
2.743ProPro: 2.743 ± 0.048
1.883ProGln: 1.883 ± 0.036
3.11ProArg: 3.11 ± 0.041
3.357ProSer: 3.357 ± 0.044
2.743ProThr: 2.743 ± 0.033
4.207ProVal: 4.207 ± 0.048
0.754ProTrp: 0.754 ± 0.019
1.21ProTyr: 1.21 ± 0.023
0.0ProXaa: 0.0 ± 0.0
Gln
4.574GlnAla: 4.574 ± 0.05
0.229GlnCys: 0.229 ± 0.011
1.786GlnAsp: 1.786 ± 0.025
1.97GlnGlu: 1.97 ± 0.032
1.052GlnPhe: 1.052 ± 0.022
2.648GlnGly: 2.648 ± 0.04
0.736GlnHis: 0.736 ± 0.016
1.911GlnIle: 1.911 ± 0.039
1.172GlnLys: 1.172 ± 0.023
2.868GlnLeu: 2.868 ± 0.037
0.85GlnMet: 0.85 ± 0.02
0.945GlnAsn: 0.945 ± 0.02
1.963GlnPro: 1.963 ± 0.034
1.411GlnGln: 1.411 ± 0.03
2.766GlnArg: 2.766 ± 0.037
1.971GlnSer: 1.971 ± 0.034
1.9GlnThr: 1.9 ± 0.031
2.583GlnVal: 2.583 ± 0.039
0.422GlnTrp: 0.422 ± 0.014
0.642GlnTyr: 0.642 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
7.455ArgAla: 7.455 ± 0.068
0.637ArgCys: 0.637 ± 0.02
3.997ArgAsp: 3.997 ± 0.043
4.595ArgGlu: 4.595 ± 0.048
3.076ArgPhe: 3.076 ± 0.038
4.774ArgGly: 4.774 ± 0.048
1.834ArgHis: 1.834 ± 0.026
4.351ArgIle: 4.351 ± 0.042
2.484ArgLys: 2.484 ± 0.039
8.334ArgLeu: 8.334 ± 0.073
1.888ArgMet: 1.888 ± 0.034
2.006ArgAsn: 2.006 ± 0.036
3.739ArgPro: 3.739 ± 0.05
3.01ArgGln: 3.01 ± 0.041
6.407ArgArg: 6.407 ± 0.064
4.511ArgSer: 4.511 ± 0.055
3.777ArgThr: 3.777 ± 0.039
5.013ArgVal: 5.013 ± 0.044
1.141ArgTrp: 1.141 ± 0.022
1.845ArgTyr: 1.845 ± 0.029
0.0ArgXaa: 0.0 ± 0.0
Ser
6.077SerAla: 6.077 ± 0.062
0.479SerCys: 0.479 ± 0.016
3.102SerAsp: 3.102 ± 0.043
3.114SerGlu: 3.114 ± 0.043
2.343SerPhe: 2.343 ± 0.033
5.587SerGly: 5.587 ± 0.062
1.266SerHis: 1.266 ± 0.026
2.906SerIle: 2.906 ± 0.038
1.816SerLys: 1.816 ± 0.033
6.072SerLeu: 6.072 ± 0.054
1.343SerMet: 1.343 ± 0.025
1.451SerAsn: 1.451 ± 0.03
3.165SerPro: 3.165 ± 0.034
1.88SerGln: 1.88 ± 0.029
4.218SerArg: 4.218 ± 0.049
3.56SerSer: 3.56 ± 0.052
2.954SerThr: 2.954 ± 0.041
4.091SerVal: 4.091 ± 0.044
0.835SerTrp: 0.835 ± 0.021
1.391SerTyr: 1.391 ± 0.029
0.0SerXaa: 0.0 ± 0.0
Thr
5.846ThrAla: 5.846 ± 0.053
0.462ThrCys: 0.462 ± 0.016
2.759ThrAsp: 2.759 ± 0.038
2.743ThrGlu: 2.743 ± 0.034
2.11ThrPhe: 2.11 ± 0.035
5.001ThrGly: 5.001 ± 0.061
1.1ThrHis: 1.1 ± 0.023
3.13ThrIle: 3.13 ± 0.035
1.671ThrLys: 1.671 ± 0.027
5.886ThrLeu: 5.886 ± 0.055
1.179ThrMet: 1.179 ± 0.026
1.348ThrAsn: 1.348 ± 0.029
3.24ThrPro: 3.24 ± 0.039
1.565ThrGln: 1.565 ± 0.03
3.479ThrArg: 3.479 ± 0.043
3.105ThrSer: 3.105 ± 0.043
2.892ThrThr: 2.892 ± 0.04
4.254ThrVal: 4.254 ± 0.047
0.741ThrTrp: 0.741 ± 0.018
1.375ThrTyr: 1.375 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
8.651ValAla: 8.651 ± 0.07
0.649ValCys: 0.649 ± 0.018
3.946ValAsp: 3.946 ± 0.045
4.723ValGlu: 4.723 ± 0.052
2.62ValPhe: 2.62 ± 0.035
5.483ValGly: 5.483 ± 0.053
1.477ValHis: 1.477 ± 0.026
4.005ValIle: 4.005 ± 0.044
2.347ValLys: 2.347 ± 0.04
7.657ValLeu: 7.657 ± 0.064
1.766ValMet: 1.766 ± 0.033
1.932ValAsn: 1.932 ± 0.03
3.789ValPro: 3.789 ± 0.043
2.307ValGln: 2.307 ± 0.031
5.357ValArg: 5.357 ± 0.051
4.557ValSer: 4.557 ± 0.043
4.415ValThr: 4.415 ± 0.052
5.866ValVal: 5.866 ± 0.063
0.926ValTrp: 0.926 ± 0.021
1.497ValTyr: 1.497 ± 0.029
0.0ValXaa: 0.0 ± 0.0
Trp
1.239TrpAla: 1.239 ± 0.023
0.141TrpCys: 0.141 ± 0.008
0.639TrpAsp: 0.639 ± 0.019
0.585TrpGlu: 0.585 ± 0.016
0.544TrpPhe: 0.544 ± 0.016
0.888TrpGly: 0.888 ± 0.021
0.356TrpHis: 0.356 ± 0.012
0.748TrpIle: 0.748 ± 0.017
0.459TrpLys: 0.459 ± 0.016
1.651TrpLeu: 1.651 ± 0.033
0.338TrpMet: 0.338 ± 0.015
0.449TrpAsn: 0.449 ± 0.013
0.683TrpPro: 0.683 ± 0.016
0.545TrpGln: 0.545 ± 0.015
1.26TrpArg: 1.26 ± 0.024
0.926TrpSer: 0.926 ± 0.019
0.829TrpThr: 0.829 ± 0.02
0.823TrpVal: 0.823 ± 0.017
0.217TrpTrp: 0.217 ± 0.011
0.282TrpTyr: 0.282 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.441TyrAla: 2.441 ± 0.036
0.23TyrCys: 0.23 ± 0.01
1.401TyrAsp: 1.401 ± 0.028
1.294TyrGlu: 1.294 ± 0.026
0.868TyrPhe: 0.868 ± 0.025
2.092TyrGly: 2.092 ± 0.034
0.479TyrHis: 0.479 ± 0.017
0.875TyrIle: 0.875 ± 0.022
0.669TyrLys: 0.669 ± 0.018
2.273TyrLeu: 2.273 ± 0.033
0.379TyrMet: 0.379 ± 0.013
0.605TyrAsn: 0.605 ± 0.018
1.131TyrPro: 1.131 ± 0.024
0.715TyrGln: 0.715 ± 0.021
1.9TyrArg: 1.9 ± 0.026
1.134TyrSer: 1.134 ± 0.022
1.111TyrThr: 1.111 ± 0.02
1.612TyrVal: 1.612 ± 0.029
0.377TyrTrp: 0.377 ± 0.014
0.61TyrTyr: 0.61 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7944 proteins (2184762 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski