Amino acid dipepetide frequency for Eubacteriaceae bacterium

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.973AlaAla: 9.973 ± 0.215
1.313AlaCys: 1.313 ± 0.047
6.017AlaAsp: 6.017 ± 0.131
5.913AlaGlu: 5.913 ± 0.132
3.615AlaPhe: 3.615 ± 0.078
6.654AlaGly: 6.654 ± 0.142
1.876AlaHis: 1.876 ± 0.064
5.691AlaIle: 5.691 ± 0.094
5.487AlaLys: 5.487 ± 0.122
8.649AlaLeu: 8.649 ± 0.138
2.805AlaMet: 2.805 ± 0.071
2.897AlaAsn: 2.897 ± 0.083
2.901AlaPro: 2.901 ± 0.096
3.178AlaGln: 3.178 ± 0.078
3.687AlaArg: 3.687 ± 0.082
4.432AlaSer: 4.432 ± 0.109
3.375AlaThr: 3.375 ± 0.095
7.543AlaVal: 7.543 ± 0.139
0.708AlaTrp: 0.708 ± 0.038
3.005AlaTyr: 3.005 ± 0.078
0.0AlaXaa: 0.0 ± 0.0
Cys
1.129CysAla: 1.129 ± 0.037
0.232CysCys: 0.232 ± 0.019
0.822CysAsp: 0.822 ± 0.037
0.812CysGlu: 0.812 ± 0.037
0.513CysPhe: 0.513 ± 0.028
1.541CysGly: 1.541 ± 0.066
0.346CysHis: 0.346 ± 0.023
0.835CysIle: 0.835 ± 0.039
0.486CysLys: 0.486 ± 0.028
1.091CysLeu: 1.091 ± 0.044
0.321CysMet: 0.321 ± 0.023
0.378CysAsn: 0.378 ± 0.026
0.745CysPro: 0.745 ± 0.035
0.484CysGln: 0.484 ± 0.028
0.677CysArg: 0.677 ± 0.033
0.723CysSer: 0.723 ± 0.033
0.696CysThr: 0.696 ± 0.036
1.097CysVal: 1.097 ± 0.049
0.104CysTrp: 0.104 ± 0.013
0.392CysTyr: 0.392 ± 0.027
0.0CysXaa: 0.0 ± 0.0
Asp
5.835AspAla: 5.835 ± 0.132
0.842AspCys: 0.842 ± 0.032
4.185AspAsp: 4.185 ± 0.118
4.607AspGlu: 4.607 ± 0.106
2.812AspPhe: 2.812 ± 0.067
4.844AspGly: 4.844 ± 0.135
1.597AspHis: 1.597 ± 0.056
4.437AspIle: 4.437 ± 0.095
3.334AspLys: 3.334 ± 0.076
5.719AspLeu: 5.719 ± 0.103
1.728AspMet: 1.728 ± 0.061
2.215AspAsn: 2.215 ± 0.065
2.792AspPro: 2.792 ± 0.081
2.706AspGln: 2.706 ± 0.078
3.197AspArg: 3.197 ± 0.088
2.881AspSer: 2.881 ± 0.074
3.192AspThr: 3.192 ± 0.083
4.531AspVal: 4.531 ± 0.091
0.65AspTrp: 0.65 ± 0.039
2.634AspTyr: 2.634 ± 0.073
0.0AspXaa: 0.0 ± 0.0
Glu
6.262GluAla: 6.262 ± 0.139
0.57GluCys: 0.57 ± 0.032
3.997GluAsp: 3.997 ± 0.101
4.23GluGlu: 4.23 ± 0.112
1.684GluPhe: 1.684 ± 0.055
3.928GluGly: 3.928 ± 0.093
1.24GluHis: 1.24 ± 0.056
4.556GluIle: 4.556 ± 0.082
5.581GluLys: 5.581 ± 0.111
5.139GluLeu: 5.139 ± 0.102
2.338GluMet: 2.338 ± 0.067
3.094GluAsn: 3.094 ± 0.079
1.966GluPro: 1.966 ± 0.069
2.331GluGln: 2.331 ± 0.062
3.01GluArg: 3.01 ± 0.074
2.837GluSer: 2.837 ± 0.068
3.598GluThr: 3.598 ± 0.078
3.733GluVal: 3.733 ± 0.088
0.449GluTrp: 0.449 ± 0.028
1.689GluTyr: 1.689 ± 0.057
0.0GluXaa: 0.0 ± 0.0
Phe
3.082PheAla: 3.082 ± 0.086
0.566PheCys: 0.566 ± 0.032
2.944PheAsp: 2.944 ± 0.086
2.541PheGlu: 2.541 ± 0.071
1.728PhePhe: 1.728 ± 0.066
3.144PheGly: 3.144 ± 0.08
0.792PheHis: 0.792 ± 0.039
2.597PheIle: 2.597 ± 0.068
2.183PheLys: 2.183 ± 0.069
3.276PheLeu: 3.276 ± 0.082
0.985PheMet: 0.985 ± 0.041
1.686PheAsn: 1.686 ± 0.053
1.42PhePro: 1.42 ± 0.051
1.005PheGln: 1.005 ± 0.049
1.556PheArg: 1.556 ± 0.057
2.45PheSer: 2.45 ± 0.068
2.123PheThr: 2.123 ± 0.066
2.738PheVal: 2.738 ± 0.072
0.41PheTrp: 0.41 ± 0.027
1.447PheTyr: 1.447 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
6.062GlyAla: 6.062 ± 0.13
1.212GlyCys: 1.212 ± 0.055
4.17GlyAsp: 4.17 ± 0.099
4.173GlyGlu: 4.173 ± 0.094
2.985GlyPhe: 2.985 ± 0.074
5.358GlyGly: 5.358 ± 0.129
1.642GlyHis: 1.642 ± 0.058
5.744GlyIle: 5.744 ± 0.113
4.8GlyLys: 4.8 ± 0.087
6.084GlyLeu: 6.084 ± 0.116
2.257GlyMet: 2.257 ± 0.053
2.44GlyAsn: 2.44 ± 0.064
1.892GlyPro: 1.892 ± 0.055
2.496GlyGln: 2.496 ± 0.062
3.877GlyArg: 3.877 ± 0.091
3.965GlySer: 3.965 ± 0.103
4.62GlyThr: 4.62 ± 0.119
5.521GlyVal: 5.521 ± 0.113
0.709GlyTrp: 0.709 ± 0.043
2.882GlyTyr: 2.882 ± 0.06
0.0GlyXaa: 0.0 ± 0.0
His
1.41HisAla: 1.41 ± 0.048
0.336HisCys: 0.336 ± 0.026
1.26HisAsp: 1.26 ± 0.045
0.981HisGlu: 0.981 ± 0.036
1.084HisPhe: 1.084 ± 0.046
1.499HisGly: 1.499 ± 0.049
0.739HisHis: 0.739 ± 0.043
1.637HisIle: 1.637 ± 0.054
0.955HisLys: 0.955 ± 0.038
2.087HisLeu: 2.087 ± 0.057
0.575HisMet: 0.575 ± 0.035
0.773HisAsn: 0.773 ± 0.037
1.215HisPro: 1.215 ± 0.048
0.928HisGln: 0.928 ± 0.035
1.119HisArg: 1.119 ± 0.044
1.114HisSer: 1.114 ± 0.036
1.175HisThr: 1.175 ± 0.049
1.482HisVal: 1.482 ± 0.05
0.203HisTrp: 0.203 ± 0.018
0.966HisTyr: 0.966 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
6.586IleAla: 6.586 ± 0.125
1.049IleCys: 1.049 ± 0.041
5.313IleAsp: 5.313 ± 0.109
4.491IleGlu: 4.491 ± 0.092
2.818IlePhe: 2.818 ± 0.08
5.397IleGly: 5.397 ± 0.119
1.514IleHis: 1.514 ± 0.052
4.363IleIle: 4.363 ± 0.108
3.844IleLys: 3.844 ± 0.095
6.153IleLeu: 6.153 ± 0.144
1.513IleMet: 1.513 ± 0.057
2.736IleAsn: 2.736 ± 0.075
3.101IlePro: 3.101 ± 0.071
2.519IleGln: 2.519 ± 0.069
3.328IleArg: 3.328 ± 0.081
4.017IleSer: 4.017 ± 0.082
3.808IleThr: 3.808 ± 0.108
5.309IleVal: 5.309 ± 0.108
0.534IleTrp: 0.534 ± 0.03
2.269IleTyr: 2.269 ± 0.065
0.0IleXaa: 0.0 ± 0.0
Lys
6.149LysAla: 6.149 ± 0.129
0.558LysCys: 0.558 ± 0.036
3.872LysAsp: 3.872 ± 0.095
4.166LysGlu: 4.166 ± 0.1
1.61LysPhe: 1.61 ± 0.058
4.158LysGly: 4.158 ± 0.075
1.129LysHis: 1.129 ± 0.044
4.686LysIle: 4.686 ± 0.098
5.418LysLys: 5.418 ± 0.115
4.739LysLeu: 4.739 ± 0.091
2.158LysMet: 2.158 ± 0.061
3.108LysAsn: 3.108 ± 0.079
2.26LysPro: 2.26 ± 0.06
2.412LysGln: 2.412 ± 0.073
3.544LysArg: 3.544 ± 0.074
3.267LysSer: 3.267 ± 0.084
4.091LysThr: 4.091 ± 0.091
4.072LysVal: 4.072 ± 0.091
0.62LysTrp: 0.62 ± 0.032
2.197LysTyr: 2.197 ± 0.071
0.0LysXaa: 0.0 ± 0.0
Leu
7.348LeuAla: 7.348 ± 0.117
1.171LeuCys: 1.171 ± 0.049
5.674LeuAsp: 5.674 ± 0.104
4.827LeuGlu: 4.827 ± 0.099
3.381LeuPhe: 3.381 ± 0.095
5.797LeuGly: 5.797 ± 0.125
1.585LeuHis: 1.585 ± 0.058
6.351LeuIle: 6.351 ± 0.128
6.234LeuLys: 6.234 ± 0.12
6.891LeuLeu: 6.891 ± 0.152
2.543LeuMet: 2.543 ± 0.066
3.783LeuAsn: 3.783 ± 0.077
3.795LeuPro: 3.795 ± 0.101
2.44LeuGln: 2.44 ± 0.064
3.837LeuArg: 3.837 ± 0.099
6.22LeuSer: 6.22 ± 0.122
5.319LeuThr: 5.319 ± 0.115
5.207LeuVal: 5.207 ± 0.108
0.676LeuTrp: 0.676 ± 0.041
2.899LeuTyr: 2.899 ± 0.077
0.0LeuXaa: 0.0 ± 0.0
Met
2.99MetAla: 2.99 ± 0.074
0.309MetCys: 0.309 ± 0.018
1.77MetAsp: 1.77 ± 0.057
1.514MetGlu: 1.514 ± 0.051
0.862MetPhe: 0.862 ± 0.04
1.918MetGly: 1.918 ± 0.06
0.558MetHis: 0.558 ± 0.031
2.207MetIle: 2.207 ± 0.069
2.104MetLys: 2.104 ± 0.058
2.413MetLeu: 2.413 ± 0.065
0.95MetMet: 0.95 ± 0.045
1.424MetAsn: 1.424 ± 0.052
1.331MetPro: 1.331 ± 0.043
1.007MetGln: 1.007 ± 0.041
1.545MetArg: 1.545 ± 0.054
1.681MetSer: 1.681 ± 0.054
1.941MetThr: 1.941 ± 0.056
1.785MetVal: 1.785 ± 0.051
0.168MetTrp: 0.168 ± 0.017
0.709MetTyr: 0.709 ± 0.036
0.0MetXaa: 0.0 ± 0.0
Asn
3.538AsnAla: 3.538 ± 0.083
0.541AsnCys: 0.541 ± 0.033
2.242AsnAsp: 2.242 ± 0.056
2.066AsnGlu: 2.066 ± 0.056
1.449AsnPhe: 1.449 ± 0.052
3.012AsnGly: 3.012 ± 0.076
0.948AsnHis: 0.948 ± 0.041
2.902AsnIle: 2.902 ± 0.085
2.296AsnLys: 2.296 ± 0.078
3.462AsnLeu: 3.462 ± 0.08
1.158AsnMet: 1.158 ± 0.049
1.597AsnAsn: 1.597 ± 0.063
1.926AsnPro: 1.926 ± 0.061
1.797AsnGln: 1.797 ± 0.057
2.133AsnArg: 2.133 ± 0.064
1.892AsnSer: 1.892 ± 0.061
2.155AsnThr: 2.155 ± 0.069
2.746AsnVal: 2.746 ± 0.075
0.403AsnTrp: 0.403 ± 0.028
1.482AsnTyr: 1.482 ± 0.055
0.0AsnXaa: 0.0 ± 0.0
Pro
3.195ProAla: 3.195 ± 0.085
0.506ProCys: 0.506 ± 0.029
2.914ProAsp: 2.914 ± 0.079
3.62ProGlu: 3.62 ± 0.1
1.634ProPhe: 1.634 ± 0.052
2.934ProGly: 2.934 ± 0.071
0.755ProHis: 0.755 ± 0.031
2.6ProIle: 2.6 ± 0.062
2.546ProLys: 2.546 ± 0.063
2.956ProLeu: 2.956 ± 0.069
1.071ProMet: 1.071 ± 0.046
1.412ProAsn: 1.412 ± 0.057
0.901ProPro: 0.901 ± 0.045
1.116ProGln: 1.116 ± 0.044
1.306ProArg: 1.306 ± 0.05
1.993ProSer: 1.993 ± 0.058
1.909ProThr: 1.909 ± 0.053
3.178ProVal: 3.178 ± 0.065
0.334ProTrp: 0.334 ± 0.025
1.492ProTyr: 1.492 ± 0.055
0.0ProXaa: 0.0 ± 0.0
Gln
3.301GlnAla: 3.301 ± 0.083
0.417GlnCys: 0.417 ± 0.026
1.988GlnAsp: 1.988 ± 0.063
1.877GlnGlu: 1.877 ± 0.06
1.291GlnPhe: 1.291 ± 0.054
1.981GlnGly: 1.981 ± 0.057
0.743GlnHis: 0.743 ± 0.035
3.072GlnIle: 3.072 ± 0.08
2.787GlnLys: 2.787 ± 0.074
3.023GlnLeu: 3.023 ± 0.084
1.234GlnMet: 1.234 ± 0.046
1.928GlnAsn: 1.928 ± 0.057
1.207GlnPro: 1.207 ± 0.051
1.269GlnGln: 1.269 ± 0.049
1.709GlnArg: 1.709 ± 0.061
1.879GlnSer: 1.879 ± 0.057
2.109GlnThr: 2.109 ± 0.068
2.313GlnVal: 2.313 ± 0.064
0.311GlnTrp: 0.311 ± 0.023
1.324GlnTyr: 1.324 ± 0.048
0.0GlnXaa: 0.0 ± 0.0
Arg
3.709ArgAla: 3.709 ± 0.094
0.667ArgCys: 0.667 ± 0.036
2.844ArgAsp: 2.844 ± 0.079
3.282ArgGlu: 3.282 ± 0.084
1.908ArgPhe: 1.908 ± 0.056
2.97ArgGly: 2.97 ± 0.075
1.165ArgHis: 1.165 ± 0.052
3.366ArgIle: 3.366 ± 0.072
3.336ArgLys: 3.336 ± 0.085
4.528ArgLeu: 4.528 ± 0.094
1.534ArgMet: 1.534 ± 0.044
1.815ArgAsn: 1.815 ± 0.057
1.802ArgPro: 1.802 ± 0.057
2.146ArgGln: 2.146 ± 0.066
3.022ArgArg: 3.022 ± 0.079
2.269ArgSer: 2.269 ± 0.056
2.539ArgThr: 2.539 ± 0.065
3.15ArgVal: 3.15 ± 0.078
0.474ArgTrp: 0.474 ± 0.033
1.966ArgTyr: 1.966 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
4.738SerAla: 4.738 ± 0.105
0.679SerCys: 0.679 ± 0.038
3.592SerAsp: 3.592 ± 0.09
3.608SerGlu: 3.608 ± 0.089
2.207SerPhe: 2.207 ± 0.061
4.847SerGly: 4.847 ± 0.093
1.176SerHis: 1.176 ± 0.045
3.652SerIle: 3.652 ± 0.087
3.257SerLys: 3.257 ± 0.079
4.392SerLeu: 4.392 ± 0.1
1.422SerMet: 1.422 ± 0.049
1.911SerAsn: 1.911 ± 0.065
1.795SerPro: 1.795 ± 0.058
1.882SerGln: 1.882 ± 0.057
2.874SerArg: 2.874 ± 0.078
3.311SerSer: 3.311 ± 0.123
2.86SerThr: 2.86 ± 0.085
3.948SerVal: 3.948 ± 0.086
0.496SerTrp: 0.496 ± 0.03
1.696SerTyr: 1.696 ± 0.056
0.0SerXaa: 0.0 ± 0.0
Thr
5.427ThrAla: 5.427 ± 0.177
0.672ThrCys: 0.672 ± 0.038
3.39ThrAsp: 3.39 ± 0.076
3.188ThrGlu: 3.188 ± 0.09
2.126ThrPhe: 2.126 ± 0.065
4.771ThrGly: 4.771 ± 0.097
1.116ThrHis: 1.116 ± 0.045
3.953ThrIle: 3.953 ± 0.094
2.934ThrLys: 2.934 ± 0.103
5.008ThrLeu: 5.008 ± 0.105
1.476ThrMet: 1.476 ± 0.052
1.955ThrAsn: 1.955 ± 0.066
2.637ThrPro: 2.637 ± 0.065
1.766ThrGln: 1.766 ± 0.052
2.286ThrArg: 2.286 ± 0.061
2.872ThrSer: 2.872 ± 0.088
3.067ThrThr: 3.067 ± 0.084
4.454ThrVal: 4.454 ± 0.126
0.434ThrTrp: 0.434 ± 0.027
1.825ThrTyr: 1.825 ± 0.067
0.0ThrXaa: 0.0 ± 0.0
Val
5.521ValAla: 5.521 ± 0.104
1.15ValCys: 1.15 ± 0.049
4.618ValAsp: 4.618 ± 0.096
4.037ValGlu: 4.037 ± 0.093
2.884ValPhe: 2.884 ± 0.07
4.815ValGly: 4.815 ± 0.107
1.455ValHis: 1.455 ± 0.057
5.313ValIle: 5.313 ± 0.103
4.348ValLys: 4.348 ± 0.096
6.339ValLeu: 6.339 ± 0.128
2.032ValMet: 2.032 ± 0.065
2.822ValAsn: 2.822 ± 0.067
2.921ValPro: 2.921 ± 0.066
2.287ValGln: 2.287 ± 0.06
3.282ValArg: 3.282 ± 0.087
4.244ValSer: 4.244 ± 0.093
4.313ValThr: 4.313 ± 0.157
5.504ValVal: 5.504 ± 0.112
0.523ValTrp: 0.523 ± 0.028
2.514ValTyr: 2.514 ± 0.07
0.0ValXaa: 0.0 ± 0.0
Trp
0.617TrpAla: 0.617 ± 0.032
0.131TrpCys: 0.131 ± 0.017
0.55TrpAsp: 0.55 ± 0.031
0.435TrpGlu: 0.435 ± 0.027
0.395TrpPhe: 0.395 ± 0.026
0.553TrpGly: 0.553 ± 0.031
0.267TrpHis: 0.267 ± 0.023
0.624TrpIle: 0.624 ± 0.035
0.492TrpLys: 0.492 ± 0.031
0.862TrpLeu: 0.862 ± 0.041
0.318TrpMet: 0.318 ± 0.021
0.328TrpAsn: 0.328 ± 0.022
0.286TrpPro: 0.286 ± 0.023
0.491TrpGln: 0.491 ± 0.029
0.491TrpArg: 0.491 ± 0.029
0.4TrpSer: 0.4 ± 0.025
0.477TrpThr: 0.477 ± 0.03
0.494TrpVal: 0.494 ± 0.029
0.108TrpTrp: 0.108 ± 0.013
0.292TrpTyr: 0.292 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.844TyrAla: 2.844 ± 0.063
0.503TyrCys: 0.503 ± 0.027
2.575TyrAsp: 2.575 ± 0.077
1.864TyrGlu: 1.864 ± 0.059
1.689TyrPhe: 1.689 ± 0.061
2.771TyrGly: 2.771 ± 0.066
0.897TyrHis: 0.897 ± 0.041
2.076TyrIle: 2.076 ± 0.064
1.793TyrLys: 1.793 ± 0.067
3.165TyrLeu: 3.165 ± 0.081
0.768TyrMet: 0.768 ± 0.037
1.444TyrAsn: 1.444 ± 0.055
1.481TyrPro: 1.481 ± 0.049
1.503TyrGln: 1.503 ± 0.051
2.012TyrArg: 2.012 ± 0.074
1.896TyrSer: 1.896 ± 0.062
1.99TyrThr: 1.99 ± 0.064
2.118TyrVal: 2.118 ± 0.064
0.296TyrTrp: 0.296 ± 0.021
1.466TyrTyr: 1.466 ± 0.055
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1822 proteins (595009 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski