Amino acid dipepetide frequency for Veillonella sp. AS16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.3AlaAla: 7.3 ± 0.167
0.851AlaCys: 0.851 ± 0.044
4.65AlaAsp: 4.65 ± 0.099
4.785AlaGlu: 4.785 ± 0.136
3.335AlaPhe: 3.335 ± 0.089
6.289AlaGly: 6.289 ± 0.131
1.731AlaHis: 1.731 ± 0.053
6.343AlaIle: 6.343 ± 0.125
5.305AlaLys: 5.305 ± 0.105
8.385AlaLeu: 8.385 ± 0.152
2.687AlaMet: 2.687 ± 0.076
3.078AlaAsn: 3.078 ± 0.102
2.51AlaPro: 2.51 ± 0.072
2.868AlaGln: 2.868 ± 0.091
3.394AlaArg: 3.394 ± 0.098
3.993AlaSer: 3.993 ± 0.084
4.268AlaThr: 4.268 ± 0.101
7.035AlaVal: 7.035 ± 0.12
0.653AlaTrp: 0.653 ± 0.036
2.835AlaTyr: 2.835 ± 0.072
0.0AlaXaa: 0.0 ± 0.0
Cys
0.777CysAla: 0.777 ± 0.044
0.14CysCys: 0.14 ± 0.016
0.578CysAsp: 0.578 ± 0.032
0.611CysGlu: 0.611 ± 0.034
0.365CysPhe: 0.365 ± 0.026
1.073CysGly: 1.073 ± 0.057
0.285CysHis: 0.285 ± 0.022
0.894CysIle: 0.894 ± 0.04
0.58CysLys: 0.58 ± 0.031
0.819CysLeu: 0.819 ± 0.04
0.285CysMet: 0.285 ± 0.021
0.421CysAsn: 0.421 ± 0.027
0.501CysPro: 0.501 ± 0.032
0.264CysGln: 0.264 ± 0.023
0.461CysArg: 0.461 ± 0.027
0.559CysSer: 0.559 ± 0.03
0.613CysThr: 0.613 ± 0.03
0.699CysVal: 0.699 ± 0.039
0.08CysTrp: 0.08 ± 0.013
0.342CysTyr: 0.342 ± 0.027
0.0CysXaa: 0.0 ± 0.0
Asp
4.778AspAla: 4.778 ± 0.122
0.611AspCys: 0.611 ± 0.033
3.279AspAsp: 3.279 ± 0.084
4.456AspGlu: 4.456 ± 0.092
2.32AspPhe: 2.32 ± 0.068
4.528AspGly: 4.528 ± 0.142
1.148AspHis: 1.148 ± 0.045
4.888AspIle: 4.888 ± 0.103
3.406AspLys: 3.406 ± 0.085
4.544AspLeu: 4.544 ± 0.103
2.056AspMet: 2.056 ± 0.059
2.386AspAsn: 2.386 ± 0.085
1.964AspPro: 1.964 ± 0.066
1.516AspGln: 1.516 ± 0.051
2.428AspArg: 2.428 ± 0.069
2.98AspSer: 2.98 ± 0.074
3.55AspThr: 3.55 ± 0.092
4.966AspVal: 4.966 ± 0.105
0.571AspTrp: 0.571 ± 0.037
2.386AspTyr: 2.386 ± 0.069
0.0AspXaa: 0.0 ± 0.0
Glu
5.693GluAla: 5.693 ± 0.137
0.521GluCys: 0.521 ± 0.03
3.501GluAsp: 3.501 ± 0.085
4.6GluGlu: 4.6 ± 0.117
2.212GluPhe: 2.212 ± 0.056
4.168GluGly: 4.168 ± 0.097
1.534GluHis: 1.534 ± 0.048
4.055GluIle: 4.055 ± 0.098
3.738GluLys: 3.738 ± 0.092
6.271GluLeu: 6.271 ± 0.125
1.736GluMet: 1.736 ± 0.059
2.634GluAsn: 2.634 ± 0.066
2.061GluPro: 2.061 ± 0.065
2.688GluGln: 2.688 ± 0.071
3.684GluArg: 3.684 ± 0.093
3.775GluSer: 3.775 ± 0.087
3.097GluThr: 3.097 ± 0.067
4.343GluVal: 4.343 ± 0.097
0.599GluTrp: 0.599 ± 0.031
2.25GluTyr: 2.25 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
2.661PheAla: 2.661 ± 0.081
0.461PheCys: 0.461 ± 0.028
2.552PheAsp: 2.552 ± 0.068
1.998PheGlu: 1.998 ± 0.055
1.633PhePhe: 1.633 ± 0.051
3.286PheGly: 3.286 ± 0.085
0.753PheHis: 0.753 ± 0.035
3.003PheIle: 3.003 ± 0.087
2.089PheLys: 2.089 ± 0.057
3.254PheLeu: 3.254 ± 0.087
1.202PheMet: 1.202 ± 0.046
1.885PheAsn: 1.885 ± 0.06
1.26PhePro: 1.26 ± 0.046
0.945PheGln: 0.945 ± 0.038
1.35PheArg: 1.35 ± 0.06
2.447PheSer: 2.447 ± 0.075
2.516PheThr: 2.516 ± 0.075
2.802PheVal: 2.802 ± 0.089
0.386PheTrp: 0.386 ± 0.025
1.307PheTyr: 1.307 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
6.455GlyAla: 6.455 ± 0.126
0.893GlyCys: 0.893 ± 0.051
4.14GlyAsp: 4.14 ± 0.105
4.011GlyGlu: 4.011 ± 0.089
3.144GlyPhe: 3.144 ± 0.076
5.604GlyGly: 5.604 ± 0.136
1.85GlyHis: 1.85 ± 0.063
6.284GlyIle: 6.284 ± 0.109
4.601GlyLys: 4.601 ± 0.104
7.017GlyLeu: 7.017 ± 0.129
2.259GlyMet: 2.259 ± 0.076
3.04GlyAsn: 3.04 ± 0.113
2.147GlyPro: 2.147 ± 0.059
2.379GlyGln: 2.379 ± 0.07
3.509GlyArg: 3.509 ± 0.089
4.229GlySer: 4.229 ± 0.09
4.928GlyThr: 4.928 ± 0.117
5.733GlyVal: 5.733 ± 0.105
0.618GlyTrp: 0.618 ± 0.032
3.048GlyTyr: 3.048 ± 0.066
0.0GlyXaa: 0.0 ± 0.0
His
1.342HisAla: 1.342 ± 0.051
0.306HisCys: 0.306 ± 0.023
1.284HisAsp: 1.284 ± 0.054
1.303HisGlu: 1.303 ± 0.047
0.842HisPhe: 0.842 ± 0.041
1.653HisGly: 1.653 ± 0.055
0.68HisHis: 0.68 ± 0.039
2.086HisIle: 2.086 ± 0.066
1.338HisLys: 1.338 ± 0.048
1.756HisLeu: 1.756 ± 0.058
0.781HisMet: 0.781 ± 0.037
1.053HisAsn: 1.053 ± 0.044
1.022HisPro: 1.022 ± 0.045
0.695HisGln: 0.695 ± 0.035
1.022HisArg: 1.022 ± 0.043
1.191HisSer: 1.191 ± 0.052
1.27HisThr: 1.27 ± 0.049
1.747HisVal: 1.747 ± 0.053
0.211HisTrp: 0.211 ± 0.021
0.767HisTyr: 0.767 ± 0.041
0.0HisXaa: 0.0 ± 0.0
Ile
6.247IleAla: 6.247 ± 0.115
0.859IleCys: 0.859 ± 0.044
4.795IleAsp: 4.795 ± 0.101
4.68IleGlu: 4.68 ± 0.104
2.617IlePhe: 2.617 ± 0.086
6.186IleGly: 6.186 ± 0.123
1.543IleHis: 1.543 ± 0.054
5.756IleIle: 5.756 ± 0.128
3.883IleLys: 3.883 ± 0.078
6.593IleLeu: 6.593 ± 0.125
2.0IleMet: 2.0 ± 0.067
3.233IleAsn: 3.233 ± 0.077
3.284IlePro: 3.284 ± 0.074
2.252IleGln: 2.252 ± 0.063
3.165IleArg: 3.165 ± 0.087
4.439IleSer: 4.439 ± 0.092
4.598IleThr: 4.598 ± 0.095
5.84IleVal: 5.84 ± 0.116
0.554IleTrp: 0.554 ± 0.034
2.341IleTyr: 2.341 ± 0.071
0.0IleXaa: 0.0 ± 0.0
Lys
5.417LysAla: 5.417 ± 0.128
0.342LysCys: 0.342 ± 0.021
3.864LysAsp: 3.864 ± 0.13
4.687LysGlu: 4.687 ± 0.103
1.563LysPhe: 1.563 ± 0.052
4.149LysGly: 4.149 ± 0.084
1.246LysHis: 1.246 ± 0.048
3.371LysIle: 3.371 ± 0.081
3.805LysLys: 3.805 ± 0.112
4.43LysLeu: 4.43 ± 0.083
1.853LysMet: 1.853 ± 0.064
2.809LysAsn: 2.809 ± 0.095
2.271LysPro: 2.271 ± 0.066
2.301LysGln: 2.301 ± 0.059
3.125LysArg: 3.125 ± 0.077
3.363LysSer: 3.363 ± 0.085
3.373LysThr: 3.373 ± 0.09
4.163LysVal: 4.163 ± 0.086
0.507LysTrp: 0.507 ± 0.035
1.871LysTyr: 1.871 ± 0.068
0.0LysXaa: 0.0 ± 0.0
Leu
7.782LeuAla: 7.782 ± 0.139
1.066LeuCys: 1.066 ± 0.042
5.281LeuAsp: 5.281 ± 0.105
5.538LeuGlu: 5.538 ± 0.11
3.548LeuPhe: 3.548 ± 0.108
7.302LeuGly: 7.302 ± 0.143
2.047LeuHis: 2.047 ± 0.061
5.85LeuIle: 5.85 ± 0.131
4.965LeuLys: 4.965 ± 0.112
8.221LeuLeu: 8.221 ± 0.186
2.51LeuMet: 2.51 ± 0.07
3.586LeuAsn: 3.586 ± 0.082
3.724LeuPro: 3.724 ± 0.081
3.422LeuGln: 3.422 ± 0.081
3.99LeuArg: 3.99 ± 0.099
5.903LeuSer: 5.903 ± 0.108
5.11LeuThr: 5.11 ± 0.095
6.336LeuVal: 6.336 ± 0.129
0.858LeuTrp: 0.858 ± 0.042
2.977LeuTyr: 2.977 ± 0.077
0.0LeuXaa: 0.0 ± 0.0
Met
2.889MetAla: 2.889 ± 0.071
0.246MetCys: 0.246 ± 0.02
1.754MetAsp: 1.754 ± 0.052
1.822MetGlu: 1.822 ± 0.069
0.845MetPhe: 0.845 ± 0.039
2.292MetGly: 2.292 ± 0.06
0.589MetHis: 0.589 ± 0.037
2.021MetIle: 2.021 ± 0.067
2.11MetLys: 2.11 ± 0.062
2.535MetLeu: 2.535 ± 0.072
0.928MetMet: 0.928 ± 0.044
1.584MetAsn: 1.584 ± 0.047
1.275MetPro: 1.275 ± 0.047
0.88MetGln: 0.88 ± 0.042
1.326MetArg: 1.326 ± 0.051
1.911MetSer: 1.911 ± 0.064
1.825MetThr: 1.825 ± 0.058
2.102MetVal: 2.102 ± 0.065
0.213MetTrp: 0.213 ± 0.018
0.935MetTyr: 0.935 ± 0.041
0.0MetXaa: 0.0 ± 0.0
Asn
3.209AsnAla: 3.209 ± 0.084
0.376AsnCys: 0.376 ± 0.029
2.318AsnAsp: 2.318 ± 0.074
2.495AsnGlu: 2.495 ± 0.076
1.488AsnPhe: 1.488 ± 0.053
3.466AsnGly: 3.466 ± 0.124
1.069AsnHis: 1.069 ± 0.05
3.352AsnIle: 3.352 ± 0.076
2.493AsnLys: 2.493 ± 0.096
3.654AsnLeu: 3.654 ± 0.088
1.155AsnMet: 1.155 ± 0.042
1.977AsnAsn: 1.977 ± 0.094
2.24AsnPro: 2.24 ± 0.064
1.441AsnGln: 1.441 ± 0.046
2.294AsnArg: 2.294 ± 0.069
2.234AsnSer: 2.234 ± 0.064
2.481AsnThr: 2.481 ± 0.085
3.16AsnVal: 3.16 ± 0.086
0.447AsnTrp: 0.447 ± 0.025
1.577AsnTyr: 1.577 ± 0.052
0.0AsnXaa: 0.0 ± 0.0
Pro
2.744ProAla: 2.744 ± 0.075
0.342ProCys: 0.342 ± 0.028
2.276ProAsp: 2.276 ± 0.071
2.905ProGlu: 2.905 ± 0.078
1.511ProPhe: 1.511 ± 0.052
2.425ProGly: 2.425 ± 0.065
0.95ProHis: 0.95 ± 0.04
2.589ProIle: 2.589 ± 0.065
2.098ProLys: 2.098 ± 0.069
3.391ProLeu: 3.391 ± 0.085
1.087ProMet: 1.087 ± 0.042
1.668ProAsn: 1.668 ± 0.053
0.943ProPro: 0.943 ± 0.046
1.322ProGln: 1.322 ± 0.049
1.289ProArg: 1.289 ± 0.058
1.934ProSer: 1.934 ± 0.063
2.29ProThr: 2.29 ± 0.066
3.309ProVal: 3.309 ± 0.081
0.344ProTrp: 0.344 ± 0.024
1.567ProTyr: 1.567 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
2.975GlnAla: 2.975 ± 0.073
0.341GlnCys: 0.341 ± 0.029
1.871GlnAsp: 1.871 ± 0.054
2.269GlnGlu: 2.269 ± 0.068
1.284GlnPhe: 1.284 ± 0.054
2.246GlnGly: 2.246 ± 0.069
0.781GlnHis: 0.781 ± 0.04
2.292GlnIle: 2.292 ± 0.063
1.728GlnLys: 1.728 ± 0.053
3.192GlnLeu: 3.192 ± 0.081
1.02GlnMet: 1.02 ± 0.046
1.399GlnAsn: 1.399 ± 0.055
1.095GlnPro: 1.095 ± 0.043
1.574GlnGln: 1.574 ± 0.074
1.66GlnArg: 1.66 ± 0.062
1.988GlnSer: 1.988 ± 0.061
1.668GlnThr: 1.668 ± 0.055
2.426GlnVal: 2.426 ± 0.062
0.344GlnTrp: 0.344 ± 0.024
1.382GlnTyr: 1.382 ± 0.052
0.0GlnXaa: 0.0 ± 0.0
Arg
3.162ArgAla: 3.162 ± 0.089
0.442ArgCys: 0.442 ± 0.03
2.542ArgAsp: 2.542 ± 0.081
3.101ArgGlu: 3.101 ± 0.087
1.778ArgPhe: 1.778 ± 0.061
2.702ArgGly: 2.702 ± 0.071
1.08ArgHis: 1.08 ± 0.045
3.89ArgIle: 3.89 ± 0.088
2.634ArgLys: 2.634 ± 0.073
4.502ArgLeu: 4.502 ± 0.097
1.485ArgMet: 1.485 ± 0.049
2.138ArgAsn: 2.138 ± 0.059
1.598ArgPro: 1.598 ± 0.056
1.728ArgGln: 1.728 ± 0.071
2.393ArgArg: 2.393 ± 0.08
2.325ArgSer: 2.325 ± 0.056
2.4ArgThr: 2.4 ± 0.067
3.115ArgVal: 3.115 ± 0.081
0.421ArgTrp: 0.421 ± 0.027
1.825ArgTyr: 1.825 ± 0.061
0.0ArgXaa: 0.0 ± 0.0
Ser
4.264SerAla: 4.264 ± 0.097
0.615SerCys: 0.615 ± 0.034
3.237SerAsp: 3.237 ± 0.086
3.029SerGlu: 3.029 ± 0.07
2.409SerPhe: 2.409 ± 0.07
4.75SerGly: 4.75 ± 0.108
1.319SerHis: 1.319 ± 0.049
4.645SerIle: 4.645 ± 0.108
3.104SerLys: 3.104 ± 0.077
5.478SerLeu: 5.478 ± 0.117
1.794SerMet: 1.794 ± 0.054
2.302SerAsn: 2.302 ± 0.077
1.824SerPro: 1.824 ± 0.053
1.775SerGln: 1.775 ± 0.055
2.461SerArg: 2.461 ± 0.077
3.13SerSer: 3.13 ± 0.09
3.214SerThr: 3.214 ± 0.085
4.795SerVal: 4.795 ± 0.101
0.557SerTrp: 0.557 ± 0.035
2.126SerTyr: 2.126 ± 0.062
0.0SerXaa: 0.0 ± 0.0
Thr
4.931ThrAla: 4.931 ± 0.105
0.564ThrCys: 0.564 ± 0.031
3.338ThrAsp: 3.338 ± 0.086
3.122ThrGlu: 3.122 ± 0.078
2.259ThrPhe: 2.259 ± 0.074
4.643ThrGly: 4.643 ± 0.116
1.188ThrHis: 1.188 ± 0.045
4.586ThrIle: 4.586 ± 0.099
3.186ThrLys: 3.186 ± 0.079
5.214ThrLeu: 5.214 ± 0.103
1.665ThrMet: 1.665 ± 0.054
2.325ThrAsn: 2.325 ± 0.088
2.491ThrPro: 2.491 ± 0.073
1.579ThrGln: 1.579 ± 0.053
2.177ThrArg: 2.177 ± 0.069
3.249ThrSer: 3.249 ± 0.086
3.366ThrThr: 3.366 ± 0.095
5.4ThrVal: 5.4 ± 0.111
0.552ThrTrp: 0.552 ± 0.03
2.149ThrTyr: 2.149 ± 0.065
0.0ThrXaa: 0.0 ± 0.0
Val
6.453ValAla: 6.453 ± 0.125
0.88ValCys: 0.88 ± 0.046
4.61ValAsp: 4.61 ± 0.107
4.849ValGlu: 4.849 ± 0.097
2.762ValPhe: 2.762 ± 0.081
5.665ValGly: 5.665 ± 0.104
1.536ValHis: 1.536 ± 0.05
5.463ValIle: 5.463 ± 0.101
4.739ValLys: 4.739 ± 0.126
6.963ValLeu: 6.963 ± 0.115
2.22ValMet: 2.22 ± 0.071
3.295ValAsn: 3.295 ± 0.094
3.195ValPro: 3.195 ± 0.075
2.519ValGln: 2.519 ± 0.054
3.316ValArg: 3.316 ± 0.075
4.856ValSer: 4.856 ± 0.111
4.82ValThr: 4.82 ± 0.128
6.168ValVal: 6.168 ± 0.121
0.603ValTrp: 0.603 ± 0.032
2.557ValTyr: 2.557 ± 0.062
0.0ValXaa: 0.0 ± 0.0
Trp
0.683TrpAla: 0.683 ± 0.036
0.072TrpCys: 0.072 ± 0.01
0.543TrpAsp: 0.543 ± 0.031
0.449TrpGlu: 0.449 ± 0.027
0.451TrpPhe: 0.451 ± 0.032
0.624TrpGly: 0.624 ± 0.032
0.253TrpHis: 0.253 ± 0.022
0.652TrpIle: 0.652 ± 0.035
0.501TrpLys: 0.501 ± 0.029
0.891TrpLeu: 0.891 ± 0.042
0.283TrpMet: 0.283 ± 0.024
0.493TrpAsn: 0.493 ± 0.032
0.3TrpPro: 0.3 ± 0.025
0.363TrpGln: 0.363 ± 0.026
0.445TrpArg: 0.445 ± 0.027
0.484TrpSer: 0.484 ± 0.028
0.484TrpThr: 0.484 ± 0.029
0.561TrpVal: 0.561 ± 0.032
0.121TrpTrp: 0.121 ± 0.016
0.344TrpTyr: 0.344 ± 0.03
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.533TyrAla: 2.533 ± 0.066
0.421TyrCys: 0.421 ± 0.028
2.348TyrAsp: 2.348 ± 0.062
2.451TyrGlu: 2.451 ± 0.062
1.424TyrPhe: 1.424 ± 0.051
2.84TyrGly: 2.84 ± 0.072
0.769TyrHis: 0.769 ± 0.036
2.846TyrIle: 2.846 ± 0.083
2.205TyrLys: 2.205 ± 0.068
2.905TyrLeu: 2.905 ± 0.071
1.078TyrMet: 1.078 ± 0.04
1.618TyrAsn: 1.618 ± 0.059
1.279TyrPro: 1.279 ± 0.052
1.022TyrGln: 1.022 ± 0.042
1.771TyrArg: 1.771 ± 0.056
1.838TyrSer: 1.838 ± 0.056
2.114TyrThr: 2.114 ± 0.066
2.771TyrVal: 2.771 ± 0.071
0.356TyrTrp: 0.356 ± 0.025
1.357TyrTyr: 1.357 ± 0.057
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1736 proteins (572448 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski