Amino acid dipepetide frequency for Mycoplasmataceae bacterium RV_VA103A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.521AlaAla: 2.521 ± 0.137
0.54AlaCys: 0.54 ± 0.045
2.23AlaAsp: 2.23 ± 0.097
4.602AlaGlu: 4.602 ± 0.128
1.62AlaPhe: 1.62 ± 0.087
2.492AlaGly: 2.492 ± 0.084
0.669AlaHis: 0.669 ± 0.04
3.102AlaIle: 3.102 ± 0.1
5.183AlaLys: 5.183 ± 0.154
3.78AlaLeu: 3.78 ± 0.121
0.669AlaMet: 0.669 ± 0.044
2.906AlaAsn: 2.906 ± 0.1
1.253AlaPro: 1.253 ± 0.069
2.057AlaGln: 2.057 ± 0.084
2.213AlaArg: 2.213 ± 0.085
2.454AlaSer: 2.454 ± 0.091
2.477AlaThr: 2.477 ± 0.091
2.277AlaVal: 2.277 ± 0.083
0.945AlaTrp: 0.945 ± 0.054
1.614AlaTyr: 1.614 ± 0.065
0.0AlaXaa: 0.0 ± 0.0
Cys
0.478CysAla: 0.478 ± 0.037
0.273CysCys: 0.273 ± 0.027
0.622CysAsp: 0.622 ± 0.047
0.828CysGlu: 0.828 ± 0.05
0.704CysPhe: 0.704 ± 0.046
0.916CysGly: 0.916 ± 0.058
0.364CysHis: 0.364 ± 0.031
0.452CysIle: 0.452 ± 0.036
0.913CysLys: 0.913 ± 0.057
1.561CysLeu: 1.561 ± 0.088
0.135CysMet: 0.135 ± 0.022
0.493CysAsn: 0.493 ± 0.04
0.657CysPro: 0.657 ± 0.051
1.441CysGln: 1.441 ± 0.07
0.666CysArg: 0.666 ± 0.048
1.118CysSer: 1.118 ± 0.072
0.185CysThr: 0.185 ± 0.023
0.405CysVal: 0.405 ± 0.03
0.487CysTrp: 0.487 ± 0.04
0.596CysTyr: 0.596 ± 0.048
0.0CysXaa: 0.0 ± 0.0
Asp
1.555AspAla: 1.555 ± 0.084
0.854AspCys: 0.854 ± 0.06
1.899AspAsp: 1.899 ± 0.087
3.481AspGlu: 3.481 ± 0.125
2.791AspPhe: 2.791 ± 0.106
1.779AspGly: 1.779 ± 0.075
0.54AspHis: 0.54 ± 0.043
3.413AspIle: 3.413 ± 0.129
5.67AspLys: 5.67 ± 0.155
5.318AspLeu: 5.318 ± 0.156
0.575AspMet: 0.575 ± 0.045
3.507AspAsn: 3.507 ± 0.114
1.732AspPro: 1.732 ± 0.087
1.749AspGln: 1.749 ± 0.076
1.726AspArg: 1.726 ± 0.071
2.025AspSer: 2.025 ± 0.072
1.735AspThr: 1.735 ± 0.074
1.139AspVal: 1.139 ± 0.065
1.209AspTrp: 1.209 ± 0.052
2.812AspTyr: 2.812 ± 0.109
0.0AspXaa: 0.0 ± 0.0
Glu
3.909GluAla: 3.909 ± 0.128
0.792GluCys: 0.792 ± 0.052
3.073GluAsp: 3.073 ± 0.103
9.306GluGlu: 9.306 ± 0.284
2.885GluPhe: 2.885 ± 0.108
3.44GluGly: 3.44 ± 0.104
1.007GluHis: 1.007 ± 0.048
8.341GluIle: 8.341 ± 0.168
12.878GluLys: 12.878 ± 0.26
9.77GluLeu: 9.77 ± 0.198
1.517GluMet: 1.517 ± 0.065
5.873GluAsn: 5.873 ± 0.144
1.875GluPro: 1.875 ± 0.094
4.264GluGln: 4.264 ± 0.167
4.649GluArg: 4.649 ± 0.135
2.929GluSer: 2.929 ± 0.093
3.639GluThr: 3.639 ± 0.09
4.426GluVal: 4.426 ± 0.105
1.62GluTrp: 1.62 ± 0.079
2.838GluTyr: 2.838 ± 0.097
0.0GluXaa: 0.0 ± 0.0
Phe
2.316PheAla: 2.316 ± 0.078
1.018PheCys: 1.018 ± 0.064
1.867PheAsp: 1.867 ± 0.084
2.122PheGlu: 2.122 ± 0.076
2.923PhePhe: 2.923 ± 0.118
1.993PheGly: 1.993 ± 0.079
0.775PheHis: 0.775 ± 0.049
3.07PheIle: 3.07 ± 0.105
3.099PheLys: 3.099 ± 0.104
5.174PheLeu: 5.174 ± 0.129
0.549PheMet: 0.549 ± 0.04
2.377PheAsn: 2.377 ± 0.088
1.773PhePro: 1.773 ± 0.065
1.676PheGln: 1.676 ± 0.074
1.867PheArg: 1.867 ± 0.066
3.689PheSer: 3.689 ± 0.141
2.586PheThr: 2.586 ± 0.096
1.867PheVal: 1.867 ± 0.074
1.112PheTrp: 1.112 ± 0.065
2.06PheTyr: 2.06 ± 0.096
0.0PheXaa: 0.0 ± 0.0
Gly
1.978GlyAla: 1.978 ± 0.082
0.569GlyCys: 0.569 ± 0.04
2.075GlyAsp: 2.075 ± 0.084
4.303GlyGlu: 4.303 ± 0.112
2.005GlyPhe: 2.005 ± 0.095
3.228GlyGly: 3.228 ± 0.111
0.836GlyHis: 0.836 ± 0.049
3.583GlyIle: 3.583 ± 0.104
5.453GlyLys: 5.453 ± 0.158
4.408GlyLeu: 4.408 ± 0.118
0.751GlyMet: 0.751 ± 0.055
2.677GlyAsn: 2.677 ± 0.092
0.93GlyPro: 0.93 ± 0.058
2.195GlyGln: 2.195 ± 0.082
2.298GlyArg: 2.298 ± 0.085
2.952GlySer: 2.952 ± 0.099
2.157GlyThr: 2.157 ± 0.08
2.571GlyVal: 2.571 ± 0.099
1.08GlyTrp: 1.08 ± 0.056
1.858GlyTyr: 1.858 ± 0.08
0.0GlyXaa: 0.0 ± 0.0
His
0.643HisAla: 0.643 ± 0.047
0.329HisCys: 0.329 ± 0.029
0.649HisAsp: 0.649 ± 0.047
0.983HisGlu: 0.983 ± 0.058
0.963HisPhe: 0.963 ± 0.049
0.775HisGly: 0.775 ± 0.045
0.376HisHis: 0.376 ± 0.036
0.913HisIle: 0.913 ± 0.059
1.101HisLys: 1.101 ± 0.055
1.773HisLeu: 1.773 ± 0.073
0.167HisMet: 0.167 ± 0.018
0.922HisAsn: 0.922 ± 0.054
0.789HisPro: 0.789 ± 0.058
1.057HisGln: 1.057 ± 0.059
0.619HisArg: 0.619 ± 0.047
1.077HisSer: 1.077 ± 0.057
0.616HisThr: 0.616 ± 0.044
0.517HisVal: 0.517 ± 0.042
0.379HisTrp: 0.379 ± 0.032
0.901HisTyr: 0.901 ± 0.056
0.0HisXaa: 0.0 ± 0.0
Ile
3.733IleAla: 3.733 ± 0.1
1.051IleCys: 1.051 ± 0.055
3.727IleAsp: 3.727 ± 0.104
5.324IleGlu: 5.324 ± 0.125
3.789IlePhe: 3.789 ± 0.124
3.269IleGly: 3.269 ± 0.084
1.112IleHis: 1.112 ± 0.071
6.847IleIle: 6.847 ± 0.158
8.32IleLys: 8.32 ± 0.174
6.377IleLeu: 6.377 ± 0.146
1.101IleMet: 1.101 ± 0.054
5.242IleAsn: 5.242 ± 0.126
2.509IlePro: 2.509 ± 0.077
2.724IleGln: 2.724 ± 0.084
3.155IleArg: 3.155 ± 0.098
5.083IleSer: 5.083 ± 0.133
4.528IleThr: 4.528 ± 0.115
3.399IleVal: 3.399 ± 0.108
1.189IleTrp: 1.189 ± 0.058
2.771IleTyr: 2.771 ± 0.089
0.0IleXaa: 0.0 ± 0.0
Lys
4.185LysAla: 4.185 ± 0.134
1.106LysCys: 1.106 ± 0.069
5.162LysAsp: 5.162 ± 0.132
12.908LysGlu: 12.908 ± 0.257
3.592LysPhe: 3.592 ± 0.108
4.825LysGly: 4.825 ± 0.13
1.429LysHis: 1.429 ± 0.073
9.4LysIle: 9.4 ± 0.176
15.188LysLys: 15.188 ± 0.287
10.501LysLeu: 10.501 ± 0.188
2.324LysMet: 2.324 ± 0.08
8.153LysAsn: 8.153 ± 0.168
2.809LysPro: 2.809 ± 0.117
4.796LysGln: 4.796 ± 0.151
4.675LysArg: 4.675 ± 0.129
5.623LysSer: 5.623 ± 0.143
4.901LysThr: 4.901 ± 0.127
4.916LysVal: 4.916 ± 0.133
1.937LysTrp: 1.937 ± 0.078
3.918LysTyr: 3.918 ± 0.117
0.0LysXaa: 0.0 ± 0.0
Leu
5.782LeuAla: 5.782 ± 0.151
0.957LeuCys: 0.957 ± 0.052
4.775LeuAsp: 4.775 ± 0.137
9.01LeuGlu: 9.01 ± 0.216
4.138LeuPhe: 4.138 ± 0.139
4.869LeuGly: 4.869 ± 0.125
1.303LeuHis: 1.303 ± 0.063
8.027LeuIle: 8.027 ± 0.199
10.639LeuLys: 10.639 ± 0.202
9.5LeuLeu: 9.5 ± 0.242
1.197LeuMet: 1.197 ± 0.06
6.034LeuAsn: 6.034 ± 0.146
3.531LeuPro: 3.531 ± 0.103
4.115LeuGln: 4.115 ± 0.139
4.053LeuArg: 4.053 ± 0.106
6.075LeuSer: 6.075 ± 0.138
6.278LeuThr: 6.278 ± 0.184
5.438LeuVal: 5.438 ± 0.128
1.391LeuTrp: 1.391 ± 0.066
2.465LeuTyr: 2.465 ± 0.093
0.0LeuXaa: 0.0 ± 0.0
Met
0.927MetAla: 0.927 ± 0.049
0.097MetCys: 0.097 ± 0.02
0.54MetAsp: 0.54 ± 0.036
1.183MetGlu: 1.183 ± 0.057
0.522MetPhe: 0.522 ± 0.042
0.743MetGly: 0.743 ± 0.046
0.12MetHis: 0.12 ± 0.02
0.963MetIle: 0.963 ± 0.058
1.509MetLys: 1.509 ± 0.067
1.303MetLeu: 1.303 ± 0.057
0.205MetMet: 0.205 ± 0.024
1.101MetAsn: 1.101 ± 0.056
0.698MetPro: 0.698 ± 0.045
0.293MetGln: 0.293 ± 0.028
0.696MetArg: 0.696 ± 0.046
0.942MetSer: 0.942 ± 0.049
1.112MetThr: 1.112 ± 0.057
0.916MetVal: 0.916 ± 0.056
0.185MetTrp: 0.185 ± 0.023
0.279MetTyr: 0.279 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
2.101AsnAla: 2.101 ± 0.083
1.356AsnCys: 1.356 ± 0.081
2.841AsnAsp: 2.841 ± 0.113
4.291AsnGlu: 4.291 ± 0.132
3.46AsnPhe: 3.46 ± 0.115
2.404AsnGly: 2.404 ± 0.091
1.192AsnHis: 1.192 ± 0.056
4.517AsnIle: 4.517 ± 0.14
6.979AsnLys: 6.979 ± 0.143
7.437AsnLeu: 7.437 ± 0.189
0.945AsnMet: 0.945 ± 0.05
5.67AsnAsn: 5.67 ± 0.146
2.697AsnPro: 2.697 ± 0.086
4.068AsnGln: 4.068 ± 0.137
2.451AsnArg: 2.451 ± 0.084
4.323AsnSer: 4.323 ± 0.116
2.489AsnThr: 2.489 ± 0.093
1.823AsnVal: 1.823 ± 0.071
1.52AsnTrp: 1.52 ± 0.072
3.431AsnTyr: 3.431 ± 0.104
0.0AsnXaa: 0.0 ± 0.0
Pro
1.585ProAla: 1.585 ± 0.064
0.487ProCys: 0.487 ± 0.045
1.74ProAsp: 1.74 ± 0.068
2.812ProGlu: 2.812 ± 0.096
1.476ProPhe: 1.476 ± 0.07
1.412ProGly: 1.412 ± 0.064
0.851ProHis: 0.851 ± 0.052
1.878ProIle: 1.878 ± 0.071
2.926ProLys: 2.926 ± 0.108
3.011ProLeu: 3.011 ± 0.096
0.291ProMet: 0.291 ± 0.032
2.559ProAsn: 2.559 ± 0.09
1.676ProPro: 1.676 ± 0.096
1.784ProGln: 1.784 ± 0.078
1.303ProArg: 1.303 ± 0.06
2.213ProSer: 2.213 ± 0.084
2.201ProThr: 2.201 ± 0.091
1.503ProVal: 1.503 ± 0.073
0.625ProTrp: 0.625 ± 0.044
1.25ProTyr: 1.25 ± 0.067
0.0ProXaa: 0.0 ± 0.0
Gln
2.897GlnAla: 2.897 ± 0.113
0.317GlnCys: 0.317 ± 0.037
1.984GlnAsp: 1.984 ± 0.086
5.676GlnGlu: 5.676 ± 0.172
1.394GlnPhe: 1.394 ± 0.064
2.324GlnGly: 2.324 ± 0.09
0.646GlnHis: 0.646 ± 0.041
3.883GlnIle: 3.883 ± 0.104
6.598GlnLys: 6.598 ± 0.17
4.851GlnLeu: 4.851 ± 0.151
0.64GlnMet: 0.64 ± 0.042
3.032GlnAsn: 3.032 ± 0.094
1.476GlnPro: 1.476 ± 0.062
3.625GlnGln: 3.625 ± 0.164
2.025GlnArg: 2.025 ± 0.079
2.007GlnSer: 2.007 ± 0.073
2.427GlnThr: 2.427 ± 0.092
2.26GlnVal: 2.26 ± 0.08
0.652GlnTrp: 0.652 ± 0.043
0.939GlnTyr: 0.939 ± 0.049
0.0GlnXaa: 0.0 ± 0.0
Arg
1.746ArgAla: 1.746 ± 0.071
0.399ArgCys: 0.399 ± 0.036
2.051ArgAsp: 2.051 ± 0.074
5.171ArgGlu: 5.171 ± 0.149
1.523ArgPhe: 1.523 ± 0.066
2.283ArgGly: 2.283 ± 0.089
0.707ArgHis: 0.707 ± 0.047
3.076ArgIle: 3.076 ± 0.094
5.142ArgLys: 5.142 ± 0.122
4.015ArgLeu: 4.015 ± 0.111
0.787ArgMet: 0.787 ± 0.05
2.43ArgAsn: 2.43 ± 0.077
1.374ArgPro: 1.374 ± 0.076
2.495ArgGln: 2.495 ± 0.073
2.025ArgArg: 2.025 ± 0.09
2.049ArgSer: 2.049 ± 0.083
1.919ArgThr: 1.919 ± 0.077
2.051ArgVal: 2.051 ± 0.069
0.649ArgTrp: 0.649 ± 0.04
1.379ArgTyr: 1.379 ± 0.062
0.0ArgXaa: 0.0 ± 0.0
Ser
2.336SerAla: 2.336 ± 0.077
0.88SerCys: 0.88 ± 0.049
2.879SerAsp: 2.879 ± 0.111
4.766SerGlu: 4.766 ± 0.116
2.952SerPhe: 2.952 ± 0.111
3.178SerGly: 3.178 ± 0.105
1.071SerHis: 1.071 ± 0.064
3.264SerIle: 3.264 ± 0.104
5.465SerLys: 5.465 ± 0.136
6.269SerLeu: 6.269 ± 0.144
0.698SerMet: 0.698 ± 0.046
3.428SerAsn: 3.428 ± 0.114
2.386SerPro: 2.386 ± 0.087
3.798SerGln: 3.798 ± 0.114
2.404SerArg: 2.404 ± 0.095
4.094SerSer: 4.094 ± 0.178
2.198SerThr: 2.198 ± 0.079
2.345SerVal: 2.345 ± 0.084
1.215SerTrp: 1.215 ± 0.062
2.063SerTyr: 2.063 ± 0.083
0.0SerXaa: 0.0 ± 0.0
Thr
2.289ThrAla: 2.289 ± 0.096
0.631ThrCys: 0.631 ± 0.048
2.433ThrAsp: 2.433 ± 0.107
3.883ThrGlu: 3.883 ± 0.112
1.952ThrPhe: 1.952 ± 0.082
2.665ThrGly: 2.665 ± 0.1
0.784ThrHis: 0.784 ± 0.05
3.519ThrIle: 3.519 ± 0.104
5.069ThrLys: 5.069 ± 0.128
4.003ThrLeu: 4.003 ± 0.116
0.382ThrMet: 0.382 ± 0.03
3.525ThrAsn: 3.525 ± 0.112
2.157ThrPro: 2.157 ± 0.086
2.184ThrGln: 2.184 ± 0.087
1.872ThrArg: 1.872 ± 0.07
3.105ThrSer: 3.105 ± 0.099
2.671ThrThr: 2.671 ± 0.101
1.57ThrVal: 1.57 ± 0.068
0.716ThrTrp: 0.716 ± 0.046
1.773ThrTyr: 1.773 ± 0.07
0.0ThrXaa: 0.0 ± 0.0
Val
2.53ValAla: 2.53 ± 0.102
0.575ValCys: 0.575 ± 0.044
2.327ValAsp: 2.327 ± 0.076
3.751ValGlu: 3.751 ± 0.11
2.116ValPhe: 2.116 ± 0.08
2.568ValGly: 2.568 ± 0.087
0.534ValHis: 0.534 ± 0.042
3.522ValIle: 3.522 ± 0.097
4.963ValLys: 4.963 ± 0.11
4.024ValLeu: 4.024 ± 0.106
0.643ValMet: 0.643 ± 0.039
2.999ValAsn: 2.999 ± 0.091
1.526ValPro: 1.526 ± 0.066
1.297ValGln: 1.297 ± 0.062
1.955ValArg: 1.955 ± 0.078
2.773ValSer: 2.773 ± 0.103
0.866ValThr: 0.866 ± 0.057
2.565ValVal: 2.565 ± 0.106
0.757ValTrp: 0.757 ± 0.051
1.746ValTyr: 1.746 ± 0.077
0.0ValXaa: 0.0 ± 0.0
Trp
0.719TrpAla: 0.719 ± 0.046
0.258TrpCys: 0.258 ± 0.03
0.88TrpAsp: 0.88 ± 0.046
2.051TrpGlu: 2.051 ± 0.083
0.725TrpPhe: 0.725 ± 0.049
1.057TrpGly: 1.057 ± 0.06
0.288TrpHis: 0.288 ± 0.03
1.418TrpIle: 1.418 ± 0.07
2.41TrpLys: 2.41 ± 0.094
2.013TrpLeu: 2.013 ± 0.079
0.32TrpMet: 0.32 ± 0.032
1.039TrpAsn: 1.039 ± 0.058
0.396TrpPro: 0.396 ± 0.033
1.054TrpGln: 1.054 ± 0.056
0.822TrpArg: 0.822 ± 0.055
0.737TrpSer: 0.737 ± 0.05
0.977TrpThr: 0.977 ± 0.059
0.91TrpVal: 0.91 ± 0.056
0.461TrpTrp: 0.461 ± 0.044
0.581TrpTyr: 0.581 ± 0.043
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.374TyrAla: 1.374 ± 0.066
0.804TyrCys: 0.804 ± 0.057
1.752TyrAsp: 1.752 ± 0.066
2.586TyrGlu: 2.586 ± 0.09
2.38TyrPhe: 2.38 ± 0.092
1.69TyrGly: 1.69 ± 0.079
0.942TyrHis: 0.942 ± 0.057
2.063TyrIle: 2.063 ± 0.078
2.847TyrLys: 2.847 ± 0.102
4.487TyrLeu: 4.487 ± 0.122
0.434TyrMet: 0.434 ± 0.033
2.028TyrAsn: 2.028 ± 0.077
1.338TyrPro: 1.338 ± 0.081
2.982TyrGln: 2.982 ± 0.111
1.799TyrArg: 1.799 ± 0.079
2.421TyrSer: 2.421 ± 0.074
1.218TyrThr: 1.218 ± 0.057
1.142TyrVal: 1.142 ± 0.062
0.986TyrTrp: 0.986 ± 0.053
1.884TyrTyr: 1.884 ± 0.081
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1718 proteins (340732 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski