Amino acid dipepetide frequency for Mycoplasma synoviae (strain 53)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.281AlaAla: 4.281 ± 0.254
0.22AlaCys: 0.22 ± 0.029
2.573AlaAsp: 2.573 ± 0.114
2.823AlaGlu: 2.823 ± 0.122
3.064AlaPhe: 3.064 ± 0.131
2.692AlaGly: 2.692 ± 0.121
0.718AlaHis: 0.718 ± 0.061
4.838AlaIle: 4.838 ± 0.186
6.208AlaLys: 6.208 ± 0.187
6.06AlaLeu: 6.06 ± 0.2
1.09AlaMet: 1.09 ± 0.074
3.993AlaAsn: 3.993 ± 0.147
1.745AlaPro: 1.745 ± 0.102
2.421AlaGln: 2.421 ± 0.106
2.045AlaArg: 2.045 ± 0.093
4.521AlaSer: 4.521 ± 0.175
3.486AlaThr: 3.486 ± 0.17
3.148AlaVal: 3.148 ± 0.149
0.537AlaTrp: 0.537 ± 0.053
2.045AlaTyr: 2.045 ± 0.096
0.0AlaXaa: 0.0 ± 0.0
Cys
0.216CysAla: 0.216 ± 0.033
0.038CysCys: 0.038 ± 0.014
0.262CysAsp: 0.262 ± 0.039
0.245CysGlu: 0.245 ± 0.033
0.237CysPhe: 0.237 ± 0.032
0.308CysGly: 0.308 ± 0.032
0.101CysHis: 0.101 ± 0.023
0.292CysIle: 0.292 ± 0.035
0.385CysLys: 0.385 ± 0.041
0.334CysLeu: 0.334 ± 0.042
0.051CysMet: 0.051 ± 0.012
0.237CysAsn: 0.237 ± 0.038
0.135CysPro: 0.135 ± 0.023
0.135CysGln: 0.135 ± 0.028
0.076CysArg: 0.076 ± 0.02
0.283CysSer: 0.283 ± 0.036
0.186CysThr: 0.186 ± 0.031
0.27CysVal: 0.27 ± 0.035
0.03CysTrp: 0.03 ± 0.01
0.173CysTyr: 0.173 ± 0.026
0.0CysXaa: 0.0 ± 0.0
Asp
3.165AspAla: 3.165 ± 0.135
0.182AspCys: 0.182 ± 0.027
2.51AspAsp: 2.51 ± 0.102
3.605AspGlu: 3.605 ± 0.156
4.2AspPhe: 4.2 ± 0.16
2.573AspGly: 2.573 ± 0.135
0.554AspHis: 0.554 ± 0.05
3.638AspIle: 3.638 ± 0.144
5.354AspLys: 5.354 ± 0.137
6.233AspLeu: 6.233 ± 0.175
0.655AspMet: 0.655 ± 0.052
3.478AspAsn: 3.478 ± 0.135
1.589AspPro: 1.589 ± 0.087
1.847AspGln: 1.847 ± 0.095
1.272AspArg: 1.272 ± 0.082
4.019AspSer: 4.019 ± 0.157
2.193AspThr: 2.193 ± 0.095
2.709AspVal: 2.709 ± 0.121
0.372AspTrp: 0.372 ± 0.036
2.366AspTyr: 2.366 ± 0.102
0.0AspXaa: 0.0 ± 0.0
Glu
3.626GluAla: 3.626 ± 0.154
0.093GluCys: 0.093 ± 0.021
2.878GluAsp: 2.878 ± 0.13
4.412GluGlu: 4.412 ± 0.201
4.036GluPhe: 4.036 ± 0.147
2.21GluGly: 2.21 ± 0.112
0.63GluHis: 0.63 ± 0.051
6.423GluIle: 6.423 ± 0.229
7.678GluLys: 7.678 ± 0.242
6.284GluLeu: 6.284 ± 0.186
1.078GluMet: 1.078 ± 0.064
6.229GluAsn: 6.229 ± 0.177
1.065GluPro: 1.065 ± 0.071
1.8GluGln: 1.8 ± 0.083
1.454GluArg: 1.454 ± 0.092
3.719GluSer: 3.719 ± 0.144
3.072GluThr: 3.072 ± 0.111
3.985GluVal: 3.985 ± 0.158
0.347GluTrp: 0.347 ± 0.037
2.142GluTyr: 2.142 ± 0.104
0.0GluXaa: 0.0 ± 0.0
Phe
3.199PheAla: 3.199 ± 0.116
0.376PheCys: 0.376 ± 0.049
4.15PheAsp: 4.15 ± 0.132
3.482PheGlu: 3.482 ± 0.153
3.237PhePhe: 3.237 ± 0.165
2.54PheGly: 2.54 ± 0.12
0.604PheHis: 0.604 ± 0.055
4.382PheIle: 4.382 ± 0.181
5.785PheLys: 5.785 ± 0.167
6.601PheLeu: 6.601 ± 0.214
0.879PheMet: 0.879 ± 0.061
4.382PheAsn: 4.382 ± 0.175
1.327PhePro: 1.327 ± 0.073
1.754PheGln: 1.754 ± 0.088
1.492PheArg: 1.492 ± 0.085
4.792PheSer: 4.792 ± 0.185
3.013PheThr: 3.013 ± 0.129
3.985PheVal: 3.985 ± 0.143
0.604PheTrp: 0.604 ± 0.049
2.404PheTyr: 2.404 ± 0.122
0.0PheXaa: 0.0 ± 0.0
Gly
2.806GlyAla: 2.806 ± 0.117
0.173GlyCys: 0.173 ± 0.027
2.214GlyAsp: 2.214 ± 0.129
2.4GlyGlu: 2.4 ± 0.104
2.861GlyPhe: 2.861 ± 0.106
2.717GlyGly: 2.717 ± 0.15
0.794GlyHis: 0.794 ± 0.065
3.803GlyIle: 3.803 ± 0.164
3.964GlyLys: 3.964 ± 0.156
3.782GlyLeu: 3.782 ± 0.15
0.782GlyMet: 0.782 ± 0.063
2.476GlyAsn: 2.476 ± 0.119
1.094GlyPro: 1.094 ± 0.067
1.551GlyGln: 1.551 ± 0.077
1.411GlyArg: 1.411 ± 0.085
3.474GlySer: 3.474 ± 0.142
2.675GlyThr: 2.675 ± 0.125
3.051GlyVal: 3.051 ± 0.121
0.321GlyTrp: 0.321 ± 0.036
2.244GlyTyr: 2.244 ± 0.101
0.0GlyXaa: 0.0 ± 0.0
His
0.731HisAla: 0.731 ± 0.059
0.042HisCys: 0.042 ± 0.012
0.625HisAsp: 0.625 ± 0.056
0.811HisGlu: 0.811 ± 0.059
0.816HisPhe: 0.816 ± 0.062
0.697HisGly: 0.697 ± 0.056
0.237HisHis: 0.237 ± 0.029
0.925HisIle: 0.925 ± 0.062
1.272HisLys: 1.272 ± 0.077
1.449HisLeu: 1.449 ± 0.087
0.182HisMet: 0.182 ± 0.029
0.761HisAsn: 0.761 ± 0.067
0.465HisPro: 0.465 ± 0.048
0.473HisGln: 0.473 ± 0.049
0.292HisArg: 0.292 ± 0.04
0.87HisSer: 0.87 ± 0.058
0.435HisThr: 0.435 ± 0.037
0.676HisVal: 0.676 ± 0.057
0.114HisTrp: 0.114 ± 0.022
0.499HisTyr: 0.499 ± 0.048
0.0HisXaa: 0.0 ± 0.0
Ile
4.91IleAla: 4.91 ± 0.184
0.486IleCys: 0.486 ± 0.045
4.221IleAsp: 4.221 ± 0.163
4.674IleGlu: 4.674 ± 0.194
4.745IlePhe: 4.745 ± 0.213
3.457IleGly: 3.457 ± 0.15
0.938IleHis: 0.938 ± 0.072
6.14IleIle: 6.14 ± 0.205
8.029IleLys: 8.029 ± 0.276
7.256IleLeu: 7.256 ± 0.26
1.242IleMet: 1.242 ± 0.075
6.377IleAsn: 6.377 ± 0.243
2.497IlePro: 2.497 ± 0.129
2.421IleGln: 2.421 ± 0.118
2.282IleArg: 2.282 ± 0.081
6.267IleSer: 6.267 ± 0.184
3.985IleThr: 3.985 ± 0.17
5.02IleVal: 5.02 ± 0.168
0.634IleTrp: 0.634 ± 0.052
3.478IleTyr: 3.478 ± 0.15
0.0IleXaa: 0.0 ± 0.0
Lys
6.102LysAla: 6.102 ± 0.185
0.3LysCys: 0.3 ± 0.045
5.798LysAsp: 5.798 ± 0.19
8.143LysGlu: 8.143 ± 0.248
5.616LysPhe: 5.616 ± 0.176
3.275LysGly: 3.275 ± 0.144
1.454LysHis: 1.454 ± 0.094
8.895LysIle: 8.895 ± 0.257
10.408LysLys: 10.408 ± 0.306
10.213LysLeu: 10.213 ± 0.198
2.168LysMet: 2.168 ± 0.093
9.711LysAsn: 9.711 ± 0.268
2.464LysPro: 2.464 ± 0.12
3.245LysGln: 3.245 ± 0.131
2.54LysArg: 2.54 ± 0.113
5.798LysSer: 5.798 ± 0.165
5.751LysThr: 5.751 ± 0.18
6.355LysVal: 6.355 ± 0.198
0.989LysTrp: 0.989 ± 0.072
4.682LysTyr: 4.682 ± 0.16
0.0LysXaa: 0.0 ± 0.0
Leu
5.722LeuAla: 5.722 ± 0.174
0.401LeuCys: 0.401 ± 0.04
6.123LeuAsp: 6.123 ± 0.16
7.006LeuGlu: 7.006 ± 0.215
5.413LeuPhe: 5.413 ± 0.203
4.45LeuGly: 4.45 ± 0.159
1.234LeuHis: 1.234 ± 0.065
8.388LeuIle: 8.388 ± 0.251
10.932LeuLys: 10.932 ± 0.29
9.106LeuLeu: 9.106 ± 0.28
1.487LeuMet: 1.487 ± 0.085
9.098LeuAsn: 9.098 ± 0.265
2.637LeuPro: 2.637 ± 0.091
3.076LeuGln: 3.076 ± 0.115
2.552LeuArg: 2.552 ± 0.104
7.632LeuSer: 7.632 ± 0.232
5.81LeuThr: 5.81 ± 0.218
6.385LeuVal: 6.385 ± 0.22
0.748LeuTrp: 0.748 ± 0.063
3.723LeuTyr: 3.723 ± 0.134
0.0LeuXaa: 0.0 ± 0.0
Met
1.141MetAla: 1.141 ± 0.069
0.055MetCys: 0.055 ± 0.016
0.693MetAsp: 0.693 ± 0.054
0.765MetGlu: 0.765 ± 0.069
0.879MetPhe: 0.879 ± 0.068
0.701MetGly: 0.701 ± 0.054
0.313MetHis: 0.313 ± 0.034
1.247MetIle: 1.247 ± 0.083
1.538MetLys: 1.538 ± 0.084
1.787MetLeu: 1.787 ± 0.088
0.308MetMet: 0.308 ± 0.045
1.048MetAsn: 1.048 ± 0.069
0.93MetPro: 0.93 ± 0.063
0.989MetGln: 0.989 ± 0.062
0.406MetArg: 0.406 ± 0.043
0.951MetSer: 0.951 ± 0.068
0.672MetThr: 0.672 ± 0.05
0.989MetVal: 0.989 ± 0.071
0.169MetTrp: 0.169 ± 0.023
0.461MetTyr: 0.461 ± 0.049
0.0MetXaa: 0.0 ± 0.0
Asn
3.782AsnAla: 3.782 ± 0.148
0.3AsnCys: 0.3 ± 0.04
3.664AsnAsp: 3.664 ± 0.158
4.437AsnGlu: 4.437 ± 0.154
4.843AsnPhe: 4.843 ± 0.198
3.19AsnGly: 3.19 ± 0.14
0.896AsnHis: 0.896 ± 0.061
5.557AsnIle: 5.557 ± 0.224
8.468AsnLys: 8.468 ± 0.218
9.732AsnLeu: 9.732 ± 0.341
0.997AsnMet: 0.997 ± 0.062
6.275AsnAsn: 6.275 ± 0.242
2.776AsnPro: 2.776 ± 0.139
2.848AsnGln: 2.848 ± 0.136
1.847AsnArg: 1.847 ± 0.099
6.182AsnSer: 6.182 ± 0.217
3.757AsnThr: 3.757 ± 0.164
4.479AsnVal: 4.479 ± 0.167
0.904AsnTrp: 0.904 ± 0.064
3.955AsnTyr: 3.955 ± 0.139
0.0AsnXaa: 0.0 ± 0.0
Pro
1.847ProAla: 1.847 ± 0.123
0.093ProCys: 0.093 ± 0.021
1.166ProAsp: 1.166 ± 0.07
2.016ProGlu: 2.016 ± 0.111
1.28ProPhe: 1.28 ± 0.075
1.437ProGly: 1.437 ± 0.082
0.389ProHis: 0.389 ± 0.04
1.94ProIle: 1.94 ± 0.093
2.869ProLys: 2.869 ± 0.131
2.307ProLeu: 2.307 ± 0.088
0.334ProMet: 0.334 ± 0.035
1.978ProAsn: 1.978 ± 0.101
0.507ProPro: 0.507 ± 0.057
1.078ProGln: 1.078 ± 0.074
0.782ProArg: 0.782 ± 0.058
2.164ProSer: 2.164 ± 0.104
2.041ProThr: 2.041 ± 0.111
1.593ProVal: 1.593 ± 0.1
0.22ProTrp: 0.22 ± 0.029
1.031ProTyr: 1.031 ± 0.075
0.0ProXaa: 0.0 ± 0.0
Gln
2.151GlnAla: 2.151 ± 0.118
0.03GlnCys: 0.03 ± 0.011
1.787GlnAsp: 1.787 ± 0.095
2.455GlnGlu: 2.455 ± 0.118
1.297GlnPhe: 1.297 ± 0.077
1.58GlnGly: 1.58 ± 0.09
0.342GlnHis: 0.342 ± 0.038
2.941GlnIle: 2.941 ± 0.101
4.074GlnLys: 4.074 ± 0.15
2.802GlnLeu: 2.802 ± 0.091
0.744GlnMet: 0.744 ± 0.058
3.25GlnAsn: 3.25 ± 0.127
0.685GlnPro: 0.685 ± 0.055
1.128GlnGln: 1.128 ± 0.064
1.306GlnArg: 1.306 ± 0.073
2.328GlnSer: 2.328 ± 0.11
1.825GlnThr: 1.825 ± 0.104
1.935GlnVal: 1.935 ± 0.095
0.406GlnTrp: 0.406 ± 0.04
1.145GlnTyr: 1.145 ± 0.079
0.0GlnXaa: 0.0 ± 0.0
Arg
1.509ArgAla: 1.509 ± 0.081
0.08ArgCys: 0.08 ± 0.018
1.365ArgAsp: 1.365 ± 0.077
1.792ArgGlu: 1.792 ± 0.086
1.403ArgPhe: 1.403 ± 0.075
1.302ArgGly: 1.302 ± 0.089
0.355ArgHis: 0.355 ± 0.046
2.371ArgIle: 2.371 ± 0.109
2.717ArgLys: 2.717 ± 0.132
2.324ArgLeu: 2.324 ± 0.111
0.558ArgMet: 0.558 ± 0.05
2.189ArgAsn: 2.189 ± 0.102
0.985ArgPro: 0.985 ± 0.07
0.963ArgGln: 0.963 ± 0.061
0.799ArgArg: 0.799 ± 0.063
1.783ArgSer: 1.783 ± 0.087
1.213ArgThr: 1.213 ± 0.071
1.754ArgVal: 1.754 ± 0.087
0.283ArgTrp: 0.283 ± 0.041
1.31ArgTyr: 1.31 ± 0.075
0.0ArgXaa: 0.0 ± 0.0
Ser
3.959SerAla: 3.959 ± 0.134
0.389SerCys: 0.389 ± 0.04
4.103SerAsp: 4.103 ± 0.132
4.686SerGlu: 4.686 ± 0.148
4.281SerPhe: 4.281 ± 0.179
3.634SerGly: 3.634 ± 0.159
0.913SerHis: 0.913 ± 0.062
5.299SerIle: 5.299 ± 0.178
7.412SerLys: 7.412 ± 0.187
8.172SerLeu: 8.172 ± 0.228
1.078SerMet: 1.078 ± 0.074
5.384SerAsn: 5.384 ± 0.174
1.644SerPro: 1.644 ± 0.094
2.823SerGln: 2.823 ± 0.124
2.003SerArg: 2.003 ± 0.089
5.789SerSer: 5.789 ± 0.175
3.288SerThr: 3.288 ± 0.122
4.221SerVal: 4.221 ± 0.116
0.778SerTrp: 0.778 ± 0.056
3.309SerTyr: 3.309 ± 0.129
0.0SerXaa: 0.0 ± 0.0
Thr
3.009ThrAla: 3.009 ± 0.16
0.19ThrCys: 0.19 ± 0.034
2.354ThrAsp: 2.354 ± 0.106
2.65ThrGlu: 2.65 ± 0.123
3.317ThrPhe: 3.317 ± 0.141
2.688ThrGly: 2.688 ± 0.119
0.672ThrHis: 0.672 ± 0.056
3.947ThrIle: 3.947 ± 0.149
5.696ThrLys: 5.696 ± 0.174
5.464ThrLeu: 5.464 ± 0.183
0.718ThrMet: 0.718 ± 0.055
3.951ThrAsn: 3.951 ± 0.178
1.694ThrPro: 1.694 ± 0.1
2.206ThrGln: 2.206 ± 0.127
1.42ThrArg: 1.42 ± 0.064
4.188ThrSer: 4.188 ± 0.168
3.212ThrThr: 3.212 ± 0.152
3.076ThrVal: 3.076 ± 0.136
0.604ThrTrp: 0.604 ± 0.058
2.016ThrTyr: 2.016 ± 0.106
0.0ThrXaa: 0.0 ± 0.0
Val
3.913ValAla: 3.913 ± 0.173
0.275ValCys: 0.275 ± 0.04
3.393ValAsp: 3.393 ± 0.143
3.537ValGlu: 3.537 ± 0.148
3.985ValPhe: 3.985 ± 0.155
2.966ValGly: 2.966 ± 0.137
0.655ValHis: 0.655 ± 0.066
4.509ValIle: 4.509 ± 0.174
5.527ValLys: 5.527 ± 0.182
6.14ValLeu: 6.14 ± 0.159
0.93ValMet: 0.93 ± 0.073
4.564ValAsn: 4.564 ± 0.171
1.559ValPro: 1.559 ± 0.088
1.551ValGln: 1.551 ± 0.072
1.648ValArg: 1.648 ± 0.083
4.678ValSer: 4.678 ± 0.14
3.85ValThr: 3.85 ± 0.163
4.095ValVal: 4.095 ± 0.179
0.431ValTrp: 0.431 ± 0.038
2.472ValTyr: 2.472 ± 0.102
0.0ValXaa: 0.0 ± 0.0
Trp
0.503TrpAla: 0.503 ± 0.046
0.021TrpCys: 0.021 ± 0.008
0.575TrpAsp: 0.575 ± 0.053
0.608TrpGlu: 0.608 ± 0.051
0.748TrpPhe: 0.748 ± 0.065
0.359TrpGly: 0.359 ± 0.039
0.11TrpHis: 0.11 ± 0.021
0.706TrpIle: 0.706 ± 0.063
1.001TrpLys: 1.001 ± 0.069
0.765TrpLeu: 0.765 ± 0.061
0.203TrpMet: 0.203 ± 0.029
0.744TrpAsn: 0.744 ± 0.062
0.194TrpPro: 0.194 ± 0.029
0.22TrpGln: 0.22 ± 0.029
0.262TrpArg: 0.262 ± 0.028
0.587TrpSer: 0.587 ± 0.052
0.57TrpThr: 0.57 ± 0.055
0.562TrpVal: 0.562 ± 0.045
0.106TrpTrp: 0.106 ± 0.02
0.372TrpTyr: 0.372 ± 0.042
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.94TyrAla: 1.94 ± 0.096
0.254TyrCys: 0.254 ± 0.036
2.058TyrAsp: 2.058 ± 0.084
2.611TyrGlu: 2.611 ± 0.115
2.73TyrPhe: 2.73 ± 0.131
1.817TyrGly: 1.817 ± 0.088
0.465TyrHis: 0.465 ± 0.044
2.819TyrIle: 2.819 ± 0.122
4.589TyrLys: 4.589 ± 0.178
5.113TyrLeu: 5.113 ± 0.163
0.554TyrMet: 0.554 ± 0.048
2.768TyrAsn: 2.768 ± 0.127
1.078TyrPro: 1.078 ± 0.074
1.682TyrGln: 1.682 ± 0.077
1.166TyrArg: 1.166 ± 0.071
3.14TyrSer: 3.14 ± 0.128
2.037TyrThr: 2.037 ± 0.09
2.388TyrVal: 2.388 ± 0.104
0.621TyrTrp: 0.621 ± 0.046
1.771TyrTyr: 1.771 ± 0.099
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 679 proteins (236649 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski