Amino acid dipepetide frequency for Mycoplasmopsis columbinum SF7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.763AlaAla: 3.763 ± 0.194
0.254AlaCys: 0.254 ± 0.034
2.666AlaAsp: 2.666 ± 0.147
3.372AlaGlu: 3.372 ± 0.177
2.789AlaPhe: 2.789 ± 0.126
2.64AlaGly: 2.64 ± 0.149
0.811AlaHis: 0.811 ± 0.075
5.692AlaIle: 5.692 ± 0.186
6.49AlaLys: 6.49 ± 0.227
6.074AlaLeu: 6.074 ± 0.176
1.026AlaMet: 1.026 ± 0.07
4.262AlaAsn: 4.262 ± 0.279
1.399AlaPro: 1.399 ± 0.079
2.631AlaGln: 2.631 ± 0.171
2.044AlaArg: 2.044 ± 0.107
3.504AlaSer: 3.504 ± 0.136
3.521AlaThr: 3.521 ± 0.245
2.942AlaVal: 2.942 ± 0.128
0.373AlaTrp: 0.373 ± 0.041
2.236AlaTyr: 2.236 ± 0.098
0.0AlaXaa: 0.0 ± 0.0
Cys
0.237CysAla: 0.237 ± 0.035
0.057CysCys: 0.057 ± 0.021
0.21CysAsp: 0.21 ± 0.034
0.215CysGlu: 0.215 ± 0.031
0.215CysPhe: 0.215 ± 0.035
0.246CysGly: 0.246 ± 0.039
0.075CysHis: 0.075 ± 0.018
0.228CysIle: 0.228 ± 0.034
0.276CysLys: 0.276 ± 0.043
0.386CysLeu: 0.386 ± 0.042
0.053CysMet: 0.053 ± 0.015
0.219CysAsn: 0.219 ± 0.034
0.145CysPro: 0.145 ± 0.025
0.228CysGln: 0.228 ± 0.036
0.123CysArg: 0.123 ± 0.025
0.263CysSer: 0.263 ± 0.039
0.184CysThr: 0.184 ± 0.031
0.197CysVal: 0.197 ± 0.031
0.044CysTrp: 0.044 ± 0.013
0.162CysTyr: 0.162 ± 0.029
0.0CysXaa: 0.0 ± 0.0
Asp
3.486AspAla: 3.486 ± 0.264
0.162AspCys: 0.162 ± 0.029
2.644AspAsp: 2.644 ± 0.135
4.815AspGlu: 4.815 ± 0.163
3.008AspPhe: 3.008 ± 0.115
2.39AspGly: 2.39 ± 0.106
0.684AspHis: 0.684 ± 0.051
3.898AspIle: 3.898 ± 0.134
5.473AspLys: 5.473 ± 0.176
6.074AspLeu: 6.074 ± 0.208
0.579AspMet: 0.579 ± 0.047
3.999AspAsn: 3.999 ± 0.245
1.675AspPro: 1.675 ± 0.097
2.131AspGln: 2.131 ± 0.17
1.368AspArg: 1.368 ± 0.094
3.46AspSer: 3.46 ± 0.171
2.276AspThr: 2.276 ± 0.116
3.1AspVal: 3.1 ± 0.117
0.412AspTrp: 0.412 ± 0.048
2.596AspTyr: 2.596 ± 0.112
0.0AspXaa: 0.0 ± 0.0
Glu
3.925GluAla: 3.925 ± 0.203
0.193GluCys: 0.193 ± 0.029
3.039GluAsp: 3.039 ± 0.128
5.565GluGlu: 5.565 ± 0.19
3.289GluPhe: 3.289 ± 0.142
2.416GluGly: 2.416 ± 0.132
1.013GluHis: 1.013 ± 0.075
7.372GluIle: 7.372 ± 0.191
8.262GluLys: 8.262 ± 0.216
7.648GluLeu: 7.648 ± 0.235
1.272GluMet: 1.272 ± 0.093
6.959GluAsn: 6.959 ± 0.24
1.254GluPro: 1.254 ± 0.09
2.679GluGln: 2.679 ± 0.123
2.087GluArg: 2.087 ± 0.095
3.144GluSer: 3.144 ± 0.113
4.012GluThr: 4.012 ± 0.138
3.67GluVal: 3.67 ± 0.16
0.539GluTrp: 0.539 ± 0.052
3.013GluTyr: 3.013 ± 0.158
0.0GluXaa: 0.0 ± 0.0
Phe
3.21PheAla: 3.21 ± 0.135
0.246PheCys: 0.246 ± 0.034
3.451PheAsp: 3.451 ± 0.14
3.381PheGlu: 3.381 ± 0.134
2.763PhePhe: 2.763 ± 0.173
2.618PheGly: 2.618 ± 0.126
0.539PheHis: 0.539 ± 0.063
4.903PheIle: 4.903 ± 0.238
5.004PheLys: 5.004 ± 0.154
5.46PheLeu: 5.46 ± 0.227
0.702PheMet: 0.702 ± 0.057
3.999PheAsn: 3.999 ± 0.185
1.123PhePro: 1.123 ± 0.075
1.399PheGln: 1.399 ± 0.087
1.373PheArg: 1.373 ± 0.074
3.6PheSer: 3.6 ± 0.156
2.714PheThr: 2.714 ± 0.122
3.052PheVal: 3.052 ± 0.141
0.583PheTrp: 0.583 ± 0.064
2.223PheTyr: 2.223 ± 0.124
0.0PheXaa: 0.0 ± 0.0
Gly
2.636GlyAla: 2.636 ± 0.113
0.21GlyCys: 0.21 ± 0.034
2.206GlyAsp: 2.206 ± 0.139
2.864GlyGlu: 2.864 ± 0.13
2.754GlyPhe: 2.754 ± 0.13
2.539GlyGly: 2.539 ± 0.174
0.829GlyHis: 0.829 ± 0.072
4.096GlyIle: 4.096 ± 0.163
3.824GlyLys: 3.824 ± 0.185
4.078GlyLeu: 4.078 ± 0.191
0.868GlyMet: 0.868 ± 0.077
2.508GlyAsn: 2.508 ± 0.134
0.956GlyPro: 0.956 ± 0.086
1.649GlyGln: 1.649 ± 0.085
1.509GlyArg: 1.509 ± 0.11
2.903GlySer: 2.903 ± 0.126
2.636GlyThr: 2.636 ± 0.126
2.697GlyVal: 2.697 ± 0.14
0.421GlyTrp: 0.421 ± 0.044
1.899GlyTyr: 1.899 ± 0.111
0.0GlyXaa: 0.0 ± 0.0
His
0.833HisAla: 0.833 ± 0.069
0.092HisCys: 0.092 ± 0.019
0.697HisAsp: 0.697 ± 0.056
0.952HisGlu: 0.952 ± 0.07
0.956HisPhe: 0.956 ± 0.067
0.763HisGly: 0.763 ± 0.077
0.241HisHis: 0.241 ± 0.035
1.153HisIle: 1.153 ± 0.073
1.109HisLys: 1.109 ± 0.083
1.544HisLeu: 1.544 ± 0.086
0.184HisMet: 0.184 ± 0.028
0.974HisAsn: 0.974 ± 0.074
0.482HisPro: 0.482 ± 0.055
0.465HisGln: 0.465 ± 0.046
0.403HisArg: 0.403 ± 0.04
0.86HisSer: 0.86 ± 0.06
0.667HisThr: 0.667 ± 0.053
0.649HisVal: 0.649 ± 0.053
0.127HisTrp: 0.127 ± 0.022
0.574HisTyr: 0.574 ± 0.05
0.0HisXaa: 0.0 ± 0.0
Ile
5.319IleAla: 5.319 ± 0.175
0.434IleCys: 0.434 ± 0.058
5.499IleAsp: 5.499 ± 0.167
5.828IleGlu: 5.828 ± 0.208
4.982IlePhe: 4.982 ± 0.254
3.934IleGly: 3.934 ± 0.193
1.004IleHis: 1.004 ± 0.07
7.297IleIle: 7.297 ± 0.309
8.893IleLys: 8.893 ± 0.217
8.31IleLeu: 8.31 ± 0.315
1.232IleMet: 1.232 ± 0.086
7.705IleAsn: 7.705 ± 0.239
2.491IlePro: 2.491 ± 0.118
2.903IleGln: 2.903 ± 0.1
2.131IleArg: 2.131 ± 0.096
6.008IleSer: 6.008 ± 0.183
4.736IleThr: 4.736 ± 0.211
5.565IleVal: 5.565 ± 0.219
0.627IleTrp: 0.627 ± 0.061
3.381IleTyr: 3.381 ± 0.154
0.0IleXaa: 0.0 ± 0.0
Lys
5.609LysAla: 5.609 ± 0.287
0.311LysCys: 0.311 ± 0.045
5.824LysAsp: 5.824 ± 0.186
8.941LysGlu: 8.941 ± 0.218
4.153LysPhe: 4.153 ± 0.168
3.469LysGly: 3.469 ± 0.177
1.438LysHis: 1.438 ± 0.099
9.643LysIle: 9.643 ± 0.373
9.341LysLys: 9.341 ± 0.35
9.279LysLeu: 9.279 ± 0.235
2.285LysMet: 2.285 ± 0.162
9.599LysAsn: 9.599 ± 0.247
2.272LysPro: 2.272 ± 0.113
3.197LysGln: 3.197 ± 0.107
2.78LysArg: 2.78 ± 0.146
5.082LysSer: 5.082 ± 0.142
5.784LysThr: 5.784 ± 0.186
5.363LysVal: 5.363 ± 0.164
0.978LysTrp: 0.978 ± 0.074
4.613LysTyr: 4.613 ± 0.171
0.0LysXaa: 0.0 ± 0.0
Leu
6.06LeuAla: 6.06 ± 0.171
0.355LeuCys: 0.355 ± 0.04
6.078LeuAsp: 6.078 ± 0.215
7.179LeuGlu: 7.179 ± 0.229
4.964LeuPhe: 4.964 ± 0.225
4.276LeuGly: 4.276 ± 0.184
1.311LeuHis: 1.311 ± 0.079
9.419LeuIle: 9.419 ± 0.301
10.411LeuLys: 10.411 ± 0.329
9.025LeuLeu: 9.025 ± 0.299
1.644LeuMet: 1.644 ± 0.101
9.762LeuAsn: 9.762 ± 0.378
2.828LeuPro: 2.828 ± 0.105
2.526LeuGln: 2.526 ± 0.105
2.846LeuArg: 2.846 ± 0.126
6.604LeuSer: 6.604 ± 0.169
6.565LeuThr: 6.565 ± 0.289
5.889LeuVal: 5.889 ± 0.188
0.763LeuTrp: 0.763 ± 0.064
3.087LeuTyr: 3.087 ± 0.125
0.0LeuXaa: 0.0 ± 0.0
Met
1.017MetAla: 1.017 ± 0.075
0.061MetCys: 0.061 ± 0.017
0.645MetAsp: 0.645 ± 0.053
0.789MetGlu: 0.789 ± 0.065
0.899MetPhe: 0.899 ± 0.071
0.811MetGly: 0.811 ± 0.061
0.285MetHis: 0.285 ± 0.035
1.359MetIle: 1.359 ± 0.102
1.745MetLys: 1.745 ± 0.098
1.561MetLeu: 1.561 ± 0.116
0.329MetMet: 0.329 ± 0.047
1.215MetAsn: 1.215 ± 0.079
0.605MetPro: 0.605 ± 0.054
0.627MetGln: 0.627 ± 0.06
0.452MetArg: 0.452 ± 0.045
1.184MetSer: 1.184 ± 0.091
0.75MetThr: 0.75 ± 0.058
0.864MetVal: 0.864 ± 0.078
0.132MetTrp: 0.132 ± 0.022
0.513MetTyr: 0.513 ± 0.054
0.0MetXaa: 0.0 ± 0.0
Asn
4.376AsnAla: 4.376 ± 0.258
0.237AsnCys: 0.237 ± 0.033
4.762AsnAsp: 4.762 ± 0.356
6.179AsnGlu: 6.179 ± 0.248
4.78AsnPhe: 4.78 ± 0.18
3.521AsnGly: 3.521 ± 0.146
1.14AsnHis: 1.14 ± 0.063
6.438AsnIle: 6.438 ± 0.202
8.635AsnLys: 8.635 ± 0.236
9.183AsnLeu: 9.183 ± 0.298
1.166AsnMet: 1.166 ± 0.081
7.464AsnAsn: 7.464 ± 0.377
2.513AsnPro: 2.513 ± 0.104
3.363AsnGln: 3.363 ± 0.21
1.908AsnArg: 1.908 ± 0.079
5.604AsnSer: 5.604 ± 0.278
3.714AsnThr: 3.714 ± 0.191
4.245AsnVal: 4.245 ± 0.183
0.763AsnTrp: 0.763 ± 0.062
3.89AsnTyr: 3.89 ± 0.155
0.0AsnXaa: 0.0 ± 0.0
Pro
1.434ProAla: 1.434 ± 0.092
0.066ProCys: 0.066 ± 0.015
1.171ProAsp: 1.171 ± 0.079
2.087ProGlu: 2.087 ± 0.118
1.395ProPhe: 1.395 ± 0.089
1.386ProGly: 1.386 ± 0.094
0.517ProHis: 0.517 ± 0.048
2.228ProIle: 2.228 ± 0.109
2.162ProLys: 2.162 ± 0.105
2.64ProLeu: 2.64 ± 0.104
0.39ProMet: 0.39 ± 0.042
2.035ProAsn: 2.035 ± 0.109
0.452ProPro: 0.452 ± 0.046
0.855ProGln: 0.855 ± 0.059
0.671ProArg: 0.671 ± 0.065
1.859ProSer: 1.859 ± 0.093
1.807ProThr: 1.807 ± 0.105
1.539ProVal: 1.539 ± 0.083
0.219ProTrp: 0.219 ± 0.032
1.0ProTyr: 1.0 ± 0.071
0.0ProXaa: 0.0 ± 0.0
Gln
2.105GlnAla: 2.105 ± 0.134
0.083GlnCys: 0.083 ± 0.021
1.715GlnAsp: 1.715 ± 0.097
2.627GlnGlu: 2.627 ± 0.098
1.333GlnPhe: 1.333 ± 0.096
1.364GlnGly: 1.364 ± 0.082
0.443GlnHis: 0.443 ± 0.048
3.464GlnIle: 3.464 ± 0.131
3.767GlnLys: 3.767 ± 0.131
3.526GlnLeu: 3.526 ± 0.156
0.658GlnMet: 0.658 ± 0.053
3.473GlnAsn: 3.473 ± 0.181
0.794GlnPro: 0.794 ± 0.063
1.25GlnGln: 1.25 ± 0.076
1.298GlnArg: 1.298 ± 0.078
1.833GlnSer: 1.833 ± 0.117
1.969GlnThr: 1.969 ± 0.12
1.855GlnVal: 1.855 ± 0.124
0.316GlnTrp: 0.316 ± 0.041
1.298GlnTyr: 1.298 ± 0.087
0.0GlnXaa: 0.0 ± 0.0
Arg
1.548ArgAla: 1.548 ± 0.084
0.11ArgCys: 0.11 ± 0.021
1.688ArgAsp: 1.688 ± 0.103
2.241ArgGlu: 2.241 ± 0.11
1.329ArgPhe: 1.329 ± 0.091
1.416ArgGly: 1.416 ± 0.096
0.417ArgHis: 0.417 ± 0.052
2.517ArgIle: 2.517 ± 0.127
2.701ArgLys: 2.701 ± 0.144
2.934ArgLeu: 2.934 ± 0.133
0.579ArgMet: 0.579 ± 0.062
2.153ArgAsn: 2.153 ± 0.106
0.737ArgPro: 0.737 ± 0.064
1.052ArgGln: 1.052 ± 0.07
1.07ArgArg: 1.07 ± 0.079
1.623ArgSer: 1.623 ± 0.08
1.504ArgThr: 1.504 ± 0.091
1.636ArgVal: 1.636 ± 0.095
0.237ArgTrp: 0.237 ± 0.033
1.149ArgTyr: 1.149 ± 0.071
0.0ArgXaa: 0.0 ± 0.0
Ser
3.355SerAla: 3.355 ± 0.139
0.281SerCys: 0.281 ± 0.038
3.219SerAsp: 3.219 ± 0.204
3.881SerGlu: 3.881 ± 0.151
3.714SerPhe: 3.714 ± 0.174
3.065SerGly: 3.065 ± 0.115
0.846SerHis: 0.846 ± 0.068
5.232SerIle: 5.232 ± 0.175
6.174SerLys: 6.174 ± 0.157
6.78SerLeu: 6.78 ± 0.223
0.781SerMet: 0.781 ± 0.063
4.999SerAsn: 4.999 ± 0.224
1.513SerPro: 1.513 ± 0.09
2.587SerGln: 2.587 ± 0.121
1.71SerArg: 1.71 ± 0.096
4.666SerSer: 4.666 ± 0.191
3.657SerThr: 3.657 ± 0.256
3.114SerVal: 3.114 ± 0.114
0.526SerTrp: 0.526 ± 0.048
2.719SerTyr: 2.719 ± 0.125
0.0SerXaa: 0.0 ± 0.0
Thr
2.995ThrAla: 2.995 ± 0.161
0.219ThrCys: 0.219 ± 0.033
2.473ThrAsp: 2.473 ± 0.141
2.815ThrGlu: 2.815 ± 0.143
3.271ThrPhe: 3.271 ± 0.169
2.429ThrGly: 2.429 ± 0.124
0.838ThrHis: 0.838 ± 0.06
4.977ThrIle: 4.977 ± 0.2
5.254ThrLys: 5.254 ± 0.175
6.266ThrLeu: 6.266 ± 0.251
0.649ThrMet: 0.649 ± 0.06
4.846ThrAsn: 4.846 ± 0.336
1.859ThrPro: 1.859 ± 0.103
1.767ThrGln: 1.767 ± 0.113
1.719ThrArg: 1.719 ± 0.097
3.758ThrSer: 3.758 ± 0.178
3.491ThrThr: 3.491 ± 0.201
2.776ThrVal: 2.776 ± 0.129
0.434ThrTrp: 0.434 ± 0.043
2.561ThrTyr: 2.561 ± 0.135
0.0ThrXaa: 0.0 ± 0.0
Val
3.771ValAla: 3.771 ± 0.179
0.219ValCys: 0.219 ± 0.031
3.447ValAsp: 3.447 ± 0.157
4.113ValGlu: 4.113 ± 0.126
2.85ValPhe: 2.85 ± 0.133
2.579ValGly: 2.579 ± 0.138
0.702ValHis: 0.702 ± 0.055
4.771ValIle: 4.771 ± 0.182
5.46ValLys: 5.46 ± 0.16
5.341ValLeu: 5.341 ± 0.18
0.763ValMet: 0.763 ± 0.071
4.267ValAsn: 4.267 ± 0.141
1.43ValPro: 1.43 ± 0.088
1.846ValGln: 1.846 ± 0.085
1.381ValArg: 1.381 ± 0.087
3.464ValSer: 3.464 ± 0.119
2.986ValThr: 2.986 ± 0.185
3.548ValVal: 3.548 ± 0.158
0.39ValTrp: 0.39 ± 0.046
1.965ValTyr: 1.965 ± 0.086
0.0ValXaa: 0.0 ± 0.0
Trp
0.474TrpAla: 0.474 ± 0.056
0.026TrpCys: 0.026 ± 0.011
0.408TrpAsp: 0.408 ± 0.041
0.579TrpGlu: 0.579 ± 0.056
0.5TrpPhe: 0.5 ± 0.048
0.303TrpGly: 0.303 ± 0.04
0.053TrpHis: 0.053 ± 0.015
0.794TrpIle: 0.794 ± 0.068
0.824TrpLys: 0.824 ± 0.065
0.811TrpLeu: 0.811 ± 0.072
0.132TrpMet: 0.132 ± 0.025
0.798TrpAsn: 0.798 ± 0.056
0.272TrpPro: 0.272 ± 0.038
0.263TrpGln: 0.263 ± 0.036
0.285TrpArg: 0.285 ± 0.038
0.509TrpSer: 0.509 ± 0.05
0.544TrpThr: 0.544 ± 0.062
0.447TrpVal: 0.447 ± 0.049
0.105TrpTrp: 0.105 ± 0.024
0.342TrpTyr: 0.342 ± 0.036
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.337TyrAla: 2.337 ± 0.092
0.145TyrCys: 0.145 ± 0.025
2.39TyrAsp: 2.39 ± 0.126
2.929TyrGlu: 2.929 ± 0.117
2.399TyrPhe: 2.399 ± 0.123
1.916TyrGly: 1.916 ± 0.112
0.539TyrHis: 0.539 ± 0.049
2.798TyrIle: 2.798 ± 0.118
4.157TyrLys: 4.157 ± 0.139
4.675TyrLeu: 4.675 ± 0.167
0.548TyrMet: 0.548 ± 0.058
2.644TyrAsn: 2.644 ± 0.11
1.114TyrPro: 1.114 ± 0.069
1.75TyrGln: 1.75 ± 0.091
1.482TyrArg: 1.482 ± 0.081
2.89TyrSer: 2.89 ± 0.104
1.881TyrThr: 1.881 ± 0.088
2.149TyrVal: 2.149 ± 0.097
0.478TyrTrp: 0.478 ± 0.053
1.535TyrTyr: 1.535 ± 0.097
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 614 proteins (228039 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski