Amino acid dipepetide frequency for Stenotrophomonas phage Mendera

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.807AlaAla: 6.807 ± 0.454
0.719AlaCys: 0.719 ± 0.121
4.791AlaAsp: 4.791 ± 0.331
4.91AlaGlu: 4.91 ± 0.391
3.094AlaPhe: 3.094 ± 0.237
5.35AlaGly: 5.35 ± 0.363
1.138AlaHis: 1.138 ± 0.148
5.05AlaIle: 5.05 ± 0.337
4.032AlaLys: 4.032 ± 0.268
6.807AlaLeu: 6.807 ± 0.365
2.196AlaMet: 2.196 ± 0.229
3.413AlaAsn: 3.413 ± 0.315
3.094AlaPro: 3.094 ± 0.319
2.814AlaGln: 2.814 ± 0.225
4.371AlaArg: 4.371 ± 0.28
4.85AlaSer: 4.85 ± 0.359
4.491AlaThr: 4.491 ± 0.402
5.03AlaVal: 5.03 ± 0.38
1.377AlaTrp: 1.377 ± 0.182
2.994AlaTyr: 2.994 ± 0.294
0.0AlaXaa: 0.0 ± 0.0
Cys
0.719CysAla: 0.719 ± 0.102
0.2CysCys: 0.2 ± 0.072
0.639CysAsp: 0.639 ± 0.103
0.559CysGlu: 0.559 ± 0.098
0.399CysPhe: 0.399 ± 0.089
0.739CysGly: 0.739 ± 0.117
0.339CysHis: 0.339 ± 0.075
0.519CysIle: 0.519 ± 0.106
0.579CysLys: 0.579 ± 0.099
0.679CysLeu: 0.679 ± 0.135
0.619CysMet: 0.619 ± 0.125
0.539CysAsn: 0.539 ± 0.096
0.699CysPro: 0.699 ± 0.116
0.419CysGln: 0.419 ± 0.083
0.559CysArg: 0.559 ± 0.103
0.778CysSer: 0.778 ± 0.146
0.659CysThr: 0.659 ± 0.134
0.739CysVal: 0.739 ± 0.116
0.22CysTrp: 0.22 ± 0.062
0.399CysTyr: 0.399 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
5.369AspAla: 5.369 ± 0.359
0.519AspCys: 0.519 ± 0.089
3.852AspAsp: 3.852 ± 0.308
4.751AspGlu: 4.751 ± 0.358
2.954AspPhe: 2.954 ± 0.268
4.99AspGly: 4.99 ± 0.335
1.098AspHis: 1.098 ± 0.166
3.174AspIle: 3.174 ± 0.251
3.673AspLys: 3.673 ± 0.321
6.148AspLeu: 6.148 ± 0.369
1.956AspMet: 1.956 ± 0.209
2.395AspAsn: 2.395 ± 0.199
3.513AspPro: 3.513 ± 0.277
2.415AspGln: 2.415 ± 0.231
3.333AspArg: 3.333 ± 0.239
4.012AspSer: 4.012 ± 0.339
3.234AspThr: 3.234 ± 0.31
4.531AspVal: 4.531 ± 0.288
1.218AspTrp: 1.218 ± 0.149
2.495AspTyr: 2.495 ± 0.211
0.0AspXaa: 0.0 ± 0.0
Glu
5.609GluAla: 5.609 ± 0.388
0.699GluCys: 0.699 ± 0.115
4.332GluAsp: 4.332 ± 0.388
5.21GluGlu: 5.21 ± 0.477
2.934GluPhe: 2.934 ± 0.257
4.252GluGly: 4.252 ± 0.321
1.377GluHis: 1.377 ± 0.17
4.591GluIle: 4.591 ± 0.348
4.052GluLys: 4.052 ± 0.306
6.587GluLeu: 6.587 ± 0.427
2.036GluMet: 2.036 ± 0.23
2.994GluAsn: 2.994 ± 0.258
2.156GluPro: 2.156 ± 0.202
2.715GluGln: 2.715 ± 0.247
3.353GluArg: 3.353 ± 0.289
3.553GluSer: 3.553 ± 0.302
3.314GluThr: 3.314 ± 0.253
4.591GluVal: 4.591 ± 0.271
1.537GluTrp: 1.537 ± 0.18
2.834GluTyr: 2.834 ± 0.205
0.0GluXaa: 0.0 ± 0.0
Phe
2.715PheAla: 2.715 ± 0.215
0.539PheCys: 0.539 ± 0.111
3.613PheAsp: 3.613 ± 0.311
2.415PheGlu: 2.415 ± 0.211
1.597PhePhe: 1.597 ± 0.201
2.974PheGly: 2.974 ± 0.298
0.858PheHis: 0.858 ± 0.147
2.555PheIle: 2.555 ± 0.26
2.914PheLys: 2.914 ± 0.238
2.775PheLeu: 2.775 ± 0.269
1.238PheMet: 1.238 ± 0.138
2.096PheAsn: 2.096 ± 0.209
1.238PhePro: 1.238 ± 0.188
1.856PheGln: 1.856 ± 0.214
2.555PheArg: 2.555 ± 0.271
2.256PheSer: 2.256 ± 0.215
2.595PheThr: 2.595 ± 0.237
2.775PheVal: 2.775 ± 0.233
0.719PheTrp: 0.719 ± 0.141
1.617PheTyr: 1.617 ± 0.165
0.0PheXaa: 0.0 ± 0.0
Gly
5.01GlyAla: 5.01 ± 0.382
0.818GlyCys: 0.818 ± 0.13
4.152GlyAsp: 4.152 ± 0.273
4.292GlyGlu: 4.292 ± 0.344
2.755GlyPhe: 2.755 ± 0.23
4.711GlyGly: 4.711 ± 0.31
1.138GlyHis: 1.138 ± 0.179
3.613GlyIle: 3.613 ± 0.296
4.052GlyLys: 4.052 ± 0.307
5.29GlyLeu: 5.29 ± 0.323
1.896GlyMet: 1.896 ± 0.191
3.254GlyAsn: 3.254 ± 0.298
1.657GlyPro: 1.657 ± 0.198
2.415GlyGln: 2.415 ± 0.21
2.834GlyArg: 2.834 ± 0.24
4.052GlySer: 4.052 ± 0.343
4.751GlyThr: 4.751 ± 0.447
4.731GlyVal: 4.731 ± 0.291
1.617GlyTrp: 1.617 ± 0.174
3.194GlyTyr: 3.194 ± 0.303
0.0GlyXaa: 0.0 ± 0.0
His
1.337HisAla: 1.337 ± 0.177
0.24HisCys: 0.24 ± 0.07
1.118HisAsp: 1.118 ± 0.168
1.098HisGlu: 1.098 ± 0.181
1.018HisPhe: 1.018 ± 0.177
1.377HisGly: 1.377 ± 0.191
0.359HisHis: 0.359 ± 0.102
0.838HisIle: 0.838 ± 0.134
1.218HisLys: 1.218 ± 0.171
1.637HisLeu: 1.637 ± 0.211
0.559HisMet: 0.559 ± 0.133
0.739HisAsn: 0.739 ± 0.11
0.818HisPro: 0.818 ± 0.141
0.539HisGln: 0.539 ± 0.105
1.158HisArg: 1.158 ± 0.162
0.838HisSer: 0.838 ± 0.118
0.838HisThr: 0.838 ± 0.124
1.218HisVal: 1.218 ± 0.167
0.2HisTrp: 0.2 ± 0.067
0.539HisTyr: 0.539 ± 0.109
0.0HisXaa: 0.0 ± 0.0
Ile
4.531IleAla: 4.531 ± 0.347
0.579IleCys: 0.579 ± 0.122
4.132IleAsp: 4.132 ± 0.306
4.791IleGlu: 4.791 ± 0.324
1.796IlePhe: 1.796 ± 0.187
3.753IleGly: 3.753 ± 0.323
0.998IleHis: 0.998 ± 0.137
3.094IleIle: 3.094 ± 0.239
4.471IleLys: 4.471 ± 0.331
4.611IleLeu: 4.611 ± 0.376
1.357IleMet: 1.357 ± 0.162
2.974IleAsn: 2.974 ± 0.281
3.074IlePro: 3.074 ± 0.325
2.196IleGln: 2.196 ± 0.211
3.813IleArg: 3.813 ± 0.288
3.333IleSer: 3.333 ± 0.28
3.713IleThr: 3.713 ± 0.337
3.793IleVal: 3.793 ± 0.296
0.679IleTrp: 0.679 ± 0.094
1.916IleTyr: 1.916 ± 0.204
0.0IleXaa: 0.0 ± 0.0
Lys
5.09LysAla: 5.09 ± 0.306
0.599LysCys: 0.599 ± 0.112
3.373LysAsp: 3.373 ± 0.253
3.992LysGlu: 3.992 ± 0.327
2.335LysPhe: 2.335 ± 0.21
3.234LysGly: 3.234 ± 0.294
1.078LysHis: 1.078 ± 0.172
4.491LysIle: 4.491 ± 0.309
4.312LysLys: 4.312 ± 0.347
4.99LysLeu: 4.99 ± 0.372
2.276LysMet: 2.276 ± 0.212
2.635LysAsn: 2.635 ± 0.215
2.814LysPro: 2.814 ± 0.279
2.355LysGln: 2.355 ± 0.235
3.513LysArg: 3.513 ± 0.277
2.974LysSer: 2.974 ± 0.22
3.254LysThr: 3.254 ± 0.257
4.132LysVal: 4.132 ± 0.255
1.098LysTrp: 1.098 ± 0.162
2.375LysTyr: 2.375 ± 0.256
0.0LysXaa: 0.0 ± 0.0
Leu
5.948LeuAla: 5.948 ± 0.311
0.838LeuCys: 0.838 ± 0.142
6.108LeuAsp: 6.108 ± 0.313
6.707LeuGlu: 6.707 ± 0.458
2.795LeuPhe: 2.795 ± 0.24
4.691LeuGly: 4.691 ± 0.284
1.777LeuHis: 1.777 ± 0.21
4.831LeuIle: 4.831 ± 0.353
5.17LeuLys: 5.17 ± 0.387
5.529LeuLeu: 5.529 ± 0.362
2.216LeuMet: 2.216 ± 0.271
4.451LeuAsn: 4.451 ± 0.294
3.114LeuPro: 3.114 ± 0.251
3.154LeuGln: 3.154 ± 0.264
4.272LeuArg: 4.272 ± 0.306
5.01LeuSer: 5.01 ± 0.358
5.21LeuThr: 5.21 ± 0.32
5.01LeuVal: 5.01 ± 0.295
0.998LeuTrp: 0.998 ± 0.135
2.635LeuTyr: 2.635 ± 0.215
0.0LeuXaa: 0.0 ± 0.0
Met
2.555MetAla: 2.555 ± 0.2
0.24MetCys: 0.24 ± 0.065
1.737MetAsp: 1.737 ± 0.183
1.637MetGlu: 1.637 ± 0.182
1.198MetPhe: 1.198 ± 0.158
1.317MetGly: 1.317 ± 0.152
0.339MetHis: 0.339 ± 0.082
1.657MetIle: 1.657 ± 0.178
1.737MetLys: 1.737 ± 0.237
1.936MetLeu: 1.936 ± 0.229
0.898MetMet: 0.898 ± 0.151
1.936MetAsn: 1.936 ± 0.202
1.078MetPro: 1.078 ± 0.114
1.118MetGln: 1.118 ± 0.165
1.397MetArg: 1.397 ± 0.183
2.355MetSer: 2.355 ± 0.189
2.296MetThr: 2.296 ± 0.223
1.577MetVal: 1.577 ± 0.186
0.339MetTrp: 0.339 ± 0.093
1.078MetTyr: 1.078 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
3.234AsnAla: 3.234 ± 0.283
0.479AsnCys: 0.479 ± 0.098
2.715AsnAsp: 2.715 ± 0.224
2.735AsnGlu: 2.735 ± 0.261
2.156AsnPhe: 2.156 ± 0.233
4.272AsnGly: 4.272 ± 0.382
0.599AsnHis: 0.599 ± 0.111
2.894AsnIle: 2.894 ± 0.232
3.054AsnLys: 3.054 ± 0.3
3.832AsnLeu: 3.832 ± 0.274
1.337AsnMet: 1.337 ± 0.162
2.236AsnAsn: 2.236 ± 0.266
2.755AsnPro: 2.755 ± 0.227
1.697AsnGln: 1.697 ± 0.215
2.595AsnArg: 2.595 ± 0.201
2.715AsnSer: 2.715 ± 0.245
2.535AsnThr: 2.535 ± 0.238
3.114AsnVal: 3.114 ± 0.253
0.778AsnTrp: 0.778 ± 0.124
1.537AsnTyr: 1.537 ± 0.17
0.0AsnXaa: 0.0 ± 0.0
Pro
3.174ProAla: 3.174 ± 0.285
0.359ProCys: 0.359 ± 0.091
3.134ProAsp: 3.134 ± 0.271
2.934ProGlu: 2.934 ± 0.267
1.976ProPhe: 1.976 ± 0.173
3.054ProGly: 3.054 ± 0.281
0.399ProHis: 0.399 ± 0.087
2.355ProIle: 2.355 ± 0.211
2.335ProLys: 2.335 ± 0.223
2.595ProLeu: 2.595 ± 0.216
0.918ProMet: 0.918 ± 0.129
2.076ProAsn: 2.076 ± 0.207
1.437ProPro: 1.437 ± 0.18
1.417ProGln: 1.417 ± 0.157
1.737ProArg: 1.737 ± 0.198
2.395ProSer: 2.395 ± 0.199
2.894ProThr: 2.894 ± 0.332
3.274ProVal: 3.274 ± 0.283
0.579ProTrp: 0.579 ± 0.108
1.836ProTyr: 1.836 ± 0.186
0.0ProXaa: 0.0 ± 0.0
Gln
3.174GlnAla: 3.174 ± 0.264
0.419GlnCys: 0.419 ± 0.089
2.276GlnAsp: 2.276 ± 0.194
2.535GlnGlu: 2.535 ± 0.231
2.076GlnPhe: 2.076 ± 0.188
1.816GlnGly: 1.816 ± 0.196
0.599GlnHis: 0.599 ± 0.106
2.775GlnIle: 2.775 ± 0.231
2.076GlnLys: 2.076 ± 0.219
3.214GlnLeu: 3.214 ± 0.251
1.238GlnMet: 1.238 ± 0.175
1.697GlnAsn: 1.697 ± 0.221
1.377GlnPro: 1.377 ± 0.147
1.537GlnGln: 1.537 ± 0.186
2.176GlnArg: 2.176 ± 0.214
1.996GlnSer: 1.996 ± 0.221
2.176GlnThr: 2.176 ± 0.19
2.914GlnVal: 2.914 ± 0.251
0.759GlnTrp: 0.759 ± 0.147
1.457GlnTyr: 1.457 ± 0.178
0.0GlnXaa: 0.0 ± 0.0
Arg
3.852ArgAla: 3.852 ± 0.239
0.759ArgCys: 0.759 ± 0.152
3.952ArgAsp: 3.952 ± 0.324
3.433ArgGlu: 3.433 ± 0.281
2.415ArgPhe: 2.415 ± 0.257
3.433ArgGly: 3.433 ± 0.27
0.778ArgHis: 0.778 ± 0.134
3.294ArgIle: 3.294 ± 0.233
3.054ArgLys: 3.054 ± 0.279
4.611ArgLeu: 4.611 ± 0.281
1.258ArgMet: 1.258 ± 0.178
2.176ArgAsn: 2.176 ± 0.248
1.856ArgPro: 1.856 ± 0.201
2.276ArgGln: 2.276 ± 0.223
3.254ArgArg: 3.254 ± 0.28
3.054ArgSer: 3.054 ± 0.257
2.735ArgThr: 2.735 ± 0.25
3.793ArgVal: 3.793 ± 0.256
0.778ArgTrp: 0.778 ± 0.145
2.256ArgTyr: 2.256 ± 0.224
0.0ArgXaa: 0.0 ± 0.0
Ser
4.451SerAla: 4.451 ± 0.291
0.858SerCys: 0.858 ± 0.144
3.653SerAsp: 3.653 ± 0.258
4.032SerGlu: 4.032 ± 0.315
2.575SerPhe: 2.575 ± 0.246
4.531SerGly: 4.531 ± 0.324
0.998SerHis: 0.998 ± 0.149
3.274SerIle: 3.274 ± 0.27
3.373SerLys: 3.373 ± 0.297
4.87SerLeu: 4.87 ± 0.32
1.477SerMet: 1.477 ± 0.143
2.595SerAsn: 2.595 ± 0.251
2.415SerPro: 2.415 ± 0.217
2.495SerGln: 2.495 ± 0.269
2.755SerArg: 2.755 ± 0.264
3.733SerSer: 3.733 ± 0.356
3.693SerThr: 3.693 ± 0.337
3.932SerVal: 3.932 ± 0.291
1.138SerTrp: 1.138 ± 0.163
2.256SerTyr: 2.256 ± 0.213
0.0SerXaa: 0.0 ± 0.0
Thr
5.15ThrAla: 5.15 ± 0.473
0.639ThrCys: 0.639 ± 0.116
3.553ThrAsp: 3.553 ± 0.245
3.932ThrGlu: 3.932 ± 0.3
3.134ThrPhe: 3.134 ± 0.284
4.431ThrGly: 4.431 ± 0.498
1.018ThrHis: 1.018 ± 0.145
3.353ThrIle: 3.353 ± 0.214
3.174ThrLys: 3.174 ± 0.278
5.09ThrLeu: 5.09 ± 0.345
1.497ThrMet: 1.497 ± 0.237
2.675ThrAsn: 2.675 ± 0.273
3.214ThrPro: 3.214 ± 0.305
1.936ThrGln: 1.936 ± 0.216
2.775ThrArg: 2.775 ± 0.209
3.633ThrSer: 3.633 ± 0.362
3.473ThrThr: 3.473 ± 0.37
4.332ThrVal: 4.332 ± 0.294
1.158ThrTrp: 1.158 ± 0.144
2.315ThrTyr: 2.315 ± 0.282
0.0ThrXaa: 0.0 ± 0.0
Val
4.591ValAla: 4.591 ± 0.299
0.898ValCys: 0.898 ± 0.133
4.91ValAsp: 4.91 ± 0.342
4.85ValGlu: 4.85 ± 0.292
2.395ValPhe: 2.395 ± 0.214
4.351ValGly: 4.351 ± 0.394
1.377ValHis: 1.377 ± 0.157
4.232ValIle: 4.232 ± 0.292
4.391ValLys: 4.391 ± 0.307
4.711ValLeu: 4.711 ± 0.321
1.796ValMet: 1.796 ± 0.212
3.274ValAsn: 3.274 ± 0.283
2.814ValPro: 2.814 ± 0.271
2.635ValGln: 2.635 ± 0.226
3.373ValArg: 3.373 ± 0.257
4.312ValSer: 4.312 ± 0.343
4.711ValThr: 4.711 ± 0.328
4.651ValVal: 4.651 ± 0.328
0.838ValTrp: 0.838 ± 0.12
2.415ValTyr: 2.415 ± 0.259
0.0ValXaa: 0.0 ± 0.0
Trp
1.218TrpAla: 1.218 ± 0.186
0.18TrpCys: 0.18 ± 0.062
0.858TrpAsp: 0.858 ± 0.138
1.258TrpGlu: 1.258 ± 0.161
0.838TrpPhe: 0.838 ± 0.146
0.918TrpGly: 0.918 ± 0.151
0.459TrpHis: 0.459 ± 0.103
0.938TrpIle: 0.938 ± 0.123
0.898TrpLys: 0.898 ± 0.138
1.477TrpLeu: 1.477 ± 0.161
0.459TrpMet: 0.459 ± 0.103
0.978TrpAsn: 0.978 ± 0.129
0.24TrpPro: 0.24 ± 0.068
0.539TrpGln: 0.539 ± 0.096
1.198TrpArg: 1.198 ± 0.161
1.058TrpSer: 1.058 ± 0.141
1.238TrpThr: 1.238 ± 0.166
0.958TrpVal: 0.958 ± 0.134
0.339TrpTrp: 0.339 ± 0.072
0.938TrpTyr: 0.938 ± 0.132
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.795TyrAla: 2.795 ± 0.266
0.419TyrCys: 0.419 ± 0.084
2.735TyrAsp: 2.735 ± 0.23
2.655TyrGlu: 2.655 ± 0.263
1.457TyrPhe: 1.457 ± 0.169
2.036TyrGly: 2.036 ± 0.204
0.958TyrHis: 0.958 ± 0.158
2.076TyrIle: 2.076 ± 0.198
2.355TyrLys: 2.355 ± 0.217
3.074TyrLeu: 3.074 ± 0.255
1.178TyrMet: 1.178 ± 0.153
2.136TyrAsn: 2.136 ± 0.218
1.537TyrPro: 1.537 ± 0.166
1.757TyrGln: 1.757 ± 0.187
1.996TyrArg: 1.996 ± 0.231
2.196TyrSer: 2.196 ± 0.209
2.675TyrThr: 2.675 ± 0.291
2.435TyrVal: 2.435 ± 0.256
0.619TyrTrp: 0.619 ± 0.127
1.457TyrTyr: 1.457 ± 0.146
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 286 proteins (50099 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski