Amino acid dipepetide frequency for Microcystis phage MaeS

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.455AlaAla: 4.455 ± 0.429
0.294AlaCys: 0.294 ± 0.117
3.362AlaAsp: 3.362 ± 0.34
4.833AlaGlu: 4.833 ± 0.707
2.311AlaPhe: 2.311 ± 0.456
4.244AlaGly: 4.244 ± 0.616
0.967AlaHis: 0.967 ± 0.195
4.791AlaIle: 4.791 ± 0.392
5.001AlaLys: 5.001 ± 0.439
4.917AlaLeu: 4.917 ± 0.498
1.261AlaMet: 1.261 ± 0.339
3.656AlaAsn: 3.656 ± 0.439
1.933AlaPro: 1.933 ± 0.347
3.488AlaGln: 3.488 ± 0.627
2.648AlaArg: 2.648 ± 0.348
3.362AlaSer: 3.362 ± 0.506
4.917AlaThr: 4.917 ± 0.749
3.362AlaVal: 3.362 ± 0.359
0.798AlaTrp: 0.798 ± 0.186
2.101AlaTyr: 2.101 ± 0.32
0.0AlaXaa: 0.0 ± 0.0
Cys
0.378CysAla: 0.378 ± 0.137
0.084CysCys: 0.084 ± 0.063
0.84CysAsp: 0.84 ± 0.256
0.714CysGlu: 0.714 ± 0.174
0.378CysPhe: 0.378 ± 0.13
1.093CysGly: 1.093 ± 0.327
0.21CysHis: 0.21 ± 0.104
0.294CysIle: 0.294 ± 0.108
0.42CysLys: 0.42 ± 0.144
0.42CysLeu: 0.42 ± 0.13
0.126CysMet: 0.126 ± 0.077
0.714CysAsn: 0.714 ± 0.191
0.546CysPro: 0.546 ± 0.173
0.378CysGln: 0.378 ± 0.141
0.378CysArg: 0.378 ± 0.123
1.051CysSer: 1.051 ± 0.191
0.294CysThr: 0.294 ± 0.108
0.378CysVal: 0.378 ± 0.156
0.084CysTrp: 0.084 ± 0.056
0.546CysTyr: 0.546 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
3.362AspAla: 3.362 ± 0.312
0.42AspCys: 0.42 ± 0.155
3.278AspAsp: 3.278 ± 0.485
5.001AspGlu: 5.001 ± 0.473
2.984AspPhe: 2.984 ± 0.44
4.244AspGly: 4.244 ± 0.57
1.009AspHis: 1.009 ± 0.188
4.581AspIle: 4.581 ± 0.44
4.665AspLys: 4.665 ± 0.485
5.127AspLeu: 5.127 ± 0.547
1.051AspMet: 1.051 ± 0.213
3.362AspAsn: 3.362 ± 0.372
2.227AspPro: 2.227 ± 0.364
1.471AspGln: 1.471 ± 0.229
2.69AspArg: 2.69 ± 0.428
3.278AspSer: 3.278 ± 0.504
2.942AspThr: 2.942 ± 0.404
4.16AspVal: 4.16 ± 0.485
0.63AspTrp: 0.63 ± 0.176
3.068AspTyr: 3.068 ± 0.418
0.0AspXaa: 0.0 ± 0.0
Glu
4.413GluAla: 4.413 ± 0.469
0.84GluCys: 0.84 ± 0.196
3.614GluAsp: 3.614 ± 0.496
8.153GluGlu: 8.153 ± 0.83
3.404GluPhe: 3.404 ± 0.434
4.244GluGly: 4.244 ± 0.485
1.093GluHis: 1.093 ± 0.234
6.093GluIle: 6.093 ± 0.649
7.312GluLys: 7.312 ± 0.732
7.943GluLeu: 7.943 ± 0.698
2.9GluMet: 2.9 ± 0.32
4.244GluAsn: 4.244 ± 0.511
1.513GluPro: 1.513 ± 0.259
3.572GluGln: 3.572 ± 0.331
3.362GluArg: 3.362 ± 0.356
3.866GluSer: 3.866 ± 0.423
3.824GluThr: 3.824 ± 0.399
6.093GluVal: 6.093 ± 0.528
0.63GluTrp: 0.63 ± 0.194
3.278GluTyr: 3.278 ± 0.471
0.0GluXaa: 0.0 ± 0.0
Phe
2.143PheAla: 2.143 ± 0.326
0.462PheCys: 0.462 ± 0.179
2.563PheAsp: 2.563 ± 0.328
3.824PheGlu: 3.824 ± 0.416
1.219PhePhe: 1.219 ± 0.255
2.605PheGly: 2.605 ± 0.483
0.546PheHis: 0.546 ± 0.147
2.9PheIle: 2.9 ± 0.419
2.984PheLys: 2.984 ± 0.295
2.774PheLeu: 2.774 ± 0.418
0.967PheMet: 0.967 ± 0.211
2.648PheAsn: 2.648 ± 0.316
1.429PhePro: 1.429 ± 0.282
1.261PheGln: 1.261 ± 0.225
1.933PheArg: 1.933 ± 0.305
2.353PheSer: 2.353 ± 0.221
3.026PheThr: 3.026 ± 0.525
2.395PheVal: 2.395 ± 0.323
0.126PheTrp: 0.126 ± 0.08
1.975PheTyr: 1.975 ± 0.251
0.0PheXaa: 0.0 ± 0.0
Gly
3.278GlyAla: 3.278 ± 0.501
0.714GlyCys: 0.714 ± 0.188
3.698GlyAsp: 3.698 ± 0.434
3.614GlyGlu: 3.614 ± 0.348
2.353GlyPhe: 2.353 ± 0.406
4.455GlyGly: 4.455 ± 0.449
1.345GlyHis: 1.345 ± 0.269
4.413GlyIle: 4.413 ± 0.413
5.337GlyLys: 5.337 ± 0.403
4.833GlyLeu: 4.833 ± 0.38
1.387GlyMet: 1.387 ± 0.214
3.74GlyAsn: 3.74 ± 0.563
0.63GlyPro: 0.63 ± 0.217
2.648GlyGln: 2.648 ± 0.389
2.858GlyArg: 2.858 ± 0.277
5.001GlySer: 5.001 ± 0.705
5.337GlyThr: 5.337 ± 0.922
3.95GlyVal: 3.95 ± 0.405
0.672GlyTrp: 0.672 ± 0.166
3.53GlyTyr: 3.53 ± 0.364
0.0GlyXaa: 0.0 ± 0.0
His
0.967HisAla: 0.967 ± 0.202
0.294HisCys: 0.294 ± 0.118
1.303HisAsp: 1.303 ± 0.236
1.303HisGlu: 1.303 ± 0.272
1.009HisPhe: 1.009 ± 0.23
0.883HisGly: 0.883 ± 0.192
0.42HisHis: 0.42 ± 0.127
1.345HisIle: 1.345 ± 0.269
1.219HisLys: 1.219 ± 0.258
1.555HisLeu: 1.555 ± 0.308
0.21HisMet: 0.21 ± 0.081
0.925HisAsn: 0.925 ± 0.208
0.63HisPro: 0.63 ± 0.155
0.84HisGln: 0.84 ± 0.173
0.672HisArg: 0.672 ± 0.168
1.009HisSer: 1.009 ± 0.177
0.84HisThr: 0.84 ± 0.192
1.135HisVal: 1.135 ± 0.226
0.168HisTrp: 0.168 ± 0.097
0.798HisTyr: 0.798 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
4.791IleAla: 4.791 ± 0.422
0.546IleCys: 0.546 ± 0.169
5.001IleAsp: 5.001 ± 0.489
5.379IleGlu: 5.379 ± 0.629
2.269IlePhe: 2.269 ± 0.387
4.244IleGly: 4.244 ± 0.44
1.429IleHis: 1.429 ± 0.269
4.497IleIle: 4.497 ± 0.564
6.43IleLys: 6.43 ± 0.589
4.37IleLeu: 4.37 ± 0.48
1.513IleMet: 1.513 ± 0.24
4.286IleAsn: 4.286 ± 0.398
2.059IlePro: 2.059 ± 0.325
2.774IleGln: 2.774 ± 0.3
3.152IleArg: 3.152 ± 0.394
4.202IleSer: 4.202 ± 0.491
5.127IleThr: 5.127 ± 0.648
3.908IleVal: 3.908 ± 0.451
0.84IleTrp: 0.84 ± 0.164
2.227IleTyr: 2.227 ± 0.263
0.0IleXaa: 0.0 ± 0.0
Lys
5.463LysAla: 5.463 ± 0.619
0.883LysCys: 0.883 ± 0.293
4.833LysAsp: 4.833 ± 0.509
7.943LysGlu: 7.943 ± 0.881
3.992LysPhe: 3.992 ± 0.42
5.001LysGly: 5.001 ± 0.556
1.891LysHis: 1.891 ± 0.376
5.463LysIle: 5.463 ± 0.485
7.564LysLys: 7.564 ± 0.738
6.43LysLeu: 6.43 ± 0.602
2.521LysMet: 2.521 ± 0.353
3.866LysAsn: 3.866 ± 0.388
2.227LysPro: 2.227 ± 0.353
3.572LysGln: 3.572 ± 0.381
4.539LysArg: 4.539 ± 0.467
4.581LysSer: 4.581 ± 0.442
3.782LysThr: 3.782 ± 0.667
5.757LysVal: 5.757 ± 0.509
1.009LysTrp: 1.009 ± 0.199
3.404LysTyr: 3.404 ± 0.425
0.0LysXaa: 0.0 ± 0.0
Leu
5.673LeuAla: 5.673 ± 0.71
0.546LeuCys: 0.546 ± 0.175
5.043LeuAsp: 5.043 ± 0.398
6.556LeuGlu: 6.556 ± 0.735
3.194LeuPhe: 3.194 ± 0.351
4.37LeuGly: 4.37 ± 0.438
1.093LeuHis: 1.093 ± 0.204
5.085LeuIle: 5.085 ± 0.492
7.018LeuLys: 7.018 ± 0.538
5.505LeuLeu: 5.505 ± 0.583
1.681LeuMet: 1.681 ± 0.268
4.707LeuAsn: 4.707 ± 0.416
2.227LeuPro: 2.227 ± 0.333
3.026LeuGln: 3.026 ± 0.31
3.278LeuArg: 3.278 ± 0.399
5.085LeuSer: 5.085 ± 0.404
5.211LeuThr: 5.211 ± 0.518
4.875LeuVal: 4.875 ± 0.459
0.883LeuTrp: 0.883 ± 0.221
2.395LeuTyr: 2.395 ± 0.384
0.0LeuXaa: 0.0 ± 0.0
Met
2.227MetAla: 2.227 ± 0.346
0.252MetCys: 0.252 ± 0.101
1.429MetAsp: 1.429 ± 0.281
2.395MetGlu: 2.395 ± 0.387
1.135MetPhe: 1.135 ± 0.239
1.261MetGly: 1.261 ± 0.442
0.378MetHis: 0.378 ± 0.119
2.017MetIle: 2.017 ± 0.318
2.984MetLys: 2.984 ± 0.388
1.639MetLeu: 1.639 ± 0.294
0.462MetMet: 0.462 ± 0.143
1.597MetAsn: 1.597 ± 0.282
0.588MetPro: 0.588 ± 0.182
0.925MetGln: 0.925 ± 0.236
0.756MetArg: 0.756 ± 0.185
1.849MetSer: 1.849 ± 0.285
1.219MetThr: 1.219 ± 0.17
1.051MetVal: 1.051 ± 0.206
0.042MetTrp: 0.042 ± 0.04
0.588MetTyr: 0.588 ± 0.162
0.0MetXaa: 0.0 ± 0.0
Asn
3.824AsnAla: 3.824 ± 0.578
0.798AsnCys: 0.798 ± 0.205
3.404AsnAsp: 3.404 ± 0.341
5.001AsnGlu: 5.001 ± 0.534
2.059AsnPhe: 2.059 ± 0.324
4.413AsnGly: 4.413 ± 0.536
1.471AsnHis: 1.471 ± 0.246
3.278AsnIle: 3.278 ± 0.559
5.379AsnLys: 5.379 ± 0.545
4.118AsnLeu: 4.118 ± 0.418
1.303AsnMet: 1.303 ± 0.231
4.118AsnAsn: 4.118 ± 0.516
2.227AsnPro: 2.227 ± 0.376
2.648AsnGln: 2.648 ± 0.417
2.984AsnArg: 2.984 ± 0.443
3.11AsnSer: 3.11 ± 0.356
2.563AsnThr: 2.563 ± 0.359
3.236AsnVal: 3.236 ± 0.523
0.756AsnTrp: 0.756 ± 0.231
2.395AsnTyr: 2.395 ± 0.32
0.0AsnXaa: 0.0 ± 0.0
Pro
1.933ProAla: 1.933 ± 0.322
0.294ProCys: 0.294 ± 0.117
2.143ProAsp: 2.143 ± 0.298
2.563ProGlu: 2.563 ± 0.35
1.597ProPhe: 1.597 ± 0.365
0.084ProGly: 0.084 ± 0.055
0.336ProHis: 0.336 ± 0.146
1.807ProIle: 1.807 ± 0.268
2.143ProLys: 2.143 ± 0.338
2.648ProLeu: 2.648 ± 0.302
1.093ProMet: 1.093 ± 0.239
1.555ProAsn: 1.555 ± 0.28
0.504ProPro: 0.504 ± 0.162
1.597ProGln: 1.597 ± 0.314
1.009ProArg: 1.009 ± 0.204
2.227ProSer: 2.227 ± 0.348
1.849ProThr: 1.849 ± 0.308
1.975ProVal: 1.975 ± 0.297
0.294ProTrp: 0.294 ± 0.103
1.261ProTyr: 1.261 ± 0.225
0.0ProXaa: 0.0 ± 0.0
Gln
2.185GlnAla: 2.185 ± 0.408
0.462GlnCys: 0.462 ± 0.134
1.891GlnAsp: 1.891 ± 0.301
3.236GlnGlu: 3.236 ± 0.395
1.345GlnPhe: 1.345 ± 0.242
3.11GlnGly: 3.11 ± 0.414
0.63GlnHis: 0.63 ± 0.187
3.152GlnIle: 3.152 ± 0.357
3.026GlnLys: 3.026 ± 0.406
3.824GlnLeu: 3.824 ± 0.552
1.639GlnMet: 1.639 ± 0.293
2.185GlnAsn: 2.185 ± 0.387
0.925GlnPro: 0.925 ± 0.211
1.597GlnGln: 1.597 ± 0.429
2.521GlnArg: 2.521 ± 0.33
1.681GlnSer: 1.681 ± 0.236
2.269GlnThr: 2.269 ± 0.396
2.648GlnVal: 2.648 ± 0.308
0.21GlnTrp: 0.21 ± 0.093
1.555GlnTyr: 1.555 ± 0.201
0.0GlnXaa: 0.0 ± 0.0
Arg
2.311ArgAla: 2.311 ± 0.341
0.336ArgCys: 0.336 ± 0.139
3.026ArgAsp: 3.026 ± 0.359
3.152ArgGlu: 3.152 ± 0.398
1.891ArgPhe: 1.891 ± 0.309
2.479ArgGly: 2.479 ± 0.307
0.588ArgHis: 0.588 ± 0.169
3.32ArgIle: 3.32 ± 0.447
5.253ArgLys: 5.253 ± 0.703
3.404ArgLeu: 3.404 ± 0.409
1.429ArgMet: 1.429 ± 0.252
2.816ArgAsn: 2.816 ± 0.451
1.303ArgPro: 1.303 ± 0.261
1.387ArgGln: 1.387 ± 0.273
1.975ArgArg: 1.975 ± 0.308
1.849ArgSer: 1.849 ± 0.253
3.026ArgThr: 3.026 ± 0.332
3.236ArgVal: 3.236 ± 0.355
0.462ArgTrp: 0.462 ± 0.132
1.765ArgTyr: 1.765 ± 0.234
0.0ArgXaa: 0.0 ± 0.0
Ser
3.866SerAla: 3.866 ± 0.647
0.42SerCys: 0.42 ± 0.116
3.152SerAsp: 3.152 ± 0.466
3.908SerGlu: 3.908 ± 0.398
1.807SerPhe: 1.807 ± 0.282
4.623SerGly: 4.623 ± 0.811
0.883SerHis: 0.883 ± 0.222
3.782SerIle: 3.782 ± 0.323
4.917SerLys: 4.917 ± 0.479
4.791SerLeu: 4.791 ± 0.517
1.345SerMet: 1.345 ± 0.229
3.614SerAsn: 3.614 ± 0.395
2.185SerPro: 2.185 ± 0.329
2.269SerGln: 2.269 ± 0.294
2.605SerArg: 2.605 ± 0.377
3.824SerSer: 3.824 ± 0.544
3.572SerThr: 3.572 ± 0.506
4.034SerVal: 4.034 ± 0.427
1.219SerTrp: 1.219 ± 0.339
2.521SerTyr: 2.521 ± 0.306
0.0SerXaa: 0.0 ± 0.0
Thr
4.076ThrAla: 4.076 ± 0.942
0.546ThrCys: 0.546 ± 0.163
3.74ThrAsp: 3.74 ± 0.67
3.866ThrGlu: 3.866 ± 0.443
2.395ThrPhe: 2.395 ± 0.347
5.127ThrGly: 5.127 ± 1.024
1.051ThrHis: 1.051 ± 0.218
4.37ThrIle: 4.37 ± 0.528
4.244ThrLys: 4.244 ± 0.422
5.631ThrLeu: 5.631 ± 0.618
1.513ThrMet: 1.513 ± 0.277
3.572ThrAsn: 3.572 ± 0.513
2.521ThrPro: 2.521 ± 0.39
1.933ThrGln: 1.933 ± 0.297
2.353ThrArg: 2.353 ± 0.334
3.362ThrSer: 3.362 ± 0.724
4.286ThrThr: 4.286 ± 0.744
4.37ThrVal: 4.37 ± 0.652
0.84ThrTrp: 0.84 ± 0.221
2.648ThrTyr: 2.648 ± 0.442
0.0ThrXaa: 0.0 ± 0.0
Val
4.286ValAla: 4.286 ± 0.44
0.63ValCys: 0.63 ± 0.187
4.875ValAsp: 4.875 ± 0.5
4.833ValGlu: 4.833 ± 0.477
2.269ValPhe: 2.269 ± 0.261
3.95ValGly: 3.95 ± 0.454
1.177ValHis: 1.177 ± 0.259
4.581ValIle: 4.581 ± 0.397
5.295ValLys: 5.295 ± 0.669
3.53ValLeu: 3.53 ± 0.341
1.513ValMet: 1.513 ± 0.279
3.824ValAsn: 3.824 ± 0.364
2.017ValPro: 2.017 ± 0.289
2.269ValGln: 2.269 ± 0.302
3.068ValArg: 3.068 ± 0.275
3.992ValSer: 3.992 ± 0.409
4.707ValThr: 4.707 ± 0.547
3.95ValVal: 3.95 ± 0.415
0.714ValTrp: 0.714 ± 0.179
2.311ValTyr: 2.311 ± 0.226
0.0ValXaa: 0.0 ± 0.0
Trp
0.462TrpAla: 0.462 ± 0.139
0.084TrpCys: 0.084 ± 0.053
0.378TrpAsp: 0.378 ± 0.158
0.714TrpGlu: 0.714 ± 0.177
0.588TrpPhe: 0.588 ± 0.144
0.546TrpGly: 0.546 ± 0.161
0.42TrpHis: 0.42 ± 0.142
0.967TrpIle: 0.967 ± 0.238
0.714TrpLys: 0.714 ± 0.138
1.135TrpLeu: 1.135 ± 0.223
0.252TrpMet: 0.252 ± 0.106
0.756TrpAsn: 0.756 ± 0.169
0.126TrpPro: 0.126 ± 0.062
0.546TrpGln: 0.546 ± 0.138
0.294TrpArg: 0.294 ± 0.109
0.588TrpSer: 0.588 ± 0.148
1.135TrpThr: 1.135 ± 0.267
0.714TrpVal: 0.714 ± 0.271
0.084TrpTrp: 0.084 ± 0.069
0.63TrpTyr: 0.63 ± 0.147
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.605TyrAla: 2.605 ± 0.327
0.42TyrCys: 0.42 ± 0.117
1.975TyrAsp: 1.975 ± 0.296
3.404TyrGlu: 3.404 ± 0.474
1.933TyrPhe: 1.933 ± 0.349
2.816TyrGly: 2.816 ± 0.399
0.588TyrHis: 0.588 ± 0.174
2.311TyrIle: 2.311 ± 0.428
2.816TyrLys: 2.816 ± 0.449
2.858TyrLeu: 2.858 ± 0.483
0.588TyrMet: 0.588 ± 0.164
2.9TyrAsn: 2.9 ± 0.381
1.093TyrPro: 1.093 ± 0.229
1.891TyrGln: 1.891 ± 0.29
1.933TyrArg: 1.933 ± 0.264
3.068TyrSer: 3.068 ± 0.315
2.521TyrThr: 2.521 ± 0.355
2.563TyrVal: 2.563 ± 0.286
0.672TyrTrp: 0.672 ± 0.184
1.807TyrTyr: 1.807 ± 0.396
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 113 proteins (23797 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski