Amino acid dipepetide frequency for Microbacterium phage Floof

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.777AlaAla: 21.777 ± 1.945
0.334AlaCys: 0.334 ± 0.119
8.35AlaAsp: 8.35 ± 0.734
9.085AlaGlu: 9.085 ± 1.008
3.674AlaPhe: 3.674 ± 0.543
9.018AlaGly: 9.018 ± 0.906
2.806AlaHis: 2.806 ± 0.359
5.878AlaIle: 5.878 ± 0.899
5.01AlaLys: 5.01 ± 0.59
12.826AlaLeu: 12.826 ± 1.331
3.34AlaMet: 3.34 ± 0.593
3.34AlaAsn: 3.34 ± 0.588
6.346AlaPro: 6.346 ± 0.745
3.34AlaGln: 3.34 ± 0.506
10.354AlaArg: 10.354 ± 1.138
8.283AlaSer: 8.283 ± 0.851
8.484AlaThr: 8.484 ± 0.946
8.283AlaVal: 8.283 ± 0.611
2.338AlaTrp: 2.338 ± 0.425
3.14AlaTyr: 3.14 ± 0.516
0.0AlaXaa: 0.0 ± 0.0
Cys
0.601CysAla: 0.601 ± 0.193
0.0CysCys: 0.0 ± 0.0
0.134CysAsp: 0.134 ± 0.092
0.267CysGlu: 0.267 ± 0.155
0.134CysPhe: 0.134 ± 0.105
0.735CysGly: 0.735 ± 0.227
0.2CysHis: 0.2 ± 0.121
0.334CysIle: 0.334 ± 0.151
0.134CysLys: 0.134 ± 0.109
0.267CysLeu: 0.267 ± 0.192
0.067CysMet: 0.067 ± 0.077
0.067CysAsn: 0.067 ± 0.062
0.468CysPro: 0.468 ± 0.184
0.468CysGln: 0.468 ± 0.16
0.267CysArg: 0.267 ± 0.141
0.334CysSer: 0.334 ± 0.163
0.134CysThr: 0.134 ± 0.083
0.267CysVal: 0.267 ± 0.128
0.0CysTrp: 0.0 ± 0.0
0.134CysTyr: 0.134 ± 0.084
0.0CysXaa: 0.0 ± 0.0
Asp
9.285AspAla: 9.285 ± 0.729
0.134AspCys: 0.134 ± 0.101
3.54AspAsp: 3.54 ± 0.628
3.741AspGlu: 3.741 ± 0.592
1.536AspPhe: 1.536 ± 0.281
5.745AspGly: 5.745 ± 0.685
1.202AspHis: 1.202 ± 0.315
2.538AspIle: 2.538 ± 0.387
1.804AspLys: 1.804 ± 0.352
6.346AspLeu: 6.346 ± 0.742
1.336AspMet: 1.336 ± 0.319
1.136AspAsn: 1.136 ± 0.229
4.342AspPro: 4.342 ± 0.66
1.536AspGln: 1.536 ± 0.263
5.21AspArg: 5.21 ± 0.644
2.939AspSer: 2.939 ± 0.416
4.075AspThr: 4.075 ± 0.596
3.073AspVal: 3.073 ± 0.521
1.136AspTrp: 1.136 ± 0.236
1.737AspTyr: 1.737 ± 0.353
0.0AspXaa: 0.0 ± 0.0
Glu
8.684GluAla: 8.684 ± 0.841
0.534GluCys: 0.534 ± 0.247
4.075GluAsp: 4.075 ± 0.52
4.075GluGlu: 4.075 ± 0.7
1.536GluPhe: 1.536 ± 0.42
5.21GluGly: 5.21 ± 0.63
1.202GluHis: 1.202 ± 0.272
1.87GluIle: 1.87 ± 0.347
2.004GluLys: 2.004 ± 0.356
6.68GluLeu: 6.68 ± 0.572
0.935GluMet: 0.935 ± 0.256
1.202GluAsn: 1.202 ± 0.218
1.87GluPro: 1.87 ± 0.39
1.603GluGln: 1.603 ± 0.311
4.743GluArg: 4.743 ± 0.58
1.67GluSer: 1.67 ± 0.374
4.676GluThr: 4.676 ± 0.658
4.743GluVal: 4.743 ± 0.524
0.868GluTrp: 0.868 ± 0.258
0.868GluTyr: 0.868 ± 0.216
0.0GluXaa: 0.0 ± 0.0
Phe
3.674PheAla: 3.674 ± 0.503
0.134PheCys: 0.134 ± 0.092
2.004PheAsp: 2.004 ± 0.409
1.603PheGlu: 1.603 ± 0.298
1.069PhePhe: 1.069 ± 0.403
3.34PheGly: 3.34 ± 0.539
0.735PheHis: 0.735 ± 0.213
1.202PheIle: 1.202 ± 0.252
0.2PheLys: 0.2 ± 0.099
2.672PheLeu: 2.672 ± 0.514
0.735PheMet: 0.735 ± 0.265
0.868PheAsn: 0.868 ± 0.249
2.004PhePro: 2.004 ± 0.423
1.136PheGln: 1.136 ± 0.259
2.138PheArg: 2.138 ± 0.375
1.403PheSer: 1.403 ± 0.306
2.605PheThr: 2.605 ± 0.536
1.87PheVal: 1.87 ± 0.298
0.134PheTrp: 0.134 ± 0.091
0.802PheTyr: 0.802 ± 0.185
0.0PheXaa: 0.0 ± 0.0
Gly
7.949GlyAla: 7.949 ± 0.554
0.401GlyCys: 0.401 ± 0.147
5.21GlyAsp: 5.21 ± 0.587
3.941GlyGlu: 3.941 ± 0.517
3.073GlyPhe: 3.073 ± 0.495
5.678GlyGly: 5.678 ± 0.829
1.403GlyHis: 1.403 ± 0.358
4.943GlyIle: 4.943 ± 0.976
3.741GlyLys: 3.741 ± 0.585
7.415GlyLeu: 7.415 ± 0.605
1.202GlyMet: 1.202 ± 0.265
1.937GlyAsn: 1.937 ± 0.431
3.407GlyPro: 3.407 ± 0.496
2.872GlyGln: 2.872 ± 0.515
5.077GlyArg: 5.077 ± 0.708
4.075GlySer: 4.075 ± 0.638
6.546GlyThr: 6.546 ± 1.013
6.48GlyVal: 6.48 ± 0.55
1.937GlyTrp: 1.937 ± 0.371
2.605GlyTyr: 2.605 ± 0.433
0.0GlyXaa: 0.0 ± 0.0
His
2.004HisAla: 2.004 ± 0.312
0.134HisCys: 0.134 ± 0.086
1.603HisAsp: 1.603 ± 0.34
1.737HisGlu: 1.737 ± 0.377
0.868HisPhe: 0.868 ± 0.284
1.603HisGly: 1.603 ± 0.333
0.668HisHis: 0.668 ± 0.206
0.802HisIle: 0.802 ± 0.258
0.534HisLys: 0.534 ± 0.214
2.405HisLeu: 2.405 ± 0.382
0.534HisMet: 0.534 ± 0.181
0.534HisAsn: 0.534 ± 0.196
1.47HisPro: 1.47 ± 0.334
0.534HisGln: 0.534 ± 0.2
0.935HisArg: 0.935 ± 0.222
1.136HisSer: 1.136 ± 0.323
1.136HisThr: 1.136 ± 0.251
1.136HisVal: 1.136 ± 0.213
0.468HisTrp: 0.468 ± 0.158
0.401HisTyr: 0.401 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
4.943IleAla: 4.943 ± 0.647
0.334IleCys: 0.334 ± 0.204
2.271IleAsp: 2.271 ± 0.443
2.472IleGlu: 2.472 ± 0.49
1.269IlePhe: 1.269 ± 0.298
3.407IleGly: 3.407 ± 0.474
0.935IleHis: 0.935 ± 0.195
1.737IleIle: 1.737 ± 0.42
1.536IleLys: 1.536 ± 0.326
3.206IleLeu: 3.206 ± 0.572
0.601IleMet: 0.601 ± 0.207
0.868IleAsn: 0.868 ± 0.241
2.138IlePro: 2.138 ± 0.339
1.403IleGln: 1.403 ± 0.301
3.607IleArg: 3.607 ± 0.484
2.605IleSer: 2.605 ± 0.422
3.941IleThr: 3.941 ± 0.473
3.407IleVal: 3.407 ± 0.705
0.935IleTrp: 0.935 ± 0.379
0.802IleTyr: 0.802 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
4.275LysAla: 4.275 ± 0.611
0.267LysCys: 0.267 ± 0.127
2.071LysAsp: 2.071 ± 0.35
2.338LysGlu: 2.338 ± 0.386
0.401LysPhe: 0.401 ± 0.163
2.605LysGly: 2.605 ± 0.438
0.601LysHis: 0.601 ± 0.158
1.002LysIle: 1.002 ± 0.236
1.603LysLys: 1.603 ± 0.328
3.54LysLeu: 3.54 ± 0.534
0.534LysMet: 0.534 ± 0.188
0.601LysAsn: 0.601 ± 0.225
2.338LysPro: 2.338 ± 0.382
0.735LysGln: 0.735 ± 0.244
3.407LysArg: 3.407 ± 0.503
2.004LysSer: 2.004 ± 0.356
2.271LysThr: 2.271 ± 0.408
2.405LysVal: 2.405 ± 0.469
0.267LysTrp: 0.267 ± 0.138
0.668LysTyr: 0.668 ± 0.19
0.0LysXaa: 0.0 ± 0.0
Leu
14.162LeuAla: 14.162 ± 1.196
0.601LeuCys: 0.601 ± 0.259
4.81LeuAsp: 4.81 ± 0.566
4.008LeuGlu: 4.008 ± 0.539
1.536LeuPhe: 1.536 ± 0.336
6.68LeuGly: 6.68 ± 0.791
2.004LeuHis: 2.004 ± 0.481
3.941LeuIle: 3.941 ± 0.475
3.206LeuLys: 3.206 ± 0.454
6.68LeuLeu: 6.68 ± 0.812
0.868LeuMet: 0.868 ± 0.202
2.806LeuAsn: 2.806 ± 0.487
6.079LeuPro: 6.079 ± 0.638
3.14LeuGln: 3.14 ± 0.377
7.081LeuArg: 7.081 ± 0.703
5.878LeuSer: 5.878 ± 0.614
7.281LeuThr: 7.281 ± 0.685
5.678LeuVal: 5.678 ± 0.503
1.536LeuTrp: 1.536 ± 0.389
1.603LeuTyr: 1.603 ± 0.274
0.0LeuXaa: 0.0 ± 0.0
Met
3.206MetAla: 3.206 ± 0.419
0.0MetCys: 0.0 ± 0.0
0.935MetAsp: 0.935 ± 0.256
0.668MetGlu: 0.668 ± 0.228
0.334MetPhe: 0.334 ± 0.144
1.069MetGly: 1.069 ± 0.267
0.334MetHis: 0.334 ± 0.15
0.802MetIle: 0.802 ± 0.211
0.401MetLys: 0.401 ± 0.165
1.002MetLeu: 1.002 ± 0.253
0.267MetMet: 0.267 ± 0.124
0.334MetAsn: 0.334 ± 0.164
1.269MetPro: 1.269 ± 0.273
0.534MetGln: 0.534 ± 0.198
1.403MetArg: 1.403 ± 0.3
1.87MetSer: 1.87 ± 0.348
2.605MetThr: 2.605 ± 0.511
0.802MetVal: 0.802 ± 0.224
0.134MetTrp: 0.134 ± 0.096
0.2MetTyr: 0.2 ± 0.118
0.0MetXaa: 0.0 ± 0.0
Asn
3.607AsnAla: 3.607 ± 0.465
0.067AsnCys: 0.067 ± 0.06
1.336AsnAsp: 1.336 ± 0.337
0.935AsnGlu: 0.935 ± 0.248
0.601AsnPhe: 0.601 ± 0.157
3.474AsnGly: 3.474 ± 0.593
0.468AsnHis: 0.468 ± 0.191
1.136AsnIle: 1.136 ± 0.414
0.534AsnLys: 0.534 ± 0.219
2.338AsnLeu: 2.338 ± 0.488
0.267AsnMet: 0.267 ± 0.113
0.468AsnAsn: 0.468 ± 0.183
1.336AsnPro: 1.336 ± 0.237
0.735AsnGln: 0.735 ± 0.202
1.136AsnArg: 1.136 ± 0.298
1.136AsnSer: 1.136 ± 0.256
1.336AsnThr: 1.336 ± 0.293
1.804AsnVal: 1.804 ± 0.478
0.334AsnTrp: 0.334 ± 0.119
0.668AsnTyr: 0.668 ± 0.224
0.0AsnXaa: 0.0 ± 0.0
Pro
7.482ProAla: 7.482 ± 0.908
0.267ProCys: 0.267 ± 0.14
3.474ProAsp: 3.474 ± 0.55
4.008ProGlu: 4.008 ± 0.579
1.737ProPhe: 1.737 ± 0.409
4.743ProGly: 4.743 ± 0.439
1.603ProHis: 1.603 ± 0.285
2.538ProIle: 2.538 ± 0.448
1.737ProLys: 1.737 ± 0.451
4.81ProLeu: 4.81 ± 0.637
1.136ProMet: 1.136 ± 0.231
1.269ProAsn: 1.269 ± 0.234
2.138ProPro: 2.138 ± 0.474
1.536ProGln: 1.536 ± 0.397
2.939ProArg: 2.939 ± 0.655
4.008ProSer: 4.008 ± 0.644
3.674ProThr: 3.674 ± 0.71
4.275ProVal: 4.275 ± 0.69
0.735ProTrp: 0.735 ± 0.287
1.269ProTyr: 1.269 ± 0.312
0.0ProXaa: 0.0 ± 0.0
Gln
4.609GlnAla: 4.609 ± 0.636
0.334GlnCys: 0.334 ± 0.129
1.603GlnAsp: 1.603 ± 0.303
1.804GlnGlu: 1.804 ± 0.273
0.2GlnPhe: 0.2 ± 0.108
2.538GlnGly: 2.538 ± 0.43
0.802GlnHis: 0.802 ± 0.229
0.668GlnIle: 0.668 ± 0.223
0.868GlnLys: 0.868 ± 0.213
3.34GlnLeu: 3.34 ± 0.533
0.468GlnMet: 0.468 ± 0.135
0.334GlnAsn: 0.334 ± 0.147
1.737GlnPro: 1.737 ± 0.405
0.802GlnGln: 0.802 ± 0.258
3.407GlnArg: 3.407 ± 0.597
1.202GlnSer: 1.202 ± 0.288
1.937GlnThr: 1.937 ± 0.276
2.004GlnVal: 2.004 ± 0.329
0.401GlnTrp: 0.401 ± 0.188
0.601GlnTyr: 0.601 ± 0.177
0.0GlnXaa: 0.0 ± 0.0
Arg
9.085ArgAla: 9.085 ± 1.147
0.334ArgCys: 0.334 ± 0.147
5.745ArgAsp: 5.745 ± 0.735
4.609ArgGlu: 4.609 ± 0.756
3.54ArgPhe: 3.54 ± 0.555
4.075ArgGly: 4.075 ± 0.527
1.069ArgHis: 1.069 ± 0.257
2.939ArgIle: 2.939 ± 0.472
4.342ArgLys: 4.342 ± 0.603
6.814ArgLeu: 6.814 ± 0.723
1.804ArgMet: 1.804 ± 0.313
1.603ArgAsn: 1.603 ± 0.225
4.208ArgPro: 4.208 ± 0.712
2.405ArgGln: 2.405 ± 0.459
6.68ArgArg: 6.68 ± 0.822
3.874ArgSer: 3.874 ± 0.665
4.342ArgThr: 4.342 ± 0.502
4.476ArgVal: 4.476 ± 0.634
1.136ArgTrp: 1.136 ± 0.312
2.138ArgTyr: 2.138 ± 0.373
0.0ArgXaa: 0.0 ± 0.0
Ser
7.749SerAla: 7.749 ± 0.781
0.067SerCys: 0.067 ± 0.077
3.607SerAsp: 3.607 ± 0.704
3.474SerGlu: 3.474 ± 0.449
2.405SerPhe: 2.405 ± 0.477
5.077SerGly: 5.077 ± 0.659
1.47SerHis: 1.47 ± 0.425
1.937SerIle: 1.937 ± 0.421
1.47SerLys: 1.47 ± 0.34
3.741SerLeu: 3.741 ± 0.532
1.603SerMet: 1.603 ± 0.295
1.269SerAsn: 1.269 ± 0.31
3.474SerPro: 3.474 ± 0.465
1.336SerGln: 1.336 ± 0.379
4.542SerArg: 4.542 ± 0.584
3.206SerSer: 3.206 ± 0.592
4.275SerThr: 4.275 ± 0.661
4.542SerVal: 4.542 ± 0.588
1.47SerTrp: 1.47 ± 0.312
1.269SerTyr: 1.269 ± 0.243
0.0SerXaa: 0.0 ± 0.0
Thr
8.617ThrAla: 8.617 ± 0.794
0.401ThrCys: 0.401 ± 0.173
5.21ThrAsp: 5.21 ± 0.633
3.34ThrGlu: 3.34 ± 0.602
3.14ThrPhe: 3.14 ± 0.617
5.611ThrGly: 5.611 ± 1.05
0.935ThrHis: 0.935 ± 0.324
3.006ThrIle: 3.006 ± 0.517
1.937ThrLys: 1.937 ± 0.3
5.678ThrLeu: 5.678 ± 0.573
0.668ThrMet: 0.668 ± 0.197
1.87ThrAsn: 1.87 ± 0.449
6.012ThrPro: 6.012 ± 0.77
1.87ThrGln: 1.87 ± 0.294
4.542ThrArg: 4.542 ± 0.743
4.943ThrSer: 4.943 ± 0.557
5.945ThrThr: 5.945 ± 0.791
5.945ThrVal: 5.945 ± 0.772
1.136ThrTrp: 1.136 ± 0.294
1.87ThrTyr: 1.87 ± 0.333
0.0ThrXaa: 0.0 ± 0.0
Val
9.285ValAla: 9.285 ± 1.147
0.401ValCys: 0.401 ± 0.175
3.607ValAsp: 3.607 ± 0.414
4.075ValGlu: 4.075 ± 0.609
2.204ValPhe: 2.204 ± 0.402
4.743ValGly: 4.743 ± 0.751
1.47ValHis: 1.47 ± 0.396
3.874ValIle: 3.874 ± 0.429
2.271ValLys: 2.271 ± 0.461
6.212ValLeu: 6.212 ± 0.73
1.202ValMet: 1.202 ± 0.254
1.937ValAsn: 1.937 ± 0.315
3.474ValPro: 3.474 ± 0.431
1.937ValGln: 1.937 ± 0.351
4.542ValArg: 4.542 ± 0.563
5.077ValSer: 5.077 ± 0.691
4.476ValThr: 4.476 ± 0.618
4.208ValVal: 4.208 ± 0.601
1.202ValTrp: 1.202 ± 0.376
1.937ValTyr: 1.937 ± 0.362
0.0ValXaa: 0.0 ± 0.0
Trp
2.338TrpAla: 2.338 ± 0.367
0.134TrpCys: 0.134 ± 0.089
1.47TrpAsp: 1.47 ± 0.229
1.536TrpGlu: 1.536 ± 0.342
0.134TrpPhe: 0.134 ± 0.084
1.87TrpGly: 1.87 ± 0.389
0.334TrpHis: 0.334 ± 0.143
0.334TrpIle: 0.334 ± 0.14
0.134TrpLys: 0.134 ± 0.102
1.403TrpLeu: 1.403 ± 0.334
0.067TrpMet: 0.067 ± 0.066
0.868TrpAsn: 0.868 ± 0.482
0.534TrpPro: 0.534 ± 0.195
0.802TrpGln: 0.802 ± 0.207
1.202TrpArg: 1.202 ± 0.306
0.935TrpSer: 0.935 ± 0.263
1.269TrpThr: 1.269 ± 0.267
1.002TrpVal: 1.002 ± 0.248
0.134TrpTrp: 0.134 ± 0.1
0.067TrpTyr: 0.067 ± 0.061
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.739TyrAla: 2.739 ± 0.409
0.134TyrCys: 0.134 ± 0.099
1.336TyrAsp: 1.336 ± 0.311
1.336TyrGlu: 1.336 ± 0.313
0.935TyrPhe: 0.935 ± 0.227
2.672TyrGly: 2.672 ± 0.497
0.401TyrHis: 0.401 ± 0.162
0.935TyrIle: 0.935 ± 0.234
0.468TyrLys: 0.468 ± 0.174
2.204TyrLeu: 2.204 ± 0.382
0.468TyrMet: 0.468 ± 0.191
0.401TyrAsn: 0.401 ± 0.154
0.735TyrPro: 0.735 ± 0.238
0.935TyrGln: 0.935 ± 0.228
2.004TyrArg: 2.004 ± 0.396
1.403TyrSer: 1.403 ± 0.308
1.67TyrThr: 1.67 ± 0.297
1.737TyrVal: 1.737 ± 0.392
0.267TyrTrp: 0.267 ± 0.123
0.468TyrTyr: 0.468 ± 0.169
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 80 proteins (14971 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski