Amino acid dipepetide frequency for Pseudomonas phage VW-6B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.635AlaAla: 13.635 ± 2.584
0.176AlaCys: 0.176 ± 0.114
7.037AlaAsp: 7.037 ± 0.763
7.125AlaGlu: 7.125 ± 0.933
2.991AlaPhe: 2.991 ± 0.446
10.38AlaGly: 10.38 ± 1.268
1.495AlaHis: 1.495 ± 0.427
6.597AlaIle: 6.597 ± 0.605
4.486AlaLys: 4.486 ± 0.608
11.436AlaLeu: 11.436 ± 1.451
2.551AlaMet: 2.551 ± 0.444
3.607AlaAsn: 3.607 ± 0.682
3.519AlaPro: 3.519 ± 0.44
5.19AlaGln: 5.19 ± 0.802
5.894AlaArg: 5.894 ± 0.659
6.422AlaSer: 6.422 ± 1.103
6.949AlaThr: 6.949 ± 1.16
8.181AlaVal: 8.181 ± 0.892
1.583AlaTrp: 1.583 ± 0.324
2.463AlaTyr: 2.463 ± 0.486
0.0AlaXaa: 0.0 ± 0.0
Cys
0.704CysAla: 0.704 ± 0.218
0.088CysCys: 0.088 ± 0.081
0.616CysAsp: 0.616 ± 0.234
0.352CysGlu: 0.352 ± 0.156
0.088CysPhe: 0.088 ± 0.078
0.616CysGly: 0.616 ± 0.246
0.088CysHis: 0.088 ± 0.094
0.264CysIle: 0.264 ± 0.15
0.264CysLys: 0.264 ± 0.146
0.704CysLeu: 0.704 ± 0.208
0.088CysMet: 0.088 ± 0.093
0.264CysAsn: 0.264 ± 0.182
0.616CysPro: 0.616 ± 0.253
0.352CysGln: 0.352 ± 0.162
0.616CysArg: 0.616 ± 0.229
0.176CysSer: 0.176 ± 0.122
0.352CysThr: 0.352 ± 0.168
0.528CysVal: 0.528 ± 0.204
0.176CysTrp: 0.176 ± 0.108
0.616CysTyr: 0.616 ± 0.243
0.0CysXaa: 0.0 ± 0.0
Asp
7.565AspAla: 7.565 ± 0.924
0.528AspCys: 0.528 ± 0.238
4.398AspAsp: 4.398 ± 0.67
3.343AspGlu: 3.343 ± 0.415
2.199AspPhe: 2.199 ± 0.485
5.894AspGly: 5.894 ± 0.669
0.44AspHis: 0.44 ± 0.166
2.727AspIle: 2.727 ± 0.603
3.871AspLys: 3.871 ± 0.458
5.102AspLeu: 5.102 ± 0.571
1.319AspMet: 1.319 ± 0.397
2.023AspAsn: 2.023 ± 0.354
2.375AspPro: 2.375 ± 0.358
1.319AspGln: 1.319 ± 0.32
2.639AspArg: 2.639 ± 0.431
3.255AspSer: 3.255 ± 0.453
3.519AspThr: 3.519 ± 0.697
3.343AspVal: 3.343 ± 0.384
1.232AspTrp: 1.232 ± 0.355
1.583AspTyr: 1.583 ± 0.368
0.0AspXaa: 0.0 ± 0.0
Glu
5.806GluAla: 5.806 ± 0.645
0.44GluCys: 0.44 ± 0.173
2.287GluAsp: 2.287 ± 0.369
2.375GluGlu: 2.375 ± 0.429
2.287GluPhe: 2.287 ± 0.451
3.255GluGly: 3.255 ± 0.488
1.319GluHis: 1.319 ± 0.294
3.255GluIle: 3.255 ± 0.555
2.903GluLys: 2.903 ± 0.461
6.07GluLeu: 6.07 ± 0.691
1.759GluMet: 1.759 ± 0.358
1.935GluAsn: 1.935 ± 0.399
3.079GluPro: 3.079 ± 0.648
2.903GluGln: 2.903 ± 0.487
2.815GluArg: 2.815 ± 0.57
3.431GluSer: 3.431 ± 0.65
3.255GluThr: 3.255 ± 0.435
3.695GluVal: 3.695 ± 0.608
1.319GluTrp: 1.319 ± 0.314
2.551GluTyr: 2.551 ± 0.395
0.0GluXaa: 0.0 ± 0.0
Phe
2.903PheAla: 2.903 ± 0.449
0.44PheCys: 0.44 ± 0.159
1.495PheAsp: 1.495 ± 0.342
1.319PheGlu: 1.319 ± 0.299
1.144PhePhe: 1.144 ± 0.283
3.343PheGly: 3.343 ± 0.441
0.616PheHis: 0.616 ± 0.219
1.407PheIle: 1.407 ± 0.397
1.495PheLys: 1.495 ± 0.326
1.935PheLeu: 1.935 ± 0.394
0.792PheMet: 0.792 ± 0.195
1.232PheAsn: 1.232 ± 0.293
1.583PhePro: 1.583 ± 0.389
0.704PheGln: 0.704 ± 0.309
2.111PheArg: 2.111 ± 0.43
1.495PheSer: 1.495 ± 0.267
2.287PheThr: 2.287 ± 0.399
2.023PheVal: 2.023 ± 0.512
0.528PheTrp: 0.528 ± 0.207
0.704PheTyr: 0.704 ± 0.275
0.0PheXaa: 0.0 ± 0.0
Gly
6.773GlyAla: 6.773 ± 0.977
0.528GlyCys: 0.528 ± 0.234
4.926GlyAsp: 4.926 ± 0.857
4.222GlyGlu: 4.222 ± 0.548
2.727GlyPhe: 2.727 ± 0.397
6.422GlyGly: 6.422 ± 0.883
1.407GlyHis: 1.407 ± 0.32
4.046GlyIle: 4.046 ± 0.791
4.486GlyLys: 4.486 ± 0.537
10.028GlyLeu: 10.028 ± 1.172
2.023GlyMet: 2.023 ± 0.364
2.815GlyAsn: 2.815 ± 0.413
2.199GlyPro: 2.199 ± 0.436
3.431GlyGln: 3.431 ± 0.48
5.19GlyArg: 5.19 ± 0.538
3.871GlySer: 3.871 ± 0.512
5.366GlyThr: 5.366 ± 0.853
6.158GlyVal: 6.158 ± 0.666
1.232GlyTrp: 1.232 ± 0.342
2.903GlyTyr: 2.903 ± 0.719
0.0GlyXaa: 0.0 ± 0.0
His
1.319HisAla: 1.319 ± 0.363
0.0HisCys: 0.0 ± 0.0
0.528HisAsp: 0.528 ± 0.165
0.968HisGlu: 0.968 ± 0.238
0.44HisPhe: 0.44 ± 0.184
1.232HisGly: 1.232 ± 0.466
0.88HisHis: 0.88 ± 0.359
0.792HisIle: 0.792 ± 0.229
0.704HisLys: 0.704 ± 0.299
1.847HisLeu: 1.847 ± 0.426
0.528HisMet: 0.528 ± 0.181
0.704HisAsn: 0.704 ± 0.228
0.616HisPro: 0.616 ± 0.205
1.144HisGln: 1.144 ± 0.26
0.704HisArg: 0.704 ± 0.22
1.495HisSer: 1.495 ± 0.385
0.616HisThr: 0.616 ± 0.248
1.144HisVal: 1.144 ± 0.304
0.176HisTrp: 0.176 ± 0.12
0.704HisTyr: 0.704 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
5.278IleAla: 5.278 ± 0.63
0.352IleCys: 0.352 ± 0.183
3.958IleAsp: 3.958 ± 0.534
2.991IleGlu: 2.991 ± 0.485
0.88IlePhe: 0.88 ± 0.282
2.991IleGly: 2.991 ± 0.466
0.704IleHis: 0.704 ± 0.236
2.111IleIle: 2.111 ± 0.382
2.815IleLys: 2.815 ± 0.617
2.727IleLeu: 2.727 ± 0.476
0.528IleMet: 0.528 ± 0.212
3.343IleAsn: 3.343 ± 0.72
1.759IlePro: 1.759 ± 0.382
2.111IleGln: 2.111 ± 0.39
3.255IleArg: 3.255 ± 0.605
4.134IleSer: 4.134 ± 0.644
4.134IleThr: 4.134 ± 0.577
2.727IleVal: 2.727 ± 0.523
0.616IleTrp: 0.616 ± 0.262
1.232IleTyr: 1.232 ± 0.292
0.0IleXaa: 0.0 ± 0.0
Lys
5.894LysAla: 5.894 ± 0.827
0.176LysCys: 0.176 ± 0.099
2.727LysAsp: 2.727 ± 0.448
2.903LysGlu: 2.903 ± 0.566
1.319LysPhe: 1.319 ± 0.281
3.079LysGly: 3.079 ± 0.546
0.528LysHis: 0.528 ± 0.212
2.639LysIle: 2.639 ± 0.436
2.551LysLys: 2.551 ± 0.473
4.222LysLeu: 4.222 ± 0.589
0.968LysMet: 0.968 ± 0.326
1.759LysAsn: 1.759 ± 0.34
2.815LysPro: 2.815 ± 0.595
1.495LysGln: 1.495 ± 0.457
3.167LysArg: 3.167 ± 0.614
3.167LysSer: 3.167 ± 0.676
3.607LysThr: 3.607 ± 0.428
2.463LysVal: 2.463 ± 0.4
0.352LysTrp: 0.352 ± 0.196
1.232LysTyr: 1.232 ± 0.239
0.0LysXaa: 0.0 ± 0.0
Leu
10.996LeuAla: 10.996 ± 1.323
1.056LeuCys: 1.056 ± 0.315
5.542LeuAsp: 5.542 ± 0.558
5.542LeuGlu: 5.542 ± 0.745
2.639LeuPhe: 2.639 ± 0.455
7.037LeuGly: 7.037 ± 0.896
1.056LeuHis: 1.056 ± 0.214
4.486LeuIle: 4.486 ± 0.519
4.75LeuLys: 4.75 ± 0.794
8.709LeuLeu: 8.709 ± 0.929
1.759LeuMet: 1.759 ± 0.346
3.871LeuAsn: 3.871 ± 0.47
4.662LeuPro: 4.662 ± 0.716
5.454LeuGln: 5.454 ± 0.747
5.806LeuArg: 5.806 ± 0.711
7.125LeuSer: 7.125 ± 0.786
7.829LeuThr: 7.829 ± 1.111
5.366LeuVal: 5.366 ± 0.585
1.319LeuTrp: 1.319 ± 0.458
2.375LeuTyr: 2.375 ± 0.391
0.0LeuXaa: 0.0 ± 0.0
Met
2.991MetAla: 2.991 ± 0.468
0.176MetCys: 0.176 ± 0.128
1.319MetAsp: 1.319 ± 0.311
1.232MetGlu: 1.232 ± 0.322
0.88MetPhe: 0.88 ± 0.325
1.583MetGly: 1.583 ± 0.458
0.352MetHis: 0.352 ± 0.149
0.528MetIle: 0.528 ± 0.234
1.232MetLys: 1.232 ± 0.34
2.463MetLeu: 2.463 ± 0.444
0.528MetMet: 0.528 ± 0.16
0.88MetAsn: 0.88 ± 0.28
1.232MetPro: 1.232 ± 0.218
0.88MetGln: 0.88 ± 0.262
0.88MetArg: 0.88 ± 0.188
1.319MetSer: 1.319 ± 0.265
1.847MetThr: 1.847 ± 0.354
0.88MetVal: 0.88 ± 0.264
0.176MetTrp: 0.176 ± 0.119
0.44MetTyr: 0.44 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
4.134AsnAla: 4.134 ± 0.634
0.528AsnCys: 0.528 ± 0.204
1.935AsnAsp: 1.935 ± 0.381
1.671AsnGlu: 1.671 ± 0.372
0.88AsnPhe: 0.88 ± 0.24
3.255AsnGly: 3.255 ± 0.668
0.616AsnHis: 0.616 ± 0.193
2.023AsnIle: 2.023 ± 0.452
2.287AsnLys: 2.287 ± 0.432
3.958AsnLeu: 3.958 ± 0.651
0.88AsnMet: 0.88 ± 0.327
1.495AsnAsn: 1.495 ± 0.416
2.199AsnPro: 2.199 ± 0.335
1.671AsnGln: 1.671 ± 0.378
2.023AsnArg: 2.023 ± 0.402
1.583AsnSer: 1.583 ± 0.38
2.727AsnThr: 2.727 ± 0.506
1.759AsnVal: 1.759 ± 0.434
0.616AsnTrp: 0.616 ± 0.217
1.144AsnTyr: 1.144 ± 0.304
0.0AsnXaa: 0.0 ± 0.0
Pro
6.158ProAla: 6.158 ± 1.183
0.352ProCys: 0.352 ± 0.162
2.727ProAsp: 2.727 ± 0.455
4.486ProGlu: 4.486 ± 0.789
0.968ProPhe: 0.968 ± 0.307
2.639ProGly: 2.639 ± 0.519
0.352ProHis: 0.352 ± 0.198
1.847ProIle: 1.847 ± 0.422
1.847ProLys: 1.847 ± 0.344
3.431ProLeu: 3.431 ± 0.619
0.528ProMet: 0.528 ± 0.224
1.583ProAsn: 1.583 ± 0.276
1.583ProPro: 1.583 ± 0.404
1.935ProGln: 1.935 ± 0.489
2.287ProArg: 2.287 ± 0.432
3.079ProSer: 3.079 ± 0.488
2.639ProThr: 2.639 ± 0.418
3.958ProVal: 3.958 ± 0.568
0.352ProTrp: 0.352 ± 0.169
1.671ProTyr: 1.671 ± 0.405
0.0ProXaa: 0.0 ± 0.0
Gln
5.894GlnAla: 5.894 ± 0.717
0.352GlnCys: 0.352 ± 0.177
2.287GlnAsp: 2.287 ± 0.404
2.199GlnGlu: 2.199 ± 0.409
1.232GlnPhe: 1.232 ± 0.341
2.991GlnGly: 2.991 ± 0.397
1.407GlnHis: 1.407 ± 0.277
1.407GlnIle: 1.407 ± 0.235
1.583GlnLys: 1.583 ± 0.511
5.63GlnLeu: 5.63 ± 0.631
0.968GlnMet: 0.968 ± 0.369
1.319GlnAsn: 1.319 ± 0.385
1.935GlnPro: 1.935 ± 0.451
3.255GlnGln: 3.255 ± 0.557
3.079GlnArg: 3.079 ± 0.421
2.727GlnSer: 2.727 ± 0.566
2.023GlnThr: 2.023 ± 0.494
3.958GlnVal: 3.958 ± 0.468
1.056GlnTrp: 1.056 ± 0.36
0.792GlnTyr: 0.792 ± 0.24
0.0GlnXaa: 0.0 ± 0.0
Arg
5.806ArgAla: 5.806 ± 0.593
0.528ArgCys: 0.528 ± 0.174
2.903ArgAsp: 2.903 ± 0.534
2.375ArgGlu: 2.375 ± 0.406
2.023ArgPhe: 2.023 ± 0.494
4.662ArgGly: 4.662 ± 0.785
1.056ArgHis: 1.056 ± 0.347
3.783ArgIle: 3.783 ± 0.52
2.551ArgLys: 2.551 ± 0.584
6.597ArgLeu: 6.597 ± 0.681
1.056ArgMet: 1.056 ± 0.329
1.319ArgAsn: 1.319 ± 0.3
3.079ArgPro: 3.079 ± 0.51
3.695ArgGln: 3.695 ± 0.633
4.486ArgArg: 4.486 ± 0.728
3.431ArgSer: 3.431 ± 0.496
3.519ArgThr: 3.519 ± 0.566
2.903ArgVal: 2.903 ± 0.38
0.88ArgTrp: 0.88 ± 0.252
2.199ArgTyr: 2.199 ± 0.373
0.0ArgXaa: 0.0 ± 0.0
Ser
7.213SerAla: 7.213 ± 0.939
0.44SerCys: 0.44 ± 0.162
3.607SerAsp: 3.607 ± 0.548
3.519SerGlu: 3.519 ± 0.544
1.495SerPhe: 1.495 ± 0.313
6.422SerGly: 6.422 ± 1.084
1.056SerHis: 1.056 ± 0.345
2.199SerIle: 2.199 ± 0.467
2.287SerLys: 2.287 ± 0.486
6.51SerLeu: 6.51 ± 0.687
1.232SerMet: 1.232 ± 0.281
2.023SerAsn: 2.023 ± 0.422
2.991SerPro: 2.991 ± 0.49
2.815SerGln: 2.815 ± 0.498
3.871SerArg: 3.871 ± 0.77
3.519SerSer: 3.519 ± 0.492
4.574SerThr: 4.574 ± 0.819
3.695SerVal: 3.695 ± 0.645
0.704SerTrp: 0.704 ± 0.241
1.407SerTyr: 1.407 ± 0.361
0.0SerXaa: 0.0 ± 0.0
Thr
7.829ThrAla: 7.829 ± 1.555
0.176ThrCys: 0.176 ± 0.12
5.014ThrAsp: 5.014 ± 0.756
4.046ThrGlu: 4.046 ± 0.544
2.111ThrPhe: 2.111 ± 0.411
6.422ThrGly: 6.422 ± 0.681
1.232ThrHis: 1.232 ± 0.408
2.551ThrIle: 2.551 ± 0.403
2.287ThrLys: 2.287 ± 0.445
5.63ThrLeu: 5.63 ± 0.92
1.407ThrMet: 1.407 ± 0.316
2.727ThrAsn: 2.727 ± 0.519
3.343ThrPro: 3.343 ± 0.487
3.167ThrGln: 3.167 ± 0.534
3.519ThrArg: 3.519 ± 0.453
3.783ThrSer: 3.783 ± 0.688
4.134ThrThr: 4.134 ± 0.633
5.014ThrVal: 5.014 ± 0.673
1.232ThrTrp: 1.232 ± 0.352
0.968ThrTyr: 0.968 ± 0.325
0.0ThrXaa: 0.0 ± 0.0
Val
7.389ValAla: 7.389 ± 0.794
0.528ValCys: 0.528 ± 0.183
2.727ValAsp: 2.727 ± 0.566
3.871ValGlu: 3.871 ± 0.648
1.847ValPhe: 1.847 ± 0.371
5.014ValGly: 5.014 ± 0.627
0.704ValHis: 0.704 ± 0.245
3.958ValIle: 3.958 ± 0.534
2.991ValLys: 2.991 ± 0.515
5.278ValLeu: 5.278 ± 0.59
1.935ValMet: 1.935 ± 0.353
3.431ValAsn: 3.431 ± 0.633
2.727ValPro: 2.727 ± 0.454
2.639ValGln: 2.639 ± 0.43
3.343ValArg: 3.343 ± 0.56
4.574ValSer: 4.574 ± 0.609
5.014ValThr: 5.014 ± 0.71
4.574ValVal: 4.574 ± 0.606
1.495ValTrp: 1.495 ± 0.323
1.232ValTyr: 1.232 ± 0.36
0.0ValXaa: 0.0 ± 0.0
Trp
1.319TrpAla: 1.319 ± 0.326
0.44TrpCys: 0.44 ± 0.191
1.056TrpAsp: 1.056 ± 0.284
0.88TrpGlu: 0.88 ± 0.305
0.704TrpPhe: 0.704 ± 0.225
0.88TrpGly: 0.88 ± 0.301
0.792TrpHis: 0.792 ± 0.268
0.616TrpIle: 0.616 ± 0.216
0.616TrpLys: 0.616 ± 0.239
1.847TrpLeu: 1.847 ± 0.363
0.352TrpMet: 0.352 ± 0.165
0.264TrpAsn: 0.264 ± 0.14
0.704TrpPro: 0.704 ± 0.288
0.528TrpGln: 0.528 ± 0.22
0.704TrpArg: 0.704 ± 0.192
1.319TrpSer: 1.319 ± 0.316
0.704TrpThr: 0.704 ± 0.292
1.144TrpVal: 1.144 ± 0.316
0.704TrpTrp: 0.704 ± 0.222
0.264TrpTyr: 0.264 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.111TyrAla: 2.111 ± 0.44
0.264TyrCys: 0.264 ± 0.164
1.583TyrAsp: 1.583 ± 0.329
1.319TyrGlu: 1.319 ± 0.323
0.704TyrPhe: 0.704 ± 0.246
2.639TyrGly: 2.639 ± 0.574
0.44TyrHis: 0.44 ± 0.237
1.407TyrIle: 1.407 ± 0.338
1.056TyrLys: 1.056 ± 0.277
3.255TyrLeu: 3.255 ± 0.593
0.528TyrMet: 0.528 ± 0.169
0.792TyrAsn: 0.792 ± 0.308
1.407TyrPro: 1.407 ± 0.292
1.407TyrGln: 1.407 ± 0.486
2.463TyrArg: 2.463 ± 0.509
1.759TyrSer: 1.759 ± 0.466
1.495TyrThr: 1.495 ± 0.337
1.759TyrVal: 1.759 ± 0.38
0.088TyrTrp: 0.088 ± 0.07
1.056TyrTyr: 1.056 ± 0.338
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (11369 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski