Amino acid dipepetide frequency for Vibrio phage NF

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.376AlaAla: 4.376 ± 1.308
0.461AlaCys: 0.461 ± 0.18
3.762AlaAsp: 3.762 ± 0.575
3.838AlaGlu: 3.838 ± 0.515
1.996AlaPhe: 1.996 ± 0.318
5.144AlaGly: 5.144 ± 0.767
1.075AlaHis: 1.075 ± 0.311
6.065AlaIle: 6.065 ± 0.638
4.99AlaLys: 4.99 ± 0.736
5.067AlaLeu: 5.067 ± 0.581
2.917AlaMet: 2.917 ± 0.644
4.836AlaAsn: 4.836 ± 0.788
1.919AlaPro: 1.919 ± 0.409
2.61AlaGln: 2.61 ± 0.414
3.071AlaArg: 3.071 ± 0.486
4.913AlaSer: 4.913 ± 0.652
4.376AlaThr: 4.376 ± 0.841
4.606AlaVal: 4.606 ± 0.595
1.842AlaTrp: 1.842 ± 0.44
2.38AlaTyr: 2.38 ± 0.427
0.0AlaXaa: 0.0 ± 0.0
Cys
0.921CysAla: 0.921 ± 0.251
0.307CysCys: 0.307 ± 0.182
0.844CysAsp: 0.844 ± 0.237
1.535CysGlu: 1.535 ± 0.411
0.307CysPhe: 0.307 ± 0.159
2.457CysGly: 2.457 ± 0.463
0.307CysHis: 0.307 ± 0.153
0.844CysIle: 0.844 ± 0.259
1.075CysLys: 1.075 ± 0.353
1.228CysLeu: 1.228 ± 0.368
0.307CysMet: 0.307 ± 0.166
0.768CysAsn: 0.768 ± 0.244
0.154CysPro: 0.154 ± 0.107
0.384CysGln: 0.384 ± 0.182
0.384CysArg: 0.384 ± 0.16
0.384CysSer: 0.384 ± 0.168
0.691CysThr: 0.691 ± 0.223
0.307CysVal: 0.307 ± 0.171
0.384CysTrp: 0.384 ± 0.159
0.691CysTyr: 0.691 ± 0.205
0.0CysXaa: 0.0 ± 0.0
Asp
4.913AspAla: 4.913 ± 0.786
0.768AspCys: 0.768 ± 0.202
4.299AspAsp: 4.299 ± 0.639
4.529AspGlu: 4.529 ± 0.61
3.301AspPhe: 3.301 ± 0.538
6.602AspGly: 6.602 ± 0.998
1.075AspHis: 1.075 ± 0.275
4.069AspIle: 4.069 ± 0.51
4.299AspLys: 4.299 ± 0.655
5.681AspLeu: 5.681 ± 0.565
2.073AspMet: 2.073 ± 0.499
2.687AspAsn: 2.687 ± 0.361
2.994AspPro: 2.994 ± 0.523
1.919AspGln: 1.919 ± 0.325
2.61AspArg: 2.61 ± 0.519
4.069AspSer: 4.069 ± 0.802
2.533AspThr: 2.533 ± 0.443
3.685AspVal: 3.685 ± 0.43
1.152AspTrp: 1.152 ± 0.309
1.766AspTyr: 1.766 ± 0.322
0.0AspXaa: 0.0 ± 0.0
Glu
4.836GluAla: 4.836 ± 0.718
0.921GluCys: 0.921 ± 0.399
3.838GluAsp: 3.838 ± 0.494
2.917GluGlu: 2.917 ± 0.699
3.301GluPhe: 3.301 ± 0.416
3.148GluGly: 3.148 ± 0.442
1.459GluHis: 1.459 ± 0.347
5.681GluIle: 5.681 ± 0.594
5.067GluLys: 5.067 ± 0.705
6.602GluLeu: 6.602 ± 0.522
1.996GluMet: 1.996 ± 0.478
3.301GluAsn: 3.301 ± 0.437
2.303GluPro: 2.303 ± 0.519
3.378GluGln: 3.378 ± 0.484
3.915GluArg: 3.915 ± 0.65
4.453GluSer: 4.453 ± 0.695
3.378GluThr: 3.378 ± 0.489
4.376GluVal: 4.376 ± 0.445
1.919GluTrp: 1.919 ± 0.379
3.224GluTyr: 3.224 ± 0.497
0.0GluXaa: 0.0 ± 0.0
Phe
2.38PheAla: 2.38 ± 0.472
0.384PheCys: 0.384 ± 0.163
3.071PheAsp: 3.071 ± 0.515
2.687PheGlu: 2.687 ± 0.42
1.152PhePhe: 1.152 ± 0.313
3.071PheGly: 3.071 ± 0.531
0.921PheHis: 0.921 ± 0.313
2.687PheIle: 2.687 ± 0.477
3.455PheLys: 3.455 ± 0.404
1.305PheLeu: 1.305 ± 0.33
0.537PheMet: 0.537 ± 0.25
2.533PheAsn: 2.533 ± 0.398
1.152PhePro: 1.152 ± 0.257
1.535PheGln: 1.535 ± 0.323
1.766PheArg: 1.766 ± 0.329
2.687PheSer: 2.687 ± 0.44
1.919PheThr: 1.919 ± 0.43
2.61PheVal: 2.61 ± 0.409
0.768PheTrp: 0.768 ± 0.211
1.535PheTyr: 1.535 ± 0.365
0.0PheXaa: 0.0 ± 0.0
Gly
5.144GlyAla: 5.144 ± 0.746
1.075GlyCys: 1.075 ± 0.275
5.451GlyAsp: 5.451 ± 0.565
5.067GlyGlu: 5.067 ± 0.725
3.608GlyPhe: 3.608 ± 0.442
6.602GlyGly: 6.602 ± 1.0
1.305GlyHis: 1.305 ± 0.354
4.299GlyIle: 4.299 ± 0.595
5.834GlyLys: 5.834 ± 0.753
4.222GlyLeu: 4.222 ± 0.509
2.073GlyMet: 2.073 ± 0.298
3.992GlyAsn: 3.992 ± 0.624
0.844GlyPro: 0.844 ± 0.29
2.533GlyGln: 2.533 ± 0.574
3.992GlyArg: 3.992 ± 0.487
4.76GlySer: 4.76 ± 0.894
3.071GlyThr: 3.071 ± 0.933
6.679GlyVal: 6.679 ± 0.53
1.766GlyTrp: 1.766 ± 0.358
3.378GlyTyr: 3.378 ± 0.444
0.0GlyXaa: 0.0 ± 0.0
His
1.382HisAla: 1.382 ± 0.231
0.307HisCys: 0.307 ± 0.168
1.152HisAsp: 1.152 ± 0.34
1.152HisGlu: 1.152 ± 0.349
1.075HisPhe: 1.075 ± 0.327
1.459HisGly: 1.459 ± 0.413
0.23HisHis: 0.23 ± 0.139
1.382HisIle: 1.382 ± 0.375
1.459HisLys: 1.459 ± 0.379
1.305HisLeu: 1.305 ± 0.337
0.384HisMet: 0.384 ± 0.16
1.152HisAsn: 1.152 ± 0.244
1.228HisPro: 1.228 ± 0.304
0.614HisGln: 0.614 ± 0.244
1.075HisArg: 1.075 ± 0.34
0.921HisSer: 0.921 ± 0.314
0.768HisThr: 0.768 ± 0.23
0.614HisVal: 0.614 ± 0.189
0.537HisTrp: 0.537 ± 0.172
0.307HisTyr: 0.307 ± 0.145
0.0HisXaa: 0.0 ± 0.0
Ile
4.683IleAla: 4.683 ± 0.616
0.691IleCys: 0.691 ± 0.238
5.681IleAsp: 5.681 ± 0.681
6.449IleGlu: 6.449 ± 0.662
1.766IlePhe: 1.766 ± 0.369
5.681IleGly: 5.681 ± 0.538
1.228IleHis: 1.228 ± 0.328
3.148IleIle: 3.148 ± 0.563
5.144IleLys: 5.144 ± 0.597
3.838IleLeu: 3.838 ± 0.511
0.844IleMet: 0.844 ± 0.226
3.838IleAsn: 3.838 ± 0.442
2.764IlePro: 2.764 ± 0.379
2.457IleGln: 2.457 ± 0.446
3.531IleArg: 3.531 ± 0.437
3.608IleSer: 3.608 ± 0.536
3.608IleThr: 3.608 ± 0.498
3.455IleVal: 3.455 ± 0.485
1.075IleTrp: 1.075 ± 0.285
2.38IleTyr: 2.38 ± 0.387
0.0IleXaa: 0.0 ± 0.0
Lys
4.683LysAla: 4.683 ± 0.802
0.768LysCys: 0.768 ± 0.233
2.994LysAsp: 2.994 ± 0.556
5.681LysGlu: 5.681 ± 0.975
2.073LysPhe: 2.073 ± 0.353
5.374LysGly: 5.374 ± 0.648
1.612LysHis: 1.612 ± 0.388
4.299LysIle: 4.299 ± 0.666
5.297LysLys: 5.297 ± 0.822
6.756LysLeu: 6.756 ± 0.875
2.073LysMet: 2.073 ± 0.458
3.301LysAsn: 3.301 ± 0.488
2.533LysPro: 2.533 ± 0.482
2.073LysGln: 2.073 ± 0.401
4.299LysArg: 4.299 ± 0.695
5.144LysSer: 5.144 ± 0.58
3.301LysThr: 3.301 ± 0.448
4.606LysVal: 4.606 ± 0.57
1.382LysTrp: 1.382 ± 0.385
3.301LysTyr: 3.301 ± 0.519
0.0LysXaa: 0.0 ± 0.0
Leu
4.913LeuAla: 4.913 ± 0.78
0.921LeuCys: 0.921 ± 0.304
4.913LeuAsp: 4.913 ± 0.535
4.606LeuGlu: 4.606 ± 0.664
2.61LeuPhe: 2.61 ± 0.439
4.299LeuGly: 4.299 ± 0.615
1.382LeuHis: 1.382 ± 0.424
5.451LeuIle: 5.451 ± 0.772
4.836LeuLys: 4.836 ± 0.566
5.527LeuLeu: 5.527 ± 0.623
2.303LeuMet: 2.303 ± 0.449
4.453LeuAsn: 4.453 ± 0.74
2.84LeuPro: 2.84 ± 0.34
2.687LeuGln: 2.687 ± 0.36
2.073LeuArg: 2.073 ± 0.422
5.988LeuSer: 5.988 ± 0.688
3.608LeuThr: 3.608 ± 0.521
4.836LeuVal: 4.836 ± 0.478
0.768LeuTrp: 0.768 ± 0.201
2.303LeuTyr: 2.303 ± 0.425
0.0LeuXaa: 0.0 ± 0.0
Met
2.303MetAla: 2.303 ± 0.354
0.537MetCys: 0.537 ± 0.194
1.535MetAsp: 1.535 ± 0.376
1.305MetGlu: 1.305 ± 0.337
0.921MetPhe: 0.921 ± 0.27
1.228MetGly: 1.228 ± 0.286
0.384MetHis: 0.384 ± 0.167
1.689MetIle: 1.689 ± 0.411
1.919MetLys: 1.919 ± 0.452
1.459MetLeu: 1.459 ± 0.327
0.844MetMet: 0.844 ± 0.261
1.459MetAsn: 1.459 ± 0.338
1.382MetPro: 1.382 ± 0.379
1.075MetGln: 1.075 ± 0.444
1.689MetArg: 1.689 ± 0.444
2.84MetSer: 2.84 ± 0.486
1.535MetThr: 1.535 ± 0.367
1.305MetVal: 1.305 ± 0.321
0.691MetTrp: 0.691 ± 0.234
0.768MetTyr: 0.768 ± 0.22
0.0MetXaa: 0.0 ± 0.0
Asn
3.378AsnAla: 3.378 ± 0.587
1.305AsnCys: 1.305 ± 0.333
2.84AsnAsp: 2.84 ± 0.542
4.376AsnGlu: 4.376 ± 0.611
2.073AsnPhe: 2.073 ± 0.365
5.911AsnGly: 5.911 ± 0.677
1.075AsnHis: 1.075 ± 0.309
2.687AsnIle: 2.687 ± 0.422
3.378AsnLys: 3.378 ± 0.605
3.992AsnLeu: 3.992 ± 0.681
0.614AsnMet: 0.614 ± 0.228
2.303AsnAsn: 2.303 ± 0.463
2.226AsnPro: 2.226 ± 0.398
1.842AsnGln: 1.842 ± 0.395
1.919AsnArg: 1.919 ± 0.377
3.685AsnSer: 3.685 ± 0.551
2.917AsnThr: 2.917 ± 0.386
2.226AsnVal: 2.226 ± 0.436
0.998AsnTrp: 0.998 ± 0.292
2.303AsnTyr: 2.303 ± 0.378
0.0AsnXaa: 0.0 ± 0.0
Pro
2.457ProAla: 2.457 ± 0.462
0.691ProCys: 0.691 ± 0.235
3.071ProAsp: 3.071 ± 0.392
2.226ProGlu: 2.226 ± 0.45
0.998ProPhe: 0.998 ± 0.268
1.305ProGly: 1.305 ± 0.263
0.307ProHis: 0.307 ± 0.15
2.38ProIle: 2.38 ± 0.598
2.073ProLys: 2.073 ± 0.474
3.148ProLeu: 3.148 ± 0.456
0.921ProMet: 0.921 ± 0.227
2.15ProAsn: 2.15 ± 0.399
0.691ProPro: 0.691 ± 0.282
1.228ProGln: 1.228 ± 0.28
2.073ProArg: 2.073 ± 0.37
2.84ProSer: 2.84 ± 0.64
1.919ProThr: 1.919 ± 0.341
2.61ProVal: 2.61 ± 0.386
0.614ProTrp: 0.614 ± 0.201
0.844ProTyr: 0.844 ± 0.221
0.0ProXaa: 0.0 ± 0.0
Gln
3.685GlnAla: 3.685 ± 0.541
0.461GlnCys: 0.461 ± 0.198
1.382GlnAsp: 1.382 ± 0.28
2.84GlnGlu: 2.84 ± 0.512
1.919GlnPhe: 1.919 ± 0.278
3.071GlnGly: 3.071 ± 0.433
0.614GlnHis: 0.614 ± 0.266
2.15GlnIle: 2.15 ± 0.373
2.15GlnLys: 2.15 ± 0.47
2.917GlnLeu: 2.917 ± 0.469
1.152GlnMet: 1.152 ± 0.357
2.073GlnAsn: 2.073 ± 0.375
1.689GlnPro: 1.689 ± 0.557
3.378GlnGln: 3.378 ± 0.927
1.919GlnArg: 1.919 ± 0.39
3.378GlnSer: 3.378 ± 0.545
1.842GlnThr: 1.842 ± 0.35
2.15GlnVal: 2.15 ± 0.391
0.691GlnTrp: 0.691 ± 0.211
1.842GlnTyr: 1.842 ± 0.403
0.0GlnXaa: 0.0 ± 0.0
Arg
2.917ArgAla: 2.917 ± 0.479
0.461ArgCys: 0.461 ± 0.182
3.531ArgAsp: 3.531 ± 0.44
3.455ArgGlu: 3.455 ± 0.484
1.996ArgPhe: 1.996 ± 0.454
2.533ArgGly: 2.533 ± 0.387
1.075ArgHis: 1.075 ± 0.321
2.38ArgIle: 2.38 ± 0.396
4.069ArgLys: 4.069 ± 0.661
2.764ArgLeu: 2.764 ± 0.455
1.842ArgMet: 1.842 ± 0.362
1.766ArgAsn: 1.766 ± 0.408
1.535ArgPro: 1.535 ± 0.345
2.764ArgGln: 2.764 ± 0.486
2.226ArgArg: 2.226 ± 0.445
2.687ArgSer: 2.687 ± 0.479
1.766ArgThr: 1.766 ± 0.449
4.069ArgVal: 4.069 ± 0.738
0.921ArgTrp: 0.921 ± 0.292
1.535ArgTyr: 1.535 ± 0.365
0.0ArgXaa: 0.0 ± 0.0
Ser
5.22SerAla: 5.22 ± 0.74
0.844SerCys: 0.844 ± 0.239
3.915SerAsp: 3.915 ± 0.437
5.451SerGlu: 5.451 ± 0.661
3.071SerPhe: 3.071 ± 0.502
5.527SerGly: 5.527 ± 0.993
0.691SerHis: 0.691 ± 0.265
4.99SerIle: 4.99 ± 0.606
5.758SerLys: 5.758 ± 0.558
5.067SerLeu: 5.067 ± 0.557
1.612SerMet: 1.612 ± 0.298
2.84SerAsn: 2.84 ± 0.561
1.996SerPro: 1.996 ± 0.299
3.455SerGln: 3.455 ± 0.493
2.687SerArg: 2.687 ± 0.366
4.606SerSer: 4.606 ± 0.751
3.224SerThr: 3.224 ± 0.492
5.144SerVal: 5.144 ± 0.672
0.461SerTrp: 0.461 ± 0.199
2.073SerTyr: 2.073 ± 0.385
0.0SerXaa: 0.0 ± 0.0
Thr
4.299ThrAla: 4.299 ± 0.66
0.614ThrCys: 0.614 ± 0.222
3.301ThrAsp: 3.301 ± 0.508
3.531ThrGlu: 3.531 ± 0.511
2.303ThrPhe: 2.303 ± 0.396
4.069ThrGly: 4.069 ± 0.593
0.844ThrHis: 0.844 ± 0.244
3.455ThrIle: 3.455 ± 0.496
2.994ThrLys: 2.994 ± 0.509
3.685ThrLeu: 3.685 ± 0.528
1.459ThrMet: 1.459 ± 0.365
2.15ThrAsn: 2.15 ± 0.405
2.687ThrPro: 2.687 ± 0.461
1.996ThrGln: 1.996 ± 0.411
2.303ThrArg: 2.303 ± 0.512
2.61ThrSer: 2.61 ± 0.406
2.38ThrThr: 2.38 ± 0.368
3.838ThrVal: 3.838 ± 0.764
0.384ThrTrp: 0.384 ± 0.148
1.382ThrTyr: 1.382 ± 0.312
0.0ThrXaa: 0.0 ± 0.0
Val
3.838ValAla: 3.838 ± 0.657
1.305ValCys: 1.305 ± 0.339
5.451ValAsp: 5.451 ± 0.707
5.067ValGlu: 5.067 ± 0.616
1.766ValPhe: 1.766 ± 0.402
4.683ValGly: 4.683 ± 0.551
1.305ValHis: 1.305 ± 0.3
5.22ValIle: 5.22 ± 0.514
4.453ValLys: 4.453 ± 0.583
2.764ValLeu: 2.764 ± 0.45
1.842ValMet: 1.842 ± 0.319
3.762ValAsn: 3.762 ± 0.619
1.919ValPro: 1.919 ± 0.419
2.61ValGln: 2.61 ± 0.388
1.996ValArg: 1.996 ± 0.345
5.374ValSer: 5.374 ± 0.79
4.683ValThr: 4.683 ± 0.684
4.913ValVal: 4.913 ± 0.579
1.075ValTrp: 1.075 ± 0.315
1.612ValTyr: 1.612 ± 0.399
0.0ValXaa: 0.0 ± 0.0
Trp
1.535TrpAla: 1.535 ± 0.334
0.614TrpCys: 0.614 ± 0.239
1.305TrpAsp: 1.305 ± 0.3
1.152TrpGlu: 1.152 ± 0.262
0.998TrpPhe: 0.998 ± 0.284
1.075TrpGly: 1.075 ± 0.28
0.691TrpHis: 0.691 ± 0.266
1.152TrpIle: 1.152 ± 0.302
0.998TrpLys: 0.998 ± 0.306
1.459TrpLeu: 1.459 ± 0.42
0.384TrpMet: 0.384 ± 0.167
0.768TrpAsn: 0.768 ± 0.253
0.23TrpPro: 0.23 ± 0.129
1.305TrpGln: 1.305 ± 0.282
0.998TrpArg: 0.998 ± 0.212
0.768TrpSer: 0.768 ± 0.254
0.844TrpThr: 0.844 ± 0.234
1.152TrpVal: 1.152 ± 0.274
0.307TrpTrp: 0.307 ± 0.153
0.691TrpTyr: 0.691 ± 0.219
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.38TyrAla: 2.38 ± 0.487
0.844TyrCys: 0.844 ± 0.219
2.687TyrAsp: 2.687 ± 0.432
2.15TyrGlu: 2.15 ± 0.486
0.921TyrPhe: 0.921 ± 0.248
2.457TyrGly: 2.457 ± 0.43
0.921TyrHis: 0.921 ± 0.201
1.842TyrIle: 1.842 ± 0.355
2.303TyrLys: 2.303 ± 0.452
2.533TyrLeu: 2.533 ± 0.388
0.614TyrMet: 0.614 ± 0.183
1.919TyrAsn: 1.919 ± 0.403
1.459TyrPro: 1.459 ± 0.335
1.459TyrGln: 1.459 ± 0.291
1.766TyrArg: 1.766 ± 0.321
2.994TyrSer: 2.994 ± 0.569
1.919TyrThr: 1.919 ± 0.42
2.457TyrVal: 2.457 ± 0.467
0.691TyrTrp: 0.691 ± 0.237
0.998TyrTyr: 0.998 ± 0.215
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (13027 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski