Amino acid dipepetide frequency for Vibrio phage Valm-yong1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.834AlaAla: 7.834 ± 1.301
1.19AlaCys: 1.19 ± 0.359
4.661AlaAsp: 4.661 ± 0.748
6.545AlaGlu: 6.545 ± 0.79
3.57AlaPhe: 3.57 ± 0.664
5.653AlaGly: 5.653 ± 0.875
1.785AlaHis: 1.785 ± 0.415
5.95AlaIle: 5.95 ± 0.908
5.653AlaLys: 5.653 ± 0.734
7.338AlaLeu: 7.338 ± 0.783
3.273AlaMet: 3.273 ± 0.627
3.967AlaAsn: 3.967 ± 0.573
1.587AlaPro: 1.587 ± 0.396
2.777AlaGln: 2.777 ± 0.461
3.074AlaArg: 3.074 ± 0.594
4.066AlaSer: 4.066 ± 0.666
5.752AlaThr: 5.752 ± 0.882
4.76AlaVal: 4.76 ± 0.742
0.992AlaTrp: 0.992 ± 0.292
2.182AlaTyr: 2.182 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
0.793CysAla: 0.793 ± 0.296
0.198CysCys: 0.198 ± 0.139
0.595CysAsp: 0.595 ± 0.214
0.793CysGlu: 0.793 ± 0.29
0.496CysPhe: 0.496 ± 0.213
0.397CysGly: 0.397 ± 0.357
0.0CysHis: 0.0 ± 0.0
0.298CysIle: 0.298 ± 0.183
0.595CysLys: 0.595 ± 0.265
1.091CysLeu: 1.091 ± 0.269
0.198CysMet: 0.198 ± 0.141
0.099CysAsn: 0.099 ± 0.096
0.397CysPro: 0.397 ± 0.209
0.397CysGln: 0.397 ± 0.154
0.694CysArg: 0.694 ± 0.294
0.595CysSer: 0.595 ± 0.221
0.992CysThr: 0.992 ± 0.263
1.091CysVal: 1.091 ± 0.324
0.099CysTrp: 0.099 ± 0.097
0.496CysTyr: 0.496 ± 0.214
0.0CysXaa: 0.0 ± 0.0
Asp
3.868AspAla: 3.868 ± 0.62
0.793AspCys: 0.793 ± 0.322
3.669AspAsp: 3.669 ± 0.661
5.454AspGlu: 5.454 ± 0.743
2.479AspPhe: 2.479 ± 0.458
3.57AspGly: 3.57 ± 0.704
0.893AspHis: 0.893 ± 0.296
4.165AspIle: 4.165 ± 0.699
2.975AspLys: 2.975 ± 0.578
5.058AspLeu: 5.058 ± 0.588
1.587AspMet: 1.587 ± 0.427
2.578AspAsn: 2.578 ± 0.523
2.281AspPro: 2.281 ± 0.554
1.289AspGln: 1.289 ± 0.304
2.38AspArg: 2.38 ± 0.379
3.074AspSer: 3.074 ± 0.633
2.578AspThr: 2.578 ± 0.413
4.76AspVal: 4.76 ± 0.706
1.091AspTrp: 1.091 ± 0.383
2.578AspTyr: 2.578 ± 0.574
0.0AspXaa: 0.0 ± 0.0
Glu
5.851GluAla: 5.851 ± 0.797
0.595GluCys: 0.595 ± 0.299
3.471GluAsp: 3.471 ± 0.425
4.463GluGlu: 4.463 ± 0.859
3.57GluPhe: 3.57 ± 0.523
3.372GluGly: 3.372 ± 0.72
1.785GluHis: 1.785 ± 0.44
3.273GluIle: 3.273 ± 0.585
3.868GluLys: 3.868 ± 0.699
9.223GluLeu: 9.223 ± 0.92
2.876GluMet: 2.876 ± 0.475
2.777GluAsn: 2.777 ± 0.774
2.777GluPro: 2.777 ± 0.625
4.066GluGln: 4.066 ± 0.63
3.768GluArg: 3.768 ± 0.663
4.066GluSer: 4.066 ± 0.685
2.777GluThr: 2.777 ± 0.651
4.562GluVal: 4.562 ± 0.513
1.19GluTrp: 1.19 ± 0.329
2.38GluTyr: 2.38 ± 0.495
0.0GluXaa: 0.0 ± 0.0
Phe
4.165PheAla: 4.165 ± 0.725
0.893PheCys: 0.893 ± 0.35
2.777PheAsp: 2.777 ± 0.578
2.777PheGlu: 2.777 ± 0.402
1.587PhePhe: 1.587 ± 0.485
2.578PheGly: 2.578 ± 0.564
0.198PheHis: 0.198 ± 0.124
2.083PheIle: 2.083 ± 0.467
2.182PheLys: 2.182 ± 0.469
3.074PheLeu: 3.074 ± 0.698
1.289PheMet: 1.289 ± 0.247
2.182PheAsn: 2.182 ± 0.476
1.488PhePro: 1.488 ± 0.268
1.488PheGln: 1.488 ± 0.312
1.587PheArg: 1.587 ± 0.363
3.273PheSer: 3.273 ± 0.689
2.876PheThr: 2.876 ± 0.618
3.173PheVal: 3.173 ± 0.565
0.694PheTrp: 0.694 ± 0.324
0.992PheTyr: 0.992 ± 0.339
0.0PheXaa: 0.0 ± 0.0
Gly
4.859GlyAla: 4.859 ± 0.964
0.793GlyCys: 0.793 ± 0.268
3.372GlyAsp: 3.372 ± 0.605
4.363GlyGlu: 4.363 ± 0.696
2.876GlyPhe: 2.876 ± 0.519
3.372GlyGly: 3.372 ± 0.596
1.091GlyHis: 1.091 ± 0.317
2.975GlyIle: 2.975 ± 0.598
5.058GlyLys: 5.058 ± 0.83
5.157GlyLeu: 5.157 ± 0.848
1.686GlyMet: 1.686 ± 0.485
4.264GlyAsn: 4.264 ± 0.612
1.289GlyPro: 1.289 ± 0.474
3.471GlyGln: 3.471 ± 0.713
2.876GlyArg: 2.876 ± 0.579
3.669GlySer: 3.669 ± 0.464
3.273GlyThr: 3.273 ± 0.65
4.363GlyVal: 4.363 ± 0.83
1.587GlyTrp: 1.587 ± 0.366
2.281GlyTyr: 2.281 ± 0.448
0.0GlyXaa: 0.0 ± 0.0
His
1.686HisAla: 1.686 ± 0.488
0.0HisCys: 0.0 ± 0.0
0.893HisAsp: 0.893 ± 0.293
0.793HisGlu: 0.793 ± 0.293
0.694HisPhe: 0.694 ± 0.218
1.289HisGly: 1.289 ± 0.351
0.595HisHis: 0.595 ± 0.187
1.289HisIle: 1.289 ± 0.361
1.686HisLys: 1.686 ± 0.467
2.083HisLeu: 2.083 ± 0.403
0.298HisMet: 0.298 ± 0.206
0.595HisAsn: 0.595 ± 0.239
0.496HisPro: 0.496 ± 0.243
0.694HisGln: 0.694 ± 0.224
0.793HisArg: 0.793 ± 0.251
0.793HisSer: 0.793 ± 0.228
0.992HisThr: 0.992 ± 0.286
1.884HisVal: 1.884 ± 0.546
0.694HisTrp: 0.694 ± 0.243
0.496HisTyr: 0.496 ± 0.24
0.0HisXaa: 0.0 ± 0.0
Ile
4.859IleAla: 4.859 ± 0.771
0.397IleCys: 0.397 ± 0.188
3.471IleAsp: 3.471 ± 0.716
5.653IleGlu: 5.653 ± 0.876
1.587IlePhe: 1.587 ± 0.349
2.876IleGly: 2.876 ± 0.386
0.198IleHis: 0.198 ± 0.124
2.678IleIle: 2.678 ± 0.541
3.967IleLys: 3.967 ± 0.631
4.463IleLeu: 4.463 ± 0.526
1.686IleMet: 1.686 ± 0.467
3.074IleAsn: 3.074 ± 0.682
3.273IlePro: 3.273 ± 0.786
2.38IleGln: 2.38 ± 0.475
3.372IleArg: 3.372 ± 0.636
4.859IleSer: 4.859 ± 0.63
3.967IleThr: 3.967 ± 0.746
2.876IleVal: 2.876 ± 0.452
0.397IleTrp: 0.397 ± 0.205
1.983IleTyr: 1.983 ± 0.524
0.0IleXaa: 0.0 ± 0.0
Lys
5.454LysAla: 5.454 ± 0.675
0.397LysCys: 0.397 ± 0.22
3.967LysAsp: 3.967 ± 0.764
4.661LysGlu: 4.661 ± 0.736
1.884LysPhe: 1.884 ± 0.432
4.661LysGly: 4.661 ± 0.732
1.19LysHis: 1.19 ± 0.35
4.165LysIle: 4.165 ± 0.648
4.76LysLys: 4.76 ± 0.824
6.049LysLeu: 6.049 ± 0.667
1.686LysMet: 1.686 ± 0.466
2.876LysAsn: 2.876 ± 0.602
3.074LysPro: 3.074 ± 0.49
2.975LysGln: 2.975 ± 0.378
2.281LysArg: 2.281 ± 0.694
4.264LysSer: 4.264 ± 0.755
3.471LysThr: 3.471 ± 0.547
4.363LysVal: 4.363 ± 0.759
0.496LysTrp: 0.496 ± 0.248
2.479LysTyr: 2.479 ± 0.562
0.0LysXaa: 0.0 ± 0.0
Leu
7.239LeuAla: 7.239 ± 0.775
0.893LeuCys: 0.893 ± 0.37
5.653LeuAsp: 5.653 ± 0.631
6.148LeuGlu: 6.148 ± 0.937
3.868LeuPhe: 3.868 ± 0.631
6.446LeuGly: 6.446 ± 0.991
1.19LeuHis: 1.19 ± 0.394
4.859LeuIle: 4.859 ± 0.808
5.454LeuLys: 5.454 ± 0.703
6.248LeuLeu: 6.248 ± 1.098
2.479LeuMet: 2.479 ± 0.425
5.256LeuAsn: 5.256 ± 0.879
3.57LeuPro: 3.57 ± 0.76
3.768LeuGln: 3.768 ± 0.616
5.256LeuArg: 5.256 ± 0.668
7.041LeuSer: 7.041 ± 1.243
6.049LeuThr: 6.049 ± 0.806
6.347LeuVal: 6.347 ± 0.832
0.793LeuTrp: 0.793 ± 0.186
1.091LeuTyr: 1.091 ± 0.337
0.0LeuXaa: 0.0 ± 0.0
Met
3.471MetAla: 3.471 ± 0.691
0.099MetCys: 0.099 ± 0.082
1.488MetAsp: 1.488 ± 0.247
1.289MetGlu: 1.289 ± 0.309
1.19MetPhe: 1.19 ± 0.33
1.091MetGly: 1.091 ± 0.354
0.496MetHis: 0.496 ± 0.194
1.388MetIle: 1.388 ± 0.396
1.686MetLys: 1.686 ± 0.369
2.678MetLeu: 2.678 ± 0.42
0.595MetMet: 0.595 ± 0.236
1.19MetAsn: 1.19 ± 0.409
1.19MetPro: 1.19 ± 0.35
1.884MetGln: 1.884 ± 0.369
1.19MetArg: 1.19 ± 0.3
3.273MetSer: 3.273 ± 0.586
1.884MetThr: 1.884 ± 0.393
1.587MetVal: 1.587 ± 0.386
0.397MetTrp: 0.397 ± 0.194
0.397MetTyr: 0.397 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
2.876AsnAla: 2.876 ± 0.583
0.595AsnCys: 0.595 ± 0.19
2.38AsnAsp: 2.38 ± 0.535
3.074AsnGlu: 3.074 ± 0.653
1.785AsnPhe: 1.785 ± 0.324
4.363AsnGly: 4.363 ± 0.701
0.893AsnHis: 0.893 ± 0.316
3.273AsnIle: 3.273 ± 0.539
3.173AsnLys: 3.173 ± 0.606
3.868AsnLeu: 3.868 ± 0.905
1.289AsnMet: 1.289 ± 0.296
2.38AsnAsn: 2.38 ± 0.464
2.281AsnPro: 2.281 ± 0.492
1.785AsnGln: 1.785 ± 0.428
2.38AsnArg: 2.38 ± 0.586
2.678AsnSer: 2.678 ± 0.388
3.669AsnThr: 3.669 ± 0.545
3.173AsnVal: 3.173 ± 0.643
0.893AsnTrp: 0.893 ± 0.284
1.587AsnTyr: 1.587 ± 0.355
0.0AsnXaa: 0.0 ± 0.0
Pro
2.876ProAla: 2.876 ± 0.669
0.198ProCys: 0.198 ± 0.143
3.57ProAsp: 3.57 ± 0.666
4.463ProGlu: 4.463 ± 0.838
1.884ProPhe: 1.884 ± 0.466
1.983ProGly: 1.983 ± 0.388
1.289ProHis: 1.289 ± 0.465
2.182ProIle: 2.182 ± 0.395
2.578ProLys: 2.578 ± 0.46
2.678ProLeu: 2.678 ± 0.505
0.595ProMet: 0.595 ± 0.206
1.686ProAsn: 1.686 ± 0.285
1.19ProPro: 1.19 ± 0.313
1.388ProGln: 1.388 ± 0.417
1.488ProArg: 1.488 ± 0.405
1.488ProSer: 1.488 ± 0.322
2.876ProThr: 2.876 ± 0.484
3.273ProVal: 3.273 ± 0.529
0.198ProTrp: 0.198 ± 0.184
0.893ProTyr: 0.893 ± 0.373
0.0ProXaa: 0.0 ± 0.0
Gln
3.57GlnAla: 3.57 ± 0.647
0.198GlnCys: 0.198 ± 0.166
2.083GlnAsp: 2.083 ± 0.569
2.678GlnGlu: 2.678 ± 0.672
1.587GlnPhe: 1.587 ± 0.444
3.173GlnGly: 3.173 ± 0.617
0.992GlnHis: 0.992 ± 0.298
2.578GlnIle: 2.578 ± 0.523
2.578GlnLys: 2.578 ± 0.599
3.967GlnLeu: 3.967 ± 0.674
1.587GlnMet: 1.587 ± 0.461
1.983GlnAsn: 1.983 ± 0.496
1.884GlnPro: 1.884 ± 0.494
2.578GlnGln: 2.578 ± 0.606
2.38GlnArg: 2.38 ± 0.413
2.876GlnSer: 2.876 ± 0.384
1.289GlnThr: 1.289 ± 0.295
2.777GlnVal: 2.777 ± 0.484
1.289GlnTrp: 1.289 ± 0.331
1.289GlnTyr: 1.289 ± 0.403
0.0GlnXaa: 0.0 ± 0.0
Arg
3.669ArgAla: 3.669 ± 0.614
0.992ArgCys: 0.992 ± 0.403
2.479ArgAsp: 2.479 ± 0.562
3.372ArgGlu: 3.372 ± 0.717
2.479ArgPhe: 2.479 ± 0.408
1.587ArgGly: 1.587 ± 0.391
1.19ArgHis: 1.19 ± 0.32
3.669ArgIle: 3.669 ± 0.63
3.669ArgLys: 3.669 ± 0.904
4.661ArgLeu: 4.661 ± 0.693
0.893ArgMet: 0.893 ± 0.34
1.884ArgAsn: 1.884 ± 0.358
1.388ArgPro: 1.388 ± 0.444
2.281ArgGln: 2.281 ± 0.67
2.678ArgArg: 2.678 ± 0.824
2.876ArgSer: 2.876 ± 0.554
2.083ArgThr: 2.083 ± 0.534
2.876ArgVal: 2.876 ± 0.441
0.992ArgTrp: 0.992 ± 0.358
1.686ArgTyr: 1.686 ± 0.405
0.0ArgXaa: 0.0 ± 0.0
Ser
5.553SerAla: 5.553 ± 0.804
0.496SerCys: 0.496 ± 0.228
3.768SerAsp: 3.768 ± 0.753
4.363SerGlu: 4.363 ± 0.63
2.083SerPhe: 2.083 ± 0.508
4.363SerGly: 4.363 ± 0.724
0.595SerHis: 0.595 ± 0.225
4.463SerIle: 4.463 ± 0.834
5.553SerLys: 5.553 ± 0.793
4.859SerLeu: 4.859 ± 0.777
0.992SerMet: 0.992 ± 0.391
2.38SerAsn: 2.38 ± 0.546
2.876SerPro: 2.876 ± 0.495
1.388SerGln: 1.388 ± 0.398
3.868SerArg: 3.868 ± 0.78
3.768SerSer: 3.768 ± 0.645
4.066SerThr: 4.066 ± 0.728
4.958SerVal: 4.958 ± 0.823
0.992SerTrp: 0.992 ± 0.328
2.182SerTyr: 2.182 ± 0.598
0.0SerXaa: 0.0 ± 0.0
Thr
4.562ThrAla: 4.562 ± 0.559
0.099ThrCys: 0.099 ± 0.103
3.372ThrAsp: 3.372 ± 0.674
2.975ThrGlu: 2.975 ± 0.481
1.884ThrPhe: 1.884 ± 0.389
4.661ThrGly: 4.661 ± 0.799
1.785ThrHis: 1.785 ± 0.467
3.173ThrIle: 3.173 ± 0.493
3.074ThrLys: 3.074 ± 0.598
5.256ThrLeu: 5.256 ± 0.669
2.975ThrMet: 2.975 ± 0.607
2.777ThrAsn: 2.777 ± 0.508
2.975ThrPro: 2.975 ± 0.533
2.975ThrGln: 2.975 ± 0.739
2.281ThrArg: 2.281 ± 0.432
4.463ThrSer: 4.463 ± 1.002
3.967ThrThr: 3.967 ± 0.559
2.578ThrVal: 2.578 ± 0.463
1.488ThrTrp: 1.488 ± 0.399
2.281ThrTyr: 2.281 ± 0.369
0.0ThrXaa: 0.0 ± 0.0
Val
5.95ValAla: 5.95 ± 0.938
0.793ValCys: 0.793 ± 0.28
3.868ValAsp: 3.868 ± 0.566
4.76ValGlu: 4.76 ± 0.726
3.57ValPhe: 3.57 ± 0.812
4.363ValGly: 4.363 ± 0.787
1.091ValHis: 1.091 ± 0.335
2.678ValIle: 2.678 ± 0.538
3.57ValLys: 3.57 ± 0.665
6.545ValLeu: 6.545 ± 0.757
1.587ValMet: 1.587 ± 0.423
3.967ValAsn: 3.967 ± 0.7
3.074ValPro: 3.074 ± 0.481
2.38ValGln: 2.38 ± 0.44
2.479ValArg: 2.479 ± 0.521
3.669ValSer: 3.669 ± 0.788
4.958ValThr: 4.958 ± 0.703
4.562ValVal: 4.562 ± 0.667
1.19ValTrp: 1.19 ± 0.329
1.587ValTyr: 1.587 ± 0.278
0.0ValXaa: 0.0 ± 0.0
Trp
1.488TrpAla: 1.488 ± 0.449
0.198TrpCys: 0.198 ± 0.141
0.595TrpAsp: 0.595 ± 0.178
1.091TrpGlu: 1.091 ± 0.341
0.496TrpPhe: 0.496 ± 0.237
0.793TrpGly: 0.793 ± 0.25
0.694TrpHis: 0.694 ± 0.226
0.595TrpIle: 0.595 ± 0.229
0.893TrpLys: 0.893 ± 0.284
1.884TrpLeu: 1.884 ± 0.561
0.397TrpMet: 0.397 ± 0.192
0.992TrpAsn: 0.992 ± 0.321
0.694TrpPro: 0.694 ± 0.286
1.091TrpGln: 1.091 ± 0.311
1.388TrpArg: 1.388 ± 0.365
1.091TrpSer: 1.091 ± 0.293
0.793TrpThr: 0.793 ± 0.183
1.091TrpVal: 1.091 ± 0.245
0.595TrpTrp: 0.595 ± 0.241
0.298TrpTyr: 0.298 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.983TyrAla: 1.983 ± 0.443
0.496TyrCys: 0.496 ± 0.256
0.992TyrAsp: 0.992 ± 0.271
0.992TyrGlu: 0.992 ± 0.196
1.785TyrPhe: 1.785 ± 0.616
2.083TyrGly: 2.083 ± 0.496
0.793TyrHis: 0.793 ± 0.243
2.281TyrIle: 2.281 ± 0.542
2.182TyrLys: 2.182 ± 0.455
3.471TyrLeu: 3.471 ± 0.767
0.397TyrMet: 0.397 ± 0.205
1.488TyrAsn: 1.488 ± 0.702
0.992TyrPro: 0.992 ± 0.249
2.281TyrGln: 2.281 ± 0.563
1.19TyrArg: 1.19 ± 0.325
1.785TyrSer: 1.785 ± 0.431
1.289TyrThr: 1.289 ± 0.377
1.587TyrVal: 1.587 ± 0.268
1.091TyrTrp: 1.091 ± 0.286
1.19TyrTyr: 1.19 ± 0.368
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (10085 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski