Amino acid dipepetide frequency for Aeromonas phage 4_L372X

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.227AlaAla: 10.227 ± 1.721
1.493AlaCys: 1.493 ± 0.367
5.599AlaAsp: 5.599 ± 0.682
6.868AlaGlu: 6.868 ± 0.806
2.538AlaPhe: 2.538 ± 0.459
6.121AlaGly: 6.121 ± 0.831
1.418AlaHis: 1.418 ± 0.315
7.838AlaIle: 7.838 ± 0.973
6.942AlaLys: 6.942 ± 0.943
6.793AlaLeu: 6.793 ± 0.765
3.434AlaMet: 3.434 ± 0.514
4.404AlaAsn: 4.404 ± 0.707
1.941AlaPro: 1.941 ± 0.41
3.732AlaGln: 3.732 ± 0.617
4.852AlaArg: 4.852 ± 0.519
7.166AlaSer: 7.166 ± 1.041
5.897AlaThr: 5.897 ± 0.753
5.3AlaVal: 5.3 ± 0.71
1.045AlaTrp: 1.045 ± 0.304
3.583AlaTyr: 3.583 ± 0.571
0.0AlaXaa: 0.0 ± 0.0
Cys
0.97CysAla: 0.97 ± 0.258
0.224CysCys: 0.224 ± 0.137
1.045CysAsp: 1.045 ± 0.277
1.194CysGlu: 1.194 ± 0.361
0.373CysPhe: 0.373 ± 0.166
1.269CysGly: 1.269 ± 0.343
0.448CysHis: 0.448 ± 0.208
1.12CysIle: 1.12 ± 0.263
1.269CysLys: 1.269 ± 0.374
0.672CysLeu: 0.672 ± 0.262
0.373CysMet: 0.373 ± 0.161
0.373CysAsn: 0.373 ± 0.191
0.448CysPro: 0.448 ± 0.185
0.523CysGln: 0.523 ± 0.202
0.672CysArg: 0.672 ± 0.196
0.97CysSer: 0.97 ± 0.255
0.597CysThr: 0.597 ± 0.225
0.597CysVal: 0.597 ± 0.268
0.448CysTrp: 0.448 ± 0.175
0.597CysTyr: 0.597 ± 0.212
0.0CysXaa: 0.0 ± 0.0
Asp
6.271AspAla: 6.271 ± 0.716
1.045AspCys: 1.045 ± 0.261
3.882AspAsp: 3.882 ± 0.682
5.225AspGlu: 5.225 ± 0.78
2.911AspPhe: 2.911 ± 0.498
5.599AspGly: 5.599 ± 0.669
0.97AspHis: 0.97 ± 0.233
2.762AspIle: 2.762 ± 0.466
3.135AspLys: 3.135 ± 0.442
5.001AspLeu: 5.001 ± 0.618
1.717AspMet: 1.717 ± 0.304
1.642AspAsn: 1.642 ± 0.334
1.493AspPro: 1.493 ± 0.309
1.568AspGln: 1.568 ± 0.434
2.314AspArg: 2.314 ± 0.401
3.807AspSer: 3.807 ± 0.558
2.911AspThr: 2.911 ± 0.499
3.732AspVal: 3.732 ± 0.534
1.045AspTrp: 1.045 ± 0.308
1.717AspTyr: 1.717 ± 0.355
0.0AspXaa: 0.0 ± 0.0
Glu
6.569GluAla: 6.569 ± 0.806
1.269GluCys: 1.269 ± 0.291
2.837GluAsp: 2.837 ± 0.476
3.061GluGlu: 3.061 ± 0.508
3.509GluPhe: 3.509 ± 0.457
3.434GluGly: 3.434 ± 0.452
0.97GluHis: 0.97 ± 0.317
4.852GluIle: 4.852 ± 0.577
3.658GluLys: 3.658 ± 0.535
7.017GluLeu: 7.017 ± 0.741
3.658GluMet: 3.658 ± 0.581
2.762GluAsn: 2.762 ± 0.451
1.941GluPro: 1.941 ± 0.472
3.21GluGln: 3.21 ± 0.671
4.927GluArg: 4.927 ± 0.592
5.673GluSer: 5.673 ± 0.575
1.717GluThr: 1.717 ± 0.339
5.3GluVal: 5.3 ± 0.621
1.194GluTrp: 1.194 ± 0.321
3.061GluTyr: 3.061 ± 0.55
0.0GluXaa: 0.0 ± 0.0
Phe
2.538PheAla: 2.538 ± 0.46
0.821PheCys: 0.821 ± 0.265
2.538PheAsp: 2.538 ± 0.454
2.463PheGlu: 2.463 ± 0.476
1.045PhePhe: 1.045 ± 0.294
3.434PheGly: 3.434 ± 0.464
0.597PheHis: 0.597 ± 0.229
2.687PheIle: 2.687 ± 0.356
2.463PheLys: 2.463 ± 0.454
1.194PheLeu: 1.194 ± 0.256
1.12PheMet: 1.12 ± 0.258
1.941PheAsn: 1.941 ± 0.479
0.821PhePro: 0.821 ± 0.253
1.269PheGln: 1.269 ± 0.34
1.269PheArg: 1.269 ± 0.331
2.911PheSer: 2.911 ± 0.507
2.538PheThr: 2.538 ± 0.505
2.016PheVal: 2.016 ± 0.4
0.299PheTrp: 0.299 ± 0.12
1.418PheTyr: 1.418 ± 0.289
0.0PheXaa: 0.0 ± 0.0
Gly
5.823GlyAla: 5.823 ± 0.824
0.97GlyCys: 0.97 ± 0.255
4.18GlyAsp: 4.18 ± 0.468
5.449GlyGlu: 5.449 ± 0.558
2.986GlyPhe: 2.986 ± 0.582
6.718GlyGly: 6.718 ± 1.033
1.12GlyHis: 1.12 ± 0.341
4.554GlyIle: 4.554 ± 0.47
4.778GlyLys: 4.778 ± 0.582
4.927GlyLeu: 4.927 ± 0.551
2.538GlyMet: 2.538 ± 0.417
2.911GlyAsn: 2.911 ± 0.596
0.97GlyPro: 0.97 ± 0.294
2.538GlyGln: 2.538 ± 0.346
3.509GlyArg: 3.509 ± 0.579
5.076GlySer: 5.076 ± 0.565
3.135GlyThr: 3.135 ± 0.505
5.823GlyVal: 5.823 ± 0.643
1.12GlyTrp: 1.12 ± 0.255
2.463GlyTyr: 2.463 ± 0.392
0.075GlyXaa: 0.075 ± 0.073
His
2.165HisAla: 2.165 ± 0.456
0.149HisCys: 0.149 ± 0.099
1.194HisAsp: 1.194 ± 0.291
1.194HisGlu: 1.194 ± 0.309
0.299HisPhe: 0.299 ± 0.15
1.194HisGly: 1.194 ± 0.394
0.97HisHis: 0.97 ± 0.326
0.97HisIle: 0.97 ± 0.332
1.344HisLys: 1.344 ± 0.336
1.194HisLeu: 1.194 ± 0.36
0.373HisMet: 0.373 ± 0.156
0.821HisAsn: 0.821 ± 0.219
0.672HisPro: 0.672 ± 0.278
0.746HisGln: 0.746 ± 0.224
0.821HisArg: 0.821 ± 0.226
0.672HisSer: 0.672 ± 0.22
0.821HisThr: 0.821 ± 0.268
0.672HisVal: 0.672 ± 0.223
0.373HisTrp: 0.373 ± 0.164
0.597HisTyr: 0.597 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
6.569IleAla: 6.569 ± 0.816
0.597IleCys: 0.597 ± 0.187
4.927IleAsp: 4.927 ± 0.504
5.449IleGlu: 5.449 ± 0.544
1.269IlePhe: 1.269 ± 0.312
4.628IleGly: 4.628 ± 0.547
1.344IleHis: 1.344 ± 0.305
3.658IleIle: 3.658 ± 0.549
4.554IleLys: 4.554 ± 0.579
3.434IleLeu: 3.434 ± 0.422
1.568IleMet: 1.568 ± 0.31
3.135IleAsn: 3.135 ± 0.461
2.239IlePro: 2.239 ± 0.328
1.717IleGln: 1.717 ± 0.347
2.837IleArg: 2.837 ± 0.488
3.807IleSer: 3.807 ± 0.488
3.135IleThr: 3.135 ± 0.518
4.255IleVal: 4.255 ± 0.703
0.672IleTrp: 0.672 ± 0.249
1.418IleTyr: 1.418 ± 0.252
0.0IleXaa: 0.0 ± 0.0
Lys
4.778LysAla: 4.778 ± 0.676
0.821LysCys: 0.821 ± 0.233
2.687LysAsp: 2.687 ± 0.457
4.554LysGlu: 4.554 ± 0.631
1.792LysPhe: 1.792 ± 0.361
3.583LysGly: 3.583 ± 0.42
0.597LysHis: 0.597 ± 0.255
3.583LysIle: 3.583 ± 0.59
3.882LysLys: 3.882 ± 0.505
6.196LysLeu: 6.196 ± 0.701
2.389LysMet: 2.389 ± 0.396
2.687LysAsn: 2.687 ± 0.329
3.135LysPro: 3.135 ± 0.468
3.658LysGln: 3.658 ± 0.625
4.404LysArg: 4.404 ± 0.758
3.658LysSer: 3.658 ± 0.612
3.21LysThr: 3.21 ± 0.486
4.628LysVal: 4.628 ± 0.551
1.045LysTrp: 1.045 ± 0.275
2.09LysTyr: 2.09 ± 0.417
0.0LysXaa: 0.0 ± 0.0
Leu
8.211LeuAla: 8.211 ± 0.868
0.597LeuCys: 0.597 ± 0.208
4.255LeuAsp: 4.255 ± 0.44
5.673LeuGlu: 5.673 ± 0.758
2.314LeuPhe: 2.314 ± 0.398
4.703LeuGly: 4.703 ± 0.61
1.493LeuHis: 1.493 ± 0.336
3.882LeuIle: 3.882 ± 0.458
4.554LeuLys: 4.554 ± 0.763
4.628LeuLeu: 4.628 ± 0.601
2.314LeuMet: 2.314 ± 0.489
2.837LeuAsn: 2.837 ± 0.479
2.538LeuPro: 2.538 ± 0.402
2.687LeuGln: 2.687 ± 0.5
3.21LeuArg: 3.21 ± 0.472
5.748LeuSer: 5.748 ± 0.685
4.33LeuThr: 4.33 ± 0.47
4.479LeuVal: 4.479 ± 0.584
0.746LeuTrp: 0.746 ± 0.18
2.09LeuTyr: 2.09 ± 0.396
0.0LeuXaa: 0.0 ± 0.0
Met
3.658MetAla: 3.658 ± 0.477
0.448MetCys: 0.448 ± 0.148
1.866MetAsp: 1.866 ± 0.341
2.016MetGlu: 2.016 ± 0.527
1.045MetPhe: 1.045 ± 0.367
1.493MetGly: 1.493 ± 0.353
0.448MetHis: 0.448 ± 0.217
1.493MetIle: 1.493 ± 0.27
2.762MetLys: 2.762 ± 0.471
2.09MetLeu: 2.09 ± 0.445
0.672MetMet: 0.672 ± 0.199
1.344MetAsn: 1.344 ± 0.333
1.642MetPro: 1.642 ± 0.42
1.493MetGln: 1.493 ± 0.385
1.717MetArg: 1.717 ± 0.355
2.837MetSer: 2.837 ± 0.523
2.837MetThr: 2.837 ± 0.511
1.568MetVal: 1.568 ± 0.354
0.523MetTrp: 0.523 ± 0.176
0.97MetTyr: 0.97 ± 0.243
0.0MetXaa: 0.0 ± 0.0
Asn
4.479AsnAla: 4.479 ± 0.781
0.597AsnCys: 0.597 ± 0.194
1.493AsnAsp: 1.493 ± 0.355
2.389AsnGlu: 2.389 ± 0.427
1.493AsnPhe: 1.493 ± 0.298
4.404AsnGly: 4.404 ± 0.65
0.896AsnHis: 0.896 ± 0.227
2.165AsnIle: 2.165 ± 0.393
2.911AsnLys: 2.911 ± 0.45
2.986AsnLeu: 2.986 ± 0.452
0.672AsnMet: 0.672 ± 0.212
1.866AsnAsn: 1.866 ± 0.396
2.09AsnPro: 2.09 ± 0.419
2.016AsnGln: 2.016 ± 0.479
1.493AsnArg: 1.493 ± 0.327
2.837AsnSer: 2.837 ± 0.447
1.866AsnThr: 1.866 ± 0.382
2.463AsnVal: 2.463 ± 0.442
0.597AsnTrp: 0.597 ± 0.228
1.344AsnTyr: 1.344 ± 0.255
0.0AsnXaa: 0.0 ± 0.0
Pro
3.061ProAla: 3.061 ± 0.49
0.746ProCys: 0.746 ± 0.241
2.165ProAsp: 2.165 ± 0.471
3.583ProGlu: 3.583 ± 0.468
1.194ProPhe: 1.194 ± 0.306
1.194ProGly: 1.194 ± 0.347
0.224ProHis: 0.224 ± 0.118
1.045ProIle: 1.045 ± 0.285
1.344ProLys: 1.344 ± 0.32
2.016ProLeu: 2.016 ± 0.391
1.642ProMet: 1.642 ± 0.332
1.344ProAsn: 1.344 ± 0.265
1.194ProPro: 1.194 ± 0.239
1.344ProGln: 1.344 ± 0.451
1.792ProArg: 1.792 ± 0.353
2.538ProSer: 2.538 ± 0.561
2.239ProThr: 2.239 ± 0.362
2.762ProVal: 2.762 ± 0.492
0.224ProTrp: 0.224 ± 0.112
0.896ProTyr: 0.896 ± 0.279
0.0ProXaa: 0.0 ± 0.0
Gln
4.404GlnAla: 4.404 ± 0.623
0.746GlnCys: 0.746 ± 0.228
1.269GlnAsp: 1.269 ± 0.308
2.837GlnGlu: 2.837 ± 0.505
1.717GlnPhe: 1.717 ± 0.376
2.687GlnGly: 2.687 ± 0.43
0.896GlnHis: 0.896 ± 0.279
1.493GlnIle: 1.493 ± 0.343
2.538GlnLys: 2.538 ± 0.594
3.21GlnLeu: 3.21 ± 0.596
0.896GlnMet: 0.896 ± 0.239
1.493GlnAsn: 1.493 ± 0.299
1.866GlnPro: 1.866 ± 0.421
2.538GlnGln: 2.538 ± 0.92
2.613GlnArg: 2.613 ± 0.621
2.762GlnSer: 2.762 ± 0.552
1.941GlnThr: 1.941 ± 0.369
2.762GlnVal: 2.762 ± 0.427
1.045GlnTrp: 1.045 ± 0.288
1.194GlnTyr: 1.194 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
5.076ArgAla: 5.076 ± 0.643
0.597ArgCys: 0.597 ± 0.202
3.434ArgAsp: 3.434 ± 0.655
3.061ArgGlu: 3.061 ± 0.512
2.314ArgPhe: 2.314 ± 0.443
2.837ArgGly: 2.837 ± 0.358
1.194ArgHis: 1.194 ± 0.384
3.732ArgIle: 3.732 ± 0.558
3.21ArgLys: 3.21 ± 0.51
4.106ArgLeu: 4.106 ± 0.508
1.792ArgMet: 1.792 ± 0.454
1.941ArgAsn: 1.941 ± 0.366
2.09ArgPro: 2.09 ± 0.35
2.239ArgGln: 2.239 ± 0.452
3.509ArgArg: 3.509 ± 0.65
2.687ArgSer: 2.687 ± 0.462
2.538ArgThr: 2.538 ± 0.379
3.359ArgVal: 3.359 ± 0.565
0.821ArgTrp: 0.821 ± 0.229
2.165ArgTyr: 2.165 ± 0.47
0.0ArgXaa: 0.0 ± 0.0
Ser
6.42SerAla: 6.42 ± 0.981
0.672SerCys: 0.672 ± 0.207
4.33SerAsp: 4.33 ± 0.515
4.031SerGlu: 4.031 ± 0.524
2.613SerPhe: 2.613 ± 0.713
6.494SerGly: 6.494 ± 0.716
0.97SerHis: 0.97 ± 0.277
5.225SerIle: 5.225 ± 0.734
3.807SerLys: 3.807 ± 0.523
4.852SerLeu: 4.852 ± 0.599
2.239SerMet: 2.239 ± 0.364
2.389SerAsn: 2.389 ± 0.438
2.239SerPro: 2.239 ± 0.428
3.135SerGln: 3.135 ± 0.473
3.509SerArg: 3.509 ± 0.427
3.732SerSer: 3.732 ± 0.586
3.583SerThr: 3.583 ± 0.646
4.18SerVal: 4.18 ± 0.431
0.896SerTrp: 0.896 ± 0.274
2.389SerTyr: 2.389 ± 0.416
0.0SerXaa: 0.0 ± 0.0
Thr
5.225ThrAla: 5.225 ± 0.641
0.523ThrCys: 0.523 ± 0.174
3.285ThrAsp: 3.285 ± 0.496
3.135ThrGlu: 3.135 ± 0.491
2.762ThrPhe: 2.762 ± 0.478
4.106ThrGly: 4.106 ± 0.67
0.523ThrHis: 0.523 ± 0.2
3.434ThrIle: 3.434 ± 0.592
3.434ThrLys: 3.434 ± 0.483
3.21ThrLeu: 3.21 ± 0.509
1.045ThrMet: 1.045 ± 0.303
2.165ThrAsn: 2.165 ± 0.353
2.389ThrPro: 2.389 ± 0.367
1.866ThrGln: 1.866 ± 0.353
3.061ThrArg: 3.061 ± 0.383
3.583ThrSer: 3.583 ± 0.462
2.538ThrThr: 2.538 ± 0.476
3.956ThrVal: 3.956 ± 0.453
0.746ThrTrp: 0.746 ± 0.26
2.239ThrTyr: 2.239 ± 0.623
0.0ThrXaa: 0.0 ± 0.0
Val
6.569ValAla: 6.569 ± 0.659
1.12ValCys: 1.12 ± 0.291
4.554ValAsp: 4.554 ± 0.559
5.225ValGlu: 5.225 ± 0.913
1.941ValPhe: 1.941 ± 0.382
5.001ValGly: 5.001 ± 0.595
0.821ValHis: 0.821 ± 0.222
4.031ValIle: 4.031 ± 0.817
4.479ValLys: 4.479 ± 0.61
2.911ValLeu: 2.911 ± 0.411
3.21ValMet: 3.21 ± 0.448
3.135ValAsn: 3.135 ± 0.528
1.568ValPro: 1.568 ± 0.315
2.165ValGln: 2.165 ± 0.39
3.135ValArg: 3.135 ± 0.673
4.031ValSer: 4.031 ± 0.585
4.703ValThr: 4.703 ± 0.535
6.718ValVal: 6.718 ± 0.762
0.672ValTrp: 0.672 ± 0.233
2.538ValTyr: 2.538 ± 0.458
0.0ValXaa: 0.0 ± 0.0
Trp
1.045TrpAla: 1.045 ± 0.208
0.224TrpCys: 0.224 ± 0.112
0.672TrpAsp: 0.672 ± 0.225
0.672TrpGlu: 0.672 ± 0.207
0.746TrpPhe: 0.746 ± 0.27
0.597TrpGly: 0.597 ± 0.211
0.523TrpHis: 0.523 ± 0.231
0.896TrpIle: 0.896 ± 0.262
0.746TrpLys: 0.746 ± 0.236
2.314TrpLeu: 2.314 ± 0.414
0.448TrpMet: 0.448 ± 0.179
0.373TrpAsn: 0.373 ± 0.131
0.224TrpPro: 0.224 ± 0.12
0.149TrpGln: 0.149 ± 0.098
1.194TrpArg: 1.194 ± 0.29
0.523TrpSer: 0.523 ± 0.211
0.746TrpThr: 0.746 ± 0.258
1.792TrpVal: 1.792 ± 0.382
0.523TrpTrp: 0.523 ± 0.171
0.448TrpTyr: 0.448 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.135TyrAla: 3.135 ± 0.522
0.523TyrCys: 0.523 ± 0.173
2.538TyrAsp: 2.538 ± 0.438
2.538TyrGlu: 2.538 ± 0.364
0.523TyrPhe: 0.523 ± 0.192
2.239TyrGly: 2.239 ± 0.425
0.821TyrHis: 0.821 ± 0.29
1.792TyrIle: 1.792 ± 0.309
1.568TyrLys: 1.568 ± 0.356
2.538TyrLeu: 2.538 ± 0.516
0.746TyrMet: 0.746 ± 0.196
1.493TyrAsn: 1.493 ± 0.266
1.045TyrPro: 1.045 ± 0.295
2.314TyrGln: 2.314 ± 0.408
1.866TyrArg: 1.866 ± 0.523
2.687TyrSer: 2.687 ± 0.509
1.941TyrThr: 1.941 ± 0.378
2.09TyrVal: 2.09 ± 0.471
0.821TyrTrp: 0.821 ± 0.264
1.344TyrTyr: 1.344 ± 0.309
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.075XaaVal: 0.075 ± 0.073
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (13397 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski