Amino acid dipepetide frequency for Proteus phage vB_PmiP_RS1pmA

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.074AlaAla: 5.074 ± 0.819
0.546AlaCys: 0.546 ± 0.261
4.528AlaAsp: 4.528 ± 0.633
4.215AlaGlu: 4.215 ± 0.511
3.044AlaPhe: 3.044 ± 0.526
6.948AlaGly: 6.948 ± 0.735
1.249AlaHis: 1.249 ± 0.277
2.888AlaIle: 2.888 ± 0.574
3.903AlaLys: 3.903 ± 0.514
7.416AlaLeu: 7.416 ± 1.006
2.108AlaMet: 2.108 ± 0.406
2.732AlaAsn: 2.732 ± 0.436
3.357AlaPro: 3.357 ± 0.915
3.669AlaGln: 3.669 ± 0.499
4.137AlaArg: 4.137 ± 0.798
4.996AlaSer: 4.996 ± 0.563
3.123AlaThr: 3.123 ± 0.529
5.699AlaVal: 5.699 ± 0.735
0.546AlaTrp: 0.546 ± 0.218
2.576AlaTyr: 2.576 ± 0.366
0.0AlaXaa: 0.0 ± 0.0
Cys
0.312CysAla: 0.312 ± 0.155
0.234CysCys: 0.234 ± 0.133
0.625CysAsp: 0.625 ± 0.266
0.234CysGlu: 0.234 ± 0.142
0.234CysPhe: 0.234 ± 0.14
0.468CysGly: 0.468 ± 0.192
0.468CysHis: 0.468 ± 0.228
1.093CysIle: 1.093 ± 0.311
0.468CysLys: 0.468 ± 0.175
1.015CysLeu: 1.015 ± 0.357
0.859CysMet: 0.859 ± 0.307
0.312CysAsn: 0.312 ± 0.179
0.468CysPro: 0.468 ± 0.191
0.234CysGln: 0.234 ± 0.138
0.234CysArg: 0.234 ± 0.17
0.859CysSer: 0.859 ± 0.301
1.639CysThr: 1.639 ± 0.346
0.703CysVal: 0.703 ± 0.284
0.234CysTrp: 0.234 ± 0.172
0.625CysTyr: 0.625 ± 0.265
0.0CysXaa: 0.0 ± 0.0
Asp
5.386AspAla: 5.386 ± 0.811
0.781AspCys: 0.781 ± 0.276
2.888AspAsp: 2.888 ± 0.462
3.435AspGlu: 3.435 ± 0.527
2.108AspPhe: 2.108 ± 0.428
3.903AspGly: 3.903 ± 0.582
0.39AspHis: 0.39 ± 0.133
3.357AspIle: 3.357 ± 0.402
3.435AspLys: 3.435 ± 0.419
5.699AspLeu: 5.699 ± 0.445
1.405AspMet: 1.405 ± 0.451
1.717AspAsn: 1.717 ± 0.442
2.42AspPro: 2.42 ± 0.39
1.015AspGln: 1.015 ± 0.293
1.327AspArg: 1.327 ± 0.33
4.684AspSer: 4.684 ± 0.613
6.323AspThr: 6.323 ± 0.667
3.044AspVal: 3.044 ± 0.587
0.703AspTrp: 0.703 ± 0.207
2.498AspTyr: 2.498 ± 0.469
0.0AspXaa: 0.0 ± 0.0
Glu
5.543GluAla: 5.543 ± 0.805
0.859GluCys: 0.859 ± 0.274
2.732GluAsp: 2.732 ± 0.446
2.81GluGlu: 2.81 ± 0.61
3.044GluPhe: 3.044 ± 0.566
3.747GluGly: 3.747 ± 0.478
1.874GluHis: 1.874 ± 0.456
2.888GluIle: 2.888 ± 0.39
3.669GluLys: 3.669 ± 0.625
6.167GluLeu: 6.167 ± 0.671
1.952GluMet: 1.952 ± 0.284
1.717GluAsn: 1.717 ± 0.318
1.405GluPro: 1.405 ± 0.339
4.215GluGln: 4.215 ± 0.722
3.669GluArg: 3.669 ± 0.555
3.513GluSer: 3.513 ± 0.495
2.732GluThr: 2.732 ± 0.567
4.215GluVal: 4.215 ± 0.548
0.937GluTrp: 0.937 ± 0.295
3.279GluTyr: 3.279 ± 0.369
0.0GluXaa: 0.0 ± 0.0
Phe
1.639PheAla: 1.639 ± 0.395
0.39PheCys: 0.39 ± 0.175
2.264PheAsp: 2.264 ± 0.39
1.561PheGlu: 1.561 ± 0.387
1.327PhePhe: 1.327 ± 0.339
2.732PheGly: 2.732 ± 0.424
0.546PheHis: 0.546 ± 0.216
2.186PheIle: 2.186 ± 0.385
3.279PheLys: 3.279 ± 0.492
2.732PheLeu: 2.732 ± 0.418
0.781PheMet: 0.781 ± 0.204
2.498PheAsn: 2.498 ± 0.387
1.327PhePro: 1.327 ± 0.439
1.327PheGln: 1.327 ± 0.324
1.405PheArg: 1.405 ± 0.351
2.966PheSer: 2.966 ± 0.487
2.342PheThr: 2.342 ± 0.377
1.483PheVal: 1.483 ± 0.348
0.39PheTrp: 0.39 ± 0.201
1.093PheTyr: 1.093 ± 0.242
0.0PheXaa: 0.0 ± 0.0
Gly
5.699GlyAla: 5.699 ± 0.609
1.171GlyCys: 1.171 ± 0.357
4.137GlyAsp: 4.137 ± 0.544
3.435GlyGlu: 3.435 ± 0.566
2.42GlyPhe: 2.42 ± 0.356
3.669GlyGly: 3.669 ± 0.626
0.781GlyHis: 0.781 ± 0.276
5.074GlyIle: 5.074 ± 0.678
3.513GlyLys: 3.513 ± 0.506
5.777GlyLeu: 5.777 ± 0.801
2.81GlyMet: 2.81 ± 0.484
2.888GlyAsn: 2.888 ± 0.391
0.078GlyPro: 0.078 ± 0.067
1.874GlyGln: 1.874 ± 0.343
3.044GlyArg: 3.044 ± 0.42
6.167GlySer: 6.167 ± 0.795
6.948GlyThr: 6.948 ± 0.695
4.996GlyVal: 4.996 ± 0.567
0.859GlyTrp: 0.859 ± 0.213
3.123GlyTyr: 3.123 ± 0.437
0.0GlyXaa: 0.0 ± 0.0
His
0.937HisAla: 0.937 ± 0.209
0.312HisCys: 0.312 ± 0.153
1.171HisAsp: 1.171 ± 0.273
1.171HisGlu: 1.171 ± 0.294
0.859HisPhe: 0.859 ± 0.25
1.249HisGly: 1.249 ± 0.403
0.234HisHis: 0.234 ± 0.132
0.625HisIle: 0.625 ± 0.143
1.795HisLys: 1.795 ± 0.436
2.42HisLeu: 2.42 ± 0.311
0.39HisMet: 0.39 ± 0.173
0.937HisAsn: 0.937 ± 0.294
0.703HisPro: 0.703 ± 0.238
0.859HisGln: 0.859 ± 0.264
1.093HisArg: 1.093 ± 0.257
1.171HisSer: 1.171 ± 0.29
1.249HisThr: 1.249 ± 0.312
0.781HisVal: 0.781 ± 0.252
0.156HisTrp: 0.156 ± 0.116
0.859HisTyr: 0.859 ± 0.2
0.0HisXaa: 0.0 ± 0.0
Ile
3.201IleAla: 3.201 ± 0.472
0.312IleCys: 0.312 ± 0.161
4.294IleAsp: 4.294 ± 0.526
2.03IleGlu: 2.03 ± 0.361
0.937IlePhe: 0.937 ± 0.261
3.357IleGly: 3.357 ± 0.677
1.171IleHis: 1.171 ± 0.359
2.966IleIle: 2.966 ± 0.679
4.918IleLys: 4.918 ± 0.622
5.074IleLeu: 5.074 ± 0.597
1.249IleMet: 1.249 ± 0.271
3.279IleAsn: 3.279 ± 0.616
2.42IlePro: 2.42 ± 0.4
3.513IleGln: 3.513 ± 0.597
2.732IleArg: 2.732 ± 0.554
2.888IleSer: 2.888 ± 0.48
4.45IleThr: 4.45 ± 0.64
3.279IleVal: 3.279 ± 0.435
0.39IleTrp: 0.39 ± 0.178
0.859IleTyr: 0.859 ± 0.309
0.0IleXaa: 0.0 ± 0.0
Lys
5.152LysAla: 5.152 ± 0.746
0.546LysCys: 0.546 ± 0.237
4.45LysAsp: 4.45 ± 0.562
5.699LysGlu: 5.699 ± 0.762
2.576LysPhe: 2.576 ± 0.522
4.059LysGly: 4.059 ± 0.469
1.171LysHis: 1.171 ± 0.335
2.732LysIle: 2.732 ± 0.365
2.108LysLys: 2.108 ± 0.512
5.543LysLeu: 5.543 ± 0.75
1.171LysMet: 1.171 ± 0.286
2.264LysAsn: 2.264 ± 0.398
2.966LysPro: 2.966 ± 0.47
3.357LysGln: 3.357 ± 0.47
2.966LysArg: 2.966 ± 0.513
3.903LysSer: 3.903 ± 0.519
2.342LysThr: 2.342 ± 0.431
3.981LysVal: 3.981 ± 0.678
0.937LysTrp: 0.937 ± 0.226
3.903LysTyr: 3.903 ± 0.465
0.0LysXaa: 0.0 ± 0.0
Leu
5.464LeuAla: 5.464 ± 0.614
1.639LeuCys: 1.639 ± 0.461
5.464LeuAsp: 5.464 ± 0.542
7.416LeuGlu: 7.416 ± 0.81
2.966LeuPhe: 2.966 ± 0.493
6.167LeuGly: 6.167 ± 0.666
1.483LeuHis: 1.483 ± 0.377
5.23LeuIle: 5.23 ± 0.763
6.635LeuLys: 6.635 ± 0.724
7.26LeuLeu: 7.26 ± 0.814
2.81LeuMet: 2.81 ± 0.483
4.137LeuAsn: 4.137 ± 0.517
3.201LeuPro: 3.201 ± 0.55
5.308LeuGln: 5.308 ± 0.753
5.074LeuArg: 5.074 ± 0.78
6.714LeuSer: 6.714 ± 0.639
4.684LeuThr: 4.684 ± 0.5
5.933LeuVal: 5.933 ± 0.883
0.859LeuTrp: 0.859 ± 0.353
3.591LeuTyr: 3.591 ± 0.554
0.0LeuXaa: 0.0 ± 0.0
Met
2.108MetAla: 2.108 ± 0.337
0.156MetCys: 0.156 ± 0.111
1.171MetAsp: 1.171 ± 0.313
0.937MetGlu: 0.937 ± 0.25
0.859MetPhe: 0.859 ± 0.229
1.405MetGly: 1.405 ± 0.317
0.781MetHis: 0.781 ± 0.226
1.171MetIle: 1.171 ± 0.318
1.249MetLys: 1.249 ± 0.305
2.966MetLeu: 2.966 ± 0.498
0.703MetMet: 0.703 ± 0.234
1.639MetAsn: 1.639 ± 0.39
0.625MetPro: 0.625 ± 0.29
2.498MetGln: 2.498 ± 0.638
1.327MetArg: 1.327 ± 0.302
2.888MetSer: 2.888 ± 0.449
1.015MetThr: 1.015 ± 0.318
1.952MetVal: 1.952 ± 0.371
0.234MetTrp: 0.234 ± 0.128
1.483MetTyr: 1.483 ± 0.292
0.0MetXaa: 0.0 ± 0.0
Asn
2.654AsnAla: 2.654 ± 0.377
0.625AsnCys: 0.625 ± 0.275
1.874AsnAsp: 1.874 ± 0.379
2.186AsnGlu: 2.186 ± 0.454
1.171AsnPhe: 1.171 ± 0.311
2.654AsnGly: 2.654 ± 0.498
0.703AsnHis: 0.703 ± 0.288
2.654AsnIle: 2.654 ± 0.501
2.732AsnLys: 2.732 ± 0.452
4.372AsnLeu: 4.372 ± 0.578
0.937AsnMet: 0.937 ± 0.266
2.81AsnAsn: 2.81 ± 0.54
2.342AsnPro: 2.342 ± 0.429
1.171AsnGln: 1.171 ± 0.264
1.795AsnArg: 1.795 ± 0.298
3.044AsnSer: 3.044 ± 0.483
3.825AsnThr: 3.825 ± 0.51
3.981AsnVal: 3.981 ± 0.541
0.546AsnTrp: 0.546 ± 0.205
1.405AsnTyr: 1.405 ± 0.374
0.0AsnXaa: 0.0 ± 0.0
Pro
2.654ProAla: 2.654 ± 0.402
0.312ProCys: 0.312 ± 0.183
2.732ProAsp: 2.732 ± 0.486
2.966ProGlu: 2.966 ± 0.55
0.937ProPhe: 0.937 ± 0.247
0.078ProGly: 0.078 ± 0.073
1.015ProHis: 1.015 ± 0.249
1.561ProIle: 1.561 ± 0.343
3.044ProLys: 3.044 ± 0.428
2.498ProLeu: 2.498 ± 0.374
1.483ProMet: 1.483 ± 0.325
1.717ProAsn: 1.717 ± 0.352
0.468ProPro: 0.468 ± 0.186
1.405ProGln: 1.405 ± 0.288
1.327ProArg: 1.327 ± 0.252
3.903ProSer: 3.903 ± 0.533
3.123ProThr: 3.123 ± 0.46
2.81ProVal: 2.81 ± 0.451
0.39ProTrp: 0.39 ± 0.2
1.015ProTyr: 1.015 ± 0.256
0.0ProXaa: 0.0 ± 0.0
Gln
5.23GlnAla: 5.23 ± 0.927
0.312GlnCys: 0.312 ± 0.241
3.201GlnAsp: 3.201 ± 0.623
4.918GlnGlu: 4.918 ± 0.69
1.952GlnPhe: 1.952 ± 0.432
4.137GlnGly: 4.137 ± 0.514
1.015GlnHis: 1.015 ± 0.339
1.171GlnIle: 1.171 ± 0.242
2.264GlnLys: 2.264 ± 0.437
3.981GlnLeu: 3.981 ± 0.668
1.171GlnMet: 1.171 ± 0.236
2.03GlnAsn: 2.03 ± 0.43
1.093GlnPro: 1.093 ± 0.353
2.888GlnGln: 2.888 ± 0.641
2.03GlnArg: 2.03 ± 0.479
3.201GlnSer: 3.201 ± 0.443
2.03GlnThr: 2.03 ± 0.371
3.591GlnVal: 3.591 ± 0.508
0.625GlnTrp: 0.625 ± 0.195
3.279GlnTyr: 3.279 ± 0.539
0.0GlnXaa: 0.0 ± 0.0
Arg
3.981ArgAla: 3.981 ± 0.815
0.546ArgCys: 0.546 ± 0.266
2.108ArgAsp: 2.108 ± 0.437
3.591ArgGlu: 3.591 ± 0.532
2.108ArgPhe: 2.108 ± 0.373
3.279ArgGly: 3.279 ± 0.593
0.703ArgHis: 0.703 ± 0.191
2.342ArgIle: 2.342 ± 0.416
3.044ArgLys: 3.044 ± 0.5
5.308ArgLeu: 5.308 ± 0.687
0.859ArgMet: 0.859 ± 0.286
1.717ArgAsn: 1.717 ± 0.437
0.937ArgPro: 0.937 ± 0.274
1.405ArgGln: 1.405 ± 0.358
2.732ArgArg: 2.732 ± 0.558
3.279ArgSer: 3.279 ± 0.439
2.576ArgThr: 2.576 ± 0.369
3.669ArgVal: 3.669 ± 0.509
0.781ArgTrp: 0.781 ± 0.18
2.03ArgTyr: 2.03 ± 0.381
0.0ArgXaa: 0.0 ± 0.0
Ser
4.996SerAla: 4.996 ± 0.521
0.468SerCys: 0.468 ± 0.19
3.435SerAsp: 3.435 ± 0.475
3.669SerGlu: 3.669 ± 0.449
1.561SerPhe: 1.561 ± 0.337
5.699SerGly: 5.699 ± 0.893
1.171SerHis: 1.171 ± 0.326
4.762SerIle: 4.762 ± 0.495
4.606SerLys: 4.606 ± 0.677
6.089SerLeu: 6.089 ± 0.754
2.42SerMet: 2.42 ± 0.566
3.201SerAsn: 3.201 ± 0.492
3.279SerPro: 3.279 ± 0.388
2.576SerGln: 2.576 ± 0.49
3.123SerArg: 3.123 ± 0.347
5.543SerSer: 5.543 ± 0.724
8.899SerThr: 8.899 ± 0.953
6.479SerVal: 6.479 ± 0.873
0.703SerTrp: 0.703 ± 0.277
2.03SerTyr: 2.03 ± 0.406
0.0SerXaa: 0.0 ± 0.0
Thr
5.621ThrAla: 5.621 ± 0.724
0.546ThrCys: 0.546 ± 0.193
3.747ThrAsp: 3.747 ± 0.613
4.45ThrGlu: 4.45 ± 0.633
2.186ThrPhe: 2.186 ± 0.347
6.167ThrGly: 6.167 ± 0.642
1.171ThrHis: 1.171 ± 0.274
3.123ThrIle: 3.123 ± 0.568
3.669ThrLys: 3.669 ± 0.572
5.464ThrLeu: 5.464 ± 0.635
1.249ThrMet: 1.249 ± 0.294
3.044ThrAsn: 3.044 ± 0.335
3.747ThrPro: 3.747 ± 0.42
3.279ThrGln: 3.279 ± 0.504
2.498ThrArg: 2.498 ± 0.381
5.308ThrSer: 5.308 ± 0.746
2.654ThrThr: 2.654 ± 0.605
6.245ThrVal: 6.245 ± 0.849
0.781ThrTrp: 0.781 ± 0.225
2.81ThrTyr: 2.81 ± 0.447
0.0ThrXaa: 0.0 ± 0.0
Val
4.996ValAla: 4.996 ± 0.689
0.625ValCys: 0.625 ± 0.204
2.732ValAsp: 2.732 ± 0.396
2.81ValGlu: 2.81 ± 0.583
2.186ValPhe: 2.186 ± 0.418
5.23ValGly: 5.23 ± 0.893
1.874ValHis: 1.874 ± 0.484
3.357ValIle: 3.357 ± 0.428
4.059ValLys: 4.059 ± 0.547
7.026ValLeu: 7.026 ± 0.717
1.327ValMet: 1.327 ± 0.315
2.186ValAsn: 2.186 ± 0.336
3.435ValPro: 3.435 ± 0.566
7.104ValGln: 7.104 ± 0.873
3.747ValArg: 3.747 ± 0.545
4.996ValSer: 4.996 ± 0.73
4.294ValThr: 4.294 ± 0.56
4.762ValVal: 4.762 ± 0.758
0.937ValTrp: 0.937 ± 0.253
2.42ValTyr: 2.42 ± 0.534
0.0ValXaa: 0.0 ± 0.0
Trp
0.312TrpAla: 0.312 ± 0.164
0.234TrpCys: 0.234 ± 0.185
1.093TrpAsp: 1.093 ± 0.287
0.937TrpGlu: 0.937 ± 0.306
0.781TrpPhe: 0.781 ± 0.196
0.781TrpGly: 0.781 ± 0.288
0.39TrpHis: 0.39 ± 0.194
0.468TrpIle: 0.468 ± 0.18
0.625TrpLys: 0.625 ± 0.225
1.327TrpLeu: 1.327 ± 0.309
0.234TrpMet: 0.234 ± 0.141
0.546TrpAsn: 0.546 ± 0.153
0.0TrpPro: 0.0 ± 0.0
0.546TrpGln: 0.546 ± 0.197
0.625TrpArg: 0.625 ± 0.188
0.937TrpSer: 0.937 ± 0.2
0.312TrpThr: 0.312 ± 0.13
1.015TrpVal: 1.015 ± 0.282
0.468TrpTrp: 0.468 ± 0.206
1.015TrpTyr: 1.015 ± 0.229
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.795TyrAla: 1.795 ± 0.268
0.703TyrCys: 0.703 ± 0.222
1.874TyrAsp: 1.874 ± 0.402
2.498TyrGlu: 2.498 ± 0.483
1.015TyrPhe: 1.015 ± 0.325
2.732TyrGly: 2.732 ± 0.47
0.781TyrHis: 0.781 ± 0.328
3.669TyrIle: 3.669 ± 0.612
2.81TyrLys: 2.81 ± 0.445
3.903TyrLeu: 3.903 ± 0.683
1.171TyrMet: 1.171 ± 0.275
2.108TyrAsn: 2.108 ± 0.375
1.171TyrPro: 1.171 ± 0.252
1.874TyrGln: 1.874 ± 0.367
2.186TyrArg: 2.186 ± 0.383
3.747TyrSer: 3.747 ± 0.478
3.201TyrThr: 3.201 ± 0.475
1.483TyrVal: 1.483 ± 0.386
1.171TyrTrp: 1.171 ± 0.364
1.249TyrTyr: 1.249 ± 0.341
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12811 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski