Amino acid dipepetide frequency for Propionibacterium phage PHL092M00

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.686AlaAla: 10.686 ± 1.739
1.278AlaCys: 1.278 ± 0.406
5.692AlaAsp: 5.692 ± 0.835
5.459AlaGlu: 5.459 ± 0.978
2.788AlaPhe: 2.788 ± 0.622
10.803AlaGly: 10.803 ± 1.523
2.207AlaHis: 2.207 ± 0.611
4.298AlaIle: 4.298 ± 0.743
3.833AlaLys: 3.833 ± 0.541
7.783AlaLeu: 7.783 ± 1.21
3.485AlaMet: 3.485 ± 1.309
2.555AlaAsn: 2.555 ± 0.53
2.439AlaPro: 2.439 ± 0.459
3.949AlaGln: 3.949 ± 0.626
6.621AlaArg: 6.621 ± 1.073
5.808AlaSer: 5.808 ± 0.853
6.156AlaThr: 6.156 ± 0.973
9.525AlaVal: 9.525 ± 1.51
1.51AlaTrp: 1.51 ± 0.333
2.788AlaTyr: 2.788 ± 0.41
0.0AlaXaa: 0.0 ± 0.0
Cys
0.813CysAla: 0.813 ± 0.277
0.116CysCys: 0.116 ± 0.116
1.162CysAsp: 1.162 ± 0.361
0.697CysGlu: 0.697 ± 0.268
0.116CysPhe: 0.116 ± 0.126
1.51CysGly: 1.51 ± 0.395
0.465CysHis: 0.465 ± 0.262
0.232CysIle: 0.232 ± 0.155
0.581CysLys: 0.581 ± 0.269
0.348CysLeu: 0.348 ± 0.195
0.0CysMet: 0.0 ± 0.0
0.232CysAsn: 0.232 ± 0.163
0.929CysPro: 0.929 ± 0.313
0.697CysGln: 0.697 ± 0.244
1.626CysArg: 1.626 ± 0.456
0.697CysSer: 0.697 ± 0.319
0.697CysThr: 0.697 ± 0.285
0.697CysVal: 0.697 ± 0.291
0.232CysTrp: 0.232 ± 0.119
0.232CysTyr: 0.232 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
5.227AspAla: 5.227 ± 0.753
1.045AspCys: 1.045 ± 0.349
5.459AspAsp: 5.459 ± 0.84
3.369AspGlu: 3.369 ± 0.766
1.394AspPhe: 1.394 ± 0.38
6.737AspGly: 6.737 ± 1.613
1.859AspHis: 1.859 ± 0.497
2.439AspIle: 2.439 ± 0.669
1.975AspLys: 1.975 ± 0.393
4.762AspLeu: 4.762 ± 0.925
1.626AspMet: 1.626 ± 0.477
3.136AspAsn: 3.136 ± 0.714
3.949AspPro: 3.949 ± 0.815
2.091AspGln: 2.091 ± 0.577
3.833AspArg: 3.833 ± 0.811
3.136AspSer: 3.136 ± 0.588
5.111AspThr: 5.111 ± 0.672
5.692AspVal: 5.692 ± 0.742
1.742AspTrp: 1.742 ± 0.473
1.975AspTyr: 1.975 ± 0.504
0.0AspXaa: 0.0 ± 0.0
Glu
5.692GluAla: 5.692 ± 1.055
0.697GluCys: 0.697 ± 0.236
2.091GluAsp: 2.091 ± 0.661
2.207GluGlu: 2.207 ± 0.555
1.278GluPhe: 1.278 ± 0.339
3.252GluGly: 3.252 ± 0.533
0.929GluHis: 0.929 ± 0.369
2.555GluIle: 2.555 ± 0.424
1.975GluLys: 1.975 ± 0.524
3.717GluLeu: 3.717 ± 0.637
1.162GluMet: 1.162 ± 0.315
1.394GluAsn: 1.394 ± 0.417
1.394GluPro: 1.394 ± 0.582
2.207GluGln: 2.207 ± 0.442
3.485GluArg: 3.485 ± 0.761
3.717GluSer: 3.717 ± 0.749
3.369GluThr: 3.369 ± 0.783
3.136GluVal: 3.136 ± 0.695
2.091GluTrp: 2.091 ± 0.579
1.278GluTyr: 1.278 ± 0.472
0.0GluXaa: 0.0 ± 0.0
Phe
3.136PheAla: 3.136 ± 1.059
0.348PheCys: 0.348 ± 0.179
2.091PheAsp: 2.091 ± 0.574
1.394PheGlu: 1.394 ± 0.381
1.045PhePhe: 1.045 ± 0.348
2.207PheGly: 2.207 ± 0.536
0.697PheHis: 0.697 ± 0.29
1.162PheIle: 1.162 ± 0.276
1.278PheLys: 1.278 ± 0.382
2.091PheLeu: 2.091 ± 0.45
0.697PheMet: 0.697 ± 0.295
0.813PheAsn: 0.813 ± 0.334
1.51PhePro: 1.51 ± 0.408
0.813PheGln: 0.813 ± 0.23
2.091PheArg: 2.091 ± 0.459
1.742PheSer: 1.742 ± 0.505
1.626PheThr: 1.626 ± 0.347
2.091PheVal: 2.091 ± 0.527
0.697PheTrp: 0.697 ± 0.253
0.232PheTyr: 0.232 ± 0.16
0.0PheXaa: 0.0 ± 0.0
Gly
7.086GlyAla: 7.086 ± 1.459
0.581GlyCys: 0.581 ± 0.261
6.273GlyAsp: 6.273 ± 0.965
4.995GlyGlu: 4.995 ± 0.737
3.252GlyPhe: 3.252 ± 0.701
6.737GlyGly: 6.737 ± 0.916
1.859GlyHis: 1.859 ± 0.517
2.904GlyIle: 2.904 ± 0.656
4.646GlyLys: 4.646 ± 0.787
7.666GlyLeu: 7.666 ± 1.138
2.323GlyMet: 2.323 ± 0.592
2.788GlyAsn: 2.788 ± 0.484
3.252GlyPro: 3.252 ± 1.203
3.369GlyGln: 3.369 ± 0.577
5.111GlyArg: 5.111 ± 0.831
6.389GlySer: 6.389 ± 1.008
4.414GlyThr: 4.414 ± 0.704
9.06GlyVal: 9.06 ± 1.664
1.975GlyTrp: 1.975 ± 0.449
2.672GlyTyr: 2.672 ± 0.642
0.0GlyXaa: 0.0 ± 0.0
His
2.091HisAla: 2.091 ± 0.614
0.697HisCys: 0.697 ± 0.3
1.742HisAsp: 1.742 ± 0.479
0.581HisGlu: 0.581 ± 0.275
0.697HisPhe: 0.697 ± 0.385
1.742HisGly: 1.742 ± 0.391
1.742HisHis: 1.742 ± 0.571
1.859HisIle: 1.859 ± 0.511
1.859HisLys: 1.859 ± 0.601
2.439HisLeu: 2.439 ± 0.531
0.348HisMet: 0.348 ± 0.206
0.697HisAsn: 0.697 ± 0.279
1.394HisPro: 1.394 ± 0.414
1.51HisGln: 1.51 ± 0.538
1.742HisArg: 1.742 ± 0.434
1.626HisSer: 1.626 ± 0.371
1.51HisThr: 1.51 ± 0.522
1.742HisVal: 1.742 ± 0.515
0.348HisTrp: 0.348 ± 0.205
0.697HisTyr: 0.697 ± 0.255
0.0HisXaa: 0.0 ± 0.0
Ile
4.53IleAla: 4.53 ± 0.868
0.929IleCys: 0.929 ± 0.349
4.879IleAsp: 4.879 ± 0.721
2.788IleGlu: 2.788 ± 0.565
1.278IlePhe: 1.278 ± 0.368
2.788IleGly: 2.788 ± 0.751
1.045IleHis: 1.045 ± 0.331
2.904IleIle: 2.904 ± 0.734
1.51IleLys: 1.51 ± 0.417
3.252IleLeu: 3.252 ± 0.47
1.51IleMet: 1.51 ± 0.502
2.091IleAsn: 2.091 ± 0.456
3.02IlePro: 3.02 ± 0.589
1.626IleGln: 1.626 ± 0.383
2.555IleArg: 2.555 ± 0.576
2.555IleSer: 2.555 ± 0.501
3.717IleThr: 3.717 ± 0.833
3.369IleVal: 3.369 ± 0.461
0.348IleTrp: 0.348 ± 0.198
0.929IleTyr: 0.929 ± 0.366
0.0IleXaa: 0.0 ± 0.0
Lys
4.414LysAla: 4.414 ± 0.909
0.232LysCys: 0.232 ± 0.153
2.555LysAsp: 2.555 ± 0.695
1.162LysGlu: 1.162 ± 0.352
1.045LysPhe: 1.045 ± 0.303
3.949LysGly: 3.949 ± 0.57
1.278LysHis: 1.278 ± 0.351
1.51LysIle: 1.51 ± 0.533
1.51LysLys: 1.51 ± 0.491
3.252LysLeu: 3.252 ± 0.549
0.929LysMet: 0.929 ± 0.295
2.091LysAsn: 2.091 ± 0.442
2.555LysPro: 2.555 ± 0.575
2.439LysGln: 2.439 ± 0.587
1.975LysArg: 1.975 ± 0.466
2.323LysSer: 2.323 ± 0.684
3.252LysThr: 3.252 ± 0.604
1.975LysVal: 1.975 ± 0.444
0.697LysTrp: 0.697 ± 0.263
1.045LysTyr: 1.045 ± 0.39
0.0LysXaa: 0.0 ± 0.0
Leu
9.06LeuAla: 9.06 ± 1.185
1.278LeuCys: 1.278 ± 0.378
5.111LeuAsp: 5.111 ± 1.006
3.252LeuGlu: 3.252 ± 0.64
1.859LeuPhe: 1.859 ± 0.52
6.389LeuGly: 6.389 ± 1.48
2.904LeuHis: 2.904 ± 0.57
3.369LeuIle: 3.369 ± 0.747
4.182LeuLys: 4.182 ± 0.73
4.53LeuLeu: 4.53 ± 0.713
1.51LeuMet: 1.51 ± 0.358
2.788LeuAsn: 2.788 ± 0.514
4.298LeuPro: 4.298 ± 0.625
3.252LeuGln: 3.252 ± 0.452
3.02LeuArg: 3.02 ± 0.644
5.459LeuSer: 5.459 ± 0.652
4.762LeuThr: 4.762 ± 0.84
4.762LeuVal: 4.762 ± 0.946
0.929LeuTrp: 0.929 ± 0.396
1.859LeuTyr: 1.859 ± 0.561
0.0LeuXaa: 0.0 ± 0.0
Met
3.136MetAla: 3.136 ± 0.671
0.697MetCys: 0.697 ± 0.283
0.929MetAsp: 0.929 ± 0.345
0.697MetGlu: 0.697 ± 0.332
0.813MetPhe: 0.813 ± 0.321
1.859MetGly: 1.859 ± 0.58
0.813MetHis: 0.813 ± 0.326
1.975MetIle: 1.975 ± 0.453
0.929MetLys: 0.929 ± 0.318
2.672MetLeu: 2.672 ± 0.468
0.465MetMet: 0.465 ± 0.228
1.045MetAsn: 1.045 ± 0.401
1.51MetPro: 1.51 ± 0.701
0.929MetGln: 0.929 ± 0.33
1.278MetArg: 1.278 ± 0.351
1.859MetSer: 1.859 ± 0.513
1.045MetThr: 1.045 ± 0.342
1.742MetVal: 1.742 ± 0.355
0.697MetTrp: 0.697 ± 0.263
1.394MetTyr: 1.394 ± 0.363
0.0MetXaa: 0.0 ± 0.0
Asn
2.788AsnAla: 2.788 ± 0.808
0.116AsnCys: 0.116 ± 0.112
1.859AsnAsp: 1.859 ± 0.435
1.162AsnGlu: 1.162 ± 0.298
0.581AsnPhe: 0.581 ± 0.258
5.111AsnGly: 5.111 ± 1.002
1.394AsnHis: 1.394 ± 0.347
1.975AsnIle: 1.975 ± 0.44
1.859AsnLys: 1.859 ± 0.487
2.091AsnLeu: 2.091 ± 0.508
1.278AsnMet: 1.278 ± 0.314
1.975AsnAsn: 1.975 ± 0.611
2.904AsnPro: 2.904 ± 0.656
1.975AsnGln: 1.975 ± 0.626
1.51AsnArg: 1.51 ± 0.572
2.091AsnSer: 2.091 ± 0.581
2.439AsnThr: 2.439 ± 0.718
2.323AsnVal: 2.323 ± 0.486
0.348AsnTrp: 0.348 ± 0.179
0.929AsnTyr: 0.929 ± 0.294
0.0AsnXaa: 0.0 ± 0.0
Pro
5.808ProAla: 5.808 ± 0.964
0.348ProCys: 0.348 ± 0.181
4.414ProAsp: 4.414 ± 0.687
2.439ProGlu: 2.439 ± 0.603
1.162ProPhe: 1.162 ± 0.41
4.995ProGly: 4.995 ± 0.771
1.394ProHis: 1.394 ± 0.414
2.091ProIle: 2.091 ± 0.386
1.742ProLys: 1.742 ± 0.361
3.252ProLeu: 3.252 ± 0.65
0.813ProMet: 0.813 ± 0.316
2.439ProAsn: 2.439 ± 0.54
3.02ProPro: 3.02 ± 0.719
1.742ProGln: 1.742 ± 0.493
1.742ProArg: 1.742 ± 0.454
3.252ProSer: 3.252 ± 0.503
2.672ProThr: 2.672 ± 0.534
4.762ProVal: 4.762 ± 1.048
1.51ProTrp: 1.51 ± 0.477
0.697ProTyr: 0.697 ± 0.285
0.0ProXaa: 0.0 ± 0.0
Gln
4.646GlnAla: 4.646 ± 0.946
0.465GlnCys: 0.465 ± 0.28
1.742GlnAsp: 1.742 ± 0.427
1.394GlnGlu: 1.394 ± 0.49
0.697GlnPhe: 0.697 ± 0.243
2.555GlnGly: 2.555 ± 0.807
1.626GlnHis: 1.626 ± 0.578
2.323GlnIle: 2.323 ± 0.625
1.162GlnLys: 1.162 ± 0.347
3.717GlnLeu: 3.717 ± 0.632
1.394GlnMet: 1.394 ± 0.377
1.045GlnAsn: 1.045 ± 0.271
3.02GlnPro: 3.02 ± 0.665
3.601GlnGln: 3.601 ± 0.794
3.252GlnArg: 3.252 ± 0.582
2.091GlnSer: 2.091 ± 0.433
2.788GlnThr: 2.788 ± 0.785
3.136GlnVal: 3.136 ± 0.621
0.929GlnTrp: 0.929 ± 0.305
1.51GlnTyr: 1.51 ± 0.427
0.0GlnXaa: 0.0 ± 0.0
Arg
6.156ArgAla: 6.156 ± 0.859
0.232ArgCys: 0.232 ± 0.169
3.717ArgAsp: 3.717 ± 0.656
2.788ArgGlu: 2.788 ± 0.55
2.091ArgPhe: 2.091 ± 0.529
4.182ArgGly: 4.182 ± 0.819
1.278ArgHis: 1.278 ± 0.354
3.485ArgIle: 3.485 ± 0.642
2.555ArgLys: 2.555 ± 0.614
6.273ArgLeu: 6.273 ± 0.858
2.091ArgMet: 2.091 ± 0.516
2.439ArgAsn: 2.439 ± 0.562
1.975ArgPro: 1.975 ± 0.551
2.439ArgGln: 2.439 ± 0.559
5.808ArgArg: 5.808 ± 0.906
4.646ArgSer: 4.646 ± 0.838
2.672ArgThr: 2.672 ± 0.787
4.879ArgVal: 4.879 ± 0.853
1.045ArgTrp: 1.045 ± 0.337
1.859ArgTyr: 1.859 ± 0.443
0.0ArgXaa: 0.0 ± 0.0
Ser
6.621SerAla: 6.621 ± 1.195
0.465SerCys: 0.465 ± 0.255
4.414SerAsp: 4.414 ± 0.633
3.02SerGlu: 3.02 ± 0.492
2.904SerPhe: 2.904 ± 0.616
8.015SerGly: 8.015 ± 1.264
1.394SerHis: 1.394 ± 0.298
3.02SerIle: 3.02 ± 0.415
2.091SerLys: 2.091 ± 0.625
3.949SerLeu: 3.949 ± 0.567
2.091SerMet: 2.091 ± 0.363
2.323SerAsn: 2.323 ± 0.544
3.02SerPro: 3.02 ± 0.624
2.091SerGln: 2.091 ± 0.433
4.066SerArg: 4.066 ± 0.755
3.949SerSer: 3.949 ± 0.713
2.904SerThr: 2.904 ± 0.517
6.273SerVal: 6.273 ± 1.093
1.626SerTrp: 1.626 ± 0.368
0.581SerTyr: 0.581 ± 0.26
0.0SerXaa: 0.0 ± 0.0
Thr
6.156ThrAla: 6.156 ± 0.864
0.813ThrCys: 0.813 ± 0.375
3.601ThrAsp: 3.601 ± 0.557
2.323ThrGlu: 2.323 ± 0.599
1.859ThrPhe: 1.859 ± 0.515
4.995ThrGly: 4.995 ± 0.657
1.162ThrHis: 1.162 ± 0.458
3.833ThrIle: 3.833 ± 0.67
1.975ThrLys: 1.975 ± 0.454
4.298ThrLeu: 4.298 ± 0.901
0.697ThrMet: 0.697 ± 0.255
1.975ThrAsn: 1.975 ± 0.526
4.066ThrPro: 4.066 ± 0.762
3.601ThrGln: 3.601 ± 0.777
3.252ThrArg: 3.252 ± 0.543
3.833ThrSer: 3.833 ± 0.665
3.833ThrThr: 3.833 ± 1.05
5.576ThrVal: 5.576 ± 0.819
0.813ThrTrp: 0.813 ± 0.275
2.091ThrTyr: 2.091 ± 0.492
0.0ThrXaa: 0.0 ± 0.0
Val
7.783ValAla: 7.783 ± 1.695
1.045ValCys: 1.045 ± 0.327
5.459ValAsp: 5.459 ± 0.87
5.227ValGlu: 5.227 ± 0.779
2.091ValPhe: 2.091 ± 0.582
5.808ValGly: 5.808 ± 0.914
1.278ValHis: 1.278 ± 0.307
3.369ValIle: 3.369 ± 0.931
3.717ValLys: 3.717 ± 0.971
6.156ValLeu: 6.156 ± 0.928
2.788ValMet: 2.788 ± 0.592
3.136ValAsn: 3.136 ± 0.725
3.369ValPro: 3.369 ± 0.484
2.672ValGln: 2.672 ± 0.565
5.111ValArg: 5.111 ± 0.753
6.969ValSer: 6.969 ± 0.845
4.414ValThr: 4.414 ± 0.675
6.273ValVal: 6.273 ± 1.246
1.162ValTrp: 1.162 ± 0.417
1.742ValTyr: 1.742 ± 0.539
0.0ValXaa: 0.0 ± 0.0
Trp
1.859TrpAla: 1.859 ± 0.528
0.232TrpCys: 0.232 ± 0.184
0.813TrpAsp: 0.813 ± 0.335
1.278TrpGlu: 1.278 ± 0.345
0.465TrpPhe: 0.465 ± 0.198
1.394TrpGly: 1.394 ± 0.457
0.697TrpHis: 0.697 ± 0.3
0.813TrpIle: 0.813 ± 0.261
0.465TrpLys: 0.465 ± 0.217
1.278TrpLeu: 1.278 ± 0.428
0.813TrpMet: 0.813 ± 0.303
0.813TrpAsn: 0.813 ± 0.358
0.581TrpPro: 0.581 ± 0.254
0.929TrpGln: 0.929 ± 0.318
2.091TrpArg: 2.091 ± 0.525
1.394TrpSer: 1.394 ± 0.546
1.51TrpThr: 1.51 ± 0.37
0.813TrpVal: 0.813 ± 0.352
0.465TrpTrp: 0.465 ± 0.216
0.581TrpTyr: 0.581 ± 0.294
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.975TyrAla: 1.975 ± 0.452
0.465TyrCys: 0.465 ± 0.219
2.555TyrAsp: 2.555 ± 0.49
1.394TyrGlu: 1.394 ± 0.336
0.348TyrPhe: 0.348 ± 0.164
2.323TyrGly: 2.323 ± 0.475
1.045TyrHis: 1.045 ± 0.34
1.51TyrIle: 1.51 ± 0.419
0.348TyrLys: 0.348 ± 0.19
1.045TyrLeu: 1.045 ± 0.323
0.348TyrMet: 0.348 ± 0.185
0.929TyrAsn: 0.929 ± 0.378
2.091TyrPro: 2.091 ± 0.52
1.162TyrGln: 1.162 ± 0.362
2.439TyrArg: 2.439 ± 0.569
1.278TyrSer: 1.278 ± 0.345
1.859TyrThr: 1.859 ± 0.373
1.742TyrVal: 1.742 ± 0.359
0.232TyrTrp: 0.232 ± 0.178
0.813TyrTyr: 0.813 ± 0.277
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (8610 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski