Amino acid dipepetide frequency for Paenibacillus phage HB10c2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.908AlaAla: 4.908 ± 1.047
0.463AlaCys: 0.463 ± 0.18
4.445AlaAsp: 4.445 ± 0.72
6.02AlaGlu: 6.02 ± 0.683
2.037AlaPhe: 2.037 ± 0.37
4.816AlaGly: 4.816 ± 0.719
0.926AlaHis: 0.926 ± 0.324
5.001AlaIle: 5.001 ± 0.885
5.094AlaLys: 5.094 ± 0.738
6.205AlaLeu: 6.205 ± 0.686
1.019AlaMet: 1.019 ± 0.303
3.056AlaAsn: 3.056 ± 0.525
1.111AlaPro: 1.111 ± 0.363
1.667AlaGln: 1.667 ± 0.344
2.593AlaArg: 2.593 ± 0.437
4.63AlaSer: 4.63 ± 0.709
3.982AlaThr: 3.982 ± 0.824
4.075AlaVal: 4.075 ± 0.517
1.204AlaTrp: 1.204 ± 0.332
2.778AlaTyr: 2.778 ± 0.61
0.0AlaXaa: 0.0 ± 0.0
Cys
0.741CysAla: 0.741 ± 0.245
0.278CysCys: 0.278 ± 0.151
0.37CysAsp: 0.37 ± 0.166
0.648CysGlu: 0.648 ± 0.268
0.185CysPhe: 0.185 ± 0.128
0.741CysGly: 0.741 ± 0.248
0.278CysHis: 0.278 ± 0.128
0.556CysIle: 0.556 ± 0.194
1.204CysLys: 1.204 ± 0.424
0.741CysLeu: 0.741 ± 0.233
0.093CysMet: 0.093 ± 0.093
0.278CysAsn: 0.278 ± 0.165
0.648CysPro: 0.648 ± 0.263
0.278CysGln: 0.278 ± 0.159
0.37CysArg: 0.37 ± 0.187
0.556CysSer: 0.556 ± 0.19
0.278CysThr: 0.278 ± 0.16
0.556CysVal: 0.556 ± 0.214
0.093CysTrp: 0.093 ± 0.094
0.185CysTyr: 0.185 ± 0.121
0.0CysXaa: 0.0 ± 0.0
Asp
3.704AspAla: 3.704 ± 0.724
0.37AspCys: 0.37 ± 0.189
4.816AspAsp: 4.816 ± 0.853
4.075AspGlu: 4.075 ± 0.705
2.778AspPhe: 2.778 ± 0.613
3.797AspGly: 3.797 ± 0.621
0.648AspHis: 0.648 ± 0.235
3.612AspIle: 3.612 ± 0.498
3.982AspLys: 3.982 ± 0.716
5.094AspLeu: 5.094 ± 0.621
2.037AspMet: 2.037 ± 0.403
1.389AspAsn: 1.389 ± 0.289
2.037AspPro: 2.037 ± 0.495
2.223AspGln: 2.223 ± 0.494
3.241AspArg: 3.241 ± 0.52
2.593AspSer: 2.593 ± 0.43
3.241AspThr: 3.241 ± 0.432
3.89AspVal: 3.89 ± 0.568
0.926AspTrp: 0.926 ± 0.331
2.408AspTyr: 2.408 ± 0.482
0.0AspXaa: 0.0 ± 0.0
Glu
4.816GluAla: 4.816 ± 0.769
0.278GluCys: 0.278 ± 0.16
4.816GluAsp: 4.816 ± 0.918
5.557GluGlu: 5.557 ± 0.908
2.408GluPhe: 2.408 ± 0.446
3.704GluGly: 3.704 ± 0.423
1.111GluHis: 1.111 ± 0.463
6.39GluIle: 6.39 ± 0.919
7.964GluLys: 7.964 ± 0.914
6.575GluLeu: 6.575 ± 0.822
4.63GluMet: 4.63 ± 0.596
3.519GluAsn: 3.519 ± 0.703
2.5GluPro: 2.5 ± 0.45
4.167GluGln: 4.167 ± 0.684
3.89GluArg: 3.89 ± 0.911
3.982GluSer: 3.982 ± 0.546
4.723GluThr: 4.723 ± 0.469
5.464GluVal: 5.464 ± 0.852
1.019GluTrp: 1.019 ± 0.335
3.056GluTyr: 3.056 ± 0.611
0.0GluXaa: 0.0 ± 0.0
Phe
1.852PheAla: 1.852 ± 0.463
0.741PheCys: 0.741 ± 0.253
2.315PheAsp: 2.315 ± 0.501
2.964PheGlu: 2.964 ± 0.581
1.297PhePhe: 1.297 ± 0.399
2.408PheGly: 2.408 ± 0.496
1.204PheHis: 1.204 ± 0.405
3.334PheIle: 3.334 ± 0.516
2.686PheLys: 2.686 ± 0.593
3.334PheLeu: 3.334 ± 0.567
1.019PheMet: 1.019 ± 0.304
2.037PheAsn: 2.037 ± 0.434
0.926PhePro: 0.926 ± 0.273
1.389PheGln: 1.389 ± 0.347
0.926PheArg: 0.926 ± 0.314
2.964PheSer: 2.964 ± 0.511
1.76PheThr: 1.76 ± 0.369
2.593PheVal: 2.593 ± 0.608
0.741PheTrp: 0.741 ± 0.31
1.297PheTyr: 1.297 ± 0.34
0.0PheXaa: 0.0 ± 0.0
Gly
2.964GlyAla: 2.964 ± 0.521
0.648GlyCys: 0.648 ± 0.234
2.871GlyAsp: 2.871 ± 0.453
5.094GlyGlu: 5.094 ± 0.707
2.964GlyPhe: 2.964 ± 0.574
2.778GlyGly: 2.778 ± 0.519
1.297GlyHis: 1.297 ± 0.32
6.02GlyIle: 6.02 ± 1.04
6.483GlyLys: 6.483 ± 0.723
4.908GlyLeu: 4.908 ± 0.986
1.76GlyMet: 1.76 ± 0.637
3.149GlyAsn: 3.149 ± 0.563
0.556GlyPro: 0.556 ± 0.258
2.037GlyGln: 2.037 ± 0.452
3.149GlyArg: 3.149 ± 0.587
3.149GlySer: 3.149 ± 0.652
2.778GlyThr: 2.778 ± 0.589
3.982GlyVal: 3.982 ± 0.632
0.926GlyTrp: 0.926 ± 0.216
3.056GlyTyr: 3.056 ± 0.569
0.0GlyXaa: 0.0 ± 0.0
His
1.204HisAla: 1.204 ± 0.304
0.278HisCys: 0.278 ± 0.159
0.833HisAsp: 0.833 ± 0.342
1.667HisGlu: 1.667 ± 0.448
0.741HisPhe: 0.741 ± 0.252
0.37HisGly: 0.37 ± 0.163
0.648HisHis: 0.648 ± 0.214
1.945HisIle: 1.945 ± 0.472
1.019HisLys: 1.019 ± 0.335
1.852HisLeu: 1.852 ± 0.468
0.37HisMet: 0.37 ± 0.17
0.833HisAsn: 0.833 ± 0.244
0.833HisPro: 0.833 ± 0.207
0.741HisGln: 0.741 ± 0.244
1.019HisArg: 1.019 ± 0.238
1.204HisSer: 1.204 ± 0.299
0.463HisThr: 0.463 ± 0.224
1.297HisVal: 1.297 ± 0.307
0.278HisTrp: 0.278 ± 0.138
0.463HisTyr: 0.463 ± 0.204
0.0HisXaa: 0.0 ± 0.0
Ile
4.63IleAla: 4.63 ± 0.738
0.741IleCys: 0.741 ± 0.294
4.538IleAsp: 4.538 ± 0.769
6.297IleGlu: 6.297 ± 0.768
3.241IlePhe: 3.241 ± 0.554
3.612IleGly: 3.612 ± 0.68
1.389IleHis: 1.389 ± 0.332
3.056IleIle: 3.056 ± 0.72
7.038IleLys: 7.038 ± 0.982
6.112IleLeu: 6.112 ± 0.741
1.574IleMet: 1.574 ± 0.316
3.149IleAsn: 3.149 ± 0.504
3.427IlePro: 3.427 ± 0.48
2.964IleGln: 2.964 ± 0.465
3.612IleArg: 3.612 ± 0.584
3.704IleSer: 3.704 ± 0.498
3.89IleThr: 3.89 ± 0.476
5.001IleVal: 5.001 ± 0.712
0.926IleTrp: 0.926 ± 0.288
3.056IleTyr: 3.056 ± 0.47
0.0IleXaa: 0.0 ± 0.0
Lys
6.02LysAla: 6.02 ± 0.686
0.741LysCys: 0.741 ± 0.275
4.353LysAsp: 4.353 ± 0.613
8.335LysGlu: 8.335 ± 0.907
2.408LysPhe: 2.408 ± 0.362
6.112LysGly: 6.112 ± 0.768
1.111LysHis: 1.111 ± 0.255
6.02LysIle: 6.02 ± 0.684
8.798LysLys: 8.798 ± 1.299
6.02LysLeu: 6.02 ± 0.707
3.149LysMet: 3.149 ± 0.501
4.723LysAsn: 4.723 ± 0.687
2.5LysPro: 2.5 ± 0.536
4.538LysGln: 4.538 ± 0.697
5.371LysArg: 5.371 ± 0.797
4.908LysSer: 4.908 ± 0.682
4.445LysThr: 4.445 ± 0.664
3.89LysVal: 3.89 ± 0.596
1.389LysTrp: 1.389 ± 0.318
2.408LysTyr: 2.408 ± 0.462
0.0LysXaa: 0.0 ± 0.0
Leu
6.39LeuAla: 6.39 ± 0.821
0.833LeuCys: 0.833 ± 0.315
5.001LeuAsp: 5.001 ± 0.64
7.131LeuGlu: 7.131 ± 0.874
2.686LeuPhe: 2.686 ± 0.388
5.279LeuGly: 5.279 ± 0.747
1.852LeuHis: 1.852 ± 0.413
4.538LeuIle: 4.538 ± 0.562
6.02LeuLys: 6.02 ± 0.629
6.112LeuLeu: 6.112 ± 0.893
1.667LeuMet: 1.667 ± 0.377
3.89LeuAsn: 3.89 ± 0.626
3.241LeuPro: 3.241 ± 0.534
4.445LeuGln: 4.445 ± 0.556
3.149LeuArg: 3.149 ± 0.464
5.927LeuSer: 5.927 ± 0.59
5.834LeuThr: 5.834 ± 0.758
4.075LeuVal: 4.075 ± 0.557
0.926LeuTrp: 0.926 ± 0.349
2.686LeuTyr: 2.686 ± 0.429
0.0LeuXaa: 0.0 ± 0.0
Met
2.037MetAla: 2.037 ± 0.413
0.0MetCys: 0.0 ± 0.0
1.482MetAsp: 1.482 ± 0.414
2.223MetGlu: 2.223 ± 0.485
1.482MetPhe: 1.482 ± 0.381
1.297MetGly: 1.297 ± 0.318
0.093MetHis: 0.093 ± 0.081
2.223MetIle: 2.223 ± 0.39
3.241MetLys: 3.241 ± 0.581
2.593MetLeu: 2.593 ± 0.496
0.926MetMet: 0.926 ± 0.248
2.686MetAsn: 2.686 ± 0.557
0.926MetPro: 0.926 ± 0.278
1.482MetGln: 1.482 ± 0.279
1.019MetArg: 1.019 ± 0.303
2.5MetSer: 2.5 ± 0.389
1.297MetThr: 1.297 ± 0.328
0.926MetVal: 0.926 ± 0.442
0.093MetTrp: 0.093 ± 0.088
1.111MetTyr: 1.111 ± 0.344
0.0MetXaa: 0.0 ± 0.0
Asn
2.408AsnAla: 2.408 ± 0.464
0.556AsnCys: 0.556 ± 0.214
1.945AsnAsp: 1.945 ± 0.463
2.871AsnGlu: 2.871 ± 0.615
1.111AsnPhe: 1.111 ± 0.377
3.612AsnGly: 3.612 ± 0.7
1.019AsnHis: 1.019 ± 0.301
3.241AsnIle: 3.241 ± 0.525
3.241AsnLys: 3.241 ± 0.468
4.075AsnLeu: 4.075 ± 0.588
1.297AsnMet: 1.297 ± 0.463
2.223AsnAsn: 2.223 ± 0.549
2.408AsnPro: 2.408 ± 0.485
1.667AsnGln: 1.667 ± 0.395
3.89AsnArg: 3.89 ± 0.683
2.593AsnSer: 2.593 ± 0.475
2.223AsnThr: 2.223 ± 0.413
2.778AsnVal: 2.778 ± 0.495
0.833AsnTrp: 0.833 ± 0.294
1.667AsnTyr: 1.667 ± 0.363
0.0AsnXaa: 0.0 ± 0.0
Pro
2.315ProAla: 2.315 ± 0.42
0.093ProCys: 0.093 ± 0.08
1.482ProAsp: 1.482 ± 0.311
3.149ProGlu: 3.149 ± 0.825
1.204ProPhe: 1.204 ± 0.308
2.223ProGly: 2.223 ± 0.444
0.926ProHis: 0.926 ± 0.252
2.5ProIle: 2.5 ± 0.387
3.149ProLys: 3.149 ± 0.639
2.593ProLeu: 2.593 ± 0.489
0.926ProMet: 0.926 ± 0.249
1.574ProAsn: 1.574 ± 0.432
1.204ProPro: 1.204 ± 0.311
0.926ProGln: 0.926 ± 0.336
0.926ProArg: 0.926 ± 0.284
3.056ProSer: 3.056 ± 0.707
1.76ProThr: 1.76 ± 0.347
2.408ProVal: 2.408 ± 0.513
0.463ProTrp: 0.463 ± 0.198
1.019ProTyr: 1.019 ± 0.299
0.0ProXaa: 0.0 ± 0.0
Gln
3.241GlnAla: 3.241 ± 0.476
0.185GlnCys: 0.185 ± 0.12
2.037GlnAsp: 2.037 ± 0.306
3.89GlnGlu: 3.89 ± 0.673
1.574GlnPhe: 1.574 ± 0.356
2.593GlnGly: 2.593 ± 0.635
1.019GlnHis: 1.019 ± 0.223
2.593GlnIle: 2.593 ± 0.518
3.519GlnLys: 3.519 ± 0.634
3.334GlnLeu: 3.334 ± 0.615
1.76GlnMet: 1.76 ± 0.491
1.297GlnAsn: 1.297 ± 0.258
1.76GlnPro: 1.76 ± 0.488
1.574GlnGln: 1.574 ± 0.36
2.778GlnArg: 2.778 ± 0.521
2.223GlnSer: 2.223 ± 0.383
2.686GlnThr: 2.686 ± 0.381
1.852GlnVal: 1.852 ± 0.423
0.741GlnTrp: 0.741 ± 0.289
1.389GlnTyr: 1.389 ± 0.345
0.0GlnXaa: 0.0 ± 0.0
Arg
2.871ArgAla: 2.871 ± 0.518
0.556ArgCys: 0.556 ± 0.245
2.315ArgAsp: 2.315 ± 0.439
3.612ArgGlu: 3.612 ± 0.613
1.667ArgPhe: 1.667 ± 0.434
2.778ArgGly: 2.778 ± 0.6
0.926ArgHis: 0.926 ± 0.272
3.149ArgIle: 3.149 ± 0.544
5.186ArgLys: 5.186 ± 0.708
4.167ArgLeu: 4.167 ± 0.663
1.297ArgMet: 1.297 ± 0.346
2.408ArgAsn: 2.408 ± 0.415
1.574ArgPro: 1.574 ± 0.359
2.593ArgGln: 2.593 ± 0.468
2.593ArgArg: 2.593 ± 0.473
2.593ArgSer: 2.593 ± 0.368
2.315ArgThr: 2.315 ± 0.513
2.871ArgVal: 2.871 ± 0.621
0.833ArgTrp: 0.833 ± 0.271
1.852ArgTyr: 1.852 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
5.279SerAla: 5.279 ± 1.031
0.741SerCys: 0.741 ± 0.236
3.612SerAsp: 3.612 ± 0.5
5.557SerGlu: 5.557 ± 0.686
2.778SerPhe: 2.778 ± 0.553
4.353SerGly: 4.353 ± 0.773
1.019SerHis: 1.019 ± 0.376
5.742SerIle: 5.742 ± 0.979
4.167SerLys: 4.167 ± 0.758
5.094SerLeu: 5.094 ± 0.773
1.667SerMet: 1.667 ± 0.403
2.223SerAsn: 2.223 ± 0.391
2.593SerPro: 2.593 ± 0.386
2.593SerGln: 2.593 ± 0.592
2.5SerArg: 2.5 ± 0.504
3.704SerSer: 3.704 ± 0.779
3.334SerThr: 3.334 ± 0.593
3.89SerVal: 3.89 ± 0.764
0.37SerTrp: 0.37 ± 0.162
1.945SerTyr: 1.945 ± 0.367
0.0SerXaa: 0.0 ± 0.0
Thr
4.26ThrAla: 4.26 ± 0.672
0.278ThrCys: 0.278 ± 0.149
2.315ThrAsp: 2.315 ± 0.398
3.241ThrGlu: 3.241 ± 0.471
2.037ThrPhe: 2.037 ± 0.382
4.167ThrGly: 4.167 ± 0.611
1.111ThrHis: 1.111 ± 0.338
4.353ThrIle: 4.353 ± 0.687
5.094ThrLys: 5.094 ± 0.728
3.704ThrLeu: 3.704 ± 0.517
1.482ThrMet: 1.482 ± 0.442
1.574ThrAsn: 1.574 ± 0.411
2.593ThrPro: 2.593 ± 0.531
2.5ThrGln: 2.5 ± 0.463
2.315ThrArg: 2.315 ± 0.386
4.167ThrSer: 4.167 ± 0.55
2.408ThrThr: 2.408 ± 0.537
4.908ThrVal: 4.908 ± 0.569
0.37ThrTrp: 0.37 ± 0.161
2.408ThrTyr: 2.408 ± 0.479
0.0ThrXaa: 0.0 ± 0.0
Val
3.982ValAla: 3.982 ± 0.563
0.741ValCys: 0.741 ± 0.258
3.704ValAsp: 3.704 ± 0.574
4.816ValGlu: 4.816 ± 0.69
3.149ValPhe: 3.149 ± 0.561
3.241ValGly: 3.241 ± 0.615
0.556ValHis: 0.556 ± 0.28
4.538ValIle: 4.538 ± 0.557
4.908ValLys: 4.908 ± 0.679
4.445ValLeu: 4.445 ± 0.503
1.667ValMet: 1.667 ± 0.366
2.593ValAsn: 2.593 ± 0.517
1.574ValPro: 1.574 ± 0.37
2.037ValGln: 2.037 ± 0.387
2.408ValArg: 2.408 ± 0.46
5.186ValSer: 5.186 ± 0.571
4.908ValThr: 4.908 ± 0.635
3.427ValVal: 3.427 ± 0.448
1.852ValTrp: 1.852 ± 0.953
1.667ValTyr: 1.667 ± 0.423
0.0ValXaa: 0.0 ± 0.0
Trp
0.741TrpAla: 0.741 ± 0.222
0.093TrpCys: 0.093 ± 0.097
0.833TrpAsp: 0.833 ± 0.365
0.833TrpGlu: 0.833 ± 0.226
0.926TrpPhe: 0.926 ± 0.218
0.833TrpGly: 0.833 ± 0.249
0.463TrpHis: 0.463 ± 0.185
0.741TrpIle: 0.741 ± 0.244
0.926TrpLys: 0.926 ± 0.283
1.76TrpLeu: 1.76 ± 0.375
0.093TrpMet: 0.093 ± 0.084
1.852TrpAsn: 1.852 ± 1.146
0.278TrpPro: 0.278 ± 0.241
0.741TrpGln: 0.741 ± 0.226
0.463TrpArg: 0.463 ± 0.232
1.111TrpSer: 1.111 ± 0.323
0.463TrpThr: 0.463 ± 0.187
0.648TrpVal: 0.648 ± 0.271
0.278TrpTrp: 0.278 ± 0.166
0.463TrpTyr: 0.463 ± 0.21
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.667TyrAla: 1.667 ± 0.438
0.463TyrCys: 0.463 ± 0.189
2.778TyrAsp: 2.778 ± 0.538
2.593TyrGlu: 2.593 ± 0.523
1.204TyrPhe: 1.204 ± 0.297
1.852TyrGly: 1.852 ± 0.45
0.463TyrHis: 0.463 ± 0.191
2.408TyrIle: 2.408 ± 0.415
3.519TyrLys: 3.519 ± 0.451
2.964TyrLeu: 2.964 ± 0.475
1.111TyrMet: 1.111 ± 0.265
1.204TyrAsn: 1.204 ± 0.412
1.204TyrPro: 1.204 ± 0.343
1.482TyrGln: 1.482 ± 0.32
1.945TyrArg: 1.945 ± 0.478
2.408TyrSer: 2.408 ± 0.451
2.5TyrThr: 2.5 ± 0.405
2.871TyrVal: 2.871 ± 0.461
0.278TyrTrp: 0.278 ± 0.159
1.482TyrTyr: 1.482 ± 0.386
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (10799 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski