Amino acid dipepetide frequency for Bacillus virus AP50

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.358AlaAla: 4.358 ± 2.018
0.654AlaCys: 0.654 ± 0.375
1.961AlaAsp: 1.961 ± 0.588
3.269AlaGlu: 3.269 ± 0.685
1.961AlaPhe: 1.961 ± 0.698
5.884AlaGly: 5.884 ± 1.479
0.872AlaHis: 0.872 ± 0.426
4.794AlaIle: 4.794 ± 1.366
6.537AlaLys: 6.537 ± 1.447
3.705AlaLeu: 3.705 ± 0.699
1.307AlaMet: 1.307 ± 0.409
3.051AlaAsn: 3.051 ± 0.776
2.179AlaPro: 2.179 ± 0.744
2.833AlaGln: 2.833 ± 1.059
2.833AlaArg: 2.833 ± 0.867
3.705AlaSer: 3.705 ± 0.89
3.705AlaThr: 3.705 ± 0.659
3.705AlaVal: 3.705 ± 1.139
0.436AlaTrp: 0.436 ± 0.328
2.397AlaTyr: 2.397 ± 0.689
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.218CysAsp: 0.218 ± 0.172
0.436CysGlu: 0.436 ± 0.3
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.436CysHis: 0.436 ± 0.278
0.218CysIle: 0.218 ± 0.215
0.218CysLys: 0.218 ± 0.172
0.436CysLeu: 0.436 ± 0.303
0.654CysMet: 0.654 ± 0.347
0.436CysAsn: 0.436 ± 0.268
0.436CysPro: 0.436 ± 0.259
0.0CysGln: 0.0 ± 0.0
1.09CysArg: 1.09 ± 0.385
0.218CysSer: 0.218 ± 0.185
0.872CysThr: 0.872 ± 0.315
0.654CysVal: 0.654 ± 0.347
0.0CysTrp: 0.0 ± 0.0
0.218CysTyr: 0.218 ± 0.245
0.0CysXaa: 0.0 ± 0.0
Asp
3.269AspAla: 3.269 ± 0.845
0.436AspCys: 0.436 ± 0.344
3.269AspAsp: 3.269 ± 1.013
2.833AspGlu: 2.833 ± 0.701
4.794AspPhe: 4.794 ± 1.018
3.269AspGly: 3.269 ± 0.655
1.307AspHis: 1.307 ± 0.571
2.833AspIle: 2.833 ± 0.6
4.576AspLys: 4.576 ± 0.834
5.666AspLeu: 5.666 ± 1.023
2.179AspMet: 2.179 ± 0.71
1.961AspAsn: 1.961 ± 0.795
1.961AspPro: 1.961 ± 0.394
2.615AspGln: 2.615 ± 1.116
3.051AspArg: 3.051 ± 0.633
3.705AspSer: 3.705 ± 0.821
4.576AspThr: 4.576 ± 0.993
5.23AspVal: 5.23 ± 1.131
0.436AspTrp: 0.436 ± 0.343
1.525AspTyr: 1.525 ± 1.036
0.0AspXaa: 0.0 ± 0.0
Glu
4.576GluAla: 4.576 ± 0.953
0.218GluCys: 0.218 ± 0.172
5.448GluAsp: 5.448 ± 0.938
8.716GluGlu: 8.716 ± 4.372
2.397GluPhe: 2.397 ± 0.734
3.051GluGly: 3.051 ± 0.865
1.307GluHis: 1.307 ± 0.664
4.358GluIle: 4.358 ± 0.727
3.705GluLys: 3.705 ± 0.826
4.358GluLeu: 4.358 ± 0.969
2.833GluMet: 2.833 ± 0.748
3.487GluAsn: 3.487 ± 0.706
0.872GluPro: 0.872 ± 0.424
5.448GluGln: 5.448 ± 1.009
2.615GluArg: 2.615 ± 0.746
4.576GluSer: 4.576 ± 1.061
4.14GluThr: 4.14 ± 0.935
4.14GluVal: 4.14 ± 1.073
0.872GluTrp: 0.872 ± 0.408
1.743GluTyr: 1.743 ± 0.481
0.0GluXaa: 0.0 ± 0.0
Phe
3.269PheAla: 3.269 ± 0.856
0.0PheCys: 0.0 ± 0.0
2.615PheAsp: 2.615 ± 1.047
3.269PheGlu: 3.269 ± 0.857
2.179PhePhe: 2.179 ± 0.673
1.961PheGly: 1.961 ± 0.514
0.218PheHis: 0.218 ± 0.188
3.269PheIle: 3.269 ± 0.759
4.14PheLys: 4.14 ± 1.013
3.051PheLeu: 3.051 ± 0.964
2.397PheMet: 2.397 ± 0.852
1.743PheAsn: 1.743 ± 0.485
1.743PhePro: 1.743 ± 0.528
1.961PheGln: 1.961 ± 0.62
1.961PheArg: 1.961 ± 0.69
2.397PheSer: 2.397 ± 0.88
4.14PheThr: 4.14 ± 1.005
3.705PheVal: 3.705 ± 0.827
0.218PheTrp: 0.218 ± 0.237
1.09PheTyr: 1.09 ± 0.424
0.0PheXaa: 0.0 ± 0.0
Gly
3.269GlyAla: 3.269 ± 1.046
0.436GlyCys: 0.436 ± 0.222
3.922GlyAsp: 3.922 ± 1.331
3.269GlyGlu: 3.269 ± 0.873
3.269GlyPhe: 3.269 ± 0.977
9.588GlyGly: 9.588 ± 2.661
1.525GlyHis: 1.525 ± 0.63
4.794GlyIle: 4.794 ± 1.107
6.102GlyLys: 6.102 ± 0.78
6.973GlyLeu: 6.973 ± 1.065
2.179GlyMet: 2.179 ± 0.608
3.922GlyAsn: 3.922 ± 0.624
0.436GlyPro: 0.436 ± 0.3
3.269GlyGln: 3.269 ± 0.72
2.615GlyArg: 2.615 ± 0.739
3.269GlySer: 3.269 ± 0.628
4.358GlyThr: 4.358 ± 1.035
5.448GlyVal: 5.448 ± 1.063
1.09GlyTrp: 1.09 ± 0.346
3.487GlyTyr: 3.487 ± 1.037
0.0GlyXaa: 0.0 ± 0.0
His
0.872HisAla: 0.872 ± 0.585
0.0HisCys: 0.0 ± 0.0
1.525HisAsp: 1.525 ± 0.667
0.218HisGlu: 0.218 ± 0.172
1.525HisPhe: 1.525 ± 0.448
0.654HisGly: 0.654 ± 0.332
0.436HisHis: 0.436 ± 0.281
1.961HisIle: 1.961 ± 0.521
0.654HisLys: 0.654 ± 0.363
1.307HisLeu: 1.307 ± 0.571
0.436HisMet: 0.436 ± 0.226
0.872HisAsn: 0.872 ± 0.359
0.436HisPro: 0.436 ± 0.282
0.436HisGln: 0.436 ± 0.285
0.872HisArg: 0.872 ± 0.521
0.654HisSer: 0.654 ± 0.323
0.436HisThr: 0.436 ± 0.268
1.961HisVal: 1.961 ± 0.59
0.218HisTrp: 0.218 ± 0.172
1.09HisTyr: 1.09 ± 0.371
0.0HisXaa: 0.0 ± 0.0
Ile
3.269IleAla: 3.269 ± 1.046
0.654IleCys: 0.654 ± 0.367
3.922IleAsp: 3.922 ± 1.034
3.051IleGlu: 3.051 ± 0.961
2.833IlePhe: 2.833 ± 0.814
3.487IleGly: 3.487 ± 0.896
1.09IleHis: 1.09 ± 0.305
3.269IleIle: 3.269 ± 1.248
5.666IleLys: 5.666 ± 1.084
3.051IleLeu: 3.051 ± 0.649
1.09IleMet: 1.09 ± 0.562
3.487IleAsn: 3.487 ± 0.755
3.487IlePro: 3.487 ± 0.9
3.487IleGln: 3.487 ± 0.818
1.307IleArg: 1.307 ± 0.408
2.833IleSer: 2.833 ± 0.845
3.269IleThr: 3.269 ± 1.308
6.537IleVal: 6.537 ± 1.3
0.654IleTrp: 0.654 ± 0.451
1.743IleTyr: 1.743 ± 0.519
0.0IleXaa: 0.0 ± 0.0
Lys
4.576LysAla: 4.576 ± 0.925
0.436LysCys: 0.436 ± 0.222
6.973LysAsp: 6.973 ± 1.595
8.499LysGlu: 8.499 ± 1.5
3.051LysPhe: 3.051 ± 0.718
5.23LysGly: 5.23 ± 0.715
0.872LysHis: 0.872 ± 0.544
2.833LysIle: 2.833 ± 0.674
10.024LysLys: 10.024 ± 2.566
5.448LysLeu: 5.448 ± 1.092
3.922LysMet: 3.922 ± 1.086
4.794LysAsn: 4.794 ± 1.004
2.397LysPro: 2.397 ± 0.914
4.358LysGln: 4.358 ± 0.889
3.922LysArg: 3.922 ± 0.98
6.537LysSer: 6.537 ± 1.423
4.358LysThr: 4.358 ± 1.344
4.794LysVal: 4.794 ± 0.988
0.872LysTrp: 0.872 ± 0.402
1.961LysTyr: 1.961 ± 0.76
0.0LysXaa: 0.0 ± 0.0
Leu
5.012LeuAla: 5.012 ± 0.971
1.09LeuCys: 1.09 ± 0.432
3.051LeuAsp: 3.051 ± 0.791
6.973LeuGlu: 6.973 ± 1.62
5.012LeuPhe: 5.012 ± 1.027
5.012LeuGly: 5.012 ± 1.007
1.743LeuHis: 1.743 ± 0.632
2.833LeuIle: 2.833 ± 1.002
5.012LeuLys: 5.012 ± 0.807
7.845LeuLeu: 7.845 ± 1.578
2.397LeuMet: 2.397 ± 0.93
5.448LeuAsn: 5.448 ± 1.595
5.666LeuPro: 5.666 ± 0.991
4.14LeuGln: 4.14 ± 1.118
3.487LeuArg: 3.487 ± 0.707
5.666LeuSer: 5.666 ± 1.045
5.012LeuThr: 5.012 ± 0.89
3.487LeuVal: 3.487 ± 0.86
0.654LeuTrp: 0.654 ± 0.384
3.269LeuTyr: 3.269 ± 0.65
0.0LeuXaa: 0.0 ± 0.0
Met
1.743MetAla: 1.743 ± 0.666
0.436MetCys: 0.436 ± 0.268
1.307MetAsp: 1.307 ± 0.42
2.833MetGlu: 2.833 ± 1.089
1.743MetPhe: 1.743 ± 0.725
2.615MetGly: 2.615 ± 0.746
0.0MetHis: 0.0 ± 0.0
2.833MetIle: 2.833 ± 0.697
3.487MetLys: 3.487 ± 1.013
3.269MetLeu: 3.269 ± 1.099
1.09MetMet: 1.09 ± 0.438
0.872MetAsn: 0.872 ± 0.365
0.654MetPro: 0.654 ± 0.43
0.654MetGln: 0.654 ± 0.35
0.654MetArg: 0.654 ± 0.316
1.743MetSer: 1.743 ± 0.534
3.269MetThr: 3.269 ± 1.149
1.743MetVal: 1.743 ± 0.612
0.872MetTrp: 0.872 ± 0.423
1.09MetTyr: 1.09 ± 0.512
0.0MetXaa: 0.0 ± 0.0
Asn
3.705AsnAla: 3.705 ± 0.804
0.218AsnCys: 0.218 ± 0.245
3.922AsnAsp: 3.922 ± 0.998
3.051AsnGlu: 3.051 ± 0.91
1.307AsnPhe: 1.307 ± 0.612
6.319AsnGly: 6.319 ± 1.364
0.654AsnHis: 0.654 ± 0.327
3.487AsnIle: 3.487 ± 0.958
3.705AsnLys: 3.705 ± 0.776
4.14AsnLeu: 4.14 ± 0.707
1.961AsnMet: 1.961 ± 0.475
3.487AsnAsn: 3.487 ± 0.718
1.961AsnPro: 1.961 ± 0.547
2.397AsnGln: 2.397 ± 0.869
1.09AsnArg: 1.09 ± 0.428
3.487AsnSer: 3.487 ± 0.549
1.743AsnThr: 1.743 ± 0.561
2.833AsnVal: 2.833 ± 0.78
0.0AsnTrp: 0.0 ± 0.0
1.525AsnTyr: 1.525 ± 0.628
0.0AsnXaa: 0.0 ± 0.0
Pro
3.269ProAla: 3.269 ± 0.936
0.0ProCys: 0.0 ± 0.0
1.961ProAsp: 1.961 ± 0.737
1.743ProGlu: 1.743 ± 0.487
1.961ProPhe: 1.961 ± 0.662
1.307ProGly: 1.307 ± 0.376
0.872ProHis: 0.872 ± 0.688
1.743ProIle: 1.743 ± 0.553
3.487ProLys: 3.487 ± 1.096
2.615ProLeu: 2.615 ± 0.873
0.654ProMet: 0.654 ± 0.425
3.051ProAsn: 3.051 ± 0.787
1.743ProPro: 1.743 ± 0.703
0.654ProGln: 0.654 ± 0.369
0.436ProArg: 0.436 ± 0.254
3.705ProSer: 3.705 ± 1.149
1.961ProThr: 1.961 ± 0.499
3.487ProVal: 3.487 ± 0.754
0.218ProTrp: 0.218 ± 0.245
1.961ProTyr: 1.961 ± 0.601
0.0ProXaa: 0.0 ± 0.0
Gln
3.705GlnAla: 3.705 ± 1.299
0.218GlnCys: 0.218 ± 0.185
0.654GlnAsp: 0.654 ± 0.4
3.269GlnGlu: 3.269 ± 1.013
1.09GlnPhe: 1.09 ± 0.37
2.833GlnGly: 2.833 ± 0.908
0.872GlnHis: 0.872 ± 0.44
2.179GlnIle: 2.179 ± 0.712
1.961GlnLys: 1.961 ± 0.547
3.705GlnLeu: 3.705 ± 1.029
1.961GlnMet: 1.961 ± 0.639
2.833GlnAsn: 2.833 ± 1.014
1.307GlnPro: 1.307 ± 0.406
2.397GlnGln: 2.397 ± 1.129
2.833GlnArg: 2.833 ± 0.766
2.179GlnSer: 2.179 ± 0.742
2.833GlnThr: 2.833 ± 0.826
2.833GlnVal: 2.833 ± 0.661
0.654GlnTrp: 0.654 ± 0.469
2.179GlnTyr: 2.179 ± 0.803
0.0GlnXaa: 0.0 ± 0.0
Arg
1.525ArgAla: 1.525 ± 0.339
0.218ArgCys: 0.218 ± 0.186
3.051ArgAsp: 3.051 ± 0.788
3.269ArgGlu: 3.269 ± 0.879
0.872ArgPhe: 0.872 ± 0.352
3.922ArgGly: 3.922 ± 0.841
0.0ArgHis: 0.0 ± 0.0
2.397ArgIle: 2.397 ± 0.938
4.14ArgLys: 4.14 ± 1.005
3.922ArgLeu: 3.922 ± 1.154
0.654ArgMet: 0.654 ± 0.387
1.525ArgAsn: 1.525 ± 0.767
2.179ArgPro: 2.179 ± 0.776
1.525ArgGln: 1.525 ± 0.505
1.525ArgArg: 1.525 ± 0.487
2.397ArgSer: 2.397 ± 0.833
1.743ArgThr: 1.743 ± 0.626
3.922ArgVal: 3.922 ± 1.104
0.0ArgTrp: 0.0 ± 0.0
1.743ArgTyr: 1.743 ± 0.604
0.0ArgXaa: 0.0 ± 0.0
Ser
2.615SerAla: 2.615 ± 0.894
0.0SerCys: 0.0 ± 0.0
1.961SerAsp: 1.961 ± 0.688
3.487SerGlu: 3.487 ± 0.654
2.615SerPhe: 2.615 ± 0.656
6.755SerGly: 6.755 ± 1.568
1.307SerHis: 1.307 ± 0.485
5.012SerIle: 5.012 ± 1.059
6.537SerLys: 6.537 ± 0.899
5.012SerLeu: 5.012 ± 1.163
1.961SerMet: 1.961 ± 0.735
2.615SerAsn: 2.615 ± 0.483
2.833SerPro: 2.833 ± 0.813
1.525SerGln: 1.525 ± 0.594
2.615SerArg: 2.615 ± 0.77
3.269SerSer: 3.269 ± 1.025
3.487SerThr: 3.487 ± 1.05
4.14SerVal: 4.14 ± 0.786
0.436SerTrp: 0.436 ± 0.284
3.051SerTyr: 3.051 ± 0.687
0.0SerXaa: 0.0 ± 0.0
Thr
3.922ThrAla: 3.922 ± 0.941
0.436ThrCys: 0.436 ± 0.319
4.794ThrAsp: 4.794 ± 1.163
3.269ThrGlu: 3.269 ± 0.917
2.615ThrPhe: 2.615 ± 0.556
4.576ThrGly: 4.576 ± 1.278
0.872ThrHis: 0.872 ± 0.276
2.397ThrIle: 2.397 ± 0.635
5.884ThrLys: 5.884 ± 1.403
7.845ThrLeu: 7.845 ± 1.641
0.436ThrMet: 0.436 ± 0.334
4.14ThrAsn: 4.14 ± 0.691
2.179ThrPro: 2.179 ± 0.681
1.743ThrGln: 1.743 ± 0.643
2.615ThrArg: 2.615 ± 0.542
3.705ThrSer: 3.705 ± 0.971
3.051ThrThr: 3.051 ± 0.67
4.576ThrVal: 4.576 ± 1.102
0.218ThrTrp: 0.218 ± 0.207
2.179ThrTyr: 2.179 ± 0.661
0.0ThrXaa: 0.0 ± 0.0
Val
3.487ValAla: 3.487 ± 1.16
0.654ValCys: 0.654 ± 0.325
5.012ValAsp: 5.012 ± 0.987
5.012ValGlu: 5.012 ± 1.25
3.487ValPhe: 3.487 ± 1.013
4.576ValGly: 4.576 ± 0.993
0.872ValHis: 0.872 ± 0.495
3.051ValIle: 3.051 ± 0.764
5.012ValLys: 5.012 ± 1.16
6.537ValLeu: 6.537 ± 1.23
3.051ValMet: 3.051 ± 0.748
1.743ValAsn: 1.743 ± 0.579
2.833ValPro: 2.833 ± 0.801
1.743ValGln: 1.743 ± 0.6
2.615ValArg: 2.615 ± 0.574
5.448ValSer: 5.448 ± 1.218
6.102ValThr: 6.102 ± 1.197
4.576ValVal: 4.576 ± 0.958
1.307ValTrp: 1.307 ± 0.528
2.615ValTyr: 2.615 ± 0.755
0.0ValXaa: 0.0 ± 0.0
Trp
1.307TrpAla: 1.307 ± 0.459
0.218TrpCys: 0.218 ± 0.172
0.872TrpAsp: 0.872 ± 0.641
0.218TrpGlu: 0.218 ± 0.172
0.436TrpPhe: 0.436 ± 0.343
0.218TrpGly: 0.218 ± 0.172
0.0TrpHis: 0.0 ± 0.0
0.436TrpIle: 0.436 ± 0.29
0.872TrpLys: 0.872 ± 0.397
1.09TrpLeu: 1.09 ± 0.439
0.436TrpMet: 0.436 ± 0.293
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.218TrpGln: 0.218 ± 0.208
0.654TrpArg: 0.654 ± 0.422
0.872TrpSer: 0.872 ± 0.377
0.436TrpThr: 0.436 ± 0.241
0.654TrpVal: 0.654 ± 0.369
0.436TrpTrp: 0.436 ± 0.32
0.436TrpTyr: 0.436 ± 0.313
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.397TyrAla: 2.397 ± 0.853
0.218TyrCys: 0.218 ± 0.172
3.051TyrAsp: 3.051 ± 0.945
2.179TyrGlu: 2.179 ± 0.727
2.179TyrPhe: 2.179 ± 0.596
1.961TyrGly: 1.961 ± 0.854
1.307TyrHis: 1.307 ± 0.442
3.487TyrIle: 3.487 ± 1.104
4.358TyrLys: 4.358 ± 1.399
2.833TyrLeu: 2.833 ± 0.543
1.09TyrMet: 1.09 ± 0.591
1.525TyrAsn: 1.525 ± 0.554
1.525TyrPro: 1.525 ± 0.469
1.09TyrGln: 1.09 ± 0.389
1.743TyrArg: 1.743 ± 0.626
0.872TyrSer: 0.872 ± 0.347
1.961TyrThr: 1.961 ± 0.647
1.307TyrVal: 1.307 ± 0.492
0.218TyrTrp: 0.218 ± 0.172
1.743TyrTyr: 1.743 ± 0.456
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 31 proteins (4590 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski