Amino acid dipepetide frequency for Wellfleet Bay virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.249AlaAla: 4.249 ± 1.229
0.797AlaCys: 0.797 ± 0.349
1.859AlaAsp: 1.859 ± 0.808
3.983AlaGlu: 3.983 ± 0.518
2.921AlaPhe: 2.921 ± 0.871
4.249AlaGly: 4.249 ± 0.603
1.859AlaHis: 1.859 ± 0.881
2.655AlaIle: 2.655 ± 0.649
5.045AlaLys: 5.045 ± 1.21
5.842AlaLeu: 5.842 ± 0.845
1.328AlaMet: 1.328 ± 0.328
2.124AlaAsn: 2.124 ± 0.574
1.328AlaPro: 1.328 ± 0.464
4.249AlaGln: 4.249 ± 1.771
3.717AlaArg: 3.717 ± 1.105
3.983AlaSer: 3.983 ± 0.603
4.249AlaThr: 4.249 ± 0.679
2.655AlaVal: 2.655 ± 0.448
0.531AlaTrp: 0.531 ± 0.299
2.921AlaTyr: 2.921 ± 0.541
0.0AlaXaa: 0.0 ± 0.0
Cys
2.39CysAla: 2.39 ± 0.732
0.531CysCys: 0.531 ± 0.524
1.062CysAsp: 1.062 ± 0.369
0.531CysGlu: 0.531 ± 0.439
1.859CysPhe: 1.859 ± 0.696
0.266CysGly: 0.266 ± 0.237
0.266CysHis: 0.266 ± 0.246
0.531CysIle: 0.531 ± 0.293
1.062CysLys: 1.062 ± 0.404
1.062CysLeu: 1.062 ± 0.486
0.266CysMet: 0.266 ± 0.23
0.797CysAsn: 0.797 ± 0.483
1.062CysPro: 1.062 ± 0.502
0.266CysGln: 0.266 ± 0.237
1.328CysArg: 1.328 ± 0.651
2.124CysSer: 2.124 ± 0.408
0.531CysThr: 0.531 ± 0.293
1.062CysVal: 1.062 ± 0.722
0.266CysTrp: 0.266 ± 0.237
0.266CysTyr: 0.266 ± 0.237
0.0CysXaa: 0.0 ± 0.0
Asp
2.39AspAla: 2.39 ± 0.451
1.328AspCys: 1.328 ± 0.294
3.186AspAsp: 3.186 ± 0.998
5.311AspGlu: 5.311 ± 1.081
2.921AspPhe: 2.921 ± 0.685
2.921AspGly: 2.921 ± 1.006
0.531AspHis: 0.531 ± 0.299
1.859AspIle: 1.859 ± 0.572
3.452AspLys: 3.452 ± 1.155
4.514AspLeu: 4.514 ± 1.352
2.39AspMet: 2.39 ± 0.733
2.39AspAsn: 2.39 ± 1.065
3.983AspPro: 3.983 ± 0.861
1.328AspGln: 1.328 ± 0.741
3.717AspArg: 3.717 ± 1.44
3.717AspSer: 3.717 ± 1.163
1.859AspThr: 1.859 ± 1.064
2.39AspVal: 2.39 ± 0.897
1.328AspTrp: 1.328 ± 0.426
1.062AspTyr: 1.062 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
3.717GluAla: 3.717 ± 0.839
1.593GluCys: 1.593 ± 0.364
5.576GluAsp: 5.576 ± 0.686
9.825GluGlu: 9.825 ± 2.39
3.717GluPhe: 3.717 ± 0.712
5.045GluGly: 5.045 ± 1.505
1.062GluHis: 1.062 ± 0.632
5.311GluIle: 5.311 ± 1.011
3.983GluLys: 3.983 ± 0.539
7.435GluLeu: 7.435 ± 0.626
4.78GluMet: 4.78 ± 1.088
3.452GluAsn: 3.452 ± 0.71
2.39GluPro: 2.39 ± 0.807
1.593GluGln: 1.593 ± 0.401
3.717GluArg: 3.717 ± 1.005
5.576GluSer: 5.576 ± 1.267
4.514GluThr: 4.514 ± 0.767
4.249GluVal: 4.249 ± 0.642
0.531GluTrp: 0.531 ± 0.299
2.921GluTyr: 2.921 ± 1.024
0.0GluXaa: 0.0 ± 0.0
Phe
0.531PheAla: 0.531 ± 0.278
0.531PheCys: 0.531 ± 0.275
1.593PheAsp: 1.593 ± 0.53
2.921PheGlu: 2.921 ± 0.603
0.531PhePhe: 0.531 ± 0.278
1.593PheGly: 1.593 ± 0.917
2.655PheHis: 2.655 ± 0.692
1.859PheIle: 1.859 ± 0.507
2.39PheLys: 2.39 ± 0.74
3.186PheLeu: 3.186 ± 0.968
0.797PheMet: 0.797 ± 0.349
2.124PheAsn: 2.124 ± 0.988
1.328PhePro: 1.328 ± 0.402
0.797PheGln: 0.797 ± 0.305
3.717PheArg: 3.717 ± 0.762
5.576PheSer: 5.576 ± 0.886
2.39PheThr: 2.39 ± 0.857
2.655PheVal: 2.655 ± 0.777
0.531PheTrp: 0.531 ± 0.278
1.859PheTyr: 1.859 ± 0.396
0.0PheXaa: 0.0 ± 0.0
Gly
3.983GlyAla: 3.983 ± 1.117
0.531GlyCys: 0.531 ± 0.293
3.452GlyAsp: 3.452 ± 0.746
4.78GlyGlu: 4.78 ± 1.28
2.39GlyPhe: 2.39 ± 0.499
4.514GlyGly: 4.514 ± 1.234
0.797GlyHis: 0.797 ± 0.28
4.514GlyIle: 4.514 ± 0.862
3.717GlyLys: 3.717 ± 0.596
4.514GlyLeu: 4.514 ± 1.619
1.859GlyMet: 1.859 ± 0.553
2.39GlyAsn: 2.39 ± 0.943
3.452GlyPro: 3.452 ± 0.598
1.328GlyGln: 1.328 ± 0.427
2.655GlyArg: 2.655 ± 0.649
2.655GlySer: 2.655 ± 0.565
3.452GlyThr: 3.452 ± 1.001
4.249GlyVal: 4.249 ± 1.19
2.124GlyTrp: 2.124 ± 0.648
2.124GlyTyr: 2.124 ± 0.585
0.0GlyXaa: 0.0 ± 0.0
His
0.797HisAla: 0.797 ± 0.305
0.531HisCys: 0.531 ± 0.237
2.124HisAsp: 2.124 ± 0.683
0.0HisGlu: 0.0 ± 0.0
0.531HisPhe: 0.531 ± 0.439
1.328HisGly: 1.328 ± 0.427
0.797HisHis: 0.797 ± 0.485
1.328HisIle: 1.328 ± 0.179
1.062HisLys: 1.062 ± 0.404
3.717HisLeu: 3.717 ± 0.746
0.0HisMet: 0.0 ± 0.0
1.328HisAsn: 1.328 ± 0.427
1.328HisPro: 1.328 ± 0.499
0.266HisGln: 0.266 ± 0.262
1.062HisArg: 1.062 ± 0.617
1.859HisSer: 1.859 ± 0.508
1.062HisThr: 1.062 ± 0.347
1.328HisVal: 1.328 ± 0.436
0.266HisTrp: 0.266 ± 0.219
1.062HisTyr: 1.062 ± 0.701
0.0HisXaa: 0.0 ± 0.0
Ile
2.39IleAla: 2.39 ± 0.642
1.062IleCys: 1.062 ± 0.318
4.249IleAsp: 4.249 ± 0.675
3.717IleGlu: 3.717 ± 0.609
1.062IlePhe: 1.062 ± 0.436
4.514IleGly: 4.514 ± 0.882
1.328IleHis: 1.328 ± 0.643
4.249IleIle: 4.249 ± 0.883
3.186IleLys: 3.186 ± 1.257
5.045IleLeu: 5.045 ± 0.715
1.859IleMet: 1.859 ± 1.003
4.249IleAsn: 4.249 ± 1.083
3.186IlePro: 3.186 ± 0.879
2.39IleGln: 2.39 ± 0.373
2.39IleArg: 2.39 ± 0.665
4.78IleSer: 4.78 ± 0.549
4.78IleThr: 4.78 ± 1.353
2.655IleVal: 2.655 ± 0.826
0.266IleTrp: 0.266 ± 0.246
2.124IleTyr: 2.124 ± 0.534
0.0IleXaa: 0.0 ± 0.0
Lys
3.452LysAla: 3.452 ± 0.46
1.859LysCys: 1.859 ± 0.842
3.186LysAsp: 3.186 ± 0.493
7.7LysGlu: 7.7 ± 0.615
1.328LysPhe: 1.328 ± 0.582
3.452LysGly: 3.452 ± 0.69
1.593LysHis: 1.593 ± 0.761
5.576LysIle: 5.576 ± 0.736
5.842LysLys: 5.842 ± 1.214
5.311LysLeu: 5.311 ± 1.175
2.124LysMet: 2.124 ± 0.785
1.062LysAsn: 1.062 ± 0.412
1.593LysPro: 1.593 ± 0.633
2.655LysGln: 2.655 ± 0.908
7.169LysArg: 7.169 ± 1.321
3.717LysSer: 3.717 ± 0.843
4.514LysThr: 4.514 ± 0.683
2.655LysVal: 2.655 ± 0.712
2.921LysTrp: 2.921 ± 0.735
2.39LysTyr: 2.39 ± 0.555
0.0LysXaa: 0.0 ± 0.0
Leu
7.7LeuAla: 7.7 ± 0.916
1.062LeuCys: 1.062 ± 0.487
4.514LeuAsp: 4.514 ± 1.23
7.966LeuGlu: 7.966 ± 1.518
2.921LeuPhe: 2.921 ± 0.685
4.249LeuGly: 4.249 ± 0.797
0.797LeuHis: 0.797 ± 0.408
4.514LeuIle: 4.514 ± 0.918
5.311LeuLys: 5.311 ± 1.109
9.294LeuLeu: 9.294 ± 1.201
4.514LeuMet: 4.514 ± 0.752
4.514LeuAsn: 4.514 ± 0.973
2.124LeuPro: 2.124 ± 0.891
2.39LeuGln: 2.39 ± 0.859
6.638LeuArg: 6.638 ± 1.38
5.576LeuSer: 5.576 ± 0.938
5.576LeuThr: 5.576 ± 0.764
4.249LeuVal: 4.249 ± 0.436
1.859LeuTrp: 1.859 ± 0.319
2.921LeuTyr: 2.921 ± 0.867
0.0LeuXaa: 0.0 ± 0.0
Met
3.717MetAla: 3.717 ± 1.141
0.531MetCys: 0.531 ± 0.308
1.328MetAsp: 1.328 ± 0.76
1.062MetGlu: 1.062 ± 0.347
0.797MetPhe: 0.797 ± 0.391
2.655MetGly: 2.655 ± 0.417
1.593MetHis: 1.593 ± 0.782
1.859MetIle: 1.859 ± 0.6
1.859MetLys: 1.859 ± 0.861
2.921MetLeu: 2.921 ± 0.451
1.859MetMet: 1.859 ± 0.785
2.39MetAsn: 2.39 ± 0.694
1.062MetPro: 1.062 ± 0.494
1.062MetGln: 1.062 ± 0.511
1.859MetArg: 1.859 ± 0.601
2.39MetSer: 2.39 ± 0.794
1.062MetThr: 1.062 ± 0.701
2.39MetVal: 2.39 ± 1.119
0.0MetTrp: 0.0 ± 0.0
0.797MetTyr: 0.797 ± 0.471
0.0MetXaa: 0.0 ± 0.0
Asn
2.921AsnAla: 2.921 ± 0.709
1.062AsnCys: 1.062 ± 0.466
1.593AsnAsp: 1.593 ± 0.501
3.186AsnGlu: 3.186 ± 0.717
3.186AsnPhe: 3.186 ± 0.625
2.655AsnGly: 2.655 ± 0.69
0.531AsnHis: 0.531 ± 0.278
1.859AsnIle: 1.859 ± 0.372
3.186AsnLys: 3.186 ± 0.793
5.311AsnLeu: 5.311 ± 0.957
0.266AsnMet: 0.266 ± 0.332
1.859AsnAsn: 1.859 ± 0.509
2.39AsnPro: 2.39 ± 1.139
3.186AsnGln: 3.186 ± 1.212
3.186AsnArg: 3.186 ± 0.873
5.311AsnSer: 5.311 ± 0.788
2.39AsnThr: 2.39 ± 0.552
1.859AsnVal: 1.859 ± 0.6
1.062AsnTrp: 1.062 ± 0.475
1.062AsnTyr: 1.062 ± 0.429
0.0AsnXaa: 0.0 ± 0.0
Pro
2.124ProAla: 2.124 ± 0.665
1.062ProCys: 1.062 ± 0.466
2.124ProAsp: 2.124 ± 0.582
4.249ProGlu: 4.249 ± 1.283
2.124ProPhe: 2.124 ± 0.685
2.39ProGly: 2.39 ± 1.035
1.062ProHis: 1.062 ± 0.318
3.186ProIle: 3.186 ± 0.391
2.39ProLys: 2.39 ± 0.869
2.655ProLeu: 2.655 ± 0.505
0.531ProMet: 0.531 ± 0.448
2.124ProAsn: 2.124 ± 0.541
3.186ProPro: 3.186 ± 1.132
1.062ProGln: 1.062 ± 0.222
2.655ProArg: 2.655 ± 0.791
1.328ProSer: 1.328 ± 0.452
1.859ProThr: 1.859 ± 0.525
2.124ProVal: 2.124 ± 0.791
0.531ProTrp: 0.531 ± 0.275
2.124ProTyr: 2.124 ± 0.616
0.0ProXaa: 0.0 ± 0.0
Gln
3.186GlnAla: 3.186 ± 0.945
0.0GlnCys: 0.0 ± 0.0
2.124GlnAsp: 2.124 ± 1.011
3.186GlnGlu: 3.186 ± 1.033
1.328GlnPhe: 1.328 ± 0.42
1.328GlnGly: 1.328 ± 0.535
0.266GlnHis: 0.266 ± 0.226
2.921GlnIle: 2.921 ± 0.819
1.593GlnLys: 1.593 ± 0.279
1.859GlnLeu: 1.859 ± 0.687
0.797GlnMet: 0.797 ± 0.485
1.062GlnAsn: 1.062 ± 0.568
1.859GlnPro: 1.859 ± 0.637
0.266GlnGln: 0.266 ± 0.226
1.859GlnArg: 1.859 ± 0.364
1.328GlnSer: 1.328 ± 0.328
1.328GlnThr: 1.328 ± 0.674
3.452GlnVal: 3.452 ± 0.576
0.797GlnTrp: 0.797 ± 0.224
0.797GlnTyr: 0.797 ± 0.302
0.0GlnXaa: 0.0 ± 0.0
Arg
3.983ArgAla: 3.983 ± 1.232
0.797ArgCys: 0.797 ± 0.363
2.39ArgAsp: 2.39 ± 0.494
4.78ArgGlu: 4.78 ± 1.491
1.859ArgPhe: 1.859 ± 0.594
4.249ArgGly: 4.249 ± 0.938
2.124ArgHis: 2.124 ± 0.474
5.311ArgIle: 5.311 ± 1.174
4.514ArgLys: 4.514 ± 1.074
4.514ArgLeu: 4.514 ± 0.69
2.655ArgMet: 2.655 ± 0.536
2.39ArgAsn: 2.39 ± 0.403
2.39ArgPro: 2.39 ± 0.668
1.859ArgGln: 1.859 ± 0.546
5.045ArgArg: 5.045 ± 0.783
3.452ArgSer: 3.452 ± 1.501
2.39ArgThr: 2.39 ± 0.865
5.842ArgVal: 5.842 ± 0.959
0.266ArgTrp: 0.266 ± 0.219
2.124ArgTyr: 2.124 ± 0.759
0.0ArgXaa: 0.0 ± 0.0
Ser
4.514SerAla: 4.514 ± 2.251
0.797SerCys: 0.797 ± 0.349
3.186SerAsp: 3.186 ± 1.09
4.514SerGlu: 4.514 ± 1.522
3.186SerPhe: 3.186 ± 1.011
2.655SerGly: 2.655 ± 0.641
0.531SerHis: 0.531 ± 0.278
3.452SerIle: 3.452 ± 1.243
5.576SerLys: 5.576 ± 1.04
7.435SerLeu: 7.435 ± 0.905
1.859SerMet: 1.859 ± 0.491
4.249SerAsn: 4.249 ± 0.986
1.859SerPro: 1.859 ± 0.732
2.655SerGln: 2.655 ± 0.874
3.717SerArg: 3.717 ± 0.863
4.514SerSer: 4.514 ± 0.951
3.452SerThr: 3.452 ± 0.578
7.169SerVal: 7.169 ± 1.502
1.062SerTrp: 1.062 ± 0.585
1.859SerTyr: 1.859 ± 0.352
0.0SerXaa: 0.0 ± 0.0
Thr
2.39ThrAla: 2.39 ± 0.706
1.328ThrCys: 1.328 ± 0.531
3.186ThrAsp: 3.186 ± 1.043
5.311ThrGlu: 5.311 ± 1.02
2.921ThrPhe: 2.921 ± 1.007
4.514ThrGly: 4.514 ± 0.847
1.593ThrHis: 1.593 ± 0.605
2.655ThrIle: 2.655 ± 0.67
4.514ThrLys: 4.514 ± 1.217
3.717ThrLeu: 3.717 ± 1.374
1.593ThrMet: 1.593 ± 0.913
1.593ThrAsn: 1.593 ± 0.471
0.797ThrPro: 0.797 ± 0.408
1.859ThrGln: 1.859 ± 0.289
1.859ThrArg: 1.859 ± 0.497
3.983ThrSer: 3.983 ± 1.389
1.859ThrThr: 1.859 ± 0.394
2.921ThrVal: 2.921 ± 0.504
1.062ThrTrp: 1.062 ± 0.466
2.124ThrTyr: 2.124 ± 0.695
0.0ThrXaa: 0.0 ± 0.0
Val
4.514ValAla: 4.514 ± 1.054
1.062ValCys: 1.062 ± 0.333
2.655ValAsp: 2.655 ± 0.595
4.514ValGlu: 4.514 ± 1.02
1.859ValPhe: 1.859 ± 0.851
3.452ValGly: 3.452 ± 0.893
1.859ValHis: 1.859 ± 0.735
3.186ValIle: 3.186 ± 1.263
6.904ValLys: 6.904 ± 0.789
4.78ValLeu: 4.78 ± 0.871
1.859ValMet: 1.859 ± 0.636
4.514ValAsn: 4.514 ± 0.264
3.452ValPro: 3.452 ± 1.153
1.062ValGln: 1.062 ± 0.492
3.983ValArg: 3.983 ± 0.636
2.39ValSer: 2.39 ± 0.859
2.39ValThr: 2.39 ± 0.729
2.921ValVal: 2.921 ± 0.734
1.328ValTrp: 1.328 ± 0.477
1.859ValTyr: 1.859 ± 0.289
0.0ValXaa: 0.0 ± 0.0
Trp
0.531TrpAla: 0.531 ± 0.322
0.266TrpCys: 0.266 ± 0.246
1.593TrpAsp: 1.593 ± 0.741
1.062TrpGlu: 1.062 ± 0.457
1.062TrpPhe: 1.062 ± 0.216
0.797TrpGly: 0.797 ± 0.46
0.266TrpHis: 0.266 ± 0.23
0.531TrpIle: 0.531 ± 0.237
1.859TrpLys: 1.859 ± 0.734
1.859TrpLeu: 1.859 ± 0.631
0.531TrpMet: 0.531 ± 0.304
1.593TrpAsn: 1.593 ± 0.401
0.266TrpPro: 0.266 ± 0.219
0.266TrpGln: 0.266 ± 0.219
1.062TrpArg: 1.062 ± 0.436
1.859TrpSer: 1.859 ± 0.56
0.266TrpThr: 0.266 ± 0.219
1.859TrpVal: 1.859 ± 0.552
0.0TrpTrp: 0.0 ± 0.0
0.531TrpTyr: 0.531 ± 0.308
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.062TyrAla: 1.062 ± 0.69
0.797TyrCys: 0.797 ± 0.457
1.593TyrAsp: 1.593 ± 0.39
2.39TyrGlu: 2.39 ± 0.655
1.062TyrPhe: 1.062 ± 0.492
2.655TyrGly: 2.655 ± 0.717
0.266TyrHis: 0.266 ± 0.219
2.124TyrIle: 2.124 ± 0.826
2.655TyrLys: 2.655 ± 0.519
3.717TyrLeu: 3.717 ± 0.995
1.062TyrMet: 1.062 ± 0.467
2.124TyrAsn: 2.124 ± 0.564
1.859TyrPro: 1.859 ± 0.647
0.797TyrGln: 0.797 ± 0.389
1.593TyrArg: 1.593 ± 0.469
2.124TyrSer: 2.124 ± 0.737
1.859TyrThr: 1.859 ± 0.431
1.859TyrVal: 1.859 ± 0.289
1.328TyrTrp: 1.328 ± 0.499
1.328TyrTyr: 1.328 ± 0.534
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3767 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski