Amino acid dipepetide frequency for Eubenangee virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.965AlaAla: 6.965 ± 1.803
0.324AlaCys: 0.324 ± 0.238
3.887AlaAsp: 3.887 ± 0.725
4.859AlaGlu: 4.859 ± 0.994
2.753AlaPhe: 2.753 ± 0.591
3.239AlaGly: 3.239 ± 0.882
1.62AlaHis: 1.62 ± 0.33
3.887AlaIle: 3.887 ± 1.074
3.401AlaLys: 3.401 ± 0.995
6.641AlaLeu: 6.641 ± 1.158
3.563AlaMet: 3.563 ± 0.887
3.401AlaAsn: 3.401 ± 0.825
1.62AlaPro: 1.62 ± 0.498
2.268AlaGln: 2.268 ± 0.665
4.373AlaArg: 4.373 ± 0.862
3.077AlaSer: 3.077 ± 0.619
3.401AlaThr: 3.401 ± 0.535
6.317AlaVal: 6.317 ± 1.155
1.458AlaTrp: 1.458 ± 0.427
3.077AlaTyr: 3.077 ± 0.91
0.0AlaXaa: 0.0 ± 0.0
Cys
0.486CysAla: 0.486 ± 0.275
0.162CysCys: 0.162 ± 0.172
0.81CysAsp: 0.81 ± 0.383
0.81CysGlu: 0.81 ± 0.371
0.648CysPhe: 0.648 ± 0.354
0.972CysGly: 0.972 ± 0.442
0.162CysHis: 0.162 ± 0.142
0.486CysIle: 0.486 ± 0.314
0.324CysLys: 0.324 ± 0.169
0.972CysLeu: 0.972 ± 0.287
0.324CysMet: 0.324 ± 0.18
0.486CysAsn: 0.486 ± 0.241
0.486CysPro: 0.486 ± 0.325
0.162CysGln: 0.162 ± 0.172
0.972CysArg: 0.972 ± 0.358
0.81CysSer: 0.81 ± 0.321
0.486CysThr: 0.486 ± 0.297
0.81CysVal: 0.81 ± 0.434
0.162CysTrp: 0.162 ± 0.142
0.486CysTyr: 0.486 ± 0.226
0.0CysXaa: 0.0 ± 0.0
Asp
3.077AspAla: 3.077 ± 0.766
0.486AspCys: 0.486 ± 0.199
4.373AspAsp: 4.373 ± 0.821
5.345AspGlu: 5.345 ± 1.063
2.915AspPhe: 2.915 ± 0.591
3.887AspGly: 3.887 ± 0.526
1.782AspHis: 1.782 ± 0.879
4.373AspIle: 4.373 ± 0.743
1.944AspLys: 1.944 ± 0.678
4.859AspLeu: 4.859 ± 0.801
1.134AspMet: 1.134 ± 0.34
1.782AspAsn: 1.782 ± 0.692
2.592AspPro: 2.592 ± 0.692
1.782AspGln: 1.782 ± 0.753
4.697AspArg: 4.697 ± 0.783
3.887AspSer: 3.887 ± 0.94
2.106AspThr: 2.106 ± 0.505
6.803AspVal: 6.803 ± 0.825
0.648AspTrp: 0.648 ± 0.383
1.944AspTyr: 1.944 ± 0.374
0.0AspXaa: 0.0 ± 0.0
Glu
4.373GluAla: 4.373 ± 1.003
0.162GluCys: 0.162 ± 0.142
5.183GluAsp: 5.183 ± 0.996
8.26GluGlu: 8.26 ± 1.473
2.106GluPhe: 2.106 ± 0.494
4.859GluGly: 4.859 ± 0.878
0.81GluHis: 0.81 ± 0.382
4.859GluIle: 4.859 ± 0.835
5.183GluLys: 5.183 ± 1.041
7.451GluLeu: 7.451 ± 1.129
3.077GluMet: 3.077 ± 0.487
2.43GluAsn: 2.43 ± 0.521
2.43GluPro: 2.43 ± 0.704
1.62GluGln: 1.62 ± 0.591
6.641GluArg: 6.641 ± 1.226
3.239GluSer: 3.239 ± 0.606
4.697GluThr: 4.697 ± 0.53
5.993GluVal: 5.993 ± 0.852
1.134GluTrp: 1.134 ± 0.435
2.753GluTyr: 2.753 ± 0.743
0.0GluXaa: 0.0 ± 0.0
Phe
1.62PheAla: 1.62 ± 0.385
0.486PheCys: 0.486 ± 0.306
4.535PheAsp: 4.535 ± 0.728
2.106PheGlu: 2.106 ± 0.31
1.134PhePhe: 1.134 ± 0.422
3.725PheGly: 3.725 ± 0.391
0.972PheHis: 0.972 ± 0.28
2.915PheIle: 2.915 ± 0.618
1.134PheLys: 1.134 ± 0.433
3.887PheLeu: 3.887 ± 0.738
1.134PheMet: 1.134 ± 0.322
0.81PheAsn: 0.81 ± 0.321
1.458PhePro: 1.458 ± 0.392
0.972PheGln: 0.972 ± 0.411
2.915PheArg: 2.915 ± 0.9
2.915PheSer: 2.915 ± 0.612
2.43PheThr: 2.43 ± 0.838
1.782PheVal: 1.782 ± 0.517
0.486PheTrp: 0.486 ± 0.211
1.782PheTyr: 1.782 ± 0.481
0.0PheXaa: 0.0 ± 0.0
Gly
5.507GlyAla: 5.507 ± 1.284
0.81GlyCys: 0.81 ± 0.322
3.401GlyAsp: 3.401 ± 1.033
2.915GlyGlu: 2.915 ± 0.799
2.753GlyPhe: 2.753 ± 0.611
4.373GlyGly: 4.373 ± 1.321
1.296GlyHis: 1.296 ± 0.387
3.725GlyIle: 3.725 ± 0.788
5.021GlyLys: 5.021 ± 1.003
4.535GlyLeu: 4.535 ± 0.694
1.296GlyMet: 1.296 ± 0.472
1.62GlyAsn: 1.62 ± 0.398
2.106GlyPro: 2.106 ± 0.663
2.43GlyGln: 2.43 ± 0.553
4.049GlyArg: 4.049 ± 0.607
2.915GlySer: 2.915 ± 0.53
3.239GlyThr: 3.239 ± 0.684
4.049GlyVal: 4.049 ± 0.772
1.296GlyTrp: 1.296 ± 0.402
2.592GlyTyr: 2.592 ± 0.647
0.0GlyXaa: 0.0 ± 0.0
His
1.782HisAla: 1.782 ± 0.547
0.486HisCys: 0.486 ± 0.234
0.648HisAsp: 0.648 ± 0.249
2.106HisGlu: 2.106 ± 0.524
0.486HisPhe: 0.486 ± 0.29
1.62HisGly: 1.62 ± 0.364
0.972HisHis: 0.972 ± 0.292
1.296HisIle: 1.296 ± 0.268
1.458HisLys: 1.458 ± 0.664
2.43HisLeu: 2.43 ± 0.531
1.296HisMet: 1.296 ± 0.456
0.648HisAsn: 0.648 ± 0.283
0.972HisPro: 0.972 ± 0.481
0.486HisGln: 0.486 ± 0.256
0.972HisArg: 0.972 ± 0.33
0.648HisSer: 0.648 ± 0.279
0.972HisThr: 0.972 ± 0.476
1.62HisVal: 1.62 ± 0.633
0.324HisTrp: 0.324 ± 0.204
0.486HisTyr: 0.486 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
4.211IleAla: 4.211 ± 0.493
0.81IleCys: 0.81 ± 0.342
3.725IleAsp: 3.725 ± 1.017
5.021IleGlu: 5.021 ± 0.545
2.43IlePhe: 2.43 ± 0.626
3.239IleGly: 3.239 ± 0.482
1.134IleHis: 1.134 ± 0.434
2.915IleIle: 2.915 ± 0.637
3.887IleLys: 3.887 ± 0.753
6.155IleLeu: 6.155 ± 0.846
2.753IleMet: 2.753 ± 0.468
3.401IleAsn: 3.401 ± 0.967
2.268IlePro: 2.268 ± 0.743
3.239IleGln: 3.239 ± 0.959
4.697IleArg: 4.697 ± 1.103
3.239IleSer: 3.239 ± 0.717
3.563IleThr: 3.563 ± 0.709
3.887IleVal: 3.887 ± 0.632
1.134IleTrp: 1.134 ± 0.411
2.43IleTyr: 2.43 ± 0.508
0.0IleXaa: 0.0 ± 0.0
Lys
3.563LysAla: 3.563 ± 0.624
0.324LysCys: 0.324 ± 0.199
3.401LysAsp: 3.401 ± 0.915
4.535LysGlu: 4.535 ± 1.451
1.944LysPhe: 1.944 ± 0.732
2.915LysGly: 2.915 ± 0.808
0.648LysHis: 0.648 ± 0.32
4.211LysIle: 4.211 ± 0.622
4.859LysLys: 4.859 ± 1.209
4.535LysLeu: 4.535 ± 0.851
1.62LysMet: 1.62 ± 0.564
2.592LysAsn: 2.592 ± 0.571
0.972LysPro: 0.972 ± 0.444
2.43LysGln: 2.43 ± 0.692
5.831LysArg: 5.831 ± 0.886
3.563LysSer: 3.563 ± 0.876
3.563LysThr: 3.563 ± 0.484
3.887LysVal: 3.887 ± 0.86
0.648LysTrp: 0.648 ± 0.38
2.43LysTyr: 2.43 ± 0.649
0.0LysXaa: 0.0 ± 0.0
Leu
8.098LeuAla: 8.098 ± 0.664
0.81LeuCys: 0.81 ± 0.48
5.669LeuAsp: 5.669 ± 0.903
5.507LeuGlu: 5.507 ± 0.786
3.077LeuPhe: 3.077 ± 0.473
4.211LeuGly: 4.211 ± 1.065
1.458LeuHis: 1.458 ± 0.336
5.993LeuIle: 5.993 ± 1.132
6.479LeuLys: 6.479 ± 0.771
5.021LeuLeu: 5.021 ± 0.805
2.268LeuMet: 2.268 ± 0.679
5.183LeuAsn: 5.183 ± 1.272
4.049LeuPro: 4.049 ± 0.77
2.43LeuGln: 2.43 ± 0.931
7.127LeuArg: 7.127 ± 0.896
5.669LeuSer: 5.669 ± 0.596
4.211LeuThr: 4.211 ± 0.693
5.345LeuVal: 5.345 ± 1.01
1.296LeuTrp: 1.296 ± 0.373
2.106LeuTyr: 2.106 ± 0.814
0.0LeuXaa: 0.0 ± 0.0
Met
2.915MetAla: 2.915 ± 0.747
0.324MetCys: 0.324 ± 0.232
1.944MetAsp: 1.944 ± 0.497
2.106MetGlu: 2.106 ± 0.542
1.944MetPhe: 1.944 ± 0.31
1.458MetGly: 1.458 ± 0.388
0.972MetHis: 0.972 ± 0.24
2.106MetIle: 2.106 ± 0.454
1.296MetLys: 1.296 ± 0.34
4.697MetLeu: 4.697 ± 0.702
1.782MetMet: 1.782 ± 0.459
1.458MetAsn: 1.458 ± 0.339
0.81MetPro: 0.81 ± 0.465
1.782MetGln: 1.782 ± 0.682
2.43MetArg: 2.43 ± 0.616
3.077MetSer: 3.077 ± 0.776
2.268MetThr: 2.268 ± 0.53
2.43MetVal: 2.43 ± 0.425
0.162MetTrp: 0.162 ± 0.137
0.81MetTyr: 0.81 ± 0.414
0.0MetXaa: 0.0 ± 0.0
Asn
2.106AsnAla: 2.106 ± 0.931
0.486AsnCys: 0.486 ± 0.517
2.753AsnAsp: 2.753 ± 0.909
2.753AsnGlu: 2.753 ± 0.448
1.944AsnPhe: 1.944 ± 0.492
2.753AsnGly: 2.753 ± 0.805
0.324AsnHis: 0.324 ± 0.195
3.887AsnIle: 3.887 ± 0.675
2.106AsnLys: 2.106 ± 0.759
2.592AsnLeu: 2.592 ± 0.695
1.944AsnMet: 1.944 ± 0.725
0.972AsnAsn: 0.972 ± 0.362
2.592AsnPro: 2.592 ± 0.571
1.944AsnGln: 1.944 ± 0.458
2.592AsnArg: 2.592 ± 0.415
1.458AsnSer: 1.458 ± 0.755
2.106AsnThr: 2.106 ± 0.348
3.887AsnVal: 3.887 ± 0.614
0.162AsnTrp: 0.162 ± 0.173
0.972AsnTyr: 0.972 ± 0.401
0.0AsnXaa: 0.0 ± 0.0
Pro
2.592ProAla: 2.592 ± 0.809
0.162ProCys: 0.162 ± 0.142
2.592ProAsp: 2.592 ± 0.568
2.592ProGlu: 2.592 ± 0.451
1.134ProPhe: 1.134 ± 0.369
2.268ProGly: 2.268 ± 0.744
0.972ProHis: 0.972 ± 0.341
2.915ProIle: 2.915 ± 0.377
2.268ProLys: 2.268 ± 0.512
2.753ProLeu: 2.753 ± 0.402
1.458ProMet: 1.458 ± 0.331
1.458ProAsn: 1.458 ± 0.457
2.268ProPro: 2.268 ± 0.508
1.944ProGln: 1.944 ± 0.591
2.268ProArg: 2.268 ± 0.745
2.592ProSer: 2.592 ± 0.522
2.753ProThr: 2.753 ± 0.864
1.458ProVal: 1.458 ± 0.474
0.324ProTrp: 0.324 ± 0.211
1.944ProTyr: 1.944 ± 0.419
0.0ProXaa: 0.0 ± 0.0
Gln
2.43GlnAla: 2.43 ± 0.9
0.324GlnCys: 0.324 ± 0.204
1.62GlnAsp: 1.62 ± 0.464
2.592GlnGlu: 2.592 ± 0.827
0.972GlnPhe: 0.972 ± 0.399
1.62GlnGly: 1.62 ± 0.475
0.972GlnHis: 0.972 ± 0.314
3.077GlnIle: 3.077 ± 0.761
1.944GlnLys: 1.944 ± 0.562
3.725GlnLeu: 3.725 ± 0.658
0.972GlnMet: 0.972 ± 0.318
1.62GlnAsn: 1.62 ± 0.561
1.458GlnPro: 1.458 ± 0.444
0.81GlnGln: 0.81 ± 0.474
2.753GlnArg: 2.753 ± 0.747
1.134GlnSer: 1.134 ± 0.277
1.62GlnThr: 1.62 ± 0.463
1.62GlnVal: 1.62 ± 0.371
0.486GlnTrp: 0.486 ± 0.323
0.972GlnTyr: 0.972 ± 0.513
0.0GlnXaa: 0.0 ± 0.0
Arg
5.669ArgAla: 5.669 ± 0.77
1.296ArgCys: 1.296 ± 0.464
4.049ArgAsp: 4.049 ± 0.887
7.127ArgGlu: 7.127 ± 1.28
3.077ArgPhe: 3.077 ± 0.608
3.725ArgGly: 3.725 ± 0.751
0.972ArgHis: 0.972 ± 0.332
5.669ArgIle: 5.669 ± 1.392
5.183ArgLys: 5.183 ± 0.954
5.507ArgLeu: 5.507 ± 1.088
2.592ArgMet: 2.592 ± 0.594
2.753ArgAsn: 2.753 ± 0.652
1.782ArgPro: 1.782 ± 0.375
1.62ArgGln: 1.62 ± 0.636
4.373ArgArg: 4.373 ± 0.751
3.563ArgSer: 3.563 ± 0.444
3.239ArgThr: 3.239 ± 0.54
5.507ArgVal: 5.507 ± 0.893
1.134ArgTrp: 1.134 ± 0.391
2.592ArgTyr: 2.592 ± 0.517
0.0ArgXaa: 0.0 ± 0.0
Ser
3.887SerAla: 3.887 ± 0.833
0.81SerCys: 0.81 ± 0.364
2.268SerAsp: 2.268 ± 0.288
5.345SerGlu: 5.345 ± 0.753
2.592SerPhe: 2.592 ± 0.296
3.239SerGly: 3.239 ± 0.822
2.106SerHis: 2.106 ± 0.435
3.077SerIle: 3.077 ± 0.759
2.592SerLys: 2.592 ± 0.885
4.859SerLeu: 4.859 ± 0.793
2.43SerMet: 2.43 ± 0.619
1.458SerAsn: 1.458 ± 0.309
2.106SerPro: 2.106 ± 0.475
1.62SerGln: 1.62 ± 0.446
2.592SerArg: 2.592 ± 0.784
3.887SerSer: 3.887 ± 0.751
3.563SerThr: 3.563 ± 0.908
4.373SerVal: 4.373 ± 0.925
1.782SerTrp: 1.782 ± 0.28
2.43SerTyr: 2.43 ± 0.619
0.0SerXaa: 0.0 ± 0.0
Thr
3.887ThrAla: 3.887 ± 0.872
0.81ThrCys: 0.81 ± 0.254
2.753ThrAsp: 2.753 ± 0.32
3.725ThrGlu: 3.725 ± 0.809
1.782ThrPhe: 1.782 ± 0.459
3.887ThrGly: 3.887 ± 0.845
1.458ThrHis: 1.458 ± 0.455
3.725ThrIle: 3.725 ± 0.767
2.753ThrLys: 2.753 ± 0.709
5.669ThrLeu: 5.669 ± 1.416
1.62ThrMet: 1.62 ± 0.301
2.106ThrAsn: 2.106 ± 0.486
2.753ThrPro: 2.753 ± 0.519
1.62ThrGln: 1.62 ± 0.559
3.563ThrArg: 3.563 ± 0.505
2.753ThrSer: 2.753 ± 0.849
2.753ThrThr: 2.753 ± 0.385
3.887ThrVal: 3.887 ± 0.561
0.972ThrTrp: 0.972 ± 0.222
0.972ThrTyr: 0.972 ± 0.563
0.0ThrXaa: 0.0 ± 0.0
Val
4.211ValAla: 4.211 ± 0.717
0.81ValCys: 0.81 ± 0.566
4.859ValAsp: 4.859 ± 1.037
5.669ValGlu: 5.669 ± 0.492
3.239ValPhe: 3.239 ± 0.484
3.887ValGly: 3.887 ± 0.845
1.62ValHis: 1.62 ± 0.406
2.592ValIle: 2.592 ± 0.591
4.373ValLys: 4.373 ± 0.795
6.803ValLeu: 6.803 ± 1.257
3.077ValMet: 3.077 ± 0.767
2.43ValAsn: 2.43 ± 0.413
4.049ValPro: 4.049 ± 0.69
2.268ValGln: 2.268 ± 0.627
5.831ValArg: 5.831 ± 1.259
4.697ValSer: 4.697 ± 0.839
4.211ValThr: 4.211 ± 1.063
3.563ValVal: 3.563 ± 0.733
0.162ValTrp: 0.162 ± 0.193
3.077ValTyr: 3.077 ± 0.418
0.0ValXaa: 0.0 ± 0.0
Trp
0.324TrpAla: 0.324 ± 0.205
0.162TrpCys: 0.162 ± 0.137
0.486TrpAsp: 0.486 ± 0.222
1.134TrpGlu: 1.134 ± 0.494
0.972TrpPhe: 0.972 ± 0.306
1.134TrpGly: 1.134 ± 0.31
0.81TrpHis: 0.81 ± 0.443
0.81TrpIle: 0.81 ± 0.296
1.134TrpLys: 1.134 ± 0.275
1.296TrpLeu: 1.296 ± 0.559
0.648TrpMet: 0.648 ± 0.42
1.62TrpAsn: 1.62 ± 0.595
0.0TrpPro: 0.0 ± 0.0
0.162TrpGln: 0.162 ± 0.142
0.486TrpArg: 0.486 ± 0.215
0.81TrpSer: 0.81 ± 0.326
0.486TrpThr: 0.486 ± 0.228
0.972TrpVal: 0.972 ± 0.269
0.162TrpTrp: 0.162 ± 0.137
0.486TrpTyr: 0.486 ± 0.306
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.43TyrAla: 2.43 ± 0.516
0.972TyrCys: 0.972 ± 0.337
1.458TyrAsp: 1.458 ± 0.516
2.753TyrGlu: 2.753 ± 0.585
1.296TyrPhe: 1.296 ± 0.358
2.915TyrGly: 2.915 ± 0.65
0.648TyrHis: 0.648 ± 0.378
1.62TyrIle: 1.62 ± 0.481
1.134TyrLys: 1.134 ± 0.374
1.62TyrLeu: 1.62 ± 0.539
1.62TyrMet: 1.62 ± 0.461
2.106TyrAsn: 2.106 ± 0.438
1.944TyrPro: 1.944 ± 0.387
1.134TyrGln: 1.134 ± 0.474
2.43TyrArg: 2.43 ± 0.463
3.077TyrSer: 3.077 ± 0.895
1.782TyrThr: 1.782 ± 0.382
3.239TyrVal: 3.239 ± 0.647
0.162TyrTrp: 0.162 ± 0.137
1.458TyrTyr: 1.458 ± 0.579
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6175 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski