Amino acid dipepetide frequency for Hybrid snakehead virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.275AlaAla: 4.275 ± 1.668
0.855AlaCys: 0.855 ± 0.898
3.99AlaAsp: 3.99 ± 1.669
5.13AlaGlu: 5.13 ± 3.67
2.85AlaPhe: 2.85 ± 0.926
2.28AlaGly: 2.28 ± 1.359
1.425AlaHis: 1.425 ± 0.589
3.705AlaIle: 3.705 ± 0.891
3.705AlaLys: 3.705 ± 0.544
7.125AlaLeu: 7.125 ± 1.926
0.855AlaMet: 0.855 ± 0.374
2.28AlaAsn: 2.28 ± 0.446
3.705AlaPro: 3.705 ± 3.709
2.565AlaGln: 2.565 ± 0.48
1.71AlaArg: 1.71 ± 0.624
7.694AlaSer: 7.694 ± 1.991
4.845AlaThr: 4.845 ± 1.484
3.135AlaVal: 3.135 ± 1.018
1.14AlaTrp: 1.14 ± 0.379
1.71AlaTyr: 1.71 ± 0.589
0.0AlaXaa: 0.0 ± 0.0
Cys
1.71CysAla: 1.71 ± 0.722
0.285CysCys: 0.285 ± 0.163
0.285CysAsp: 0.285 ± 0.415
0.855CysGlu: 0.855 ± 0.374
0.57CysPhe: 0.57 ± 0.325
1.14CysGly: 1.14 ± 0.663
0.57CysHis: 0.57 ± 0.726
0.855CysIle: 0.855 ± 0.317
0.855CysLys: 0.855 ± 0.734
1.71CysLeu: 1.71 ± 0.624
0.0CysMet: 0.0 ± 0.0
1.995CysAsn: 1.995 ± 1.73
0.57CysPro: 0.57 ± 0.83
0.57CysGln: 0.57 ± 0.332
0.285CysArg: 0.285 ± 0.163
0.57CysSer: 0.57 ± 0.325
1.425CysThr: 1.425 ± 0.553
0.285CysVal: 0.285 ± 0.163
0.57CysTrp: 0.57 ± 0.325
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.42AspAla: 3.42 ± 1.639
0.855AspCys: 0.855 ± 0.374
3.99AspAsp: 3.99 ± 1.354
3.42AspGlu: 3.42 ± 0.286
3.99AspPhe: 3.99 ± 0.893
2.28AspGly: 2.28 ± 1.056
1.14AspHis: 1.14 ± 0.379
3.135AspIle: 3.135 ± 0.817
2.28AspLys: 2.28 ± 1.027
5.985AspLeu: 5.985 ± 0.304
1.71AspMet: 1.71 ± 0.748
3.42AspAsn: 3.42 ± 0.547
3.42AspPro: 3.42 ± 1.072
1.71AspGln: 1.71 ± 0.748
0.855AspArg: 0.855 ± 0.488
2.28AspSer: 2.28 ± 0.562
1.425AspThr: 1.425 ± 0.49
2.85AspVal: 2.85 ± 1.022
1.71AspTrp: 1.71 ± 0.607
2.85AspTyr: 2.85 ± 1.291
0.0AspXaa: 0.0 ± 0.0
Glu
3.705GluAla: 3.705 ± 1.563
0.855GluCys: 0.855 ± 0.953
3.135GluAsp: 3.135 ± 1.06
4.845GluGlu: 4.845 ± 1.865
1.14GluPhe: 1.14 ± 0.466
2.85GluGly: 2.85 ± 1.199
1.425GluHis: 1.425 ± 0.49
7.41GluIle: 7.41 ± 1.403
5.985GluLys: 5.985 ± 1.504
6.27GluLeu: 6.27 ± 1.318
1.995GluMet: 1.995 ± 0.287
2.565GluAsn: 2.565 ± 1.182
0.855GluPro: 0.855 ± 0.488
0.855GluGln: 0.855 ± 1.352
2.85GluArg: 2.85 ± 0.981
5.7GluSer: 5.7 ± 1.043
3.705GluThr: 3.705 ± 0.774
4.845GluVal: 4.845 ± 0.576
1.425GluTrp: 1.425 ± 0.668
3.135GluTyr: 3.135 ± 0.796
0.0GluXaa: 0.0 ± 0.0
Phe
1.71PheAla: 1.71 ± 0.748
0.285PheCys: 0.285 ± 0.415
2.28PheAsp: 2.28 ± 0.626
2.85PheGlu: 2.85 ± 0.731
1.995PhePhe: 1.995 ± 0.495
2.28PheGly: 2.28 ± 0.446
1.71PheHis: 1.71 ± 0.532
1.71PheIle: 1.71 ± 0.724
3.705PheLys: 3.705 ± 1.032
5.985PheLeu: 5.985 ± 1.705
1.14PheMet: 1.14 ± 0.541
1.14PheAsn: 1.14 ± 0.919
1.71PhePro: 1.71 ± 0.976
2.28PheGln: 2.28 ± 0.785
2.565PheArg: 2.565 ± 0.48
1.995PheSer: 1.995 ± 1.138
1.14PheThr: 1.14 ± 1.076
1.995PheVal: 1.995 ± 0.68
0.285PheTrp: 0.285 ± 0.163
0.57PheTyr: 0.57 ± 0.34
0.0PheXaa: 0.0 ± 0.0
Gly
3.135GlyAla: 3.135 ± 0.862
0.285GlyCys: 0.285 ± 0.163
3.99GlyAsp: 3.99 ± 1.203
4.275GlyGlu: 4.275 ± 1.15
3.135GlyPhe: 3.135 ± 0.473
3.135GlyGly: 3.135 ± 0.86
0.855GlyHis: 0.855 ± 0.496
3.135GlyIle: 3.135 ± 1.018
3.42GlyLys: 3.42 ± 0.779
8.264GlyLeu: 8.264 ± 2.056
1.425GlyMet: 1.425 ± 0.655
2.565GlyAsn: 2.565 ± 0.978
2.28GlyPro: 2.28 ± 1.268
2.85GlyGln: 2.85 ± 1.572
3.135GlyArg: 3.135 ± 0.668
3.42GlySer: 3.42 ± 0.828
4.275GlyThr: 4.275 ± 1.417
2.565GlyVal: 2.565 ± 1.36
1.71GlyTrp: 1.71 ± 0.386
0.57GlyTyr: 0.57 ± 0.387
0.0GlyXaa: 0.0 ± 0.0
His
0.855HisAla: 0.855 ± 0.317
0.285HisCys: 0.285 ± 0.163
0.855HisAsp: 0.855 ± 0.734
1.14HisGlu: 1.14 ± 0.651
1.425HisPhe: 1.425 ± 0.813
1.71HisGly: 1.71 ± 1.229
0.57HisHis: 0.57 ± 0.325
0.855HisIle: 0.855 ± 0.317
1.425HisLys: 1.425 ± 0.49
1.995HisLeu: 1.995 ± 0.547
0.57HisMet: 0.57 ± 0.794
0.855HisAsn: 0.855 ± 0.614
2.85HisPro: 2.85 ± 0.897
1.14HisGln: 1.14 ± 0.466
1.425HisArg: 1.425 ± 0.813
1.995HisSer: 1.995 ± 0.552
1.425HisThr: 1.425 ± 0.589
1.71HisVal: 1.71 ± 0.728
0.855HisTrp: 0.855 ± 0.374
1.425HisTyr: 1.425 ± 0.485
0.0HisXaa: 0.0 ± 0.0
Ile
2.28IleAla: 2.28 ± 0.645
0.855IleCys: 0.855 ± 0.317
2.565IleAsp: 2.565 ± 0.785
3.99IleGlu: 3.99 ± 0.893
2.565IlePhe: 2.565 ± 1.535
5.13IleGly: 5.13 ± 0.946
1.995IleHis: 1.995 ± 0.467
3.99IleIle: 3.99 ± 0.87
3.705IleLys: 3.705 ± 0.974
5.13IleLeu: 5.13 ± 1.785
1.71IleMet: 1.71 ± 0.728
2.565IleAsn: 2.565 ± 0.48
2.565IlePro: 2.565 ± 0.591
3.135IleGln: 3.135 ± 1.027
2.565IleArg: 2.565 ± 0.775
5.7IleSer: 5.7 ± 1.301
3.42IleThr: 3.42 ± 0.699
2.565IleVal: 2.565 ± 0.48
0.855IleTrp: 0.855 ± 0.431
2.28IleTyr: 2.28 ± 0.726
0.0IleXaa: 0.0 ± 0.0
Lys
2.85LysAla: 2.85 ± 0.923
1.425LysCys: 1.425 ± 0.49
3.135LysAsp: 3.135 ± 0.556
5.7LysGlu: 5.7 ± 1.069
0.855LysPhe: 0.855 ± 0.488
3.99LysGly: 3.99 ± 1.538
1.71LysHis: 1.71 ± 0.748
3.42LysIle: 3.42 ± 1.032
5.13LysLys: 5.13 ± 0.815
7.41LysLeu: 7.41 ± 1.284
3.705LysMet: 3.705 ± 1.043
2.85LysAsn: 2.85 ± 0.791
3.135LysPro: 3.135 ± 2.522
1.71LysGln: 1.71 ± 0.728
3.705LysArg: 3.705 ± 0.846
3.705LysSer: 3.705 ± 1.382
5.13LysThr: 5.13 ± 1.269
3.705LysVal: 3.705 ± 1.602
1.71LysTrp: 1.71 ± 0.493
0.855LysTyr: 0.855 ± 0.386
0.0LysXaa: 0.0 ± 0.0
Leu
9.119LeuAla: 9.119 ± 1.842
0.855LeuCys: 0.855 ± 0.374
5.13LeuAsp: 5.13 ± 1.146
7.979LeuGlu: 7.979 ± 1.491
3.99LeuPhe: 3.99 ± 0.953
6.84LeuGly: 6.84 ± 1.389
1.425LeuHis: 1.425 ± 0.49
6.555LeuIle: 6.555 ± 2.229
7.41LeuLys: 7.41 ± 0.927
7.979LeuLeu: 7.979 ± 1.761
3.42LeuMet: 3.42 ± 0.66
6.27LeuAsn: 6.27 ± 0.906
2.85LeuPro: 2.85 ± 1.372
3.42LeuGln: 3.42 ± 0.909
6.555LeuArg: 6.555 ± 1.638
8.264LeuSer: 8.264 ± 1.86
7.125LeuThr: 7.125 ± 0.716
5.13LeuVal: 5.13 ± 0.6
0.57LeuTrp: 0.57 ± 0.387
3.42LeuTyr: 3.42 ± 0.646
0.0LeuXaa: 0.0 ± 0.0
Met
2.565MetAla: 2.565 ± 1.31
1.14MetCys: 1.14 ± 1.145
2.565MetAsp: 2.565 ± 0.48
1.425MetGlu: 1.425 ± 1.147
1.14MetPhe: 1.14 ± 0.502
1.425MetGly: 1.425 ± 0.739
0.57MetHis: 0.57 ± 0.332
1.995MetIle: 1.995 ± 0.267
1.995MetLys: 1.995 ± 0.591
2.28MetLeu: 2.28 ± 0.785
1.995MetMet: 1.995 ± 0.575
0.855MetAsn: 0.855 ± 0.488
1.425MetPro: 1.425 ± 0.872
0.855MetGln: 0.855 ± 0.488
1.425MetArg: 1.425 ± 0.485
2.565MetSer: 2.565 ± 0.718
2.565MetThr: 2.565 ± 0.765
1.425MetVal: 1.425 ± 1.462
0.285MetTrp: 0.285 ± 0.163
1.14MetTyr: 1.14 ± 0.441
0.0MetXaa: 0.0 ± 0.0
Asn
3.135AsnAla: 3.135 ± 0.873
0.285AsnCys: 0.285 ± 0.415
2.565AsnAsp: 2.565 ± 0.643
1.71AsnGlu: 1.71 ± 0.589
1.14AsnPhe: 1.14 ± 0.449
2.85AsnGly: 2.85 ± 1.722
1.71AsnHis: 1.71 ± 0.624
1.71AsnIle: 1.71 ± 0.722
4.275AsnLys: 4.275 ± 0.891
6.555AsnLeu: 6.555 ± 1.842
1.14AsnMet: 1.14 ± 0.529
2.85AsnAsn: 2.85 ± 0.502
3.705AsnPro: 3.705 ± 1.259
1.71AsnGln: 1.71 ± 0.388
1.425AsnArg: 1.425 ± 0.49
1.71AsnSer: 1.71 ± 0.624
2.565AsnThr: 2.565 ± 0.807
1.425AsnVal: 1.425 ± 0.589
1.14AsnTrp: 1.14 ± 0.348
1.425AsnTyr: 1.425 ± 0.449
0.0AsnXaa: 0.0 ± 0.0
Pro
3.99ProAla: 3.99 ± 2.366
0.57ProCys: 0.57 ± 0.387
3.42ProAsp: 3.42 ± 1.17
2.565ProGlu: 2.565 ± 0.565
1.71ProPhe: 1.71 ± 0.624
2.28ProGly: 2.28 ± 1.859
1.425ProHis: 1.425 ± 0.49
1.71ProIle: 1.71 ± 1.457
3.135ProLys: 3.135 ± 0.918
7.41ProLeu: 7.41 ± 1.25
1.71ProMet: 1.71 ± 0.651
1.71ProAsn: 1.71 ± 0.743
2.85ProPro: 2.85 ± 2.043
0.855ProGln: 0.855 ± 0.734
1.71ProArg: 1.71 ± 0.743
5.415ProSer: 5.415 ± 1.681
2.28ProThr: 2.28 ± 0.728
1.71ProVal: 1.71 ± 1.062
0.285ProTrp: 0.285 ± 0.163
3.135ProTyr: 3.135 ± 1.933
0.0ProXaa: 0.0 ± 0.0
Gln
3.135GlnAla: 3.135 ± 1.739
0.285GlnCys: 0.285 ± 0.163
1.14GlnAsp: 1.14 ± 0.68
3.42GlnGlu: 3.42 ± 0.808
1.14GlnPhe: 1.14 ± 0.529
3.42GlnGly: 3.42 ± 0.514
0.855GlnHis: 0.855 ± 0.374
1.995GlnIle: 1.995 ± 1.401
1.71GlnLys: 1.71 ± 0.976
1.995GlnLeu: 1.995 ± 0.963
1.995GlnMet: 1.995 ± 0.946
0.855GlnAsn: 0.855 ± 0.431
0.855GlnPro: 0.855 ± 0.386
2.28GlnGln: 2.28 ± 1.176
1.995GlnArg: 1.995 ± 1.138
2.85GlnSer: 2.85 ± 0.565
2.28GlnThr: 2.28 ± 0.92
2.28GlnVal: 2.28 ± 0.583
0.285GlnTrp: 0.285 ± 0.163
1.995GlnTyr: 1.995 ± 0.552
0.0GlnXaa: 0.0 ± 0.0
Arg
2.85ArgAla: 2.85 ± 0.827
1.14ArgCys: 1.14 ± 0.379
2.28ArgAsp: 2.28 ± 1.064
2.85ArgGlu: 2.85 ± 1.231
3.42ArgPhe: 3.42 ± 1.037
2.565ArgGly: 2.565 ± 1.153
1.425ArgHis: 1.425 ± 0.49
1.14ArgIle: 1.14 ± 0.539
2.565ArgLys: 2.565 ± 0.718
3.99ArgLeu: 3.99 ± 1.203
1.71ArgMet: 1.71 ± 1.013
1.995ArgAsn: 1.995 ± 0.822
2.85ArgPro: 2.85 ± 0.724
1.995ArgGln: 1.995 ± 0.467
2.85ArgArg: 2.85 ± 0.662
3.135ArgSer: 3.135 ± 1.094
0.855ArgThr: 0.855 ± 0.488
5.415ArgVal: 5.415 ± 1.888
1.71ArgTrp: 1.71 ± 0.386
1.14ArgTyr: 1.14 ± 0.741
0.0ArgXaa: 0.0 ± 0.0
Ser
7.41SerAla: 7.41 ± 1.611
1.425SerCys: 1.425 ± 0.485
2.85SerAsp: 2.85 ± 0.991
4.845SerGlu: 4.845 ± 1.155
2.565SerPhe: 2.565 ± 0.685
4.845SerGly: 4.845 ± 0.917
2.28SerHis: 2.28 ± 0.92
5.415SerIle: 5.415 ± 1.214
4.275SerLys: 4.275 ± 0.919
6.27SerLeu: 6.27 ± 2.472
1.71SerMet: 1.71 ± 0.493
3.42SerAsn: 3.42 ± 0.699
4.275SerPro: 4.275 ± 1.529
3.135SerGln: 3.135 ± 0.837
3.705SerArg: 3.705 ± 0.93
5.7SerSer: 5.7 ± 1.21
4.56SerThr: 4.56 ± 1.028
3.42SerVal: 3.42 ± 0.646
1.425SerTrp: 1.425 ± 0.637
2.565SerTyr: 2.565 ± 2.086
0.0SerXaa: 0.0 ± 0.0
Thr
2.85ThrAla: 2.85 ± 2.027
1.425ThrCys: 1.425 ± 0.485
3.99ThrAsp: 3.99 ± 0.893
3.42ThrGlu: 3.42 ± 0.896
1.425ThrPhe: 1.425 ± 0.331
4.845ThrGly: 4.845 ± 0.7
0.855ThrHis: 0.855 ± 0.488
3.99ThrIle: 3.99 ± 0.998
2.565ThrLys: 2.565 ± 0.641
5.7ThrLeu: 5.7 ± 1.377
0.855ThrMet: 0.855 ± 0.496
2.28ThrAsn: 2.28 ± 0.753
3.705ThrPro: 3.705 ± 2.35
1.71ThrGln: 1.71 ± 0.681
3.135ThrArg: 3.135 ± 1.073
4.56ThrSer: 4.56 ± 0.744
2.85ThrThr: 2.85 ± 0.839
3.135ThrVal: 3.135 ± 1.249
1.995ThrTrp: 1.995 ± 0.495
2.28ThrTyr: 2.28 ± 0.552
0.0ThrXaa: 0.0 ± 0.0
Val
2.28ValAla: 2.28 ± 1.497
1.425ValCys: 1.425 ± 0.485
1.71ValAsp: 1.71 ± 0.681
1.995ValGlu: 1.995 ± 0.575
1.995ValPhe: 1.995 ± 1.086
1.425ValGly: 1.425 ± 2.076
1.425ValHis: 1.425 ± 1.045
4.56ValIle: 4.56 ± 1.501
3.705ValLys: 3.705 ± 2.147
7.41ValLeu: 7.41 ± 0.936
1.425ValMet: 1.425 ± 0.589
1.995ValAsn: 1.995 ± 0.876
3.135ValPro: 3.135 ± 0.699
1.71ValGln: 1.71 ± 1.177
3.135ValArg: 3.135 ± 1.146
5.415ValSer: 5.415 ± 0.385
3.135ValThr: 3.135 ± 0.837
2.565ValVal: 2.565 ± 1.166
0.57ValTrp: 0.57 ± 0.387
1.425ValTyr: 1.425 ± 0.628
0.0ValXaa: 0.0 ± 0.0
Trp
1.425TrpAla: 1.425 ± 0.364
0.0TrpCys: 0.0 ± 0.0
1.425TrpAsp: 1.425 ± 0.589
1.14TrpGlu: 1.14 ± 0.651
0.855TrpPhe: 0.855 ± 0.5
1.425TrpGly: 1.425 ± 0.331
1.425TrpHis: 1.425 ± 1.045
0.855TrpIle: 0.855 ± 0.317
1.71TrpLys: 1.71 ± 0.681
0.855TrpLeu: 0.855 ± 0.5
0.285TrpMet: 0.285 ± 0.451
0.855TrpAsn: 0.855 ± 0.488
0.855TrpPro: 0.855 ± 0.317
0.285TrpGln: 0.285 ± 0.163
0.285TrpArg: 0.285 ± 0.163
1.425TrpSer: 1.425 ± 0.813
1.14TrpThr: 1.14 ± 0.441
1.71TrpVal: 1.71 ± 0.798
0.0TrpTrp: 0.0 ± 0.0
0.57TrpTyr: 0.57 ± 0.83
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.28TyrAla: 2.28 ± 0.931
0.57TyrCys: 0.57 ± 0.325
1.425TyrAsp: 1.425 ± 0.553
2.28TyrGlu: 2.28 ± 1.268
1.425TyrPhe: 1.425 ± 0.49
1.71TyrGly: 1.71 ± 0.388
0.57TyrHis: 0.57 ± 0.332
1.425TyrIle: 1.425 ± 0.364
2.28TyrLys: 2.28 ± 0.292
3.99TyrLeu: 3.99 ± 0.87
1.995TyrMet: 1.995 ± 1.144
1.71TyrAsn: 1.71 ± 0.624
2.28TyrPro: 2.28 ± 0.292
1.995TyrGln: 1.995 ± 0.938
2.565TyrArg: 2.565 ± 1.126
1.995TyrSer: 1.995 ± 0.355
1.425TyrThr: 1.425 ± 2.002
0.57TyrVal: 0.57 ± 0.332
0.0TyrTrp: 0.0 ± 0.0
0.57TyrTyr: 0.57 ± 0.387
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3510 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski