Amino acid dipepetide frequency for Hubei dimarhabdovirus virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.686AlaAla: 6.686 ± 3.144
2.14AlaCys: 2.14 ± 0.532
3.744AlaAsp: 3.744 ± 1.041
3.209AlaGlu: 3.209 ± 0.612
2.14AlaPhe: 2.14 ± 0.481
1.872AlaGly: 1.872 ± 0.641
1.07AlaHis: 1.07 ± 0.423
2.14AlaIle: 2.14 ± 0.526
2.407AlaLys: 2.407 ± 0.292
6.954AlaLeu: 6.954 ± 1.348
1.07AlaMet: 1.07 ± 0.425
2.942AlaAsn: 2.942 ± 0.637
2.942AlaPro: 2.942 ± 1.775
2.942AlaGln: 2.942 ± 1.303
2.14AlaArg: 2.14 ± 0.921
3.744AlaSer: 3.744 ± 1.425
2.942AlaThr: 2.942 ± 1.714
4.012AlaVal: 4.012 ± 1.235
0.802AlaTrp: 0.802 ± 0.292
2.675AlaTyr: 2.675 ± 0.693
0.0AlaXaa: 0.0 ± 0.0
Cys
0.267CysAla: 0.267 ± 0.151
0.802CysCys: 0.802 ± 0.368
0.802CysAsp: 0.802 ± 0.439
0.535CysGlu: 0.535 ± 0.775
2.14CysPhe: 2.14 ± 0.532
0.802CysGly: 0.802 ± 0.368
0.267CysHis: 0.267 ± 0.388
0.535CysIle: 0.535 ± 0.314
0.802CysLys: 0.802 ± 0.453
1.872CysLeu: 1.872 ± 0.841
0.0CysMet: 0.0 ± 0.0
0.802CysAsn: 0.802 ± 0.453
0.802CysPro: 0.802 ± 0.689
1.337CysGln: 1.337 ± 0.428
0.267CysArg: 0.267 ± 0.151
3.477CysSer: 3.477 ± 1.625
0.802CysThr: 0.802 ± 0.304
1.07CysVal: 1.07 ± 0.778
0.267CysTrp: 0.267 ± 0.151
0.802CysTyr: 0.802 ± 0.453
0.0CysXaa: 0.0 ± 0.0
Asp
2.675AspAla: 2.675 ± 0.498
0.535AspCys: 0.535 ± 0.302
1.872AspAsp: 1.872 ± 0.493
3.477AspGlu: 3.477 ± 0.8
2.942AspPhe: 2.942 ± 0.664
2.942AspGly: 2.942 ± 1.067
3.477AspHis: 3.477 ± 0.191
2.942AspIle: 2.942 ± 1.052
1.605AspLys: 1.605 ± 0.65
6.419AspLeu: 6.419 ± 1.422
1.605AspMet: 1.605 ± 0.505
3.209AspAsn: 3.209 ± 1.09
3.744AspPro: 3.744 ± 0.653
2.407AspGln: 2.407 ± 0.372
2.14AspArg: 2.14 ± 0.636
3.477AspSer: 3.477 ± 0.637
1.872AspThr: 1.872 ± 0.777
1.872AspVal: 1.872 ± 1.691
0.535AspTrp: 0.535 ± 0.302
2.675AspTyr: 2.675 ± 0.504
0.0AspXaa: 0.0 ± 0.0
Glu
1.605GluAla: 1.605 ± 0.863
1.07GluCys: 1.07 ± 0.381
3.209GluAsp: 3.209 ± 1.258
5.616GluGlu: 5.616 ± 1.169
2.407GluPhe: 2.407 ± 0.826
1.872GluGly: 1.872 ± 0.443
1.605GluHis: 1.605 ± 0.707
5.616GluIle: 5.616 ± 1.438
3.209GluLys: 3.209 ± 0.984
6.419GluLeu: 6.419 ± 0.915
2.675GluMet: 2.675 ± 0.753
2.942GluAsn: 2.942 ± 0.608
2.407GluPro: 2.407 ± 0.664
1.337GluGln: 1.337 ± 0.755
4.279GluArg: 4.279 ± 0.862
6.686GluSer: 6.686 ± 1.356
2.407GluThr: 2.407 ± 1.222
4.279GluVal: 4.279 ± 1.041
1.07GluTrp: 1.07 ± 0.489
2.675GluTyr: 2.675 ± 0.799
0.0GluXaa: 0.0 ± 0.0
Phe
1.872PheAla: 1.872 ± 0.493
0.802PheCys: 0.802 ± 0.368
1.07PheAsp: 1.07 ± 0.604
2.675PheGlu: 2.675 ± 1.207
1.872PhePhe: 1.872 ± 0.57
2.407PheGly: 2.407 ± 0.691
1.337PheHis: 1.337 ± 0.755
4.814PheIle: 4.814 ± 0.483
2.407PheLys: 2.407 ± 1.57
2.675PheLeu: 2.675 ± 0.927
0.802PheMet: 0.802 ± 0.304
1.337PheAsn: 1.337 ± 0.939
3.209PhePro: 3.209 ± 0.842
1.872PheGln: 1.872 ± 0.57
2.407PheArg: 2.407 ± 0.875
2.14PheSer: 2.14 ± 0.603
3.477PheThr: 3.477 ± 1.135
4.279PheVal: 4.279 ± 0.865
0.535PheTrp: 0.535 ± 0.314
1.605PheTyr: 1.605 ± 0.349
0.0PheXaa: 0.0 ± 0.0
Gly
1.605GlyAla: 1.605 ± 0.341
1.337GlyCys: 1.337 ± 0.527
2.942GlyAsp: 2.942 ± 0.34
2.942GlyGlu: 2.942 ± 1.031
2.675GlyPhe: 2.675 ± 0.567
3.744GlyGly: 3.744 ± 1.306
2.407GlyHis: 2.407 ± 0.806
3.744GlyIle: 3.744 ± 1.149
2.675GlyLys: 2.675 ± 0.504
6.419GlyLeu: 6.419 ± 1.385
1.337GlyMet: 1.337 ± 0.361
1.337GlyAsn: 1.337 ± 0.361
1.07GlyPro: 1.07 ± 1.074
2.675GlyGln: 2.675 ± 0.498
1.872GlyArg: 1.872 ± 0.3
3.209GlySer: 3.209 ± 0.721
3.209GlyThr: 3.209 ± 0.536
2.407GlyVal: 2.407 ± 0.441
0.267GlyTrp: 0.267 ± 0.151
1.605GlyTyr: 1.605 ± 0.608
0.0GlyXaa: 0.0 ± 0.0
His
1.337HisAla: 1.337 ± 0.56
0.535HisCys: 0.535 ± 0.314
0.267HisAsp: 0.267 ± 0.388
1.872HisGlu: 1.872 ± 0.801
1.605HisPhe: 1.605 ± 0.707
1.337HisGly: 1.337 ± 0.527
0.535HisHis: 0.535 ± 0.314
1.872HisIle: 1.872 ± 0.801
1.337HisLys: 1.337 ± 0.283
4.547HisLeu: 4.547 ± 2.038
0.802HisMet: 0.802 ± 0.306
0.535HisAsn: 0.535 ± 0.342
3.209HisPro: 3.209 ± 0.505
0.535HisGln: 0.535 ± 0.314
1.872HisArg: 1.872 ± 0.653
1.07HisSer: 1.07 ± 0.477
2.14HisThr: 2.14 ± 0.682
2.14HisVal: 2.14 ± 0.691
0.535HisTrp: 0.535 ± 0.302
0.802HisTyr: 0.802 ± 0.292
0.0HisXaa: 0.0 ± 0.0
Ile
2.14IleAla: 2.14 ± 0.551
1.07IleCys: 1.07 ± 0.425
3.744IleAsp: 3.744 ± 0.52
3.477IleGlu: 3.477 ± 0.691
2.675IlePhe: 2.675 ± 0.863
4.012IleGly: 4.012 ± 1.285
2.407IleHis: 2.407 ± 0.706
3.744IleIle: 3.744 ± 0.503
3.744IleLys: 3.744 ± 0.373
5.349IleLeu: 5.349 ± 2.077
1.872IleMet: 1.872 ± 0.801
3.209IleAsn: 3.209 ± 1.167
3.209IlePro: 3.209 ± 0.792
2.14IleGln: 2.14 ± 0.528
5.082IleArg: 5.082 ± 0.615
4.547IleSer: 4.547 ± 0.607
2.14IleThr: 2.14 ± 0.551
2.675IleVal: 2.675 ± 0.455
1.872IleTrp: 1.872 ± 0.536
2.14IleTyr: 2.14 ± 0.694
0.0IleXaa: 0.0 ± 0.0
Lys
3.477LysAla: 3.477 ± 0.8
1.07LysCys: 1.07 ± 1.551
2.942LysAsp: 2.942 ± 1.024
4.012LysGlu: 4.012 ± 0.626
1.337LysPhe: 1.337 ± 0.841
2.407LysGly: 2.407 ± 1.404
0.802LysHis: 0.802 ± 0.453
2.14LysIle: 2.14 ± 0.905
2.942LysLys: 2.942 ± 1.225
4.012LysLeu: 4.012 ± 0.768
2.407LysMet: 2.407 ± 0.563
1.605LysAsn: 1.605 ± 0.991
1.872LysPro: 1.872 ± 0.493
2.407LysGln: 2.407 ± 1.018
2.942LysArg: 2.942 ± 0.476
4.547LysSer: 4.547 ± 1.322
3.209LysThr: 3.209 ± 0.849
3.744LysVal: 3.744 ± 0.505
1.337LysTrp: 1.337 ± 0.342
2.14LysTyr: 2.14 ± 0.97
0.0LysXaa: 0.0 ± 0.0
Leu
5.884LeuAla: 5.884 ± 1.132
0.802LeuCys: 0.802 ± 0.439
6.686LeuAsp: 6.686 ± 0.446
5.082LeuGlu: 5.082 ± 1.669
3.744LeuPhe: 3.744 ± 1.149
4.814LeuGly: 4.814 ± 0.908
3.209LeuHis: 3.209 ± 1.318
5.616LeuIle: 5.616 ± 0.965
5.082LeuLys: 5.082 ± 1.495
9.628LeuLeu: 9.628 ± 2.393
2.942LeuMet: 2.942 ± 0.799
4.814LeuAsn: 4.814 ± 0.29
2.942LeuPro: 2.942 ± 0.505
2.675LeuGln: 2.675 ± 0.952
7.221LeuArg: 7.221 ± 1.225
10.163LeuSer: 10.163 ± 1.265
7.489LeuThr: 7.489 ± 2.044
6.419LeuVal: 6.419 ± 1.435
0.267LeuTrp: 0.267 ± 0.388
3.477LeuTyr: 3.477 ± 1.812
0.0LeuXaa: 0.0 ± 0.0
Met
1.605MetAla: 1.605 ± 0.588
0.267MetCys: 0.267 ± 0.151
1.605MetAsp: 1.605 ± 0.905
1.337MetGlu: 1.337 ± 0.428
1.337MetPhe: 1.337 ± 0.527
1.337MetGly: 1.337 ± 0.283
0.267MetHis: 0.267 ± 0.283
2.407MetIle: 2.407 ± 0.552
1.07MetLys: 1.07 ± 0.754
1.872MetLeu: 1.872 ± 0.536
0.802MetMet: 0.802 ± 0.507
2.407MetAsn: 2.407 ± 0.646
2.14MetPro: 2.14 ± 0.845
0.535MetGln: 0.535 ± 0.302
1.605MetArg: 1.605 ± 0.625
3.209MetSer: 3.209 ± 0.705
1.872MetThr: 1.872 ± 0.433
0.802MetVal: 0.802 ± 0.682
1.07MetTrp: 1.07 ± 0.381
0.535MetTyr: 0.535 ± 0.245
0.0MetXaa: 0.0 ± 0.0
Asn
1.07AsnAla: 1.07 ± 0.289
0.802AsnCys: 0.802 ± 0.292
3.209AsnAsp: 3.209 ± 0.789
1.07AsnGlu: 1.07 ± 0.489
3.209AsnPhe: 3.209 ± 0.536
1.07AsnGly: 1.07 ± 0.628
2.675AsnHis: 2.675 ± 0.549
2.14AsnIle: 2.14 ± 0.551
1.07AsnLys: 1.07 ± 0.289
6.686AsnLeu: 6.686 ± 0.917
2.407AsnMet: 2.407 ± 0.908
2.942AsnAsn: 2.942 ± 0.816
3.209AsnPro: 3.209 ± 1.855
2.14AsnGln: 2.14 ± 0.94
3.477AsnArg: 3.477 ± 0.494
4.814AsnSer: 4.814 ± 0.69
2.14AsnThr: 2.14 ± 0.794
0.802AsnVal: 0.802 ± 0.453
0.535AsnTrp: 0.535 ± 0.302
2.407AsnTyr: 2.407 ± 1.006
0.0AsnXaa: 0.0 ± 0.0
Pro
5.082ProAla: 5.082 ± 2.651
0.267ProCys: 0.267 ± 0.151
3.744ProAsp: 3.744 ± 0.89
3.477ProGlu: 3.477 ± 1.162
1.872ProPhe: 1.872 ± 0.781
1.872ProGly: 1.872 ± 0.908
1.07ProHis: 1.07 ± 0.604
2.407ProIle: 2.407 ± 0.691
2.407ProLys: 2.407 ± 1.295
4.279ProLeu: 4.279 ± 0.757
0.535ProMet: 0.535 ± 0.786
1.337ProAsn: 1.337 ± 0.325
2.942ProPro: 2.942 ± 0.67
2.14ProGln: 2.14 ± 0.62
5.616ProArg: 5.616 ± 1.081
5.349ProSer: 5.349 ± 1.604
4.012ProThr: 4.012 ± 1.469
0.802ProVal: 0.802 ± 0.304
0.0ProTrp: 0.0 ± 0.0
1.605ProTyr: 1.605 ± 0.434
0.0ProXaa: 0.0 ± 0.0
Gln
2.14GlnAla: 2.14 ± 0.636
1.07GlnCys: 1.07 ± 0.628
1.337GlnAsp: 1.337 ± 0.654
2.407GlnGlu: 2.407 ± 0.292
1.872GlnPhe: 1.872 ± 1.009
2.407GlnGly: 2.407 ± 0.646
1.07GlnHis: 1.07 ± 0.641
2.14GlnIle: 2.14 ± 0.562
2.675GlnLys: 2.675 ± 1.305
2.675GlnLeu: 2.675 ± 0.687
1.07GlnMet: 1.07 ± 0.604
1.337GlnAsn: 1.337 ± 0.527
1.872GlnPro: 1.872 ± 0.989
0.535GlnGln: 0.535 ± 0.342
2.675GlnArg: 2.675 ± 0.367
2.675GlnSer: 2.675 ± 0.769
2.407GlnThr: 2.407 ± 0.372
2.14GlnVal: 2.14 ± 0.534
0.802GlnTrp: 0.802 ± 0.304
0.535GlnTyr: 0.535 ± 0.302
0.0GlnXaa: 0.0 ± 0.0
Arg
5.349ArgAla: 5.349 ± 1.021
0.267ArgCys: 0.267 ± 0.151
1.605ArgAsp: 1.605 ± 0.608
5.349ArgGlu: 5.349 ± 1.029
2.407ArgPhe: 2.407 ± 0.913
3.744ArgGly: 3.744 ± 0.486
1.605ArgHis: 1.605 ± 0.707
3.477ArgIle: 3.477 ± 0.8
3.744ArgLys: 3.744 ± 1.126
4.279ArgLeu: 4.279 ± 0.757
0.535ArgMet: 0.535 ± 0.314
3.209ArgAsn: 3.209 ± 0.745
2.675ArgPro: 2.675 ± 0.628
2.407ArgGln: 2.407 ± 0.818
2.942ArgArg: 2.942 ± 0.921
4.814ArgSer: 4.814 ± 0.825
2.675ArgThr: 2.675 ± 0.905
4.547ArgVal: 4.547 ± 1.468
0.802ArgTrp: 0.802 ± 0.469
3.209ArgTyr: 3.209 ± 0.891
0.0ArgXaa: 0.0 ± 0.0
Ser
6.419SerAla: 6.419 ± 2.399
2.14SerCys: 2.14 ± 1.787
5.082SerAsp: 5.082 ± 0.614
6.151SerGlu: 6.151 ± 1.359
3.477SerPhe: 3.477 ± 1.038
5.884SerGly: 5.884 ± 0.635
0.802SerHis: 0.802 ± 0.453
5.616SerIle: 5.616 ± 0.575
3.744SerLys: 3.744 ± 1.357
9.628SerLeu: 9.628 ± 1.498
1.872SerMet: 1.872 ± 0.618
4.012SerAsn: 4.012 ± 0.641
3.744SerPro: 3.744 ± 0.427
2.407SerGln: 2.407 ± 0.418
5.349SerArg: 5.349 ± 1.52
8.291SerSer: 8.291 ± 2.16
4.814SerThr: 4.814 ± 1.131
4.012SerVal: 4.012 ± 0.642
1.337SerTrp: 1.337 ± 0.485
3.477SerTyr: 3.477 ± 0.884
0.0SerXaa: 0.0 ± 0.0
Thr
4.814ThrAla: 4.814 ± 1.116
1.872ThrCys: 1.872 ± 1.081
2.407ThrAsp: 2.407 ± 0.511
4.279ThrGlu: 4.279 ± 1.593
1.872ThrPhe: 1.872 ± 0.678
1.872ThrGly: 1.872 ± 0.724
1.07ThrHis: 1.07 ± 0.363
4.279ThrIle: 4.279 ± 1.103
2.675ThrLys: 2.675 ± 0.791
4.547ThrLeu: 4.547 ± 0.896
1.07ThrMet: 1.07 ± 0.41
4.279ThrAsn: 4.279 ± 1.459
2.942ThrPro: 2.942 ± 1.948
1.872ThrGln: 1.872 ± 0.458
2.942ThrArg: 2.942 ± 0.646
5.884ThrSer: 5.884 ± 0.889
2.14ThrThr: 2.14 ± 0.551
4.012ThrVal: 4.012 ± 1.003
0.802ThrTrp: 0.802 ± 0.453
1.07ThrTyr: 1.07 ± 0.572
0.0ThrXaa: 0.0 ± 0.0
Val
2.675ValAla: 2.675 ± 0.967
0.0ValCys: 0.0 ± 0.0
2.407ValAsp: 2.407 ± 0.587
3.477ValGlu: 3.477 ± 1.205
1.605ValPhe: 1.605 ± 0.505
2.675ValGly: 2.675 ± 0.983
1.605ValHis: 1.605 ± 0.434
3.744ValIle: 3.744 ± 0.9
4.012ValLys: 4.012 ± 1.097
6.151ValLeu: 6.151 ± 1.693
2.14ValMet: 2.14 ± 0.455
4.279ValAsn: 4.279 ± 0.684
3.209ValPro: 3.209 ± 0.962
1.872ValGln: 1.872 ± 1.345
1.872ValArg: 1.872 ± 0.522
4.547ValSer: 4.547 ± 0.761
4.012ValThr: 4.012 ± 0.417
4.279ValVal: 4.279 ± 1.981
0.802ValTrp: 0.802 ± 0.364
1.337ValTyr: 1.337 ± 0.466
0.0ValXaa: 0.0 ± 0.0
Trp
0.535TrpAla: 0.535 ± 0.302
0.535TrpCys: 0.535 ± 0.775
1.605TrpAsp: 1.605 ± 0.365
0.535TrpGlu: 0.535 ± 0.314
0.802TrpPhe: 0.802 ± 0.453
0.802TrpGly: 0.802 ± 0.368
0.267TrpHis: 0.267 ± 0.151
0.535TrpIle: 0.535 ± 0.558
1.07TrpLys: 1.07 ± 0.604
2.14TrpLeu: 2.14 ± 1.207
0.267TrpMet: 0.267 ± 0.388
0.267TrpAsn: 0.267 ± 0.151
0.267TrpPro: 0.267 ± 0.151
0.267TrpGln: 0.267 ± 0.151
0.535TrpArg: 0.535 ± 0.566
1.07TrpSer: 1.07 ± 0.289
1.337TrpThr: 1.337 ± 0.619
0.535TrpVal: 0.535 ± 0.485
0.0TrpTrp: 0.0 ± 0.0
0.267TrpTyr: 0.267 ± 0.283
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.337TyrAla: 1.337 ± 0.622
1.07TyrCys: 1.07 ± 0.363
2.14TyrAsp: 2.14 ± 0.727
2.407TyrGlu: 2.407 ± 0.814
1.605TyrPhe: 1.605 ± 0.232
1.605TyrGly: 1.605 ± 0.905
1.337TyrHis: 1.337 ± 0.517
1.337TyrIle: 1.337 ± 0.955
2.675TyrLys: 2.675 ± 1.188
2.407TyrLeu: 2.407 ± 0.814
1.605TyrMet: 1.605 ± 0.659
1.07TyrAsn: 1.07 ± 0.395
2.407TyrPro: 2.407 ± 0.368
1.337TyrGln: 1.337 ± 0.522
2.407TyrArg: 2.407 ± 0.65
4.814TyrSer: 4.814 ± 1.102
1.605TyrThr: 1.605 ± 0.434
1.872TyrVal: 1.872 ± 0.591
0.0TyrTrp: 0.0 ± 0.0
1.07TyrTyr: 1.07 ± 0.489
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3740 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski