Amino acid dipepetide frequency for Mirabilis mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.098AlaAla: 2.098 ± 1.015
0.42AlaCys: 0.42 ± 0.339
2.518AlaAsp: 2.518 ± 0.928
4.616AlaGlu: 4.616 ± 0.923
2.098AlaPhe: 2.098 ± 0.811
1.679AlaGly: 1.679 ± 0.544
1.679AlaHis: 1.679 ± 0.748
1.679AlaIle: 1.679 ± 1.559
4.616AlaLys: 4.616 ± 1.983
2.518AlaLeu: 2.518 ± 0.86
1.259AlaMet: 1.259 ± 0.642
4.616AlaAsn: 4.616 ± 0.738
2.518AlaPro: 2.518 ± 0.75
2.098AlaGln: 2.098 ± 0.406
1.259AlaArg: 1.259 ± 0.528
4.616AlaSer: 4.616 ± 1.477
1.679AlaThr: 1.679 ± 1.032
1.259AlaVal: 1.259 ± 0.534
0.42AlaTrp: 0.42 ± 0.359
0.839AlaTyr: 0.839 ± 0.641
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.42CysCys: 0.42 ± 0.359
0.42CysAsp: 0.42 ± 0.359
2.098CysGlu: 2.098 ± 0.79
0.42CysPhe: 0.42 ± 0.34
0.42CysGly: 0.42 ± 0.339
0.0CysHis: 0.0 ± 0.0
0.839CysIle: 0.839 ± 0.504
0.839CysLys: 0.839 ± 0.376
2.098CysLeu: 2.098 ± 0.85
0.0CysMet: 0.0 ± 0.0
1.259CysAsn: 1.259 ± 0.491
1.679CysPro: 1.679 ± 0.901
0.839CysGln: 0.839 ± 0.385
1.259CysArg: 1.259 ± 0.621
0.839CysSer: 0.839 ± 0.376
0.42CysThr: 0.42 ± 0.557
0.0CysVal: 0.0 ± 0.0
0.42CysTrp: 0.42 ± 0.359
0.42CysTyr: 0.42 ± 0.34
0.0CysXaa: 0.0 ± 0.0
Asp
2.098AspAla: 2.098 ± 0.612
1.679AspCys: 1.679 ± 0.715
2.937AspAsp: 2.937 ± 1.462
5.036AspGlu: 5.036 ± 1.128
2.098AspPhe: 2.098 ± 0.955
1.259AspGly: 1.259 ± 0.671
4.196AspHis: 4.196 ± 1.575
3.357AspIle: 3.357 ± 0.622
5.875AspLys: 5.875 ± 2.204
3.357AspLeu: 3.357 ± 1.706
0.839AspMet: 0.839 ± 0.367
2.937AspAsn: 2.937 ± 0.967
0.839AspPro: 0.839 ± 0.385
2.518AspGln: 2.518 ± 0.994
3.357AspArg: 3.357 ± 0.956
3.777AspSer: 3.777 ± 1.079
2.518AspThr: 2.518 ± 0.845
2.098AspVal: 2.098 ± 0.798
0.839AspTrp: 0.839 ± 0.679
2.937AspTyr: 2.937 ± 0.844
0.0AspXaa: 0.0 ± 0.0
Glu
4.616GluAla: 4.616 ± 1.249
0.839GluCys: 0.839 ± 0.679
6.714GluAsp: 6.714 ± 1.436
8.393GluGlu: 8.393 ± 3.104
3.777GluPhe: 3.777 ± 1.509
2.937GluGly: 2.937 ± 0.659
0.839GluHis: 0.839 ± 0.758
9.232GluIle: 9.232 ± 2.319
10.911GluLys: 10.911 ± 1.706
7.134GluLeu: 7.134 ± 1.294
2.518GluMet: 2.518 ± 1.192
4.616GluAsn: 4.616 ± 0.672
1.679GluPro: 1.679 ± 1.152
3.777GluGln: 3.777 ± 0.809
2.518GluArg: 2.518 ± 1.053
2.937GluSer: 2.937 ± 0.536
5.875GluThr: 5.875 ± 2.086
2.937GluVal: 2.937 ± 1.449
0.0GluTrp: 0.0 ± 0.0
0.839GluTyr: 0.839 ± 0.504
0.0GluXaa: 0.0 ± 0.0
Phe
2.098PheAla: 2.098 ± 0.855
2.098PheCys: 2.098 ± 0.729
1.679PheAsp: 1.679 ± 0.804
2.518PheGlu: 2.518 ± 0.622
1.679PhePhe: 1.679 ± 0.41
2.937PheGly: 2.937 ± 1.06
0.839PheHis: 0.839 ± 0.681
4.196PheIle: 4.196 ± 0.619
3.777PheLys: 3.777 ± 1.255
2.518PheLeu: 2.518 ± 1.284
0.839PheMet: 0.839 ± 0.419
0.839PheAsn: 0.839 ± 0.417
2.518PhePro: 2.518 ± 0.602
0.839PheGln: 0.839 ± 0.376
1.259PheArg: 1.259 ± 0.641
5.455PheSer: 5.455 ± 1.621
2.937PheThr: 2.937 ± 0.77
0.42PheVal: 0.42 ± 0.359
1.259PheTrp: 1.259 ± 1.021
0.42PheTyr: 0.42 ± 0.339
0.0PheXaa: 0.0 ± 0.0
Gly
2.098GlyAla: 2.098 ± 1.011
0.42GlyCys: 0.42 ± 0.359
2.518GlyAsp: 2.518 ± 0.491
2.937GlyGlu: 2.937 ± 0.596
1.679GlyPhe: 1.679 ± 1.361
1.259GlyGly: 1.259 ± 0.641
1.259GlyHis: 1.259 ± 0.621
2.518GlyIle: 2.518 ± 0.947
5.036GlyLys: 5.036 ± 0.659
4.616GlyLeu: 4.616 ± 1.431
0.839GlyMet: 0.839 ± 0.561
4.616GlyAsn: 4.616 ± 1.409
1.679GlyPro: 1.679 ± 1.357
0.0GlyGln: 0.0 ± 0.0
0.839GlyArg: 0.839 ± 0.419
3.357GlySer: 3.357 ± 1.296
4.196GlyThr: 4.196 ± 1.077
2.098GlyVal: 2.098 ± 0.692
0.42GlyTrp: 0.42 ± 0.34
1.679GlyTyr: 1.679 ± 0.327
0.0GlyXaa: 0.0 ± 0.0
His
0.839HisAla: 0.839 ± 0.758
0.839HisCys: 0.839 ± 0.54
0.839HisAsp: 0.839 ± 0.385
0.839HisGlu: 0.839 ± 0.504
2.098HisPhe: 2.098 ± 0.528
1.259HisGly: 1.259 ± 0.846
0.0HisHis: 0.0 ± 0.0
2.518HisIle: 2.518 ± 1.24
0.42HisLys: 0.42 ± 0.339
0.839HisLeu: 0.839 ± 0.385
0.839HisMet: 0.839 ± 0.681
1.259HisAsn: 1.259 ± 0.8
1.679HisPro: 1.679 ± 0.717
1.259HisGln: 1.259 ± 0.641
0.42HisArg: 0.42 ± 0.359
1.679HisSer: 1.679 ± 0.654
1.259HisThr: 1.259 ± 0.65
2.518HisVal: 2.518 ± 0.649
0.0HisTrp: 0.0 ± 0.0
1.679HisTyr: 1.679 ± 0.752
0.0HisXaa: 0.0 ± 0.0
Ile
3.357IleAla: 3.357 ± 1.136
1.679IleCys: 1.679 ± 0.982
5.875IleAsp: 5.875 ± 1.406
4.196IleGlu: 4.196 ± 1.706
4.196IlePhe: 4.196 ± 1.274
2.098IleGly: 2.098 ± 0.552
1.679IleHis: 1.679 ± 0.838
6.295IleIle: 6.295 ± 1.022
7.554IleLys: 7.554 ± 0.996
5.455IleLeu: 5.455 ± 1.094
0.839IleMet: 0.839 ± 0.505
6.714IleAsn: 6.714 ± 1.572
5.455IlePro: 5.455 ± 1.303
2.518IleGln: 2.518 ± 0.38
3.357IleArg: 3.357 ± 1.28
6.714IleSer: 6.714 ± 2.066
3.357IleThr: 3.357 ± 1.462
3.357IleVal: 3.357 ± 1.387
0.42IleTrp: 0.42 ± 0.34
4.196IleTyr: 4.196 ± 0.752
0.0IleXaa: 0.0 ± 0.0
Lys
5.036LysAla: 5.036 ± 0.984
0.42LysCys: 0.42 ± 0.359
6.295LysAsp: 6.295 ± 2.33
8.393LysGlu: 8.393 ± 1.61
4.196LysPhe: 4.196 ± 0.817
6.714LysGly: 6.714 ± 1.921
0.42LysHis: 0.42 ± 0.34
10.491LysIle: 10.491 ± 1.475
14.687LysLys: 14.687 ± 1.009
7.554LysLeu: 7.554 ± 2.025
0.839LysMet: 0.839 ± 0.505
7.134LysAsn: 7.134 ± 2.421
5.455LysPro: 5.455 ± 1.195
4.196LysGln: 4.196 ± 0.916
4.616LysArg: 4.616 ± 1.733
5.036LysSer: 5.036 ± 0.951
5.036LysThr: 5.036 ± 1.795
5.036LysVal: 5.036 ± 1.96
0.839LysTrp: 0.839 ± 0.376
3.357LysTyr: 3.357 ± 1.513
0.0LysXaa: 0.0 ± 0.0
Leu
5.455LeuAla: 5.455 ± 1.842
1.679LeuCys: 1.679 ± 0.544
3.777LeuAsp: 3.777 ± 1.126
5.875LeuGlu: 5.875 ± 1.442
2.518LeuPhe: 2.518 ± 0.701
4.196LeuGly: 4.196 ± 1.367
1.259LeuHis: 1.259 ± 0.322
7.134LeuIle: 7.134 ± 1.419
7.973LeuLys: 7.973 ± 1.813
8.393LeuLeu: 8.393 ± 1.975
1.259LeuMet: 1.259 ± 0.778
3.777LeuAsn: 3.777 ± 0.649
4.196LeuPro: 4.196 ± 1.427
2.098LeuGln: 2.098 ± 0.785
5.455LeuArg: 5.455 ± 0.789
6.295LeuSer: 6.295 ± 2.125
4.196LeuThr: 4.196 ± 1.018
4.196LeuVal: 4.196 ± 1.08
1.259LeuTrp: 1.259 ± 0.725
1.259LeuTyr: 1.259 ± 0.322
0.0LeuXaa: 0.0 ± 0.0
Met
0.42MetAla: 0.42 ± 0.401
0.0MetCys: 0.0 ± 0.0
0.42MetAsp: 0.42 ± 0.401
1.679MetGlu: 1.679 ± 0.835
0.839MetPhe: 0.839 ± 0.504
0.839MetGly: 0.839 ± 0.505
0.0MetHis: 0.0 ± 0.0
1.259MetIle: 1.259 ± 0.778
2.518MetLys: 2.518 ± 0.827
1.259MetLeu: 1.259 ± 0.725
1.259MetMet: 1.259 ± 1.018
1.259MetAsn: 1.259 ± 0.676
0.839MetPro: 0.839 ± 0.417
0.839MetGln: 0.839 ± 0.681
0.839MetArg: 0.839 ± 0.681
2.937MetSer: 2.937 ± 1.257
0.42MetThr: 0.42 ± 0.359
1.259MetVal: 1.259 ± 0.621
0.42MetTrp: 0.42 ± 0.34
0.42MetTyr: 0.42 ± 0.359
0.0MetXaa: 0.0 ± 0.0
Asn
2.518AsnAla: 2.518 ± 0.684
0.0AsnCys: 0.0 ± 0.0
2.937AsnAsp: 2.937 ± 1.06
5.875AsnGlu: 5.875 ± 1.508
2.518AsnPhe: 2.518 ± 0.894
1.679AsnGly: 1.679 ± 0.95
1.259AsnHis: 1.259 ± 0.624
4.616AsnIle: 4.616 ± 1.267
5.036AsnLys: 5.036 ± 1.289
7.554AsnLeu: 7.554 ± 1.268
0.839AsnMet: 0.839 ± 0.419
3.357AsnAsn: 3.357 ± 0.939
4.616AsnPro: 4.616 ± 1.25
3.777AsnGln: 3.777 ± 1.473
1.679AsnArg: 1.679 ± 1.008
3.777AsnSer: 3.777 ± 1.443
1.679AsnThr: 1.679 ± 0.849
2.518AsnVal: 2.518 ± 1.063
0.839AsnTrp: 0.839 ± 0.641
1.259AsnTyr: 1.259 ± 0.642
0.0AsnXaa: 0.0 ± 0.0
Pro
2.098ProAla: 2.098 ± 0.612
0.0ProCys: 0.0 ± 0.0
2.937ProAsp: 2.937 ± 0.877
5.455ProGlu: 5.455 ± 1.42
2.518ProPhe: 2.518 ± 0.79
1.679ProGly: 1.679 ± 0.515
0.839ProHis: 0.839 ± 0.58
2.518ProIle: 2.518 ± 0.825
3.357ProLys: 3.357 ± 1.183
2.937ProLeu: 2.937 ± 1.075
0.839ProMet: 0.839 ± 0.655
3.357ProAsn: 3.357 ± 1.001
0.42ProPro: 0.42 ± 0.557
2.098ProGln: 2.098 ± 0.612
2.518ProArg: 2.518 ± 0.731
6.295ProSer: 6.295 ± 2.103
2.937ProThr: 2.937 ± 1.754
4.196ProVal: 4.196 ± 0.804
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.679GlnAla: 1.679 ± 0.41
0.839GlnCys: 0.839 ± 0.679
2.518GlnAsp: 2.518 ± 0.491
6.295GlnGlu: 6.295 ± 0.792
1.259GlnPhe: 1.259 ± 0.528
1.679GlnGly: 1.679 ± 0.515
0.839GlnHis: 0.839 ± 0.385
2.937GlnIle: 2.937 ± 0.73
3.777GlnLys: 3.777 ± 1.372
3.357GlnLeu: 3.357 ± 0.939
1.259GlnMet: 1.259 ± 0.407
2.518GlnAsn: 2.518 ± 0.995
1.259GlnPro: 1.259 ± 0.651
2.937GlnGln: 2.937 ± 0.775
1.259GlnArg: 1.259 ± 0.65
1.259GlnSer: 1.259 ± 0.825
2.937GlnThr: 2.937 ± 0.964
2.518GlnVal: 2.518 ± 1.171
0.839GlnTrp: 0.839 ± 0.681
0.42GlnTyr: 0.42 ± 0.339
0.0GlnXaa: 0.0 ± 0.0
Arg
0.839ArgAla: 0.839 ± 0.556
0.42ArgCys: 0.42 ± 0.359
0.839ArgAsp: 0.839 ± 0.417
2.518ArgGlu: 2.518 ± 0.625
1.259ArgPhe: 1.259 ± 0.407
2.937ArgGly: 2.937 ± 1.59
1.679ArgHis: 1.679 ± 0.591
3.777ArgIle: 3.777 ± 0.649
7.134ArgLys: 7.134 ± 1.773
2.518ArgLeu: 2.518 ± 1.04
1.259ArgMet: 1.259 ± 0.322
2.098ArgAsn: 2.098 ± 0.876
1.679ArgPro: 1.679 ± 0.73
0.42ArgGln: 0.42 ± 0.401
3.777ArgArg: 3.777 ± 1.098
3.357ArgSer: 3.357 ± 1.173
0.42ArgThr: 0.42 ± 0.34
2.937ArgVal: 2.937 ± 0.681
0.42ArgTrp: 0.42 ± 0.34
1.259ArgTyr: 1.259 ± 0.491
0.0ArgXaa: 0.0 ± 0.0
Ser
0.839SerAla: 0.839 ± 0.679
0.0SerCys: 0.0 ± 0.0
3.357SerAsp: 3.357 ± 0.741
8.393SerGlu: 8.393 ± 2.478
2.937SerPhe: 2.937 ± 1.042
4.616SerGly: 4.616 ± 0.936
3.357SerHis: 3.357 ± 0.976
7.134SerIle: 7.134 ± 1.353
10.491SerLys: 10.491 ± 1.338
7.134SerLeu: 7.134 ± 0.89
1.679SerMet: 1.679 ± 0.87
2.518SerAsn: 2.518 ± 0.897
4.196SerPro: 4.196 ± 0.584
3.357SerGln: 3.357 ± 0.852
2.098SerArg: 2.098 ± 1.028
8.812SerSer: 8.812 ± 2.034
2.937SerThr: 2.937 ± 1.207
2.098SerVal: 2.098 ± 1.109
0.839SerTrp: 0.839 ± 0.679
2.937SerTyr: 2.937 ± 0.577
0.0SerXaa: 0.0 ± 0.0
Thr
3.357ThrAla: 3.357 ± 1.382
0.839ThrCys: 0.839 ± 0.376
4.616ThrAsp: 4.616 ± 1.13
2.937ThrGlu: 2.937 ± 1.028
2.098ThrPhe: 2.098 ± 0.863
2.518ThrGly: 2.518 ± 0.815
1.259ThrHis: 1.259 ± 0.641
3.357ThrIle: 3.357 ± 1.453
3.357ThrLys: 3.357 ± 1.068
4.196ThrLeu: 4.196 ± 0.638
0.42ThrMet: 0.42 ± 0.359
2.098ThrAsn: 2.098 ± 0.498
2.518ThrPro: 2.518 ± 0.701
2.098ThrGln: 2.098 ± 0.692
1.259ThrArg: 1.259 ± 0.407
5.036ThrSer: 5.036 ± 1.725
2.937ThrThr: 2.937 ± 0.875
2.518ThrVal: 2.518 ± 0.788
0.42ThrTrp: 0.42 ± 0.639
2.098ThrTyr: 2.098 ± 0.673
0.0ThrXaa: 0.0 ± 0.0
Val
1.679ValAla: 1.679 ± 0.796
1.259ValCys: 1.259 ± 1.021
2.518ValAsp: 2.518 ± 1.135
2.098ValGlu: 2.098 ± 0.533
1.679ValPhe: 1.679 ± 0.74
2.518ValGly: 2.518 ± 1.252
0.839ValHis: 0.839 ± 0.504
2.518ValIle: 2.518 ± 1.695
6.295ValLys: 6.295 ± 1.403
5.455ValLeu: 5.455 ± 1.386
1.259ValMet: 1.259 ± 0.589
2.098ValAsn: 2.098 ± 0.673
2.098ValPro: 2.098 ± 0.785
1.679ValGln: 1.679 ± 0.748
2.518ValArg: 2.518 ± 0.847
2.937ValSer: 2.937 ± 1.165
2.098ValThr: 2.098 ± 0.719
2.518ValVal: 2.518 ± 0.974
0.839ValTrp: 0.839 ± 0.376
2.937ValTyr: 2.937 ± 0.652
0.0ValXaa: 0.0 ± 0.0
Trp
0.42TrpAla: 0.42 ± 0.339
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.839TrpGlu: 0.839 ± 0.385
0.42TrpPhe: 0.42 ± 0.34
0.839TrpGly: 0.839 ± 0.681
0.0TrpHis: 0.0 ± 0.0
1.259TrpIle: 1.259 ± 0.651
0.42TrpLys: 0.42 ± 0.359
0.42TrpLeu: 0.42 ± 0.506
0.42TrpMet: 0.42 ± 0.34
0.42TrpAsn: 0.42 ± 0.401
0.839TrpPro: 0.839 ± 1.114
1.259TrpGln: 1.259 ± 0.825
0.839TrpArg: 0.839 ± 0.385
0.839TrpSer: 0.839 ± 0.504
0.839TrpThr: 0.839 ± 0.385
0.839TrpVal: 0.839 ± 0.376
0.0TrpTrp: 0.0 ± 0.0
0.42TrpTyr: 0.42 ± 0.639
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.937TyrAla: 2.937 ± 0.848
0.839TyrCys: 0.839 ± 0.505
0.839TyrAsp: 0.839 ± 0.504
2.098TyrGlu: 2.098 ± 1.794
0.839TyrPhe: 0.839 ± 0.718
0.0TyrGly: 0.0 ± 0.0
0.839TyrHis: 0.839 ± 0.385
1.679TyrIle: 1.679 ± 0.687
1.679TyrLys: 1.679 ± 0.566
2.937TyrLeu: 2.937 ± 0.868
0.0TyrMet: 0.0 ± 0.0
0.839TyrAsn: 0.839 ± 0.417
0.839TyrPro: 0.839 ± 0.505
3.777TyrGln: 3.777 ± 0.978
0.42TyrArg: 0.42 ± 0.359
4.196TyrSer: 4.196 ± 1.406
1.259TyrThr: 1.259 ± 0.322
2.518TyrVal: 2.518 ± 0.974
0.839TyrTrp: 0.839 ± 0.565
0.42TyrTyr: 0.42 ± 0.359
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2384 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski