Amino acid dipepetide frequency for Mokola virus (MOKV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.658AlaAla: 2.658 ± 0.946
1.45AlaCys: 1.45 ± 0.624
2.9AlaAsp: 2.9 ± 1.027
5.075AlaGlu: 5.075 ± 2.61
1.45AlaPhe: 1.45 ± 0.66
1.208AlaGly: 1.208 ± 0.411
1.933AlaHis: 1.933 ± 1.076
2.175AlaIle: 2.175 ± 1.098
1.45AlaLys: 1.45 ± 0.831
5.317AlaLeu: 5.317 ± 1.211
1.933AlaMet: 1.933 ± 0.463
1.692AlaAsn: 1.692 ± 0.715
2.175AlaPro: 2.175 ± 0.771
1.692AlaGln: 1.692 ± 0.809
4.592AlaArg: 4.592 ± 1.901
3.625AlaSer: 3.625 ± 0.865
1.692AlaThr: 1.692 ± 0.86
3.142AlaVal: 3.142 ± 1.087
0.0AlaTrp: 0.0 ± 0.0
1.45AlaTyr: 1.45 ± 0.743
0.0AlaXaa: 0.0 ± 0.0
Cys
0.725CysAla: 0.725 ± 0.555
0.483CysCys: 0.483 ± 0.292
0.483CysAsp: 0.483 ± 0.292
0.242CysGlu: 0.242 ± 0.334
0.483CysPhe: 0.483 ± 0.292
0.967CysGly: 0.967 ± 0.584
0.242CysHis: 0.242 ± 0.441
1.692CysIle: 1.692 ± 1.081
0.725CysLys: 0.725 ± 0.796
2.417CysLeu: 2.417 ± 0.886
0.242CysMet: 0.242 ± 0.334
0.483CysAsn: 0.483 ± 0.292
1.208CysPro: 1.208 ± 0.369
0.725CysGln: 0.725 ± 0.427
0.483CysArg: 0.483 ± 0.292
2.417CysSer: 2.417 ± 0.618
0.483CysThr: 0.483 ± 0.667
0.242CysVal: 0.242 ± 0.151
0.242CysTrp: 0.242 ± 0.151
0.967CysTyr: 0.967 ± 0.411
0.0CysXaa: 0.0 ± 0.0
Asp
1.45AspAla: 1.45 ± 0.571
0.242AspCys: 0.242 ± 0.304
5.8AspAsp: 5.8 ± 2.04
3.625AspGlu: 3.625 ± 2.058
3.142AspPhe: 3.142 ± 1.396
2.658AspGly: 2.658 ± 1.027
1.933AspHis: 1.933 ± 1.305
7.008AspIle: 7.008 ± 2.97
4.108AspLys: 4.108 ± 2.659
6.767AspLeu: 6.767 ± 2.19
1.692AspMet: 1.692 ± 0.601
3.142AspAsn: 3.142 ± 1.172
5.075AspPro: 5.075 ± 0.621
2.175AspGln: 2.175 ± 1.098
2.417AspArg: 2.417 ± 0.964
4.108AspSer: 4.108 ± 1.361
1.933AspThr: 1.933 ± 0.434
2.9AspVal: 2.9 ± 0.825
0.483AspTrp: 0.483 ± 0.269
1.933AspTyr: 1.933 ± 0.689
0.0AspXaa: 0.0 ± 0.0
Glu
3.383GluAla: 3.383 ± 0.752
0.242GluCys: 0.242 ± 0.334
4.833GluAsp: 4.833 ± 2.449
7.975GluGlu: 7.975 ± 3.755
1.692GluPhe: 1.692 ± 0.571
6.283GluGly: 6.283 ± 2.463
1.208GluHis: 1.208 ± 0.894
4.35GluIle: 4.35 ± 0.759
3.383GluLys: 3.383 ± 0.338
3.142GluLeu: 3.142 ± 0.837
2.9GluMet: 2.9 ± 0.765
0.967GluAsn: 0.967 ± 0.494
2.9GluPro: 2.9 ± 0.811
1.208GluGln: 1.208 ± 0.856
4.35GluArg: 4.35 ± 1.561
8.458GluSer: 8.458 ± 2.071
3.383GluThr: 3.383 ± 1.264
3.625GluVal: 3.625 ± 0.908
0.483GluTrp: 0.483 ± 0.302
0.725GluTyr: 0.725 ± 0.934
0.0GluXaa: 0.0 ± 0.0
Phe
0.967PheAla: 0.967 ± 0.853
0.242PheCys: 0.242 ± 0.441
2.658PheAsp: 2.658 ± 1.113
2.9PheGlu: 2.9 ± 1.421
3.142PhePhe: 3.142 ± 1.313
1.208PheGly: 1.208 ± 0.534
1.45PheHis: 1.45 ± 0.669
1.208PheIle: 1.208 ± 0.564
2.9PheLys: 2.9 ± 0.845
5.317PheLeu: 5.317 ± 1.137
0.242PheMet: 0.242 ± 0.151
1.45PheAsn: 1.45 ± 0.452
4.35PhePro: 4.35 ± 1.051
1.692PheGln: 1.692 ± 0.689
4.833PheArg: 4.833 ± 0.884
3.867PheSer: 3.867 ± 1.28
1.933PheThr: 1.933 ± 1.172
3.142PheVal: 3.142 ± 1.088
0.483PheTrp: 0.483 ± 0.292
0.483PheTyr: 0.483 ± 0.292
0.0PheXaa: 0.0 ± 0.0
Gly
2.658GlyAla: 2.658 ± 0.49
1.208GlyCys: 1.208 ± 0.595
2.417GlyAsp: 2.417 ± 1.057
5.8GlyGlu: 5.8 ± 3.598
3.383GlyPhe: 3.383 ± 1.533
3.625GlyGly: 3.625 ± 0.945
0.725GlyHis: 0.725 ± 0.313
3.142GlyIle: 3.142 ± 0.666
3.625GlyLys: 3.625 ± 1.407
6.283GlyLeu: 6.283 ± 1.881
0.967GlyMet: 0.967 ± 0.436
3.383GlyAsn: 3.383 ± 0.798
1.933GlyPro: 1.933 ± 0.719
2.417GlyGln: 2.417 ± 1.075
2.417GlyArg: 2.417 ± 0.902
3.142GlySer: 3.142 ± 0.327
3.142GlyThr: 3.142 ± 1.318
3.142GlyVal: 3.142 ± 0.567
0.242GlyTrp: 0.242 ± 0.151
3.383GlyTyr: 3.383 ± 0.568
0.0GlyXaa: 0.0 ± 0.0
His
0.483HisAla: 0.483 ± 0.269
0.0HisCys: 0.0 ± 0.0
2.9HisAsp: 2.9 ± 2.009
0.967HisGlu: 0.967 ± 0.411
1.208HisPhe: 1.208 ± 0.816
0.725HisGly: 0.725 ± 0.427
0.725HisHis: 0.725 ± 0.313
1.692HisIle: 1.692 ± 0.889
0.967HisLys: 0.967 ± 0.613
2.175HisLeu: 2.175 ± 0.48
0.725HisMet: 0.725 ± 0.608
0.483HisAsn: 0.483 ± 0.292
1.45HisPro: 1.45 ± 0.479
1.45HisGln: 1.45 ± 0.717
0.725HisArg: 0.725 ± 0.452
2.175HisSer: 2.175 ± 0.72
1.933HisThr: 1.933 ± 0.747
0.967HisVal: 0.967 ± 0.411
0.967HisTrp: 0.967 ± 0.411
0.483HisTyr: 0.483 ± 0.302
0.0HisXaa: 0.0 ± 0.0
Ile
2.175IleAla: 2.175 ± 0.624
1.208IleCys: 1.208 ± 0.453
2.9IleAsp: 2.9 ± 1.524
4.833IleGlu: 4.833 ± 2.057
2.658IlePhe: 2.658 ± 0.395
3.625IleGly: 3.625 ± 0.877
2.175IleHis: 2.175 ± 0.692
3.867IleIle: 3.867 ± 1.714
4.35IleLys: 4.35 ± 0.272
8.217IleLeu: 8.217 ± 1.526
1.45IleMet: 1.45 ± 0.61
3.383IleAsn: 3.383 ± 0.93
2.9IlePro: 2.9 ± 1.041
3.383IleGln: 3.383 ± 2.067
3.867IleArg: 3.867 ± 0.667
4.833IleSer: 4.833 ± 1.864
2.175IleThr: 2.175 ± 0.86
3.625IleVal: 3.625 ± 1.696
0.967IleTrp: 0.967 ± 0.494
2.9IleTyr: 2.9 ± 1.25
0.0IleXaa: 0.0 ± 0.0
Lys
2.9LysAla: 2.9 ± 1.029
0.483LysCys: 0.483 ± 0.302
4.35LysAsp: 4.35 ± 0.79
2.175LysGlu: 2.175 ± 0.824
0.967LysPhe: 0.967 ± 0.652
5.558LysGly: 5.558 ± 1.991
0.483LysHis: 0.483 ± 0.292
3.383LysIle: 3.383 ± 0.969
4.35LysLys: 4.35 ± 1.338
4.592LysLeu: 4.592 ± 0.409
1.933LysMet: 1.933 ± 0.678
0.967LysAsn: 0.967 ± 0.411
1.692LysPro: 1.692 ± 0.56
0.967LysGln: 0.967 ± 0.338
3.625LysArg: 3.625 ± 1.405
5.317LysSer: 5.317 ± 0.79
4.35LysThr: 4.35 ± 1.159
5.558LysVal: 5.558 ± 0.758
1.208LysTrp: 1.208 ± 0.597
1.692LysTyr: 1.692 ± 0.768
0.0LysXaa: 0.0 ± 0.0
Leu
4.833LeuAla: 4.833 ± 0.921
1.45LeuCys: 1.45 ± 0.452
5.8LeuAsp: 5.8 ± 1.205
4.833LeuGlu: 4.833 ± 1.097
4.833LeuPhe: 4.833 ± 2.173
5.317LeuGly: 5.317 ± 0.949
1.208LeuHis: 1.208 ± 0.894
8.7LeuIle: 8.7 ± 1.163
7.008LeuLys: 7.008 ± 0.683
8.7LeuLeu: 8.7 ± 1.829
3.383LeuMet: 3.383 ± 1.307
4.833LeuAsn: 4.833 ± 1.985
3.625LeuPro: 3.625 ± 1.116
1.933LeuGln: 1.933 ± 0.859
8.217LeuArg: 8.217 ± 1.528
10.875LeuSer: 10.875 ± 1.68
3.383LeuThr: 3.383 ± 1.539
5.8LeuVal: 5.8 ± 1.259
2.658LeuTrp: 2.658 ± 0.837
3.625LeuTyr: 3.625 ± 0.862
0.0LeuXaa: 0.0 ± 0.0
Met
1.933MetAla: 1.933 ± 1.126
0.483MetCys: 0.483 ± 0.269
0.967MetAsp: 0.967 ± 0.584
1.208MetGlu: 1.208 ± 0.816
1.208MetPhe: 1.208 ± 0.556
0.725MetGly: 0.725 ± 0.313
0.967MetHis: 0.967 ± 0.815
1.692MetIle: 1.692 ± 1.203
0.725MetLys: 0.725 ± 0.313
2.658MetLeu: 2.658 ± 1.377
0.725MetMet: 0.725 ± 0.313
3.625MetAsn: 3.625 ± 1.875
0.483MetPro: 0.483 ± 0.55
0.483MetGln: 0.483 ± 0.455
0.725MetArg: 0.725 ± 0.452
4.108MetSer: 4.108 ± 0.884
2.658MetThr: 2.658 ± 0.661
0.242MetVal: 0.242 ± 0.151
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.692AsnAla: 1.692 ± 0.56
0.725AsnCys: 0.725 ± 0.452
3.142AsnAsp: 3.142 ± 0.98
1.692AsnGlu: 1.692 ± 0.476
3.383AsnPhe: 3.383 ± 1.599
1.692AsnGly: 1.692 ± 1.341
1.45AsnHis: 1.45 ± 0.452
4.108AsnIle: 4.108 ± 1.102
0.967AsnLys: 0.967 ± 0.411
3.867AsnLeu: 3.867 ± 0.688
0.242AsnMet: 0.242 ± 0.441
1.208AsnAsn: 1.208 ± 0.556
2.9AsnPro: 2.9 ± 1.54
1.208AsnGln: 1.208 ± 0.597
2.175AsnArg: 2.175 ± 0.684
5.075AsnSer: 5.075 ± 1.706
1.208AsnThr: 1.208 ± 0.619
2.9AsnVal: 2.9 ± 0.684
1.208AsnTrp: 1.208 ± 0.719
1.45AsnTyr: 1.45 ± 0.61
0.0AsnXaa: 0.0 ± 0.0
Pro
1.208ProAla: 1.208 ± 0.534
0.967ProCys: 0.967 ± 0.411
5.075ProAsp: 5.075 ± 1.296
4.108ProGlu: 4.108 ± 0.871
0.967ProPhe: 0.967 ± 0.815
2.658ProGly: 2.658 ± 1.345
0.725ProHis: 0.725 ± 0.698
2.175ProIle: 2.175 ± 1.084
1.933ProLys: 1.933 ± 1.206
6.767ProLeu: 6.767 ± 1.155
0.725ProMet: 0.725 ± 0.323
2.9ProAsn: 2.9 ± 0.684
2.417ProPro: 2.417 ± 1.334
0.967ProGln: 0.967 ± 0.603
1.933ProArg: 1.933 ± 0.64
6.283ProSer: 6.283 ± 1.486
2.417ProThr: 2.417 ± 0.312
2.9ProVal: 2.9 ± 1.062
0.0ProTrp: 0.0 ± 0.0
1.208ProTyr: 1.208 ± 0.369
0.0ProXaa: 0.0 ± 0.0
Gln
1.692GlnAla: 1.692 ± 0.619
0.483GlnCys: 0.483 ± 0.55
1.692GlnAsp: 1.692 ± 0.543
2.175GlnGlu: 2.175 ± 0.614
1.45GlnPhe: 1.45 ± 0.905
1.692GlnGly: 1.692 ± 0.565
0.483GlnHis: 0.483 ± 0.292
2.9GlnIle: 2.9 ± 0.831
1.692GlnLys: 1.692 ± 0.607
2.658GlnLeu: 2.658 ± 0.54
0.242GlnMet: 0.242 ± 0.151
1.45GlnAsn: 1.45 ± 0.61
0.242GlnPro: 0.242 ± 0.151
1.45GlnGln: 1.45 ± 0.514
1.208GlnArg: 1.208 ± 0.754
1.45GlnSer: 1.45 ± 0.624
2.417GlnThr: 2.417 ± 0.581
2.417GlnVal: 2.417 ± 0.581
0.725GlnTrp: 0.725 ± 0.427
0.725GlnTyr: 0.725 ± 0.313
0.0GlnXaa: 0.0 ± 0.0
Arg
4.108ArgAla: 4.108 ± 0.376
0.967ArgCys: 0.967 ± 0.652
5.075ArgAsp: 5.075 ± 1.558
2.658ArgGlu: 2.658 ± 1.186
2.658ArgPhe: 2.658 ± 0.612
2.9ArgGly: 2.9 ± 1.524
1.933ArgHis: 1.933 ± 0.623
2.658ArgIle: 2.658 ± 1.182
2.175ArgLys: 2.175 ± 0.935
7.733ArgLeu: 7.733 ± 2.559
1.692ArgMet: 1.692 ± 0.617
1.208ArgAsn: 1.208 ± 0.593
1.933ArgPro: 1.933 ± 0.64
1.692ArgGln: 1.692 ± 0.838
3.625ArgArg: 3.625 ± 1.347
4.108ArgSer: 4.108 ± 0.526
4.592ArgThr: 4.592 ± 0.513
5.8ArgVal: 5.8 ± 0.665
0.725ArgTrp: 0.725 ± 0.452
1.933ArgTyr: 1.933 ± 0.689
0.0ArgXaa: 0.0 ± 0.0
Ser
4.108SerAla: 4.108 ± 1.064
1.208SerCys: 1.208 ± 0.652
4.833SerAsp: 4.833 ± 1.059
6.767SerGlu: 6.767 ± 1.561
5.558SerPhe: 5.558 ± 0.675
7.25SerGly: 7.25 ± 1.842
1.45SerHis: 1.45 ± 0.514
5.075SerIle: 5.075 ± 1.644
5.8SerLys: 5.8 ± 1.3
9.425SerLeu: 9.425 ± 2.03
2.658SerMet: 2.658 ± 0.693
3.142SerAsn: 3.142 ± 0.508
3.867SerPro: 3.867 ± 1.2
1.208SerGln: 1.208 ± 0.453
8.7SerArg: 8.7 ± 1.76
10.875SerSer: 10.875 ± 1.924
4.35SerThr: 4.35 ± 1.159
4.35SerVal: 4.35 ± 0.313
2.417SerTrp: 2.417 ± 1.057
2.9SerTyr: 2.9 ± 0.677
0.0SerXaa: 0.0 ± 0.0
Thr
3.142ThrAla: 3.142 ± 1.313
0.967ThrCys: 0.967 ± 0.338
2.175ThrAsp: 2.175 ± 1.043
2.9ThrGlu: 2.9 ± 1.098
1.933ThrPhe: 1.933 ± 0.62
2.9ThrGly: 2.9 ± 0.402
1.208ThrHis: 1.208 ± 0.369
3.142ThrIle: 3.142 ± 0.947
2.9ThrLys: 2.9 ± 1.115
5.075ThrLeu: 5.075 ± 1.467
1.692ThrMet: 1.692 ± 0.578
2.9ThrAsn: 2.9 ± 0.825
2.417ThrPro: 2.417 ± 0.77
2.658ThrGln: 2.658 ± 1.241
2.9ThrArg: 2.9 ± 1.54
3.142ThrSer: 3.142 ± 0.625
3.383ThrThr: 3.383 ± 1.256
2.417ThrVal: 2.417 ± 0.857
1.45ThrTrp: 1.45 ± 0.514
2.175ThrTyr: 2.175 ± 0.72
0.0ThrXaa: 0.0 ± 0.0
Val
4.833ValAla: 4.833 ± 2.442
2.175ValCys: 2.175 ± 1.239
3.142ValAsp: 3.142 ± 0.697
3.383ValGlu: 3.383 ± 1.206
2.175ValPhe: 2.175 ± 0.72
3.142ValGly: 3.142 ± 1.657
1.933ValHis: 1.933 ± 0.554
3.625ValIle: 3.625 ± 1.255
3.142ValLys: 3.142 ± 1.849
4.108ValLeu: 4.108 ± 1.44
0.0ValMet: 0.0 ± 0.0
1.933ValAsn: 1.933 ± 0.689
5.317ValPro: 5.317 ± 1.046
0.967ValGln: 0.967 ± 0.603
1.933ValArg: 1.933 ± 0.5
7.008ValSer: 7.008 ± 0.741
3.625ValThr: 3.625 ± 1.46
4.592ValVal: 4.592 ± 0.801
1.208ValTrp: 1.208 ± 0.664
2.417ValTyr: 2.417 ± 0.901
0.0ValXaa: 0.0 ± 0.0
Trp
1.692TrpAla: 1.692 ± 0.774
0.483TrpCys: 0.483 ± 0.55
0.242TrpAsp: 0.242 ± 0.441
0.967TrpGlu: 0.967 ± 0.411
0.242TrpPhe: 0.242 ± 0.151
1.208TrpGly: 1.208 ± 0.529
0.483TrpHis: 0.483 ± 0.302
1.45TrpIle: 1.45 ± 0.905
0.967TrpLys: 0.967 ± 0.411
1.208TrpLeu: 1.208 ± 0.597
0.483TrpMet: 0.483 ± 0.55
1.692TrpAsn: 1.692 ± 0.565
0.242TrpPro: 0.242 ± 0.151
0.0TrpGln: 0.0 ± 0.0
0.242TrpArg: 0.242 ± 0.151
1.692TrpSer: 1.692 ± 0.543
0.725TrpThr: 0.725 ± 0.323
1.208TrpVal: 1.208 ± 0.451
0.0TrpTrp: 0.0 ± 0.0
0.242TrpTyr: 0.242 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.967TyrAla: 0.967 ± 0.411
0.483TyrCys: 0.483 ± 0.302
1.208TyrAsp: 1.208 ± 0.534
1.45TyrGlu: 1.45 ± 0.537
1.933TyrPhe: 1.933 ± 0.62
1.692TyrGly: 1.692 ± 0.588
0.483TyrHis: 0.483 ± 0.302
1.45TyrIle: 1.45 ± 0.669
3.383TyrLys: 3.383 ± 0.752
4.592TyrLeu: 4.592 ± 1.092
1.45TyrMet: 1.45 ± 0.61
1.45TyrAsn: 1.45 ± 0.66
1.208TyrPro: 1.208 ± 0.75
0.967TyrGln: 0.967 ± 0.603
1.208TyrArg: 1.208 ± 0.369
3.383TyrSer: 3.383 ± 1.334
1.692TyrThr: 1.692 ± 1.224
1.692TyrVal: 1.692 ± 1.422
0.0TyrTrp: 0.0 ± 0.0
0.725TyrTyr: 0.725 ± 0.323
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4139 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski