Amino acid dipepetide frequency for Vanilla virus X

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.314AlaAla: 7.314 ± 0.871
1.463AlaCys: 1.463 ± 0.666
4.876AlaAsp: 4.876 ± 2.7
1.463AlaGlu: 1.463 ± 0.648
2.925AlaPhe: 2.925 ± 0.523
5.851AlaGly: 5.851 ± 1.215
2.925AlaHis: 2.925 ± 1.024
5.851AlaIle: 5.851 ± 3.282
5.363AlaLys: 5.363 ± 2.047
7.801AlaLeu: 7.801 ± 4.647
2.438AlaMet: 2.438 ± 0.947
5.851AlaAsn: 5.851 ± 1.846
3.901AlaPro: 3.901 ± 2.027
4.876AlaGln: 4.876 ± 2.094
3.901AlaArg: 3.901 ± 2.0
6.338AlaSer: 6.338 ± 1.901
7.314AlaThr: 7.314 ± 3.628
5.363AlaVal: 5.363 ± 3.156
0.488AlaTrp: 0.488 ± 1.203
4.388AlaTyr: 4.388 ± 2.303
0.0AlaXaa: 0.0 ± 0.0
Cys
0.488CysAla: 0.488 ± 0.887
0.0CysCys: 0.0 ± 0.0
0.488CysAsp: 0.488 ± 0.256
1.95CysGlu: 1.95 ± 1.024
0.0CysPhe: 0.0 ± 0.0
1.463CysGly: 1.463 ± 0.666
0.0CysHis: 0.0 ± 0.0
0.975CysIle: 0.975 ± 0.512
0.975CysLys: 0.975 ± 0.867
0.488CysLeu: 0.488 ± 0.834
0.488CysMet: 0.488 ± 0.887
0.488CysAsn: 0.488 ± 0.256
0.975CysPro: 0.975 ± 0.71
0.488CysGln: 0.488 ± 0.256
0.488CysArg: 0.488 ± 0.834
1.463CysSer: 1.463 ± 0.768
2.438CysThr: 2.438 ± 1.027
1.463CysVal: 1.463 ± 0.768
0.0CysTrp: 0.0 ± 0.0
0.488CysTyr: 0.488 ± 0.256
0.0CysXaa: 0.0 ± 0.0
Asp
3.413AspAla: 3.413 ± 1.229
0.975AspCys: 0.975 ± 0.512
4.388AspAsp: 4.388 ± 1.683
2.438AspGlu: 2.438 ± 0.846
2.438AspPhe: 2.438 ± 0.759
2.438AspGly: 2.438 ± 1.38
0.975AspHis: 0.975 ± 1.067
6.338AspIle: 6.338 ± 1.634
1.463AspLys: 1.463 ± 0.648
3.413AspLeu: 3.413 ± 0.54
0.488AspMet: 0.488 ± 0.256
0.975AspAsn: 0.975 ± 1.067
3.901AspPro: 3.901 ± 2.586
2.438AspGln: 2.438 ± 1.279
2.925AspArg: 2.925 ± 0.923
2.925AspSer: 2.925 ± 1.024
5.851AspThr: 5.851 ± 0.875
1.95AspVal: 1.95 ± 1.467
1.463AspTrp: 1.463 ± 0.768
3.413AspTyr: 3.413 ± 1.36
0.0AspXaa: 0.0 ± 0.0
Glu
6.338GluAla: 6.338 ± 1.901
0.488GluCys: 0.488 ± 0.256
3.413GluAsp: 3.413 ± 1.121
4.876GluGlu: 4.876 ± 1.806
0.975GluPhe: 0.975 ± 0.512
0.488GluGly: 0.488 ± 0.256
0.488GluHis: 0.488 ± 0.834
3.413GluIle: 3.413 ± 1.185
4.876GluLys: 4.876 ± 1.92
6.338GluLeu: 6.338 ± 3.327
0.0GluMet: 0.0 ± 0.0
2.925GluAsn: 2.925 ± 1.024
2.925GluPro: 2.925 ± 0.923
2.438GluGln: 2.438 ± 1.279
2.438GluArg: 2.438 ± 0.759
1.463GluSer: 1.463 ± 0.648
1.463GluThr: 1.463 ± 0.768
2.438GluVal: 2.438 ± 0.759
1.463GluTrp: 1.463 ± 0.768
1.463GluTyr: 1.463 ± 0.666
0.0GluXaa: 0.0 ± 0.0
Phe
5.851PheAla: 5.851 ± 1.846
1.463PheCys: 1.463 ± 0.648
2.925PheAsp: 2.925 ± 1.316
2.438PheGlu: 2.438 ± 1.279
0.975PhePhe: 0.975 ± 0.733
3.413PheGly: 3.413 ± 0.54
1.95PheHis: 1.95 ± 0.717
0.975PheIle: 0.975 ± 0.71
1.95PheLys: 1.95 ± 1.024
7.314PheLeu: 7.314 ± 2.267
0.975PheMet: 0.975 ± 0.512
0.488PheAsn: 0.488 ± 0.256
1.463PhePro: 1.463 ± 0.768
0.975PheGln: 0.975 ± 0.512
2.438PheArg: 2.438 ± 1.168
2.438PheSer: 2.438 ± 1.279
2.925PheThr: 2.925 ± 0.523
2.438PheVal: 2.438 ± 0.846
0.975PheTrp: 0.975 ± 0.733
1.463PheTyr: 1.463 ± 1.49
0.0PheXaa: 0.0 ± 0.0
Gly
4.388GlyAla: 4.388 ± 1.998
0.975GlyCys: 0.975 ± 0.512
5.851GlyAsp: 5.851 ± 1.344
2.438GlyGlu: 2.438 ± 1.279
1.95GlyPhe: 1.95 ± 0.717
3.413GlyGly: 3.413 ± 1.687
1.95GlyHis: 1.95 ± 0.794
1.95GlyIle: 1.95 ± 0.794
3.413GlyLys: 3.413 ± 1.166
4.388GlyLeu: 4.388 ± 2.193
0.975GlyMet: 0.975 ± 0.764
1.463GlyAsn: 1.463 ± 0.768
1.95GlyPro: 1.95 ± 1.024
2.438GlyGln: 2.438 ± 0.621
1.95GlyArg: 1.95 ± 1.13
3.413GlySer: 3.413 ± 1.043
2.438GlyThr: 2.438 ± 0.999
2.925GlyVal: 2.925 ± 3.124
0.0GlyTrp: 0.0 ± 0.0
0.488GlyTyr: 0.488 ± 0.256
0.0GlyXaa: 0.0 ± 0.0
His
3.413HisAla: 3.413 ± 0.87
1.463HisCys: 1.463 ± 1.607
0.488HisAsp: 0.488 ± 0.256
1.95HisGlu: 1.95 ± 1.024
1.463HisPhe: 1.463 ± 0.768
1.463HisGly: 1.463 ± 0.98
4.876HisHis: 4.876 ± 6.647
1.463HisIle: 1.463 ± 0.666
1.95HisLys: 1.95 ± 1.42
2.438HisLeu: 2.438 ± 0.846
0.488HisMet: 0.488 ± 0.677
0.0HisAsn: 0.0 ± 0.0
4.388HisPro: 4.388 ± 1.754
0.975HisGln: 0.975 ± 0.512
2.438HisArg: 2.438 ± 1.727
1.463HisSer: 1.463 ± 0.779
2.925HisThr: 2.925 ± 1.205
0.488HisVal: 0.488 ± 1.203
0.0HisTrp: 0.0 ± 0.0
0.488HisTyr: 0.488 ± 0.834
0.0HisXaa: 0.0 ± 0.0
Ile
4.388IleAla: 4.388 ± 2.324
0.975IleCys: 0.975 ± 0.71
2.925IleAsp: 2.925 ± 0.981
2.438IleGlu: 2.438 ± 1.353
5.363IlePhe: 5.363 ± 1.396
2.925IleGly: 2.925 ± 1.332
1.95IleHis: 1.95 ± 2.411
4.876IleIle: 4.876 ± 4.654
6.826IleLys: 6.826 ± 1.259
6.338IleLeu: 6.338 ± 3.691
2.925IleMet: 2.925 ± 0.923
3.413IleAsn: 3.413 ± 1.791
2.925IlePro: 2.925 ± 1.332
4.388IleGln: 4.388 ± 1.134
1.95IleArg: 1.95 ± 1.13
2.438IleSer: 2.438 ± 1.279
4.388IleThr: 4.388 ± 2.193
4.388IleVal: 4.388 ± 2.968
0.0IleTrp: 0.0 ± 0.0
2.438IleTyr: 2.438 ± 2.033
0.0IleXaa: 0.0 ± 0.0
Lys
4.876LysAla: 4.876 ± 1.519
1.463LysCys: 1.463 ± 0.768
2.925LysAsp: 2.925 ± 0.523
2.925LysGlu: 2.925 ± 1.535
0.975LysPhe: 0.975 ± 0.512
1.95LysGly: 1.95 ± 0.717
1.463LysHis: 1.463 ± 0.768
3.901LysIle: 3.901 ± 0.663
2.438LysLys: 2.438 ± 1.279
7.314LysLeu: 7.314 ± 3.017
1.95LysMet: 1.95 ± 1.024
2.925LysAsn: 2.925 ± 0.923
3.413LysPro: 3.413 ± 1.53
2.438LysGln: 2.438 ± 1.279
0.975LysArg: 0.975 ± 0.512
4.388LysSer: 4.388 ± 1.569
5.851LysThr: 5.851 ± 1.606
2.438LysVal: 2.438 ± 0.621
0.0LysTrp: 0.0 ± 0.0
1.95LysTyr: 1.95 ± 0.717
0.0LysXaa: 0.0 ± 0.0
Leu
7.314LeuAla: 7.314 ± 0.874
1.463LeuCys: 1.463 ± 1.528
2.925LeuAsp: 2.925 ± 1.559
6.826LeuGlu: 6.826 ± 2.126
4.388LeuPhe: 4.388 ± 2.303
5.851LeuGly: 5.851 ± 1.765
3.901LeuHis: 3.901 ± 1.451
5.363LeuIle: 5.363 ± 3.348
8.289LeuLys: 8.289 ± 2.906
9.751LeuLeu: 9.751 ± 3.858
0.975LeuMet: 0.975 ± 0.512
4.876LeuAsn: 4.876 ± 1.243
7.314LeuPro: 7.314 ± 3.823
5.363LeuGln: 5.363 ± 2.078
4.876LeuArg: 4.876 ± 2.243
4.876LeuSer: 4.876 ± 1.222
5.851LeuThr: 5.851 ± 2.539
3.901LeuVal: 3.901 ± 1.451
0.975LeuTrp: 0.975 ± 0.512
1.95LeuTyr: 1.95 ± 1.024
0.0LeuXaa: 0.0 ± 0.0
Met
2.438MetAla: 2.438 ± 0.621
0.0MetCys: 0.0 ± 0.0
0.975MetAsp: 0.975 ± 0.512
0.488MetGlu: 0.488 ± 0.256
0.488MetPhe: 0.488 ± 0.887
0.975MetGly: 0.975 ± 0.512
0.488MetHis: 0.488 ± 1.013
0.975MetIle: 0.975 ± 0.512
0.975MetLys: 0.975 ± 0.512
2.438MetLeu: 2.438 ± 0.759
0.0MetMet: 0.0 ± 0.0
0.975MetAsn: 0.975 ± 0.512
1.463MetPro: 1.463 ± 0.98
0.975MetGln: 0.975 ± 0.512
1.95MetArg: 1.95 ± 1.024
0.0MetSer: 0.0 ± 0.0
0.488MetThr: 0.488 ± 0.256
0.488MetVal: 0.488 ± 0.256
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.363AsnAla: 5.363 ± 1.406
2.438AsnCys: 2.438 ± 0.845
2.925AsnAsp: 2.925 ± 1.535
0.975AsnGlu: 0.975 ± 0.512
3.413AsnPhe: 3.413 ± 1.229
0.975AsnGly: 0.975 ± 0.512
0.975AsnHis: 0.975 ± 0.71
3.901AsnIle: 3.901 ± 1.478
0.975AsnLys: 0.975 ± 0.733
3.901AsnLeu: 3.901 ± 1.34
0.488AsnMet: 0.488 ± 0.256
0.975AsnAsn: 0.975 ± 0.733
3.413AsnPro: 3.413 ± 1.281
1.95AsnGln: 1.95 ± 0.658
1.463AsnArg: 1.463 ± 0.666
4.876AsnSer: 4.876 ± 1.191
2.925AsnThr: 2.925 ± 1.065
1.95AsnVal: 1.95 ± 1.024
0.488AsnTrp: 0.488 ± 0.887
1.95AsnTyr: 1.95 ± 0.658
0.0AsnXaa: 0.0 ± 0.0
Pro
6.826ProAla: 6.826 ± 3.702
0.975ProCys: 0.975 ± 0.867
2.438ProAsp: 2.438 ± 0.986
3.901ProGlu: 3.901 ± 0.663
4.388ProPhe: 4.388 ± 1.426
2.438ProGly: 2.438 ± 1.353
1.463ProHis: 1.463 ± 2.26
4.876ProIle: 4.876 ± 1.92
3.413ProLys: 3.413 ± 0.891
6.826ProLeu: 6.826 ± 1.912
0.488ProMet: 0.488 ± 0.569
2.438ProAsn: 2.438 ± 1.279
2.438ProPro: 2.438 ± 2.016
1.463ProGln: 1.463 ± 0.768
1.95ProArg: 1.95 ± 1.024
5.363ProSer: 5.363 ± 1.691
5.363ProThr: 5.363 ± 2.582
2.925ProVal: 2.925 ± 2.286
1.463ProTrp: 1.463 ± 0.768
0.975ProTyr: 0.975 ± 0.512
0.0ProXaa: 0.0 ± 0.0
Gln
3.901GlnAla: 3.901 ± 1.059
0.488GlnCys: 0.488 ± 0.256
3.413GlnAsp: 3.413 ± 0.87
0.975GlnGlu: 0.975 ± 0.512
1.463GlnPhe: 1.463 ± 0.648
1.95GlnGly: 1.95 ± 1.024
1.95GlnHis: 1.95 ± 0.717
3.901GlnIle: 3.901 ± 2.443
1.463GlnLys: 1.463 ± 0.768
6.338GlnLeu: 6.338 ± 1.775
0.0GlnMet: 0.0 ± 0.0
3.901GlnAsn: 3.901 ± 1.316
2.925GlnPro: 2.925 ± 0.981
1.95GlnGln: 1.95 ± 0.658
1.95GlnArg: 1.95 ± 0.771
2.438GlnSer: 2.438 ± 1.279
3.413GlnThr: 3.413 ± 0.54
2.438GlnVal: 2.438 ± 1.38
0.975GlnTrp: 0.975 ± 0.71
0.975GlnTyr: 0.975 ± 0.512
0.0GlnXaa: 0.0 ± 0.0
Arg
3.901ArgAla: 3.901 ± 2.934
0.0ArgCys: 0.0 ± 0.0
1.463ArgAsp: 1.463 ± 0.666
2.438ArgGlu: 2.438 ± 1.279
3.413ArgPhe: 3.413 ± 0.54
1.463ArgGly: 1.463 ± 1.356
1.463ArgHis: 1.463 ± 1.923
1.463ArgIle: 1.463 ± 0.98
3.413ArgLys: 3.413 ± 1.791
2.438ArgLeu: 2.438 ± 0.846
1.463ArgMet: 1.463 ± 0.768
2.925ArgAsn: 2.925 ± 1.535
1.95ArgPro: 1.95 ± 0.717
2.925ArgGln: 2.925 ± 1.353
3.413ArgArg: 3.413 ± 1.121
0.975ArgSer: 0.975 ± 1.067
4.388ArgThr: 4.388 ± 2.005
3.413ArgVal: 3.413 ± 2.797
0.0ArgTrp: 0.0 ± 0.0
3.901ArgTyr: 3.901 ± 1.301
0.0ArgXaa: 0.0 ± 0.0
Ser
2.925SerAla: 2.925 ± 1.535
0.975SerCys: 0.975 ± 0.512
5.363SerAsp: 5.363 ± 1.119
3.413SerGlu: 3.413 ± 1.121
1.95SerPhe: 1.95 ± 1.024
2.925SerGly: 2.925 ± 0.523
1.463SerHis: 1.463 ± 0.648
5.363SerIle: 5.363 ± 1.535
2.438SerLys: 2.438 ± 0.846
5.851SerLeu: 5.851 ± 1.579
0.488SerMet: 0.488 ± 0.256
2.438SerAsn: 2.438 ± 0.759
4.388SerPro: 4.388 ± 1.426
4.876SerGln: 4.876 ± 1.391
3.901SerArg: 3.901 ± 1.427
3.901SerSer: 3.901 ± 0.663
2.438SerThr: 2.438 ± 0.999
1.463SerVal: 1.463 ± 1.002
0.0SerTrp: 0.0 ± 0.0
0.975SerTyr: 0.975 ± 0.512
0.0SerXaa: 0.0 ± 0.0
Thr
5.363ThrAla: 5.363 ± 4.549
0.0ThrCys: 0.0 ± 0.0
2.925ThrAsp: 2.925 ± 1.535
3.413ThrGlu: 3.413 ± 1.791
5.851ThrPhe: 5.851 ± 1.215
5.363ThrGly: 5.363 ± 2.582
3.413ThrHis: 3.413 ± 0.87
4.876ThrIle: 4.876 ± 3.705
1.463ThrLys: 1.463 ± 0.98
8.776ThrLeu: 8.776 ± 4.798
0.488ThrMet: 0.488 ± 0.256
3.413ThrAsn: 3.413 ± 3.39
8.776ThrPro: 8.776 ± 4.06
0.975ThrGln: 0.975 ± 0.512
3.413ThrArg: 3.413 ± 2.633
3.413ThrSer: 3.413 ± 1.229
2.925ThrThr: 2.925 ± 1.316
4.876ThrVal: 4.876 ± 1.692
0.488ThrTrp: 0.488 ± 0.256
0.975ThrTyr: 0.975 ± 0.512
0.0ThrXaa: 0.0 ± 0.0
Val
4.388ValAla: 4.388 ± 3.924
0.0ValCys: 0.0 ± 0.0
2.438ValAsp: 2.438 ± 0.846
4.388ValGlu: 4.388 ± 2.896
2.438ValPhe: 2.438 ± 0.986
3.413ValGly: 3.413 ± 2.49
0.488ValHis: 0.488 ± 0.256
4.388ValIle: 4.388 ± 0.849
3.413ValLys: 3.413 ± 1.229
1.95ValLeu: 1.95 ± 1.733
0.488ValMet: 0.488 ± 0.256
1.463ValAsn: 1.463 ± 0.98
2.925ValPro: 2.925 ± 1.102
2.925ValGln: 2.925 ± 1.024
2.925ValArg: 2.925 ± 1.024
2.438ValSer: 2.438 ± 2.594
4.876ValThr: 4.876 ± 2.941
1.463ValVal: 1.463 ± 0.98
0.488ValTrp: 0.488 ± 0.887
1.95ValTyr: 1.95 ± 0.717
0.0ValXaa: 0.0 ± 0.0
Trp
1.95TrpAla: 1.95 ± 0.956
0.0TrpCys: 0.0 ± 0.0
0.488TrpAsp: 0.488 ± 0.887
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.488TrpHis: 0.488 ± 0.834
0.0TrpIle: 0.0 ± 0.0
0.975TrpLys: 0.975 ± 0.512
1.463TrpLeu: 1.463 ± 0.768
0.0TrpMet: 0.0 ± 0.0
0.975TrpAsn: 0.975 ± 0.733
0.488TrpPro: 0.488 ± 0.256
0.488TrpGln: 0.488 ± 0.887
0.975TrpArg: 0.975 ± 0.512
0.488TrpSer: 0.488 ± 0.256
0.0TrpThr: 0.0 ± 0.0
0.975TrpVal: 0.975 ± 0.512
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.851TyrAla: 5.851 ± 1.554
0.0TyrCys: 0.0 ± 0.0
0.975TyrAsp: 0.975 ± 0.512
0.975TyrGlu: 0.975 ± 0.512
0.975TyrPhe: 0.975 ± 0.512
0.0TyrGly: 0.0 ± 0.0
1.95TyrHis: 1.95 ± 1.024
3.413TyrIle: 3.413 ± 1.791
0.488TyrLys: 0.488 ± 0.256
1.463TyrLeu: 1.463 ± 0.768
0.488TyrMet: 0.488 ± 0.256
3.413TyrAsn: 3.413 ± 0.87
0.975TyrPro: 0.975 ± 0.71
1.463TyrGln: 1.463 ± 1.308
0.488TyrArg: 0.488 ± 0.887
2.438TyrSer: 2.438 ± 0.846
2.925TyrThr: 2.925 ± 1.332
1.463TyrVal: 1.463 ± 0.666
0.488TyrTrp: 0.488 ± 0.256
0.488TyrTyr: 0.488 ± 0.256
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2052 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski