Amino acid dipepetide frequency for Citrus yellow mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.487AlaAla: 1.487 ± 0.8
0.0AlaCys: 0.0 ± 0.0
2.974AlaAsp: 2.974 ± 1.17
2.602AlaGlu: 2.602 ± 1.399
3.717AlaPhe: 3.717 ± 1.999
1.115AlaGly: 1.115 ± 0.6
2.602AlaHis: 2.602 ± 0.952
5.204AlaIle: 5.204 ± 2.079
2.23AlaLys: 2.23 ± 1.688
7.807AlaLeu: 7.807 ± 3.109
2.23AlaMet: 2.23 ± 0.95
1.115AlaAsn: 1.115 ± 1.219
2.23AlaPro: 2.23 ± 1.199
4.089AlaGln: 4.089 ± 1.088
4.461AlaArg: 4.461 ± 2.052
2.23AlaSer: 2.23 ± 2.513
3.346AlaThr: 3.346 ± 2.069
2.602AlaVal: 2.602 ± 0.835
0.743AlaTrp: 0.743 ± 0.4
4.833AlaTyr: 4.833 ± 1.418
0.0AlaXaa: 0.0 ± 0.0
Cys
0.743CysAla: 0.743 ± 0.705
0.372CysCys: 0.372 ± 0.2
0.372CysAsp: 0.372 ± 0.2
1.115CysGlu: 1.115 ± 0.665
1.487CysPhe: 1.487 ± 0.8
0.743CysGly: 0.743 ± 0.4
0.743CysHis: 0.743 ± 0.4
0.743CysIle: 0.743 ± 0.705
2.23CysLys: 2.23 ± 0.95
1.115CysLeu: 1.115 ± 0.665
0.372CysMet: 0.372 ± 0.2
1.487CysAsn: 1.487 ± 0.684
1.115CysPro: 1.115 ± 0.698
1.859CysGln: 1.859 ± 1.0
0.743CysArg: 0.743 ± 0.4
0.743CysSer: 0.743 ± 0.917
0.0CysThr: 0.0 ± 0.0
0.743CysVal: 0.743 ± 0.4
0.0CysTrp: 0.0 ± 0.0
1.487CysTyr: 1.487 ± 0.8
0.0CysXaa: 0.0 ± 0.0
Asp
2.602AspAla: 2.602 ± 1.399
1.487AspCys: 1.487 ± 0.684
4.461AspAsp: 4.461 ± 1.648
3.717AspGlu: 3.717 ± 1.999
2.23AspPhe: 2.23 ± 1.199
2.974AspGly: 2.974 ± 1.176
1.115AspHis: 1.115 ± 0.665
1.115AspIle: 1.115 ± 0.6
2.23AspLys: 2.23 ± 0.777
6.32AspLeu: 6.32 ± 1.549
0.743AspMet: 0.743 ± 0.4
2.23AspAsn: 2.23 ± 0.871
4.089AspPro: 4.089 ± 1.471
1.859AspGln: 1.859 ± 0.757
3.346AspArg: 3.346 ± 1.597
2.974AspSer: 2.974 ± 1.176
3.717AspThr: 3.717 ± 1.168
0.743AspVal: 0.743 ± 0.4
0.372AspTrp: 0.372 ± 0.2
1.487AspTyr: 1.487 ± 0.734
0.0AspXaa: 0.0 ± 0.0
Glu
5.204GluAla: 5.204 ± 1.675
1.487GluCys: 1.487 ± 0.684
2.974GluAsp: 2.974 ± 0.772
10.037GluGlu: 10.037 ± 3.079
1.487GluPhe: 1.487 ± 0.801
5.204GluGly: 5.204 ± 2.182
1.487GluHis: 1.487 ± 0.844
3.346GluIle: 3.346 ± 1.597
5.576GluLys: 5.576 ± 3.762
3.717GluLeu: 3.717 ± 2.039
0.743GluMet: 0.743 ± 0.4
2.23GluAsn: 2.23 ± 1.199
1.859GluPro: 1.859 ± 0.729
3.717GluGln: 3.717 ± 0.889
4.089GluArg: 4.089 ± 1.272
2.974GluSer: 2.974 ± 1.601
2.23GluThr: 2.23 ± 2.789
5.576GluVal: 5.576 ± 1.467
1.115GluTrp: 1.115 ± 0.6
2.23GluTyr: 2.23 ± 0.885
0.0GluXaa: 0.0 ± 0.0
Phe
2.602PheAla: 2.602 ± 1.399
0.743PheCys: 0.743 ± 0.4
1.859PheAsp: 1.859 ± 1.0
1.115PheGlu: 1.115 ± 0.6
0.372PhePhe: 0.372 ± 0.2
0.743PheGly: 0.743 ± 0.917
0.743PheHis: 0.743 ± 0.4
4.089PheIle: 4.089 ± 1.088
1.487PheLys: 1.487 ± 0.801
1.859PheLeu: 1.859 ± 1.425
1.115PheMet: 1.115 ± 0.698
3.717PheAsn: 3.717 ± 1.557
0.372PhePro: 0.372 ± 1.012
2.23PheGln: 2.23 ± 1.199
2.602PheArg: 2.602 ± 0.926
2.602PheSer: 2.602 ± 0.952
2.23PheThr: 2.23 ± 1.199
1.487PheVal: 1.487 ± 0.8
0.372PheTrp: 0.372 ± 0.2
1.487PheTyr: 1.487 ± 0.734
0.0PheXaa: 0.0 ± 0.0
Gly
1.487GlyAla: 1.487 ± 0.8
0.743GlyCys: 0.743 ± 0.4
2.974GlyAsp: 2.974 ± 1.599
4.089GlyGlu: 4.089 ± 1.698
1.115GlyPhe: 1.115 ± 0.792
2.974GlyGly: 2.974 ± 0.82
0.372GlyHis: 0.372 ± 0.2
4.833GlyIle: 4.833 ± 0.902
5.204GlyLys: 5.204 ± 1.036
5.204GlyLeu: 5.204 ± 1.661
1.115GlyMet: 1.115 ± 0.698
2.23GlyAsn: 2.23 ± 0.886
1.487GlyPro: 1.487 ± 0.684
0.743GlyGln: 0.743 ± 0.4
3.717GlyArg: 3.717 ± 1.168
2.602GlySer: 2.602 ± 1.474
4.833GlyThr: 4.833 ± 2.179
2.602GlyVal: 2.602 ± 0.858
0.743GlyTrp: 0.743 ± 0.4
2.602GlyTyr: 2.602 ± 1.399
0.0GlyXaa: 0.0 ± 0.0
His
1.859HisAla: 1.859 ± 1.0
0.743HisCys: 0.743 ± 0.4
0.372HisAsp: 0.372 ± 0.2
2.602HisGlu: 2.602 ± 1.012
0.743HisPhe: 0.743 ± 0.74
0.743HisGly: 0.743 ± 0.4
0.743HisHis: 0.743 ± 0.74
2.23HisIle: 2.23 ± 0.95
1.487HisLys: 1.487 ± 1.094
2.602HisLeu: 2.602 ± 1.396
2.23HisMet: 2.23 ± 1.573
1.115HisAsn: 1.115 ± 0.844
1.487HisPro: 1.487 ± 0.712
1.115HisGln: 1.115 ± 0.665
1.487HisArg: 1.487 ± 0.712
2.23HisSer: 2.23 ± 0.855
1.115HisThr: 1.115 ± 0.844
1.487HisVal: 1.487 ± 0.8
0.0HisTrp: 0.0 ± 0.0
0.743HisTyr: 0.743 ± 0.705
0.0HisXaa: 0.0 ± 0.0
Ile
5.204IleAla: 5.204 ± 3.092
1.115IleCys: 1.115 ± 0.665
3.346IleAsp: 3.346 ± 1.799
2.974IleGlu: 2.974 ± 1.599
1.859IlePhe: 1.859 ± 0.779
3.717IleGly: 3.717 ± 1.168
2.23IleHis: 2.23 ± 1.396
4.089IleIle: 4.089 ± 2.103
1.487IleLys: 1.487 ± 1.421
5.576IleLeu: 5.576 ± 2.813
0.743IleMet: 0.743 ± 0.4
2.602IleAsn: 2.602 ± 2.028
3.717IlePro: 3.717 ± 1.517
6.691IleGln: 6.691 ± 2.867
2.974IleArg: 2.974 ± 0.831
6.32IleSer: 6.32 ± 3.754
3.346IleThr: 3.346 ± 1.14
3.717IleVal: 3.717 ± 1.498
0.743IleTrp: 0.743 ± 0.93
1.487IleTyr: 1.487 ± 0.734
0.0IleXaa: 0.0 ± 0.0
Lys
3.346LysAla: 3.346 ± 1.05
0.743LysCys: 0.743 ± 0.4
1.859LysAsp: 1.859 ± 0.729
4.461LysGlu: 4.461 ± 2.365
1.859LysPhe: 1.859 ± 1.0
4.833LysGly: 4.833 ± 2.023
2.602LysHis: 2.602 ± 1.399
2.602LysIle: 2.602 ± 1.022
2.974LysLys: 2.974 ± 2.985
5.204LysLeu: 5.204 ± 3.867
2.23LysMet: 2.23 ± 1.396
4.089LysAsn: 4.089 ± 2.168
2.23LysPro: 2.23 ± 2.17
4.089LysGln: 4.089 ± 3.436
3.346LysArg: 3.346 ± 1.45
4.833LysSer: 4.833 ± 1.524
2.974LysThr: 2.974 ± 1.031
2.602LysVal: 2.602 ± 2.265
1.115LysTrp: 1.115 ± 0.844
1.115LysTyr: 1.115 ± 0.698
0.0LysXaa: 0.0 ± 0.0
Leu
4.461LeuAla: 4.461 ± 2.191
2.602LeuCys: 2.602 ± 1.0
4.461LeuAsp: 4.461 ± 2.374
6.691LeuGlu: 6.691 ± 2.574
2.974LeuPhe: 2.974 ± 2.241
5.204LeuGly: 5.204 ± 2.138
1.859LeuHis: 1.859 ± 1.761
6.691LeuIle: 6.691 ± 3.694
5.576LeuLys: 5.576 ± 1.768
10.409LeuLeu: 10.409 ± 11.601
2.23LeuMet: 2.23 ± 2.823
3.346LeuAsn: 3.346 ± 1.777
4.461LeuPro: 4.461 ± 1.92
5.948LeuGln: 5.948 ± 1.287
4.089LeuArg: 4.089 ± 2.644
5.576LeuSer: 5.576 ± 2.314
5.576LeuThr: 5.576 ± 2.411
7.063LeuVal: 7.063 ± 3.528
1.115LeuTrp: 1.115 ± 0.698
2.974LeuTyr: 2.974 ± 1.528
0.0LeuXaa: 0.0 ± 0.0
Met
0.743MetAla: 0.743 ± 0.4
0.372MetCys: 0.372 ± 0.2
1.487MetAsp: 1.487 ± 0.712
1.487MetGlu: 1.487 ± 0.844
1.859MetPhe: 1.859 ± 0.876
1.487MetGly: 1.487 ± 0.8
0.743MetHis: 0.743 ± 0.4
1.115MetIle: 1.115 ± 1.181
2.602MetLys: 2.602 ± 1.473
1.859MetLeu: 1.859 ± 1.425
1.487MetMet: 1.487 ± 2.386
1.115MetAsn: 1.115 ± 0.858
2.602MetPro: 2.602 ± 1.012
1.859MetGln: 1.859 ± 1.076
1.859MetArg: 1.859 ± 1.425
2.602MetSer: 2.602 ± 1.085
2.23MetThr: 2.23 ± 2.584
1.859MetVal: 1.859 ± 0.779
0.0MetTrp: 0.0 ± 0.0
0.372MetTyr: 0.372 ± 0.2
0.0MetXaa: 0.0 ± 0.0
Asn
1.487AsnAla: 1.487 ± 1.922
0.372AsnCys: 0.372 ± 0.2
1.859AsnAsp: 1.859 ± 1.0
1.115AsnGlu: 1.115 ± 0.665
1.859AsnPhe: 1.859 ± 0.806
1.859AsnGly: 1.859 ± 0.876
1.859AsnHis: 1.859 ± 1.425
2.23AsnIle: 2.23 ± 1.199
2.23AsnLys: 2.23 ± 1.199
6.32AsnLeu: 6.32 ± 4.56
2.23AsnMet: 2.23 ± 0.886
2.602AsnAsn: 2.602 ± 1.312
2.602AsnPro: 2.602 ± 1.116
3.717AsnGln: 3.717 ± 1.269
1.115AsnArg: 1.115 ± 0.665
3.717AsnSer: 3.717 ± 1.725
3.346AsnThr: 3.346 ± 0.851
2.974AsnVal: 2.974 ± 1.083
0.743AsnTrp: 0.743 ± 0.4
0.743AsnTyr: 0.743 ± 0.4
0.0AsnXaa: 0.0 ± 0.0
Pro
5.204ProAla: 5.204 ± 2.014
0.372ProCys: 0.372 ± 0.2
3.346ProAsp: 3.346 ± 1.14
1.859ProGlu: 1.859 ± 0.729
1.859ProPhe: 1.859 ± 1.0
2.974ProGly: 2.974 ± 1.17
2.23ProHis: 2.23 ± 0.924
2.974ProIle: 2.974 ± 1.811
2.974ProLys: 2.974 ± 1.953
4.461ProLeu: 4.461 ± 3.901
1.859ProMet: 1.859 ± 1.0
1.115ProAsn: 1.115 ± 0.6
3.346ProPro: 3.346 ± 1.343
2.974ProGln: 2.974 ± 0.82
1.859ProArg: 1.859 ± 1.0
6.32ProSer: 6.32 ± 2.834
4.461ProThr: 4.461 ± 0.835
2.23ProVal: 2.23 ± 0.871
0.0ProTrp: 0.0 ± 0.0
1.487ProTyr: 1.487 ± 0.734
0.0ProXaa: 0.0 ± 0.0
Gln
2.974GlnAla: 2.974 ± 0.831
0.743GlnCys: 0.743 ± 0.4
4.089GlnAsp: 4.089 ± 1.542
3.717GlnGlu: 3.717 ± 0.958
0.372GlnPhe: 0.372 ± 0.2
2.23GlnGly: 2.23 ± 0.777
2.602GlnHis: 2.602 ± 1.0
4.833GlnIle: 4.833 ± 1.117
2.974GlnLys: 2.974 ± 1.601
6.32GlnLeu: 6.32 ± 3.194
1.859GlnMet: 1.859 ± 0.624
3.346GlnAsn: 3.346 ± 1.501
3.346GlnPro: 3.346 ± 1.046
5.576GlnGln: 5.576 ± 2.05
2.602GlnArg: 2.602 ± 1.335
3.346GlnSer: 3.346 ± 1.078
1.115GlnThr: 1.115 ± 0.698
4.461GlnVal: 4.461 ± 0.835
1.115GlnTrp: 1.115 ± 0.6
1.487GlnTyr: 1.487 ± 0.8
0.0GlnXaa: 0.0 ± 0.0
Arg
2.23ArgAla: 2.23 ± 0.9
0.372ArgCys: 0.372 ± 0.2
3.717ArgAsp: 3.717 ± 1.514
3.346ArgGlu: 3.346 ± 1.493
0.743ArgPhe: 0.743 ± 0.705
1.859ArgGly: 1.859 ± 0.95
1.115ArgHis: 1.115 ± 0.6
5.576ArgIle: 5.576 ± 2.771
2.974ArgLys: 2.974 ± 1.741
6.32ArgLeu: 6.32 ± 1.78
2.602ArgMet: 2.602 ± 1.022
1.859ArgAsn: 1.859 ± 1.0
4.089ArgPro: 4.089 ± 2.199
2.602ArgGln: 2.602 ± 2.329
5.576ArgArg: 5.576 ± 2.011
4.833ArgSer: 4.833 ± 1.117
2.974ArgThr: 2.974 ± 1.368
2.602ArgVal: 2.602 ± 1.633
1.487ArgTrp: 1.487 ± 0.801
1.859ArgTyr: 1.859 ± 0.779
0.0ArgXaa: 0.0 ± 0.0
Ser
3.346SerAla: 3.346 ± 1.429
1.859SerCys: 1.859 ± 0.876
1.859SerAsp: 1.859 ± 0.757
5.204SerGlu: 5.204 ± 2.581
3.346SerPhe: 3.346 ± 1.799
4.461SerGly: 4.461 ± 1.26
1.859SerHis: 1.859 ± 1.765
4.089SerIle: 4.089 ± 1.654
4.089SerLys: 4.089 ± 1.495
6.32SerLeu: 6.32 ± 2.678
1.859SerMet: 1.859 ± 1.306
3.346SerAsn: 3.346 ± 0.851
4.461SerPro: 4.461 ± 1.349
1.115SerGln: 1.115 ± 0.665
5.576SerArg: 5.576 ± 1.44
6.691SerSer: 6.691 ± 1.872
5.576SerThr: 5.576 ± 1.23
2.23SerVal: 2.23 ± 2.165
0.743SerTrp: 0.743 ± 0.4
2.974SerTyr: 2.974 ± 1.189
0.0SerXaa: 0.0 ± 0.0
Thr
4.089ThrAla: 4.089 ± 0.869
1.487ThrCys: 1.487 ± 1.041
2.602ThrAsp: 2.602 ± 1.399
4.461ThrGlu: 4.461 ± 3.544
1.487ThrPhe: 1.487 ± 1.125
4.461ThrGly: 4.461 ± 1.648
1.115ThrHis: 1.115 ± 0.698
2.23ThrIle: 2.23 ± 0.95
2.602ThrLys: 2.602 ± 0.869
4.461ThrLeu: 4.461 ± 4.303
1.487ThrMet: 1.487 ± 0.997
2.602ThrAsn: 2.602 ± 0.926
3.717ThrPro: 3.717 ± 0.925
2.23ThrGln: 2.23 ± 1.199
4.461ThrArg: 4.461 ± 1.007
4.833ThrSer: 4.833 ± 1.437
5.576ThrThr: 5.576 ± 2.448
2.974ThrVal: 2.974 ± 1.176
0.372ThrTrp: 0.372 ± 0.2
2.602ThrTyr: 2.602 ± 1.514
0.0ThrXaa: 0.0 ± 0.0
Val
5.204ValAla: 5.204 ± 1.462
1.487ValCys: 1.487 ± 0.8
2.974ValAsp: 2.974 ± 1.264
3.346ValGlu: 3.346 ± 4.06
2.974ValPhe: 2.974 ± 1.189
1.115ValGly: 1.115 ± 0.698
1.115ValHis: 1.115 ± 0.844
2.23ValIle: 2.23 ± 1.573
4.461ValLys: 4.461 ± 1.788
2.23ValLeu: 2.23 ± 1.573
1.859ValMet: 1.859 ± 1.663
2.974ValAsn: 2.974 ± 2.251
2.974ValPro: 2.974 ± 1.17
3.717ValGln: 3.717 ± 1.999
2.974ValArg: 2.974 ± 1.599
2.602ValSer: 2.602 ± 1.785
3.717ValThr: 3.717 ± 0.904
3.346ValVal: 3.346 ± 1.387
0.372ValTrp: 0.372 ± 0.2
1.487ValTyr: 1.487 ± 0.801
0.0ValXaa: 0.0 ± 0.0
Trp
0.743TrpAla: 0.743 ± 0.4
0.372TrpCys: 0.372 ± 0.2
0.743TrpAsp: 0.743 ± 0.4
1.115TrpGlu: 1.115 ± 0.844
0.372TrpPhe: 0.372 ± 0.2
0.372TrpGly: 0.372 ± 0.2
0.0TrpHis: 0.0 ± 0.0
0.743TrpIle: 0.743 ± 0.4
0.372TrpLys: 0.372 ± 0.2
1.487TrpLeu: 1.487 ± 0.8
0.0TrpMet: 0.0 ± 0.0
0.372TrpAsn: 0.372 ± 0.2
0.372TrpPro: 0.372 ± 0.2
1.487TrpGln: 1.487 ± 0.712
0.743TrpArg: 0.743 ± 0.4
0.743TrpSer: 0.743 ± 0.4
0.743TrpThr: 0.743 ± 0.4
0.743TrpVal: 0.743 ± 0.93
0.372TrpTrp: 0.372 ± 0.2
0.372TrpTyr: 0.372 ± 1.047
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.602TyrAla: 2.602 ± 1.388
0.743TyrCys: 0.743 ± 0.4
1.487TyrAsp: 1.487 ± 0.801
2.23TyrGlu: 2.23 ± 1.588
1.115TyrPhe: 1.115 ± 0.6
2.602TyrGly: 2.602 ± 1.396
0.0TyrHis: 0.0 ± 0.0
2.974TyrIle: 2.974 ± 1.021
3.346TyrLys: 3.346 ± 2.605
3.717TyrLeu: 3.717 ± 1.031
0.372TyrMet: 0.372 ± 0.187
1.487TyrAsn: 1.487 ± 0.8
3.346TyrPro: 3.346 ± 1.052
1.487TyrGln: 1.487 ± 0.734
1.115TyrArg: 1.115 ± 0.6
2.602TyrSer: 2.602 ± 0.869
0.743TyrThr: 0.743 ± 0.4
1.115TyrVal: 1.115 ± 0.6
0.743TyrTrp: 0.743 ± 0.4
1.487TyrTyr: 1.487 ± 0.8
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2691 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski