Amino acid dipepetide frequency for Zucchini green mottle mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.439AlaAla: 10.439 ± 1.106
1.228AlaCys: 1.228 ± 0.449
4.298AlaAsp: 4.298 ± 1.191
2.763AlaGlu: 2.763 ± 0.359
4.912AlaPhe: 4.912 ± 1.248
5.527AlaGly: 5.527 ± 0.753
0.921AlaHis: 0.921 ± 0.607
3.377AlaIle: 3.377 ± 0.745
4.912AlaLys: 4.912 ± 1.383
7.369AlaLeu: 7.369 ± 2.714
0.614AlaMet: 0.614 ± 0.187
3.684AlaAsn: 3.684 ± 1.018
1.228AlaPro: 1.228 ± 0.375
1.228AlaGln: 1.228 ± 0.375
1.535AlaArg: 1.535 ± 0.54
7.062AlaSer: 7.062 ± 4.023
5.22AlaThr: 5.22 ± 1.142
5.22AlaVal: 5.22 ± 0.564
1.535AlaTrp: 1.535 ± 0.425
0.614AlaTyr: 0.614 ± 0.411
0.0AlaXaa: 0.0 ± 0.0
Cys
0.307CysAla: 0.307 ± 0.696
0.614CysCys: 0.614 ± 0.187
0.921CysAsp: 0.921 ± 0.276
2.456CysGlu: 2.456 ± 0.75
1.228CysPhe: 1.228 ± 0.449
1.228CysGly: 1.228 ± 0.449
1.228CysHis: 1.228 ± 0.375
0.921CysIle: 0.921 ± 0.276
0.614CysLys: 0.614 ± 0.663
1.842CysLeu: 1.842 ± 0.372
0.0CysMet: 0.0 ± 0.0
1.228CysAsn: 1.228 ± 0.375
1.228CysPro: 1.228 ± 0.375
0.614CysGln: 0.614 ± 0.187
0.307CysArg: 0.307 ± 0.206
0.0CysSer: 0.0 ± 0.0
2.456CysThr: 2.456 ± 0.75
2.149CysVal: 2.149 ± 0.596
0.0CysTrp: 0.0 ± 0.0
0.614CysTyr: 0.614 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
5.527AspAla: 5.527 ± 0.483
1.842AspCys: 1.842 ± 0.562
4.912AspAsp: 4.912 ± 0.931
4.298AspGlu: 4.298 ± 0.616
3.991AspPhe: 3.991 ± 0.689
1.842AspGly: 1.842 ± 0.562
0.307AspHis: 0.307 ± 0.206
4.298AspIle: 4.298 ± 1.006
4.605AspLys: 4.605 ± 0.642
2.763AspLeu: 2.763 ± 0.895
0.614AspMet: 0.614 ± 0.663
3.07AspAsn: 3.07 ± 0.989
1.842AspPro: 1.842 ± 0.552
0.614AspGln: 0.614 ± 0.187
3.07AspArg: 3.07 ± 0.589
3.684AspSer: 3.684 ± 2.972
4.912AspThr: 4.912 ± 0.785
5.22AspVal: 5.22 ± 1.441
0.614AspTrp: 0.614 ± 0.187
3.07AspTyr: 3.07 ± 0.344
0.0AspXaa: 0.0 ± 0.0
Glu
5.527GluAla: 5.527 ± 1.205
0.307GluCys: 0.307 ± 0.696
2.149GluAsp: 2.149 ± 0.377
2.456GluGlu: 2.456 ± 0.692
4.912GluPhe: 4.912 ± 0.727
3.377GluGly: 3.377 ± 1.104
0.921GluHis: 0.921 ± 0.276
2.149GluIle: 2.149 ± 0.418
2.763GluLys: 2.763 ± 0.411
6.448GluLeu: 6.448 ± 0.55
1.535GluMet: 1.535 ± 0.425
2.456GluAsn: 2.456 ± 0.652
0.307GluPro: 0.307 ± 0.206
2.149GluGln: 2.149 ± 0.596
3.377GluArg: 3.377 ± 0.964
6.448GluSer: 6.448 ± 2.126
3.684GluThr: 3.684 ± 1.124
2.763GluVal: 2.763 ± 0.668
1.228GluTrp: 1.228 ± 0.375
1.535GluTyr: 1.535 ± 0.642
0.0GluXaa: 0.0 ± 0.0
Phe
1.535PheAla: 1.535 ± 0.425
2.456PheCys: 2.456 ± 0.692
4.298PheAsp: 4.298 ± 0.218
3.07PheGlu: 3.07 ± 0.608
4.912PhePhe: 4.912 ± 0.373
2.456PheGly: 2.456 ± 1.353
2.763PheHis: 2.763 ± 0.828
1.842PheIle: 1.842 ± 0.552
1.228PheLys: 1.228 ± 0.449
2.763PheLeu: 2.763 ± 1.089
1.535PheMet: 1.535 ± 0.431
2.456PheAsn: 2.456 ± 0.544
2.763PhePro: 2.763 ± 1.077
2.763PheGln: 2.763 ± 0.774
3.684PheArg: 3.684 ± 0.521
7.676PheSer: 7.676 ± 0.45
3.377PheThr: 3.377 ± 0.391
2.763PheVal: 2.763 ± 0.895
0.307PheTrp: 0.307 ± 0.206
1.228PheTyr: 1.228 ± 0.722
0.0PheXaa: 0.0 ± 0.0
Gly
2.456GlyAla: 2.456 ± 2.125
1.535GlyCys: 1.535 ± 0.425
2.149GlyAsp: 2.149 ± 0.418
2.149GlyGlu: 2.149 ± 0.591
1.842GlyPhe: 1.842 ± 1.162
2.149GlyGly: 2.149 ± 1.032
0.307GlyHis: 0.307 ± 0.206
1.842GlyIle: 1.842 ± 0.5
3.991GlyLys: 3.991 ± 0.359
8.29GlyLeu: 8.29 ± 0.895
0.614GlyMet: 0.614 ± 0.187
2.149GlyAsn: 2.149 ± 0.717
2.149GlyPro: 2.149 ± 0.596
0.921GlyGln: 0.921 ± 0.276
2.149GlyArg: 2.149 ± 0.596
3.377GlySer: 3.377 ± 0.428
1.228GlyThr: 1.228 ± 0.504
5.22GlyVal: 5.22 ± 3.3
0.614GlyTrp: 0.614 ± 0.187
1.535GlyTyr: 1.535 ± 0.425
0.0GlyXaa: 0.0 ± 0.0
His
1.535HisAla: 1.535 ± 0.425
0.614HisCys: 0.614 ± 0.187
1.228HisAsp: 1.228 ± 0.375
1.535HisGlu: 1.535 ± 0.425
0.614HisPhe: 0.614 ± 0.187
0.0HisGly: 0.0 ± 0.0
1.535HisHis: 1.535 ± 0.425
0.921HisIle: 0.921 ± 0.276
1.535HisLys: 1.535 ± 0.425
2.149HisLeu: 2.149 ± 0.717
2.149HisMet: 2.149 ± 0.596
0.307HisAsn: 0.307 ± 0.695
0.0HisPro: 0.0 ± 0.0
0.307HisGln: 0.307 ± 0.206
0.0HisArg: 0.0 ± 0.0
4.605HisSer: 4.605 ± 1.325
1.842HisThr: 1.842 ± 0.562
0.614HisVal: 0.614 ± 0.187
0.0HisTrp: 0.0 ± 0.0
1.228HisTyr: 1.228 ± 0.375
0.0HisXaa: 0.0 ± 0.0
Ile
3.991IleAla: 3.991 ± 0.359
0.921IleCys: 0.921 ± 0.548
2.149IleAsp: 2.149 ± 0.591
3.991IleGlu: 3.991 ± 0.921
2.149IlePhe: 2.149 ± 0.377
3.684IleGly: 3.684 ± 1.018
1.842IleHis: 1.842 ± 0.552
3.377IleIle: 3.377 ± 0.956
1.842IleLys: 1.842 ± 1.162
3.07IleLeu: 3.07 ± 1.22
0.614IleMet: 0.614 ± 0.187
1.535IleAsn: 1.535 ± 0.425
3.684IlePro: 3.684 ± 1.74
0.614IleGln: 0.614 ± 0.411
3.991IleArg: 3.991 ± 0.921
4.912IleSer: 4.912 ± 1.089
3.07IleThr: 3.07 ± 0.589
4.605IleVal: 4.605 ± 1.274
0.307IleTrp: 0.307 ± 0.206
1.842IleTyr: 1.842 ± 0.552
0.0IleXaa: 0.0 ± 0.0
Lys
5.834LysAla: 5.834 ± 1.613
0.0LysCys: 0.0 ± 0.0
2.149LysAsp: 2.149 ± 0.591
2.456LysGlu: 2.456 ± 0.305
3.377LysPhe: 3.377 ± 0.779
2.149LysGly: 2.149 ± 1.032
0.614LysHis: 0.614 ± 0.187
4.912LysIle: 4.912 ± 1.499
5.22LysLys: 5.22 ± 0.667
3.07LysLeu: 3.07 ± 1.863
0.614LysMet: 0.614 ± 0.397
2.763LysAsn: 2.763 ± 0.359
1.842LysPro: 1.842 ± 1.875
1.842LysGln: 1.842 ± 0.562
4.605LysArg: 4.605 ± 0.899
7.369LysSer: 7.369 ± 0.8
3.377LysThr: 3.377 ± 0.964
5.834LysVal: 5.834 ± 0.729
0.921LysTrp: 0.921 ± 0.276
1.228LysTyr: 1.228 ± 0.776
0.307LysXaa: 0.307 ± 0.206
Leu
3.991LeuAla: 3.991 ± 1.263
1.842LeuCys: 1.842 ± 0.552
4.912LeuAsp: 4.912 ± 1.034
7.369LeuGlu: 7.369 ± 1.0
3.991LeuPhe: 3.991 ± 0.804
1.535LeuGly: 1.535 ± 0.9
1.842LeuHis: 1.842 ± 0.562
4.912LeuIle: 4.912 ± 1.259
6.448LeuLys: 6.448 ± 0.707
6.141LeuLeu: 6.141 ± 0.905
1.842LeuMet: 1.842 ± 0.346
2.456LeuAsn: 2.456 ± 1.353
5.527LeuPro: 5.527 ± 0.483
3.991LeuGln: 3.991 ± 0.804
4.298LeuArg: 4.298 ± 0.832
7.983LeuSer: 7.983 ± 1.094
5.22LeuThr: 5.22 ± 0.804
6.755LeuVal: 6.755 ± 1.399
0.614LeuTrp: 0.614 ± 0.187
0.921LeuTyr: 0.921 ± 0.607
0.0LeuXaa: 0.0 ± 0.0
Met
1.535MetAla: 1.535 ± 0.425
0.0MetCys: 0.0 ± 0.0
2.149MetAsp: 2.149 ± 0.596
0.614MetGlu: 0.614 ± 0.187
0.307MetPhe: 0.307 ± 0.696
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.228MetIle: 1.228 ± 0.449
0.614MetLys: 0.614 ± 0.187
1.842MetLeu: 1.842 ± 0.562
1.228MetMet: 1.228 ± 0.375
1.228MetAsn: 1.228 ± 0.375
0.614MetPro: 0.614 ± 0.641
1.535MetGln: 1.535 ± 0.425
0.921MetArg: 0.921 ± 0.276
1.535MetSer: 1.535 ± 1.183
0.614MetThr: 0.614 ± 0.187
2.763MetVal: 2.763 ± 0.895
0.0MetTrp: 0.0 ± 0.0
1.228MetTyr: 1.228 ± 0.375
0.0MetXaa: 0.0 ± 0.0
Asn
1.535AsnAla: 1.535 ± 0.554
0.307AsnCys: 0.307 ± 0.696
2.149AsnAsp: 2.149 ± 1.133
2.456AsnGlu: 2.456 ± 0.524
5.22AsnPhe: 5.22 ± 1.512
0.921AsnGly: 0.921 ± 0.607
0.614AsnHis: 0.614 ± 0.187
2.456AsnIle: 2.456 ± 0.75
1.535AsnLys: 1.535 ± 0.425
4.298AsnLeu: 4.298 ± 0.444
0.614AsnMet: 0.614 ± 0.187
0.0AsnAsn: 0.0 ± 0.0
2.456AsnPro: 2.456 ± 1.162
0.614AsnGln: 0.614 ± 0.641
1.842AsnArg: 1.842 ± 0.815
1.228AsnSer: 1.228 ± 0.776
1.228AsnThr: 1.228 ± 0.449
3.991AsnVal: 3.991 ± 0.497
0.307AsnTrp: 0.307 ± 0.696
2.149AsnTyr: 2.149 ± 0.377
0.0AsnXaa: 0.0 ± 0.0
Pro
3.07ProAla: 3.07 ± 0.608
2.149ProCys: 2.149 ± 0.596
2.149ProAsp: 2.149 ± 0.418
2.763ProGlu: 2.763 ± 0.359
0.921ProPhe: 0.921 ± 0.548
3.377ProGly: 3.377 ± 0.391
0.614ProHis: 0.614 ± 0.187
3.377ProIle: 3.377 ± 0.515
1.535ProLys: 1.535 ± 0.425
3.684ProLeu: 3.684 ± 0.521
0.0ProMet: 0.0 ± 0.0
1.842ProAsn: 1.842 ± 0.372
0.921ProPro: 0.921 ± 0.691
1.842ProGln: 1.842 ± 0.815
1.535ProArg: 1.535 ± 0.425
1.842ProSer: 1.842 ± 2.241
2.149ProThr: 2.149 ± 0.591
3.377ProVal: 3.377 ± 0.673
0.307ProTrp: 0.307 ± 0.696
2.149ProTyr: 2.149 ± 1.133
0.0ProXaa: 0.0 ± 0.0
Gln
3.07GlnAla: 3.07 ± 0.989
0.921GlnCys: 0.921 ± 0.276
2.456GlnAsp: 2.456 ± 0.75
1.535GlnGlu: 1.535 ± 0.431
1.535GlnPhe: 1.535 ± 0.569
2.149GlnGly: 2.149 ± 0.591
0.614GlnHis: 0.614 ± 0.187
2.149GlnIle: 2.149 ± 0.591
1.228GlnLys: 1.228 ± 0.375
1.535GlnLeu: 1.535 ± 0.642
2.149GlnMet: 2.149 ± 0.596
0.307GlnAsn: 0.307 ± 0.695
2.149GlnPro: 2.149 ± 0.418
1.535GlnGln: 1.535 ± 0.425
0.921GlnArg: 0.921 ± 0.276
3.07GlnSer: 3.07 ± 0.85
2.149GlnThr: 2.149 ± 0.51
3.07GlnVal: 3.07 ± 0.937
0.614GlnTrp: 0.614 ± 0.187
0.614GlnTyr: 0.614 ± 0.187
0.0GlnXaa: 0.0 ± 0.0
Arg
2.149ArgAla: 2.149 ± 1.452
1.535ArgCys: 1.535 ± 0.425
1.842ArgAsp: 1.842 ± 1.875
1.228ArgGlu: 1.228 ± 1.291
0.0ArgPhe: 0.0 ± 0.0
0.307ArgGly: 0.307 ± 0.696
0.614ArgHis: 0.614 ± 0.187
2.456ArgIle: 2.456 ± 0.652
3.991ArgLys: 3.991 ± 1.114
3.377ArgLeu: 3.377 ± 0.964
0.921ArgMet: 0.921 ± 0.548
2.763ArgAsn: 2.763 ± 1.077
1.535ArgPro: 1.535 ± 0.425
0.921ArgGln: 0.921 ± 0.276
1.842ArgArg: 1.842 ± 0.562
6.755ArgSer: 6.755 ± 0.179
2.149ArgThr: 2.149 ± 0.596
7.369ArgVal: 7.369 ± 0.925
0.0ArgTrp: 0.0 ± 0.0
5.22ArgTyr: 5.22 ± 0.804
0.0ArgXaa: 0.0 ± 0.0
Ser
7.369SerAla: 7.369 ± 1.966
1.228SerCys: 1.228 ± 0.375
3.991SerAsp: 3.991 ± 0.497
5.834SerGlu: 5.834 ± 1.748
5.527SerPhe: 5.527 ± 1.195
4.298SerGly: 4.298 ± 2.083
0.614SerHis: 0.614 ± 0.187
3.991SerIle: 3.991 ± 0.764
7.983SerLys: 7.983 ± 3.946
8.29SerLeu: 8.29 ± 1.432
2.149SerMet: 2.149 ± 0.763
0.921SerAsn: 0.921 ± 0.607
2.456SerPro: 2.456 ± 0.686
3.684SerGln: 3.684 ± 0.731
3.991SerArg: 3.991 ± 2.25
6.141SerSer: 6.141 ± 1.376
4.605SerThr: 4.605 ± 2.278
11.667SerVal: 11.667 ± 3.593
1.842SerTrp: 1.842 ± 0.562
1.842SerTyr: 1.842 ± 0.372
0.0SerXaa: 0.0 ± 0.0
Thr
4.912ThrAla: 4.912 ± 1.259
1.228ThrCys: 1.228 ± 0.375
4.605ThrAsp: 4.605 ± 0.677
3.07ThrGlu: 3.07 ± 0.937
4.605ThrPhe: 4.605 ± 1.274
3.377ThrGly: 3.377 ± 0.783
2.763ThrHis: 2.763 ± 0.774
2.149ThrIle: 2.149 ± 1.032
3.684ThrLys: 3.684 ± 1.124
5.22ThrLeu: 5.22 ± 0.634
0.921ThrMet: 0.921 ± 0.276
1.228ThrAsn: 1.228 ± 0.963
3.07ThrPro: 3.07 ± 0.85
3.991ThrGln: 3.991 ± 0.921
1.842ThrArg: 1.842 ± 0.372
3.684ThrSer: 3.684 ± 1.63
3.377ThrThr: 3.377 ± 0.391
7.983ThrVal: 7.983 ± 1.073
0.614ThrTrp: 0.614 ± 0.187
2.456ThrTyr: 2.456 ± 0.75
0.0ThrXaa: 0.0 ± 0.0
Val
6.448ValAla: 6.448 ± 1.13
1.228ValCys: 1.228 ± 0.375
7.983ValAsp: 7.983 ± 2.029
4.605ValGlu: 4.605 ± 0.881
3.377ValPhe: 3.377 ± 1.138
6.141ValGly: 6.141 ± 0.541
3.684ValHis: 3.684 ± 1.124
4.605ValIle: 4.605 ± 1.256
5.834ValLys: 5.834 ± 1.204
5.22ValLeu: 5.22 ± 1.142
1.228ValMet: 1.228 ± 0.375
4.298ValAsn: 4.298 ± 0.616
2.456ValPro: 2.456 ± 0.305
2.763ValGln: 2.763 ± 0.411
5.527ValArg: 5.527 ± 1.115
6.755ValSer: 6.755 ± 1.558
8.904ValThr: 8.904 ± 1.064
7.062ValVal: 7.062 ± 0.809
2.149ValTrp: 2.149 ± 1.434
2.456ValTyr: 2.456 ± 0.692
0.0ValXaa: 0.0 ± 0.0
Trp
0.614TrpAla: 0.614 ± 0.187
0.0TrpCys: 0.0 ± 0.0
0.614TrpAsp: 0.614 ± 0.187
0.614TrpGlu: 0.614 ± 0.411
1.535TrpPhe: 1.535 ± 0.425
0.614TrpGly: 0.614 ± 0.187
0.0TrpHis: 0.0 ± 0.0
1.228TrpIle: 1.228 ± 0.375
0.614TrpLys: 0.614 ± 0.187
0.307TrpLeu: 0.307 ± 0.206
0.0TrpMet: 0.0 ± 0.0
0.307TrpAsn: 0.307 ± 0.206
0.614TrpPro: 0.614 ± 0.187
0.614TrpGln: 0.614 ± 0.187
1.228TrpArg: 1.228 ± 0.375
1.228TrpSer: 1.228 ± 0.963
0.921TrpThr: 0.921 ± 1.05
0.921TrpVal: 0.921 ± 0.548
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.842TyrAla: 1.842 ± 0.562
0.0TyrCys: 0.0 ± 0.0
3.991TyrAsp: 3.991 ± 1.114
1.228TyrGlu: 1.228 ± 0.449
0.921TyrPhe: 0.921 ± 0.617
1.535TyrGly: 1.535 ± 0.425
0.921TyrHis: 0.921 ± 0.276
0.0TyrIle: 0.0 ± 0.0
0.614TyrLys: 0.614 ± 0.187
4.912TyrLeu: 4.912 ± 0.25
0.307TyrMet: 0.307 ± 0.206
0.921TyrAsn: 0.921 ± 0.607
2.763TyrPro: 2.763 ± 0.828
0.921TyrGln: 0.921 ± 0.276
0.307TyrArg: 0.307 ± 0.206
3.07TyrSer: 3.07 ± 0.589
4.298TyrThr: 4.298 ± 1.191
3.377TyrVal: 3.377 ± 1.812
0.0TyrTrp: 0.0 ± 0.0
1.842TyrTyr: 1.842 ± 0.372
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.307XaaGln: 0.307 ± 0.206
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3258 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski