Amino acid dipepetide frequency for Opium poppy mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.548AlaAla: 6.548 ± 1.492
3.571AlaCys: 3.571 ± 1.351
3.571AlaAsp: 3.571 ± 0.894
4.762AlaGlu: 4.762 ± 2.046
1.19AlaPhe: 1.19 ± 0.601
4.762AlaGly: 4.762 ± 0.384
0.595AlaHis: 0.595 ± 0.729
4.167AlaIle: 4.167 ± 0.82
7.143AlaLys: 7.143 ± 2.319
7.738AlaLeu: 7.738 ± 1.754
1.786AlaMet: 1.786 ± 0.656
1.19AlaAsn: 1.19 ± 0.517
6.548AlaPro: 6.548 ± 2.313
4.167AlaGln: 4.167 ± 1.245
5.357AlaArg: 5.357 ± 1.389
7.738AlaSer: 7.738 ± 1.721
7.738AlaThr: 7.738 ± 1.989
4.762AlaVal: 4.762 ± 0.666
0.595AlaTrp: 0.595 ± 0.347
2.381AlaTyr: 2.381 ± 1.034
0.0AlaXaa: 0.0 ± 0.0
Cys
1.19CysAla: 1.19 ± 0.694
0.595CysCys: 0.595 ± 0.347
0.0CysAsp: 0.0 ± 0.0
0.595CysGlu: 0.595 ± 0.347
2.381CysPhe: 2.381 ± 0.577
1.19CysGly: 1.19 ± 0.667
0.0CysHis: 0.0 ± 0.0
1.19CysIle: 1.19 ± 0.694
1.19CysLys: 1.19 ± 0.601
0.595CysLeu: 0.595 ± 0.729
0.0CysMet: 0.0 ± 0.0
1.19CysAsn: 1.19 ± 1.569
1.19CysPro: 1.19 ± 0.517
1.786CysGln: 1.786 ± 0.656
3.571CysArg: 3.571 ± 1.068
0.0CysSer: 0.0 ± 0.0
3.571CysThr: 3.571 ± 1.311
2.381CysVal: 2.381 ± 0.573
0.595CysTrp: 0.595 ± 0.785
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.167AspAla: 4.167 ± 1.551
1.19AspCys: 1.19 ± 0.694
1.786AspAsp: 1.786 ± 0.657
3.571AspGlu: 3.571 ± 1.311
5.952AspPhe: 5.952 ± 2.258
3.571AspGly: 3.571 ± 2.035
1.786AspHis: 1.786 ± 0.717
1.786AspIle: 1.786 ± 0.657
1.786AspLys: 1.786 ± 0.676
4.167AspLeu: 4.167 ± 0.997
0.595AspMet: 0.595 ± 0.347
0.595AspAsn: 0.595 ± 0.729
3.571AspPro: 3.571 ± 1.435
3.571AspGln: 3.571 ± 1.311
3.571AspArg: 3.571 ± 0.864
0.595AspSer: 0.595 ± 0.347
0.595AspThr: 0.595 ± 0.347
2.381AspVal: 2.381 ± 0.862
1.19AspTrp: 1.19 ± 0.517
1.786AspTyr: 1.786 ± 0.656
0.0AspXaa: 0.0 ± 0.0
Glu
7.143GluAla: 7.143 ± 1.405
1.786GluCys: 1.786 ± 0.679
2.381GluAsp: 2.381 ± 0.573
8.333GluGlu: 8.333 ± 1.823
0.0GluPhe: 0.0 ± 0.0
2.381GluGly: 2.381 ± 0.908
0.595GluHis: 0.595 ± 0.347
4.762GluIle: 4.762 ± 0.666
1.19GluLys: 1.19 ± 0.601
5.952GluLeu: 5.952 ± 1.078
0.595GluMet: 0.595 ± 0.347
0.595GluAsn: 0.595 ± 0.785
3.571GluPro: 3.571 ± 0.89
4.762GluGln: 4.762 ± 1.145
5.357GluArg: 5.357 ± 1.365
1.19GluSer: 1.19 ± 0.694
3.571GluThr: 3.571 ± 1.073
4.762GluVal: 4.762 ± 0.94
1.19GluTrp: 1.19 ± 0.517
0.595GluTyr: 0.595 ± 0.729
0.0GluXaa: 0.0 ± 0.0
Phe
2.976PheAla: 2.976 ± 1.14
0.595PheCys: 0.595 ± 0.347
1.786PheAsp: 1.786 ± 1.04
0.595PheGlu: 0.595 ± 0.347
0.0PhePhe: 0.0 ± 0.0
1.19PheGly: 1.19 ± 0.694
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
2.976PheLys: 2.976 ± 1.215
1.786PheLeu: 1.786 ± 0.657
0.595PheMet: 0.595 ± 0.471
1.786PheAsn: 1.786 ± 1.04
0.595PhePro: 0.595 ± 0.729
2.381PheGln: 2.381 ± 0.913
0.0PheArg: 0.0 ± 0.0
1.786PheSer: 1.786 ± 0.679
2.381PheThr: 2.381 ± 0.862
2.381PheVal: 2.381 ± 0.913
0.0PheTrp: 0.0 ± 0.0
1.786PheTyr: 1.786 ± 0.657
0.0PheXaa: 0.0 ± 0.0
Gly
2.976GlyAla: 2.976 ± 1.36
1.786GlyCys: 1.786 ± 0.717
5.952GlyAsp: 5.952 ± 1.839
6.548GlyGlu: 6.548 ± 2.421
1.786GlyPhe: 1.786 ± 1.04
4.762GlyGly: 4.762 ± 1.179
2.976GlyHis: 2.976 ± 1.207
5.357GlyIle: 5.357 ± 1.425
3.571GlyLys: 3.571 ± 0.864
5.357GlyLeu: 5.357 ± 0.675
1.786GlyMet: 1.786 ± 0.656
3.571GlyAsn: 3.571 ± 1.954
2.381GlyPro: 2.381 ± 0.789
0.595GlyGln: 0.595 ± 0.785
2.381GlyArg: 2.381 ± 0.661
4.762GlySer: 4.762 ± 3.573
0.595GlyThr: 0.595 ± 0.347
4.167GlyVal: 4.167 ± 1.088
0.595GlyTrp: 0.595 ± 0.785
0.595GlyTyr: 0.595 ± 0.347
0.0GlyXaa: 0.0 ± 0.0
His
1.786HisAla: 1.786 ± 0.717
0.0HisCys: 0.0 ± 0.0
0.595HisAsp: 0.595 ± 0.729
1.786HisGlu: 1.786 ± 1.04
0.595HisPhe: 0.595 ± 0.347
0.595HisGly: 0.595 ± 0.347
1.19HisHis: 1.19 ± 0.667
0.595HisIle: 0.595 ± 0.347
1.786HisLys: 1.786 ± 0.717
2.381HisLeu: 2.381 ± 0.577
0.595HisMet: 0.595 ± 0.785
2.381HisAsn: 2.381 ± 0.661
1.786HisPro: 1.786 ± 1.414
0.595HisGln: 0.595 ± 0.729
1.19HisArg: 1.19 ± 1.058
0.595HisSer: 0.595 ± 0.347
1.19HisThr: 1.19 ± 1.569
1.786HisVal: 1.786 ± 0.657
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.571IleAla: 3.571 ± 1.863
1.19IleCys: 1.19 ± 1.058
4.762IleAsp: 4.762 ± 1.145
3.571IleGlu: 3.571 ± 0.916
0.0IlePhe: 0.0 ± 0.0
1.19IleGly: 1.19 ± 0.694
1.786IleHis: 1.786 ± 1.04
2.976IleIle: 2.976 ± 1.215
2.381IleLys: 2.381 ± 1.034
1.786IleLeu: 1.786 ± 0.679
1.19IleMet: 1.19 ± 0.601
2.381IleAsn: 2.381 ± 0.913
3.571IlePro: 3.571 ± 1.824
1.19IleGln: 1.19 ± 0.694
4.167IleArg: 4.167 ± 1.158
3.571IleSer: 3.571 ± 1.314
1.786IleThr: 1.786 ± 0.656
2.381IleVal: 2.381 ± 1.31
0.595IleTrp: 0.595 ± 0.729
1.19IleTyr: 1.19 ± 0.601
0.0IleXaa: 0.0 ± 0.0
Lys
4.762LysAla: 4.762 ± 1.372
0.0LysCys: 0.0 ± 0.0
2.976LysAsp: 2.976 ± 1.215
1.786LysGlu: 1.786 ± 1.638
0.595LysPhe: 0.595 ± 0.347
2.976LysGly: 2.976 ± 1.215
0.595LysHis: 0.595 ± 0.785
3.571LysIle: 3.571 ± 1.068
2.976LysLys: 2.976 ± 0.451
6.548LysLeu: 6.548 ± 1.492
0.595LysMet: 0.595 ± 0.347
2.381LysAsn: 2.381 ± 1.478
4.167LysPro: 4.167 ± 1.158
1.786LysGln: 1.786 ± 0.656
1.786LysArg: 1.786 ± 1.04
2.976LysSer: 2.976 ± 0.451
3.571LysThr: 3.571 ± 1.536
4.762LysVal: 4.762 ± 1.138
2.976LysTrp: 2.976 ± 1.215
2.976LysTyr: 2.976 ± 1.215
0.0LysXaa: 0.0 ± 0.0
Leu
4.167LeuAla: 4.167 ± 1.551
1.19LeuCys: 1.19 ± 0.694
5.357LeuAsp: 5.357 ± 1.967
6.548LeuGlu: 6.548 ± 1.943
0.595LeuPhe: 0.595 ± 0.729
6.548LeuGly: 6.548 ± 0.571
1.19LeuHis: 1.19 ± 1.569
2.381LeuIle: 2.381 ± 0.913
3.571LeuLys: 3.571 ± 0.894
6.548LeuLeu: 6.548 ± 1.469
2.976LeuMet: 2.976 ± 0.676
4.762LeuAsn: 4.762 ± 2.379
7.738LeuPro: 7.738 ± 1.724
5.952LeuGln: 5.952 ± 1.86
6.548LeuArg: 6.548 ± 0.873
5.357LeuSer: 5.357 ± 1.535
1.19LeuThr: 1.19 ± 0.601
4.762LeuVal: 4.762 ± 1.111
1.19LeuTrp: 1.19 ± 0.694
2.976LeuTyr: 2.976 ± 1.876
0.0LeuXaa: 0.0 ± 0.0
Met
1.786MetAla: 1.786 ± 1.638
0.0MetCys: 0.0 ± 0.0
2.381MetAsp: 2.381 ± 0.913
2.381MetGlu: 2.381 ± 0.913
0.595MetPhe: 0.595 ± 0.347
1.19MetGly: 1.19 ± 0.667
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.786MetLys: 1.786 ± 0.656
1.786MetLeu: 1.786 ± 0.657
0.0MetMet: 0.0 ± 0.0
1.786MetAsn: 1.786 ± 0.656
1.19MetPro: 1.19 ± 0.517
3.571MetGln: 3.571 ± 1.954
0.595MetArg: 0.595 ± 0.347
3.571MetSer: 3.571 ± 0.916
0.595MetThr: 0.595 ± 0.347
1.19MetVal: 1.19 ± 0.694
1.786MetTrp: 1.786 ± 0.679
0.595MetTyr: 0.595 ± 0.347
0.0MetXaa: 0.0 ± 0.0
Asn
3.571AsnAla: 3.571 ± 1.059
0.595AsnCys: 0.595 ± 0.347
1.19AsnAsp: 1.19 ± 0.517
2.381AsnGlu: 2.381 ± 0.913
2.381AsnPhe: 2.381 ± 0.862
0.595AsnGly: 0.595 ± 0.347
0.595AsnHis: 0.595 ± 0.347
0.595AsnIle: 0.595 ± 0.785
0.595AsnLys: 0.595 ± 0.347
5.357AsnLeu: 5.357 ± 0.676
1.19AsnMet: 1.19 ± 1.526
2.381AsnAsn: 2.381 ± 1.387
1.19AsnPro: 1.19 ± 1.569
0.0AsnGln: 0.0 ± 0.0
2.976AsnArg: 2.976 ± 2.128
4.762AsnSer: 4.762 ± 1.206
1.786AsnThr: 1.786 ± 0.811
1.786AsnVal: 1.786 ± 1.414
0.595AsnTrp: 0.595 ± 0.347
1.19AsnTyr: 1.19 ± 0.601
0.0AsnXaa: 0.0 ± 0.0
Pro
3.571ProAla: 3.571 ± 2.001
2.381ProCys: 2.381 ± 0.913
1.19ProAsp: 1.19 ± 0.601
1.786ProGlu: 1.786 ± 1.414
0.595ProPhe: 0.595 ± 0.729
4.762ProGly: 4.762 ± 1.696
2.381ProHis: 2.381 ± 2.305
1.786ProIle: 1.786 ± 0.811
3.571ProLys: 3.571 ± 2.153
5.357ProLeu: 5.357 ± 2.262
1.786ProMet: 1.786 ± 0.656
1.786ProAsn: 1.786 ± 1.714
8.929ProPro: 8.929 ± 2.1
1.19ProGln: 1.19 ± 0.517
10.119ProArg: 10.119 ± 2.131
7.143ProSer: 7.143 ± 1.727
5.357ProThr: 5.357 ± 1.377
3.571ProVal: 3.571 ± 1.435
1.19ProTrp: 1.19 ± 0.694
1.786ProTyr: 1.786 ± 0.679
0.0ProXaa: 0.0 ± 0.0
Gln
2.976GlnAla: 2.976 ± 1.207
1.786GlnCys: 1.786 ± 0.676
0.0GlnAsp: 0.0 ± 0.0
2.381GlnGlu: 2.381 ± 1.034
0.0GlnPhe: 0.0 ± 0.0
2.381GlnGly: 2.381 ± 1.478
1.19GlnHis: 1.19 ± 0.667
1.786GlnIle: 1.786 ± 0.656
1.19GlnLys: 1.19 ± 0.601
4.762GlnLeu: 4.762 ± 1.138
0.0GlnMet: 0.0 ± 0.0
1.19GlnAsn: 1.19 ± 0.694
6.548GlnPro: 6.548 ± 0.882
1.786GlnGln: 1.786 ± 0.656
2.381GlnArg: 2.381 ± 1.37
6.548GlnSer: 6.548 ± 1.431
2.381GlnThr: 2.381 ± 0.913
2.976GlnVal: 2.976 ± 0.668
0.595GlnTrp: 0.595 ± 0.785
2.381GlnTyr: 2.381 ± 1.034
0.0GlnXaa: 0.0 ± 0.0
Arg
8.929ArgAla: 8.929 ± 1.777
1.786ArgCys: 1.786 ± 0.679
1.19ArgAsp: 1.19 ± 0.694
5.952ArgGlu: 5.952 ± 1.53
2.976ArgPhe: 2.976 ± 1.138
4.762ArgGly: 4.762 ± 5.314
1.786ArgHis: 1.786 ± 0.657
2.381ArgIle: 2.381 ± 0.577
2.976ArgLys: 2.976 ± 0.451
3.571ArgLeu: 3.571 ± 1.059
3.571ArgMet: 3.571 ± 0.89
2.976ArgAsn: 2.976 ± 1.419
2.976ArgPro: 2.976 ± 1.341
3.571ArgGln: 3.571 ± 1.351
5.952ArgArg: 5.952 ± 6.02
2.976ArgSer: 2.976 ± 0.668
3.571ArgThr: 3.571 ± 1.863
9.524ArgVal: 9.524 ± 1.569
1.19ArgTrp: 1.19 ± 0.517
1.786ArgTyr: 1.786 ± 0.656
0.0ArgXaa: 0.0 ± 0.0
Ser
7.143SerAla: 7.143 ± 0.841
0.595SerCys: 0.595 ± 0.729
0.595SerAsp: 0.595 ± 0.347
1.786SerGlu: 1.786 ± 0.679
1.19SerPhe: 1.19 ± 0.694
4.762SerGly: 4.762 ± 1.543
1.786SerHis: 1.786 ± 1.04
3.571SerIle: 3.571 ± 0.158
5.357SerLys: 5.357 ± 1.405
5.357SerLeu: 5.357 ± 1.899
2.381SerMet: 2.381 ± 0.577
0.595SerAsn: 0.595 ± 0.729
4.762SerPro: 4.762 ± 1.206
1.19SerGln: 1.19 ± 0.694
7.143SerArg: 7.143 ± 1.397
3.571SerSer: 3.571 ± 1.954
3.571SerThr: 3.571 ± 1.793
4.762SerVal: 4.762 ± 0.384
1.786SerTrp: 1.786 ± 0.656
2.381SerTyr: 2.381 ± 0.913
0.0SerXaa: 0.0 ± 0.0
Thr
4.167ThrAla: 4.167 ± 0.295
1.786ThrCys: 1.786 ± 0.656
4.167ThrAsp: 4.167 ± 0.533
1.786ThrGlu: 1.786 ± 2.354
1.19ThrPhe: 1.19 ± 0.694
3.571ThrGly: 3.571 ± 0.894
1.19ThrHis: 1.19 ± 0.694
2.381ThrIle: 2.381 ± 1.31
1.19ThrLys: 1.19 ± 0.694
4.167ThrLeu: 4.167 ± 1.629
3.571ThrMet: 3.571 ± 1.262
1.19ThrAsn: 1.19 ± 0.694
5.952ThrPro: 5.952 ± 2.721
2.976ThrGln: 2.976 ± 1.419
6.548ThrArg: 6.548 ± 0.571
1.786ThrSer: 1.786 ± 0.657
4.762ThrThr: 4.762 ± 1.955
2.381ThrVal: 2.381 ± 0.862
0.595ThrTrp: 0.595 ± 0.785
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
10.119ValAla: 10.119 ± 1.773
1.786ValCys: 1.786 ± 0.811
5.357ValAsp: 5.357 ± 1.377
2.381ValGlu: 2.381 ± 1.387
1.786ValPhe: 1.786 ± 0.657
5.952ValGly: 5.952 ± 2.279
1.786ValHis: 1.786 ± 0.811
2.381ValIle: 2.381 ± 0.913
5.357ValLys: 5.357 ± 1.425
5.952ValLeu: 5.952 ± 1.465
1.786ValMet: 1.786 ± 0.656
1.786ValAsn: 1.786 ± 0.657
2.381ValPro: 2.381 ± 0.661
2.381ValGln: 2.381 ± 0.573
1.19ValArg: 1.19 ± 0.694
3.571ValSer: 3.571 ± 0.847
2.381ValThr: 2.381 ± 1.31
7.143ValVal: 7.143 ± 1.87
0.0ValTrp: 0.0 ± 0.0
3.571ValTyr: 3.571 ± 0.89
0.0ValXaa: 0.0 ± 0.0
Trp
2.976TrpAla: 2.976 ± 1.129
0.595TrpCys: 0.595 ± 0.347
0.0TrpAsp: 0.0 ± 0.0
0.595TrpGlu: 0.595 ± 0.347
1.19TrpPhe: 1.19 ± 0.517
1.19TrpGly: 1.19 ± 1.569
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.595TrpLys: 0.595 ± 0.347
1.786TrpLeu: 1.786 ± 0.656
1.19TrpMet: 1.19 ± 0.563
0.595TrpAsn: 0.595 ± 0.347
0.0TrpPro: 0.0 ± 0.0
0.595TrpGln: 0.595 ± 0.729
2.381TrpArg: 2.381 ± 0.573
0.0TrpSer: 0.0 ± 0.0
0.595TrpThr: 0.595 ± 0.347
0.595TrpVal: 0.595 ± 0.729
0.595TrpTrp: 0.595 ± 0.347
1.786TrpTyr: 1.786 ± 0.656
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.786TyrAla: 1.786 ± 0.656
0.0TyrCys: 0.0 ± 0.0
2.976TyrAsp: 2.976 ± 0.668
1.19TyrGlu: 1.19 ± 0.694
0.595TyrPhe: 0.595 ± 0.347
4.167TyrGly: 4.167 ± 1.388
0.0TyrHis: 0.0 ± 0.0
3.571TyrIle: 3.571 ± 1.358
3.571TyrLys: 3.571 ± 1.536
0.595TyrLeu: 0.595 ± 0.729
0.0TyrMet: 0.0 ± 0.0
1.19TyrAsn: 1.19 ± 0.517
1.19TyrPro: 1.19 ± 0.601
1.19TyrGln: 1.19 ± 0.694
1.19TyrArg: 1.19 ± 0.694
1.786TyrSer: 1.786 ± 1.638
4.167TyrThr: 4.167 ± 1.551
0.595TyrVal: 0.595 ± 0.347
0.0TyrTrp: 0.0 ± 0.0
2.381TyrTyr: 2.381 ± 1.034
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1681 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski