Amino acid dipepetide frequency for Pea necrotic yellow dwarf virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.787AlaCys: 0.787 ± 0.92
0.787AlaAsp: 0.787 ± 0.608
3.147AlaGlu: 3.147 ± 1.75
4.721AlaPhe: 4.721 ± 1.71
3.934AlaGly: 3.934 ± 1.388
0.787AlaHis: 0.787 ± 0.608
3.147AlaIle: 3.147 ± 1.431
1.574AlaLys: 1.574 ± 0.855
1.574AlaLeu: 1.574 ± 1.212
3.147AlaMet: 3.147 ± 1.22
0.787AlaAsn: 0.787 ± 0.608
2.36AlaPro: 2.36 ± 2.258
1.574AlaGln: 1.574 ± 1.506
2.36AlaArg: 2.36 ± 0.907
3.934AlaSer: 3.934 ± 2.056
0.0AlaThr: 0.0 ± 0.0
3.147AlaVal: 3.147 ± 2.337
1.574AlaTrp: 1.574 ± 0.984
1.574AlaTyr: 1.574 ± 0.946
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.574CysCys: 1.574 ± 0.916
3.147CysAsp: 3.147 ± 1.622
0.0CysGlu: 0.0 ± 0.0
0.787CysPhe: 0.787 ± 0.608
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.574CysIle: 1.574 ± 1.841
0.0CysLys: 0.0 ± 0.0
1.574CysLeu: 1.574 ± 1.006
0.0CysMet: 0.0 ± 0.0
1.574CysAsn: 1.574 ± 0.827
0.0CysPro: 0.0 ± 0.0
0.787CysGln: 0.787 ± 0.812
2.36CysArg: 2.36 ± 1.533
2.36CysSer: 2.36 ± 1.649
1.574CysThr: 1.574 ± 1.024
0.0CysVal: 0.0 ± 0.0
1.574CysTrp: 1.574 ± 0.984
0.787CysTyr: 0.787 ± 0.709
0.0CysXaa: 0.0 ± 0.0
Asp
3.147AspAla: 3.147 ± 1.246
0.787AspCys: 0.787 ± 0.812
5.507AspAsp: 5.507 ± 2.691
5.507AspGlu: 5.507 ± 2.592
1.574AspPhe: 1.574 ± 0.916
2.36AspGly: 2.36 ± 0.981
1.574AspHis: 1.574 ± 0.953
6.294AspIle: 6.294 ± 1.859
1.574AspLys: 1.574 ± 1.216
3.147AspLeu: 3.147 ± 1.663
4.721AspMet: 4.721 ± 1.674
0.787AspAsn: 0.787 ± 0.766
0.0AspPro: 0.0 ± 0.0
0.787AspGln: 0.787 ± 0.709
0.787AspArg: 0.787 ± 0.608
3.934AspSer: 3.934 ± 1.654
1.574AspThr: 1.574 ± 1.232
4.721AspVal: 4.721 ± 3.034
1.574AspTrp: 1.574 ± 0.989
3.147AspTyr: 3.147 ± 1.581
0.0AspXaa: 0.0 ± 0.0
Glu
3.147GluAla: 3.147 ± 1.881
0.787GluCys: 0.787 ± 0.608
13.375GluAsp: 13.375 ± 2.541
4.721GluGlu: 4.721 ± 1.771
2.36GluPhe: 2.36 ± 0.971
7.081GluGly: 7.081 ± 1.919
0.787GluHis: 0.787 ± 0.709
3.147GluIle: 3.147 ± 1.352
3.934GluLys: 3.934 ± 1.5
6.294GluLeu: 6.294 ± 2.677
1.574GluMet: 1.574 ± 0.916
0.0GluAsn: 0.0 ± 0.0
1.574GluPro: 1.574 ± 0.989
3.147GluGln: 3.147 ± 1.359
5.507GluArg: 5.507 ± 2.399
3.934GluSer: 3.934 ± 1.884
2.36GluThr: 2.36 ± 1.023
6.294GluVal: 6.294 ± 2.647
0.0GluTrp: 0.0 ± 0.0
4.721GluTyr: 4.721 ± 2.382
0.0GluXaa: 0.0 ± 0.0
Phe
3.934PheAla: 3.934 ± 1.712
1.574PheCys: 1.574 ± 1.024
0.787PheAsp: 0.787 ± 0.608
2.36PheGlu: 2.36 ± 1.205
3.934PhePhe: 3.934 ± 2.032
0.787PheGly: 0.787 ± 0.709
1.574PheHis: 1.574 ± 0.8
1.574PheIle: 1.574 ± 1.021
2.36PheLys: 2.36 ± 1.776
4.721PheLeu: 4.721 ± 2.106
1.574PheMet: 1.574 ± 1.05
3.934PheAsn: 3.934 ± 1.667
1.574PhePro: 1.574 ± 0.916
1.574PheGln: 1.574 ± 0.8
4.721PheArg: 4.721 ± 2.021
3.147PheSer: 3.147 ± 1.128
4.721PheThr: 4.721 ± 1.024
1.574PheVal: 1.574 ± 1.708
0.787PheTrp: 0.787 ± 0.812
1.574PheTyr: 1.574 ± 0.916
0.0PheXaa: 0.0 ± 0.0
Gly
3.147GlyAla: 3.147 ± 2.14
0.0GlyCys: 0.0 ± 0.0
2.36GlyAsp: 2.36 ± 1.292
7.081GlyGlu: 7.081 ± 2.91
3.147GlyPhe: 3.147 ± 1.282
5.507GlyGly: 5.507 ± 1.694
0.787GlyHis: 0.787 ± 0.709
5.507GlyIle: 5.507 ± 2.36
5.507GlyLys: 5.507 ± 2.148
4.721GlyLeu: 4.721 ± 2.167
3.934GlyMet: 3.934 ± 1.279
2.36GlyAsn: 2.36 ± 1.576
3.934GlyPro: 3.934 ± 1.34
1.574GlyGln: 1.574 ± 1.05
3.147GlyArg: 3.147 ± 1.031
3.147GlySer: 3.147 ± 1.623
3.147GlyThr: 3.147 ± 1.291
6.294GlyVal: 6.294 ± 1.776
0.787GlyTrp: 0.787 ± 0.608
3.934GlyTyr: 3.934 ± 1.78
0.0GlyXaa: 0.0 ± 0.0
His
0.787HisAla: 0.787 ± 0.854
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.574HisGlu: 1.574 ± 0.946
2.36HisPhe: 2.36 ± 1.368
0.787HisGly: 0.787 ± 0.812
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
3.147HisLeu: 3.147 ± 1.327
1.574HisMet: 1.574 ± 1.01
0.787HisAsn: 0.787 ± 0.766
0.0HisPro: 0.0 ± 0.0
3.147HisGln: 3.147 ± 1.267
0.787HisArg: 0.787 ± 0.813
0.0HisSer: 0.0 ± 0.0
0.787HisThr: 0.787 ± 0.766
1.574HisVal: 1.574 ± 0.948
0.0HisTrp: 0.0 ± 0.0
0.787HisTyr: 0.787 ± 0.709
0.0HisXaa: 0.0 ± 0.0
Ile
1.574IleAla: 1.574 ± 1.216
3.934IleCys: 3.934 ± 2.268
0.787IleAsp: 0.787 ± 0.813
2.36IleGlu: 2.36 ± 1.266
2.36IlePhe: 2.36 ± 1.164
3.934IleGly: 3.934 ± 1.683
3.147IleHis: 3.147 ± 1.098
5.507IleIle: 5.507 ± 3.248
4.721IleLys: 4.721 ± 0.997
3.147IleLeu: 3.147 ± 1.352
1.574IleMet: 1.574 ± 1.755
3.147IleAsn: 3.147 ± 1.051
3.934IlePro: 3.934 ± 1.099
1.574IleGln: 1.574 ± 0.916
3.934IleArg: 3.934 ± 1.451
3.934IleSer: 3.934 ± 2.534
5.507IleThr: 5.507 ± 2.872
8.655IleVal: 8.655 ± 2.24
0.787IleTrp: 0.787 ± 0.608
0.787IleTyr: 0.787 ± 0.812
0.0IleXaa: 0.0 ± 0.0
Lys
3.147LysAla: 3.147 ± 2.401
0.0LysCys: 0.0 ± 0.0
2.36LysAsp: 2.36 ± 1.488
5.507LysGlu: 5.507 ± 1.869
1.574LysPhe: 1.574 ± 1.08
1.574LysGly: 1.574 ± 0.948
0.787LysHis: 0.787 ± 0.608
2.36LysIle: 2.36 ± 1.574
4.721LysLys: 4.721 ± 1.129
5.507LysLeu: 5.507 ± 2.228
0.0LysMet: 0.0 ± 0.0
2.36LysAsn: 2.36 ± 1.351
1.574LysPro: 1.574 ± 0.8
0.787LysGln: 0.787 ± 0.813
7.081LysArg: 7.081 ± 2.811
3.934LysSer: 3.934 ± 1.046
7.081LysThr: 7.081 ± 1.706
6.294LysVal: 6.294 ± 2.478
0.787LysTrp: 0.787 ± 0.813
3.147LysTyr: 3.147 ± 1.806
0.0LysXaa: 0.0 ± 0.0
Leu
3.934LeuAla: 3.934 ± 1.482
0.0LeuCys: 0.0 ± 0.0
3.147LeuAsp: 3.147 ± 1.948
8.655LeuGlu: 8.655 ± 1.782
4.721LeuPhe: 4.721 ± 2.121
7.868LeuGly: 7.868 ± 3.096
1.574LeuHis: 1.574 ± 0.946
4.721LeuIle: 4.721 ± 1.01
6.294LeuLys: 6.294 ± 1.622
8.655LeuLeu: 8.655 ± 2.137
1.574LeuMet: 1.574 ± 1.108
6.294LeuAsn: 6.294 ± 2.673
1.574LeuPro: 1.574 ± 0.948
3.934LeuGln: 3.934 ± 1.543
5.507LeuArg: 5.507 ± 2.347
3.934LeuSer: 3.934 ± 1.675
2.36LeuThr: 2.36 ± 1.016
7.081LeuVal: 7.081 ± 2.486
1.574LeuTrp: 1.574 ± 1.233
3.147LeuTyr: 3.147 ± 1.283
0.0LeuXaa: 0.0 ± 0.0
Met
0.787MetAla: 0.787 ± 0.753
0.787MetCys: 0.787 ± 0.812
3.147MetAsp: 3.147 ± 1.387
3.934MetGlu: 3.934 ± 1.395
2.36MetPhe: 2.36 ± 1.554
1.574MetGly: 1.574 ± 0.948
0.0MetHis: 0.0 ± 0.0
1.574MetIle: 1.574 ± 1.006
6.294MetLys: 6.294 ± 2.696
3.147MetLeu: 3.147 ± 1.948
2.36MetMet: 2.36 ± 1.392
0.787MetAsn: 0.787 ± 0.854
0.787MetPro: 0.787 ± 0.753
0.787MetGln: 0.787 ± 0.709
3.147MetArg: 3.147 ± 1.718
1.574MetSer: 1.574 ± 0.984
0.787MetThr: 0.787 ± 0.813
3.934MetVal: 3.934 ± 2.191
0.0MetTrp: 0.0 ± 0.0
0.787MetTyr: 0.787 ± 0.753
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
1.574AsnAsp: 1.574 ± 1.532
1.574AsnGlu: 1.574 ± 1.055
1.574AsnPhe: 1.574 ± 0.827
3.934AsnGly: 3.934 ± 1.768
0.787AsnHis: 0.787 ± 0.854
3.934AsnIle: 3.934 ± 1.543
0.787AsnLys: 0.787 ± 0.753
1.574AsnLeu: 1.574 ± 1.083
0.787AsnMet: 0.787 ± 0.854
1.574AsnAsn: 1.574 ± 0.855
3.147AsnPro: 3.147 ± 1.051
0.787AsnGln: 0.787 ± 0.709
1.574AsnArg: 1.574 ± 1.093
4.721AsnSer: 4.721 ± 1.962
1.574AsnThr: 1.574 ± 1.216
2.36AsnVal: 2.36 ± 0.981
1.574AsnTrp: 1.574 ± 1.143
2.36AsnTyr: 2.36 ± 0.907
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.0ProCys: 0.0 ± 0.0
0.787ProAsp: 0.787 ± 0.753
1.574ProGlu: 1.574 ± 1.006
1.574ProPhe: 1.574 ± 0.921
4.721ProGly: 4.721 ± 2.164
0.0ProHis: 0.0 ± 0.0
4.721ProIle: 4.721 ± 2.054
0.0ProLys: 0.0 ± 0.0
3.934ProLeu: 3.934 ± 0.89
0.0ProMet: 0.0 ± 0.0
0.787ProAsn: 0.787 ± 0.92
2.36ProPro: 2.36 ± 1.702
1.574ProGln: 1.574 ± 0.916
3.934ProArg: 3.934 ± 1.274
4.721ProSer: 4.721 ± 1.867
1.574ProThr: 1.574 ± 1.108
3.147ProVal: 3.147 ± 1.571
1.574ProTrp: 1.574 ± 1.216
0.787ProTyr: 0.787 ± 0.766
0.0ProXaa: 0.0 ± 0.0
Gln
1.574GlnAla: 1.574 ± 1.093
0.0GlnCys: 0.0 ± 0.0
1.574GlnAsp: 1.574 ± 1.108
0.787GlnGlu: 0.787 ± 0.709
0.787GlnPhe: 0.787 ± 0.753
5.507GlnGly: 5.507 ± 2.98
1.574GlnHis: 1.574 ± 1.01
0.787GlnIle: 0.787 ± 0.766
1.574GlnLys: 1.574 ± 0.948
3.147GlnLeu: 3.147 ± 1.71
0.787GlnMet: 0.787 ± 0.623
0.0GlnAsn: 0.0 ± 0.0
1.574GlnPro: 1.574 ± 0.984
1.574GlnGln: 1.574 ± 0.948
3.147GlnArg: 3.147 ± 1.313
3.934GlnSer: 3.934 ± 1.1
0.787GlnThr: 0.787 ± 0.753
1.574GlnVal: 1.574 ± 0.855
0.787GlnTrp: 0.787 ± 0.753
0.787GlnTyr: 0.787 ± 0.812
0.0GlnXaa: 0.0 ± 0.0
Arg
3.147ArgAla: 3.147 ± 1.452
3.147ArgCys: 3.147 ± 1.512
4.721ArgAsp: 4.721 ± 2.302
6.294ArgGlu: 6.294 ± 2.14
3.147ArgPhe: 3.147 ± 1.676
2.36ArgGly: 2.36 ± 0.896
1.574ArgHis: 1.574 ± 1.206
3.934ArgIle: 3.934 ± 1.768
7.868ArgLys: 7.868 ± 1.622
7.081ArgLeu: 7.081 ± 2.923
1.574ArgMet: 1.574 ± 1.024
2.36ArgAsn: 2.36 ± 1.439
3.147ArgPro: 3.147 ± 1.614
1.574ArgGln: 1.574 ± 0.855
8.655ArgArg: 8.655 ± 2.095
2.36ArgSer: 2.36 ± 1.368
3.147ArgThr: 3.147 ± 1.267
3.147ArgVal: 3.147 ± 1.739
0.787ArgTrp: 0.787 ± 0.709
1.574ArgTyr: 1.574 ± 1.506
0.0ArgXaa: 0.0 ± 0.0
Ser
1.574SerAla: 1.574 ± 1.055
3.147SerCys: 3.147 ± 1.896
3.934SerAsp: 3.934 ± 2.185
3.147SerGlu: 3.147 ± 1.111
4.721SerPhe: 4.721 ± 1.913
6.294SerGly: 6.294 ± 2.076
0.787SerHis: 0.787 ± 0.766
3.147SerIle: 3.147 ± 1.675
2.36SerLys: 2.36 ± 1.791
6.294SerLeu: 6.294 ± 1.984
4.721SerMet: 4.721 ± 1.024
2.36SerAsn: 2.36 ± 1.366
2.36SerPro: 2.36 ± 1.518
0.787SerGln: 0.787 ± 0.92
3.934SerArg: 3.934 ± 1.796
7.868SerSer: 7.868 ± 4.187
3.934SerThr: 3.934 ± 1.545
3.934SerVal: 3.934 ± 1.84
0.0SerTrp: 0.0 ± 0.0
3.147SerTyr: 3.147 ± 1.893
0.0SerXaa: 0.0 ± 0.0
Thr
3.147ThrAla: 3.147 ± 1.044
0.787ThrCys: 0.787 ± 0.766
0.0ThrAsp: 0.0 ± 0.0
1.574ThrGlu: 1.574 ± 1.08
2.36ThrPhe: 2.36 ± 1.287
3.147ThrGly: 3.147 ± 1.301
0.0ThrHis: 0.0 ± 0.0
2.36ThrIle: 2.36 ± 1.192
1.574ThrLys: 1.574 ± 1.024
3.934ThrLeu: 3.934 ± 1.337
1.574ThrMet: 1.574 ± 1.532
1.574ThrAsn: 1.574 ± 1.01
3.934ThrPro: 3.934 ± 1.096
1.574ThrGln: 1.574 ± 0.8
6.294ThrArg: 6.294 ± 1.354
3.934ThrSer: 3.934 ± 1.328
0.787ThrThr: 0.787 ± 0.753
3.934ThrVal: 3.934 ± 1.721
0.0ThrTrp: 0.0 ± 0.0
1.574ThrTyr: 1.574 ± 0.827
0.0ThrXaa: 0.0 ± 0.0
Val
5.507ValAla: 5.507 ± 3.812
1.574ValCys: 1.574 ± 1.006
2.36ValAsp: 2.36 ± 1.776
9.441ValGlu: 9.441 ± 2.797
2.36ValPhe: 2.36 ± 1.497
2.36ValGly: 2.36 ± 1.314
1.574ValHis: 1.574 ± 0.989
6.294ValIle: 6.294 ± 1.81
7.081ValLys: 7.081 ± 2.289
8.655ValLeu: 8.655 ± 3.525
5.507ValMet: 5.507 ± 1.765
4.721ValAsn: 4.721 ± 1.048
2.36ValPro: 2.36 ± 1.793
1.574ValGln: 1.574 ± 1.627
3.934ValArg: 3.934 ± 1.312
3.934ValSer: 3.934 ± 1.552
2.36ValThr: 2.36 ± 1.497
5.507ValVal: 5.507 ± 2.727
0.0ValTrp: 0.0 ± 0.0
3.147ValTyr: 3.147 ± 1.359
0.0ValXaa: 0.0 ± 0.0
Trp
0.787TrpAla: 0.787 ± 0.753
0.787TrpCys: 0.787 ± 0.608
1.574TrpAsp: 1.574 ± 0.984
0.787TrpGlu: 0.787 ± 0.608
0.787TrpPhe: 0.787 ± 0.766
0.787TrpGly: 0.787 ± 0.753
0.0TrpHis: 0.0 ± 0.0
2.36TrpIle: 2.36 ± 1.366
1.574TrpLys: 1.574 ± 1.021
0.787TrpLeu: 0.787 ± 0.92
0.787TrpMet: 0.787 ± 0.608
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.787TrpGln: 0.787 ± 0.608
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
2.36TrpVal: 2.36 ± 1.2
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.574TyrAla: 1.574 ± 1.216
0.0TyrCys: 0.0 ± 0.0
1.574TyrAsp: 1.574 ± 0.946
4.721TyrGlu: 4.721 ± 1.742
1.574TyrPhe: 1.574 ± 1.418
5.507TyrGly: 5.507 ± 1.538
0.787TyrHis: 0.787 ± 0.766
3.147TyrIle: 3.147 ± 1.387
0.787TyrLys: 0.787 ± 0.753
5.507TyrLeu: 5.507 ± 2.011
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.574TyrPro: 1.574 ± 1.206
2.36TyrGln: 2.36 ± 1.823
0.787TyrArg: 0.787 ± 0.813
3.147TyrSer: 3.147 ± 1.069
0.0TyrThr: 0.0 ± 0.0
4.721TyrVal: 4.721 ± 1.93
0.0TyrTrp: 0.0 ± 0.0
0.787TyrTyr: 0.787 ± 0.766
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1272 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski