Amino acid dipepetide frequency for Potato virus M (strain Russian) (PVM)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.194AlaAla: 7.194 ± 2.598
1.079AlaCys: 1.079 ± 0.752
3.957AlaAsp: 3.957 ± 3.23
4.676AlaGlu: 4.676 ± 2.176
3.597AlaPhe: 3.597 ± 0.853
6.115AlaGly: 6.115 ± 1.695
1.799AlaHis: 1.799 ± 0.618
3.957AlaIle: 3.957 ± 1.798
4.317AlaLys: 4.317 ± 1.693
8.993AlaLeu: 8.993 ± 1.568
2.158AlaMet: 2.158 ± 0.943
1.799AlaAsn: 1.799 ± 0.618
2.878AlaPro: 2.878 ± 1.195
2.518AlaGln: 2.518 ± 0.895
5.755AlaArg: 5.755 ± 1.61
4.317AlaSer: 4.317 ± 0.876
4.317AlaThr: 4.317 ± 0.902
6.475AlaVal: 6.475 ± 2.402
1.799AlaTrp: 1.799 ± 0.935
2.158AlaTyr: 2.158 ± 1.163
0.0AlaXaa: 0.0 ± 0.0
Cys
3.237CysAla: 3.237 ± 1.023
1.079CysCys: 1.079 ± 0.561
0.36CysAsp: 0.36 ± 0.703
0.719CysGlu: 0.719 ± 0.374
2.878CysPhe: 2.878 ± 1.013
2.158CysGly: 2.158 ± 1.092
0.36CysHis: 0.36 ± 0.703
1.079CysIle: 1.079 ± 0.907
2.158CysLys: 2.158 ± 0.726
1.439CysLeu: 1.439 ± 0.579
0.719CysMet: 0.719 ± 0.641
1.439CysAsn: 1.439 ± 0.748
1.079CysPro: 1.079 ± 1.376
0.36CysGln: 0.36 ± 0.187
1.799CysArg: 1.799 ± 1.457
1.079CysSer: 1.079 ± 0.581
1.439CysThr: 1.439 ± 0.748
2.878CysVal: 2.878 ± 1.29
0.36CysTrp: 0.36 ± 0.187
1.079CysTyr: 1.079 ± 0.563
0.0CysXaa: 0.0 ± 0.0
Asp
3.237AspAla: 3.237 ± 2.383
1.079AspCys: 1.079 ± 1.092
1.079AspAsp: 1.079 ± 0.561
5.036AspGlu: 5.036 ± 0.901
3.237AspPhe: 3.237 ± 0.57
3.237AspGly: 3.237 ± 1.471
1.079AspHis: 1.079 ± 0.752
2.878AspIle: 2.878 ± 0.474
1.079AspLys: 1.079 ± 0.561
2.878AspLeu: 2.878 ± 1.158
1.079AspMet: 1.079 ± 0.549
1.439AspAsn: 1.439 ± 0.577
2.518AspPro: 2.518 ± 1.56
0.36AspGln: 0.36 ± 0.744
2.518AspArg: 2.518 ± 0.782
3.237AspSer: 3.237 ± 0.839
1.439AspThr: 1.439 ± 0.553
4.317AspVal: 4.317 ± 1.286
0.719AspTrp: 0.719 ± 0.605
2.518AspTyr: 2.518 ± 1.086
0.0AspXaa: 0.0 ± 0.0
Glu
5.396GluAla: 5.396 ± 0.718
1.079GluCys: 1.079 ± 0.561
2.158GluAsp: 2.158 ± 1.145
5.396GluGlu: 5.396 ± 1.872
3.237GluPhe: 3.237 ± 0.944
4.317GluGly: 4.317 ± 2.28
1.439GluHis: 1.439 ± 0.553
3.237GluIle: 3.237 ± 1.171
4.317GluLys: 4.317 ± 1.036
5.396GluLeu: 5.396 ± 1.547
1.799GluMet: 1.799 ± 0.696
4.317GluAsn: 4.317 ± 1.338
2.878GluPro: 2.878 ± 1.496
2.878GluGln: 2.878 ± 1.496
4.676GluArg: 4.676 ± 1.303
3.597GluSer: 3.597 ± 1.547
2.158GluThr: 2.158 ± 1.814
7.554GluVal: 7.554 ± 1.637
1.439GluTrp: 1.439 ± 0.553
2.518GluTyr: 2.518 ± 0.843
0.0GluXaa: 0.0 ± 0.0
Phe
3.957PheAla: 3.957 ± 0.87
2.158PheCys: 2.158 ± 1.121
3.957PheAsp: 3.957 ± 1.335
5.396PheGlu: 5.396 ± 0.928
1.439PhePhe: 1.439 ± 0.748
2.518PheGly: 2.518 ± 2.496
0.719PheHis: 0.719 ± 0.374
2.518PheIle: 2.518 ± 0.895
1.439PheLys: 1.439 ± 0.748
5.396PheLeu: 5.396 ± 1.872
2.158PheMet: 2.158 ± 1.122
1.439PheAsn: 1.439 ± 0.748
1.079PhePro: 1.079 ± 0.561
0.719PheGln: 0.719 ± 0.605
1.799PheArg: 1.799 ± 0.96
2.878PheSer: 2.878 ± 0.7
1.799PheThr: 1.799 ± 0.97
5.036PheVal: 5.036 ± 1.863
0.0PheTrp: 0.0 ± 0.0
1.079PheTyr: 1.079 ± 0.752
0.0PheXaa: 0.0 ± 0.0
Gly
7.554GlyAla: 7.554 ± 1.866
1.799GlyCys: 1.799 ± 2.137
3.957GlyAsp: 3.957 ± 0.523
3.237GlyGlu: 3.237 ± 1.673
0.36GlyPhe: 0.36 ± 0.707
2.878GlyGly: 2.878 ± 1.516
0.719GlyHis: 0.719 ± 0.374
1.799GlyIle: 1.799 ± 0.647
5.755GlyLys: 5.755 ± 1.44
6.835GlyLeu: 6.835 ± 1.193
2.158GlyMet: 2.158 ± 0.726
1.799GlyAsn: 1.799 ± 0.634
1.799GlyPro: 1.799 ± 0.618
2.878GlyGln: 2.878 ± 1.896
5.755GlyArg: 5.755 ± 1.779
3.597GlySer: 3.597 ± 2.196
3.957GlyThr: 3.957 ± 0.817
5.036GlyVal: 5.036 ± 1.04
1.799GlyTrp: 1.799 ± 0.881
1.079GlyTyr: 1.079 ± 0.561
0.0GlyXaa: 0.0 ± 0.0
His
1.799HisAla: 1.799 ± 0.935
1.439HisCys: 1.439 ± 0.993
0.719HisAsp: 0.719 ± 0.374
1.439HisGlu: 1.439 ± 0.748
0.36HisPhe: 0.36 ± 0.187
1.439HisGly: 1.439 ± 0.973
1.079HisHis: 1.079 ± 0.561
1.439HisIle: 1.439 ± 0.579
2.518HisLys: 2.518 ± 0.782
2.158HisLeu: 2.158 ± 1.122
1.439HisMet: 1.439 ± 0.553
2.158HisAsn: 2.158 ± 1.372
0.36HisPro: 0.36 ± 0.187
0.0HisGln: 0.0 ± 0.0
2.158HisArg: 2.158 ± 0.856
1.439HisSer: 1.439 ± 0.748
0.719HisThr: 0.719 ± 0.793
1.079HisVal: 1.079 ± 0.549
0.0HisTrp: 0.0 ± 0.0
0.719HisTyr: 0.719 ± 0.374
0.0HisXaa: 0.0 ± 0.0
Ile
3.597IleAla: 3.597 ± 1.348
1.439IleCys: 1.439 ± 0.748
2.518IleAsp: 2.518 ± 1.049
3.597IleGlu: 3.597 ± 0.982
0.36IlePhe: 0.36 ± 0.187
0.719IleGly: 0.719 ± 0.374
0.719IleHis: 0.719 ± 0.374
0.719IleIle: 0.719 ± 0.374
1.079IleLys: 1.079 ± 0.549
2.158IleLeu: 2.158 ± 0.734
1.439IleMet: 1.439 ± 0.748
2.158IleAsn: 2.158 ± 0.856
0.719IlePro: 0.719 ± 0.608
0.719IleGln: 0.719 ± 1.025
3.597IleArg: 3.597 ± 1.488
2.878IleSer: 2.878 ± 1.53
3.237IleThr: 3.237 ± 1.194
4.676IleVal: 4.676 ± 2.385
0.719IleTrp: 0.719 ± 0.793
2.518IleTyr: 2.518 ± 0.433
0.0IleXaa: 0.0 ± 0.0
Lys
4.676LysAla: 4.676 ± 0.709
0.719LysCys: 0.719 ± 0.374
4.317LysAsp: 4.317 ± 3.1
2.878LysGlu: 2.878 ± 1.496
3.957LysPhe: 3.957 ± 1.36
2.878LysGly: 2.878 ± 1.496
0.719LysHis: 0.719 ± 0.608
1.079LysIle: 1.079 ± 0.561
2.878LysLys: 2.878 ± 1.107
6.475LysLeu: 6.475 ± 1.944
1.799LysMet: 1.799 ± 0.605
2.158LysAsn: 2.158 ± 1.122
2.878LysPro: 2.878 ± 0.875
1.439LysGln: 1.439 ± 0.748
3.957LysArg: 3.957 ± 1.551
4.676LysSer: 4.676 ± 0.709
1.439LysThr: 1.439 ± 0.748
3.597LysVal: 3.597 ± 0.904
0.719LysTrp: 0.719 ± 0.641
1.799LysTyr: 1.799 ± 0.908
0.0LysXaa: 0.0 ± 0.0
Leu
6.475LeuAla: 6.475 ± 1.314
3.597LeuCys: 3.597 ± 1.337
6.475LeuAsp: 6.475 ± 1.153
7.194LeuGlu: 7.194 ± 1.889
4.676LeuPhe: 4.676 ± 1.209
8.273LeuGly: 8.273 ± 1.928
2.158LeuHis: 2.158 ± 0.734
4.676LeuIle: 4.676 ± 2.936
8.273LeuLys: 8.273 ± 3.16
8.273LeuLeu: 8.273 ± 4.182
1.079LeuMet: 1.079 ± 0.729
3.597LeuAsn: 3.597 ± 0.982
4.317LeuPro: 4.317 ± 1.469
1.799LeuGln: 1.799 ± 0.568
5.755LeuArg: 5.755 ± 1.191
7.554LeuSer: 7.554 ± 2.191
3.597LeuThr: 3.597 ± 0.983
5.755LeuVal: 5.755 ± 1.962
1.439LeuTrp: 1.439 ± 0.577
3.237LeuTyr: 3.237 ± 0.823
0.0LeuXaa: 0.0 ± 0.0
Met
2.518MetAla: 2.518 ± 0.861
1.799MetCys: 1.799 ± 0.647
2.158MetAsp: 2.158 ± 0.734
1.799MetGlu: 1.799 ± 0.935
1.439MetPhe: 1.439 ± 0.748
1.799MetGly: 1.799 ± 1.14
1.079MetHis: 1.079 ± 0.561
1.079MetIle: 1.079 ± 0.907
1.079MetLys: 1.079 ± 0.563
1.799MetLeu: 1.799 ± 0.618
0.36MetMet: 0.36 ± 0.707
1.439MetAsn: 1.439 ± 0.553
1.439MetPro: 1.439 ± 0.757
0.719MetGln: 0.719 ± 0.374
4.317MetArg: 4.317 ± 1.451
1.079MetSer: 1.079 ± 0.549
0.719MetThr: 0.719 ± 0.641
1.079MetVal: 1.079 ± 0.561
0.0MetTrp: 0.0 ± 0.0
0.36MetTyr: 0.36 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
3.597AsnAla: 3.597 ± 1.235
2.518AsnCys: 2.518 ± 1.007
0.36AsnAsp: 0.36 ± 0.187
1.799AsnGlu: 1.799 ± 0.935
1.439AsnPhe: 1.439 ± 0.748
1.439AsnGly: 1.439 ± 0.553
1.079AsnHis: 1.079 ± 0.549
0.719AsnIle: 0.719 ± 0.374
3.237AsnLys: 3.237 ± 1.327
6.115AsnLeu: 6.115 ± 2.058
2.158AsnMet: 2.158 ± 1.097
1.439AsnAsn: 1.439 ± 0.553
1.079AsnPro: 1.079 ± 1.303
0.719AsnGln: 0.719 ± 0.605
2.878AsnArg: 2.878 ± 1.691
3.597AsnSer: 3.597 ± 1.444
1.799AsnThr: 1.799 ± 2.097
3.237AsnVal: 3.237 ± 0.937
1.439AsnTrp: 1.439 ± 0.579
2.158AsnTyr: 2.158 ± 1.122
0.0AsnXaa: 0.0 ± 0.0
Pro
1.439ProAla: 1.439 ± 0.553
0.719ProCys: 0.719 ± 0.374
3.597ProAsp: 3.597 ± 0.514
3.237ProGlu: 3.237 ± 0.577
1.079ProPhe: 1.079 ± 0.907
3.237ProGly: 3.237 ± 1.052
1.079ProHis: 1.079 ± 0.752
1.799ProIle: 1.799 ± 1.341
0.719ProLys: 0.719 ± 0.374
3.957ProLeu: 3.957 ± 2.085
0.719ProMet: 0.719 ± 0.374
1.079ProAsn: 1.079 ± 0.752
2.518ProPro: 2.518 ± 2.244
0.719ProGln: 0.719 ± 0.374
3.597ProArg: 3.597 ± 2.267
2.518ProSer: 2.518 ± 0.598
3.597ProThr: 3.597 ± 2.389
2.158ProVal: 2.158 ± 1.658
0.719ProTrp: 0.719 ± 0.793
2.158ProTyr: 2.158 ± 0.726
0.0ProXaa: 0.0 ± 0.0
Gln
2.518GlnAla: 2.518 ± 0.861
1.079GlnCys: 1.079 ± 0.581
0.36GlnAsp: 0.36 ± 0.187
1.799GlnGlu: 1.799 ± 0.568
1.439GlnPhe: 1.439 ± 0.577
1.439GlnGly: 1.439 ± 1.874
1.079GlnHis: 1.079 ± 0.561
0.719GlnIle: 0.719 ± 0.374
1.079GlnLys: 1.079 ± 0.561
3.237GlnLeu: 3.237 ± 1.176
0.36GlnMet: 0.36 ± 0.187
0.36GlnAsn: 0.36 ± 0.187
2.518GlnPro: 2.518 ± 1.494
1.439GlnGln: 1.439 ± 0.704
0.36GlnArg: 0.36 ± 0.744
2.158GlnSer: 2.158 ± 1.092
0.36GlnThr: 0.36 ± 0.744
3.237GlnVal: 3.237 ± 0.839
0.36GlnTrp: 0.36 ± 0.187
0.719GlnTyr: 0.719 ± 0.605
0.0GlnXaa: 0.0 ± 0.0
Arg
6.115ArgAla: 6.115 ± 1.391
1.799ArgCys: 1.799 ± 1.906
2.158ArgAsp: 2.158 ± 1.634
5.396ArgGlu: 5.396 ± 2.201
5.396ArgPhe: 5.396 ± 1.201
4.676ArgGly: 4.676 ± 2.176
1.439ArgHis: 1.439 ± 0.579
1.799ArgIle: 1.799 ± 1.454
2.518ArgLys: 2.518 ± 0.809
8.273ArgLeu: 8.273 ± 0.62
2.158ArgMet: 2.158 ± 1.122
2.518ArgAsn: 2.518 ± 0.861
3.237ArgPro: 3.237 ± 3.656
0.0ArgGln: 0.0 ± 0.0
6.835ArgArg: 6.835 ± 3.055
5.396ArgSer: 5.396 ± 1.718
2.158ArgThr: 2.158 ± 0.734
4.676ArgVal: 4.676 ± 0.978
0.36ArgTrp: 0.36 ± 0.187
2.518ArgTyr: 2.518 ± 1.309
0.0ArgXaa: 0.0 ± 0.0
Ser
4.676SerAla: 4.676 ± 1.565
1.079SerCys: 1.079 ± 0.561
3.597SerAsp: 3.597 ± 1.232
6.475SerGlu: 6.475 ± 1.237
2.878SerPhe: 2.878 ± 0.875
5.036SerGly: 5.036 ± 1.493
2.158SerHis: 2.158 ± 1.655
2.158SerIle: 2.158 ± 1.786
3.597SerLys: 3.597 ± 1.228
5.396SerLeu: 5.396 ± 1.361
2.158SerMet: 2.158 ± 1.025
3.597SerAsn: 3.597 ± 1.137
2.158SerPro: 2.158 ± 0.822
2.518SerGln: 2.518 ± 0.843
2.158SerArg: 2.158 ± 1.203
4.317SerSer: 4.317 ± 1.482
2.158SerThr: 2.158 ± 0.796
6.475SerVal: 6.475 ± 1.822
0.36SerTrp: 0.36 ± 0.187
2.518SerTyr: 2.518 ± 0.895
0.0SerXaa: 0.0 ± 0.0
Thr
3.597ThrAla: 3.597 ± 3.103
0.36ThrCys: 0.36 ± 0.744
1.079ThrAsp: 1.079 ± 0.549
2.878ThrGlu: 2.878 ± 1.459
4.317ThrPhe: 4.317 ± 1.363
3.237ThrGly: 3.237 ± 1.646
2.158ThrHis: 2.158 ± 1.097
1.799ThrIle: 1.799 ± 0.935
3.957ThrLys: 3.957 ± 1.67
4.676ThrLeu: 4.676 ± 0.709
1.079ThrMet: 1.079 ± 0.584
2.158ThrAsn: 2.158 ± 1.097
1.079ThrPro: 1.079 ± 1.059
0.719ThrGln: 0.719 ± 0.374
2.158ThrArg: 2.158 ± 0.971
3.237ThrSer: 3.237 ± 1.949
0.719ThrThr: 0.719 ± 0.608
2.878ThrVal: 2.878 ± 1.048
0.719ThrTrp: 0.719 ± 0.605
1.439ThrTyr: 1.439 ± 0.579
0.0ThrXaa: 0.0 ± 0.0
Val
5.396ValAla: 5.396 ± 1.389
2.518ValCys: 2.518 ± 1.08
1.439ValAsp: 1.439 ± 0.579
4.317ValGlu: 4.317 ± 0.902
3.597ValPhe: 3.597 ± 2.74
6.115ValGly: 6.115 ± 1.808
2.878ValHis: 2.878 ± 1.012
3.597ValIle: 3.597 ± 1.868
1.439ValLys: 1.439 ± 0.553
10.432ValLeu: 10.432 ± 3.543
1.079ValMet: 1.079 ± 0.561
3.597ValAsn: 3.597 ± 1.344
2.878ValPro: 2.878 ± 1.449
4.317ValGln: 4.317 ± 1.325
6.475ValArg: 6.475 ± 1.187
5.396ValSer: 5.396 ± 1.449
5.396ValThr: 5.396 ± 2.03
9.712ValVal: 9.712 ± 2.609
1.079ValTrp: 1.079 ± 0.752
1.439ValTyr: 1.439 ± 1.347
0.0ValXaa: 0.0 ± 0.0
Trp
1.079TrpAla: 1.079 ± 1.967
0.36TrpCys: 0.36 ± 0.187
0.0TrpAsp: 0.0 ± 0.0
0.719TrpGlu: 0.719 ± 0.374
0.719TrpPhe: 0.719 ± 0.374
1.439TrpGly: 1.439 ± 0.748
0.719TrpHis: 0.719 ± 0.374
0.719TrpIle: 0.719 ± 0.641
0.0TrpLys: 0.0 ± 0.0
1.439TrpLeu: 1.439 ± 0.757
0.719TrpMet: 0.719 ± 0.374
2.158TrpAsn: 2.158 ± 0.469
1.079TrpPro: 1.079 ± 0.911
0.0TrpGln: 0.0 ± 0.0
1.079TrpArg: 1.079 ± 0.561
1.079TrpSer: 1.079 ± 0.581
0.0TrpThr: 0.0 ± 0.0
0.719TrpVal: 0.719 ± 0.374
0.36TrpTrp: 0.36 ± 0.187
0.36TrpTyr: 0.36 ± 0.872
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.799TyrAla: 1.799 ± 0.674
0.0TyrCys: 0.0 ± 0.0
0.719TyrAsp: 0.719 ± 0.374
1.799TyrGlu: 1.799 ± 0.618
1.079TyrPhe: 1.079 ± 0.561
1.799TyrGly: 1.799 ± 0.935
0.0TyrHis: 0.0 ± 0.0
1.799TyrIle: 1.799 ± 0.935
3.597TyrLys: 3.597 ± 1.269
3.237TyrLeu: 3.237 ± 1.322
1.079TyrMet: 1.079 ± 0.561
2.158TyrAsn: 2.158 ± 1.106
2.158TyrPro: 2.158 ± 1.122
1.799TyrGln: 1.799 ± 0.634
1.799TyrArg: 1.799 ± 0.97
1.439TyrSer: 1.439 ± 0.748
3.237TyrThr: 3.237 ± 2.884
2.518TyrVal: 2.518 ± 1.941
0.36TyrTrp: 0.36 ± 0.187
0.719TyrTyr: 0.719 ± 0.374
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2781 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski