Amino acid dipepetide frequency for Common vole polyomavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.394AlaAla: 11.394 ± 3.543
1.628AlaCys: 1.628 ± 1.03
3.798AlaAsp: 3.798 ± 1.023
2.17AlaGlu: 2.17 ± 1.44
0.543AlaPhe: 0.543 ± 0.437
5.426AlaGly: 5.426 ± 2.821
0.0AlaHis: 0.0 ± 0.0
2.713AlaIle: 2.713 ± 0.718
2.713AlaLys: 2.713 ± 1.274
10.309AlaLeu: 10.309 ± 3.727
1.628AlaMet: 1.628 ± 0.656
2.17AlaAsn: 2.17 ± 1.272
4.883AlaPro: 4.883 ± 2.313
1.628AlaGln: 1.628 ± 0.391
0.543AlaArg: 0.543 ± 0.344
5.426AlaSer: 5.426 ± 2.218
3.798AlaThr: 3.798 ± 2.235
4.341AlaVal: 4.341 ± 1.445
2.17AlaTrp: 2.17 ± 0.726
2.17AlaTyr: 2.17 ± 0.932
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.085CysCys: 1.085 ± 0.553
0.543CysAsp: 0.543 ± 0.344
1.628CysGlu: 1.628 ± 0.752
0.543CysPhe: 0.543 ± 0.344
0.543CysGly: 0.543 ± 0.533
0.0CysHis: 0.0 ± 0.0
1.085CysIle: 1.085 ± 1.066
3.798CysLys: 3.798 ± 1.412
3.256CysLeu: 3.256 ± 1.504
0.543CysMet: 0.543 ± 0.344
1.628CysAsn: 1.628 ± 0.752
0.0CysPro: 0.0 ± 0.0
0.543CysGln: 0.543 ± 0.344
0.543CysArg: 0.543 ± 0.344
0.0CysSer: 0.0 ± 0.0
0.543CysThr: 0.543 ± 0.344
1.085CysVal: 1.085 ± 0.424
1.085CysTrp: 1.085 ± 0.553
3.798CysTyr: 3.798 ± 1.694
0.0CysXaa: 0.0 ± 0.0
Asp
0.543AspAla: 0.543 ± 0.533
0.543AspCys: 0.543 ± 0.533
1.628AspAsp: 1.628 ± 0.656
5.969AspGlu: 5.969 ± 0.279
1.085AspPhe: 1.085 ± 0.688
3.798AspGly: 3.798 ± 0.789
1.628AspHis: 1.628 ± 0.638
2.17AspIle: 2.17 ± 1.377
2.713AspLys: 2.713 ± 0.945
1.628AspLeu: 1.628 ± 1.033
1.628AspMet: 1.628 ± 0.63
1.628AspAsn: 1.628 ± 0.638
2.17AspPro: 2.17 ± 0.481
1.085AspGln: 1.085 ± 0.688
2.713AspArg: 2.713 ± 0.974
2.713AspSer: 2.713 ± 1.339
2.713AspThr: 2.713 ± 0.707
3.256AspVal: 3.256 ± 0.965
0.0AspTrp: 0.0 ± 0.0
3.256AspTyr: 3.256 ± 1.034
0.0AspXaa: 0.0 ± 0.0
Glu
7.596GluAla: 7.596 ± 1.449
0.543GluCys: 0.543 ± 0.344
5.969GluAsp: 5.969 ± 0.279
9.767GluGlu: 9.767 ± 2.908
3.798GluPhe: 3.798 ± 0.599
2.17GluGly: 2.17 ± 0.963
3.256GluHis: 3.256 ± 0.709
2.713GluIle: 2.713 ± 0.661
2.17GluLys: 2.17 ± 1.377
9.767GluLeu: 9.767 ± 3.191
0.543GluMet: 0.543 ± 0.437
2.17GluAsn: 2.17 ± 0.933
0.0GluPro: 0.0 ± 0.0
3.798GluGln: 3.798 ± 1.022
5.969GluArg: 5.969 ± 3.078
3.798GluSer: 3.798 ± 0.748
3.256GluThr: 3.256 ± 1.262
7.054GluVal: 7.054 ± 3.37
1.085GluTrp: 1.085 ± 0.688
1.085GluTyr: 1.085 ± 0.688
0.0GluXaa: 0.0 ± 0.0
Phe
2.713PheAla: 2.713 ± 1.253
0.543PheCys: 0.543 ± 0.344
0.0PheAsp: 0.0 ± 0.0
2.713PheGlu: 2.713 ± 0.817
1.628PhePhe: 1.628 ± 1.176
3.256PheGly: 3.256 ± 0.717
0.543PheHis: 0.543 ± 0.344
1.628PheIle: 1.628 ± 0.638
2.17PheLys: 2.17 ± 1.377
1.628PheLeu: 1.628 ± 1.033
3.798PheMet: 3.798 ± 1.003
2.17PheAsn: 2.17 ± 0.933
2.17PhePro: 2.17 ± 0.932
0.543PheGln: 0.543 ± 0.533
2.713PheArg: 2.713 ± 1.19
3.256PheSer: 3.256 ± 1.052
0.0PheThr: 0.0 ± 0.0
2.17PheVal: 2.17 ± 1.377
0.543PheTrp: 0.543 ± 0.437
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.256GlyAla: 3.256 ± 1.815
0.543GlyCys: 0.543 ± 0.344
3.256GlyAsp: 3.256 ± 0.88
2.17GlyGlu: 2.17 ± 0.698
1.085GlyPhe: 1.085 ± 0.553
10.309GlyGly: 10.309 ± 1.785
1.085GlyHis: 1.085 ± 0.636
3.798GlyIle: 3.798 ± 1.069
1.628GlyLys: 1.628 ± 1.033
5.426GlyLeu: 5.426 ± 1.077
1.628GlyMet: 1.628 ± 0.638
2.17GlyAsn: 2.17 ± 1.086
3.256GlyPro: 3.256 ± 1.043
2.713GlyGln: 2.713 ± 0.74
3.798GlyArg: 3.798 ± 1.096
6.511GlySer: 6.511 ± 3.413
6.511GlyThr: 6.511 ± 2.886
8.681GlyVal: 8.681 ± 2.336
0.543GlyTrp: 0.543 ± 0.437
1.085GlyTyr: 1.085 ± 0.636
0.0GlyXaa: 0.0 ± 0.0
His
0.543HisAla: 0.543 ± 0.533
2.17HisCys: 2.17 ± 1.03
0.543HisAsp: 0.543 ± 0.344
0.543HisGlu: 0.543 ± 0.344
1.085HisPhe: 1.085 ± 0.424
1.085HisGly: 1.085 ± 0.64
2.17HisHis: 2.17 ± 1.03
1.085HisIle: 1.085 ± 0.424
1.085HisLys: 1.085 ± 0.688
0.0HisLeu: 0.0 ± 0.0
0.543HisMet: 0.543 ± 0.344
0.543HisAsn: 0.543 ± 0.344
1.628HisPro: 1.628 ± 0.752
0.543HisGln: 0.543 ± 0.533
1.628HisArg: 1.628 ± 0.505
0.543HisSer: 0.543 ± 0.344
1.628HisThr: 1.628 ± 0.505
1.628HisVal: 1.628 ± 0.505
0.0HisTrp: 0.0 ± 0.0
2.17HisTyr: 2.17 ± 0.532
0.0HisXaa: 0.0 ± 0.0
Ile
4.883IleAla: 4.883 ± 1.654
0.543IleCys: 0.543 ± 0.344
2.17IleAsp: 2.17 ± 1.03
2.713IleGlu: 2.713 ± 0.817
1.085IlePhe: 1.085 ± 0.688
4.883IleGly: 4.883 ± 1.623
1.628IleHis: 1.628 ± 0.752
2.17IleIle: 2.17 ± 0.933
0.543IleLys: 0.543 ± 0.437
2.17IleLeu: 2.17 ± 0.532
1.085IleMet: 1.085 ± 0.553
3.256IleAsn: 3.256 ± 1.224
3.798IlePro: 3.798 ± 0.498
2.713IleGln: 2.713 ± 0.332
3.798IleArg: 3.798 ± 0.888
4.883IleSer: 4.883 ± 2.27
2.17IleThr: 2.17 ± 0.849
2.17IleVal: 2.17 ± 0.923
1.085IleTrp: 1.085 ± 0.636
0.543IleTyr: 0.543 ± 0.344
0.0IleXaa: 0.0 ± 0.0
Lys
2.17LysAla: 2.17 ± 0.726
2.713LysCys: 2.713 ± 1.274
0.543LysAsp: 0.543 ± 0.344
5.969LysGlu: 5.969 ± 2.115
1.085LysPhe: 1.085 ± 0.688
3.798LysGly: 3.798 ± 1.529
2.17LysHis: 2.17 ± 1.03
1.085LysIle: 1.085 ± 0.424
3.256LysLys: 3.256 ± 0.553
1.628LysLeu: 1.628 ± 1.033
1.085LysMet: 1.085 ± 0.709
1.628LysAsn: 1.628 ± 0.656
2.713LysPro: 2.713 ± 0.332
0.543LysGln: 0.543 ± 0.437
7.054LysArg: 7.054 ± 1.8
2.713LysSer: 2.713 ± 1.721
2.713LysThr: 2.713 ± 1.253
1.628LysVal: 1.628 ± 1.033
0.0LysTrp: 0.0 ± 0.0
1.628LysTyr: 1.628 ± 0.638
0.0LysXaa: 0.0 ± 0.0
Leu
1.628LeuAla: 1.628 ± 0.86
1.628LeuCys: 1.628 ± 0.63
5.969LeuAsp: 5.969 ± 0.279
9.767LeuGlu: 9.767 ± 1.461
1.628LeuPhe: 1.628 ± 0.638
4.883LeuGly: 4.883 ± 1.103
0.543LeuHis: 0.543 ± 0.344
3.256LeuIle: 3.256 ± 1.109
3.256LeuLys: 3.256 ± 0.88
7.596LeuLeu: 7.596 ± 3.397
4.883LeuMet: 4.883 ± 1.925
8.139LeuAsn: 8.139 ± 2.923
5.426LeuPro: 5.426 ± 1.373
3.256LeuGln: 3.256 ± 0.88
6.511LeuArg: 6.511 ± 1.886
5.426LeuSer: 5.426 ± 0.569
3.256LeuThr: 3.256 ± 1.043
3.798LeuVal: 3.798 ± 0.599
1.085LeuTrp: 1.085 ± 0.553
5.969LeuTyr: 5.969 ± 0.595
0.0LeuXaa: 0.0 ± 0.0
Met
2.17MetAla: 2.17 ± 0.389
3.798MetCys: 3.798 ± 1.626
2.713MetAsp: 2.713 ± 1.274
0.0MetGlu: 0.0 ± 0.0
0.543MetPhe: 0.543 ± 0.437
1.085MetGly: 1.085 ± 0.447
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.628MetLys: 1.628 ± 0.63
1.628MetLeu: 1.628 ± 0.656
0.0MetMet: 0.0 ± 0.0
1.628MetAsn: 1.628 ± 0.638
1.085MetPro: 1.085 ± 0.636
2.713MetGln: 2.713 ± 1.274
2.17MetArg: 2.17 ± 0.932
1.628MetSer: 1.628 ± 0.656
0.543MetThr: 0.543 ± 0.533
0.543MetVal: 0.543 ± 0.344
0.0MetTrp: 0.0 ± 0.0
0.543MetTyr: 0.543 ± 0.344
0.0MetXaa: 0.0 ± 0.0
Asn
5.969AsnAla: 5.969 ± 1.56
1.085AsnCys: 1.085 ± 0.424
1.085AsnAsp: 1.085 ± 0.688
4.341AsnGlu: 4.341 ± 1.515
4.883AsnPhe: 4.883 ± 0.823
0.0AsnGly: 0.0 ± 0.0
0.543AsnHis: 0.543 ± 0.533
3.256AsnIle: 3.256 ± 0.509
4.341AsnLys: 4.341 ± 0.711
4.883AsnLeu: 4.883 ± 0.652
0.543AsnMet: 0.543 ± 0.336
3.798AsnAsn: 3.798 ± 0.571
1.628AsnPro: 1.628 ± 0.789
0.0AsnGln: 0.0 ± 0.0
2.713AsnArg: 2.713 ± 1.253
3.256AsnSer: 3.256 ± 0.265
2.17AsnThr: 2.17 ± 0.963
3.798AsnVal: 3.798 ± 1.022
1.628AsnTrp: 1.628 ± 0.63
0.543AsnTyr: 0.543 ± 0.533
0.0AsnXaa: 0.0 ± 0.0
Pro
1.628ProAla: 1.628 ± 0.638
0.0ProCys: 0.0 ± 0.0
2.713ProAsp: 2.713 ± 1.339
3.256ProGlu: 3.256 ± 0.782
2.17ProPhe: 2.17 ± 1.377
2.713ProGly: 2.713 ± 0.932
1.628ProHis: 1.628 ± 0.727
0.543ProIle: 0.543 ± 0.344
0.543ProLys: 0.543 ± 0.344
4.883ProLeu: 4.883 ± 1.726
0.543ProMet: 0.543 ± 0.533
1.628ProAsn: 1.628 ± 1.31
2.17ProPro: 2.17 ± 0.849
2.713ProGln: 2.713 ± 1.687
2.713ProArg: 2.713 ± 1.274
2.713ProSer: 2.713 ± 0.655
5.426ProThr: 5.426 ± 1.267
4.341ProVal: 4.341 ± 1.731
0.0ProTrp: 0.0 ± 0.0
3.256ProTyr: 3.256 ± 0.509
0.0ProXaa: 0.0 ± 0.0
Gln
2.17GlnAla: 2.17 ± 0.894
1.085GlnCys: 1.085 ± 0.553
2.17GlnAsp: 2.17 ± 0.389
0.543GlnGlu: 0.543 ± 0.344
1.085GlnPhe: 1.085 ± 0.688
2.713GlnGly: 2.713 ± 0.817
0.0GlnHis: 0.0 ± 0.0
2.713GlnIle: 2.713 ± 0.524
1.085GlnLys: 1.085 ± 0.424
3.256GlnLeu: 3.256 ± 1.219
0.0GlnMet: 0.0 ± 0.0
2.713GlnAsn: 2.713 ± 0.655
2.713GlnPro: 2.713 ± 1.19
4.341GlnGln: 4.341 ± 1.588
2.17GlnArg: 2.17 ± 0.389
3.256GlnSer: 3.256 ± 1.012
2.713GlnThr: 2.713 ± 0.707
3.798GlnVal: 3.798 ± 0.498
2.17GlnTrp: 2.17 ± 1.105
3.256GlnTyr: 3.256 ± 1.908
0.0GlnXaa: 0.0 ± 0.0
Arg
5.426ArgAla: 5.426 ± 2.218
0.543ArgCys: 0.543 ± 0.437
3.798ArgAsp: 3.798 ± 0.86
5.426ArgGlu: 5.426 ± 1.267
2.17ArgPhe: 2.17 ± 0.933
2.713ArgGly: 2.713 ± 1.216
1.628ArgHis: 1.628 ± 0.505
4.883ArgIle: 4.883 ± 1.268
2.713ArgLys: 2.713 ± 1.339
5.426ArgLeu: 5.426 ± 0.912
1.085ArgMet: 1.085 ± 0.553
4.341ArgAsn: 4.341 ± 0.898
1.628ArgPro: 1.628 ± 0.505
1.628ArgGln: 1.628 ± 1.31
6.511ArgArg: 6.511 ± 1.931
4.883ArgSer: 4.883 ± 0.66
4.341ArgThr: 4.341 ± 0.898
3.256ArgVal: 3.256 ± 1.579
0.543ArgTrp: 0.543 ± 0.533
0.543ArgTyr: 0.543 ± 0.344
0.0ArgXaa: 0.0 ± 0.0
Ser
5.969SerAla: 5.969 ± 2.529
1.628SerCys: 1.628 ± 0.752
3.256SerAsp: 3.256 ± 0.558
4.341SerGlu: 4.341 ± 0.344
2.17SerPhe: 2.17 ± 0.726
8.681SerGly: 8.681 ± 3.116
0.0SerHis: 0.0 ± 0.0
2.17SerIle: 2.17 ± 0.933
3.256SerLys: 3.256 ± 0.265
6.511SerLeu: 6.511 ± 0.922
1.085SerMet: 1.085 ± 0.549
4.341SerAsn: 4.341 ± 1.289
0.543SerPro: 0.543 ± 0.344
4.341SerGln: 4.341 ± 2.754
4.341SerArg: 4.341 ± 0.467
3.798SerSer: 3.798 ± 1.258
7.596SerThr: 7.596 ± 2.517
3.798SerVal: 3.798 ± 0.409
1.085SerTrp: 1.085 ± 0.636
1.085SerTyr: 1.085 ± 0.636
0.0SerXaa: 0.0 ± 0.0
Thr
4.341ThrAla: 4.341 ± 1.346
0.543ThrCys: 0.543 ± 0.344
1.628ThrAsp: 1.628 ± 1.005
2.713ThrGlu: 2.713 ± 1.274
1.085ThrPhe: 1.085 ± 0.709
1.085ThrGly: 1.085 ± 0.873
1.085ThrHis: 1.085 ± 0.447
7.054ThrIle: 7.054 ± 1.397
2.713ThrLys: 2.713 ± 0.655
5.426ThrLeu: 5.426 ± 0.852
2.17ThrMet: 2.17 ± 0.624
1.085ThrAsn: 1.085 ± 0.636
5.426ThrPro: 5.426 ± 1.677
3.256ThrGln: 3.256 ± 1.034
3.798ThrArg: 3.798 ± 0.86
1.628ThrSer: 1.628 ± 0.638
3.798ThrThr: 3.798 ± 0.593
6.511ThrVal: 6.511 ± 2.37
0.543ThrTrp: 0.543 ± 0.533
2.713ThrTyr: 2.713 ± 1.096
0.0ThrXaa: 0.0 ± 0.0
Val
3.798ValAla: 3.798 ± 1.306
0.0ValCys: 0.0 ± 0.0
0.543ValAsp: 0.543 ± 0.437
8.139ValGlu: 8.139 ± 1.967
4.883ValPhe: 4.883 ± 1.082
5.969ValGly: 5.969 ± 1.66
2.17ValHis: 2.17 ± 0.726
3.798ValIle: 3.798 ± 2.084
3.256ValLys: 3.256 ± 1.224
3.256ValLeu: 3.256 ± 1.262
0.0ValMet: 0.0 ± 0.0
2.713ValAsn: 2.713 ± 0.817
4.341ValPro: 4.341 ± 0.743
4.883ValGln: 4.883 ± 1.571
1.628ValArg: 1.628 ± 0.656
7.596ValSer: 7.596 ± 2.971
4.883ValThr: 4.883 ± 1.772
5.969ValVal: 5.969 ± 1.615
0.543ValTrp: 0.543 ± 0.437
2.17ValTyr: 2.17 ± 0.586
0.0ValXaa: 0.0 ± 0.0
Trp
1.085TrpAla: 1.085 ± 0.873
0.543TrpCys: 0.543 ± 0.344
0.543TrpAsp: 0.543 ± 0.533
2.713TrpGlu: 2.713 ± 1.027
0.543TrpPhe: 0.543 ± 0.533
2.713TrpGly: 2.713 ± 0.937
0.0TrpHis: 0.0 ± 0.0
1.085TrpIle: 1.085 ± 0.709
1.628TrpLys: 1.628 ± 0.752
1.085TrpLeu: 1.085 ± 0.636
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.628TrpSer: 1.628 ± 1.03
0.0TrpThr: 0.0 ± 0.0
2.17TrpVal: 2.17 ± 1.272
1.085TrpTrp: 1.085 ± 0.553
0.543TrpTyr: 0.543 ± 0.344
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.17TyrAla: 2.17 ± 0.532
1.085TyrCys: 1.085 ± 0.553
0.0TyrAsp: 0.0 ± 0.0
1.628TyrGlu: 1.628 ± 0.727
1.085TyrPhe: 1.085 ± 0.873
1.628TyrGly: 1.628 ± 0.752
1.085TyrHis: 1.085 ± 0.688
1.085TyrIle: 1.085 ± 0.636
1.628TyrLys: 1.628 ± 0.638
8.139TyrLeu: 8.139 ± 1.643
1.085TyrMet: 1.085 ± 0.688
2.713TyrAsn: 2.713 ± 1.216
0.0TyrPro: 0.0 ± 0.0
2.713TyrGln: 2.713 ± 1.612
2.17TyrArg: 2.17 ± 0.389
4.341TyrSer: 4.341 ± 0.743
1.085TyrThr: 1.085 ± 0.873
0.543TyrVal: 0.543 ± 0.344
2.17TyrTrp: 2.17 ± 0.586
1.085TyrTyr: 1.085 ± 0.553
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1844 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski