Amino acid dipepetide frequency for Porcine polyomavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.124AlaAla: 6.124 ± 3.551
0.612AlaCys: 0.612 ± 0.562
1.837AlaAsp: 1.837 ± 0.858
3.674AlaGlu: 3.674 ± 1.087
1.225AlaPhe: 1.225 ± 0.466
4.287AlaGly: 4.287 ± 2.857
0.0AlaHis: 0.0 ± 0.0
3.674AlaIle: 3.674 ± 1.716
1.837AlaLys: 1.837 ± 1.187
5.511AlaLeu: 5.511 ± 1.431
0.612AlaMet: 0.612 ± 0.396
1.837AlaAsn: 1.837 ± 1.437
2.449AlaPro: 2.449 ± 2.247
2.449AlaGln: 2.449 ± 0.782
1.837AlaArg: 1.837 ± 0.653
3.674AlaSer: 3.674 ± 1.7
3.674AlaThr: 3.674 ± 2.069
6.124AlaVal: 6.124 ± 1.038
0.612AlaTrp: 0.612 ± 0.396
0.612AlaTyr: 0.612 ± 0.717
0.0AlaXaa: 0.0 ± 0.0
Cys
0.612CysAla: 0.612 ± 0.396
0.0CysCys: 0.0 ± 0.0
2.449CysAsp: 2.449 ± 0.828
0.0CysGlu: 0.0 ± 0.0
1.225CysPhe: 1.225 ± 0.753
1.837CysGly: 1.837 ± 1.417
0.0CysHis: 0.0 ± 0.0
0.612CysIle: 0.612 ± 0.717
3.674CysLys: 3.674 ± 1.977
2.449CysLeu: 2.449 ± 0.907
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.612CysGln: 0.612 ± 0.396
0.612CysArg: 0.612 ± 0.396
1.225CysSer: 1.225 ± 0.753
1.837CysThr: 1.837 ± 0.966
2.449CysVal: 2.449 ± 2.116
0.612CysTrp: 0.612 ± 0.562
1.225CysTyr: 1.225 ± 1.435
0.0CysXaa: 0.0 ± 0.0
Asp
1.225AspAla: 1.225 ± 0.466
1.837AspCys: 1.837 ± 1.449
3.674AspAsp: 3.674 ± 1.652
4.899AspGlu: 4.899 ± 0.134
3.062AspPhe: 3.062 ± 0.69
1.837AspGly: 1.837 ± 0.403
0.0AspHis: 0.0 ± 0.0
1.225AspIle: 1.225 ± 0.466
3.062AspLys: 3.062 ± 1.307
4.899AspLeu: 4.899 ± 0.776
2.449AspMet: 2.449 ± 1.322
1.837AspAsn: 1.837 ± 0.584
3.674AspPro: 3.674 ± 0.766
2.449AspGln: 2.449 ± 0.714
1.837AspArg: 1.837 ± 0.966
3.062AspSer: 3.062 ± 1.146
1.225AspThr: 1.225 ± 0.466
3.674AspVal: 3.674 ± 0.671
2.449AspTrp: 2.449 ± 0.789
3.062AspTyr: 3.062 ± 1.979
0.0AspXaa: 0.0 ± 0.0
Glu
4.899GluAla: 4.899 ± 2.063
0.612GluCys: 0.612 ± 0.396
2.449GluAsp: 2.449 ± 0.403
6.124GluGlu: 6.124 ± 3.227
1.225GluPhe: 1.225 ± 0.792
4.287GluGly: 4.287 ± 1.221
1.225GluHis: 1.225 ± 0.753
3.674GluIle: 3.674 ± 1.091
3.062GluLys: 3.062 ± 1.979
9.798GluLeu: 9.798 ± 1.4
0.612GluMet: 0.612 ± 0.396
3.674GluAsn: 3.674 ± 0.892
2.449GluPro: 2.449 ± 0.828
2.449GluGln: 2.449 ± 1.064
1.837GluArg: 1.837 ± 0.858
4.899GluSer: 4.899 ± 1.776
4.287GluThr: 4.287 ± 0.528
3.062GluVal: 3.062 ± 1.289
1.225GluTrp: 1.225 ± 0.753
3.674GluTyr: 3.674 ± 0.519
0.0GluXaa: 0.0 ± 0.0
Phe
3.062PheAla: 3.062 ± 0.931
0.612PheCys: 0.612 ± 0.396
2.449PheAsp: 2.449 ± 1.269
1.837PheGlu: 1.837 ± 1.187
1.225PhePhe: 1.225 ± 0.414
2.449PheGly: 2.449 ± 0.484
2.449PheHis: 2.449 ± 1.064
0.612PheIle: 0.612 ± 0.396
0.612PheLys: 0.612 ± 0.396
2.449PheLeu: 2.449 ± 0.907
0.612PheMet: 0.612 ± 0.396
1.837PheAsn: 1.837 ± 0.584
3.674PhePro: 3.674 ± 0.458
3.062PheGln: 3.062 ± 0.942
1.225PheArg: 1.225 ± 1.124
2.449PheSer: 2.449 ± 0.828
2.449PheThr: 2.449 ± 0.931
1.225PheVal: 1.225 ± 0.414
0.612PheTrp: 0.612 ± 0.562
0.612PheTyr: 0.612 ± 0.562
0.0PheXaa: 0.0 ± 0.0
Gly
4.287GlyAla: 4.287 ± 2.751
1.837GlyCys: 1.837 ± 0.966
3.674GlyAsp: 3.674 ± 1.962
7.348GlyGlu: 7.348 ± 1.453
1.225GlyPhe: 1.225 ± 0.792
12.247GlyGly: 12.247 ± 3.104
1.225GlyHis: 1.225 ± 0.958
4.899GlyIle: 4.899 ± 1.996
4.287GlyLys: 4.287 ± 1.603
6.736GlyLeu: 6.736 ± 3.469
1.225GlyMet: 1.225 ± 1.124
4.287GlyAsn: 4.287 ± 1.019
3.062GlyPro: 3.062 ± 1.058
3.674GlyGln: 3.674 ± 1.4
2.449GlyArg: 2.449 ± 1.716
4.287GlySer: 4.287 ± 1.149
4.287GlyThr: 4.287 ± 2.007
7.961GlyVal: 7.961 ± 3.105
0.0GlyTrp: 0.0 ± 0.0
1.837GlyTyr: 1.837 ± 0.808
0.0GlyXaa: 0.0 ± 0.0
His
0.612HisAla: 0.612 ± 0.562
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.225HisGlu: 1.225 ± 0.792
1.225HisPhe: 1.225 ± 0.69
1.225HisGly: 1.225 ± 0.466
0.612HisHis: 0.612 ± 0.479
0.612HisIle: 0.612 ± 0.717
0.612HisLys: 0.612 ± 0.717
3.062HisLeu: 3.062 ± 1.307
1.225HisMet: 1.225 ± 0.792
0.0HisAsn: 0.0 ± 0.0
1.837HisPro: 1.837 ± 0.653
0.0HisGln: 0.0 ± 0.0
1.837HisArg: 1.837 ± 1.187
1.225HisSer: 1.225 ± 0.414
3.062HisThr: 3.062 ± 0.361
0.0HisVal: 0.0 ± 0.0
0.612HisTrp: 0.612 ± 0.479
1.225HisTyr: 1.225 ± 0.466
0.0HisXaa: 0.0 ± 0.0
Ile
1.837IleAla: 1.837 ± 1.437
0.612IleCys: 0.612 ± 0.396
1.837IleAsp: 1.837 ± 1.187
1.225IleGlu: 1.225 ± 0.958
1.225IlePhe: 1.225 ± 0.792
4.287IleGly: 4.287 ± 1.74
2.449IleHis: 2.449 ± 0.977
0.0IleIle: 0.0 ± 0.0
4.287IleLys: 4.287 ± 1.11
3.674IleLeu: 3.674 ± 0.458
0.612IleMet: 0.612 ± 0.619
2.449IleAsn: 2.449 ± 0.907
4.899IlePro: 4.899 ± 1.996
1.225IleGln: 1.225 ± 0.958
1.837IleArg: 1.837 ± 0.403
1.837IleSer: 1.837 ± 1.046
1.225IleThr: 1.225 ± 0.792
3.674IleVal: 3.674 ± 0.766
0.612IleTrp: 0.612 ± 0.396
1.837IleTyr: 1.837 ± 0.966
0.0IleXaa: 0.0 ± 0.0
Lys
1.837LysAla: 1.837 ± 0.966
2.449LysCys: 2.449 ± 1.506
1.225LysAsp: 1.225 ± 0.753
5.511LysGlu: 5.511 ± 2.876
0.612LysPhe: 0.612 ± 0.396
5.511LysGly: 5.511 ± 1.352
2.449LysHis: 2.449 ± 1.269
1.225LysIle: 1.225 ± 0.753
4.899LysLys: 4.899 ± 1.302
3.062LysLeu: 3.062 ± 1.614
0.612LysMet: 0.612 ± 0.396
3.062LysAsn: 3.062 ± 0.931
2.449LysPro: 2.449 ± 1.447
1.225LysGln: 1.225 ± 0.69
5.511LysArg: 5.511 ± 1.126
2.449LysSer: 2.449 ± 0.86
3.674LysThr: 3.674 ± 0.766
4.287LysVal: 4.287 ± 1.318
1.225LysTrp: 1.225 ± 1.435
1.225LysTyr: 1.225 ± 0.792
0.0LysXaa: 0.0 ± 0.0
Leu
7.348LeuAla: 7.348 ± 3.4
3.062LeuCys: 3.062 ± 1.614
6.736LeuAsp: 6.736 ± 0.884
7.961LeuGlu: 7.961 ± 2.802
3.674LeuPhe: 3.674 ± 1.167
8.573LeuGly: 8.573 ± 2.662
1.837LeuHis: 1.837 ± 0.403
4.899LeuIle: 4.899 ± 0.97
3.062LeuLys: 3.062 ± 0.931
11.635LeuLeu: 11.635 ± 1.05
6.124LeuMet: 6.124 ± 2.643
4.899LeuAsn: 4.899 ± 1.578
7.961LeuPro: 7.961 ± 1.315
3.062LeuGln: 3.062 ± 1.058
5.511LeuArg: 5.511 ± 0.757
4.287LeuSer: 4.287 ± 1.564
4.287LeuThr: 4.287 ± 1.089
4.899LeuVal: 4.899 ± 1.557
0.612LeuTrp: 0.612 ± 0.479
6.736LeuTyr: 6.736 ± 0.948
0.0LeuXaa: 0.0 ± 0.0
Met
0.612MetAla: 0.612 ± 0.396
1.837MetCys: 1.837 ± 0.966
2.449MetAsp: 2.449 ± 1.269
0.612MetGlu: 0.612 ± 0.479
0.612MetPhe: 0.612 ± 0.717
1.837MetGly: 1.837 ± 0.719
0.0MetHis: 0.0 ± 0.0
1.225MetIle: 1.225 ± 0.414
1.225MetLys: 1.225 ± 0.753
1.837MetLeu: 1.837 ± 0.667
0.0MetMet: 0.0 ± 0.0
2.449MetAsn: 2.449 ± 0.907
0.0MetPro: 0.0 ± 0.0
1.837MetGln: 1.837 ± 0.966
1.225MetArg: 1.225 ± 0.753
2.449MetSer: 2.449 ± 0.86
1.225MetThr: 1.225 ± 0.414
1.225MetVal: 1.225 ± 0.414
0.0MetTrp: 0.0 ± 0.0
0.612MetTyr: 0.612 ± 0.396
0.0MetXaa: 0.0 ± 0.0
Asn
3.062AsnAla: 3.062 ± 1.233
0.0AsnCys: 0.0 ± 0.0
3.674AsnAsp: 3.674 ± 1.167
2.449AsnGlu: 2.449 ± 0.714
1.225AsnPhe: 1.225 ± 0.414
3.674AsnGly: 3.674 ± 1.16
0.0AsnHis: 0.0 ± 0.0
3.062AsnIle: 3.062 ± 1.272
3.674AsnLys: 3.674 ± 1.05
7.348AsnLeu: 7.348 ± 1.164
0.0AsnMet: 0.0 ± 0.0
2.449AsnAsn: 2.449 ± 1.269
1.225AsnPro: 1.225 ± 0.69
1.837AsnGln: 1.837 ± 0.667
2.449AsnArg: 2.449 ± 0.907
4.899AsnSer: 4.899 ± 0.804
2.449AsnThr: 2.449 ± 1.692
1.837AsnVal: 1.837 ± 0.584
0.612AsnTrp: 0.612 ± 0.396
0.612AsnTyr: 0.612 ± 0.396
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.612ProCys: 0.612 ± 0.562
4.287ProAsp: 4.287 ± 0.499
5.511ProGlu: 5.511 ± 2.009
1.837ProPhe: 1.837 ± 1.187
2.449ProGly: 2.449 ± 1.309
0.612ProHis: 0.612 ± 0.479
0.612ProIle: 0.612 ± 0.396
3.674ProLys: 3.674 ± 0.766
5.511ProLeu: 5.511 ± 2.968
0.0ProMet: 0.0 ± 0.0
3.674ProAsn: 3.674 ± 1.946
4.899ProPro: 4.899 ± 1.445
3.062ProGln: 3.062 ± 1.395
0.0ProArg: 0.0 ± 0.0
3.062ProSer: 3.062 ± 0.361
5.511ProThr: 5.511 ± 2.248
4.287ProVal: 4.287 ± 0.702
0.0ProTrp: 0.0 ± 0.0
2.449ProTyr: 2.449 ± 1.035
0.0ProXaa: 0.0 ± 0.0
Gln
3.674GlnAla: 3.674 ± 0.766
1.837GlnCys: 1.837 ± 0.966
0.612GlnAsp: 0.612 ± 0.479
3.062GlnGlu: 3.062 ± 1.706
2.449GlnPhe: 2.449 ± 0.828
3.062GlnGly: 3.062 ± 0.821
1.225GlnHis: 1.225 ± 0.414
3.062GlnIle: 3.062 ± 0.666
1.837GlnLys: 1.837 ± 0.584
3.674GlnLeu: 3.674 ± 1.174
1.225GlnMet: 1.225 ± 0.723
0.612GlnAsn: 0.612 ± 0.562
2.449GlnPro: 2.449 ± 0.86
1.837GlnGln: 1.837 ± 0.719
4.287GlnArg: 4.287 ± 1.805
3.062GlnSer: 3.062 ± 0.662
4.899GlnThr: 4.899 ± 0.804
3.062GlnVal: 3.062 ± 0.821
1.225GlnTrp: 1.225 ± 0.737
1.225GlnTyr: 1.225 ± 0.466
0.0GlnXaa: 0.0 ± 0.0
Arg
0.612ArgAla: 0.612 ± 0.562
0.0ArgCys: 0.0 ± 0.0
1.837ArgAsp: 1.837 ± 1.187
2.449ArgGlu: 2.449 ± 0.714
1.837ArgPhe: 1.837 ± 0.584
2.449ArgGly: 2.449 ± 1.035
1.837ArgHis: 1.837 ± 0.966
3.062ArgIle: 3.062 ± 0.931
4.287ArgLys: 4.287 ± 1.857
7.961ArgLeu: 7.961 ± 0.69
0.0ArgMet: 0.0 ± 0.41
1.837ArgAsn: 1.837 ± 0.584
0.612ArgPro: 0.612 ± 0.717
1.837ArgGln: 1.837 ± 1.046
7.961ArgArg: 7.961 ± 1.549
1.225ArgSer: 1.225 ± 0.792
2.449ArgThr: 2.449 ± 1.309
6.124ArgVal: 6.124 ± 1.625
1.225ArgTrp: 1.225 ± 0.753
2.449ArgTyr: 2.449 ± 0.714
0.0ArgXaa: 0.0 ± 0.0
Ser
3.674SerAla: 3.674 ± 1.4
1.225SerCys: 1.225 ± 0.833
3.062SerAsp: 3.062 ± 1.146
1.225SerGlu: 1.225 ± 0.466
2.449SerPhe: 2.449 ± 0.863
7.348SerGly: 7.348 ± 3.906
1.225SerHis: 1.225 ± 0.414
2.449SerIle: 2.449 ± 0.907
1.225SerLys: 1.225 ± 0.753
3.674SerLeu: 3.674 ± 1.503
1.837SerMet: 1.837 ± 0.808
2.449SerAsn: 2.449 ± 0.714
1.225SerPro: 1.225 ± 0.414
3.674SerGln: 3.674 ± 0.582
5.511SerArg: 5.511 ± 1.514
1.837SerSer: 1.837 ± 0.584
4.899SerThr: 4.899 ± 0.804
4.287SerVal: 4.287 ± 1.429
0.612SerTrp: 0.612 ± 0.562
0.612SerTyr: 0.612 ± 0.479
0.0SerXaa: 0.0 ± 0.0
Thr
3.062ThrAla: 3.062 ± 1.233
0.612ThrCys: 0.612 ± 0.396
1.225ThrAsp: 1.225 ± 0.69
4.287ThrGlu: 4.287 ± 2.501
3.062ThrPhe: 3.062 ± 0.662
4.287ThrGly: 4.287 ± 2.383
0.612ThrHis: 0.612 ± 0.479
3.062ThrIle: 3.062 ± 1.447
1.837ThrLys: 1.837 ± 0.584
7.348ThrLeu: 7.348 ± 1.367
3.062ThrMet: 3.062 ± 0.69
0.612ThrAsn: 0.612 ± 0.562
4.899ThrPro: 4.899 ± 0.631
6.124ThrGln: 6.124 ± 2.07
3.062ThrArg: 3.062 ± 1.058
2.449ThrSer: 2.449 ± 0.86
3.674ThrThr: 3.674 ± 1.16
6.124ThrVal: 6.124 ± 1.948
1.225ThrTrp: 1.225 ± 0.753
3.674ThrTyr: 3.674 ± 0.458
0.0ThrXaa: 0.0 ± 0.0
Val
4.899ValAla: 4.899 ± 1.202
0.0ValCys: 0.0 ± 0.0
4.287ValAsp: 4.287 ± 2.007
4.287ValGlu: 4.287 ± 1.219
3.062ValPhe: 3.062 ± 0.931
4.287ValGly: 4.287 ± 2.501
0.612ValHis: 0.612 ± 0.717
1.837ValIle: 1.837 ± 0.403
3.062ValLys: 3.062 ± 0.942
10.41ValLeu: 10.41 ± 2.109
0.612ValMet: 0.612 ± 0.396
6.124ValAsn: 6.124 ± 1.379
2.449ValPro: 2.449 ± 0.828
6.124ValGln: 6.124 ± 2.222
1.837ValArg: 1.837 ± 0.403
2.449ValSer: 2.449 ± 0.403
4.899ValThr: 4.899 ± 2.894
2.449ValVal: 2.449 ± 0.828
2.449ValTrp: 2.449 ± 1.322
3.674ValTyr: 3.674 ± 1.398
0.0ValXaa: 0.0 ± 0.0
Trp
0.612TrpAla: 0.612 ± 0.562
1.225TrpCys: 1.225 ± 0.753
2.449TrpAsp: 2.449 ± 1.506
1.225TrpGlu: 1.225 ± 0.414
1.225TrpPhe: 1.225 ± 0.753
1.837TrpGly: 1.837 ± 1.417
0.0TrpHis: 0.0 ± 0.0
0.612TrpIle: 0.612 ± 0.479
1.225TrpLys: 1.225 ± 0.792
1.225TrpLeu: 1.225 ± 0.414
0.612TrpMet: 0.612 ± 0.562
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.225TrpSer: 1.225 ± 0.753
1.837TrpThr: 1.837 ± 0.653
1.837TrpVal: 1.837 ± 1.015
1.225TrpTrp: 1.225 ± 0.753
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.612TyrAla: 0.612 ± 0.396
2.449TyrCys: 2.449 ± 1.506
1.225TyrAsp: 1.225 ± 0.792
0.612TyrGlu: 0.612 ± 0.717
1.837TyrPhe: 1.837 ± 0.904
3.062TyrGly: 3.062 ± 1.307
1.837TyrHis: 1.837 ± 0.966
1.225TyrIle: 1.225 ± 0.466
3.062TyrLys: 3.062 ± 0.821
4.899TyrLeu: 4.899 ± 1.578
1.225TyrMet: 1.225 ± 0.792
2.449TyrAsn: 2.449 ± 0.789
1.837TyrPro: 1.837 ± 1.046
1.837TyrGln: 1.837 ± 0.858
1.837TyrArg: 1.837 ± 0.808
2.449TyrSer: 2.449 ± 0.403
2.449TyrThr: 2.449 ± 0.828
1.225TyrVal: 1.225 ± 0.69
1.225TyrTrp: 1.225 ± 0.792
2.449TyrTyr: 2.449 ± 0.863
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1634 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski