Amino acid dipepetide frequency for Meles meles polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.269AlaAla: 8.269 ± 3.182
0.0AlaCys: 0.0 ± 0.0
1.55AlaAsp: 1.55 ± 0.89
3.101AlaGlu: 3.101 ± 1.171
1.55AlaPhe: 1.55 ± 0.817
4.134AlaGly: 4.134 ± 1.18
0.0AlaHis: 0.0 ± 0.0
3.618AlaIle: 3.618 ± 2.168
3.101AlaLys: 3.101 ± 0.615
8.786AlaLeu: 8.786 ± 2.848
0.0AlaMet: 0.0 ± 0.0
1.55AlaAsn: 1.55 ± 0.646
6.202AlaPro: 6.202 ± 1.321
3.101AlaGln: 3.101 ± 0.975
4.134AlaArg: 4.134 ± 1.73
3.618AlaSer: 3.618 ± 1.062
5.685AlaThr: 5.685 ± 2.1
4.134AlaVal: 4.134 ± 1.406
1.55AlaTrp: 1.55 ± 0.817
0.517AlaTyr: 0.517 ± 0.499
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.034CysCys: 1.034 ± 0.377
1.034CysAsp: 1.034 ± 0.377
0.0CysGlu: 0.0 ± 0.0
1.034CysPhe: 1.034 ± 0.377
2.067CysGly: 2.067 ± 1.133
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.067CysLys: 2.067 ± 0.754
1.55CysLeu: 1.55 ± 0.817
0.517CysMet: 0.517 ± 0.388
1.034CysAsn: 1.034 ± 0.377
0.0CysPro: 0.0 ± 0.0
2.067CysGln: 2.067 ± 1.183
2.067CysArg: 2.067 ± 1.183
1.034CysSer: 1.034 ± 0.776
0.517CysThr: 0.517 ± 0.388
0.517CysVal: 0.517 ± 0.388
0.517CysTrp: 0.517 ± 0.409
1.55CysTyr: 1.55 ± 1.734
0.0CysXaa: 0.0 ± 0.0
Asp
1.55AspAla: 1.55 ± 0.536
0.517AspCys: 0.517 ± 0.388
2.584AspAsp: 2.584 ± 1.368
6.202AspGlu: 6.202 ± 0.71
1.034AspPhe: 1.034 ± 0.592
4.134AspGly: 4.134 ± 0.707
0.0AspHis: 0.0 ± 0.0
2.584AspIle: 2.584 ± 0.855
4.651AspLys: 4.651 ± 0.465
8.269AspLeu: 8.269 ± 2.201
0.517AspMet: 0.517 ± 0.409
2.067AspAsn: 2.067 ± 1.064
4.134AspPro: 4.134 ± 1.751
2.067AspGln: 2.067 ± 0.997
1.55AspArg: 1.55 ± 0.536
3.618AspSer: 3.618 ± 1.344
2.584AspThr: 2.584 ± 1.285
4.134AspVal: 4.134 ± 1.474
1.55AspTrp: 1.55 ± 0.536
2.067AspTyr: 2.067 ± 0.644
0.0AspXaa: 0.0 ± 0.0
Glu
3.618GluAla: 3.618 ± 0.865
1.55GluCys: 1.55 ± 0.646
5.168GluAsp: 5.168 ± 1.201
11.37GluGlu: 11.37 ± 1.968
3.101GluPhe: 3.101 ± 1.02
4.651GluGly: 4.651 ± 0.949
1.034GluHis: 1.034 ± 0.819
4.134GluIle: 4.134 ± 1.768
6.718GluLys: 6.718 ± 2.11
5.168GluLeu: 5.168 ± 0.678
1.034GluMet: 1.034 ± 0.592
6.202GluAsn: 6.202 ± 0.903
2.067GluPro: 2.067 ± 0.997
1.55GluGln: 1.55 ± 0.536
3.101GluArg: 3.101 ± 0.412
4.651GluSer: 4.651 ± 0.282
3.101GluThr: 3.101 ± 0.858
5.168GluVal: 5.168 ± 0.946
0.0GluTrp: 0.0 ± 0.0
2.067GluTyr: 2.067 ± 0.752
0.0GluXaa: 0.0 ± 0.0
Phe
2.067PheAla: 2.067 ± 0.416
0.517PheCys: 0.517 ± 0.388
1.55PheAsp: 1.55 ± 0.817
5.168PheGlu: 5.168 ± 1.582
1.034PhePhe: 1.034 ± 0.678
3.101PheGly: 3.101 ± 0.478
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
2.067PheLys: 2.067 ± 0.633
4.134PheLeu: 4.134 ± 1.165
1.034PheMet: 1.034 ± 0.776
2.067PheAsn: 2.067 ± 0.531
2.584PhePro: 2.584 ± 0.353
0.517PheGln: 0.517 ± 0.388
2.584PheArg: 2.584 ± 0.686
3.618PheSer: 3.618 ± 0.992
1.034PheThr: 1.034 ± 0.594
2.067PheVal: 2.067 ± 0.644
1.034PheTrp: 1.034 ± 0.678
0.517PheTyr: 0.517 ± 0.578
0.0PheXaa: 0.0 ± 0.0
Gly
8.786GlyAla: 8.786 ± 2.482
1.034GlyCys: 1.034 ± 0.776
6.202GlyAsp: 6.202 ± 0.871
4.134GlyGlu: 4.134 ± 1.112
3.618GlyPhe: 3.618 ± 0.63
6.202GlyGly: 6.202 ± 1.088
2.584GlyHis: 2.584 ± 0.686
3.618GlyIle: 3.618 ± 0.926
2.067GlyLys: 2.067 ± 0.997
8.786GlyLeu: 8.786 ± 0.835
1.034GlyMet: 1.034 ± 0.592
1.55GlyAsn: 1.55 ± 0.817
2.584GlyPro: 2.584 ± 0.353
2.067GlyGln: 2.067 ± 0.416
1.55GlyArg: 1.55 ± 0.722
3.101GlySer: 3.101 ± 0.97
4.651GlyThr: 4.651 ± 1.522
4.134GlyVal: 4.134 ± 0.526
0.0GlyTrp: 0.0 ± 0.0
2.067GlyTyr: 2.067 ± 0.958
0.0GlyXaa: 0.0 ± 0.0
His
1.55HisAla: 1.55 ± 0.722
0.517HisCys: 0.517 ± 0.388
0.0HisAsp: 0.0 ± 0.0
1.55HisGlu: 1.55 ± 0.646
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.517HisHis: 0.517 ± 0.388
2.067HisIle: 2.067 ± 1.023
0.517HisLys: 0.517 ± 0.388
0.517HisLeu: 0.517 ± 0.388
0.517HisMet: 0.517 ± 0.409
0.517HisAsn: 0.517 ± 0.388
2.584HisPro: 2.584 ± 1.042
0.0HisGln: 0.0 ± 0.0
1.55HisArg: 1.55 ± 0.536
1.55HisSer: 1.55 ± 0.89
0.517HisThr: 0.517 ± 0.388
2.584HisVal: 2.584 ± 1.34
0.0HisTrp: 0.0 ± 0.0
1.55HisTyr: 1.55 ± 0.536
0.0HisXaa: 0.0 ± 0.0
Ile
2.067IleAla: 2.067 ± 1.381
1.034IleCys: 1.034 ± 0.776
3.101IleAsp: 3.101 ± 0.615
6.718IleGlu: 6.718 ± 1.908
1.55IlePhe: 1.55 ± 1.164
1.55IleGly: 1.55 ± 0.722
0.0IleHis: 0.0 ± 0.0
2.584IleIle: 2.584 ± 0.836
2.067IleLys: 2.067 ± 0.633
4.651IleLeu: 4.651 ± 1.375
0.517IleMet: 0.517 ± 0.409
3.101IleAsn: 3.101 ± 0.433
2.584IlePro: 2.584 ± 0.472
1.55IleGln: 1.55 ± 1.164
1.55IleArg: 1.55 ± 0.722
2.067IleSer: 2.067 ± 0.492
3.101IleThr: 3.101 ± 0.818
1.55IleVal: 1.55 ± 0.685
0.517IleTrp: 0.517 ± 0.578
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.651LysAla: 4.651 ± 1.089
0.517LysCys: 0.517 ± 0.578
1.55LysAsp: 1.55 ± 0.685
5.168LysGlu: 5.168 ± 1.331
1.55LysPhe: 1.55 ± 0.646
6.718LysGly: 6.718 ± 1.739
2.584LysHis: 2.584 ± 0.748
2.067LysIle: 2.067 ± 0.754
4.134LysLys: 4.134 ± 1.984
6.202LysLeu: 6.202 ± 1.713
0.517LysMet: 0.517 ± 0.388
3.618LysAsn: 3.618 ± 0.388
1.034LysPro: 1.034 ± 0.377
0.0LysGln: 0.0 ± 0.0
8.269LysArg: 8.269 ± 1.172
3.101LysSer: 3.101 ± 1.85
2.584LysThr: 2.584 ± 0.918
2.067LysVal: 2.067 ± 0.754
0.517LysTrp: 0.517 ± 0.388
3.101LysTyr: 3.101 ± 0.615
0.0LysXaa: 0.0 ± 0.0
Leu
4.134LeuAla: 4.134 ± 1.023
3.618LeuCys: 3.618 ± 1.082
4.651LeuAsp: 4.651 ± 1.141
6.202LeuGlu: 6.202 ± 1.109
7.235LeuPhe: 7.235 ± 1.08
4.651LeuGly: 4.651 ± 1.472
0.517LeuHis: 0.517 ± 0.388
3.101LeuIle: 3.101 ± 0.615
3.618LeuLys: 3.618 ± 0.869
14.47LeuLeu: 14.47 ± 1.526
3.101LeuMet: 3.101 ± 1.166
5.168LeuAsn: 5.168 ± 1.456
7.752LeuPro: 7.752 ± 1.757
7.235LeuGln: 7.235 ± 2.442
5.168LeuArg: 5.168 ± 1.57
7.235LeuSer: 7.235 ± 2.208
6.202LeuThr: 6.202 ± 1.321
2.067LeuVal: 2.067 ± 1.059
0.0LeuTrp: 0.0 ± 0.0
4.651LeuTyr: 4.651 ± 1.461
0.0LeuXaa: 0.0 ± 0.0
Met
2.067MetAla: 2.067 ± 0.752
1.034MetCys: 1.034 ± 1.156
2.584MetAsp: 2.584 ± 0.918
0.517MetGlu: 0.517 ± 0.388
0.517MetPhe: 0.517 ± 0.409
1.034MetGly: 1.034 ± 0.488
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.55MetLys: 1.55 ± 0.685
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.517MetAsn: 0.517 ± 0.388
1.034MetPro: 1.034 ± 0.678
2.584MetGln: 2.584 ± 1.372
1.034MetArg: 1.034 ± 0.592
0.517MetSer: 0.517 ± 0.388
1.55MetThr: 1.55 ± 0.917
0.517MetVal: 0.517 ± 0.388
0.517MetTrp: 0.517 ± 0.409
2.584MetTyr: 2.584 ± 0.918
0.0MetXaa: 0.0 ± 0.0
Asn
2.067AsnAla: 2.067 ± 1.133
0.517AsnCys: 0.517 ± 0.388
1.55AsnAsp: 1.55 ± 0.646
1.55AsnGlu: 1.55 ± 0.685
1.55AsnPhe: 1.55 ± 0.817
2.584AsnGly: 2.584 ± 1.027
1.034AsnHis: 1.034 ± 0.678
1.55AsnIle: 1.55 ± 0.646
4.134AsnLys: 4.134 ± 0.791
2.584AsnLeu: 2.584 ± 0.608
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
2.584AsnPro: 2.584 ± 1.081
1.55AsnGln: 1.55 ± 1.164
1.55AsnArg: 1.55 ± 0.536
4.651AsnSer: 4.651 ± 1.53
2.584AsnThr: 2.584 ± 0.918
2.067AsnVal: 2.067 ± 1.296
0.0AsnTrp: 0.0 ± 0.0
3.101AsnTyr: 3.101 ± 0.863
0.0AsnXaa: 0.0 ± 0.0
Pro
1.034ProAla: 1.034 ± 0.819
0.517ProCys: 0.517 ± 0.409
6.202ProAsp: 6.202 ± 1.528
0.517ProGlu: 0.517 ± 0.388
2.067ProPhe: 2.067 ± 0.416
4.134ProGly: 4.134 ± 1.73
0.517ProHis: 0.517 ± 0.409
2.067ProIle: 2.067 ± 0.754
5.685ProLys: 5.685 ± 1.754
3.618ProLeu: 3.618 ± 2.03
1.55ProMet: 1.55 ± 0.51
0.517ProAsn: 0.517 ± 0.578
7.235ProPro: 7.235 ± 2.414
5.168ProGln: 5.168 ± 1.46
2.067ProArg: 2.067 ± 1.252
6.202ProSer: 6.202 ± 1.705
3.618ProThr: 3.618 ± 1.357
5.685ProVal: 5.685 ± 0.752
0.0ProTrp: 0.0 ± 0.0
3.101ProTyr: 3.101 ± 0.412
0.0ProXaa: 0.0 ± 0.0
Gln
3.101GlnAla: 3.101 ± 1.486
0.0GlnCys: 0.0 ± 0.0
5.685GlnAsp: 5.685 ± 1.266
0.517GlnGlu: 0.517 ± 0.388
1.55GlnPhe: 1.55 ± 0.685
1.55GlnGly: 1.55 ± 0.646
0.0GlnHis: 0.0 ± 0.0
5.168GlnIle: 5.168 ± 1.38
4.134GlnLys: 4.134 ± 0.305
5.168GlnLeu: 5.168 ± 1.587
1.034GlnMet: 1.034 ± 0.592
1.55GlnAsn: 1.55 ± 0.536
1.55GlnPro: 1.55 ± 0.685
0.517GlnGln: 0.517 ± 0.388
2.067GlnArg: 2.067 ± 0.924
1.55GlnSer: 1.55 ± 1.164
3.618GlnThr: 3.618 ± 1.142
3.101GlnVal: 3.101 ± 1.648
0.517GlnTrp: 0.517 ± 0.578
2.584GlnTyr: 2.584 ± 0.627
0.0GlnXaa: 0.0 ± 0.0
Arg
2.067ArgAla: 2.067 ± 1.551
2.067ArgCys: 2.067 ± 1.133
1.55ArgAsp: 1.55 ± 0.536
3.101ArgGlu: 3.101 ± 0.924
2.067ArgPhe: 2.067 ± 0.644
2.067ArgGly: 2.067 ± 1.047
3.101ArgHis: 3.101 ± 1.06
1.034ArgIle: 1.034 ± 0.776
4.651ArgLys: 4.651 ± 1.455
4.134ArgLeu: 4.134 ± 1.847
1.55ArgMet: 1.55 ± 0.509
2.584ArgAsn: 2.584 ± 1.081
2.584ArgPro: 2.584 ± 0.627
5.168ArgGln: 5.168 ± 2.68
4.134ArgArg: 4.134 ± 1.673
5.685ArgSer: 5.685 ± 2.463
4.134ArgThr: 4.134 ± 0.672
2.584ArgVal: 2.584 ± 0.627
1.55ArgTrp: 1.55 ± 0.536
3.101ArgTyr: 3.101 ± 0.975
0.0ArgXaa: 0.0 ± 0.0
Ser
4.134SerAla: 4.134 ± 0.526
2.067SerCys: 2.067 ± 0.711
2.067SerAsp: 2.067 ± 0.633
5.168SerGlu: 5.168 ± 0.929
2.067SerPhe: 2.067 ± 0.492
4.134SerGly: 4.134 ± 1.412
2.067SerHis: 2.067 ± 0.416
2.584SerIle: 2.584 ± 0.472
2.584SerLys: 2.584 ± 1.026
5.168SerLeu: 5.168 ± 1.624
1.034SerMet: 1.034 ± 0.608
2.584SerAsn: 2.584 ± 0.985
4.134SerPro: 4.134 ± 1.197
4.134SerGln: 4.134 ± 0.861
7.235SerArg: 7.235 ± 1.913
2.067SerSer: 2.067 ± 0.633
5.168SerThr: 5.168 ± 1.495
2.584SerVal: 2.584 ± 1.31
2.584SerTrp: 2.584 ± 1.372
1.55SerTyr: 1.55 ± 1.228
0.0SerXaa: 0.0 ± 0.0
Thr
3.618ThrAla: 3.618 ± 1.761
0.0ThrCys: 0.0 ± 0.0
5.168ThrAsp: 5.168 ± 1.061
5.168ThrGlu: 5.168 ± 2.174
2.067ThrPhe: 2.067 ± 0.752
4.651ThrGly: 4.651 ± 3.235
1.55ThrHis: 1.55 ± 0.727
1.55ThrIle: 1.55 ± 0.89
0.0ThrLys: 0.0 ± 0.0
6.202ThrLeu: 6.202 ± 2.435
2.067ThrMet: 2.067 ± 0.704
0.517ThrAsn: 0.517 ± 0.388
7.235ThrPro: 7.235 ± 2.194
1.55ThrGln: 1.55 ± 1.164
3.618ThrArg: 3.618 ± 1.997
3.618ThrSer: 3.618 ± 1.142
4.651ThrThr: 4.651 ± 1.669
4.134ThrVal: 4.134 ± 0.305
1.55ThrTrp: 1.55 ± 0.935
2.067ThrTyr: 2.067 ± 0.997
0.0ThrXaa: 0.0 ± 0.0
Val
5.168ValAla: 5.168 ± 1.201
0.517ValCys: 0.517 ± 0.388
3.101ValAsp: 3.101 ± 1.448
5.168ValGlu: 5.168 ± 1.027
1.034ValPhe: 1.034 ± 0.776
4.651ValGly: 4.651 ± 2.055
0.517ValHis: 0.517 ± 0.409
2.067ValIle: 2.067 ± 1.064
4.134ValLys: 4.134 ± 0.791
5.168ValLeu: 5.168 ± 1.54
1.034ValMet: 1.034 ± 0.592
2.067ValAsn: 2.067 ± 0.752
1.55ValPro: 1.55 ± 0.89
2.067ValGln: 2.067 ± 0.416
1.034ValArg: 1.034 ± 0.819
4.651ValSer: 4.651 ± 1.221
4.134ValThr: 4.134 ± 0.545
2.584ValVal: 2.584 ± 0.985
3.101ValTrp: 3.101 ± 1.869
0.517ValTyr: 0.517 ± 0.409
0.0ValXaa: 0.0 ± 0.0
Trp
1.034TrpAla: 1.034 ± 0.678
0.0TrpCys: 0.0 ± 0.0
0.517TrpAsp: 0.517 ± 0.388
2.584TrpGlu: 2.584 ± 0.918
0.517TrpPhe: 0.517 ± 0.578
2.067TrpGly: 2.067 ± 1.398
0.517TrpHis: 0.517 ± 0.409
0.517TrpIle: 0.517 ± 0.578
0.517TrpLys: 0.517 ± 0.388
1.55TrpLeu: 1.55 ± 0.536
1.55TrpMet: 1.55 ± 0.922
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.034TrpGln: 1.034 ± 0.678
1.55TrpArg: 1.55 ± 0.935
1.034TrpSer: 1.034 ± 0.592
0.0TrpThr: 0.0 ± 0.0
0.517TrpVal: 0.517 ± 0.388
0.517TrpTrp: 0.517 ± 0.388
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.618TyrAla: 3.618 ± 0.867
1.55TyrCys: 1.55 ± 1.104
0.0TyrAsp: 0.0 ± 0.0
1.55TyrGlu: 1.55 ± 0.646
1.55TyrPhe: 1.55 ± 0.685
5.685TyrGly: 5.685 ± 1.411
2.067TyrHis: 2.067 ± 0.633
1.55TyrIle: 1.55 ± 0.536
1.55TyrLys: 1.55 ± 0.646
4.651TyrLeu: 4.651 ± 1.429
1.034TyrMet: 1.034 ± 0.776
0.0TyrAsn: 0.0 ± 0.0
2.067TyrPro: 2.067 ± 0.958
1.034TyrGln: 1.034 ± 0.819
3.101TyrArg: 3.101 ± 1.171
1.55TyrSer: 1.55 ± 0.51
2.067TyrThr: 2.067 ± 1.252
2.067TyrVal: 2.067 ± 1.252
0.0TyrTrp: 0.0 ± 0.0
2.067TyrTyr: 2.067 ± 0.416
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1936 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski