Amino acid dipepetide frequency for MW polyomavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.621AlaAla: 5.621 ± 1.941
1.124AlaCys: 1.124 ± 0.779
3.373AlaAsp: 3.373 ± 1.305
1.124AlaGlu: 1.124 ± 1.261
2.811AlaPhe: 2.811 ± 1.137
1.686AlaGly: 1.686 ± 1.152
0.0AlaHis: 0.0 ± 0.0
3.373AlaIle: 3.373 ± 1.281
2.811AlaLys: 2.811 ± 1.404
6.745AlaLeu: 6.745 ± 3.097
1.124AlaMet: 1.124 ± 0.802
0.562AlaAsn: 0.562 ± 0.401
3.373AlaPro: 3.373 ± 1.305
2.811AlaGln: 2.811 ± 1.137
6.183AlaArg: 6.183 ± 3.234
4.497AlaSer: 4.497 ± 1.561
1.124AlaThr: 1.124 ± 0.53
6.183AlaVal: 6.183 ± 2.903
2.248AlaTrp: 2.248 ± 0.768
0.562AlaTyr: 0.562 ± 0.401
0.0AlaXaa: 0.0 ± 0.0
Cys
1.124CysAla: 1.124 ± 0.779
1.124CysCys: 1.124 ± 0.881
0.0CysAsp: 0.0 ± 0.0
3.373CysGlu: 3.373 ± 1.544
1.124CysPhe: 1.124 ± 1.665
1.686CysGly: 1.686 ± 0.723
0.0CysHis: 0.0 ± 0.0
0.562CysIle: 0.562 ± 0.536
1.686CysLys: 1.686 ± 0.772
2.811CysLeu: 2.811 ± 2.322
0.0CysMet: 0.0 ± 0.0
1.686CysAsn: 1.686 ± 0.772
2.248CysPro: 2.248 ± 1.498
1.124CysGln: 1.124 ± 0.802
0.562CysArg: 0.562 ± 0.536
1.686CysSer: 1.686 ± 0.772
2.248CysThr: 2.248 ± 1.06
2.248CysVal: 2.248 ± 0.768
0.562CysTrp: 0.562 ± 0.401
1.124CysTyr: 1.124 ± 0.779
0.0CysXaa: 0.0 ± 0.0
Asp
2.248AspAla: 2.248 ± 0.98
0.562AspCys: 0.562 ± 0.401
2.248AspAsp: 2.248 ± 1.151
3.935AspGlu: 3.935 ± 2.236
1.686AspPhe: 1.686 ± 1.203
5.059AspGly: 5.059 ± 2.247
0.562AspHis: 0.562 ± 0.401
3.935AspIle: 3.935 ± 0.802
2.811AspLys: 2.811 ± 1.653
3.935AspLeu: 3.935 ± 1.292
2.248AspMet: 2.248 ± 1.102
3.935AspAsn: 3.935 ± 1.144
3.373AspPro: 3.373 ± 1.975
0.562AspGln: 0.562 ± 0.401
2.248AspArg: 2.248 ± 1.006
2.811AspSer: 2.811 ± 1.086
2.248AspThr: 2.248 ± 1.06
5.059AspVal: 5.059 ± 1.301
1.686AspTrp: 1.686 ± 0.802
2.811AspTyr: 2.811 ± 1.685
0.0AspXaa: 0.0 ± 0.0
Glu
6.183GluAla: 6.183 ± 3.114
1.124GluCys: 1.124 ± 0.779
4.497GluAsp: 4.497 ± 2.019
6.745GluGlu: 6.745 ± 2.303
3.373GluPhe: 3.373 ± 1.834
5.059GluGly: 5.059 ± 1.141
2.248GluHis: 2.248 ± 0.866
2.248GluIle: 2.248 ± 1.604
7.307GluLys: 7.307 ± 3.279
4.497GluLeu: 4.497 ± 2.0
1.124GluMet: 1.124 ± 0.926
4.497GluAsn: 4.497 ± 2.349
1.124GluPro: 1.124 ± 0.53
1.686GluGln: 1.686 ± 0.802
0.0GluArg: 0.0 ± 0.0
3.373GluSer: 3.373 ± 1.229
3.373GluThr: 3.373 ± 1.589
7.307GluVal: 7.307 ± 1.48
0.562GluTrp: 0.562 ± 0.401
1.686GluTyr: 1.686 ± 0.802
0.0GluXaa: 0.0 ± 0.0
Phe
2.811PheAla: 2.811 ± 1.262
1.124PheCys: 1.124 ± 0.779
3.373PheAsp: 3.373 ± 1.143
4.497PheGlu: 4.497 ± 1.518
2.811PhePhe: 2.811 ± 1.73
2.248PheGly: 2.248 ± 1.559
1.686PheHis: 1.686 ± 0.772
3.373PheIle: 3.373 ± 1.535
3.935PheLys: 3.935 ± 1.733
3.373PheLeu: 3.373 ± 1.864
0.0PheMet: 0.0 ± 0.0
2.248PheAsn: 2.248 ± 0.866
3.935PhePro: 3.935 ± 1.13
2.248PheGln: 2.248 ± 1.638
1.686PheArg: 1.686 ± 0.802
2.811PheSer: 2.811 ± 0.928
0.0PheThr: 0.0 ± 0.0
0.562PheVal: 0.562 ± 0.401
1.686PheTrp: 1.686 ± 1.209
0.562PheTyr: 0.562 ± 0.401
0.0PheXaa: 0.0 ± 0.0
Gly
3.373GlyAla: 3.373 ± 2.701
2.811GlyCys: 2.811 ± 1.509
5.621GlyAsp: 5.621 ± 1.518
9.556GlyGlu: 9.556 ± 4.177
0.562GlyPhe: 0.562 ± 0.631
5.059GlyGly: 5.059 ± 1.392
0.0GlyHis: 0.0 ± 0.0
3.373GlyIle: 3.373 ± 0.897
1.686GlyLys: 1.686 ± 1.203
5.621GlyLeu: 5.621 ± 2.685
2.811GlyMet: 2.811 ± 1.508
3.373GlyAsn: 3.373 ± 0.519
4.497GlyPro: 4.497 ± 0.899
2.811GlyGln: 2.811 ± 0.956
0.0GlyArg: 0.0 ± 0.0
1.686GlySer: 1.686 ± 0.605
4.497GlyThr: 4.497 ± 3.614
3.935GlyVal: 3.935 ± 2.48
1.686GlyTrp: 1.686 ± 0.802
3.373GlyTyr: 3.373 ± 1.762
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.124HisAsp: 1.124 ± 0.802
1.124HisGlu: 1.124 ± 0.802
2.248HisPhe: 2.248 ± 0.65
0.562HisGly: 0.562 ± 0.401
0.562HisHis: 0.562 ± 0.401
0.562HisIle: 0.562 ± 0.832
0.562HisLys: 0.562 ± 0.832
2.248HisLeu: 2.248 ± 0.866
0.562HisMet: 0.562 ± 0.536
1.686HisAsn: 1.686 ± 1.209
1.686HisPro: 1.686 ± 0.917
0.0HisGln: 0.0 ± 0.0
1.686HisArg: 1.686 ± 0.723
0.0HisSer: 0.0 ± 0.0
0.562HisThr: 0.562 ± 0.536
0.562HisVal: 0.562 ± 0.401
0.0HisTrp: 0.0 ± 0.0
0.562HisTyr: 0.562 ± 0.401
0.0HisXaa: 0.0 ± 0.0
Ile
1.124IleAla: 1.124 ± 0.589
3.373IleCys: 3.373 ± 1.863
1.686IleAsp: 1.686 ± 0.987
6.745IleGlu: 6.745 ± 2.85
2.811IlePhe: 2.811 ± 1.137
1.124IleGly: 1.124 ± 0.53
0.562IleHis: 0.562 ± 0.401
3.373IleIle: 3.373 ± 0.738
2.811IleLys: 2.811 ± 1.48
7.307IleLeu: 7.307 ± 1.17
0.562IleMet: 0.562 ± 0.832
2.811IleAsn: 2.811 ± 1.48
4.497IlePro: 4.497 ± 0.899
3.373IleGln: 3.373 ± 2.701
0.562IleArg: 0.562 ± 0.401
2.248IleSer: 2.248 ± 2.522
2.811IleThr: 2.811 ± 0.88
2.248IleVal: 2.248 ± 0.98
0.562IleTrp: 0.562 ± 0.832
0.562IleTyr: 0.562 ± 0.832
0.0IleXaa: 0.0 ± 0.0
Lys
2.248LysAla: 2.248 ± 1.11
1.686LysCys: 1.686 ± 0.723
2.248LysAsp: 2.248 ± 1.604
2.811LysGlu: 2.811 ± 0.928
1.124LysPhe: 1.124 ± 0.802
5.059LysGly: 5.059 ± 1.932
2.248LysHis: 2.248 ± 1.182
1.124LysIle: 1.124 ± 0.881
3.373LysLys: 3.373 ± 1.3
6.183LysLeu: 6.183 ± 2.725
2.811LysMet: 2.811 ± 1.653
2.248LysAsn: 2.248 ± 1.11
0.562LysPro: 0.562 ± 0.536
2.811LysGln: 2.811 ± 2.272
5.059LysArg: 5.059 ± 1.13
0.562LysSer: 0.562 ± 0.401
6.183LysThr: 6.183 ± 2.791
3.373LysVal: 3.373 ± 1.065
0.562LysTrp: 0.562 ± 0.536
2.248LysTyr: 2.248 ± 1.182
0.0LysXaa: 0.0 ± 0.0
Leu
3.373LeuAla: 3.373 ± 1.065
2.811LeuCys: 2.811 ± 0.99
4.497LeuAsp: 4.497 ± 1.549
4.497LeuGlu: 4.497 ± 1.733
3.935LeuPhe: 3.935 ± 2.236
5.059LeuGly: 5.059 ± 3.498
2.248LeuHis: 2.248 ± 0.768
7.87LeuIle: 7.87 ± 2.821
5.059LeuLys: 5.059 ± 1.726
17.426LeuLeu: 17.426 ± 4.647
3.373LeuMet: 3.373 ± 1.834
5.621LeuAsn: 5.621 ± 1.332
3.373LeuPro: 3.373 ± 2.022
5.059LeuGln: 5.059 ± 2.349
6.183LeuArg: 6.183 ± 1.051
10.118LeuSer: 10.118 ± 2.374
2.811LeuThr: 2.811 ± 0.88
5.059LeuVal: 5.059 ± 2.218
1.124LeuTrp: 1.124 ± 1.665
2.248LeuTyr: 2.248 ± 1.06
0.0LeuXaa: 0.0 ± 0.0
Met
2.811MetAla: 2.811 ± 0.928
0.562MetCys: 0.562 ± 0.401
2.248MetAsp: 2.248 ± 1.006
0.562MetGlu: 0.562 ± 0.401
1.124MetPhe: 1.124 ± 0.802
2.248MetGly: 2.248 ± 0.685
0.562MetHis: 0.562 ± 0.832
0.562MetIle: 0.562 ± 0.401
1.124MetLys: 1.124 ± 0.779
1.686MetLeu: 1.686 ± 0.802
0.0MetMet: 0.0 ± 0.0
0.562MetAsn: 0.562 ± 0.401
2.248MetPro: 2.248 ± 1.498
1.124MetGln: 1.124 ± 0.779
0.0MetArg: 0.0 ± 0.0
0.562MetSer: 0.562 ± 0.536
1.686MetThr: 1.686 ± 0.987
0.562MetVal: 0.562 ± 0.401
0.562MetTrp: 0.562 ± 0.536
2.811MetTyr: 2.811 ± 0.928
0.0MetXaa: 0.0 ± 0.0
Asn
5.621AsnAla: 5.621 ± 3.198
0.562AsnCys: 0.562 ± 0.401
2.811AsnAsp: 2.811 ± 0.956
3.373AsnGlu: 3.373 ± 0.738
1.686AsnPhe: 1.686 ± 0.786
0.0AsnGly: 0.0 ± 0.0
0.0AsnHis: 0.0 ± 0.0
4.497AsnIle: 4.497 ± 1.152
2.811AsnLys: 2.811 ± 1.491
3.935AsnLeu: 3.935 ± 1.099
0.562AsnMet: 0.562 ± 0.631
0.0AsnAsn: 0.0 ± 0.0
1.686AsnPro: 1.686 ± 0.772
1.686AsnGln: 1.686 ± 0.987
0.562AsnArg: 0.562 ± 0.401
7.307AsnSer: 7.307 ± 2.387
2.811AsnThr: 2.811 ± 0.88
7.307AsnVal: 7.307 ± 3.105
0.562AsnTrp: 0.562 ± 0.401
1.686AsnTyr: 1.686 ± 0.802
0.0AsnXaa: 0.0 ± 0.0
Pro
3.373ProAla: 3.373 ± 1.928
1.124ProCys: 1.124 ± 1.072
6.183ProAsp: 6.183 ± 1.52
1.686ProGlu: 1.686 ± 1.203
2.811ProPhe: 2.811 ± 1.137
8.432ProGly: 8.432 ± 2.792
0.562ProHis: 0.562 ± 0.401
1.124ProIle: 1.124 ± 0.53
2.248ProLys: 2.248 ± 1.604
5.621ProLeu: 5.621 ± 1.911
0.562ProMet: 0.562 ± 0.536
0.562ProAsn: 0.562 ± 0.536
4.497ProPro: 4.497 ± 1.029
1.686ProGln: 1.686 ± 1.35
2.248ProArg: 2.248 ± 1.06
3.935ProSer: 3.935 ± 1.144
4.497ProThr: 4.497 ± 2.308
5.621ProVal: 5.621 ± 4.043
0.0ProTrp: 0.0 ± 0.0
0.562ProTyr: 0.562 ± 0.536
0.0ProXaa: 0.0 ± 0.0
Gln
2.811GlnAla: 2.811 ± 1.031
1.124GlnCys: 1.124 ± 0.779
1.124GlnAsp: 1.124 ± 0.53
1.124GlnGlu: 1.124 ± 0.779
0.562GlnPhe: 0.562 ± 0.401
1.686GlnGly: 1.686 ± 0.896
0.562GlnHis: 0.562 ± 0.832
2.248GlnIle: 2.248 ± 0.685
2.811GlnLys: 2.811 ± 0.928
4.497GlnLeu: 4.497 ± 2.584
0.562GlnMet: 0.562 ± 0.536
3.373GlnAsn: 3.373 ± 1.604
2.811GlnPro: 2.811 ± 1.459
3.373GlnGln: 3.373 ± 0.79
2.248GlnArg: 2.248 ± 0.65
1.686GlnSer: 1.686 ± 1.203
4.497GlnThr: 4.497 ± 2.349
3.373GlnVal: 3.373 ± 1.793
1.686GlnTrp: 1.686 ± 0.802
2.248GlnTyr: 2.248 ± 0.768
0.0GlnXaa: 0.0 ± 0.0
Arg
2.248ArgAla: 2.248 ± 1.151
0.562ArgCys: 0.562 ± 0.536
2.811ArgAsp: 2.811 ± 0.99
2.248ArgGlu: 2.248 ± 1.182
1.686ArgPhe: 1.686 ± 0.802
2.811ArgGly: 2.811 ± 0.88
2.811ArgHis: 2.811 ± 1.234
1.124ArgIle: 1.124 ± 0.53
1.124ArgLys: 1.124 ± 0.53
4.497ArgLeu: 4.497 ± 2.001
2.248ArgMet: 2.248 ± 1.415
3.935ArgAsn: 3.935 ± 0.532
2.811ArgPro: 2.811 ± 1.086
5.059ArgGln: 5.059 ± 2.414
3.373ArgArg: 3.373 ± 1.3
1.686ArgSer: 1.686 ± 0.896
0.0ArgThr: 0.0 ± 0.0
1.686ArgVal: 1.686 ± 0.917
1.124ArgTrp: 1.124 ± 0.926
2.248ArgTyr: 2.248 ± 1.06
0.0ArgXaa: 0.0 ± 0.0
Ser
1.686SerAla: 1.686 ± 0.786
2.811SerCys: 2.811 ± 1.441
2.248SerAsp: 2.248 ± 1.006
3.935SerGlu: 3.935 ± 2.598
3.935SerPhe: 3.935 ± 1.437
6.745SerGly: 6.745 ± 2.004
0.0SerHis: 0.0 ± 0.0
1.686SerIle: 1.686 ± 0.896
3.373SerLys: 3.373 ± 1.3
8.994SerLeu: 8.994 ± 1.377
0.0SerMet: 0.0 ± 0.0
4.497SerAsn: 4.497 ± 1.706
0.0SerPro: 0.0 ± 0.0
3.373SerGln: 3.373 ± 1.833
3.373SerArg: 3.373 ± 0.852
3.373SerSer: 3.373 ± 1.163
3.373SerThr: 3.373 ± 1.377
3.935SerVal: 3.935 ± 2.76
0.0SerTrp: 0.0 ± 0.0
0.562SerTyr: 0.562 ± 0.631
0.0SerXaa: 0.0 ± 0.0
Thr
4.497ThrAla: 4.497 ± 1.245
1.124ThrCys: 1.124 ± 0.53
2.248ThrAsp: 2.248 ± 1.06
2.248ThrGlu: 2.248 ± 1.498
1.124ThrPhe: 1.124 ± 0.997
1.686ThrGly: 1.686 ± 1.16
0.0ThrHis: 0.0 ± 0.0
4.497ThrIle: 4.497 ± 1.091
2.248ThrLys: 2.248 ± 2.143
3.935ThrLeu: 3.935 ± 1.296
1.124ThrMet: 1.124 ± 0.53
2.248ThrAsn: 2.248 ± 1.498
8.994ThrPro: 8.994 ± 1.999
1.686ThrGln: 1.686 ± 0.605
3.935ThrArg: 3.935 ± 1.144
1.686ThrSer: 1.686 ± 0.989
4.497ThrThr: 4.497 ± 1.78
6.183ThrVal: 6.183 ± 3.177
0.0ThrTrp: 0.0 ± 0.0
1.686ThrTyr: 1.686 ± 0.987
0.0ThrXaa: 0.0 ± 0.0
Val
3.373ValAla: 3.373 ± 1.065
2.248ValCys: 2.248 ± 1.633
3.935ValAsp: 3.935 ± 1.358
5.059ValGlu: 5.059 ± 1.125
3.935ValPhe: 3.935 ± 1.513
3.935ValGly: 3.935 ± 1.823
0.0ValHis: 0.0 ± 0.0
3.935ValIle: 3.935 ± 2.184
3.935ValLys: 3.935 ± 1.296
7.87ValLeu: 7.87 ± 0.832
0.0ValMet: 0.0 ± 0.0
4.497ValAsn: 4.497 ± 1.107
3.373ValPro: 3.373 ± 1.589
3.373ValGln: 3.373 ± 0.74
2.811ValArg: 2.811 ± 0.759
5.059ValSer: 5.059 ± 1.301
5.621ValThr: 5.621 ± 2.688
2.248ValVal: 2.248 ± 0.768
0.562ValTrp: 0.562 ± 0.832
4.497ValTyr: 4.497 ± 1.863
0.0ValXaa: 0.0 ± 0.0
Trp
1.124TrpAla: 1.124 ± 0.926
0.0TrpCys: 0.0 ± 0.0
1.124TrpAsp: 1.124 ± 0.802
2.248TrpGlu: 2.248 ± 0.65
0.562TrpPhe: 0.562 ± 0.832
3.373TrpGly: 3.373 ± 1.455
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.124TrpLys: 1.124 ± 0.779
0.562TrpLeu: 0.562 ± 0.401
1.124TrpMet: 1.124 ± 0.926
0.0TrpAsn: 0.0 ± 0.0
0.562TrpPro: 0.562 ± 0.832
1.124TrpGln: 1.124 ± 0.802
1.124TrpArg: 1.124 ± 0.926
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.124TrpVal: 1.124 ± 0.779
1.686TrpTrp: 1.686 ± 0.917
0.562TrpTyr: 0.562 ± 0.536
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.686TyrAla: 1.686 ± 0.802
1.124TyrCys: 1.124 ± 0.53
0.562TyrAsp: 0.562 ± 0.536
1.124TyrGlu: 1.124 ± 0.802
5.059TyrPhe: 5.059 ± 1.693
2.811TyrGly: 2.811 ± 0.88
1.686TyrHis: 1.686 ± 0.802
1.686TyrIle: 1.686 ± 1.209
1.686TyrLys: 1.686 ± 0.917
0.0TyrLeu: 0.0 ± 0.0
2.248TyrMet: 2.248 ± 1.11
1.124TyrAsn: 1.124 ± 0.926
1.686TyrPro: 1.686 ± 1.608
0.0TyrGln: 0.0 ± 0.0
2.248TyrArg: 2.248 ± 1.0
2.811TyrSer: 2.811 ± 0.95
2.811TyrThr: 2.811 ± 1.485
1.686TyrVal: 1.686 ± 0.772
0.562TyrTrp: 0.562 ± 0.401
3.935TyrTyr: 3.935 ± 1.513
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1780 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski