Amino acid dipepetide frequency for Myocastor coypus polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.31AlaAla: 4.31 ± 1.776
0.0AlaCys: 0.0 ± 0.0
1.616AlaAsp: 1.616 ± 0.837
2.155AlaGlu: 2.155 ± 1.623
1.616AlaPhe: 1.616 ± 0.539
5.388AlaGly: 5.388 ± 3.17
1.078AlaHis: 1.078 ± 0.778
2.694AlaIle: 2.694 ± 0.803
1.078AlaLys: 1.078 ± 0.452
8.621AlaLeu: 8.621 ± 3.92
0.539AlaMet: 0.539 ± 0.442
1.078AlaAsn: 1.078 ± 0.938
3.772AlaPro: 3.772 ± 1.495
2.694AlaGln: 2.694 ± 0.833
4.849AlaArg: 4.849 ± 1.421
1.078AlaSer: 1.078 ± 0.659
3.233AlaThr: 3.233 ± 1.487
4.31AlaVal: 4.31 ± 2.159
0.539AlaTrp: 0.539 ± 0.385
2.155AlaTyr: 2.155 ± 0.638
0.0AlaXaa: 0.0 ± 0.0
Cys
2.155CysAla: 2.155 ± 1.28
0.0CysCys: 0.0 ± 0.0
0.539CysAsp: 0.539 ± 0.5
3.233CysGlu: 3.233 ± 1.63
0.539CysPhe: 0.539 ± 0.385
1.078CysGly: 1.078 ± 0.452
0.0CysHis: 0.0 ± 0.0
1.078CysIle: 1.078 ± 0.555
4.31CysLys: 4.31 ± 1.921
3.772CysLeu: 3.772 ± 1.952
0.539CysMet: 0.539 ± 0.438
0.539CysAsn: 0.539 ± 0.385
1.078CysPro: 1.078 ± 0.452
1.078CysGln: 1.078 ± 0.771
1.078CysArg: 1.078 ± 0.771
0.539CysSer: 0.539 ± 0.385
0.0CysThr: 0.0 ± 0.0
1.078CysVal: 1.078 ± 0.555
0.0CysTrp: 0.0 ± 0.0
1.616CysTyr: 1.616 ± 1.501
0.0CysXaa: 0.0 ± 0.0
Asp
2.694AspAla: 2.694 ± 0.353
1.078AspCys: 1.078 ± 0.555
3.233AspAsp: 3.233 ± 1.405
1.616AspGlu: 1.616 ± 0.837
1.616AspPhe: 1.616 ± 0.815
4.849AspGly: 4.849 ± 1.659
0.0AspHis: 0.0 ± 0.0
5.388AspIle: 5.388 ± 0.699
3.772AspLys: 3.772 ± 1.439
2.155AspLeu: 2.155 ± 0.694
1.078AspMet: 1.078 ± 0.665
0.539AspAsn: 0.539 ± 0.385
2.155AspPro: 2.155 ± 1.28
2.155AspGln: 2.155 ± 0.585
1.616AspArg: 1.616 ± 0.985
2.694AspSer: 2.694 ± 1.34
2.155AspThr: 2.155 ± 0.787
1.616AspVal: 1.616 ± 1.156
1.616AspTrp: 1.616 ± 0.656
1.616AspTyr: 1.616 ± 0.656
0.0AspXaa: 0.0 ± 0.0
Glu
4.849GluAla: 4.849 ± 1.883
2.694GluCys: 2.694 ± 0.989
3.772GluAsp: 3.772 ± 1.156
11.853GluGlu: 11.853 ± 3.241
1.616GluPhe: 1.616 ± 1.156
5.927GluGly: 5.927 ± 2.184
2.694GluHis: 2.694 ± 0.985
2.694GluIle: 2.694 ± 0.833
6.466GluLys: 6.466 ± 2.986
5.927GluLeu: 5.927 ± 1.243
1.616GluMet: 1.616 ± 0.697
3.772GluAsn: 3.772 ± 1.378
2.155GluPro: 2.155 ± 1.032
2.155GluGln: 2.155 ± 0.742
2.694GluArg: 2.694 ± 1.505
5.927GluSer: 5.927 ± 2.757
3.233GluThr: 3.233 ± 1.108
3.233GluVal: 3.233 ± 1.8
1.078GluTrp: 1.078 ± 0.63
2.694GluTyr: 2.694 ± 0.619
0.0GluXaa: 0.0 ± 0.0
Phe
1.616PheAla: 1.616 ± 0.697
1.078PheCys: 1.078 ± 0.771
0.0PheAsp: 0.0 ± 0.0
2.155PheGlu: 2.155 ± 0.638
2.155PhePhe: 2.155 ± 0.973
2.694PheGly: 2.694 ± 1.184
1.078PheHis: 1.078 ± 0.502
0.539PheIle: 0.539 ± 0.469
2.155PheLys: 2.155 ± 1.032
4.31PheLeu: 4.31 ± 1.187
0.539PheMet: 0.539 ± 0.385
1.078PheAsn: 1.078 ± 0.771
2.694PhePro: 2.694 ± 0.985
2.155PheGln: 2.155 ± 1.076
1.078PheArg: 1.078 ± 0.778
1.078PheSer: 1.078 ± 0.452
4.31PheThr: 4.31 ± 0.826
3.233PheVal: 3.233 ± 0.843
0.0PheTrp: 0.0 ± 0.0
0.539PheTyr: 0.539 ± 0.385
0.0PheXaa: 0.0 ± 0.0
Gly
3.233GlyAla: 3.233 ± 2.254
1.616GlyCys: 1.616 ± 1.156
3.233GlyAsp: 3.233 ± 0.733
6.466GlyGlu: 6.466 ± 1.689
1.078GlyPhe: 1.078 ± 0.659
6.466GlyGly: 6.466 ± 1.591
1.616GlyHis: 1.616 ± 0.656
3.772GlyIle: 3.772 ± 1.183
3.233GlyLys: 3.233 ± 1.325
9.159GlyLeu: 9.159 ± 2.114
1.078GlyMet: 1.078 ± 0.555
2.155GlyAsn: 2.155 ± 0.593
5.927GlyPro: 5.927 ± 2.453
5.388GlyGln: 5.388 ± 1.581
1.616GlyArg: 1.616 ± 0.826
2.155GlySer: 2.155 ± 0.526
5.388GlyThr: 5.388 ± 2.398
8.082GlyVal: 8.082 ± 1.134
1.616GlyTrp: 1.616 ± 0.539
1.616GlyTyr: 1.616 ± 0.539
0.0GlyXaa: 0.0 ± 0.0
His
2.155HisAla: 2.155 ± 0.742
0.539HisCys: 0.539 ± 0.5
0.0HisAsp: 0.0 ± 0.0
1.078HisGlu: 1.078 ± 0.771
0.539HisPhe: 0.539 ± 0.469
1.078HisGly: 1.078 ± 0.63
0.539HisHis: 0.539 ± 0.385
0.0HisIle: 0.0 ± 0.0
1.616HisLys: 1.616 ± 0.513
3.772HisLeu: 3.772 ± 2.064
1.078HisMet: 1.078 ± 0.555
0.539HisAsn: 0.539 ± 0.385
1.078HisPro: 1.078 ± 0.555
0.539HisGln: 0.539 ± 0.56
0.539HisArg: 0.539 ± 0.385
0.539HisSer: 0.539 ± 0.385
1.078HisThr: 1.078 ± 0.778
1.616HisVal: 1.616 ± 0.697
0.0HisTrp: 0.0 ± 0.0
3.233HisTyr: 3.233 ± 0.843
0.0HisXaa: 0.0 ± 0.0
Ile
1.616IleAla: 1.616 ± 0.991
2.694IleCys: 2.694 ± 0.989
2.155IleAsp: 2.155 ± 0.694
4.31IleGlu: 4.31 ± 1.745
2.155IlePhe: 2.155 ± 1.147
2.694IleGly: 2.694 ± 1.728
0.0IleHis: 0.0 ± 0.0
2.155IleIle: 2.155 ± 1.5
3.233IleLys: 3.233 ± 1.325
5.388IleLeu: 5.388 ± 0.905
2.155IleMet: 2.155 ± 1.009
3.233IleAsn: 3.233 ± 0.531
3.233IlePro: 3.233 ± 0.744
2.155IleGln: 2.155 ± 1.764
1.078IleArg: 1.078 ± 0.452
4.849IleSer: 4.849 ± 1.883
2.694IleThr: 2.694 ± 1.199
3.772IleVal: 3.772 ± 1.334
1.078IleTrp: 1.078 ± 0.452
1.078IleTyr: 1.078 ± 0.555
0.0IleXaa: 0.0 ± 0.0
Lys
3.233LysAla: 3.233 ± 1.405
1.616LysCys: 1.616 ± 0.985
2.694LysAsp: 2.694 ± 0.985
1.616LysGlu: 1.616 ± 1.156
0.0LysPhe: 0.0 ± 0.0
3.772LysGly: 3.772 ± 1.181
2.155LysHis: 2.155 ± 1.009
2.155LysIle: 2.155 ± 1.032
8.621LysLys: 8.621 ± 1.334
4.849LysLeu: 4.849 ± 1.946
0.539LysMet: 0.539 ± 0.385
1.616LysAsn: 1.616 ± 0.837
2.155LysPro: 2.155 ± 0.844
2.155LysGln: 2.155 ± 1.111
10.237LysArg: 10.237 ± 0.64
4.31LysSer: 4.31 ± 2.053
4.31LysThr: 4.31 ± 1.613
4.31LysVal: 4.31 ± 1.492
0.0LysTrp: 0.0 ± 0.0
3.233LysTyr: 3.233 ± 1.173
0.0LysXaa: 0.0 ± 0.0
Leu
3.772LeuAla: 3.772 ± 2.523
4.31LeuCys: 4.31 ± 1.699
6.466LeuAsp: 6.466 ± 1.36
10.237LeuGlu: 10.237 ± 1.702
6.466LeuPhe: 6.466 ± 1.472
8.082LeuGly: 8.082 ± 2.819
2.155LeuHis: 2.155 ± 1.46
7.543LeuIle: 7.543 ± 0.644
3.233LeuLys: 3.233 ± 1.885
5.927LeuLeu: 5.927 ± 2.797
5.927LeuMet: 5.927 ± 2.378
7.543LeuAsn: 7.543 ± 0.652
3.772LeuPro: 3.772 ± 0.76
4.849LeuGln: 4.849 ± 0.514
5.927LeuArg: 5.927 ± 0.811
1.616LeuSer: 1.616 ± 0.697
4.31LeuThr: 4.31 ± 1.104
5.927LeuVal: 5.927 ± 1.794
1.616LeuTrp: 1.616 ± 0.815
2.694LeuTyr: 2.694 ± 0.619
0.0LeuXaa: 0.0 ± 0.0
Met
2.694MetAla: 2.694 ± 0.984
0.0MetCys: 0.0 ± 0.0
1.616MetAsp: 1.616 ± 0.815
1.616MetGlu: 1.616 ± 0.539
1.078MetPhe: 1.078 ± 0.452
1.616MetGly: 1.616 ± 0.455
2.155MetHis: 2.155 ± 0.694
0.0MetIle: 0.0 ± 0.0
1.078MetLys: 1.078 ± 0.63
1.616MetLeu: 1.616 ± 0.656
0.539MetMet: 0.539 ± 0.469
1.078MetAsn: 1.078 ± 0.555
1.078MetPro: 1.078 ± 0.63
2.155MetGln: 2.155 ± 1.147
1.078MetArg: 1.078 ± 0.555
1.616MetSer: 1.616 ± 0.539
1.078MetThr: 1.078 ± 0.63
1.078MetVal: 1.078 ± 0.771
1.616MetTrp: 1.616 ± 0.991
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.616AsnAla: 1.616 ± 0.656
1.078AsnCys: 1.078 ± 0.771
2.155AsnAsp: 2.155 ± 0.904
1.616AsnGlu: 1.616 ± 0.837
1.616AsnPhe: 1.616 ± 0.656
1.078AsnGly: 1.078 ± 0.938
0.0AsnHis: 0.0 ± 0.0
3.233AsnIle: 3.233 ± 0.844
1.616AsnLys: 1.616 ± 1.156
5.388AsnLeu: 5.388 ± 1.194
2.155AsnMet: 2.155 ± 1.339
2.155AsnAsn: 2.155 ± 0.38
3.772AsnPro: 3.772 ± 1.514
0.0AsnGln: 0.0 ± 0.0
1.078AsnArg: 1.078 ± 0.645
1.616AsnSer: 1.616 ± 0.837
2.694AsnThr: 2.694 ± 0.833
2.694AsnVal: 2.694 ± 0.984
1.078AsnTrp: 1.078 ± 0.778
1.616AsnTyr: 1.616 ± 0.539
0.0AsnXaa: 0.0 ± 0.0
Pro
1.616ProAla: 1.616 ± 0.837
1.078ProCys: 1.078 ± 0.555
4.31ProAsp: 4.31 ± 0.851
4.849ProGlu: 4.849 ± 1.284
1.616ProPhe: 1.616 ± 0.697
5.388ProGly: 5.388 ± 1.77
0.539ProHis: 0.539 ± 0.5
2.694ProIle: 2.694 ± 0.764
5.388ProLys: 5.388 ± 1.666
5.927ProLeu: 5.927 ± 1.098
0.539ProMet: 0.539 ± 0.469
0.0ProAsn: 0.0 ± 0.0
4.849ProPro: 4.849 ± 1.028
0.0ProGln: 0.0 ± 0.0
3.233ProArg: 3.233 ± 1.194
3.233ProSer: 3.233 ± 1.199
7.004ProThr: 7.004 ± 1.53
4.849ProVal: 4.849 ± 2.485
0.539ProTrp: 0.539 ± 0.5
1.616ProTyr: 1.616 ± 0.826
0.0ProXaa: 0.0 ± 0.0
Gln
4.31GlnAla: 4.31 ± 1.197
1.616GlnCys: 1.616 ± 0.815
0.539GlnAsp: 0.539 ± 0.385
4.31GlnGlu: 4.31 ± 1.398
2.155GlnPhe: 2.155 ± 0.526
3.772GlnGly: 3.772 ± 0.92
0.539GlnHis: 0.539 ± 0.5
4.849GlnIle: 4.849 ± 1.311
1.078GlnLys: 1.078 ± 0.555
3.233GlnLeu: 3.233 ± 1.433
1.078GlnMet: 1.078 ± 0.452
1.078GlnAsn: 1.078 ± 0.452
2.155GlnPro: 2.155 ± 0.844
3.233GlnGln: 3.233 ± 1.298
1.616GlnArg: 1.616 ± 0.826
3.233GlnSer: 3.233 ± 1.298
2.694GlnThr: 2.694 ± 1.715
1.616GlnVal: 1.616 ± 1.243
0.539GlnTrp: 0.539 ± 0.5
1.078GlnTyr: 1.078 ± 0.778
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
0.0ArgCys: 0.0 ± 0.0
1.078ArgAsp: 1.078 ± 0.555
4.31ArgGlu: 4.31 ± 0.826
2.694ArgPhe: 2.694 ± 1.386
3.772ArgGly: 3.772 ± 1.14
2.155ArgHis: 2.155 ± 0.593
1.616ArgIle: 1.616 ± 0.697
3.233ArgLys: 3.233 ± 1.078
3.772ArgLeu: 3.772 ± 1.244
1.078ArgMet: 1.078 ± 0.63
3.233ArgAsn: 3.233 ± 1.086
0.539ArgPro: 0.539 ± 0.385
3.233ArgGln: 3.233 ± 1.267
2.155ArgArg: 2.155 ± 1.28
5.388ArgSer: 5.388 ± 0.782
2.155ArgThr: 2.155 ± 0.787
5.388ArgVal: 5.388 ± 1.03
1.078ArgTrp: 1.078 ± 0.778
4.31ArgTyr: 4.31 ± 1.758
0.0ArgXaa: 0.0 ± 0.0
Ser
5.388SerAla: 5.388 ± 1.64
1.616SerCys: 1.616 ± 0.837
1.616SerAsp: 1.616 ± 1.156
3.233SerGlu: 3.233 ± 1.63
3.772SerPhe: 3.772 ± 1.156
3.233SerGly: 3.233 ± 0.845
1.616SerHis: 1.616 ± 1.156
2.155SerIle: 2.155 ± 1.291
4.31SerLys: 4.31 ± 1.388
7.543SerLeu: 7.543 ± 0.595
0.539SerMet: 0.539 ± 0.469
2.155SerAsn: 2.155 ± 0.38
2.694SerPro: 2.694 ± 1.482
1.616SerGln: 1.616 ± 0.697
1.616SerArg: 1.616 ± 0.837
4.31SerSer: 4.31 ± 0.982
2.155SerThr: 2.155 ± 0.38
2.694SerVal: 2.694 ± 0.547
0.539SerTrp: 0.539 ± 0.385
3.233SerTyr: 3.233 ± 1.621
0.0SerXaa: 0.0 ± 0.0
Thr
3.233ThrAla: 3.233 ± 1.514
1.078ThrCys: 1.078 ± 0.452
1.616ThrAsp: 1.616 ± 0.985
5.388ThrGlu: 5.388 ± 2.123
1.078ThrPhe: 1.078 ± 0.555
3.772ThrGly: 3.772 ± 1.332
0.0ThrHis: 0.0 ± 0.0
3.233ThrIle: 3.233 ± 0.634
2.155ThrLys: 2.155 ± 0.844
3.772ThrLeu: 3.772 ± 0.628
0.539ThrMet: 0.539 ± 0.385
3.233ThrAsn: 3.233 ± 1.219
5.927ThrPro: 5.927 ± 2.178
3.772ThrGln: 3.772 ± 0.68
3.772ThrArg: 3.772 ± 1.347
4.31ThrSer: 4.31 ± 1.714
3.233ThrThr: 3.233 ± 1.674
3.772ThrVal: 3.772 ± 1.367
1.078ThrTrp: 1.078 ± 0.778
3.772ThrTyr: 3.772 ± 1.881
0.0ThrXaa: 0.0 ± 0.0
Val
3.233ValAla: 3.233 ± 0.866
0.539ValCys: 0.539 ± 0.385
3.233ValAsp: 3.233 ± 0.845
4.31ValGlu: 4.31 ± 1.478
0.539ValPhe: 0.539 ± 0.56
2.694ValGly: 2.694 ± 2.346
1.078ValHis: 1.078 ± 0.452
2.155ValIle: 2.155 ± 1.076
3.233ValLys: 3.233 ± 1.031
10.776ValLeu: 10.776 ± 2.461
1.616ValMet: 1.616 ± 0.985
2.155ValAsn: 2.155 ± 0.526
7.004ValPro: 7.004 ± 0.901
4.31ValGln: 4.31 ± 1.197
2.694ValArg: 2.694 ± 1.534
5.388ValSer: 5.388 ± 1.0
4.31ValThr: 4.31 ± 1.552
2.694ValVal: 2.694 ± 0.547
0.0ValTrp: 0.0 ± 0.0
2.694ValTyr: 2.694 ± 1.107
0.0ValXaa: 0.0 ± 0.0
Trp
1.078TrpAla: 1.078 ± 0.778
0.539TrpCys: 0.539 ± 0.385
0.0TrpAsp: 0.0 ± 0.0
0.539TrpGlu: 0.539 ± 0.469
1.078TrpPhe: 1.078 ± 0.555
2.155TrpGly: 2.155 ± 0.936
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.616TrpLys: 1.616 ± 0.815
0.0TrpLeu: 0.0 ± 0.0
1.078TrpMet: 1.078 ± 0.778
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.539TrpGln: 0.539 ± 0.469
1.616TrpArg: 1.616 ± 0.826
0.539TrpSer: 0.539 ± 0.5
0.539TrpThr: 0.539 ± 0.385
1.616TrpVal: 1.616 ± 0.786
0.0TrpTrp: 0.0 ± 0.0
1.078TrpTyr: 1.078 ± 0.452
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.539TyrAla: 0.539 ± 0.385
1.078TyrCys: 1.078 ± 1.001
3.772TyrAsp: 3.772 ± 1.338
2.155TyrGlu: 2.155 ± 0.585
1.078TyrPhe: 1.078 ± 0.938
5.388TyrGly: 5.388 ± 2.313
2.155TyrHis: 2.155 ± 1.111
3.772TyrIle: 3.772 ± 1.14
2.155TyrLys: 2.155 ± 1.147
6.466TyrLeu: 6.466 ± 1.85
0.0TyrMet: 0.0 ± 0.0
1.078TyrAsn: 1.078 ± 0.778
3.233TyrPro: 3.233 ± 1.419
0.539TyrGln: 0.539 ± 0.385
1.616TyrArg: 1.616 ± 0.539
1.616TyrSer: 1.616 ± 0.837
2.155TyrThr: 2.155 ± 0.844
1.078TyrVal: 1.078 ± 0.452
0.0TyrTrp: 0.0 ± 0.0
2.155TyrTyr: 2.155 ± 1.556
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1857 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski