Amino acid dipepetide frequency for Murine polyomavirus (strain A2) (MPyV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.204AlaAla: 5.204 ± 1.286
0.434AlaCys: 0.434 ± 0.384
4.337AlaAsp: 4.337 ± 1.847
3.469AlaGlu: 3.469 ± 1.443
1.301AlaPhe: 1.301 ± 0.655
2.602AlaGly: 2.602 ± 0.572
3.469AlaHis: 3.469 ± 1.181
3.036AlaIle: 3.036 ± 2.036
1.735AlaLys: 1.735 ± 1.153
9.974AlaLeu: 9.974 ± 3.688
0.434AlaMet: 0.434 ± 0.435
0.867AlaAsn: 0.867 ± 0.577
3.036AlaPro: 3.036 ± 1.171
2.168AlaGln: 2.168 ± 0.69
3.036AlaArg: 3.036 ± 0.412
5.637AlaSer: 5.637 ± 1.217
3.903AlaThr: 3.903 ± 1.032
2.602AlaVal: 2.602 ± 0.71
0.434AlaTrp: 0.434 ± 0.289
1.735AlaTyr: 1.735 ± 0.887
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.168CysAsp: 2.168 ± 0.561
1.301CysGlu: 1.301 ± 0.655
1.301CysPhe: 1.301 ± 0.611
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.301CysIle: 1.301 ± 0.611
2.168CysLys: 2.168 ± 1.173
5.204CysLeu: 5.204 ± 2.578
0.0CysMet: 0.0 ± 0.0
0.434CysAsn: 0.434 ± 0.289
0.867CysPro: 0.867 ± 0.452
0.434CysGln: 0.434 ± 0.289
1.301CysArg: 1.301 ± 0.611
2.602CysSer: 2.602 ± 1.044
1.301CysThr: 1.301 ± 0.655
1.301CysVal: 1.301 ± 0.655
0.0CysTrp: 0.0 ± 0.0
1.735CysTyr: 1.735 ± 0.668
0.0CysXaa: 0.0 ± 0.0
Asp
3.036AspAla: 3.036 ± 0.728
0.0AspCys: 0.0 ± 0.0
1.735AspAsp: 1.735 ± 1.155
2.168AspGlu: 2.168 ± 0.521
4.77AspPhe: 4.77 ± 1.862
3.903AspGly: 3.903 ± 1.85
0.0AspHis: 0.0 ± 0.0
4.337AspIle: 4.337 ± 0.894
4.337AspLys: 4.337 ± 1.659
4.337AspLeu: 4.337 ± 0.858
0.867AspMet: 0.867 ± 0.767
0.434AspAsn: 0.434 ± 0.289
5.204AspPro: 5.204 ± 1.48
2.168AspGln: 2.168 ± 0.778
2.602AspArg: 2.602 ± 1.126
1.301AspSer: 1.301 ± 0.655
3.036AspThr: 3.036 ± 0.816
3.036AspVal: 3.036 ± 1.066
3.036AspTrp: 3.036 ± 1.317
1.301AspTyr: 1.301 ± 0.583
0.0AspXaa: 0.0 ± 0.0
Glu
2.168GluAla: 2.168 ± 1.036
3.036GluCys: 3.036 ± 1.247
3.469GluAsp: 3.469 ± 0.567
7.806GluGlu: 7.806 ± 2.21
1.301GluPhe: 1.301 ± 0.866
6.505GluGly: 6.505 ± 2.124
0.0GluHis: 0.0 ± 0.0
1.735GluIle: 1.735 ± 0.741
1.735GluLys: 1.735 ± 0.905
6.071GluLeu: 6.071 ± 2.079
0.0GluMet: 0.0 ± 0.0
4.337GluAsn: 4.337 ± 1.447
2.602GluPro: 2.602 ± 0.926
1.301GluGln: 1.301 ± 0.624
2.602GluArg: 2.602 ± 1.044
4.337GluSer: 4.337 ± 0.899
2.168GluThr: 2.168 ± 0.966
5.637GluVal: 5.637 ± 1.508
0.434GluTrp: 0.434 ± 0.289
1.301GluTyr: 1.301 ± 0.624
0.0GluXaa: 0.0 ± 0.0
Phe
3.036PheAla: 3.036 ± 0.893
2.602PheCys: 2.602 ± 1.222
0.867PheAsp: 0.867 ± 0.454
1.301PheGlu: 1.301 ± 0.866
0.867PhePhe: 0.867 ± 0.577
4.337PheGly: 4.337 ± 1.614
0.434PheHis: 0.434 ± 0.289
1.301PheIle: 1.301 ± 0.973
2.602PheLys: 2.602 ± 0.726
3.903PheLeu: 3.903 ± 0.918
0.434PheMet: 0.434 ± 0.39
2.168PheAsn: 2.168 ± 0.892
3.903PhePro: 3.903 ± 1.298
1.735PheGln: 1.735 ± 0.668
1.735PheArg: 1.735 ± 1.155
0.867PheSer: 0.867 ± 0.452
2.168PheThr: 2.168 ± 1.173
1.301PheVal: 1.301 ± 0.814
0.0PheTrp: 0.0 ± 0.0
0.867PheTyr: 0.867 ± 0.722
0.0PheXaa: 0.0 ± 0.0
Gly
3.469GlyAla: 3.469 ± 2.461
0.0GlyCys: 0.0 ± 0.0
4.337GlyAsp: 4.337 ± 0.816
4.337GlyGlu: 4.337 ± 1.664
3.036GlyPhe: 3.036 ± 0.55
10.408GlyGly: 10.408 ± 2.783
2.168GlyHis: 2.168 ± 0.517
1.735GlyIle: 1.735 ± 0.931
1.735GlyLys: 1.735 ± 0.905
7.372GlyLeu: 7.372 ± 2.879
2.168GlyMet: 2.168 ± 0.748
2.602GlyAsn: 2.602 ± 1.152
3.036GlyPro: 3.036 ± 1.418
1.735GlyGln: 1.735 ± 1.153
3.036GlyArg: 3.036 ± 0.757
6.938GlySer: 6.938 ± 1.866
5.637GlyThr: 5.637 ± 1.707
4.337GlyVal: 4.337 ± 1.293
1.735GlyTrp: 1.735 ± 0.778
1.735GlyTyr: 1.735 ± 0.57
0.0GlyXaa: 0.0 ± 0.0
His
4.337HisAla: 4.337 ± 1.026
0.0HisCys: 0.0 ± 0.0
1.301HisAsp: 1.301 ± 0.583
0.0HisGlu: 0.0 ± 0.0
0.867HisPhe: 0.867 ± 0.87
1.301HisGly: 1.301 ± 0.792
0.0HisHis: 0.0 ± 0.0
1.301HisIle: 1.301 ± 0.772
0.434HisLys: 0.434 ± 0.384
0.867HisLeu: 0.867 ± 0.454
0.434HisMet: 0.434 ± 0.289
0.434HisAsn: 0.434 ± 0.468
3.036HisPro: 3.036 ± 0.806
1.301HisGln: 1.301 ± 0.583
3.469HisArg: 3.469 ± 0.495
4.337HisSer: 4.337 ± 1.396
1.301HisThr: 1.301 ± 0.583
0.434HisVal: 0.434 ± 0.384
0.434HisTrp: 0.434 ± 0.289
0.867HisTyr: 0.867 ± 0.454
0.0HisXaa: 0.0 ± 0.0
Ile
1.735IleAla: 1.735 ± 0.668
1.301IleCys: 1.301 ± 0.624
1.735IleAsp: 1.735 ± 0.945
2.602IleGlu: 2.602 ± 1.815
0.0IlePhe: 0.0 ± 0.0
0.434IleGly: 0.434 ± 0.289
0.867IleHis: 0.867 ± 0.454
0.867IleIle: 0.867 ± 0.454
2.602IleLys: 2.602 ± 0.7
6.071IleLeu: 6.071 ± 2.818
1.301IleMet: 1.301 ± 0.655
1.735IleAsn: 1.735 ± 0.648
2.168IlePro: 2.168 ± 0.918
2.168IleGln: 2.168 ± 0.853
0.0IleArg: 0.0 ± 0.0
3.036IleSer: 3.036 ± 0.756
2.602IleThr: 2.602 ± 1.127
0.0IleVal: 0.0 ± 0.0
1.301IleTrp: 1.301 ± 0.486
1.301IleTyr: 1.301 ± 0.583
0.0IleXaa: 0.0 ± 0.0
Lys
2.602LysAla: 2.602 ± 1.309
3.469LysCys: 3.469 ± 1.009
3.469LysAsp: 3.469 ± 1.221
4.337LysGlu: 4.337 ± 1.745
1.301LysPhe: 1.301 ± 0.583
3.469LysGly: 3.469 ± 1.3
1.735LysHis: 1.735 ± 0.905
0.434LysIle: 0.434 ± 0.384
3.036LysLys: 3.036 ± 1.171
3.903LysLeu: 3.903 ± 1.24
0.434LysMet: 0.434 ± 0.289
1.735LysAsn: 1.735 ± 0.904
1.301LysPro: 1.301 ± 0.866
3.036LysGln: 3.036 ± 1.247
3.903LysArg: 3.903 ± 0.575
1.301LysSer: 1.301 ± 0.788
5.204LysThr: 5.204 ± 1.887
0.434LysVal: 0.434 ± 0.289
0.434LysTrp: 0.434 ± 0.289
1.735LysTyr: 1.735 ± 0.518
0.0LysXaa: 0.0 ± 0.0
Leu
5.637LeuAla: 5.637 ± 1.359
2.168LeuCys: 2.168 ± 0.561
8.673LeuAsp: 8.673 ± 1.206
7.806LeuGlu: 7.806 ± 1.481
4.77LeuPhe: 4.77 ± 1.459
6.505LeuGly: 6.505 ± 1.652
4.337LeuHis: 4.337 ± 1.142
6.071LeuIle: 6.071 ± 1.066
4.337LeuLys: 4.337 ± 1.655
17.78LeuLeu: 17.78 ± 3.405
3.036LeuMet: 3.036 ± 0.942
6.505LeuAsn: 6.505 ± 0.882
4.337LeuPro: 4.337 ± 0.712
3.903LeuGln: 3.903 ± 1.276
5.637LeuArg: 5.637 ± 2.65
6.938LeuSer: 6.938 ± 2.057
5.637LeuThr: 5.637 ± 1.508
5.637LeuVal: 5.637 ± 1.186
2.602LeuTrp: 2.602 ± 1.222
3.903LeuTyr: 3.903 ± 0.836
0.0LeuXaa: 0.0 ± 0.0
Met
2.168MetAla: 2.168 ± 1.313
0.434MetCys: 0.434 ± 0.289
1.301MetAsp: 1.301 ± 0.611
2.168MetGlu: 2.168 ± 0.785
0.0MetPhe: 0.0 ± 0.0
3.903MetGly: 3.903 ± 1.272
0.0MetHis: 0.0 ± 0.0
0.434MetIle: 0.434 ± 0.435
0.0MetLys: 0.0 ± 0.0
2.602MetLeu: 2.602 ± 0.406
0.0MetMet: 0.0 ± 0.0
1.735MetAsn: 1.735 ± 0.668
2.168MetPro: 2.168 ± 1.625
4.337MetGln: 4.337 ± 2.051
0.867MetArg: 0.867 ± 0.87
0.434MetSer: 0.434 ± 0.468
1.735MetThr: 1.735 ± 0.717
2.168MetVal: 2.168 ± 0.616
0.434MetTrp: 0.434 ± 0.384
0.434MetTyr: 0.434 ± 0.384
0.0MetXaa: 0.0 ± 0.0
Asn
1.735AsnAla: 1.735 ± 0.57
0.867AsnCys: 0.867 ± 0.577
0.434AsnAsp: 0.434 ± 0.289
2.168AsnGlu: 2.168 ± 0.813
0.434AsnPhe: 0.434 ± 0.289
2.602AsnGly: 2.602 ± 0.406
0.434AsnHis: 0.434 ± 0.289
0.867AsnIle: 0.867 ± 0.577
2.602AsnLys: 2.602 ± 1.309
6.505AsnLeu: 6.505 ± 2.303
1.301AsnMet: 1.301 ± 0.8
1.301AsnAsn: 1.301 ± 0.788
4.337AsnPro: 4.337 ± 0.686
0.867AsnGln: 0.867 ± 0.722
2.168AsnArg: 2.168 ± 1.807
2.168AsnSer: 2.168 ± 0.561
2.602AsnThr: 2.602 ± 1.647
2.602AsnVal: 2.602 ± 0.584
0.0AsnTrp: 0.0 ± 0.0
1.301AsnTyr: 1.301 ± 0.788
0.0AsnXaa: 0.0 ± 0.0
Pro
5.637ProAla: 5.637 ± 1.126
1.301ProCys: 1.301 ± 0.655
5.204ProAsp: 5.204 ± 0.948
2.168ProGlu: 2.168 ± 0.778
0.434ProPhe: 0.434 ± 0.289
3.036ProGly: 3.036 ± 0.597
0.867ProHis: 0.867 ± 0.454
2.168ProIle: 2.168 ± 1.188
2.602ProLys: 2.602 ± 0.926
4.77ProLeu: 4.77 ± 0.297
2.602ProMet: 2.602 ± 0.737
0.434ProAsn: 0.434 ± 0.289
6.505ProPro: 6.505 ± 1.533
4.337ProGln: 4.337 ± 1.721
6.071ProArg: 6.071 ± 1.418
2.602ProSer: 2.602 ± 1.393
5.637ProThr: 5.637 ± 1.345
3.469ProVal: 3.469 ± 1.866
1.301ProTrp: 1.301 ± 0.772
0.867ProTyr: 0.867 ± 0.452
0.0ProXaa: 0.0 ± 0.0
Gln
2.602GlnAla: 2.602 ± 0.584
0.434GlnCys: 0.434 ± 0.289
1.735GlnAsp: 1.735 ± 0.59
2.602GlnGlu: 2.602 ± 0.584
2.602GlnPhe: 2.602 ± 0.71
3.036GlnGly: 3.036 ± 0.704
1.735GlnHis: 1.735 ± 0.945
1.301GlnIle: 1.301 ± 0.624
1.735GlnLys: 1.735 ± 0.57
5.204GlnLeu: 5.204 ± 1.245
1.301GlnMet: 1.301 ± 0.792
0.0GlnAsn: 0.0 ± 0.0
1.735GlnPro: 1.735 ± 0.904
3.903GlnGln: 3.903 ± 1.512
4.77GlnArg: 4.77 ± 2.067
4.77GlnSer: 4.77 ± 1.37
2.168GlnThr: 2.168 ± 0.853
3.903GlnVal: 3.903 ± 0.899
0.867GlnTrp: 0.867 ± 0.684
0.867GlnTyr: 0.867 ± 0.767
0.0GlnXaa: 0.0 ± 0.0
Arg
4.77ArgAla: 4.77 ± 2.83
1.301ArgCys: 1.301 ± 0.611
2.602ArgAsp: 2.602 ± 0.912
3.036ArgGlu: 3.036 ± 0.561
2.168ArgPhe: 2.168 ± 0.677
2.168ArgGly: 2.168 ± 0.616
0.867ArgHis: 0.867 ± 0.87
1.301ArgIle: 1.301 ± 0.583
3.036ArgLys: 3.036 ± 0.816
8.239ArgLeu: 8.239 ± 2.152
4.337ArgMet: 4.337 ± 1.914
2.168ArgAsn: 2.168 ± 0.813
1.735ArgPro: 1.735 ± 0.905
3.036ArgGln: 3.036 ± 1.317
6.071ArgArg: 6.071 ± 1.755
3.036ArgSer: 3.036 ± 1.497
3.036ArgThr: 3.036 ± 0.695
3.903ArgVal: 3.903 ± 0.459
1.301ArgTrp: 1.301 ± 0.792
3.469ArgTyr: 3.469 ± 1.409
0.0ArgXaa: 0.0 ± 0.0
Ser
3.036SerAla: 3.036 ± 1.584
3.036SerCys: 3.036 ± 1.066
3.903SerAsp: 3.903 ± 0.918
3.469SerGlu: 3.469 ± 1.007
2.168SerPhe: 2.168 ± 1.173
5.204SerGly: 5.204 ± 1.162
3.036SerHis: 3.036 ± 0.757
0.434SerIle: 0.434 ± 0.289
2.602SerLys: 2.602 ± 1.357
9.54SerLeu: 9.54 ± 1.323
3.036SerMet: 3.036 ± 0.728
2.168SerAsn: 2.168 ± 0.517
3.903SerPro: 3.903 ± 1.575
3.469SerGln: 3.469 ± 1.371
3.903SerArg: 3.903 ± 0.867
8.239SerSer: 8.239 ± 1.3
3.903SerThr: 3.903 ± 1.113
3.903SerVal: 3.903 ± 0.459
0.0SerTrp: 0.0 ± 0.0
2.168SerTyr: 2.168 ± 1.121
0.0SerXaa: 0.0 ± 0.0
Thr
3.903ThrAla: 3.903 ± 1.019
1.735ThrCys: 1.735 ± 0.518
1.301ThrAsp: 1.301 ± 0.655
4.337ThrGlu: 4.337 ± 1.458
3.903ThrPhe: 3.903 ± 1.347
4.77ThrGly: 4.77 ± 0.83
1.735ThrHis: 1.735 ± 0.739
2.602ThrIle: 2.602 ± 0.971
4.337ThrLys: 4.337 ± 1.687
3.903ThrLeu: 3.903 ± 1.329
1.735ThrMet: 1.735 ± 0.504
0.434ThrAsn: 0.434 ± 0.384
6.938ThrPro: 6.938 ± 1.221
1.301ThrGln: 1.301 ± 0.792
4.77ThrArg: 4.77 ± 0.548
2.168ThrSer: 2.168 ± 0.778
1.735ThrThr: 1.735 ± 0.655
5.204ThrVal: 5.204 ± 2.182
2.168ThrTrp: 2.168 ± 0.892
0.434ThrTyr: 0.434 ± 0.435
0.0ThrXaa: 0.0 ± 0.0
Val
2.168ValAla: 2.168 ± 1.443
0.867ValCys: 0.867 ± 0.577
1.735ValAsp: 1.735 ± 0.783
1.735ValGlu: 1.735 ± 1.153
2.168ValPhe: 2.168 ± 0.813
1.735ValGly: 1.735 ± 1.014
3.469ValHis: 3.469 ± 0.495
1.735ValIle: 1.735 ± 1.443
3.469ValLys: 3.469 ± 1.009
6.071ValLeu: 6.071 ± 1.971
0.867ValMet: 0.867 ± 0.593
3.469ValAsn: 3.469 ± 0.717
3.036ValPro: 3.036 ± 0.756
2.168ValGln: 2.168 ± 0.966
3.469ValArg: 3.469 ± 1.245
5.204ValSer: 5.204 ± 2.43
4.337ValThr: 4.337 ± 1.643
5.204ValVal: 5.204 ± 2.59
1.301ValTrp: 1.301 ± 0.583
3.903ValTyr: 3.903 ± 0.93
0.0ValXaa: 0.0 ± 0.0
Trp
1.301TrpAla: 1.301 ± 0.583
0.0TrpCys: 0.0 ± 0.0
0.434TrpAsp: 0.434 ± 0.289
0.867TrpGlu: 0.867 ± 0.424
0.867TrpPhe: 0.867 ± 0.684
3.469TrpGly: 3.469 ± 1.137
0.434TrpHis: 0.434 ± 0.384
0.867TrpIle: 0.867 ± 0.452
0.434TrpLys: 0.434 ± 0.289
1.301TrpLeu: 1.301 ± 0.817
0.867TrpMet: 0.867 ± 0.722
2.168TrpAsn: 2.168 ± 0.827
0.0TrpPro: 0.0 ± 0.0
0.867TrpGln: 0.867 ± 0.722
1.301TrpArg: 1.301 ± 0.792
1.301TrpSer: 1.301 ± 0.788
0.0TrpThr: 0.0 ± 0.0
1.301TrpVal: 1.301 ± 0.792
0.0TrpTrp: 0.0 ± 0.0
0.434TrpTyr: 0.434 ± 0.289
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.867TyrAla: 0.867 ± 0.684
0.434TyrCys: 0.434 ± 0.289
0.867TyrAsp: 0.867 ± 0.452
0.867TyrGlu: 0.867 ± 0.577
2.168TyrPhe: 2.168 ± 0.892
2.168TyrGly: 2.168 ± 0.966
0.867TyrHis: 0.867 ± 0.452
0.0TyrIle: 0.0 ± 0.0
1.735TyrLys: 1.735 ± 0.518
3.036TyrLeu: 3.036 ± 0.818
1.735TyrMet: 1.735 ± 0.908
2.168TyrAsn: 2.168 ± 1.265
2.168TyrPro: 2.168 ± 1.162
3.036TyrGln: 3.036 ± 0.561
0.867TyrArg: 0.867 ± 0.722
3.903TyrSer: 3.903 ± 0.745
1.301TyrThr: 1.301 ± 0.413
1.735TyrVal: 1.735 ± 1.014
0.434TyrTrp: 0.434 ± 0.289
2.602TyrTyr: 2.602 ± 0.623
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2307 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski