Amino acid dipepetide frequency for Murine polyomavirus (strain BG) (MPyV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.772AlaAla: 4.772 ± 1.15
0.434AlaCys: 0.434 ± 0.389
4.772AlaAsp: 4.772 ± 1.899
3.037AlaGlu: 3.037 ± 1.428
1.302AlaPhe: 1.302 ± 0.612
2.169AlaGly: 2.169 ± 0.781
4.338AlaHis: 4.338 ± 1.936
3.037AlaIle: 3.037 ± 2.072
1.735AlaLys: 1.735 ± 1.079
9.978AlaLeu: 9.978 ± 3.606
0.434AlaMet: 0.434 ± 0.37
0.868AlaAsn: 0.868 ± 0.654
3.471AlaPro: 3.471 ± 1.356
1.302AlaGln: 1.302 ± 0.981
3.037AlaArg: 3.037 ± 0.404
4.338AlaSer: 4.338 ± 0.863
3.905AlaThr: 3.905 ± 0.915
3.037AlaVal: 3.037 ± 0.764
0.434AlaTrp: 0.434 ± 0.327
1.735AlaTyr: 1.735 ± 0.787
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.735CysAsp: 1.735 ± 0.493
1.735CysGlu: 1.735 ± 0.898
1.302CysPhe: 1.302 ± 0.55
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.302CysIle: 1.302 ± 0.55
2.169CysLys: 2.169 ± 1.205
4.772CysLeu: 4.772 ± 2.38
0.0CysMet: 0.0 ± 0.0
0.434CysAsn: 0.434 ± 0.327
0.868CysPro: 0.868 ± 0.395
0.434CysGln: 0.434 ± 0.327
0.868CysArg: 0.868 ± 0.577
3.037CysSer: 3.037 ± 1.522
1.302CysThr: 1.302 ± 0.612
1.302CysVal: 1.302 ± 0.612
0.0CysTrp: 0.0 ± 0.0
1.735CysTyr: 1.735 ± 0.697
0.0CysXaa: 0.0 ± 0.0
Asp
3.037AspAla: 3.037 ± 0.745
0.0AspCys: 0.0 ± 0.0
1.735AspAsp: 1.735 ± 1.308
2.169AspGlu: 2.169 ± 0.584
4.772AspPhe: 4.772 ± 2.155
4.772AspGly: 4.772 ± 1.467
0.0AspHis: 0.0 ± 0.0
3.905AspIle: 3.905 ± 0.986
4.338AspLys: 4.338 ± 1.497
4.338AspLeu: 4.338 ± 1.001
0.868AspMet: 0.868 ± 0.779
0.434AspAsn: 0.434 ± 0.327
5.206AspPro: 5.206 ± 1.438
2.169AspGln: 2.169 ± 0.703
2.603AspArg: 2.603 ± 0.998
1.735AspSer: 1.735 ± 0.79
3.471AspThr: 3.471 ± 0.999
3.471AspVal: 3.471 ± 0.985
3.037AspTrp: 3.037 ± 1.393
1.735AspTyr: 1.735 ± 0.63
0.0AspXaa: 0.0 ± 0.0
Glu
2.169GluAla: 2.169 ± 1.132
3.037GluCys: 3.037 ± 1.212
3.905GluAsp: 3.905 ± 0.439
8.243GluGlu: 8.243 ± 2.166
1.302GluPhe: 1.302 ± 0.981
6.508GluGly: 6.508 ± 2.192
0.0GluHis: 0.0 ± 0.0
1.735GluIle: 1.735 ± 0.873
1.302GluLys: 1.302 ± 0.981
6.074GluLeu: 6.074 ± 1.796
0.0GluMet: 0.0 ± 0.0
4.772GluAsn: 4.772 ± 1.416
2.603GluPro: 2.603 ± 0.858
1.302GluGln: 1.302 ± 0.646
2.603GluArg: 2.603 ± 1.222
4.338GluSer: 4.338 ± 0.848
2.169GluThr: 2.169 ± 0.984
5.64GluVal: 5.64 ± 1.42
0.434GluTrp: 0.434 ± 0.327
1.302GluTyr: 1.302 ± 0.646
0.0GluXaa: 0.0 ± 0.0
Phe
3.037PheAla: 3.037 ± 0.865
2.603PheCys: 2.603 ± 1.1
0.868PheAsp: 0.868 ± 0.413
1.302PheGlu: 1.302 ± 0.981
0.868PhePhe: 0.868 ± 0.654
4.338PheGly: 4.338 ± 1.464
0.434PheHis: 0.434 ± 0.327
1.302PheIle: 1.302 ± 0.851
2.603PheLys: 2.603 ± 0.786
3.905PheLeu: 3.905 ± 0.927
0.434PheMet: 0.434 ± 0.326
2.169PheAsn: 2.169 ± 0.901
3.037PhePro: 3.037 ± 0.865
1.735PheGln: 1.735 ± 0.697
1.735PheArg: 1.735 ± 1.308
0.434PheSer: 0.434 ± 0.327
2.169PheThr: 2.169 ± 1.205
1.302PheVal: 1.302 ± 0.755
0.0PheTrp: 0.0 ± 0.0
0.868PheTyr: 0.868 ± 0.801
0.0PheXaa: 0.0 ± 0.0
Gly
4.338GlyAla: 4.338 ± 3.199
0.0GlyCys: 0.0 ± 0.0
4.338GlyAsp: 4.338 ± 0.687
3.905GlyGlu: 3.905 ± 1.829
2.603GlyPhe: 2.603 ± 0.362
10.846GlyGly: 10.846 ± 2.313
2.169GlyHis: 2.169 ± 0.521
2.169GlyIle: 2.169 ± 1.132
1.735GlyLys: 1.735 ± 0.898
8.243GlyLeu: 8.243 ± 3.084
2.169GlyMet: 2.169 ± 0.693
2.169GlyAsn: 2.169 ± 1.065
3.037GlyPro: 3.037 ± 1.548
1.735GlyGln: 1.735 ± 1.079
3.037GlyArg: 3.037 ± 0.675
6.941GlySer: 6.941 ± 1.873
5.64GlyThr: 5.64 ± 1.631
4.772GlyVal: 4.772 ± 1.194
1.735GlyTrp: 1.735 ± 0.85
1.735GlyTyr: 1.735 ± 0.63
0.0GlyXaa: 0.0 ± 0.0
His
4.338HisAla: 4.338 ± 0.992
0.0HisCys: 0.0 ± 0.0
1.302HisAsp: 1.302 ± 0.642
0.0HisGlu: 0.0 ± 0.0
0.868HisPhe: 0.868 ± 0.74
1.302HisGly: 1.302 ± 0.89
0.434HisHis: 0.434 ± 0.389
1.302HisIle: 1.302 ± 0.815
0.0HisLys: 0.0 ± 0.0
0.868HisLeu: 0.868 ± 0.413
0.434HisMet: 0.434 ± 0.327
0.434HisAsn: 0.434 ± 0.43
3.037HisPro: 3.037 ± 0.849
1.302HisGln: 1.302 ± 0.642
3.037HisArg: 3.037 ± 0.675
4.338HisSer: 4.338 ± 1.173
1.302HisThr: 1.302 ± 0.642
0.434HisVal: 0.434 ± 0.389
0.868HisTrp: 0.868 ± 0.395
0.868HisTyr: 0.868 ± 0.413
0.0HisXaa: 0.0 ± 0.0
Ile
1.302IleAla: 1.302 ± 0.55
1.302IleCys: 1.302 ± 0.646
1.735IleAsp: 1.735 ± 0.92
2.603IleGlu: 2.603 ± 1.954
0.0IlePhe: 0.0 ± 0.0
0.434IleGly: 0.434 ± 0.327
0.868IleHis: 0.868 ± 0.413
0.868IleIle: 0.868 ± 0.413
2.603IleLys: 2.603 ± 0.791
6.074IleLeu: 6.074 ± 2.37
1.302IleMet: 1.302 ± 0.612
1.735IleAsn: 1.735 ± 0.678
1.735IlePro: 1.735 ± 0.98
2.169IleGln: 2.169 ± 0.874
0.0IleArg: 0.0 ± 0.0
3.471IleSer: 3.471 ± 1.048
2.603IleThr: 2.603 ± 1.151
0.0IleVal: 0.0 ± 0.0
1.302IleTrp: 1.302 ± 0.455
1.302IleTyr: 1.302 ± 0.642
0.0IleXaa: 0.0 ± 0.0
Lys
2.603LysAla: 2.603 ± 1.224
3.471LysCys: 3.471 ± 1.03
3.471LysAsp: 3.471 ± 1.252
4.338LysGlu: 4.338 ± 1.983
1.302LysPhe: 1.302 ± 0.642
3.037LysGly: 3.037 ± 0.953
1.302LysHis: 1.302 ± 0.981
0.434LysIle: 0.434 ± 0.389
3.037LysLys: 3.037 ± 1.262
4.338LysLeu: 4.338 ± 1.348
0.434LysMet: 0.434 ± 0.327
1.735LysAsn: 1.735 ± 0.79
1.735LysPro: 1.735 ± 0.898
3.037LysGln: 3.037 ± 1.212
3.905LysArg: 3.905 ± 0.439
0.868LysSer: 0.868 ± 0.395
4.772LysThr: 4.772 ± 1.502
0.434LysVal: 0.434 ± 0.327
0.434LysTrp: 0.434 ± 0.327
1.735LysTyr: 1.735 ± 0.493
0.0LysXaa: 0.0 ± 0.0
Leu
5.64LeuAla: 5.64 ± 1.405
2.169LeuCys: 2.169 ± 0.568
9.111LeuAsp: 9.111 ± 1.134
7.375LeuGlu: 7.375 ± 1.538
4.772LeuPhe: 4.772 ± 1.549
6.508LeuGly: 6.508 ± 1.443
4.338LeuHis: 4.338 ± 0.93
6.074LeuIle: 6.074 ± 1.031
4.338LeuLys: 4.338 ± 1.879
17.787LeuLeu: 17.787 ± 3.155
3.471LeuMet: 3.471 ± 0.766
6.508LeuAsn: 6.508 ± 0.884
5.206LeuPro: 5.206 ± 1.219
4.338LeuGln: 4.338 ± 1.385
5.64LeuArg: 5.64 ± 2.371
6.941LeuSer: 6.941 ± 1.672
5.64LeuThr: 5.64 ± 1.42
5.64LeuVal: 5.64 ± 1.161
2.603LeuTrp: 2.603 ± 1.1
3.905LeuTyr: 3.905 ± 0.749
0.0LeuXaa: 0.0 ± 0.0
Met
2.169MetAla: 2.169 ± 1.415
0.434MetCys: 0.434 ± 0.327
1.302MetAsp: 1.302 ± 0.55
2.169MetGlu: 2.169 ± 0.8
0.0MetPhe: 0.0 ± 0.0
3.905MetGly: 3.905 ± 1.196
0.0MetHis: 0.0 ± 0.0
0.434MetIle: 0.434 ± 0.37
0.0MetLys: 0.0 ± 0.0
3.037MetLeu: 3.037 ± 0.576
0.0MetMet: 0.0 ± 0.0
1.735MetAsn: 1.735 ± 0.697
2.169MetPro: 2.169 ± 1.406
4.338MetGln: 4.338 ± 1.704
0.868MetArg: 0.868 ± 0.74
0.434MetSer: 0.434 ± 0.43
1.735MetThr: 1.735 ± 0.65
2.169MetVal: 2.169 ± 0.585
0.434MetTrp: 0.434 ± 0.389
0.434MetTyr: 0.434 ± 0.389
0.0MetXaa: 0.0 ± 0.0
Asn
1.735AsnAla: 1.735 ± 0.63
0.868AsnCys: 0.868 ± 0.654
0.434AsnAsp: 0.434 ± 0.327
2.169AsnGlu: 2.169 ± 0.736
0.434AsnPhe: 0.434 ± 0.327
2.169AsnGly: 2.169 ± 0.584
0.434AsnHis: 0.434 ± 0.327
0.868AsnIle: 0.868 ± 0.654
2.603AsnLys: 2.603 ± 1.224
6.508AsnLeu: 6.508 ± 2.231
1.302AsnMet: 1.302 ± 0.832
1.302AsnAsn: 1.302 ± 0.713
4.338AsnPro: 4.338 ± 0.821
0.868AsnGln: 0.868 ± 0.801
2.603AsnArg: 2.603 ± 2.266
1.735AsnSer: 1.735 ± 0.697
3.037AsnThr: 3.037 ± 2.104
2.603AsnVal: 2.603 ± 0.572
0.0AsnTrp: 0.0 ± 0.0
1.302AsnTyr: 1.302 ± 0.713
0.0AsnXaa: 0.0 ± 0.0
Pro
5.64ProAla: 5.64 ± 1.041
0.868ProCys: 0.868 ± 0.654
5.206ProAsp: 5.206 ± 0.804
3.037ProGlu: 3.037 ± 1.006
0.434ProPhe: 0.434 ± 0.327
3.471ProGly: 3.471 ± 1.356
0.868ProHis: 0.868 ± 0.413
2.169ProIle: 2.169 ± 1.139
2.603ProLys: 2.603 ± 0.858
4.772ProLeu: 4.772 ± 0.275
2.603ProMet: 2.603 ± 1.062
0.434ProAsn: 0.434 ± 0.327
6.508ProPro: 6.508 ± 1.353
4.338ProGln: 4.338 ± 1.924
5.206ProArg: 5.206 ± 1.317
2.603ProSer: 2.603 ± 1.563
6.508ProThr: 6.508 ± 1.631
3.471ProVal: 3.471 ± 1.767
1.302ProTrp: 1.302 ± 0.815
0.868ProTyr: 0.868 ± 0.395
0.0ProXaa: 0.0 ± 0.0
Gln
2.603GlnAla: 2.603 ± 0.572
0.434GlnCys: 0.434 ± 0.327
1.735GlnAsp: 1.735 ± 0.601
2.603GlnGlu: 2.603 ± 0.572
2.603GlnPhe: 2.603 ± 0.662
3.037GlnGly: 3.037 ± 0.69
1.735GlnHis: 1.735 ± 0.92
1.302GlnIle: 1.302 ± 0.646
1.735GlnLys: 1.735 ± 0.63
5.64GlnLeu: 5.64 ± 1.043
1.302GlnMet: 1.302 ± 0.89
0.0GlnAsn: 0.0 ± 0.0
1.735GlnPro: 1.735 ± 0.79
3.905GlnGln: 3.905 ± 1.514
4.772GlnArg: 4.772 ± 1.939
4.772GlnSer: 4.772 ± 1.664
2.169GlnThr: 2.169 ± 0.874
3.037GlnVal: 3.037 ± 1.072
0.868GlnTrp: 0.868 ± 0.577
0.868GlnTyr: 0.868 ± 0.779
0.0GlnXaa: 0.0 ± 0.0
Arg
4.772ArgAla: 4.772 ± 2.41
1.302ArgCys: 1.302 ± 0.55
3.037ArgAsp: 3.037 ± 0.675
3.037ArgGlu: 3.037 ± 0.623
1.735ArgPhe: 1.735 ± 0.493
2.603ArgGly: 2.603 ± 0.769
0.868ArgHis: 0.868 ± 0.74
1.302ArgIle: 1.302 ± 0.642
3.037ArgLys: 3.037 ± 0.81
7.809ArgLeu: 7.809 ± 1.793
4.338ArgMet: 4.338 ± 1.704
1.735ArgAsn: 1.735 ± 0.537
1.735ArgPro: 1.735 ± 0.898
3.037ArgGln: 3.037 ± 1.393
6.074ArgArg: 6.074 ± 1.492
2.603ArgSer: 2.603 ± 1.238
2.603ArgThr: 2.603 ± 0.571
3.905ArgVal: 3.905 ± 0.546
1.302ArgTrp: 1.302 ± 0.89
3.471ArgTyr: 3.471 ± 1.559
0.0ArgXaa: 0.0 ± 0.0
Ser
2.169SerAla: 2.169 ± 0.908
3.037SerCys: 3.037 ± 0.969
3.905SerAsp: 3.905 ± 0.927
3.471SerGlu: 3.471 ± 1.071
1.735SerPhe: 1.735 ± 1.308
5.206SerGly: 5.206 ± 1.168
2.169SerHis: 2.169 ± 0.854
0.434SerIle: 0.434 ± 0.327
2.169SerLys: 2.169 ± 0.977
8.677SerLeu: 8.677 ± 0.958
3.471SerMet: 3.471 ± 0.529
2.169SerAsn: 2.169 ± 0.521
4.338SerPro: 4.338 ± 1.633
3.471SerGln: 3.471 ± 1.41
3.471SerArg: 3.471 ± 1.048
7.375SerSer: 7.375 ± 1.499
3.905SerThr: 3.905 ± 0.953
5.206SerVal: 5.206 ± 0.866
0.0SerTrp: 0.0 ± 0.0
2.169SerTyr: 2.169 ± 1.246
0.0SerXaa: 0.0 ± 0.0
Thr
3.471ThrAla: 3.471 ± 1.202
1.735ThrCys: 1.735 ± 0.493
1.735ThrAsp: 1.735 ± 0.79
4.772ThrGlu: 4.772 ± 1.64
3.905ThrPhe: 3.905 ± 1.143
5.206ThrGly: 5.206 ± 1.034
1.302ThrHis: 1.302 ± 0.815
2.603ThrIle: 2.603 ± 0.911
4.772ThrLys: 4.772 ± 1.96
3.905ThrLeu: 3.905 ± 1.377
1.735ThrMet: 1.735 ± 0.535
0.434ThrAsn: 0.434 ± 0.389
7.375ThrPro: 7.375 ± 1.421
1.302ThrGln: 1.302 ± 0.89
4.772ThrArg: 4.772 ± 0.594
2.169ThrSer: 2.169 ± 0.703
1.735ThrThr: 1.735 ± 1.14
5.206ThrVal: 5.206 ± 2.567
1.302ThrTrp: 1.302 ± 0.89
0.434ThrTyr: 0.434 ± 0.37
0.0ThrXaa: 0.0 ± 0.0
Val
2.169ValAla: 2.169 ± 1.635
0.868ValCys: 0.868 ± 0.654
1.735ValAsp: 1.735 ± 0.751
1.735ValGlu: 1.735 ± 1.079
2.169ValPhe: 2.169 ± 0.736
2.169ValGly: 2.169 ± 1.415
3.905ValHis: 3.905 ± 0.546
1.735ValIle: 1.735 ± 1.601
3.471ValLys: 3.471 ± 1.03
6.508ValLeu: 6.508 ± 1.924
0.868ValMet: 0.868 ± 0.53
4.338ValAsn: 4.338 ± 1.101
3.037ValPro: 3.037 ± 0.777
2.169ValGln: 2.169 ± 0.984
3.471ValArg: 3.471 ± 1.37
4.772ValSer: 4.772 ± 1.805
4.338ValThr: 4.338 ± 1.642
4.338ValVal: 4.338 ± 2.091
1.302ValTrp: 1.302 ± 0.642
3.471ValTyr: 3.471 ± 0.8
0.0ValXaa: 0.0 ± 0.0
Trp
1.302TrpAla: 1.302 ± 0.642
0.0TrpCys: 0.0 ± 0.0
0.434TrpAsp: 0.434 ± 0.327
0.868TrpGlu: 0.868 ± 0.471
0.868TrpPhe: 0.868 ± 0.577
3.471TrpGly: 3.471 ± 0.961
0.434TrpHis: 0.434 ± 0.389
0.434TrpIle: 0.434 ± 0.327
0.434TrpLys: 0.434 ± 0.327
1.302TrpLeu: 1.302 ± 0.642
0.868TrpMet: 0.868 ± 0.801
1.735TrpAsn: 1.735 ± 0.697
0.0TrpPro: 0.0 ± 0.0
0.868TrpGln: 0.868 ± 0.801
1.735TrpArg: 1.735 ± 1.116
1.302TrpSer: 1.302 ± 0.713
0.0TrpThr: 0.0 ± 0.0
1.302TrpVal: 1.302 ± 0.89
0.0TrpTrp: 0.0 ± 0.0
0.434TrpTyr: 0.434 ± 0.327
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.868TyrAla: 0.868 ± 0.577
0.434TyrCys: 0.434 ± 0.327
1.302TyrAsp: 1.302 ± 0.713
0.868TyrGlu: 0.868 ± 0.654
2.169TyrPhe: 2.169 ± 0.901
2.169TyrGly: 2.169 ± 0.984
0.868TyrHis: 0.868 ± 0.395
0.0TyrIle: 0.0 ± 0.0
1.735TyrLys: 1.735 ± 0.493
3.471TyrLeu: 3.471 ± 1.214
1.302TyrMet: 1.302 ± 0.851
2.169TyrAsn: 2.169 ± 1.079
2.169TyrPro: 2.169 ± 1.199
3.037TyrGln: 3.037 ± 0.623
0.868TyrArg: 0.868 ± 0.801
4.338TyrSer: 4.338 ± 1.029
0.868TyrThr: 0.868 ± 0.471
1.302TyrVal: 1.302 ± 0.89
0.434TyrTrp: 0.434 ± 0.327
2.603TyrTyr: 2.603 ± 0.647
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2306 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski