Amino acid dipepetide frequency for Mus musculus papillomavirus type 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.702AlaAla: 3.702 ± 0.857
1.645AlaCys: 1.645 ± 1.967
5.348AlaAsp: 5.348 ± 1.445
2.879AlaGlu: 2.879 ± 1.336
3.702AlaPhe: 3.702 ± 1.371
3.702AlaGly: 3.702 ± 0.888
0.823AlaHis: 0.823 ± 0.73
0.411AlaIle: 0.411 ± 0.352
2.879AlaLys: 2.879 ± 1.32
5.759AlaLeu: 5.759 ± 2.096
1.234AlaMet: 1.234 ± 0.565
2.468AlaAsn: 2.468 ± 1.478
2.468AlaPro: 2.468 ± 0.544
2.057AlaGln: 2.057 ± 0.728
2.879AlaArg: 2.879 ± 0.711
4.525AlaSer: 4.525 ± 1.215
3.291AlaThr: 3.291 ± 0.984
2.057AlaVal: 2.057 ± 0.471
0.823AlaTrp: 0.823 ± 0.444
1.234AlaTyr: 1.234 ± 0.565
0.0AlaXaa: 0.0 ± 0.0
Cys
1.234CysAla: 1.234 ± 0.686
1.234CysCys: 1.234 ± 0.739
1.645CysAsp: 1.645 ± 0.609
0.823CysGlu: 0.823 ± 0.347
0.823CysPhe: 0.823 ± 0.552
1.234CysGly: 1.234 ± 1.32
0.411CysHis: 0.411 ± 0.365
0.823CysIle: 0.823 ± 0.678
1.645CysLys: 1.645 ± 1.414
1.234CysLeu: 1.234 ± 0.686
0.411CysMet: 0.411 ± 0.662
0.411CysAsn: 0.411 ± 0.325
1.645CysPro: 1.645 ± 0.792
1.234CysGln: 1.234 ± 0.398
0.411CysArg: 0.411 ± 0.662
2.057CysSer: 2.057 ± 1.063
0.411CysThr: 0.411 ± 0.365
1.645CysVal: 1.645 ± 1.53
1.234CysTrp: 1.234 ± 0.454
0.411CysTyr: 0.411 ± 0.325
0.0CysXaa: 0.0 ± 0.0
Asp
1.645AspAla: 1.645 ± 0.833
1.234AspCys: 1.234 ± 0.454
2.468AspAsp: 2.468 ± 0.622
3.702AspGlu: 3.702 ± 1.052
4.936AspPhe: 4.936 ± 1.552
3.291AspGly: 3.291 ± 0.671
1.234AspHis: 1.234 ± 0.853
4.114AspIle: 4.114 ± 1.552
2.468AspLys: 2.468 ± 0.96
6.17AspLeu: 6.17 ± 2.159
0.823AspMet: 0.823 ± 0.423
4.936AspAsn: 4.936 ± 1.079
3.702AspPro: 3.702 ± 1.505
2.057AspGln: 2.057 ± 1.012
2.468AspArg: 2.468 ± 0.882
7.816AspSer: 7.816 ± 2.111
5.348AspThr: 5.348 ± 1.17
5.348AspVal: 5.348 ± 1.707
0.411AspTrp: 0.411 ± 0.325
1.234AspTyr: 1.234 ± 0.786
0.0AspXaa: 0.0 ± 0.0
Glu
3.702GluAla: 3.702 ± 1.408
0.823GluCys: 0.823 ± 0.695
2.879GluAsp: 2.879 ± 0.988
5.759GluGlu: 5.759 ± 2.256
1.234GluPhe: 1.234 ± 0.702
2.879GluGly: 2.879 ± 0.992
1.234GluHis: 1.234 ± 0.565
2.468GluIle: 2.468 ± 0.796
2.468GluLys: 2.468 ± 1.962
6.17GluLeu: 6.17 ± 1.647
1.645GluMet: 1.645 ± 0.616
4.114GluAsn: 4.114 ± 0.822
2.879GluPro: 2.879 ± 0.59
3.291GluGln: 3.291 ± 1.278
2.879GluArg: 2.879 ± 0.775
4.114GluSer: 4.114 ± 1.259
3.291GluThr: 3.291 ± 1.121
5.348GluVal: 5.348 ± 1.041
0.411GluTrp: 0.411 ± 0.352
1.645GluTyr: 1.645 ± 0.472
0.0GluXaa: 0.0 ± 0.0
Phe
2.468PheAla: 2.468 ± 0.577
2.468PheCys: 2.468 ± 1.434
2.057PheAsp: 2.057 ± 0.763
3.702PheGlu: 3.702 ± 1.03
2.468PhePhe: 2.468 ± 1.222
2.879PheGly: 2.879 ± 0.923
1.234PheHis: 1.234 ± 1.318
2.879PheIle: 2.879 ± 1.095
3.291PheLys: 3.291 ± 1.098
3.291PheLeu: 3.291 ± 0.785
0.823PheMet: 0.823 ± 0.65
1.234PheAsn: 1.234 ± 0.704
2.057PhePro: 2.057 ± 1.106
0.823PheGln: 0.823 ± 0.502
1.645PheArg: 1.645 ± 0.486
2.468PheSer: 2.468 ± 0.575
1.645PheThr: 1.645 ± 0.694
2.468PheVal: 2.468 ± 1.069
1.234PheTrp: 1.234 ± 0.633
2.468PheTyr: 2.468 ± 0.639
0.0PheXaa: 0.0 ± 0.0
Gly
2.468GlyAla: 2.468 ± 0.583
1.645GlyCys: 1.645 ± 0.61
5.348GlyAsp: 5.348 ± 1.495
4.525GlyGlu: 4.525 ± 1.171
2.468GlyPhe: 2.468 ± 0.774
4.936GlyGly: 4.936 ± 2.224
1.645GlyHis: 1.645 ± 0.694
3.291GlyIle: 3.291 ± 0.944
3.291GlyLys: 3.291 ± 1.496
4.525GlyLeu: 4.525 ± 1.44
0.0GlyMet: 0.0 ± 0.0
4.525GlyAsn: 4.525 ± 0.882
6.993GlyPro: 6.993 ± 3.05
2.879GlyGln: 2.879 ± 0.853
4.114GlyArg: 4.114 ± 1.054
7.404GlySer: 7.404 ± 0.993
5.759GlyThr: 5.759 ± 2.252
4.936GlyVal: 4.936 ± 1.288
0.411GlyTrp: 0.411 ± 0.325
1.234GlyTyr: 1.234 ± 1.317
0.0GlyXaa: 0.0 ± 0.0
His
0.823HisAla: 0.823 ± 0.347
0.823HisCys: 0.823 ± 0.552
0.823HisAsp: 0.823 ± 0.444
0.823HisGlu: 0.823 ± 0.413
0.823HisPhe: 0.823 ± 0.423
0.0HisGly: 0.0 ± 0.0
1.234HisHis: 1.234 ± 0.647
0.823HisIle: 0.823 ± 0.502
1.234HisLys: 1.234 ± 0.732
1.645HisLeu: 1.645 ± 1.286
0.823HisMet: 0.823 ± 0.695
0.823HisAsn: 0.823 ± 0.707
2.057HisPro: 2.057 ± 1.092
0.411HisGln: 0.411 ± 0.352
0.823HisArg: 0.823 ± 0.444
0.823HisSer: 0.823 ± 0.444
1.645HisThr: 1.645 ± 0.619
0.823HisVal: 0.823 ± 0.347
0.823HisTrp: 0.823 ± 0.502
0.411HisTyr: 0.411 ± 0.352
0.0HisXaa: 0.0 ± 0.0
Ile
2.879IleAla: 2.879 ± 1.156
0.411IleCys: 0.411 ± 0.365
3.702IleAsp: 3.702 ± 1.092
2.879IleGlu: 2.879 ± 0.877
1.234IlePhe: 1.234 ± 1.094
5.348IleGly: 5.348 ± 1.208
0.823IleHis: 0.823 ± 0.718
2.879IleIle: 2.879 ± 1.462
1.234IleLys: 1.234 ± 0.735
3.702IleLeu: 3.702 ± 0.791
0.0IleMet: 0.0 ± 0.0
1.645IleAsn: 1.645 ± 0.806
0.823IlePro: 0.823 ± 0.423
1.645IleGln: 1.645 ± 0.978
1.645IleArg: 1.645 ± 0.292
6.17IleSer: 6.17 ± 2.302
2.057IleThr: 2.057 ± 0.567
3.702IleVal: 3.702 ± 1.694
0.411IleTrp: 0.411 ± 0.352
0.823IleTyr: 0.823 ± 0.387
0.0IleXaa: 0.0 ± 0.0
Lys
2.468LysAla: 2.468 ± 1.543
1.645LysCys: 1.645 ± 0.841
1.234LysAsp: 1.234 ± 0.909
4.114LysGlu: 4.114 ± 1.232
1.234LysPhe: 1.234 ± 1.094
4.936LysGly: 4.936 ± 1.542
1.645LysHis: 1.645 ± 0.989
0.0LysIle: 0.0 ± 0.157
2.468LysLys: 2.468 ± 1.251
3.702LysLeu: 3.702 ± 2.145
1.234LysMet: 1.234 ± 0.924
1.645LysAsn: 1.645 ± 0.751
2.057LysPro: 2.057 ± 0.771
0.823LysGln: 0.823 ± 0.528
4.114LysArg: 4.114 ± 0.939
2.468LysSer: 2.468 ± 1.613
4.525LysThr: 4.525 ± 1.193
4.525LysVal: 4.525 ± 1.148
0.411LysTrp: 0.411 ± 0.365
2.468LysTyr: 2.468 ± 0.621
0.0LysXaa: 0.0 ± 0.0
Leu
4.936LeuAla: 4.936 ± 1.639
0.411LeuCys: 0.411 ± 0.365
6.17LeuAsp: 6.17 ± 0.958
5.348LeuGlu: 5.348 ± 0.954
4.525LeuPhe: 4.525 ± 1.355
7.816LeuGly: 7.816 ± 2.12
1.645LeuHis: 1.645 ± 0.816
5.759LeuIle: 5.759 ± 1.323
4.525LeuLys: 4.525 ± 1.133
9.872LeuLeu: 9.872 ± 3.433
0.823LeuMet: 0.823 ± 0.627
2.468LeuAsn: 2.468 ± 1.179
4.936LeuPro: 4.936 ± 1.248
5.348LeuGln: 5.348 ± 1.395
5.348LeuArg: 5.348 ± 1.367
4.114LeuSer: 4.114 ± 1.292
4.936LeuThr: 4.936 ± 1.004
4.525LeuVal: 4.525 ± 1.398
0.823LeuTrp: 0.823 ± 0.444
3.291LeuTyr: 3.291 ± 0.583
0.0LeuXaa: 0.0 ± 0.0
Met
1.645MetAla: 1.645 ± 0.639
0.411MetCys: 0.411 ± 0.365
1.234MetAsp: 1.234 ± 0.653
1.234MetGlu: 1.234 ± 0.86
1.645MetPhe: 1.645 ± 0.486
0.411MetGly: 0.411 ± 0.325
0.411MetHis: 0.411 ± 0.359
0.0MetIle: 0.0 ± 0.0
0.823MetLys: 0.823 ± 0.413
0.823MetLeu: 0.823 ± 0.444
0.411MetMet: 0.411 ± 0.365
2.057MetAsn: 2.057 ± 0.728
0.823MetPro: 0.823 ± 0.413
1.234MetGln: 1.234 ± 0.638
0.823MetArg: 0.823 ± 0.413
0.411MetSer: 0.411 ± 0.325
1.234MetThr: 1.234 ± 0.633
1.234MetVal: 1.234 ± 0.651
0.823MetTrp: 0.823 ± 0.707
0.411MetTyr: 0.411 ± 0.352
0.0MetXaa: 0.0 ± 0.0
Asn
1.234AsnAla: 1.234 ± 0.565
0.411AsnCys: 0.411 ± 0.359
2.057AsnAsp: 2.057 ± 0.876
4.114AsnGlu: 4.114 ± 0.875
2.468AsnPhe: 2.468 ± 1.041
3.291AsnGly: 3.291 ± 0.438
0.0AsnHis: 0.0 ± 0.0
0.823AsnIle: 0.823 ± 0.707
3.702AsnLys: 3.702 ± 0.78
4.114AsnLeu: 4.114 ± 1.2
0.411AsnMet: 0.411 ± 0.365
2.468AsnAsn: 2.468 ± 0.709
3.291AsnPro: 3.291 ± 1.316
1.645AsnGln: 1.645 ± 0.627
3.702AsnArg: 3.702 ± 0.85
4.525AsnSer: 4.525 ± 1.385
5.348AsnThr: 5.348 ± 1.133
1.234AsnVal: 1.234 ± 0.732
1.645AsnTrp: 1.645 ± 0.61
1.234AsnTyr: 1.234 ± 0.441
0.0AsnXaa: 0.0 ± 0.0
Pro
2.879ProAla: 2.879 ± 1.39
0.823ProCys: 0.823 ± 0.502
7.816ProAsp: 7.816 ± 2.162
3.702ProGlu: 3.702 ± 1.591
2.468ProPhe: 2.468 ± 0.886
3.291ProGly: 3.291 ± 1.969
0.411ProHis: 0.411 ± 0.523
3.291ProIle: 3.291 ± 1.588
3.702ProLys: 3.702 ± 1.121
7.404ProLeu: 7.404 ± 1.113
1.234ProMet: 1.234 ± 0.975
2.879ProAsn: 2.879 ± 1.281
7.816ProPro: 7.816 ± 2.789
2.057ProGln: 2.057 ± 1.045
2.057ProArg: 2.057 ± 1.134
4.525ProSer: 4.525 ± 1.331
3.702ProThr: 3.702 ± 1.474
2.057ProVal: 2.057 ± 0.675
0.0ProTrp: 0.0 ± 0.0
1.645ProTyr: 1.645 ± 0.933
0.0ProXaa: 0.0 ± 0.0
Gln
2.879GlnAla: 2.879 ± 0.921
0.823GlnCys: 0.823 ± 0.413
4.525GlnAsp: 4.525 ± 1.025
0.823GlnGlu: 0.823 ± 0.65
1.645GlnPhe: 1.645 ± 0.694
2.879GlnGly: 2.879 ± 0.504
0.0GlnHis: 0.0 ± 0.0
0.823GlnIle: 0.823 ± 0.564
1.645GlnLys: 1.645 ± 1.003
4.525GlnLeu: 4.525 ± 1.444
1.645GlnMet: 1.645 ± 0.472
1.234GlnAsn: 1.234 ± 0.846
4.114GlnPro: 4.114 ± 1.573
2.879GlnGln: 2.879 ± 1.184
2.057GlnArg: 2.057 ± 1.106
1.645GlnSer: 1.645 ± 0.694
2.057GlnThr: 2.057 ± 1.13
2.468GlnVal: 2.468 ± 0.509
1.234GlnTrp: 1.234 ± 0.694
0.823GlnTyr: 0.823 ± 0.347
0.0GlnXaa: 0.0 ± 0.0
Arg
2.879ArgAla: 2.879 ± 0.674
1.645ArgCys: 1.645 ± 0.797
2.057ArgAsp: 2.057 ± 1.016
2.468ArgGlu: 2.468 ± 0.708
3.291ArgPhe: 3.291 ± 0.898
3.702ArgGly: 3.702 ± 1.348
2.057ArgHis: 2.057 ± 0.782
2.057ArgIle: 2.057 ± 1.059
4.114ArgLys: 4.114 ± 0.72
6.993ArgLeu: 6.993 ± 1.747
0.411ArgMet: 0.411 ± 0.437
2.057ArgAsn: 2.057 ± 0.776
2.468ArgPro: 2.468 ± 0.591
3.702ArgGln: 3.702 ± 0.794
6.17ArgArg: 6.17 ± 1.09
3.291ArgSer: 3.291 ± 1.221
1.645ArgThr: 1.645 ± 0.735
4.936ArgVal: 4.936 ± 2.018
0.411ArgTrp: 0.411 ± 0.352
2.057ArgTyr: 2.057 ± 0.717
0.0ArgXaa: 0.0 ± 0.0
Ser
5.759SerAla: 5.759 ± 1.238
0.411SerCys: 0.411 ± 0.662
4.114SerAsp: 4.114 ± 0.941
3.291SerGlu: 3.291 ± 1.193
3.291SerPhe: 3.291 ± 1.889
8.227SerGly: 8.227 ± 2.261
0.411SerHis: 0.411 ± 0.352
3.291SerIle: 3.291 ± 1.299
1.645SerLys: 1.645 ± 0.694
6.582SerLeu: 6.582 ± 1.796
1.234SerMet: 1.234 ± 0.319
3.702SerAsn: 3.702 ± 1.278
4.525SerPro: 4.525 ± 0.946
1.645SerGln: 1.645 ± 0.694
5.348SerArg: 5.348 ± 1.359
10.284SerSer: 10.284 ± 2.339
6.582SerThr: 6.582 ± 0.901
6.582SerVal: 6.582 ± 2.341
0.0SerTrp: 0.0 ± 0.0
2.468SerTyr: 2.468 ± 0.824
0.0SerXaa: 0.0 ± 0.0
Thr
4.114ThrAla: 4.114 ± 1.373
1.645ThrCys: 1.645 ± 0.94
4.936ThrAsp: 4.936 ± 1.255
2.468ThrGlu: 2.468 ± 0.511
1.645ThrPhe: 1.645 ± 0.628
6.582ThrGly: 6.582 ± 1.858
0.411ThrHis: 0.411 ± 0.352
3.291ThrIle: 3.291 ± 1.324
1.234ThrLys: 1.234 ± 0.588
4.936ThrLeu: 4.936 ± 0.619
2.057ThrMet: 2.057 ± 0.717
2.468ThrAsn: 2.468 ± 1.353
6.17ThrPro: 6.17 ± 1.417
2.468ThrGln: 2.468 ± 0.574
5.348ThrArg: 5.348 ± 0.898
5.348ThrSer: 5.348 ± 1.038
6.17ThrThr: 6.17 ± 0.806
6.17ThrVal: 6.17 ± 1.84
0.0ThrTrp: 0.0 ± 0.0
1.645ThrTyr: 1.645 ± 0.735
0.0ThrXaa: 0.0 ± 0.0
Val
2.879ValAla: 2.879 ± 0.59
1.234ValCys: 1.234 ± 0.964
4.525ValAsp: 4.525 ± 0.961
4.936ValGlu: 4.936 ± 1.604
2.468ValPhe: 2.468 ± 0.577
3.702ValGly: 3.702 ± 1.022
1.645ValHis: 1.645 ± 0.987
5.759ValIle: 5.759 ± 2.825
2.468ValLys: 2.468 ± 0.588
2.879ValLeu: 2.879 ± 0.817
1.234ValMet: 1.234 ± 0.702
4.525ValAsn: 4.525 ± 0.634
3.702ValPro: 3.702 ± 1.496
2.057ValGln: 2.057 ± 0.471
3.291ValArg: 3.291 ± 1.822
5.348ValSer: 5.348 ± 1.64
6.17ValThr: 6.17 ± 2.64
3.702ValVal: 3.702 ± 1.513
0.823ValTrp: 0.823 ± 0.347
3.291ValTyr: 3.291 ± 0.888
0.0ValXaa: 0.0 ± 0.0
Trp
0.411TrpAla: 0.411 ± 0.325
0.411TrpCys: 0.411 ± 0.325
0.411TrpAsp: 0.411 ± 0.365
0.823TrpGlu: 0.823 ± 0.502
0.0TrpPhe: 0.0 ± 0.0
0.411TrpGly: 0.411 ± 0.352
0.823TrpHis: 0.823 ± 0.502
0.823TrpIle: 0.823 ± 0.65
1.645TrpLys: 1.645 ± 0.989
1.234TrpLeu: 1.234 ± 0.565
0.0TrpMet: 0.0 ± 0.0
0.823TrpAsn: 0.823 ± 0.347
0.411TrpPro: 0.411 ± 0.365
0.411TrpGln: 0.411 ± 0.352
1.234TrpArg: 1.234 ± 0.718
0.411TrpSer: 0.411 ± 0.352
1.234TrpThr: 1.234 ± 0.786
0.823TrpVal: 0.823 ± 0.444
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.879TyrAla: 2.879 ± 1.122
0.823TyrCys: 0.823 ± 0.695
2.057TyrAsp: 2.057 ± 0.699
0.823TyrGlu: 0.823 ± 0.444
1.234TyrPhe: 1.234 ± 0.565
2.879TyrGly: 2.879 ± 1.233
0.411TyrHis: 0.411 ± 0.352
0.411TyrIle: 0.411 ± 0.361
1.645TyrLys: 1.645 ± 0.735
2.057TyrLeu: 2.057 ± 0.675
1.234TyrMet: 1.234 ± 0.397
1.234TyrAsn: 1.234 ± 0.651
0.823TyrPro: 0.823 ± 0.347
2.057TyrGln: 2.057 ± 0.9
2.057TyrArg: 2.057 ± 0.892
1.645TyrSer: 1.645 ± 0.841
2.057TyrThr: 2.057 ± 0.826
2.057TyrVal: 2.057 ± 0.73
0.411TyrTrp: 0.411 ± 0.352
1.645TyrTyr: 1.645 ± 1.11
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2432 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski