Amino acid dipepetide frequency for Gammapapillomavirus 22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.517AlaAla: 4.517 ± 2.35
1.232AlaCys: 1.232 ± 0.634
4.107AlaAsp: 4.107 ± 0.928
4.517AlaGlu: 4.517 ± 1.113
2.875AlaPhe: 2.875 ± 0.786
2.464AlaGly: 2.464 ± 0.917
0.821AlaHis: 0.821 ± 0.343
4.107AlaIle: 4.107 ± 1.012
4.107AlaLys: 4.107 ± 0.728
4.517AlaLeu: 4.517 ± 1.685
0.411AlaMet: 0.411 ± 0.369
4.107AlaAsn: 4.107 ± 1.701
4.517AlaPro: 4.517 ± 1.33
1.643AlaGln: 1.643 ± 0.679
1.232AlaArg: 1.232 ± 0.817
1.643AlaSer: 1.643 ± 0.573
3.285AlaThr: 3.285 ± 1.394
1.643AlaVal: 1.643 ± 0.679
0.0AlaTrp: 0.0 ± 0.0
1.643AlaTyr: 1.643 ± 0.732
0.0AlaXaa: 0.0 ± 0.0
Cys
0.821CysAla: 0.821 ± 0.343
0.821CysCys: 0.821 ± 0.575
1.232CysAsp: 1.232 ± 1.108
0.821CysGlu: 0.821 ± 0.567
2.053CysPhe: 2.053 ± 0.923
0.411CysGly: 0.411 ± 0.655
0.0CysHis: 0.0 ± 0.0
1.232CysIle: 1.232 ± 0.304
2.875CysLys: 2.875 ± 1.428
1.643CysLeu: 1.643 ± 1.149
0.0CysMet: 0.0 ± 0.0
2.053CysAsn: 2.053 ± 0.986
2.464CysPro: 2.464 ± 1.365
0.821CysGln: 0.821 ± 0.565
0.411CysArg: 0.411 ± 0.484
1.643CysSer: 1.643 ± 1.142
1.643CysThr: 1.643 ± 0.977
2.464CysVal: 2.464 ± 0.934
0.411CysTrp: 0.411 ± 0.317
1.643CysTyr: 1.643 ± 0.983
0.0CysXaa: 0.0 ± 0.0
Asp
2.464AspAla: 2.464 ± 0.948
2.053AspCys: 2.053 ± 1.026
4.107AspAsp: 4.107 ± 1.478
6.16AspGlu: 6.16 ± 1.929
2.464AspPhe: 2.464 ± 0.794
2.875AspGly: 2.875 ± 1.217
1.232AspHis: 1.232 ± 0.619
3.696AspIle: 3.696 ± 0.931
2.053AspLys: 2.053 ± 0.539
6.16AspLeu: 6.16 ± 1.708
1.232AspMet: 1.232 ± 0.738
2.875AspAsn: 2.875 ± 0.567
4.107AspPro: 4.107 ± 1.478
0.821AspGln: 0.821 ± 0.652
1.643AspArg: 1.643 ± 0.599
3.285AspSer: 3.285 ± 1.138
7.392AspThr: 7.392 ± 1.38
3.285AspVal: 3.285 ± 1.487
0.821AspTrp: 0.821 ± 0.343
2.464AspTyr: 2.464 ± 0.676
0.0AspXaa: 0.0 ± 0.0
Glu
2.875GluAla: 2.875 ± 1.401
1.643GluCys: 1.643 ± 0.544
4.517GluAsp: 4.517 ± 1.522
6.571GluGlu: 6.571 ± 0.716
1.232GluPhe: 1.232 ± 0.304
3.696GluGly: 3.696 ± 0.883
0.411GluHis: 0.411 ± 0.484
2.875GluIle: 2.875 ± 0.924
4.517GluLys: 4.517 ± 1.473
6.982GluLeu: 6.982 ± 1.966
0.821GluMet: 0.821 ± 0.343
4.107GluAsn: 4.107 ± 0.744
1.232GluPro: 1.232 ± 0.72
2.875GluGln: 2.875 ± 0.637
3.696GluArg: 3.696 ± 1.8
5.749GluSer: 5.749 ± 1.232
4.928GluThr: 4.928 ± 1.613
3.285GluVal: 3.285 ± 0.771
1.643GluTrp: 1.643 ± 1.054
2.053GluTyr: 2.053 ± 0.761
0.0GluXaa: 0.0 ± 0.0
Phe
2.053PheAla: 2.053 ± 0.8
2.875PheCys: 2.875 ± 0.923
2.875PheAsp: 2.875 ± 1.135
3.285PheGlu: 3.285 ± 1.172
2.464PhePhe: 2.464 ± 1.27
4.517PheGly: 4.517 ± 0.887
0.411PheHis: 0.411 ± 0.326
0.411PheIle: 0.411 ± 0.326
3.285PheLys: 3.285 ± 1.609
4.517PheLeu: 4.517 ± 1.102
0.821PheMet: 0.821 ± 0.427
3.696PheAsn: 3.696 ± 0.987
2.875PhePro: 2.875 ± 0.536
0.821PheGln: 0.821 ± 0.343
2.053PheArg: 2.053 ± 0.381
2.053PheSer: 2.053 ± 0.958
2.053PheThr: 2.053 ± 0.499
2.053PheVal: 2.053 ± 0.565
0.821PheTrp: 0.821 ± 0.343
0.821PheTyr: 0.821 ± 0.433
0.0PheXaa: 0.0 ± 0.0
Gly
3.285GlyAla: 3.285 ± 0.838
0.411GlyCys: 0.411 ± 0.326
4.517GlyAsp: 4.517 ± 0.886
2.053GlyGlu: 2.053 ± 0.809
1.643GlyPhe: 1.643 ± 0.458
1.643GlyGly: 1.643 ± 0.68
2.053GlyHis: 2.053 ± 0.85
3.696GlyIle: 3.696 ± 0.964
2.464GlyLys: 2.464 ± 0.924
3.696GlyLeu: 3.696 ± 1.276
0.0GlyMet: 0.0 ± 0.0
2.464GlyAsn: 2.464 ± 0.803
2.875GlyPro: 2.875 ± 0.798
1.232GlyGln: 1.232 ± 0.702
3.285GlyArg: 3.285 ± 1.37
2.875GlySer: 2.875 ± 0.785
6.16GlyThr: 6.16 ± 1.874
2.053GlyVal: 2.053 ± 0.871
0.821GlyTrp: 0.821 ± 0.405
1.643GlyTyr: 1.643 ± 0.601
0.0GlyXaa: 0.0 ± 0.0
His
0.411HisAla: 0.411 ± 0.326
0.0HisCys: 0.0 ± 0.0
0.821HisAsp: 0.821 ± 0.652
0.411HisGlu: 0.411 ± 0.369
0.411HisPhe: 0.411 ± 0.361
1.232HisGly: 1.232 ± 0.702
0.411HisHis: 0.411 ± 0.317
1.643HisIle: 1.643 ± 1.126
1.232HisLys: 1.232 ± 0.634
2.053HisLeu: 2.053 ± 0.681
0.411HisMet: 0.411 ± 0.445
1.232HisAsn: 1.232 ± 0.516
2.053HisPro: 2.053 ± 1.009
0.411HisGln: 0.411 ± 0.445
0.411HisArg: 0.411 ± 0.484
0.821HisSer: 0.821 ± 0.405
1.232HisThr: 1.232 ± 0.394
0.0HisVal: 0.0 ± 0.0
0.821HisTrp: 0.821 ± 0.575
0.411HisTyr: 0.411 ± 0.326
0.0HisXaa: 0.0 ± 0.0
Ile
4.517IleAla: 4.517 ± 1.148
1.232IleCys: 1.232 ± 1.321
3.285IleAsp: 3.285 ± 1.337
4.928IleGlu: 4.928 ± 1.365
1.643IlePhe: 1.643 ± 0.572
3.285IleGly: 3.285 ± 0.838
0.821IleHis: 0.821 ± 0.652
2.464IleIle: 2.464 ± 1.059
1.232IleLys: 1.232 ± 0.77
3.696IleLeu: 3.696 ± 0.846
0.821IleMet: 0.821 ± 0.405
2.464IleAsn: 2.464 ± 1.019
3.285IlePro: 3.285 ± 1.535
3.285IleGln: 3.285 ± 0.728
2.053IleArg: 2.053 ± 0.495
4.928IleSer: 4.928 ± 2.259
2.464IleThr: 2.464 ± 1.102
2.464IleVal: 2.464 ± 0.644
1.232IleTrp: 1.232 ± 0.479
0.821IleTyr: 0.821 ± 0.4
0.0IleXaa: 0.0 ± 0.0
Lys
3.285LysAla: 3.285 ± 0.836
1.643LysCys: 1.643 ± 0.544
4.107LysAsp: 4.107 ± 1.853
4.517LysGlu: 4.517 ± 1.33
2.053LysPhe: 2.053 ± 1.338
3.285LysGly: 3.285 ± 1.216
2.464LysHis: 2.464 ± 1.32
2.875LysIle: 2.875 ± 1.03
2.875LysLys: 2.875 ± 0.73
4.517LysLeu: 4.517 ± 1.744
1.643LysMet: 1.643 ± 0.672
3.285LysAsn: 3.285 ± 1.255
0.821LysPro: 0.821 ± 0.565
2.875LysGln: 2.875 ± 0.858
5.339LysArg: 5.339 ± 0.965
2.875LysSer: 2.875 ± 1.378
4.107LysThr: 4.107 ± 1.151
3.285LysVal: 3.285 ± 0.856
0.411LysTrp: 0.411 ± 0.445
3.285LysTyr: 3.285 ± 1.199
0.0LysXaa: 0.0 ± 0.0
Leu
3.696LeuAla: 3.696 ± 0.577
1.232LeuCys: 1.232 ± 0.694
3.696LeuAsp: 3.696 ± 1.197
4.517LeuGlu: 4.517 ± 1.414
3.285LeuPhe: 3.285 ± 1.986
6.16LeuGly: 6.16 ± 2.025
1.232LeuHis: 1.232 ± 0.304
2.875LeuIle: 2.875 ± 0.73
6.982LeuLys: 6.982 ± 2.701
11.499LeuLeu: 11.499 ± 2.713
3.285LeuMet: 3.285 ± 1.672
4.928LeuAsn: 4.928 ± 1.424
5.749LeuPro: 5.749 ± 1.416
7.392LeuGln: 7.392 ± 2.124
3.285LeuArg: 3.285 ± 0.444
6.16LeuSer: 6.16 ± 1.939
6.16LeuThr: 6.16 ± 1.509
5.339LeuVal: 5.339 ± 1.969
0.821LeuTrp: 0.821 ± 0.465
4.517LeuTyr: 4.517 ± 0.989
0.0LeuXaa: 0.0 ± 0.0
Met
0.821MetAla: 0.821 ± 0.567
1.232MetCys: 1.232 ± 0.559
1.643MetAsp: 1.643 ± 0.662
1.232MetGlu: 1.232 ± 0.634
0.821MetPhe: 0.821 ± 0.343
0.411MetGly: 0.411 ± 0.369
0.0MetHis: 0.0 ± 0.0
0.821MetIle: 0.821 ± 0.669
0.411MetLys: 0.411 ± 0.445
2.053MetLeu: 2.053 ± 0.714
0.0MetMet: 0.0 ± 0.0
1.232MetAsn: 1.232 ± 0.394
0.0MetPro: 0.0 ± 0.0
0.411MetGln: 0.411 ± 0.445
0.821MetArg: 0.821 ± 0.739
2.053MetSer: 2.053 ± 1.19
0.0MetThr: 0.0 ± 0.0
0.821MetVal: 0.821 ± 0.509
0.411MetTrp: 0.411 ± 0.317
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.285AsnAla: 3.285 ± 0.771
1.643AsnCys: 1.643 ± 1.158
2.053AsnAsp: 2.053 ± 0.618
2.464AsnGlu: 2.464 ± 0.846
1.643AsnPhe: 1.643 ± 0.606
2.053AsnGly: 2.053 ± 0.881
0.411AsnHis: 0.411 ± 0.369
2.464AsnIle: 2.464 ± 1.048
3.696AsnLys: 3.696 ± 1.327
5.339AsnLeu: 5.339 ± 1.707
0.411AsnMet: 0.411 ± 0.361
3.285AsnAsn: 3.285 ± 0.936
3.696AsnPro: 3.696 ± 1.85
2.053AsnGln: 2.053 ± 0.923
2.053AsnArg: 2.053 ± 0.955
5.339AsnSer: 5.339 ± 1.242
2.053AsnThr: 2.053 ± 0.85
5.339AsnVal: 5.339 ± 1.006
0.411AsnTrp: 0.411 ± 0.317
1.643AsnTyr: 1.643 ± 0.841
0.0AsnXaa: 0.0 ± 0.0
Pro
2.464ProAla: 2.464 ± 1.324
0.821ProCys: 0.821 ± 0.705
4.107ProAsp: 4.107 ± 1.899
4.107ProGlu: 4.107 ± 0.993
2.875ProPhe: 2.875 ± 0.929
0.821ProGly: 0.821 ± 0.527
0.411ProHis: 0.411 ± 0.317
4.107ProIle: 4.107 ± 1.814
4.928ProLys: 4.928 ± 1.199
5.749ProLeu: 5.749 ± 0.799
0.411ProMet: 0.411 ± 0.369
2.053ProAsn: 2.053 ± 0.451
7.392ProPro: 7.392 ± 1.869
2.875ProGln: 2.875 ± 1.041
3.696ProArg: 3.696 ± 0.767
3.696ProSer: 3.696 ± 1.194
5.749ProThr: 5.749 ± 2.446
4.928ProVal: 4.928 ± 1.497
0.411ProTrp: 0.411 ± 0.326
3.285ProTyr: 3.285 ± 1.447
0.0ProXaa: 0.0 ± 0.0
Gln
2.464GlnAla: 2.464 ± 1.434
0.411GlnCys: 0.411 ± 0.361
2.464GlnAsp: 2.464 ± 0.859
2.875GlnGlu: 2.875 ± 1.107
2.053GlnPhe: 2.053 ± 0.492
1.232GlnGly: 1.232 ± 0.836
0.411GlnHis: 0.411 ± 0.369
2.875GlnIle: 2.875 ± 1.328
1.232GlnLys: 1.232 ± 0.516
3.696GlnLeu: 3.696 ± 1.003
1.643GlnMet: 1.643 ± 0.634
2.053GlnAsn: 2.053 ± 0.924
2.464GlnPro: 2.464 ± 0.784
2.464GlnGln: 2.464 ± 0.939
2.875GlnArg: 2.875 ± 0.813
0.821GlnSer: 0.821 ± 0.343
2.875GlnThr: 2.875 ± 0.706
2.464GlnVal: 2.464 ± 0.877
0.821GlnTrp: 0.821 ± 0.405
1.232GlnTyr: 1.232 ± 0.776
0.0GlnXaa: 0.0 ± 0.0
Arg
3.285ArgAla: 3.285 ± 1.172
2.053ArgCys: 2.053 ± 1.067
2.053ArgAsp: 2.053 ± 0.824
3.285ArgGlu: 3.285 ± 1.11
2.464ArgPhe: 2.464 ± 0.544
1.643ArgGly: 1.643 ± 0.627
2.053ArgHis: 2.053 ± 1.043
2.464ArgIle: 2.464 ± 1.719
4.517ArgLys: 4.517 ± 1.268
4.517ArgLeu: 4.517 ± 0.889
1.232ArgMet: 1.232 ± 0.772
2.053ArgAsn: 2.053 ± 0.809
4.107ArgPro: 4.107 ± 1.358
0.411ArgGln: 0.411 ± 0.361
6.982ArgArg: 6.982 ± 2.474
4.928ArgSer: 4.928 ± 1.006
2.464ArgThr: 2.464 ± 0.701
3.285ArgVal: 3.285 ± 1.699
0.411ArgTrp: 0.411 ± 0.317
2.053ArgTyr: 2.053 ± 1.11
0.0ArgXaa: 0.0 ± 0.0
Ser
4.517SerAla: 4.517 ± 0.563
1.643SerCys: 1.643 ± 1.588
4.517SerAsp: 4.517 ± 1.813
3.285SerGlu: 3.285 ± 0.775
4.107SerPhe: 4.107 ± 1.309
4.517SerGly: 4.517 ± 0.621
0.411SerHis: 0.411 ± 0.361
4.107SerIle: 4.107 ± 1.762
3.696SerLys: 3.696 ± 0.997
9.035SerLeu: 9.035 ± 2.081
0.411SerMet: 0.411 ± 0.326
2.464SerAsn: 2.464 ± 0.757
2.875SerPro: 2.875 ± 1.062
2.053SerGln: 2.053 ± 0.73
2.875SerArg: 2.875 ± 1.599
5.749SerSer: 5.749 ± 1.928
5.339SerThr: 5.339 ± 1.819
4.107SerVal: 4.107 ± 1.478
0.411SerTrp: 0.411 ± 0.326
2.464SerTyr: 2.464 ± 0.557
0.0SerXaa: 0.0 ± 0.0
Thr
4.107ThrAla: 4.107 ± 1.059
2.053ThrCys: 2.053 ± 0.539
3.285ThrAsp: 3.285 ± 0.868
4.107ThrGlu: 4.107 ± 0.651
3.285ThrPhe: 3.285 ± 1.154
4.517ThrGly: 4.517 ± 1.802
1.643ThrHis: 1.643 ± 0.265
2.464ThrIle: 2.464 ± 1.239
1.643ThrLys: 1.643 ± 0.458
5.749ThrLeu: 5.749 ± 2.116
0.821ThrMet: 0.821 ± 0.739
3.285ThrAsn: 3.285 ± 0.951
6.16ThrPro: 6.16 ± 0.825
2.875ThrGln: 2.875 ± 0.953
4.107ThrArg: 4.107 ± 1.191
4.928ThrSer: 4.928 ± 0.972
6.571ThrThr: 6.571 ± 1.92
8.214ThrVal: 8.214 ± 2.307
0.411ThrTrp: 0.411 ± 0.361
2.464ThrTyr: 2.464 ± 1.185
0.0ThrXaa: 0.0 ± 0.0
Val
1.643ValAla: 1.643 ± 0.434
0.411ValCys: 0.411 ± 0.484
5.339ValAsp: 5.339 ± 1.028
2.875ValGlu: 2.875 ± 1.148
3.285ValPhe: 3.285 ± 0.816
1.643ValGly: 1.643 ± 0.681
1.232ValHis: 1.232 ± 0.829
3.285ValIle: 3.285 ± 1.502
4.107ValLys: 4.107 ± 1.348
3.285ValLeu: 3.285 ± 0.752
0.411ValMet: 0.411 ± 0.326
1.643ValAsn: 1.643 ± 1.045
5.749ValPro: 5.749 ± 1.825
2.053ValGln: 2.053 ± 0.668
5.339ValArg: 5.339 ± 1.758
6.982ValSer: 6.982 ± 1.481
4.517ValThr: 4.517 ± 1.802
5.339ValVal: 5.339 ± 1.627
0.821ValTrp: 0.821 ± 0.567
2.053ValTyr: 2.053 ± 0.881
0.0ValXaa: 0.0 ± 0.0
Trp
1.232TrpAla: 1.232 ± 0.577
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.643TrpGlu: 1.643 ± 1.042
0.411TrpPhe: 0.411 ± 0.369
0.411TrpGly: 0.411 ± 0.326
0.0TrpHis: 0.0 ± 0.0
0.821TrpIle: 0.821 ± 0.739
0.821TrpLys: 0.821 ± 0.575
1.232TrpLeu: 1.232 ± 0.708
0.0TrpMet: 0.0 ± 0.0
1.232TrpAsn: 1.232 ± 0.738
0.411TrpPro: 0.411 ± 0.326
0.0TrpGln: 0.0 ± 0.0
1.643TrpArg: 1.643 ± 0.544
0.0TrpSer: 0.0 ± 0.0
1.643TrpThr: 1.643 ± 0.627
0.411TrpVal: 0.411 ± 0.369
0.0TrpTrp: 0.0 ± 0.0
0.821TrpTyr: 0.821 ± 0.527
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.643TyrAla: 1.643 ± 0.683
2.053TyrCys: 2.053 ± 1.1
2.464TyrAsp: 2.464 ± 0.552
2.053TyrGlu: 2.053 ± 0.407
3.696TyrPhe: 3.696 ± 1.354
2.053TyrGly: 2.053 ± 0.681
0.0TyrHis: 0.0 ± 0.0
1.643TyrIle: 1.643 ± 0.434
2.875TyrLys: 2.875 ± 1.261
2.875TyrLeu: 2.875 ± 1.041
0.0TyrMet: 0.0 ± 0.0
1.232TyrAsn: 1.232 ± 0.634
2.053TyrPro: 2.053 ± 0.702
2.053TyrGln: 2.053 ± 0.959
2.464TyrArg: 2.464 ± 1.267
2.053TyrSer: 2.053 ± 0.607
2.053TyrThr: 2.053 ± 1.184
1.232TyrVal: 1.232 ± 0.738
0.821TyrTrp: 0.821 ± 0.652
2.464TyrTyr: 2.464 ± 0.523
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2436 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski