Amino acid dipepetide frequency for Gammapapillomavirus 23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.168AlaAla: 6.168 ± 1.811
0.822AlaCys: 0.822 ± 0.687
4.934AlaAsp: 4.934 ± 1.123
4.112AlaGlu: 4.112 ± 1.507
2.056AlaPhe: 2.056 ± 0.995
3.289AlaGly: 3.289 ± 1.38
0.411AlaHis: 0.411 ± 0.359
2.467AlaIle: 2.467 ± 0.863
3.289AlaLys: 3.289 ± 1.404
4.934AlaLeu: 4.934 ± 1.929
1.234AlaMet: 1.234 ± 0.667
0.822AlaAsn: 0.822 ± 0.726
2.056AlaPro: 2.056 ± 1.397
1.234AlaGln: 1.234 ± 0.485
3.289AlaArg: 3.289 ± 1.406
2.467AlaSer: 2.467 ± 0.822
5.345AlaThr: 5.345 ± 1.123
4.112AlaVal: 4.112 ± 2.177
0.0AlaTrp: 0.0 ± 0.0
1.234AlaTyr: 1.234 ± 0.623
0.0AlaXaa: 0.0 ± 0.0
Cys
0.411CysAla: 0.411 ± 0.341
1.234CysCys: 1.234 ± 0.87
1.234CysAsp: 1.234 ± 0.841
0.411CysGlu: 0.411 ± 0.686
0.411CysPhe: 0.411 ± 0.686
0.822CysGly: 0.822 ± 0.589
0.0CysHis: 0.0 ± 0.0
1.645CysIle: 1.645 ± 1.257
1.645CysLys: 1.645 ± 1.999
2.467CysLeu: 2.467 ± 2.062
0.411CysMet: 0.411 ± 0.664
1.645CysAsn: 1.645 ± 0.749
2.056CysPro: 2.056 ± 1.106
0.822CysGln: 0.822 ± 0.427
1.234CysArg: 1.234 ± 1.33
2.056CysSer: 2.056 ± 1.001
0.822CysThr: 0.822 ± 0.415
1.645CysVal: 1.645 ± 2.17
1.234CysTrp: 1.234 ± 0.4
0.411CysTyr: 0.411 ± 0.341
0.0CysXaa: 0.0 ± 0.0
Asp
5.345AspAla: 5.345 ± 1.223
0.822AspCys: 0.822 ± 0.427
5.757AspAsp: 5.757 ± 2.085
4.523AspGlu: 4.523 ± 1.677
4.112AspPhe: 4.112 ± 1.019
2.056AspGly: 2.056 ± 1.058
1.234AspHis: 1.234 ± 0.4
3.701AspIle: 3.701 ± 0.943
1.645AspLys: 1.645 ± 0.629
5.757AspLeu: 5.757 ± 2.17
0.822AspMet: 0.822 ± 0.726
4.112AspAsn: 4.112 ± 1.617
4.523AspPro: 4.523 ± 1.509
1.645AspGln: 1.645 ± 0.672
2.056AspArg: 2.056 ± 0.578
3.289AspSer: 3.289 ± 1.514
5.345AspThr: 5.345 ± 1.739
5.345AspVal: 5.345 ± 1.862
1.234AspTrp: 1.234 ± 0.841
2.056AspTyr: 2.056 ± 0.833
0.0AspXaa: 0.0 ± 0.0
Glu
5.757GluAla: 5.757 ± 1.555
0.822GluCys: 0.822 ± 0.687
4.112GluAsp: 4.112 ± 0.684
6.579GluGlu: 6.579 ± 2.565
1.645GluPhe: 1.645 ± 1.335
4.523GluGly: 4.523 ± 1.436
0.822GluHis: 0.822 ± 0.469
3.289GluIle: 3.289 ± 1.431
4.523GluLys: 4.523 ± 2.161
4.523GluLeu: 4.523 ± 1.229
2.467GluMet: 2.467 ± 1.336
4.112GluAsn: 4.112 ± 1.32
2.467GluPro: 2.467 ± 0.905
3.289GluGln: 3.289 ± 0.788
3.289GluArg: 3.289 ± 1.372
4.523GluSer: 4.523 ± 1.247
3.701GluThr: 3.701 ± 1.347
2.056GluVal: 2.056 ± 0.533
0.822GluTrp: 0.822 ± 0.446
1.645GluTyr: 1.645 ± 1.444
0.0GluXaa: 0.0 ± 0.0
Phe
1.645PheAla: 1.645 ± 0.927
1.645PheCys: 1.645 ± 0.802
2.056PheAsp: 2.056 ± 0.808
2.878PheGlu: 2.878 ± 1.159
2.467PhePhe: 2.467 ± 1.257
2.467PheGly: 2.467 ± 1.034
0.822PheHis: 0.822 ± 0.692
2.056PheIle: 2.056 ± 1.095
3.289PheLys: 3.289 ± 1.358
6.579PheLeu: 6.579 ± 1.183
1.645PheMet: 1.645 ± 0.999
3.701PheAsn: 3.701 ± 1.78
2.878PhePro: 2.878 ± 0.643
1.234PheGln: 1.234 ± 0.856
2.878PheArg: 2.878 ± 1.067
2.056PheSer: 2.056 ± 0.828
2.056PheThr: 2.056 ± 1.088
2.467PheVal: 2.467 ± 1.07
0.822PheTrp: 0.822 ± 0.415
2.467PheTyr: 2.467 ± 0.904
0.0PheXaa: 0.0 ± 0.0
Gly
2.056GlyAla: 2.056 ± 1.02
0.411GlyCys: 0.411 ± 0.363
3.701GlyAsp: 3.701 ± 0.878
2.878GlyGlu: 2.878 ± 1.574
1.645GlyPhe: 1.645 ± 0.336
3.289GlyGly: 3.289 ± 1.089
0.822GlyHis: 0.822 ± 0.469
4.112GlyIle: 4.112 ± 1.666
2.056GlyLys: 2.056 ± 1.094
4.523GlyLeu: 4.523 ± 1.219
1.234GlyMet: 1.234 ± 0.701
4.112GlyAsn: 4.112 ± 1.125
3.701GlyPro: 3.701 ± 1.118
2.467GlyGln: 2.467 ± 1.522
3.289GlyArg: 3.289 ± 0.884
5.345GlySer: 5.345 ± 1.826
3.289GlyThr: 3.289 ± 1.236
3.701GlyVal: 3.701 ± 1.579
0.0GlyTrp: 0.0 ± 0.0
0.411GlyTyr: 0.411 ± 0.606
0.0GlyXaa: 0.0 ± 0.0
His
0.822HisAla: 0.822 ± 0.682
0.822HisCys: 0.822 ± 0.687
0.0HisAsp: 0.0 ± 0.0
0.822HisGlu: 0.822 ± 0.446
0.822HisPhe: 0.822 ± 0.427
0.822HisGly: 0.822 ± 0.589
0.0HisHis: 0.0 ± 0.0
0.822HisIle: 0.822 ± 0.427
0.822HisLys: 0.822 ± 0.446
2.056HisLeu: 2.056 ± 0.691
0.411HisMet: 0.411 ± 0.659
1.234HisAsn: 1.234 ± 0.4
0.411HisPro: 0.411 ± 0.363
0.822HisGln: 0.822 ± 0.659
1.645HisArg: 1.645 ± 1.263
0.411HisSer: 0.411 ± 0.428
2.056HisThr: 2.056 ± 0.75
0.0HisVal: 0.0 ± 0.0
1.234HisTrp: 1.234 ± 0.826
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.467IleAla: 2.467 ± 0.992
1.234IleCys: 1.234 ± 0.581
3.701IleAsp: 3.701 ± 1.342
5.345IleGlu: 5.345 ± 1.551
2.878IlePhe: 2.878 ± 0.518
4.523IleGly: 4.523 ± 2.036
1.234IleHis: 1.234 ± 0.618
2.467IleIle: 2.467 ± 1.079
1.645IleLys: 1.645 ± 0.629
2.878IleLeu: 2.878 ± 0.908
0.822IleMet: 0.822 ± 0.659
2.467IleAsn: 2.467 ± 0.89
4.112IlePro: 4.112 ± 1.061
3.289IleGln: 3.289 ± 1.38
2.467IleArg: 2.467 ± 1.102
3.701IleSer: 3.701 ± 1.101
2.056IleThr: 2.056 ± 1.088
3.289IleVal: 3.289 ± 0.52
0.411IleTrp: 0.411 ± 0.341
0.822IleTyr: 0.822 ± 0.469
0.0IleXaa: 0.0 ± 0.0
Lys
1.645LysAla: 1.645 ± 0.846
3.289LysCys: 3.289 ± 1.346
3.289LysAsp: 3.289 ± 1.346
3.701LysGlu: 3.701 ± 1.032
2.878LysPhe: 2.878 ± 0.55
2.056LysGly: 2.056 ± 1.15
1.234LysHis: 1.234 ± 0.668
1.645LysIle: 1.645 ± 1.124
2.878LysLys: 2.878 ± 1.42
4.112LysLeu: 4.112 ± 2.19
1.234LysMet: 1.234 ± 0.665
1.234LysAsn: 1.234 ± 0.804
1.234LysPro: 1.234 ± 0.431
0.822LysGln: 0.822 ± 0.415
5.757LysArg: 5.757 ± 1.272
3.701LysSer: 3.701 ± 1.422
2.878LysThr: 2.878 ± 1.194
2.467LysVal: 2.467 ± 0.761
0.822LysTrp: 0.822 ± 0.707
3.289LysTyr: 3.289 ± 1.121
0.0LysXaa: 0.0 ± 0.0
Leu
4.934LeuAla: 4.934 ± 2.596
1.234LeuCys: 1.234 ± 0.856
4.523LeuAsp: 4.523 ± 1.436
7.401LeuGlu: 7.401 ± 1.078
8.224LeuPhe: 8.224 ± 2.197
5.757LeuGly: 5.757 ± 2.158
1.234LeuHis: 1.234 ± 0.431
3.289LeuIle: 3.289 ± 1.001
4.934LeuLys: 4.934 ± 1.512
6.579LeuLeu: 6.579 ± 2.265
1.645LeuMet: 1.645 ± 1.07
2.878LeuAsn: 2.878 ± 0.559
4.523LeuPro: 4.523 ± 1.854
6.168LeuGln: 6.168 ± 3.125
2.878LeuArg: 2.878 ± 1.205
6.99LeuSer: 6.99 ± 1.712
4.523LeuThr: 4.523 ± 1.109
7.401LeuVal: 7.401 ± 1.274
0.411LeuTrp: 0.411 ± 0.363
4.934LeuTyr: 4.934 ± 1.282
0.0LeuXaa: 0.0 ± 0.0
Met
0.411MetAla: 0.411 ± 0.363
0.0MetCys: 0.0 ± 0.0
1.645MetAsp: 1.645 ± 0.902
1.234MetGlu: 1.234 ± 0.942
0.411MetPhe: 0.411 ± 0.363
0.822MetGly: 0.822 ± 0.659
0.411MetHis: 0.411 ± 0.341
0.822MetIle: 0.822 ± 0.574
0.822MetLys: 0.822 ± 0.687
2.467MetLeu: 2.467 ± 0.987
0.0MetMet: 0.0 ± 0.0
1.645MetAsn: 1.645 ± 0.716
0.411MetPro: 0.411 ± 0.341
2.467MetGln: 2.467 ± 1.079
1.234MetArg: 1.234 ± 0.856
0.822MetSer: 0.822 ± 0.726
0.822MetThr: 0.822 ± 0.427
0.822MetVal: 0.822 ± 0.415
0.0MetTrp: 0.0 ± 0.0
0.822MetTyr: 0.822 ± 0.455
0.0MetXaa: 0.0 ± 0.0
Asn
3.289AsnAla: 3.289 ± 1.078
1.645AsnCys: 1.645 ± 1.257
1.234AsnAsp: 1.234 ± 0.701
1.234AsnGlu: 1.234 ± 0.431
1.234AsnPhe: 1.234 ± 0.667
2.878AsnGly: 2.878 ± 1.086
1.234AsnHis: 1.234 ± 1.022
3.289AsnIle: 3.289 ± 1.422
1.645AsnLys: 1.645 ± 0.672
4.934AsnLeu: 4.934 ± 1.028
0.822AsnMet: 0.822 ± 0.427
3.701AsnAsn: 3.701 ± 1.436
4.934AsnPro: 4.934 ± 1.398
4.112AsnGln: 4.112 ± 1.669
2.056AsnArg: 2.056 ± 0.777
3.701AsnSer: 3.701 ± 1.113
3.289AsnThr: 3.289 ± 0.622
4.523AsnVal: 4.523 ± 0.797
1.234AsnTrp: 1.234 ± 0.667
2.056AsnTyr: 2.056 ± 0.578
0.0AsnXaa: 0.0 ± 0.0
Pro
3.289ProAla: 3.289 ± 1.116
0.411ProCys: 0.411 ± 0.341
5.345ProAsp: 5.345 ± 1.942
4.112ProGlu: 4.112 ± 1.788
2.878ProPhe: 2.878 ± 1.03
2.056ProGly: 2.056 ± 1.07
0.0ProHis: 0.0 ± 0.0
3.701ProIle: 3.701 ± 2.397
3.289ProLys: 3.289 ± 0.547
4.934ProLeu: 4.934 ± 0.986
0.0ProMet: 0.0 ± 0.0
2.878ProAsn: 2.878 ± 0.625
8.224ProPro: 8.224 ± 2.53
2.878ProGln: 2.878 ± 1.592
1.645ProArg: 1.645 ± 0.762
3.701ProSer: 3.701 ± 1.099
5.757ProThr: 5.757 ± 2.039
3.289ProVal: 3.289 ± 0.876
0.411ProTrp: 0.411 ± 0.363
3.701ProTyr: 3.701 ± 1.405
0.0ProXaa: 0.0 ± 0.0
Gln
2.467GlnAla: 2.467 ± 0.794
0.822GlnCys: 0.822 ± 0.584
3.701GlnAsp: 3.701 ± 1.505
2.056GlnGlu: 2.056 ± 0.691
1.645GlnPhe: 1.645 ± 0.587
2.467GlnGly: 2.467 ± 0.489
0.822GlnHis: 0.822 ± 0.584
4.112GlnIle: 4.112 ± 1.318
1.645GlnLys: 1.645 ± 0.846
6.168GlnLeu: 6.168 ± 1.43
1.234GlnMet: 1.234 ± 0.947
2.056GlnAsn: 2.056 ± 0.765
0.822GlnPro: 0.822 ± 0.502
2.056GlnGln: 2.056 ± 0.949
2.056GlnArg: 2.056 ± 1.796
1.234GlnSer: 1.234 ± 0.804
3.289GlnThr: 3.289 ± 0.672
1.234GlnVal: 1.234 ± 0.803
1.645GlnTrp: 1.645 ± 0.834
1.645GlnTyr: 1.645 ± 1.075
0.0GlnXaa: 0.0 ± 0.0
Arg
4.523ArgAla: 4.523 ± 1.168
1.234ArgCys: 1.234 ± 0.637
2.878ArgAsp: 2.878 ± 0.835
2.467ArgGlu: 2.467 ± 1.074
0.0ArgPhe: 0.0 ± 0.0
2.878ArgGly: 2.878 ± 1.312
1.234ArgHis: 1.234 ± 0.721
1.645ArgIle: 1.645 ± 1.168
4.934ArgLys: 4.934 ± 0.7
6.99ArgLeu: 6.99 ± 1.911
1.645ArgMet: 1.645 ± 0.93
3.701ArgAsn: 3.701 ± 1.168
4.112ArgPro: 4.112 ± 1.939
2.467ArgGln: 2.467 ± 0.904
5.757ArgArg: 5.757 ± 2.442
5.757ArgSer: 5.757 ± 2.084
2.467ArgThr: 2.467 ± 1.091
3.289ArgVal: 3.289 ± 1.175
0.411ArgTrp: 0.411 ± 0.428
2.056ArgTyr: 2.056 ± 0.637
0.0ArgXaa: 0.0 ± 0.0
Ser
1.645SerAla: 1.645 ± 0.891
1.234SerCys: 1.234 ± 0.875
5.345SerAsp: 5.345 ± 1.309
4.523SerGlu: 4.523 ± 1.573
6.99SerPhe: 6.99 ± 1.914
5.345SerGly: 5.345 ± 1.413
2.467SerHis: 2.467 ± 1.522
4.934SerIle: 4.934 ± 1.155
2.467SerLys: 2.467 ± 1.441
6.99SerLeu: 6.99 ± 1.33
0.0SerMet: 0.0 ± 0.0
2.056SerAsn: 2.056 ± 1.28
3.701SerPro: 3.701 ± 1.301
1.645SerGln: 1.645 ± 0.83
4.934SerArg: 4.934 ± 1.732
4.934SerSer: 4.934 ± 2.703
5.757SerThr: 5.757 ± 1.655
2.467SerVal: 2.467 ± 1.204
0.0SerTrp: 0.0 ± 0.0
2.467SerTyr: 2.467 ± 0.926
0.0SerXaa: 0.0 ± 0.0
Thr
1.234ThrAla: 1.234 ± 0.761
1.234ThrCys: 1.234 ± 0.87
3.289ThrAsp: 3.289 ± 1.502
4.934ThrGlu: 4.934 ± 1.57
2.878ThrPhe: 2.878 ± 1.159
4.112ThrGly: 4.112 ± 1.1
1.234ThrHis: 1.234 ± 1.176
4.523ThrIle: 4.523 ± 1.889
2.878ThrLys: 2.878 ± 1.176
5.345ThrLeu: 5.345 ± 1.364
1.234ThrMet: 1.234 ± 0.431
4.523ThrAsn: 4.523 ± 1.684
6.168ThrPro: 6.168 ± 1.312
2.056ThrGln: 2.056 ± 0.637
4.934ThrArg: 4.934 ± 1.228
5.345ThrSer: 5.345 ± 1.878
6.168ThrThr: 6.168 ± 2.012
4.523ThrVal: 4.523 ± 0.966
0.822ThrTrp: 0.822 ± 0.682
1.234ThrTyr: 1.234 ± 0.431
0.0ThrXaa: 0.0 ± 0.0
Val
3.289ValAla: 3.289 ± 0.825
1.234ValCys: 1.234 ± 1.33
5.757ValAsp: 5.757 ± 1.716
2.056ValGlu: 2.056 ± 0.916
2.056ValPhe: 2.056 ± 0.828
1.234ValGly: 1.234 ± 0.721
0.822ValHis: 0.822 ± 0.659
2.056ValIle: 2.056 ± 0.991
1.645ValLys: 1.645 ± 0.752
3.701ValLeu: 3.701 ± 1.532
0.822ValMet: 0.822 ± 0.726
3.289ValAsn: 3.289 ± 0.878
5.345ValPro: 5.345 ± 2.079
1.645ValGln: 1.645 ± 0.685
4.112ValArg: 4.112 ± 2.219
6.168ValSer: 6.168 ± 1.958
5.345ValThr: 5.345 ± 2.823
2.056ValVal: 2.056 ± 0.472
1.234ValTrp: 1.234 ± 0.826
2.056ValTyr: 2.056 ± 0.472
0.0ValXaa: 0.0 ± 0.0
Trp
0.822TrpAla: 0.822 ± 0.446
0.411TrpCys: 0.411 ± 0.341
0.822TrpAsp: 0.822 ± 0.726
1.234TrpGlu: 1.234 ± 0.942
0.411TrpPhe: 0.411 ± 0.428
0.0TrpGly: 0.0 ± 0.0
0.411TrpHis: 0.411 ± 0.428
0.411TrpIle: 0.411 ± 0.341
1.234TrpLys: 1.234 ± 1.022
1.645TrpLeu: 1.645 ± 0.83
0.0TrpMet: 0.0 ± 0.0
0.411TrpAsn: 0.411 ± 0.428
0.411TrpPro: 0.411 ± 0.363
0.411TrpGln: 0.411 ± 0.428
1.645TrpArg: 1.645 ± 1.444
0.822TrpSer: 0.822 ± 0.707
0.822TrpThr: 0.822 ± 0.469
0.822TrpVal: 0.822 ± 0.682
0.0TrpTrp: 0.0 ± 0.0
0.411TrpTyr: 0.411 ± 0.341
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.056TyrAla: 2.056 ± 0.637
2.056TyrCys: 2.056 ± 2.677
2.056TyrAsp: 2.056 ± 0.637
2.467TyrGlu: 2.467 ± 1.543
2.467TyrPhe: 2.467 ± 0.799
1.645TyrGly: 1.645 ± 0.716
0.0TyrHis: 0.0 ± 0.0
0.822TyrIle: 0.822 ± 0.415
2.878TyrLys: 2.878 ± 1.071
2.878TyrLeu: 2.878 ± 1.585
0.0TyrMet: 0.0 ± 0.0
2.467TyrAsn: 2.467 ± 1.053
0.411TyrPro: 0.411 ± 0.363
1.645TyrGln: 1.645 ± 0.62
3.289TyrArg: 3.289 ± 0.891
2.878TyrSer: 2.878 ± 1.435
2.878TyrThr: 2.878 ± 0.643
0.411TyrVal: 0.411 ± 0.341
0.411TyrTrp: 0.411 ± 0.363
4.523TyrTyr: 4.523 ± 1.835
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2433 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski