Amino acid dipepetide frequency for Gammapapillomavirus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.132AlaAla: 4.132 ± 1.736
0.413AlaCys: 0.413 ± 0.489
2.893AlaAsp: 2.893 ± 1.086
3.306AlaGlu: 3.306 ± 1.668
3.306AlaPhe: 3.306 ± 1.481
3.306AlaGly: 3.306 ± 1.52
1.24AlaHis: 1.24 ± 0.339
3.306AlaIle: 3.306 ± 0.883
4.132AlaLys: 4.132 ± 0.713
5.785AlaLeu: 5.785 ± 1.188
0.0AlaMet: 0.0 ± 0.0
3.306AlaAsn: 3.306 ± 1.259
0.826AlaPro: 0.826 ± 0.466
1.24AlaGln: 1.24 ± 0.63
2.893AlaArg: 2.893 ± 0.954
1.653AlaSer: 1.653 ± 0.229
4.132AlaThr: 4.132 ± 1.034
4.959AlaVal: 4.959 ± 1.021
0.413AlaTrp: 0.413 ± 0.489
0.826AlaTyr: 0.826 ± 0.44
0.0AlaXaa: 0.0 ± 0.0
Cys
0.413CysAla: 0.413 ± 0.482
1.653CysCys: 1.653 ± 1.132
0.413CysAsp: 0.413 ± 0.436
0.413CysGlu: 0.413 ± 0.436
2.066CysPhe: 2.066 ± 1.059
0.826CysGly: 0.826 ± 0.573
0.826CysHis: 0.826 ± 0.721
1.653CysIle: 1.653 ± 1.053
2.066CysLys: 2.066 ± 1.006
0.826CysLeu: 0.826 ± 0.978
0.0CysMet: 0.0 ± 0.0
0.826CysAsn: 0.826 ± 0.657
1.24CysPro: 1.24 ± 0.784
0.826CysGln: 0.826 ± 0.44
1.24CysArg: 1.24 ± 1.468
1.24CysSer: 1.24 ± 0.771
1.24CysThr: 1.24 ± 0.768
0.826CysVal: 0.826 ± 0.657
1.24CysTrp: 1.24 ± 0.47
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.959AspAla: 4.959 ± 1.139
2.066AspCys: 2.066 ± 0.695
4.132AspAsp: 4.132 ± 1.251
5.785AspGlu: 5.785 ± 1.855
2.479AspPhe: 2.479 ± 1.758
1.653AspGly: 1.653 ± 0.881
0.826AspHis: 0.826 ± 0.44
5.785AspIle: 5.785 ± 1.488
2.066AspLys: 2.066 ± 1.232
5.372AspLeu: 5.372 ± 1.007
2.066AspMet: 2.066 ± 0.672
5.372AspAsn: 5.372 ± 0.756
4.545AspPro: 4.545 ± 1.36
2.066AspGln: 2.066 ± 0.74
2.066AspArg: 2.066 ± 0.478
2.893AspSer: 2.893 ± 0.542
3.306AspThr: 3.306 ± 0.495
3.306AspVal: 3.306 ± 1.653
0.826AspTrp: 0.826 ± 0.424
2.893AspTyr: 2.893 ± 0.542
0.0AspXaa: 0.0 ± 0.0
Glu
3.306GluAla: 3.306 ± 1.579
0.413GluCys: 0.413 ± 0.328
4.132GluAsp: 4.132 ± 1.801
7.851GluGlu: 7.851 ± 2.428
2.893GluPhe: 2.893 ± 0.797
1.653GluGly: 1.653 ± 0.229
1.653GluHis: 1.653 ± 0.663
2.479GluIle: 2.479 ± 0.726
2.066GluLys: 2.066 ± 0.753
5.785GluLeu: 5.785 ± 0.939
0.826GluMet: 0.826 ± 0.424
5.372GluAsn: 5.372 ± 1.682
2.066GluPro: 2.066 ± 0.812
2.893GluGln: 2.893 ± 0.622
2.479GluArg: 2.479 ± 0.995
4.545GluSer: 4.545 ± 1.704
4.959GluThr: 4.959 ± 1.059
4.132GluVal: 4.132 ± 1.378
1.24GluTrp: 1.24 ± 0.734
1.653GluTyr: 1.653 ± 0.747
0.0GluXaa: 0.0 ± 0.0
Phe
2.479PheAla: 2.479 ± 1.129
1.653PheCys: 1.653 ± 1.047
2.893PheAsp: 2.893 ± 0.877
4.959PheGlu: 4.959 ± 2.293
2.066PhePhe: 2.066 ± 0.745
3.306PheGly: 3.306 ± 1.135
0.826PheHis: 0.826 ± 0.488
2.893PheIle: 2.893 ± 0.788
2.893PheLys: 2.893 ± 0.954
3.719PheLeu: 3.719 ± 1.242
1.653PheMet: 1.653 ± 1.03
2.066PheAsn: 2.066 ± 0.879
2.066PhePro: 2.066 ± 0.478
2.479PheGln: 2.479 ± 0.757
1.653PheArg: 1.653 ± 0.5
1.653PheSer: 1.653 ± 0.875
1.653PheThr: 1.653 ± 0.853
2.066PheVal: 2.066 ± 1.124
1.24PheTrp: 1.24 ± 0.643
3.306PheTyr: 3.306 ± 1.526
0.0PheXaa: 0.0 ± 0.0
Gly
2.066GlyAla: 2.066 ± 0.948
0.826GlyCys: 0.826 ± 0.625
4.959GlyAsp: 4.959 ± 1.382
3.719GlyGlu: 3.719 ± 0.955
1.24GlyPhe: 1.24 ± 0.339
3.306GlyGly: 3.306 ± 1.586
2.066GlyHis: 2.066 ± 0.757
3.306GlyIle: 3.306 ± 0.814
2.479GlyLys: 2.479 ± 1.538
3.306GlyLeu: 3.306 ± 0.916
1.24GlyMet: 1.24 ± 0.393
2.479GlyAsn: 2.479 ± 0.785
2.066GlyPro: 2.066 ± 0.84
2.479GlyGln: 2.479 ± 0.623
2.893GlyArg: 2.893 ± 1.258
4.959GlySer: 4.959 ± 1.784
6.612GlyThr: 6.612 ± 1.565
2.893GlyVal: 2.893 ± 1.49
0.0GlyTrp: 0.0 ± 0.0
1.24GlyTyr: 1.24 ± 0.76
0.0GlyXaa: 0.0 ± 0.0
His
0.413HisAla: 0.413 ± 0.436
0.826HisCys: 0.826 ± 0.424
0.413HisAsp: 0.413 ± 0.547
0.826HisGlu: 0.826 ± 0.625
2.066HisPhe: 2.066 ± 0.933
0.826HisGly: 0.826 ± 0.721
0.826HisHis: 0.826 ± 0.713
0.826HisIle: 0.826 ± 0.442
1.24HisLys: 1.24 ± 0.614
4.132HisLeu: 4.132 ± 1.484
0.0HisMet: 0.0 ± 0.0
0.413HisAsn: 0.413 ± 0.396
2.893HisPro: 2.893 ± 0.638
0.0HisGln: 0.0 ± 0.0
0.826HisArg: 0.826 ± 0.486
0.826HisSer: 0.826 ± 0.652
0.413HisThr: 0.413 ± 0.436
0.413HisVal: 0.413 ± 0.328
0.826HisTrp: 0.826 ± 0.488
0.826HisTyr: 0.826 ± 0.44
0.0HisXaa: 0.0 ± 0.0
Ile
2.893IleAla: 2.893 ± 0.934
0.413IleCys: 0.413 ± 0.436
4.545IleAsp: 4.545 ± 1.164
4.959IleGlu: 4.959 ± 1.431
2.066IlePhe: 2.066 ± 0.718
3.306IleGly: 3.306 ± 1.279
1.24IleHis: 1.24 ± 0.57
3.719IleIle: 3.719 ± 1.536
2.066IleLys: 2.066 ± 0.825
4.545IleLeu: 4.545 ± 0.845
0.0IleMet: 0.0 ± 0.0
1.653IleAsn: 1.653 ± 0.661
2.479IlePro: 2.479 ± 1.196
2.479IleGln: 2.479 ± 1.1
1.653IleArg: 1.653 ± 0.83
5.785IleSer: 5.785 ± 2.059
2.893IleThr: 2.893 ± 1.873
5.372IleVal: 5.372 ± 1.947
0.0IleTrp: 0.0 ± 0.0
2.893IleTyr: 2.893 ± 0.838
0.0IleXaa: 0.0 ± 0.0
Lys
2.066LysAla: 2.066 ± 1.045
2.479LysCys: 2.479 ± 1.249
0.826LysAsp: 0.826 ± 0.872
2.479LysGlu: 2.479 ± 1.678
2.479LysPhe: 2.479 ± 1.075
3.719LysGly: 3.719 ± 1.212
2.066LysHis: 2.066 ± 1.346
3.719LysIle: 3.719 ± 0.949
2.066LysLys: 2.066 ± 0.736
4.545LysLeu: 4.545 ± 1.418
1.653LysMet: 1.653 ± 0.528
1.24LysAsn: 1.24 ± 0.62
2.479LysPro: 2.479 ± 0.946
3.719LysGln: 3.719 ± 1.413
5.372LysArg: 5.372 ± 1.121
3.719LysSer: 3.719 ± 1.673
2.479LysThr: 2.479 ± 0.995
2.893LysVal: 2.893 ± 0.875
1.24LysTrp: 1.24 ± 0.393
2.066LysTyr: 2.066 ± 1.201
0.0LysXaa: 0.0 ± 0.0
Leu
5.372LeuAla: 5.372 ± 1.343
0.826LeuCys: 0.826 ± 0.625
6.198LeuAsp: 6.198 ± 1.592
4.132LeuGlu: 4.132 ± 1.632
5.785LeuPhe: 5.785 ± 2.326
6.612LeuGly: 6.612 ± 2.049
1.24LeuHis: 1.24 ± 0.822
4.132LeuIle: 4.132 ± 0.801
5.372LeuLys: 5.372 ± 1.427
8.678LeuLeu: 8.678 ± 2.055
1.24LeuMet: 1.24 ± 0.762
3.719LeuAsn: 3.719 ± 1.23
6.612LeuPro: 6.612 ± 1.352
6.612LeuGln: 6.612 ± 1.815
6.198LeuArg: 6.198 ± 1.092
9.504LeuSer: 9.504 ± 1.768
6.198LeuThr: 6.198 ± 0.9
4.132LeuVal: 4.132 ± 1.796
0.413LeuTrp: 0.413 ± 0.436
2.893LeuTyr: 2.893 ± 1.432
0.0LeuXaa: 0.0 ± 0.0
Met
1.24MetAla: 1.24 ± 0.47
0.413MetCys: 0.413 ± 0.436
0.413MetAsp: 0.413 ± 0.436
1.653MetGlu: 1.653 ± 1.049
0.413MetPhe: 0.413 ± 0.436
0.826MetGly: 0.826 ± 0.44
0.0MetHis: 0.0 ± 0.0
0.826MetIle: 0.826 ± 0.486
0.0MetLys: 0.0 ± 0.0
2.066MetLeu: 2.066 ± 0.901
0.0MetMet: 0.0 ± 0.0
1.24MetAsn: 1.24 ± 0.339
0.413MetPro: 0.413 ± 0.396
1.24MetGln: 1.24 ± 0.665
1.24MetArg: 1.24 ± 0.57
0.826MetSer: 0.826 ± 0.442
0.826MetThr: 0.826 ± 0.486
0.413MetVal: 0.413 ± 0.328
0.0MetTrp: 0.0 ± 0.0
1.24MetTyr: 1.24 ± 0.468
0.0MetXaa: 0.0 ± 0.0
Asn
2.479AsnAla: 2.479 ± 0.937
0.413AsnCys: 0.413 ± 0.489
2.066AsnAsp: 2.066 ± 0.688
2.479AsnGlu: 2.479 ± 1.08
2.893AsnPhe: 2.893 ± 1.308
4.132AsnGly: 4.132 ± 1.849
0.0AsnHis: 0.0 ± 0.0
4.959AsnIle: 4.959 ± 3.376
2.479AsnLys: 2.479 ± 1.096
4.545AsnLeu: 4.545 ± 1.481
0.826AsnMet: 0.826 ± 0.667
2.479AsnAsn: 2.479 ± 0.972
2.066AsnPro: 2.066 ± 0.725
2.066AsnGln: 2.066 ± 0.825
2.893AsnArg: 2.893 ± 0.636
4.132AsnSer: 4.132 ± 1.826
3.719AsnThr: 3.719 ± 1.236
3.719AsnVal: 3.719 ± 0.561
1.653AsnTrp: 1.653 ± 0.74
0.826AsnTyr: 0.826 ± 0.546
0.0AsnXaa: 0.0 ± 0.0
Pro
4.959ProAla: 4.959 ± 1.467
0.413ProCys: 0.413 ± 0.436
5.372ProAsp: 5.372 ± 1.706
2.066ProGlu: 2.066 ± 0.78
1.24ProPhe: 1.24 ± 0.57
1.24ProGly: 1.24 ± 0.585
0.0ProHis: 0.0 ± 0.0
2.893ProIle: 2.893 ± 1.196
2.893ProLys: 2.893 ± 0.94
5.785ProLeu: 5.785 ± 1.363
0.413ProMet: 0.413 ± 0.436
2.066ProAsn: 2.066 ± 0.515
7.851ProPro: 7.851 ± 3.516
2.479ProGln: 2.479 ± 1.365
3.719ProArg: 3.719 ± 1.702
4.959ProSer: 4.959 ± 2.217
5.785ProThr: 5.785 ± 1.983
2.893ProVal: 2.893 ± 0.829
0.0ProTrp: 0.0 ± 0.0
2.893ProTyr: 2.893 ± 0.94
0.0ProXaa: 0.0 ± 0.0
Gln
2.066GlnAla: 2.066 ± 0.407
1.24GlnCys: 1.24 ± 0.682
4.132GlnAsp: 4.132 ± 1.546
2.479GlnGlu: 2.479 ± 1.007
2.479GlnPhe: 2.479 ± 0.971
2.479GlnGly: 2.479 ± 1.057
0.826GlnHis: 0.826 ± 0.556
2.479GlnIle: 2.479 ± 0.476
1.653GlnLys: 1.653 ± 1.155
5.785GlnLeu: 5.785 ± 2.239
1.653GlnMet: 1.653 ± 0.229
2.479GlnAsn: 2.479 ± 0.804
1.653GlnPro: 1.653 ± 0.643
2.479GlnGln: 2.479 ± 1.096
1.653GlnArg: 1.653 ± 0.905
3.719GlnSer: 3.719 ± 1.219
1.653GlnThr: 1.653 ± 0.687
2.893GlnVal: 2.893 ± 1.083
0.413GlnTrp: 0.413 ± 0.328
1.653GlnTyr: 1.653 ± 1.248
0.0GlnXaa: 0.0 ± 0.0
Arg
2.066ArgAla: 2.066 ± 1.044
2.479ArgCys: 2.479 ± 1.491
2.479ArgAsp: 2.479 ± 0.921
0.826ArgGlu: 0.826 ± 0.44
2.066ArgPhe: 2.066 ± 0.445
3.306ArgGly: 3.306 ± 1.017
2.066ArgHis: 2.066 ± 1.081
2.066ArgIle: 2.066 ± 1.346
5.372ArgLys: 5.372 ± 0.991
6.198ArgLeu: 6.198 ± 1.248
0.413ArgMet: 0.413 ± 0.387
2.893ArgAsn: 2.893 ± 0.624
4.545ArgPro: 4.545 ± 2.861
1.653ArgGln: 1.653 ± 0.619
3.719ArgArg: 3.719 ± 1.644
6.198ArgSer: 6.198 ± 2.427
2.479ArgThr: 2.479 ± 1.087
2.066ArgVal: 2.066 ± 0.973
0.0ArgTrp: 0.0 ± 0.0
0.826ArgTyr: 0.826 ± 0.631
0.0ArgXaa: 0.0 ± 0.0
Ser
2.479SerAla: 2.479 ± 1.225
0.413SerCys: 0.413 ± 0.482
3.719SerAsp: 3.719 ± 1.183
5.372SerGlu: 5.372 ± 0.966
2.893SerPhe: 2.893 ± 0.872
6.198SerGly: 6.198 ± 2.149
2.066SerHis: 2.066 ± 0.761
2.893SerIle: 2.893 ± 0.788
3.719SerLys: 3.719 ± 1.394
11.157SerLeu: 11.157 ± 1.614
0.826SerMet: 0.826 ± 0.635
2.893SerAsn: 2.893 ± 1.976
3.719SerPro: 3.719 ± 1.42
3.719SerGln: 3.719 ± 1.359
3.306SerArg: 3.306 ± 1.78
8.678SerSer: 8.678 ± 2.868
6.198SerThr: 6.198 ± 1.827
2.893SerVal: 2.893 ± 1.04
1.24SerTrp: 1.24 ± 0.816
2.066SerTyr: 2.066 ± 0.679
0.0SerXaa: 0.0 ± 0.0
Thr
3.719ThrAla: 3.719 ± 1.314
1.653ThrCys: 1.653 ± 0.512
5.785ThrAsp: 5.785 ± 1.981
3.719ThrGlu: 3.719 ± 1.243
3.719ThrPhe: 3.719 ± 0.883
4.132ThrGly: 4.132 ± 0.988
0.413ThrHis: 0.413 ± 0.489
2.066ThrIle: 2.066 ± 0.444
3.306ThrLys: 3.306 ± 1.197
5.372ThrLeu: 5.372 ± 1.772
1.653ThrMet: 1.653 ± 0.648
3.719ThrAsn: 3.719 ± 0.679
5.372ThrPro: 5.372 ± 2.116
2.893ThrGln: 2.893 ± 1.496
3.719ThrArg: 3.719 ± 0.844
4.545ThrSer: 4.545 ± 0.834
4.545ThrThr: 4.545 ± 1.307
7.025ThrVal: 7.025 ± 0.995
0.413ThrTrp: 0.413 ± 0.328
1.653ThrTyr: 1.653 ± 0.656
0.0ThrXaa: 0.0 ± 0.0
Val
1.653ValAla: 1.653 ± 0.758
0.413ValCys: 0.413 ± 0.482
5.372ValAsp: 5.372 ± 1.272
3.719ValGlu: 3.719 ± 1.251
2.066ValPhe: 2.066 ± 0.407
2.066ValGly: 2.066 ± 0.963
0.826ValHis: 0.826 ± 0.466
3.306ValIle: 3.306 ± 1.036
2.066ValLys: 2.066 ± 0.869
3.306ValLeu: 3.306 ± 0.982
0.413ValMet: 0.413 ± 0.436
3.719ValAsn: 3.719 ± 0.755
4.545ValPro: 4.545 ± 1.356
2.893ValGln: 2.893 ± 0.972
4.132ValArg: 4.132 ± 1.445
6.198ValSer: 6.198 ± 0.813
4.545ValThr: 4.545 ± 1.102
1.653ValVal: 1.653 ± 0.661
1.653ValTrp: 1.653 ± 0.815
1.653ValTyr: 1.653 ± 0.687
0.0ValXaa: 0.0 ± 0.0
Trp
1.24TrpAla: 1.24 ± 0.647
0.0TrpCys: 0.0 ± 0.0
1.653TrpAsp: 1.653 ± 1.111
0.0TrpGlu: 0.0 ± 0.0
0.826TrpPhe: 0.826 ± 0.424
0.413TrpGly: 0.413 ± 0.328
0.413TrpHis: 0.413 ± 0.387
0.826TrpIle: 0.826 ± 0.486
1.653TrpLys: 1.653 ± 0.568
1.653TrpLeu: 1.653 ± 0.921
0.0TrpMet: 0.0 ± 0.0
0.413TrpAsn: 0.413 ± 0.436
0.413TrpPro: 0.413 ± 0.436
0.826TrpGln: 0.826 ± 0.872
0.826TrpArg: 0.826 ± 0.573
0.0TrpSer: 0.0 ± 0.0
1.653TrpThr: 1.653 ± 1.111
0.413TrpVal: 0.413 ± 0.328
0.0TrpTrp: 0.0 ± 0.0
0.413TrpTyr: 0.413 ± 0.387
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.066TyrAla: 2.066 ± 0.852
0.826TyrCys: 0.826 ± 0.978
2.066TyrAsp: 2.066 ± 0.444
1.653TyrGlu: 1.653 ± 0.661
2.479TyrPhe: 2.479 ± 1.15
0.826TyrGly: 0.826 ± 0.488
0.413TyrHis: 0.413 ± 0.547
0.826TyrIle: 0.826 ± 0.442
3.719TyrLys: 3.719 ± 1.472
3.306TyrLeu: 3.306 ± 0.82
0.0TyrMet: 0.0 ± 0.0
2.479TyrAsn: 2.479 ± 0.576
2.066TyrPro: 2.066 ± 0.637
0.826TyrGln: 0.826 ± 0.486
1.24TyrArg: 1.24 ± 0.784
0.826TyrSer: 0.826 ± 0.466
4.132TyrThr: 4.132 ± 1.045
1.24TyrVal: 1.24 ± 0.47
0.826TyrTrp: 0.826 ± 0.488
2.479TyrTyr: 2.479 ± 0.995
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2421 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski