Amino acid dipepetide frequency for Human papillomavirus 167

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.986AlaAla: 3.986 ± 1.467
0.886AlaCys: 0.886 ± 0.643
3.986AlaAsp: 3.986 ± 1.074
6.2AlaGlu: 6.2 ± 1.494
3.543AlaPhe: 3.543 ± 0.762
0.886AlaGly: 0.886 ± 0.396
1.329AlaHis: 1.329 ± 0.671
3.986AlaIle: 3.986 ± 1.155
3.543AlaLys: 3.543 ± 0.737
3.543AlaLeu: 3.543 ± 0.888
0.0AlaMet: 0.0 ± 0.0
2.214AlaAsn: 2.214 ± 0.594
3.986AlaPro: 3.986 ± 1.155
0.886AlaGln: 0.886 ± 0.52
2.657AlaArg: 2.657 ± 1.024
2.657AlaSer: 2.657 ± 0.928
3.986AlaThr: 3.986 ± 0.953
3.986AlaVal: 3.986 ± 0.858
0.0AlaTrp: 0.0 ± 0.0
1.771AlaTyr: 1.771 ± 0.782
0.0AlaXaa: 0.0 ± 0.0
Cys
1.771CysAla: 1.771 ± 1.416
2.214CysCys: 2.214 ± 2.426
2.214CysAsp: 2.214 ± 1.562
0.443CysGlu: 0.443 ± 0.591
2.214CysPhe: 2.214 ± 1.016
0.443CysGly: 0.443 ± 0.591
0.0CysHis: 0.0 ± 0.0
1.329CysIle: 1.329 ± 0.839
0.886CysLys: 0.886 ± 0.732
2.657CysLeu: 2.657 ± 2.422
0.0CysMet: 0.0 ± 0.0
1.771CysAsn: 1.771 ± 1.121
1.329CysPro: 1.329 ± 0.703
0.443CysGln: 0.443 ± 0.408
0.886CysArg: 0.886 ± 0.643
2.657CysSer: 2.657 ± 1.919
1.329CysThr: 1.329 ± 0.673
1.329CysVal: 1.329 ± 0.927
0.443CysTrp: 0.443 ± 0.362
0.443CysTyr: 0.443 ± 0.366
0.0CysXaa: 0.0 ± 0.0
Asp
3.1AspAla: 3.1 ± 0.488
2.214AspCys: 2.214 ± 0.691
3.543AspAsp: 3.543 ± 0.719
4.872AspGlu: 4.872 ± 0.814
3.986AspPhe: 3.986 ± 1.074
3.543AspGly: 3.543 ± 1.947
0.886AspHis: 0.886 ± 0.724
3.986AspIle: 3.986 ± 2.196
2.214AspLys: 2.214 ± 0.993
7.086AspLeu: 7.086 ± 1.473
0.443AspMet: 0.443 ± 0.366
4.429AspAsn: 4.429 ± 1.368
5.757AspPro: 5.757 ± 1.775
1.329AspGln: 1.329 ± 0.671
2.214AspArg: 2.214 ± 0.659
4.872AspSer: 4.872 ± 1.573
3.1AspThr: 3.1 ± 1.09
5.757AspVal: 5.757 ± 1.19
0.0AspTrp: 0.0 ± 0.0
2.657AspTyr: 2.657 ± 1.168
0.0AspXaa: 0.0 ± 0.0
Glu
6.643GluAla: 6.643 ± 1.908
1.771GluCys: 1.771 ± 0.932
3.543GluAsp: 3.543 ± 0.807
6.2GluGlu: 6.2 ± 2.817
0.886GluPhe: 0.886 ± 0.738
2.214GluGly: 2.214 ± 1.048
2.214GluHis: 2.214 ± 0.831
4.429GluIle: 4.429 ± 1.718
0.443GluLys: 0.443 ± 0.621
4.429GluLeu: 4.429 ± 1.362
1.329GluMet: 1.329 ± 0.637
2.657GluAsn: 2.657 ± 0.594
3.543GluPro: 3.543 ± 2.288
3.1GluGln: 3.1 ± 1.11
2.214GluArg: 2.214 ± 0.868
5.314GluSer: 5.314 ± 1.444
4.429GluThr: 4.429 ± 1.322
3.986GluVal: 3.986 ± 1.091
2.214GluTrp: 2.214 ± 0.922
2.214GluTyr: 2.214 ± 0.463
0.0GluXaa: 0.0 ± 0.0
Phe
3.1PheAla: 3.1 ± 0.537
2.657PheCys: 2.657 ± 1.97
3.1PheAsp: 3.1 ± 0.534
3.543PheGlu: 3.543 ± 1.473
3.1PhePhe: 3.1 ± 1.042
2.214PheGly: 2.214 ± 0.508
0.886PheHis: 0.886 ± 0.443
3.543PheIle: 3.543 ± 1.334
4.872PheLys: 4.872 ± 1.647
5.314PheLeu: 5.314 ± 1.584
1.771PheMet: 1.771 ± 0.645
2.214PheAsn: 2.214 ± 1.576
1.329PhePro: 1.329 ± 0.655
0.443PheGln: 0.443 ± 0.362
1.771PheArg: 1.771 ± 1.449
3.986PheSer: 3.986 ± 0.856
1.329PheThr: 1.329 ± 0.768
3.1PheVal: 3.1 ± 0.488
1.329PheTrp: 1.329 ± 0.671
3.986PheTyr: 3.986 ± 1.023
0.0PheXaa: 0.0 ± 0.0
Gly
0.886GlyAla: 0.886 ± 0.692
0.886GlyCys: 0.886 ± 0.714
5.757GlyAsp: 5.757 ± 1.73
3.986GlyGlu: 3.986 ± 1.138
0.886GlyPhe: 0.886 ± 0.459
1.771GlyGly: 1.771 ± 0.81
2.214GlyHis: 2.214 ± 0.861
2.657GlyIle: 2.657 ± 1.123
4.872GlyLys: 4.872 ± 1.935
2.657GlyLeu: 2.657 ± 1.378
0.0GlyMet: 0.0 ± 0.0
3.986GlyAsn: 3.986 ± 1.06
3.543GlyPro: 3.543 ± 1.278
0.443GlyGln: 0.443 ± 0.408
3.1GlyArg: 3.1 ± 1.148
3.986GlySer: 3.986 ± 1.846
4.429GlyThr: 4.429 ± 2.189
2.214GlyVal: 2.214 ± 1.014
0.0GlyTrp: 0.0 ± 0.0
1.771GlyTyr: 1.771 ± 0.729
0.0GlyXaa: 0.0 ± 0.0
His
0.443HisAla: 0.443 ± 0.362
0.0HisCys: 0.0 ± 0.0
0.886HisAsp: 0.886 ± 0.396
0.443HisGlu: 0.443 ± 0.362
2.214HisPhe: 2.214 ± 0.699
1.329HisGly: 1.329 ± 0.754
0.0HisHis: 0.0 ± 0.0
2.214HisIle: 2.214 ± 1.357
0.443HisLys: 0.443 ± 0.413
0.443HisLeu: 0.443 ± 0.362
0.886HisMet: 0.886 ± 0.459
1.329HisAsn: 1.329 ± 0.341
2.214HisPro: 2.214 ± 1.186
1.771HisGln: 1.771 ± 0.614
1.329HisArg: 1.329 ± 0.795
2.214HisSer: 2.214 ± 0.89
1.771HisThr: 1.771 ± 0.784
0.0HisVal: 0.0 ± 0.0
1.329HisTrp: 1.329 ± 0.729
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.771IleAla: 1.771 ± 0.69
1.329IleCys: 1.329 ± 0.673
2.214IleAsp: 2.214 ± 0.393
4.872IleGlu: 4.872 ± 0.949
3.1IlePhe: 3.1 ± 0.665
3.543IleGly: 3.543 ± 1.532
1.771IleHis: 1.771 ± 1.172
3.1IleIle: 3.1 ± 1.712
1.329IleLys: 1.329 ± 0.664
5.314IleLeu: 5.314 ± 1.729
0.443IleMet: 0.443 ± 0.591
3.1IleAsn: 3.1 ± 1.093
3.986IlePro: 3.986 ± 1.721
3.1IleGln: 3.1 ± 1.347
2.214IleArg: 2.214 ± 1.411
4.872IleSer: 4.872 ± 1.527
3.1IleThr: 3.1 ± 0.836
3.986IleVal: 3.986 ± 1.202
0.0IleTrp: 0.0 ± 0.0
2.657IleTyr: 2.657 ± 0.886
0.0IleXaa: 0.0 ± 0.0
Lys
3.986LysAla: 3.986 ± 1.371
2.214LysCys: 2.214 ± 1.813
2.214LysAsp: 2.214 ± 0.671
3.1LysGlu: 3.1 ± 0.951
3.1LysPhe: 3.1 ± 1.415
3.1LysGly: 3.1 ± 0.706
1.329LysHis: 1.329 ± 0.795
1.329LysIle: 1.329 ± 0.686
2.214LysLys: 2.214 ± 1.188
3.543LysLeu: 3.543 ± 1.689
1.329LysMet: 1.329 ± 0.664
5.314LysAsn: 5.314 ± 1.478
2.657LysPro: 2.657 ± 1.006
1.771LysGln: 1.771 ± 1.006
3.543LysArg: 3.543 ± 0.713
5.757LysSer: 5.757 ± 1.445
3.543LysThr: 3.543 ± 1.067
2.214LysVal: 2.214 ± 0.873
1.329LysTrp: 1.329 ± 0.421
1.771LysTyr: 1.771 ± 0.252
0.0LysXaa: 0.0 ± 0.0
Leu
3.1LeuAla: 3.1 ± 1.449
2.657LeuCys: 2.657 ± 1.429
7.972LeuAsp: 7.972 ± 1.888
3.986LeuGlu: 3.986 ± 1.629
2.657LeuPhe: 2.657 ± 0.768
6.2LeuGly: 6.2 ± 2.4
1.771LeuHis: 1.771 ± 1.02
4.429LeuIle: 4.429 ± 1.36
4.872LeuLys: 4.872 ± 1.395
6.2LeuLeu: 6.2 ± 1.301
1.771LeuMet: 1.771 ± 0.871
3.543LeuAsn: 3.543 ± 1.012
1.771LeuPro: 1.771 ± 0.678
6.643LeuGln: 6.643 ± 2.148
4.872LeuArg: 4.872 ± 1.02
5.757LeuSer: 5.757 ± 0.964
6.643LeuThr: 6.643 ± 1.017
4.872LeuVal: 4.872 ± 0.838
0.443LeuTrp: 0.443 ± 0.366
6.2LeuTyr: 6.2 ± 1.258
0.0LeuXaa: 0.0 ± 0.0
Met
0.443MetAla: 0.443 ± 0.366
0.443MetCys: 0.443 ± 0.362
2.214MetAsp: 2.214 ± 0.749
0.443MetGlu: 0.443 ± 0.413
0.886MetPhe: 0.886 ± 0.732
1.329MetGly: 1.329 ± 0.671
0.0MetHis: 0.0 ± 0.0
1.329MetIle: 1.329 ± 0.449
0.886MetLys: 0.886 ± 0.724
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.329MetAsn: 1.329 ± 0.449
1.329MetPro: 1.329 ± 0.933
0.443MetGln: 0.443 ± 0.413
0.886MetArg: 0.886 ± 0.729
0.0MetSer: 0.0 ± 0.0
0.886MetThr: 0.886 ± 0.724
0.886MetVal: 0.886 ± 0.396
0.0MetTrp: 0.0 ± 0.0
0.443MetTyr: 0.443 ± 0.408
0.0MetXaa: 0.0 ± 0.0
Asn
3.1AsnAla: 3.1 ± 0.667
1.771AsnCys: 1.771 ± 0.846
2.214AsnAsp: 2.214 ± 0.702
3.1AsnGlu: 3.1 ± 1.158
2.214AsnPhe: 2.214 ± 1.125
1.771AsnGly: 1.771 ± 0.705
0.0AsnHis: 0.0 ± 0.0
3.543AsnIle: 3.543 ± 0.914
3.986AsnLys: 3.986 ± 1.161
7.972AsnLeu: 7.972 ± 2.372
0.886AsnMet: 0.886 ± 0.672
3.543AsnAsn: 3.543 ± 1.047
2.214AsnPro: 2.214 ± 1.116
2.657AsnGln: 2.657 ± 0.687
3.986AsnArg: 3.986 ± 1.474
4.429AsnSer: 4.429 ± 1.845
3.986AsnThr: 3.986 ± 0.759
3.543AsnVal: 3.543 ± 1.219
0.886AsnTrp: 0.886 ± 0.459
1.329AsnTyr: 1.329 ± 0.703
0.0AsnXaa: 0.0 ± 0.0
Pro
3.1ProAla: 3.1 ± 1.439
0.443ProCys: 0.443 ± 0.362
6.2ProAsp: 6.2 ± 2.181
3.1ProGlu: 3.1 ± 0.589
1.329ProPhe: 1.329 ± 0.686
0.886ProGly: 0.886 ± 0.816
0.443ProHis: 0.443 ± 0.366
3.1ProIle: 3.1 ± 1.094
5.314ProLys: 5.314 ± 0.999
5.314ProLeu: 5.314 ± 1.918
0.443ProMet: 0.443 ± 0.366
3.1ProAsn: 3.1 ± 0.791
5.757ProPro: 5.757 ± 1.464
3.1ProGln: 3.1 ± 1.687
0.443ProArg: 0.443 ± 0.362
3.543ProSer: 3.543 ± 1.845
4.429ProThr: 4.429 ± 1.704
3.1ProVal: 3.1 ± 1.588
0.0ProTrp: 0.0 ± 0.0
2.214ProTyr: 2.214 ± 1.098
0.0ProXaa: 0.0 ± 0.0
Gln
1.771GlnAla: 1.771 ± 1.128
0.443GlnCys: 0.443 ± 0.362
0.886GlnAsp: 0.886 ± 0.396
2.657GlnGlu: 2.657 ± 0.448
1.771GlnPhe: 1.771 ± 0.639
2.657GlnGly: 2.657 ± 0.898
1.771GlnHis: 1.771 ± 0.914
2.214GlnIle: 2.214 ± 0.594
1.771GlnLys: 1.771 ± 1.25
5.314GlnLeu: 5.314 ± 1.909
3.1GlnMet: 3.1 ± 1.028
0.443GlnAsn: 0.443 ± 0.366
2.214GlnPro: 2.214 ± 0.838
1.771GlnGln: 1.771 ± 0.708
2.657GlnArg: 2.657 ± 1.237
2.214GlnSer: 2.214 ± 0.901
1.329GlnThr: 1.329 ± 0.449
3.1GlnVal: 3.1 ± 0.989
0.443GlnTrp: 0.443 ± 0.362
2.657GlnTyr: 2.657 ± 0.555
0.0GlnXaa: 0.0 ± 0.0
Arg
3.986ArgAla: 3.986 ± 1.11
0.886ArgCys: 0.886 ± 0.643
3.1ArgAsp: 3.1 ± 1.068
2.657ArgGlu: 2.657 ± 1.227
3.986ArgPhe: 3.986 ± 1.958
3.986ArgGly: 3.986 ± 1.919
1.771ArgHis: 1.771 ± 1.006
2.657ArgIle: 2.657 ± 1.195
5.314ArgLys: 5.314 ± 1.091
5.757ArgLeu: 5.757 ± 1.46
0.443ArgMet: 0.443 ± 0.366
3.1ArgAsn: 3.1 ± 0.69
1.771ArgPro: 1.771 ± 0.79
1.329ArgGln: 1.329 ± 0.421
7.529ArgArg: 7.529 ± 3.113
3.1ArgSer: 3.1 ± 1.028
1.771ArgThr: 1.771 ± 0.729
1.771ArgVal: 1.771 ± 1.029
0.0ArgTrp: 0.0 ± 0.0
2.657ArgTyr: 2.657 ± 0.99
0.0ArgXaa: 0.0 ± 0.0
Ser
3.543SerAla: 3.543 ± 1.41
0.443SerCys: 0.443 ± 0.362
3.986SerAsp: 3.986 ± 0.644
2.657SerGlu: 2.657 ± 1.186
5.314SerPhe: 5.314 ± 1.065
4.872SerGly: 4.872 ± 1.508
0.443SerHis: 0.443 ± 0.408
2.214SerIle: 2.214 ± 1.077
2.214SerLys: 2.214 ± 1.019
7.529SerLeu: 7.529 ± 0.899
0.0SerMet: 0.0 ± 0.0
4.429SerAsn: 4.429 ± 1.845
3.1SerPro: 3.1 ± 1.265
4.872SerGln: 4.872 ± 1.794
5.757SerArg: 5.757 ± 1.65
5.757SerSer: 5.757 ± 2.099
6.2SerThr: 6.2 ± 2.619
3.986SerVal: 3.986 ± 0.741
0.886SerTrp: 0.886 ± 0.827
3.1SerTyr: 3.1 ± 0.673
0.0SerXaa: 0.0 ± 0.0
Thr
3.1ThrAla: 3.1 ± 1.06
0.886ThrCys: 0.886 ± 1.182
4.429ThrAsp: 4.429 ± 0.706
5.314ThrGlu: 5.314 ± 1.413
3.543ThrPhe: 3.543 ± 1.143
4.429ThrGly: 4.429 ± 1.257
0.886ThrHis: 0.886 ± 0.396
4.872ThrIle: 4.872 ± 1.908
2.214ThrLys: 2.214 ± 1.077
5.314ThrLeu: 5.314 ± 1.715
0.0ThrMet: 0.0 ± 0.0
4.872ThrAsn: 4.872 ± 1.687
3.543ThrPro: 3.543 ± 1.624
1.771ThrGln: 1.771 ± 0.678
3.543ThrArg: 3.543 ± 0.821
3.543ThrSer: 3.543 ± 2.09
3.543ThrThr: 3.543 ± 0.661
3.986ThrVal: 3.986 ± 1.403
0.443ThrTrp: 0.443 ± 0.362
2.214ThrTyr: 2.214 ± 0.463
0.0ThrXaa: 0.0 ± 0.0
Val
2.657ValAla: 2.657 ± 1.205
1.329ValCys: 1.329 ± 0.927
4.872ValAsp: 4.872 ± 2.387
3.543ValGlu: 3.543 ± 1.457
3.1ValPhe: 3.1 ± 1.366
2.657ValGly: 2.657 ± 0.687
2.214ValHis: 2.214 ± 0.916
3.1ValIle: 3.1 ± 2.084
4.429ValLys: 4.429 ± 1.368
3.543ValLeu: 3.543 ± 1.334
0.886ValMet: 0.886 ± 0.64
3.543ValAsn: 3.543 ± 0.862
2.214ValPro: 2.214 ± 0.616
2.657ValGln: 2.657 ± 0.687
3.543ValArg: 3.543 ± 0.947
4.429ValSer: 4.429 ± 1.726
3.543ValThr: 3.543 ± 1.146
3.1ValVal: 3.1 ± 1.883
2.214ValTrp: 2.214 ± 1.265
0.443ValTyr: 0.443 ± 0.366
0.0ValXaa: 0.0 ± 0.0
Trp
0.886TrpAla: 0.886 ± 0.724
0.0TrpCys: 0.0 ± 0.0
0.886TrpAsp: 0.886 ± 0.52
0.443TrpGlu: 0.443 ± 0.362
0.886TrpPhe: 0.886 ± 0.442
0.443TrpGly: 0.443 ± 0.366
0.443TrpHis: 0.443 ± 0.413
0.443TrpIle: 0.443 ± 0.362
1.329TrpLys: 1.329 ± 0.655
0.886TrpLeu: 0.886 ± 0.724
0.0TrpMet: 0.0 ± 0.0
0.443TrpAsn: 0.443 ± 0.413
0.0TrpPro: 0.0 ± 0.0
0.886TrpGln: 0.886 ± 0.732
1.329TrpArg: 1.329 ± 0.729
0.443TrpSer: 0.443 ± 0.413
0.886TrpThr: 0.886 ± 0.827
0.886TrpVal: 0.886 ± 0.459
0.0TrpTrp: 0.0 ± 0.0
0.443TrpTyr: 0.443 ± 0.413
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.657TyrAla: 2.657 ± 0.599
1.329TyrCys: 1.329 ± 0.615
1.771TyrAsp: 1.771 ± 0.708
2.214TyrGlu: 2.214 ± 0.74
4.872TyrPhe: 4.872 ± 1.395
1.771TyrGly: 1.771 ± 0.678
0.886TyrHis: 0.886 ± 0.827
1.329TyrIle: 1.329 ± 0.698
2.214TyrLys: 2.214 ± 0.466
3.543TyrLeu: 3.543 ± 0.862
0.0TyrMet: 0.0 ± 0.0
1.771TyrAsn: 1.771 ± 0.599
3.1TyrPro: 3.1 ± 0.791
1.771TyrGln: 1.771 ± 0.599
3.1TyrArg: 3.1 ± 1.06
1.771TyrSer: 1.771 ± 1.148
2.214TyrThr: 2.214 ± 0.758
2.657TyrVal: 2.657 ± 0.713
0.0TyrTrp: 0.0 ± 0.0
0.886TyrTyr: 0.886 ± 0.827
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2259 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski