Amino acid dipepetide frequency for human papillomavirus 200

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.788AlaAla: 5.788 ± 0.793
0.89AlaCys: 0.89 ± 0.665
4.452AlaAsp: 4.452 ± 1.105
3.117AlaGlu: 3.117 ± 0.759
5.343AlaPhe: 5.343 ± 1.1
3.562AlaGly: 3.562 ± 1.161
0.445AlaHis: 0.445 ± 0.372
1.336AlaIle: 1.336 ± 0.821
4.007AlaLys: 4.007 ± 1.154
3.117AlaLeu: 3.117 ± 0.786
0.445AlaMet: 0.445 ± 0.395
2.671AlaAsn: 2.671 ± 1.378
2.671AlaPro: 2.671 ± 1.373
1.781AlaGln: 1.781 ± 0.775
4.007AlaArg: 4.007 ± 1.491
6.233AlaSer: 6.233 ± 1.685
4.007AlaThr: 4.007 ± 2.359
3.117AlaVal: 3.117 ± 0.928
0.0AlaTrp: 0.0 ± 0.0
2.226AlaTyr: 2.226 ± 0.378
0.0AlaXaa: 0.0 ± 0.0
Cys
0.445CysAla: 0.445 ± 0.372
1.781CysCys: 1.781 ± 1.289
0.0CysAsp: 0.0 ± 0.0
0.89CysGlu: 0.89 ± 0.665
1.781CysPhe: 1.781 ± 1.004
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.781CysIle: 1.781 ± 0.763
2.671CysLys: 2.671 ± 1.521
2.226CysLeu: 2.226 ± 1.181
0.445CysMet: 0.445 ± 0.526
0.445CysAsn: 0.445 ± 0.372
1.336CysPro: 1.336 ± 1.669
1.336CysGln: 1.336 ± 1.128
0.89CysArg: 0.89 ± 1.155
0.445CysSer: 0.445 ± 0.848
2.226CysThr: 2.226 ± 1.287
0.89CysVal: 0.89 ± 0.665
1.781CysTrp: 1.781 ± 0.74
2.671CysTyr: 2.671 ± 1.151
0.0CysXaa: 0.0 ± 0.0
Asp
4.007AspAla: 4.007 ± 1.41
0.89AspCys: 0.89 ± 0.398
3.562AspAsp: 3.562 ± 1.013
4.007AspGlu: 4.007 ± 1.289
3.562AspPhe: 3.562 ± 0.91
1.336AspGly: 1.336 ± 0.446
0.89AspHis: 0.89 ± 0.789
4.452AspIle: 4.452 ± 1.042
1.781AspLys: 1.781 ± 0.804
7.569AspLeu: 7.569 ± 1.779
0.89AspMet: 0.89 ± 0.41
3.562AspAsn: 3.562 ± 0.823
5.343AspPro: 5.343 ± 2.579
0.445AspGln: 0.445 ± 0.372
1.336AspArg: 1.336 ± 0.834
5.343AspSer: 5.343 ± 1.64
4.452AspThr: 4.452 ± 0.946
2.671AspVal: 2.671 ± 1.489
1.336AspTrp: 1.336 ± 0.372
2.671AspTyr: 2.671 ± 0.505
0.0AspXaa: 0.0 ± 0.0
Glu
2.671GluAla: 2.671 ± 0.961
1.336GluCys: 1.336 ± 1.185
3.117GluAsp: 3.117 ± 1.043
4.898GluGlu: 4.898 ± 1.533
1.781GluPhe: 1.781 ± 0.999
2.226GluGly: 2.226 ± 1.068
1.336GluHis: 1.336 ± 0.898
2.671GluIle: 2.671 ± 0.505
1.781GluLys: 1.781 ± 1.145
6.233GluLeu: 6.233 ± 2.255
0.89GluMet: 0.89 ± 0.665
4.898GluAsn: 4.898 ± 2.013
3.117GluPro: 3.117 ± 0.461
2.226GluGln: 2.226 ± 0.513
4.452GluArg: 4.452 ± 1.249
8.459GluSer: 8.459 ± 2.281
4.452GluThr: 4.452 ± 0.904
2.226GluVal: 2.226 ± 0.751
0.445GluTrp: 0.445 ± 0.332
2.226GluTyr: 2.226 ± 1.202
0.0GluXaa: 0.0 ± 0.0
Phe
4.007PheAla: 4.007 ± 0.721
1.781PheCys: 1.781 ± 1.302
3.562PheAsp: 3.562 ± 0.369
3.562PheGlu: 3.562 ± 1.806
1.781PhePhe: 1.781 ± 0.76
3.562PheGly: 3.562 ± 1.034
1.336PheHis: 1.336 ± 0.699
3.562PheIle: 3.562 ± 0.871
4.452PheLys: 4.452 ± 1.942
5.788PheLeu: 5.788 ± 1.682
0.89PheMet: 0.89 ± 0.845
3.117PheAsn: 3.117 ± 1.013
1.781PhePro: 1.781 ± 1.145
1.336PheGln: 1.336 ± 0.483
0.89PheArg: 0.89 ± 0.402
2.226PheSer: 2.226 ± 0.856
3.562PheThr: 3.562 ± 0.817
3.117PheVal: 3.117 ± 0.865
1.336PheTrp: 1.336 ± 0.637
1.781PheTyr: 1.781 ± 0.804
0.0PheXaa: 0.0 ± 0.0
Gly
2.226GlyAla: 2.226 ± 0.513
0.89GlyCys: 0.89 ± 0.472
4.007GlyAsp: 4.007 ± 1.637
4.007GlyGlu: 4.007 ± 0.737
0.89GlyPhe: 0.89 ± 0.745
4.898GlyGly: 4.898 ± 2.312
1.336GlyHis: 1.336 ± 0.597
3.117GlyIle: 3.117 ± 0.648
2.226GlyLys: 2.226 ± 0.965
5.788GlyLeu: 5.788 ± 1.648
0.445GlyMet: 0.445 ± 0.332
4.452GlyAsn: 4.452 ± 1.112
3.117GlyPro: 3.117 ± 1.363
0.445GlyGln: 0.445 ± 0.332
2.671GlyArg: 2.671 ± 1.005
6.233GlySer: 6.233 ± 1.95
4.007GlyThr: 4.007 ± 0.368
3.562GlyVal: 3.562 ± 1.161
0.0GlyTrp: 0.0 ± 0.0
0.89GlyTyr: 0.89 ± 0.486
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.445HisAsp: 0.445 ± 0.395
0.0HisGlu: 0.0 ± 0.0
0.445HisPhe: 0.445 ± 0.526
0.89HisGly: 0.89 ± 0.468
0.0HisHis: 0.0 ± 0.0
2.226HisIle: 2.226 ± 0.837
0.89HisLys: 0.89 ± 0.468
2.671HisLeu: 2.671 ± 1.506
0.445HisMet: 0.445 ± 0.372
0.445HisAsn: 0.445 ± 0.526
0.89HisPro: 0.89 ± 0.472
0.89HisGln: 0.89 ± 0.468
0.0HisArg: 0.0 ± 0.0
0.445HisSer: 0.445 ± 0.395
0.89HisThr: 0.89 ± 0.472
1.336HisVal: 1.336 ± 0.372
0.445HisTrp: 0.445 ± 0.526
0.445HisTyr: 0.445 ± 0.423
0.0HisXaa: 0.0 ± 0.0
Ile
2.671IleAla: 2.671 ± 1.643
0.445IleCys: 0.445 ± 0.372
3.562IleAsp: 3.562 ± 1.034
4.007IleGlu: 4.007 ± 1.637
3.562IlePhe: 3.562 ± 1.954
2.226IleGly: 2.226 ± 1.079
0.0IleHis: 0.0 ± 0.0
3.117IleIle: 3.117 ± 1.207
2.671IleLys: 2.671 ± 1.015
3.562IleLeu: 3.562 ± 0.586
0.0IleMet: 0.0 ± 0.0
4.007IleAsn: 4.007 ± 1.575
3.562IlePro: 3.562 ± 1.511
4.452IleGln: 4.452 ± 0.756
2.671IleArg: 2.671 ± 2.231
3.562IleSer: 3.562 ± 0.817
4.452IleThr: 4.452 ± 1.059
4.007IleVal: 4.007 ± 0.777
0.89IleTrp: 0.89 ± 0.554
1.781IleTyr: 1.781 ± 0.712
0.0IleXaa: 0.0 ± 0.0
Lys
2.671LysAla: 2.671 ± 0.891
2.671LysCys: 2.671 ± 1.151
2.226LysAsp: 2.226 ± 0.834
4.007LysGlu: 4.007 ± 1.747
4.007LysPhe: 4.007 ± 0.996
2.226LysGly: 2.226 ± 0.835
0.445LysHis: 0.445 ± 0.332
2.671LysIle: 2.671 ± 1.032
2.226LysLys: 2.226 ± 1.286
7.124LysLeu: 7.124 ± 2.271
1.336LysMet: 1.336 ± 0.676
2.226LysAsn: 2.226 ± 0.378
0.89LysPro: 0.89 ± 0.745
1.781LysGln: 1.781 ± 0.804
4.898LysArg: 4.898 ± 0.948
4.007LysSer: 4.007 ± 1.646
2.671LysThr: 2.671 ± 0.666
1.781LysVal: 1.781 ± 1.047
0.445LysTrp: 0.445 ± 0.395
2.671LysTyr: 2.671 ± 1.419
0.0LysXaa: 0.0 ± 0.0
Leu
5.343LeuAla: 5.343 ± 1.341
2.226LeuCys: 2.226 ± 1.666
6.679LeuAsp: 6.679 ± 1.643
4.898LeuGlu: 4.898 ± 1.807
7.569LeuPhe: 7.569 ± 1.491
4.898LeuGly: 4.898 ± 1.775
0.445LeuHis: 0.445 ± 0.372
5.343LeuIle: 5.343 ± 1.1
4.898LeuLys: 4.898 ± 1.608
5.343LeuLeu: 5.343 ± 1.493
1.781LeuMet: 1.781 ± 0.518
3.562LeuAsn: 3.562 ± 0.555
5.788LeuPro: 5.788 ± 1.73
6.233LeuGln: 6.233 ± 1.96
4.898LeuArg: 4.898 ± 1.098
9.35LeuSer: 9.35 ± 1.885
4.007LeuThr: 4.007 ± 1.034
6.679LeuVal: 6.679 ± 1.421
0.89LeuTrp: 0.89 ± 0.472
2.671LeuTyr: 2.671 ± 0.677
0.0LeuXaa: 0.0 ± 0.0
Met
0.89MetAla: 0.89 ± 0.402
0.445MetCys: 0.445 ± 0.332
0.445MetAsp: 0.445 ± 0.423
1.336MetGlu: 1.336 ± 0.607
0.0MetPhe: 0.0 ± 0.0
1.336MetGly: 1.336 ± 1.117
0.0MetHis: 0.0 ± 0.0
0.89MetIle: 0.89 ± 0.665
0.89MetLys: 0.89 ± 0.468
0.89MetLeu: 0.89 ± 0.665
0.0MetMet: 0.0 ± 0.0
0.445MetAsn: 0.445 ± 0.372
0.445MetPro: 0.445 ± 0.332
1.781MetGln: 1.781 ± 0.847
0.445MetArg: 0.445 ± 0.848
1.781MetSer: 1.781 ± 0.884
0.445MetThr: 0.445 ± 0.372
0.89MetVal: 0.89 ± 0.665
0.445MetTrp: 0.445 ± 0.395
0.445MetTyr: 0.445 ± 0.332
0.0MetXaa: 0.0 ± 0.0
Asn
3.562AsnAla: 3.562 ± 1.003
2.671AsnCys: 2.671 ± 1.687
4.452AsnAsp: 4.452 ± 1.248
3.117AsnGlu: 3.117 ± 0.964
3.117AsnPhe: 3.117 ± 0.838
3.562AsnGly: 3.562 ± 1.218
0.0AsnHis: 0.0 ± 0.0
3.562AsnIle: 3.562 ± 1.807
4.007AsnLys: 4.007 ± 0.7
4.007AsnLeu: 4.007 ± 0.931
0.89AsnMet: 0.89 ± 0.789
2.226AsnAsn: 2.226 ± 0.573
4.007AsnPro: 4.007 ± 1.847
0.89AsnGln: 0.89 ± 0.402
2.226AsnArg: 2.226 ± 0.56
5.343AsnSer: 5.343 ± 1.641
4.452AsnThr: 4.452 ± 1.188
3.117AsnVal: 3.117 ± 0.595
0.445AsnTrp: 0.445 ± 0.332
1.336AsnTyr: 1.336 ± 1.014
0.0AsnXaa: 0.0 ± 0.0
Pro
4.898ProAla: 4.898 ± 2.851
0.445ProCys: 0.445 ± 0.372
4.452ProAsp: 4.452 ± 1.594
3.117ProGlu: 3.117 ± 1.038
1.781ProPhe: 1.781 ± 0.573
3.562ProGly: 3.562 ± 1.353
0.445ProHis: 0.445 ± 0.423
3.562ProIle: 3.562 ± 1.958
3.117ProLys: 3.117 ± 1.038
5.343ProLeu: 5.343 ± 2.023
1.336ProMet: 1.336 ± 0.857
3.117ProAsn: 3.117 ± 0.525
3.562ProPro: 3.562 ± 0.854
2.226ProGln: 2.226 ± 1.642
4.007ProArg: 4.007 ± 1.324
5.343ProSer: 5.343 ± 1.291
2.671ProThr: 2.671 ± 1.233
0.89ProVal: 0.89 ± 0.745
0.445ProTrp: 0.445 ± 0.423
2.226ProTyr: 2.226 ± 1.224
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.445GlnCys: 0.445 ± 0.372
2.226GlnAsp: 2.226 ± 0.856
4.007GlnGlu: 4.007 ± 1.162
3.562GlnPhe: 3.562 ± 1.455
2.226GlnGly: 2.226 ± 0.674
0.89GlnHis: 0.89 ± 0.645
1.781GlnIle: 1.781 ± 0.629
0.0GlnLys: 0.0 ± 0.0
5.343GlnLeu: 5.343 ± 1.495
0.89GlnMet: 0.89 ± 0.665
2.671GlnAsn: 2.671 ± 1.15
2.226GlnPro: 2.226 ± 0.716
2.226GlnGln: 2.226 ± 1.153
1.781GlnArg: 1.781 ± 0.775
2.226GlnSer: 2.226 ± 0.895
2.671GlnThr: 2.671 ± 0.811
2.226GlnVal: 2.226 ± 0.961
0.445GlnTrp: 0.445 ± 0.332
1.781GlnTyr: 1.781 ± 1.102
0.0GlnXaa: 0.0 ± 0.0
Arg
4.007ArgAla: 4.007 ± 1.182
2.226ArgCys: 2.226 ± 1.081
1.781ArgAsp: 1.781 ± 0.568
1.781ArgGlu: 1.781 ± 0.629
1.336ArgPhe: 1.336 ± 0.483
2.226ArgGly: 2.226 ± 1.502
1.336ArgHis: 1.336 ± 0.754
3.117ArgIle: 3.117 ± 2.473
3.562ArgLys: 3.562 ± 0.557
5.343ArgLeu: 5.343 ± 0.635
0.445ArgMet: 0.445 ± 0.332
5.343ArgAsn: 5.343 ± 1.411
3.117ArgPro: 3.117 ± 1.902
2.226ArgGln: 2.226 ± 0.664
4.898ArgArg: 4.898 ± 2.073
4.007ArgSer: 4.007 ± 0.856
1.336ArgThr: 1.336 ± 0.769
2.226ArgVal: 2.226 ± 0.896
0.445ArgTrp: 0.445 ± 0.395
0.445ArgTyr: 0.445 ± 0.372
0.0ArgXaa: 0.0 ± 0.0
Ser
5.343SerAla: 5.343 ± 1.027
0.445SerCys: 0.445 ± 0.848
4.452SerAsp: 4.452 ± 1.477
6.679SerGlu: 6.679 ± 1.671
4.007SerPhe: 4.007 ± 1.183
5.788SerGly: 5.788 ± 1.825
1.781SerHis: 1.781 ± 1.026
4.007SerIle: 4.007 ± 1.396
3.117SerLys: 3.117 ± 1.065
9.35SerLeu: 9.35 ± 1.194
1.781SerMet: 1.781 ± 1.172
3.562SerAsn: 3.562 ± 1.375
4.898SerPro: 4.898 ± 2.198
4.007SerGln: 4.007 ± 0.861
4.452SerArg: 4.452 ± 1.413
9.35SerSer: 9.35 ± 3.508
6.679SerThr: 6.679 ± 1.336
4.007SerVal: 4.007 ± 1.409
1.336SerTrp: 1.336 ± 0.997
2.671SerTyr: 2.671 ± 0.505
0.0SerXaa: 0.0 ± 0.0
Thr
4.452ThrAla: 4.452 ± 1.052
2.671ThrCys: 2.671 ± 0.956
4.007ThrAsp: 4.007 ± 1.012
5.343ThrGlu: 5.343 ± 0.778
1.781ThrPhe: 1.781 ± 0.501
4.007ThrGly: 4.007 ± 1.093
1.336ThrHis: 1.336 ± 0.483
3.562ThrIle: 3.562 ± 0.588
2.226ThrLys: 2.226 ± 0.995
4.452ThrLeu: 4.452 ± 0.984
0.445ThrMet: 0.445 ± 0.372
3.117ThrAsn: 3.117 ± 1.065
4.007ThrPro: 4.007 ± 0.834
1.336ThrGln: 1.336 ± 0.886
2.226ThrArg: 2.226 ± 1.079
5.343ThrSer: 5.343 ± 2.513
2.671ThrThr: 2.671 ± 0.683
7.569ThrVal: 7.569 ± 1.569
0.0ThrTrp: 0.0 ± 0.0
1.336ThrTyr: 1.336 ± 0.599
0.0ThrXaa: 0.0 ± 0.0
Val
2.671ValAla: 2.671 ± 1.017
0.89ValCys: 0.89 ± 1.052
4.452ValAsp: 4.452 ± 0.742
2.226ValGlu: 2.226 ± 1.248
2.671ValPhe: 2.671 ± 1.4
4.898ValGly: 4.898 ± 1.421
1.336ValHis: 1.336 ± 0.411
2.226ValIle: 2.226 ± 0.706
3.562ValLys: 3.562 ± 0.556
3.562ValLeu: 3.562 ± 0.96
0.445ValMet: 0.445 ± 0.372
3.562ValAsn: 3.562 ± 1.161
4.007ValPro: 4.007 ± 1.813
1.781ValGln: 1.781 ± 0.884
1.336ValArg: 1.336 ± 0.411
4.898ValSer: 4.898 ± 1.004
3.562ValThr: 3.562 ± 1.567
1.781ValVal: 1.781 ± 0.796
1.781ValTrp: 1.781 ± 1.152
2.226ValTyr: 2.226 ± 1.028
0.0ValXaa: 0.0 ± 0.0
Trp
0.89TrpAla: 0.89 ± 0.665
0.445TrpCys: 0.445 ± 0.332
0.89TrpAsp: 0.89 ± 0.472
0.0TrpGlu: 0.0 ± 0.0
0.445TrpPhe: 0.445 ± 0.395
0.445TrpGly: 0.445 ± 0.372
0.445TrpHis: 0.445 ± 0.395
0.0TrpIle: 0.0 ± 0.0
1.781TrpLys: 1.781 ± 0.74
2.671TrpLeu: 2.671 ± 0.656
0.0TrpMet: 0.0 ± 0.0
0.89TrpAsn: 0.89 ± 0.402
0.89TrpPro: 0.89 ± 0.472
0.445TrpGln: 0.445 ± 0.372
1.336TrpArg: 1.336 ± 1.092
0.445TrpSer: 0.445 ± 0.372
0.89TrpThr: 0.89 ± 0.789
0.89TrpVal: 0.89 ± 0.665
0.0TrpTrp: 0.0 ± 0.0
0.445TrpTyr: 0.445 ± 0.423
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.671TyrAla: 2.671 ± 0.804
0.445TyrCys: 0.445 ± 0.526
1.336TyrAsp: 1.336 ± 0.411
0.445TyrGlu: 0.445 ± 0.372
4.007TyrPhe: 4.007 ± 0.721
1.336TyrGly: 1.336 ± 0.446
0.0TyrHis: 0.0 ± 0.0
2.226TyrIle: 2.226 ± 0.751
3.117TyrLys: 3.117 ± 0.765
3.562TyrLeu: 3.562 ± 0.77
0.0TyrMet: 0.0 ± 0.0
2.671TyrAsn: 2.671 ± 1.548
0.89TyrPro: 0.89 ± 0.845
1.781TyrGln: 1.781 ± 1.047
1.781TyrArg: 1.781 ± 0.985
2.671TyrSer: 2.671 ± 1.194
1.781TyrThr: 1.781 ± 0.675
0.89TyrVal: 0.89 ± 0.402
1.336TyrTrp: 1.336 ± 0.754
1.781TyrTyr: 1.781 ± 0.936
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2247 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski