Amino acid dipepetide frequency for Human papillomavirus 122

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.979AlaAla: 1.979 ± 0.715
1.187AlaCys: 1.187 ± 0.883
5.144AlaAsp: 5.144 ± 1.582
3.562AlaGlu: 3.562 ± 0.951
2.374AlaPhe: 2.374 ± 0.771
1.979AlaGly: 1.979 ± 1.005
0.396AlaHis: 0.396 ± 0.297
2.374AlaIle: 2.374 ± 0.591
3.957AlaLys: 3.957 ± 0.907
3.562AlaLeu: 3.562 ± 0.817
0.396AlaMet: 0.396 ± 0.375
2.77AlaAsn: 2.77 ± 1.01
3.166AlaPro: 3.166 ± 1.172
4.353AlaGln: 4.353 ± 1.576
5.144AlaArg: 5.144 ± 1.018
2.77AlaSer: 2.77 ± 1.612
4.353AlaThr: 4.353 ± 0.619
3.957AlaVal: 3.957 ± 1.549
0.0AlaTrp: 0.0 ± 0.0
2.77AlaTyr: 2.77 ± 1.406
0.0AlaXaa: 0.0 ± 0.0
Cys
1.187CysAla: 1.187 ± 0.704
0.791CysCys: 0.791 ± 0.402
0.396CysAsp: 0.396 ± 0.342
0.396CysGlu: 0.396 ± 0.375
1.583CysPhe: 1.583 ± 0.531
0.791CysGly: 0.791 ± 0.843
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.187CysLys: 1.187 ± 0.726
2.374CysLeu: 2.374 ± 1.324
0.396CysMet: 0.396 ± 0.375
0.791CysAsn: 0.791 ± 0.843
2.374CysPro: 2.374 ± 0.702
1.583CysGln: 1.583 ± 0.934
1.979CysArg: 1.979 ± 0.979
1.583CysSer: 1.583 ± 0.934
0.791CysThr: 0.791 ± 0.544
0.396CysVal: 0.396 ± 0.375
0.396CysTrp: 0.396 ± 0.297
1.583CysTyr: 1.583 ± 0.88
0.0CysXaa: 0.0 ± 0.0
Asp
4.749AspAla: 4.749 ± 0.931
0.396AspCys: 0.396 ± 0.342
2.77AspAsp: 2.77 ± 0.904
2.374AspGlu: 2.374 ± 0.639
2.374AspPhe: 2.374 ± 0.745
3.957AspGly: 3.957 ± 1.162
2.374AspHis: 2.374 ± 0.398
3.166AspIle: 3.166 ± 0.896
1.583AspLys: 1.583 ± 0.879
7.915AspLeu: 7.915 ± 2.065
0.791AspMet: 0.791 ± 0.75
1.979AspAsn: 1.979 ± 0.713
4.353AspPro: 4.353 ± 1.263
2.77AspGln: 2.77 ± 1.061
2.374AspArg: 2.374 ± 0.785
3.166AspSer: 3.166 ± 1.115
5.54AspThr: 5.54 ± 1.491
3.957AspVal: 3.957 ± 1.583
1.187AspTrp: 1.187 ± 0.619
1.583AspTyr: 1.583 ± 0.558
0.0AspXaa: 0.0 ± 0.0
Glu
3.957GluAla: 3.957 ± 0.961
1.583GluCys: 1.583 ± 1.117
3.562GluAsp: 3.562 ± 0.723
5.936GluGlu: 5.936 ± 2.251
0.791GluPhe: 0.791 ± 0.584
6.727GluGly: 6.727 ± 2.832
1.583GluHis: 1.583 ± 0.273
2.374GluIle: 2.374 ± 1.423
3.562GluLys: 3.562 ± 1.718
5.54GluLeu: 5.54 ± 2.766
1.979GluMet: 1.979 ± 0.833
1.187GluAsn: 1.187 ± 0.658
2.77GluPro: 2.77 ± 0.837
3.957GluGln: 3.957 ± 0.827
3.166GluArg: 3.166 ± 0.81
5.936GluSer: 5.936 ± 1.689
3.562GluThr: 3.562 ± 1.17
5.936GluVal: 5.936 ± 0.667
0.791GluTrp: 0.791 ± 0.367
1.583GluTyr: 1.583 ± 1.499
0.0GluXaa: 0.0 ± 0.0
Phe
4.353PheAla: 4.353 ± 1.294
0.791PheCys: 0.791 ± 0.544
4.353PheAsp: 4.353 ± 0.724
4.353PheGlu: 4.353 ± 1.61
1.583PhePhe: 1.583 ± 0.591
1.979PheGly: 1.979 ± 0.816
0.791PheHis: 0.791 ± 0.544
1.583PheIle: 1.583 ± 0.531
1.187PheLys: 1.187 ± 0.619
3.562PheLeu: 3.562 ± 0.869
0.396PheMet: 0.396 ± 0.426
2.374PheAsn: 2.374 ± 0.466
2.374PhePro: 2.374 ± 0.639
2.374PheGln: 2.374 ± 0.643
0.396PheArg: 0.396 ± 0.375
1.979PheSer: 1.979 ± 0.553
1.583PheThr: 1.583 ± 0.699
1.583PheVal: 1.583 ± 1.499
1.583PheTrp: 1.583 ± 0.734
0.791PheTyr: 0.791 ± 0.438
0.0PheXaa: 0.0 ± 0.0
Gly
3.562GlyAla: 3.562 ± 1.143
1.187GlyCys: 1.187 ± 0.327
4.353GlyAsp: 4.353 ± 0.717
7.123GlyGlu: 7.123 ± 2.382
1.187GlyPhe: 1.187 ± 0.68
6.727GlyGly: 6.727 ± 2.842
2.77GlyHis: 2.77 ± 1.098
2.77GlyIle: 2.77 ± 0.612
2.77GlyLys: 2.77 ± 1.231
3.957GlyLeu: 3.957 ± 0.801
0.396GlyMet: 0.396 ± 0.342
3.957GlyAsn: 3.957 ± 1.884
4.353GlyPro: 4.353 ± 1.53
1.979GlyGln: 1.979 ± 0.82
7.519GlyArg: 7.519 ± 3.109
3.166GlySer: 3.166 ± 1.171
4.749GlyThr: 4.749 ± 2.124
3.957GlyVal: 3.957 ± 1.176
0.0GlyTrp: 0.0 ± 0.0
1.979GlyTyr: 1.979 ± 1.062
0.0GlyXaa: 0.0 ± 0.0
His
0.396HisAla: 0.396 ± 0.375
1.187HisCys: 1.187 ± 0.881
0.396HisAsp: 0.396 ± 0.375
0.396HisGlu: 0.396 ± 0.571
0.396HisPhe: 0.396 ± 0.369
1.583HisGly: 1.583 ± 0.805
0.396HisHis: 0.396 ± 0.342
1.187HisIle: 1.187 ± 0.69
1.979HisLys: 1.979 ± 1.194
1.187HisLeu: 1.187 ± 0.752
0.0HisMet: 0.0 ± 0.0
1.583HisAsn: 1.583 ± 0.591
2.77HisPro: 2.77 ± 1.129
1.187HisGln: 1.187 ± 0.733
0.791HisArg: 0.791 ± 0.544
1.979HisSer: 1.979 ± 0.679
0.396HisThr: 0.396 ± 0.375
0.791HisVal: 0.791 ± 0.45
1.187HisTrp: 1.187 ± 0.392
0.791HisTyr: 0.791 ± 0.367
0.0HisXaa: 0.0 ± 0.0
Ile
2.374IleAla: 2.374 ± 0.992
0.0IleCys: 0.0 ± 0.0
5.144IleAsp: 5.144 ± 1.952
5.144IleGlu: 5.144 ± 1.95
1.187IlePhe: 1.187 ± 0.718
3.166IleGly: 3.166 ± 1.163
1.187IleHis: 1.187 ± 0.69
3.166IleIle: 3.166 ± 1.718
0.791IleLys: 0.791 ± 0.594
3.166IleLeu: 3.166 ± 0.731
0.791IleMet: 0.791 ± 0.507
3.562IleAsn: 3.562 ± 1.341
2.77IlePro: 2.77 ± 1.275
1.583IleGln: 1.583 ± 0.764
2.77IleArg: 2.77 ± 0.867
3.166IleSer: 3.166 ± 1.21
0.791IleThr: 0.791 ± 0.45
4.749IleVal: 4.749 ± 0.971
0.791IleTrp: 0.791 ± 0.544
1.583IleTyr: 1.583 ± 0.273
0.0IleXaa: 0.0 ± 0.0
Lys
3.957LysAla: 3.957 ± 0.934
1.187LysCys: 1.187 ± 0.392
2.374LysAsp: 2.374 ± 1.329
1.979LysGlu: 1.979 ± 0.89
2.374LysPhe: 2.374 ± 1.196
2.77LysGly: 2.77 ± 0.996
1.187LysHis: 1.187 ± 0.538
1.583LysIle: 1.583 ± 0.899
1.187LysLys: 1.187 ± 0.685
3.562LysLeu: 3.562 ± 1.413
0.0LysMet: 0.0 ± 0.0
0.791LysAsn: 0.791 ± 0.45
2.77LysPro: 2.77 ± 1.788
1.979LysGln: 1.979 ± 0.772
4.353LysArg: 4.353 ± 1.366
3.562LysSer: 3.562 ± 1.341
1.979LysThr: 1.979 ± 0.907
3.166LysVal: 3.166 ± 1.303
0.396LysTrp: 0.396 ± 0.342
2.77LysTyr: 2.77 ± 0.496
0.0LysXaa: 0.0 ± 0.0
Leu
5.54LeuAla: 5.54 ± 1.659
2.374LeuCys: 2.374 ± 1.296
4.749LeuAsp: 4.749 ± 0.891
5.54LeuGlu: 5.54 ± 1.267
3.562LeuPhe: 3.562 ± 1.244
5.54LeuGly: 5.54 ± 1.569
1.979LeuHis: 1.979 ± 0.91
3.166LeuIle: 3.166 ± 1.549
3.957LeuLys: 3.957 ± 0.584
10.685LeuLeu: 10.685 ± 2.6
1.583LeuMet: 1.583 ± 0.624
1.583LeuAsn: 1.583 ± 0.868
4.353LeuPro: 4.353 ± 1.622
6.332LeuGln: 6.332 ± 0.783
4.749LeuArg: 4.749 ± 1.439
8.31LeuSer: 8.31 ± 1.125
4.749LeuThr: 4.749 ± 1.334
5.144LeuVal: 5.144 ± 1.353
1.187LeuTrp: 1.187 ± 0.68
1.979LeuTyr: 1.979 ± 0.777
0.0LeuXaa: 0.0 ± 0.0
Met
0.791MetAla: 0.791 ± 0.438
0.396MetCys: 0.396 ± 0.297
1.187MetAsp: 1.187 ± 0.685
0.791MetGlu: 0.791 ± 0.449
1.979MetPhe: 1.979 ± 0.99
0.396MetGly: 0.396 ± 0.375
0.0MetHis: 0.0 ± 0.0
1.187MetIle: 1.187 ± 0.874
0.396MetLys: 0.396 ± 0.342
1.583MetLeu: 1.583 ± 0.939
0.0MetMet: 0.0 ± 0.373
0.791MetAsn: 0.791 ± 0.367
0.396MetPro: 0.396 ± 0.297
0.396MetGln: 0.396 ± 0.297
0.791MetArg: 0.791 ± 0.402
2.374MetSer: 2.374 ± 0.819
0.0MetThr: 0.0 ± 0.0
0.791MetVal: 0.791 ± 0.402
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.374AsnAla: 2.374 ± 0.959
2.374AsnCys: 2.374 ± 1.345
1.187AsnAsp: 1.187 ± 0.392
1.979AsnGlu: 1.979 ± 0.589
2.374AsnPhe: 2.374 ± 0.776
2.374AsnGly: 2.374 ± 1.387
0.396AsnHis: 0.396 ± 0.297
3.562AsnIle: 3.562 ± 1.348
3.562AsnLys: 3.562 ± 0.82
0.791AsnLeu: 0.791 ± 0.594
0.0AsnMet: 0.0 ± 0.0
2.374AsnAsn: 2.374 ± 1.347
1.979AsnPro: 1.979 ± 0.597
1.187AsnGln: 1.187 ± 1.124
2.374AsnArg: 2.374 ± 1.321
3.166AsnSer: 3.166 ± 2.032
2.374AsnThr: 2.374 ± 1.071
2.77AsnVal: 2.77 ± 0.885
0.0AsnTrp: 0.0 ± 0.0
1.187AsnTyr: 1.187 ± 0.68
0.0AsnXaa: 0.0 ± 0.0
Pro
3.957ProAla: 3.957 ± 0.679
1.583ProCys: 1.583 ± 0.736
6.727ProAsp: 6.727 ± 2.003
2.77ProGlu: 2.77 ± 0.692
1.187ProPhe: 1.187 ± 0.761
2.77ProGly: 2.77 ± 1.825
0.396ProHis: 0.396 ± 0.369
3.166ProIle: 3.166 ± 2.072
2.77ProLys: 2.77 ± 1.051
5.936ProLeu: 5.936 ± 1.103
0.396ProMet: 0.396 ± 0.297
1.583ProAsn: 1.583 ± 0.5
9.497ProPro: 9.497 ± 3.17
3.166ProGln: 3.166 ± 1.392
3.166ProArg: 3.166 ± 1.516
3.957ProSer: 3.957 ± 2.367
7.123ProThr: 7.123 ± 1.459
3.562ProVal: 3.562 ± 0.756
0.396ProTrp: 0.396 ± 0.342
0.791ProTyr: 0.791 ± 0.642
0.0ProXaa: 0.0 ± 0.0
Gln
1.979GlnAla: 1.979 ± 0.664
1.187GlnCys: 1.187 ± 0.67
2.374GlnAsp: 2.374 ± 0.854
3.562GlnGlu: 3.562 ± 1.369
3.562GlnPhe: 3.562 ± 1.133
4.749GlnGly: 4.749 ± 1.645
1.583GlnHis: 1.583 ± 0.694
2.374GlnIle: 2.374 ± 0.857
1.583GlnLys: 1.583 ± 1.016
5.54GlnLeu: 5.54 ± 1.145
1.583GlnMet: 1.583 ± 0.721
1.583GlnAsn: 1.583 ± 0.653
2.77GlnPro: 2.77 ± 1.232
3.957GlnGln: 3.957 ± 1.342
3.166GlnArg: 3.166 ± 0.807
3.166GlnSer: 3.166 ± 0.739
3.562GlnThr: 3.562 ± 0.914
3.562GlnVal: 3.562 ± 0.839
0.396GlnTrp: 0.396 ± 0.426
2.77GlnTyr: 2.77 ± 0.783
0.0GlnXaa: 0.0 ± 0.0
Arg
3.957ArgAla: 3.957 ± 1.483
0.396ArgCys: 0.396 ± 0.522
2.77ArgAsp: 2.77 ± 0.883
4.749ArgGlu: 4.749 ± 0.989
2.374ArgPhe: 2.374 ± 0.542
6.727ArgGly: 6.727 ± 2.627
2.77ArgHis: 2.77 ± 0.996
1.979ArgIle: 1.979 ± 0.589
4.353ArgLys: 4.353 ± 0.741
7.519ArgLeu: 7.519 ± 1.471
0.791ArgMet: 0.791 ± 0.45
1.583ArgAsn: 1.583 ± 0.536
2.77ArgPro: 2.77 ± 1.341
3.562ArgGln: 3.562 ± 1.119
5.936ArgArg: 5.936 ± 1.696
8.31ArgSer: 8.31 ± 4.302
4.353ArgThr: 4.353 ± 1.114
2.374ArgVal: 2.374 ± 1.024
0.396ArgTrp: 0.396 ± 0.522
2.374ArgTyr: 2.374 ± 0.542
0.0ArgXaa: 0.0 ± 0.0
Ser
2.77SerAla: 2.77 ± 0.929
0.396SerCys: 0.396 ± 0.522
3.562SerAsp: 3.562 ± 1.341
3.957SerGlu: 3.957 ± 0.952
3.957SerPhe: 3.957 ± 0.91
5.54SerGly: 5.54 ± 2.39
0.791SerHis: 0.791 ± 0.402
5.144SerIle: 5.144 ± 1.59
1.979SerLys: 1.979 ± 0.701
7.519SerLeu: 7.519 ± 1.183
0.791SerMet: 0.791 ± 0.594
2.77SerAsn: 2.77 ± 1.131
3.562SerPro: 3.562 ± 1.379
3.166SerGln: 3.166 ± 1.07
7.915SerArg: 7.915 ± 3.897
7.915SerSer: 7.915 ± 2.755
6.727SerThr: 6.727 ± 1.333
4.749SerVal: 4.749 ± 1.407
0.791SerTrp: 0.791 ± 0.402
1.979SerTyr: 1.979 ± 0.727
0.0SerXaa: 0.0 ± 0.0
Thr
3.166ThrAla: 3.166 ± 1.153
1.187ThrCys: 1.187 ± 0.535
3.957ThrAsp: 3.957 ± 1.172
3.166ThrGlu: 3.166 ± 0.844
1.979ThrPhe: 1.979 ± 0.561
3.562ThrGly: 3.562 ± 0.769
0.0ThrHis: 0.0 ± 0.0
1.979ThrIle: 1.979 ± 1.039
1.583ThrLys: 1.583 ± 0.734
4.353ThrLeu: 4.353 ± 1.553
2.374ThrMet: 2.374 ± 1.033
3.166ThrAsn: 3.166 ± 0.55
5.936ThrPro: 5.936 ± 2.95
3.166ThrGln: 3.166 ± 1.501
5.54ThrArg: 5.54 ± 1.098
5.144ThrSer: 5.144 ± 1.662
2.77ThrThr: 2.77 ± 1.717
6.332ThrVal: 6.332 ± 1.622
1.187ThrTrp: 1.187 ± 0.685
1.979ThrTyr: 1.979 ± 0.961
0.0ThrXaa: 0.0 ± 0.0
Val
3.166ValAla: 3.166 ± 1.272
1.979ValCys: 1.979 ± 0.918
3.166ValAsp: 3.166 ± 1.344
6.727ValGlu: 6.727 ± 1.634
1.979ValPhe: 1.979 ± 0.899
5.144ValGly: 5.144 ± 1.273
1.187ValHis: 1.187 ± 0.449
3.166ValIle: 3.166 ± 0.74
2.374ValLys: 2.374 ± 0.591
4.353ValLeu: 4.353 ± 1.044
1.187ValMet: 1.187 ± 0.684
1.583ValAsn: 1.583 ± 0.273
3.957ValPro: 3.957 ± 0.722
4.749ValGln: 4.749 ± 1.277
5.144ValArg: 5.144 ± 1.424
3.957ValSer: 3.957 ± 1.894
4.353ValThr: 4.353 ± 1.333
1.979ValVal: 1.979 ± 0.91
0.791ValTrp: 0.791 ± 0.45
1.979ValTyr: 1.979 ± 0.934
0.0ValXaa: 0.0 ± 0.0
Trp
0.396TrpAla: 0.396 ± 0.375
0.0TrpCys: 0.0 ± 0.0
0.791TrpAsp: 0.791 ± 0.367
1.187TrpGlu: 1.187 ± 0.752
0.396TrpPhe: 0.396 ± 0.297
0.396TrpGly: 0.396 ± 0.375
0.0TrpHis: 0.0 ± 0.0
0.791TrpIle: 0.791 ± 0.594
0.791TrpLys: 0.791 ± 0.544
1.187TrpLeu: 1.187 ± 0.68
0.396TrpMet: 0.396 ± 0.297
0.396TrpAsn: 0.396 ± 0.375
0.0TrpPro: 0.0 ± 0.0
1.187TrpGln: 1.187 ± 0.51
0.0TrpArg: 0.0 ± 0.0
1.187TrpSer: 1.187 ± 0.685
1.583TrpThr: 1.583 ± 0.718
0.791TrpVal: 0.791 ± 0.402
0.0TrpTrp: 0.0 ± 0.0
0.396TrpTyr: 0.396 ± 0.297
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.583TyrAla: 1.583 ± 0.699
0.396TyrCys: 0.396 ± 0.522
0.396TyrAsp: 0.396 ± 0.297
0.791TyrGlu: 0.791 ± 0.632
2.77TyrPhe: 2.77 ± 1.431
1.583TyrGly: 1.583 ± 0.273
0.791TyrHis: 0.791 ± 0.438
3.166TyrIle: 3.166 ± 0.907
2.374TyrLys: 2.374 ± 0.827
2.77TyrLeu: 2.77 ± 0.879
0.0TyrMet: 0.0 ± 0.0
1.979TyrAsn: 1.979 ± 1.429
1.979TyrPro: 1.979 ± 0.934
2.374TyrGln: 2.374 ± 0.784
2.77TyrArg: 2.77 ± 0.887
1.187TyrSer: 1.187 ± 0.748
1.187TyrThr: 1.187 ± 0.41
2.374TyrVal: 2.374 ± 0.721
0.396TyrTrp: 0.396 ± 0.426
2.77TyrTyr: 2.77 ± 1.323
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2528 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski