Amino acid dipepetide frequency for Human papillomavirus 172

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.861AlaAla: 2.861 ± 1.131
2.452AlaCys: 2.452 ± 1.024
2.861AlaAsp: 2.861 ± 0.799
5.721AlaGlu: 5.721 ± 0.789
2.452AlaPhe: 2.452 ± 0.652
3.269AlaGly: 3.269 ± 1.532
1.635AlaHis: 1.635 ± 0.628
4.087AlaIle: 4.087 ± 0.885
2.861AlaLys: 2.861 ± 0.493
3.678AlaLeu: 3.678 ± 0.946
0.409AlaMet: 0.409 ± 0.31
2.452AlaAsn: 2.452 ± 0.734
3.678AlaPro: 3.678 ± 1.349
2.043AlaGln: 2.043 ± 0.803
1.226AlaArg: 1.226 ± 0.407
3.678AlaSer: 3.678 ± 2.137
1.635AlaThr: 1.635 ± 0.643
1.635AlaVal: 1.635 ± 0.585
0.0AlaTrp: 0.0 ± 0.0
1.635AlaTyr: 1.635 ± 0.716
0.0AlaXaa: 0.0 ± 0.0
Cys
0.409CysAla: 0.409 ± 0.31
2.452CysCys: 2.452 ± 1.656
1.635CysAsp: 1.635 ± 0.942
0.409CysGlu: 0.409 ± 0.326
1.635CysPhe: 1.635 ± 0.716
0.409CysGly: 0.409 ± 0.595
0.409CysHis: 0.409 ± 0.495
2.861CysIle: 2.861 ± 0.891
3.269CysLys: 3.269 ± 1.112
2.861CysLeu: 2.861 ± 2.014
0.409CysMet: 0.409 ± 0.31
0.817CysAsn: 0.817 ± 0.543
2.043CysPro: 2.043 ± 1.08
0.409CysGln: 0.409 ± 0.326
1.226CysArg: 1.226 ± 1.222
1.226CysSer: 1.226 ± 0.76
2.452CysThr: 2.452 ± 1.012
0.817CysVal: 0.817 ± 0.652
0.817CysTrp: 0.817 ± 0.649
1.635CysTyr: 1.635 ± 0.998
0.0CysXaa: 0.0 ± 0.0
Asp
4.087AspAla: 4.087 ± 0.878
2.861AspCys: 2.861 ± 0.746
4.087AspAsp: 4.087 ± 3.375
4.904AspGlu: 4.904 ± 1.356
2.861AspPhe: 2.861 ± 0.921
2.861AspGly: 2.861 ± 1.506
0.817AspHis: 0.817 ± 0.456
4.495AspIle: 4.495 ± 2.059
3.678AspLys: 3.678 ± 0.876
6.13AspLeu: 6.13 ± 1.686
1.635AspMet: 1.635 ± 0.789
2.043AspAsn: 2.043 ± 0.667
6.13AspPro: 6.13 ± 1.872
0.817AspGln: 0.817 ± 0.383
2.043AspArg: 2.043 ± 1.011
3.678AspSer: 3.678 ± 1.239
5.721AspThr: 5.721 ± 1.162
6.13AspVal: 6.13 ± 1.777
2.043AspTrp: 2.043 ± 0.862
0.817AspTyr: 0.817 ± 0.383
0.0AspXaa: 0.0 ± 0.0
Glu
2.861GluAla: 2.861 ± 0.967
0.817GluCys: 0.817 ± 0.62
4.904GluAsp: 4.904 ± 1.12
3.678GluGlu: 3.678 ± 1.499
2.043GluPhe: 2.043 ± 0.567
3.678GluGly: 3.678 ± 0.704
1.635GluHis: 1.635 ± 0.64
2.452GluIle: 2.452 ± 0.51
3.269GluLys: 3.269 ± 1.285
7.356GluLeu: 7.356 ± 2.015
0.409GluMet: 0.409 ± 0.361
5.721GluAsn: 5.721 ± 0.98
2.452GluPro: 2.452 ± 0.928
1.635GluGln: 1.635 ± 1.305
2.452GluArg: 2.452 ± 1.079
4.087GluSer: 4.087 ± 1.213
3.678GluThr: 3.678 ± 1.457
4.087GluVal: 4.087 ± 0.584
1.226GluTrp: 1.226 ± 0.794
2.452GluTyr: 2.452 ± 1.275
0.0GluXaa: 0.0 ± 0.0
Phe
2.452PheAla: 2.452 ± 0.878
2.452PheCys: 2.452 ± 1.537
4.087PheAsp: 4.087 ± 0.986
2.861PheGlu: 2.861 ± 1.07
1.635PhePhe: 1.635 ± 0.756
2.043PheGly: 2.043 ± 0.504
0.409PheHis: 0.409 ± 0.326
3.678PheIle: 3.678 ± 1.146
2.861PheLys: 2.861 ± 1.075
4.495PheLeu: 4.495 ± 1.106
1.226PheMet: 1.226 ± 0.694
3.678PheAsn: 3.678 ± 0.745
2.452PhePro: 2.452 ± 0.816
2.043PheGln: 2.043 ± 0.567
1.226PheArg: 1.226 ± 0.672
3.269PheSer: 3.269 ± 0.76
1.226PheThr: 1.226 ± 0.929
3.678PheVal: 3.678 ± 0.664
0.817PheTrp: 0.817 ± 0.383
1.226PheTyr: 1.226 ± 0.599
0.0PheXaa: 0.0 ± 0.0
Gly
2.861GlyAla: 2.861 ± 1.055
0.409GlyCys: 0.409 ± 0.326
5.313GlyAsp: 5.313 ± 1.554
4.087GlyGlu: 4.087 ± 1.269
2.452GlyPhe: 2.452 ± 0.492
1.635GlyGly: 1.635 ± 0.643
1.635GlyHis: 1.635 ± 0.955
4.087GlyIle: 4.087 ± 0.916
2.861GlyLys: 2.861 ± 1.104
2.861GlyLeu: 2.861 ± 1.953
0.0GlyMet: 0.0 ± 0.0
4.904GlyAsn: 4.904 ± 1.58
2.043GlyPro: 2.043 ± 0.895
2.452GlyGln: 2.452 ± 1.12
2.452GlyArg: 2.452 ± 0.779
4.904GlySer: 4.904 ± 1.723
3.678GlyThr: 3.678 ± 1.112
3.269GlyVal: 3.269 ± 1.456
0.409GlyTrp: 0.409 ± 0.31
0.817GlyTyr: 0.817 ± 0.653
0.0GlyXaa: 0.0 ± 0.0
His
0.409HisAla: 0.409 ± 0.326
0.817HisCys: 0.817 ± 0.62
1.226HisAsp: 1.226 ± 0.599
1.226HisGlu: 1.226 ± 0.599
1.226HisPhe: 1.226 ± 0.749
0.817HisGly: 0.817 ± 0.456
0.409HisHis: 0.409 ± 0.326
0.409HisIle: 0.409 ± 0.361
0.817HisLys: 0.817 ± 0.631
2.043HisLeu: 2.043 ± 1.022
0.409HisMet: 0.409 ± 0.503
2.043HisAsn: 2.043 ± 0.618
2.043HisPro: 2.043 ± 1.062
0.0HisGln: 0.0 ± 0.0
0.817HisArg: 0.817 ± 0.383
0.0HisSer: 0.0 ± 0.0
0.817HisThr: 0.817 ± 0.422
0.409HisVal: 0.409 ± 0.395
0.409HisTrp: 0.409 ± 0.495
0.817HisTyr: 0.817 ± 0.422
0.0HisXaa: 0.0 ± 0.0
Ile
2.452IleAla: 2.452 ± 0.86
0.0IleCys: 0.0 ± 0.0
6.13IleAsp: 6.13 ± 2.225
4.495IleGlu: 4.495 ± 1.341
2.043IlePhe: 2.043 ± 0.441
2.861IleGly: 2.861 ± 1.365
0.409IleHis: 0.409 ± 0.361
4.087IleIle: 4.087 ± 1.37
3.269IleLys: 3.269 ± 1.2
2.043IleLeu: 2.043 ± 0.775
0.0IleMet: 0.0 ± 0.0
3.678IleAsn: 3.678 ± 1.573
4.087IlePro: 4.087 ± 1.504
3.269IleGln: 3.269 ± 0.68
2.861IleArg: 2.861 ± 0.839
5.721IleSer: 5.721 ± 1.297
4.495IleThr: 4.495 ± 1.62
3.678IleVal: 3.678 ± 1.654
1.226IleTrp: 1.226 ± 0.417
2.043IleTyr: 2.043 ± 0.744
0.0IleXaa: 0.0 ± 0.0
Lys
2.452LysAla: 2.452 ± 1.317
2.043LysCys: 2.043 ± 1.014
2.861LysAsp: 2.861 ± 1.031
4.904LysGlu: 4.904 ± 1.29
3.269LysPhe: 3.269 ± 1.782
3.269LysGly: 3.269 ± 1.157
1.226LysHis: 1.226 ± 0.672
2.861LysIle: 2.861 ± 1.0
2.861LysLys: 2.861 ± 1.107
3.269LysLeu: 3.269 ± 0.722
1.635LysMet: 1.635 ± 0.595
2.452LysAsn: 2.452 ± 1.084
2.043LysPro: 2.043 ± 0.758
2.043LysGln: 2.043 ± 0.78
6.13LysArg: 6.13 ± 0.853
2.452LysSer: 2.452 ± 0.835
4.495LysThr: 4.495 ± 0.942
3.269LysVal: 3.269 ± 1.324
0.817LysTrp: 0.817 ± 0.631
2.043LysTyr: 2.043 ± 0.978
0.0LysXaa: 0.0 ± 0.0
Leu
2.861LeuAla: 2.861 ± 1.105
2.861LeuCys: 2.861 ± 1.383
3.269LeuAsp: 3.269 ± 1.361
5.721LeuGlu: 5.721 ± 0.973
4.087LeuPhe: 4.087 ± 1.383
4.495LeuGly: 4.495 ± 1.059
2.043LeuHis: 2.043 ± 0.697
2.861LeuIle: 2.861 ± 1.185
5.721LeuLys: 5.721 ± 1.934
9.399LeuLeu: 9.399 ± 2.027
0.817LeuMet: 0.817 ± 0.755
1.635LeuAsn: 1.635 ± 0.29
4.495LeuPro: 4.495 ± 0.908
4.495LeuGln: 4.495 ± 1.18
3.269LeuArg: 3.269 ± 1.317
8.173LeuSer: 8.173 ± 3.434
7.765LeuThr: 7.765 ± 1.13
5.721LeuVal: 5.721 ± 1.589
0.409LeuTrp: 0.409 ± 0.326
4.495LeuTyr: 4.495 ± 1.256
0.0LeuXaa: 0.0 ± 0.0
Met
0.817MetAla: 0.817 ± 0.577
0.817MetCys: 0.817 ± 0.653
0.817MetAsp: 0.817 ± 0.653
0.409MetGlu: 0.409 ± 0.495
1.226MetPhe: 1.226 ± 0.527
1.226MetGly: 1.226 ± 0.616
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.817MetLeu: 0.817 ± 0.456
0.817MetMet: 0.817 ± 0.372
2.452MetAsn: 2.452 ± 0.652
0.409MetPro: 0.409 ± 0.503
0.409MetGln: 0.409 ± 0.31
1.226MetArg: 1.226 ± 0.805
0.817MetSer: 0.817 ± 0.422
0.0MetThr: 0.0 ± 0.0
1.226MetVal: 1.226 ± 0.677
0.409MetTrp: 0.409 ± 0.395
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.269AsnAla: 3.269 ± 0.771
1.226AsnCys: 1.226 ± 0.47
2.861AsnAsp: 2.861 ± 0.647
5.313AsnGlu: 5.313 ± 1.338
2.452AsnPhe: 2.452 ± 1.082
3.269AsnGly: 3.269 ± 1.277
0.409AsnHis: 0.409 ± 0.395
1.635AsnIle: 1.635 ± 1.049
5.721AsnLys: 5.721 ± 1.342
4.904AsnLeu: 4.904 ± 1.044
0.409AsnMet: 0.409 ± 0.31
2.043AsnAsn: 2.043 ± 0.453
4.087AsnPro: 4.087 ± 1.723
2.043AsnGln: 2.043 ± 0.78
1.635AsnArg: 1.635 ± 0.543
4.087AsnSer: 4.087 ± 1.542
1.635AsnThr: 1.635 ± 0.77
2.043AsnVal: 2.043 ± 0.578
0.409AsnTrp: 0.409 ± 0.395
1.226AsnTyr: 1.226 ± 0.639
0.0AsnXaa: 0.0 ± 0.0
Pro
2.861ProAla: 2.861 ± 0.997
0.817ProCys: 0.817 ± 0.824
4.904ProAsp: 4.904 ± 1.949
3.678ProGlu: 3.678 ± 1.773
4.087ProPhe: 4.087 ± 1.142
2.452ProGly: 2.452 ± 0.652
0.817ProHis: 0.817 ± 1.007
3.269ProIle: 3.269 ± 1.976
4.087ProLys: 4.087 ± 1.56
4.904ProLeu: 4.904 ± 0.866
0.409ProMet: 0.409 ± 0.31
1.635ProAsn: 1.635 ± 0.896
6.539ProPro: 6.539 ± 1.886
3.678ProGln: 3.678 ± 1.124
3.269ProArg: 3.269 ± 1.176
6.13ProSer: 6.13 ± 2.117
6.13ProThr: 6.13 ± 2.597
2.452ProVal: 2.452 ± 0.779
0.0ProTrp: 0.0 ± 0.0
3.678ProTyr: 3.678 ± 1.603
0.0ProXaa: 0.0 ± 0.0
Gln
3.269GlnAla: 3.269 ± 1.069
0.817GlnCys: 0.817 ± 0.509
4.087GlnAsp: 4.087 ± 0.97
2.452GlnGlu: 2.452 ± 0.886
2.043GlnPhe: 2.043 ± 0.686
1.635GlnGly: 1.635 ± 0.718
0.817GlnHis: 0.817 ± 0.631
2.861GlnIle: 2.861 ± 0.824
1.635GlnLys: 1.635 ± 0.533
3.269GlnLeu: 3.269 ± 1.467
2.043GlnMet: 2.043 ± 0.995
0.409GlnAsn: 0.409 ± 0.595
2.452GlnPro: 2.452 ± 0.998
2.043GlnGln: 2.043 ± 0.798
1.226GlnArg: 1.226 ± 0.417
2.043GlnSer: 2.043 ± 0.953
2.043GlnThr: 2.043 ± 0.697
1.635GlnVal: 1.635 ± 0.633
0.409GlnTrp: 0.409 ± 0.31
2.043GlnTyr: 2.043 ± 0.864
0.0GlnXaa: 0.0 ± 0.0
Arg
3.269ArgAla: 3.269 ± 0.825
2.861ArgCys: 2.861 ± 1.592
1.635ArgAsp: 1.635 ± 1.081
1.635ArgGlu: 1.635 ± 0.802
2.043ArgPhe: 2.043 ± 0.374
2.861ArgGly: 2.861 ± 0.664
0.817ArgHis: 0.817 ± 0.653
1.226ArgIle: 1.226 ± 0.714
3.269ArgLys: 3.269 ± 0.63
6.539ArgLeu: 6.539 ± 1.241
0.0ArgMet: 0.0 ± 0.0
3.678ArgAsn: 3.678 ± 0.85
5.313ArgPro: 5.313 ± 2.952
3.678ArgGln: 3.678 ± 0.705
6.13ArgArg: 6.13 ± 3.071
3.678ArgSer: 3.678 ± 1.263
2.043ArgThr: 2.043 ± 0.651
2.043ArgVal: 2.043 ± 0.504
0.0ArgTrp: 0.0 ± 0.0
1.635ArgTyr: 1.635 ± 0.533
0.0ArgXaa: 0.0 ± 0.0
Ser
4.087SerAla: 4.087 ± 0.916
0.0SerCys: 0.0 ± 0.0
5.313SerAsp: 5.313 ± 1.224
2.043SerGlu: 2.043 ± 0.78
2.861SerPhe: 2.861 ± 0.822
4.904SerGly: 4.904 ± 0.746
1.226SerHis: 1.226 ± 0.599
5.721SerIle: 5.721 ± 2.326
1.635SerLys: 1.635 ± 0.78
8.582SerLeu: 8.582 ± 2.186
0.817SerMet: 0.817 ± 0.543
3.678SerAsn: 3.678 ± 1.11
4.904SerPro: 4.904 ± 1.162
2.861SerGln: 2.861 ± 1.041
4.087SerArg: 4.087 ± 1.557
4.904SerSer: 4.904 ± 2.129
5.313SerThr: 5.313 ± 1.888
5.313SerVal: 5.313 ± 1.482
1.226SerTrp: 1.226 ± 0.407
2.043SerTyr: 2.043 ± 0.504
0.0SerXaa: 0.0 ± 0.0
Thr
4.087ThrAla: 4.087 ± 1.007
2.043ThrCys: 2.043 ± 0.823
4.087ThrAsp: 4.087 ± 1.347
2.452ThrGlu: 2.452 ± 1.019
3.678ThrPhe: 3.678 ± 0.949
5.721ThrGly: 5.721 ± 1.282
1.226ThrHis: 1.226 ± 0.925
7.356ThrIle: 7.356 ± 2.062
1.635ThrLys: 1.635 ± 0.942
3.269ThrLeu: 3.269 ± 0.791
0.817ThrMet: 0.817 ± 0.654
2.043ThrAsn: 2.043 ± 0.862
4.087ThrPro: 4.087 ± 1.036
0.817ThrGln: 0.817 ± 0.385
5.313ThrArg: 5.313 ± 1.62
5.313ThrSer: 5.313 ± 2.218
6.13ThrThr: 6.13 ± 1.342
6.13ThrVal: 6.13 ± 2.419
1.226ThrTrp: 1.226 ± 0.547
2.043ThrTyr: 2.043 ± 0.864
0.0ThrXaa: 0.0 ± 0.0
Val
2.043ValAla: 2.043 ± 0.744
0.817ValCys: 0.817 ± 0.422
5.313ValAsp: 5.313 ± 0.887
2.452ValGlu: 2.452 ± 0.947
2.043ValPhe: 2.043 ± 0.594
2.861ValGly: 2.861 ± 0.895
1.226ValHis: 1.226 ± 0.713
3.269ValIle: 3.269 ± 0.766
2.452ValLys: 2.452 ± 0.912
4.495ValLeu: 4.495 ± 1.219
0.817ValMet: 0.817 ± 0.653
2.861ValAsn: 2.861 ± 0.951
5.313ValPro: 5.313 ± 2.112
2.452ValGln: 2.452 ± 0.69
2.452ValArg: 2.452 ± 1.904
6.539ValSer: 6.539 ± 1.33
4.495ValThr: 4.495 ± 1.672
4.904ValVal: 4.904 ± 1.365
0.817ValTrp: 0.817 ± 0.577
2.043ValTyr: 2.043 ± 0.453
0.0ValXaa: 0.0 ± 0.0
Trp
1.226TrpAla: 1.226 ± 0.916
0.0TrpCys: 0.0 ± 0.0
0.817TrpAsp: 0.817 ± 0.653
0.409TrpGlu: 0.409 ± 0.395
0.409TrpPhe: 0.409 ± 0.31
0.409TrpGly: 0.409 ± 0.326
0.0TrpHis: 0.0 ± 0.0
1.226TrpIle: 1.226 ± 0.929
1.226TrpLys: 1.226 ± 0.6
0.817TrpLeu: 0.817 ± 0.456
0.0TrpMet: 0.0 ± 0.0
0.409TrpAsn: 0.409 ± 0.326
0.409TrpPro: 0.409 ± 0.326
0.409TrpGln: 0.409 ± 0.31
2.043TrpArg: 2.043 ± 1.014
0.409TrpSer: 0.409 ± 0.361
2.452TrpThr: 2.452 ± 1.084
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.817TrpTyr: 0.817 ± 0.631
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.635TyrAla: 1.635 ± 0.716
1.635TyrCys: 1.635 ± 1.236
1.635TyrAsp: 1.635 ± 0.636
2.043TyrGlu: 2.043 ± 0.78
3.269TyrPhe: 3.269 ± 1.19
2.452TyrGly: 2.452 ± 1.124
0.0TyrHis: 0.0 ± 0.0
0.409TyrIle: 0.409 ± 0.326
3.269TyrLys: 3.269 ± 0.917
3.269TyrLeu: 3.269 ± 1.205
0.409TyrMet: 0.409 ± 0.395
2.043TyrAsn: 2.043 ± 1.115
0.817TyrPro: 0.817 ± 0.383
1.635TyrGln: 1.635 ± 0.64
3.269TyrArg: 3.269 ± 0.767
0.409TyrSer: 0.409 ± 0.361
3.269TyrThr: 3.269 ± 1.481
1.226TyrVal: 1.226 ± 0.643
0.817TyrTrp: 0.817 ± 0.653
3.269TyrTyr: 3.269 ± 1.081
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2448 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski