Amino acid dipepetide frequency for Human papillomavirus 139

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.124AlaAla: 4.124 ± 1.806
1.237AlaCys: 1.237 ± 1.35
3.711AlaAsp: 3.711 ± 1.038
4.948AlaGlu: 4.948 ± 1.505
2.062AlaPhe: 2.062 ± 0.736
3.711AlaGly: 3.711 ± 1.042
1.237AlaHis: 1.237 ± 0.891
2.887AlaIle: 2.887 ± 0.712
2.062AlaLys: 2.062 ± 0.768
4.948AlaLeu: 4.948 ± 1.73
1.649AlaMet: 1.649 ± 0.838
3.711AlaAsn: 3.711 ± 1.26
2.887AlaPro: 2.887 ± 1.432
1.649AlaGln: 1.649 ± 1.593
2.062AlaArg: 2.062 ± 0.685
4.124AlaSer: 4.124 ± 0.912
4.948AlaThr: 4.948 ± 1.256
1.649AlaVal: 1.649 ± 0.748
1.237AlaTrp: 1.237 ± 0.837
2.474AlaTyr: 2.474 ± 0.901
0.0AlaXaa: 0.0 ± 0.0
Cys
0.412CysAla: 0.412 ± 0.398
0.825CysCys: 0.825 ± 0.645
1.649CysAsp: 1.649 ± 0.632
0.825CysGlu: 0.825 ± 0.419
1.237CysPhe: 1.237 ± 0.967
0.412CysGly: 0.412 ± 0.514
0.412CysHis: 0.412 ± 0.514
0.825CysIle: 0.825 ± 0.645
2.887CysLys: 2.887 ± 0.783
0.825CysLeu: 0.825 ± 0.9
0.412CysMet: 0.412 ± 0.45
0.412CysAsn: 0.412 ± 0.322
1.237CysPro: 1.237 ± 0.822
0.825CysGln: 0.825 ± 0.645
1.649CysArg: 1.649 ± 0.869
2.062CysSer: 2.062 ± 1.13
1.237CysThr: 1.237 ± 0.789
1.649CysVal: 1.649 ± 1.304
0.825CysTrp: 0.825 ± 0.419
0.412CysTyr: 0.412 ± 0.398
0.0CysXaa: 0.0 ± 0.0
Asp
4.948AspAla: 4.948 ± 1.418
0.825AspCys: 0.825 ± 0.419
6.186AspAsp: 6.186 ± 3.022
4.948AspGlu: 4.948 ± 1.915
2.062AspPhe: 2.062 ± 1.196
4.124AspGly: 4.124 ± 0.774
0.825AspHis: 0.825 ± 0.419
5.361AspIle: 5.361 ± 0.783
2.887AspLys: 2.887 ± 1.046
5.361AspLeu: 5.361 ± 1.24
0.825AspMet: 0.825 ± 0.796
2.887AspAsn: 2.887 ± 0.854
4.124AspPro: 4.124 ± 1.039
1.649AspGln: 1.649 ± 0.605
1.649AspArg: 1.649 ± 1.278
4.536AspSer: 4.536 ± 1.162
3.711AspThr: 3.711 ± 0.776
6.598AspVal: 6.598 ± 2.21
1.237AspTrp: 1.237 ± 0.733
1.237AspTyr: 1.237 ± 0.71
0.0AspXaa: 0.0 ± 0.0
Glu
4.536GluAla: 4.536 ± 1.297
0.825GluCys: 0.825 ± 0.645
4.124GluAsp: 4.124 ± 1.055
4.124GluGlu: 4.124 ± 1.003
2.887GluPhe: 2.887 ± 0.892
4.536GluGly: 4.536 ± 1.921
1.237GluHis: 1.237 ± 0.65
2.474GluIle: 2.474 ± 0.994
2.474GluLys: 2.474 ± 0.903
4.124GluLeu: 4.124 ± 1.582
0.412GluMet: 0.412 ± 0.322
7.01GluAsn: 7.01 ± 2.253
3.299GluPro: 3.299 ± 1.346
2.474GluGln: 2.474 ± 0.838
2.887GluArg: 2.887 ± 1.519
3.711GluSer: 3.711 ± 1.468
4.948GluThr: 4.948 ± 1.183
3.299GluVal: 3.299 ± 1.045
0.825GluTrp: 0.825 ± 0.645
1.649GluTyr: 1.649 ± 1.038
0.0GluXaa: 0.0 ± 0.0
Phe
4.124PheAla: 4.124 ± 0.922
1.237PheCys: 1.237 ± 0.874
2.062PheAsp: 2.062 ± 0.487
2.062PheGlu: 2.062 ± 0.72
2.062PhePhe: 2.062 ± 1.113
2.474PheGly: 2.474 ± 0.907
1.237PheHis: 1.237 ± 0.632
4.124PheIle: 4.124 ± 0.991
2.062PheLys: 2.062 ± 1.322
3.299PheLeu: 3.299 ± 1.144
0.825PheMet: 0.825 ± 0.645
2.062PheAsn: 2.062 ± 0.487
1.237PhePro: 1.237 ± 0.603
1.649PheGln: 1.649 ± 0.471
1.237PheArg: 1.237 ± 0.751
3.711PheSer: 3.711 ± 1.589
2.474PheThr: 2.474 ± 0.766
4.124PheVal: 4.124 ± 1.757
1.237PheTrp: 1.237 ± 0.751
2.062PheTyr: 2.062 ± 0.872
0.0PheXaa: 0.0 ± 0.0
Gly
2.474GlyAla: 2.474 ± 1.078
1.649GlyCys: 1.649 ± 0.711
6.186GlyAsp: 6.186 ± 1.749
4.536GlyGlu: 4.536 ± 1.807
0.825GlyPhe: 0.825 ± 0.419
3.299GlyGly: 3.299 ± 1.966
1.237GlyHis: 1.237 ± 0.674
3.299GlyIle: 3.299 ± 0.971
3.711GlyLys: 3.711 ± 0.668
6.186GlyLeu: 6.186 ± 1.218
0.412GlyMet: 0.412 ± 0.398
3.711GlyAsn: 3.711 ± 0.786
2.887GlyPro: 2.887 ± 1.135
1.237GlyGln: 1.237 ± 1.194
2.887GlyArg: 2.887 ± 1.02
4.948GlySer: 4.948 ± 0.929
5.361GlyThr: 5.361 ± 2.141
2.474GlyVal: 2.474 ± 1.158
0.0GlyTrp: 0.0 ± 0.0
0.412GlyTyr: 0.412 ± 0.352
0.0GlyXaa: 0.0 ± 0.0
His
0.412HisAla: 0.412 ± 0.398
1.237HisCys: 1.237 ± 0.749
1.237HisAsp: 1.237 ± 0.747
0.825HisGlu: 0.825 ± 0.419
1.649HisPhe: 1.649 ± 0.635
0.825HisGly: 0.825 ± 0.883
0.0HisHis: 0.0 ± 0.0
0.825HisIle: 0.825 ± 0.705
2.062HisLys: 2.062 ± 1.361
1.237HisLeu: 1.237 ± 0.71
0.0HisMet: 0.0 ± 0.0
0.825HisAsn: 0.825 ± 0.419
2.062HisPro: 2.062 ± 1.034
0.412HisGln: 0.412 ± 0.322
0.0HisArg: 0.0 ± 0.0
1.237HisSer: 1.237 ± 0.603
0.412HisThr: 0.412 ± 0.398
0.825HisVal: 0.825 ± 0.608
0.825HisTrp: 0.825 ± 0.385
1.649HisTyr: 1.649 ± 0.765
0.0HisXaa: 0.0 ± 0.0
Ile
0.825IleAla: 0.825 ± 0.494
1.237IleCys: 1.237 ± 0.632
4.536IleAsp: 4.536 ± 2.168
2.474IleGlu: 2.474 ± 1.237
1.237IlePhe: 1.237 ± 0.751
4.124IleGly: 4.124 ± 0.768
0.0IleHis: 0.0 ± 0.0
1.237IleIle: 1.237 ± 0.816
3.299IleLys: 3.299 ± 1.367
4.124IleLeu: 4.124 ± 1.383
2.474IleMet: 2.474 ± 1.445
2.474IleAsn: 2.474 ± 0.755
2.887IlePro: 2.887 ± 1.243
0.825IleGln: 0.825 ± 0.419
2.062IleArg: 2.062 ± 1.419
2.887IleSer: 2.887 ± 0.553
4.124IleThr: 4.124 ± 1.5
6.598IleVal: 6.598 ± 1.982
0.825IleTrp: 0.825 ± 0.481
0.412IleTyr: 0.412 ± 0.398
0.0IleXaa: 0.0 ± 0.0
Lys
1.649LysAla: 1.649 ± 1.283
0.412LysCys: 0.412 ± 0.398
2.062LysAsp: 2.062 ± 0.931
5.361LysGlu: 5.361 ± 2.078
0.825LysPhe: 0.825 ± 0.556
4.124LysGly: 4.124 ± 1.679
0.412LysHis: 0.412 ± 0.322
2.887LysIle: 2.887 ± 1.155
4.536LysLys: 4.536 ± 1.973
3.711LysLeu: 3.711 ± 1.084
2.062LysMet: 2.062 ± 0.62
2.474LysAsn: 2.474 ± 1.198
1.237LysPro: 1.237 ± 0.71
2.474LysGln: 2.474 ± 1.178
5.773LysArg: 5.773 ± 1.505
4.536LysSer: 4.536 ± 1.825
0.825LysThr: 0.825 ± 0.645
2.887LysVal: 2.887 ± 0.822
0.825LysTrp: 0.825 ± 0.696
2.062LysTyr: 2.062 ± 1.027
0.0LysXaa: 0.0 ± 0.0
Leu
6.598LeuAla: 6.598 ± 1.546
2.062LeuCys: 2.062 ± 1.489
6.186LeuAsp: 6.186 ± 1.08
6.598LeuGlu: 6.598 ± 2.506
5.361LeuPhe: 5.361 ± 0.691
3.711LeuGly: 3.711 ± 1.102
1.237LeuHis: 1.237 ± 0.412
2.474LeuIle: 2.474 ± 0.66
5.361LeuLys: 5.361 ± 2.288
8.66LeuLeu: 8.66 ± 2.334
1.237LeuMet: 1.237 ± 0.759
3.711LeuAsn: 3.711 ± 0.907
5.361LeuPro: 5.361 ± 2.548
6.186LeuGln: 6.186 ± 1.494
4.948LeuArg: 4.948 ± 1.224
7.835LeuSer: 7.835 ± 1.902
3.299LeuThr: 3.299 ± 0.816
4.948LeuVal: 4.948 ± 1.301
0.0LeuTrp: 0.0 ± 0.0
2.887LeuTyr: 2.887 ± 0.942
0.0LeuXaa: 0.0 ± 0.0
Met
1.649MetAla: 1.649 ± 0.748
0.825MetCys: 0.825 ± 0.419
0.0MetAsp: 0.0 ± 0.0
0.825MetGlu: 0.825 ± 0.634
0.825MetPhe: 0.825 ± 0.645
0.0MetGly: 0.0 ± 0.0
0.412MetHis: 0.412 ± 0.514
0.0MetIle: 0.0 ± 0.0
1.237MetLys: 1.237 ± 0.684
1.237MetLeu: 1.237 ± 0.837
0.412MetMet: 0.412 ± 0.398
1.649MetAsn: 1.649 ± 1.127
0.412MetPro: 0.412 ± 0.322
0.412MetGln: 0.412 ± 0.398
1.237MetArg: 1.237 ± 0.383
2.474MetSer: 2.474 ± 0.959
1.649MetThr: 1.649 ± 0.719
0.825MetVal: 0.825 ± 0.645
0.0MetTrp: 0.0 ± 0.0
0.412MetTyr: 0.412 ± 0.398
0.0MetXaa: 0.0 ± 0.0
Asn
4.124AsnAla: 4.124 ± 1.213
0.412AsnCys: 0.412 ± 0.352
2.474AsnAsp: 2.474 ± 1.897
2.887AsnGlu: 2.887 ± 1.276
2.474AsnPhe: 2.474 ± 0.824
2.887AsnGly: 2.887 ± 0.806
1.237AsnHis: 1.237 ± 0.747
2.474AsnIle: 2.474 ± 1.038
1.237AsnLys: 1.237 ± 0.412
3.299AsnLeu: 3.299 ± 1.266
0.0AsnMet: 0.0 ± 0.0
2.887AsnAsn: 2.887 ± 1.552
5.361AsnPro: 5.361 ± 1.368
2.474AsnGln: 2.474 ± 0.907
2.887AsnArg: 2.887 ± 0.775
4.536AsnSer: 4.536 ± 1.944
4.948AsnThr: 4.948 ± 0.966
3.711AsnVal: 3.711 ± 1.397
0.825AsnTrp: 0.825 ± 0.696
1.649AsnTyr: 1.649 ± 0.659
0.0AsnXaa: 0.0 ± 0.0
Pro
3.299ProAla: 3.299 ± 1.203
1.237ProCys: 1.237 ± 0.855
4.536ProAsp: 4.536 ± 1.916
2.474ProGlu: 2.474 ± 0.955
0.825ProPhe: 0.825 ± 0.494
2.474ProGly: 2.474 ± 1.448
0.412ProHis: 0.412 ± 0.622
0.412ProIle: 0.412 ± 0.352
2.474ProLys: 2.474 ± 0.901
6.186ProLeu: 6.186 ± 1.848
0.825ProMet: 0.825 ± 0.374
2.887ProAsn: 2.887 ± 0.982
5.361ProPro: 5.361 ± 1.519
4.948ProGln: 4.948 ± 1.747
4.124ProArg: 4.124 ± 1.897
6.186ProSer: 6.186 ± 1.915
3.299ProThr: 3.299 ± 1.974
4.536ProVal: 4.536 ± 0.703
0.825ProTrp: 0.825 ± 0.759
1.649ProTyr: 1.649 ± 1.038
0.0ProXaa: 0.0 ± 0.0
Gln
3.711GlnAla: 3.711 ± 0.967
1.237GlnCys: 1.237 ± 0.684
2.887GlnAsp: 2.887 ± 0.482
4.536GlnGlu: 4.536 ± 1.363
2.474GlnPhe: 2.474 ± 1.096
0.825GlnGly: 0.825 ± 0.419
0.0GlnHis: 0.0 ± 0.0
2.887GlnIle: 2.887 ± 0.822
0.412GlnLys: 0.412 ± 0.398
4.948GlnLeu: 4.948 ± 1.342
0.412GlnMet: 0.412 ± 0.322
1.649GlnAsn: 1.649 ± 0.928
2.474GlnPro: 2.474 ± 0.588
2.474GlnGln: 2.474 ± 1.096
2.062GlnArg: 2.062 ± 0.957
2.474GlnSer: 2.474 ± 0.903
3.711GlnThr: 3.711 ± 1.21
1.649GlnVal: 1.649 ± 0.553
0.0GlnTrp: 0.0 ± 0.0
0.412GlnTyr: 0.412 ± 0.398
0.0GlnXaa: 0.0 ± 0.0
Arg
3.299ArgAla: 3.299 ± 0.738
2.062ArgCys: 2.062 ± 0.768
1.237ArgAsp: 1.237 ± 0.809
2.887ArgGlu: 2.887 ± 1.277
2.474ArgPhe: 2.474 ± 1.078
1.649ArgGly: 1.649 ± 0.997
2.474ArgHis: 2.474 ± 0.842
2.474ArgIle: 2.474 ± 1.324
4.124ArgLys: 4.124 ± 0.692
8.247ArgLeu: 8.247 ± 2.548
1.649ArgMet: 1.649 ± 0.805
1.237ArgAsn: 1.237 ± 0.632
2.474ArgPro: 2.474 ± 1.08
2.474ArgGln: 2.474 ± 1.12
9.897ArgArg: 9.897 ± 4.284
2.474ArgSer: 2.474 ± 0.957
2.062ArgThr: 2.062 ± 0.685
4.536ArgVal: 4.536 ± 2.112
0.412ArgTrp: 0.412 ± 0.398
1.237ArgTyr: 1.237 ± 0.837
0.0ArgXaa: 0.0 ± 0.0
Ser
4.536SerAla: 4.536 ± 1.55
0.0SerCys: 0.0 ± 0.0
4.948SerAsp: 4.948 ± 0.89
2.887SerGlu: 2.887 ± 1.326
5.361SerPhe: 5.361 ± 0.855
5.361SerGly: 5.361 ± 1.133
2.887SerHis: 2.887 ± 0.924
5.361SerIle: 5.361 ± 2.291
2.887SerLys: 2.887 ± 1.234
7.01SerLeu: 7.01 ± 1.734
0.825SerMet: 0.825 ± 0.645
4.948SerAsn: 4.948 ± 1.617
4.124SerPro: 4.124 ± 1.372
2.474SerGln: 2.474 ± 0.766
6.186SerArg: 6.186 ± 1.228
11.959SerSer: 11.959 ± 2.354
5.773SerThr: 5.773 ± 1.118
4.536SerVal: 4.536 ± 0.814
1.237SerTrp: 1.237 ± 0.733
1.237SerTyr: 1.237 ± 0.773
0.0SerXaa: 0.0 ± 0.0
Thr
1.649ThrAla: 1.649 ± 0.635
2.062ThrCys: 2.062 ± 0.806
4.948ThrAsp: 4.948 ± 1.438
2.887ThrGlu: 2.887 ± 0.823
4.124ThrPhe: 4.124 ± 1.689
7.423ThrGly: 7.423 ± 2.363
0.825ThrHis: 0.825 ± 0.419
3.299ThrIle: 3.299 ± 1.257
1.237ThrLys: 1.237 ± 0.702
5.773ThrLeu: 5.773 ± 1.749
0.0ThrMet: 0.0 ± 0.0
2.062ThrAsn: 2.062 ± 0.685
4.948ThrPro: 4.948 ± 1.863
2.062ThrGln: 2.062 ± 0.847
2.887ThrArg: 2.887 ± 0.616
6.598ThrSer: 6.598 ± 1.523
2.474ThrThr: 2.474 ± 0.588
5.773ThrVal: 5.773 ± 1.43
0.0ThrTrp: 0.0 ± 0.0
0.825ThrTyr: 0.825 ± 0.608
0.0ThrXaa: 0.0 ± 0.0
Val
3.299ValAla: 3.299 ± 1.262
1.237ValCys: 1.237 ± 0.5
2.887ValAsp: 2.887 ± 0.822
3.299ValGlu: 3.299 ± 0.759
3.299ValPhe: 3.299 ± 0.759
4.124ValGly: 4.124 ± 1.348
1.649ValHis: 1.649 ± 0.635
4.124ValIle: 4.124 ± 1.464
1.649ValLys: 1.649 ± 0.563
5.773ValLeu: 5.773 ± 1.316
0.825ValMet: 0.825 ± 0.732
3.299ValAsn: 3.299 ± 1.238
5.773ValPro: 5.773 ± 1.277
3.299ValGln: 3.299 ± 1.106
2.474ValArg: 2.474 ± 1.087
5.773ValSer: 5.773 ± 0.958
4.948ValThr: 4.948 ± 1.716
3.711ValVal: 3.711 ± 0.878
0.412ValTrp: 0.412 ± 0.398
4.124ValTyr: 4.124 ± 1.5
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
2.474TrpAsp: 2.474 ± 1.639
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.825TrpHis: 0.825 ± 0.494
0.825TrpIle: 0.825 ± 0.645
0.825TrpLys: 0.825 ± 0.419
1.649TrpLeu: 1.649 ± 0.722
0.412TrpMet: 0.412 ± 0.398
1.649TrpAsn: 1.649 ± 1.127
0.412TrpPro: 0.412 ± 0.398
0.412TrpGln: 0.412 ± 0.322
0.412TrpArg: 0.412 ± 0.45
0.412TrpSer: 0.412 ± 0.398
0.825TrpThr: 0.825 ± 0.796
0.825TrpVal: 0.825 ± 0.494
0.0TrpTrp: 0.0 ± 0.0
0.412TrpTyr: 0.412 ± 0.322
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.649TyrAla: 1.649 ± 0.594
0.412TyrCys: 0.412 ± 0.514
1.649TyrAsp: 1.649 ± 0.758
2.474TyrGlu: 2.474 ± 1.112
3.299TyrPhe: 3.299 ± 1.184
1.649TyrGly: 1.649 ± 0.511
0.0TyrHis: 0.0 ± 0.0
0.412TyrIle: 0.412 ± 0.322
3.711TyrLys: 3.711 ± 1.368
2.062TyrLeu: 2.062 ± 1.023
0.412TyrMet: 0.412 ± 0.352
1.237TyrAsn: 1.237 ± 0.702
0.412TyrPro: 0.412 ± 0.398
0.825TyrGln: 0.825 ± 0.419
2.062TyrArg: 2.062 ± 0.769
2.062TyrSer: 2.062 ± 0.445
0.825TyrThr: 0.825 ± 0.385
1.237TyrVal: 1.237 ± 0.582
0.825TyrTrp: 0.825 ± 0.385
2.887TyrTyr: 2.887 ± 1.614
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2426 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski