Amino acid dipepetide frequency for Human papillomavirus 141

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.324AlaAla: 3.324 ± 1.135
2.077AlaCys: 2.077 ± 1.587
4.57AlaAsp: 4.57 ± 0.842
2.493AlaGlu: 2.493 ± 0.681
3.324AlaPhe: 3.324 ± 0.732
1.662AlaGly: 1.662 ± 1.153
0.415AlaHis: 0.415 ± 0.338
4.985AlaIle: 4.985 ± 1.042
2.908AlaLys: 2.908 ± 1.39
4.57AlaLeu: 4.57 ± 1.625
0.415AlaMet: 0.415 ± 0.338
2.493AlaAsn: 2.493 ± 1.115
4.155AlaPro: 4.155 ± 1.773
2.493AlaGln: 2.493 ± 1.24
2.077AlaArg: 2.077 ± 0.758
3.324AlaSer: 3.324 ± 0.906
4.155AlaThr: 4.155 ± 0.83
0.831AlaVal: 0.831 ± 0.461
0.831AlaTrp: 0.831 ± 0.7
0.831AlaTyr: 0.831 ± 0.7
0.0AlaXaa: 0.0 ± 0.0
Cys
1.246CysAla: 1.246 ± 0.927
1.246CysCys: 1.246 ± 0.833
2.077CysAsp: 2.077 ± 1.069
1.246CysGlu: 1.246 ± 0.862
1.246CysPhe: 1.246 ± 0.817
0.0CysGly: 0.0 ± 0.0
0.415CysHis: 0.415 ± 0.35
1.662CysIle: 1.662 ± 0.86
4.155CysLys: 4.155 ± 1.439
2.077CysLeu: 2.077 ± 2.099
0.0CysMet: 0.0 ± 0.0
2.077CysAsn: 2.077 ± 1.017
1.662CysPro: 1.662 ± 0.855
1.246CysGln: 1.246 ± 1.049
0.415CysArg: 0.415 ± 0.532
1.246CysSer: 1.246 ± 0.862
0.0CysThr: 0.0 ± 0.0
1.246CysVal: 1.246 ± 0.862
1.246CysTrp: 1.246 ± 0.817
1.246CysTyr: 1.246 ± 1.596
0.0CysXaa: 0.0 ± 0.0
Asp
2.493AspAla: 2.493 ± 0.449
2.493AspCys: 2.493 ± 1.618
4.985AspAsp: 4.985 ± 1.607
5.401AspGlu: 5.401 ± 2.017
2.077AspPhe: 2.077 ± 0.664
3.324AspGly: 3.324 ± 0.627
1.246AspHis: 1.246 ± 0.707
4.155AspIle: 4.155 ± 2.089
2.493AspLys: 2.493 ± 1.311
7.894AspLeu: 7.894 ± 2.396
1.246AspMet: 1.246 ± 0.545
3.739AspAsn: 3.739 ± 0.801
3.739AspPro: 3.739 ± 0.999
1.662AspGln: 1.662 ± 0.747
2.077AspArg: 2.077 ± 1.247
6.232AspSer: 6.232 ± 1.443
7.894AspThr: 7.894 ± 1.284
4.155AspVal: 4.155 ± 1.621
1.246AspTrp: 1.246 ± 0.661
0.415AspTyr: 0.415 ± 0.426
0.0AspXaa: 0.0 ± 0.0
Glu
3.324GluAla: 3.324 ± 1.037
1.662GluCys: 1.662 ± 0.923
5.401GluAsp: 5.401 ± 1.267
7.063GluGlu: 7.063 ± 2.342
1.662GluPhe: 1.662 ± 0.809
3.324GluGly: 3.324 ± 1.535
1.246GluHis: 1.246 ± 0.491
4.57GluIle: 4.57 ± 1.038
2.493GluLys: 2.493 ± 1.361
4.985GluLeu: 4.985 ± 1.254
0.831GluMet: 0.831 ± 0.7
3.324GluAsn: 3.324 ± 0.968
1.662GluPro: 1.662 ± 0.783
1.662GluGln: 1.662 ± 0.969
1.662GluArg: 1.662 ± 0.706
6.232GluSer: 6.232 ± 1.392
3.739GluThr: 3.739 ± 1.145
2.908GluVal: 2.908 ± 1.769
1.662GluTrp: 1.662 ± 1.218
1.246GluTyr: 1.246 ± 0.678
0.0GluXaa: 0.0 ± 0.0
Phe
1.662PheAla: 1.662 ± 0.66
0.831PheCys: 0.831 ± 0.598
4.155PheAsp: 4.155 ± 1.255
2.908PheGlu: 2.908 ± 1.035
4.155PhePhe: 4.155 ± 1.624
2.077PheGly: 2.077 ± 0.832
1.662PheHis: 1.662 ± 0.713
3.324PheIle: 3.324 ± 1.006
3.739PheLys: 3.739 ± 1.332
4.155PheLeu: 4.155 ± 1.552
0.831PheMet: 0.831 ± 0.387
2.077PheAsn: 2.077 ± 0.835
2.493PhePro: 2.493 ± 0.864
1.246PheGln: 1.246 ± 0.833
2.077PheArg: 2.077 ± 0.462
2.493PheSer: 2.493 ± 1.013
2.077PheThr: 2.077 ± 0.911
2.493PheVal: 2.493 ± 1.213
0.831PheTrp: 0.831 ± 0.677
1.662PheTyr: 1.662 ± 0.74
0.0PheXaa: 0.0 ± 0.0
Gly
2.077GlyAla: 2.077 ± 1.256
0.831GlyCys: 0.831 ± 0.512
2.908GlyAsp: 2.908 ± 1.156
3.739GlyGlu: 3.739 ± 1.223
0.831GlyPhe: 0.831 ± 0.404
2.493GlyGly: 2.493 ± 0.943
1.246GlyHis: 1.246 ± 0.814
2.908GlyIle: 2.908 ± 1.788
2.908GlyLys: 2.908 ± 0.977
6.232GlyLeu: 6.232 ± 1.316
0.415GlyMet: 0.415 ± 0.402
2.908GlyAsn: 2.908 ± 1.008
3.324GlyPro: 3.324 ± 2.234
1.662GlyGln: 1.662 ± 0.686
2.493GlyArg: 2.493 ± 0.978
3.324GlySer: 3.324 ± 0.9
4.155GlyThr: 4.155 ± 1.409
2.493GlyVal: 2.493 ± 0.835
0.0GlyTrp: 0.0 ± 0.0
0.831GlyTyr: 0.831 ± 0.491
0.0GlyXaa: 0.0 ± 0.0
His
1.662HisAla: 1.662 ± 0.542
0.415HisCys: 0.415 ± 0.35
0.831HisAsp: 0.831 ± 0.461
1.662HisGlu: 1.662 ± 0.705
0.415HisPhe: 0.415 ± 0.338
1.246HisGly: 1.246 ± 0.619
0.0HisHis: 0.0 ± 0.0
0.415HisIle: 0.415 ± 0.338
2.077HisLys: 2.077 ± 0.855
2.493HisLeu: 2.493 ± 1.107
0.415HisMet: 0.415 ± 0.532
0.831HisAsn: 0.831 ± 0.461
1.662HisPro: 1.662 ± 0.886
0.831HisGln: 0.831 ± 0.528
0.415HisArg: 0.415 ± 0.426
0.415HisSer: 0.415 ± 0.35
1.246HisThr: 1.246 ± 0.643
0.0HisVal: 0.0 ± 0.0
0.831HisTrp: 0.831 ± 0.598
1.246HisTyr: 1.246 ± 0.656
0.0HisXaa: 0.0 ± 0.0
Ile
3.324IleAla: 3.324 ± 0.915
0.831IleCys: 0.831 ± 0.512
6.647IleAsp: 6.647 ± 1.551
3.324IleGlu: 3.324 ± 1.336
3.324IlePhe: 3.324 ± 0.859
3.739IleGly: 3.739 ± 2.695
0.831IleHis: 0.831 ± 0.634
3.324IleIle: 3.324 ± 0.521
2.493IleLys: 2.493 ± 0.96
4.155IleLeu: 4.155 ± 1.178
0.831IleMet: 0.831 ± 0.7
2.493IleAsn: 2.493 ± 0.561
3.324IlePro: 3.324 ± 1.652
4.57IleGln: 4.57 ± 0.602
1.246IleArg: 1.246 ± 0.545
3.739IleSer: 3.739 ± 1.292
4.155IleThr: 4.155 ± 0.915
2.077IleVal: 2.077 ± 0.5
0.831IleTrp: 0.831 ± 0.598
1.246IleTyr: 1.246 ± 0.756
0.0IleXaa: 0.0 ± 0.0
Lys
3.324LysAla: 3.324 ± 0.915
1.246LysCys: 1.246 ± 0.817
3.324LysAsp: 3.324 ± 2.049
3.324LysGlu: 3.324 ± 1.361
2.908LysPhe: 2.908 ± 1.4
1.662LysGly: 1.662 ± 0.634
1.246LysHis: 1.246 ± 1.049
1.246LysIle: 1.246 ± 0.678
2.493LysLys: 2.493 ± 0.47
5.401LysLeu: 5.401 ± 2.221
1.246LysMet: 1.246 ± 0.873
3.739LysAsn: 3.739 ± 1.222
2.493LysPro: 2.493 ± 1.15
4.155LysGln: 4.155 ± 0.941
5.401LysArg: 5.401 ± 0.738
4.57LysSer: 4.57 ± 2.631
2.077LysThr: 2.077 ± 0.826
3.739LysVal: 3.739 ± 0.669
0.831LysTrp: 0.831 ± 0.512
2.908LysTyr: 2.908 ± 1.552
0.0LysXaa: 0.0 ± 0.0
Leu
5.816LeuAla: 5.816 ± 0.966
2.908LeuCys: 2.908 ± 1.786
6.232LeuAsp: 6.232 ± 1.191
5.401LeuGlu: 5.401 ± 1.589
4.57LeuPhe: 4.57 ± 1.013
4.155LeuGly: 4.155 ± 0.768
1.662LeuHis: 1.662 ± 0.543
3.739LeuIle: 3.739 ± 1.445
5.816LeuLys: 5.816 ± 2.284
9.971LeuLeu: 9.971 ± 2.777
1.662LeuMet: 1.662 ± 1.151
2.908LeuAsn: 2.908 ± 1.254
4.155LeuPro: 4.155 ± 0.548
3.324LeuGln: 3.324 ± 1.017
4.57LeuArg: 4.57 ± 1.166
7.894LeuSer: 7.894 ± 3.217
5.816LeuThr: 5.816 ± 1.372
6.232LeuVal: 6.232 ± 1.722
0.415LeuTrp: 0.415 ± 0.532
5.816LeuTyr: 5.816 ± 1.119
0.0LeuXaa: 0.0 ± 0.0
Met
2.077MetAla: 2.077 ± 1.042
1.662MetCys: 1.662 ± 0.783
1.662MetAsp: 1.662 ± 0.783
0.831MetGlu: 0.831 ± 0.645
0.0MetPhe: 0.0 ± 0.0
0.831MetGly: 0.831 ± 0.608
0.0MetHis: 0.0 ± 0.0
0.831MetIle: 0.831 ± 0.404
0.831MetLys: 0.831 ± 0.7
1.246MetLeu: 1.246 ± 0.597
0.415MetMet: 0.415 ± 0.553
1.246MetAsn: 1.246 ± 0.597
0.415MetPro: 0.415 ± 0.553
0.831MetGln: 0.831 ± 0.391
0.831MetArg: 0.831 ± 0.608
0.0MetSer: 0.0 ± 0.0
2.908MetThr: 2.908 ± 0.977
0.415MetVal: 0.415 ± 0.35
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.908AsnAla: 2.908 ± 0.769
0.415AsnCys: 0.415 ± 0.426
1.246AsnAsp: 1.246 ± 0.713
1.246AsnGlu: 1.246 ± 0.458
1.662AsnPhe: 1.662 ± 0.95
2.493AsnGly: 2.493 ± 0.47
0.831AsnHis: 0.831 ± 0.404
4.985AsnIle: 4.985 ± 1.817
2.077AsnLys: 2.077 ± 0.386
3.739AsnLeu: 3.739 ± 1.397
0.831AsnMet: 0.831 ± 0.67
0.831AsnAsn: 0.831 ± 0.512
4.985AsnPro: 4.985 ± 0.676
3.324AsnGln: 3.324 ± 1.32
4.155AsnArg: 4.155 ± 0.71
3.324AsnSer: 3.324 ± 1.292
4.155AsnThr: 4.155 ± 0.621
3.324AsnVal: 3.324 ± 1.27
0.415AsnTrp: 0.415 ± 0.35
0.831AsnTyr: 0.831 ± 0.853
0.0AsnXaa: 0.0 ± 0.0
Pro
3.324ProAla: 3.324 ± 1.655
1.246ProCys: 1.246 ± 0.598
4.155ProAsp: 4.155 ± 1.211
4.155ProGlu: 4.155 ± 0.725
1.662ProPhe: 1.662 ± 0.886
2.908ProGly: 2.908 ± 1.49
0.415ProHis: 0.415 ± 0.35
2.077ProIle: 2.077 ± 0.462
4.985ProLys: 4.985 ± 1.546
6.647ProLeu: 6.647 ± 1.903
0.831ProMet: 0.831 ± 0.391
1.246ProAsn: 1.246 ± 0.776
8.725ProPro: 8.725 ± 1.795
0.415ProGln: 0.415 ± 0.402
2.908ProArg: 2.908 ± 1.312
7.478ProSer: 7.478 ± 2.092
3.739ProThr: 3.739 ± 1.035
3.324ProVal: 3.324 ± 1.168
0.0ProTrp: 0.0 ± 0.0
2.493ProTyr: 2.493 ± 0.978
0.0ProXaa: 0.0 ± 0.0
Gln
0.831GlnAla: 0.831 ± 0.461
1.246GlnCys: 1.246 ± 1.084
2.908GlnAsp: 2.908 ± 0.653
2.908GlnGlu: 2.908 ± 0.891
2.493GlnPhe: 2.493 ± 1.07
1.662GlnGly: 1.662 ± 0.95
1.246GlnHis: 1.246 ± 0.458
3.324GlnIle: 3.324 ± 0.521
1.662GlnLys: 1.662 ± 0.261
6.232GlnLeu: 6.232 ± 0.751
2.077GlnMet: 2.077 ± 0.827
2.493GlnAsn: 2.493 ± 0.705
1.662GlnPro: 1.662 ± 0.74
3.739GlnGln: 3.739 ± 0.991
2.493GlnArg: 2.493 ± 1.193
1.246GlnSer: 1.246 ± 0.389
2.493GlnThr: 2.493 ± 0.605
3.324GlnVal: 3.324 ± 1.347
0.415GlnTrp: 0.415 ± 0.35
2.908GlnTyr: 2.908 ± 0.941
0.0GlnXaa: 0.0 ± 0.0
Arg
2.493ArgAla: 2.493 ± 1.24
2.493ArgCys: 2.493 ± 1.701
2.908ArgAsp: 2.908 ± 1.151
2.077ArgGlu: 2.077 ± 0.462
2.077ArgPhe: 2.077 ± 1.29
3.324ArgGly: 3.324 ± 1.218
2.077ArgHis: 2.077 ± 1.397
2.908ArgIle: 2.908 ± 1.123
3.739ArgLys: 3.739 ± 1.193
4.985ArgLeu: 4.985 ± 1.269
0.831ArgMet: 0.831 ± 0.404
1.662ArgAsn: 1.662 ± 0.634
2.493ArgPro: 2.493 ± 0.988
2.908ArgGln: 2.908 ± 0.943
8.309ArgArg: 8.309 ± 3.218
5.816ArgSer: 5.816 ± 1.467
1.662ArgThr: 1.662 ± 0.813
2.493ArgVal: 2.493 ± 0.681
0.0ArgTrp: 0.0 ± 0.0
1.662ArgTyr: 1.662 ± 0.582
0.0ArgXaa: 0.0 ± 0.0
Ser
3.324SerAla: 3.324 ± 1.548
1.246SerCys: 1.246 ± 0.862
4.57SerAsp: 4.57 ± 1.36
3.324SerGlu: 3.324 ± 1.236
4.57SerPhe: 4.57 ± 0.976
2.908SerGly: 2.908 ± 0.69
1.662SerHis: 1.662 ± 1.153
4.57SerIle: 4.57 ± 0.706
2.908SerLys: 2.908 ± 0.954
8.309SerLeu: 8.309 ± 2.087
0.415SerMet: 0.415 ± 0.35
4.985SerAsn: 4.985 ± 1.884
4.155SerPro: 4.155 ± 1.098
3.324SerGln: 3.324 ± 0.974
5.401SerArg: 5.401 ± 1.3
6.232SerSer: 6.232 ± 2.526
5.401SerThr: 5.401 ± 1.495
4.985SerVal: 4.985 ± 1.362
0.0SerTrp: 0.0 ± 0.0
2.077SerTyr: 2.077 ± 0.916
0.0SerXaa: 0.0 ± 0.0
Thr
2.077ThrAla: 2.077 ± 0.462
0.415ThrCys: 0.415 ± 0.35
4.985ThrAsp: 4.985 ± 1.17
4.155ThrGlu: 4.155 ± 0.648
2.077ThrPhe: 2.077 ± 0.616
4.155ThrGly: 4.155 ± 1.617
0.415ThrHis: 0.415 ± 0.426
2.077ThrIle: 2.077 ± 0.386
2.077ThrLys: 2.077 ± 0.836
4.57ThrLeu: 4.57 ± 1.41
1.246ThrMet: 1.246 ± 0.619
2.908ThrAsn: 2.908 ± 0.477
7.063ThrPro: 7.063 ± 2.404
2.908ThrGln: 2.908 ± 0.977
4.155ThrArg: 4.155 ± 1.185
6.232ThrSer: 6.232 ± 1.357
3.739ThrThr: 3.739 ± 1.21
4.57ThrVal: 4.57 ± 1.27
1.662ThrTrp: 1.662 ± 0.737
1.662ThrTyr: 1.662 ± 1.093
0.0ThrXaa: 0.0 ± 0.0
Val
3.324ValAla: 3.324 ± 0.951
1.246ValCys: 1.246 ± 1.291
4.155ValAsp: 4.155 ± 0.695
2.908ValGlu: 2.908 ± 0.986
2.908ValPhe: 2.908 ± 0.607
3.324ValGly: 3.324 ± 1.186
2.077ValHis: 2.077 ± 0.611
2.493ValIle: 2.493 ± 0.956
2.908ValLys: 2.908 ± 0.627
1.662ValLeu: 1.662 ± 0.543
1.246ValMet: 1.246 ± 0.8
4.155ValAsn: 4.155 ± 1.47
3.739ValPro: 3.739 ± 0.688
3.324ValGln: 3.324 ± 0.631
3.739ValArg: 3.739 ± 1.789
3.324ValSer: 3.324 ± 0.807
1.662ValThr: 1.662 ± 0.886
2.077ValVal: 2.077 ± 0.685
0.831ValTrp: 0.831 ± 0.512
2.908ValTyr: 2.908 ± 0.427
0.0ValXaa: 0.0 ± 0.0
Trp
0.831TrpAla: 0.831 ± 0.404
0.0TrpCys: 0.0 ± 0.0
0.415TrpAsp: 0.415 ± 0.338
0.415TrpGlu: 0.415 ± 0.426
0.831TrpPhe: 0.831 ± 0.461
0.415TrpGly: 0.415 ± 0.338
0.415TrpHis: 0.415 ± 0.426
1.246TrpIle: 1.246 ± 0.817
1.662TrpLys: 1.662 ± 0.776
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.415TrpAsn: 0.415 ± 0.338
0.415TrpPro: 0.415 ± 0.338
0.831TrpGln: 0.831 ± 0.391
1.246TrpArg: 1.246 ± 0.656
0.0TrpSer: 0.0 ± 0.0
1.246TrpThr: 1.246 ± 0.879
2.493TrpVal: 2.493 ± 0.836
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.493TyrAla: 2.493 ± 0.644
1.246TyrCys: 1.246 ± 1.079
0.831TyrAsp: 0.831 ± 0.605
2.077TyrGlu: 2.077 ± 0.758
4.155TyrPhe: 4.155 ± 0.513
2.077TyrGly: 2.077 ± 0.916
0.0TyrHis: 0.0 ± 0.0
1.662TyrIle: 1.662 ± 0.66
2.908TyrLys: 2.908 ± 0.915
3.739TyrLeu: 3.739 ± 1.445
0.831TyrMet: 0.831 ± 0.512
1.662TyrAsn: 1.662 ± 0.806
0.415TyrPro: 0.415 ± 0.338
2.908TyrGln: 2.908 ± 1.076
1.246TyrArg: 1.246 ± 0.678
1.246TyrSer: 1.246 ± 0.64
0.831TyrThr: 0.831 ± 0.528
0.831TyrVal: 0.831 ± 0.461
0.831TyrTrp: 0.831 ± 0.608
3.739TyrTyr: 3.739 ± 1.802
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2408 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski