Amino acid dipepetide frequency for Human papillomavirus 136

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.57AlaAla: 4.57 ± 1.722
1.662AlaCys: 1.662 ± 1.096
5.401AlaAsp: 5.401 ± 0.696
3.324AlaGlu: 3.324 ± 1.54
3.324AlaPhe: 3.324 ± 0.6
2.077AlaGly: 2.077 ± 0.894
0.415AlaHis: 0.415 ± 0.342
5.401AlaIle: 5.401 ± 1.026
3.324AlaLys: 3.324 ± 1.624
4.57AlaLeu: 4.57 ± 1.782
0.415AlaMet: 0.415 ± 0.342
2.493AlaAsn: 2.493 ± 1.128
2.908AlaPro: 2.908 ± 0.951
2.908AlaGln: 2.908 ± 0.969
2.908AlaArg: 2.908 ± 0.921
4.155AlaSer: 4.155 ± 0.988
2.908AlaThr: 2.908 ± 0.986
2.077AlaVal: 2.077 ± 0.807
0.415AlaTrp: 0.415 ± 0.326
0.831AlaTyr: 0.831 ± 0.653
0.0AlaXaa: 0.0 ± 0.0
Cys
0.831CysAla: 0.831 ± 0.834
1.246CysCys: 1.246 ± 0.846
2.077CysAsp: 2.077 ± 0.98
1.662CysGlu: 1.662 ± 0.749
0.831CysPhe: 0.831 ± 0.647
0.415CysGly: 0.415 ± 0.455
0.831CysHis: 0.831 ± 0.54
1.662CysIle: 1.662 ± 0.816
2.493CysLys: 2.493 ± 1.283
2.908CysLeu: 2.908 ± 3.169
0.0CysMet: 0.0 ± 0.0
1.246CysAsn: 1.246 ± 0.846
1.662CysPro: 1.662 ± 0.851
0.415CysGln: 0.415 ± 0.326
2.077CysArg: 2.077 ± 1.284
0.831CysSer: 0.831 ± 0.54
0.831CysThr: 0.831 ± 0.377
1.662CysVal: 1.662 ± 0.915
0.831CysTrp: 0.831 ± 0.418
1.662CysTyr: 1.662 ± 1.617
0.0CysXaa: 0.0 ± 0.0
Asp
2.908AspAla: 2.908 ± 0.675
3.324AspCys: 3.324 ± 1.635
4.57AspAsp: 4.57 ± 1.974
4.985AspGlu: 4.985 ± 1.215
1.246AspPhe: 1.246 ± 0.616
2.908AspGly: 2.908 ± 0.97
0.0AspHis: 0.0 ± 0.0
4.155AspIle: 4.155 ± 1.572
3.324AspLys: 3.324 ± 1.645
6.647AspLeu: 6.647 ± 2.002
0.831AspMet: 0.831 ± 0.57
4.985AspAsn: 4.985 ± 1.744
4.985AspPro: 4.985 ± 1.416
2.077AspGln: 2.077 ± 0.648
2.077AspArg: 2.077 ± 1.107
4.985AspSer: 4.985 ± 1.63
4.155AspThr: 4.155 ± 1.406
6.647AspVal: 6.647 ± 2.187
0.831AspTrp: 0.831 ± 0.377
0.415AspTyr: 0.415 ± 0.417
0.0AspXaa: 0.0 ± 0.0
Glu
2.908GluAla: 2.908 ± 1.049
1.662GluCys: 1.662 ± 0.728
4.985GluAsp: 4.985 ± 1.354
6.232GluGlu: 6.232 ± 2.178
2.493GluPhe: 2.493 ± 0.855
2.908GluGly: 2.908 ± 1.695
1.662GluHis: 1.662 ± 0.588
3.739GluIle: 3.739 ± 1.569
1.246GluLys: 1.246 ± 0.758
6.647GluLeu: 6.647 ± 1.609
0.831GluMet: 0.831 ± 0.653
4.155GluAsn: 4.155 ± 0.658
2.493GluPro: 2.493 ± 0.827
1.662GluGln: 1.662 ± 0.705
2.908GluArg: 2.908 ± 1.054
6.647GluSer: 6.647 ± 1.313
2.077GluThr: 2.077 ± 0.828
4.155GluVal: 4.155 ± 2.058
1.662GluTrp: 1.662 ± 1.163
1.246GluTyr: 1.246 ± 0.697
0.0GluXaa: 0.0 ± 0.0
Phe
1.246PheAla: 1.246 ± 0.595
0.831PheCys: 0.831 ± 0.569
3.739PheAsp: 3.739 ± 0.873
4.57PheGlu: 4.57 ± 1.111
4.155PhePhe: 4.155 ± 1.428
1.246PheGly: 1.246 ± 0.697
1.246PheHis: 1.246 ± 0.467
4.155PheIle: 4.155 ± 1.563
3.324PheLys: 3.324 ± 1.805
4.57PheLeu: 4.57 ± 1.489
0.831PheMet: 0.831 ± 0.385
1.662PheAsn: 1.662 ± 0.758
1.246PhePro: 1.246 ± 0.651
0.831PheGln: 0.831 ± 0.626
2.077PheArg: 2.077 ± 0.967
2.493PheSer: 2.493 ± 0.947
1.246PheThr: 1.246 ± 0.67
3.739PheVal: 3.739 ± 1.423
0.831PheTrp: 0.831 ± 0.685
2.077PheTyr: 2.077 ± 0.854
0.0PheXaa: 0.0 ± 0.0
Gly
2.493GlyAla: 2.493 ± 1.394
0.831GlyCys: 0.831 ± 0.476
2.908GlyAsp: 2.908 ± 1.282
2.077GlyGlu: 2.077 ± 0.836
1.246GlyPhe: 1.246 ± 0.685
2.908GlyGly: 2.908 ± 1.222
1.246GlyHis: 1.246 ± 0.776
1.662GlyIle: 1.662 ± 0.723
2.908GlyLys: 2.908 ± 0.861
7.063GlyLeu: 7.063 ± 1.708
0.831GlyMet: 0.831 ± 0.49
2.077GlyAsn: 2.077 ± 0.337
3.324GlyPro: 3.324 ± 0.713
2.908GlyGln: 2.908 ± 1.014
3.739GlyArg: 3.739 ± 1.374
2.493GlySer: 2.493 ± 0.644
3.324GlyThr: 3.324 ± 1.155
3.739GlyVal: 3.739 ± 1.143
0.0GlyTrp: 0.0 ± 0.0
0.831GlyTyr: 0.831 ± 0.49
0.0GlyXaa: 0.0 ± 0.0
His
1.246HisAla: 1.246 ± 0.595
0.831HisCys: 0.831 ± 0.418
0.415HisAsp: 0.415 ± 0.417
0.831HisGlu: 0.831 ± 0.398
0.0HisPhe: 0.0 ± 0.0
1.246HisGly: 1.246 ± 0.567
0.415HisHis: 0.415 ± 0.455
0.415HisIle: 0.415 ± 0.342
1.246HisLys: 1.246 ± 0.382
2.908HisLeu: 2.908 ± 0.741
0.831HisMet: 0.831 ± 0.529
0.831HisAsn: 0.831 ± 0.418
1.662HisPro: 1.662 ± 0.83
0.831HisGln: 0.831 ± 0.398
0.415HisArg: 0.415 ± 0.417
0.831HisSer: 0.831 ± 0.653
1.246HisThr: 1.246 ± 0.823
0.415HisVal: 0.415 ± 0.36
0.831HisTrp: 0.831 ± 0.569
1.662HisTyr: 1.662 ± 0.782
0.0HisXaa: 0.0 ± 0.0
Ile
2.493IleAla: 2.493 ± 0.855
0.415IleCys: 0.415 ± 0.529
4.57IleAsp: 4.57 ± 1.174
5.401IleGlu: 5.401 ± 2.243
2.493IlePhe: 2.493 ± 0.965
4.155IleGly: 4.155 ± 1.844
0.831IleHis: 0.831 ± 0.54
1.246IleIle: 1.246 ± 0.616
1.662IleLys: 1.662 ± 1.22
3.739IleLeu: 3.739 ± 1.067
0.415IleMet: 0.415 ± 0.489
4.155IleAsn: 4.155 ± 1.341
3.324IlePro: 3.324 ± 1.124
2.493IleGln: 2.493 ± 0.717
2.493IleArg: 2.493 ± 0.985
4.985IleSer: 4.985 ± 1.941
3.324IleThr: 3.324 ± 1.16
2.493IleVal: 2.493 ± 0.912
0.415IleTrp: 0.415 ± 0.529
1.246IleTyr: 1.246 ± 0.717
0.0IleXaa: 0.0 ± 0.0
Lys
2.493LysAla: 2.493 ± 0.436
1.246LysCys: 1.246 ± 0.768
3.324LysAsp: 3.324 ± 1.122
2.908LysGlu: 2.908 ± 1.455
4.57LysPhe: 4.57 ± 1.186
2.908LysGly: 2.908 ± 1.259
1.662LysHis: 1.662 ± 0.902
2.077LysIle: 2.077 ± 1.013
2.493LysLys: 2.493 ± 0.867
3.324LysLeu: 3.324 ± 1.606
1.662LysMet: 1.662 ± 0.902
2.077LysAsn: 2.077 ± 1.036
2.493LysPro: 2.493 ± 1.249
4.985LysGln: 4.985 ± 1.834
6.647LysArg: 6.647 ± 1.355
2.077LysSer: 2.077 ± 0.789
0.831LysThr: 0.831 ± 0.398
3.739LysVal: 3.739 ± 1.191
0.831LysTrp: 0.831 ± 0.476
1.246LysTyr: 1.246 ± 0.595
0.0LysXaa: 0.0 ± 0.0
Leu
7.063LeuAla: 7.063 ± 1.311
2.908LeuCys: 2.908 ± 1.488
5.401LeuAsp: 5.401 ± 1.201
2.908LeuGlu: 2.908 ± 1.054
4.985LeuPhe: 4.985 ± 1.084
6.232LeuGly: 6.232 ± 2.033
1.246LeuHis: 1.246 ± 0.632
4.985LeuIle: 4.985 ± 1.441
7.478LeuLys: 7.478 ± 2.996
13.295LeuLeu: 13.295 ± 2.804
1.662LeuMet: 1.662 ± 0.807
3.324LeuAsn: 3.324 ± 1.577
4.155LeuPro: 4.155 ± 0.782
5.816LeuGln: 5.816 ± 1.303
6.232LeuArg: 6.232 ± 1.892
7.478LeuSer: 7.478 ± 1.836
3.739LeuThr: 3.739 ± 0.479
5.816LeuVal: 5.816 ± 1.341
0.415LeuTrp: 0.415 ± 0.342
4.985LeuTyr: 4.985 ± 0.629
0.0LeuXaa: 0.0 ± 0.0
Met
1.662MetAla: 1.662 ± 0.911
1.246MetCys: 1.246 ± 0.641
1.662MetAsp: 1.662 ± 0.737
0.415MetGlu: 0.415 ± 0.529
0.0MetPhe: 0.0 ± 0.0
0.831MetGly: 0.831 ± 0.567
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.415MetLys: 0.415 ± 0.326
0.831MetLeu: 0.831 ± 0.91
0.415MetMet: 0.415 ± 0.457
1.662MetAsn: 1.662 ± 0.705
0.831MetPro: 0.831 ± 0.54
0.0MetGln: 0.0 ± 0.0
0.831MetArg: 0.831 ± 0.569
0.831MetSer: 0.831 ± 0.49
2.077MetThr: 2.077 ± 0.337
1.246MetVal: 1.246 ± 0.767
0.415MetTrp: 0.415 ± 0.417
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.739AsnAla: 3.739 ± 0.993
0.831AsnCys: 0.831 ± 0.418
2.493AsnAsp: 2.493 ± 1.03
3.739AsnGlu: 3.739 ± 0.781
1.662AsnPhe: 1.662 ± 0.98
1.246AsnGly: 1.246 ± 0.835
0.415AsnHis: 0.415 ± 0.326
5.401AsnIle: 5.401 ± 1.727
2.908AsnLys: 2.908 ± 1.129
4.985AsnLeu: 4.985 ± 1.373
0.415AsnMet: 0.415 ± 0.326
2.077AsnAsn: 2.077 ± 0.798
4.155AsnPro: 4.155 ± 1.494
2.493AsnGln: 2.493 ± 1.21
3.739AsnArg: 3.739 ± 1.063
3.739AsnSer: 3.739 ± 1.165
3.324AsnThr: 3.324 ± 0.68
3.324AsnVal: 3.324 ± 1.651
0.831AsnTrp: 0.831 ± 0.418
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.324ProAla: 3.324 ± 1.084
1.246ProCys: 1.246 ± 0.595
4.57ProAsp: 4.57 ± 2.161
3.739ProGlu: 3.739 ± 1.223
2.493ProPhe: 2.493 ± 0.815
2.077ProGly: 2.077 ± 1.799
0.0ProHis: 0.0 ± 0.0
2.908ProIle: 2.908 ± 1.376
3.739ProLys: 3.739 ± 1.302
7.063ProLeu: 7.063 ± 1.905
0.0ProMet: 0.0 ± 0.0
3.324ProAsn: 3.324 ± 0.785
8.309ProPro: 8.309 ± 1.092
1.662ProGln: 1.662 ± 0.527
4.155ProArg: 4.155 ± 1.268
6.647ProSer: 6.647 ± 2.338
3.739ProThr: 3.739 ± 1.09
2.493ProVal: 2.493 ± 0.936
0.0ProTrp: 0.0 ± 0.0
2.493ProTyr: 2.493 ± 1.372
0.0ProXaa: 0.0 ± 0.0
Gln
2.077GlnAla: 2.077 ± 0.804
0.831GlnCys: 0.831 ± 0.676
2.493GlnAsp: 2.493 ± 0.535
3.739GlnGlu: 3.739 ± 1.049
2.077GlnPhe: 2.077 ± 0.843
2.077GlnGly: 2.077 ± 0.967
1.246GlnHis: 1.246 ± 0.382
2.493GlnIle: 2.493 ± 1.199
1.246GlnLys: 1.246 ± 0.467
6.232GlnLeu: 6.232 ± 0.863
1.662GlnMet: 1.662 ± 0.753
1.662GlnAsn: 1.662 ± 0.685
2.493GlnPro: 2.493 ± 1.035
2.908GlnGln: 2.908 ± 1.068
1.662GlnArg: 1.662 ± 0.511
3.739GlnSer: 3.739 ± 1.957
3.324GlnThr: 3.324 ± 1.396
3.324GlnVal: 3.324 ± 1.108
0.831GlnTrp: 0.831 ± 0.653
1.662GlnTyr: 1.662 ± 0.557
0.0GlnXaa: 0.0 ± 0.0
Arg
4.57ArgAla: 4.57 ± 0.54
3.739ArgCys: 3.739 ± 2.282
3.324ArgAsp: 3.324 ± 0.983
2.908ArgGlu: 2.908 ± 0.552
1.662ArgPhe: 1.662 ± 0.77
3.324ArgGly: 3.324 ± 0.944
3.324ArgHis: 3.324 ± 1.535
2.493ArgIle: 2.493 ± 0.711
3.739ArgLys: 3.739 ± 0.772
7.894ArgLeu: 7.894 ± 1.475
0.831ArgMet: 0.831 ± 0.57
1.662ArgAsn: 1.662 ± 0.593
3.324ArgPro: 3.324 ± 1.35
2.493ArgGln: 2.493 ± 0.436
5.401ArgArg: 5.401 ± 2.825
2.493ArgSer: 2.493 ± 0.653
2.493ArgThr: 2.493 ± 0.566
2.493ArgVal: 2.493 ± 0.644
0.0ArgTrp: 0.0 ± 0.0
1.662ArgTyr: 1.662 ± 0.896
0.0ArgXaa: 0.0 ± 0.0
Ser
3.739SerAla: 3.739 ± 1.138
0.831SerCys: 0.831 ± 0.54
2.493SerAsp: 2.493 ± 0.916
3.739SerGlu: 3.739 ± 1.205
3.739SerPhe: 3.739 ± 1.306
1.662SerGly: 1.662 ± 0.528
2.077SerHis: 2.077 ± 1.216
2.908SerIle: 2.908 ± 1.164
3.324SerLys: 3.324 ± 0.74
9.971SerLeu: 9.971 ± 2.337
0.0SerMet: 0.0 ± 0.308
4.57SerAsn: 4.57 ± 1.374
6.232SerPro: 6.232 ± 1.638
2.493SerGln: 2.493 ± 0.676
4.57SerArg: 4.57 ± 0.712
5.401SerSer: 5.401 ± 1.182
6.232SerThr: 6.232 ± 1.724
4.155SerVal: 4.155 ± 1.649
0.415SerTrp: 0.415 ± 0.326
2.493SerTyr: 2.493 ± 0.763
0.0SerXaa: 0.0 ± 0.0
Thr
2.908ThrAla: 2.908 ± 0.908
0.831ThrCys: 0.831 ± 0.569
4.155ThrAsp: 4.155 ± 0.964
2.908ThrGlu: 2.908 ± 0.86
1.662ThrPhe: 1.662 ± 0.489
3.739ThrGly: 3.739 ± 0.732
0.0ThrHis: 0.0 ± 0.0
1.662ThrIle: 1.662 ± 0.766
1.662ThrLys: 1.662 ± 0.753
2.077ThrLeu: 2.077 ± 0.983
1.662ThrMet: 1.662 ± 0.911
3.739ThrAsn: 3.739 ± 0.84
5.401ThrPro: 5.401 ± 1.85
2.908ThrGln: 2.908 ± 0.993
3.739ThrArg: 3.739 ± 1.139
3.324ThrSer: 3.324 ± 0.864
4.985ThrThr: 4.985 ± 1.299
4.57ThrVal: 4.57 ± 1.282
1.246ThrTrp: 1.246 ± 0.697
2.077ThrTyr: 2.077 ± 0.672
0.0ThrXaa: 0.0 ± 0.0
Val
3.324ValAla: 3.324 ± 0.717
0.831ValCys: 0.831 ± 0.671
5.401ValAsp: 5.401 ± 0.95
3.739ValGlu: 3.739 ± 0.975
2.493ValPhe: 2.493 ± 0.796
4.155ValGly: 4.155 ± 1.232
2.077ValHis: 2.077 ± 0.626
2.077ValIle: 2.077 ± 1.06
2.908ValLys: 2.908 ± 0.86
2.908ValLeu: 2.908 ± 0.569
0.831ValMet: 0.831 ± 0.671
3.324ValAsn: 3.324 ± 1.303
3.324ValPro: 3.324 ± 1.401
4.57ValGln: 4.57 ± 1.352
2.077ValArg: 2.077 ± 1.017
5.401ValSer: 5.401 ± 1.594
3.739ValThr: 3.739 ± 0.993
1.246ValVal: 1.246 ± 0.579
0.831ValTrp: 0.831 ± 0.476
3.739ValTyr: 3.739 ± 1.205
0.0ValXaa: 0.0 ± 0.0
Trp
0.831TrpAla: 0.831 ± 0.398
0.0TrpCys: 0.0 ± 0.0
0.415TrpAsp: 0.415 ± 0.342
0.831TrpGlu: 0.831 ± 0.476
0.831TrpPhe: 0.831 ± 0.418
0.415TrpGly: 0.415 ± 0.342
0.415TrpHis: 0.415 ± 0.417
1.662TrpIle: 1.662 ± 0.835
1.662TrpLys: 1.662 ± 0.772
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.415TrpPro: 0.415 ± 0.342
0.415TrpGln: 0.415 ± 0.342
0.831TrpArg: 0.831 ± 0.647
0.831TrpSer: 0.831 ± 0.377
0.831TrpThr: 0.831 ± 0.834
1.662TrpVal: 1.662 ± 0.511
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.662TyrAla: 1.662 ± 0.705
0.831TyrCys: 0.831 ± 1.059
1.662TyrAsp: 1.662 ± 0.816
1.662TyrGlu: 1.662 ± 1.063
4.155TyrPhe: 4.155 ± 0.677
2.077TyrGly: 2.077 ± 0.633
0.0TyrHis: 0.0 ± 0.0
1.246TyrIle: 1.246 ± 0.467
2.493TyrLys: 2.493 ± 1.187
2.493TyrLeu: 2.493 ± 1.13
0.831TyrMet: 0.831 ± 0.476
2.493TyrAsn: 2.493 ± 0.666
1.246TyrPro: 1.246 ± 0.641
2.908TyrGln: 2.908 ± 0.928
1.246TyrArg: 1.246 ± 0.467
2.077TyrSer: 2.077 ± 0.729
0.831TyrThr: 0.831 ± 0.398
0.0TyrVal: 0.0 ± 0.0
0.415TyrTrp: 0.415 ± 0.457
3.324TyrTyr: 3.324 ± 0.651
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2408 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski