Amino acid dipepetide frequency for Human papillomavirus type 190

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.002AlaAla: 7.002 ± 1.676
2.188AlaCys: 2.188 ± 1.439
4.376AlaAsp: 4.376 ± 0.818
3.939AlaGlu: 3.939 ± 0.665
3.063AlaPhe: 3.063 ± 0.568
3.063AlaGly: 3.063 ± 1.056
0.875AlaHis: 0.875 ± 0.659
3.501AlaIle: 3.501 ± 1.044
3.501AlaLys: 3.501 ± 1.121
5.252AlaLeu: 5.252 ± 1.427
0.0AlaMet: 0.0 ± 0.0
1.751AlaAsn: 1.751 ± 1.033
3.063AlaPro: 3.063 ± 0.763
3.063AlaGln: 3.063 ± 0.349
2.626AlaArg: 2.626 ± 0.827
2.626AlaSer: 2.626 ± 0.846
4.376AlaThr: 4.376 ± 1.255
3.939AlaVal: 3.939 ± 1.087
0.875AlaTrp: 0.875 ± 0.52
1.751AlaTyr: 1.751 ± 1.033
0.0AlaXaa: 0.0 ± 0.0
Cys
1.313CysAla: 1.313 ± 0.498
1.313CysCys: 1.313 ± 1.057
2.188CysAsp: 2.188 ± 0.977
0.875CysGlu: 0.875 ± 0.378
0.875CysPhe: 0.875 ± 0.574
1.313CysGly: 1.313 ± 0.527
0.0CysHis: 0.0 ± 0.0
0.875CysIle: 0.875 ± 0.999
2.188CysLys: 2.188 ± 0.961
2.626CysLeu: 2.626 ± 1.56
0.875CysMet: 0.875 ± 0.52
0.875CysAsn: 0.875 ± 0.378
2.188CysPro: 2.188 ± 0.569
0.0CysGln: 0.0 ± 0.0
0.438CysArg: 0.438 ± 0.499
1.313CysSer: 1.313 ± 1.201
0.875CysThr: 0.875 ± 0.378
0.875CysVal: 0.875 ± 0.52
1.313CysTrp: 1.313 ± 0.469
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.127AspAla: 6.127 ± 1.634
2.626AspCys: 2.626 ± 1.705
3.501AspAsp: 3.501 ± 0.905
4.376AspGlu: 4.376 ± 0.949
2.626AspPhe: 2.626 ± 0.901
2.626AspGly: 2.626 ± 0.393
0.875AspHis: 0.875 ± 0.378
4.376AspIle: 4.376 ± 0.961
2.188AspLys: 2.188 ± 0.879
3.939AspLeu: 3.939 ± 1.301
0.875AspMet: 0.875 ± 0.378
3.939AspAsn: 3.939 ± 0.929
4.814AspPro: 4.814 ± 1.207
2.626AspGln: 2.626 ± 1.304
2.188AspArg: 2.188 ± 0.434
6.127AspSer: 6.127 ± 1.753
6.565AspThr: 6.565 ± 1.812
3.939AspVal: 3.939 ± 2.859
0.875AspTrp: 0.875 ± 0.705
2.188AspTyr: 2.188 ± 0.636
0.0AspXaa: 0.0 ± 0.0
Glu
4.376GluAla: 4.376 ± 1.956
0.438GluCys: 0.438 ± 0.33
3.501GluAsp: 3.501 ± 0.529
5.689GluGlu: 5.689 ± 1.953
1.751GluPhe: 1.751 ± 0.813
2.626GluGly: 2.626 ± 0.393
0.438GluHis: 0.438 ± 0.369
1.313GluIle: 1.313 ± 0.598
3.063GluLys: 3.063 ± 1.037
3.939GluLeu: 3.939 ± 0.624
0.438GluMet: 0.438 ± 0.352
3.939GluAsn: 3.939 ± 1.115
3.063GluPro: 3.063 ± 0.82
5.689GluGln: 5.689 ± 1.087
1.313GluArg: 1.313 ± 0.469
4.814GluSer: 4.814 ± 1.571
2.188GluThr: 2.188 ± 0.723
5.689GluVal: 5.689 ± 1.388
1.751GluTrp: 1.751 ± 0.226
3.501GluTyr: 3.501 ± 1.59
0.0GluXaa: 0.0 ± 0.0
Phe
1.313PheAla: 1.313 ± 0.535
1.313PheCys: 1.313 ± 0.535
1.751PheAsp: 1.751 ± 0.668
3.501PheGlu: 3.501 ± 1.276
3.063PhePhe: 3.063 ± 0.799
1.313PheGly: 1.313 ± 0.652
0.0PheHis: 0.0 ± 0.0
2.626PheIle: 2.626 ± 0.701
4.814PheLys: 4.814 ± 1.808
4.814PheLeu: 4.814 ± 0.953
0.438PheMet: 0.438 ± 0.33
2.188PheAsn: 2.188 ± 0.699
1.751PhePro: 1.751 ± 0.798
3.939PheGln: 3.939 ± 1.408
1.313PheArg: 1.313 ± 0.362
1.313PheSer: 1.313 ± 0.642
1.751PheThr: 1.751 ± 0.831
2.188PheVal: 2.188 ± 0.903
0.875PheTrp: 0.875 ± 0.378
2.188PheTyr: 2.188 ± 1.071
0.0PheXaa: 0.0 ± 0.0
Gly
3.501GlyAla: 3.501 ± 0.736
1.313GlyCys: 1.313 ± 0.627
4.814GlyAsp: 4.814 ± 1.132
5.689GlyGlu: 5.689 ± 1.64
0.875GlyPhe: 0.875 ± 0.378
5.252GlyGly: 5.252 ± 2.439
1.751GlyHis: 1.751 ± 1.019
2.188GlyIle: 2.188 ± 1.004
1.751GlyLys: 1.751 ± 0.655
4.376GlyLeu: 4.376 ± 1.228
0.875GlyMet: 0.875 ± 0.509
2.626GlyAsn: 2.626 ± 0.789
1.751GlyPro: 1.751 ± 0.226
3.939GlyGln: 3.939 ± 1.449
4.376GlyArg: 4.376 ± 1.127
4.376GlySer: 4.376 ± 1.034
3.501GlyThr: 3.501 ± 1.447
3.501GlyVal: 3.501 ± 0.554
0.0GlyTrp: 0.0 ± 0.0
0.875GlyTyr: 0.875 ± 0.395
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.751HisCys: 1.751 ± 0.875
0.438HisAsp: 0.438 ± 0.352
0.875HisGlu: 0.875 ± 0.378
1.751HisPhe: 1.751 ± 0.668
0.438HisGly: 0.438 ± 0.473
0.875HisHis: 0.875 ± 0.824
1.751HisIle: 1.751 ± 0.661
0.438HisLys: 0.438 ± 0.412
1.313HisLeu: 1.313 ± 0.644
0.438HisMet: 0.438 ± 0.544
0.438HisAsn: 0.438 ± 0.352
1.751HisPro: 1.751 ± 0.634
0.875HisGln: 0.875 ± 0.378
1.313HisArg: 1.313 ± 0.362
1.313HisSer: 1.313 ± 0.362
1.313HisThr: 1.313 ± 0.362
1.751HisVal: 1.751 ± 0.501
0.438HisTrp: 0.438 ± 0.33
0.438HisTyr: 0.438 ± 0.33
0.0HisXaa: 0.0 ± 0.0
Ile
2.188IleAla: 2.188 ± 0.91
0.875IleCys: 0.875 ± 0.596
2.626IleAsp: 2.626 ± 1.254
5.252IleGlu: 5.252 ± 0.876
2.626IlePhe: 2.626 ± 0.584
5.252IleGly: 5.252 ± 1.544
0.875IleHis: 0.875 ± 0.509
2.626IleIle: 2.626 ± 1.303
0.875IleLys: 0.875 ± 0.543
3.939IleLeu: 3.939 ± 0.829
0.0IleMet: 0.0 ± 0.0
3.939IleAsn: 3.939 ± 0.538
2.626IlePro: 2.626 ± 1.405
1.751IleGln: 1.751 ± 0.978
3.063IleArg: 3.063 ± 1.324
3.063IleSer: 3.063 ± 0.984
2.626IleThr: 2.626 ± 0.886
3.939IleVal: 3.939 ± 1.291
0.438IleTrp: 0.438 ± 0.499
2.188IleTyr: 2.188 ± 0.392
0.0IleXaa: 0.0 ± 0.0
Lys
2.626LysAla: 2.626 ± 0.908
1.751LysCys: 1.751 ± 0.524
3.501LysAsp: 3.501 ± 1.01
3.063LysGlu: 3.063 ± 1.521
2.626LysPhe: 2.626 ± 0.972
1.313LysGly: 1.313 ± 0.957
1.751LysHis: 1.751 ± 1.033
1.313LysIle: 1.313 ± 0.783
1.751LysLys: 1.751 ± 0.798
4.814LysLeu: 4.814 ± 1.606
1.313LysMet: 1.313 ± 0.611
2.626LysAsn: 2.626 ± 0.796
0.438LysPro: 0.438 ± 0.352
3.939LysGln: 3.939 ± 1.642
7.002LysArg: 7.002 ± 0.779
2.626LysSer: 2.626 ± 1.262
2.626LysThr: 2.626 ± 1.056
3.063LysVal: 3.063 ± 0.788
0.438LysTrp: 0.438 ± 0.412
3.063LysTyr: 3.063 ± 1.097
0.0LysXaa: 0.0 ± 0.0
Leu
3.939LeuAla: 3.939 ± 0.429
1.751LeuCys: 1.751 ± 0.709
6.565LeuAsp: 6.565 ± 0.824
4.814LeuGlu: 4.814 ± 1.099
7.002LeuPhe: 7.002 ± 1.916
5.252LeuGly: 5.252 ± 2.271
3.063LeuHis: 3.063 ± 1.414
6.127LeuIle: 6.127 ± 1.72
3.939LeuLys: 3.939 ± 1.789
7.877LeuLeu: 7.877 ± 1.408
0.875LeuMet: 0.875 ± 0.435
2.188LeuAsn: 2.188 ± 0.404
4.814LeuPro: 4.814 ± 0.836
7.002LeuGln: 7.002 ± 0.987
3.501LeuArg: 3.501 ± 0.755
5.252LeuSer: 5.252 ± 1.893
5.689LeuThr: 5.689 ± 0.55
4.376LeuVal: 4.376 ± 0.674
0.875LeuTrp: 0.875 ± 0.406
5.252LeuTyr: 5.252 ± 1.027
0.0LeuXaa: 0.0 ± 0.0
Met
2.188MetAla: 2.188 ± 1.006
0.875MetCys: 0.875 ± 0.378
0.0MetAsp: 0.0 ± 0.0
1.751MetGlu: 1.751 ± 0.618
0.0MetPhe: 0.0 ± 0.0
0.438MetGly: 0.438 ± 0.352
0.438MetHis: 0.438 ± 0.352
0.438MetIle: 0.438 ± 0.352
1.313MetLys: 1.313 ± 1.057
1.313MetLeu: 1.313 ± 0.605
0.0MetMet: 0.0 ± 0.0
0.875MetAsn: 0.875 ± 0.378
0.438MetPro: 0.438 ± 0.352
0.0MetGln: 0.0 ± 0.0
0.438MetArg: 0.438 ± 0.352
0.438MetSer: 0.438 ± 0.499
0.875MetThr: 0.875 ± 0.378
1.313MetVal: 1.313 ± 0.652
0.438MetTrp: 0.438 ± 0.412
1.313MetTyr: 1.313 ± 0.469
0.0MetXaa: 0.0 ± 0.0
Asn
2.626AsnAla: 2.626 ± 0.369
0.875AsnCys: 0.875 ± 0.543
2.626AsnAsp: 2.626 ± 1.442
1.751AsnGlu: 1.751 ± 0.655
1.313AsnPhe: 1.313 ± 0.703
2.626AsnGly: 2.626 ± 0.948
0.0AsnHis: 0.0 ± 0.0
2.188AsnIle: 2.188 ± 0.649
2.626AsnLys: 2.626 ± 0.637
3.501AsnLeu: 3.501 ± 0.384
1.751AsnMet: 1.751 ± 1.002
2.626AsnAsn: 2.626 ± 1.127
3.939AsnPro: 3.939 ± 1.41
2.626AsnGln: 2.626 ± 1.133
2.626AsnArg: 2.626 ± 0.708
3.063AsnSer: 3.063 ± 1.329
4.376AsnThr: 4.376 ± 1.688
2.188AsnVal: 2.188 ± 0.589
0.875AsnTrp: 0.875 ± 0.47
1.313AsnTyr: 1.313 ± 0.652
0.0AsnXaa: 0.0 ± 0.0
Pro
2.626ProAla: 2.626 ± 1.004
0.438ProCys: 0.438 ± 0.499
5.689ProAsp: 5.689 ± 1.698
2.626ProGlu: 2.626 ± 0.917
0.438ProPhe: 0.438 ± 0.412
1.751ProGly: 1.751 ± 0.601
0.0ProHis: 0.0 ± 0.0
4.376ProIle: 4.376 ± 1.593
4.376ProLys: 4.376 ± 1.059
8.315ProLeu: 8.315 ± 1.517
0.0ProMet: 0.0 ± 0.0
2.188ProAsn: 2.188 ± 0.966
4.814ProPro: 4.814 ± 1.132
1.313ProGln: 1.313 ± 0.615
3.501ProArg: 3.501 ± 1.037
3.939ProSer: 3.939 ± 0.847
4.376ProThr: 4.376 ± 1.725
3.063ProVal: 3.063 ± 1.63
0.0ProTrp: 0.0 ± 0.0
2.626ProTyr: 2.626 ± 1.118
0.0ProXaa: 0.0 ± 0.0
Gln
3.063GlnAla: 3.063 ± 0.682
1.751GlnCys: 1.751 ± 0.815
2.188GlnAsp: 2.188 ± 0.367
1.751GlnGlu: 1.751 ± 0.798
3.063GlnPhe: 3.063 ± 0.635
2.188GlnGly: 2.188 ± 0.914
2.188GlnHis: 2.188 ± 1.081
3.063GlnIle: 3.063 ± 0.868
3.063GlnLys: 3.063 ± 0.682
7.002GlnLeu: 7.002 ± 2.196
1.313GlnMet: 1.313 ± 1.057
1.751GlnAsn: 1.751 ± 1.019
3.063GlnPro: 3.063 ± 0.763
2.626GlnGln: 2.626 ± 1.351
3.063GlnArg: 3.063 ± 1.441
4.814GlnSer: 4.814 ± 1.44
2.188GlnThr: 2.188 ± 0.699
2.188GlnVal: 2.188 ± 0.836
0.438GlnTrp: 0.438 ± 0.352
2.188GlnTyr: 2.188 ± 0.899
0.0GlnXaa: 0.0 ± 0.0
Arg
3.063ArgAla: 3.063 ± 1.62
0.438ArgCys: 0.438 ± 0.499
3.063ArgAsp: 3.063 ± 1.329
2.626ArgGlu: 2.626 ± 1.178
1.751ArgPhe: 1.751 ± 0.65
3.939ArgGly: 3.939 ± 1.134
2.188ArgHis: 2.188 ± 0.745
0.438ArgIle: 0.438 ± 0.369
4.376ArgLys: 4.376 ± 0.482
4.814ArgLeu: 4.814 ± 0.801
0.875ArgMet: 0.875 ± 0.47
3.063ArgAsn: 3.063 ± 0.54
3.939ArgPro: 3.939 ± 1.568
2.626ArgGln: 2.626 ± 1.023
6.127ArgArg: 6.127 ± 2.405
4.814ArgSer: 4.814 ± 1.157
2.188ArgThr: 2.188 ± 0.392
6.127ArgVal: 6.127 ± 1.389
0.0ArgTrp: 0.0 ± 0.0
0.875ArgTyr: 0.875 ± 0.43
0.0ArgXaa: 0.0 ± 0.0
Ser
3.501SerAla: 3.501 ± 0.612
0.875SerCys: 0.875 ± 0.52
5.252SerAsp: 5.252 ± 1.682
1.751SerGlu: 1.751 ± 0.79
1.751SerPhe: 1.751 ± 0.482
5.252SerGly: 5.252 ± 1.777
1.751SerHis: 1.751 ± 0.482
2.626SerIle: 2.626 ± 1.067
2.626SerLys: 2.626 ± 1.621
7.002SerLeu: 7.002 ± 1.231
1.313SerMet: 1.313 ± 0.652
3.063SerAsn: 3.063 ± 1.156
2.626SerPro: 2.626 ± 0.968
3.063SerGln: 3.063 ± 0.811
3.939SerArg: 3.939 ± 1.05
9.19SerSer: 9.19 ± 2.723
5.689SerThr: 5.689 ± 1.887
7.002SerVal: 7.002 ± 1.435
1.313SerTrp: 1.313 ± 0.527
2.188SerTyr: 2.188 ± 0.666
0.0SerXaa: 0.0 ± 0.0
Thr
3.501ThrAla: 3.501 ± 0.949
0.0ThrCys: 0.0 ± 0.0
6.127ThrAsp: 6.127 ± 1.095
3.501ThrGlu: 3.501 ± 1.129
2.188ThrPhe: 2.188 ± 0.958
5.689ThrGly: 5.689 ± 1.89
0.875ThrHis: 0.875 ± 0.607
5.252ThrIle: 5.252 ± 1.768
2.626ThrLys: 2.626 ± 1.103
4.814ThrLeu: 4.814 ± 1.443
0.875ThrMet: 0.875 ± 0.705
3.063ThrAsn: 3.063 ± 0.932
3.501ThrPro: 3.501 ± 1.327
3.501ThrGln: 3.501 ± 1.176
2.188ThrArg: 2.188 ± 0.629
3.501ThrSer: 3.501 ± 0.988
4.376ThrThr: 4.376 ± 1.435
3.501ThrVal: 3.501 ± 0.957
0.438ThrTrp: 0.438 ± 0.412
2.626ThrTyr: 2.626 ± 0.737
0.0ThrXaa: 0.0 ± 0.0
Val
4.814ValAla: 4.814 ± 0.699
0.875ValCys: 0.875 ± 0.518
6.565ValAsp: 6.565 ± 1.517
5.252ValGlu: 5.252 ± 1.496
2.188ValPhe: 2.188 ± 0.954
3.063ValGly: 3.063 ± 1.862
1.751ValHis: 1.751 ± 0.668
3.501ValIle: 3.501 ± 1.014
1.751ValLys: 1.751 ± 0.682
4.814ValLeu: 4.814 ± 1.278
0.875ValMet: 0.875 ± 0.378
0.875ValAsn: 0.875 ± 0.395
5.689ValPro: 5.689 ± 0.994
2.626ValGln: 2.626 ± 0.862
4.376ValArg: 4.376 ± 1.069
5.689ValSer: 5.689 ± 0.666
3.501ValThr: 3.501 ± 1.602
0.875ValVal: 0.875 ± 0.395
0.875ValTrp: 0.875 ± 0.509
3.063ValTyr: 3.063 ± 0.606
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.875TrpAsp: 0.875 ± 0.509
0.0TrpGlu: 0.0 ± 0.0
0.438TrpPhe: 0.438 ± 0.412
1.313TrpGly: 1.313 ± 0.535
0.438TrpHis: 0.438 ± 0.412
0.875TrpIle: 0.875 ± 0.705
1.751TrpLys: 1.751 ± 1.028
1.313TrpLeu: 1.313 ± 0.469
0.438TrpMet: 0.438 ± 0.353
0.875TrpAsn: 0.875 ± 0.509
0.0TrpPro: 0.0 ± 0.0
0.438TrpGln: 0.438 ± 0.33
1.313TrpArg: 1.313 ± 0.646
0.438TrpSer: 0.438 ± 0.369
1.313TrpThr: 1.313 ± 0.866
0.875TrpVal: 0.875 ± 0.47
0.0TrpTrp: 0.0 ± 0.0
0.438TrpTyr: 0.438 ± 0.352
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.063TyrAla: 3.063 ± 1.243
0.875TyrCys: 0.875 ± 0.596
1.751TyrAsp: 1.751 ± 0.97
0.438TyrGlu: 0.438 ± 0.352
3.063TyrPhe: 3.063 ± 0.975
2.626TyrGly: 2.626 ± 0.737
0.0TyrHis: 0.0 ± 0.0
1.313TyrIle: 1.313 ± 0.612
2.626TyrLys: 2.626 ± 0.96
4.376TyrLeu: 4.376 ± 0.991
0.875TyrMet: 0.875 ± 0.509
2.626TyrAsn: 2.626 ± 0.574
2.188TyrPro: 2.188 ± 0.742
1.313TyrGln: 1.313 ± 0.469
2.626TyrArg: 2.626 ± 0.957
3.063TyrSer: 3.063 ± 0.712
1.751TyrThr: 1.751 ± 0.571
2.626TyrVal: 2.626 ± 0.96
0.875TyrTrp: 0.875 ± 0.509
2.626TyrTyr: 2.626 ± 1.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2286 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski