Amino acid dipepetide frequency for Human papillomavirus 154

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.304AlaAla: 3.304 ± 1.807
0.826AlaCys: 0.826 ± 0.872
2.478AlaAsp: 2.478 ± 0.835
4.131AlaGlu: 4.131 ± 1.141
2.891AlaPhe: 2.891 ± 0.541
0.413AlaGly: 0.413 ± 0.367
0.826AlaHis: 0.826 ± 0.515
3.304AlaIle: 3.304 ± 0.976
4.544AlaLys: 4.544 ± 1.369
6.609AlaLeu: 6.609 ± 2.32
1.239AlaMet: 1.239 ± 0.67
1.239AlaAsn: 1.239 ± 0.661
4.131AlaPro: 4.131 ± 1.415
1.652AlaGln: 1.652 ± 1.083
2.065AlaArg: 2.065 ± 0.769
4.131AlaSer: 4.131 ± 1.498
4.131AlaThr: 4.131 ± 0.877
0.826AlaVal: 0.826 ± 0.389
1.239AlaTrp: 1.239 ± 0.645
2.065AlaTyr: 2.065 ± 1.415
0.0AlaXaa: 0.0 ± 0.0
Cys
1.239CysAla: 1.239 ± 0.645
1.239CysCys: 1.239 ± 0.731
1.652CysAsp: 1.652 ± 0.778
2.065CysGlu: 2.065 ± 1.023
1.239CysPhe: 1.239 ± 0.54
0.413CysGly: 0.413 ± 0.323
0.413CysHis: 0.413 ± 0.436
1.239CysIle: 1.239 ± 0.731
1.652CysLys: 1.652 ± 0.767
1.652CysLeu: 1.652 ± 1.313
0.413CysMet: 0.413 ± 0.388
1.652CysAsn: 1.652 ± 0.956
1.652CysPro: 1.652 ± 0.795
0.0CysGln: 0.0 ± 0.0
0.826CysArg: 0.826 ± 0.872
1.239CysSer: 1.239 ± 0.531
1.652CysThr: 1.652 ± 0.886
0.826CysVal: 0.826 ± 0.554
0.826CysTrp: 0.826 ± 0.455
0.826CysTyr: 0.826 ± 0.682
0.0CysXaa: 0.0 ± 0.0
Asp
2.891AspAla: 2.891 ± 0.852
1.652AspCys: 1.652 ± 0.946
4.544AspAsp: 4.544 ± 1.712
6.196AspGlu: 6.196 ± 2.22
2.891AspPhe: 2.891 ± 0.788
2.891AspGly: 2.891 ± 0.816
0.826AspHis: 0.826 ± 0.646
4.131AspIle: 4.131 ± 2.196
1.652AspLys: 1.652 ± 0.701
7.435AspLeu: 7.435 ± 2.209
1.239AspMet: 1.239 ± 0.548
3.717AspAsn: 3.717 ± 0.826
4.131AspPro: 4.131 ± 1.451
3.304AspGln: 3.304 ± 0.817
1.239AspArg: 1.239 ± 0.727
2.891AspSer: 2.891 ± 1.35
5.783AspThr: 5.783 ± 1.045
5.37AspVal: 5.37 ± 1.676
0.413AspTrp: 0.413 ± 0.352
0.826AspTyr: 0.826 ± 0.389
0.0AspXaa: 0.0 ± 0.0
Glu
4.131GluAla: 4.131 ± 1.063
0.826GluCys: 0.826 ± 0.704
5.37GluAsp: 5.37 ± 1.724
7.022GluGlu: 7.022 ± 1.999
2.891GluPhe: 2.891 ± 1.113
2.891GluGly: 2.891 ± 0.738
2.065GluHis: 2.065 ± 0.752
2.891GluIle: 2.891 ± 1.297
2.065GluLys: 2.065 ± 1.088
4.957GluLeu: 4.957 ± 1.41
1.239GluMet: 1.239 ± 0.529
5.37GluAsn: 5.37 ± 1.082
1.239GluPro: 1.239 ± 0.75
1.652GluGln: 1.652 ± 0.748
3.717GluArg: 3.717 ± 1.37
4.544GluSer: 4.544 ± 1.307
2.891GluThr: 2.891 ± 0.928
4.544GluVal: 4.544 ± 1.046
0.413GluTrp: 0.413 ± 0.323
1.652GluTyr: 1.652 ± 0.653
0.0GluXaa: 0.0 ± 0.0
Phe
2.065PheAla: 2.065 ± 0.771
2.065PheCys: 2.065 ± 0.762
4.957PheAsp: 4.957 ± 0.789
3.304PheGlu: 3.304 ± 1.274
3.717PhePhe: 3.717 ± 1.34
0.0PheGly: 0.0 ± 0.0
0.826PheHis: 0.826 ± 0.515
4.131PheIle: 4.131 ± 0.875
2.478PheLys: 2.478 ± 1.494
5.37PheLeu: 5.37 ± 2.938
1.652PheMet: 1.652 ± 0.659
1.652PheAsn: 1.652 ± 0.471
0.826PhePro: 0.826 ± 0.514
1.239PheGln: 1.239 ± 0.731
1.652PheArg: 1.652 ± 0.551
4.131PheSer: 4.131 ± 1.472
1.239PheThr: 1.239 ± 0.283
2.065PheVal: 2.065 ± 1.137
1.652PheTrp: 1.652 ± 0.767
2.065PheTyr: 2.065 ± 0.818
0.0PheXaa: 0.0 ± 0.0
Gly
3.717GlyAla: 3.717 ± 0.865
0.413GlyCys: 0.413 ± 0.367
3.304GlyAsp: 3.304 ± 1.137
2.478GlyGlu: 2.478 ± 0.797
2.065GlyPhe: 2.065 ± 1.041
4.957GlyGly: 4.957 ± 2.552
1.652GlyHis: 1.652 ± 0.832
2.891GlyIle: 2.891 ± 1.558
2.891GlyLys: 2.891 ± 0.692
7.022GlyLeu: 7.022 ± 1.281
0.0GlyMet: 0.0 ± 0.0
3.717GlyAsn: 3.717 ± 1.034
3.304GlyPro: 3.304 ± 0.87
2.065GlyGln: 2.065 ± 0.714
2.891GlyArg: 2.891 ± 1.07
2.891GlySer: 2.891 ± 0.761
2.891GlyThr: 2.891 ± 0.788
2.891GlyVal: 2.891 ± 0.768
0.0GlyTrp: 0.0 ± 0.0
1.239GlyTyr: 1.239 ± 0.577
0.0GlyXaa: 0.0 ± 0.0
His
0.413HisAla: 0.413 ± 0.436
0.413HisCys: 0.413 ± 0.377
1.239HisAsp: 1.239 ± 0.59
1.652HisGlu: 1.652 ± 0.858
0.826HisPhe: 0.826 ± 0.735
1.239HisGly: 1.239 ± 0.638
0.826HisHis: 0.826 ± 0.517
0.826HisIle: 0.826 ± 0.53
1.239HisLys: 1.239 ± 0.421
2.478HisLeu: 2.478 ± 0.958
0.413HisMet: 0.413 ± 0.364
1.239HisAsn: 1.239 ± 0.969
2.065HisPro: 2.065 ± 1.137
1.239HisGln: 1.239 ± 0.558
0.826HisArg: 0.826 ± 0.432
2.065HisSer: 2.065 ± 1.005
0.413HisThr: 0.413 ± 0.367
0.413HisVal: 0.413 ± 0.384
0.826HisTrp: 0.826 ± 0.581
0.826HisTyr: 0.826 ± 0.439
0.0HisXaa: 0.0 ± 0.0
Ile
3.304IleAla: 3.304 ± 0.655
0.413IleCys: 0.413 ± 0.377
4.957IleAsp: 4.957 ± 1.304
2.478IleGlu: 2.478 ± 1.105
2.478IlePhe: 2.478 ± 0.728
4.131IleGly: 4.131 ± 1.822
2.065IleHis: 2.065 ± 1.165
2.478IleIle: 2.478 ± 0.991
2.891IleLys: 2.891 ± 0.599
3.304IleLeu: 3.304 ± 0.611
0.0IleMet: 0.0 ± 0.0
7.435IleAsn: 7.435 ± 2.469
3.717IlePro: 3.717 ± 1.318
3.304IleGln: 3.304 ± 0.97
2.065IleArg: 2.065 ± 0.679
2.891IleSer: 2.891 ± 0.738
3.717IleThr: 3.717 ± 0.744
3.304IleVal: 3.304 ± 0.625
0.0IleTrp: 0.0 ± 0.0
3.304IleTyr: 3.304 ± 0.601
0.0IleXaa: 0.0 ± 0.0
Lys
2.478LysAla: 2.478 ± 0.762
0.826LysCys: 0.826 ± 0.455
4.131LysAsp: 4.131 ± 1.703
3.717LysGlu: 3.717 ± 1.705
3.304LysPhe: 3.304 ± 1.54
1.652LysGly: 1.652 ± 0.778
2.478LysHis: 2.478 ± 1.494
2.891LysIle: 2.891 ± 0.928
2.478LysLys: 2.478 ± 0.41
5.37LysLeu: 5.37 ± 2.085
1.239LysMet: 1.239 ± 0.558
2.065LysAsn: 2.065 ± 0.733
1.652LysPro: 1.652 ± 1.202
3.304LysGln: 3.304 ± 0.509
4.957LysArg: 4.957 ± 1.324
3.304LysSer: 3.304 ± 1.968
1.239LysThr: 1.239 ± 0.743
2.891LysVal: 2.891 ± 0.541
1.239LysTrp: 1.239 ± 0.661
1.652LysTyr: 1.652 ± 0.886
0.0LysXaa: 0.0 ± 0.0
Leu
5.37LeuAla: 5.37 ± 0.757
1.652LeuCys: 1.652 ± 0.788
5.37LeuAsp: 5.37 ± 1.871
5.783LeuGlu: 5.783 ± 1.172
4.544LeuPhe: 4.544 ± 1.177
7.022LeuGly: 7.022 ± 1.916
2.478LeuHis: 2.478 ± 1.121
3.717LeuIle: 3.717 ± 0.984
6.609LeuLys: 6.609 ± 1.688
9.087LeuLeu: 9.087 ± 1.879
1.239LeuMet: 1.239 ± 0.652
2.478LeuAsn: 2.478 ± 0.925
7.435LeuPro: 7.435 ± 1.089
6.196LeuGln: 6.196 ± 2.129
6.196LeuArg: 6.196 ± 1.937
7.022LeuSer: 7.022 ± 1.582
2.478LeuThr: 2.478 ± 0.771
4.131LeuVal: 4.131 ± 1.406
0.0LeuTrp: 0.0 ± 0.0
4.131LeuTyr: 4.131 ± 0.836
0.0LeuXaa: 0.0 ± 0.0
Met
0.826MetAla: 0.826 ± 0.389
2.065MetCys: 2.065 ± 1.005
1.239MetAsp: 1.239 ± 0.67
0.826MetGlu: 0.826 ± 0.682
0.826MetPhe: 0.826 ± 0.385
1.652MetGly: 1.652 ± 0.436
0.0MetHis: 0.0 ± 0.0
0.826MetIle: 0.826 ± 0.517
0.0MetLys: 0.0 ± 0.0
0.826MetLeu: 0.826 ± 0.517
0.826MetMet: 0.826 ± 0.478
0.826MetAsn: 0.826 ± 0.517
0.826MetPro: 0.826 ± 0.554
0.413MetGln: 0.413 ± 0.367
1.652MetArg: 1.652 ± 0.778
2.478MetSer: 2.478 ± 1.028
1.652MetThr: 1.652 ± 0.648
1.239MetVal: 1.239 ± 0.63
0.0MetTrp: 0.0 ± 0.0
0.826MetTyr: 0.826 ± 0.385
0.0MetXaa: 0.0 ± 0.0
Asn
2.891AsnAla: 2.891 ± 0.828
1.239AsnCys: 1.239 ± 0.421
2.065AsnAsp: 2.065 ± 0.925
3.304AsnGlu: 3.304 ± 1.527
0.826AsnPhe: 0.826 ± 0.735
2.065AsnGly: 2.065 ± 0.974
0.413AsnHis: 0.413 ± 0.352
5.37AsnIle: 5.37 ± 1.499
3.717AsnLys: 3.717 ± 0.545
2.478AsnLeu: 2.478 ± 0.51
0.826AsnMet: 0.826 ± 0.704
4.957AsnAsn: 4.957 ± 2.04
2.478AsnPro: 2.478 ± 0.51
4.544AsnGln: 4.544 ± 1.391
2.891AsnArg: 2.891 ± 0.543
3.717AsnSer: 3.717 ± 1.304
4.957AsnThr: 4.957 ± 0.756
2.891AsnVal: 2.891 ± 0.956
1.652AsnTrp: 1.652 ± 0.648
0.826AsnTyr: 0.826 ± 0.514
0.0AsnXaa: 0.0 ± 0.0
Pro
4.544ProAla: 4.544 ± 0.715
1.239ProCys: 1.239 ± 0.528
4.544ProAsp: 4.544 ± 1.296
3.717ProGlu: 3.717 ± 0.651
1.652ProPhe: 1.652 ± 0.433
2.065ProGly: 2.065 ± 1.491
0.413ProHis: 0.413 ± 0.377
3.304ProIle: 3.304 ± 1.79
3.717ProLys: 3.717 ± 0.977
7.022ProLeu: 7.022 ± 2.433
0.826ProMet: 0.826 ± 0.385
2.478ProAsn: 2.478 ± 1.297
9.087ProPro: 9.087 ± 0.994
2.891ProGln: 2.891 ± 0.543
2.891ProArg: 2.891 ± 0.843
5.37ProSer: 5.37 ± 1.026
4.957ProThr: 4.957 ± 1.418
2.478ProVal: 2.478 ± 0.52
0.0ProTrp: 0.0 ± 0.0
1.652ProTyr: 1.652 ± 0.802
0.0ProXaa: 0.0 ± 0.0
Gln
2.065GlnAla: 2.065 ± 1.07
0.413GlnCys: 0.413 ± 0.436
2.891GlnAsp: 2.891 ± 0.679
2.065GlnGlu: 2.065 ± 0.732
3.717GlnPhe: 3.717 ± 0.622
2.478GlnGly: 2.478 ± 0.778
0.413GlnHis: 0.413 ± 0.367
3.304GlnIle: 3.304 ± 0.433
2.065GlnLys: 2.065 ± 0.954
6.196GlnLeu: 6.196 ± 0.822
2.478GlnMet: 2.478 ± 1.234
1.652GlnAsn: 1.652 ± 0.853
2.891GlnPro: 2.891 ± 0.458
2.065GlnGln: 2.065 ± 0.936
1.239GlnArg: 1.239 ± 0.558
1.239GlnSer: 1.239 ± 0.421
2.891GlnThr: 2.891 ± 0.9
2.478GlnVal: 2.478 ± 0.58
1.239GlnTrp: 1.239 ± 0.747
1.652GlnTyr: 1.652 ± 0.778
0.0GlnXaa: 0.0 ± 0.0
Arg
3.304ArgAla: 3.304 ± 0.712
1.652ArgCys: 1.652 ± 0.869
3.304ArgAsp: 3.304 ± 1.324
2.065ArgGlu: 2.065 ± 0.437
2.065ArgPhe: 2.065 ± 0.974
3.717ArgGly: 3.717 ± 1.036
1.652ArgHis: 1.652 ± 1.15
2.891ArgIle: 2.891 ± 0.674
4.131ArgLys: 4.131 ± 0.418
5.37ArgLeu: 5.37 ± 1.398
0.413ArgMet: 0.413 ± 0.323
2.478ArgAsn: 2.478 ± 1.051
2.478ArgPro: 2.478 ± 0.953
1.652ArgGln: 1.652 ± 0.671
7.848ArgArg: 7.848 ± 2.975
5.37ArgSer: 5.37 ± 1.71
2.478ArgThr: 2.478 ± 0.736
1.239ArgVal: 1.239 ± 1.102
0.0ArgTrp: 0.0 ± 0.0
2.065ArgTyr: 2.065 ± 0.899
0.0ArgXaa: 0.0 ± 0.0
Ser
3.304SerAla: 3.304 ± 0.859
1.652SerCys: 1.652 ± 0.952
2.478SerAsp: 2.478 ± 0.797
3.717SerGlu: 3.717 ± 1.356
3.717SerPhe: 3.717 ± 0.914
2.891SerGly: 2.891 ± 0.692
1.239SerHis: 1.239 ± 0.758
3.304SerIle: 3.304 ± 0.821
2.065SerLys: 2.065 ± 1.415
6.196SerLeu: 6.196 ± 1.72
2.065SerMet: 2.065 ± 0.598
4.131SerAsn: 4.131 ± 1.563
5.37SerPro: 5.37 ± 1.785
2.891SerGln: 2.891 ± 1.073
4.131SerArg: 4.131 ± 0.676
7.022SerSer: 7.022 ± 1.296
7.022SerThr: 7.022 ± 2.032
4.957SerVal: 4.957 ± 1.801
0.413SerTrp: 0.413 ± 0.384
1.652SerTyr: 1.652 ± 0.777
0.0SerXaa: 0.0 ± 0.0
Thr
1.652ThrAla: 1.652 ± 0.767
0.826ThrCys: 0.826 ± 0.581
4.131ThrAsp: 4.131 ± 0.713
2.891ThrGlu: 2.891 ± 1.313
2.478ThrPhe: 2.478 ± 1.367
5.37ThrGly: 5.37 ± 1.295
1.239ThrHis: 1.239 ± 0.714
4.131ThrIle: 4.131 ± 1.881
2.891ThrLys: 2.891 ± 0.792
4.131ThrLeu: 4.131 ± 1.365
1.239ThrMet: 1.239 ± 0.645
2.065ThrAsn: 2.065 ± 0.743
4.131ThrPro: 4.131 ± 1.766
1.652ThrGln: 1.652 ± 0.633
3.717ThrArg: 3.717 ± 0.607
4.957ThrSer: 4.957 ± 1.334
6.196ThrThr: 6.196 ± 1.582
5.37ThrVal: 5.37 ± 1.129
1.239ThrTrp: 1.239 ± 0.417
3.304ThrTyr: 3.304 ± 1.262
0.0ThrXaa: 0.0 ± 0.0
Val
2.065ValAla: 2.065 ± 0.955
0.826ValCys: 0.826 ± 0.52
3.717ValAsp: 3.717 ± 1.592
2.478ValGlu: 2.478 ± 0.847
1.239ValPhe: 1.239 ± 0.415
4.957ValGly: 4.957 ± 2.279
0.413ValHis: 0.413 ± 0.367
2.891ValIle: 2.891 ± 1.656
1.652ValLys: 1.652 ± 0.471
4.544ValLeu: 4.544 ± 0.644
1.239ValMet: 1.239 ± 0.811
2.478ValAsn: 2.478 ± 0.556
4.957ValPro: 4.957 ± 1.84
3.304ValGln: 3.304 ± 0.907
2.065ValArg: 2.065 ± 0.624
3.304ValSer: 3.304 ± 0.779
4.957ValThr: 4.957 ± 0.793
2.478ValVal: 2.478 ± 0.913
1.239ValTrp: 1.239 ± 0.595
1.239ValTyr: 1.239 ± 0.67
0.0ValXaa: 0.0 ± 0.0
Trp
0.826TrpAla: 0.826 ± 0.385
0.413TrpCys: 0.413 ± 0.436
0.413TrpAsp: 0.413 ± 0.367
0.0TrpGlu: 0.0 ± 0.0
0.826TrpPhe: 0.826 ± 0.455
0.0TrpGly: 0.0 ± 0.0
0.413TrpHis: 0.413 ± 0.323
2.065TrpIle: 2.065 ± 1.186
1.652TrpLys: 1.652 ± 0.753
0.413TrpLeu: 0.413 ± 0.352
0.413TrpMet: 0.413 ± 0.352
0.826TrpAsn: 0.826 ± 0.735
0.413TrpPro: 0.413 ± 0.367
0.413TrpGln: 0.413 ± 0.367
1.239TrpArg: 1.239 ± 0.595
0.413TrpSer: 0.413 ± 0.384
1.239TrpThr: 1.239 ± 0.661
0.413TrpVal: 0.413 ± 0.367
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.652TyrAla: 1.652 ± 0.642
2.065TyrCys: 2.065 ± 1.265
1.239TyrAsp: 1.239 ± 0.856
2.478TyrGlu: 2.478 ± 0.771
2.891TyrPhe: 2.891 ± 0.844
2.478TyrGly: 2.478 ± 0.967
0.826TyrHis: 0.826 ± 0.767
1.652TyrIle: 1.652 ± 1.076
2.478TyrLys: 2.478 ± 0.977
2.478TyrLeu: 2.478 ± 0.605
0.413TyrMet: 0.413 ± 0.323
1.652TyrAsn: 1.652 ± 0.701
2.065TyrPro: 2.065 ± 0.57
2.065TyrGln: 2.065 ± 0.915
2.065TyrArg: 2.065 ± 0.786
1.239TyrSer: 1.239 ± 0.415
0.826TyrThr: 0.826 ± 0.426
1.239TyrVal: 1.239 ± 0.645
0.0TyrTrp: 0.0 ± 0.0
2.478TyrTyr: 2.478 ± 0.567
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2422 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski