Amino acid dipepetide frequency for Human papillomavirus 68

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.873AlaAla: 3.873 ± 1.189
0.861AlaCys: 0.861 ± 0.88
3.442AlaAsp: 3.442 ± 1.475
2.151AlaGlu: 2.151 ± 0.989
3.442AlaPhe: 3.442 ± 1.39
2.582AlaGly: 2.582 ± 1.083
0.43AlaHis: 0.43 ± 0.394
3.873AlaIle: 3.873 ± 1.033
2.582AlaLys: 2.582 ± 1.004
4.733AlaLeu: 4.733 ± 1.091
1.721AlaMet: 1.721 ± 0.966
1.291AlaAsn: 1.291 ± 0.78
3.873AlaPro: 3.873 ± 1.843
3.873AlaGln: 3.873 ± 1.315
3.012AlaArg: 3.012 ± 0.85
3.012AlaSer: 3.012 ± 0.993
6.454AlaThr: 6.454 ± 1.279
1.721AlaVal: 1.721 ± 0.954
0.43AlaTrp: 0.43 ± 0.394
1.721AlaTyr: 1.721 ± 0.73
0.0AlaXaa: 0.0 ± 0.0
Cys
1.291CysAla: 1.291 ± 0.855
1.721CysCys: 1.721 ± 1.506
0.43CysAsp: 0.43 ± 0.348
0.43CysGlu: 0.43 ± 0.348
0.861CysPhe: 0.861 ± 0.456
1.291CysGly: 1.291 ± 0.711
0.43CysHis: 0.43 ± 0.585
2.582CysIle: 2.582 ± 0.839
2.582CysLys: 2.582 ± 0.591
1.721CysLeu: 1.721 ± 0.728
1.721CysMet: 1.721 ± 1.172
2.151CysAsn: 2.151 ± 1.473
2.582CysPro: 2.582 ± 0.969
1.721CysGln: 1.721 ± 0.953
1.721CysArg: 1.721 ± 2.38
1.291CysSer: 1.291 ± 0.711
2.582CysThr: 2.582 ± 0.982
3.012CysVal: 3.012 ± 1.537
1.291CysTrp: 1.291 ± 0.756
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.582AspAla: 2.582 ± 1.219
2.151AspCys: 2.151 ± 0.947
3.012AspAsp: 3.012 ± 1.254
4.733AspGlu: 4.733 ± 3.112
2.151AspPhe: 2.151 ± 0.757
4.733AspGly: 4.733 ± 0.915
1.291AspHis: 1.291 ± 0.924
4.303AspIle: 4.303 ± 1.735
2.151AspLys: 2.151 ± 1.142
3.442AspLeu: 3.442 ± 2.44
1.291AspMet: 1.291 ± 0.663
3.442AspAsn: 3.442 ± 1.185
3.012AspPro: 3.012 ± 1.275
0.861AspGln: 0.861 ± 0.435
0.861AspArg: 0.861 ± 0.773
7.315AspSer: 7.315 ± 1.663
8.176AspThr: 8.176 ± 1.504
3.442AspVal: 3.442 ± 1.251
1.291AspTrp: 1.291 ± 0.663
1.721AspTyr: 1.721 ± 1.255
0.0AspXaa: 0.0 ± 0.0
Glu
0.861GluAla: 0.861 ± 0.667
0.0GluCys: 0.0 ± 0.0
3.442GluAsp: 3.442 ± 1.843
1.721GluGlu: 1.721 ± 0.817
1.291GluPhe: 1.291 ± 0.867
3.012GluGly: 3.012 ± 1.624
0.861GluHis: 0.861 ± 0.516
2.582GluIle: 2.582 ± 1.682
2.151GluLys: 2.151 ± 0.892
5.164GluLeu: 5.164 ± 0.857
0.0GluMet: 0.0 ± 0.0
3.873GluAsn: 3.873 ± 1.532
4.303GluPro: 4.303 ± 2.018
2.151GluGln: 2.151 ± 1.078
1.721GluArg: 1.721 ± 0.979
3.012GluSer: 3.012 ± 0.889
3.442GluThr: 3.442 ± 1.668
3.873GluVal: 3.873 ± 0.802
0.861GluTrp: 0.861 ± 0.456
0.861GluTyr: 0.861 ± 0.85
0.0GluXaa: 0.0 ± 0.0
Phe
3.012PheAla: 3.012 ± 1.771
1.291PheCys: 1.291 ± 0.673
2.582PheAsp: 2.582 ± 1.422
1.291PheGlu: 1.291 ± 0.78
2.582PhePhe: 2.582 ± 0.831
1.291PheGly: 1.291 ± 0.756
1.291PheHis: 1.291 ± 1.613
1.721PheIle: 1.721 ± 0.812
3.873PheLys: 3.873 ± 1.515
5.594PheLeu: 5.594 ± 1.496
0.861PheMet: 0.861 ± 0.667
0.861PheAsn: 0.861 ± 0.435
1.721PhePro: 1.721 ± 0.966
0.43PheGln: 0.43 ± 0.425
0.861PheArg: 0.861 ± 0.435
2.582PheSer: 2.582 ± 1.487
2.582PheThr: 2.582 ± 1.495
2.582PheVal: 2.582 ± 1.357
1.291PheTrp: 1.291 ± 0.786
0.43PheTyr: 0.43 ± 0.828
0.0PheXaa: 0.0 ± 0.0
Gly
1.721GlyAla: 1.721 ± 0.395
1.291GlyCys: 1.291 ± 0.455
6.454GlyAsp: 6.454 ± 1.598
2.151GlyGlu: 2.151 ± 0.881
0.861GlyPhe: 0.861 ± 0.457
2.582GlyGly: 2.582 ± 1.078
1.721GlyHis: 1.721 ± 1.255
4.733GlyIle: 4.733 ± 1.28
3.012GlyLys: 3.012 ± 0.869
3.012GlyLeu: 3.012 ± 0.81
0.861GlyMet: 0.861 ± 0.516
2.151GlyAsn: 2.151 ± 0.744
2.582GlyPro: 2.582 ± 0.996
0.861GlyGln: 0.861 ± 0.88
2.151GlyArg: 2.151 ± 1.171
2.151GlySer: 2.151 ± 0.645
9.897GlyThr: 9.897 ± 4.525
4.303GlyVal: 4.303 ± 1.503
0.43GlyTrp: 0.43 ± 0.348
2.151GlyTyr: 2.151 ± 0.645
0.0GlyXaa: 0.0 ± 0.0
His
0.43HisAla: 0.43 ± 0.585
0.861HisCys: 0.861 ± 0.883
0.861HisAsp: 0.861 ± 0.832
0.43HisGlu: 0.43 ± 0.585
1.291HisPhe: 1.291 ± 0.663
2.582HisGly: 2.582 ± 0.906
0.43HisHis: 0.43 ± 0.585
1.291HisIle: 1.291 ± 0.516
1.291HisLys: 1.291 ± 1.106
1.291HisLeu: 1.291 ± 0.79
0.0HisMet: 0.0 ± 0.0
2.582HisAsn: 2.582 ± 1.083
1.721HisPro: 1.721 ± 0.804
1.721HisGln: 1.721 ± 1.173
1.291HisArg: 1.291 ± 0.78
0.861HisSer: 0.861 ± 0.695
1.721HisThr: 1.721 ± 0.889
1.291HisVal: 1.291 ± 0.673
0.861HisTrp: 0.861 ± 0.516
2.151HisTyr: 2.151 ± 1.141
0.0HisXaa: 0.0 ± 0.0
Ile
3.442IleAla: 3.442 ± 1.02
1.721IleCys: 1.721 ± 0.886
3.442IleAsp: 3.442 ± 0.806
2.582IleGlu: 2.582 ± 1.142
1.291IlePhe: 1.291 ± 0.41
3.012IleGly: 3.012 ± 1.727
1.721IleHis: 1.721 ± 0.625
2.151IleIle: 2.151 ± 0.522
3.012IleLys: 3.012 ± 1.087
3.873IleLeu: 3.873 ± 1.695
0.0IleMet: 0.0 ± 0.0
1.721IleAsn: 1.721 ± 0.704
3.012IlePro: 3.012 ± 1.495
2.582IleGln: 2.582 ± 1.308
3.873IleArg: 3.873 ± 1.396
4.303IleSer: 4.303 ± 1.3
4.733IleThr: 4.733 ± 0.925
3.873IleVal: 3.873 ± 1.391
0.0IleTrp: 0.0 ± 0.0
2.582IleTyr: 2.582 ± 1.063
0.0IleXaa: 0.0 ± 0.0
Lys
1.721LysAla: 1.721 ± 0.395
3.442LysCys: 3.442 ± 1.725
2.582LysAsp: 2.582 ± 1.117
1.291LysGlu: 1.291 ± 0.86
3.442LysPhe: 3.442 ± 1.433
2.151LysGly: 2.151 ± 0.799
1.721LysHis: 1.721 ± 0.73
2.582LysIle: 2.582 ± 1.554
3.012LysLys: 3.012 ± 0.814
2.582LysLeu: 2.582 ± 2.563
0.43LysMet: 0.43 ± 0.386
2.582LysAsn: 2.582 ± 1.005
2.582LysPro: 2.582 ± 1.113
3.012LysGln: 3.012 ± 0.755
6.024LysArg: 6.024 ± 0.985
3.442LysSer: 3.442 ± 1.519
3.873LysThr: 3.873 ± 1.932
2.582LysVal: 2.582 ± 1.061
0.43LysTrp: 0.43 ± 0.392
1.721LysTyr: 1.721 ± 0.625
0.0LysXaa: 0.0 ± 0.0
Leu
3.012LeuAla: 3.012 ± 1.186
2.151LeuCys: 2.151 ± 1.138
5.594LeuAsp: 5.594 ± 1.25
4.303LeuGlu: 4.303 ± 1.637
3.012LeuPhe: 3.012 ± 1.612
3.442LeuGly: 3.442 ± 1.405
2.582LeuHis: 2.582 ± 0.922
2.582LeuIle: 2.582 ± 2.09
4.733LeuLys: 4.733 ± 1.287
6.454LeuLeu: 6.454 ± 2.078
1.291LeuMet: 1.291 ± 0.676
2.582LeuAsn: 2.582 ± 1.725
4.303LeuPro: 4.303 ± 1.583
8.606LeuGln: 8.606 ± 1.209
3.442LeuArg: 3.442 ± 1.668
4.733LeuSer: 4.733 ± 1.465
4.303LeuThr: 4.303 ± 1.162
5.164LeuVal: 5.164 ± 2.911
0.861LeuTrp: 0.861 ± 0.516
4.733LeuTyr: 4.733 ± 0.79
0.0LeuXaa: 0.0 ± 0.0
Met
1.721MetAla: 1.721 ± 0.728
1.291MetCys: 1.291 ± 0.79
1.291MetAsp: 1.291 ± 0.614
0.43MetGlu: 0.43 ± 0.348
0.861MetPhe: 0.861 ± 0.477
0.0MetGly: 0.0 ± 0.0
1.721MetHis: 1.721 ± 1.16
0.861MetIle: 0.861 ± 0.7
0.861MetLys: 0.861 ± 0.784
2.582MetLeu: 2.582 ± 2.09
0.43MetMet: 0.43 ± 0.392
0.861MetAsn: 0.861 ± 0.516
0.861MetPro: 0.861 ± 0.695
0.861MetGln: 0.861 ± 0.457
0.43MetArg: 0.43 ± 0.348
2.151MetSer: 2.151 ± 1.197
0.43MetThr: 0.43 ± 0.392
2.582MetVal: 2.582 ± 1.061
0.43MetTrp: 0.43 ± 0.638
0.861MetTyr: 0.861 ± 0.682
0.0MetXaa: 0.0 ± 0.0
Asn
3.012AsnAla: 3.012 ± 1.358
1.721AsnCys: 1.721 ± 1.022
2.151AsnAsp: 2.151 ± 1.142
1.291AsnGlu: 1.291 ± 0.671
2.582AsnPhe: 2.582 ± 1.399
3.012AsnGly: 3.012 ± 1.269
0.43AsnHis: 0.43 ± 0.585
3.012AsnIle: 3.012 ± 1.446
3.442AsnLys: 3.442 ± 1.388
1.721AsnLeu: 1.721 ± 1.147
1.291AsnMet: 1.291 ± 0.663
1.291AsnAsn: 1.291 ± 0.612
4.303AsnPro: 4.303 ± 0.954
0.861AsnGln: 0.861 ± 0.435
2.151AsnArg: 2.151 ± 1.066
3.012AsnSer: 3.012 ± 0.924
6.024AsnThr: 6.024 ± 1.229
3.442AsnVal: 3.442 ± 1.053
0.43AsnTrp: 0.43 ± 0.348
1.291AsnTyr: 1.291 ± 0.516
0.0AsnXaa: 0.0 ± 0.0
Pro
6.024ProAla: 6.024 ± 2.722
0.861ProCys: 0.861 ± 0.667
5.594ProAsp: 5.594 ± 1.272
1.721ProGlu: 1.721 ± 0.635
0.861ProPhe: 0.861 ± 0.435
0.861ProGly: 0.861 ± 0.477
0.0ProHis: 0.0 ± 0.0
3.442ProIle: 3.442 ± 1.335
3.012ProLys: 3.012 ± 0.939
7.315ProLeu: 7.315 ± 1.774
2.582ProMet: 2.582 ± 1.21
3.012ProAsn: 3.012 ± 0.954
4.733ProPro: 4.733 ± 1.609
1.291ProGln: 1.291 ± 0.78
1.291ProArg: 1.291 ± 0.531
5.594ProSer: 5.594 ± 2.572
4.733ProThr: 4.733 ± 1.267
4.303ProVal: 4.303 ± 1.869
0.43ProTrp: 0.43 ± 0.585
2.582ProTyr: 2.582 ± 1.104
0.0ProXaa: 0.0 ± 0.0
Gln
3.012GlnAla: 3.012 ± 1.549
2.151GlnCys: 2.151 ± 1.102
2.151GlnAsp: 2.151 ± 0.799
2.151GlnGlu: 2.151 ± 0.987
1.291GlnPhe: 1.291 ± 0.786
1.291GlnGly: 1.291 ± 0.663
0.861GlnHis: 0.861 ± 0.644
0.861GlnIle: 0.861 ± 0.457
1.721GlnLys: 1.721 ± 0.73
6.024GlnLeu: 6.024 ± 2.506
1.721GlnMet: 1.721 ± 0.589
0.861GlnAsn: 0.861 ± 0.456
2.151GlnPro: 2.151 ± 0.747
1.721GlnGln: 1.721 ± 0.816
2.582GlnArg: 2.582 ± 1.09
2.582GlnSer: 2.582 ± 1.235
2.151GlnThr: 2.151 ± 0.522
3.012GlnVal: 3.012 ± 1.459
1.291GlnTrp: 1.291 ± 0.711
1.291GlnTyr: 1.291 ± 0.455
0.0GlnXaa: 0.0 ± 0.0
Arg
2.151ArgAla: 2.151 ± 1.216
0.861ArgCys: 0.861 ± 0.773
2.582ArgAsp: 2.582 ± 1.065
3.012ArgGlu: 3.012 ± 1.641
1.291ArgPhe: 1.291 ± 0.87
3.012ArgGly: 3.012 ± 1.304
3.012ArgHis: 3.012 ± 1.714
0.861ArgIle: 0.861 ± 0.435
3.873ArgLys: 3.873 ± 1.263
5.164ArgLeu: 5.164 ± 1.479
0.43ArgMet: 0.43 ± 0.638
1.721ArgAsn: 1.721 ± 1.022
2.582ArgPro: 2.582 ± 0.815
2.151ArgGln: 2.151 ± 1.513
5.164ArgArg: 5.164 ± 3.27
2.582ArgSer: 2.582 ± 0.796
3.873ArgThr: 3.873 ± 2.318
2.151ArgVal: 2.151 ± 1.212
0.43ArgTrp: 0.43 ± 0.348
2.582ArgTyr: 2.582 ± 0.966
0.0ArgXaa: 0.0 ± 0.0
Ser
4.303SerAla: 4.303 ± 1.717
1.721SerCys: 1.721 ± 0.728
3.442SerAsp: 3.442 ± 1.129
3.873SerGlu: 3.873 ± 1.052
1.721SerPhe: 1.721 ± 0.932
5.594SerGly: 5.594 ± 2.802
1.291SerHis: 1.291 ± 0.78
3.442SerIle: 3.442 ± 1.02
3.012SerLys: 3.012 ± 1.593
3.873SerLeu: 3.873 ± 0.765
2.582SerMet: 2.582 ± 0.894
3.873SerAsn: 3.873 ± 1.082
4.303SerPro: 4.303 ± 0.954
0.861SerGln: 0.861 ± 0.516
4.303SerArg: 4.303 ± 1.576
7.315SerSer: 7.315 ± 2.325
9.036SerThr: 9.036 ± 2.954
4.733SerVal: 4.733 ± 0.79
0.43SerTrp: 0.43 ± 0.392
2.582SerTyr: 2.582 ± 1.326
0.0SerXaa: 0.0 ± 0.0
Thr
3.873ThrAla: 3.873 ± 1.289
2.582ThrCys: 2.582 ± 0.736
5.164ThrAsp: 5.164 ± 1.649
5.164ThrGlu: 5.164 ± 1.168
3.012ThrPhe: 3.012 ± 1.59
7.745ThrGly: 7.745 ± 2.267
2.151ThrHis: 2.151 ± 1.147
6.024ThrIle: 6.024 ± 1.023
3.012ThrLys: 3.012 ± 1.225
6.885ThrLeu: 6.885 ± 1.759
1.721ThrMet: 1.721 ± 0.395
3.873ThrAsn: 3.873 ± 0.829
5.594ThrPro: 5.594 ± 2.805
3.012ThrGln: 3.012 ± 0.863
2.151ThrArg: 2.151 ± 0.969
7.315ThrSer: 7.315 ± 3.15
9.036ThrThr: 9.036 ± 2.403
7.745ThrVal: 7.745 ± 1.694
1.291ThrTrp: 1.291 ± 0.776
2.582ThrTyr: 2.582 ± 1.245
0.0ThrXaa: 0.0 ± 0.0
Val
3.442ValAla: 3.442 ± 1.128
3.012ValCys: 3.012 ± 1.847
4.733ValAsp: 4.733 ± 0.738
3.442ValGlu: 3.442 ± 0.55
3.873ValPhe: 3.873 ± 2.434
3.012ValGly: 3.012 ± 1.475
1.291ValHis: 1.291 ± 0.843
1.721ValIle: 1.721 ± 1.148
0.861ValLys: 0.861 ± 0.456
3.873ValLeu: 3.873 ± 1.688
0.861ValMet: 0.861 ± 0.78
3.873ValAsn: 3.873 ± 0.858
5.164ValPro: 5.164 ± 1.088
3.012ValGln: 3.012 ± 0.923
3.012ValArg: 3.012 ± 1.083
6.885ValSer: 6.885 ± 2.371
3.873ValThr: 3.873 ± 0.93
3.873ValVal: 3.873 ± 1.282
2.151ValTrp: 2.151 ± 1.307
5.164ValTyr: 5.164 ± 3.506
0.0ValXaa: 0.0 ± 0.0
Trp
1.291TrpAla: 1.291 ± 0.786
1.291TrpCys: 1.291 ± 0.671
0.43TrpAsp: 0.43 ± 0.392
0.0TrpGlu: 0.0 ± 0.0
0.861TrpPhe: 0.861 ± 0.695
0.861TrpGly: 0.861 ± 0.784
1.291TrpHis: 1.291 ± 0.812
1.721TrpIle: 1.721 ± 0.852
0.861TrpLys: 0.861 ± 0.456
0.861TrpLeu: 0.861 ± 0.435
0.0TrpMet: 0.0 ± 0.0
2.151TrpAsn: 2.151 ± 1.171
0.43TrpPro: 0.43 ± 0.348
0.0TrpGln: 0.0 ± 0.0
0.861TrpArg: 0.861 ± 0.435
0.43TrpSer: 0.43 ± 0.348
1.291TrpThr: 1.291 ± 0.87
0.861TrpVal: 0.861 ± 0.695
0.0TrpTrp: 0.0 ± 0.0
0.861TrpTyr: 0.861 ± 0.456
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.303TyrAla: 4.303 ± 1.188
0.43TyrCys: 0.43 ± 0.828
2.582TyrAsp: 2.582 ± 1.254
3.012TyrGlu: 3.012 ± 1.729
2.151TyrPhe: 2.151 ± 0.522
3.012TyrGly: 3.012 ± 0.755
0.43TyrHis: 0.43 ± 0.394
2.582TyrIle: 2.582 ± 1.547
2.151TyrLys: 2.151 ± 0.614
2.151TyrLeu: 2.151 ± 1.081
1.291TyrMet: 1.291 ± 0.776
1.721TyrAsn: 1.721 ± 1.031
0.43TyrPro: 0.43 ± 0.425
1.291TyrGln: 1.291 ± 0.455
2.582TyrArg: 2.582 ± 1.083
1.721TyrSer: 1.721 ± 1.062
1.721TyrThr: 1.721 ± 1.393
2.582TyrVal: 2.582 ± 0.815
1.721TyrTrp: 1.721 ± 0.73
3.442TyrTyr: 3.442 ± 1.313
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2325 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski