Amino acid dipepetide frequency for Gammapapillomavirus 16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.595AlaAla: 4.595 ± 2.477
1.671AlaCys: 1.671 ± 1.064
2.924AlaAsp: 2.924 ± 0.568
4.177AlaGlu: 4.177 ± 0.887
2.924AlaPhe: 2.924 ± 0.568
1.671AlaGly: 1.671 ± 1.043
0.418AlaHis: 0.418 ± 0.344
3.759AlaIle: 3.759 ± 0.583
4.595AlaLys: 4.595 ± 1.083
5.013AlaLeu: 5.013 ± 2.033
0.835AlaMet: 0.835 ± 0.663
3.342AlaAsn: 3.342 ± 0.726
3.759AlaPro: 3.759 ± 0.964
1.671AlaGln: 1.671 ± 0.63
3.342AlaArg: 3.342 ± 1.964
3.759AlaSer: 3.759 ± 0.734
3.759AlaThr: 3.759 ± 0.538
3.342AlaVal: 3.342 ± 0.648
0.835AlaTrp: 0.835 ± 0.687
2.506AlaTyr: 2.506 ± 1.187
0.0AlaXaa: 0.0 ± 0.0
Cys
0.418CysAla: 0.418 ± 0.528
0.835CysCys: 0.835 ± 0.671
1.671CysAsp: 1.671 ± 1.342
0.835CysGlu: 0.835 ± 0.675
1.671CysPhe: 1.671 ± 1.137
0.418CysGly: 0.418 ± 0.337
0.418CysHis: 0.418 ± 0.337
1.671CysIle: 1.671 ± 1.957
2.089CysLys: 2.089 ± 1.365
2.506CysLeu: 2.506 ± 1.902
0.0CysMet: 0.0 ± 0.0
0.418CysAsn: 0.418 ± 0.337
1.671CysPro: 1.671 ± 0.984
2.506CysGln: 2.506 ± 0.618
0.835CysArg: 0.835 ± 0.434
1.671CysSer: 1.671 ± 1.521
1.671CysThr: 1.671 ± 0.601
0.418CysVal: 0.418 ± 0.528
0.418CysTrp: 0.418 ± 0.337
1.671CysTyr: 1.671 ± 1.521
0.0CysXaa: 0.0 ± 0.0
Asp
3.342AspAla: 3.342 ± 0.75
2.506AspCys: 2.506 ± 1.259
5.43AspAsp: 5.43 ± 1.966
4.177AspGlu: 4.177 ± 1.184
1.253AspPhe: 1.253 ± 0.579
2.089AspGly: 2.089 ± 1.056
0.418AspHis: 0.418 ± 0.337
4.595AspIle: 4.595 ± 1.703
1.671AspLys: 1.671 ± 1.375
7.101AspLeu: 7.101 ± 1.701
0.835AspMet: 0.835 ± 0.41
4.177AspAsn: 4.177 ± 1.285
3.759AspPro: 3.759 ± 1.038
3.342AspGln: 3.342 ± 0.918
2.506AspArg: 2.506 ± 1.379
2.506AspSer: 2.506 ± 0.67
5.43AspThr: 5.43 ± 1.649
4.595AspVal: 4.595 ± 2.092
0.835AspTrp: 0.835 ± 0.393
1.671AspTyr: 1.671 ± 0.603
0.0AspXaa: 0.0 ± 0.0
Glu
5.848GluAla: 5.848 ± 1.307
0.0GluCys: 0.0 ± 0.0
4.595GluAsp: 4.595 ± 1.072
7.519GluGlu: 7.519 ± 4.017
2.924GluPhe: 2.924 ± 1.394
2.506GluGly: 2.506 ± 1.321
2.506GluHis: 2.506 ± 0.649
4.595GluIle: 4.595 ± 1.576
2.924GluLys: 2.924 ± 2.132
4.595GluLeu: 4.595 ± 1.909
1.253GluMet: 1.253 ± 0.642
7.101GluAsn: 7.101 ± 1.926
4.177GluPro: 4.177 ± 0.734
3.342GluGln: 3.342 ± 1.756
2.089GluArg: 2.089 ± 0.695
3.759GluSer: 3.759 ± 1.392
2.506GluThr: 2.506 ± 0.524
2.924GluVal: 2.924 ± 0.587
0.835GluTrp: 0.835 ± 0.675
0.835GluTyr: 0.835 ± 0.674
0.0GluXaa: 0.0 ± 0.0
Phe
1.671PheAla: 1.671 ± 0.603
0.835PheCys: 0.835 ± 0.671
2.924PheAsp: 2.924 ± 1.238
2.506PheGlu: 2.506 ± 1.58
2.506PhePhe: 2.506 ± 0.735
2.924PheGly: 2.924 ± 1.237
0.418PheHis: 0.418 ± 0.344
1.253PheIle: 1.253 ± 0.728
2.506PheLys: 2.506 ± 1.198
4.177PheLeu: 4.177 ± 1.287
1.253PheMet: 1.253 ± 0.632
2.089PheAsn: 2.089 ± 0.683
1.671PhePro: 1.671 ± 0.997
2.924PheGln: 2.924 ± 1.215
3.342PheArg: 3.342 ± 0.574
2.506PheSer: 2.506 ± 0.901
2.506PheThr: 2.506 ± 0.934
1.671PheVal: 1.671 ± 0.603
0.835PheTrp: 0.835 ± 0.41
2.089PheTyr: 2.089 ± 0.467
0.0PheXaa: 0.0 ± 0.0
Gly
2.924GlyAla: 2.924 ± 0.892
1.253GlyCys: 1.253 ± 0.444
2.506GlyAsp: 2.506 ± 1.347
2.089GlyGlu: 2.089 ± 0.742
0.0GlyPhe: 0.0 ± 0.0
5.013GlyGly: 5.013 ± 1.779
1.671GlyHis: 1.671 ± 0.803
3.342GlyIle: 3.342 ± 0.885
2.924GlyLys: 2.924 ± 0.837
6.683GlyLeu: 6.683 ± 2.019
1.253GlyMet: 1.253 ± 0.822
5.013GlyAsn: 5.013 ± 0.879
1.253GlyPro: 1.253 ± 0.444
1.671GlyGln: 1.671 ± 0.652
2.924GlyArg: 2.924 ± 1.191
6.266GlySer: 6.266 ± 1.289
2.924GlyThr: 2.924 ± 1.523
0.835GlyVal: 0.835 ± 0.393
0.0GlyTrp: 0.0 ± 0.0
2.089GlyTyr: 2.089 ± 1.019
0.0GlyXaa: 0.0 ± 0.0
His
2.089HisAla: 2.089 ± 0.714
0.418HisCys: 0.418 ± 0.337
0.418HisAsp: 0.418 ± 0.536
1.671HisGlu: 1.671 ± 0.315
1.671HisPhe: 1.671 ± 0.84
0.418HisGly: 0.418 ± 0.337
0.418HisHis: 0.418 ± 0.33
0.418HisIle: 0.418 ± 0.33
0.835HisLys: 0.835 ± 0.41
1.671HisLeu: 1.671 ± 0.671
0.418HisMet: 0.418 ± 0.49
1.671HisAsn: 1.671 ± 0.681
2.089HisPro: 2.089 ± 1.023
1.671HisGln: 1.671 ± 1.157
1.253HisArg: 1.253 ± 1.126
1.671HisSer: 1.671 ± 0.749
1.671HisThr: 1.671 ± 0.478
0.418HisVal: 0.418 ± 0.33
1.253HisTrp: 1.253 ± 0.398
0.835HisTyr: 0.835 ± 0.734
0.0HisXaa: 0.0 ± 0.0
Ile
2.506IleAla: 2.506 ± 0.735
0.835IleCys: 0.835 ± 0.725
5.43IleAsp: 5.43 ± 1.245
6.266IleGlu: 6.266 ± 0.944
2.506IlePhe: 2.506 ± 0.663
4.595IleGly: 4.595 ± 1.737
1.671IleHis: 1.671 ± 1.012
2.506IleIle: 2.506 ± 0.716
1.671IleLys: 1.671 ± 0.971
4.177IleLeu: 4.177 ± 1.421
0.418IleMet: 0.418 ± 0.337
2.506IleAsn: 2.506 ± 0.571
3.342IlePro: 3.342 ± 1.113
0.418IleGln: 0.418 ± 0.337
1.671IleArg: 1.671 ± 1.291
4.595IleSer: 4.595 ± 1.006
2.924IleThr: 2.924 ± 0.931
3.342IleVal: 3.342 ± 1.265
0.0IleTrp: 0.0 ± 0.0
2.924IleTyr: 2.924 ± 0.723
0.0IleXaa: 0.0 ± 0.0
Lys
2.089LysAla: 2.089 ± 0.697
2.089LysCys: 2.089 ± 1.923
2.506LysAsp: 2.506 ± 0.726
3.342LysGlu: 3.342 ± 1.051
1.253LysPhe: 1.253 ± 0.668
2.506LysGly: 2.506 ± 0.702
1.671LysHis: 1.671 ± 1.064
2.089LysIle: 2.089 ± 0.687
2.506LysLys: 2.506 ± 0.899
5.848LysLeu: 5.848 ± 1.271
0.418LysMet: 0.418 ± 0.397
3.342LysAsn: 3.342 ± 0.91
1.253LysPro: 1.253 ± 0.68
2.924LysGln: 2.924 ± 0.638
5.43LysArg: 5.43 ± 1.024
3.342LysSer: 3.342 ± 1.505
3.342LysThr: 3.342 ± 1.107
4.177LysVal: 4.177 ± 0.795
0.418LysTrp: 0.418 ± 0.344
1.671LysTyr: 1.671 ± 0.876
0.0LysXaa: 0.0 ± 0.0
Leu
6.266LeuAla: 6.266 ± 0.54
1.671LeuCys: 1.671 ± 0.898
6.266LeuAsp: 6.266 ± 1.19
6.683LeuGlu: 6.683 ± 2.095
5.848LeuPhe: 5.848 ± 1.39
5.43LeuGly: 5.43 ± 1.716
3.342LeuHis: 3.342 ± 1.713
5.43LeuIle: 5.43 ± 0.642
5.43LeuLys: 5.43 ± 1.043
10.025LeuLeu: 10.025 ± 2.615
2.089LeuMet: 2.089 ± 1.163
2.506LeuAsn: 2.506 ± 1.241
5.43LeuPro: 5.43 ± 1.549
3.759LeuGln: 3.759 ± 0.636
2.924LeuArg: 2.924 ± 1.086
10.025LeuSer: 10.025 ± 2.273
5.013LeuThr: 5.013 ± 1.163
4.177LeuVal: 4.177 ± 3.012
0.418LeuTrp: 0.418 ± 0.337
5.43LeuTyr: 5.43 ± 1.522
0.0LeuXaa: 0.0 ± 0.0
Met
0.835MetAla: 0.835 ± 0.41
0.418MetCys: 0.418 ± 0.344
0.835MetAsp: 0.835 ± 0.438
0.418MetGlu: 0.418 ± 0.671
0.835MetPhe: 0.835 ± 0.438
0.835MetGly: 0.835 ± 0.41
0.0MetHis: 0.0 ± 0.0
0.418MetIle: 0.418 ± 0.337
1.253MetLys: 1.253 ± 0.642
0.835MetLeu: 0.835 ± 0.438
0.0MetMet: 0.0 ± 0.0
2.506MetAsn: 2.506 ± 0.534
1.253MetPro: 1.253 ± 0.55
1.253MetGln: 1.253 ± 0.448
1.253MetArg: 1.253 ± 0.655
1.671MetSer: 1.671 ± 0.597
1.253MetThr: 1.253 ± 0.398
2.089MetVal: 2.089 ± 1.289
0.0MetTrp: 0.0 ± 0.0
0.418MetTyr: 0.418 ± 0.397
0.0MetXaa: 0.0 ± 0.0
Asn
3.759AsnAla: 3.759 ± 1.362
1.253AsnCys: 1.253 ± 0.674
2.506AsnAsp: 2.506 ± 0.895
3.342AsnGlu: 3.342 ± 1.551
2.089AsnPhe: 2.089 ± 0.697
3.759AsnGly: 3.759 ± 0.987
0.835AsnHis: 0.835 ± 0.438
5.43AsnIle: 5.43 ± 1.236
3.759AsnLys: 3.759 ± 1.514
6.266AsnLeu: 6.266 ± 1.278
0.418AsnMet: 0.418 ± 0.337
2.506AsnAsn: 2.506 ± 1.108
3.342AsnPro: 3.342 ± 1.33
2.506AsnGln: 2.506 ± 1.635
4.595AsnArg: 4.595 ± 1.151
2.924AsnSer: 2.924 ± 0.837
4.177AsnThr: 4.177 ± 0.97
2.089AsnVal: 2.089 ± 0.885
0.835AsnTrp: 0.835 ± 0.438
1.253AsnTyr: 1.253 ± 0.674
0.0AsnXaa: 0.0 ± 0.0
Pro
4.595ProAla: 4.595 ± 1.415
0.835ProCys: 0.835 ± 0.725
4.177ProAsp: 4.177 ± 1.522
4.177ProGlu: 4.177 ± 1.269
2.089ProPhe: 2.089 ± 0.806
2.089ProGly: 2.089 ± 1.32
0.418ProHis: 0.418 ± 0.536
2.506ProIle: 2.506 ± 1.583
2.506ProLys: 2.506 ± 0.934
9.19ProLeu: 9.19 ± 1.371
0.418ProMet: 0.418 ± 0.536
1.671ProAsn: 1.671 ± 0.315
6.266ProPro: 6.266 ± 1.951
2.089ProGln: 2.089 ± 1.323
0.835ProArg: 0.835 ± 0.438
4.595ProSer: 4.595 ± 1.577
6.266ProThr: 6.266 ± 2.059
1.253ProVal: 1.253 ± 0.375
0.0ProTrp: 0.0 ± 0.0
2.506ProTyr: 2.506 ± 0.779
0.0ProXaa: 0.0 ± 0.0
Gln
1.253GlnAla: 1.253 ± 0.398
0.418GlnCys: 0.418 ± 0.671
2.089GlnAsp: 2.089 ± 0.808
2.089GlnGlu: 2.089 ± 0.822
1.671GlnPhe: 1.671 ± 0.749
2.506GlnGly: 2.506 ± 0.649
1.671GlnHis: 1.671 ± 0.597
2.089GlnIle: 2.089 ± 0.767
1.253GlnLys: 1.253 ± 1.092
6.683GlnLeu: 6.683 ± 1.639
1.671GlnMet: 1.671 ± 0.602
3.342GlnAsn: 3.342 ± 1.104
2.089GlnPro: 2.089 ± 0.697
2.089GlnGln: 2.089 ± 0.567
2.089GlnArg: 2.089 ± 1.037
1.253GlnSer: 1.253 ± 0.448
1.671GlnThr: 1.671 ± 0.63
2.089GlnVal: 2.089 ± 0.902
0.835GlnTrp: 0.835 ± 0.438
0.835GlnTyr: 0.835 ± 0.687
0.0GlnXaa: 0.0 ± 0.0
Arg
3.759ArgAla: 3.759 ± 1.055
1.671ArgCys: 1.671 ± 1.276
2.924ArgAsp: 2.924 ± 0.811
2.089ArgGlu: 2.089 ± 0.841
1.253ArgPhe: 1.253 ± 0.668
3.759ArgGly: 3.759 ± 1.271
2.089ArgHis: 2.089 ± 1.05
1.671ArgIle: 1.671 ± 0.774
4.595ArgLys: 4.595 ± 0.659
5.43ArgLeu: 5.43 ± 1.63
0.835ArgMet: 0.835 ± 0.41
3.342ArgAsn: 3.342 ± 1.528
2.924ArgPro: 2.924 ± 1.032
0.835ArgGln: 0.835 ± 0.578
7.101ArgArg: 7.101 ± 2.772
3.342ArgSer: 3.342 ± 1.375
1.671ArgThr: 1.671 ± 0.852
2.924ArgVal: 2.924 ± 1.542
0.0ArgTrp: 0.0 ± 0.0
1.671ArgTyr: 1.671 ± 0.656
0.0ArgXaa: 0.0 ± 0.0
Ser
3.759SerAla: 3.759 ± 1.354
1.253SerCys: 1.253 ± 1.082
4.177SerAsp: 4.177 ± 1.041
5.848SerGlu: 5.848 ± 1.72
2.506SerPhe: 2.506 ± 0.905
3.342SerGly: 3.342 ± 0.835
2.089SerHis: 2.089 ± 1.258
2.506SerIle: 2.506 ± 0.966
2.924SerLys: 2.924 ± 1.259
4.595SerLeu: 4.595 ± 1.574
1.253SerMet: 1.253 ± 0.744
5.43SerAsn: 5.43 ± 2.236
4.595SerPro: 4.595 ± 0.902
2.089SerGln: 2.089 ± 0.763
3.759SerArg: 3.759 ± 1.216
8.772SerSer: 8.772 ± 2.539
6.266SerThr: 6.266 ± 1.537
4.177SerVal: 4.177 ± 0.663
0.835SerTrp: 0.835 ± 0.41
2.924SerTyr: 2.924 ± 0.89
0.0SerXaa: 0.0 ± 0.0
Thr
2.506ThrAla: 2.506 ± 1.052
2.089ThrCys: 2.089 ± 1.008
5.013ThrAsp: 5.013 ± 0.931
4.177ThrGlu: 4.177 ± 0.728
2.924ThrPhe: 2.924 ± 1.086
5.013ThrGly: 5.013 ± 1.299
0.835ThrHis: 0.835 ± 0.399
3.342ThrIle: 3.342 ± 1.308
2.506ThrLys: 2.506 ± 0.649
6.266ThrLeu: 6.266 ± 1.738
2.506ThrMet: 2.506 ± 1.626
2.924ThrAsn: 2.924 ± 0.638
3.759ThrPro: 3.759 ± 1.035
1.671ThrGln: 1.671 ± 0.946
2.924ThrArg: 2.924 ± 1.202
5.013ThrSer: 5.013 ± 2.125
4.177ThrThr: 4.177 ± 1.641
4.595ThrVal: 4.595 ± 1.71
1.253ThrTrp: 1.253 ± 0.822
1.253ThrTyr: 1.253 ± 0.444
0.0ThrXaa: 0.0 ± 0.0
Val
3.759ValAla: 3.759 ± 0.697
0.835ValCys: 0.835 ± 0.49
3.759ValAsp: 3.759 ± 0.917
3.342ValGlu: 3.342 ± 0.927
1.671ValPhe: 1.671 ± 0.591
2.506ValGly: 2.506 ± 0.961
1.671ValHis: 1.671 ± 0.671
2.506ValIle: 2.506 ± 1.178
3.342ValLys: 3.342 ± 1.056
3.759ValLeu: 3.759 ± 1.061
0.418ValMet: 0.418 ± 0.337
1.671ValAsn: 1.671 ± 0.627
4.595ValPro: 4.595 ± 1.211
1.671ValGln: 1.671 ± 0.774
2.506ValArg: 2.506 ± 0.905
2.089ValSer: 2.089 ± 0.659
3.759ValThr: 3.759 ± 1.142
2.089ValVal: 2.089 ± 0.806
2.089ValTrp: 2.089 ± 0.989
2.089ValTyr: 2.089 ± 0.508
0.0ValXaa: 0.0 ± 0.0
Trp
0.835TrpAla: 0.835 ± 0.393
0.418TrpCys: 0.418 ± 0.344
0.418TrpAsp: 0.418 ± 0.344
0.0TrpGlu: 0.0 ± 0.0
1.253TrpPhe: 1.253 ± 0.674
0.418TrpGly: 0.418 ± 0.344
0.418TrpHis: 0.418 ± 0.397
1.671TrpIle: 1.671 ± 0.963
0.418TrpLys: 0.418 ± 0.337
1.253TrpLeu: 1.253 ± 0.668
0.418TrpMet: 0.418 ± 0.397
0.0TrpAsn: 0.0 ± 0.0
0.418TrpPro: 0.418 ± 0.344
0.418TrpGln: 0.418 ± 0.344
0.835TrpArg: 0.835 ± 0.725
0.0TrpSer: 0.0 ± 0.0
1.671TrpThr: 1.671 ± 0.681
0.835TrpVal: 0.835 ± 0.434
0.0TrpTrp: 0.0 ± 0.0
0.418TrpTyr: 0.418 ± 0.397
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.089TyrAla: 2.089 ± 0.644
2.506TyrCys: 2.506 ± 1.657
1.671TyrAsp: 1.671 ± 0.315
2.089TyrGlu: 2.089 ± 1.674
4.177TyrPhe: 4.177 ± 1.752
0.835TyrGly: 0.835 ± 0.41
0.0TyrHis: 0.0 ± 0.0
2.506TyrIle: 2.506 ± 0.548
2.506TyrLys: 2.506 ± 1.06
2.089TyrLeu: 2.089 ± 0.714
1.253TyrMet: 1.253 ± 0.807
2.089TyrAsn: 2.089 ± 0.713
0.835TyrPro: 0.835 ± 0.652
0.835TyrGln: 0.835 ± 0.434
1.671TyrArg: 1.671 ± 0.867
3.342TyrSer: 3.342 ± 1.066
2.089TyrThr: 2.089 ± 0.869
2.089TyrVal: 2.089 ± 0.713
0.418TyrTrp: 0.418 ± 0.344
2.089TyrTyr: 2.089 ± 1.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2395 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski