Amino acid dipepetide frequency for Human papillomavirus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.545AlaAla: 2.545 ± 1.149
1.272AlaCys: 1.272 ± 0.92
4.771AlaAsp: 4.771 ± 0.692
4.453AlaGlu: 4.453 ± 1.299
2.226AlaPhe: 2.226 ± 0.793
2.863AlaGly: 2.863 ± 0.63
1.908AlaHis: 1.908 ± 0.839
1.59AlaIle: 1.59 ± 1.048
3.181AlaLys: 3.181 ± 1.054
3.817AlaLeu: 3.817 ± 0.955
1.59AlaMet: 1.59 ± 0.734
2.545AlaAsn: 2.545 ± 0.634
2.545AlaPro: 2.545 ± 0.77
3.499AlaGln: 3.499 ± 1.37
4.453AlaArg: 4.453 ± 1.243
1.59AlaSer: 1.59 ± 0.743
4.135AlaThr: 4.135 ± 0.985
4.453AlaVal: 4.453 ± 0.639
0.636AlaTrp: 0.636 ± 0.438
2.226AlaTyr: 2.226 ± 0.432
0.0AlaXaa: 0.0 ± 0.0
Cys
0.636CysAla: 0.636 ± 0.298
1.272CysCys: 1.272 ± 1.102
0.318CysAsp: 0.318 ± 0.26
1.272CysGlu: 1.272 ± 0.753
0.954CysPhe: 0.954 ± 0.364
1.272CysGly: 1.272 ± 1.138
0.0CysHis: 0.0 ± 0.0
0.636CysIle: 0.636 ± 0.298
2.545CysLys: 2.545 ± 0.799
0.954CysLeu: 0.954 ± 0.754
0.318CysMet: 0.318 ± 0.353
0.318CysAsn: 0.318 ± 0.36
1.272CysPro: 1.272 ± 0.597
0.318CysGln: 0.318 ± 0.36
2.226CysArg: 2.226 ± 0.929
0.636CysSer: 0.636 ± 0.52
0.954CysThr: 0.954 ± 0.414
0.318CysVal: 0.318 ± 0.36
0.318CysTrp: 0.318 ± 0.26
0.318CysTyr: 0.318 ± 0.272
0.0CysXaa: 0.0 ± 0.0
Asp
1.908AspAla: 1.908 ± 0.555
1.59AspCys: 1.59 ± 0.935
2.863AspAsp: 2.863 ± 0.952
2.226AspGlu: 2.226 ± 0.317
2.545AspPhe: 2.545 ± 0.657
2.545AspGly: 2.545 ± 0.505
0.318AspHis: 0.318 ± 0.271
5.725AspIle: 5.725 ± 1.899
2.863AspLys: 2.863 ± 1.129
4.453AspLeu: 4.453 ± 1.252
1.272AspMet: 1.272 ± 0.481
3.181AspAsn: 3.181 ± 0.828
3.499AspPro: 3.499 ± 0.679
2.863AspGln: 2.863 ± 1.119
1.59AspArg: 1.59 ± 0.857
4.135AspSer: 4.135 ± 0.389
4.135AspThr: 4.135 ± 0.523
3.181AspVal: 3.181 ± 1.018
1.59AspTrp: 1.59 ± 0.804
1.59AspTyr: 1.59 ± 0.816
0.0AspXaa: 0.0 ± 0.0
Glu
4.453GluAla: 4.453 ± 1.252
0.954GluCys: 0.954 ± 0.78
2.863GluAsp: 2.863 ± 0.811
6.361GluGlu: 6.361 ± 2.168
1.59GluPhe: 1.59 ± 0.935
6.679GluGly: 6.679 ± 3.972
0.954GluHis: 0.954 ± 0.47
2.226GluIle: 2.226 ± 1.094
1.272GluLys: 1.272 ± 0.76
4.135GluLeu: 4.135 ± 1.505
0.0GluMet: 0.0 ± 0.0
1.908GluAsn: 1.908 ± 0.528
3.181GluPro: 3.181 ± 1.05
3.181GluGln: 3.181 ± 1.177
2.863GluArg: 2.863 ± 0.809
4.135GluSer: 4.135 ± 1.507
3.817GluThr: 3.817 ± 1.079
5.089GluVal: 5.089 ± 1.237
0.636GluTrp: 0.636 ± 0.36
0.954GluTyr: 0.954 ± 0.816
0.0GluXaa: 0.0 ± 0.0
Phe
2.545PheAla: 2.545 ± 0.763
0.636PheCys: 0.636 ± 0.482
2.545PheAsp: 2.545 ± 0.659
2.863PheGlu: 2.863 ± 1.463
1.272PhePhe: 1.272 ± 0.481
1.59PheGly: 1.59 ± 0.525
0.954PheHis: 0.954 ± 0.624
1.59PheIle: 1.59 ± 0.539
1.908PheLys: 1.908 ± 0.835
2.545PheLeu: 2.545 ± 0.895
0.0PheMet: 0.0 ± 0.0
2.226PheAsn: 2.226 ± 0.812
1.272PhePro: 1.272 ± 0.724
1.272PheGln: 1.272 ± 0.412
1.908PheArg: 1.908 ± 0.827
3.499PheSer: 3.499 ± 0.644
0.318PheThr: 0.318 ± 0.26
1.59PheVal: 1.59 ± 0.608
1.272PheTrp: 1.272 ± 0.596
2.226PheTyr: 2.226 ± 0.89
0.0PheXaa: 0.0 ± 0.0
Gly
4.453GlyAla: 4.453 ± 0.664
0.954GlyCys: 0.954 ± 0.375
5.089GlyAsp: 5.089 ± 1.372
4.453GlyGlu: 4.453 ± 0.739
1.908GlyPhe: 1.908 ± 0.696
8.27GlyGly: 8.27 ± 2.589
4.771GlyHis: 4.771 ± 2.676
2.226GlyIle: 2.226 ± 0.743
1.59GlyLys: 1.59 ± 0.608
4.135GlyLeu: 4.135 ± 1.093
0.0GlyMet: 0.0 ± 0.0
2.863GlyAsn: 2.863 ± 0.679
6.361GlyPro: 6.361 ± 3.801
2.545GlyGln: 2.545 ± 0.441
10.178GlyArg: 10.178 ± 3.221
7.634GlySer: 7.634 ± 1.901
4.453GlyThr: 4.453 ± 0.839
3.499GlyVal: 3.499 ± 0.918
0.0GlyTrp: 0.0 ± 0.0
1.272GlyTyr: 1.272 ± 0.783
0.0GlyXaa: 0.0 ± 0.0
His
0.636HisAla: 0.636 ± 0.361
0.318HisCys: 0.318 ± 0.26
1.59HisAsp: 1.59 ± 0.711
1.59HisGlu: 1.59 ± 0.996
1.908HisPhe: 1.908 ± 0.636
0.954HisGly: 0.954 ± 0.516
1.272HisHis: 1.272 ± 1.374
0.954HisIle: 0.954 ± 0.436
2.226HisLys: 2.226 ± 0.874
1.272HisLeu: 1.272 ± 0.81
0.0HisMet: 0.0 ± 0.0
1.908HisAsn: 1.908 ± 0.689
2.545HisPro: 2.545 ± 1.29
1.59HisGln: 1.59 ± 0.832
1.272HisArg: 1.272 ± 0.394
1.59HisSer: 1.59 ± 0.467
1.908HisThr: 1.908 ± 0.774
1.272HisVal: 1.272 ± 0.702
0.954HisTrp: 0.954 ± 0.35
0.636HisTyr: 0.636 ± 0.52
0.0HisXaa: 0.0 ± 0.0
Ile
2.545IleAla: 2.545 ± 0.985
0.318IleCys: 0.318 ± 0.36
2.226IleAsp: 2.226 ± 0.89
2.545IleGlu: 2.545 ± 1.127
0.954IlePhe: 0.954 ± 0.553
3.499IleGly: 3.499 ± 1.172
0.954IleHis: 0.954 ± 0.532
2.545IleIle: 2.545 ± 1.104
2.226IleLys: 2.226 ± 1.117
2.863IleLeu: 2.863 ± 0.76
0.0IleMet: 0.0 ± 0.0
2.226IleAsn: 2.226 ± 0.726
2.226IlePro: 2.226 ± 1.083
1.908IleGln: 1.908 ± 1.15
1.908IleArg: 1.908 ± 0.919
3.181IleSer: 3.181 ± 0.559
1.59IleThr: 1.59 ± 0.593
2.226IleVal: 2.226 ± 0.415
0.636IleTrp: 0.636 ± 0.438
3.181IleTyr: 3.181 ± 0.825
0.0IleXaa: 0.0 ± 0.0
Lys
5.089LysAla: 5.089 ± 1.135
0.318LysCys: 0.318 ± 0.329
1.908LysAsp: 1.908 ± 1.064
3.499LysGlu: 3.499 ± 0.974
1.908LysPhe: 1.908 ± 0.976
5.407LysGly: 5.407 ± 0.964
1.59LysHis: 1.59 ± 0.621
1.272LysIle: 1.272 ± 0.553
1.908LysLys: 1.908 ± 0.819
4.135LysLeu: 4.135 ± 0.812
0.636LysMet: 0.636 ± 0.404
2.545LysAsn: 2.545 ± 1.027
0.954LysPro: 0.954 ± 1.057
2.226LysGln: 2.226 ± 0.375
3.817LysArg: 3.817 ± 0.433
3.181LysSer: 3.181 ± 1.535
2.226LysThr: 2.226 ± 0.42
4.771LysVal: 4.771 ± 2.198
0.636LysTrp: 0.636 ± 0.392
2.226LysTyr: 2.226 ± 0.432
0.0LysXaa: 0.0 ± 0.0
Leu
2.863LeuAla: 2.863 ± 1.171
1.272LeuCys: 1.272 ± 0.527
5.089LeuAsp: 5.089 ± 1.26
5.089LeuGlu: 5.089 ± 0.751
3.817LeuPhe: 3.817 ± 0.987
6.361LeuGly: 6.361 ± 1.065
2.545LeuHis: 2.545 ± 1.016
2.545LeuIle: 2.545 ± 0.924
3.817LeuLys: 3.817 ± 1.093
9.86LeuLeu: 9.86 ± 2.445
1.59LeuMet: 1.59 ± 0.593
0.954LeuAsn: 0.954 ± 0.624
3.817LeuPro: 3.817 ± 1.24
5.725LeuGln: 5.725 ± 1.246
2.863LeuArg: 2.863 ± 1.033
5.407LeuSer: 5.407 ± 1.69
6.043LeuThr: 6.043 ± 0.757
3.181LeuVal: 3.181 ± 1.35
0.636LeuTrp: 0.636 ± 0.298
0.954LeuTyr: 0.954 ± 0.579
0.0LeuXaa: 0.0 ± 0.0
Met
1.59MetAla: 1.59 ± 0.589
0.0MetCys: 0.0 ± 0.0
0.954MetAsp: 0.954 ± 0.354
0.318MetGlu: 0.318 ± 0.268
0.636MetPhe: 0.636 ± 0.544
0.954MetGly: 0.954 ± 0.397
0.0MetHis: 0.0 ± 0.0
0.318MetIle: 0.318 ± 0.299
0.954MetLys: 0.954 ± 0.537
1.272MetLeu: 1.272 ± 0.634
0.0MetMet: 0.0 ± 0.0
0.954MetAsn: 0.954 ± 0.557
0.0MetPro: 0.0 ± 0.0
0.318MetGln: 0.318 ± 0.272
0.318MetArg: 0.318 ± 0.26
1.272MetSer: 1.272 ± 0.724
0.636MetThr: 0.636 ± 0.386
0.954MetVal: 0.954 ± 0.568
0.318MetTrp: 0.318 ± 0.268
0.318MetTyr: 0.318 ± 0.272
0.0MetXaa: 0.0 ± 0.0
Asn
2.863AsnAla: 2.863 ± 0.741
0.954AsnCys: 0.954 ± 0.644
1.59AsnAsp: 1.59 ± 0.445
2.226AsnGlu: 2.226 ± 1.094
1.59AsnPhe: 1.59 ± 0.893
2.863AsnGly: 2.863 ± 1.352
0.954AsnHis: 0.954 ± 0.466
2.226AsnIle: 2.226 ± 0.538
1.59AsnLys: 1.59 ± 0.315
1.272AsnLeu: 1.272 ± 0.788
0.636AsnMet: 0.636 ± 0.494
1.59AsnAsn: 1.59 ± 0.481
3.181AsnPro: 3.181 ± 1.288
2.863AsnGln: 2.863 ± 1.11
2.863AsnArg: 2.863 ± 0.631
3.181AsnSer: 3.181 ± 1.006
3.181AsnThr: 3.181 ± 0.694
1.272AsnVal: 1.272 ± 0.513
0.0AsnTrp: 0.0 ± 0.0
0.954AsnTyr: 0.954 ± 0.466
0.0AsnXaa: 0.0 ± 0.0
Pro
3.181ProAla: 3.181 ± 1.059
1.59ProCys: 1.59 ± 0.657
5.089ProAsp: 5.089 ± 1.421
3.817ProGlu: 3.817 ± 1.535
0.954ProPhe: 0.954 ± 0.47
4.453ProGly: 4.453 ± 1.821
1.272ProHis: 1.272 ± 0.818
1.59ProIle: 1.59 ± 0.878
3.499ProLys: 3.499 ± 1.081
4.453ProLeu: 4.453 ± 1.276
0.318ProMet: 0.318 ± 0.26
2.545ProAsn: 2.545 ± 0.985
11.45ProPro: 11.45 ± 7.331
2.863ProGln: 2.863 ± 1.006
5.089ProArg: 5.089 ± 1.751
3.817ProSer: 3.817 ± 1.488
4.771ProThr: 4.771 ± 1.107
6.679ProVal: 6.679 ± 2.143
0.636ProTrp: 0.636 ± 0.425
1.59ProTyr: 1.59 ± 0.775
0.0ProXaa: 0.0 ± 0.0
Gln
3.181GlnAla: 3.181 ± 0.635
0.954GlnCys: 0.954 ± 0.47
3.181GlnAsp: 3.181 ± 0.907
1.272GlnGlu: 1.272 ± 0.63
1.908GlnPhe: 1.908 ± 0.848
2.545GlnGly: 2.545 ± 1.275
0.954GlnHis: 0.954 ± 0.305
2.545GlnIle: 2.545 ± 0.689
1.59GlnLys: 1.59 ± 0.628
5.407GlnLeu: 5.407 ± 0.815
1.59GlnMet: 1.59 ± 0.619
0.954GlnAsn: 0.954 ± 0.403
3.181GlnPro: 3.181 ± 0.919
5.089GlnGln: 5.089 ± 0.818
5.089GlnArg: 5.089 ± 1.524
1.59GlnSer: 1.59 ± 0.464
6.361GlnThr: 6.361 ± 1.329
2.226GlnVal: 2.226 ± 1.189
0.318GlnTrp: 0.318 ± 0.26
1.272GlnTyr: 1.272 ± 0.493
0.0GlnXaa: 0.0 ± 0.0
Arg
5.089ArgAla: 5.089 ± 1.114
0.636ArgCys: 0.636 ± 0.46
3.817ArgAsp: 3.817 ± 0.843
2.226ArgGlu: 2.226 ± 0.49
1.59ArgPhe: 1.59 ± 0.446
11.768ArgGly: 11.768 ± 3.866
1.272ArgHis: 1.272 ± 0.588
0.954ArgIle: 0.954 ± 0.693
4.135ArgLys: 4.135 ± 1.393
6.361ArgLeu: 6.361 ± 0.769
0.636ArgMet: 0.636 ± 0.38
3.499ArgAsn: 3.499 ± 0.672
3.499ArgPro: 3.499 ± 0.747
1.908ArgGln: 1.908 ± 0.48
9.542ArgArg: 9.542 ± 3.396
17.494ArgSer: 17.494 ± 8.063
1.272ArgThr: 1.272 ± 0.593
3.499ArgVal: 3.499 ± 1.03
0.318ArgTrp: 0.318 ± 0.344
2.863ArgTyr: 2.863 ± 0.744
0.0ArgXaa: 0.0 ± 0.0
Ser
3.181SerAla: 3.181 ± 0.815
0.636SerCys: 0.636 ± 0.38
2.863SerAsp: 2.863 ± 1.227
2.863SerGlu: 2.863 ± 1.075
3.181SerPhe: 3.181 ± 0.873
4.771SerGly: 4.771 ± 1.045
1.272SerHis: 1.272 ± 0.534
2.863SerIle: 2.863 ± 0.728
5.407SerLys: 5.407 ± 1.344
7.952SerLeu: 7.952 ± 1.086
0.954SerMet: 0.954 ± 0.375
1.272SerAsn: 1.272 ± 0.596
5.725SerPro: 5.725 ± 2.184
3.499SerGln: 3.499 ± 0.947
13.041SerArg: 13.041 ± 6.065
11.45SerSer: 11.45 ± 3.833
9.542SerThr: 9.542 ± 2.838
4.135SerVal: 4.135 ± 0.829
0.954SerTrp: 0.954 ± 0.397
1.908SerTyr: 1.908 ± 0.592
0.0SerXaa: 0.0 ± 0.0
Thr
3.181ThrAla: 3.181 ± 1.095
2.226ThrCys: 2.226 ± 0.445
2.545ThrAsp: 2.545 ± 0.966
4.453ThrGlu: 4.453 ± 0.849
1.908ThrPhe: 1.908 ± 0.545
4.453ThrGly: 4.453 ± 1.491
0.954ThrHis: 0.954 ± 0.399
2.226ThrIle: 2.226 ± 0.99
2.226ThrLys: 2.226 ± 0.492
2.545ThrLeu: 2.545 ± 1.117
0.318ThrMet: 0.318 ± 0.268
3.181ThrAsn: 3.181 ± 0.617
7.316ThrPro: 7.316 ± 1.636
4.135ThrGln: 4.135 ± 0.981
6.997ThrArg: 6.997 ± 1.731
6.997ThrSer: 6.997 ± 2.408
9.224ThrThr: 9.224 ± 3.589
5.089ThrVal: 5.089 ± 1.558
0.636ThrTrp: 0.636 ± 0.38
1.908ThrTyr: 1.908 ± 0.815
0.0ThrXaa: 0.0 ± 0.0
Val
3.817ValAla: 3.817 ± 0.663
0.636ValCys: 0.636 ± 0.474
3.817ValAsp: 3.817 ± 0.786
3.817ValGlu: 3.817 ± 1.099
2.226ValPhe: 2.226 ± 0.691
2.545ValGly: 2.545 ± 0.459
2.863ValHis: 2.863 ± 1.206
2.863ValIle: 2.863 ± 0.884
3.181ValLys: 3.181 ± 0.618
3.181ValLeu: 3.181 ± 0.735
0.0ValMet: 0.0 ± 0.0
1.908ValAsn: 1.908 ± 0.555
5.725ValPro: 5.725 ± 2.062
3.181ValGln: 3.181 ± 1.218
5.089ValArg: 5.089 ± 1.121
4.771ValSer: 4.771 ± 0.995
4.453ValThr: 4.453 ± 1.868
1.908ValVal: 1.908 ± 0.992
0.636ValTrp: 0.636 ± 0.544
1.908ValTyr: 1.908 ± 0.994
0.0ValXaa: 0.0 ± 0.0
Trp
0.954TrpAla: 0.954 ± 0.508
0.318TrpCys: 0.318 ± 0.26
0.636TrpAsp: 0.636 ± 0.544
0.636TrpGlu: 0.636 ± 0.392
0.0TrpPhe: 0.0 ± 0.0
0.318TrpGly: 0.318 ± 0.344
0.636TrpHis: 0.636 ± 0.343
0.954TrpIle: 0.954 ± 0.78
1.59TrpLys: 1.59 ± 0.842
0.954TrpLeu: 0.954 ± 0.496
0.318TrpMet: 0.318 ± 0.322
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.636TrpGln: 0.636 ± 0.343
0.0TrpArg: 0.0 ± 0.0
0.954TrpSer: 0.954 ± 0.397
0.954TrpThr: 0.954 ± 0.579
1.272TrpVal: 1.272 ± 0.553
0.0TrpTrp: 0.0 ± 0.0
0.318TrpTyr: 0.318 ± 0.26
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.59TyrAla: 1.59 ± 0.593
0.318TyrCys: 0.318 ± 0.36
0.318TyrAsp: 0.318 ± 0.268
1.272TyrGlu: 1.272 ± 0.76
0.954TyrPhe: 0.954 ± 0.816
2.545TyrGly: 2.545 ± 0.749
0.954TyrHis: 0.954 ± 0.479
1.908TyrIle: 1.908 ± 1.285
2.545TyrLys: 2.545 ± 0.881
2.863TyrLeu: 2.863 ± 0.908
1.59TyrMet: 1.59 ± 0.683
1.272TyrAsn: 1.272 ± 1.087
1.908TyrPro: 1.908 ± 0.411
1.59TyrGln: 1.59 ± 0.435
1.272TyrArg: 1.272 ± 0.452
0.954TyrSer: 0.954 ± 0.553
2.545TyrThr: 2.545 ± 0.774
1.908TyrVal: 1.908 ± 0.802
0.318TyrTrp: 0.318 ± 0.352
2.226TyrTyr: 2.226 ± 1.013
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (3145 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski