Amino acid dipepetide frequency for Human papillomavirus 31

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.644AlaAla: 4.644 ± 0.923
2.322AlaCys: 2.322 ± 0.724
3.483AlaAsp: 3.483 ± 1.051
4.257AlaGlu: 4.257 ± 1.561
1.161AlaPhe: 1.161 ± 0.729
4.644AlaGly: 4.644 ± 0.95
0.0AlaHis: 0.0 ± 0.0
3.87AlaIle: 3.87 ± 1.31
2.709AlaLys: 2.709 ± 1.245
5.031AlaLeu: 5.031 ± 1.267
0.774AlaMet: 0.774 ± 0.369
2.709AlaAsn: 2.709 ± 1.074
4.644AlaPro: 4.644 ± 2.095
2.322AlaGln: 2.322 ± 1.006
1.935AlaArg: 1.935 ± 0.79
1.935AlaSer: 1.935 ± 1.209
6.579AlaThr: 6.579 ± 2.235
2.709AlaVal: 2.709 ± 0.782
0.0AlaTrp: 0.0 ± 0.0
1.548AlaTyr: 1.548 ± 0.809
0.0AlaXaa: 0.0 ± 0.0
Cys
1.935CysAla: 1.935 ± 0.586
0.774CysCys: 0.774 ± 0.72
1.161CysAsp: 1.161 ± 0.645
0.387CysGlu: 0.387 ± 0.493
1.161CysPhe: 1.161 ± 1.509
1.548CysGly: 1.548 ± 0.625
0.387CysHis: 0.387 ± 0.322
3.483CysIle: 3.483 ± 1.325
2.709CysLys: 2.709 ± 1.087
2.709CysLeu: 2.709 ± 0.894
0.387CysMet: 0.387 ± 0.322
1.548CysAsn: 1.548 ± 0.882
2.709CysPro: 2.709 ± 0.652
1.548CysGln: 1.548 ± 0.769
0.774CysArg: 0.774 ± 0.645
1.161CysSer: 1.161 ± 0.624
2.709CysThr: 2.709 ± 1.458
3.096CysVal: 3.096 ± 0.98
1.161CysTrp: 1.161 ± 0.504
0.387CysTyr: 0.387 ± 0.505
0.0CysXaa: 0.0 ± 0.0
Asp
1.935AspAla: 1.935 ± 0.586
0.774AspCys: 0.774 ± 0.365
2.709AspAsp: 2.709 ± 0.887
2.709AspGlu: 2.709 ± 0.821
3.096AspPhe: 3.096 ± 0.901
4.257AspGly: 4.257 ± 1.03
0.387AspHis: 0.387 ± 0.375
3.87AspIle: 3.87 ± 1.543
2.322AspLys: 2.322 ± 1.328
3.096AspLeu: 3.096 ± 1.536
0.774AspMet: 0.774 ± 0.365
2.322AspAsn: 2.322 ± 1.186
3.483AspPro: 3.483 ± 1.717
1.161AspGln: 1.161 ± 0.656
0.774AspArg: 0.774 ± 0.645
5.031AspSer: 5.031 ± 2.074
6.192AspThr: 6.192 ± 2.409
3.87AspVal: 3.87 ± 1.616
1.548AspTrp: 1.548 ± 0.596
2.322AspTyr: 2.322 ± 1.06
0.0AspXaa: 0.0 ± 0.0
Glu
3.483GluAla: 3.483 ± 1.056
0.774GluCys: 0.774 ± 0.447
3.483GluAsp: 3.483 ± 1.269
4.257GluGlu: 4.257 ± 0.943
1.161GluPhe: 1.161 ± 0.504
2.709GluGly: 2.709 ± 1.292
2.709GluHis: 2.709 ± 0.682
1.935GluIle: 1.935 ± 0.678
2.322GluLys: 2.322 ± 1.119
4.257GluLeu: 4.257 ± 1.415
0.774GluMet: 0.774 ± 0.455
3.483GluAsn: 3.483 ± 1.099
1.548GluPro: 1.548 ± 0.816
2.322GluGln: 2.322 ± 0.782
1.161GluArg: 1.161 ± 0.729
1.935GluSer: 1.935 ± 1.007
5.031GluThr: 5.031 ± 1.042
2.322GluVal: 2.322 ± 0.924
0.387GluTrp: 0.387 ± 0.322
1.161GluTyr: 1.161 ± 0.71
0.0GluXaa: 0.0 ± 0.0
Phe
1.161PheAla: 1.161 ± 0.78
1.161PheCys: 1.161 ± 1.187
2.709PheAsp: 2.709 ± 0.526
0.387PheGlu: 0.387 ± 0.322
1.935PhePhe: 1.935 ± 0.716
2.709PheGly: 2.709 ± 0.974
0.774PheHis: 0.774 ± 0.503
1.935PheIle: 1.935 ± 0.477
3.483PheLys: 3.483 ± 1.438
4.644PheLeu: 4.644 ± 1.335
0.387PheMet: 0.387 ± 0.322
1.161PheAsn: 1.161 ± 0.639
3.096PhePro: 3.096 ± 0.704
0.774PheGln: 0.774 ± 0.365
0.774PheArg: 0.774 ± 0.526
2.322PheSer: 2.322 ± 0.552
2.709PheThr: 2.709 ± 0.851
3.096PheVal: 3.096 ± 1.099
0.774PheTrp: 0.774 ± 0.365
1.935PheTyr: 1.935 ± 0.588
0.0PheXaa: 0.0 ± 0.0
Gly
2.322GlyAla: 2.322 ± 1.32
1.548GlyCys: 1.548 ± 0.596
4.644GlyAsp: 4.644 ± 1.0
3.096GlyGlu: 3.096 ± 1.034
1.935GlyPhe: 1.935 ± 0.716
2.709GlyGly: 2.709 ± 1.073
1.935GlyHis: 1.935 ± 0.951
3.87GlyIle: 3.87 ± 1.204
3.483GlyLys: 3.483 ± 1.071
3.483GlyLeu: 3.483 ± 1.969
1.548GlyMet: 1.548 ± 1.29
1.935GlyAsn: 1.935 ± 0.779
1.161GlyPro: 1.161 ± 0.729
3.096GlyGln: 3.096 ± 1.207
2.322GlyArg: 2.322 ± 1.04
4.644GlySer: 4.644 ± 1.84
5.031GlyThr: 5.031 ± 0.934
4.257GlyVal: 4.257 ± 0.941
0.387GlyTrp: 0.387 ± 0.322
2.322GlyTyr: 2.322 ± 0.595
0.0GlyXaa: 0.0 ± 0.0
His
2.709HisAla: 2.709 ± 0.748
0.387HisCys: 0.387 ± 0.505
0.0HisAsp: 0.0 ± 0.0
1.548HisGlu: 1.548 ± 0.579
0.774HisPhe: 0.774 ± 0.365
1.548HisGly: 1.548 ± 0.595
0.774HisHis: 0.774 ± 0.578
1.161HisIle: 1.161 ± 1.124
1.161HisLys: 1.161 ± 0.555
2.322HisLeu: 2.322 ± 0.691
0.0HisMet: 0.0 ± 0.0
2.322HisAsn: 2.322 ± 1.01
2.322HisPro: 2.322 ± 0.872
0.387HisGln: 0.387 ± 0.375
1.548HisArg: 1.548 ± 0.71
1.935HisSer: 1.935 ± 0.863
0.774HisThr: 0.774 ± 0.653
0.0HisVal: 0.0 ± 0.0
1.161HisTrp: 1.161 ± 0.611
1.935HisTyr: 1.935 ± 0.816
0.0HisXaa: 0.0 ± 0.0
Ile
3.096IleAla: 3.096 ± 1.36
1.935IleCys: 1.935 ± 0.996
3.483IleAsp: 3.483 ± 1.186
3.483IleGlu: 3.483 ± 1.361
1.548IlePhe: 1.548 ± 0.868
1.935IleGly: 1.935 ± 0.606
2.322IleHis: 2.322 ± 0.751
1.548IleIle: 1.548 ± 0.536
0.774IleLys: 0.774 ± 0.365
4.257IleLeu: 4.257 ± 0.988
1.161IleMet: 1.161 ± 0.587
1.161IleAsn: 1.161 ± 0.442
4.644IlePro: 4.644 ± 2.185
0.774IleGln: 0.774 ± 0.365
2.709IleArg: 2.709 ± 0.811
6.966IleSer: 6.966 ± 1.569
3.483IleThr: 3.483 ± 0.67
6.192IleVal: 6.192 ± 1.584
0.0IleTrp: 0.0 ± 0.0
2.322IleTyr: 2.322 ± 1.118
0.0IleXaa: 0.0 ± 0.0
Lys
3.483LysAla: 3.483 ± 1.012
2.322LysCys: 2.322 ± 1.028
1.161LysAsp: 1.161 ± 0.593
1.935LysGlu: 1.935 ± 0.979
3.096LysPhe: 3.096 ± 1.516
3.096LysGly: 3.096 ± 1.225
1.935LysHis: 1.935 ± 1.115
3.096LysIle: 3.096 ± 0.538
3.483LysLys: 3.483 ± 1.536
3.87LysLeu: 3.87 ± 1.672
0.774LysMet: 0.774 ± 0.683
3.483LysAsn: 3.483 ± 1.213
1.548LysPro: 1.548 ± 1.163
2.709LysGln: 2.709 ± 0.883
6.192LysArg: 6.192 ± 1.249
3.87LysSer: 3.87 ± 1.817
2.709LysThr: 2.709 ± 0.804
2.322LysVal: 2.322 ± 0.703
0.774LysTrp: 0.774 ± 0.578
2.709LysTyr: 2.709 ± 0.76
0.0LysXaa: 0.0 ± 0.0
Leu
1.935LeuAla: 1.935 ± 0.779
3.483LeuCys: 3.483 ± 1.248
3.87LeuAsp: 3.87 ± 0.928
4.257LeuGlu: 4.257 ± 1.921
2.709LeuPhe: 2.709 ± 1.186
4.257LeuGly: 4.257 ± 1.291
3.096LeuHis: 3.096 ± 1.161
3.096LeuIle: 3.096 ± 1.518
6.579LeuLys: 6.579 ± 1.704
8.514LeuLeu: 8.514 ± 2.977
2.709LeuMet: 2.709 ± 1.241
3.483LeuAsn: 3.483 ± 1.039
2.322LeuPro: 2.322 ± 0.86
6.966LeuGln: 6.966 ± 2.086
5.805LeuArg: 5.805 ± 0.991
7.74LeuSer: 7.74 ± 1.939
3.483LeuThr: 3.483 ± 1.567
2.322LeuVal: 2.322 ± 1.028
0.774LeuTrp: 0.774 ± 0.573
4.257LeuTyr: 4.257 ± 1.095
0.0LeuXaa: 0.0 ± 0.0
Met
1.161MetAla: 1.161 ± 0.623
0.387MetCys: 0.387 ± 0.322
1.548MetAsp: 1.548 ± 0.567
0.774MetGlu: 0.774 ± 0.455
0.774MetPhe: 0.774 ± 0.526
1.548MetGly: 1.548 ± 0.533
0.387MetHis: 0.387 ± 0.375
0.387MetIle: 0.387 ± 0.503
0.0MetLys: 0.0 ± 0.0
2.709MetLeu: 2.709 ± 1.492
0.387MetMet: 0.387 ± 0.375
0.387MetAsn: 0.387 ± 0.35
0.0MetPro: 0.0 ± 0.0
1.548MetGln: 1.548 ± 0.298
0.774MetArg: 0.774 ± 0.568
1.935MetSer: 1.935 ± 0.93
1.161MetThr: 1.161 ± 0.611
2.709MetVal: 2.709 ± 1.283
0.0MetTrp: 0.0 ± 0.0
0.387MetTyr: 0.387 ± 0.375
0.0MetXaa: 0.0 ± 0.0
Asn
2.709AsnAla: 2.709 ± 0.624
3.096AsnCys: 3.096 ± 1.176
1.935AsnAsp: 1.935 ± 0.891
1.548AsnGlu: 1.548 ± 0.625
1.161AsnPhe: 1.161 ± 0.71
1.935AsnGly: 1.935 ± 1.188
0.387AsnHis: 0.387 ± 0.375
5.418AsnIle: 5.418 ± 1.343
3.096AsnLys: 3.096 ± 0.856
0.774AsnLeu: 0.774 ± 0.365
0.387AsnMet: 0.387 ± 0.35
3.096AsnAsn: 3.096 ± 0.538
3.87AsnPro: 3.87 ± 0.935
1.548AsnGln: 1.548 ± 0.505
1.548AsnArg: 1.548 ± 0.963
3.87AsnSer: 3.87 ± 1.46
5.805AsnThr: 5.805 ± 2.511
1.548AsnVal: 1.548 ± 0.298
0.774AsnTrp: 0.774 ± 0.455
1.161AsnTyr: 1.161 ± 0.555
0.0AsnXaa: 0.0 ± 0.0
Pro
6.966ProAla: 6.966 ± 2.5
0.774ProCys: 0.774 ± 0.365
3.483ProAsp: 3.483 ± 1.43
2.709ProGlu: 2.709 ± 0.962
1.935ProPhe: 1.935 ± 0.779
1.161ProGly: 1.161 ± 0.696
1.161ProHis: 1.161 ± 0.656
3.096ProIle: 3.096 ± 1.752
3.87ProLys: 3.87 ± 1.497
6.966ProLeu: 6.966 ± 2.175
0.387ProMet: 0.387 ± 0.372
3.096ProAsn: 3.096 ± 1.101
3.87ProPro: 3.87 ± 1.307
0.774ProGln: 0.774 ± 0.434
1.935ProArg: 1.935 ± 1.075
3.87ProSer: 3.87 ± 2.281
6.579ProThr: 6.579 ± 2.476
2.709ProVal: 2.709 ± 0.81
0.774ProTrp: 0.774 ± 0.986
2.322ProTyr: 2.322 ± 0.995
0.0ProXaa: 0.0 ± 0.0
Gln
2.709GlnAla: 2.709 ± 0.964
0.774GlnCys: 0.774 ± 0.583
1.548GlnAsp: 1.548 ± 0.887
1.161GlnGlu: 1.161 ± 0.741
1.161GlnPhe: 1.161 ± 0.71
1.161GlnGly: 1.161 ± 0.587
0.387GlnHis: 0.387 ± 0.322
1.161GlnIle: 1.161 ± 0.347
0.774GlnLys: 0.774 ± 0.7
5.031GlnLeu: 5.031 ± 1.001
2.322GlnMet: 2.322 ± 0.871
0.387GlnAsn: 0.387 ± 0.322
3.096GlnPro: 3.096 ± 0.952
2.709GlnGln: 2.709 ± 1.062
2.709GlnArg: 2.709 ± 1.347
2.709GlnSer: 2.709 ± 1.594
3.87GlnThr: 3.87 ± 0.656
5.031GlnVal: 5.031 ± 1.404
0.774GlnTrp: 0.774 ± 0.645
0.387GlnTyr: 0.387 ± 0.35
0.0GlnXaa: 0.0 ± 0.0
Arg
3.096ArgAla: 3.096 ± 0.952
1.548ArgCys: 1.548 ± 0.948
1.548ArgAsp: 1.548 ± 0.741
1.548ArgGlu: 1.548 ± 0.62
1.548ArgPhe: 1.548 ± 0.86
1.548ArgGly: 1.548 ± 0.816
1.548ArgHis: 1.548 ± 0.707
1.161ArgIle: 1.161 ± 0.741
1.935ArgLys: 1.935 ± 0.588
6.966ArgLeu: 6.966 ± 1.076
0.0ArgMet: 0.0 ± 0.0
1.935ArgAsn: 1.935 ± 0.625
5.031ArgPro: 5.031 ± 1.532
1.161ArgGln: 1.161 ± 0.967
4.257ArgArg: 4.257 ± 1.41
3.096ArgSer: 3.096 ± 1.164
3.87ArgThr: 3.87 ± 0.811
1.548ArgVal: 1.548 ± 0.868
1.548ArgTrp: 1.548 ± 0.995
2.709ArgTyr: 2.709 ± 0.648
0.0ArgXaa: 0.0 ± 0.0
Ser
5.418SerAla: 5.418 ± 1.76
1.935SerCys: 1.935 ± 0.735
5.805SerAsp: 5.805 ± 1.622
3.096SerGlu: 3.096 ± 0.683
4.257SerPhe: 4.257 ± 1.663
5.805SerGly: 5.805 ± 2.518
0.774SerHis: 0.774 ± 0.404
4.644SerIle: 4.644 ± 2.324
4.644SerLys: 4.644 ± 1.387
2.709SerLeu: 2.709 ± 0.526
1.935SerMet: 1.935 ± 0.877
5.418SerAsn: 5.418 ± 1.444
2.322SerPro: 2.322 ± 0.789
3.096SerGln: 3.096 ± 0.849
2.322SerArg: 2.322 ± 0.985
5.418SerSer: 5.418 ± 1.493
13.158SerThr: 13.158 ± 3.242
7.353SerVal: 7.353 ± 2.322
0.387SerTrp: 0.387 ± 0.322
1.935SerTyr: 1.935 ± 0.909
0.0SerXaa: 0.0 ± 0.0
Thr
4.257ThrAla: 4.257 ± 0.547
3.483ThrCys: 3.483 ± 1.068
4.644ThrAsp: 4.644 ± 0.775
3.87ThrGlu: 3.87 ± 1.721
3.096ThrPhe: 3.096 ± 1.029
5.805ThrGly: 5.805 ± 1.454
2.709ThrHis: 2.709 ± 1.152
2.322ThrIle: 2.322 ± 0.804
4.257ThrLys: 4.257 ± 1.394
6.192ThrLeu: 6.192 ± 0.969
1.161ThrMet: 1.161 ± 0.769
3.483ThrAsn: 3.483 ± 0.745
7.74ThrPro: 7.74 ± 2.049
3.096ThrGln: 3.096 ± 0.944
3.096ThrArg: 3.096 ± 0.737
9.675ThrSer: 9.675 ± 2.694
10.449ThrThr: 10.449 ± 2.362
7.74ThrVal: 7.74 ± 1.624
1.161ThrTrp: 1.161 ± 0.555
2.709ThrTyr: 2.709 ± 1.064
0.0ThrXaa: 0.0 ± 0.0
Val
1.548ValAla: 1.548 ± 0.509
2.322ValCys: 2.322 ± 1.04
3.87ValAsp: 3.87 ± 0.76
2.322ValGlu: 2.322 ± 1.016
3.096ValPhe: 3.096 ± 0.731
3.483ValGly: 3.483 ± 1.608
1.548ValHis: 1.548 ± 1.09
4.644ValIle: 4.644 ± 2.073
2.709ValLys: 2.709 ± 0.846
4.257ValLeu: 4.257 ± 1.788
1.161ValMet: 1.161 ± 0.706
2.322ValAsn: 2.322 ± 1.049
3.87ValPro: 3.87 ± 1.412
2.709ValGln: 2.709 ± 0.959
2.709ValArg: 2.709 ± 0.625
11.997ValSer: 11.997 ± 2.864
4.644ValThr: 4.644 ± 1.413
3.096ValVal: 3.096 ± 1.125
0.387ValTrp: 0.387 ± 0.35
3.096ValTyr: 3.096 ± 1.411
0.0ValXaa: 0.0 ± 0.0
Trp
1.161TrpAla: 1.161 ± 0.587
0.774TrpCys: 0.774 ± 0.645
0.0TrpAsp: 0.0 ± 0.0
0.774TrpGlu: 0.774 ± 0.447
0.387TrpPhe: 0.387 ± 0.322
1.161TrpGly: 1.161 ± 0.639
1.161TrpHis: 1.161 ± 0.555
0.774TrpIle: 0.774 ± 0.645
1.548TrpLys: 1.548 ± 0.797
0.387TrpLeu: 0.387 ± 0.322
0.387TrpMet: 0.387 ± 0.35
0.387TrpAsn: 0.387 ± 0.35
0.387TrpPro: 0.387 ± 0.322
0.0TrpGln: 0.0 ± 0.0
1.161TrpArg: 1.161 ± 0.504
0.0TrpSer: 0.0 ± 0.0
1.935TrpThr: 1.935 ± 1.02
0.387TrpVal: 0.387 ± 0.503
0.0TrpTrp: 0.0 ± 0.0
1.161TrpTyr: 1.161 ± 0.666
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.935TyrAla: 1.935 ± 0.637
1.161TyrCys: 1.161 ± 0.729
1.548TyrAsp: 1.548 ± 0.715
3.096TyrGlu: 3.096 ± 1.064
1.935TyrPhe: 1.935 ± 0.53
2.709TyrGly: 2.709 ± 0.739
0.387TyrHis: 0.387 ± 0.35
2.322TyrIle: 2.322 ± 0.707
2.709TyrLys: 2.709 ± 1.039
3.483TyrLeu: 3.483 ± 1.042
1.161TyrMet: 1.161 ± 0.626
1.548TyrAsn: 1.548 ± 0.753
0.774TyrPro: 0.774 ± 0.606
1.161TyrGln: 1.161 ± 0.609
3.096TyrArg: 3.096 ± 1.249
2.322TyrSer: 2.322 ± 1.099
0.774TyrThr: 0.774 ± 0.749
3.483TyrVal: 3.483 ± 0.698
1.161TyrTrp: 1.161 ± 0.416
2.322TyrTyr: 2.322 ± 1.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (2585 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski