Amino acid dipepetide frequency for Human papillomavirus 157

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.231AlaAla: 6.231 ± 1.552
0.89AlaCys: 0.89 ± 1.158
4.005AlaAsp: 4.005 ± 1.148
7.121AlaGlu: 7.121 ± 0.911
3.115AlaPhe: 3.115 ± 0.631
3.115AlaGly: 3.115 ± 1.101
0.445AlaHis: 0.445 ± 0.374
3.56AlaIle: 3.56 ± 1.399
3.115AlaLys: 3.115 ± 0.619
3.115AlaLeu: 3.115 ± 0.627
1.78AlaMet: 1.78 ± 1.087
1.78AlaAsn: 1.78 ± 0.613
2.67AlaPro: 2.67 ± 1.527
2.67AlaGln: 2.67 ± 0.521
3.115AlaArg: 3.115 ± 0.919
4.45AlaSer: 4.45 ± 1.425
4.005AlaThr: 4.005 ± 0.947
3.56AlaVal: 3.56 ± 1.35
0.445AlaTrp: 0.445 ± 0.361
2.225AlaTyr: 2.225 ± 0.413
0.0AlaXaa: 0.0 ± 0.0
Cys
1.78CysAla: 1.78 ± 1.001
1.78CysCys: 1.78 ± 1.049
0.445CysAsp: 0.445 ± 0.361
0.445CysGlu: 0.445 ± 0.361
2.225CysPhe: 2.225 ± 1.344
0.89CysGly: 0.89 ± 0.579
0.89CysHis: 0.89 ± 0.656
0.89CysIle: 0.89 ± 0.613
1.78CysLys: 1.78 ± 1.014
2.67CysLeu: 2.67 ± 1.546
0.89CysMet: 0.89 ± 0.721
1.78CysAsn: 1.78 ± 0.765
0.89CysPro: 0.89 ± 0.656
0.89CysGln: 0.89 ± 0.579
1.78CysArg: 1.78 ± 1.769
1.335CysSer: 1.335 ± 1.096
0.89CysThr: 0.89 ± 0.575
0.89CysVal: 0.89 ± 0.579
1.335CysTrp: 1.335 ± 0.424
0.445CysTyr: 0.445 ± 0.579
0.0CysXaa: 0.0 ± 0.0
Asp
6.676AspAla: 6.676 ± 1.758
2.225AspCys: 2.225 ± 0.61
4.45AspAsp: 4.45 ± 1.209
4.005AspGlu: 4.005 ± 0.628
3.115AspPhe: 3.115 ± 1.404
3.115AspGly: 3.115 ± 1.011
1.335AspHis: 1.335 ± 0.661
5.34AspIle: 5.34 ± 2.069
1.335AspLys: 1.335 ± 0.679
6.676AspLeu: 6.676 ± 1.454
1.335AspMet: 1.335 ± 0.624
2.225AspAsn: 2.225 ± 0.934
4.005AspPro: 4.005 ± 1.184
1.335AspGln: 1.335 ± 0.721
2.67AspArg: 2.67 ± 1.021
6.676AspSer: 6.676 ± 1.695
2.67AspThr: 2.67 ± 0.406
5.785AspVal: 5.785 ± 2.18
0.89AspTrp: 0.89 ± 0.395
1.335AspTyr: 1.335 ± 0.658
0.0AspXaa: 0.0 ± 0.0
Glu
3.56GluAla: 3.56 ± 1.35
0.89GluCys: 0.89 ± 0.721
5.34GluAsp: 5.34 ± 1.332
6.231GluGlu: 6.231 ± 1.617
2.225GluPhe: 2.225 ± 0.719
2.67GluGly: 2.67 ± 1.125
0.89GluHis: 0.89 ± 0.442
3.115GluIle: 3.115 ± 0.821
2.67GluLys: 2.67 ± 0.839
4.45GluLeu: 4.45 ± 1.221
1.335GluMet: 1.335 ± 0.365
4.895GluAsn: 4.895 ± 1.149
2.225GluPro: 2.225 ± 1.029
2.67GluGln: 2.67 ± 1.176
2.67GluArg: 2.67 ± 1.181
5.785GluSer: 5.785 ± 2.111
4.895GluThr: 4.895 ± 0.83
3.115GluVal: 3.115 ± 0.627
0.445GluTrp: 0.445 ± 0.439
2.225GluTyr: 2.225 ± 0.923
0.0GluXaa: 0.0 ± 0.0
Phe
2.67PheAla: 2.67 ± 0.424
0.89PheCys: 0.89 ± 0.575
4.005PheAsp: 4.005 ± 1.001
4.005PheGlu: 4.005 ± 1.56
4.005PhePhe: 4.005 ± 1.083
2.67PheGly: 2.67 ± 1.016
0.89PheHis: 0.89 ± 0.462
1.78PheIle: 1.78 ± 0.591
5.34PheLys: 5.34 ± 1.981
4.005PheLeu: 4.005 ± 0.879
0.89PheMet: 0.89 ± 0.422
1.78PheAsn: 1.78 ± 0.79
1.78PhePro: 1.78 ± 0.228
1.335PheGln: 1.335 ± 0.958
1.78PheArg: 1.78 ± 0.613
3.56PheSer: 3.56 ± 1.117
4.005PheThr: 4.005 ± 0.958
1.335PheVal: 1.335 ± 0.365
0.89PheTrp: 0.89 ± 0.395
2.225PheTyr: 2.225 ± 0.663
0.0PheXaa: 0.0 ± 0.0
Gly
2.67GlyAla: 2.67 ± 0.484
1.335GlyCys: 1.335 ± 0.833
4.895GlyAsp: 4.895 ± 1.39
4.005GlyGlu: 4.005 ± 0.851
1.78GlyPhe: 1.78 ± 0.66
2.225GlyGly: 2.225 ± 1.547
1.78GlyHis: 1.78 ± 1.034
2.67GlyIle: 2.67 ± 1.096
2.225GlyLys: 2.225 ± 1.047
4.005GlyLeu: 4.005 ± 1.304
0.0GlyMet: 0.0 ± 0.0
3.115GlyAsn: 3.115 ± 0.656
2.225GlyPro: 2.225 ± 0.751
0.89GlyGln: 0.89 ± 0.462
3.115GlyArg: 3.115 ± 0.664
4.45GlySer: 4.45 ± 1.027
5.34GlyThr: 5.34 ± 1.867
2.225GlyVal: 2.225 ± 1.547
0.445GlyTrp: 0.445 ± 0.361
1.335GlyTyr: 1.335 ± 0.439
0.0GlyXaa: 0.0 ± 0.0
His
1.335HisAla: 1.335 ± 0.679
0.445HisCys: 0.445 ± 0.404
0.89HisAsp: 0.89 ± 0.808
0.0HisGlu: 0.0 ± 0.0
1.335HisPhe: 1.335 ± 0.699
0.89HisGly: 0.89 ± 0.871
0.445HisHis: 0.445 ± 0.439
1.78HisIle: 1.78 ± 0.668
2.225HisLys: 2.225 ± 1.031
3.56HisLeu: 3.56 ± 1.576
0.0HisMet: 0.0 ± 0.0
0.445HisAsn: 0.445 ± 0.361
1.335HisPro: 1.335 ± 0.73
0.445HisGln: 0.445 ± 0.361
0.89HisArg: 0.89 ± 0.501
1.335HisSer: 1.335 ± 0.473
0.445HisThr: 0.445 ± 0.523
0.445HisVal: 0.445 ± 0.361
1.335HisTrp: 1.335 ± 0.661
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.225IleAla: 2.225 ± 1.361
1.335IleCys: 1.335 ± 1.003
2.67IleAsp: 2.67 ± 1.533
3.56IleGlu: 3.56 ± 0.856
3.115IlePhe: 3.115 ± 1.059
2.67IleGly: 2.67 ± 1.051
0.0IleHis: 0.0 ± 0.0
3.115IleIle: 3.115 ± 1.542
1.78IleLys: 1.78 ± 0.79
4.895IleLeu: 4.895 ± 1.373
0.445IleMet: 0.445 ± 0.579
2.67IleAsn: 2.67 ± 0.519
3.115IlePro: 3.115 ± 0.925
3.56IleGln: 3.56 ± 0.835
1.335IleArg: 1.335 ± 0.827
5.34IleSer: 5.34 ± 1.401
5.34IleThr: 5.34 ± 1.046
3.115IleVal: 3.115 ± 1.622
0.445IleTrp: 0.445 ± 0.374
1.335IleTyr: 1.335 ± 0.73
0.0IleXaa: 0.0 ± 0.0
Lys
0.89LysAla: 0.89 ± 0.721
2.67LysCys: 2.67 ± 1.356
2.67LysAsp: 2.67 ± 1.575
4.005LysGlu: 4.005 ± 1.383
3.115LysPhe: 3.115 ± 1.27
1.78LysGly: 1.78 ± 1.087
0.89LysHis: 0.89 ± 0.512
1.335LysIle: 1.335 ± 0.699
1.78LysLys: 1.78 ± 1.025
3.115LysLeu: 3.115 ± 1.488
1.78LysMet: 1.78 ± 1.113
3.56LysAsn: 3.56 ± 2.02
1.78LysPro: 1.78 ± 0.92
2.67LysGln: 2.67 ± 0.737
5.34LysArg: 5.34 ± 1.518
4.005LysSer: 4.005 ± 1.319
2.67LysThr: 2.67 ± 0.521
2.67LysVal: 2.67 ± 0.555
0.0LysTrp: 0.0 ± 0.0
3.56LysTyr: 3.56 ± 0.978
0.0LysXaa: 0.0 ± 0.0
Leu
7.121LeuAla: 7.121 ± 1.465
1.78LeuCys: 1.78 ± 1.071
7.121LeuAsp: 7.121 ± 1.487
3.56LeuGlu: 3.56 ± 0.979
4.45LeuPhe: 4.45 ± 1.26
4.005LeuGly: 4.005 ± 1.742
1.335LeuHis: 1.335 ± 0.612
5.785LeuIle: 5.785 ± 1.177
5.785LeuLys: 5.785 ± 1.902
5.785LeuLeu: 5.785 ± 1.41
0.89LeuMet: 0.89 ± 0.455
2.225LeuAsn: 2.225 ± 1.173
2.67LeuPro: 2.67 ± 1.272
4.45LeuGln: 4.45 ± 0.548
5.34LeuArg: 5.34 ± 2.19
6.676LeuSer: 6.676 ± 1.167
3.115LeuThr: 3.115 ± 0.872
7.121LeuVal: 7.121 ± 1.623
0.89LeuTrp: 0.89 ± 0.747
5.34LeuTyr: 5.34 ± 1.304
0.0LeuXaa: 0.0 ± 0.0
Met
0.445MetAla: 0.445 ± 0.404
0.0MetCys: 0.0 ± 0.0
0.445MetAsp: 0.445 ± 0.361
1.78MetGlu: 1.78 ± 1.138
1.335MetPhe: 1.335 ± 0.658
1.335MetGly: 1.335 ± 0.658
0.0MetHis: 0.0 ± 0.0
0.445MetIle: 0.445 ± 0.361
1.335MetLys: 1.335 ± 0.884
1.78MetLeu: 1.78 ± 1.025
0.0MetMet: 0.0 ± 0.0
1.335MetAsn: 1.335 ± 0.679
0.445MetPro: 0.445 ± 0.361
0.445MetGln: 0.445 ± 0.523
0.89MetArg: 0.89 ± 0.395
1.335MetSer: 1.335 ± 0.365
0.89MetThr: 0.89 ± 0.473
1.78MetVal: 1.78 ± 0.985
0.0MetTrp: 0.0 ± 0.0
0.445MetTyr: 0.445 ± 0.361
0.0MetXaa: 0.0 ± 0.0
Asn
3.115AsnAla: 3.115 ± 0.656
1.78AsnCys: 1.78 ± 1.132
3.56AsnAsp: 3.56 ± 0.711
2.67AsnGlu: 2.67 ± 0.484
1.78AsnPhe: 1.78 ± 0.79
2.67AsnGly: 2.67 ± 0.739
0.89AsnHis: 0.89 ± 0.512
3.115AsnIle: 3.115 ± 1.137
2.225AsnLys: 2.225 ± 0.644
4.895AsnLeu: 4.895 ± 1.063
0.89AsnMet: 0.89 ± 0.721
2.67AsnAsn: 2.67 ± 0.879
3.115AsnPro: 3.115 ± 1.287
1.78AsnGln: 1.78 ± 0.613
1.78AsnArg: 1.78 ± 0.768
3.56AsnSer: 3.56 ± 1.698
6.231AsnThr: 6.231 ± 1.037
3.56AsnVal: 3.56 ± 1.248
0.445AsnTrp: 0.445 ± 0.361
0.89AsnTyr: 0.89 ± 0.871
0.0AsnXaa: 0.0 ± 0.0
Pro
4.45ProAla: 4.45 ± 2.717
0.445ProCys: 0.445 ± 0.374
6.676ProAsp: 6.676 ± 2.263
2.67ProGlu: 2.67 ± 0.928
0.445ProPhe: 0.445 ± 0.361
0.89ProGly: 0.89 ± 0.462
0.445ProHis: 0.445 ± 0.361
2.225ProIle: 2.225 ± 0.5
3.115ProLys: 3.115 ± 1.292
4.895ProLeu: 4.895 ± 1.014
0.0ProMet: 0.0 ± 0.0
2.225ProAsn: 2.225 ± 0.751
4.895ProPro: 4.895 ± 1.619
3.56ProGln: 3.56 ± 0.939
3.115ProArg: 3.115 ± 0.548
2.225ProSer: 2.225 ± 1.054
4.895ProThr: 4.895 ± 2.473
4.005ProVal: 4.005 ± 1.765
0.445ProTrp: 0.445 ± 0.439
2.67ProTyr: 2.67 ± 1.267
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
1.335GlnCys: 1.335 ± 1.291
1.78GlnAsp: 1.78 ± 0.581
3.56GlnGlu: 3.56 ± 1.832
1.335GlnPhe: 1.335 ± 0.523
4.005GlnGly: 4.005 ± 1.07
1.78GlnHis: 1.78 ± 0.905
2.67GlnIle: 2.67 ± 0.741
1.78GlnLys: 1.78 ± 0.925
5.34GlnLeu: 5.34 ± 1.646
1.335GlnMet: 1.335 ± 0.77
0.89GlnAsn: 0.89 ± 0.641
3.115GlnPro: 3.115 ± 1.471
2.225GlnGln: 2.225 ± 0.863
2.225GlnArg: 2.225 ± 0.5
0.445GlnSer: 0.445 ± 0.361
1.335GlnThr: 1.335 ± 0.833
3.56GlnVal: 3.56 ± 1.205
0.89GlnTrp: 0.89 ± 0.721
2.67GlnTyr: 2.67 ± 0.839
0.0GlnXaa: 0.0 ± 0.0
Arg
4.005ArgAla: 4.005 ± 1.031
1.335ArgCys: 1.335 ± 0.791
2.225ArgAsp: 2.225 ± 1.311
2.67ArgGlu: 2.67 ± 0.848
1.335ArgPhe: 1.335 ± 0.473
2.67ArgGly: 2.67 ± 0.63
2.67ArgHis: 2.67 ± 0.63
2.225ArgIle: 2.225 ± 0.9
4.005ArgLys: 4.005 ± 0.435
5.785ArgLeu: 5.785 ± 1.003
0.89ArgMet: 0.89 ± 0.473
4.45ArgAsn: 4.45 ± 1.552
3.115ArgPro: 3.115 ± 1.419
1.78ArgGln: 1.78 ± 1.025
5.34ArgArg: 5.34 ± 3.209
2.67ArgSer: 2.67 ± 0.597
4.895ArgThr: 4.895 ± 1.884
2.67ArgVal: 2.67 ± 1.312
0.0ArgTrp: 0.0 ± 0.0
1.78ArgTyr: 1.78 ± 0.747
0.0ArgXaa: 0.0 ± 0.0
Ser
4.45SerAla: 4.45 ± 1.378
0.89SerCys: 0.89 ± 0.579
5.34SerAsp: 5.34 ± 0.812
3.56SerGlu: 3.56 ± 1.354
4.005SerPhe: 4.005 ± 0.888
4.895SerGly: 4.895 ± 0.849
0.445SerHis: 0.445 ± 0.361
3.56SerIle: 3.56 ± 1.184
3.115SerLys: 3.115 ± 1.249
6.231SerLeu: 6.231 ± 0.868
1.335SerMet: 1.335 ± 0.658
4.45SerAsn: 4.45 ± 2.851
4.45SerPro: 4.45 ± 1.259
3.56SerGln: 3.56 ± 1.375
4.005SerArg: 4.005 ± 1.876
7.121SerSer: 7.121 ± 2.148
6.676SerThr: 6.676 ± 2.623
5.785SerVal: 5.785 ± 1.652
0.0SerTrp: 0.0 ± 0.0
1.335SerTyr: 1.335 ± 0.473
0.0SerXaa: 0.0 ± 0.0
Thr
4.005ThrAla: 4.005 ± 1.596
1.78ThrCys: 1.78 ± 1.112
3.56ThrAsp: 3.56 ± 1.271
3.56ThrGlu: 3.56 ± 0.682
3.115ThrPhe: 3.115 ± 1.09
4.005ThrGly: 4.005 ± 1.184
1.335ThrHis: 1.335 ± 0.852
3.115ThrIle: 3.115 ± 0.626
1.335ThrLys: 1.335 ± 0.679
8.011ThrLeu: 8.011 ± 1.741
0.89ThrMet: 0.89 ± 0.619
5.34ThrAsn: 5.34 ± 1.037
4.895ThrPro: 4.895 ± 2.044
2.67ThrGln: 2.67 ± 0.644
4.45ThrArg: 4.45 ± 0.521
5.34ThrSer: 5.34 ± 2.104
3.115ThrThr: 3.115 ± 1.415
5.785ThrVal: 5.785 ± 1.658
0.89ThrTrp: 0.89 ± 0.501
1.78ThrTyr: 1.78 ± 0.881
0.0ThrXaa: 0.0 ± 0.0
Val
2.225ValAla: 2.225 ± 0.815
1.335ValCys: 1.335 ± 0.928
5.34ValAsp: 5.34 ± 1.035
3.56ValGlu: 3.56 ± 1.769
3.56ValPhe: 3.56 ± 0.601
3.56ValGly: 3.56 ± 1.499
1.78ValHis: 1.78 ± 0.657
3.56ValIle: 3.56 ± 0.951
3.115ValLys: 3.115 ± 1.405
3.115ValLeu: 3.115 ± 0.806
0.445ValMet: 0.445 ± 0.374
3.56ValAsn: 3.56 ± 0.978
4.45ValPro: 4.45 ± 1.527
3.115ValGln: 3.115 ± 1.903
3.56ValArg: 3.56 ± 1.049
7.121ValSer: 7.121 ± 0.888
4.45ValThr: 4.45 ± 1.271
3.115ValVal: 3.115 ± 1.948
0.89ValTrp: 0.89 ± 0.753
1.78ValTyr: 1.78 ± 0.747
0.0ValXaa: 0.0 ± 0.0
Trp
1.335TrpAla: 1.335 ± 0.699
0.445TrpCys: 0.445 ± 0.361
0.89TrpAsp: 0.89 ± 0.473
0.445TrpGlu: 0.445 ± 0.439
0.0TrpPhe: 0.0 ± 0.0
0.445TrpGly: 0.445 ± 0.374
0.445TrpHis: 0.445 ± 0.439
0.89TrpIle: 0.89 ± 0.721
0.89TrpLys: 0.89 ± 0.575
1.335TrpLeu: 1.335 ± 0.679
0.0TrpMet: 0.0 ± 0.0
0.89TrpAsn: 0.89 ± 0.747
0.445TrpPro: 0.445 ± 0.374
0.445TrpGln: 0.445 ± 0.374
0.445TrpArg: 0.445 ± 0.579
0.445TrpSer: 0.445 ± 0.439
1.78TrpThr: 1.78 ± 0.629
0.445TrpVal: 0.445 ± 0.439
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.225TyrAla: 2.225 ± 0.727
1.335TyrCys: 1.335 ± 0.661
0.89TyrAsp: 0.89 ± 0.442
0.89TyrGlu: 0.89 ± 0.462
4.45TyrPhe: 4.45 ± 1.13
2.225TyrGly: 2.225 ± 0.823
0.89TyrHis: 0.89 ± 0.613
0.89TyrIle: 0.89 ± 0.395
1.78TyrLys: 1.78 ± 0.581
1.78TyrLeu: 1.78 ± 0.629
0.89TyrMet: 0.89 ± 0.395
1.335TyrAsn: 1.335 ± 0.679
2.67TyrPro: 2.67 ± 1.421
1.78TyrGln: 1.78 ± 0.668
2.67TyrArg: 2.67 ± 0.519
1.335TyrSer: 1.335 ± 0.66
1.335TyrThr: 1.335 ± 0.439
2.67TyrVal: 2.67 ± 0.521
1.335TyrTrp: 1.335 ± 0.73
1.78TyrTyr: 1.78 ± 0.794
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2248 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski