Amino acid dipepetide frequency for Human papillomavirus type 131

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.541AlaAla: 6.541 ± 1.602
1.635AlaCys: 1.635 ± 0.689
3.679AlaAsp: 3.679 ± 0.991
5.724AlaGlu: 5.724 ± 1.949
4.906AlaPhe: 4.906 ± 1.221
3.679AlaGly: 3.679 ± 1.347
1.226AlaHis: 1.226 ± 0.737
2.044AlaIle: 2.044 ± 0.61
2.044AlaLys: 2.044 ± 0.738
4.088AlaLeu: 4.088 ± 0.597
0.409AlaMet: 0.409 ± 0.321
3.271AlaAsn: 3.271 ± 1.12
2.862AlaPro: 2.862 ± 1.132
1.226AlaGln: 1.226 ± 0.353
2.453AlaArg: 2.453 ± 0.61
4.088AlaSer: 4.088 ± 1.114
4.497AlaThr: 4.497 ± 0.731
2.862AlaVal: 2.862 ± 0.613
0.0AlaTrp: 0.0 ± 0.0
1.635AlaTyr: 1.635 ± 0.5
0.0AlaXaa: 0.0 ± 0.0
Cys
0.818CysAla: 0.818 ± 0.335
2.862CysCys: 2.862 ± 1.615
0.0CysAsp: 0.0 ± 0.0
1.226CysGlu: 1.226 ± 0.656
2.044CysPhe: 2.044 ± 1.045
0.409CysGly: 0.409 ± 0.321
0.818CysHis: 0.818 ± 0.534
1.226CysIle: 1.226 ± 0.962
1.635CysLys: 1.635 ± 1.037
2.453CysLeu: 2.453 ± 1.357
0.0CysMet: 0.0 ± 0.0
1.226CysAsn: 1.226 ± 0.629
0.818CysPro: 0.818 ± 0.676
0.0CysGln: 0.0 ± 0.0
1.226CysArg: 1.226 ± 0.666
3.271CysSer: 3.271 ± 2.523
0.818CysThr: 0.818 ± 0.641
1.226CysVal: 1.226 ± 0.541
1.635CysTrp: 1.635 ± 0.684
0.409CysTyr: 0.409 ± 0.431
0.0CysXaa: 0.0 ± 0.0
Asp
3.271AspAla: 3.271 ± 0.679
2.044AspCys: 2.044 ± 0.645
3.679AspAsp: 3.679 ± 0.974
2.044AspGlu: 2.044 ± 0.781
2.862AspPhe: 2.862 ± 0.796
2.862AspGly: 2.862 ± 0.793
1.635AspHis: 1.635 ± 0.904
4.906AspIle: 4.906 ± 1.459
2.044AspLys: 2.044 ± 0.945
10.63AspLeu: 10.63 ± 1.608
0.409AspMet: 0.409 ± 0.338
2.862AspAsn: 2.862 ± 0.738
4.088AspPro: 4.088 ± 0.744
0.409AspGln: 0.409 ± 0.321
1.635AspArg: 1.635 ± 1.037
4.906AspSer: 4.906 ± 0.937
4.497AspThr: 4.497 ± 0.593
6.132AspVal: 6.132 ± 1.096
0.818AspTrp: 0.818 ± 0.431
1.635AspTyr: 1.635 ± 1.391
0.0AspXaa: 0.0 ± 0.0
Glu
2.044GluAla: 2.044 ± 0.805
0.409GluCys: 0.409 ± 0.321
3.679GluAsp: 3.679 ± 1.301
5.315GluGlu: 5.315 ± 0.852
2.044GluPhe: 2.044 ± 0.9
4.497GluGly: 4.497 ± 0.538
1.226GluHis: 1.226 ± 0.771
1.635GluIle: 1.635 ± 0.6
2.862GluLys: 2.862 ± 0.932
4.497GluLeu: 4.497 ± 1.65
2.453GluMet: 2.453 ± 0.771
6.132GluAsn: 6.132 ± 1.05
4.497GluPro: 4.497 ± 1.38
2.453GluGln: 2.453 ± 0.405
2.453GluArg: 2.453 ± 0.885
5.315GluSer: 5.315 ± 1.624
4.497GluThr: 4.497 ± 1.431
3.271GluVal: 3.271 ± 1.492
1.635GluTrp: 1.635 ± 0.505
1.635GluTyr: 1.635 ± 0.678
0.0GluXaa: 0.0 ± 0.0
Phe
2.453PheAla: 2.453 ± 0.377
1.226PheCys: 1.226 ± 0.785
2.044PheAsp: 2.044 ± 0.76
4.088PheGlu: 4.088 ± 1.742
3.271PhePhe: 3.271 ± 0.892
3.271PheGly: 3.271 ± 1.654
1.226PheHis: 1.226 ± 0.875
2.862PheIle: 2.862 ± 0.825
4.906PheLys: 4.906 ± 2.168
4.906PheLeu: 4.906 ± 2.098
0.818PheMet: 0.818 ± 0.365
2.044PheAsn: 2.044 ± 0.879
1.635PhePro: 1.635 ± 0.5
2.044PheGln: 2.044 ± 0.624
1.226PheArg: 1.226 ± 0.353
2.862PheSer: 2.862 ± 1.127
3.679PheThr: 3.679 ± 1.58
2.453PheVal: 2.453 ± 0.809
1.226PheTrp: 1.226 ± 0.562
1.635PheTyr: 1.635 ± 0.539
0.0PheXaa: 0.0 ± 0.0
Gly
1.635GlyAla: 1.635 ± 0.678
1.226GlyCys: 1.226 ± 0.737
4.906GlyAsp: 4.906 ± 0.959
4.088GlyGlu: 4.088 ± 1.056
2.044GlyPhe: 2.044 ± 0.647
3.679GlyGly: 3.679 ± 1.716
2.044GlyHis: 2.044 ± 1.21
2.453GlyIle: 2.453 ± 0.746
3.679GlyLys: 3.679 ± 0.648
6.132GlyLeu: 6.132 ± 1.634
0.409GlyMet: 0.409 ± 0.425
4.088GlyAsn: 4.088 ± 1.079
4.088GlyPro: 4.088 ± 1.285
0.409GlyGln: 0.409 ± 0.407
2.862GlyArg: 2.862 ± 0.997
4.497GlySer: 4.497 ± 0.76
3.679GlyThr: 3.679 ± 1.747
2.453GlyVal: 2.453 ± 1.007
0.409GlyTrp: 0.409 ± 0.458
1.226GlyTyr: 1.226 ± 0.456
0.0GlyXaa: 0.0 ± 0.0
His
1.226HisAla: 1.226 ± 0.592
0.0HisCys: 0.0 ± 0.0
1.635HisAsp: 1.635 ± 0.785
0.409HisGlu: 0.409 ± 0.33
0.818HisPhe: 0.818 ± 0.36
0.818HisGly: 0.818 ± 0.633
0.818HisHis: 0.818 ± 0.546
1.226HisIle: 1.226 ± 0.367
1.635HisLys: 1.635 ± 0.801
4.497HisLeu: 4.497 ± 1.5
0.409HisMet: 0.409 ± 0.321
0.409HisAsn: 0.409 ± 0.407
2.044HisPro: 2.044 ± 0.794
1.226HisGln: 1.226 ± 0.823
0.818HisArg: 0.818 ± 0.633
0.818HisSer: 0.818 ± 0.442
1.635HisThr: 1.635 ± 0.778
0.409HisVal: 0.409 ± 0.338
0.818HisTrp: 0.818 ± 0.504
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.679IleAla: 3.679 ± 1.191
1.226IleCys: 1.226 ± 0.562
3.271IleAsp: 3.271 ± 0.599
4.906IleGlu: 4.906 ± 1.195
1.226IlePhe: 1.226 ± 0.725
3.679IleGly: 3.679 ± 1.731
0.818IleHis: 0.818 ± 0.5
2.044IleIle: 2.044 ± 1.231
1.635IleLys: 1.635 ± 0.684
3.271IleLeu: 3.271 ± 0.882
0.0IleMet: 0.0 ± 0.0
2.453IleAsn: 2.453 ± 1.215
2.862IlePro: 2.862 ± 1.059
1.635IleGln: 1.635 ± 0.678
2.453IleArg: 2.453 ± 1.3
4.088IleSer: 4.088 ± 1.22
4.088IleThr: 4.088 ± 0.857
6.95IleVal: 6.95 ± 2.115
0.409IleTrp: 0.409 ± 0.321
0.818IleTyr: 0.818 ± 0.814
0.0IleXaa: 0.0 ± 0.0
Lys
2.862LysAla: 2.862 ± 1.0
1.226LysCys: 1.226 ± 0.629
2.862LysAsp: 2.862 ± 0.816
2.453LysGlu: 2.453 ± 1.465
4.497LysPhe: 4.497 ± 1.716
2.044LysGly: 2.044 ± 0.998
0.818LysHis: 0.818 ± 0.546
2.044LysIle: 2.044 ± 0.443
2.453LysLys: 2.453 ± 1.275
3.271LysLeu: 3.271 ± 1.144
1.226LysMet: 1.226 ± 0.866
3.271LysAsn: 3.271 ± 0.822
1.635LysPro: 1.635 ± 0.482
2.453LysGln: 2.453 ± 0.803
4.906LysArg: 4.906 ± 0.849
5.315LysSer: 5.315 ± 2.025
3.679LysThr: 3.679 ± 1.631
3.679LysVal: 3.679 ± 0.923
0.409LysTrp: 0.409 ± 0.458
2.044LysTyr: 2.044 ± 0.724
0.0LysXaa: 0.0 ± 0.0
Leu
5.315LeuAla: 5.315 ± 1.398
2.453LeuCys: 2.453 ± 1.104
5.724LeuAsp: 5.724 ± 1.669
4.088LeuGlu: 4.088 ± 1.132
8.177LeuPhe: 8.177 ± 1.837
6.132LeuGly: 6.132 ± 2.066
2.862LeuHis: 2.862 ± 1.334
6.95LeuIle: 6.95 ± 1.391
6.132LeuLys: 6.132 ± 2.27
8.994LeuLeu: 8.994 ± 2.644
2.044LeuMet: 2.044 ± 0.97
2.453LeuAsn: 2.453 ± 0.596
6.132LeuPro: 6.132 ± 1.102
4.088LeuGln: 4.088 ± 0.646
3.679LeuArg: 3.679 ± 1.527
7.768LeuSer: 7.768 ± 1.857
6.541LeuThr: 6.541 ± 1.118
5.315LeuVal: 5.315 ± 0.856
1.226LeuTrp: 1.226 ± 0.737
4.088LeuTyr: 4.088 ± 0.543
0.0LeuXaa: 0.0 ± 0.0
Met
1.226MetAla: 1.226 ± 0.511
0.818MetCys: 0.818 ± 0.335
0.409MetAsp: 0.409 ± 0.33
0.818MetGlu: 0.818 ± 0.442
0.409MetPhe: 0.409 ± 0.321
1.226MetGly: 1.226 ± 0.643
0.0MetHis: 0.0 ± 0.0
0.409MetIle: 0.409 ± 0.321
0.818MetLys: 0.818 ± 0.641
1.635MetLeu: 1.635 ± 0.5
0.0MetMet: 0.0 ± 0.0
0.818MetAsn: 0.818 ± 0.335
0.409MetPro: 0.409 ± 0.321
0.818MetGln: 0.818 ± 0.442
0.818MetArg: 0.818 ± 0.534
1.635MetSer: 1.635 ± 0.5
0.409MetThr: 0.409 ± 0.338
1.635MetVal: 1.635 ± 0.851
0.0MetTrp: 0.0 ± 0.0
0.818MetTyr: 0.818 ± 0.641
0.0MetXaa: 0.0 ± 0.0
Asn
3.679AsnAla: 3.679 ± 1.314
1.635AsnCys: 1.635 ± 1.193
2.453AsnAsp: 2.453 ± 0.833
2.044AsnGlu: 2.044 ± 0.586
1.635AsnPhe: 1.635 ± 0.67
2.044AsnGly: 2.044 ± 0.642
0.818AsnHis: 0.818 ± 0.546
5.315AsnIle: 5.315 ± 1.871
2.862AsnLys: 2.862 ± 0.813
4.088AsnLeu: 4.088 ± 0.935
1.635AsnMet: 1.635 ± 0.812
4.088AsnAsn: 4.088 ± 0.836
2.862AsnPro: 2.862 ± 1.03
0.818AsnGln: 0.818 ± 0.447
2.862AsnArg: 2.862 ± 0.914
4.497AsnSer: 4.497 ± 1.331
3.271AsnThr: 3.271 ± 1.482
4.088AsnVal: 4.088 ± 1.405
0.818AsnTrp: 0.818 ± 0.641
1.635AsnTyr: 1.635 ± 0.854
0.0AsnXaa: 0.0 ± 0.0
Pro
2.862ProAla: 2.862 ± 1.408
0.818ProCys: 0.818 ± 0.587
5.315ProAsp: 5.315 ± 1.344
3.271ProGlu: 3.271 ± 0.624
1.635ProPhe: 1.635 ± 0.721
1.635ProGly: 1.635 ± 0.692
1.635ProHis: 1.635 ± 1.342
3.271ProIle: 3.271 ± 2.056
3.679ProLys: 3.679 ± 0.886
5.315ProLeu: 5.315 ± 1.009
0.409ProMet: 0.409 ± 0.321
3.271ProAsn: 3.271 ± 1.221
6.132ProPro: 6.132 ± 2.143
1.635ProGln: 1.635 ± 0.482
2.862ProArg: 2.862 ± 0.647
5.315ProSer: 5.315 ± 2.097
4.088ProThr: 4.088 ± 1.69
3.271ProVal: 3.271 ± 1.154
0.0ProTrp: 0.0 ± 0.0
2.453ProTyr: 2.453 ± 1.192
0.0ProXaa: 0.0 ± 0.0
Gln
1.226GlnAla: 1.226 ± 0.367
0.0GlnCys: 0.0 ± 0.0
1.635GlnAsp: 1.635 ± 0.976
2.862GlnGlu: 2.862 ± 1.021
1.226GlnPhe: 1.226 ± 0.725
1.226GlnGly: 1.226 ± 0.357
0.409GlnHis: 0.409 ± 0.431
1.635GlnIle: 1.635 ± 0.5
1.226GlnLys: 1.226 ± 0.768
5.315GlnLeu: 5.315 ± 1.78
1.226GlnMet: 1.226 ± 0.353
0.409GlnAsn: 0.409 ± 0.338
2.044GlnPro: 2.044 ± 1.265
3.271GlnGln: 3.271 ± 0.867
0.409GlnArg: 0.409 ± 0.407
0.818GlnSer: 0.818 ± 0.335
2.044GlnThr: 2.044 ± 0.738
3.679GlnVal: 3.679 ± 0.683
0.818GlnTrp: 0.818 ± 0.641
1.226GlnTyr: 1.226 ± 0.592
0.0GlnXaa: 0.0 ± 0.0
Arg
2.862ArgAla: 2.862 ± 1.066
1.635ArgCys: 1.635 ± 0.927
4.088ArgAsp: 4.088 ± 1.1
2.044ArgGlu: 2.044 ± 0.624
1.635ArgPhe: 1.635 ± 0.647
2.453ArgGly: 2.453 ± 0.58
1.226ArgHis: 1.226 ± 0.68
1.226ArgIle: 1.226 ± 1.025
4.497ArgLys: 4.497 ± 0.894
8.177ArgLeu: 8.177 ± 1.208
0.0ArgMet: 0.0 ± 0.0
3.271ArgAsn: 3.271 ± 1.395
2.862ArgPro: 2.862 ± 1.411
2.044ArgGln: 2.044 ± 0.803
3.679ArgArg: 3.679 ± 1.337
2.453ArgSer: 2.453 ± 0.446
2.044ArgThr: 2.044 ± 0.642
1.635ArgVal: 1.635 ± 0.678
0.818ArgTrp: 0.818 ± 0.442
0.409ArgTyr: 0.409 ± 0.338
0.0ArgXaa: 0.0 ± 0.0
Ser
5.315SerAla: 5.315 ± 1.619
1.226SerCys: 1.226 ± 0.785
5.315SerAsp: 5.315 ± 2.244
5.315SerGlu: 5.315 ± 1.523
2.453SerPhe: 2.453 ± 0.959
2.453SerGly: 2.453 ± 0.58
2.044SerHis: 2.044 ± 0.815
2.453SerIle: 2.453 ± 1.009
2.862SerLys: 2.862 ± 1.166
7.359SerLeu: 7.359 ± 1.988
1.226SerMet: 1.226 ± 0.706
5.724SerAsn: 5.724 ± 1.666
4.497SerPro: 4.497 ± 1.443
1.635SerGln: 1.635 ± 0.531
4.497SerArg: 4.497 ± 1.483
7.359SerSer: 7.359 ± 1.268
5.724SerThr: 5.724 ± 1.597
5.724SerVal: 5.724 ± 2.054
0.409SerTrp: 0.409 ± 0.321
2.453SerTyr: 2.453 ± 0.935
0.0SerXaa: 0.0 ± 0.0
Thr
4.088ThrAla: 4.088 ± 1.361
1.635ThrCys: 1.635 ± 0.537
3.679ThrAsp: 3.679 ± 1.28
5.724ThrGlu: 5.724 ± 1.223
2.044ThrPhe: 2.044 ± 0.769
4.906ThrGly: 4.906 ± 1.004
0.409ThrHis: 0.409 ± 0.338
3.679ThrIle: 3.679 ± 1.057
1.226ThrLys: 1.226 ± 0.791
5.315ThrLeu: 5.315 ± 0.679
0.409ThrMet: 0.409 ± 0.338
2.862ThrAsn: 2.862 ± 0.911
4.497ThrPro: 4.497 ± 1.109
2.862ThrGln: 2.862 ± 0.634
4.088ThrArg: 4.088 ± 1.19
4.497ThrSer: 4.497 ± 2.633
3.271ThrThr: 3.271 ± 1.413
6.95ThrVal: 6.95 ± 1.758
0.409ThrTrp: 0.409 ± 0.407
2.044ThrTyr: 2.044 ± 0.792
0.0ThrXaa: 0.0 ± 0.0
Val
5.315ValAla: 5.315 ± 0.884
0.818ValCys: 0.818 ± 0.5
5.315ValAsp: 5.315 ± 0.907
4.497ValGlu: 4.497 ± 1.265
4.088ValPhe: 4.088 ± 1.211
5.315ValGly: 5.315 ± 1.393
1.226ValHis: 1.226 ± 0.597
3.271ValIle: 3.271 ± 0.732
4.497ValLys: 4.497 ± 1.209
5.724ValLeu: 5.724 ± 1.308
1.226ValMet: 1.226 ± 0.353
2.453ValAsn: 2.453 ± 0.613
3.271ValPro: 3.271 ± 1.299
2.862ValGln: 2.862 ± 0.58
2.453ValArg: 2.453 ± 1.044
3.679ValSer: 3.679 ± 0.616
3.679ValThr: 3.679 ± 1.258
2.453ValVal: 2.453 ± 0.786
0.818ValTrp: 0.818 ± 0.504
2.044ValTyr: 2.044 ± 0.969
0.0ValXaa: 0.0 ± 0.0
Trp
0.818TrpAla: 0.818 ± 0.641
0.0TrpCys: 0.0 ± 0.0
1.635TrpAsp: 1.635 ± 0.809
0.818TrpGlu: 0.818 ± 0.633
0.0TrpPhe: 0.0 ± 0.0
0.818TrpGly: 0.818 ± 0.335
0.409TrpHis: 0.409 ± 0.407
1.226TrpIle: 1.226 ± 0.562
0.409TrpLys: 0.409 ± 0.321
1.635TrpLeu: 1.635 ± 0.537
0.0TrpMet: 0.0 ± 0.0
0.818TrpAsn: 0.818 ± 0.335
0.818TrpPro: 0.818 ± 0.46
0.409TrpGln: 0.409 ± 0.338
1.635TrpArg: 1.635 ± 0.927
1.226TrpSer: 1.226 ± 0.562
0.818TrpThr: 0.818 ± 0.814
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.409TrpTyr: 0.409 ± 0.321
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.635TyrAla: 1.635 ± 0.505
1.226TyrCys: 1.226 ± 0.785
2.044TyrAsp: 2.044 ± 0.869
0.818TyrGlu: 0.818 ± 0.863
2.862TyrPhe: 2.862 ± 0.493
2.862TyrGly: 2.862 ± 1.345
0.0TyrHis: 0.0 ± 0.0
1.635TyrIle: 1.635 ± 0.531
2.453TyrLys: 2.453 ± 0.913
2.862TyrLeu: 2.862 ± 0.846
0.409TyrMet: 0.409 ± 0.33
1.226TyrAsn: 1.226 ± 0.357
0.818TyrPro: 0.818 ± 0.597
0.409TyrGln: 0.409 ± 0.407
2.044TyrArg: 2.044 ± 0.903
1.635TyrSer: 1.635 ± 0.671
1.635TyrThr: 1.635 ± 0.678
0.818TyrVal: 0.818 ± 0.335
1.226TyrTrp: 1.226 ± 0.68
1.635TyrTyr: 1.635 ± 0.748
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2447 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski