Amino acid dipepetide frequency for Human papillomavirus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.06AlaAla: 3.06 ± 1.345
0.765AlaCys: 0.765 ± 0.637
5.356AlaAsp: 5.356 ± 2.037
4.591AlaGlu: 4.591 ± 1.614
2.295AlaPhe: 2.295 ± 1.035
1.913AlaGly: 1.913 ± 0.788
0.765AlaHis: 0.765 ± 0.403
3.443AlaIle: 3.443 ± 1.719
3.06AlaLys: 3.06 ± 1.044
6.121AlaLeu: 6.121 ± 1.273
1.53AlaMet: 1.53 ± 0.57
1.913AlaAsn: 1.913 ± 0.602
3.06AlaPro: 3.06 ± 0.6
3.06AlaGln: 3.06 ± 1.174
3.06AlaArg: 3.06 ± 1.148
2.295AlaSer: 2.295 ± 0.806
5.356AlaThr: 5.356 ± 0.773
3.443AlaVal: 3.443 ± 0.951
0.0AlaTrp: 0.0 ± 0.0
1.53AlaTyr: 1.53 ± 0.926
0.0AlaXaa: 0.0 ± 0.0
Cys
0.383CysAla: 0.383 ± 0.388
2.678CysCys: 2.678 ± 1.405
2.295CysAsp: 2.295 ± 1.514
0.383CysGlu: 0.383 ± 0.388
0.383CysPhe: 0.383 ± 0.316
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
2.295CysIle: 2.295 ± 1.957
1.148CysLys: 1.148 ± 0.696
1.53CysLeu: 1.53 ± 1.043
0.0CysMet: 0.0 ± 0.0
0.765CysAsn: 0.765 ± 0.607
1.148CysPro: 1.148 ± 1.078
0.383CysGln: 0.383 ± 0.366
1.148CysArg: 1.148 ± 1.087
3.443CysSer: 3.443 ± 1.794
2.678CysThr: 2.678 ± 0.831
0.0CysVal: 0.0 ± 0.0
1.53CysTrp: 1.53 ± 0.518
1.53CysTyr: 1.53 ± 1.038
0.0CysXaa: 0.0 ± 0.0
Asp
6.121AspAla: 6.121 ± 1.686
2.295AspCys: 2.295 ± 0.839
4.208AspAsp: 4.208 ± 1.66
3.443AspGlu: 3.443 ± 1.46
2.678AspPhe: 2.678 ± 1.03
3.06AspGly: 3.06 ± 1.129
0.765AspHis: 0.765 ± 0.455
4.591AspIle: 4.591 ± 1.292
2.678AspLys: 2.678 ± 1.449
6.886AspLeu: 6.886 ± 2.457
1.148AspMet: 1.148 ± 0.421
4.591AspAsn: 4.591 ± 1.868
4.591AspPro: 4.591 ± 1.33
2.295AspGln: 2.295 ± 0.702
0.765AspArg: 0.765 ± 0.442
4.591AspSer: 4.591 ± 0.816
3.443AspThr: 3.443 ± 0.778
4.973AspVal: 4.973 ± 1.892
0.765AspTrp: 0.765 ± 0.632
1.148AspTyr: 1.148 ± 0.575
0.0AspXaa: 0.0 ± 0.0
Glu
2.678GluAla: 2.678 ± 0.567
1.53GluCys: 1.53 ± 0.983
3.443GluAsp: 3.443 ± 1.077
8.416GluGlu: 8.416 ± 3.947
1.53GluPhe: 1.53 ± 0.892
3.06GluGly: 3.06 ± 1.317
0.383GluHis: 0.383 ± 0.359
3.06GluIle: 3.06 ± 1.221
1.53GluLys: 1.53 ± 0.998
4.208GluLeu: 4.208 ± 0.81
1.53GluMet: 1.53 ± 0.655
3.06GluAsn: 3.06 ± 0.699
2.678GluPro: 2.678 ± 1.592
4.973GluGln: 4.973 ± 1.129
3.826GluArg: 3.826 ± 1.011
5.738GluSer: 5.738 ± 1.456
4.591GluThr: 4.591 ± 1.401
3.06GluVal: 3.06 ± 1.639
0.765GluTrp: 0.765 ± 0.632
1.913GluTyr: 1.913 ± 0.63
0.0GluXaa: 0.0 ± 0.0
Phe
1.148PheAla: 1.148 ± 0.433
1.53PheCys: 1.53 ± 1.038
3.826PheAsp: 3.826 ± 1.927
3.443PheGlu: 3.443 ± 1.227
1.913PhePhe: 1.913 ± 1.056
2.678PheGly: 2.678 ± 1.015
0.383PheHis: 0.383 ± 0.359
1.53PheIle: 1.53 ± 1.031
1.913PheLys: 1.913 ± 0.727
3.826PheLeu: 3.826 ± 1.186
1.148PheMet: 1.148 ± 0.545
1.53PheAsn: 1.53 ± 1.031
1.53PhePro: 1.53 ± 0.518
1.913PheGln: 1.913 ± 0.586
2.295PheArg: 2.295 ± 0.816
3.443PheSer: 3.443 ± 1.14
2.295PheThr: 2.295 ± 0.633
3.443PheVal: 3.443 ± 1.357
0.765PheTrp: 0.765 ± 0.403
1.53PheTyr: 1.53 ± 0.49
0.0PheXaa: 0.0 ± 0.0
Gly
3.06GlyAla: 3.06 ± 1.333
0.383GlyCys: 0.383 ± 0.359
4.208GlyAsp: 4.208 ± 1.289
3.826GlyGlu: 3.826 ± 1.588
1.913GlyPhe: 1.913 ± 0.717
6.121GlyGly: 6.121 ± 2.085
0.765GlyHis: 0.765 ± 0.719
3.443GlyIle: 3.443 ± 1.265
2.678GlyLys: 2.678 ± 0.704
5.738GlyLeu: 5.738 ± 2.352
0.0GlyMet: 0.0 ± 0.0
2.678GlyAsn: 2.678 ± 1.231
4.208GlyPro: 4.208 ± 1.043
2.295GlyGln: 2.295 ± 1.384
4.591GlyArg: 4.591 ± 1.148
6.121GlySer: 6.121 ± 1.831
5.738GlyThr: 5.738 ± 1.856
2.295GlyVal: 2.295 ± 0.83
0.0GlyTrp: 0.0 ± 0.0
1.913GlyTyr: 1.913 ± 0.729
0.0GlyXaa: 0.0 ± 0.0
His
1.148HisAla: 1.148 ± 0.696
0.383HisCys: 0.383 ± 0.316
0.383HisAsp: 0.383 ± 0.366
0.765HisGlu: 0.765 ± 0.403
1.53HisPhe: 1.53 ± 0.294
0.383HisGly: 0.383 ± 0.316
0.383HisHis: 0.383 ± 0.423
1.148HisIle: 1.148 ± 0.723
0.765HisLys: 0.765 ± 0.455
2.295HisLeu: 2.295 ± 0.522
0.0HisMet: 0.0 ± 0.0
1.913HisAsn: 1.913 ± 1.144
1.148HisPro: 1.148 ± 0.722
0.383HisGln: 0.383 ± 0.316
1.148HisArg: 1.148 ± 0.594
2.295HisSer: 2.295 ± 0.93
1.148HisThr: 1.148 ± 0.768
0.765HisVal: 0.765 ± 0.404
0.383HisTrp: 0.383 ± 0.359
0.765HisTyr: 0.765 ± 0.432
0.0HisXaa: 0.0 ± 0.0
Ile
3.06IleAla: 3.06 ± 0.485
1.148IleCys: 1.148 ± 1.008
4.591IleAsp: 4.591 ± 1.63
3.443IleGlu: 3.443 ± 1.638
2.295IlePhe: 2.295 ± 0.804
4.973IleGly: 4.973 ± 2.917
0.765IleHis: 0.765 ± 0.432
1.53IleIle: 1.53 ± 0.892
1.913IleLys: 1.913 ± 1.004
3.06IleLeu: 3.06 ± 1.147
0.383IleMet: 0.383 ± 0.388
1.53IleAsn: 1.53 ± 0.589
3.06IlePro: 3.06 ± 2.487
2.295IleGln: 2.295 ± 0.951
2.295IleArg: 2.295 ± 1.551
3.826IleSer: 3.826 ± 1.022
2.678IleThr: 2.678 ± 0.961
3.826IleVal: 3.826 ± 1.41
0.0IleTrp: 0.0 ± 0.0
2.295IleTyr: 2.295 ± 0.804
0.0IleXaa: 0.0 ± 0.0
Lys
1.913LysAla: 1.913 ± 1.162
1.53LysCys: 1.53 ± 0.513
1.53LysAsp: 1.53 ± 1.107
2.295LysGlu: 2.295 ± 0.996
1.148LysPhe: 1.148 ± 0.629
2.678LysGly: 2.678 ± 1.32
1.913LysHis: 1.913 ± 0.96
1.913LysIle: 1.913 ± 1.041
2.295LysLys: 2.295 ± 0.878
5.738LysLeu: 5.738 ± 2.038
1.148LysMet: 1.148 ± 0.676
0.765LysAsn: 0.765 ± 0.404
1.913LysPro: 1.913 ± 1.038
3.826LysGln: 3.826 ± 1.14
5.356LysArg: 5.356 ± 1.344
6.121LysSer: 6.121 ± 2.5
1.148LysThr: 1.148 ± 0.713
4.208LysVal: 4.208 ± 0.984
0.765LysTrp: 0.765 ± 0.519
0.765LysTyr: 0.765 ± 0.632
0.0LysXaa: 0.0 ± 0.0
Leu
3.06LeuAla: 3.06 ± 1.098
1.913LeuCys: 1.913 ± 0.621
4.591LeuAsp: 4.591 ± 1.448
5.356LeuGlu: 5.356 ± 1.513
4.973LeuPhe: 4.973 ± 1.337
6.886LeuGly: 6.886 ± 1.911
1.53LeuHis: 1.53 ± 1.014
3.826LeuIle: 3.826 ± 0.648
8.034LeuLys: 8.034 ± 2.45
7.651LeuLeu: 7.651 ± 1.805
2.295LeuMet: 2.295 ± 1.017
4.591LeuAsn: 4.591 ± 1.389
4.208LeuPro: 4.208 ± 0.756
4.973LeuGln: 4.973 ± 1.178
4.973LeuArg: 4.973 ± 1.317
6.503LeuSer: 6.503 ± 1.267
5.738LeuThr: 5.738 ± 1.124
4.973LeuVal: 4.973 ± 1.401
0.765LeuTrp: 0.765 ± 0.644
5.738LeuTyr: 5.738 ± 0.87
0.0LeuXaa: 0.0 ± 0.0
Met
1.53MetAla: 1.53 ± 0.685
1.148MetCys: 1.148 ± 0.629
0.765MetAsp: 0.765 ± 0.632
0.765MetGlu: 0.765 ± 0.455
0.765MetPhe: 0.765 ± 0.446
0.765MetGly: 0.765 ± 0.719
0.383MetHis: 0.383 ± 0.388
1.148MetIle: 1.148 ± 0.591
0.383MetLys: 0.383 ± 0.468
2.678MetLeu: 2.678 ± 1.304
0.0MetMet: 0.0 ± 0.0
0.765MetAsn: 0.765 ± 0.455
0.765MetPro: 0.765 ± 0.404
0.383MetGln: 0.383 ± 0.366
0.765MetArg: 0.765 ± 0.443
1.913MetSer: 1.913 ± 0.621
0.0MetThr: 0.0 ± 0.0
0.765MetVal: 0.765 ± 0.632
0.0MetTrp: 0.0 ± 0.0
1.53MetTyr: 1.53 ± 1.264
0.0MetXaa: 0.0 ± 0.0
Asn
3.443AsnAla: 3.443 ± 1.273
2.678AsnCys: 2.678 ± 1.219
1.53AsnAsp: 1.53 ± 0.493
1.148AsnGlu: 1.148 ± 0.65
2.678AsnPhe: 2.678 ± 0.926
2.295AsnGly: 2.295 ± 1.358
0.765AsnHis: 0.765 ± 0.607
2.295AsnIle: 2.295 ± 1.133
1.53AsnLys: 1.53 ± 0.57
3.443AsnLeu: 3.443 ± 0.846
0.0AsnMet: 0.0 ± 0.0
2.295AsnAsn: 2.295 ± 1.283
3.06AsnPro: 3.06 ± 1.306
1.53AsnGln: 1.53 ± 0.773
3.826AsnArg: 3.826 ± 1.243
4.208AsnSer: 4.208 ± 1.679
1.53AsnThr: 1.53 ± 1.047
0.765AsnVal: 0.765 ± 0.446
2.678AsnTrp: 2.678 ± 0.88
1.148AsnTyr: 1.148 ± 0.551
0.0AsnXaa: 0.0 ± 0.0
Pro
5.738ProAla: 5.738 ± 1.718
0.383ProCys: 0.383 ± 0.359
3.06ProAsp: 3.06 ± 1.31
2.678ProGlu: 2.678 ± 0.638
2.295ProPhe: 2.295 ± 1.059
2.678ProGly: 2.678 ± 1.367
1.148ProHis: 1.148 ± 0.722
3.06ProIle: 3.06 ± 1.146
3.06ProLys: 3.06 ± 0.51
4.591ProLeu: 4.591 ± 1.637
1.913ProMet: 1.913 ± 0.96
2.295ProAsn: 2.295 ± 0.974
7.651ProPro: 7.651 ± 2.813
1.53ProGln: 1.53 ± 0.589
3.06ProArg: 3.06 ± 1.358
6.121ProSer: 6.121 ± 1.154
5.738ProThr: 5.738 ± 2.459
1.148ProVal: 1.148 ± 0.722
0.0ProTrp: 0.0 ± 0.0
1.148ProTyr: 1.148 ± 1.078
0.0ProXaa: 0.0 ± 0.0
Gln
4.208GlnAla: 4.208 ± 1.237
0.765GlnCys: 0.765 ± 0.534
2.295GlnAsp: 2.295 ± 0.985
3.06GlnGlu: 3.06 ± 0.699
1.53GlnPhe: 1.53 ± 0.807
3.06GlnGly: 3.06 ± 0.878
0.765GlnHis: 0.765 ± 0.404
2.295GlnIle: 2.295 ± 1.016
1.913GlnLys: 1.913 ± 1.146
4.973GlnLeu: 4.973 ± 1.201
1.148GlnMet: 1.148 ± 0.551
1.53GlnAsn: 1.53 ± 0.702
1.913GlnPro: 1.913 ± 0.514
2.678GlnGln: 2.678 ± 0.52
1.148GlnArg: 1.148 ± 0.906
4.208GlnSer: 4.208 ± 1.192
2.295GlnThr: 2.295 ± 0.981
3.06GlnVal: 3.06 ± 0.898
0.765GlnTrp: 0.765 ± 0.432
1.913GlnTyr: 1.913 ± 1.388
0.0GlnXaa: 0.0 ± 0.0
Arg
4.208ArgAla: 4.208 ± 0.91
1.53ArgCys: 1.53 ± 0.611
4.208ArgAsp: 4.208 ± 1.308
1.913ArgGlu: 1.913 ± 0.643
3.443ArgPhe: 3.443 ± 1.655
6.121ArgGly: 6.121 ± 2.03
1.913ArgHis: 1.913 ± 0.664
2.678ArgIle: 2.678 ± 0.948
6.503ArgLys: 6.503 ± 2.409
5.356ArgLeu: 5.356 ± 1.555
1.913ArgMet: 1.913 ± 0.783
1.913ArgAsn: 1.913 ± 0.758
3.06ArgPro: 3.06 ± 1.467
2.678ArgGln: 2.678 ± 1.314
9.946ArgArg: 9.946 ± 4.051
2.678ArgSer: 2.678 ± 0.861
1.148ArgThr: 1.148 ± 0.539
2.295ArgVal: 2.295 ± 1.465
0.765ArgTrp: 0.765 ± 0.442
1.148ArgTyr: 1.148 ± 0.594
0.0ArgXaa: 0.0 ± 0.0
Ser
3.443SerAla: 3.443 ± 0.602
0.383SerCys: 0.383 ± 0.316
6.503SerAsp: 6.503 ± 0.978
4.973SerGlu: 4.973 ± 0.437
3.443SerPhe: 3.443 ± 0.911
6.886SerGly: 6.886 ± 2.665
3.443SerHis: 3.443 ± 0.696
1.913SerIle: 1.913 ± 0.809
0.765SerLys: 0.765 ± 0.632
9.564SerLeu: 9.564 ± 1.415
0.383SerMet: 0.383 ± 0.316
3.443SerAsn: 3.443 ± 2.11
5.738SerPro: 5.738 ± 2.111
3.443SerGln: 3.443 ± 0.894
7.651SerArg: 7.651 ± 1.707
9.564SerSer: 9.564 ± 1.134
7.651SerThr: 7.651 ± 2.861
3.06SerVal: 3.06 ± 1.339
0.383SerTrp: 0.383 ± 0.359
3.443SerTyr: 3.443 ± 0.99
0.0SerXaa: 0.0 ± 0.0
Thr
2.678ThrAla: 2.678 ± 0.467
1.148ThrCys: 1.148 ± 1.072
4.208ThrAsp: 4.208 ± 0.972
7.269ThrGlu: 7.269 ± 2.126
2.678ThrPhe: 2.678 ± 1.229
3.443ThrGly: 3.443 ± 0.99
0.765ThrHis: 0.765 ± 0.455
3.06ThrIle: 3.06 ± 2.501
1.148ThrLys: 1.148 ± 0.433
6.503ThrLeu: 6.503 ± 1.591
1.148ThrMet: 1.148 ± 0.629
1.53ThrAsn: 1.53 ± 1.047
4.591ThrPro: 4.591 ± 0.869
2.295ThrGln: 2.295 ± 0.62
3.443ThrArg: 3.443 ± 1.079
6.121ThrSer: 6.121 ± 1.554
3.06ThrThr: 3.06 ± 0.767
3.826ThrVal: 3.826 ± 0.713
0.383ThrTrp: 0.383 ± 0.316
3.443ThrTyr: 3.443 ± 1.783
0.0ThrXaa: 0.0 ± 0.0
Val
2.678ValAla: 2.678 ± 0.891
1.148ValCys: 1.148 ± 0.561
3.826ValAsp: 3.826 ± 1.022
3.06ValGlu: 3.06 ± 0.827
1.913ValPhe: 1.913 ± 0.667
3.06ValGly: 3.06 ± 0.873
1.53ValHis: 1.53 ± 0.551
3.06ValIle: 3.06 ± 0.909
3.443ValLys: 3.443 ± 0.639
3.826ValLeu: 3.826 ± 1.196
0.765ValMet: 0.765 ± 0.719
2.295ValAsn: 2.295 ± 0.401
3.826ValPro: 3.826 ± 1.402
2.295ValGln: 2.295 ± 0.902
2.678ValArg: 2.678 ± 0.467
4.591ValSer: 4.591 ± 1.028
3.06ValThr: 3.06 ± 1.282
1.913ValVal: 1.913 ± 0.588
1.53ValTrp: 1.53 ± 0.854
0.765ValTyr: 0.765 ± 0.446
0.0ValXaa: 0.0 ± 0.0
Trp
0.383TrpAla: 0.383 ± 0.316
0.0TrpCys: 0.0 ± 0.0
1.53TrpAsp: 1.53 ± 0.779
0.383TrpGlu: 0.383 ± 0.388
0.383TrpPhe: 0.383 ± 0.388
0.383TrpGly: 0.383 ± 0.423
0.383TrpHis: 0.383 ± 0.388
0.765TrpIle: 0.765 ± 0.632
1.148TrpLys: 1.148 ± 0.629
1.53TrpLeu: 1.53 ± 0.647
0.0TrpMet: 0.0 ± 0.0
0.765TrpAsn: 0.765 ± 0.607
0.383TrpPro: 0.383 ± 0.359
0.383TrpGln: 0.383 ± 0.359
1.148TrpArg: 1.148 ± 0.829
0.0TrpSer: 0.0 ± 0.0
1.148TrpThr: 1.148 ± 0.758
1.53TrpVal: 1.53 ± 0.518
0.0TrpTrp: 0.0 ± 0.0
0.765TrpTyr: 0.765 ± 0.632
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.678TyrAla: 2.678 ± 0.515
0.383TyrCys: 0.383 ± 0.494
2.678TyrAsp: 2.678 ± 0.885
0.765TyrGlu: 0.765 ± 0.443
2.678TyrPhe: 2.678 ± 1.094
1.53TyrGly: 1.53 ± 0.518
0.383TyrHis: 0.383 ± 0.316
1.913TyrIle: 1.913 ± 1.009
3.06TyrLys: 3.06 ± 1.44
3.06TyrLeu: 3.06 ± 1.063
0.383TyrMet: 0.383 ± 0.308
2.678TyrAsn: 2.678 ± 0.721
0.765TyrPro: 0.765 ± 0.503
1.913TyrGln: 1.913 ± 0.727
2.295TyrArg: 2.295 ± 0.794
2.295TyrSer: 2.295 ± 1.296
2.295TyrThr: 2.295 ± 0.582
1.913TyrVal: 1.913 ± 0.829
0.765TyrTrp: 0.765 ± 0.455
1.148TyrTyr: 1.148 ± 0.63
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2615 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski