Amino acid dipepetide frequency for Uncia uncia papillomavirus type 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.843AlaAla: 6.843 ± 2.009
0.855AlaCys: 0.855 ± 0.925
5.133AlaAsp: 5.133 ± 0.97
4.705AlaGlu: 4.705 ± 1.557
2.994AlaPhe: 2.994 ± 1.753
3.422AlaGly: 3.422 ± 0.779
0.0AlaHis: 0.0 ± 0.0
1.711AlaIle: 1.711 ± 0.888
3.422AlaLys: 3.422 ± 1.935
5.988AlaLeu: 5.988 ± 0.881
0.855AlaMet: 0.855 ± 0.512
0.855AlaAsn: 0.855 ± 0.433
3.422AlaPro: 3.422 ± 1.053
1.711AlaGln: 1.711 ± 0.639
2.994AlaArg: 2.994 ± 1.008
4.705AlaSer: 4.705 ± 0.991
5.56AlaThr: 5.56 ± 1.021
3.849AlaVal: 3.849 ± 1.438
0.428AlaTrp: 0.428 ± 0.32
2.139AlaTyr: 2.139 ± 1.256
0.0AlaXaa: 0.0 ± 0.0
Cys
2.139CysAla: 2.139 ± 1.787
0.855CysCys: 0.855 ± 1.117
0.428CysAsp: 0.428 ± 0.347
1.283CysGlu: 1.283 ± 1.042
1.283CysPhe: 1.283 ± 0.436
2.139CysGly: 2.139 ± 2.089
0.0CysHis: 0.0 ± 0.0
0.428CysIle: 0.428 ± 0.347
2.994CysLys: 2.994 ± 1.154
2.566CysLeu: 2.566 ± 2.026
1.711CysMet: 1.711 ± 1.226
0.428CysAsn: 0.428 ± 0.32
2.139CysPro: 2.139 ± 0.68
0.855CysGln: 0.855 ± 0.703
1.283CysArg: 1.283 ± 0.686
1.283CysSer: 1.283 ± 0.873
1.283CysThr: 1.283 ± 0.686
1.283CysVal: 1.283 ± 1.677
0.428CysTrp: 0.428 ± 0.347
0.428CysTyr: 0.428 ± 0.723
0.0CysXaa: 0.0 ± 0.0
Asp
2.566AspAla: 2.566 ± 1.042
2.139AspCys: 2.139 ± 1.736
4.277AspAsp: 4.277 ± 1.235
3.849AspGlu: 3.849 ± 1.385
1.283AspPhe: 1.283 ± 0.502
2.139AspGly: 2.139 ± 1.37
0.855AspHis: 0.855 ± 0.371
3.849AspIle: 3.849 ± 1.012
3.422AspLys: 3.422 ± 1.535
9.41AspLeu: 9.41 ± 2.754
0.428AspMet: 0.428 ± 0.32
2.139AspAsn: 2.139 ± 0.857
4.705AspPro: 4.705 ± 1.743
1.711AspGln: 1.711 ± 0.61
2.566AspArg: 2.566 ± 1.326
5.133AspSer: 5.133 ± 1.157
2.139AspThr: 2.139 ± 0.716
2.566AspVal: 2.566 ± 0.959
2.139AspTrp: 2.139 ± 1.285
0.428AspTyr: 0.428 ± 0.32
0.0AspXaa: 0.0 ± 0.0
Glu
3.422GluAla: 3.422 ± 1.346
1.283GluCys: 1.283 ± 1.042
5.56GluAsp: 5.56 ± 1.027
7.271GluGlu: 7.271 ± 4.532
2.139GluPhe: 2.139 ± 1.119
2.994GluGly: 2.994 ± 1.26
0.855GluHis: 0.855 ± 0.576
3.422GluIle: 3.422 ± 1.316
3.849GluLys: 3.849 ± 1.325
2.994GluLeu: 2.994 ± 0.967
0.428GluMet: 0.428 ± 0.347
3.849GluAsn: 3.849 ± 0.994
3.422GluPro: 3.422 ± 0.722
6.843GluGln: 6.843 ± 2.37
3.422GluArg: 3.422 ± 0.948
2.994GluSer: 2.994 ± 0.72
5.133GluThr: 5.133 ± 1.115
4.277GluVal: 4.277 ± 0.76
0.428GluTrp: 0.428 ± 0.347
0.855GluTyr: 0.855 ± 0.64
0.0GluXaa: 0.0 ± 0.0
Phe
3.422PheAla: 3.422 ± 0.853
2.139PheCys: 2.139 ± 0.922
3.422PheAsp: 3.422 ± 0.736
2.139PheGlu: 2.139 ± 0.994
2.994PhePhe: 2.994 ± 1.295
2.994PheGly: 2.994 ± 0.743
0.0PheHis: 0.0 ± 0.0
0.855PheIle: 0.855 ± 0.433
2.994PheLys: 2.994 ± 1.097
5.56PheLeu: 5.56 ± 1.854
0.855PheMet: 0.855 ± 0.694
0.855PheAsn: 0.855 ± 0.64
2.566PhePro: 2.566 ± 0.998
2.139PheGln: 2.139 ± 0.909
2.139PheArg: 2.139 ± 0.71
2.566PheSer: 2.566 ± 1.368
2.994PheThr: 2.994 ± 1.951
0.855PheVal: 0.855 ± 0.755
1.711PheTrp: 1.711 ± 1.035
0.855PheTyr: 0.855 ± 0.64
0.0PheXaa: 0.0 ± 0.0
Gly
3.849GlyAla: 3.849 ± 1.062
0.855GlyCys: 0.855 ± 0.576
5.56GlyAsp: 5.56 ± 0.643
4.705GlyGlu: 4.705 ± 1.311
2.994GlyPhe: 2.994 ± 0.702
6.843GlyGly: 6.843 ± 2.963
1.711GlyHis: 1.711 ± 0.644
2.139GlyIle: 2.139 ± 0.423
3.849GlyLys: 3.849 ± 1.639
6.416GlyLeu: 6.416 ± 2.084
0.855GlyMet: 0.855 ± 0.719
3.422GlyAsn: 3.422 ± 1.316
1.711GlyPro: 1.711 ± 0.624
3.849GlyGln: 3.849 ± 0.446
3.422GlyArg: 3.422 ± 1.31
7.271GlySer: 7.271 ± 2.176
2.994GlyThr: 2.994 ± 1.155
5.988GlyVal: 5.988 ± 1.521
0.0GlyTrp: 0.0 ± 0.0
0.428GlyTyr: 0.428 ± 0.357
0.0GlyXaa: 0.0 ± 0.0
His
0.855HisAla: 0.855 ± 0.64
0.428HisCys: 0.428 ± 0.347
0.0HisAsp: 0.0 ± 0.0
2.139HisGlu: 2.139 ± 0.423
0.428HisPhe: 0.428 ± 0.357
0.428HisGly: 0.428 ± 0.723
0.0HisHis: 0.0 ± 0.0
1.283HisIle: 1.283 ± 1.07
0.855HisLys: 0.855 ± 0.478
1.711HisLeu: 1.711 ± 1.069
0.0HisMet: 0.0 ± 0.0
0.428HisAsn: 0.428 ± 0.559
0.855HisPro: 0.855 ± 0.433
1.283HisGln: 1.283 ± 0.796
0.0HisArg: 0.0 ± 0.0
0.855HisSer: 0.855 ± 0.371
0.855HisThr: 0.855 ± 0.64
1.711HisVal: 1.711 ± 0.61
0.855HisTrp: 0.855 ± 0.487
0.855HisTyr: 0.855 ± 0.542
0.0HisXaa: 0.0 ± 0.0
Ile
2.139IleAla: 2.139 ± 0.909
0.0IleCys: 0.0 ± 0.0
0.855IleAsp: 0.855 ± 0.429
2.994IleGlu: 2.994 ± 1.187
1.711IlePhe: 1.711 ± 0.61
3.849IleGly: 3.849 ± 1.675
0.428IleHis: 0.428 ± 0.32
1.283IleIle: 1.283 ± 0.4
1.283IleLys: 1.283 ± 0.673
2.139IleLeu: 2.139 ± 1.41
0.0IleMet: 0.0 ± 0.0
1.711IleAsn: 1.711 ± 0.622
1.711IlePro: 1.711 ± 1.161
2.139IleGln: 2.139 ± 0.934
0.855IleArg: 0.855 ± 0.542
4.705IleSer: 4.705 ± 2.167
0.855IleThr: 0.855 ± 0.487
2.994IleVal: 2.994 ± 0.72
0.855IleTrp: 0.855 ± 0.619
1.283IleTyr: 1.283 ± 0.419
0.0IleXaa: 0.0 ± 0.0
Lys
3.849LysAla: 3.849 ± 1.06
1.711LysCys: 1.711 ± 0.591
0.855LysAsp: 0.855 ± 0.478
2.566LysGlu: 2.566 ± 1.103
3.422LysPhe: 3.422 ± 1.343
5.56LysGly: 5.56 ± 1.938
0.855LysHis: 0.855 ± 0.694
1.283LysIle: 1.283 ± 0.64
3.849LysLys: 3.849 ± 1.575
5.988LysLeu: 5.988 ± 2.532
0.855LysMet: 0.855 ± 0.904
1.711LysAsn: 1.711 ± 0.764
2.994LysPro: 2.994 ± 0.986
2.566LysGln: 2.566 ± 1.227
4.705LysArg: 4.705 ± 0.932
2.994LysSer: 2.994 ± 1.261
3.849LysThr: 3.849 ± 2.035
1.283LysVal: 1.283 ± 0.673
0.0LysTrp: 0.0 ± 0.0
2.139LysTyr: 2.139 ± 0.468
0.0LysXaa: 0.0 ± 0.0
Leu
6.843LeuAla: 6.843 ± 1.409
2.566LeuCys: 2.566 ± 1.913
3.849LeuAsp: 3.849 ± 0.744
6.416LeuGlu: 6.416 ± 1.457
4.705LeuPhe: 4.705 ± 1.338
6.843LeuGly: 6.843 ± 2.149
2.139LeuHis: 2.139 ± 0.504
1.283LeuIle: 1.283 ± 0.4
3.849LeuLys: 3.849 ± 1.261
14.115LeuLeu: 14.115 ± 3.333
1.283LeuMet: 1.283 ± 0.803
2.994LeuAsn: 2.994 ± 0.632
5.133LeuPro: 5.133 ± 1.43
6.843LeuGln: 6.843 ± 2.186
7.699LeuArg: 7.699 ± 2.404
9.41LeuSer: 9.41 ± 2.625
5.56LeuThr: 5.56 ± 1.329
6.416LeuVal: 6.416 ± 1.3
0.855LeuTrp: 0.855 ± 0.478
3.849LeuTyr: 3.849 ± 1.157
0.0LeuXaa: 0.0 ± 0.0
Met
2.139MetAla: 2.139 ± 0.988
0.428MetCys: 0.428 ± 0.347
2.139MetAsp: 2.139 ± 0.716
0.855MetGlu: 0.855 ± 0.478
0.428MetPhe: 0.428 ± 0.32
0.428MetGly: 0.428 ± 0.347
0.0MetHis: 0.0 ± 0.0
0.855MetIle: 0.855 ± 0.734
0.428MetLys: 0.428 ± 0.559
1.283MetLeu: 1.283 ± 0.419
0.0MetMet: 0.0 ± 0.0
0.855MetAsn: 0.855 ± 0.487
0.428MetPro: 0.428 ± 0.723
0.855MetGln: 0.855 ± 0.429
0.855MetArg: 0.855 ± 0.619
0.855MetSer: 0.855 ± 0.694
0.428MetThr: 0.428 ± 0.347
0.428MetVal: 0.428 ± 0.32
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.283AsnAla: 1.283 ± 1.042
0.855AsnCys: 0.855 ± 0.645
2.566AsnAsp: 2.566 ± 0.541
0.428AsnGlu: 0.428 ± 0.32
0.855AsnPhe: 0.855 ± 0.433
1.283AsnGly: 1.283 ± 0.673
1.711AsnHis: 1.711 ± 1.468
1.283AsnIle: 1.283 ± 0.673
2.139AsnLys: 2.139 ± 0.966
1.711AsnLeu: 1.711 ± 0.792
0.428AsnMet: 0.428 ± 0.32
1.283AsnAsn: 1.283 ± 0.673
3.422AsnPro: 3.422 ± 1.057
1.283AsnGln: 1.283 ± 0.673
2.139AsnArg: 2.139 ± 0.856
2.566AsnSer: 2.566 ± 0.515
2.139AsnThr: 2.139 ± 0.934
3.849AsnVal: 3.849 ± 0.707
0.0AsnTrp: 0.0 ± 0.0
0.855AsnTyr: 0.855 ± 0.429
0.0AsnXaa: 0.0 ± 0.0
Pro
4.277ProAla: 4.277 ± 1.966
0.855ProCys: 0.855 ± 0.645
2.566ProAsp: 2.566 ± 0.962
5.56ProGlu: 5.56 ± 1.678
2.139ProPhe: 2.139 ± 1.124
2.994ProGly: 2.994 ± 1.373
0.855ProHis: 0.855 ± 0.713
2.566ProIle: 2.566 ± 1.35
3.849ProLys: 3.849 ± 1.086
4.705ProLeu: 4.705 ± 1.454
0.0ProMet: 0.0 ± 0.0
2.566ProAsn: 2.566 ± 1.142
9.41ProPro: 9.41 ± 3.986
2.139ProGln: 2.139 ± 1.403
5.56ProArg: 5.56 ± 2.211
4.705ProSer: 4.705 ± 1.712
5.133ProThr: 5.133 ± 1.408
5.988ProVal: 5.988 ± 1.579
0.428ProTrp: 0.428 ± 0.487
1.283ProTyr: 1.283 ± 0.665
0.0ProXaa: 0.0 ± 0.0
Gln
1.283GlnAla: 1.283 ± 0.68
1.711GlnCys: 1.711 ± 0.622
1.283GlnAsp: 1.283 ± 0.64
3.849GlnGlu: 3.849 ± 1.039
1.711GlnPhe: 1.711 ± 0.804
3.849GlnGly: 3.849 ± 0.931
1.283GlnHis: 1.283 ± 0.607
1.283GlnIle: 1.283 ± 0.419
1.283GlnLys: 1.283 ± 0.976
5.56GlnLeu: 5.56 ± 1.643
1.283GlnMet: 1.283 ± 0.74
2.566GlnAsn: 2.566 ± 0.982
3.422GlnPro: 3.422 ± 1.34
3.422GlnGln: 3.422 ± 0.616
2.139GlnArg: 2.139 ± 0.446
2.566GlnSer: 2.566 ± 1.054
4.277GlnThr: 4.277 ± 1.051
1.711GlnVal: 1.711 ± 0.828
1.283GlnTrp: 1.283 ± 0.686
1.283GlnTyr: 1.283 ± 0.4
0.0GlnXaa: 0.0 ± 0.0
Arg
2.994ArgAla: 2.994 ± 0.729
2.139ArgCys: 2.139 ± 1.281
3.422ArgAsp: 3.422 ± 1.232
2.994ArgGlu: 2.994 ± 1.656
2.994ArgPhe: 2.994 ± 1.023
5.988ArgGly: 5.988 ± 2.066
1.711ArgHis: 1.711 ± 0.644
1.711ArgIle: 1.711 ± 0.722
3.849ArgLys: 3.849 ± 1.198
8.127ArgLeu: 8.127 ± 1.721
1.283ArgMet: 1.283 ± 0.436
0.855ArgAsn: 0.855 ± 0.694
5.56ArgPro: 5.56 ± 3.053
0.428ArgGln: 0.428 ± 0.487
4.705ArgArg: 4.705 ± 0.731
5.988ArgSer: 5.988 ± 0.78
2.139ArgThr: 2.139 ± 0.758
5.988ArgVal: 5.988 ± 1.92
1.283ArgTrp: 1.283 ± 0.436
1.711ArgTyr: 1.711 ± 0.818
0.0ArgXaa: 0.0 ± 0.0
Ser
4.705SerAla: 4.705 ± 1.359
1.711SerCys: 1.711 ± 1.592
6.843SerAsp: 6.843 ± 1.76
3.422SerGlu: 3.422 ± 1.75
1.711SerPhe: 1.711 ± 0.939
7.699SerGly: 7.699 ± 2.527
0.428SerHis: 0.428 ± 0.357
1.283SerIle: 1.283 ± 0.64
3.422SerLys: 3.422 ± 0.603
10.265SerLeu: 10.265 ± 2.443
0.428SerMet: 0.428 ± 0.347
1.283SerAsn: 1.283 ± 0.64
4.705SerPro: 4.705 ± 0.817
2.994SerGln: 2.994 ± 0.854
5.988SerArg: 5.988 ± 1.622
5.988SerSer: 5.988 ± 2.075
4.705SerThr: 4.705 ± 1.54
6.843SerVal: 6.843 ± 0.915
0.855SerTrp: 0.855 ± 0.429
2.994SerTyr: 2.994 ± 0.967
0.0SerXaa: 0.0 ± 0.0
Thr
3.422ThrAla: 3.422 ± 0.644
0.855ThrCys: 0.855 ± 0.371
2.994ThrAsp: 2.994 ± 0.81
2.566ThrGlu: 2.566 ± 0.98
3.849ThrPhe: 3.849 ± 0.772
3.422ThrGly: 3.422 ± 1.756
0.428ThrHis: 0.428 ± 0.32
1.711ThrIle: 1.711 ± 0.9
1.711ThrLys: 1.711 ± 0.62
4.277ThrLeu: 4.277 ± 0.93
1.711ThrMet: 1.711 ± 0.61
1.283ThrAsn: 1.283 ± 0.673
4.705ThrPro: 4.705 ± 1.202
2.994ThrGln: 2.994 ± 0.995
8.127ThrArg: 8.127 ± 1.508
4.705ThrSer: 4.705 ± 1.982
4.705ThrThr: 4.705 ± 1.743
4.705ThrVal: 4.705 ± 0.865
0.428ThrTrp: 0.428 ± 0.487
1.283ThrTyr: 1.283 ± 0.96
0.0ThrXaa: 0.0 ± 0.0
Val
2.994ValAla: 2.994 ± 1.389
2.566ValCys: 2.566 ± 1.388
4.705ValAsp: 4.705 ± 0.932
5.56ValGlu: 5.56 ± 1.084
5.133ValPhe: 5.133 ± 1.821
2.994ValGly: 2.994 ± 1.225
1.283ValHis: 1.283 ± 0.4
2.566ValIle: 2.566 ± 1.658
3.849ValLys: 3.849 ± 1.23
5.988ValLeu: 5.988 ± 0.629
0.855ValMet: 0.855 ± 0.649
1.711ValAsn: 1.711 ± 0.875
5.56ValPro: 5.56 ± 1.453
2.139ValGln: 2.139 ± 0.796
4.705ValArg: 4.705 ± 1.572
6.843ValSer: 6.843 ± 2.096
3.422ValThr: 3.422 ± 1.131
2.994ValVal: 2.994 ± 1.142
0.855ValTrp: 0.855 ± 0.64
0.428ValTyr: 0.428 ± 0.559
0.0ValXaa: 0.0 ± 0.0
Trp
1.283TrpAla: 1.283 ± 0.712
0.428TrpCys: 0.428 ± 0.32
0.428TrpAsp: 0.428 ± 0.347
0.855TrpGlu: 0.855 ± 0.64
0.0TrpPhe: 0.0 ± 0.0
1.283TrpGly: 1.283 ± 0.419
0.428TrpHis: 0.428 ± 0.32
1.711TrpIle: 1.711 ± 0.859
1.283TrpLys: 1.283 ± 0.849
1.711TrpLeu: 1.711 ± 1.035
0.0TrpMet: 0.0 ± 0.0
0.855TrpAsn: 0.855 ± 0.478
0.0TrpPro: 0.0 ± 0.0
0.428TrpGln: 0.428 ± 0.487
0.428TrpArg: 0.428 ± 0.487
0.0TrpSer: 0.0 ± 0.0
1.283TrpThr: 1.283 ± 1.46
0.855TrpVal: 0.855 ± 0.694
0.0TrpTrp: 0.0 ± 0.0
0.428TrpTyr: 0.428 ± 0.347
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.283TyrAla: 1.283 ± 0.436
0.855TyrCys: 0.855 ± 0.864
0.855TyrAsp: 0.855 ± 0.371
1.283TyrGlu: 1.283 ± 1.012
1.283TyrPhe: 1.283 ± 0.436
1.711TyrGly: 1.711 ± 0.804
0.855TyrHis: 0.855 ± 0.487
1.283TyrIle: 1.283 ± 0.673
1.283TyrLys: 1.283 ± 0.712
2.139TyrLeu: 2.139 ± 1.124
0.0TyrMet: 0.0 ± 0.0
0.428TyrAsn: 0.428 ± 0.32
1.711TyrPro: 1.711 ± 0.867
0.428TyrGln: 0.428 ± 0.347
2.139TyrArg: 2.139 ± 0.807
2.139TyrSer: 2.139 ± 1.291
0.428TyrThr: 0.428 ± 0.32
2.566TyrVal: 2.566 ± 1.177
0.855TyrTrp: 0.855 ± 0.429
2.994TyrTyr: 2.994 ± 1.324
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2339 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski