Amino acid dipepetide frequency for Gammapapillomavirus 11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.417AlaAla: 2.417 ± 1.178
2.417AlaCys: 2.417 ± 1.376
5.238AlaAsp: 5.238 ± 1.135
0.806AlaGlu: 0.806 ± 0.414
4.029AlaPhe: 4.029 ± 1.155
2.015AlaGly: 2.015 ± 1.102
0.806AlaHis: 0.806 ± 0.431
4.835AlaIle: 4.835 ± 1.582
2.82AlaLys: 2.82 ± 1.115
5.641AlaLeu: 5.641 ± 1.921
0.806AlaMet: 0.806 ± 0.314
3.223AlaAsn: 3.223 ± 1.052
2.82AlaPro: 2.82 ± 1.125
2.417AlaGln: 2.417 ± 1.183
2.417AlaArg: 2.417 ± 0.847
4.835AlaSer: 4.835 ± 1.271
3.223AlaThr: 3.223 ± 0.772
1.612AlaVal: 1.612 ± 0.46
1.209AlaTrp: 1.209 ± 0.539
1.612AlaTyr: 1.612 ± 0.903
0.0AlaXaa: 0.0 ± 0.0
Cys
0.806CysAla: 0.806 ± 0.615
1.209CysCys: 1.209 ± 0.775
1.209CysAsp: 1.209 ± 0.922
2.82CysGlu: 2.82 ± 1.582
1.209CysPhe: 1.209 ± 0.39
0.403CysGly: 0.403 ± 0.352
0.806CysHis: 0.806 ± 0.452
1.209CysIle: 1.209 ± 0.583
3.626CysLys: 3.626 ± 1.316
1.209CysLeu: 1.209 ± 1.541
0.0CysMet: 0.0 ± 0.0
0.806CysAsn: 0.806 ± 0.623
1.612CysPro: 1.612 ± 0.842
0.403CysGln: 0.403 ± 0.307
0.403CysArg: 0.403 ± 0.514
1.209CysSer: 1.209 ± 0.789
0.806CysThr: 0.806 ± 0.581
0.806CysVal: 0.806 ± 0.581
0.806CysTrp: 0.806 ± 0.445
1.612CysTyr: 1.612 ± 1.61
0.0CysXaa: 0.0 ± 0.0
Asp
5.641AspAla: 5.641 ± 1.089
1.612AspCys: 1.612 ± 0.895
6.044AspAsp: 6.044 ± 2.069
4.029AspGlu: 4.029 ± 1.698
1.209AspPhe: 1.209 ± 0.688
2.82AspGly: 2.82 ± 1.111
0.806AspHis: 0.806 ± 0.452
3.223AspIle: 3.223 ± 1.588
1.612AspLys: 1.612 ± 0.827
5.641AspLeu: 5.641 ± 3.63
1.612AspMet: 1.612 ± 0.658
4.029AspAsn: 4.029 ± 0.884
3.223AspPro: 3.223 ± 1.265
2.015AspGln: 2.015 ± 0.77
2.015AspArg: 2.015 ± 0.945
4.432AspSer: 4.432 ± 0.89
5.641AspThr: 5.641 ± 1.733
4.835AspVal: 4.835 ± 2.327
0.806AspTrp: 0.806 ± 0.314
1.209AspTyr: 1.209 ± 0.713
0.0AspXaa: 0.0 ± 0.0
Glu
2.417GluAla: 2.417 ± 0.601
0.806GluCys: 0.806 ± 0.615
4.835GluAsp: 4.835 ± 1.581
7.252GluGlu: 7.252 ± 2.129
2.417GluPhe: 2.417 ± 1.03
2.417GluGly: 2.417 ± 0.601
1.612GluHis: 1.612 ± 0.74
3.626GluIle: 3.626 ± 0.544
4.029GluLys: 4.029 ± 1.813
4.029GluLeu: 4.029 ± 1.022
0.806GluMet: 0.806 ± 0.387
5.238GluAsn: 5.238 ± 1.035
2.417GluPro: 2.417 ± 1.077
2.82GluGln: 2.82 ± 1.269
2.417GluArg: 2.417 ± 1.022
4.835GluSer: 4.835 ± 1.589
2.82GluThr: 2.82 ± 1.045
4.029GluVal: 4.029 ± 0.904
0.806GluTrp: 0.806 ± 0.832
1.612GluTyr: 1.612 ± 0.632
0.0GluXaa: 0.0 ± 0.0
Phe
2.417PheAla: 2.417 ± 0.745
2.82PheCys: 2.82 ± 1.299
3.626PheAsp: 3.626 ± 0.887
4.029PheGlu: 4.029 ± 1.325
3.223PhePhe: 3.223 ± 0.999
0.403PheGly: 0.403 ± 0.329
0.806PheHis: 0.806 ± 0.431
2.417PheIle: 2.417 ± 1.182
2.015PheLys: 2.015 ± 1.187
4.835PheLeu: 4.835 ± 1.732
1.209PheMet: 1.209 ± 0.736
2.015PheAsn: 2.015 ± 0.565
1.612PhePro: 1.612 ± 0.516
1.612PheGln: 1.612 ± 0.603
0.806PheArg: 0.806 ± 0.314
2.82PheSer: 2.82 ± 0.941
2.015PheThr: 2.015 ± 0.585
4.432PheVal: 4.432 ± 1.113
1.612PheTrp: 1.612 ± 0.687
2.417PheTyr: 2.417 ± 0.667
0.0PheXaa: 0.0 ± 0.0
Gly
2.417GlyAla: 2.417 ± 0.769
0.403GlyCys: 0.403 ± 0.31
3.223GlyAsp: 3.223 ± 1.322
3.223GlyGlu: 3.223 ± 1.115
1.612GlyPhe: 1.612 ± 0.954
4.029GlyGly: 4.029 ± 1.454
1.612GlyHis: 1.612 ± 0.762
4.029GlyIle: 4.029 ± 1.283
3.223GlyLys: 3.223 ± 1.01
5.641GlyLeu: 5.641 ± 1.523
0.806GlyMet: 0.806 ± 0.414
4.835GlyAsn: 4.835 ± 0.86
2.015GlyPro: 2.015 ± 0.883
0.806GlyGln: 0.806 ± 0.42
2.82GlyArg: 2.82 ± 1.164
4.432GlySer: 4.432 ± 0.928
1.612GlyThr: 1.612 ± 0.553
2.417GlyVal: 2.417 ± 0.919
0.0GlyTrp: 0.0 ± 0.0
1.209GlyTyr: 1.209 ± 0.489
0.0GlyXaa: 0.0 ± 0.0
His
0.806HisAla: 0.806 ± 0.572
0.0HisCys: 0.0 ± 0.0
0.403HisAsp: 0.403 ± 0.416
0.403HisGlu: 0.403 ± 0.329
3.223HisPhe: 3.223 ± 0.708
0.806HisGly: 0.806 ± 0.623
1.209HisHis: 1.209 ± 0.722
0.403HisIle: 0.403 ± 0.31
2.015HisLys: 2.015 ± 0.565
0.806HisLeu: 0.806 ± 0.832
0.403HisMet: 0.403 ± 0.334
0.403HisAsn: 0.403 ± 0.416
2.417HisPro: 2.417 ± 1.375
1.612HisGln: 1.612 ± 0.796
0.403HisArg: 0.403 ± 0.329
2.015HisSer: 2.015 ± 0.676
1.209HisThr: 1.209 ± 0.723
0.403HisVal: 0.403 ± 0.329
0.806HisTrp: 0.806 ± 0.572
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.82IleAla: 2.82 ± 0.824
0.403IleCys: 0.403 ± 0.514
4.432IleAsp: 4.432 ± 1.212
2.82IleGlu: 2.82 ± 0.941
2.015IlePhe: 2.015 ± 0.709
3.626IleGly: 3.626 ± 1.515
0.806IleHis: 0.806 ± 0.658
2.82IleIle: 2.82 ± 0.961
1.612IleLys: 1.612 ± 0.784
4.432IleLeu: 4.432 ± 1.397
1.612IleMet: 1.612 ± 0.862
5.641IleAsn: 5.641 ± 1.816
3.626IlePro: 3.626 ± 1.537
4.029IleGln: 4.029 ± 1.823
4.432IleArg: 4.432 ± 1.193
3.223IleSer: 3.223 ± 0.821
2.417IleThr: 2.417 ± 0.644
5.641IleVal: 5.641 ± 1.502
0.0IleTrp: 0.0 ± 0.0
3.223IleTyr: 3.223 ± 0.656
0.0IleXaa: 0.0 ± 0.0
Lys
3.626LysAla: 3.626 ± 1.136
2.417LysCys: 2.417 ± 0.68
2.82LysAsp: 2.82 ± 1.118
2.417LysGlu: 2.417 ± 1.082
3.223LysPhe: 3.223 ± 1.141
2.417LysGly: 2.417 ± 0.78
1.209LysHis: 1.209 ± 0.688
2.82LysIle: 2.82 ± 1.307
4.432LysLys: 4.432 ± 0.767
5.238LysLeu: 5.238 ± 1.854
1.612LysMet: 1.612 ± 0.796
2.417LysAsn: 2.417 ± 1.61
1.612LysPro: 1.612 ± 0.842
3.626LysGln: 3.626 ± 1.367
5.238LysArg: 5.238 ± 0.746
3.223LysSer: 3.223 ± 1.169
1.209LysThr: 1.209 ± 0.642
3.223LysVal: 3.223 ± 0.552
1.209LysTrp: 1.209 ± 0.541
2.82LysTyr: 2.82 ± 1.15
0.0LysXaa: 0.0 ± 0.0
Leu
6.849LeuAla: 6.849 ± 1.719
3.223LeuCys: 3.223 ± 1.435
6.044LeuAsp: 6.044 ± 1.276
6.446LeuGlu: 6.446 ± 1.505
4.432LeuPhe: 4.432 ± 1.324
4.029LeuGly: 4.029 ± 1.263
1.612LeuHis: 1.612 ± 0.787
5.238LeuIle: 5.238 ± 1.839
8.058LeuLys: 8.058 ± 2.736
9.67LeuLeu: 9.67 ± 1.792
1.209LeuMet: 1.209 ± 0.866
4.029LeuAsn: 4.029 ± 0.76
6.446LeuPro: 6.446 ± 1.631
4.432LeuGln: 4.432 ± 1.145
3.626LeuArg: 3.626 ± 0.688
5.238LeuSer: 5.238 ± 1.671
4.029LeuThr: 4.029 ± 1.298
4.029LeuVal: 4.029 ± 1.695
0.403LeuTrp: 0.403 ± 0.352
4.432LeuTyr: 4.432 ± 1.017
0.0LeuXaa: 0.0 ± 0.0
Met
1.612MetAla: 1.612 ± 0.705
1.612MetCys: 1.612 ± 0.692
1.209MetAsp: 1.209 ± 0.417
0.806MetGlu: 0.806 ± 0.564
0.0MetPhe: 0.0 ± 0.0
1.209MetGly: 1.209 ± 0.669
0.403MetHis: 0.403 ± 0.352
1.209MetIle: 1.209 ± 0.541
0.806MetLys: 0.806 ± 0.452
1.209MetLeu: 1.209 ± 0.468
0.806MetMet: 0.806 ± 0.594
2.015MetAsn: 2.015 ± 1.025
0.806MetPro: 0.806 ± 0.615
0.806MetGln: 0.806 ± 0.431
1.209MetArg: 1.209 ± 0.539
1.612MetSer: 1.612 ± 0.743
2.015MetThr: 2.015 ± 0.77
1.209MetVal: 1.209 ± 0.737
0.0MetTrp: 0.0 ± 0.0
0.806MetTyr: 0.806 ± 0.387
0.0MetXaa: 0.0 ± 0.0
Asn
3.626AsnAla: 3.626 ± 0.572
0.403AsnCys: 0.403 ± 0.416
1.209AsnAsp: 1.209 ± 0.762
4.029AsnGlu: 4.029 ± 1.26
2.82AsnPhe: 2.82 ± 0.621
3.223AsnGly: 3.223 ± 0.941
0.403AsnHis: 0.403 ± 0.307
4.029AsnIle: 4.029 ± 1.352
2.82AsnLys: 2.82 ± 0.488
7.655AsnLeu: 7.655 ± 2.619
1.209AsnMet: 1.209 ± 0.922
4.029AsnAsn: 4.029 ± 0.971
1.612AsnPro: 1.612 ± 0.609
3.223AsnGln: 3.223 ± 0.815
2.82AsnArg: 2.82 ± 0.956
3.626AsnSer: 3.626 ± 0.883
4.432AsnThr: 4.432 ± 1.393
2.015AsnVal: 2.015 ± 0.452
1.612AsnTrp: 1.612 ± 0.891
0.806AsnTyr: 0.806 ± 0.572
0.0AsnXaa: 0.0 ± 0.0
Pro
3.223ProAla: 3.223 ± 1.676
0.806ProCys: 0.806 ± 0.572
3.223ProAsp: 3.223 ± 0.597
4.029ProGlu: 4.029 ± 0.755
2.015ProPhe: 2.015 ± 0.646
2.82ProGly: 2.82 ± 1.136
0.403ProHis: 0.403 ± 0.329
3.223ProIle: 3.223 ± 1.568
4.432ProLys: 4.432 ± 0.655
5.641ProLeu: 5.641 ± 1.369
0.403ProMet: 0.403 ± 0.307
3.626ProAsn: 3.626 ± 1.149
8.864ProPro: 8.864 ± 2.164
2.015ProGln: 2.015 ± 0.375
3.223ProArg: 3.223 ± 1.16
4.432ProSer: 4.432 ± 0.951
5.641ProThr: 5.641 ± 1.219
1.612ProVal: 1.612 ± 0.687
0.0ProTrp: 0.0 ± 0.0
1.612ProTyr: 1.612 ± 0.872
0.0ProXaa: 0.0 ± 0.0
Gln
1.209GlnAla: 1.209 ± 0.541
0.806GlnCys: 0.806 ± 0.581
2.015GlnAsp: 2.015 ± 0.923
3.626GlnGlu: 3.626 ± 1.283
2.417GlnPhe: 2.417 ± 0.471
2.417GlnGly: 2.417 ± 0.919
0.806GlnHis: 0.806 ± 0.42
3.626GlnIle: 3.626 ± 1.169
1.612GlnLys: 1.612 ± 0.66
6.044GlnLeu: 6.044 ± 1.029
2.015GlnMet: 2.015 ± 0.649
2.015GlnAsn: 2.015 ± 0.64
2.015GlnPro: 2.015 ± 0.81
2.82GlnGln: 2.82 ± 1.144
1.209GlnArg: 1.209 ± 0.762
1.612GlnSer: 1.612 ± 0.743
3.223GlnThr: 3.223 ± 0.686
3.223GlnVal: 3.223 ± 0.954
1.209GlnTrp: 1.209 ± 0.642
2.417GlnTyr: 2.417 ± 0.953
0.0GlnXaa: 0.0 ± 0.0
Arg
2.82ArgAla: 2.82 ± 0.732
1.209ArgCys: 1.209 ± 1.053
3.223ArgAsp: 3.223 ± 0.766
2.015ArgGlu: 2.015 ± 0.896
3.223ArgPhe: 3.223 ± 0.698
3.626ArgGly: 3.626 ± 1.426
2.82ArgHis: 2.82 ± 1.005
3.223ArgIle: 3.223 ± 0.701
4.432ArgLys: 4.432 ± 0.634
6.044ArgLeu: 6.044 ± 1.261
0.0ArgMet: 0.0 ± 0.0
2.417ArgAsn: 2.417 ± 0.734
3.223ArgPro: 3.223 ± 1.046
1.612ArgGln: 1.612 ± 0.706
6.849ArgArg: 6.849 ± 3.593
4.835ArgSer: 4.835 ± 1.486
0.806ArgThr: 0.806 ± 0.53
2.015ArgVal: 2.015 ± 1.092
0.0ArgTrp: 0.0 ± 0.0
1.612ArgTyr: 1.612 ± 1.201
0.0ArgXaa: 0.0 ± 0.0
Ser
2.015SerAla: 2.015 ± 1.186
0.403SerCys: 0.403 ± 0.329
2.417SerAsp: 2.417 ± 1.042
3.626SerGlu: 3.626 ± 0.971
2.015SerPhe: 2.015 ± 0.69
2.82SerGly: 2.82 ± 0.37
2.015SerHis: 2.015 ± 0.657
3.223SerIle: 3.223 ± 0.866
2.015SerLys: 2.015 ± 1.537
6.446SerLeu: 6.446 ± 1.708
2.015SerMet: 2.015 ± 0.513
3.626SerAsn: 3.626 ± 1.611
6.849SerPro: 6.849 ± 2.421
2.417SerGln: 2.417 ± 0.632
4.835SerArg: 4.835 ± 1.364
5.641SerSer: 5.641 ± 1.673
8.058SerThr: 8.058 ± 2.831
4.432SerVal: 4.432 ± 1.388
0.0SerTrp: 0.0 ± 0.0
2.417SerTyr: 2.417 ± 1.093
0.0SerXaa: 0.0 ± 0.0
Thr
4.432ThrAla: 4.432 ± 1.903
0.806ThrCys: 0.806 ± 0.572
4.835ThrAsp: 4.835 ± 0.822
2.82ThrGlu: 2.82 ± 0.75
2.417ThrPhe: 2.417 ± 0.774
4.432ThrGly: 4.432 ± 1.255
0.0ThrHis: 0.0 ± 0.0
2.82ThrIle: 2.82 ± 1.235
1.612ThrLys: 1.612 ± 0.935
3.223ThrLeu: 3.223 ± 1.017
2.417ThrMet: 2.417 ± 0.774
1.612ThrAsn: 1.612 ± 0.935
4.835ThrPro: 4.835 ± 1.868
3.223ThrGln: 3.223 ± 1.675
3.223ThrArg: 3.223 ± 0.811
4.029ThrSer: 4.029 ± 1.244
4.835ThrThr: 4.835 ± 1.614
4.432ThrVal: 4.432 ± 1.432
0.806ThrTrp: 0.806 ± 0.42
2.417ThrTyr: 2.417 ± 0.707
0.0ThrXaa: 0.0 ± 0.0
Val
2.82ValAla: 2.82 ± 0.771
0.806ValCys: 0.806 ± 1.079
3.626ValAsp: 3.626 ± 1.25
2.82ValGlu: 2.82 ± 1.041
2.015ValPhe: 2.015 ± 0.612
4.029ValGly: 4.029 ± 1.607
0.806ValHis: 0.806 ± 0.62
3.223ValIle: 3.223 ± 1.529
1.612ValLys: 1.612 ± 0.706
4.432ValLeu: 4.432 ± 1.401
2.015ValMet: 2.015 ± 0.624
2.82ValAsn: 2.82 ± 0.742
4.029ValPro: 4.029 ± 1.763
4.432ValGln: 4.432 ± 1.058
4.029ValArg: 4.029 ± 1.196
3.223ValSer: 3.223 ± 0.531
3.223ValThr: 3.223 ± 0.821
3.223ValVal: 3.223 ± 1.093
1.209ValTrp: 1.209 ± 0.618
1.209ValTyr: 1.209 ± 0.544
0.0ValXaa: 0.0 ± 0.0
Trp
0.806TrpAla: 0.806 ± 0.387
0.0TrpCys: 0.0 ± 0.0
0.403TrpAsp: 0.403 ± 0.31
0.0TrpGlu: 0.0 ± 0.0
1.209TrpPhe: 1.209 ± 0.612
0.403TrpGly: 0.403 ± 0.31
0.403TrpHis: 0.403 ± 0.416
2.015TrpIle: 2.015 ± 1.062
2.015TrpLys: 2.015 ± 0.823
0.806TrpLeu: 0.806 ± 0.615
0.0TrpMet: 0.0 ± 0.0
0.403TrpAsn: 0.403 ± 0.31
0.403TrpPro: 0.403 ± 0.31
0.403TrpGln: 0.403 ± 0.31
1.612TrpArg: 1.612 ± 0.697
0.403TrpSer: 0.403 ± 0.31
0.806TrpThr: 0.806 ± 0.832
0.403TrpVal: 0.403 ± 0.31
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.417TyrAla: 2.417 ± 0.812
0.806TyrCys: 0.806 ± 0.581
2.417TyrAsp: 2.417 ± 1.05
3.223TyrGlu: 3.223 ± 1.492
2.82TyrPhe: 2.82 ± 0.794
2.417TyrGly: 2.417 ± 0.741
0.403TyrHis: 0.403 ± 0.329
2.417TyrIle: 2.417 ± 0.597
2.015TyrLys: 2.015 ± 0.832
4.835TyrLeu: 4.835 ± 1.544
0.0TyrMet: 0.0 ± 0.0
0.403TyrAsn: 0.403 ± 0.416
0.806TyrPro: 0.806 ± 0.62
1.612TyrGln: 1.612 ± 0.73
2.417TyrArg: 2.417 ± 0.967
1.612TyrSer: 1.612 ± 0.696
1.209TyrThr: 1.209 ± 0.676
1.612TyrVal: 1.612 ± 1.026
0.0TyrTrp: 0.0 ± 0.0
2.417TyrTyr: 2.417 ± 0.601
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2483 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski