Amino acid dipepetide frequency for Gammapapillomavirus 12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.467AlaAla: 5.467 ± 1.132
1.682AlaCys: 1.682 ± 0.749
5.046AlaAsp: 5.046 ± 1.166
5.046AlaGlu: 5.046 ± 1.189
4.205AlaPhe: 4.205 ± 0.932
3.364AlaGly: 3.364 ± 1.232
1.262AlaHis: 1.262 ± 0.475
3.364AlaIle: 3.364 ± 0.741
2.523AlaLys: 2.523 ± 0.813
3.785AlaLeu: 3.785 ± 1.513
0.421AlaMet: 0.421 ± 0.326
2.523AlaAsn: 2.523 ± 0.802
1.262AlaPro: 1.262 ± 0.89
1.682AlaGln: 1.682 ± 0.246
5.887AlaArg: 5.887 ± 1.496
7.149AlaSer: 7.149 ± 0.485
5.467AlaThr: 5.467 ± 1.039
4.205AlaVal: 4.205 ± 1.436
0.0AlaTrp: 0.0 ± 0.0
1.262AlaTyr: 1.262 ± 0.375
0.0AlaXaa: 0.0 ± 0.0
Cys
1.262CysAla: 1.262 ± 0.691
2.944CysCys: 2.944 ± 1.916
1.262CysAsp: 1.262 ± 0.752
1.682CysGlu: 1.682 ± 0.71
2.103CysPhe: 2.103 ± 1.051
0.0CysGly: 0.0 ± 0.0
0.421CysHis: 0.421 ± 0.367
0.841CysIle: 0.841 ± 0.373
1.682CysLys: 1.682 ± 0.788
1.682CysLeu: 1.682 ± 1.167
0.0CysMet: 0.0 ± 0.0
0.841CysAsn: 0.841 ± 0.567
1.262CysPro: 1.262 ± 0.86
2.103CysGln: 2.103 ± 0.883
0.421CysArg: 0.421 ± 0.477
1.262CysSer: 1.262 ± 0.767
1.262CysThr: 1.262 ± 0.641
1.262CysVal: 1.262 ± 0.568
1.682CysTrp: 1.682 ± 0.706
1.262CysTyr: 1.262 ± 0.571
0.0CysXaa: 0.0 ± 0.0
Asp
5.467AspAla: 5.467 ± 1.058
1.262AspCys: 1.262 ± 0.353
2.944AspAsp: 2.944 ± 1.089
5.467AspGlu: 5.467 ± 0.829
3.364AspPhe: 3.364 ± 0.946
3.364AspGly: 3.364 ± 1.065
0.841AspHis: 0.841 ± 0.411
6.308AspIle: 6.308 ± 1.454
1.262AspLys: 1.262 ± 0.599
4.626AspLeu: 4.626 ± 0.976
0.421AspMet: 0.421 ± 0.367
4.205AspAsn: 4.205 ± 0.738
5.046AspPro: 5.046 ± 1.436
0.421AspGln: 0.421 ± 0.317
2.103AspArg: 2.103 ± 1.106
3.364AspSer: 3.364 ± 0.627
4.626AspThr: 4.626 ± 1.007
6.308AspVal: 6.308 ± 1.259
1.262AspTrp: 1.262 ± 0.568
1.262AspTyr: 1.262 ± 0.58
0.0AspXaa: 0.0 ± 0.0
Glu
5.887GluAla: 5.887 ± 2.042
1.682GluCys: 1.682 ± 1.071
4.626GluAsp: 4.626 ± 1.312
7.569GluGlu: 7.569 ± 1.389
2.523GluPhe: 2.523 ± 1.381
3.364GluGly: 3.364 ± 0.662
1.262GluHis: 1.262 ± 0.626
2.944GluIle: 2.944 ± 1.118
2.523GluLys: 2.523 ± 1.271
6.728GluLeu: 6.728 ± 0.605
0.421GluMet: 0.421 ± 0.326
4.626GluAsn: 4.626 ± 1.001
2.944GluPro: 2.944 ± 1.046
2.523GluGln: 2.523 ± 0.783
4.626GluArg: 4.626 ± 0.964
3.364GluSer: 3.364 ± 1.342
4.626GluThr: 4.626 ± 1.463
2.103GluVal: 2.103 ± 1.243
0.421GluTrp: 0.421 ± 0.367
3.364GluTyr: 3.364 ± 1.083
0.0GluXaa: 0.0 ± 0.0
Phe
5.046PheAla: 5.046 ± 0.703
2.103PheCys: 2.103 ± 1.339
2.103PheAsp: 2.103 ± 0.67
3.364PheGlu: 3.364 ± 1.689
2.523PhePhe: 2.523 ± 0.819
1.682PheGly: 1.682 ± 0.643
1.262PheHis: 1.262 ± 0.744
2.103PheIle: 2.103 ± 1.091
3.785PheLys: 3.785 ± 1.127
4.626PheLeu: 4.626 ± 1.48
0.841PheMet: 0.841 ± 0.635
2.944PheAsn: 2.944 ± 0.749
1.682PhePro: 1.682 ± 0.794
1.682PheGln: 1.682 ± 0.503
2.103PheArg: 2.103 ± 0.69
2.523PheSer: 2.523 ± 0.644
1.682PheThr: 1.682 ± 0.643
2.944PheVal: 2.944 ± 0.938
0.841PheTrp: 0.841 ± 0.408
1.682PheTyr: 1.682 ± 1.019
0.0PheXaa: 0.0 ± 0.0
Gly
3.785GlyAla: 3.785 ± 1.366
0.841GlyCys: 0.841 ± 0.499
6.308GlyAsp: 6.308 ± 1.413
3.364GlyGlu: 3.364 ± 1.254
2.944GlyPhe: 2.944 ± 0.422
3.364GlyGly: 3.364 ± 1.575
0.841GlyHis: 0.841 ± 0.734
2.944GlyIle: 2.944 ± 0.547
3.785GlyLys: 3.785 ± 1.471
3.785GlyLeu: 3.785 ± 1.711
0.841GlyMet: 0.841 ± 0.563
4.626GlyAsn: 4.626 ± 1.18
2.944GlyPro: 2.944 ± 0.721
1.682GlyGln: 1.682 ± 1.008
1.682GlyArg: 1.682 ± 0.799
4.626GlySer: 4.626 ± 1.72
4.205GlyThr: 4.205 ± 1.099
2.944GlyVal: 2.944 ± 1.013
0.0GlyTrp: 0.0 ± 0.0
0.421GlyTyr: 0.421 ± 0.438
0.0GlyXaa: 0.0 ± 0.0
His
1.262HisAla: 1.262 ± 0.744
1.262HisCys: 1.262 ± 0.996
0.421HisAsp: 0.421 ± 0.317
0.841HisGlu: 0.841 ± 0.401
0.841HisPhe: 0.841 ± 0.373
1.262HisGly: 1.262 ± 0.631
0.0HisHis: 0.0 ± 0.0
1.682HisIle: 1.682 ± 0.976
1.262HisLys: 1.262 ± 0.571
2.944HisLeu: 2.944 ± 0.973
0.0HisMet: 0.0 ± 0.35
0.421HisAsn: 0.421 ± 0.326
2.103HisPro: 2.103 ± 1.079
0.841HisGln: 0.841 ± 0.602
0.421HisArg: 0.421 ± 0.499
0.421HisSer: 0.421 ± 0.394
0.421HisThr: 0.421 ± 0.317
0.421HisVal: 0.421 ± 0.438
0.421HisTrp: 0.421 ± 0.438
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.103IleAla: 2.103 ± 0.989
0.841IleCys: 0.841 ± 0.499
1.682IleAsp: 1.682 ± 0.734
3.785IleGlu: 3.785 ± 0.689
1.682IlePhe: 1.682 ± 0.643
3.364IleGly: 3.364 ± 0.984
1.262IleHis: 1.262 ± 0.626
1.262IleIle: 1.262 ± 0.785
2.944IleLys: 2.944 ± 1.133
3.364IleLeu: 3.364 ± 0.653
0.0IleMet: 0.0 ± 0.0
1.682IleAsn: 1.682 ± 0.931
3.364IlePro: 3.364 ± 1.167
2.944IleGln: 2.944 ± 0.975
3.785IleArg: 3.785 ± 0.919
2.523IleSer: 2.523 ± 0.828
4.205IleThr: 4.205 ± 1.523
7.149IleVal: 7.149 ± 2.197
0.421IleTrp: 0.421 ± 0.367
1.682IleTyr: 1.682 ± 0.618
0.0IleXaa: 0.0 ± 0.0
Lys
1.262LysAla: 1.262 ± 0.568
2.523LysCys: 2.523 ± 0.955
1.682LysAsp: 1.682 ± 0.506
4.205LysGlu: 4.205 ± 1.803
3.364LysPhe: 3.364 ± 1.436
2.944LysGly: 2.944 ± 1.142
0.421LysHis: 0.421 ± 0.326
2.103LysIle: 2.103 ± 0.602
5.046LysLys: 5.046 ± 1.114
5.467LysLeu: 5.467 ± 1.493
1.682LysMet: 1.682 ± 1.042
4.626LysAsn: 4.626 ± 0.859
2.103LysPro: 2.103 ± 0.846
2.103LysGln: 2.103 ± 0.807
4.626LysArg: 4.626 ± 0.972
5.046LysSer: 5.046 ± 1.875
1.682LysThr: 1.682 ± 1.264
1.682LysVal: 1.682 ± 0.513
1.262LysTrp: 1.262 ± 0.829
2.523LysTyr: 2.523 ± 0.726
0.0LysXaa: 0.0 ± 0.0
Leu
5.046LeuAla: 5.046 ± 1.336
1.682LeuCys: 1.682 ± 0.549
7.149LeuAsp: 7.149 ± 1.547
5.046LeuGlu: 5.046 ± 1.183
6.308LeuPhe: 6.308 ± 1.07
5.887LeuGly: 5.887 ± 1.937
1.682LeuHis: 1.682 ± 0.709
4.205LeuIle: 4.205 ± 0.694
5.467LeuLys: 5.467 ± 1.675
7.569LeuLeu: 7.569 ± 1.176
0.421LeuMet: 0.421 ± 0.346
3.364LeuAsn: 3.364 ± 0.752
5.887LeuPro: 5.887 ± 1.094
6.308LeuGln: 6.308 ± 0.825
3.364LeuArg: 3.364 ± 1.31
7.569LeuSer: 7.569 ± 1.724
3.364LeuThr: 3.364 ± 0.675
5.887LeuVal: 5.887 ± 1.115
1.262LeuTrp: 1.262 ± 0.822
5.887LeuTyr: 5.887 ± 1.109
0.0LeuXaa: 0.0 ± 0.0
Met
1.262MetAla: 1.262 ± 0.641
0.0MetCys: 0.0 ± 0.0
1.682MetAsp: 1.682 ± 0.816
1.262MetGlu: 1.262 ± 0.602
0.841MetPhe: 0.841 ± 0.408
1.262MetGly: 1.262 ± 0.705
0.0MetHis: 0.0 ± 0.0
0.841MetIle: 0.841 ± 0.652
0.841MetLys: 0.841 ± 0.601
0.421MetLeu: 0.421 ± 0.477
0.0MetMet: 0.0 ± 0.0
1.262MetAsn: 1.262 ± 0.58
0.841MetPro: 0.841 ± 0.563
0.421MetGln: 0.421 ± 0.367
0.841MetArg: 0.841 ± 0.521
1.682MetSer: 1.682 ± 0.549
0.421MetThr: 0.421 ± 0.326
0.421MetVal: 0.421 ± 0.326
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.205AsnAla: 4.205 ± 1.193
1.262AsnCys: 1.262 ± 0.767
2.103AsnAsp: 2.103 ± 0.476
2.103AsnGlu: 2.103 ± 0.626
1.682AsnPhe: 1.682 ± 0.817
2.103AsnGly: 2.103 ± 1.204
0.0AsnHis: 0.0 ± 0.0
3.785AsnIle: 3.785 ± 1.173
2.944AsnLys: 2.944 ± 0.704
5.046AsnLeu: 5.046 ± 0.798
2.103AsnMet: 2.103 ± 0.875
4.626AsnAsn: 4.626 ± 0.782
2.944AsnPro: 2.944 ± 1.278
2.103AsnGln: 2.103 ± 1.018
3.785AsnArg: 3.785 ± 0.471
3.364AsnSer: 3.364 ± 1.234
2.103AsnThr: 2.103 ± 0.617
3.364AsnVal: 3.364 ± 0.629
0.841AsnTrp: 0.841 ± 0.567
1.682AsnTyr: 1.682 ± 0.506
0.0AsnXaa: 0.0 ± 0.0
Pro
5.467ProAla: 5.467 ± 1.707
0.0ProCys: 0.0 ± 0.0
5.467ProAsp: 5.467 ± 1.788
3.364ProGlu: 3.364 ± 0.579
0.841ProPhe: 0.841 ± 0.567
2.103ProGly: 2.103 ± 0.976
1.262ProHis: 1.262 ± 1.003
1.262ProIle: 1.262 ± 0.952
2.944ProLys: 2.944 ± 0.549
5.887ProLeu: 5.887 ± 0.497
0.421ProMet: 0.421 ± 0.326
2.944ProAsn: 2.944 ± 1.011
6.308ProPro: 6.308 ± 2.317
2.103ProGln: 2.103 ± 0.328
2.523ProArg: 2.523 ± 1.101
4.205ProSer: 4.205 ± 0.936
5.467ProThr: 5.467 ± 2.008
2.103ProVal: 2.103 ± 1.202
0.421ProTrp: 0.421 ± 0.438
2.103ProTyr: 2.103 ± 0.988
0.0ProXaa: 0.0 ± 0.0
Gln
1.682GlnAla: 1.682 ± 0.534
0.421GlnCys: 0.421 ± 0.477
2.523GlnAsp: 2.523 ± 0.547
3.785GlnGlu: 3.785 ± 0.986
0.841GlnPhe: 0.841 ± 0.634
2.523GlnGly: 2.523 ± 1.319
1.682GlnHis: 1.682 ± 0.851
2.103GlnIle: 2.103 ± 0.654
1.682GlnLys: 1.682 ± 0.821
4.205GlnLeu: 4.205 ± 1.023
1.682GlnMet: 1.682 ± 0.969
0.841GlnAsn: 0.841 ± 0.411
1.682GlnPro: 1.682 ± 1.008
3.785GlnGln: 3.785 ± 0.787
1.682GlnArg: 1.682 ± 0.506
0.841GlnSer: 0.841 ± 0.411
1.262GlnThr: 1.262 ± 0.952
3.364GlnVal: 3.364 ± 0.87
1.262GlnTrp: 1.262 ± 0.599
2.103GlnTyr: 2.103 ± 1.484
0.0GlnXaa: 0.0 ± 0.0
Arg
2.523ArgAla: 2.523 ± 0.447
2.103ArgCys: 2.103 ± 1.081
2.523ArgAsp: 2.523 ± 0.534
2.523ArgGlu: 2.523 ± 0.74
2.103ArgPhe: 2.103 ± 0.729
3.785ArgGly: 3.785 ± 1.184
2.523ArgHis: 2.523 ± 1.031
0.421ArgIle: 0.421 ± 0.317
4.626ArgLys: 4.626 ± 1.083
8.41ArgLeu: 8.41 ± 1.222
0.421ArgMet: 0.421 ± 0.367
1.682ArgAsn: 1.682 ± 1.041
3.785ArgPro: 3.785 ± 1.186
2.103ArgGln: 2.103 ± 0.631
5.467ArgArg: 5.467 ± 2.047
3.785ArgSer: 3.785 ± 0.625
2.523ArgThr: 2.523 ± 1.067
2.523ArgVal: 2.523 ± 0.95
0.841ArgTrp: 0.841 ± 0.876
1.262ArgTyr: 1.262 ± 0.65
0.0ArgXaa: 0.0 ± 0.0
Ser
3.364SerAla: 3.364 ± 1.715
1.682SerCys: 1.682 ± 0.69
4.205SerAsp: 4.205 ± 1.359
4.626SerGlu: 4.626 ± 0.921
2.523SerPhe: 2.523 ± 0.677
3.785SerGly: 3.785 ± 0.723
0.841SerHis: 0.841 ± 0.408
2.944SerIle: 2.944 ± 0.885
1.682SerLys: 1.682 ± 0.931
8.41SerLeu: 8.41 ± 1.712
2.523SerMet: 2.523 ± 1.224
3.785SerAsn: 3.785 ± 1.394
3.785SerPro: 3.785 ± 0.694
1.262SerGln: 1.262 ± 0.626
2.944SerArg: 2.944 ± 1.128
5.467SerSer: 5.467 ± 1.733
6.308SerThr: 6.308 ± 1.47
3.785SerVal: 3.785 ± 0.703
0.0SerTrp: 0.0 ± 0.0
3.364SerTyr: 3.364 ± 1.068
0.0SerXaa: 0.0 ± 0.0
Thr
4.205ThrAla: 4.205 ± 0.893
1.262ThrCys: 1.262 ± 0.785
3.364ThrAsp: 3.364 ± 1.387
3.785ThrGlu: 3.785 ± 0.944
1.682ThrPhe: 1.682 ± 0.746
4.205ThrGly: 4.205 ± 1.289
0.421ThrHis: 0.421 ± 0.326
6.728ThrIle: 6.728 ± 2.82
2.103ThrLys: 2.103 ± 1.024
6.728ThrLeu: 6.728 ± 1.229
0.421ThrMet: 0.421 ± 0.326
2.523ThrAsn: 2.523 ± 0.866
4.205ThrPro: 4.205 ± 1.582
1.262ThrGln: 1.262 ± 0.404
2.944ThrArg: 2.944 ± 0.897
2.103ThrSer: 2.103 ± 0.897
4.626ThrThr: 4.626 ± 1.312
5.467ThrVal: 5.467 ± 1.696
0.421ThrTrp: 0.421 ± 0.326
1.682ThrTyr: 1.682 ± 0.799
0.0ThrXaa: 0.0 ± 0.0
Val
3.364ValAla: 3.364 ± 0.87
0.841ValCys: 0.841 ± 0.638
6.308ValAsp: 6.308 ± 1.429
4.205ValGlu: 4.205 ± 1.878
3.364ValPhe: 3.364 ± 1.172
5.887ValGly: 5.887 ± 1.442
1.262ValHis: 1.262 ± 0.475
1.682ValIle: 1.682 ± 0.959
4.626ValLys: 4.626 ± 1.661
5.046ValLeu: 5.046 ± 1.858
0.421ValMet: 0.421 ± 0.367
1.262ValAsn: 1.262 ± 0.623
2.944ValPro: 2.944 ± 1.013
3.364ValGln: 3.364 ± 0.967
2.944ValArg: 2.944 ± 1.121
6.728ValSer: 6.728 ± 0.811
3.364ValThr: 3.364 ± 1.006
1.262ValVal: 1.262 ± 0.624
0.421ValTrp: 0.421 ± 0.367
1.262ValTyr: 1.262 ± 0.475
0.0ValXaa: 0.0 ± 0.0
Trp
0.841TrpAla: 0.841 ± 0.408
0.421TrpCys: 0.421 ± 0.326
0.421TrpAsp: 0.421 ± 0.367
0.421TrpGlu: 0.421 ± 0.499
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.421TrpHis: 0.421 ± 0.438
1.682TrpIle: 1.682 ± 1.071
1.682TrpLys: 1.682 ± 1.047
1.682TrpLeu: 1.682 ± 1.043
0.421TrpMet: 0.421 ± 0.477
0.421TrpAsn: 0.421 ± 0.438
0.421TrpPro: 0.421 ± 0.367
0.0TrpGln: 0.0 ± 0.0
1.682TrpArg: 1.682 ± 1.203
0.0TrpSer: 0.0 ± 0.0
1.262TrpThr: 1.262 ± 0.785
0.841TrpVal: 0.841 ± 0.411
0.0TrpTrp: 0.0 ± 0.0
0.421TrpTyr: 0.421 ± 0.326
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.103TyrAla: 2.103 ± 0.945
0.421TyrCys: 0.421 ± 0.435
2.103TyrAsp: 2.103 ± 0.642
1.682TyrGlu: 1.682 ± 0.577
3.785TyrPhe: 3.785 ± 0.663
1.682TyrGly: 1.682 ± 0.706
0.0TyrHis: 0.0 ± 0.0
0.841TyrIle: 0.841 ± 0.488
2.944TyrLys: 2.944 ± 0.495
2.944TyrLeu: 2.944 ± 1.083
0.421TyrMet: 0.421 ± 0.326
2.944TyrAsn: 2.944 ± 0.808
1.682TyrPro: 1.682 ± 0.821
1.262TyrGln: 1.262 ± 0.375
2.103TyrArg: 2.103 ± 1.081
1.262TyrSer: 1.262 ± 0.705
1.262TyrThr: 1.262 ± 0.475
2.523TyrVal: 2.523 ± 0.698
1.262TyrTrp: 1.262 ± 0.822
2.944TyrTyr: 2.944 ± 1.499
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2379 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski