Amino acid dipepetide frequency for Gammapapillomavirus 18

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.101AlaAla: 3.101 ± 1.117
1.772AlaCys: 1.772 ± 0.903
4.874AlaAsp: 4.874 ± 0.993
3.101AlaGlu: 3.101 ± 0.776
3.101AlaPhe: 3.101 ± 1.064
3.545AlaGly: 3.545 ± 1.55
0.886AlaHis: 0.886 ± 0.409
2.215AlaIle: 2.215 ± 0.757
2.215AlaLys: 2.215 ± 1.214
4.431AlaLeu: 4.431 ± 1.742
0.886AlaMet: 0.886 ± 0.446
1.772AlaAsn: 1.772 ± 0.633
2.215AlaPro: 2.215 ± 0.88
1.329AlaGln: 1.329 ± 0.841
3.101AlaArg: 3.101 ± 0.773
1.772AlaSer: 1.772 ± 0.839
3.101AlaThr: 3.101 ± 0.692
1.329AlaVal: 1.329 ± 0.995
0.0AlaTrp: 0.0 ± 0.0
1.329AlaTyr: 1.329 ± 0.679
0.0AlaXaa: 0.0 ± 0.0
Cys
1.329CysAla: 1.329 ± 0.996
2.215CysCys: 2.215 ± 1.879
0.886CysAsp: 0.886 ± 0.664
1.772CysGlu: 1.772 ± 1.309
1.329CysPhe: 1.329 ± 0.679
0.443CysGly: 0.443 ± 0.59
0.886CysHis: 0.886 ± 0.621
2.658CysIle: 2.658 ± 1.684
2.658CysLys: 2.658 ± 1.075
0.886CysLeu: 0.886 ± 1.151
0.443CysMet: 0.443 ± 0.576
0.443CysAsn: 0.443 ± 0.332
1.772CysPro: 1.772 ± 0.868
1.329CysGln: 1.329 ± 0.675
0.443CysArg: 0.443 ± 0.576
2.658CysSer: 2.658 ± 1.614
3.101CysThr: 3.101 ± 1.509
1.329CysVal: 1.329 ± 1.405
0.886CysTrp: 0.886 ± 0.463
0.886CysTyr: 0.886 ± 1.151
0.0CysXaa: 0.0 ± 0.0
Asp
2.658AspAla: 2.658 ± 0.634
2.658AspCys: 2.658 ± 1.084
2.658AspAsp: 2.658 ± 0.921
2.215AspGlu: 2.215 ± 0.389
4.874AspPhe: 4.874 ± 0.993
3.101AspGly: 3.101 ± 1.64
0.0AspHis: 0.0 ± 0.0
6.646AspIle: 6.646 ± 1.943
2.215AspLys: 2.215 ± 1.534
6.646AspLeu: 6.646 ± 1.532
0.443AspMet: 0.443 ± 0.396
2.658AspAsn: 2.658 ± 1.05
4.874AspPro: 4.874 ± 1.175
1.329AspGln: 1.329 ± 0.763
0.886AspArg: 0.886 ± 0.481
4.431AspSer: 4.431 ± 1.037
5.317AspThr: 5.317 ± 1.726
7.089AspVal: 7.089 ± 1.306
0.886AspTrp: 0.886 ± 0.446
1.329AspTyr: 1.329 ± 0.684
0.0AspXaa: 0.0 ± 0.0
Glu
4.431GluAla: 4.431 ± 1.249
1.772GluCys: 1.772 ± 1.017
3.545GluAsp: 3.545 ± 1.032
8.861GluGlu: 8.861 ± 1.954
1.772GluPhe: 1.772 ± 1.524
3.988GluGly: 3.988 ± 1.38
0.886GluHis: 0.886 ± 0.88
3.988GluIle: 3.988 ± 1.484
3.101GluLys: 3.101 ± 1.385
7.089GluLeu: 7.089 ± 1.834
0.443GluMet: 0.443 ± 0.332
3.988GluAsn: 3.988 ± 0.798
3.101GluPro: 3.101 ± 1.192
3.545GluGln: 3.545 ± 0.716
3.545GluArg: 3.545 ± 1.706
5.317GluSer: 5.317 ± 1.713
3.988GluThr: 3.988 ± 1.163
2.215GluVal: 2.215 ± 0.777
0.886GluTrp: 0.886 ± 0.664
1.772GluTyr: 1.772 ± 1.004
0.0GluXaa: 0.0 ± 0.0
Phe
1.329PheAla: 1.329 ± 0.429
2.658PheCys: 2.658 ± 1.871
2.215PheAsp: 2.215 ± 0.855
3.545PheGlu: 3.545 ± 1.264
2.215PhePhe: 2.215 ± 0.988
3.545PheGly: 3.545 ± 0.65
0.443PheHis: 0.443 ± 0.332
0.886PheIle: 0.886 ± 0.409
3.988PheLys: 3.988 ± 1.618
3.988PheLeu: 3.988 ± 2.127
1.772PheMet: 1.772 ± 0.659
3.101PheAsn: 3.101 ± 1.429
2.658PhePro: 2.658 ± 0.614
1.772PheGln: 1.772 ± 1.309
2.658PheArg: 2.658 ± 0.789
2.215PheSer: 2.215 ± 1.061
2.658PheThr: 2.658 ± 1.259
4.874PheVal: 4.874 ± 0.63
0.886PheTrp: 0.886 ± 0.446
2.658PheTyr: 2.658 ± 0.8
0.0PheXaa: 0.0 ± 0.0
Gly
2.658GlyAla: 2.658 ± 0.634
1.329GlyCys: 1.329 ± 0.429
6.646GlyAsp: 6.646 ± 1.543
3.101GlyGlu: 3.101 ± 1.183
3.988GlyPhe: 3.988 ± 0.888
2.658GlyGly: 2.658 ± 0.692
1.772GlyHis: 1.772 ± 0.24
1.772GlyIle: 1.772 ± 1.149
3.988GlyLys: 3.988 ± 1.097
3.101GlyLeu: 3.101 ± 1.598
0.886GlyMet: 0.886 ± 0.793
3.101GlyAsn: 3.101 ± 1.166
1.329GlyPro: 1.329 ± 0.776
1.772GlyGln: 1.772 ± 0.654
3.101GlyArg: 3.101 ± 1.889
4.431GlySer: 4.431 ± 1.331
3.988GlyThr: 3.988 ± 1.531
2.658GlyVal: 2.658 ± 0.987
0.0GlyTrp: 0.0 ± 0.0
0.443GlyTyr: 0.443 ± 0.396
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.443HisAsp: 0.443 ± 0.59
0.443HisGlu: 0.443 ± 0.332
0.443HisPhe: 0.443 ± 0.332
0.886HisGly: 0.886 ± 0.621
0.0HisHis: 0.0 ± 0.0
0.886HisIle: 0.886 ± 0.409
0.886HisLys: 0.886 ± 0.446
0.443HisLeu: 0.443 ± 0.381
0.443HisMet: 0.443 ± 0.332
0.443HisAsn: 0.443 ± 0.396
1.772HisPro: 1.772 ± 0.593
0.0HisGln: 0.0 ± 0.0
0.886HisArg: 0.886 ± 0.88
1.329HisSer: 1.329 ± 0.732
0.886HisThr: 0.886 ± 0.793
0.443HisVal: 0.443 ± 0.381
0.886HisTrp: 0.886 ± 0.88
1.329HisTyr: 1.329 ± 0.361
0.0HisXaa: 0.0 ± 0.0
Ile
2.658IleAla: 2.658 ± 1.339
0.443IleCys: 0.443 ± 0.576
4.874IleAsp: 4.874 ± 0.444
5.76IleGlu: 5.76 ± 1.676
1.772IlePhe: 1.772 ± 0.905
2.658IleGly: 2.658 ± 0.486
0.886IleHis: 0.886 ± 0.621
4.431IleIle: 4.431 ± 1.147
2.658IleLys: 2.658 ± 0.563
2.658IleLeu: 2.658 ± 1.118
0.886IleMet: 0.886 ± 0.499
2.658IleAsn: 2.658 ± 0.858
3.101IlePro: 3.101 ± 2.205
4.874IleGln: 4.874 ± 0.591
1.329IleArg: 1.329 ± 0.632
4.874IleSer: 4.874 ± 1.328
4.431IleThr: 4.431 ± 1.493
2.215IleVal: 2.215 ± 0.675
0.443IleTrp: 0.443 ± 0.332
2.658IleTyr: 2.658 ± 1.333
0.0IleXaa: 0.0 ± 0.0
Lys
2.658LysAla: 2.658 ± 1.05
2.215LysCys: 2.215 ± 0.866
1.772LysAsp: 1.772 ± 1.117
2.658LysGlu: 2.658 ± 0.858
1.772LysPhe: 1.772 ± 0.56
3.101LysGly: 3.101 ± 0.943
0.443LysHis: 0.443 ± 0.332
4.431LysIle: 4.431 ± 1.735
3.101LysLys: 3.101 ± 1.409
3.101LysLeu: 3.101 ± 0.792
0.443LysMet: 0.443 ± 0.396
2.658LysAsn: 2.658 ± 1.063
2.215LysPro: 2.215 ± 1.193
2.215LysGln: 2.215 ± 1.032
6.203LysArg: 6.203 ± 0.854
3.988LysSer: 3.988 ± 1.097
3.988LysThr: 3.988 ± 1.481
3.988LysVal: 3.988 ± 0.736
0.443LysTrp: 0.443 ± 0.332
2.215LysTyr: 2.215 ± 1.301
0.0LysXaa: 0.0 ± 0.0
Leu
6.203LeuAla: 6.203 ± 1.948
0.886LeuCys: 0.886 ± 0.722
4.431LeuAsp: 4.431 ± 1.399
6.646LeuGlu: 6.646 ± 1.186
4.431LeuPhe: 4.431 ± 1.746
5.76LeuGly: 5.76 ± 1.866
1.329LeuHis: 1.329 ± 0.71
2.215LeuIle: 2.215 ± 0.947
3.988LeuLys: 3.988 ± 1.597
8.861LeuLeu: 8.861 ± 0.914
1.329LeuMet: 1.329 ± 0.787
4.874LeuAsn: 4.874 ± 1.377
6.646LeuPro: 6.646 ± 1.556
5.76LeuGln: 5.76 ± 1.771
3.101LeuArg: 3.101 ± 0.93
5.317LeuSer: 5.317 ± 1.562
6.203LeuThr: 6.203 ± 0.914
3.988LeuVal: 3.988 ± 1.591
1.772LeuTrp: 1.772 ± 1.117
5.317LeuTyr: 5.317 ± 1.472
0.0LeuXaa: 0.0 ± 0.0
Met
0.443MetAla: 0.443 ± 0.576
0.443MetCys: 0.443 ± 0.396
1.329MetAsp: 1.329 ± 1.189
0.886MetGlu: 0.886 ± 0.588
1.772MetPhe: 1.772 ± 0.893
0.443MetGly: 0.443 ± 0.332
0.0MetHis: 0.0 ± 0.0
0.443MetIle: 0.443 ± 0.332
0.443MetLys: 0.443 ± 0.332
1.329MetLeu: 1.329 ± 0.679
0.0MetMet: 0.0 ± 0.0
1.772MetAsn: 1.772 ± 0.746
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.329MetArg: 1.329 ± 0.802
2.215MetSer: 2.215 ± 0.423
0.886MetThr: 0.886 ± 0.446
1.329MetVal: 1.329 ± 0.429
0.0MetTrp: 0.0 ± 0.0
0.443MetTyr: 0.443 ± 0.396
0.0MetXaa: 0.0 ± 0.0
Asn
0.443AsnAla: 0.443 ± 0.332
1.329AsnCys: 1.329 ± 0.762
3.101AsnAsp: 3.101 ± 0.868
3.545AsnGlu: 3.545 ± 0.745
3.545AsnPhe: 3.545 ± 1.353
1.329AsnGly: 1.329 ± 0.832
0.0AsnHis: 0.0 ± 0.0
2.658AsnIle: 2.658 ± 1.291
4.874AsnLys: 4.874 ± 1.367
7.532AsnLeu: 7.532 ± 1.678
0.886AsnMet: 0.886 ± 0.446
2.215AsnAsn: 2.215 ± 0.849
4.874AsnPro: 4.874 ± 1.32
3.101AsnGln: 3.101 ± 1.684
2.215AsnArg: 2.215 ± 0.8
3.545AsnSer: 3.545 ± 2.228
2.658AsnThr: 2.658 ± 1.014
2.658AsnVal: 2.658 ± 0.722
0.443AsnTrp: 0.443 ± 0.44
1.329AsnTyr: 1.329 ± 0.429
0.0AsnXaa: 0.0 ± 0.0
Pro
2.658ProAla: 2.658 ± 1.228
1.329ProCys: 1.329 ± 0.968
4.431ProAsp: 4.431 ± 1.335
4.874ProGlu: 4.874 ± 2.177
1.329ProPhe: 1.329 ± 0.924
1.772ProGly: 1.772 ± 0.767
0.443ProHis: 0.443 ± 0.44
3.545ProIle: 3.545 ± 1.567
3.101ProLys: 3.101 ± 0.877
6.203ProLeu: 6.203 ± 1.754
0.443ProMet: 0.443 ± 0.332
4.431ProAsn: 4.431 ± 1.394
7.532ProPro: 7.532 ± 1.576
2.658ProGln: 2.658 ± 0.533
3.545ProArg: 3.545 ± 1.301
4.431ProSer: 4.431 ± 1.326
3.545ProThr: 3.545 ± 1.439
3.101ProVal: 3.101 ± 1.269
0.886ProTrp: 0.886 ± 0.481
2.658ProTyr: 2.658 ± 1.012
0.0ProXaa: 0.0 ± 0.0
Gln
1.329GlnAla: 1.329 ± 0.472
1.329GlnCys: 1.329 ± 1.208
1.772GlnAsp: 1.772 ± 1.117
2.658GlnGlu: 2.658 ± 0.858
3.988GlnPhe: 3.988 ± 1.508
2.215GlnGly: 2.215 ± 0.777
0.443GlnHis: 0.443 ± 0.396
4.431GlnIle: 4.431 ± 0.993
1.772GlnLys: 1.772 ± 0.62
3.101GlnLeu: 3.101 ± 1.345
2.215GlnMet: 2.215 ± 1.101
2.215GlnAsn: 2.215 ± 0.947
1.772GlnPro: 1.772 ± 1.337
2.658GlnGln: 2.658 ± 0.944
4.431GlnArg: 4.431 ± 1.399
2.215GlnSer: 2.215 ± 0.887
2.215GlnThr: 2.215 ± 1.175
1.329GlnVal: 1.329 ± 0.653
0.443GlnTrp: 0.443 ± 0.332
1.772GlnTyr: 1.772 ± 1.117
0.0GlnXaa: 0.0 ± 0.0
Arg
3.545ArgAla: 3.545 ± 1.567
1.329ArgCys: 1.329 ± 0.613
3.101ArgAsp: 3.101 ± 0.995
3.101ArgGlu: 3.101 ± 1.159
1.329ArgPhe: 1.329 ± 0.776
4.431ArgGly: 4.431 ± 2.001
2.215ArgHis: 2.215 ± 0.759
1.772ArgIle: 1.772 ± 0.767
2.215ArgLys: 2.215 ± 1.054
7.532ArgLeu: 7.532 ± 1.271
1.329ArgMet: 1.329 ± 0.766
3.988ArgAsn: 3.988 ± 1.263
2.658ArgPro: 2.658 ± 1.336
1.772ArgGln: 1.772 ± 0.598
5.76ArgArg: 5.76 ± 2.204
2.658ArgSer: 2.658 ± 1.41
3.545ArgThr: 3.545 ± 0.693
2.658ArgVal: 2.658 ± 0.828
0.443ArgTrp: 0.443 ± 0.576
2.658ArgTyr: 2.658 ± 1.21
0.0ArgXaa: 0.0 ± 0.0
Ser
1.329SerAla: 1.329 ± 0.691
1.772SerCys: 1.772 ± 1.077
3.988SerAsp: 3.988 ± 0.888
3.988SerGlu: 3.988 ± 1.008
2.215SerPhe: 2.215 ± 0.61
3.545SerGly: 3.545 ± 0.591
0.886SerHis: 0.886 ± 0.42
1.329SerIle: 1.329 ± 0.684
3.988SerLys: 3.988 ± 1.595
8.418SerLeu: 8.418 ± 1.181
0.443SerMet: 0.443 ± 0.332
3.101SerAsn: 3.101 ± 1.392
6.646SerPro: 6.646 ± 1.964
3.988SerGln: 3.988 ± 1.795
3.988SerArg: 3.988 ± 2.183
5.76SerSer: 5.76 ± 2.843
5.317SerThr: 5.317 ± 1.801
5.76SerVal: 5.76 ± 2.526
0.0SerTrp: 0.0 ± 0.0
1.329SerTyr: 1.329 ± 0.361
0.0SerXaa: 0.0 ± 0.0
Thr
2.215ThrAla: 2.215 ± 1.283
2.215ThrCys: 2.215 ± 1.147
3.545ThrAsp: 3.545 ± 1.333
4.431ThrGlu: 4.431 ± 1.062
2.215ThrPhe: 2.215 ± 0.389
5.317ThrGly: 5.317 ± 1.295
0.0ThrHis: 0.0 ± 0.0
7.532ThrIle: 7.532 ± 1.747
1.329ThrLys: 1.329 ± 0.763
4.431ThrLeu: 4.431 ± 1.372
0.0ThrMet: 0.0 ± 0.0
3.101ThrAsn: 3.101 ± 0.84
5.317ThrPro: 5.317 ± 2.142
0.443ThrGln: 0.443 ± 0.332
4.431ThrArg: 4.431 ± 0.84
3.545ThrSer: 3.545 ± 1.301
3.988ThrThr: 3.988 ± 1.555
8.861ThrVal: 8.861 ± 2.985
0.443ThrTrp: 0.443 ± 0.332
3.101ThrTyr: 3.101 ± 0.992
0.0ThrXaa: 0.0 ± 0.0
Val
4.874ValAla: 4.874 ± 1.277
1.772ValCys: 1.772 ± 0.768
6.203ValAsp: 6.203 ± 1.137
4.431ValGlu: 4.431 ± 1.008
3.101ValPhe: 3.101 ± 1.155
2.658ValGly: 2.658 ± 0.486
0.886ValHis: 0.886 ± 0.42
2.215ValIle: 2.215 ± 1.054
1.772ValLys: 1.772 ± 0.722
4.874ValLeu: 4.874 ± 1.216
1.329ValMet: 1.329 ± 0.429
4.431ValAsn: 4.431 ± 1.318
3.101ValPro: 3.101 ± 1.264
3.101ValGln: 3.101 ± 1.553
4.431ValArg: 4.431 ± 1.667
3.988ValSer: 3.988 ± 1.278
2.658ValThr: 2.658 ± 0.486
4.431ValVal: 4.431 ± 1.331
0.886ValTrp: 0.886 ± 0.481
1.772ValTyr: 1.772 ± 0.56
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.886TrpAsp: 0.886 ± 0.793
0.443TrpGlu: 0.443 ± 0.396
0.886TrpPhe: 0.886 ± 0.463
0.443TrpGly: 0.443 ± 0.396
0.443TrpHis: 0.443 ± 0.44
0.886TrpIle: 0.886 ± 0.664
2.215TrpLys: 2.215 ± 0.866
1.329TrpLeu: 1.329 ± 0.776
0.0TrpMet: 0.0 ± 0.0
0.443TrpAsn: 0.443 ± 0.44
0.443TrpPro: 0.443 ± 0.396
0.443TrpGln: 0.443 ± 0.332
0.886TrpArg: 0.886 ± 0.672
0.0TrpSer: 0.0 ± 0.0
1.329TrpThr: 1.329 ± 0.832
0.443TrpVal: 0.443 ± 0.332
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.215TyrAla: 2.215 ± 0.779
0.886TyrCys: 0.886 ± 0.588
2.215TyrAsp: 2.215 ± 0.929
1.772TyrGlu: 1.772 ± 1.018
4.431TyrPhe: 4.431 ± 0.617
0.886TyrGly: 0.886 ± 0.409
0.0TyrHis: 0.0 ± 0.0
1.329TyrIle: 1.329 ± 0.429
3.101TyrLys: 3.101 ± 0.497
3.101TyrLeu: 3.101 ± 1.847
0.443TyrMet: 0.443 ± 0.396
0.886TyrAsn: 0.886 ± 0.446
0.886TyrPro: 0.886 ± 0.481
2.215TyrGln: 2.215 ± 0.389
1.772TyrArg: 1.772 ± 0.903
3.101TyrSer: 3.101 ± 1.319
2.658TyrThr: 2.658 ± 0.634
2.215TyrVal: 2.215 ± 0.849
0.886TyrTrp: 0.886 ± 0.793
3.101TyrTyr: 3.101 ± 1.162
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2258 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski