Amino acid dipepetide frequency for Gammapapillomavirus 19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.292AlaAla: 4.292 ± 1.231
0.429AlaCys: 0.429 ± 0.54
2.146AlaAsp: 2.146 ± 0.432
5.15AlaGlu: 5.15 ± 1.859
3.863AlaPhe: 3.863 ± 0.926
2.575AlaGly: 2.575 ± 1.326
0.858AlaHis: 0.858 ± 0.451
2.575AlaIle: 2.575 ± 1.227
3.433AlaLys: 3.433 ± 1.337
6.438AlaLeu: 6.438 ± 2.023
0.429AlaMet: 0.429 ± 0.34
2.146AlaAsn: 2.146 ± 1.161
3.433AlaPro: 3.433 ± 1.073
2.146AlaGln: 2.146 ± 0.984
3.004AlaArg: 3.004 ± 0.554
4.721AlaSer: 4.721 ± 0.896
4.292AlaThr: 4.292 ± 1.668
3.863AlaVal: 3.863 ± 1.832
1.288AlaTrp: 1.288 ± 1.019
1.288AlaTyr: 1.288 ± 0.716
0.0AlaXaa: 0.0 ± 0.0
Cys
0.858CysAla: 0.858 ± 0.427
1.717CysCys: 1.717 ± 0.895
1.717CysAsp: 1.717 ± 1.043
1.288CysGlu: 1.288 ± 1.621
1.717CysPhe: 1.717 ± 0.786
1.288CysGly: 1.288 ± 0.666
0.0CysHis: 0.0 ± 0.0
1.717CysIle: 1.717 ± 1.771
0.858CysLys: 0.858 ± 0.427
1.288CysLeu: 1.288 ± 1.048
0.0CysMet: 0.0 ± 0.0
0.858CysAsn: 0.858 ± 0.68
1.717CysPro: 1.717 ± 0.812
0.0CysGln: 0.0 ± 0.0
0.858CysArg: 0.858 ± 1.068
1.717CysSer: 1.717 ± 0.722
2.146CysThr: 2.146 ± 0.849
1.717CysVal: 1.717 ± 1.22
0.858CysTrp: 0.858 ± 0.95
0.429CysTyr: 0.429 ± 0.534
0.0CysXaa: 0.0 ± 0.0
Asp
3.863AspAla: 3.863 ± 1.525
2.575AspCys: 2.575 ± 1.285
5.579AspAsp: 5.579 ± 0.993
3.433AspGlu: 3.433 ± 1.53
2.146AspPhe: 2.146 ± 1.005
2.575AspGly: 2.575 ± 1.217
0.429AspHis: 0.429 ± 0.34
4.721AspIle: 4.721 ± 2.611
1.717AspLys: 1.717 ± 0.635
9.442AspLeu: 9.442 ± 1.637
0.429AspMet: 0.429 ± 0.363
3.863AspAsn: 3.863 ± 0.657
6.009AspPro: 6.009 ± 1.628
0.858AspGln: 0.858 ± 0.662
3.004AspArg: 3.004 ± 0.809
4.292AspSer: 4.292 ± 0.924
5.579AspThr: 5.579 ± 0.936
6.867AspVal: 6.867 ± 2.763
0.429AspTrp: 0.429 ± 0.363
1.717AspTyr: 1.717 ± 0.73
0.0AspXaa: 0.0 ± 0.0
Glu
3.004GluAla: 3.004 ± 1.577
1.288GluCys: 1.288 ± 0.681
4.292GluAsp: 4.292 ± 1.718
7.296GluGlu: 7.296 ± 2.203
1.717GluPhe: 1.717 ± 0.841
1.717GluGly: 1.717 ± 0.255
0.858GluHis: 0.858 ± 0.463
4.292GluIle: 4.292 ± 2.102
2.575GluLys: 2.575 ± 0.947
5.15GluLeu: 5.15 ± 1.833
0.858GluMet: 0.858 ± 0.572
4.721GluAsn: 4.721 ± 0.881
4.292GluPro: 4.292 ± 1.071
3.863GluGln: 3.863 ± 1.196
3.863GluArg: 3.863 ± 2.284
3.004GluSer: 3.004 ± 2.245
2.146GluThr: 2.146 ± 0.413
3.433GluVal: 3.433 ± 1.36
1.717GluTrp: 1.717 ± 0.984
1.717GluTyr: 1.717 ± 0.807
0.0GluXaa: 0.0 ± 0.0
Phe
3.004PheAla: 3.004 ± 0.589
2.146PheCys: 2.146 ± 1.15
3.863PheAsp: 3.863 ± 0.891
3.863PheGlu: 3.863 ± 1.457
3.004PhePhe: 3.004 ± 1.539
1.717PheGly: 1.717 ± 0.569
1.288PheHis: 1.288 ± 1.066
3.863PheIle: 3.863 ± 0.833
3.004PheLys: 3.004 ± 1.286
3.863PheLeu: 3.863 ± 1.455
1.288PheMet: 1.288 ± 0.687
2.146PheAsn: 2.146 ± 0.804
2.146PhePro: 2.146 ± 0.633
0.858PheGln: 0.858 ± 0.679
1.717PheArg: 1.717 ± 0.664
2.146PheSer: 2.146 ± 0.852
3.004PheThr: 3.004 ± 1.054
3.004PheVal: 3.004 ± 0.941
0.858PheTrp: 0.858 ± 0.726
1.717PheTyr: 1.717 ± 0.664
0.0PheXaa: 0.0 ± 0.0
Gly
2.146GlyAla: 2.146 ± 0.432
0.429GlyCys: 0.429 ± 0.363
6.438GlyAsp: 6.438 ± 1.399
3.004GlyGlu: 3.004 ± 0.917
1.717GlyPhe: 1.717 ± 0.723
3.004GlyGly: 3.004 ± 0.97
2.146GlyHis: 2.146 ± 0.854
3.004GlyIle: 3.004 ± 0.809
3.004GlyLys: 3.004 ± 1.061
4.292GlyLeu: 4.292 ± 1.414
0.0GlyMet: 0.0 ± 0.0
2.146GlyAsn: 2.146 ± 0.838
3.433GlyPro: 3.433 ± 1.01
1.717GlyGln: 1.717 ± 0.758
3.863GlyArg: 3.863 ± 1.025
3.004GlySer: 3.004 ± 0.792
4.292GlyThr: 4.292 ± 1.223
3.004GlyVal: 3.004 ± 1.142
0.429GlyTrp: 0.429 ± 0.475
1.717GlyTyr: 1.717 ± 1.41
0.0GlyXaa: 0.0 ± 0.0
His
1.717HisAla: 1.717 ± 0.656
0.429HisCys: 0.429 ± 0.34
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.288HisPhe: 1.288 ± 0.693
0.429HisGly: 0.429 ± 0.54
0.0HisHis: 0.0 ± 0.0
1.717HisIle: 1.717 ± 0.599
0.0HisLys: 0.0 ± 0.0
2.146HisLeu: 2.146 ± 1.109
0.0HisMet: 0.0 ± 0.0
0.429HisAsn: 0.429 ± 0.363
3.004HisPro: 3.004 ± 1.824
0.429HisGln: 0.429 ± 0.34
1.288HisArg: 1.288 ± 0.668
1.288HisSer: 1.288 ± 0.767
0.0HisThr: 0.0 ± 0.0
0.858HisVal: 0.858 ± 0.942
0.429HisTrp: 0.429 ± 0.475
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.004IleAla: 3.004 ± 0.743
0.429IleCys: 0.429 ± 0.54
5.579IleAsp: 5.579 ± 2.439
5.579IleGlu: 5.579 ± 0.991
2.575IlePhe: 2.575 ± 0.85
3.004IleGly: 3.004 ± 1.455
1.288IleHis: 1.288 ± 0.917
2.146IleIle: 2.146 ± 0.96
2.575IleLys: 2.575 ± 0.961
3.863IleLeu: 3.863 ± 1.889
0.0IleMet: 0.0 ± 0.0
2.146IleAsn: 2.146 ± 1.443
4.721IlePro: 4.721 ± 1.581
2.575IleGln: 2.575 ± 1.059
0.858IleArg: 0.858 ± 0.662
4.721IleSer: 4.721 ± 1.096
3.004IleThr: 3.004 ± 0.786
3.863IleVal: 3.863 ± 0.833
1.288IleTrp: 1.288 ± 1.074
3.863IleTyr: 3.863 ± 1.698
0.0IleXaa: 0.0 ± 0.0
Lys
3.004LysAla: 3.004 ± 1.111
1.288LysCys: 1.288 ± 0.643
2.146LysAsp: 2.146 ± 0.849
3.004LysGlu: 3.004 ± 0.955
2.146LysPhe: 2.146 ± 1.053
2.575LysGly: 2.575 ± 1.505
0.429LysHis: 0.429 ± 0.34
2.146LysIle: 2.146 ± 1.016
2.575LysLys: 2.575 ± 1.098
2.575LysLeu: 2.575 ± 1.479
2.146LysMet: 2.146 ± 0.732
2.575LysAsn: 2.575 ± 0.939
2.575LysPro: 2.575 ± 1.087
3.433LysGln: 3.433 ± 1.94
5.579LysArg: 5.579 ± 1.515
5.15LysSer: 5.15 ± 2.305
2.146LysThr: 2.146 ± 0.413
3.004LysVal: 3.004 ± 1.267
0.858LysTrp: 0.858 ± 0.451
3.433LysTyr: 3.433 ± 1.27
0.0LysXaa: 0.0 ± 0.0
Leu
4.292LeuAla: 4.292 ± 1.361
3.004LeuCys: 3.004 ± 1.915
5.579LeuAsp: 5.579 ± 0.834
3.433LeuGlu: 3.433 ± 0.655
3.004LeuPhe: 3.004 ± 1.388
6.009LeuGly: 6.009 ± 2.293
2.146LeuHis: 2.146 ± 1.423
3.004LeuIle: 3.004 ± 0.969
5.15LeuLys: 5.15 ± 1.625
6.438LeuLeu: 6.438 ± 3.123
3.863LeuMet: 3.863 ± 1.995
3.004LeuAsn: 3.004 ± 1.036
2.575LeuPro: 2.575 ± 0.873
8.155LeuGln: 8.155 ± 2.134
1.717LeuArg: 1.717 ± 1.052
5.579LeuSer: 5.579 ± 1.755
5.15LeuThr: 5.15 ± 1.252
5.579LeuVal: 5.579 ± 1.051
0.429LeuTrp: 0.429 ± 0.363
5.15LeuTyr: 5.15 ± 0.739
0.0LeuXaa: 0.0 ± 0.0
Met
1.288MetAla: 1.288 ± 0.716
0.429MetCys: 0.429 ± 0.34
1.717MetAsp: 1.717 ± 1.105
1.288MetGlu: 1.288 ± 0.643
1.717MetPhe: 1.717 ± 1.06
1.717MetGly: 1.717 ± 0.73
0.0MetHis: 0.0 ± 0.0
1.288MetIle: 1.288 ± 1.118
0.429MetLys: 0.429 ± 0.363
1.288MetLeu: 1.288 ± 0.857
0.858MetMet: 0.858 ± 1.348
0.858MetAsn: 0.858 ± 0.427
0.429MetPro: 0.429 ± 0.34
0.429MetGln: 0.429 ± 0.372
1.288MetArg: 1.288 ± 1.019
1.288MetSer: 1.288 ± 0.681
0.858MetThr: 0.858 ± 0.463
1.717MetVal: 1.717 ± 1.105
0.0MetTrp: 0.0 ± 0.0
0.858MetTyr: 0.858 ± 0.705
0.0MetXaa: 0.0 ± 0.0
Asn
3.433AsnAla: 3.433 ± 1.12
0.429AsnCys: 0.429 ± 0.34
2.575AsnAsp: 2.575 ± 1.4
1.717AsnGlu: 1.717 ± 0.843
3.433AsnPhe: 3.433 ± 1.976
3.433AsnGly: 3.433 ± 1.562
0.0AsnHis: 0.0 ± 0.0
3.004AsnIle: 3.004 ± 0.471
2.575AsnLys: 2.575 ± 1.326
3.004AsnLeu: 3.004 ± 1.019
1.288AsnMet: 1.288 ± 0.857
3.433AsnAsn: 3.433 ± 0.549
3.004AsnPro: 3.004 ± 1.598
1.288AsnGln: 1.288 ± 0.716
2.575AsnArg: 2.575 ± 0.464
3.433AsnSer: 3.433 ± 1.27
4.292AsnThr: 4.292 ± 1.727
3.004AsnVal: 3.004 ± 1.152
0.429AsnTrp: 0.429 ± 0.34
0.858AsnTyr: 0.858 ± 0.591
0.0AsnXaa: 0.0 ± 0.0
Pro
5.15ProAla: 5.15 ± 1.732
2.146ProCys: 2.146 ± 1.232
6.438ProAsp: 6.438 ± 1.926
3.863ProGlu: 3.863 ± 0.821
1.717ProPhe: 1.717 ± 0.931
2.146ProGly: 2.146 ± 0.999
0.429ProHis: 0.429 ± 0.372
5.579ProIle: 5.579 ± 1.723
3.863ProLys: 3.863 ± 0.597
3.004ProLeu: 3.004 ± 0.863
0.429ProMet: 0.429 ± 0.372
2.146ProAsn: 2.146 ± 0.761
6.438ProPro: 6.438 ± 1.858
1.288ProGln: 1.288 ± 0.421
2.146ProArg: 2.146 ± 1.053
4.292ProSer: 4.292 ± 1.332
4.292ProThr: 4.292 ± 1.104
2.146ProVal: 2.146 ± 1.611
0.429ProTrp: 0.429 ± 0.363
2.575ProTyr: 2.575 ± 1.31
0.0ProXaa: 0.0 ± 0.0
Gln
2.575GlnAla: 2.575 ± 0.751
0.858GlnCys: 0.858 ± 0.492
1.717GlnAsp: 1.717 ± 0.93
2.575GlnGlu: 2.575 ± 0.978
3.863GlnPhe: 3.863 ± 1.114
2.575GlnGly: 2.575 ± 1.31
1.288GlnHis: 1.288 ± 0.762
2.575GlnIle: 2.575 ± 1.572
2.146GlnLys: 2.146 ± 1.292
3.863GlnLeu: 3.863 ± 0.581
3.004GlnMet: 3.004 ± 1.333
1.717GlnAsn: 1.717 ± 0.656
2.146GlnPro: 2.146 ± 0.899
3.433GlnGln: 3.433 ± 1.27
0.858GlnArg: 0.858 ± 0.518
2.146GlnSer: 2.146 ± 1.278
1.717GlnThr: 1.717 ± 0.793
3.004GlnVal: 3.004 ± 1.036
1.288GlnTrp: 1.288 ± 0.906
2.575GlnTyr: 2.575 ± 1.09
0.0GlnXaa: 0.0 ± 0.0
Arg
3.004ArgAla: 3.004 ± 1.055
1.717ArgCys: 1.717 ± 0.722
3.863ArgAsp: 3.863 ± 2.28
3.433ArgGlu: 3.433 ± 1.623
3.433ArgPhe: 3.433 ± 0.605
3.433ArgGly: 3.433 ± 1.162
1.288ArgHis: 1.288 ± 0.759
0.429ArgIle: 0.429 ± 0.372
5.15ArgLys: 5.15 ± 1.143
5.15ArgLeu: 5.15 ± 1.36
1.288ArgMet: 1.288 ± 1.019
2.575ArgAsn: 2.575 ± 1.336
1.717ArgPro: 1.717 ± 1.489
2.146ArgGln: 2.146 ± 0.852
4.721ArgArg: 4.721 ± 3.027
4.721ArgSer: 4.721 ± 2.092
1.288ArgThr: 1.288 ± 0.926
5.15ArgVal: 5.15 ± 1.268
0.429ArgTrp: 0.429 ± 0.475
1.288ArgTyr: 1.288 ± 0.437
0.0ArgXaa: 0.0 ± 0.0
Ser
3.863SerAla: 3.863 ± 1.234
0.858SerCys: 0.858 ± 0.427
4.292SerAsp: 4.292 ± 1.16
3.863SerGlu: 3.863 ± 1.135
1.717SerPhe: 1.717 ± 1.098
6.438SerGly: 6.438 ± 0.724
0.858SerHis: 0.858 ± 0.463
3.004SerIle: 3.004 ± 1.228
3.004SerLys: 3.004 ± 1.313
7.725SerLeu: 7.725 ± 2.022
0.858SerMet: 0.858 ± 0.451
2.575SerAsn: 2.575 ± 1.178
3.433SerPro: 3.433 ± 1.881
3.863SerGln: 3.863 ± 0.954
6.009SerArg: 6.009 ± 2.089
5.579SerSer: 5.579 ± 3.355
5.15SerThr: 5.15 ± 1.757
3.863SerVal: 3.863 ± 1.168
1.288SerTrp: 1.288 ± 0.47
2.146SerTyr: 2.146 ± 1.008
0.0SerXaa: 0.0 ± 0.0
Thr
5.15ThrAla: 5.15 ± 1.065
0.858ThrCys: 0.858 ± 0.42
5.15ThrAsp: 5.15 ± 1.418
4.292ThrGlu: 4.292 ± 1.483
2.146ThrPhe: 2.146 ± 0.759
1.288ThrGly: 1.288 ± 1.117
1.288ThrHis: 1.288 ± 0.693
4.292ThrIle: 4.292 ± 1.604
2.146ThrLys: 2.146 ± 1.407
3.004ThrLeu: 3.004 ± 1.055
1.288ThrMet: 1.288 ± 0.904
3.433ThrAsn: 3.433 ± 1.708
4.292ThrPro: 4.292 ± 0.848
1.288ThrGln: 1.288 ± 0.865
3.004ThrArg: 3.004 ± 1.039
5.579ThrSer: 5.579 ± 1.852
4.292ThrThr: 4.292 ± 1.09
4.721ThrVal: 4.721 ± 1.167
1.288ThrTrp: 1.288 ± 0.666
2.146ThrTyr: 2.146 ± 0.695
0.0ThrXaa: 0.0 ± 0.0
Val
1.717ValAla: 1.717 ± 0.771
0.858ValCys: 0.858 ± 0.567
4.292ValAsp: 4.292 ± 1.717
2.575ValGlu: 2.575 ± 1.17
3.433ValPhe: 3.433 ± 1.352
4.292ValGly: 4.292 ± 0.915
0.429ValHis: 0.429 ± 0.372
4.721ValIle: 4.721 ± 1.042
4.721ValLys: 4.721 ± 1.262
6.009ValLeu: 6.009 ± 1.032
0.429ValMet: 0.429 ± 0.363
2.575ValAsn: 2.575 ± 0.825
3.433ValPro: 3.433 ± 1.154
3.433ValGln: 3.433 ± 1.288
5.579ValArg: 5.579 ± 1.2
5.579ValSer: 5.579 ± 0.921
5.579ValThr: 5.579 ± 1.088
3.004ValVal: 3.004 ± 0.694
1.288ValTrp: 1.288 ± 0.926
1.288ValTyr: 1.288 ± 0.729
0.0ValXaa: 0.0 ± 0.0
Trp
1.288TrpAla: 1.288 ± 0.421
0.0TrpCys: 0.0 ± 0.0
2.146TrpAsp: 2.146 ± 0.984
0.429TrpGlu: 0.429 ± 0.475
1.288TrpPhe: 1.288 ± 0.392
0.858TrpGly: 0.858 ± 0.726
0.0TrpHis: 0.0 ± 0.0
0.858TrpIle: 0.858 ± 0.492
0.0TrpLys: 0.0 ± 0.0
1.288TrpLeu: 1.288 ± 0.716
0.0TrpMet: 0.0 ± 0.0
0.429TrpAsn: 0.429 ± 0.475
0.429TrpPro: 0.429 ± 0.363
1.717TrpGln: 1.717 ± 0.985
2.146TrpArg: 2.146 ± 1.253
0.429TrpSer: 0.429 ± 0.475
1.288TrpThr: 1.288 ± 0.847
0.858TrpVal: 0.858 ± 0.492
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.717TyrAla: 1.717 ± 0.255
0.858TyrCys: 0.858 ± 1.068
0.0TyrAsp: 0.0 ± 0.0
1.717TyrGlu: 1.717 ± 0.535
2.575TyrPhe: 2.575 ± 1.131
1.717TyrGly: 1.717 ± 0.781
0.429TyrHis: 0.429 ± 0.34
2.146TyrIle: 2.146 ± 1.507
3.863TyrLys: 3.863 ± 0.698
3.433TyrLeu: 3.433 ± 0.814
0.858TyrMet: 0.858 ± 0.68
3.433TyrAsn: 3.433 ± 0.856
1.288TyrPro: 1.288 ± 0.437
3.004TyrGln: 3.004 ± 0.631
2.575TyrArg: 2.575 ± 0.766
1.717TyrSer: 1.717 ± 1.086
0.429TyrThr: 0.429 ± 0.372
2.575TyrVal: 2.575 ± 0.939
0.429TyrTrp: 0.429 ± 0.363
1.288TyrTyr: 1.288 ± 1.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2331 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski