Amino acid dipepetide frequency for Gammapapillomavirus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.051AlaAla: 5.051 ± 2.364
0.421AlaCys: 0.421 ± 0.501
3.788AlaAsp: 3.788 ± 1.063
3.788AlaGlu: 3.788 ± 1.48
2.946AlaPhe: 2.946 ± 0.782
2.104AlaGly: 2.104 ± 0.828
0.842AlaHis: 0.842 ± 0.578
2.946AlaIle: 2.946 ± 1.515
3.367AlaLys: 3.367 ± 1.568
4.63AlaLeu: 4.63 ± 1.593
0.421AlaMet: 0.421 ± 0.39
3.788AlaAsn: 3.788 ± 1.551
3.367AlaPro: 3.367 ± 1.239
3.788AlaGln: 3.788 ± 0.537
2.104AlaArg: 2.104 ± 0.798
5.051AlaSer: 5.051 ± 0.724
3.788AlaThr: 3.788 ± 1.324
1.684AlaVal: 1.684 ± 0.639
1.263AlaTrp: 1.263 ± 0.652
2.104AlaTyr: 2.104 ± 0.892
0.0AlaXaa: 0.0 ± 0.0
Cys
0.842CysAla: 0.842 ± 0.574
1.684CysCys: 1.684 ± 1.223
1.684CysAsp: 1.684 ± 0.833
1.684CysGlu: 1.684 ± 1.223
1.684CysPhe: 1.684 ± 0.995
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
2.525CysIle: 2.525 ± 1.288
2.104CysLys: 2.104 ± 1.0
2.104CysLeu: 2.104 ± 2.505
0.0CysMet: 0.0 ± 0.0
1.684CysAsn: 1.684 ± 0.876
1.263CysPro: 1.263 ± 0.842
2.104CysGln: 2.104 ± 1.064
0.842CysArg: 0.842 ± 0.611
1.263CysSer: 1.263 ± 0.877
1.263CysThr: 1.263 ± 0.85
2.104CysVal: 2.104 ± 0.769
0.842CysTrp: 0.842 ± 0.484
0.842CysTyr: 0.842 ± 0.803
0.0CysXaa: 0.0 ± 0.0
Asp
5.471AspAla: 5.471 ± 1.579
2.104AspCys: 2.104 ± 1.223
5.471AspAsp: 5.471 ± 1.676
5.051AspGlu: 5.051 ± 1.58
2.946AspPhe: 2.946 ± 1.094
3.788AspGly: 3.788 ± 1.126
0.421AspHis: 0.421 ± 0.337
2.525AspIle: 2.525 ± 1.542
2.104AspLys: 2.104 ± 0.82
6.313AspLeu: 6.313 ± 2.069
0.842AspMet: 0.842 ± 0.472
2.104AspAsn: 2.104 ± 0.452
5.051AspPro: 5.051 ± 1.186
2.946AspGln: 2.946 ± 1.412
4.209AspArg: 4.209 ± 1.483
4.63AspSer: 4.63 ± 1.308
5.051AspThr: 5.051 ± 1.772
5.892AspVal: 5.892 ± 2.578
0.842AspTrp: 0.842 ± 0.387
2.946AspTyr: 2.946 ± 0.906
0.0AspXaa: 0.0 ± 0.0
Glu
2.946GluAla: 2.946 ± 0.782
1.684GluCys: 1.684 ± 1.346
2.946GluAsp: 2.946 ± 0.632
6.313GluGlu: 6.313 ± 2.42
2.525GluPhe: 2.525 ± 0.601
2.525GluGly: 2.525 ± 1.542
2.104GluHis: 2.104 ± 1.266
2.525GluIle: 2.525 ± 0.769
2.525GluLys: 2.525 ± 1.118
7.997GluLeu: 7.997 ± 1.878
0.842GluMet: 0.842 ± 0.385
2.946GluAsn: 2.946 ± 1.119
1.684GluPro: 1.684 ± 0.671
2.525GluGln: 2.525 ± 1.625
4.63GluArg: 4.63 ± 1.426
5.471GluSer: 5.471 ± 1.93
2.946GluThr: 2.946 ± 0.488
4.209GluVal: 4.209 ± 1.296
0.842GluTrp: 0.842 ± 0.484
1.263GluTyr: 1.263 ± 0.877
0.0GluXaa: 0.0 ± 0.0
Phe
2.104PheAla: 2.104 ± 0.835
1.263PheCys: 1.263 ± 0.85
4.63PheAsp: 4.63 ± 0.949
3.788PheGlu: 3.788 ± 1.257
2.104PhePhe: 2.104 ± 1.012
2.946PheGly: 2.946 ± 0.847
1.263PheHis: 1.263 ± 0.729
0.842PheIle: 0.842 ± 0.387
2.946PheLys: 2.946 ± 1.453
3.367PheLeu: 3.367 ± 1.334
0.421PheMet: 0.421 ± 0.55
2.104PheAsn: 2.104 ± 1.265
1.263PhePro: 1.263 ± 0.844
1.684PheGln: 1.684 ± 0.576
2.104PheArg: 2.104 ± 0.872
3.367PheSer: 3.367 ± 0.679
1.684PheThr: 1.684 ± 0.602
4.63PheVal: 4.63 ± 1.629
0.842PheTrp: 0.842 ± 0.387
2.104PheTyr: 2.104 ± 0.388
0.0PheXaa: 0.0 ± 0.0
Gly
2.104GlyAla: 2.104 ± 0.913
0.842GlyCys: 0.842 ± 0.634
4.209GlyAsp: 4.209 ± 1.506
5.051GlyGlu: 5.051 ± 1.528
0.421GlyPhe: 0.421 ± 0.443
2.525GlyGly: 2.525 ± 1.194
0.842GlyHis: 0.842 ± 0.691
2.525GlyIle: 2.525 ± 0.747
3.788GlyLys: 3.788 ± 0.998
6.313GlyLeu: 6.313 ± 1.898
0.421GlyMet: 0.421 ± 0.39
3.367GlyAsn: 3.367 ± 0.771
2.946GlyPro: 2.946 ± 0.976
2.104GlyGln: 2.104 ± 0.803
3.367GlyArg: 3.367 ± 1.591
3.788GlySer: 3.788 ± 0.537
2.946GlyThr: 2.946 ± 1.709
2.525GlyVal: 2.525 ± 1.587
0.0GlyTrp: 0.0 ± 0.0
0.842GlyTyr: 0.842 ± 0.514
0.0GlyXaa: 0.0 ± 0.0
His
2.104HisAla: 2.104 ± 0.692
0.842HisCys: 0.842 ± 0.669
0.0HisAsp: 0.0 ± 0.0
0.842HisGlu: 0.842 ± 0.385
0.421HisPhe: 0.421 ± 0.39
0.421HisGly: 0.421 ± 0.39
0.421HisHis: 0.421 ± 0.443
0.842HisIle: 0.842 ± 0.472
1.263HisLys: 1.263 ± 0.667
2.946HisLeu: 2.946 ± 1.895
0.421HisMet: 0.421 ± 0.346
0.842HisAsn: 0.842 ± 0.472
2.525HisPro: 2.525 ± 1.261
0.421HisGln: 0.421 ± 0.337
1.684HisArg: 1.684 ± 0.555
1.684HisSer: 1.684 ± 0.783
1.684HisThr: 1.684 ± 0.671
1.263HisVal: 1.263 ± 0.843
0.421HisTrp: 0.421 ± 0.346
0.842HisTyr: 0.842 ± 0.484
0.0HisXaa: 0.0 ± 0.0
Ile
2.525IleAla: 2.525 ± 0.836
1.263IleCys: 1.263 ± 1.162
8.418IleAsp: 8.418 ± 1.935
2.104IleGlu: 2.104 ± 1.195
2.104IlePhe: 2.104 ± 1.141
2.104IleGly: 2.104 ± 1.009
1.684IleHis: 1.684 ± 1.158
1.684IleIle: 1.684 ± 0.524
1.684IleLys: 1.684 ± 0.946
4.209IleLeu: 4.209 ± 1.506
0.842IleMet: 0.842 ± 0.581
1.684IleAsn: 1.684 ± 1.158
3.788IlePro: 3.788 ± 1.492
1.684IleGln: 1.684 ± 0.596
1.684IleArg: 1.684 ± 0.768
2.525IleSer: 2.525 ± 0.655
2.104IleThr: 2.104 ± 0.801
2.525IleVal: 2.525 ± 0.841
0.842IleTrp: 0.842 ± 0.387
2.104IleTyr: 2.104 ± 0.684
0.0IleXaa: 0.0 ± 0.0
Lys
2.104LysAla: 2.104 ± 1.167
4.209LysCys: 4.209 ± 2.651
2.525LysAsp: 2.525 ± 1.113
3.788LysGlu: 3.788 ± 1.444
2.525LysPhe: 2.525 ± 0.886
1.684LysGly: 1.684 ± 0.769
2.104LysHis: 2.104 ± 1.053
2.104LysIle: 2.104 ± 1.367
2.104LysLys: 2.104 ± 0.373
3.367LysLeu: 3.367 ± 1.514
1.263LysMet: 1.263 ± 0.706
2.525LysAsn: 2.525 ± 0.63
1.263LysPro: 1.263 ± 0.603
2.104LysGln: 2.104 ± 1.184
6.313LysArg: 6.313 ± 0.804
4.63LysSer: 4.63 ± 1.157
1.263LysThr: 1.263 ± 0.638
6.313LysVal: 6.313 ± 0.877
0.842LysTrp: 0.842 ± 0.468
1.684LysTyr: 1.684 ± 0.602
0.0LysXaa: 0.0 ± 0.0
Leu
4.63LeuAla: 4.63 ± 1.125
2.104LeuCys: 2.104 ± 1.653
6.734LeuAsp: 6.734 ± 1.027
5.471LeuGlu: 5.471 ± 0.896
4.63LeuPhe: 4.63 ± 2.194
4.63LeuGly: 4.63 ± 1.941
2.946LeuHis: 2.946 ± 1.342
3.788LeuIle: 3.788 ± 0.561
4.63LeuLys: 4.63 ± 1.571
10.101LeuLeu: 10.101 ± 1.834
2.104LeuMet: 2.104 ± 0.824
1.263LeuAsn: 1.263 ± 0.417
5.051LeuPro: 5.051 ± 1.002
7.997LeuGln: 7.997 ± 1.527
5.892LeuArg: 5.892 ± 1.407
7.576LeuSer: 7.576 ± 2.402
5.892LeuThr: 5.892 ± 0.819
3.788LeuVal: 3.788 ± 2.093
0.0LeuTrp: 0.0 ± 0.0
4.63LeuTyr: 4.63 ± 1.221
0.0LeuXaa: 0.0 ± 0.0
Met
1.263MetAla: 1.263 ± 0.638
1.263MetCys: 1.263 ± 0.638
0.421MetAsp: 0.421 ± 0.346
0.421MetGlu: 0.421 ± 0.501
0.421MetPhe: 0.421 ± 0.346
1.684MetGly: 1.684 ± 0.671
0.0MetHis: 0.0 ± 0.0
0.421MetIle: 0.421 ± 0.39
0.842MetLys: 0.842 ± 0.673
0.421MetLeu: 0.421 ± 0.39
0.0MetMet: 0.0 ± 0.3
1.263MetAsn: 1.263 ± 0.652
1.263MetPro: 1.263 ± 0.646
0.421MetGln: 0.421 ± 0.346
1.263MetArg: 1.263 ± 0.889
1.263MetSer: 1.263 ± 0.417
0.421MetThr: 0.421 ± 0.346
0.842MetVal: 0.842 ± 0.673
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.842AsnAla: 0.842 ± 0.385
2.104AsnCys: 2.104 ± 1.093
0.421AsnAsp: 0.421 ± 0.497
2.104AsnGlu: 2.104 ± 0.874
2.946AsnPhe: 2.946 ± 1.013
3.367AsnGly: 3.367 ± 1.59
0.0AsnHis: 0.0 ± 0.0
2.946AsnIle: 2.946 ± 0.51
2.525AsnLys: 2.525 ± 1.231
4.63AsnLeu: 4.63 ± 1.295
0.842AsnMet: 0.842 ± 0.673
2.946AsnAsn: 2.946 ± 0.961
3.367AsnPro: 3.367 ± 0.679
1.684AsnGln: 1.684 ± 0.814
2.946AsnArg: 2.946 ± 0.474
3.788AsnSer: 3.788 ± 0.995
4.209AsnThr: 4.209 ± 0.9
4.63AsnVal: 4.63 ± 0.761
1.263AsnTrp: 1.263 ± 0.649
1.263AsnTyr: 1.263 ± 0.43
0.0AsnXaa: 0.0 ± 0.0
Pro
4.63ProAla: 4.63 ± 1.683
1.263ProCys: 1.263 ± 0.611
2.946ProAsp: 2.946 ± 0.892
2.525ProGlu: 2.525 ± 1.176
3.788ProPhe: 3.788 ± 1.428
2.104ProGly: 2.104 ± 1.183
0.0ProHis: 0.0 ± 0.0
2.946ProIle: 2.946 ± 1.355
2.104ProLys: 2.104 ± 0.452
6.734ProLeu: 6.734 ± 2.02
0.421ProMet: 0.421 ± 0.346
2.946ProAsn: 2.946 ± 1.12
9.259ProPro: 9.259 ± 2.467
1.263ProGln: 1.263 ± 0.595
3.367ProArg: 3.367 ± 1.366
5.051ProSer: 5.051 ± 1.448
5.892ProThr: 5.892 ± 1.853
3.367ProVal: 3.367 ± 1.404
0.0ProTrp: 0.0 ± 0.0
2.525ProTyr: 2.525 ± 1.022
0.0ProXaa: 0.0 ± 0.0
Gln
2.525GlnAla: 2.525 ± 1.377
2.104GlnCys: 2.104 ± 1.643
2.946GlnAsp: 2.946 ± 1.105
2.104GlnGlu: 2.104 ± 0.659
2.946GlnPhe: 2.946 ± 1.082
2.104GlnGly: 2.104 ± 1.015
0.842GlnHis: 0.842 ± 0.387
1.684GlnIle: 1.684 ± 0.274
2.525GlnLys: 2.525 ± 1.197
5.471GlnLeu: 5.471 ± 1.528
0.842GlnMet: 0.842 ± 0.691
1.684GlnAsn: 1.684 ± 0.969
2.946GlnPro: 2.946 ± 1.0
0.421GlnGln: 0.421 ± 0.497
0.421GlnArg: 0.421 ± 0.39
1.684GlnSer: 1.684 ± 0.784
2.525GlnThr: 2.525 ± 1.015
3.788GlnVal: 3.788 ± 1.288
0.842GlnTrp: 0.842 ± 0.673
2.104GlnTyr: 2.104 ± 0.828
0.0GlnXaa: 0.0 ± 0.0
Arg
1.684ArgAla: 1.684 ± 0.816
1.263ArgCys: 1.263 ± 0.649
2.946ArgAsp: 2.946 ± 1.456
2.525ArgGlu: 2.525 ± 1.021
0.842ArgPhe: 0.842 ± 0.78
4.63ArgGly: 4.63 ± 1.126
2.525ArgHis: 2.525 ± 1.052
1.263ArgIle: 1.263 ± 0.889
6.313ArgLys: 6.313 ± 1.539
7.155ArgLeu: 7.155 ± 1.068
0.421ArgMet: 0.421 ± 0.346
2.946ArgAsn: 2.946 ± 1.315
4.209ArgPro: 4.209 ± 1.625
2.104ArgGln: 2.104 ± 1.265
3.367ArgArg: 3.367 ± 1.525
5.892ArgSer: 5.892 ± 1.694
2.946ArgThr: 2.946 ± 1.027
2.946ArgVal: 2.946 ± 0.597
0.842ArgTrp: 0.842 ± 0.887
1.684ArgTyr: 1.684 ± 0.769
0.0ArgXaa: 0.0 ± 0.0
Ser
6.734SerAla: 6.734 ± 1.56
0.842SerCys: 0.842 ± 0.472
4.63SerAsp: 4.63 ± 0.791
4.209SerGlu: 4.209 ± 0.98
2.946SerPhe: 2.946 ± 0.805
2.946SerGly: 2.946 ± 0.976
2.104SerHis: 2.104 ± 1.513
5.471SerIle: 5.471 ± 1.663
3.788SerLys: 3.788 ± 1.763
6.734SerLeu: 6.734 ± 1.147
0.842SerMet: 0.842 ± 0.527
5.051SerAsn: 5.051 ± 1.57
5.892SerPro: 5.892 ± 1.506
1.263SerGln: 1.263 ± 0.706
5.892SerArg: 5.892 ± 1.212
4.63SerSer: 4.63 ± 1.078
3.367SerThr: 3.367 ± 1.508
4.209SerVal: 4.209 ± 0.803
0.421SerTrp: 0.421 ± 0.443
2.104SerTyr: 2.104 ± 0.892
0.0SerXaa: 0.0 ± 0.0
Thr
2.946ThrAla: 2.946 ± 0.782
0.421ThrCys: 0.421 ± 0.337
6.734ThrAsp: 6.734 ± 0.524
4.63ThrGlu: 4.63 ± 1.192
1.263ThrPhe: 1.263 ± 0.638
4.63ThrGly: 4.63 ± 1.035
1.263ThrHis: 1.263 ± 0.729
2.104ThrIle: 2.104 ± 0.929
3.367ThrLys: 3.367 ± 0.672
3.788ThrLeu: 3.788 ± 0.911
0.842ThrMet: 0.842 ± 0.691
3.367ThrAsn: 3.367 ± 1.284
2.946ThrPro: 2.946 ± 2.301
2.946ThrGln: 2.946 ± 1.023
2.525ThrArg: 2.525 ± 0.888
6.313ThrSer: 6.313 ± 1.315
2.946ThrThr: 2.946 ± 1.831
5.051ThrVal: 5.051 ± 1.328
0.0ThrTrp: 0.0 ± 0.0
0.842ThrTyr: 0.842 ± 0.472
0.0ThrXaa: 0.0 ± 0.0
Val
5.051ValAla: 5.051 ± 1.238
0.0ValCys: 0.0 ± 0.0
6.313ValAsp: 6.313 ± 1.594
2.946ValGlu: 2.946 ± 0.632
4.209ValPhe: 4.209 ± 1.461
4.209ValGly: 4.209 ± 1.442
0.842ValHis: 0.842 ± 0.385
5.051ValIle: 5.051 ± 2.302
3.788ValLys: 3.788 ± 0.663
3.367ValLeu: 3.367 ± 0.735
1.263ValMet: 1.263 ± 0.659
3.788ValAsn: 3.788 ± 0.812
4.209ValPro: 4.209 ± 1.753
3.367ValGln: 3.367 ± 0.898
3.367ValArg: 3.367 ± 1.559
3.788ValSer: 3.788 ± 1.028
3.788ValThr: 3.788 ± 2.013
2.525ValVal: 2.525 ± 0.846
2.104ValTrp: 2.104 ± 0.816
0.842ValTyr: 0.842 ± 0.468
0.0ValXaa: 0.0 ± 0.0
Trp
0.421TrpAla: 0.421 ± 0.337
0.0TrpCys: 0.0 ± 0.0
0.842TrpAsp: 0.842 ± 0.691
0.421TrpGlu: 0.421 ± 0.346
1.263TrpPhe: 1.263 ± 0.417
0.421TrpGly: 0.421 ± 0.346
0.421TrpHis: 0.421 ± 0.443
0.842TrpIle: 0.842 ± 0.484
1.684TrpLys: 1.684 ± 0.943
0.842TrpLeu: 0.842 ± 0.387
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.421TrpPro: 0.421 ± 0.346
0.421TrpGln: 0.421 ± 0.337
1.684TrpArg: 1.684 ± 1.222
0.842TrpSer: 0.842 ± 0.887
0.842TrpThr: 0.842 ± 0.468
1.263TrpVal: 1.263 ± 0.706
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.263TyrAla: 1.263 ± 0.652
0.842TyrCys: 0.842 ± 0.611
2.104TyrAsp: 2.104 ± 0.998
1.684TyrGlu: 1.684 ± 1.292
2.525TyrPhe: 2.525 ± 0.757
2.104TyrGly: 2.104 ± 0.774
1.263TyrHis: 1.263 ± 0.695
2.525TyrIle: 2.525 ± 0.886
1.263TyrLys: 1.263 ± 0.368
3.367TyrLeu: 3.367 ± 1.212
0.842TyrMet: 0.842 ± 0.484
2.525TyrAsn: 2.525 ± 0.431
0.421TyrPro: 0.421 ± 0.39
1.263TyrGln: 1.263 ± 0.417
0.842TyrArg: 0.842 ± 0.468
0.842TyrSer: 0.842 ± 0.514
3.367TyrThr: 3.367 ± 0.746
1.263TyrVal: 1.263 ± 0.638
0.421TyrTrp: 0.421 ± 0.346
1.684TyrTyr: 1.684 ± 0.803
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2377 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski