Amino acid dipepetide frequency for Gammapapillomavirus 10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.209AlaAla: 6.209 ± 1.944
0.414AlaCys: 0.414 ± 0.491
1.242AlaAsp: 1.242 ± 0.34
4.967AlaGlu: 4.967 ± 1.422
2.897AlaPhe: 2.897 ± 0.905
1.242AlaGly: 1.242 ± 0.698
1.656AlaHis: 1.656 ± 0.847
2.897AlaIle: 2.897 ± 0.498
4.967AlaLys: 4.967 ± 0.764
5.381AlaLeu: 5.381 ± 0.515
0.414AlaMet: 0.414 ± 0.373
0.414AlaAsn: 0.414 ± 0.517
2.897AlaPro: 2.897 ± 0.872
0.828AlaGln: 0.828 ± 0.431
4.553AlaArg: 4.553 ± 1.452
3.311AlaSer: 3.311 ± 1.163
4.967AlaThr: 4.967 ± 1.438
3.311AlaVal: 3.311 ± 0.541
0.414AlaTrp: 0.414 ± 0.323
2.483AlaTyr: 2.483 ± 0.676
0.0AlaXaa: 0.0 ± 0.0
Cys
0.828CysAla: 0.828 ± 0.647
0.828CysCys: 0.828 ± 0.518
0.828CysAsp: 0.828 ± 0.647
1.242CysGlu: 1.242 ± 0.71
0.414CysPhe: 0.414 ± 0.424
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
2.07CysIle: 2.07 ± 0.996
2.483CysLys: 2.483 ± 1.214
2.07CysLeu: 2.07 ± 1.658
0.0CysMet: 0.0 ± 0.0
1.242CysAsn: 1.242 ± 0.71
0.828CysPro: 0.828 ± 0.423
0.0CysGln: 0.0 ± 0.0
2.483CysArg: 2.483 ± 1.189
3.725CysSer: 3.725 ± 1.812
1.656CysThr: 1.656 ± 0.945
0.414CysVal: 0.414 ± 0.373
0.414CysTrp: 0.414 ± 0.424
0.828CysTyr: 0.828 ± 0.574
0.0CysXaa: 0.0 ± 0.0
Asp
2.897AspAla: 2.897 ± 1.45
2.483AspCys: 2.483 ± 1.208
3.725AspAsp: 3.725 ± 0.89
4.553AspGlu: 4.553 ± 1.643
4.139AspPhe: 4.139 ± 0.667
1.242AspGly: 1.242 ± 0.459
0.828AspHis: 0.828 ± 0.431
4.553AspIle: 4.553 ± 1.146
1.242AspLys: 1.242 ± 1.273
6.209AspLeu: 6.209 ± 1.747
0.828AspMet: 0.828 ± 0.431
2.897AspAsn: 2.897 ± 0.632
5.381AspPro: 5.381 ± 1.05
2.483AspGln: 2.483 ± 1.178
2.483AspArg: 2.483 ± 0.509
5.381AspSer: 5.381 ± 1.961
2.897AspThr: 2.897 ± 0.828
7.036AspVal: 7.036 ± 2.542
1.656AspTrp: 1.656 ± 0.638
1.656AspTyr: 1.656 ± 0.725
0.0AspXaa: 0.0 ± 0.0
Glu
4.553GluAla: 4.553 ± 0.712
0.414GluCys: 0.414 ± 0.323
7.036GluAsp: 7.036 ± 2.105
9.52GluGlu: 9.52 ± 1.769
2.07GluPhe: 2.07 ± 1.306
4.139GluGly: 4.139 ± 1.328
2.07GluHis: 2.07 ± 0.836
1.656GluIle: 1.656 ± 0.761
2.483GluLys: 2.483 ± 0.782
7.45GluLeu: 7.45 ± 1.909
1.656GluMet: 1.656 ± 0.971
2.897GluAsn: 2.897 ± 0.892
1.656GluPro: 1.656 ± 0.586
2.897GluGln: 2.897 ± 0.615
2.07GluArg: 2.07 ± 1.179
2.483GluSer: 2.483 ± 1.067
4.139GluThr: 4.139 ± 1.052
4.553GluVal: 4.553 ± 1.286
0.414GluTrp: 0.414 ± 0.424
1.656GluTyr: 1.656 ± 0.638
0.0GluXaa: 0.0 ± 0.0
Phe
2.07PheAla: 2.07 ± 0.835
0.414PheCys: 0.414 ± 0.491
3.311PheAsp: 3.311 ± 1.017
1.656PheGlu: 1.656 ± 0.908
1.242PhePhe: 1.242 ± 0.738
2.483PheGly: 2.483 ± 0.859
2.07PheHis: 2.07 ± 0.893
2.483PheIle: 2.483 ± 0.789
3.725PheLys: 3.725 ± 1.214
3.311PheLeu: 3.311 ± 1.102
1.242PheMet: 1.242 ± 0.664
2.07PheAsn: 2.07 ± 0.946
1.242PhePro: 1.242 ± 0.721
2.07PheGln: 2.07 ± 0.402
0.414PheArg: 0.414 ± 0.424
2.483PheSer: 2.483 ± 0.892
1.242PheThr: 1.242 ± 0.34
5.381PheVal: 5.381 ± 0.922
0.828PheTrp: 0.828 ± 0.431
2.483PheTyr: 2.483 ± 0.968
0.0PheXaa: 0.0 ± 0.0
Gly
0.828GlyAla: 0.828 ± 0.746
1.242GlyCys: 1.242 ± 0.459
4.967GlyAsp: 4.967 ± 1.82
3.725GlyGlu: 3.725 ± 1.244
0.414GlyPhe: 0.414 ± 0.373
2.483GlyGly: 2.483 ± 1.302
2.483GlyHis: 2.483 ± 1.046
3.725GlyIle: 3.725 ± 0.793
3.311GlyLys: 3.311 ± 1.511
4.553GlyLeu: 4.553 ± 1.38
0.0GlyMet: 0.0 ± 0.0
4.139GlyAsn: 4.139 ± 1.341
4.139GlyPro: 4.139 ± 1.41
1.656GlyGln: 1.656 ± 1.252
3.725GlyArg: 3.725 ± 2.088
6.209GlySer: 6.209 ± 2.314
4.139GlyThr: 4.139 ± 1.545
3.725GlyVal: 3.725 ± 0.609
0.828GlyTrp: 0.828 ± 0.474
1.656GlyTyr: 1.656 ± 0.949
0.0GlyXaa: 0.0 ± 0.0
His
0.414HisAla: 0.414 ± 0.373
0.414HisCys: 0.414 ± 0.323
0.0HisAsp: 0.0 ± 0.0
0.828HisGlu: 0.828 ± 0.474
1.656HisPhe: 1.656 ± 0.586
0.0HisGly: 0.0 ± 0.0
0.828HisHis: 0.828 ± 0.849
0.828HisIle: 0.828 ± 0.423
1.656HisLys: 1.656 ± 0.949
2.07HisLeu: 2.07 ± 1.218
0.414HisMet: 0.414 ± 0.323
1.656HisAsn: 1.656 ± 0.862
1.656HisPro: 1.656 ± 0.847
0.828HisGln: 0.828 ± 0.379
0.414HisArg: 0.414 ± 0.323
0.828HisSer: 0.828 ± 0.746
2.07HisThr: 2.07 ± 0.489
2.07HisVal: 2.07 ± 1.026
0.828HisTrp: 0.828 ± 0.473
0.414HisTyr: 0.414 ± 0.373
0.0HisXaa: 0.0 ± 0.0
Ile
1.656IleAla: 1.656 ± 1.057
0.0IleCys: 0.0 ± 0.0
3.725IleAsp: 3.725 ± 1.688
7.45IleGlu: 7.45 ± 0.833
0.414IlePhe: 0.414 ± 0.323
2.483IleGly: 2.483 ± 0.719
0.828IleHis: 0.828 ± 0.423
2.07IleIle: 2.07 ± 1.52
0.828IleLys: 0.828 ± 0.518
3.725IleLeu: 3.725 ± 0.673
0.414IleMet: 0.414 ± 0.603
0.414IleAsn: 0.414 ± 0.373
3.725IlePro: 3.725 ± 1.364
1.242IleGln: 1.242 ± 0.459
2.07IleArg: 2.07 ± 1.004
4.553IleSer: 4.553 ± 1.443
2.483IleThr: 2.483 ± 1.144
3.725IleVal: 3.725 ± 1.642
0.828IleTrp: 0.828 ± 0.518
1.656IleTyr: 1.656 ± 0.276
0.0IleXaa: 0.0 ± 0.0
Lys
3.725LysAla: 3.725 ± 0.969
2.483LysCys: 2.483 ± 1.23
3.311LysAsp: 3.311 ± 1.391
2.483LysGlu: 2.483 ± 1.122
2.483LysPhe: 2.483 ± 0.859
2.483LysGly: 2.483 ± 1.236
2.07LysHis: 2.07 ± 0.673
1.242LysIle: 1.242 ± 0.546
2.07LysLys: 2.07 ± 0.64
7.45LysLeu: 7.45 ± 2.001
0.828LysMet: 0.828 ± 0.431
2.483LysAsn: 2.483 ± 1.085
2.07LysPro: 2.07 ± 0.402
2.07LysGln: 2.07 ± 0.529
5.381LysArg: 5.381 ± 1.203
3.725LysSer: 3.725 ± 1.206
4.139LysThr: 4.139 ± 1.807
4.553LysVal: 4.553 ± 1.033
1.242LysTrp: 1.242 ± 0.488
2.07LysTyr: 2.07 ± 1.15
0.0LysXaa: 0.0 ± 0.0
Leu
7.036LeuAla: 7.036 ± 1.192
1.242LeuCys: 1.242 ± 0.546
4.967LeuAsp: 4.967 ± 0.716
6.623LeuGlu: 6.623 ± 1.658
4.139LeuPhe: 4.139 ± 0.799
7.45LeuGly: 7.45 ± 4.198
0.414LeuHis: 0.414 ± 0.424
4.139LeuIle: 4.139 ± 1.243
6.623LeuLys: 6.623 ± 1.035
12.831LeuLeu: 12.831 ± 2.233
2.07LeuMet: 2.07 ± 1.06
3.311LeuAsn: 3.311 ± 1.08
6.209LeuPro: 6.209 ± 2.255
5.795LeuGln: 5.795 ± 1.029
3.725LeuArg: 3.725 ± 0.976
9.106LeuSer: 9.106 ± 3.164
4.967LeuThr: 4.967 ± 0.969
4.553LeuVal: 4.553 ± 1.377
0.828LeuTrp: 0.828 ± 0.746
5.381LeuTyr: 5.381 ± 1.659
0.0LeuXaa: 0.0 ± 0.0
Met
0.414MetAla: 0.414 ± 0.373
1.242MetCys: 1.242 ± 0.97
1.656MetAsp: 1.656 ± 0.69
0.414MetGlu: 0.414 ± 0.424
0.414MetPhe: 0.414 ± 0.373
1.242MetGly: 1.242 ± 0.738
0.0MetHis: 0.0 ± 0.0
0.414MetIle: 0.414 ± 0.517
1.242MetLys: 1.242 ± 0.692
0.414MetLeu: 0.414 ± 0.424
0.414MetMet: 0.414 ± 0.424
1.656MetAsn: 1.656 ± 0.605
1.242MetPro: 1.242 ± 0.553
1.242MetGln: 1.242 ± 0.664
0.828MetArg: 0.828 ± 0.518
0.828MetSer: 0.828 ± 0.647
1.242MetThr: 1.242 ± 0.459
0.414MetVal: 0.414 ± 0.323
0.0MetTrp: 0.0 ± 0.0
0.414MetTyr: 0.414 ± 0.373
0.0MetXaa: 0.0 ± 0.0
Asn
2.483AsnAla: 2.483 ± 0.919
1.242AsnCys: 1.242 ± 0.597
1.242AsnAsp: 1.242 ± 0.84
1.656AsnGlu: 1.656 ± 0.85
2.483AsnPhe: 2.483 ± 1.435
4.553AsnGly: 4.553 ± 1.28
0.828AsnHis: 0.828 ± 0.379
2.07AsnIle: 2.07 ± 1.602
4.553AsnLys: 4.553 ± 1.245
2.483AsnLeu: 2.483 ± 0.637
0.0AsnMet: 0.0 ± 0.0
4.139AsnAsn: 4.139 ± 1.341
3.311AsnPro: 3.311 ± 2.117
2.07AsnGln: 2.07 ± 1.009
2.483AsnArg: 2.483 ± 0.65
2.483AsnSer: 2.483 ± 0.865
3.311AsnThr: 3.311 ± 1.616
2.483AsnVal: 2.483 ± 0.789
1.242AsnTrp: 1.242 ± 0.97
0.828AsnTyr: 0.828 ± 0.647
0.0AsnXaa: 0.0 ± 0.0
Pro
4.967ProAla: 4.967 ± 2.008
0.828ProCys: 0.828 ± 0.574
6.209ProAsp: 6.209 ± 1.857
3.311ProGlu: 3.311 ± 0.588
2.483ProPhe: 2.483 ± 0.7
2.897ProGly: 2.897 ± 1.041
0.0ProHis: 0.0 ± 0.0
1.242ProIle: 1.242 ± 0.72
3.725ProLys: 3.725 ± 0.689
6.209ProLeu: 6.209 ± 1.95
0.414ProMet: 0.414 ± 0.491
2.483ProAsn: 2.483 ± 0.752
5.795ProPro: 5.795 ± 2.329
3.311ProGln: 3.311 ± 1.581
3.311ProArg: 3.311 ± 1.065
5.795ProSer: 5.795 ± 1.817
2.897ProThr: 2.897 ± 1.415
2.07ProVal: 2.07 ± 0.402
0.414ProTrp: 0.414 ± 0.491
2.07ProTyr: 2.07 ± 1.139
0.0ProXaa: 0.0 ± 0.0
Gln
1.242GlnAla: 1.242 ± 0.57
0.828GlnCys: 0.828 ± 0.612
2.897GlnAsp: 2.897 ± 0.857
0.828GlnGlu: 0.828 ± 0.474
2.483GlnPhe: 2.483 ± 0.617
2.897GlnGly: 2.897 ± 0.987
0.828GlnHis: 0.828 ± 0.474
1.242GlnIle: 1.242 ± 0.86
1.242GlnLys: 1.242 ± 0.79
4.967GlnLeu: 4.967 ± 1.294
1.242GlnMet: 1.242 ± 0.607
2.897GlnAsn: 2.897 ± 1.072
2.07GlnPro: 2.07 ± 0.481
3.311GlnGln: 3.311 ± 1.231
2.483GlnArg: 2.483 ± 1.046
1.656GlnSer: 1.656 ± 0.51
4.139GlnThr: 4.139 ± 1.379
1.656GlnVal: 1.656 ± 1.057
1.242GlnTrp: 1.242 ± 0.589
2.07GlnTyr: 2.07 ± 0.668
0.0GlnXaa: 0.0 ± 0.0
Arg
2.897ArgAla: 2.897 ± 0.931
1.656ArgCys: 1.656 ± 0.69
2.897ArgAsp: 2.897 ± 0.99
1.242ArgGlu: 1.242 ± 0.818
1.656ArgPhe: 1.656 ± 0.925
3.311ArgGly: 3.311 ± 1.139
0.828ArgHis: 0.828 ± 0.746
2.07ArgIle: 2.07 ± 0.834
3.311ArgLys: 3.311 ± 0.8
5.795ArgLeu: 5.795 ± 1.673
0.414ArgMet: 0.414 ± 0.424
2.07ArgAsn: 2.07 ± 0.83
3.311ArgPro: 3.311 ± 0.813
3.725ArgGln: 3.725 ± 1.648
6.209ArgArg: 6.209 ± 1.544
6.209ArgSer: 6.209 ± 1.414
2.483ArgThr: 2.483 ± 1.067
2.483ArgVal: 2.483 ± 1.116
0.0ArgTrp: 0.0 ± 0.0
1.242ArgTyr: 1.242 ± 0.739
0.0ArgXaa: 0.0 ± 0.0
Ser
2.483SerAla: 2.483 ± 0.494
1.656SerCys: 1.656 ± 1.21
4.139SerAsp: 4.139 ± 1.078
4.553SerGlu: 4.553 ± 0.696
5.381SerPhe: 5.381 ± 2.194
5.795SerGly: 5.795 ± 1.968
1.242SerHis: 1.242 ± 0.34
3.311SerIle: 3.311 ± 1.83
2.07SerLys: 2.07 ± 1.346
10.762SerLeu: 10.762 ± 1.727
0.828SerMet: 0.828 ± 0.709
4.553SerAsn: 4.553 ± 1.406
3.311SerPro: 3.311 ± 1.476
1.242SerGln: 1.242 ± 0.459
3.311SerArg: 3.311 ± 1.168
8.278SerSer: 8.278 ± 1.693
6.209SerThr: 6.209 ± 2.227
4.553SerVal: 4.553 ± 1.456
1.242SerTrp: 1.242 ± 0.597
2.07SerTyr: 2.07 ± 0.984
0.0SerXaa: 0.0 ± 0.0
Thr
3.311ThrAla: 3.311 ± 1.348
2.07ThrCys: 2.07 ± 0.955
3.311ThrAsp: 3.311 ± 1.011
4.139ThrGlu: 4.139 ± 0.879
1.656ThrPhe: 1.656 ± 0.779
5.381ThrGly: 5.381 ± 1.294
0.828ThrHis: 0.828 ± 0.423
2.897ThrIle: 2.897 ± 1.757
3.311ThrLys: 3.311 ± 1.457
5.795ThrLeu: 5.795 ± 1.118
1.242ThrMet: 1.242 ± 0.34
2.897ThrAsn: 2.897 ± 1.049
5.795ThrPro: 5.795 ± 1.306
2.897ThrGln: 2.897 ± 1.264
3.311ThrArg: 3.311 ± 1.189
5.381ThrSer: 5.381 ± 1.59
4.967ThrThr: 4.967 ± 2.064
2.897ThrVal: 2.897 ± 0.891
0.414ThrTrp: 0.414 ± 0.424
1.242ThrTyr: 1.242 ± 0.84
0.0ThrXaa: 0.0 ± 0.0
Val
4.139ValAla: 4.139 ± 1.485
1.242ValCys: 1.242 ± 1.277
6.623ValAsp: 6.623 ± 1.368
4.553ValGlu: 4.553 ± 0.929
1.242ValPhe: 1.242 ± 0.805
3.725ValGly: 3.725 ± 0.498
1.656ValHis: 1.656 ± 0.586
3.311ValIle: 3.311 ± 0.8
3.725ValLys: 3.725 ± 0.768
4.967ValLeu: 4.967 ± 0.891
0.828ValMet: 0.828 ± 0.613
2.07ValAsn: 2.07 ± 0.673
3.311ValPro: 3.311 ± 1.266
2.897ValGln: 2.897 ± 1.755
2.483ValArg: 2.483 ± 0.655
3.311ValSer: 3.311 ± 1.021
3.725ValThr: 3.725 ± 0.589
2.897ValVal: 2.897 ± 1.451
0.414ValTrp: 0.414 ± 0.373
2.897ValTyr: 2.897 ± 1.232
0.0ValXaa: 0.0 ± 0.0
Trp
0.414TrpAla: 0.414 ± 0.323
0.414TrpCys: 0.414 ± 0.491
0.414TrpAsp: 0.414 ± 0.373
0.828TrpGlu: 0.828 ± 0.849
0.0TrpPhe: 0.0 ± 0.0
0.828TrpGly: 0.828 ± 0.525
0.414TrpHis: 0.414 ± 0.424
1.656TrpIle: 1.656 ± 0.952
2.897TrpLys: 2.897 ± 1.07
1.656TrpLeu: 1.656 ± 0.51
0.414TrpMet: 0.414 ± 0.373
0.414TrpAsn: 0.414 ± 0.373
0.414TrpPro: 0.414 ± 0.373
0.414TrpGln: 0.414 ± 0.323
0.828TrpArg: 0.828 ± 0.849
0.414TrpSer: 0.414 ± 0.424
0.0TrpThr: 0.0 ± 0.0
0.828TrpVal: 0.828 ± 0.474
0.0TrpTrp: 0.0 ± 0.0
0.414TrpTyr: 0.414 ± 0.323
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.897TyrAla: 2.897 ± 0.933
1.242TyrCys: 1.242 ± 0.956
2.07TyrAsp: 2.07 ± 1.153
1.242TyrGlu: 1.242 ± 0.446
4.553TyrPhe: 4.553 ± 1.39
3.311TyrGly: 3.311 ± 1.638
0.0TyrHis: 0.0 ± 0.0
0.828TyrIle: 0.828 ± 0.473
2.897TyrLys: 2.897 ± 1.308
3.311TyrLeu: 3.311 ± 1.24
1.656TyrMet: 1.656 ± 1.016
1.242TyrAsn: 1.242 ± 0.698
2.07TyrPro: 2.07 ± 1.149
1.242TyrGln: 1.242 ± 0.618
1.242TyrArg: 1.242 ± 0.873
0.828TyrSer: 0.828 ± 0.431
2.07TyrThr: 2.07 ± 0.836
0.414TyrVal: 0.414 ± 0.373
0.414TyrTrp: 0.414 ± 0.373
1.656TyrTyr: 1.656 ± 1.214
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2417 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski