Amino acid dipepetide frequency for Gammapapillomavirus 25

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.302AlaAla: 3.302 ± 0.761
0.413AlaCys: 0.413 ± 0.439
3.302AlaAsp: 3.302 ± 0.978
2.889AlaGlu: 2.889 ± 1.207
2.476AlaPhe: 2.476 ± 0.901
2.476AlaGly: 2.476 ± 1.147
0.825AlaHis: 0.825 ± 0.436
2.476AlaIle: 2.476 ± 0.902
2.889AlaLys: 2.889 ± 0.476
4.953AlaLeu: 4.953 ± 1.602
0.825AlaMet: 0.825 ± 0.348
3.302AlaAsn: 3.302 ± 1.112
1.651AlaPro: 1.651 ± 0.599
3.714AlaGln: 3.714 ± 1.326
4.127AlaArg: 4.127 ± 2.159
2.889AlaSer: 2.889 ± 1.632
3.302AlaThr: 3.302 ± 0.526
3.714AlaVal: 3.714 ± 1.253
0.0AlaTrp: 0.0 ± 0.0
0.413AlaTyr: 0.413 ± 0.32
0.0AlaXaa: 0.0 ± 0.0
Cys
2.064CysAla: 2.064 ± 0.762
1.238CysCys: 1.238 ± 0.739
1.651CysAsp: 1.651 ± 0.843
2.064CysGlu: 2.064 ± 0.876
1.238CysPhe: 1.238 ± 0.684
0.413CysGly: 0.413 ± 0.539
0.0CysHis: 0.0 ± 0.0
0.413CysIle: 0.413 ± 0.439
2.889CysLys: 2.889 ± 1.315
2.889CysLeu: 2.889 ± 2.216
0.413CysMet: 0.413 ± 0.32
1.238CysAsn: 1.238 ± 0.628
1.651CysPro: 1.651 ± 0.642
0.413CysGln: 0.413 ± 0.439
0.413CysArg: 0.413 ± 0.439
0.825CysSer: 0.825 ± 0.562
0.413CysThr: 0.413 ± 0.32
0.0CysVal: 0.0 ± 0.0
0.825CysTrp: 0.825 ± 0.473
1.238CysTyr: 1.238 ± 0.905
0.0CysXaa: 0.0 ± 0.0
Asp
1.651AspAla: 1.651 ± 0.724
2.064AspCys: 2.064 ± 0.973
4.953AspAsp: 4.953 ± 1.935
7.842AspGlu: 7.842 ± 1.411
4.54AspPhe: 4.54 ± 1.668
2.476AspGly: 2.476 ± 0.618
1.238AspHis: 1.238 ± 0.656
3.714AspIle: 3.714 ± 1.321
2.889AspLys: 2.889 ± 1.21
4.54AspLeu: 4.54 ± 1.098
1.651AspMet: 1.651 ± 0.843
2.476AspAsn: 2.476 ± 0.527
4.953AspPro: 4.953 ± 1.411
0.0AspGln: 0.0 ± 0.0
0.825AspArg: 0.825 ± 0.473
5.365AspSer: 5.365 ± 1.34
4.54AspThr: 4.54 ± 0.793
5.365AspVal: 5.365 ± 1.923
1.238AspTrp: 1.238 ± 0.684
2.064AspTyr: 2.064 ± 0.631
0.0AspXaa: 0.0 ± 0.0
Glu
2.889GluAla: 2.889 ± 1.107
2.064GluCys: 2.064 ± 1.236
6.191GluAsp: 6.191 ± 0.959
5.778GluGlu: 5.778 ± 2.0
1.238GluPhe: 1.238 ± 0.608
3.302GluGly: 3.302 ± 0.433
0.825GluHis: 0.825 ± 0.436
2.889GluIle: 2.889 ± 1.288
2.889GluLys: 2.889 ± 1.169
6.603GluLeu: 6.603 ± 1.452
0.825GluMet: 0.825 ± 0.639
4.54GluAsn: 4.54 ± 1.006
4.127GluPro: 4.127 ± 1.547
2.064GluGln: 2.064 ± 0.703
3.302GluArg: 3.302 ± 1.604
4.953GluSer: 4.953 ± 0.887
4.54GluThr: 4.54 ± 1.821
2.889GluVal: 2.889 ± 0.925
0.413GluTrp: 0.413 ± 0.32
2.064GluTyr: 2.064 ± 0.73
0.0GluXaa: 0.0 ± 0.0
Phe
2.889PheAla: 2.889 ± 0.541
0.413PheCys: 0.413 ± 0.439
2.889PheAsp: 2.889 ± 0.702
2.476PheGlu: 2.476 ± 0.822
1.651PhePhe: 1.651 ± 0.696
2.064PheGly: 2.064 ± 0.687
0.825PheHis: 0.825 ± 0.639
2.476PheIle: 2.476 ± 0.996
4.54PheLys: 4.54 ± 1.62
3.714PheLeu: 3.714 ± 0.579
2.064PheMet: 2.064 ± 0.83
1.238PheAsn: 1.238 ± 0.739
2.064PhePro: 2.064 ± 0.697
1.651PheGln: 1.651 ± 0.909
2.476PheArg: 2.476 ± 0.421
2.064PheSer: 2.064 ± 1.04
2.889PheThr: 2.889 ± 0.928
2.889PheVal: 2.889 ± 1.2
0.825PheTrp: 0.825 ± 0.348
3.302PheTyr: 3.302 ± 0.963
0.0PheXaa: 0.0 ± 0.0
Gly
2.889GlyAla: 2.889 ± 1.115
0.413GlyCys: 0.413 ± 0.365
4.953GlyAsp: 4.953 ± 1.553
4.127GlyGlu: 4.127 ± 0.931
1.651GlyPhe: 1.651 ± 0.518
4.127GlyGly: 4.127 ± 1.749
1.651GlyHis: 1.651 ± 0.923
4.127GlyIle: 4.127 ± 0.778
4.127GlyLys: 4.127 ± 0.945
5.365GlyLeu: 5.365 ± 1.514
0.825GlyMet: 0.825 ± 0.857
2.476GlyAsn: 2.476 ± 1.313
2.476GlyPro: 2.476 ± 0.917
0.413GlyGln: 0.413 ± 0.318
2.889GlyArg: 2.889 ± 0.646
5.778GlySer: 5.778 ± 1.543
4.54GlyThr: 4.54 ± 1.523
3.302GlyVal: 3.302 ± 0.893
0.413GlyTrp: 0.413 ± 0.455
1.651GlyTyr: 1.651 ± 0.557
0.0GlyXaa: 0.0 ± 0.0
His
0.413HisAla: 0.413 ± 0.539
0.825HisCys: 0.825 ± 0.436
0.413HisAsp: 0.413 ± 0.539
0.0HisGlu: 0.0 ± 0.0
2.064HisPhe: 2.064 ± 0.597
1.238HisGly: 1.238 ± 0.738
0.413HisHis: 0.413 ± 0.439
0.413HisIle: 0.413 ± 0.365
1.651HisLys: 1.651 ± 0.713
1.651HisLeu: 1.651 ± 1.044
0.825HisMet: 0.825 ± 0.617
0.413HisAsn: 0.413 ± 0.32
2.064HisPro: 2.064 ± 1.158
0.413HisGln: 0.413 ± 0.32
0.825HisArg: 0.825 ± 0.582
0.825HisSer: 0.825 ± 0.436
1.651HisThr: 1.651 ± 0.874
0.0HisVal: 0.0 ± 0.0
1.238HisTrp: 1.238 ± 0.655
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.476IleAla: 2.476 ± 1.184
1.651IleCys: 1.651 ± 0.681
2.476IleAsp: 2.476 ± 1.116
3.302IleGlu: 3.302 ± 0.985
1.238IlePhe: 1.238 ± 0.654
5.365IleGly: 5.365 ± 2.614
0.413IleHis: 0.413 ± 0.455
2.476IleIle: 2.476 ± 1.116
0.825IleLys: 0.825 ± 0.91
4.54IleLeu: 4.54 ± 1.028
0.413IleMet: 0.413 ± 0.32
3.714IleAsn: 3.714 ± 1.136
2.889IlePro: 2.889 ± 1.007
2.889IleGln: 2.889 ± 0.795
2.064IleArg: 2.064 ± 0.715
4.127IleSer: 4.127 ± 1.097
3.714IleThr: 3.714 ± 1.406
2.889IleVal: 2.889 ± 0.52
0.413IleTrp: 0.413 ± 0.32
1.238IleTyr: 1.238 ± 0.401
0.0IleXaa: 0.0 ± 0.0
Lys
2.064LysAla: 2.064 ± 0.775
1.238LysCys: 1.238 ± 0.684
2.476LysAsp: 2.476 ± 0.421
2.064LysGlu: 2.064 ± 0.756
2.476LysPhe: 2.476 ± 1.172
2.064LysGly: 2.064 ± 0.568
0.825LysHis: 0.825 ± 0.473
2.476LysIle: 2.476 ± 0.957
2.476LysLys: 2.476 ± 0.802
6.191LysLeu: 6.191 ± 1.88
0.0LysMet: 0.0 ± 0.0
2.476LysAsn: 2.476 ± 1.4
3.302LysPro: 3.302 ± 1.593
2.889LysGln: 2.889 ± 1.374
4.953LysArg: 4.953 ± 0.801
6.191LysSer: 6.191 ± 1.601
4.127LysThr: 4.127 ± 1.567
2.889LysVal: 2.889 ± 0.872
0.413LysTrp: 0.413 ± 0.429
4.953LysTyr: 4.953 ± 1.565
0.0LysXaa: 0.0 ± 0.0
Leu
4.127LeuAla: 4.127 ± 1.107
0.413LeuCys: 0.413 ± 0.365
6.191LeuAsp: 6.191 ± 1.397
5.778LeuGlu: 5.778 ± 0.421
4.953LeuPhe: 4.953 ± 1.314
6.191LeuGly: 6.191 ± 2.042
2.064LeuHis: 2.064 ± 1.042
4.127LeuIle: 4.127 ± 1.809
3.302LeuLys: 3.302 ± 1.66
5.778LeuLeu: 5.778 ± 1.307
2.064LeuMet: 2.064 ± 0.779
2.064LeuAsn: 2.064 ± 0.884
4.953LeuPro: 4.953 ± 1.153
9.905LeuGln: 9.905 ± 1.116
4.127LeuArg: 4.127 ± 1.665
9.08LeuSer: 9.08 ± 2.286
4.953LeuThr: 4.953 ± 0.856
7.842LeuVal: 7.842 ± 0.966
0.825LeuTrp: 0.825 ± 0.552
4.54LeuTyr: 4.54 ± 1.625
0.0LeuXaa: 0.0 ± 0.0
Met
2.064MetAla: 2.064 ± 0.805
1.238MetCys: 1.238 ± 0.56
2.889MetAsp: 2.889 ± 1.296
0.0MetGlu: 0.0 ± 0.0
0.825MetPhe: 0.825 ± 0.348
0.413MetGly: 0.413 ± 0.32
0.413MetHis: 0.413 ± 0.439
0.825MetIle: 0.825 ± 0.562
1.238MetLys: 1.238 ± 0.56
1.238MetLeu: 1.238 ± 0.684
0.0MetMet: 0.0 ± 0.0
1.238MetAsn: 1.238 ± 0.372
0.0MetPro: 0.0 ± 0.0
1.238MetGln: 1.238 ± 0.372
0.825MetArg: 0.825 ± 0.348
1.238MetSer: 1.238 ± 0.372
0.825MetThr: 0.825 ± 0.473
0.825MetVal: 0.825 ± 0.473
0.0MetTrp: 0.0 ± 0.0
1.651MetTyr: 1.651 ± 0.706
0.0MetXaa: 0.0 ± 0.0
Asn
2.064AsnAla: 2.064 ± 0.777
0.825AsnCys: 0.825 ± 0.473
1.651AsnAsp: 1.651 ± 0.569
3.302AsnGlu: 3.302 ± 0.463
0.825AsnPhe: 0.825 ± 0.436
2.476AsnGly: 2.476 ± 0.689
0.413AsnHis: 0.413 ± 0.365
4.54AsnIle: 4.54 ± 1.309
2.476AsnLys: 2.476 ± 1.01
3.714AsnLeu: 3.714 ± 0.86
1.651AsnMet: 1.651 ± 0.843
2.889AsnAsn: 2.889 ± 1.077
3.714AsnPro: 3.714 ± 0.872
2.064AsnGln: 2.064 ± 0.769
2.889AsnArg: 2.889 ± 0.781
1.651AsnSer: 1.651 ± 0.957
2.064AsnThr: 2.064 ± 0.746
3.302AsnVal: 3.302 ± 0.835
1.238AsnTrp: 1.238 ± 0.372
1.238AsnTyr: 1.238 ± 1.094
0.0AsnXaa: 0.0 ± 0.0
Pro
5.365ProAla: 5.365 ± 1.911
1.238ProCys: 1.238 ± 0.905
5.365ProAsp: 5.365 ± 1.022
2.889ProGlu: 2.889 ± 1.171
2.476ProPhe: 2.476 ± 0.844
1.651ProGly: 1.651 ± 0.55
2.064ProHis: 2.064 ± 1.509
2.889ProIle: 2.889 ± 1.513
3.714ProLys: 3.714 ± 0.824
7.016ProLeu: 7.016 ± 1.441
0.413ProMet: 0.413 ± 0.365
2.889ProAsn: 2.889 ± 0.646
8.667ProPro: 8.667 ± 2.498
2.476ProGln: 2.476 ± 0.745
1.651ProArg: 1.651 ± 0.996
7.016ProSer: 7.016 ± 1.873
2.476ProThr: 2.476 ± 0.67
1.238ProVal: 1.238 ± 0.77
0.0ProTrp: 0.0 ± 0.0
2.064ProTyr: 2.064 ± 1.377
0.0ProXaa: 0.0 ± 0.0
Gln
1.238GlnAla: 1.238 ± 0.419
0.413GlnCys: 0.413 ± 0.439
2.889GlnAsp: 2.889 ± 0.879
3.302GlnGlu: 3.302 ± 0.632
2.889GlnPhe: 2.889 ± 1.065
2.889GlnGly: 2.889 ± 1.234
0.413GlnHis: 0.413 ± 0.539
2.889GlnIle: 2.889 ± 1.086
1.651GlnLys: 1.651 ± 0.616
6.191GlnLeu: 6.191 ± 1.302
1.651GlnMet: 1.651 ± 0.745
1.238GlnAsn: 1.238 ± 1.094
2.064GlnPro: 2.064 ± 0.397
1.651GlnGln: 1.651 ± 0.531
0.825GlnArg: 0.825 ± 0.419
1.651GlnSer: 1.651 ± 0.745
2.476GlnThr: 2.476 ± 1.342
2.064GlnVal: 2.064 ± 1.2
0.825GlnTrp: 0.825 ± 0.508
1.651GlnTyr: 1.651 ± 0.724
0.0GlnXaa: 0.0 ± 0.0
Arg
2.476ArgAla: 2.476 ± 1.308
1.238ArgCys: 1.238 ± 0.68
2.476ArgAsp: 2.476 ± 0.841
1.651ArgGlu: 1.651 ± 0.644
2.889ArgPhe: 2.889 ± 1.119
4.127ArgGly: 4.127 ± 1.317
2.476ArgHis: 2.476 ± 0.501
0.0ArgIle: 0.0 ± 0.0
4.127ArgLys: 4.127 ± 1.236
4.953ArgLeu: 4.953 ± 1.318
1.651ArgMet: 1.651 ± 0.724
2.476ArgAsn: 2.476 ± 0.456
3.714ArgPro: 3.714 ± 1.662
1.238ArgGln: 1.238 ± 0.91
5.778ArgArg: 5.778 ± 3.004
4.953ArgSer: 4.953 ± 2.471
2.889ArgThr: 2.889 ± 1.606
2.064ArgVal: 2.064 ± 0.648
0.825ArgTrp: 0.825 ± 0.857
2.889ArgTyr: 2.889 ± 0.558
0.0ArgXaa: 0.0 ± 0.0
Ser
1.651SerAla: 1.651 ± 0.706
2.064SerCys: 2.064 ± 1.195
2.476SerAsp: 2.476 ± 1.568
5.778SerGlu: 5.778 ± 0.721
3.302SerPhe: 3.302 ± 1.328
5.778SerGly: 5.778 ± 1.189
0.825SerHis: 0.825 ± 0.508
2.476SerIle: 2.476 ± 0.889
2.889SerLys: 2.889 ± 0.887
10.73SerLeu: 10.73 ± 1.611
0.825SerMet: 0.825 ± 0.545
3.714SerAsn: 3.714 ± 1.836
3.714SerPro: 3.714 ± 1.501
3.302SerGln: 3.302 ± 0.646
5.365SerArg: 5.365 ± 1.19
6.603SerSer: 6.603 ± 0.876
9.492SerThr: 9.492 ± 2.133
5.365SerVal: 5.365 ± 1.657
0.413SerTrp: 0.413 ± 0.429
2.064SerTyr: 2.064 ± 1.236
0.0SerXaa: 0.0 ± 0.0
Thr
5.365ThrAla: 5.365 ± 1.685
0.413ThrCys: 0.413 ± 0.365
4.127ThrAsp: 4.127 ± 0.755
2.889ThrGlu: 2.889 ± 0.558
1.651ThrPhe: 1.651 ± 0.668
4.127ThrGly: 4.127 ± 1.321
0.413ThrHis: 0.413 ± 0.539
4.953ThrIle: 4.953 ± 1.518
3.714ThrLys: 3.714 ± 1.553
4.953ThrLeu: 4.953 ± 1.437
0.825ThrMet: 0.825 ± 0.473
2.476ThrAsn: 2.476 ± 0.804
3.714ThrPro: 3.714 ± 0.958
1.238ThrGln: 1.238 ± 0.713
4.54ThrArg: 4.54 ± 1.106
4.953ThrSer: 4.953 ± 1.141
4.127ThrThr: 4.127 ± 1.456
7.016ThrVal: 7.016 ± 1.37
0.825ThrTrp: 0.825 ± 0.348
2.889ThrTyr: 2.889 ± 1.082
0.0ThrXaa: 0.0 ± 0.0
Val
2.064ValAla: 2.064 ± 1.043
2.064ValCys: 2.064 ± 0.861
4.127ValAsp: 4.127 ± 1.156
4.953ValGlu: 4.953 ± 0.916
4.127ValPhe: 4.127 ± 0.977
3.714ValGly: 3.714 ± 0.635
0.825ValHis: 0.825 ± 0.436
2.476ValIle: 2.476 ± 1.214
2.064ValLys: 2.064 ± 0.648
3.714ValLeu: 3.714 ± 0.974
1.238ValMet: 1.238 ± 1.094
1.238ValAsn: 1.238 ± 0.56
4.54ValPro: 4.54 ± 1.687
1.238ValGln: 1.238 ± 0.419
4.54ValArg: 4.54 ± 1.015
4.953ValSer: 4.953 ± 1.388
4.54ValThr: 4.54 ± 0.768
2.064ValVal: 2.064 ± 0.973
1.238ValTrp: 1.238 ± 0.655
2.476ValTyr: 2.476 ± 0.521
0.0ValXaa: 0.0 ± 0.0
Trp
0.413TrpAla: 0.413 ± 0.429
0.0TrpCys: 0.0 ± 0.0
0.825TrpAsp: 0.825 ± 0.461
1.238TrpGlu: 1.238 ± 0.604
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.413TrpHis: 0.413 ± 0.429
0.825TrpIle: 0.825 ± 0.639
2.064TrpLys: 2.064 ± 0.894
1.238TrpLeu: 1.238 ± 0.515
0.413TrpMet: 0.413 ± 0.365
0.825TrpAsn: 0.825 ± 0.729
0.0TrpPro: 0.0 ± 0.0
0.413TrpGln: 0.413 ± 0.32
1.238TrpArg: 1.238 ± 1.009
1.238TrpSer: 1.238 ± 0.916
0.413TrpThr: 0.413 ± 0.429
0.413TrpVal: 0.413 ± 0.32
0.0TrpTrp: 0.0 ± 0.0
0.413TrpTyr: 0.413 ± 0.32
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.476TyrAla: 2.476 ± 0.901
2.064TyrCys: 2.064 ± 1.236
2.064TyrAsp: 2.064 ± 0.644
2.064TyrGlu: 2.064 ± 0.697
3.302TyrPhe: 3.302 ± 1.313
3.302TyrGly: 3.302 ± 1.105
0.0TyrHis: 0.0 ± 0.0
1.651TyrIle: 1.651 ± 0.843
4.127TyrLys: 4.127 ± 0.818
3.302TyrLeu: 3.302 ± 0.912
0.0TyrMet: 0.0 ± 0.0
2.064TyrAsn: 2.064 ± 0.942
3.302TyrPro: 3.302 ± 0.565
1.651TyrGln: 1.651 ± 0.489
1.238TyrArg: 1.238 ± 0.739
2.476TyrSer: 2.476 ± 0.909
1.238TyrThr: 1.238 ± 0.844
2.064TyrVal: 2.064 ± 0.95
0.413TyrTrp: 0.413 ± 0.365
2.476TyrTyr: 2.476 ± 1.112
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2424 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski