Amino acid dipepetide frequency for Gammapapillomavirus 14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.628AlaAla: 4.628 ± 1.617
0.841AlaCys: 0.841 ± 0.915
5.048AlaAsp: 5.048 ± 1.328
3.366AlaGlu: 3.366 ± 1.187
4.628AlaPhe: 4.628 ± 0.85
3.786AlaGly: 3.786 ± 1.558
2.103AlaHis: 2.103 ± 0.852
2.945AlaIle: 2.945 ± 1.204
2.103AlaLys: 2.103 ± 0.895
5.048AlaLeu: 5.048 ± 1.057
0.841AlaMet: 0.841 ± 0.707
2.524AlaAsn: 2.524 ± 1.002
3.366AlaPro: 3.366 ± 1.214
2.524AlaGln: 2.524 ± 1.043
3.786AlaArg: 3.786 ± 1.716
4.628AlaSer: 4.628 ± 1.3
4.207AlaThr: 4.207 ± 1.114
2.103AlaVal: 2.103 ± 0.441
0.0AlaTrp: 0.0 ± 0.0
2.103AlaTyr: 2.103 ± 0.741
0.0AlaXaa: 0.0 ± 0.0
Cys
1.262CysAla: 1.262 ± 0.686
2.103CysCys: 2.103 ± 0.994
0.841CysAsp: 0.841 ± 0.653
0.421CysGlu: 0.421 ± 0.359
1.683CysPhe: 1.683 ± 0.804
0.421CysGly: 0.421 ± 0.39
0.0CysHis: 0.0 ± 0.0
1.262CysIle: 1.262 ± 0.625
2.103CysLys: 2.103 ± 0.81
1.683CysLeu: 1.683 ± 2.156
0.0CysMet: 0.0 ± 0.0
1.262CysAsn: 1.262 ± 0.631
0.841CysPro: 0.841 ± 0.78
0.0CysGln: 0.0 ± 0.0
0.841CysArg: 0.841 ± 1.308
2.103CysSer: 2.103 ± 1.55
1.683CysThr: 1.683 ± 0.785
0.841CysVal: 0.841 ± 0.643
1.683CysTrp: 1.683 ± 0.59
1.262CysTyr: 1.262 ± 0.809
0.0CysXaa: 0.0 ± 0.0
Asp
5.048AspAla: 5.048 ± 1.184
1.683AspCys: 1.683 ± 0.647
4.207AspAsp: 4.207 ± 2.111
4.207AspGlu: 4.207 ± 1.365
1.683AspPhe: 1.683 ± 0.681
1.683AspGly: 1.683 ± 0.547
1.262AspHis: 1.262 ± 0.849
4.628AspIle: 4.628 ± 2.042
2.103AspLys: 2.103 ± 1.115
7.573AspLeu: 7.573 ± 2.403
0.841AspMet: 0.841 ± 0.444
4.207AspAsn: 4.207 ± 1.264
4.628AspPro: 4.628 ± 1.535
1.262AspGln: 1.262 ± 0.672
0.421AspArg: 0.421 ± 0.39
4.207AspSer: 4.207 ± 0.584
6.731AspThr: 6.731 ± 1.525
5.469AspVal: 5.469 ± 1.664
1.683AspTrp: 1.683 ± 1.036
2.103AspTyr: 2.103 ± 1.827
0.0AspXaa: 0.0 ± 0.0
Glu
4.628GluAla: 4.628 ± 1.845
1.683GluCys: 1.683 ± 0.804
4.207GluAsp: 4.207 ± 1.66
7.152GluGlu: 7.152 ± 2.84
2.103GluPhe: 2.103 ± 0.778
2.945GluGly: 2.945 ± 1.251
1.262GluHis: 1.262 ± 1.247
1.683GluIle: 1.683 ± 0.633
3.366GluLys: 3.366 ± 1.636
5.89GluLeu: 5.89 ± 2.23
1.262GluMet: 1.262 ± 0.656
4.628GluAsn: 4.628 ± 0.765
5.048GluPro: 5.048 ± 1.232
2.103GluGln: 2.103 ± 0.856
4.207GluArg: 4.207 ± 1.198
3.786GluSer: 3.786 ± 1.509
4.207GluThr: 4.207 ± 0.982
2.945GluVal: 2.945 ± 1.38
1.262GluTrp: 1.262 ± 0.869
2.524GluTyr: 2.524 ± 1.17
0.0GluXaa: 0.0 ± 0.0
Phe
2.103PheAla: 2.103 ± 0.956
1.683PheCys: 1.683 ± 0.744
2.945PheAsp: 2.945 ± 0.655
2.945PheGlu: 2.945 ± 1.327
2.524PhePhe: 2.524 ± 1.279
2.945PheGly: 2.945 ± 1.275
0.421PheHis: 0.421 ± 0.39
2.103PheIle: 2.103 ± 0.726
3.786PheLys: 3.786 ± 1.969
4.628PheLeu: 4.628 ± 2.215
1.683PheMet: 1.683 ± 0.611
2.103PheAsn: 2.103 ± 0.755
2.945PhePro: 2.945 ± 0.536
1.262PheGln: 1.262 ± 0.656
1.683PheArg: 1.683 ± 0.697
4.628PheSer: 4.628 ± 1.309
0.841PheThr: 0.841 ± 0.447
2.524PheVal: 2.524 ± 0.927
1.262PheTrp: 1.262 ± 0.617
2.103PheTyr: 2.103 ± 0.715
0.0PheXaa: 0.0 ± 0.0
Gly
1.262GlyAla: 1.262 ± 0.686
0.421GlyCys: 0.421 ± 0.39
3.786GlyAsp: 3.786 ± 0.643
3.786GlyGlu: 3.786 ± 0.886
1.683GlyPhe: 1.683 ± 0.632
4.628GlyGly: 4.628 ± 1.757
1.262GlyHis: 1.262 ± 0.75
4.207GlyIle: 4.207 ± 1.101
3.366GlyLys: 3.366 ± 1.08
4.207GlyLeu: 4.207 ± 2.291
0.0GlyMet: 0.0 ± 0.0
3.366GlyAsn: 3.366 ± 1.266
3.786GlyPro: 3.786 ± 1.255
1.262GlyGln: 1.262 ± 0.421
5.048GlyArg: 5.048 ± 1.978
3.366GlySer: 3.366 ± 1.644
5.048GlyThr: 5.048 ± 1.522
2.103GlyVal: 2.103 ± 0.605
0.0GlyTrp: 0.0 ± 0.0
0.421GlyTyr: 0.421 ± 0.354
0.0GlyXaa: 0.0 ± 0.0
His
1.262HisAla: 1.262 ± 0.421
0.421HisCys: 0.421 ± 0.416
0.841HisAsp: 0.841 ± 0.685
0.421HisGlu: 0.421 ± 0.39
0.421HisPhe: 0.421 ± 0.359
0.421HisGly: 0.421 ± 0.416
0.0HisHis: 0.0 ± 0.0
1.683HisIle: 1.683 ± 0.973
0.841HisLys: 0.841 ± 0.643
2.103HisLeu: 2.103 ± 1.035
0.421HisMet: 0.421 ± 0.416
0.0HisAsn: 0.0 ± 0.0
2.103HisPro: 2.103 ± 0.804
1.262HisGln: 1.262 ± 1.098
0.841HisArg: 0.841 ± 0.685
0.421HisSer: 0.421 ± 0.359
1.262HisThr: 1.262 ± 0.5
1.262HisVal: 1.262 ± 0.705
0.421HisTrp: 0.421 ± 0.39
1.262HisTyr: 1.262 ± 0.699
0.0HisXaa: 0.0 ± 0.0
Ile
3.786IleAla: 3.786 ± 1.476
0.421IleCys: 0.421 ± 0.39
4.207IleAsp: 4.207 ± 1.172
5.89IleGlu: 5.89 ± 1.062
2.524IlePhe: 2.524 ± 1.351
3.366IleGly: 3.366 ± 1.425
0.421IleHis: 0.421 ± 0.528
1.683IleIle: 1.683 ± 0.595
1.262IleLys: 1.262 ± 0.672
2.103IleLeu: 2.103 ± 0.441
0.841IleMet: 0.841 ± 0.514
2.524IleAsn: 2.524 ± 0.743
3.366IlePro: 3.366 ± 1.333
2.524IleGln: 2.524 ± 0.572
2.524IleArg: 2.524 ± 1.357
2.945IleSer: 2.945 ± 1.186
2.945IleThr: 2.945 ± 1.275
5.469IleVal: 5.469 ± 1.641
0.0IleTrp: 0.0 ± 0.0
3.366IleTyr: 3.366 ± 1.361
0.0IleXaa: 0.0 ± 0.0
Lys
1.683LysAla: 1.683 ± 0.681
2.524LysCys: 2.524 ± 0.772
2.524LysAsp: 2.524 ± 0.921
4.628LysGlu: 4.628 ± 2.416
4.207LysPhe: 4.207 ± 1.448
2.524LysGly: 2.524 ± 1.279
1.262LysHis: 1.262 ± 0.672
2.524LysIle: 2.524 ± 0.487
2.945LysLys: 2.945 ± 0.751
5.048LysLeu: 5.048 ± 2.132
1.683LysMet: 1.683 ± 1.025
2.945LysAsn: 2.945 ± 1.441
1.262LysPro: 1.262 ± 0.421
1.262LysGln: 1.262 ± 0.414
4.207LysArg: 4.207 ± 0.593
3.366LysSer: 3.366 ± 2.417
3.366LysThr: 3.366 ± 1.166
2.524LysVal: 2.524 ± 1.09
0.841LysTrp: 0.841 ± 0.685
2.103LysTyr: 2.103 ± 0.894
0.0LysXaa: 0.0 ± 0.0
Leu
5.469LeuAla: 5.469 ± 1.278
2.103LeuCys: 2.103 ± 1.08
5.469LeuAsp: 5.469 ± 1.662
4.207LeuGlu: 4.207 ± 1.056
5.89LeuPhe: 5.89 ± 2.046
4.628LeuGly: 4.628 ± 1.627
2.103LeuHis: 2.103 ± 0.937
5.89LeuIle: 5.89 ± 1.01
6.731LeuLys: 6.731 ± 1.703
6.731LeuLeu: 6.731 ± 3.063
2.103LeuMet: 2.103 ± 0.954
3.786LeuAsn: 3.786 ± 1.017
5.89LeuPro: 5.89 ± 1.218
3.786LeuGln: 3.786 ± 0.757
4.628LeuArg: 4.628 ± 2.569
8.835LeuSer: 8.835 ± 1.155
4.207LeuThr: 4.207 ± 0.737
5.89LeuVal: 5.89 ± 1.726
0.0LeuTrp: 0.0 ± 0.0
5.469LeuTyr: 5.469 ± 1.47
0.0LeuXaa: 0.0 ± 0.0
Met
0.421MetAla: 0.421 ± 0.39
1.262MetCys: 1.262 ± 1.405
1.262MetAsp: 1.262 ± 0.414
0.841MetGlu: 0.841 ± 0.938
0.841MetPhe: 0.841 ± 0.416
1.262MetGly: 1.262 ± 0.421
0.421MetHis: 0.421 ± 0.359
0.421MetIle: 0.421 ± 0.39
0.0MetLys: 0.0 ± 0.0
1.262MetLeu: 1.262 ± 0.421
0.0MetMet: 0.0 ± 0.0
0.841MetAsn: 0.841 ± 0.44
0.0MetPro: 0.0 ± 0.0
0.841MetGln: 0.841 ± 0.831
1.262MetArg: 1.262 ± 0.891
1.683MetSer: 1.683 ± 0.633
1.262MetThr: 1.262 ± 0.414
1.683MetVal: 1.683 ± 0.973
0.0MetTrp: 0.0 ± 0.0
0.421MetTyr: 0.421 ± 0.359
0.0MetXaa: 0.0 ± 0.0
Asn
2.945AsnAla: 2.945 ± 1.081
1.262AsnCys: 1.262 ± 0.809
2.945AsnAsp: 2.945 ± 0.956
2.945AsnGlu: 2.945 ± 1.121
2.103AsnPhe: 2.103 ± 1.115
1.683AsnGly: 1.683 ± 0.56
0.421AsnHis: 0.421 ± 0.359
1.683AsnIle: 1.683 ± 0.608
3.366AsnLys: 3.366 ± 1.228
5.048AsnLeu: 5.048 ± 1.488
0.841AsnMet: 0.841 ± 0.717
1.683AsnAsn: 1.683 ± 1.123
4.628AsnPro: 4.628 ± 1.518
1.262AsnGln: 1.262 ± 0.812
2.103AsnArg: 2.103 ± 0.394
4.628AsnSer: 4.628 ± 1.613
2.945AsnThr: 2.945 ± 1.266
3.786AsnVal: 3.786 ± 1.008
0.421AsnTrp: 0.421 ± 0.359
1.262AsnTyr: 1.262 ± 0.937
0.0AsnXaa: 0.0 ± 0.0
Pro
4.628ProAla: 4.628 ± 1.873
0.841ProCys: 0.841 ± 0.707
6.731ProAsp: 6.731 ± 1.32
3.786ProGlu: 3.786 ± 1.426
1.683ProPhe: 1.683 ± 0.857
2.103ProGly: 2.103 ± 0.804
0.421ProHis: 0.421 ± 0.528
3.366ProIle: 3.366 ± 0.786
2.524ProLys: 2.524 ± 0.927
5.89ProLeu: 5.89 ± 1.211
0.421ProMet: 0.421 ± 0.359
2.524ProAsn: 2.524 ± 0.983
6.31ProPro: 6.31 ± 1.497
2.524ProGln: 2.524 ± 1.857
5.89ProArg: 5.89 ± 1.72
5.048ProSer: 5.048 ± 1.666
5.469ProThr: 5.469 ± 1.842
2.103ProVal: 2.103 ± 0.851
0.421ProTrp: 0.421 ± 0.416
2.103ProTyr: 2.103 ± 1.13
0.0ProXaa: 0.0 ± 0.0
Gln
0.841GlnAla: 0.841 ± 0.514
0.0GlnCys: 0.0 ± 0.0
2.103GlnAsp: 2.103 ± 0.768
2.945GlnGlu: 2.945 ± 1.041
0.841GlnPhe: 0.841 ± 0.429
2.103GlnGly: 2.103 ± 0.601
0.841GlnHis: 0.841 ± 0.717
0.421GlnIle: 0.421 ± 0.416
0.0GlnLys: 0.0 ± 0.0
5.89GlnLeu: 5.89 ± 1.356
1.262GlnMet: 1.262 ± 0.496
2.945GlnAsn: 2.945 ± 1.045
2.103GlnPro: 2.103 ± 1.053
2.945GlnGln: 2.945 ± 0.685
1.683GlnArg: 1.683 ± 1.654
2.103GlnSer: 2.103 ± 0.394
1.262GlnThr: 1.262 ± 0.5
2.945GlnVal: 2.945 ± 1.523
1.262GlnTrp: 1.262 ± 0.421
1.262GlnTyr: 1.262 ± 0.75
0.0GlnXaa: 0.0 ± 0.0
Arg
4.207ArgAla: 4.207 ± 1.287
1.683ArgCys: 1.683 ± 0.752
2.524ArgAsp: 2.524 ± 0.907
2.524ArgGlu: 2.524 ± 0.767
1.683ArgPhe: 1.683 ± 0.618
3.786ArgGly: 3.786 ± 1.824
2.103ArgHis: 2.103 ± 1.278
1.683ArgIle: 1.683 ± 1.108
3.786ArgLys: 3.786 ± 0.918
8.835ArgLeu: 8.835 ± 1.246
0.0ArgMet: 0.0 ± 0.611
2.524ArgAsn: 2.524 ± 0.772
2.524ArgPro: 2.524 ± 0.822
2.524ArgGln: 2.524 ± 0.876
4.207ArgArg: 4.207 ± 1.825
3.786ArgSer: 3.786 ± 2.57
3.366ArgThr: 3.366 ± 1.836
2.524ArgVal: 2.524 ± 0.86
0.0ArgTrp: 0.0 ± 0.0
0.421ArgTyr: 0.421 ± 0.39
0.0ArgXaa: 0.0 ± 0.0
Ser
3.786SerAla: 3.786 ± 0.783
0.841SerCys: 0.841 ± 0.717
2.524SerAsp: 2.524 ± 1.144
5.469SerGlu: 5.469 ± 2.182
5.048SerPhe: 5.048 ± 0.63
4.207SerGly: 4.207 ± 0.722
0.841SerHis: 0.841 ± 0.416
3.366SerIle: 3.366 ± 1.384
4.207SerLys: 4.207 ± 1.7
8.414SerLeu: 8.414 ± 1.728
1.262SerMet: 1.262 ± 0.702
3.786SerAsn: 3.786 ± 1.434
4.207SerPro: 4.207 ± 0.932
2.524SerGln: 2.524 ± 0.927
2.945SerArg: 2.945 ± 1.167
5.048SerSer: 5.048 ± 1.62
6.31SerThr: 6.31 ± 1.735
6.731SerVal: 6.731 ± 1.213
0.0SerTrp: 0.0 ± 0.0
1.683SerTyr: 1.683 ± 0.306
0.0SerXaa: 0.0 ± 0.0
Thr
4.628ThrAla: 4.628 ± 1.273
1.262ThrCys: 1.262 ± 0.891
4.628ThrAsp: 4.628 ± 2.426
5.89ThrGlu: 5.89 ± 1.035
1.683ThrPhe: 1.683 ± 0.857
3.786ThrGly: 3.786 ± 1.36
0.421ThrHis: 0.421 ± 0.354
4.628ThrIle: 4.628 ± 1.485
2.945ThrLys: 2.945 ± 1.257
6.31ThrLeu: 6.31 ± 0.886
0.841ThrMet: 0.841 ± 0.429
2.945ThrAsn: 2.945 ± 0.892
6.731ThrPro: 6.731 ± 1.263
0.841ThrGln: 0.841 ± 0.707
4.207ThrArg: 4.207 ± 1.572
4.207ThrSer: 4.207 ± 1.47
5.89ThrThr: 5.89 ± 1.824
3.786ThrVal: 3.786 ± 1.676
0.421ThrTrp: 0.421 ± 0.654
1.262ThrTyr: 1.262 ± 0.937
0.0ThrXaa: 0.0 ± 0.0
Val
5.048ValAla: 5.048 ± 1.449
0.421ValCys: 0.421 ± 0.654
4.628ValAsp: 4.628 ± 1.453
2.945ValGlu: 2.945 ± 0.88
2.524ValPhe: 2.524 ± 1.014
4.628ValGly: 4.628 ± 1.835
0.841ValHis: 0.841 ± 0.429
4.628ValIle: 4.628 ± 1.843
4.207ValLys: 4.207 ± 1.711
2.524ValLeu: 2.524 ± 1.115
0.841ValMet: 0.841 ± 0.44
2.524ValAsn: 2.524 ± 0.828
4.628ValPro: 4.628 ± 1.215
2.524ValGln: 2.524 ± 0.975
0.841ValArg: 0.841 ± 0.707
5.89ValSer: 5.89 ± 1.516
3.366ValThr: 3.366 ± 0.978
2.103ValVal: 2.103 ± 0.441
1.683ValTrp: 1.683 ± 1.235
2.524ValTyr: 2.524 ± 1.581
0.0ValXaa: 0.0 ± 0.0
Trp
0.841TrpAla: 0.841 ± 0.717
0.0TrpCys: 0.0 ± 0.0
1.683TrpAsp: 1.683 ± 0.735
0.421TrpGlu: 0.421 ± 0.416
0.841TrpPhe: 0.841 ± 0.416
0.421TrpGly: 0.421 ± 0.528
0.0TrpHis: 0.0 ± 0.0
1.683TrpIle: 1.683 ± 1.024
0.841TrpLys: 0.841 ± 0.416
1.683TrpLeu: 1.683 ± 0.306
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.841TrpGln: 0.841 ± 0.78
0.841TrpArg: 0.841 ± 0.803
0.421TrpSer: 0.421 ± 0.39
0.841TrpThr: 0.841 ± 0.831
0.841TrpVal: 0.841 ± 0.653
0.0TrpTrp: 0.0 ± 0.0
0.421TrpTyr: 0.421 ± 0.416
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.103TyrAla: 2.103 ± 0.755
0.841TyrCys: 0.841 ± 1.308
2.524TyrAsp: 2.524 ± 1.286
2.103TyrGlu: 2.103 ± 1.089
2.524TyrPhe: 2.524 ± 1.128
1.683TyrGly: 1.683 ± 0.712
1.262TyrHis: 1.262 ± 0.92
1.683TyrIle: 1.683 ± 0.986
2.945TyrLys: 2.945 ± 0.961
2.945TyrLeu: 2.945 ± 1.169
0.421TyrMet: 0.421 ± 0.359
0.841TyrAsn: 0.841 ± 0.447
0.421TyrPro: 0.421 ± 0.39
1.683TyrGln: 1.683 ± 0.59
2.945TyrArg: 2.945 ± 1.14
2.103TyrSer: 2.103 ± 0.516
2.103TyrThr: 2.103 ± 0.441
1.683TyrVal: 1.683 ± 0.633
1.262TyrTrp: 1.262 ± 0.812
1.683TyrTyr: 1.683 ± 1.235
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2378 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski