Amino acid dipepetide frequency for Ursus maritimus papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.378AlaAla: 6.378 ± 1.844
1.822AlaCys: 1.822 ± 0.998
3.645AlaAsp: 3.645 ± 1.26
6.378AlaGlu: 6.378 ± 1.558
1.367AlaPhe: 1.367 ± 0.914
3.645AlaGly: 3.645 ± 1.866
0.456AlaHis: 0.456 ± 0.419
1.822AlaIle: 1.822 ± 0.608
5.467AlaLys: 5.467 ± 1.024
5.923AlaLeu: 5.923 ± 1.104
2.278AlaMet: 2.278 ± 0.816
0.456AlaAsn: 0.456 ± 0.371
1.822AlaPro: 1.822 ± 1.207
0.911AlaGln: 0.911 ± 0.742
3.645AlaArg: 3.645 ± 0.975
6.834AlaSer: 6.834 ± 0.73
5.011AlaThr: 5.011 ± 1.03
2.278AlaVal: 2.278 ± 0.368
0.0AlaTrp: 0.0 ± 0.0
2.278AlaTyr: 2.278 ± 0.694
0.0AlaXaa: 0.0 ± 0.0
Cys
0.911CysAla: 0.911 ± 0.776
0.911CysCys: 0.911 ± 0.839
0.911CysAsp: 0.911 ± 0.473
1.822CysGlu: 1.822 ± 0.834
0.911CysPhe: 0.911 ± 0.417
1.367CysGly: 1.367 ± 1.634
0.456CysHis: 0.456 ± 0.371
0.456CysIle: 0.456 ± 0.344
1.822CysLys: 1.822 ± 0.786
2.278CysLeu: 2.278 ± 3.161
1.367CysMet: 1.367 ± 0.914
0.911CysAsn: 0.911 ± 0.687
1.367CysPro: 1.367 ± 0.422
0.911CysGln: 0.911 ± 0.687
0.0CysArg: 0.0 ± 0.0
2.733CysSer: 2.733 ± 1.432
1.367CysThr: 1.367 ± 0.697
0.911CysVal: 0.911 ± 0.776
1.367CysTrp: 1.367 ± 0.423
0.456CysTyr: 0.456 ± 0.419
0.0CysXaa: 0.0 ± 0.0
Asp
2.733AspAla: 2.733 ± 1.65
1.367AspCys: 1.367 ± 0.668
2.278AspAsp: 2.278 ± 1.303
2.733AspGlu: 2.733 ± 0.676
2.733AspPhe: 2.733 ± 0.488
5.467AspGly: 5.467 ± 1.516
0.0AspHis: 0.0 ± 0.0
3.189AspIle: 3.189 ± 0.965
3.189AspLys: 3.189 ± 1.517
5.467AspLeu: 5.467 ± 1.763
0.911AspMet: 0.911 ± 0.742
3.189AspAsn: 3.189 ± 1.671
5.467AspPro: 5.467 ± 1.732
2.278AspGln: 2.278 ± 0.482
1.822AspArg: 1.822 ± 0.943
3.645AspSer: 3.645 ± 0.998
5.011AspThr: 5.011 ± 1.433
2.733AspVal: 2.733 ± 1.073
0.911AspTrp: 0.911 ± 0.687
0.456AspTyr: 0.456 ± 0.371
0.0AspXaa: 0.0 ± 0.0
Glu
4.1GluAla: 4.1 ± 0.879
0.911GluCys: 0.911 ± 0.687
2.733GluAsp: 2.733 ± 0.831
5.011GluGlu: 5.011 ± 1.881
1.367GluPhe: 1.367 ± 0.655
4.1GluGly: 4.1 ± 1.783
1.822GluHis: 1.822 ± 0.608
0.911GluIle: 0.911 ± 0.821
4.556GluLys: 4.556 ± 2.04
2.733GluLeu: 2.733 ± 0.559
1.367GluMet: 1.367 ± 0.668
2.733GluAsn: 2.733 ± 0.982
1.367GluPro: 1.367 ± 0.812
2.733GluGln: 2.733 ± 0.843
3.189GluArg: 3.189 ± 0.879
3.189GluSer: 3.189 ± 0.823
2.733GluThr: 2.733 ± 1.189
3.645GluVal: 3.645 ± 0.975
0.0GluTrp: 0.0 ± 0.0
2.733GluTyr: 2.733 ± 0.846
0.0GluXaa: 0.0 ± 0.0
Phe
1.367PheAla: 1.367 ± 0.668
1.367PheCys: 1.367 ± 0.812
1.822PheAsp: 1.822 ± 0.842
1.367PheGlu: 1.367 ± 0.764
2.733PhePhe: 2.733 ± 1.0
2.278PheGly: 2.278 ± 0.731
0.911PheHis: 0.911 ± 0.829
0.911PheIle: 0.911 ± 0.417
4.1PheLys: 4.1 ± 1.289
3.645PheLeu: 3.645 ± 0.448
1.822PheMet: 1.822 ± 1.111
2.278PheAsn: 2.278 ± 1.31
2.733PhePro: 2.733 ± 0.795
0.0PheGln: 0.0 ± 0.0
1.367PheArg: 1.367 ± 0.718
2.733PheSer: 2.733 ± 1.146
3.189PheThr: 3.189 ± 1.159
2.278PheVal: 2.278 ± 0.368
1.367PheTrp: 1.367 ± 0.422
1.367PheTyr: 1.367 ± 0.764
0.0PheXaa: 0.0 ± 0.0
Gly
3.645GlyAla: 3.645 ± 1.386
0.456GlyCys: 0.456 ± 0.371
4.1GlyAsp: 4.1 ± 0.949
6.834GlyGlu: 6.834 ± 1.707
2.733GlyPhe: 2.733 ± 0.686
7.745GlyGly: 7.745 ± 4.004
1.822GlyHis: 1.822 ± 0.84
2.278GlyIle: 2.278 ± 0.832
2.278GlyLys: 2.278 ± 1.435
5.923GlyLeu: 5.923 ± 0.588
0.0GlyMet: 0.0 ± 0.0
2.733GlyAsn: 2.733 ± 1.336
2.733GlyPro: 2.733 ± 1.06
2.733GlyGln: 2.733 ± 1.425
4.1GlyArg: 4.1 ± 0.959
7.745GlySer: 7.745 ± 2.839
4.556GlyThr: 4.556 ± 1.78
7.289GlyVal: 7.289 ± 1.925
0.0GlyTrp: 0.0 ± 0.0
2.733GlyTyr: 2.733 ± 0.676
0.0GlyXaa: 0.0 ± 0.0
His
2.733HisAla: 2.733 ± 0.591
0.911HisCys: 0.911 ± 0.417
0.911HisAsp: 0.911 ± 0.46
0.911HisGlu: 0.911 ± 0.687
1.822HisPhe: 1.822 ± 0.946
1.367HisGly: 1.367 ± 0.779
0.456HisHis: 0.456 ± 0.41
0.911HisIle: 0.911 ± 0.417
1.367HisLys: 1.367 ± 0.812
2.278HisLeu: 2.278 ± 1.131
0.0HisMet: 0.0 ± 0.0
1.367HisAsn: 1.367 ± 0.851
2.733HisPro: 2.733 ± 0.896
0.911HisGln: 0.911 ± 0.839
0.456HisArg: 0.456 ± 0.41
0.911HisSer: 0.911 ± 0.425
0.911HisThr: 0.911 ± 0.497
0.911HisVal: 0.911 ± 0.461
0.456HisTrp: 0.456 ± 0.371
0.911HisTyr: 0.911 ± 0.473
0.0HisXaa: 0.0 ± 0.0
Ile
2.278IleAla: 2.278 ± 1.546
0.456IleCys: 0.456 ± 0.371
2.733IleAsp: 2.733 ± 0.925
3.189IleGlu: 3.189 ± 0.789
0.456IlePhe: 0.456 ± 0.371
2.733IleGly: 2.733 ± 1.949
0.0IleHis: 0.0 ± 0.0
0.456IleIle: 0.456 ± 0.41
1.822IleLys: 1.822 ± 1.07
2.733IleLeu: 2.733 ± 0.874
1.367IleMet: 1.367 ± 0.422
0.0IleAsn: 0.0 ± 0.0
3.645IlePro: 3.645 ± 1.888
2.278IleGln: 2.278 ± 1.303
1.822IleArg: 1.822 ± 1.366
4.1IleSer: 4.1 ± 1.064
1.367IleThr: 1.367 ± 0.378
1.822IleVal: 1.822 ± 0.64
0.456IleTrp: 0.456 ± 0.777
1.367IleTyr: 1.367 ± 0.448
0.0IleXaa: 0.0 ± 0.0
Lys
5.467LysAla: 5.467 ± 1.915
1.822LysCys: 1.822 ± 1.105
2.733LysAsp: 2.733 ± 0.996
3.645LysGlu: 3.645 ± 1.228
4.1LysPhe: 4.1 ± 1.079
3.645LysGly: 3.645 ± 1.435
2.278LysHis: 2.278 ± 1.253
0.911LysIle: 0.911 ± 0.829
4.1LysLys: 4.1 ± 2.053
3.189LysLeu: 3.189 ± 0.717
0.911LysMet: 0.911 ± 0.417
2.278LysAsn: 2.278 ± 0.804
1.367LysPro: 1.367 ± 0.812
1.822LysGln: 1.822 ± 0.224
5.467LysArg: 5.467 ± 0.746
2.733LysSer: 2.733 ± 1.336
2.733LysThr: 2.733 ± 0.922
4.1LysVal: 4.1 ± 1.533
1.367LysTrp: 1.367 ± 0.858
3.645LysTyr: 3.645 ± 1.309
0.0LysXaa: 0.0 ± 0.0
Leu
4.1LeuAla: 4.1 ± 1.528
2.278LeuCys: 2.278 ± 1.581
4.1LeuAsp: 4.1 ± 0.949
3.189LeuGlu: 3.189 ± 1.323
3.645LeuPhe: 3.645 ± 1.342
5.467LeuGly: 5.467 ± 1.811
2.278LeuHis: 2.278 ± 0.769
1.822LeuIle: 1.822 ± 1.56
4.556LeuLys: 4.556 ± 1.592
9.112LeuLeu: 9.112 ± 3.008
3.189LeuMet: 3.189 ± 1.18
3.645LeuAsn: 3.645 ± 1.385
5.011LeuPro: 5.011 ± 1.521
8.2LeuGln: 8.2 ± 2.37
1.822LeuArg: 1.822 ± 0.608
8.2LeuSer: 8.2 ± 0.629
3.189LeuThr: 3.189 ± 0.924
2.733LeuVal: 2.733 ± 0.925
1.367LeuTrp: 1.367 ± 0.718
3.189LeuTyr: 3.189 ± 0.775
0.0LeuXaa: 0.0 ± 0.0
Met
3.189MetAla: 3.189 ± 1.176
0.456MetCys: 0.456 ± 0.777
1.367MetAsp: 1.367 ± 0.739
0.911MetGlu: 0.911 ± 0.839
0.911MetPhe: 0.911 ± 0.473
0.456MetGly: 0.456 ± 0.344
0.911MetHis: 0.911 ± 0.46
1.367MetIle: 1.367 ± 0.423
0.456MetLys: 0.456 ± 0.344
1.367MetLeu: 1.367 ± 0.668
0.456MetMet: 0.456 ± 0.419
0.456MetAsn: 0.456 ± 0.344
0.456MetPro: 0.456 ± 0.41
2.278MetGln: 2.278 ± 1.028
0.911MetArg: 0.911 ± 0.687
1.367MetSer: 1.367 ± 0.762
0.456MetThr: 0.456 ± 0.371
2.733MetVal: 2.733 ± 0.795
0.0MetTrp: 0.0 ± 0.0
0.456MetTyr: 0.456 ± 0.419
0.0MetXaa: 0.0 ± 0.0
Asn
2.278AsnAla: 2.278 ± 0.845
1.367AsnCys: 1.367 ± 0.914
0.456AsnAsp: 0.456 ± 0.344
2.733AsnGlu: 2.733 ± 0.982
0.911AsnPhe: 0.911 ± 0.473
2.733AsnGly: 2.733 ± 0.846
0.456AsnHis: 0.456 ± 0.344
0.911AsnIle: 0.911 ± 0.497
3.189AsnLys: 3.189 ± 1.645
2.733AsnLeu: 2.733 ± 1.437
0.911AsnMet: 0.911 ± 0.417
2.278AsnAsn: 2.278 ± 0.999
3.645AsnPro: 3.645 ± 1.316
1.822AsnGln: 1.822 ± 0.648
2.278AsnArg: 2.278 ± 0.731
3.645AsnSer: 3.645 ± 1.198
2.733AsnThr: 2.733 ± 0.982
4.556AsnVal: 4.556 ± 1.079
0.911AsnTrp: 0.911 ± 0.417
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.189ProAla: 3.189 ± 1.639
0.456ProCys: 0.456 ± 0.419
5.467ProAsp: 5.467 ± 2.085
0.0ProGlu: 0.0 ± 0.0
3.645ProPhe: 3.645 ± 1.26
3.189ProGly: 3.189 ± 1.107
0.911ProHis: 0.911 ± 0.821
4.556ProIle: 4.556 ± 1.871
4.1ProLys: 4.1 ± 0.918
7.745ProLeu: 7.745 ± 1.744
0.456ProMet: 0.456 ± 0.41
2.733ProAsn: 2.733 ± 0.846
6.378ProPro: 6.378 ± 2.326
2.733ProGln: 2.733 ± 0.922
2.733ProArg: 2.733 ± 0.992
4.556ProSer: 4.556 ± 0.786
5.011ProThr: 5.011 ± 2.137
4.1ProVal: 4.1 ± 2.107
0.456ProTrp: 0.456 ± 0.419
1.822ProTyr: 1.822 ± 1.19
0.0ProXaa: 0.0 ± 0.0
Gln
1.822GlnAla: 1.822 ± 0.557
1.367GlnCys: 1.367 ± 1.514
2.278GlnAsp: 2.278 ± 0.804
1.822GlnGlu: 1.822 ± 0.732
0.911GlnPhe: 0.911 ± 0.473
2.278GlnGly: 2.278 ± 0.829
1.822GlnHis: 1.822 ± 0.557
1.367GlnIle: 1.367 ± 0.422
2.733GlnLys: 2.733 ± 0.996
5.923GlnLeu: 5.923 ± 1.618
0.911GlnMet: 0.911 ± 0.461
2.733GlnAsn: 2.733 ± 0.982
4.556GlnPro: 4.556 ± 0.767
10.023GlnGln: 10.023 ± 7.741
0.911GlnArg: 0.911 ± 0.839
1.367GlnSer: 1.367 ± 0.764
3.189GlnThr: 3.189 ± 1.057
2.278GlnVal: 2.278 ± 0.832
1.367GlnTrp: 1.367 ± 1.031
0.911GlnTyr: 0.911 ± 0.473
0.0GlnXaa: 0.0 ± 0.0
Arg
2.733ArgAla: 2.733 ± 1.398
1.367ArgCys: 1.367 ± 0.718
0.911ArgAsp: 0.911 ± 0.905
1.367ArgGlu: 1.367 ± 0.697
2.733ArgPhe: 2.733 ± 1.134
4.556ArgGly: 4.556 ± 1.88
2.733ArgHis: 2.733 ± 1.288
1.367ArgIle: 1.367 ± 0.833
4.1ArgLys: 4.1 ± 0.634
5.923ArgLeu: 5.923 ± 2.387
0.0ArgMet: 0.0 ± 0.356
1.822ArgAsn: 1.822 ± 0.956
3.645ArgPro: 3.645 ± 1.206
2.733ArgGln: 2.733 ± 0.528
4.556ArgArg: 4.556 ± 0.708
5.011ArgSer: 5.011 ± 1.563
1.822ArgThr: 1.822 ± 0.608
3.645ArgVal: 3.645 ± 0.773
0.456ArgTrp: 0.456 ± 0.344
0.911ArgTyr: 0.911 ± 0.46
0.0ArgXaa: 0.0 ± 0.0
Ser
6.378SerAla: 6.378 ± 1.824
0.456SerCys: 0.456 ± 0.371
6.834SerAsp: 6.834 ± 1.021
1.822SerGlu: 1.822 ± 1.207
2.733SerPhe: 2.733 ± 1.275
5.923SerGly: 5.923 ± 1.165
0.911SerHis: 0.911 ± 0.461
4.1SerIle: 4.1 ± 2.211
2.733SerLys: 2.733 ± 1.062
4.1SerLeu: 4.1 ± 1.998
1.822SerMet: 1.822 ± 0.224
2.733SerAsn: 2.733 ± 1.0
5.011SerPro: 5.011 ± 1.382
1.367SerGln: 1.367 ± 0.655
5.011SerArg: 5.011 ± 1.191
13.212SerSer: 13.212 ± 4.563
8.2SerThr: 8.2 ± 1.988
8.2SerVal: 8.2 ± 1.118
0.911SerTrp: 0.911 ± 0.461
0.911SerTyr: 0.911 ± 0.821
0.0SerXaa: 0.0 ± 0.0
Thr
3.189ThrAla: 3.189 ± 0.529
2.733ThrCys: 2.733 ± 0.591
4.556ThrAsp: 4.556 ± 1.522
1.367ThrGlu: 1.367 ± 0.762
2.733ThrPhe: 2.733 ± 0.922
5.923ThrGly: 5.923 ± 1.858
1.367ThrHis: 1.367 ± 0.764
2.733ThrIle: 2.733 ± 1.487
1.822ThrLys: 1.822 ± 1.054
2.733ThrLeu: 2.733 ± 0.781
1.367ThrMet: 1.367 ± 0.71
3.189ThrAsn: 3.189 ± 1.007
5.467ThrPro: 5.467 ± 1.974
2.733ThrGln: 2.733 ± 0.528
5.011ThrArg: 5.011 ± 1.545
3.189ThrSer: 3.189 ± 1.112
5.467ThrThr: 5.467 ± 1.791
6.834ThrVal: 6.834 ± 2.364
0.911ThrTrp: 0.911 ± 0.839
2.278ThrTyr: 2.278 ± 0.967
0.0ThrXaa: 0.0 ± 0.0
Val
2.733ValAla: 2.733 ± 0.843
2.278ValCys: 2.278 ± 1.466
3.645ValAsp: 3.645 ± 0.813
5.011ValGlu: 5.011 ± 1.175
2.278ValPhe: 2.278 ± 0.761
5.011ValGly: 5.011 ± 1.258
2.733ValHis: 2.733 ± 0.528
2.733ValIle: 2.733 ± 0.896
1.367ValLys: 1.367 ± 0.422
3.189ValLeu: 3.189 ± 1.029
0.911ValMet: 0.911 ± 0.776
2.278ValAsn: 2.278 ± 0.482
5.923ValPro: 5.923 ± 1.153
2.733ValGln: 2.733 ± 1.063
4.1ValArg: 4.1 ± 0.87
5.923ValSer: 5.923 ± 1.867
6.378ValThr: 6.378 ± 1.063
4.1ValVal: 4.1 ± 1.173
0.911ValTrp: 0.911 ± 0.774
5.011ValTyr: 5.011 ± 1.311
0.0ValXaa: 0.0 ± 0.0
Trp
1.822TrpAla: 1.822 ± 0.608
0.0TrpCys: 0.0 ± 0.0
0.911TrpAsp: 0.911 ± 0.461
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.367TrpGly: 1.367 ± 0.697
0.0TrpHis: 0.0 ± 0.0
1.822TrpIle: 1.822 ± 0.977
1.367TrpLys: 1.367 ± 0.914
1.367TrpLeu: 1.367 ± 0.791
0.0TrpMet: 0.0 ± 0.0
1.367TrpAsn: 1.367 ± 1.113
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.367TrpArg: 1.367 ± 1.514
0.456TrpSer: 0.456 ± 0.419
1.822TrpThr: 1.822 ± 1.204
0.456TrpVal: 0.456 ± 0.344
0.0TrpTrp: 0.0 ± 0.0
0.911TrpTyr: 0.911 ± 0.461
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.367TyrAla: 1.367 ± 0.668
0.456TyrCys: 0.456 ± 0.777
3.189TyrAsp: 3.189 ± 0.538
1.822TyrGlu: 1.822 ± 1.207
0.911TyrPhe: 0.911 ± 0.774
3.189TyrGly: 3.189 ± 1.467
0.911TyrHis: 0.911 ± 0.497
0.456TyrIle: 0.456 ± 0.344
3.189TyrLys: 3.189 ± 1.159
2.733TyrLeu: 2.733 ± 0.982
0.456TyrMet: 0.456 ± 0.344
1.367TyrAsn: 1.367 ± 0.933
0.911TyrPro: 0.911 ± 0.742
1.367TyrGln: 1.367 ± 0.655
2.278TyrArg: 2.278 ± 1.184
1.822TyrSer: 1.822 ± 0.92
0.456TyrThr: 0.456 ± 0.41
3.645TyrVal: 3.645 ± 0.448
1.822TyrTrp: 1.822 ± 0.732
2.733TyrTyr: 2.733 ± 1.066
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2196 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski