Amino acid dipepetide frequency for Betapapillomavirus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.31AlaAla: 4.31 ± 0.878
1.176AlaCys: 1.176 ± 0.526
2.743AlaAsp: 2.743 ± 0.905
4.31AlaGlu: 4.31 ± 1.221
3.527AlaPhe: 3.527 ± 1.346
2.743AlaGly: 2.743 ± 1.595
0.784AlaHis: 0.784 ± 0.616
1.959AlaIle: 1.959 ± 0.612
1.959AlaLys: 1.959 ± 0.911
3.918AlaLeu: 3.918 ± 0.96
1.176AlaMet: 1.176 ± 0.343
1.959AlaAsn: 1.959 ± 0.824
2.743AlaPro: 2.743 ± 1.216
2.351AlaGln: 2.351 ± 0.741
5.094AlaArg: 5.094 ± 1.404
2.743AlaSer: 2.743 ± 0.644
3.527AlaThr: 3.527 ± 1.257
4.31AlaVal: 4.31 ± 0.958
0.784AlaTrp: 0.784 ± 0.39
1.176AlaTyr: 1.176 ± 0.596
0.0AlaXaa: 0.0 ± 0.0
Cys
1.176CysAla: 1.176 ± 0.526
1.567CysCys: 1.567 ± 1.254
0.392CysAsp: 0.392 ± 0.476
0.392CysGlu: 0.392 ± 0.476
1.567CysPhe: 1.567 ± 0.604
2.351CysGly: 2.351 ± 0.985
0.0CysHis: 0.0 ± 0.0
1.567CysIle: 1.567 ± 0.716
1.959CysLys: 1.959 ± 0.824
1.959CysLeu: 1.959 ± 1.179
0.392CysMet: 0.392 ± 0.308
0.392CysAsn: 0.392 ± 0.41
1.959CysPro: 1.959 ± 0.638
0.0CysGln: 0.0 ± 0.0
2.351CysArg: 2.351 ± 1.112
1.959CysSer: 1.959 ± 1.206
0.784CysThr: 0.784 ± 0.616
0.392CysVal: 0.392 ± 0.324
1.176CysTrp: 1.176 ± 0.526
1.176CysTyr: 1.176 ± 0.499
0.0CysXaa: 0.0 ± 0.0
Asp
3.527AspAla: 3.527 ± 0.755
1.176AspCys: 1.176 ± 0.924
2.743AspAsp: 2.743 ± 1.405
2.351AspGlu: 2.351 ± 0.693
1.176AspPhe: 1.176 ± 0.62
3.527AspGly: 3.527 ± 0.905
0.392AspHis: 0.392 ± 0.324
4.31AspIle: 4.31 ± 0.731
1.567AspLys: 1.567 ± 0.91
7.053AspLeu: 7.053 ± 1.851
0.784AspMet: 0.784 ± 0.381
3.918AspAsn: 3.918 ± 1.171
3.527AspPro: 3.527 ± 0.699
3.527AspGln: 3.527 ± 0.697
1.959AspArg: 1.959 ± 0.646
0.392AspSer: 0.392 ± 0.308
6.27AspThr: 6.27 ± 1.574
5.094AspVal: 5.094 ± 1.349
1.176AspTrp: 1.176 ± 0.466
1.959AspTyr: 1.959 ± 0.556
0.0AspXaa: 0.0 ± 0.0
Glu
4.702GluAla: 4.702 ± 1.063
0.784GluCys: 0.784 ± 0.616
3.918GluAsp: 3.918 ± 0.856
6.27GluGlu: 6.27 ± 0.942
1.567GluPhe: 1.567 ± 0.834
5.486GluGly: 5.486 ± 1.744
1.176GluHis: 1.176 ± 0.375
4.31GluIle: 4.31 ± 1.78
2.743GluLys: 2.743 ± 0.9
6.661GluLeu: 6.661 ± 2.101
0.784GluMet: 0.784 ± 0.616
3.135GluAsn: 3.135 ± 0.726
3.527GluPro: 3.527 ± 0.944
4.31GluGln: 4.31 ± 0.953
4.702GluArg: 4.702 ± 2.35
7.053GluSer: 7.053 ± 1.762
3.918GluThr: 3.918 ± 1.281
6.27GluVal: 6.27 ± 1.474
1.567GluTrp: 1.567 ± 0.517
2.351GluTyr: 2.351 ± 0.95
0.0GluXaa: 0.0 ± 0.0
Phe
1.567PheAla: 1.567 ± 0.604
1.176PheCys: 1.176 ± 0.526
2.351PheAsp: 2.351 ± 0.461
2.743PheGlu: 2.743 ± 1.069
1.176PhePhe: 1.176 ± 0.375
3.135PheGly: 3.135 ± 0.567
0.392PheHis: 0.392 ± 0.296
1.959PheIle: 1.959 ± 0.675
2.351PheLys: 2.351 ± 0.867
5.486PheLeu: 5.486 ± 1.466
0.392PheMet: 0.392 ± 0.308
3.135PheAsn: 3.135 ± 0.836
1.176PhePro: 1.176 ± 0.612
1.959PheGln: 1.959 ± 0.656
1.959PheArg: 1.959 ± 0.824
2.351PheSer: 2.351 ± 0.684
0.392PheThr: 0.392 ± 0.371
1.567PheVal: 1.567 ± 0.432
1.176PheTrp: 1.176 ± 0.637
2.351PheTyr: 2.351 ± 0.902
0.0PheXaa: 0.0 ± 0.0
Gly
3.527GlyAla: 3.527 ± 2.085
2.351GlyCys: 2.351 ± 0.951
4.702GlyAsp: 4.702 ± 1.6
5.878GlyGlu: 5.878 ± 1.137
1.959GlyPhe: 1.959 ± 0.446
7.445GlyGly: 7.445 ± 1.81
3.527GlyHis: 3.527 ± 0.993
2.743GlyIle: 2.743 ± 0.949
4.702GlyLys: 4.702 ± 1.996
3.527GlyLeu: 3.527 ± 1.154
0.392GlyMet: 0.392 ± 0.324
3.135GlyAsn: 3.135 ± 1.024
3.527GlyPro: 3.527 ± 1.353
2.743GlyGln: 2.743 ± 1.179
6.27GlyArg: 6.27 ± 2.868
4.31GlySer: 4.31 ± 0.898
4.702GlyThr: 4.702 ± 1.235
3.527GlyVal: 3.527 ± 0.592
0.0GlyTrp: 0.0 ± 0.0
1.959GlyTyr: 1.959 ± 0.711
0.0GlyXaa: 0.0 ± 0.0
His
0.784HisAla: 0.784 ± 0.499
1.176HisCys: 1.176 ± 0.731
0.392HisAsp: 0.392 ± 0.407
0.784HisGlu: 0.784 ± 0.381
1.176HisPhe: 1.176 ± 0.509
0.784HisGly: 0.784 ± 0.427
0.392HisHis: 0.392 ± 0.308
1.176HisIle: 1.176 ± 0.607
1.959HisLys: 1.959 ± 1.023
1.567HisLeu: 1.567 ± 0.444
0.392HisMet: 0.392 ± 0.324
1.567HisAsn: 1.567 ± 0.48
1.959HisPro: 1.959 ± 0.698
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.176HisSer: 1.176 ± 0.53
1.959HisThr: 1.959 ± 0.69
0.784HisVal: 0.784 ± 0.486
1.176HisTrp: 1.176 ± 0.343
1.959HisTyr: 1.959 ± 0.961
0.0HisXaa: 0.0 ± 0.0
Ile
2.743IleAla: 2.743 ± 0.75
1.176IleCys: 1.176 ± 0.499
2.743IleAsp: 2.743 ± 0.807
4.31IleGlu: 4.31 ± 1.407
0.784IlePhe: 0.784 ± 0.396
2.743IleGly: 2.743 ± 0.759
1.176IleHis: 1.176 ± 0.521
1.959IleIle: 1.959 ± 0.822
1.959IleLys: 1.959 ± 0.659
6.27IleLeu: 6.27 ± 0.687
1.176IleMet: 1.176 ± 0.745
2.351IleAsn: 2.351 ± 0.661
3.527IlePro: 3.527 ± 1.721
0.784IleGln: 0.784 ± 0.356
2.351IleArg: 2.351 ± 0.494
3.135IleSer: 3.135 ± 1.175
1.567IleThr: 1.567 ± 0.561
2.743IleVal: 2.743 ± 0.728
0.784IleTrp: 0.784 ± 0.501
3.527IleTyr: 3.527 ± 0.619
0.0IleXaa: 0.0 ± 0.0
Lys
3.135LysAla: 3.135 ± 1.019
0.784LysCys: 0.784 ± 0.39
1.176LysAsp: 1.176 ± 0.696
3.918LysGlu: 3.918 ± 1.282
3.135LysPhe: 3.135 ± 0.953
3.527LysGly: 3.527 ± 1.912
1.959LysHis: 1.959 ± 0.805
1.959LysIle: 1.959 ± 1.227
3.527LysLys: 3.527 ± 1.566
3.135LysLeu: 3.135 ± 1.076
0.784LysMet: 0.784 ± 0.397
1.567LysAsn: 1.567 ± 0.891
0.784LysPro: 0.784 ± 0.813
1.959LysGln: 1.959 ± 0.61
4.31LysArg: 4.31 ± 1.456
4.31LysSer: 4.31 ± 1.532
2.351LysThr: 2.351 ± 1.281
3.135LysVal: 3.135 ± 1.187
0.784LysTrp: 0.784 ± 0.427
2.743LysTyr: 2.743 ± 0.551
0.0LysXaa: 0.0 ± 0.0
Leu
3.135LeuAla: 3.135 ± 0.392
3.527LeuCys: 3.527 ± 1.477
6.661LeuAsp: 6.661 ± 2.214
8.229LeuGlu: 8.229 ± 1.927
3.527LeuPhe: 3.527 ± 0.69
5.486LeuGly: 5.486 ± 1.722
2.743LeuHis: 2.743 ± 1.009
3.527LeuIle: 3.527 ± 0.974
6.27LeuLys: 6.27 ± 1.141
10.972LeuLeu: 10.972 ± 3.215
1.176LeuMet: 1.176 ± 0.366
2.743LeuAsn: 2.743 ± 0.796
3.527LeuPro: 3.527 ± 1.146
7.837LeuGln: 7.837 ± 1.058
2.351LeuArg: 2.351 ± 0.992
7.053LeuSer: 7.053 ± 2.414
3.918LeuThr: 3.918 ± 1.064
3.527LeuVal: 3.527 ± 1.292
0.784LeuTrp: 0.784 ± 0.381
2.351LeuTyr: 2.351 ± 0.687
0.0LeuXaa: 0.0 ± 0.0
Met
1.959MetAla: 1.959 ± 0.696
0.0MetCys: 0.0 ± 0.0
1.176MetAsp: 1.176 ± 0.494
1.176MetGlu: 1.176 ± 0.343
1.176MetPhe: 1.176 ± 0.612
0.0MetGly: 0.0 ± 0.0
0.392MetHis: 0.392 ± 0.407
1.176MetIle: 1.176 ± 0.755
0.0MetLys: 0.0 ± 0.0
1.567MetLeu: 1.567 ± 0.78
0.0MetMet: 0.0 ± 0.0
0.784MetAsn: 0.784 ± 0.381
0.0MetPro: 0.0 ± 0.0
0.784MetGln: 0.784 ± 0.616
1.176MetArg: 1.176 ± 0.696
1.959MetSer: 1.959 ± 1.185
0.0MetThr: 0.0 ± 0.0
1.176MetVal: 1.176 ± 0.343
0.0MetTrp: 0.0 ± 0.0
0.784MetTyr: 0.784 ± 0.381
0.0MetXaa: 0.0 ± 0.0
Asn
2.351AsnAla: 2.351 ± 0.82
0.784AsnCys: 0.784 ± 0.39
2.351AsnAsp: 2.351 ± 0.898
2.743AsnGlu: 2.743 ± 0.965
2.351AsnPhe: 2.351 ± 0.86
3.135AsnGly: 3.135 ± 1.133
0.0AsnHis: 0.0 ± 0.0
1.567AsnIle: 1.567 ± 0.559
1.567AsnLys: 1.567 ± 0.48
3.527AsnLeu: 3.527 ± 1.26
0.392AsnMet: 0.392 ± 0.296
1.567AsnAsn: 1.567 ± 0.936
3.135AsnPro: 3.135 ± 1.397
1.567AsnGln: 1.567 ± 0.68
2.351AsnArg: 2.351 ± 1.202
3.527AsnSer: 3.527 ± 1.375
3.527AsnThr: 3.527 ± 1.709
3.135AsnVal: 3.135 ± 1.268
0.392AsnTrp: 0.392 ± 0.308
1.567AsnTyr: 1.567 ± 0.761
0.0AsnXaa: 0.0 ± 0.0
Pro
2.743ProAla: 2.743 ± 0.806
1.176ProCys: 1.176 ± 0.732
5.878ProAsp: 5.878 ± 2.145
5.486ProGlu: 5.486 ± 1.664
1.567ProPhe: 1.567 ± 0.651
1.567ProGly: 1.567 ± 0.965
0.392ProHis: 0.392 ± 0.407
1.567ProIle: 1.567 ± 0.664
3.135ProLys: 3.135 ± 1.012
5.094ProLeu: 5.094 ± 1.057
1.567ProMet: 1.567 ± 1.232
2.743ProAsn: 2.743 ± 0.745
7.053ProPro: 7.053 ± 1.796
1.567ProGln: 1.567 ± 0.782
3.135ProArg: 3.135 ± 0.952
5.878ProSer: 5.878 ± 2.081
5.486ProThr: 5.486 ± 2.187
3.527ProVal: 3.527 ± 1.312
0.392ProTrp: 0.392 ± 0.371
1.176ProTyr: 1.176 ± 0.516
0.0ProXaa: 0.0 ± 0.0
Gln
2.743GlnAla: 2.743 ± 0.778
1.176GlnCys: 1.176 ± 0.343
3.135GlnAsp: 3.135 ± 0.771
3.135GlnGlu: 3.135 ± 0.952
2.351GlnPhe: 2.351 ± 0.682
2.743GlnGly: 2.743 ± 0.888
0.784GlnHis: 0.784 ± 0.616
4.31GlnIle: 4.31 ± 0.521
1.176GlnLys: 1.176 ± 0.818
3.135GlnLeu: 3.135 ± 0.66
1.567GlnMet: 1.567 ± 0.543
1.567GlnAsn: 1.567 ± 0.68
3.135GlnPro: 3.135 ± 1.137
2.743GlnGln: 2.743 ± 1.471
3.135GlnArg: 3.135 ± 1.641
1.567GlnSer: 1.567 ± 0.604
1.176GlnThr: 1.176 ± 0.596
3.527GlnVal: 3.527 ± 0.924
0.784GlnTrp: 0.784 ± 0.39
1.176GlnTyr: 1.176 ± 0.661
0.0GlnXaa: 0.0 ± 0.0
Arg
4.702ArgAla: 4.702 ± 0.856
2.351ArgCys: 2.351 ± 0.972
3.135ArgAsp: 3.135 ± 0.92
4.31ArgGlu: 4.31 ± 0.527
3.918ArgPhe: 3.918 ± 1.618
7.837ArgGly: 7.837 ± 2.503
1.959ArgHis: 1.959 ± 0.59
1.176ArgIle: 1.176 ± 0.527
4.31ArgLys: 4.31 ± 0.636
6.661ArgLeu: 6.661 ± 1.45
0.392ArgMet: 0.392 ± 0.394
1.176ArgAsn: 1.176 ± 0.546
3.135ArgPro: 3.135 ± 0.795
2.351ArgGln: 2.351 ± 0.836
6.661ArgArg: 6.661 ± 2.642
7.053ArgSer: 7.053 ± 4.227
2.743ArgThr: 2.743 ± 1.144
3.527ArgVal: 3.527 ± 1.142
0.0ArgTrp: 0.0 ± 0.0
1.959ArgTyr: 1.959 ± 0.502
0.0ArgXaa: 0.0 ± 0.0
Ser
1.959SerAla: 1.959 ± 0.698
0.392SerCys: 0.392 ± 0.324
3.527SerAsp: 3.527 ± 1.034
4.31SerGlu: 4.31 ± 1.516
2.743SerPhe: 2.743 ± 0.904
6.27SerGly: 6.27 ± 1.752
0.784SerHis: 0.784 ± 0.628
2.743SerIle: 2.743 ± 0.389
2.743SerLys: 2.743 ± 1.365
6.27SerLeu: 6.27 ± 0.898
0.784SerMet: 0.784 ± 0.616
3.135SerAsn: 3.135 ± 1.705
6.27SerPro: 6.27 ± 1.862
3.918SerGln: 3.918 ± 1.568
9.796SerArg: 9.796 ± 3.699
5.094SerSer: 5.094 ± 1.715
6.661SerThr: 6.661 ± 1.94
3.918SerVal: 3.918 ± 1.027
0.784SerTrp: 0.784 ± 0.39
0.784SerTyr: 0.784 ± 0.413
0.0SerXaa: 0.0 ± 0.0
Thr
1.176ThrAla: 1.176 ± 0.727
1.176ThrCys: 1.176 ± 0.375
3.918ThrAsp: 3.918 ± 0.959
5.486ThrGlu: 5.486 ± 0.664
1.959ThrPhe: 1.959 ± 0.549
5.486ThrGly: 5.486 ± 1.311
0.392ThrHis: 0.392 ± 0.308
3.135ThrIle: 3.135 ± 1.444
1.959ThrLys: 1.959 ± 0.991
3.918ThrLeu: 3.918 ± 2.825
1.567ThrMet: 1.567 ± 0.891
2.351ThrAsn: 2.351 ± 0.848
5.486ThrPro: 5.486 ± 2.972
1.176ThrGln: 1.176 ± 0.887
4.702ThrArg: 4.702 ± 1.365
5.486ThrSer: 5.486 ± 1.689
4.31ThrThr: 4.31 ± 1.023
4.702ThrVal: 4.702 ± 1.448
0.392ThrTrp: 0.392 ± 0.371
1.176ThrTyr: 1.176 ± 0.661
0.0ThrXaa: 0.0 ± 0.0
Val
3.135ValAla: 3.135 ± 1.364
0.784ValCys: 0.784 ± 0.685
3.918ValAsp: 3.918 ± 1.01
5.878ValGlu: 5.878 ± 1.397
1.567ValPhe: 1.567 ± 0.559
4.31ValGly: 4.31 ± 0.935
1.959ValHis: 1.959 ± 0.592
3.135ValIle: 3.135 ± 1.007
1.567ValLys: 1.567 ± 0.561
3.527ValLeu: 3.527 ± 1.064
0.784ValMet: 0.784 ± 0.39
2.351ValAsn: 2.351 ± 1.322
5.486ValPro: 5.486 ± 1.027
2.743ValGln: 2.743 ± 0.597
4.702ValArg: 4.702 ± 0.98
5.094ValSer: 5.094 ± 1.141
4.702ValThr: 4.702 ± 1.389
3.527ValVal: 3.527 ± 1.361
0.392ValTrp: 0.392 ± 0.324
1.567ValTyr: 1.567 ± 0.761
0.0ValXaa: 0.0 ± 0.0
Trp
1.176TrpAla: 1.176 ± 0.445
0.0TrpCys: 0.0 ± 0.0
0.784TrpAsp: 0.784 ± 0.648
0.784TrpGlu: 0.784 ± 0.529
0.392TrpPhe: 0.392 ± 0.308
0.392TrpGly: 0.392 ± 0.324
0.0TrpHis: 0.0 ± 0.0
0.392TrpIle: 0.392 ± 0.308
1.567TrpLys: 1.567 ± 0.834
1.567TrpLeu: 1.567 ± 0.761
0.392TrpMet: 0.392 ± 0.308
0.392TrpAsn: 0.392 ± 0.308
0.0TrpPro: 0.0 ± 0.0
1.567TrpGln: 1.567 ± 0.385
0.0TrpArg: 0.0 ± 0.0
1.176TrpSer: 1.176 ± 0.696
0.784TrpThr: 0.784 ± 0.741
1.567TrpVal: 1.567 ± 0.867
0.0TrpTrp: 0.0 ± 0.0
0.392TrpTyr: 0.392 ± 0.308
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.743TyrAla: 2.743 ± 0.516
0.784TyrCys: 0.784 ± 0.499
0.784TyrAsp: 0.784 ± 0.39
2.743TyrGlu: 2.743 ± 0.916
1.176TyrPhe: 1.176 ± 0.623
1.959TyrGly: 1.959 ± 0.391
1.959TyrHis: 1.959 ± 0.446
3.135TyrIle: 3.135 ± 1.065
1.959TyrLys: 1.959 ± 0.911
3.527TyrLeu: 3.527 ± 0.851
0.0TyrMet: 0.0 ± 0.0
1.567TyrAsn: 1.567 ± 0.432
1.176TyrPro: 1.176 ± 0.375
1.176TyrGln: 1.176 ± 0.775
2.743TyrArg: 2.743 ± 0.522
1.567TyrSer: 1.567 ± 0.592
1.176TyrThr: 1.176 ± 0.615
1.176TyrVal: 1.176 ± 0.727
0.784TyrTrp: 0.784 ± 0.427
2.743TyrTyr: 2.743 ± 0.831
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2553 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski