Amino acid dipepetide frequency for Erinaceus europaeus papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.754AlaAla: 1.754 ± 0.993
2.63AlaCys: 2.63 ± 1.058
4.822AlaAsp: 4.822 ± 1.268
4.384AlaGlu: 4.384 ± 1.297
2.192AlaPhe: 2.192 ± 0.819
2.63AlaGly: 2.63 ± 0.647
0.438AlaHis: 0.438 ± 0.392
1.315AlaIle: 1.315 ± 0.646
5.261AlaLys: 5.261 ± 1.384
6.576AlaLeu: 6.576 ± 1.352
1.754AlaMet: 1.754 ± 1.052
2.63AlaAsn: 2.63 ± 0.647
2.192AlaPro: 2.192 ± 0.758
3.946AlaGln: 3.946 ± 1.192
3.507AlaArg: 3.507 ± 0.752
3.946AlaSer: 3.946 ± 1.257
6.138AlaThr: 6.138 ± 1.607
2.192AlaVal: 2.192 ± 1.169
0.877AlaTrp: 0.877 ± 0.686
2.192AlaTyr: 2.192 ± 0.759
0.0AlaXaa: 0.0 ± 0.0
Cys
2.192CysAla: 2.192 ± 0.644
1.315CysCys: 1.315 ± 1.038
0.877CysAsp: 0.877 ± 0.443
0.877CysGlu: 0.877 ± 0.628
0.438CysPhe: 0.438 ± 0.431
1.315CysGly: 1.315 ± 1.05
0.0CysHis: 0.0 ± 0.0
0.877CysIle: 0.877 ± 0.562
2.63CysLys: 2.63 ± 1.142
2.63CysLeu: 2.63 ± 1.26
1.315CysMet: 1.315 ± 1.038
0.438CysAsn: 0.438 ± 0.343
1.754CysPro: 1.754 ± 0.649
0.0CysGln: 0.0 ± 0.0
2.63CysArg: 2.63 ± 1.317
0.877CysSer: 0.877 ± 1.111
0.877CysThr: 0.877 ± 0.466
0.0CysVal: 0.0 ± 0.0
1.754CysTrp: 1.754 ± 0.554
0.438CysTyr: 0.438 ± 0.53
0.0CysXaa: 0.0 ± 0.0
Asp
6.576AspAla: 6.576 ± 0.966
1.754AspCys: 1.754 ± 0.563
5.261AspAsp: 5.261 ± 2.057
2.63AspGlu: 2.63 ± 1.293
3.946AspPhe: 3.946 ± 1.203
5.261AspGly: 5.261 ± 1.483
0.877AspHis: 0.877 ± 0.628
3.946AspIle: 3.946 ± 1.377
3.069AspLys: 3.069 ± 0.82
4.822AspLeu: 4.822 ± 1.194
0.0AspMet: 0.0 ± 0.0
2.63AspAsn: 2.63 ± 1.018
4.384AspPro: 4.384 ± 1.103
3.069AspGln: 3.069 ± 1.501
2.63AspArg: 2.63 ± 1.042
6.138AspSer: 6.138 ± 1.061
3.507AspThr: 3.507 ± 1.3
1.315AspVal: 1.315 ± 0.704
1.315AspTrp: 1.315 ± 0.696
1.315AspTyr: 1.315 ± 0.79
0.0AspXaa: 0.0 ± 0.0
Glu
4.384GluAla: 4.384 ± 1.302
0.877GluCys: 0.877 ± 0.562
6.138GluAsp: 6.138 ± 1.517
5.699GluGlu: 5.699 ± 0.955
1.315GluPhe: 1.315 ± 0.932
4.384GluGly: 4.384 ± 0.864
0.438GluHis: 0.438 ± 0.392
3.507GluIle: 3.507 ± 1.552
2.63GluLys: 2.63 ± 1.215
4.384GluLeu: 4.384 ± 1.354
1.315GluMet: 1.315 ± 0.696
3.069GluAsn: 3.069 ± 1.167
1.315GluPro: 1.315 ± 0.642
3.507GluGln: 3.507 ± 0.686
2.192GluArg: 2.192 ± 0.758
3.507GluSer: 3.507 ± 1.113
2.63GluThr: 2.63 ± 1.263
6.138GluVal: 6.138 ± 0.885
0.877GluTrp: 0.877 ± 0.443
1.315GluTyr: 1.315 ± 0.745
0.0GluXaa: 0.0 ± 0.0
Phe
3.507PheAla: 3.507 ± 0.523
0.877PheCys: 0.877 ± 0.562
3.946PheAsp: 3.946 ± 1.257
1.754PheGlu: 1.754 ± 0.786
1.754PhePhe: 1.754 ± 0.646
1.754PheGly: 1.754 ± 0.668
0.877PheHis: 0.877 ± 0.642
3.946PheIle: 3.946 ± 0.74
2.63PheLys: 2.63 ± 1.207
4.384PheLeu: 4.384 ± 1.016
0.877PheMet: 0.877 ± 0.626
1.754PheAsn: 1.754 ± 1.129
2.192PhePro: 2.192 ± 1.107
1.754PheGln: 1.754 ± 0.668
2.192PheArg: 2.192 ± 0.874
3.069PheSer: 3.069 ± 0.571
1.754PheThr: 1.754 ± 0.802
2.192PheVal: 2.192 ± 0.454
1.315PheTrp: 1.315 ± 0.763
1.754PheTyr: 1.754 ± 0.836
0.0PheXaa: 0.0 ± 0.0
Gly
2.63GlyAla: 2.63 ± 1.536
0.877GlyCys: 0.877 ± 0.606
3.946GlyAsp: 3.946 ± 0.871
4.384GlyGlu: 4.384 ± 1.239
3.946GlyPhe: 3.946 ± 0.63
4.822GlyGly: 4.822 ± 2.516
3.069GlyHis: 3.069 ± 0.784
4.384GlyIle: 4.384 ± 1.156
3.946GlyLys: 3.946 ± 1.402
5.261GlyLeu: 5.261 ± 1.524
0.0GlyMet: 0.0 ± 0.0
5.261GlyAsn: 5.261 ± 1.344
3.507GlyPro: 3.507 ± 1.353
2.192GlyGln: 2.192 ± 0.668
4.384GlyArg: 4.384 ± 2.52
4.384GlySer: 4.384 ± 1.336
4.384GlyThr: 4.384 ± 1.427
6.576GlyVal: 6.576 ± 1.443
0.877GlyTrp: 0.877 ± 0.562
0.877GlyTyr: 0.877 ± 0.758
0.0GlyXaa: 0.0 ± 0.0
His
0.877HisAla: 0.877 ± 0.466
1.754HisCys: 1.754 ± 1.238
0.438HisAsp: 0.438 ± 0.343
0.0HisGlu: 0.0 ± 0.0
1.315HisPhe: 1.315 ± 0.37
0.438HisGly: 0.438 ± 0.343
0.0HisHis: 0.0 ± 0.0
1.754HisIle: 1.754 ± 0.878
0.877HisLys: 0.877 ± 0.686
0.877HisLeu: 0.877 ± 0.721
0.0HisMet: 0.0 ± 0.0
0.877HisAsn: 0.877 ± 0.439
3.069HisPro: 3.069 ± 1.778
0.0HisGln: 0.0 ± 0.0
1.754HisArg: 1.754 ± 1.435
0.877HisSer: 0.877 ± 0.758
0.877HisThr: 0.877 ± 0.863
1.315HisVal: 1.315 ± 0.652
0.877HisTrp: 0.877 ± 0.466
0.877HisTyr: 0.877 ± 0.466
0.0HisXaa: 0.0 ± 0.0
Ile
3.069IleAla: 3.069 ± 0.512
0.877IleCys: 0.877 ± 0.606
2.63IleAsp: 2.63 ± 1.293
3.507IleGlu: 3.507 ± 0.687
0.438IlePhe: 0.438 ± 0.392
3.946IleGly: 3.946 ± 1.742
1.315IleHis: 1.315 ± 0.704
2.63IleIle: 2.63 ± 0.894
0.877IleLys: 0.877 ± 0.404
4.384IleLeu: 4.384 ± 0.994
0.438IleMet: 0.438 ± 0.304
1.315IleAsn: 1.315 ± 0.721
1.754IlePro: 1.754 ± 0.622
2.192IleGln: 2.192 ± 0.896
3.507IleArg: 3.507 ± 0.991
3.946IleSer: 3.946 ± 1.61
2.192IleThr: 2.192 ± 0.668
3.069IleVal: 3.069 ± 1.323
0.0IleTrp: 0.0 ± 0.0
2.192IleTyr: 2.192 ± 0.812
0.0IleXaa: 0.0 ± 0.0
Lys
3.946LysAla: 3.946 ± 0.609
0.877LysCys: 0.877 ± 0.686
3.507LysAsp: 3.507 ± 1.267
2.63LysGlu: 2.63 ± 1.073
3.069LysPhe: 3.069 ± 0.784
3.069LysGly: 3.069 ± 1.223
0.877LysHis: 0.877 ± 0.686
2.192LysIle: 2.192 ± 0.883
3.946LysLys: 3.946 ± 0.794
3.946LysLeu: 3.946 ± 1.008
1.754LysMet: 1.754 ± 0.926
2.63LysAsn: 2.63 ± 0.929
0.877LysPro: 0.877 ± 0.466
2.192LysGln: 2.192 ± 0.745
4.822LysArg: 4.822 ± 1.243
3.946LysSer: 3.946 ± 1.265
3.507LysThr: 3.507 ± 1.106
5.699LysVal: 5.699 ± 1.718
0.0LysTrp: 0.0 ± 0.0
3.946LysTyr: 3.946 ± 1.441
0.0LysXaa: 0.0 ± 0.0
Leu
3.946LeuAla: 3.946 ± 1.236
1.754LeuCys: 1.754 ± 1.575
4.384LeuAsp: 4.384 ± 0.795
3.946LeuGlu: 3.946 ± 1.246
5.261LeuPhe: 5.261 ± 1.563
8.33LeuGly: 8.33 ± 1.762
1.315LeuHis: 1.315 ± 0.79
3.507LeuIle: 3.507 ± 1.842
6.576LeuLys: 6.576 ± 1.573
6.576LeuLeu: 6.576 ± 1.528
0.877LeuMet: 0.877 ± 0.718
1.754LeuAsn: 1.754 ± 0.863
2.192LeuPro: 2.192 ± 1.002
7.014LeuGln: 7.014 ± 2.222
6.576LeuArg: 6.576 ± 1.413
7.014LeuSer: 7.014 ± 2.198
4.384LeuThr: 4.384 ± 1.001
7.014LeuVal: 7.014 ± 1.56
1.315LeuTrp: 1.315 ± 0.417
5.261LeuTyr: 5.261 ± 1.821
0.0LeuXaa: 0.0 ± 0.0
Met
1.754MetAla: 1.754 ± 0.863
0.438MetCys: 0.438 ± 0.392
1.315MetAsp: 1.315 ± 0.745
0.877MetGlu: 0.877 ± 0.863
0.877MetPhe: 0.877 ± 0.404
0.438MetGly: 0.438 ± 0.392
0.438MetHis: 0.438 ± 0.431
0.438MetIle: 0.438 ± 0.555
0.877MetLys: 0.877 ± 0.562
0.877MetLeu: 0.877 ± 0.721
0.0MetMet: 0.0 ± 0.0
0.438MetAsn: 0.438 ± 0.392
0.438MetPro: 0.438 ± 0.392
0.877MetGln: 0.877 ± 0.657
0.877MetArg: 0.877 ± 0.562
1.754MetSer: 1.754 ± 0.995
1.754MetThr: 1.754 ± 0.724
1.754MetVal: 1.754 ± 1.045
0.877MetTrp: 0.877 ± 0.466
0.438MetTyr: 0.438 ± 0.392
0.0MetXaa: 0.0 ± 0.0
Asn
3.507AsnAla: 3.507 ± 1.541
1.315AsnCys: 1.315 ± 0.696
2.63AsnAsp: 2.63 ± 0.603
3.069AsnGlu: 3.069 ± 0.888
0.877AsnPhe: 0.877 ± 0.783
1.315AsnGly: 1.315 ± 0.37
0.0AsnHis: 0.0 ± 0.0
0.438AsnIle: 0.438 ± 0.632
2.63AsnLys: 2.63 ± 1.378
1.754AsnLeu: 1.754 ± 0.836
0.877AsnMet: 0.877 ± 0.783
1.315AsnAsn: 1.315 ± 1.175
3.507AsnPro: 3.507 ± 1.079
1.754AsnGln: 1.754 ± 0.646
3.507AsnArg: 3.507 ± 1.089
2.192AsnSer: 2.192 ± 0.745
4.384AsnThr: 4.384 ± 1.326
2.63AsnVal: 2.63 ± 1.018
0.0AsnTrp: 0.0 ± 0.0
0.438AsnTyr: 0.438 ± 0.343
0.0AsnXaa: 0.0 ± 0.0
Pro
3.946ProAla: 3.946 ± 0.871
0.877ProCys: 0.877 ± 0.628
5.261ProAsp: 5.261 ± 2.687
2.192ProGlu: 2.192 ± 1.022
2.192ProPhe: 2.192 ± 0.644
2.63ProGly: 2.63 ± 0.808
1.315ProHis: 1.315 ± 0.808
0.877ProIle: 0.877 ± 0.439
3.946ProLys: 3.946 ± 0.778
5.261ProLeu: 5.261 ± 1.451
0.877ProMet: 0.877 ± 0.758
3.069ProAsn: 3.069 ± 1.141
6.138ProPro: 6.138 ± 1.962
1.754ProGln: 1.754 ± 0.661
2.63ProArg: 2.63 ± 1.334
4.384ProSer: 4.384 ± 1.438
1.754ProThr: 1.754 ± 0.745
4.384ProVal: 4.384 ± 1.142
0.0ProTrp: 0.0 ± 0.0
0.877ProTyr: 0.877 ± 0.783
0.0ProXaa: 0.0 ± 0.0
Gln
3.069GlnAla: 3.069 ± 0.584
0.0GlnCys: 0.0 ± 0.0
1.754GlnAsp: 1.754 ± 1.517
3.507GlnGlu: 3.507 ± 1.523
1.315GlnPhe: 1.315 ± 0.417
2.192GlnGly: 2.192 ± 1.138
0.438GlnHis: 0.438 ± 0.379
1.754GlnIle: 1.754 ± 0.96
2.63GlnLys: 2.63 ± 1.176
7.014GlnLeu: 7.014 ± 1.851
2.192GlnMet: 2.192 ± 0.43
1.315GlnAsn: 1.315 ± 0.74
1.315GlnPro: 1.315 ± 0.417
1.754GlnGln: 1.754 ± 0.704
1.754GlnArg: 1.754 ± 1.077
1.315GlnSer: 1.315 ± 0.696
3.507GlnThr: 3.507 ± 1.135
2.63GlnVal: 2.63 ± 0.711
1.754GlnTrp: 1.754 ± 0.872
1.754GlnTyr: 1.754 ± 1.1
0.0GlnXaa: 0.0 ± 0.0
Arg
3.069ArgAla: 3.069 ± 0.976
2.192ArgCys: 2.192 ± 0.812
2.192ArgAsp: 2.192 ± 1.373
3.507ArgGlu: 3.507 ± 1.095
3.069ArgPhe: 3.069 ± 0.691
4.822ArgGly: 4.822 ± 1.297
2.192ArgHis: 2.192 ± 1.247
0.877ArgIle: 0.877 ± 0.404
5.261ArgLys: 5.261 ± 0.612
7.014ArgLeu: 7.014 ± 1.966
0.438ArgMet: 0.438 ± 0.465
1.315ArgAsn: 1.315 ± 0.745
3.507ArgPro: 3.507 ± 1.203
1.315ArgGln: 1.315 ± 0.417
3.946ArgArg: 3.946 ± 1.567
5.261ArgSer: 5.261 ± 1.971
4.384ArgThr: 4.384 ± 1.502
4.384ArgVal: 4.384 ± 1.672
0.0ArgTrp: 0.0 ± 0.0
1.315ArgTyr: 1.315 ± 0.696
0.0ArgXaa: 0.0 ± 0.0
Ser
5.261SerAla: 5.261 ± 1.362
1.315SerCys: 1.315 ± 0.646
3.507SerAsp: 3.507 ± 1.105
3.507SerGlu: 3.507 ± 1.359
3.507SerPhe: 3.507 ± 1.022
6.138SerGly: 6.138 ± 1.502
2.63SerHis: 2.63 ± 1.019
3.069SerIle: 3.069 ± 0.889
2.192SerLys: 2.192 ± 0.875
5.699SerLeu: 5.699 ± 0.847
2.192SerMet: 2.192 ± 1.004
3.507SerAsn: 3.507 ± 0.681
4.822SerPro: 4.822 ± 1.735
3.507SerGln: 3.507 ± 1.027
3.946SerArg: 3.946 ± 1.382
7.453SerSer: 7.453 ± 2.09
7.014SerThr: 7.014 ± 1.455
5.261SerVal: 5.261 ± 1.312
0.877SerTrp: 0.877 ± 0.466
0.438SerTyr: 0.438 ± 0.343
0.0SerXaa: 0.0 ± 0.0
Thr
3.507ThrAla: 3.507 ± 0.863
1.315ThrCys: 1.315 ± 0.658
4.384ThrAsp: 4.384 ± 1.396
4.822ThrGlu: 4.822 ± 1.43
2.192ThrPhe: 2.192 ± 0.793
6.138ThrGly: 6.138 ± 0.876
0.438ThrHis: 0.438 ± 0.431
4.384ThrIle: 4.384 ± 1.701
2.192ThrLys: 2.192 ± 0.668
3.946ThrLeu: 3.946 ± 1.382
1.315ThrMet: 1.315 ± 0.696
1.754ThrAsn: 1.754 ± 0.886
5.699ThrPro: 5.699 ± 1.647
2.192ThrGln: 2.192 ± 0.454
3.069ThrArg: 3.069 ± 0.888
5.699ThrSer: 5.699 ± 1.512
6.138ThrThr: 6.138 ± 1.619
3.946ThrVal: 3.946 ± 1.364
0.877ThrTrp: 0.877 ± 0.863
1.754ThrTyr: 1.754 ± 0.995
0.0ThrXaa: 0.0 ± 0.0
Val
2.192ValAla: 2.192 ± 0.572
1.315ValCys: 1.315 ± 1.666
6.138ValAsp: 6.138 ± 1.028
4.822ValGlu: 4.822 ± 1.364
3.946ValPhe: 3.946 ± 1.332
4.822ValGly: 4.822 ± 1.781
1.315ValHis: 1.315 ± 0.44
2.192ValIle: 2.192 ± 1.022
3.507ValLys: 3.507 ± 0.752
7.453ValLeu: 7.453 ± 1.6
0.438ValMet: 0.438 ± 0.431
0.877ValAsn: 0.877 ± 0.783
3.507ValPro: 3.507 ± 0.768
1.754ValGln: 1.754 ± 0.831
3.946ValArg: 3.946 ± 1.224
7.453ValSer: 7.453 ± 0.94
3.507ValThr: 3.507 ± 0.759
3.069ValVal: 3.069 ± 0.77
0.438ValTrp: 0.438 ± 0.392
2.63ValTyr: 2.63 ± 0.592
0.0ValXaa: 0.0 ± 0.0
Trp
1.315TrpAla: 1.315 ± 0.417
0.438TrpCys: 0.438 ± 0.431
0.438TrpAsp: 0.438 ± 0.392
0.877TrpGlu: 0.877 ± 0.886
0.877TrpPhe: 0.877 ± 0.466
1.315TrpGly: 1.315 ± 0.808
0.0TrpHis: 0.0 ± 0.0
0.877TrpIle: 0.877 ± 0.686
0.877TrpLys: 0.877 ± 0.562
2.192TrpLeu: 2.192 ± 1.107
0.0TrpMet: 0.0 ± 0.0
0.877TrpAsn: 0.877 ± 0.783
0.0TrpPro: 0.0 ± 0.0
0.877TrpGln: 0.877 ± 0.466
0.438TrpArg: 0.438 ± 0.431
0.877TrpSer: 0.877 ± 0.466
1.315TrpThr: 1.315 ± 0.696
0.877TrpVal: 0.877 ± 0.686
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.877TyrAla: 0.877 ± 0.443
0.438TyrCys: 0.438 ± 0.343
0.877TyrAsp: 0.877 ± 0.404
2.63TyrGlu: 2.63 ± 1.005
1.754TyrPhe: 1.754 ± 1.065
3.946TyrGly: 3.946 ± 0.89
0.877TyrHis: 0.877 ± 0.439
1.315TyrIle: 1.315 ± 0.37
0.877TyrLys: 0.877 ± 0.404
4.384TyrLeu: 4.384 ± 1.511
0.438TyrMet: 0.438 ± 0.53
1.315TyrAsn: 1.315 ± 0.44
2.192TyrPro: 2.192 ± 1.18
1.315TyrGln: 1.315 ± 0.417
1.754TyrArg: 1.754 ± 0.849
1.754TyrSer: 1.754 ± 1.093
1.754TyrThr: 1.754 ± 0.215
0.877TyrVal: 0.877 ± 0.466
0.438TyrTrp: 0.438 ± 0.343
3.507TyrTyr: 3.507 ± 1.892
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2282 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski