Amino acid dipepetide frequency for Boa constrictor papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.188AlaAla: 5.188 ± 0.724
0.0AlaCys: 0.0 ± 0.0
2.594AlaAsp: 2.594 ± 0.63
6.053AlaGlu: 6.053 ± 0.885
3.891AlaPhe: 3.891 ± 0.646
2.162AlaGly: 2.162 ± 0.784
0.432AlaHis: 0.432 ± 0.5
3.026AlaIle: 3.026 ± 0.888
3.459AlaLys: 3.459 ± 1.301
4.756AlaLeu: 4.756 ± 1.764
0.865AlaMet: 0.865 ± 0.677
0.0AlaAsn: 0.0 ± 0.0
3.891AlaPro: 3.891 ± 1.701
3.891AlaGln: 3.891 ± 1.296
2.594AlaArg: 2.594 ± 0.509
5.62AlaSer: 5.62 ± 1.02
2.162AlaThr: 2.162 ± 0.508
2.594AlaVal: 2.594 ± 1.581
0.432AlaTrp: 0.432 ± 0.463
3.459AlaTyr: 3.459 ± 0.983
0.0AlaXaa: 0.0 ± 0.0
Cys
0.865CysAla: 0.865 ± 0.535
1.297CysCys: 1.297 ± 1.323
0.865CysAsp: 0.865 ± 0.66
0.865CysGlu: 0.865 ± 0.882
0.865CysPhe: 0.865 ± 0.634
0.865CysGly: 0.865 ± 0.7
0.432CysHis: 0.432 ± 0.527
1.297CysIle: 1.297 ± 0.99
2.162CysLys: 2.162 ± 0.618
2.162CysLeu: 2.162 ± 1.017
0.432CysMet: 0.432 ± 0.33
1.729CysAsn: 1.729 ± 0.834
1.729CysPro: 1.729 ± 0.604
0.432CysGln: 0.432 ± 0.441
2.162CysArg: 2.162 ± 1.312
2.162CysSer: 2.162 ± 1.221
1.729CysThr: 1.729 ± 0.95
0.432CysVal: 0.432 ± 0.362
0.432CysTrp: 0.432 ± 0.362
0.432CysTyr: 0.432 ± 0.441
0.0CysXaa: 0.0 ± 0.0
Asp
1.729AspAla: 1.729 ± 0.704
3.026AspCys: 3.026 ± 1.049
3.891AspAsp: 3.891 ± 0.781
6.053AspGlu: 6.053 ± 1.306
2.162AspPhe: 2.162 ± 0.79
3.026AspGly: 3.026 ± 1.023
0.0AspHis: 0.0 ± 0.0
4.756AspIle: 4.756 ± 1.532
1.729AspLys: 1.729 ± 0.722
3.026AspLeu: 3.026 ± 1.494
0.432AspMet: 0.432 ± 0.362
2.162AspAsn: 2.162 ± 0.808
3.891AspPro: 3.891 ± 1.33
1.729AspGln: 1.729 ± 0.607
1.297AspArg: 1.297 ± 0.455
4.756AspSer: 4.756 ± 1.356
4.323AspThr: 4.323 ± 1.272
5.62AspVal: 5.62 ± 0.996
0.865AspTrp: 0.865 ± 0.421
1.729AspTyr: 1.729 ± 0.686
0.0AspXaa: 0.0 ± 0.0
Glu
5.188GluAla: 5.188 ± 1.684
0.432GluCys: 0.432 ± 0.33
5.62GluAsp: 5.62 ± 1.484
6.917GluGlu: 6.917 ± 2.328
3.026GluPhe: 3.026 ± 1.007
3.891GluGly: 3.891 ± 1.386
1.297GluHis: 1.297 ± 0.455
6.053GluIle: 6.053 ± 1.189
1.297GluLys: 1.297 ± 1.018
5.62GluLeu: 5.62 ± 0.915
1.729GluMet: 1.729 ± 1.274
3.459GluAsn: 3.459 ± 1.163
4.323GluPro: 4.323 ± 1.135
3.459GluGln: 3.459 ± 1.395
4.323GluArg: 4.323 ± 1.742
3.459GluSer: 3.459 ± 1.111
3.459GluThr: 3.459 ± 1.027
4.323GluVal: 4.323 ± 1.613
0.865GluTrp: 0.865 ± 0.66
2.162GluTyr: 2.162 ± 0.653
0.0GluXaa: 0.0 ± 0.0
Phe
2.162PheAla: 2.162 ± 1.063
2.162PheCys: 2.162 ± 1.125
1.729PheAsp: 1.729 ± 0.464
2.594PheGlu: 2.594 ± 1.202
2.162PhePhe: 2.162 ± 0.745
1.729PheGly: 1.729 ± 0.675
0.0PheHis: 0.0 ± 0.0
2.162PheIle: 2.162 ± 0.956
3.459PheLys: 3.459 ± 1.388
3.026PheLeu: 3.026 ± 1.243
0.865PheMet: 0.865 ± 0.558
1.729PheAsn: 1.729 ± 1.05
0.865PhePro: 0.865 ± 0.419
1.729PheGln: 1.729 ± 0.496
2.162PheArg: 2.162 ± 0.749
1.729PheSer: 1.729 ± 0.961
3.459PheThr: 3.459 ± 0.888
2.162PheVal: 2.162 ± 1.085
2.162PheTrp: 2.162 ± 0.745
2.594PheTyr: 2.594 ± 0.951
0.0PheXaa: 0.0 ± 0.0
Gly
4.323GlyAla: 4.323 ± 1.349
1.297GlyCys: 1.297 ± 0.728
2.594GlyAsp: 2.594 ± 0.953
3.459GlyGlu: 3.459 ± 1.31
1.297GlyPhe: 1.297 ± 0.748
6.485GlyGly: 6.485 ± 3.073
0.865GlyHis: 0.865 ± 0.53
3.459GlyIle: 3.459 ± 0.639
2.594GlyLys: 2.594 ± 0.543
2.594GlyLeu: 2.594 ± 0.882
0.432GlyMet: 0.432 ± 0.478
3.459GlyAsn: 3.459 ± 0.864
3.026GlyPro: 3.026 ± 1.016
3.026GlyGln: 3.026 ± 1.17
4.756GlyArg: 4.756 ± 1.502
5.62GlySer: 5.62 ± 1.819
6.917GlyThr: 6.917 ± 2.38
3.459GlyVal: 3.459 ± 0.886
0.0GlyTrp: 0.0 ± 0.0
0.432GlyTyr: 0.432 ± 0.399
0.0GlyXaa: 0.0 ± 0.0
His
0.865HisAla: 0.865 ± 0.421
0.0HisCys: 0.0 ± 0.0
0.432HisAsp: 0.432 ± 0.399
0.432HisGlu: 0.432 ± 0.527
0.432HisPhe: 0.432 ± 0.33
0.865HisGly: 0.865 ± 0.736
0.0HisHis: 0.0 ± 0.0
2.594HisIle: 2.594 ± 0.899
0.865HisLys: 0.865 ± 0.66
1.297HisLeu: 1.297 ± 0.612
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.729HisPro: 1.729 ± 0.755
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.729HisSer: 1.729 ± 0.532
0.432HisThr: 0.432 ± 0.463
2.162HisVal: 2.162 ± 0.365
1.729HisTrp: 1.729 ± 0.577
2.162HisTyr: 2.162 ± 0.508
0.0HisXaa: 0.0 ± 0.0
Ile
1.729IleAla: 1.729 ± 0.959
1.729IleCys: 1.729 ± 0.904
4.756IleAsp: 4.756 ± 2.128
4.756IleGlu: 4.756 ± 2.004
2.162IlePhe: 2.162 ± 1.062
4.323IleGly: 4.323 ± 1.781
0.865IleHis: 0.865 ± 0.553
3.891IleIle: 3.891 ± 1.258
6.485IleLys: 6.485 ± 1.787
7.35IleLeu: 7.35 ± 2.097
1.297IleMet: 1.297 ± 1.045
3.026IleAsn: 3.026 ± 1.258
6.485IlePro: 6.485 ± 1.479
3.891IleGln: 3.891 ± 1.443
2.162IleArg: 2.162 ± 1.16
3.891IleSer: 3.891 ± 1.038
2.162IleThr: 2.162 ± 0.735
1.729IleVal: 1.729 ± 0.662
0.0IleTrp: 0.0 ± 0.0
3.026IleTyr: 3.026 ± 0.803
0.0IleXaa: 0.0 ± 0.0
Lys
3.459LysAla: 3.459 ± 0.687
2.594LysCys: 2.594 ± 1.025
2.594LysAsp: 2.594 ± 1.13
3.026LysGlu: 3.026 ± 1.561
2.162LysPhe: 2.162 ± 1.114
3.026LysGly: 3.026 ± 1.605
1.297LysHis: 1.297 ± 0.669
2.594LysIle: 2.594 ± 0.7
2.162LysLys: 2.162 ± 1.061
1.729LysLeu: 1.729 ± 0.737
0.865LysMet: 0.865 ± 0.66
1.729LysAsn: 1.729 ± 1.274
2.594LysPro: 2.594 ± 0.634
1.729LysGln: 1.729 ± 0.686
6.485LysArg: 6.485 ± 0.673
3.891LysSer: 3.891 ± 1.383
3.891LysThr: 3.891 ± 0.935
1.729LysVal: 1.729 ± 0.604
0.432LysTrp: 0.432 ± 0.441
3.459LysTyr: 3.459 ± 0.74
0.0LysXaa: 0.0 ± 0.0
Leu
4.756LeuAla: 4.756 ± 1.372
2.162LeuCys: 2.162 ± 1.437
6.485LeuAsp: 6.485 ± 1.489
6.485LeuGlu: 6.485 ± 1.033
3.891LeuPhe: 3.891 ± 1.356
5.62LeuGly: 5.62 ± 1.935
1.297LeuHis: 1.297 ± 0.408
3.026LeuIle: 3.026 ± 1.305
4.756LeuLys: 4.756 ± 2.232
9.944LeuLeu: 9.944 ± 2.848
0.0LeuMet: 0.0 ± 0.0
5.62LeuAsn: 5.62 ± 1.434
1.729LeuPro: 1.729 ± 0.57
4.756LeuGln: 4.756 ± 1.319
5.188LeuArg: 5.188 ± 1.984
6.053LeuSer: 6.053 ± 0.783
5.62LeuThr: 5.62 ± 1.31
2.594LeuVal: 2.594 ± 1.066
0.865LeuTrp: 0.865 ± 0.53
3.026LeuTyr: 3.026 ± 0.99
0.0LeuXaa: 0.0 ± 0.0
Met
2.162MetAla: 2.162 ± 0.827
0.0MetCys: 0.0 ± 0.0
0.865MetAsp: 0.865 ± 0.419
2.162MetGlu: 2.162 ± 1.181
1.729MetPhe: 1.729 ± 0.577
0.0MetGly: 0.0 ± 0.0
0.432MetHis: 0.432 ± 0.33
0.432MetIle: 0.432 ± 0.527
0.865MetLys: 0.865 ± 0.553
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.865MetAsn: 0.865 ± 0.419
0.432MetPro: 0.432 ± 0.362
0.865MetGln: 0.865 ± 0.66
0.432MetArg: 0.432 ± 0.33
1.729MetSer: 1.729 ± 0.961
1.729MetThr: 1.729 ± 0.962
0.865MetVal: 0.865 ± 0.66
0.432MetTrp: 0.432 ± 0.463
0.865MetTyr: 0.865 ± 0.421
0.0MetXaa: 0.0 ± 0.0
Asn
2.594AsnAla: 2.594 ± 1.284
1.297AsnCys: 1.297 ± 0.553
2.162AsnAsp: 2.162 ± 0.79
1.297AsnGlu: 1.297 ± 0.612
0.865AsnPhe: 0.865 ± 0.431
3.459AsnGly: 3.459 ± 1.563
0.865AsnHis: 0.865 ± 0.535
3.459AsnIle: 3.459 ± 1.003
1.729AsnLys: 1.729 ± 0.999
0.865AsnLeu: 0.865 ± 0.546
0.432AsnMet: 0.432 ± 0.399
2.594AsnAsn: 2.594 ± 1.461
3.459AsnPro: 3.459 ± 0.888
0.865AsnGln: 0.865 ± 0.556
4.323AsnArg: 4.323 ± 1.137
3.026AsnSer: 3.026 ± 1.138
1.729AsnThr: 1.729 ± 0.604
8.214AsnVal: 8.214 ± 2.745
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.891ProAla: 3.891 ± 1.352
1.297ProCys: 1.297 ± 0.713
3.026ProAsp: 3.026 ± 1.134
4.756ProGlu: 4.756 ± 1.455
2.162ProPhe: 2.162 ± 0.963
1.729ProGly: 1.729 ± 0.704
1.297ProHis: 1.297 ± 0.408
3.891ProIle: 3.891 ± 1.453
3.026ProLys: 3.026 ± 0.863
3.891ProLeu: 3.891 ± 1.603
0.865ProMet: 0.865 ± 0.441
3.026ProAsn: 3.026 ± 1.021
4.323ProPro: 4.323 ± 1.069
0.865ProGln: 0.865 ± 0.619
3.026ProArg: 3.026 ± 0.912
4.756ProSer: 4.756 ± 2.137
4.756ProThr: 4.756 ± 1.607
4.323ProVal: 4.323 ± 1.601
0.432ProTrp: 0.432 ± 0.33
1.729ProTyr: 1.729 ± 0.707
0.0ProXaa: 0.0 ± 0.0
Gln
3.459GlnAla: 3.459 ± 1.262
1.297GlnCys: 1.297 ± 0.99
2.594GlnAsp: 2.594 ± 0.995
1.729GlnGlu: 1.729 ± 0.271
0.865GlnPhe: 0.865 ± 0.7
1.729GlnGly: 1.729 ± 0.582
0.432GlnHis: 0.432 ± 0.33
3.026GlnIle: 3.026 ± 1.144
1.729GlnLys: 1.729 ± 0.732
4.756GlnLeu: 4.756 ± 2.697
1.729GlnMet: 1.729 ± 0.864
2.162GlnAsn: 2.162 ± 0.749
2.162GlnPro: 2.162 ± 1.055
1.729GlnGln: 1.729 ± 0.655
2.594GlnArg: 2.594 ± 1.134
2.162GlnSer: 2.162 ± 0.909
3.459GlnThr: 3.459 ± 1.155
1.729GlnVal: 1.729 ± 0.532
0.432GlnTrp: 0.432 ± 0.33
1.729GlnTyr: 1.729 ± 1.05
0.0GlnXaa: 0.0 ± 0.0
Arg
2.162ArgAla: 2.162 ± 0.508
2.162ArgCys: 2.162 ± 1.429
0.865ArgAsp: 0.865 ± 0.66
3.026ArgGlu: 3.026 ± 1.052
2.162ArgPhe: 2.162 ± 0.801
3.026ArgGly: 3.026 ± 1.161
1.729ArgHis: 1.729 ± 0.767
3.026ArgIle: 3.026 ± 1.197
3.891ArgLys: 3.891 ± 0.765
7.782ArgLeu: 7.782 ± 2.34
0.432ArgMet: 0.432 ± 0.33
3.459ArgAsn: 3.459 ± 0.762
3.026ArgPro: 3.026 ± 1.039
3.026ArgGln: 3.026 ± 1.484
7.35ArgArg: 7.35 ± 1.466
3.891ArgSer: 3.891 ± 0.829
4.756ArgThr: 4.756 ± 0.88
4.756ArgVal: 4.756 ± 0.976
0.432ArgTrp: 0.432 ± 0.5
3.026ArgTyr: 3.026 ± 1.734
0.0ArgXaa: 0.0 ± 0.0
Ser
3.891SerAla: 3.891 ± 0.878
0.0SerCys: 0.0 ± 0.0
5.188SerAsp: 5.188 ± 1.433
4.756SerGlu: 4.756 ± 1.63
3.891SerPhe: 3.891 ± 1.238
3.891SerGly: 3.891 ± 1.223
2.162SerHis: 2.162 ± 0.909
5.62SerIle: 5.62 ± 1.714
2.162SerLys: 2.162 ± 0.823
9.079SerLeu: 9.079 ± 1.597
1.729SerMet: 1.729 ± 0.842
3.026SerAsn: 3.026 ± 0.852
4.323SerPro: 4.323 ± 0.939
1.297SerGln: 1.297 ± 0.763
3.459SerArg: 3.459 ± 0.821
5.188SerSer: 5.188 ± 1.084
6.053SerThr: 6.053 ± 1.233
3.459SerVal: 3.459 ± 1.904
0.432SerTrp: 0.432 ± 0.463
0.865SerTyr: 0.865 ± 0.695
0.0SerXaa: 0.0 ± 0.0
Thr
2.594ThrAla: 2.594 ± 1.063
0.865ThrCys: 0.865 ± 0.441
4.323ThrAsp: 4.323 ± 2.099
3.459ThrGlu: 3.459 ± 0.809
2.594ThrPhe: 2.594 ± 0.63
2.162ThrGly: 2.162 ± 0.508
1.297ThrHis: 1.297 ± 0.408
3.459ThrIle: 3.459 ± 1.879
2.594ThrLys: 2.594 ± 1.806
7.782ThrLeu: 7.782 ± 3.588
2.594ThrMet: 2.594 ± 1.244
1.729ThrAsn: 1.729 ± 0.271
3.459ThrPro: 3.459 ± 1.29
3.459ThrGln: 3.459 ± 0.614
5.62ThrArg: 5.62 ± 0.556
3.459ThrSer: 3.459 ± 0.623
5.188ThrThr: 5.188 ± 3.487
8.647ThrVal: 8.647 ± 3.425
0.0ThrTrp: 0.0 ± 0.0
2.162ThrTyr: 2.162 ± 0.898
0.0ThrXaa: 0.0 ± 0.0
Val
3.891ValAla: 3.891 ± 1.16
0.865ValCys: 0.865 ± 0.53
4.323ValAsp: 4.323 ± 0.767
6.485ValGlu: 6.485 ± 1.583
2.162ValPhe: 2.162 ± 0.735
6.053ValGly: 6.053 ± 1.97
2.162ValHis: 2.162 ± 1.536
4.323ValIle: 4.323 ± 1.523
1.729ValLys: 1.729 ± 0.737
4.323ValLeu: 4.323 ± 1.208
1.297ValMet: 1.297 ± 0.612
2.162ValAsn: 2.162 ± 1.489
4.756ValPro: 4.756 ± 0.646
3.026ValGln: 3.026 ± 0.629
3.459ValArg: 3.459 ± 1.282
3.891ValSer: 3.891 ± 0.526
3.026ValThr: 3.026 ± 1.088
4.323ValVal: 4.323 ± 1.582
0.865ValTrp: 0.865 ± 0.546
2.162ValTyr: 2.162 ± 0.436
0.0ValXaa: 0.0 ± 0.0
Trp
0.432TrpAla: 0.432 ± 0.362
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.432TrpGlu: 0.432 ± 0.441
0.432TrpPhe: 0.432 ± 0.33
0.865TrpGly: 0.865 ± 0.725
0.0TrpHis: 0.0 ± 0.0
1.729TrpIle: 1.729 ± 0.577
1.729TrpLys: 1.729 ± 0.884
1.297TrpLeu: 1.297 ± 0.664
0.0TrpMet: 0.0 ± 0.0
0.432TrpAsn: 0.432 ± 0.362
0.0TrpPro: 0.0 ± 0.0
0.432TrpGln: 0.432 ± 0.463
0.432TrpArg: 0.432 ± 0.441
0.0TrpSer: 0.0 ± 0.0
1.297TrpThr: 1.297 ± 0.945
1.297TrpVal: 1.297 ± 0.633
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.162TyrAla: 2.162 ± 0.749
0.865TyrCys: 0.865 ± 0.712
1.297TyrAsp: 1.297 ± 0.713
2.162TyrGlu: 2.162 ± 0.618
1.729TyrPhe: 1.729 ± 0.639
5.188TyrGly: 5.188 ± 1.147
0.865TyrHis: 0.865 ± 0.546
3.891TyrIle: 3.891 ± 1.037
3.026TyrLys: 3.026 ± 0.518
2.594TyrLeu: 2.594 ± 0.7
0.432TyrMet: 0.432 ± 0.33
0.865TyrAsn: 0.865 ± 0.421
0.865TyrPro: 0.865 ± 0.546
1.297TyrGln: 1.297 ± 0.455
2.162TyrArg: 2.162 ± 0.956
3.459TyrSer: 3.459 ± 0.879
1.297TyrThr: 1.297 ± 0.488
0.865TyrVal: 0.865 ± 0.529
0.0TyrTrp: 0.0 ± 0.0
1.297TyrTyr: 1.297 ± 1.388
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2314 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski