Amino acid dipepetide frequency for Baja California bark scorpion polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.448AlaAla: 6.448 ± 6.686
0.0AlaCys: 0.0 ± 0.0
4.836AlaAsp: 4.836 ± 0.736
5.911AlaGlu: 5.911 ± 1.488
3.224AlaPhe: 3.224 ± 0.685
3.224AlaGly: 3.224 ± 2.835
2.149AlaHis: 2.149 ± 0.804
4.299AlaIle: 4.299 ± 2.719
3.761AlaLys: 3.761 ± 1.143
6.985AlaLeu: 6.985 ± 3.483
1.612AlaMet: 1.612 ± 0.758
2.149AlaAsn: 2.149 ± 1.482
1.612AlaPro: 1.612 ± 1.616
0.537AlaGln: 0.537 ± 0.37
1.612AlaArg: 1.612 ± 0.758
3.761AlaSer: 3.761 ± 0.685
2.149AlaThr: 2.149 ± 1.065
4.836AlaVal: 4.836 ± 2.167
0.537AlaTrp: 0.537 ± 0.767
3.761AlaTyr: 3.761 ± 1.38
0.0AlaXaa: 0.0 ± 0.0
Cys
1.612CysAla: 1.612 ± 0.672
1.075CysCys: 1.075 ± 0.74
1.612CysAsp: 1.612 ± 0.635
0.537CysGlu: 0.537 ± 0.441
1.075CysPhe: 1.075 ± 0.402
1.075CysGly: 1.075 ± 0.807
0.537CysHis: 0.537 ± 0.37
1.612CysIle: 1.612 ± 1.322
1.075CysLys: 1.075 ± 0.402
3.761CysLeu: 3.761 ± 1.49
0.0CysMet: 0.0 ± 0.0
1.612CysAsn: 1.612 ± 1.11
0.0CysPro: 0.0 ± 0.0
1.612CysGln: 1.612 ± 0.833
2.149CysArg: 2.149 ± 1.666
5.373CysSer: 5.373 ± 2.303
2.149CysThr: 2.149 ± 0.958
3.761CysVal: 3.761 ± 1.091
0.537CysTrp: 0.537 ± 0.85
1.075CysTyr: 1.075 ± 1.701
0.0CysXaa: 0.0 ± 0.0
Asp
2.687AspAla: 2.687 ± 0.687
1.612AspCys: 1.612 ± 0.758
3.761AspAsp: 3.761 ± 1.38
2.687AspGlu: 2.687 ± 0.996
2.149AspPhe: 2.149 ± 0.958
1.612AspGly: 1.612 ± 0.635
2.687AspHis: 2.687 ± 1.602
3.761AspIle: 3.761 ± 1.882
2.687AspLys: 2.687 ± 0.879
2.149AspLeu: 2.149 ± 0.779
0.537AspMet: 0.537 ± 0.37
3.224AspAsn: 3.224 ± 0.928
2.687AspPro: 2.687 ± 0.888
0.537AspGln: 0.537 ± 0.767
4.836AspArg: 4.836 ± 1.256
4.836AspSer: 4.836 ± 1.311
3.224AspThr: 3.224 ± 1.416
2.687AspVal: 2.687 ± 1.602
1.612AspTrp: 1.612 ± 0.758
1.612AspTyr: 1.612 ± 0.94
0.0AspXaa: 0.0 ± 0.0
Glu
3.761GluAla: 3.761 ± 1.882
0.537GluCys: 0.537 ± 0.679
3.761GluAsp: 3.761 ± 1.518
5.911GluGlu: 5.911 ± 2.945
2.149GluPhe: 2.149 ± 1.311
1.612GluGly: 1.612 ± 0.607
0.537GluHis: 0.537 ± 0.441
4.299GluIle: 4.299 ± 0.607
6.985GluLys: 6.985 ± 1.311
3.761GluLeu: 3.761 ± 1.073
2.149GluMet: 2.149 ± 1.225
1.612GluAsn: 1.612 ± 0.986
0.537GluPro: 0.537 ± 0.37
2.687GluGln: 2.687 ± 0.879
0.537GluArg: 0.537 ± 0.37
5.911GluSer: 5.911 ± 2.041
5.373GluThr: 5.373 ± 1.503
4.299GluVal: 4.299 ± 1.058
0.0GluTrp: 0.0 ± 0.0
2.149GluTyr: 2.149 ± 0.958
0.0GluXaa: 0.0 ± 0.0
Phe
3.761PheAla: 3.761 ± 1.211
0.537PheCys: 0.537 ± 0.441
4.836PheAsp: 4.836 ± 1.588
2.687PheGlu: 2.687 ± 1.082
3.224PhePhe: 3.224 ± 0.947
0.537PheGly: 0.537 ± 0.37
2.149PheHis: 2.149 ± 1.173
3.761PheIle: 3.761 ± 1.814
4.299PheLys: 4.299 ± 1.83
3.224PheLeu: 3.224 ± 1.834
2.687PheMet: 2.687 ± 0.705
1.612PheAsn: 1.612 ± 1.026
1.075PhePro: 1.075 ± 0.402
1.075PheGln: 1.075 ± 0.748
2.149PheArg: 2.149 ± 1.139
4.836PheSer: 4.836 ± 2.02
1.612PheThr: 1.612 ± 0.635
1.612PheVal: 1.612 ± 1.11
0.0PheTrp: 0.0 ± 0.0
2.149PheTyr: 2.149 ± 1.105
0.0PheXaa: 0.0 ± 0.0
Gly
3.761GlyAla: 3.761 ± 4.499
2.687GlyCys: 2.687 ± 1.468
1.075GlyAsp: 1.075 ± 0.741
3.761GlyGlu: 3.761 ± 1.211
1.075GlyPhe: 1.075 ± 0.74
4.299GlyGly: 4.299 ± 1.098
0.537GlyHis: 0.537 ± 0.37
3.224GlyIle: 3.224 ± 2.262
4.299GlyLys: 4.299 ± 1.608
3.761GlyLeu: 3.761 ± 2.355
1.075GlyMet: 1.075 ± 0.741
4.299GlyAsn: 4.299 ± 2.304
1.612GlyPro: 1.612 ± 1.11
0.537GlyGln: 0.537 ± 0.37
3.761GlyArg: 3.761 ± 1.574
5.373GlySer: 5.373 ± 4.468
1.075GlyThr: 1.075 ± 0.741
2.149GlyVal: 2.149 ± 0.658
1.075GlyTrp: 1.075 ± 0.833
3.224GlyTyr: 3.224 ± 1.77
0.0GlyXaa: 0.0 ± 0.0
His
3.761HisAla: 3.761 ± 1.092
0.537HisCys: 0.537 ± 0.85
0.0HisAsp: 0.0 ± 0.0
1.612HisGlu: 1.612 ± 1.322
1.075HisPhe: 1.075 ± 0.402
0.0HisGly: 0.0 ± 0.0
0.537HisHis: 0.537 ± 0.679
1.612HisIle: 1.612 ± 1.023
1.612HisLys: 1.612 ± 0.758
2.687HisLeu: 2.687 ± 1.738
0.0HisMet: 0.0 ± 0.0
1.075HisAsn: 1.075 ± 1.127
1.075HisPro: 1.075 ± 0.402
1.075HisGln: 1.075 ± 0.402
1.075HisArg: 1.075 ± 0.402
1.075HisSer: 1.075 ± 0.881
1.612HisThr: 1.612 ± 0.833
1.075HisVal: 1.075 ± 0.664
0.0HisTrp: 0.0 ± 0.0
1.075HisTyr: 1.075 ± 0.402
0.0HisXaa: 0.0 ± 0.0
Ile
3.761IleAla: 3.761 ± 1.326
3.761IleCys: 3.761 ± 2.342
5.373IleAsp: 5.373 ± 1.875
3.224IleGlu: 3.224 ± 1.726
5.373IlePhe: 5.373 ± 2.215
4.836IleGly: 4.836 ± 2.619
1.075IleHis: 1.075 ± 0.833
3.761IleIle: 3.761 ± 1.481
5.911IleLys: 5.911 ± 1.519
4.299IleLeu: 4.299 ± 1.208
1.075IleMet: 1.075 ± 0.737
2.687IleAsn: 2.687 ± 0.687
2.687IlePro: 2.687 ± 0.83
3.224IleGln: 3.224 ± 1.043
3.761IleArg: 3.761 ± 1.517
3.224IleSer: 3.224 ± 1.479
2.149IleThr: 2.149 ± 0.804
2.687IleVal: 2.687 ± 1.55
0.0IleTrp: 0.0 ± 0.0
3.224IleTyr: 3.224 ± 1.201
0.0IleXaa: 0.0 ± 0.0
Lys
2.149LysAla: 2.149 ± 1.152
3.224LysCys: 3.224 ± 0.8
2.687LysAsp: 2.687 ± 1.13
3.761LysGlu: 3.761 ± 1.518
3.224LysPhe: 3.224 ± 1.308
3.224LysGly: 3.224 ± 1.17
0.537LysHis: 0.537 ± 0.679
4.299LysIle: 4.299 ± 1.457
14.508LysLys: 14.508 ± 3.338
8.06LysLeu: 8.06 ± 1.521
2.149LysMet: 2.149 ± 0.832
10.747LysAsn: 10.747 ± 2.288
4.299LysPro: 4.299 ± 1.066
2.149LysGln: 2.149 ± 0.801
4.299LysArg: 4.299 ± 1.317
3.761LysSer: 3.761 ± 2.08
4.299LysThr: 4.299 ± 1.154
2.149LysVal: 2.149 ± 1.129
1.612LysTrp: 1.612 ± 1.291
1.612LysTyr: 1.612 ± 0.758
0.0LysXaa: 0.0 ± 0.0
Leu
3.224LeuAla: 3.224 ± 1.213
1.075LeuCys: 1.075 ± 0.807
4.299LeuAsp: 4.299 ± 1.232
5.911LeuGlu: 5.911 ± 0.942
3.761LeuPhe: 3.761 ± 1.191
3.224LeuGly: 3.224 ± 1.343
2.687LeuHis: 2.687 ± 1.313
4.836LeuIle: 4.836 ± 1.453
6.985LeuLys: 6.985 ± 2.901
6.985LeuLeu: 6.985 ± 3.221
3.761LeuMet: 3.761 ± 1.122
5.373LeuAsn: 5.373 ± 2.012
3.224LeuPro: 3.224 ± 0.835
0.0LeuGln: 0.0 ± 0.0
3.224LeuArg: 3.224 ± 1.539
3.224LeuSer: 3.224 ± 0.834
5.911LeuThr: 5.911 ± 1.574
4.299LeuVal: 4.299 ± 0.794
0.0LeuTrp: 0.0 ± 0.0
4.836LeuTyr: 4.836 ± 2.016
0.0LeuXaa: 0.0 ± 0.0
Met
0.537MetAla: 0.537 ± 0.441
1.612MetCys: 1.612 ± 1.11
0.537MetAsp: 0.537 ± 0.441
1.612MetGlu: 1.612 ± 0.833
1.075MetPhe: 1.075 ± 0.807
1.075MetGly: 1.075 ± 1.039
2.149MetHis: 2.149 ± 1.386
0.537MetIle: 0.537 ± 0.441
0.537MetLys: 0.537 ± 0.441
2.149MetLeu: 2.149 ± 0.658
0.0MetMet: 0.0 ± 0.0
1.612MetAsn: 1.612 ± 0.886
1.612MetPro: 1.612 ± 0.635
0.537MetGln: 0.537 ± 0.37
1.612MetArg: 1.612 ± 0.833
2.149MetSer: 2.149 ± 0.889
3.224MetThr: 3.224 ± 1.111
1.075MetVal: 1.075 ± 0.402
0.537MetTrp: 0.537 ± 0.441
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.911AsnAla: 5.911 ± 1.246
2.687AsnCys: 2.687 ± 0.888
1.612AsnAsp: 1.612 ± 0.635
1.612AsnGlu: 1.612 ± 0.608
3.224AsnPhe: 3.224 ± 0.929
3.224AsnGly: 3.224 ± 1.416
0.537AsnHis: 0.537 ± 0.679
3.761AsnIle: 3.761 ± 0.682
4.836AsnLys: 4.836 ± 1.278
5.373AsnLeu: 5.373 ± 2.589
0.537AsnMet: 0.537 ± 0.37
4.299AsnAsn: 4.299 ± 1.373
4.836AsnPro: 4.836 ± 1.573
2.149AsnGln: 2.149 ± 0.658
3.224AsnArg: 3.224 ± 0.968
2.687AsnSer: 2.687 ± 1.602
1.612AsnThr: 1.612 ± 0.635
3.224AsnVal: 3.224 ± 1.206
1.075AsnTrp: 1.075 ± 1.127
5.911AsnTyr: 5.911 ± 1.325
0.0AsnXaa: 0.0 ± 0.0
Pro
2.149ProAla: 2.149 ± 1.48
2.687ProCys: 2.687 ± 1.85
4.836ProAsp: 4.836 ± 1.278
3.224ProGlu: 3.224 ± 1.452
2.149ProPhe: 2.149 ± 0.958
2.149ProGly: 2.149 ± 0.697
0.537ProHis: 0.537 ± 0.679
3.224ProIle: 3.224 ± 1.201
4.836ProLys: 4.836 ± 1.891
2.687ProLeu: 2.687 ± 0.824
1.075ProMet: 1.075 ± 0.807
3.224ProAsn: 3.224 ± 1.716
5.373ProPro: 5.373 ± 1.45
0.0ProGln: 0.0 ± 0.0
1.612ProArg: 1.612 ± 0.635
2.687ProSer: 2.687 ± 1.306
3.224ProThr: 3.224 ± 0.676
2.149ProVal: 2.149 ± 0.958
0.537ProTrp: 0.537 ± 0.441
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.075GlnAla: 1.075 ± 0.402
0.537GlnCys: 0.537 ± 0.679
2.149GlnAsp: 2.149 ± 0.877
1.612GlnGlu: 1.612 ± 0.635
2.149GlnPhe: 2.149 ± 0.804
2.149GlnGly: 2.149 ± 1.173
0.537GlnHis: 0.537 ± 0.37
1.612GlnIle: 1.612 ± 1.11
2.687GlnLys: 2.687 ± 1.441
1.612GlnLeu: 1.612 ± 1.026
0.537GlnMet: 0.537 ± 0.679
1.612GlnAsn: 1.612 ± 1.11
1.612GlnPro: 1.612 ± 0.635
1.612GlnGln: 1.612 ± 0.758
1.612GlnArg: 1.612 ± 0.646
1.612GlnSer: 1.612 ± 1.462
0.537GlnThr: 0.537 ± 0.441
2.687GlnVal: 2.687 ± 1.281
1.075GlnTrp: 1.075 ± 0.74
1.075GlnTyr: 1.075 ± 0.402
0.0GlnXaa: 0.0 ± 0.0
Arg
1.075ArgAla: 1.075 ± 1.036
2.149ArgCys: 2.149 ± 1.613
1.612ArgAsp: 1.612 ± 0.758
2.149ArgGlu: 2.149 ± 1.48
2.149ArgPhe: 2.149 ± 1.342
5.911ArgGly: 5.911 ± 2.675
1.075ArgHis: 1.075 ± 0.759
4.836ArgIle: 4.836 ± 1.002
4.299ArgLys: 4.299 ± 1.83
4.299ArgLeu: 4.299 ± 1.778
1.612ArgMet: 1.612 ± 0.758
2.149ArgAsn: 2.149 ± 0.674
2.687ArgPro: 2.687 ± 1.306
1.612ArgGln: 1.612 ± 0.758
5.911ArgArg: 5.911 ± 1.626
2.687ArgSer: 2.687 ± 1.526
2.687ArgThr: 2.687 ± 1.84
1.612ArgVal: 1.612 ± 0.819
2.149ArgTrp: 2.149 ± 1.613
1.612ArgTyr: 1.612 ± 0.672
0.0ArgXaa: 0.0 ± 0.0
Ser
6.985SerAla: 6.985 ± 5.028
1.612SerCys: 1.612 ± 0.923
2.149SerAsp: 2.149 ± 0.958
3.224SerGlu: 3.224 ± 1.213
2.149SerPhe: 2.149 ± 1.48
5.373SerGly: 5.373 ± 2.647
1.075SerHis: 1.075 ± 0.748
4.836SerIle: 4.836 ± 2.53
4.299SerLys: 4.299 ± 1.776
5.373SerLeu: 5.373 ± 1.769
1.612SerMet: 1.612 ± 0.635
5.911SerAsn: 5.911 ± 1.325
4.299SerPro: 4.299 ± 1.29
2.687SerGln: 2.687 ± 1.13
5.373SerArg: 5.373 ± 2.383
2.687SerSer: 2.687 ± 1.418
2.149SerThr: 2.149 ± 1.6
3.224SerVal: 3.224 ± 2.025
1.075SerTrp: 1.075 ± 1.258
1.612SerTyr: 1.612 ± 1.059
0.0SerXaa: 0.0 ± 0.0
Thr
3.224ThrAla: 3.224 ± 1.213
0.0ThrCys: 0.0 ± 0.0
2.687ThrAsp: 2.687 ± 1.422
3.224ThrGlu: 3.224 ± 1.372
3.224ThrPhe: 3.224 ± 0.877
2.149ThrGly: 2.149 ± 0.761
0.537ThrHis: 0.537 ± 0.85
4.836ThrIle: 4.836 ± 1.283
3.761ThrLys: 3.761 ± 1.229
2.687ThrLeu: 2.687 ± 1.306
0.537ThrMet: 0.537 ± 0.37
4.299ThrAsn: 4.299 ± 1.916
2.149ThrPro: 2.149 ± 0.958
2.687ThrGln: 2.687 ± 1.023
2.687ThrArg: 2.687 ± 1.429
3.761ThrSer: 3.761 ± 2.04
3.224ThrThr: 3.224 ± 1.823
3.224ThrVal: 3.224 ± 0.852
1.075ThrTrp: 1.075 ± 0.807
2.687ThrTyr: 2.687 ± 0.717
0.0ThrXaa: 0.0 ± 0.0
Val
5.373ValAla: 5.373 ± 1.062
3.224ValCys: 3.224 ± 1.269
2.149ValAsp: 2.149 ± 1.137
3.224ValGlu: 3.224 ± 0.947
1.612ValPhe: 1.612 ± 0.758
2.687ValGly: 2.687 ± 2.973
1.612ValHis: 1.612 ± 0.672
3.761ValIle: 3.761 ± 1.433
3.761ValLys: 3.761 ± 1.229
2.687ValLeu: 2.687 ± 0.748
0.537ValMet: 0.537 ± 0.441
1.612ValAsn: 1.612 ± 0.758
4.836ValPro: 4.836 ± 0.885
4.299ValGln: 4.299 ± 1.853
3.224ValArg: 3.224 ± 0.928
3.761ValSer: 3.761 ± 2.385
1.612ValThr: 1.612 ± 0.833
3.761ValVal: 3.761 ± 1.211
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.537TrpAla: 0.537 ± 0.37
0.537TrpCys: 0.537 ± 0.37
0.0TrpAsp: 0.0 ± 0.0
0.537TrpGlu: 0.537 ± 0.37
1.075TrpPhe: 1.075 ± 0.881
2.149TrpGly: 2.149 ± 3.402
0.0TrpHis: 0.0 ± 0.0
0.537TrpIle: 0.537 ± 0.767
0.537TrpLys: 0.537 ± 0.679
1.075TrpLeu: 1.075 ± 1.127
0.537TrpMet: 0.537 ± 0.37
0.537TrpAsn: 0.537 ± 0.441
1.075TrpPro: 1.075 ± 0.79
0.0TrpGln: 0.0 ± 0.0
0.537TrpArg: 0.537 ± 0.85
1.075TrpSer: 1.075 ± 0.833
1.612TrpThr: 1.612 ± 0.833
1.075TrpVal: 1.075 ± 0.74
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.612TyrAla: 1.612 ± 1.023
2.149TyrCys: 2.149 ± 0.804
1.612TyrAsp: 1.612 ± 0.886
2.149TyrGlu: 2.149 ± 0.674
2.687TyrPhe: 2.687 ± 0.829
2.149TyrGly: 2.149 ± 1.311
1.075TyrHis: 1.075 ± 0.402
2.687TyrIle: 2.687 ± 0.996
1.612TyrLys: 1.612 ± 0.607
3.224TyrLeu: 3.224 ± 0.956
1.612TyrMet: 1.612 ± 0.499
2.687TyrAsn: 2.687 ± 0.888
1.075TyrPro: 1.075 ± 0.74
1.075TyrGln: 1.075 ± 0.74
1.075TyrArg: 1.075 ± 0.881
3.224TyrSer: 3.224 ± 2.723
3.224TyrThr: 3.224 ± 0.887
2.149TyrVal: 2.149 ± 0.958
0.537TyrTrp: 0.537 ± 0.37
1.075TyrTyr: 1.075 ± 0.833
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1862 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski