Amino acid dipepetide frequency for Santa barbara virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.798AlaAla: 2.798 ± 1.009
0.763AlaCys: 0.763 ± 0.342
2.544AlaAsp: 2.544 ± 0.588
1.018AlaGlu: 1.018 ± 0.407
1.272AlaPhe: 1.272 ± 0.656
1.781AlaGly: 1.781 ± 1.046
1.272AlaHis: 1.272 ± 0.557
2.035AlaIle: 2.035 ± 0.515
2.289AlaLys: 2.289 ± 0.823
4.07AlaLeu: 4.07 ± 1.067
1.018AlaMet: 1.018 ± 0.558
3.561AlaAsn: 3.561 ± 0.698
1.272AlaPro: 1.272 ± 1.066
2.289AlaGln: 2.289 ± 0.734
1.526AlaArg: 1.526 ± 0.965
3.307AlaSer: 3.307 ± 1.484
3.561AlaThr: 3.561 ± 0.715
1.526AlaVal: 1.526 ± 0.745
0.509AlaTrp: 0.509 ± 0.219
1.272AlaTyr: 1.272 ± 0.474
0.0AlaXaa: 0.0 ± 0.0
Cys
0.509CysAla: 0.509 ± 0.219
0.254CysCys: 0.254 ± 0.323
1.018CysAsp: 1.018 ± 0.585
0.254CysGlu: 0.254 ± 0.296
1.018CysPhe: 1.018 ± 0.31
0.509CysGly: 0.509 ± 0.219
0.254CysHis: 0.254 ± 0.146
1.526CysIle: 1.526 ± 0.389
1.272CysLys: 1.272 ± 1.412
1.272CysLeu: 1.272 ± 0.308
0.0CysMet: 0.0 ± 0.0
2.035CysAsn: 2.035 ± 0.801
0.763CysPro: 0.763 ± 0.685
1.526CysGln: 1.526 ± 0.449
0.254CysArg: 0.254 ± 0.296
1.526CysSer: 1.526 ± 0.409
1.018CysThr: 1.018 ± 0.31
0.763CysVal: 0.763 ± 0.225
0.254CysTrp: 0.254 ± 0.146
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.781AspAla: 1.781 ± 0.944
1.781AspCys: 1.781 ± 0.731
3.561AspAsp: 3.561 ± 0.636
2.289AspGlu: 2.289 ± 0.538
2.289AspPhe: 2.289 ± 0.843
1.526AspGly: 1.526 ± 0.409
1.526AspHis: 1.526 ± 1.369
3.561AspIle: 3.561 ± 0.78
3.307AspLys: 3.307 ± 0.572
6.105AspLeu: 6.105 ± 1.757
1.781AspMet: 1.781 ± 0.731
2.289AspAsn: 2.289 ± 0.893
3.307AspPro: 3.307 ± 0.727
4.325AspGln: 4.325 ± 1.39
1.781AspArg: 1.781 ± 0.431
5.342AspSer: 5.342 ± 1.102
1.781AspThr: 1.781 ± 1.444
2.035AspVal: 2.035 ± 0.425
1.781AspTrp: 1.781 ± 0.805
2.289AspTyr: 2.289 ± 0.876
0.0AspXaa: 0.0 ± 0.0
Glu
2.544GluAla: 2.544 ± 1.068
0.763GluCys: 0.763 ± 0.5
3.561GluAsp: 3.561 ± 0.65
6.105GluGlu: 6.105 ± 1.477
5.088GluPhe: 5.088 ± 1.332
4.07GluGly: 4.07 ± 0.551
1.018GluHis: 1.018 ± 0.437
6.36GluIle: 6.36 ± 1.713
4.579GluLys: 4.579 ± 0.794
6.614GluLeu: 6.614 ± 1.768
1.526GluMet: 1.526 ± 0.562
2.544GluAsn: 2.544 ± 0.379
2.289GluPro: 2.289 ± 1.021
1.526GluGln: 1.526 ± 0.65
2.544GluArg: 2.544 ± 0.628
4.325GluSer: 4.325 ± 0.676
3.307GluThr: 3.307 ± 0.874
4.833GluVal: 4.833 ± 0.805
1.018GluTrp: 1.018 ± 0.975
1.526GluTyr: 1.526 ± 0.45
0.0GluXaa: 0.0 ± 0.0
Phe
1.018PheAla: 1.018 ± 0.585
0.509PheCys: 0.509 ± 0.219
3.816PheAsp: 3.816 ± 0.982
4.07PheGlu: 4.07 ± 1.821
2.798PhePhe: 2.798 ± 1.001
4.325PheGly: 4.325 ± 0.632
1.272PheHis: 1.272 ± 0.557
2.544PheIle: 2.544 ± 1.138
3.816PheLys: 3.816 ± 0.765
4.07PheLeu: 4.07 ± 1.214
0.763PheMet: 0.763 ± 0.566
1.272PheAsn: 1.272 ± 0.342
3.561PhePro: 3.561 ± 0.965
2.798PheGln: 2.798 ± 0.701
1.018PheArg: 1.018 ± 0.585
3.816PheSer: 3.816 ± 0.666
1.018PheThr: 1.018 ± 0.792
3.053PheVal: 3.053 ± 0.904
1.526PheTrp: 1.526 ± 0.493
1.272PheTyr: 1.272 ± 1.054
0.0PheXaa: 0.0 ± 0.0
Gly
1.272GlyAla: 1.272 ± 0.429
0.763GlyCys: 0.763 ± 0.225
2.544GlyAsp: 2.544 ± 0.444
2.798GlyGlu: 2.798 ± 0.964
2.289GlyPhe: 2.289 ± 1.085
3.053GlyGly: 3.053 ± 1.185
0.509GlyHis: 0.509 ± 0.219
6.36GlyIle: 6.36 ± 1.027
3.561GlyLys: 3.561 ± 1.257
9.158GlyLeu: 9.158 ± 1.253
0.763GlyMet: 0.763 ± 0.225
1.526GlyAsn: 1.526 ± 0.417
1.781GlyPro: 1.781 ± 0.643
1.272GlyGln: 1.272 ± 0.6
2.035GlyArg: 2.035 ± 0.303
3.816GlySer: 3.816 ± 1.175
2.289GlyThr: 2.289 ± 0.942
1.526GlyVal: 1.526 ± 0.752
1.018GlyTrp: 1.018 ± 0.571
2.035GlyTyr: 2.035 ± 0.595
0.0GlyXaa: 0.0 ± 0.0
His
1.018HisAla: 1.018 ± 0.321
0.0HisCys: 0.0 ± 0.0
0.763HisAsp: 0.763 ± 0.517
1.018HisGlu: 1.018 ± 0.792
1.018HisPhe: 1.018 ± 0.448
0.509HisGly: 0.509 ± 0.292
1.018HisHis: 1.018 ± 0.509
2.289HisIle: 2.289 ± 0.509
0.763HisLys: 0.763 ± 0.454
1.272HisLeu: 1.272 ± 0.418
0.509HisMet: 0.509 ± 0.555
1.272HisAsn: 1.272 ± 0.418
1.526HisPro: 1.526 ± 0.656
0.763HisGln: 0.763 ± 0.225
1.781HisArg: 1.781 ± 0.464
2.035HisSer: 2.035 ± 0.943
1.018HisThr: 1.018 ± 0.756
0.509HisVal: 0.509 ± 0.588
0.509HisTrp: 0.509 ± 0.219
2.035HisTyr: 2.035 ± 0.57
0.0HisXaa: 0.0 ± 0.0
Ile
0.763IleAla: 0.763 ± 0.517
1.272IleCys: 1.272 ± 0.474
3.561IleAsp: 3.561 ± 0.498
6.105IleGlu: 6.105 ± 1.521
3.053IlePhe: 3.053 ± 0.978
4.579IleGly: 4.579 ± 1.268
1.526IleHis: 1.526 ± 0.516
8.14IleIle: 8.14 ± 0.854
7.377IleLys: 7.377 ± 1.207
9.158IleLeu: 9.158 ± 1.758
1.018IleMet: 1.018 ± 0.734
5.851IleAsn: 5.851 ± 1.087
4.833IlePro: 4.833 ± 0.664
2.289IleGln: 2.289 ± 1.081
4.833IleArg: 4.833 ± 1.258
5.342IleSer: 5.342 ± 1.565
5.342IleThr: 5.342 ± 2.148
3.816IleVal: 3.816 ± 1.246
1.526IleTrp: 1.526 ± 0.513
2.544IleTyr: 2.544 ± 0.652
0.0IleXaa: 0.0 ± 0.0
Lys
3.053LysAla: 3.053 ± 1.454
0.763LysCys: 0.763 ± 0.535
2.289LysAsp: 2.289 ± 0.638
6.36LysGlu: 6.36 ± 1.385
3.816LysPhe: 3.816 ± 1.541
4.833LysGly: 4.833 ± 1.052
1.526LysHis: 1.526 ± 0.545
6.614LysIle: 6.614 ± 1.787
7.632LysLys: 7.632 ± 2.093
6.105LysLeu: 6.105 ± 0.964
2.798LysMet: 2.798 ± 0.881
5.088LysAsn: 5.088 ± 1.082
2.544LysPro: 2.544 ± 1.079
4.579LysGln: 4.579 ± 0.974
5.597LysArg: 5.597 ± 0.962
4.325LysSer: 4.325 ± 0.335
3.816LysThr: 3.816 ± 0.95
2.798LysVal: 2.798 ± 1.118
0.763LysTrp: 0.763 ± 0.439
2.035LysTyr: 2.035 ± 0.502
0.0LysXaa: 0.0 ± 0.0
Leu
4.833LeuAla: 4.833 ± 0.712
1.018LeuCys: 1.018 ± 0.31
5.851LeuAsp: 5.851 ± 1.835
5.597LeuGlu: 5.597 ± 0.665
4.833LeuPhe: 4.833 ± 0.672
5.088LeuGly: 5.088 ± 0.515
1.018LeuHis: 1.018 ± 0.407
9.412LeuIle: 9.412 ± 2.054
6.868LeuLys: 6.868 ± 1.09
7.632LeuLeu: 7.632 ± 2.027
3.053LeuMet: 3.053 ± 1.089
9.158LeuAsn: 9.158 ± 1.093
3.561LeuPro: 3.561 ± 1.14
2.289LeuGln: 2.289 ± 0.835
7.123LeuArg: 7.123 ± 2.092
8.395LeuSer: 8.395 ± 1.267
6.105LeuThr: 6.105 ± 1.382
4.579LeuVal: 4.579 ± 1.601
1.526LeuTrp: 1.526 ± 0.417
4.325LeuTyr: 4.325 ± 0.902
0.0LeuXaa: 0.0 ± 0.0
Met
1.018MetAla: 1.018 ± 0.698
0.254MetCys: 0.254 ± 0.146
0.763MetAsp: 0.763 ± 0.539
0.763MetGlu: 0.763 ± 0.668
1.526MetPhe: 1.526 ± 0.259
1.272MetGly: 1.272 ± 0.286
0.254MetHis: 0.254 ± 0.146
2.035MetIle: 2.035 ± 0.424
1.526MetLys: 1.526 ± 0.389
1.272MetLeu: 1.272 ± 0.769
0.509MetMet: 0.509 ± 0.708
1.018MetAsn: 1.018 ± 0.58
0.763MetPro: 0.763 ± 0.342
0.509MetGln: 0.509 ± 0.287
0.763MetArg: 0.763 ± 0.439
1.272MetSer: 1.272 ± 0.597
1.781MetThr: 1.781 ± 0.466
1.526MetVal: 1.526 ± 0.389
0.0MetTrp: 0.0 ± 0.0
1.526MetTyr: 1.526 ± 0.888
0.0MetXaa: 0.0 ± 0.0
Asn
3.816AsnAla: 3.816 ± 1.24
1.018AsnCys: 1.018 ± 0.7
3.561AsnAsp: 3.561 ± 1.004
4.07AsnGlu: 4.07 ± 1.04
2.035AsnPhe: 2.035 ± 0.946
2.289AsnGly: 2.289 ± 0.843
1.526AsnHis: 1.526 ± 0.468
4.833AsnIle: 4.833 ± 0.946
5.088AsnLys: 5.088 ± 0.768
6.868AsnLeu: 6.868 ± 1.74
0.254AsnMet: 0.254 ± 0.146
4.07AsnAsn: 4.07 ± 1.568
3.053AsnPro: 3.053 ± 1.083
3.561AsnGln: 3.561 ± 0.715
1.526AsnArg: 1.526 ± 0.515
5.597AsnSer: 5.597 ± 0.711
3.307AsnThr: 3.307 ± 0.8
2.798AsnVal: 2.798 ± 0.715
2.289AsnTrp: 2.289 ± 0.662
2.289AsnTyr: 2.289 ± 0.509
0.0AsnXaa: 0.0 ± 0.0
Pro
1.526ProAla: 1.526 ± 0.87
0.763ProCys: 0.763 ± 0.326
3.307ProAsp: 3.307 ± 0.749
3.053ProGlu: 3.053 ± 0.407
2.798ProPhe: 2.798 ± 1.154
1.272ProGly: 1.272 ± 0.474
1.781ProHis: 1.781 ± 0.53
2.289ProIle: 2.289 ± 0.684
3.053ProLys: 3.053 ± 0.735
3.816ProLeu: 3.816 ± 1.411
0.509ProMet: 0.509 ± 0.441
2.544ProAsn: 2.544 ± 0.866
2.035ProPro: 2.035 ± 0.425
1.018ProGln: 1.018 ± 0.914
1.781ProArg: 1.781 ± 0.567
4.325ProSer: 4.325 ± 1.008
2.798ProThr: 2.798 ± 0.489
2.544ProVal: 2.544 ± 0.848
0.254ProTrp: 0.254 ± 0.146
2.544ProTyr: 2.544 ± 1.303
0.0ProXaa: 0.0 ± 0.0
Gln
1.526GlnAla: 1.526 ± 0.862
0.763GlnCys: 0.763 ± 0.363
1.018GlnAsp: 1.018 ± 0.597
3.561GlnGlu: 3.561 ± 0.703
1.272GlnPhe: 1.272 ± 0.656
1.781GlnGly: 1.781 ± 0.449
0.763GlnHis: 0.763 ± 0.342
2.798GlnIle: 2.798 ± 1.653
3.307GlnLys: 3.307 ± 0.736
3.816GlnLeu: 3.816 ± 1.495
0.509GlnMet: 0.509 ± 0.29
3.307GlnAsn: 3.307 ± 0.706
0.254GlnPro: 0.254 ± 0.146
0.254GlnGln: 0.254 ± 0.146
1.018GlnArg: 1.018 ± 0.31
3.053GlnSer: 3.053 ± 0.721
3.053GlnThr: 3.053 ± 1.232
1.018GlnVal: 1.018 ± 0.474
0.254GlnTrp: 0.254 ± 0.323
1.526GlnTyr: 1.526 ± 1.026
0.0GlnXaa: 0.0 ± 0.0
Arg
1.526ArgAla: 1.526 ± 0.561
1.018ArgCys: 1.018 ± 0.31
2.798ArgAsp: 2.798 ± 0.775
4.07ArgGlu: 4.07 ± 1.061
2.289ArgPhe: 2.289 ± 0.982
3.307ArgGly: 3.307 ± 1.21
0.763ArgHis: 0.763 ± 0.439
1.781ArgIle: 1.781 ± 0.82
2.798ArgLys: 2.798 ± 0.853
3.816ArgLeu: 3.816 ± 0.942
0.254ArgMet: 0.254 ± 0.296
3.307ArgAsn: 3.307 ± 0.688
2.035ArgPro: 2.035 ± 0.593
1.272ArgGln: 1.272 ± 0.448
1.526ArgArg: 1.526 ± 1.348
3.053ArgSer: 3.053 ± 0.591
3.816ArgThr: 3.816 ± 0.579
1.781ArgVal: 1.781 ± 0.687
1.272ArgTrp: 1.272 ± 0.656
1.272ArgTyr: 1.272 ± 0.754
0.0ArgXaa: 0.0 ± 0.0
Ser
3.307SerAla: 3.307 ± 1.051
0.763SerCys: 0.763 ± 0.439
4.833SerAsp: 4.833 ± 1.514
6.36SerGlu: 6.36 ± 1.267
4.579SerPhe: 4.579 ± 0.983
3.307SerGly: 3.307 ± 0.818
2.035SerHis: 2.035 ± 0.882
7.123SerIle: 7.123 ± 0.307
5.342SerLys: 5.342 ± 0.766
10.43SerLeu: 10.43 ± 1.855
1.272SerMet: 1.272 ± 0.561
6.105SerAsn: 6.105 ± 0.984
3.816SerPro: 3.816 ± 0.455
1.526SerGln: 1.526 ± 0.598
3.816SerArg: 3.816 ± 0.893
6.105SerSer: 6.105 ± 1.488
2.544SerThr: 2.544 ± 0.653
2.035SerVal: 2.035 ± 1.064
1.272SerTrp: 1.272 ± 0.597
2.289SerTyr: 2.289 ± 0.379
0.0SerXaa: 0.0 ± 0.0
Thr
1.526ThrAla: 1.526 ± 1.044
1.526ThrCys: 1.526 ± 0.656
3.561ThrAsp: 3.561 ± 1.076
3.053ThrGlu: 3.053 ± 0.721
1.781ThrPhe: 1.781 ± 0.79
2.798ThrGly: 2.798 ± 0.853
1.781ThrHis: 1.781 ± 1.11
5.342ThrIle: 5.342 ± 0.975
5.088ThrLys: 5.088 ± 1.659
5.088ThrLeu: 5.088 ± 1.242
1.526ThrMet: 1.526 ± 0.468
3.053ThrAsn: 3.053 ± 0.639
2.544ThrPro: 2.544 ± 1.34
2.035ThrGln: 2.035 ± 1.17
0.254ThrArg: 0.254 ± 0.146
4.579ThrSer: 4.579 ± 0.631
2.798ThrThr: 2.798 ± 0.497
3.053ThrVal: 3.053 ± 1.027
2.289ThrTrp: 2.289 ± 0.385
1.272ThrTyr: 1.272 ± 0.429
0.0ThrXaa: 0.0 ± 0.0
Val
2.035ValAla: 2.035 ± 0.641
1.018ValCys: 1.018 ± 0.658
2.798ValAsp: 2.798 ± 1.392
1.781ValGlu: 1.781 ± 0.618
2.544ValPhe: 2.544 ± 0.58
2.289ValGly: 2.289 ± 0.596
0.509ValHis: 0.509 ± 0.287
3.561ValIle: 3.561 ± 0.598
5.088ValLys: 5.088 ± 1.384
5.597ValLeu: 5.597 ± 1.107
0.763ValMet: 0.763 ± 0.655
2.289ValAsn: 2.289 ± 1.058
1.781ValPro: 1.781 ± 0.79
0.254ValGln: 0.254 ± 0.146
1.781ValArg: 1.781 ± 0.927
3.307ValSer: 3.307 ± 0.564
4.325ValThr: 4.325 ± 1.233
1.018ValVal: 1.018 ± 0.437
0.254ValTrp: 0.254 ± 0.296
1.781ValTyr: 1.781 ± 1.004
0.0ValXaa: 0.0 ± 0.0
Trp
0.763TrpAla: 0.763 ± 0.326
0.254TrpCys: 0.254 ± 0.323
0.0TrpAsp: 0.0 ± 0.0
2.035TrpGlu: 2.035 ± 0.573
0.763TrpPhe: 0.763 ± 0.326
0.509TrpGly: 0.509 ± 0.292
0.509TrpHis: 0.509 ± 0.219
2.035TrpIle: 2.035 ± 0.939
1.526TrpLys: 1.526 ± 0.45
1.781TrpLeu: 1.781 ± 0.985
0.509TrpMet: 0.509 ± 0.29
1.272TrpAsn: 1.272 ± 0.516
0.509TrpPro: 0.509 ± 0.219
0.0TrpGln: 0.0 ± 0.0
0.509TrpArg: 0.509 ± 0.292
2.544TrpSer: 2.544 ± 0.837
0.509TrpThr: 0.509 ± 0.287
1.781TrpVal: 1.781 ± 1.09
0.509TrpTrp: 0.509 ± 0.593
1.272TrpTyr: 1.272 ± 1.087
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.053TyrAla: 3.053 ± 0.757
0.763TyrCys: 0.763 ± 0.528
2.289TyrAsp: 2.289 ± 0.805
1.781TyrGlu: 1.781 ± 0.822
1.526TyrPhe: 1.526 ± 0.734
1.272TyrGly: 1.272 ± 0.486
0.763TyrHis: 0.763 ± 0.225
2.289TyrIle: 2.289 ± 0.31
3.053TyrLys: 3.053 ± 1.248
4.325TyrLeu: 4.325 ± 0.779
0.763TyrMet: 0.763 ± 0.225
2.289TyrAsn: 2.289 ± 0.577
1.781TyrPro: 1.781 ± 0.287
0.763TyrGln: 0.763 ± 0.225
2.289TyrArg: 2.289 ± 1.026
2.798TyrSer: 2.798 ± 0.77
0.763TyrThr: 0.763 ± 0.326
1.781TyrVal: 1.781 ± 0.466
0.763TyrTrp: 0.763 ± 0.733
0.763TyrTyr: 0.763 ± 0.505
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3932 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski