Amino acid dipepetide frequency for Sierra Nevada virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.977AlaAla: 5.977 ± 2.128
1.358AlaCys: 1.358 ± 0.459
2.988AlaAsp: 2.988 ± 0.806
6.52AlaGlu: 6.52 ± 4.082
1.63AlaPhe: 1.63 ± 0.644
6.248AlaGly: 6.248 ± 1.107
1.63AlaHis: 1.63 ± 0.627
2.988AlaIle: 2.988 ± 0.806
2.717AlaLys: 2.717 ± 1.32
8.693AlaLeu: 8.693 ± 1.769
1.358AlaMet: 1.358 ± 0.574
1.358AlaAsn: 1.358 ± 0.607
4.618AlaPro: 4.618 ± 1.421
3.26AlaGln: 3.26 ± 0.697
5.162AlaArg: 5.162 ± 1.639
4.618AlaSer: 4.618 ± 1.493
7.878AlaThr: 7.878 ± 1.936
2.988AlaVal: 2.988 ± 0.604
1.358AlaTrp: 1.358 ± 0.561
2.717AlaTyr: 2.717 ± 0.827
0.0AlaXaa: 0.0 ± 0.0
Cys
0.543CysAla: 0.543 ± 0.273
0.815CysCys: 0.815 ± 0.457
0.543CysAsp: 0.543 ± 0.273
0.815CysGlu: 0.815 ± 0.322
0.543CysPhe: 0.543 ± 0.444
0.815CysGly: 0.815 ± 0.322
1.63CysHis: 1.63 ± 0.914
1.087CysIle: 1.087 ± 0.526
1.358CysLys: 1.358 ± 0.805
2.717CysLeu: 2.717 ± 0.626
0.272CysMet: 0.272 ± 0.152
0.815CysAsn: 0.815 ± 0.322
0.815CysPro: 0.815 ± 0.322
1.087CysGln: 1.087 ± 0.591
0.272CysArg: 0.272 ± 0.281
2.445CysSer: 2.445 ± 0.967
2.173CysThr: 2.173 ± 0.884
0.543CysVal: 0.543 ± 0.273
0.272CysTrp: 0.272 ± 0.152
1.902CysTyr: 1.902 ± 0.795
0.0CysXaa: 0.0 ± 0.0
Asp
1.358AspAla: 1.358 ± 1.277
0.543AspCys: 0.543 ± 0.261
1.358AspAsp: 1.358 ± 0.334
4.075AspGlu: 4.075 ± 1.346
2.173AspPhe: 2.173 ± 0.765
0.815AspGly: 0.815 ± 0.447
1.902AspHis: 1.902 ± 0.572
1.358AspIle: 1.358 ± 0.334
3.532AspLys: 3.532 ± 0.855
8.15AspLeu: 8.15 ± 2.053
0.815AspMet: 0.815 ± 0.596
1.358AspAsn: 1.358 ± 0.48
3.532AspPro: 3.532 ± 0.624
1.087AspGln: 1.087 ± 0.459
2.988AspArg: 2.988 ± 1.524
4.075AspSer: 4.075 ± 1.394
3.26AspThr: 3.26 ± 1.205
1.902AspVal: 1.902 ± 0.512
0.815AspTrp: 0.815 ± 0.365
2.988AspTyr: 2.988 ± 0.914
0.0AspXaa: 0.0 ± 0.0
Glu
8.15GluAla: 8.15 ± 2.031
0.815GluCys: 0.815 ± 0.423
5.162GluAsp: 5.162 ± 1.593
11.41GluGlu: 11.41 ± 6.853
1.63GluPhe: 1.63 ± 0.765
5.705GluGly: 5.705 ± 1.02
1.358GluHis: 1.358 ± 0.561
2.445GluIle: 2.445 ± 0.758
5.977GluLys: 5.977 ± 2.958
7.607GluLeu: 7.607 ± 1.515
1.902GluMet: 1.902 ± 0.503
2.445GluAsn: 2.445 ± 1.2
3.803GluPro: 3.803 ± 1.349
2.445GluGln: 2.445 ± 0.381
5.977GluArg: 5.977 ± 3.629
6.248GluSer: 6.248 ± 2.408
4.618GluThr: 4.618 ± 1.801
3.26GluVal: 3.26 ± 1.532
0.815GluTrp: 0.815 ± 0.376
1.902GluTyr: 1.902 ± 0.569
0.0GluXaa: 0.0 ± 0.0
Phe
1.902PheAla: 1.902 ± 0.523
0.815PheCys: 0.815 ± 0.457
0.815PheAsp: 0.815 ± 0.436
3.26PheGlu: 3.26 ± 0.838
1.902PhePhe: 1.902 ± 0.844
1.63PheGly: 1.63 ± 0.528
0.815PheHis: 0.815 ± 0.295
2.717PheIle: 2.717 ± 0.852
2.717PheLys: 2.717 ± 0.956
4.618PheLeu: 4.618 ± 1.551
0.272PheMet: 0.272 ± 0.152
0.272PheAsn: 0.272 ± 0.152
0.815PhePro: 0.815 ± 0.584
0.272PheGln: 0.272 ± 0.281
0.543PheArg: 0.543 ± 0.305
3.26PheSer: 3.26 ± 1.096
1.358PheThr: 1.358 ± 0.502
0.815PheVal: 0.815 ± 0.322
1.087PheTrp: 1.087 ± 0.522
0.815PheTyr: 0.815 ± 0.423
0.0PheXaa: 0.0 ± 0.0
Gly
4.347GlyAla: 4.347 ± 1.022
0.543GlyCys: 0.543 ± 0.305
3.803GlyAsp: 3.803 ± 0.653
4.89GlyGlu: 4.89 ± 1.081
2.445GlyPhe: 2.445 ± 0.797
4.618GlyGly: 4.618 ± 0.894
1.63GlyHis: 1.63 ± 0.675
1.087GlyIle: 1.087 ± 0.329
2.717GlyLys: 2.717 ± 0.667
6.792GlyLeu: 6.792 ± 1.983
2.173GlyMet: 2.173 ± 0.884
1.087GlyAsn: 1.087 ± 0.492
3.803GlyPro: 3.803 ± 0.826
1.902GlyGln: 1.902 ± 0.772
4.347GlyArg: 4.347 ± 0.827
3.803GlySer: 3.803 ± 1.455
3.26GlyThr: 3.26 ± 0.937
3.26GlyVal: 3.26 ± 0.671
1.087GlyTrp: 1.087 ± 0.526
1.087GlyTyr: 1.087 ± 0.522
0.0GlyXaa: 0.0 ± 0.0
His
2.173HisAla: 2.173 ± 0.958
1.358HisCys: 1.358 ± 0.566
1.358HisAsp: 1.358 ± 0.899
1.358HisGlu: 1.358 ± 0.773
0.815HisPhe: 0.815 ± 0.457
1.902HisGly: 1.902 ± 0.673
1.087HisHis: 1.087 ± 0.492
0.543HisIle: 0.543 ± 0.305
2.445HisLys: 2.445 ± 0.508
3.532HisLeu: 3.532 ± 1.589
0.272HisMet: 0.272 ± 0.152
1.902HisAsn: 1.902 ± 0.518
1.63HisPro: 1.63 ± 0.915
0.543HisGln: 0.543 ± 0.305
2.173HisArg: 2.173 ± 0.745
1.902HisSer: 1.902 ± 0.694
0.543HisThr: 0.543 ± 0.305
1.087HisVal: 1.087 ± 0.5
1.087HisTrp: 1.087 ± 0.39
0.272HisTyr: 0.272 ± 0.152
0.0HisXaa: 0.0 ± 0.0
Ile
1.358IleAla: 1.358 ± 0.551
1.087IleCys: 1.087 ± 0.431
1.902IleAsp: 1.902 ± 0.683
3.803IleGlu: 3.803 ± 0.348
2.445IlePhe: 2.445 ± 1.144
2.173IleGly: 2.173 ± 0.884
1.63IleHis: 1.63 ± 0.538
0.815IleIle: 0.815 ± 0.436
2.445IleLys: 2.445 ± 0.322
5.162IleLeu: 5.162 ± 0.63
1.087IleMet: 1.087 ± 0.291
1.087IleAsn: 1.087 ± 0.522
1.087IlePro: 1.087 ± 0.609
1.63IleGln: 1.63 ± 1.039
2.988IleArg: 2.988 ± 0.457
6.52IleSer: 6.52 ± 1.794
3.26IleThr: 3.26 ± 0.861
1.358IleVal: 1.358 ± 0.547
0.272IleTrp: 0.272 ± 0.463
0.272IleTyr: 0.272 ± 0.281
0.0IleXaa: 0.0 ± 0.0
Lys
5.705LysAla: 5.705 ± 2.262
0.815LysCys: 0.815 ± 0.322
1.902LysAsp: 1.902 ± 0.843
5.705LysGlu: 5.705 ± 2.553
0.815LysPhe: 0.815 ± 0.524
4.347LysGly: 4.347 ± 1.01
1.63LysHis: 1.63 ± 0.265
2.173LysIle: 2.173 ± 1.23
7.335LysLys: 7.335 ± 3.569
3.803LysLeu: 3.803 ± 0.887
0.543LysMet: 0.543 ± 0.261
3.532LysAsn: 3.532 ± 0.605
2.445LysPro: 2.445 ± 0.954
2.988LysGln: 2.988 ± 0.628
5.705LysArg: 5.705 ± 2.721
1.358LysSer: 1.358 ± 0.965
3.532LysThr: 3.532 ± 0.678
2.445LysVal: 2.445 ± 0.657
0.0LysTrp: 0.0 ± 0.0
1.63LysTyr: 1.63 ± 0.549
0.0LysXaa: 0.0 ± 0.0
Leu
9.237LeuAla: 9.237 ± 1.037
1.358LeuCys: 1.358 ± 0.574
3.532LeuAsp: 3.532 ± 0.874
9.237LeuGlu: 9.237 ± 1.793
3.532LeuPhe: 3.532 ± 1.148
5.977LeuGly: 5.977 ± 1.085
3.803LeuHis: 3.803 ± 1.314
5.162LeuIle: 5.162 ± 1.392
7.335LeuLys: 7.335 ± 1.467
12.497LeuLeu: 12.497 ± 4.698
2.988LeuMet: 2.988 ± 0.902
2.717LeuAsn: 2.717 ± 1.019
7.607LeuPro: 7.607 ± 2.44
4.347LeuGln: 4.347 ± 0.774
8.965LeuArg: 8.965 ± 0.873
9.508LeuSer: 9.508 ± 2.628
4.075LeuThr: 4.075 ± 1.23
4.347LeuVal: 4.347 ± 1.008
2.445LeuTrp: 2.445 ± 0.967
5.433LeuTyr: 5.433 ± 1.021
0.0LeuXaa: 0.0 ± 0.0
Met
3.532MetAla: 3.532 ± 0.599
0.0MetCys: 0.0 ± 0.0
2.173MetAsp: 2.173 ± 0.553
1.358MetGlu: 1.358 ± 0.656
0.272MetPhe: 0.272 ± 0.281
0.543MetGly: 0.543 ± 0.261
0.543MetHis: 0.543 ± 0.561
0.815MetIle: 0.815 ± 0.436
0.543MetLys: 0.543 ± 0.305
1.087MetLeu: 1.087 ± 0.528
0.543MetMet: 0.543 ± 0.595
0.543MetAsn: 0.543 ± 0.434
0.272MetPro: 0.272 ± 0.152
0.815MetGln: 0.815 ± 0.447
1.087MetArg: 1.087 ± 0.673
1.358MetSer: 1.358 ± 0.506
2.445MetThr: 2.445 ± 0.781
2.445MetVal: 2.445 ± 0.721
0.272MetTrp: 0.272 ± 0.281
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.173AsnAla: 2.173 ± 0.6
1.087AsnCys: 1.087 ± 0.794
0.815AsnAsp: 0.815 ± 0.866
1.902AsnGlu: 1.902 ± 0.483
1.087AsnPhe: 1.087 ± 0.382
1.087AsnGly: 1.087 ± 0.522
0.543AsnHis: 0.543 ± 0.305
1.63AsnIle: 1.63 ± 0.739
1.63AsnLys: 1.63 ± 0.765
3.26AsnLeu: 3.26 ± 0.655
0.272AsnMet: 0.272 ± 0.152
0.543AsnAsn: 0.543 ± 0.261
1.63AsnPro: 1.63 ± 0.528
2.717AsnGln: 2.717 ± 0.888
2.173AsnArg: 2.173 ± 0.611
1.63AsnSer: 1.63 ± 0.535
2.445AsnThr: 2.445 ± 0.714
0.815AsnVal: 0.815 ± 0.457
0.815AsnTrp: 0.815 ± 0.457
1.63AsnTyr: 1.63 ± 0.644
0.0AsnXaa: 0.0 ± 0.0
Pro
4.347ProAla: 4.347 ± 1.592
2.445ProCys: 2.445 ± 0.666
2.445ProAsp: 2.445 ± 1.126
5.162ProGlu: 5.162 ± 1.105
1.087ProPhe: 1.087 ± 0.431
1.902ProGly: 1.902 ± 0.676
1.087ProHis: 1.087 ± 0.526
1.902ProIle: 1.902 ± 0.635
1.63ProLys: 1.63 ± 0.691
5.977ProLeu: 5.977 ± 1.223
0.815ProMet: 0.815 ± 0.854
2.173ProAsn: 2.173 ± 0.639
3.532ProPro: 3.532 ± 0.841
1.902ProGln: 1.902 ± 1.359
3.532ProArg: 3.532 ± 1.997
4.618ProSer: 4.618 ± 1.532
3.803ProThr: 3.803 ± 0.591
2.717ProVal: 2.717 ± 0.723
1.087ProTrp: 1.087 ± 0.522
2.173ProTyr: 2.173 ± 0.863
0.0ProXaa: 0.0 ± 0.0
Gln
3.803GlnAla: 3.803 ± 1.246
1.087GlnCys: 1.087 ± 0.5
3.532GlnAsp: 3.532 ± 0.748
3.803GlnGlu: 3.803 ± 1.886
1.087GlnPhe: 1.087 ± 0.545
2.717GlnGly: 2.717 ± 1.064
0.272GlnHis: 0.272 ± 0.42
1.63GlnIle: 1.63 ± 0.7
2.173GlnLys: 2.173 ± 0.796
4.89GlnLeu: 4.89 ± 0.824
0.543GlnMet: 0.543 ± 0.5
0.543GlnAsn: 0.543 ± 0.371
2.445GlnPro: 2.445 ± 0.605
1.358GlnGln: 1.358 ± 0.844
2.445GlnArg: 2.445 ± 0.58
1.358GlnSer: 1.358 ± 0.547
2.173GlnThr: 2.173 ± 0.698
2.717GlnVal: 2.717 ± 0.376
0.815GlnTrp: 0.815 ± 0.322
0.543GlnTyr: 0.543 ± 0.5
0.0GlnXaa: 0.0 ± 0.0
Arg
5.162ArgAla: 5.162 ± 2.162
1.087ArgCys: 1.087 ± 0.585
4.618ArgAsp: 4.618 ± 0.623
5.162ArgGlu: 5.162 ± 2.949
1.358ArgPhe: 1.358 ± 0.744
3.532ArgGly: 3.532 ± 0.809
2.988ArgHis: 2.988 ± 0.751
3.532ArgIle: 3.532 ± 0.879
4.347ArgLys: 4.347 ± 2.483
7.607ArgLeu: 7.607 ± 1.276
1.087ArgMet: 1.087 ± 0.634
2.445ArgAsn: 2.445 ± 0.786
4.075ArgPro: 4.075 ± 0.679
2.988ArgGln: 2.988 ± 0.52
6.248ArgArg: 6.248 ± 2.947
2.717ArgSer: 2.717 ± 0.892
4.347ArgThr: 4.347 ± 1.561
2.988ArgVal: 2.988 ± 0.724
1.358ArgTrp: 1.358 ± 0.519
1.087ArgTyr: 1.087 ± 0.421
0.0ArgXaa: 0.0 ± 0.0
Ser
5.705SerAla: 5.705 ± 0.991
1.63SerCys: 1.63 ± 0.528
3.532SerAsp: 3.532 ± 0.871
7.063SerGlu: 7.063 ± 3.389
2.445SerPhe: 2.445 ± 0.781
3.26SerGly: 3.26 ± 0.817
2.173SerHis: 2.173 ± 0.863
2.988SerIle: 2.988 ± 1.331
2.445SerLys: 2.445 ± 1.16
9.237SerLeu: 9.237 ± 1.468
1.902SerMet: 1.902 ± 1.067
2.173SerAsn: 2.173 ± 0.78
4.075SerPro: 4.075 ± 1.135
2.445SerGln: 2.445 ± 0.679
4.075SerArg: 4.075 ± 1.361
6.248SerSer: 6.248 ± 1.194
3.532SerThr: 3.532 ± 0.983
3.532SerVal: 3.532 ± 0.995
1.087SerTrp: 1.087 ± 0.794
1.358SerTyr: 1.358 ± 0.366
0.0SerXaa: 0.0 ± 0.0
Thr
4.89ThrAla: 4.89 ± 1.123
1.087ThrCys: 1.087 ± 0.794
2.717ThrAsp: 2.717 ± 1.367
4.89ThrGlu: 4.89 ± 1.043
1.902ThrPhe: 1.902 ± 0.589
4.89ThrGly: 4.89 ± 1.488
1.358ThrHis: 1.358 ± 0.48
4.618ThrIle: 4.618 ± 1.166
1.63ThrLys: 1.63 ± 0.728
7.607ThrLeu: 7.607 ± 2.079
1.358ThrMet: 1.358 ± 0.614
1.902ThrAsn: 1.902 ± 1.03
3.803ThrPro: 3.803 ± 1.455
1.358ThrGln: 1.358 ± 0.487
4.075ThrArg: 4.075 ± 0.648
3.803ThrSer: 3.803 ± 1.313
4.618ThrThr: 4.618 ± 1.563
4.075ThrVal: 4.075 ± 1.513
1.358ThrTrp: 1.358 ± 0.334
0.543ThrTyr: 0.543 ± 0.305
0.0ThrXaa: 0.0 ± 0.0
Val
3.532ValAla: 3.532 ± 0.869
1.63ValCys: 1.63 ± 0.545
2.717ValAsp: 2.717 ± 0.455
1.902ValGlu: 1.902 ± 0.383
1.63ValPhe: 1.63 ± 0.7
2.988ValGly: 2.988 ± 0.435
0.815ValHis: 0.815 ± 0.402
1.63ValIle: 1.63 ± 0.635
1.902ValLys: 1.902 ± 0.894
4.89ValLeu: 4.89 ± 1.698
1.63ValMet: 1.63 ± 0.535
0.815ValAsn: 0.815 ± 0.376
1.902ValPro: 1.902 ± 0.59
3.26ValGln: 3.26 ± 0.659
2.445ValArg: 2.445 ± 0.416
2.173ValSer: 2.173 ± 0.542
3.26ValThr: 3.26 ± 1.387
2.445ValVal: 2.445 ± 0.593
0.272ValTrp: 0.272 ± 0.152
2.717ValTyr: 2.717 ± 0.976
0.0ValXaa: 0.0 ± 0.0
Trp
0.543TrpAla: 0.543 ± 0.261
1.087TrpCys: 1.087 ± 0.609
0.815TrpAsp: 0.815 ± 0.866
0.815TrpGlu: 0.815 ± 0.322
1.087TrpPhe: 1.087 ± 0.431
0.815TrpGly: 0.815 ± 0.457
0.543TrpHis: 0.543 ± 0.261
0.815TrpIle: 0.815 ± 0.457
1.63TrpLys: 1.63 ± 0.549
1.63TrpLeu: 1.63 ± 0.783
0.543TrpMet: 0.543 ± 0.305
0.815TrpAsn: 0.815 ± 0.52
1.087TrpPro: 1.087 ± 0.322
1.358TrpGln: 1.358 ± 0.475
1.358TrpArg: 1.358 ± 0.435
0.815TrpSer: 0.815 ± 0.365
1.087TrpThr: 1.087 ± 0.431
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.272TrpTyr: 0.272 ± 0.463
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.63TyrAla: 1.63 ± 0.591
0.815TyrCys: 0.815 ± 0.436
1.358TyrAsp: 1.358 ± 0.435
0.543TyrGlu: 0.543 ± 0.475
0.815TyrPhe: 0.815 ± 0.322
2.717TyrGly: 2.717 ± 1.216
0.543TyrHis: 0.543 ± 0.393
2.445TyrIle: 2.445 ± 0.967
1.902TyrLys: 1.902 ± 0.817
4.347TyrLeu: 4.347 ± 1.217
0.0TyrMet: 0.0 ± 0.0
1.358TyrAsn: 1.358 ± 0.435
1.358TyrPro: 1.358 ± 0.366
2.173TyrGln: 2.173 ± 0.524
2.445TyrArg: 2.445 ± 0.758
2.445TyrSer: 2.445 ± 0.781
0.815TyrThr: 0.815 ± 0.322
0.815TyrVal: 0.815 ± 0.836
0.815TyrTrp: 0.815 ± 0.457
0.543TyrTyr: 0.543 ± 0.273
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3682 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski