Amino acid dipepetide frequency for Muir Springs virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.118AlaAla: 3.118 ± 1.077
1.039AlaCys: 1.039 ± 0.624
2.338AlaAsp: 2.338 ± 1.005
3.637AlaGlu: 3.637 ± 2.123
1.819AlaPhe: 1.819 ± 0.412
2.338AlaGly: 2.338 ± 0.775
1.039AlaHis: 1.039 ± 0.347
3.118AlaIle: 3.118 ± 1.339
1.819AlaLys: 1.819 ± 0.864
4.417AlaLeu: 4.417 ± 1.302
1.299AlaMet: 1.299 ± 0.921
2.338AlaAsn: 2.338 ± 1.524
2.078AlaPro: 2.078 ± 0.766
1.819AlaGln: 1.819 ± 0.366
4.417AlaArg: 4.417 ± 1.547
2.338AlaSer: 2.338 ± 0.379
2.858AlaThr: 2.858 ± 0.812
1.819AlaVal: 1.819 ± 1.731
0.0AlaTrp: 0.0 ± 0.0
2.078AlaTyr: 2.078 ± 0.811
0.0AlaXaa: 0.0 ± 0.0
Cys
0.26CysAla: 0.26 ± 0.368
0.52CysCys: 0.52 ± 0.312
0.779CysAsp: 0.779 ± 0.468
0.52CysGlu: 0.52 ± 0.266
0.52CysPhe: 0.52 ± 0.312
1.559CysGly: 1.559 ± 0.55
0.0CysHis: 0.0 ± 0.0
1.559CysIle: 1.559 ± 0.4
0.779CysLys: 0.779 ± 0.773
1.299CysLeu: 1.299 ± 0.518
0.26CysMet: 0.26 ± 0.156
0.52CysAsn: 0.52 ± 0.571
0.779CysPro: 0.779 ± 0.811
0.779CysGln: 0.779 ± 0.482
0.0CysArg: 0.0 ± 0.0
1.299CysSer: 1.299 ± 0.518
0.779CysThr: 0.779 ± 0.401
0.26CysVal: 0.26 ± 0.338
0.779CysTrp: 0.779 ± 0.468
0.779CysTyr: 0.779 ± 0.477
0.0CysXaa: 0.0 ± 0.0
Asp
2.338AspAla: 2.338 ± 0.638
0.26AspCys: 0.26 ± 0.338
4.936AspAsp: 4.936 ± 2.249
3.897AspGlu: 3.897 ± 1.394
2.858AspPhe: 2.858 ± 0.382
3.637AspGly: 3.637 ± 0.371
2.078AspHis: 2.078 ± 0.289
3.897AspIle: 3.897 ± 1.099
3.378AspLys: 3.378 ± 0.95
6.235AspLeu: 6.235 ± 2.221
2.078AspMet: 2.078 ± 0.674
2.598AspAsn: 2.598 ± 0.837
3.378AspPro: 3.378 ± 0.679
2.078AspGln: 2.078 ± 0.593
2.598AspArg: 2.598 ± 0.533
3.637AspSer: 3.637 ± 1.31
1.819AspThr: 1.819 ± 0.507
2.338AspVal: 2.338 ± 0.346
0.779AspTrp: 0.779 ± 0.277
3.118AspTyr: 3.118 ± 1.122
0.0AspXaa: 0.0 ± 0.0
Glu
1.559GluAla: 1.559 ± 0.958
1.039GluCys: 1.039 ± 0.36
5.456GluAsp: 5.456 ± 0.881
7.794GluGlu: 7.794 ± 2.512
2.598GluPhe: 2.598 ± 1.172
5.456GluGly: 5.456 ± 1.038
2.078GluHis: 2.078 ± 1.262
6.235GluIle: 6.235 ± 1.01
5.456GluLys: 5.456 ± 0.761
8.054GluLeu: 8.054 ± 0.749
2.598GluMet: 2.598 ± 0.834
4.677GluAsn: 4.677 ± 0.927
1.819GluPro: 1.819 ± 1.221
1.819GluGln: 1.819 ± 0.82
2.858GluArg: 2.858 ± 0.594
2.858GluSer: 2.858 ± 0.711
6.495GluThr: 6.495 ± 2.075
4.677GluVal: 4.677 ± 1.32
1.559GluTrp: 1.559 ± 1.038
2.338GluTyr: 2.338 ± 0.699
0.0GluXaa: 0.0 ± 0.0
Phe
2.078PheAla: 2.078 ± 0.885
0.779PheCys: 0.779 ± 0.275
1.299PheAsp: 1.299 ± 0.461
2.858PheGlu: 2.858 ± 0.788
3.118PhePhe: 3.118 ± 0.664
1.819PheGly: 1.819 ± 0.446
0.779PheHis: 0.779 ± 0.275
2.338PheIle: 2.338 ± 0.623
1.559PheLys: 1.559 ± 0.887
5.716PheLeu: 5.716 ± 1.267
0.52PheMet: 0.52 ± 0.286
3.378PheAsn: 3.378 ± 1.862
2.338PhePro: 2.338 ± 0.504
1.039PheGln: 1.039 ± 0.572
2.858PheArg: 2.858 ± 0.443
4.417PheSer: 4.417 ± 1.891
1.819PheThr: 1.819 ± 0.608
2.338PheVal: 2.338 ± 0.501
0.52PheTrp: 0.52 ± 0.676
1.819PheTyr: 1.819 ± 1.109
0.0PheXaa: 0.0 ± 0.0
Gly
2.078GlyAla: 2.078 ± 0.81
1.039GlyCys: 1.039 ± 0.297
4.157GlyAsp: 4.157 ± 0.786
2.598GlyGlu: 2.598 ± 1.551
2.598GlyPhe: 2.598 ± 0.891
2.598GlyGly: 2.598 ± 1.082
0.779GlyHis: 0.779 ± 0.477
3.637GlyIle: 3.637 ± 0.607
4.417GlyLys: 4.417 ± 0.917
7.275GlyLeu: 7.275 ± 1.722
1.299GlyMet: 1.299 ± 0.477
2.078GlyAsn: 2.078 ± 0.622
1.819GlyPro: 1.819 ± 0.402
2.598GlyGln: 2.598 ± 1.219
2.858GlyArg: 2.858 ± 1.123
3.118GlySer: 3.118 ± 0.833
3.378GlyThr: 3.378 ± 0.666
3.637GlyVal: 3.637 ± 1.353
0.779GlyTrp: 0.779 ± 0.275
1.299GlyTyr: 1.299 ± 0.252
0.0GlyXaa: 0.0 ± 0.0
His
0.26HisAla: 0.26 ± 0.156
0.0HisCys: 0.0 ± 0.0
0.52HisAsp: 0.52 ± 0.532
1.819HisGlu: 1.819 ± 0.708
1.299HisPhe: 1.299 ± 0.586
0.52HisGly: 0.52 ± 0.312
1.559HisHis: 1.559 ± 0.431
1.819HisIle: 1.819 ± 0.764
2.338HisLys: 2.338 ± 0.813
2.858HisLeu: 2.858 ± 0.36
0.0HisMet: 0.0 ± 0.0
1.299HisAsn: 1.299 ± 0.963
1.299HisPro: 1.299 ± 0.482
1.039HisGln: 1.039 ± 0.889
1.559HisArg: 1.559 ± 0.297
1.819HisSer: 1.819 ± 0.708
0.52HisThr: 0.52 ± 0.635
0.779HisVal: 0.779 ± 0.482
0.26HisTrp: 0.26 ± 0.156
0.52HisTyr: 0.52 ± 0.312
0.0HisXaa: 0.0 ± 0.0
Ile
3.897IleAla: 3.897 ± 1.384
1.299IleCys: 1.299 ± 1.257
4.157IleAsp: 4.157 ± 0.364
4.677IleGlu: 4.677 ± 0.991
2.078IlePhe: 2.078 ± 0.913
4.157IleGly: 4.157 ± 0.842
0.26IleHis: 0.26 ± 0.338
6.495IleIle: 6.495 ± 2.711
4.417IleLys: 4.417 ± 1.384
6.495IleLeu: 6.495 ± 1.191
0.779IleMet: 0.779 ± 0.468
3.897IleAsn: 3.897 ± 1.01
4.417IlePro: 4.417 ± 1.326
1.559IleGln: 1.559 ± 0.556
4.936IleArg: 4.936 ± 1.109
7.275IleSer: 7.275 ± 1.573
5.716IleThr: 5.716 ± 0.626
4.157IleVal: 4.157 ± 0.949
1.819IleTrp: 1.819 ± 0.586
3.118IleTyr: 3.118 ± 0.935
0.0IleXaa: 0.0 ± 0.0
Lys
4.157LysAla: 4.157 ± 1.457
0.52LysCys: 0.52 ± 0.266
6.755LysAsp: 6.755 ± 1.315
7.534LysGlu: 7.534 ± 1.649
2.598LysPhe: 2.598 ± 0.595
2.598LysGly: 2.598 ± 1.085
2.338LysHis: 2.338 ± 0.826
4.936LysIle: 4.936 ± 1.045
7.275LysLys: 7.275 ± 1.957
5.716LysLeu: 5.716 ± 1.639
1.039LysMet: 1.039 ± 0.36
5.976LysAsn: 5.976 ± 1.826
1.559LysPro: 1.559 ± 1.3
1.039LysGln: 1.039 ± 0.903
2.858LysArg: 2.858 ± 0.798
5.196LysSer: 5.196 ± 1.77
4.157LysThr: 4.157 ± 0.9
3.118LysVal: 3.118 ± 0.711
1.039LysTrp: 1.039 ± 0.547
2.858LysTyr: 2.858 ± 1.105
0.0LysXaa: 0.0 ± 0.0
Leu
5.976LeuAla: 5.976 ± 1.41
1.819LeuCys: 1.819 ± 0.559
5.976LeuAsp: 5.976 ± 0.94
7.794LeuGlu: 7.794 ± 1.924
3.637LeuPhe: 3.637 ± 0.488
5.456LeuGly: 5.456 ± 1.046
2.598LeuHis: 2.598 ± 1.323
7.794LeuIle: 7.794 ± 1.006
6.495LeuLys: 6.495 ± 2.016
6.755LeuLeu: 6.755 ± 1.267
1.559LeuMet: 1.559 ± 0.43
5.456LeuAsn: 5.456 ± 1.287
4.157LeuPro: 4.157 ± 1.25
2.858LeuGln: 2.858 ± 0.477
4.936LeuArg: 4.936 ± 0.547
6.755LeuSer: 6.755 ± 1.686
5.456LeuThr: 5.456 ± 1.864
3.637LeuVal: 3.637 ± 0.534
0.779LeuTrp: 0.779 ± 0.277
3.378LeuTyr: 3.378 ± 0.853
0.0LeuXaa: 0.0 ± 0.0
Met
1.819MetAla: 1.819 ± 0.534
0.0MetCys: 0.0 ± 0.0
1.299MetAsp: 1.299 ± 0.648
1.559MetGlu: 1.559 ± 0.797
1.299MetPhe: 1.299 ± 0.763
1.039MetGly: 1.039 ± 0.493
0.52MetHis: 0.52 ± 0.286
2.598MetIle: 2.598 ± 0.707
1.039MetLys: 1.039 ± 0.432
1.299MetLeu: 1.299 ± 0.461
1.299MetMet: 1.299 ± 0.847
0.52MetAsn: 0.52 ± 0.321
0.26MetPro: 0.26 ± 0.156
1.299MetGln: 1.299 ± 0.541
1.039MetArg: 1.039 ± 1.005
3.897MetSer: 3.897 ± 1.053
1.819MetThr: 1.819 ± 0.507
1.559MetVal: 1.559 ± 0.767
0.0MetTrp: 0.0 ± 0.0
0.52MetTyr: 0.52 ± 0.312
0.0MetXaa: 0.0 ± 0.0
Asn
1.299AsnAla: 1.299 ± 0.923
0.779AsnCys: 0.779 ± 0.644
2.078AsnAsp: 2.078 ± 0.289
3.637AsnGlu: 3.637 ± 0.719
2.338AsnPhe: 2.338 ± 0.379
3.118AsnGly: 3.118 ± 1.681
1.299AsnHis: 1.299 ± 0.643
3.378AsnIle: 3.378 ± 0.401
3.378AsnLys: 3.378 ± 0.762
4.936AsnLeu: 4.936 ± 1.072
1.559AsnMet: 1.559 ± 0.955
3.637AsnAsn: 3.637 ± 0.804
3.118AsnPro: 3.118 ± 1.367
3.637AsnGln: 3.637 ± 0.719
2.598AsnArg: 2.598 ± 0.381
5.456AsnSer: 5.456 ± 1.672
2.598AsnThr: 2.598 ± 0.707
2.598AsnVal: 2.598 ± 0.701
0.779AsnTrp: 0.779 ± 0.587
3.378AsnTyr: 3.378 ± 0.703
0.0AsnXaa: 0.0 ± 0.0
Pro
2.338ProAla: 2.338 ± 0.728
0.0ProCys: 0.0 ± 0.0
3.378ProAsp: 3.378 ± 1.306
2.338ProGlu: 2.338 ± 0.679
1.819ProPhe: 1.819 ± 0.709
1.559ProGly: 1.559 ± 0.528
0.779ProHis: 0.779 ± 0.275
4.417ProIle: 4.417 ± 1.342
3.378ProLys: 3.378 ± 0.772
3.118ProLeu: 3.118 ± 0.608
0.779ProMet: 0.779 ± 0.367
1.819ProAsn: 1.819 ± 1.093
1.299ProPro: 1.299 ± 0.672
0.52ProGln: 0.52 ± 0.571
1.559ProArg: 1.559 ± 0.297
4.417ProSer: 4.417 ± 0.667
1.299ProThr: 1.299 ± 0.643
2.598ProVal: 2.598 ± 1.215
0.52ProTrp: 0.52 ± 0.312
1.559ProTyr: 1.559 ± 0.675
0.0ProXaa: 0.0 ± 0.0
Gln
1.819GlnAla: 1.819 ± 0.82
0.26GlnCys: 0.26 ± 0.156
2.078GlnAsp: 2.078 ± 1.141
4.677GlnGlu: 4.677 ± 2.161
1.299GlnPhe: 1.299 ± 0.541
2.598GlnGly: 2.598 ± 0.42
0.779GlnHis: 0.779 ± 0.275
1.559GlnIle: 1.559 ± 0.602
1.559GlnLys: 1.559 ± 0.869
3.378GlnLeu: 3.378 ± 1.257
1.039GlnMet: 1.039 ± 0.297
1.819GlnAsn: 1.819 ± 0.634
1.039GlnPro: 1.039 ± 0.493
1.039GlnGln: 1.039 ± 0.815
1.819GlnArg: 1.819 ± 0.525
1.299GlnSer: 1.299 ± 0.461
1.039GlnThr: 1.039 ± 1.003
1.559GlnVal: 1.559 ± 1.205
0.52GlnTrp: 0.52 ± 0.532
1.819GlnTyr: 1.819 ± 0.608
0.0GlnXaa: 0.0 ± 0.0
Arg
2.598ArgAla: 2.598 ± 1.219
1.819ArgCys: 1.819 ± 0.899
1.559ArgAsp: 1.559 ± 0.982
3.637ArgGlu: 3.637 ± 0.707
2.858ArgPhe: 2.858 ± 0.36
2.078ArgGly: 2.078 ± 0.622
1.559ArgHis: 1.559 ± 0.431
3.378ArgIle: 3.378 ± 0.964
4.677ArgLys: 4.677 ± 1.648
5.456ArgLeu: 5.456 ± 0.775
1.559ArgMet: 1.559 ± 0.383
3.118ArgAsn: 3.118 ± 0.857
1.819ArgPro: 1.819 ± 0.229
2.338ArgGln: 2.338 ± 1.064
3.378ArgArg: 3.378 ± 1.625
1.819ArgSer: 1.819 ± 1.504
2.858ArgThr: 2.858 ± 0.72
3.897ArgVal: 3.897 ± 0.747
1.559ArgTrp: 1.559 ± 0.937
1.819ArgTyr: 1.819 ± 0.438
0.0ArgXaa: 0.0 ± 0.0
Ser
3.118SerAla: 3.118 ± 1.474
0.52SerCys: 0.52 ± 0.266
5.716SerAsp: 5.716 ± 1.696
5.196SerGlu: 5.196 ± 1.246
2.858SerPhe: 2.858 ± 1.356
3.637SerGly: 3.637 ± 1.083
1.039SerHis: 1.039 ± 0.686
5.716SerIle: 5.716 ± 0.568
7.275SerLys: 7.275 ± 2.326
4.936SerLeu: 4.936 ± 1.228
2.598SerMet: 2.598 ± 0.573
3.897SerAsn: 3.897 ± 0.628
2.078SerPro: 2.078 ± 0.904
1.559SerGln: 1.559 ± 0.611
4.677SerArg: 4.677 ± 0.669
5.456SerSer: 5.456 ± 1.246
4.677SerThr: 4.677 ± 1.175
3.637SerVal: 3.637 ± 0.718
1.299SerTrp: 1.299 ± 0.781
3.378SerTyr: 3.378 ± 0.469
0.0SerXaa: 0.0 ± 0.0
Thr
2.858ThrAla: 2.858 ± 1.978
0.52ThrCys: 0.52 ± 0.266
1.039ThrAsp: 1.039 ± 0.531
5.196ThrGlu: 5.196 ± 1.163
2.078ThrPhe: 2.078 ± 0.567
3.897ThrGly: 3.897 ± 0.961
1.299ThrHis: 1.299 ± 0.355
4.677ThrIle: 4.677 ± 1.708
5.716ThrLys: 5.716 ± 1.151
4.677ThrLeu: 4.677 ± 0.771
1.299ThrMet: 1.299 ± 0.728
2.078ThrAsn: 2.078 ± 0.954
1.819ThrPro: 1.819 ± 0.919
1.299ThrGln: 1.299 ± 0.648
3.637ThrArg: 3.637 ± 0.558
4.677ThrSer: 4.677 ± 1.175
4.157ThrThr: 4.157 ± 1.108
4.157ThrVal: 4.157 ± 0.572
1.039ThrTrp: 1.039 ± 0.347
2.338ThrTyr: 2.338 ± 0.813
0.0ThrXaa: 0.0 ± 0.0
Val
3.118ValAla: 3.118 ± 1.913
0.779ValCys: 0.779 ± 0.468
2.338ValAsp: 2.338 ± 1.189
3.378ValGlu: 3.378 ± 0.576
2.078ValPhe: 2.078 ± 0.72
2.078ValGly: 2.078 ± 0.396
0.52ValHis: 0.52 ± 0.676
3.637ValIle: 3.637 ± 0.671
2.598ValLys: 2.598 ± 0.932
6.235ValLeu: 6.235 ± 1.12
1.299ValMet: 1.299 ± 0.765
3.378ValAsn: 3.378 ± 0.576
2.338ValPro: 2.338 ± 0.779
2.338ValGln: 2.338 ± 0.885
3.118ValArg: 3.118 ± 1.367
3.637ValSer: 3.637 ± 0.947
3.897ValThr: 3.897 ± 0.233
2.338ValVal: 2.338 ± 0.416
0.26ValTrp: 0.26 ± 0.156
2.078ValTyr: 2.078 ± 1.11
0.0ValXaa: 0.0 ± 0.0
Trp
0.52TrpAla: 0.52 ± 0.312
0.52TrpCys: 0.52 ± 0.453
0.26TrpAsp: 0.26 ± 0.368
2.338TrpGlu: 2.338 ± 0.674
2.078TrpPhe: 2.078 ± 0.459
1.039TrpGly: 1.039 ± 0.36
0.52TrpHis: 0.52 ± 0.312
1.039TrpIle: 1.039 ± 0.432
1.559TrpLys: 1.559 ± 0.619
0.26TrpLeu: 0.26 ± 0.156
0.52TrpMet: 0.52 ± 0.532
1.039TrpAsn: 1.039 ± 0.36
0.26TrpPro: 0.26 ± 0.156
0.26TrpGln: 0.26 ± 0.156
0.0TrpArg: 0.0 ± 0.0
1.039TrpSer: 1.039 ± 0.493
0.779TrpThr: 0.779 ± 0.587
0.779TrpVal: 0.779 ± 0.275
0.26TrpTrp: 0.26 ± 0.156
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.52TyrAla: 0.52 ± 0.321
0.52TyrCys: 0.52 ± 0.485
1.819TyrAsp: 1.819 ± 0.507
2.338TyrGlu: 2.338 ± 0.639
1.559TyrPhe: 1.559 ± 0.297
2.858TyrGly: 2.858 ± 0.667
0.26TyrHis: 0.26 ± 0.338
3.378TyrIle: 3.378 ± 2.03
4.157TyrLys: 4.157 ± 1.381
4.157TyrLeu: 4.157 ± 0.637
1.039TyrMet: 1.039 ± 0.36
2.078TyrAsn: 2.078 ± 1.249
1.819TyrPro: 1.819 ± 0.229
2.078TyrGln: 2.078 ± 0.783
2.078TyrArg: 2.078 ± 0.879
2.858TyrSer: 2.858 ± 0.714
2.338TyrThr: 2.338 ± 0.813
1.819TyrVal: 1.819 ± 1.03
0.52TyrTrp: 0.52 ± 0.266
0.26TyrTyr: 0.26 ± 0.156
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3850 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski