Amino acid dipepetide frequency for Salem virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.151AlaAla: 2.151 ± 0.435
1.564AlaCys: 1.564 ± 0.587
3.715AlaAsp: 3.715 ± 1.031
3.519AlaGlu: 3.519 ± 0.659
1.76AlaPhe: 1.76 ± 0.897
4.301AlaGly: 4.301 ± 1.356
2.346AlaHis: 2.346 ± 0.615
5.279AlaIle: 5.279 ± 0.484
2.151AlaLys: 2.151 ± 0.815
5.67AlaLeu: 5.67 ± 1.0
1.173AlaMet: 1.173 ± 0.66
2.542AlaAsn: 2.542 ± 0.824
2.151AlaPro: 2.151 ± 0.576
2.151AlaGln: 2.151 ± 0.816
2.346AlaArg: 2.346 ± 0.561
3.91AlaSer: 3.91 ± 0.514
2.933AlaThr: 2.933 ± 0.948
3.91AlaVal: 3.91 ± 1.076
0.196AlaTrp: 0.196 ± 0.201
0.391AlaTyr: 0.391 ± 0.175
0.0AlaXaa: 0.0 ± 0.0
Cys
0.782CysAla: 0.782 ± 0.339
0.196CysCys: 0.196 ± 0.119
0.782CysAsp: 0.782 ± 0.308
0.391CysGlu: 0.391 ± 0.248
0.391CysPhe: 0.391 ± 0.175
1.173CysGly: 1.173 ± 0.817
0.391CysHis: 0.391 ± 0.274
0.391CysIle: 0.391 ± 0.224
0.782CysLys: 0.782 ± 0.433
1.173CysLeu: 1.173 ± 0.391
0.782CysMet: 0.782 ± 0.439
0.782CysAsn: 0.782 ± 0.547
1.369CysPro: 1.369 ± 0.42
0.391CysGln: 0.391 ± 0.237
0.587CysArg: 0.587 ± 0.294
2.151CysSer: 2.151 ± 0.574
1.564CysThr: 1.564 ± 0.947
0.978CysVal: 0.978 ± 0.413
0.196CysTrp: 0.196 ± 0.286
0.782CysTyr: 0.782 ± 0.383
0.0CysXaa: 0.0 ± 0.0
Asp
1.955AspAla: 1.955 ± 0.636
0.587AspCys: 0.587 ± 0.357
4.106AspAsp: 4.106 ± 1.597
2.737AspGlu: 2.737 ± 0.635
1.76AspPhe: 1.76 ± 0.525
2.737AspGly: 2.737 ± 0.511
1.564AspHis: 1.564 ± 0.463
2.737AspIle: 2.737 ± 0.597
3.128AspLys: 3.128 ± 0.602
8.407AspLeu: 8.407 ± 0.943
0.978AspMet: 0.978 ± 0.395
1.564AspAsn: 1.564 ± 0.355
4.497AspPro: 4.497 ± 0.548
3.519AspGln: 3.519 ± 0.741
2.151AspArg: 2.151 ± 0.89
3.128AspSer: 3.128 ± 0.798
2.933AspThr: 2.933 ± 0.53
3.324AspVal: 3.324 ± 0.611
0.782AspTrp: 0.782 ± 0.351
1.955AspTyr: 1.955 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
2.933GluAla: 2.933 ± 0.855
0.978GluCys: 0.978 ± 0.577
3.324GluAsp: 3.324 ± 0.807
4.106GluGlu: 4.106 ± 1.263
2.542GluPhe: 2.542 ± 0.458
4.106GluGly: 4.106 ± 0.841
2.151GluHis: 2.151 ± 0.461
4.497GluIle: 4.497 ± 1.12
2.737GluLys: 2.737 ± 0.591
5.083GluLeu: 5.083 ± 1.017
1.173GluMet: 1.173 ± 0.58
2.346GluAsn: 2.346 ± 0.7
1.955GluPro: 1.955 ± 0.442
1.173GluGln: 1.173 ± 0.307
2.542GluArg: 2.542 ± 0.485
3.715GluSer: 3.715 ± 0.983
2.737GluThr: 2.737 ± 0.628
4.692GluVal: 4.692 ± 0.865
0.391GluTrp: 0.391 ± 0.32
1.955GluTyr: 1.955 ± 0.419
0.0GluXaa: 0.0 ± 0.0
Phe
2.933PheAla: 2.933 ± 0.77
0.978PheCys: 0.978 ± 0.335
0.978PheAsp: 0.978 ± 0.502
1.564PheGlu: 1.564 ± 0.553
1.369PhePhe: 1.369 ± 0.83
2.346PheGly: 2.346 ± 0.663
0.978PheHis: 0.978 ± 0.384
2.151PheIle: 2.151 ± 0.419
1.564PheLys: 1.564 ± 0.522
4.106PheLeu: 4.106 ± 0.754
0.587PheMet: 0.587 ± 0.262
2.542PheAsn: 2.542 ± 0.453
1.564PhePro: 1.564 ± 0.464
0.782PheGln: 0.782 ± 0.459
2.933PheArg: 2.933 ± 0.76
2.346PheSer: 2.346 ± 0.796
0.978PheThr: 0.978 ± 0.478
1.369PheVal: 1.369 ± 0.359
0.391PheTrp: 0.391 ± 0.175
1.369PheTyr: 1.369 ± 0.699
0.0PheXaa: 0.0 ± 0.0
Gly
3.91GlyAla: 3.91 ± 1.248
0.391GlyCys: 0.391 ± 0.237
3.715GlyAsp: 3.715 ± 1.256
2.737GlyGlu: 2.737 ± 1.086
2.542GlyPhe: 2.542 ± 0.623
2.346GlyGly: 2.346 ± 0.389
1.76GlyHis: 1.76 ± 0.257
4.692GlyIle: 4.692 ± 0.697
4.106GlyLys: 4.106 ± 1.434
6.647GlyLeu: 6.647 ± 1.165
1.173GlyMet: 1.173 ± 0.247
3.128GlyAsn: 3.128 ± 0.596
1.76GlyPro: 1.76 ± 0.49
2.151GlyGln: 2.151 ± 0.744
2.737GlyArg: 2.737 ± 0.622
4.106GlySer: 4.106 ± 1.205
2.737GlyThr: 2.737 ± 0.862
5.279GlyVal: 5.279 ± 0.999
0.587GlyTrp: 0.587 ± 0.221
2.151GlyTyr: 2.151 ± 0.564
0.0GlyXaa: 0.0 ± 0.0
His
0.978HisAla: 0.978 ± 0.322
0.391HisCys: 0.391 ± 0.237
0.978HisAsp: 0.978 ± 0.329
0.978HisGlu: 0.978 ± 0.341
0.978HisPhe: 0.978 ± 0.411
1.173HisGly: 1.173 ± 0.349
0.0HisHis: 0.0 ± 0.0
1.76HisIle: 1.76 ± 0.828
1.173HisLys: 1.173 ± 0.417
4.106HisLeu: 4.106 ± 0.512
0.587HisMet: 0.587 ± 0.309
0.587HisAsn: 0.587 ± 0.267
1.369HisPro: 1.369 ± 0.531
1.76HisGln: 1.76 ± 0.72
1.564HisArg: 1.564 ± 0.322
2.542HisSer: 2.542 ± 0.435
0.978HisThr: 0.978 ± 0.516
0.782HisVal: 0.782 ± 0.317
0.587HisTrp: 0.587 ± 0.221
2.151HisTyr: 2.151 ± 0.51
0.0HisXaa: 0.0 ± 0.0
Ile
3.715IleAla: 3.715 ± 1.22
1.369IleCys: 1.369 ± 0.505
4.888IleAsp: 4.888 ± 1.193
3.715IleGlu: 3.715 ± 0.839
1.955IlePhe: 1.955 ± 0.466
4.301IleGly: 4.301 ± 0.796
1.76IleHis: 1.76 ± 0.482
4.497IleIle: 4.497 ± 0.937
4.692IleLys: 4.692 ± 0.958
7.429IleLeu: 7.429 ± 0.791
1.564IleMet: 1.564 ± 0.522
3.324IleAsn: 3.324 ± 0.621
4.106IlePro: 4.106 ± 1.42
2.933IleGln: 2.933 ± 0.807
3.128IleArg: 3.128 ± 0.791
5.67IleSer: 5.67 ± 1.444
4.106IleThr: 4.106 ± 1.659
4.888IleVal: 4.888 ± 1.195
0.978IleTrp: 0.978 ± 0.457
2.151IleTyr: 2.151 ± 0.764
0.0IleXaa: 0.0 ± 0.0
Lys
2.542LysAla: 2.542 ± 0.601
1.173LysCys: 1.173 ± 0.482
2.346LysAsp: 2.346 ± 0.704
2.933LysGlu: 2.933 ± 0.83
1.564LysPhe: 1.564 ± 0.424
2.933LysGly: 2.933 ± 0.611
1.173LysHis: 1.173 ± 0.285
3.324LysIle: 3.324 ± 1.161
2.346LysLys: 2.346 ± 0.644
7.234LysLeu: 7.234 ± 1.127
1.173LysMet: 1.173 ± 0.583
2.346LysAsn: 2.346 ± 1.313
2.346LysPro: 2.346 ± 0.617
2.346LysGln: 2.346 ± 0.504
2.737LysArg: 2.737 ± 1.254
4.888LysSer: 4.888 ± 1.657
3.128LysThr: 3.128 ± 0.966
3.715LysVal: 3.715 ± 1.089
0.782LysTrp: 0.782 ± 0.351
2.542LysTyr: 2.542 ± 0.752
0.0LysXaa: 0.0 ± 0.0
Leu
8.016LeuAla: 8.016 ± 1.326
1.369LeuCys: 1.369 ± 0.417
4.692LeuAsp: 4.692 ± 0.65
7.234LeuGlu: 7.234 ± 0.831
3.715LeuPhe: 3.715 ± 0.798
7.234LeuGly: 7.234 ± 1.177
2.346LeuHis: 2.346 ± 0.483
8.016LeuIle: 8.016 ± 0.913
7.038LeuLys: 7.038 ± 1.163
11.73LeuLeu: 11.73 ± 1.131
3.128LeuMet: 3.128 ± 0.959
6.256LeuAsn: 6.256 ± 1.367
3.324LeuPro: 3.324 ± 0.674
5.67LeuGln: 5.67 ± 1.046
5.865LeuArg: 5.865 ± 0.814
8.993LeuSer: 8.993 ± 1.151
9.189LeuThr: 9.189 ± 1.213
6.061LeuVal: 6.061 ± 0.503
1.564LeuTrp: 1.564 ± 0.587
3.324LeuTyr: 3.324 ± 1.219
0.0LeuXaa: 0.0 ± 0.0
Met
0.978MetAla: 0.978 ± 0.301
0.782MetCys: 0.782 ± 0.291
0.782MetAsp: 0.782 ± 0.313
0.978MetGlu: 0.978 ± 0.362
0.587MetPhe: 0.587 ± 0.495
0.587MetGly: 0.587 ± 0.262
0.391MetHis: 0.391 ± 0.175
1.564MetIle: 1.564 ± 0.629
1.564MetLys: 1.564 ± 0.346
3.128MetLeu: 3.128 ± 0.845
0.782MetMet: 0.782 ± 0.228
1.564MetAsn: 1.564 ± 0.488
0.782MetPro: 0.782 ± 0.351
0.196MetGln: 0.196 ± 0.201
1.955MetArg: 1.955 ± 0.187
1.955MetSer: 1.955 ± 0.392
1.564MetThr: 1.564 ± 0.293
1.173MetVal: 1.173 ± 0.876
0.0MetTrp: 0.0 ± 0.0
0.587MetTyr: 0.587 ± 0.234
0.0MetXaa: 0.0 ± 0.0
Asn
1.173AsnAla: 1.173 ± 0.307
0.391AsnCys: 0.391 ± 0.478
2.933AsnAsp: 2.933 ± 0.563
1.955AsnGlu: 1.955 ± 0.605
1.955AsnPhe: 1.955 ± 0.515
2.151AsnGly: 2.151 ± 0.413
0.978AsnHis: 0.978 ± 0.343
2.346AsnIle: 2.346 ± 0.614
2.542AsnLys: 2.542 ± 0.218
5.67AsnLeu: 5.67 ± 0.508
0.587AsnMet: 0.587 ± 0.511
1.564AsnAsn: 1.564 ± 0.567
3.324AsnPro: 3.324 ± 1.001
1.76AsnGln: 1.76 ± 0.56
1.76AsnArg: 1.76 ± 0.562
5.279AsnSer: 5.279 ± 0.584
1.955AsnThr: 1.955 ± 0.754
1.955AsnVal: 1.955 ± 0.56
1.173AsnTrp: 1.173 ± 0.711
2.151AsnTyr: 2.151 ± 0.717
0.0AsnXaa: 0.0 ± 0.0
Pro
3.324ProAla: 3.324 ± 0.731
0.391ProCys: 0.391 ± 0.213
4.106ProAsp: 4.106 ± 0.701
2.542ProGlu: 2.542 ± 0.745
1.173ProPhe: 1.173 ± 0.645
2.346ProGly: 2.346 ± 0.757
1.173ProHis: 1.173 ± 0.622
3.91ProIle: 3.91 ± 1.009
2.542ProLys: 2.542 ± 0.69
4.888ProLeu: 4.888 ± 1.467
0.391ProMet: 0.391 ± 0.224
1.369ProAsn: 1.369 ± 0.628
1.369ProPro: 1.369 ± 0.365
1.955ProGln: 1.955 ± 0.812
4.106ProArg: 4.106 ± 1.036
4.497ProSer: 4.497 ± 0.781
3.128ProThr: 3.128 ± 0.488
2.346ProVal: 2.346 ± 0.817
0.391ProTrp: 0.391 ± 0.274
1.955ProTyr: 1.955 ± 0.65
0.0ProXaa: 0.0 ± 0.0
Gln
2.542GlnAla: 2.542 ± 0.426
0.587GlnCys: 0.587 ± 0.291
1.76GlnAsp: 1.76 ± 0.805
2.933GlnGlu: 2.933 ± 1.497
0.587GlnPhe: 0.587 ± 0.276
2.737GlnGly: 2.737 ± 0.432
0.587GlnHis: 0.587 ± 0.256
3.519GlnIle: 3.519 ± 0.642
1.955GlnLys: 1.955 ± 0.626
2.933GlnLeu: 2.933 ± 0.678
0.978GlnMet: 0.978 ± 0.389
1.564GlnAsn: 1.564 ± 0.279
1.369GlnPro: 1.369 ± 0.817
1.369GlnGln: 1.369 ± 0.367
1.76GlnArg: 1.76 ± 0.46
4.692GlnSer: 4.692 ± 0.894
1.955GlnThr: 1.955 ± 0.773
3.128GlnVal: 3.128 ± 0.836
0.0GlnTrp: 0.0 ± 0.0
1.76GlnTyr: 1.76 ± 0.397
0.0GlnXaa: 0.0 ± 0.0
Arg
2.933ArgAla: 2.933 ± 0.604
1.173ArgCys: 1.173 ± 0.492
2.737ArgAsp: 2.737 ± 0.445
3.128ArgGlu: 3.128 ± 0.924
1.564ArgPhe: 1.564 ± 0.429
4.106ArgGly: 4.106 ± 0.83
1.564ArgHis: 1.564 ± 0.455
3.519ArgIle: 3.519 ± 0.743
2.542ArgLys: 2.542 ± 1.235
5.279ArgLeu: 5.279 ± 1.107
1.564ArgMet: 1.564 ± 0.64
2.346ArgAsn: 2.346 ± 0.65
1.955ArgPro: 1.955 ± 0.47
1.369ArgGln: 1.369 ± 1.004
3.91ArgArg: 3.91 ± 1.112
4.301ArgSer: 4.301 ± 1.242
3.128ArgThr: 3.128 ± 0.943
4.301ArgVal: 4.301 ± 0.998
0.587ArgTrp: 0.587 ± 0.418
1.955ArgTyr: 1.955 ± 0.412
0.0ArgXaa: 0.0 ± 0.0
Ser
3.128SerAla: 3.128 ± 0.672
0.978SerCys: 0.978 ± 0.495
4.301SerAsp: 4.301 ± 0.975
3.91SerGlu: 3.91 ± 0.98
2.737SerPhe: 2.737 ± 0.55
4.497SerGly: 4.497 ± 0.843
2.346SerHis: 2.346 ± 0.483
6.061SerIle: 6.061 ± 0.477
5.279SerLys: 5.279 ± 1.228
9.971SerLeu: 9.971 ± 1.118
1.564SerMet: 1.564 ± 0.503
3.715SerAsn: 3.715 ± 0.409
4.888SerPro: 4.888 ± 0.876
3.324SerGln: 3.324 ± 1.017
5.083SerArg: 5.083 ± 0.969
9.384SerSer: 9.384 ± 2.488
6.256SerThr: 6.256 ± 1.222
5.67SerVal: 5.67 ± 1.11
0.978SerTrp: 0.978 ± 0.331
2.346SerTyr: 2.346 ± 0.622
0.0SerXaa: 0.0 ± 0.0
Thr
4.692ThrAla: 4.692 ± 0.621
0.978ThrCys: 0.978 ± 0.702
2.737ThrAsp: 2.737 ± 0.732
3.519ThrGlu: 3.519 ± 0.644
2.151ThrPhe: 2.151 ± 0.83
3.519ThrGly: 3.519 ± 0.837
1.173ThrHis: 1.173 ± 0.43
5.279ThrIle: 5.279 ± 1.516
1.955ThrLys: 1.955 ± 0.84
7.625ThrLeu: 7.625 ± 1.22
0.978ThrMet: 0.978 ± 0.573
2.346ThrAsn: 2.346 ± 0.29
2.346ThrPro: 2.346 ± 0.549
2.346ThrGln: 2.346 ± 0.914
2.737ThrArg: 2.737 ± 0.82
4.301ThrSer: 4.301 ± 0.638
3.715ThrThr: 3.715 ± 0.673
4.497ThrVal: 4.497 ± 0.54
0.782ThrTrp: 0.782 ± 0.308
2.737ThrTyr: 2.737 ± 0.573
0.0ThrXaa: 0.0 ± 0.0
Val
2.542ValAla: 2.542 ± 0.793
0.391ValCys: 0.391 ± 0.358
3.324ValAsp: 3.324 ± 0.754
4.301ValGlu: 4.301 ± 0.715
1.369ValPhe: 1.369 ± 0.46
4.301ValGly: 4.301 ± 0.661
1.76ValHis: 1.76 ± 0.519
4.888ValIle: 4.888 ± 0.862
3.519ValLys: 3.519 ± 0.61
7.429ValLeu: 7.429 ± 1.187
1.955ValMet: 1.955 ± 1.066
1.369ValAsn: 1.369 ± 0.386
3.715ValPro: 3.715 ± 1.335
1.369ValGln: 1.369 ± 0.461
3.91ValArg: 3.91 ± 0.727
6.843ValSer: 6.843 ± 1.555
4.497ValThr: 4.497 ± 0.89
3.715ValVal: 3.715 ± 0.486
0.0ValTrp: 0.0 ± 0.0
1.76ValTyr: 1.76 ± 0.584
0.0ValXaa: 0.0 ± 0.0
Trp
0.587TrpAla: 0.587 ± 0.356
0.587TrpCys: 0.587 ± 0.495
0.978TrpAsp: 0.978 ± 0.458
0.782TrpGlu: 0.782 ± 0.383
0.587TrpPhe: 0.587 ± 0.356
0.196TrpGly: 0.196 ± 0.119
0.196TrpHis: 0.196 ± 0.119
0.782TrpIle: 0.782 ± 0.351
0.782TrpLys: 0.782 ± 0.543
0.978TrpLeu: 0.978 ± 0.38
0.0TrpMet: 0.0 ± 0.0
0.391TrpAsn: 0.391 ± 0.175
0.391TrpPro: 0.391 ± 0.237
0.587TrpGln: 0.587 ± 0.28
0.196TrpArg: 0.196 ± 0.119
0.587TrpSer: 0.587 ± 0.262
0.782TrpThr: 0.782 ± 0.645
0.587TrpVal: 0.587 ± 0.234
0.196TrpTrp: 0.196 ± 0.119
0.587TrpTyr: 0.587 ± 0.356
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.346TyrAla: 2.346 ± 0.808
0.782TyrCys: 0.782 ± 0.383
1.564TyrAsp: 1.564 ± 0.553
1.173TyrGlu: 1.173 ± 0.356
2.542TyrPhe: 2.542 ± 0.603
1.564TyrGly: 1.564 ± 0.424
0.782TyrHis: 0.782 ± 0.296
1.955TyrIle: 1.955 ± 0.632
1.173TyrLys: 1.173 ± 0.379
5.279TyrLeu: 5.279 ± 1.369
0.587TyrMet: 0.587 ± 0.237
1.955TyrAsn: 1.955 ± 0.627
3.324TyrPro: 3.324 ± 0.427
1.564TyrGln: 1.564 ± 0.339
1.955TyrArg: 1.955 ± 0.387
3.128TyrSer: 3.128 ± 0.991
2.151TyrThr: 2.151 ± 0.511
0.587TyrVal: 0.587 ± 0.356
0.196TyrTrp: 0.196 ± 0.119
1.564TyrTyr: 1.564 ± 0.711
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (5116 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski