Amino acid dipepetide frequency for Barley stripe mosaic virus (BSMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.495AlaAla: 3.495 ± 0.712
1.907AlaCys: 1.907 ± 0.447
4.449AlaAsp: 4.449 ± 1.244
4.131AlaGlu: 4.131 ± 1.505
3.178AlaPhe: 3.178 ± 0.779
3.178AlaGly: 3.178 ± 0.877
1.271AlaHis: 1.271 ± 0.759
2.86AlaIle: 2.86 ± 0.849
3.178AlaLys: 3.178 ± 1.859
8.58AlaLeu: 8.58 ± 2.623
2.224AlaMet: 2.224 ± 0.701
1.907AlaAsn: 1.907 ± 0.757
2.224AlaPro: 2.224 ± 0.89
2.86AlaGln: 2.86 ± 0.78
2.86AlaArg: 2.86 ± 0.819
4.766AlaSer: 4.766 ± 1.341
3.495AlaThr: 3.495 ± 0.707
5.72AlaVal: 5.72 ± 1.497
0.318AlaTrp: 0.318 ± 0.389
1.589AlaTyr: 1.589 ± 0.886
0.0AlaXaa: 0.0 ± 0.0
Cys
1.907CysAla: 1.907 ± 0.74
1.589CysCys: 1.589 ± 0.859
3.178CysAsp: 3.178 ± 1.083
2.542CysGlu: 2.542 ± 1.011
0.953CysPhe: 0.953 ± 0.602
2.86CysGly: 2.86 ± 2.007
0.318CysHis: 0.318 ± 0.233
0.636CysIle: 0.636 ± 0.269
0.318CysLys: 0.318 ± 0.279
1.271CysLeu: 1.271 ± 0.678
0.318CysMet: 0.318 ± 0.353
0.318CysAsn: 0.318 ± 0.233
0.636CysPro: 0.636 ± 0.391
0.636CysGln: 0.636 ± 0.487
1.589CysArg: 1.589 ± 0.665
3.495CysSer: 3.495 ± 0.696
0.953CysThr: 0.953 ± 0.256
1.271CysVal: 1.271 ± 0.465
0.0CysTrp: 0.0 ± 0.0
0.318CysTyr: 0.318 ± 0.279
0.0CysXaa: 0.0 ± 0.0
Asp
1.907AspAla: 1.907 ± 1.103
2.224AspCys: 2.224 ± 0.48
2.224AspAsp: 2.224 ± 0.598
2.542AspGlu: 2.542 ± 0.811
3.813AspPhe: 3.813 ± 1.549
3.495AspGly: 3.495 ± 0.592
1.589AspHis: 1.589 ± 0.847
4.131AspIle: 4.131 ± 0.749
5.084AspLys: 5.084 ± 1.045
6.673AspLeu: 6.673 ± 1.273
0.953AspMet: 0.953 ± 0.411
2.542AspAsn: 2.542 ± 0.462
1.271AspPro: 1.271 ± 0.622
2.86AspGln: 2.86 ± 1.397
4.131AspArg: 4.131 ± 0.795
5.402AspSer: 5.402 ± 1.514
2.224AspThr: 2.224 ± 0.845
6.673AspVal: 6.673 ± 0.991
0.636AspTrp: 0.636 ± 0.566
1.271AspTyr: 1.271 ± 0.609
0.0AspXaa: 0.0 ± 0.0
Glu
3.178GluAla: 3.178 ± 1.033
1.907GluCys: 1.907 ± 0.479
4.131GluAsp: 4.131 ± 0.605
3.178GluGlu: 3.178 ± 0.973
2.86GluPhe: 2.86 ± 0.78
2.542GluGly: 2.542 ± 0.82
0.953GluHis: 0.953 ± 0.496
5.084GluIle: 5.084 ± 1.136
4.449GluLys: 4.449 ± 0.91
4.766GluLeu: 4.766 ± 1.573
0.953GluMet: 0.953 ± 0.617
3.813GluAsn: 3.813 ± 0.675
0.636GluPro: 0.636 ± 0.673
2.224GluGln: 2.224 ± 0.578
3.813GluArg: 3.813 ± 0.516
4.766GluSer: 4.766 ± 0.869
5.084GluThr: 5.084 ± 1.875
3.495GluVal: 3.495 ± 0.889
1.271GluTrp: 1.271 ± 0.536
2.542GluTyr: 2.542 ± 0.807
0.0GluXaa: 0.0 ± 0.0
Phe
3.178PheAla: 3.178 ± 1.346
1.907PheCys: 1.907 ± 0.619
4.131PheAsp: 4.131 ± 1.241
3.813PheGlu: 3.813 ± 0.669
3.178PhePhe: 3.178 ± 0.565
3.495PheGly: 3.495 ± 1.994
0.318PheHis: 0.318 ± 0.233
0.953PheIle: 0.953 ± 0.361
3.495PheLys: 3.495 ± 0.826
4.449PheLeu: 4.449 ± 0.717
0.636PheMet: 0.636 ± 0.269
1.589PheAsn: 1.589 ± 0.778
1.589PhePro: 1.589 ± 0.314
1.907PheGln: 1.907 ± 0.664
2.224PheArg: 2.224 ± 0.43
2.542PheSer: 2.542 ± 0.611
0.953PheThr: 0.953 ± 0.256
1.907PheVal: 1.907 ± 0.466
0.318PheTrp: 0.318 ± 0.457
1.589PheTyr: 1.589 ± 0.826
0.0PheXaa: 0.0 ± 0.0
Gly
2.542GlyAla: 2.542 ± 0.596
1.589GlyCys: 1.589 ± 0.545
3.813GlyAsp: 3.813 ± 1.316
3.813GlyGlu: 3.813 ± 1.166
0.318GlyPhe: 0.318 ± 0.233
4.766GlyGly: 4.766 ± 0.857
1.907GlyHis: 1.907 ± 1.042
4.449GlyIle: 4.449 ± 1.58
3.178GlyLys: 3.178 ± 1.055
3.495GlyLeu: 3.495 ± 1.691
1.589GlyMet: 1.589 ± 0.594
2.224GlyAsn: 2.224 ± 0.764
2.86GlyPro: 2.86 ± 0.506
1.589GlyGln: 1.589 ± 0.869
2.224GlyArg: 2.224 ± 0.513
4.766GlySer: 4.766 ± 1.009
3.813GlyThr: 3.813 ± 0.714
3.495GlyVal: 3.495 ± 1.211
0.318GlyTrp: 0.318 ± 0.233
2.224GlyTyr: 2.224 ± 0.439
0.318GlyXaa: 0.318 ± 0.457
His
0.318HisAla: 0.318 ± 0.453
0.953HisCys: 0.953 ± 0.447
1.589HisAsp: 1.589 ± 0.665
1.907HisGlu: 1.907 ± 0.466
1.271HisPhe: 1.271 ± 0.622
0.318HisGly: 0.318 ± 0.233
0.318HisHis: 0.318 ± 0.381
1.271HisIle: 1.271 ± 0.564
0.953HisLys: 0.953 ± 0.526
0.953HisLeu: 0.953 ± 0.617
0.636HisMet: 0.636 ± 0.466
0.318HisAsn: 0.318 ± 0.389
0.953HisPro: 0.953 ± 1.062
0.0HisGln: 0.0 ± 0.0
0.636HisArg: 0.636 ± 0.304
3.495HisSer: 3.495 ± 1.19
0.953HisThr: 0.953 ± 0.419
1.271HisVal: 1.271 ± 0.521
0.318HisTrp: 0.318 ± 0.381
2.542HisTyr: 2.542 ± 0.871
0.0HisXaa: 0.0 ± 0.0
Ile
5.084IleAla: 5.084 ± 0.906
1.271IleCys: 1.271 ± 0.931
2.86IleAsp: 2.86 ± 0.871
3.178IleGlu: 3.178 ± 0.712
1.589IlePhe: 1.589 ± 0.58
3.495IleGly: 3.495 ± 1.341
1.271IleHis: 1.271 ± 0.699
4.449IleIle: 4.449 ± 1.065
2.224IleLys: 2.224 ± 0.618
4.766IleLeu: 4.766 ± 1.37
0.636IleMet: 0.636 ± 0.466
2.224IleAsn: 2.224 ± 0.594
3.813IlePro: 3.813 ± 0.976
0.953IleGln: 0.953 ± 0.256
2.224IleArg: 2.224 ± 0.594
5.402IleSer: 5.402 ± 1.637
2.224IleThr: 2.224 ± 0.652
4.449IleVal: 4.449 ± 1.338
0.0IleTrp: 0.0 ± 0.0
2.542IleTyr: 2.542 ± 0.993
0.0IleXaa: 0.0 ± 0.0
Lys
3.178LysAla: 3.178 ± 1.462
0.636LysCys: 0.636 ± 0.441
3.495LysAsp: 3.495 ± 0.925
4.449LysGlu: 4.449 ± 1.303
4.766LysPhe: 4.766 ± 1.791
3.178LysGly: 3.178 ± 1.517
1.907LysHis: 1.907 ± 1.189
2.224LysIle: 2.224 ± 0.762
4.131LysLys: 4.131 ± 1.321
6.991LysLeu: 6.991 ± 1.859
1.589LysMet: 1.589 ± 1.087
2.542LysAsn: 2.542 ± 1.11
2.542LysPro: 2.542 ± 1.254
1.907LysGln: 1.907 ± 1.074
5.084LysArg: 5.084 ± 1.312
5.402LysSer: 5.402 ± 1.827
4.766LysThr: 4.766 ± 0.801
5.72LysVal: 5.72 ± 1.741
0.0LysTrp: 0.0 ± 0.0
3.178LysTyr: 3.178 ± 0.78
0.0LysXaa: 0.0 ± 0.0
Leu
6.037LeuAla: 6.037 ± 1.359
3.178LeuCys: 3.178 ± 0.839
5.72LeuAsp: 5.72 ± 1.069
4.449LeuGlu: 4.449 ± 1.684
2.542LeuPhe: 2.542 ± 0.586
1.907LeuGly: 1.907 ± 0.483
1.271LeuHis: 1.271 ± 0.456
6.355LeuIle: 6.355 ± 0.791
8.897LeuLys: 8.897 ± 1.995
12.075LeuLeu: 12.075 ± 2.694
1.271LeuMet: 1.271 ± 0.612
3.178LeuAsn: 3.178 ± 0.749
3.495LeuPro: 3.495 ± 1.215
3.813LeuGln: 3.813 ± 0.913
6.037LeuArg: 6.037 ± 1.113
7.309LeuSer: 7.309 ± 1.41
5.402LeuThr: 5.402 ± 1.685
5.72LeuVal: 5.72 ± 1.143
1.589LeuTrp: 1.589 ± 0.975
4.131LeuTyr: 4.131 ± 1.016
0.0LeuXaa: 0.0 ± 0.0
Met
3.178MetAla: 3.178 ± 1.199
0.0MetCys: 0.0 ± 0.0
1.271MetAsp: 1.271 ± 0.831
0.636MetGlu: 0.636 ± 0.269
0.953MetPhe: 0.953 ± 0.698
0.636MetGly: 0.636 ± 0.304
0.318MetHis: 0.318 ± 0.279
1.271MetIle: 1.271 ± 0.287
1.907MetLys: 1.907 ± 1.186
1.271MetLeu: 1.271 ± 0.962
0.636MetMet: 0.636 ± 0.441
0.636MetAsn: 0.636 ± 0.304
1.907MetPro: 1.907 ± 0.627
1.271MetGln: 1.271 ± 0.534
1.271MetArg: 1.271 ± 0.931
1.271MetSer: 1.271 ± 0.538
0.953MetThr: 0.953 ± 0.256
1.271MetVal: 1.271 ± 0.609
0.318MetTrp: 0.318 ± 0.381
0.318MetTyr: 0.318 ± 0.233
0.0MetXaa: 0.0 ± 0.0
Asn
2.86AsnAla: 2.86 ± 1.209
0.636AsnCys: 0.636 ± 0.558
2.224AsnAsp: 2.224 ± 0.79
1.907AsnGlu: 1.907 ± 0.959
1.271AsnPhe: 1.271 ± 0.664
2.542AsnGly: 2.542 ± 1.254
0.636AsnHis: 0.636 ± 0.481
1.907AsnIle: 1.907 ± 0.398
2.224AsnLys: 2.224 ± 0.533
4.131AsnLeu: 4.131 ± 1.02
0.953AsnMet: 0.953 ± 0.419
1.271AsnAsn: 1.271 ± 0.624
1.271AsnPro: 1.271 ± 0.637
0.636AsnGln: 0.636 ± 0.914
1.907AsnArg: 1.907 ± 1.083
3.813AsnSer: 3.813 ± 0.736
1.271AsnThr: 1.271 ± 0.622
2.224AsnVal: 2.224 ± 0.637
0.636AsnTrp: 0.636 ± 0.466
1.589AsnTyr: 1.589 ± 0.356
0.0AsnXaa: 0.0 ± 0.0
Pro
3.178ProAla: 3.178 ± 1.635
1.271ProCys: 1.271 ± 0.512
2.224ProAsp: 2.224 ± 0.594
2.542ProGlu: 2.542 ± 0.523
0.953ProPhe: 0.953 ± 1.062
3.813ProGly: 3.813 ± 0.606
1.589ProHis: 1.589 ± 0.627
3.495ProIle: 3.495 ± 0.509
2.224ProLys: 2.224 ± 0.54
5.084ProLeu: 5.084 ± 1.52
0.953ProMet: 0.953 ± 0.518
3.178ProAsn: 3.178 ± 1.055
1.271ProPro: 1.271 ± 0.624
1.589ProGln: 1.589 ± 0.504
2.542ProArg: 2.542 ± 0.887
3.178ProSer: 3.178 ± 0.787
1.271ProThr: 1.271 ± 0.475
3.178ProVal: 3.178 ± 1.07
0.0ProTrp: 0.0 ± 0.0
1.271ProTyr: 1.271 ± 0.514
0.0ProXaa: 0.0 ± 0.0
Gln
2.224GlnAla: 2.224 ± 0.832
1.271GlnCys: 1.271 ± 0.747
1.271GlnAsp: 1.271 ± 0.514
2.542GlnGlu: 2.542 ± 1.091
1.589GlnPhe: 1.589 ± 0.692
3.178GlnGly: 3.178 ± 0.662
0.0GlnHis: 0.0 ± 0.0
0.636GlnIle: 0.636 ± 0.404
4.766GlnLys: 4.766 ± 1.272
1.907GlnLeu: 1.907 ± 0.619
0.636GlnMet: 0.636 ± 0.466
0.636GlnAsn: 0.636 ± 0.466
1.907GlnPro: 1.907 ± 0.777
0.953GlnGln: 0.953 ± 0.588
1.589GlnArg: 1.589 ± 0.455
1.589GlnSer: 1.589 ± 0.356
1.589GlnThr: 1.589 ± 0.964
2.86GlnVal: 2.86 ± 1.193
0.318GlnTrp: 0.318 ± 0.389
1.271GlnTyr: 1.271 ± 0.606
0.0GlnXaa: 0.0 ± 0.0
Arg
5.084ArgAla: 5.084 ± 1.239
0.636ArgCys: 0.636 ± 0.499
4.766ArgAsp: 4.766 ± 1.169
3.178ArgGlu: 3.178 ± 1.651
3.178ArgPhe: 3.178 ± 1.142
2.86ArgGly: 2.86 ± 0.858
0.953ArgHis: 0.953 ± 0.411
1.907ArgIle: 1.907 ± 0.664
4.449ArgLys: 4.449 ± 0.791
5.402ArgLeu: 5.402 ± 1.761
1.589ArgMet: 1.589 ± 0.297
3.178ArgAsn: 3.178 ± 0.732
2.224ArgPro: 2.224 ± 0.828
3.178ArgGln: 3.178 ± 1.252
3.495ArgArg: 3.495 ± 0.33
4.766ArgSer: 4.766 ± 1.318
4.131ArgThr: 4.131 ± 1.007
1.589ArgVal: 1.589 ± 0.665
0.318ArgTrp: 0.318 ± 0.279
1.271ArgTyr: 1.271 ± 0.287
0.0ArgXaa: 0.0 ± 0.0
Ser
5.084SerAla: 5.084 ± 1.109
2.224SerCys: 2.224 ± 0.917
3.178SerAsp: 3.178 ± 0.948
7.309SerGlu: 7.309 ± 1.573
1.589SerPhe: 1.589 ± 1.048
3.178SerGly: 3.178 ± 1.182
2.86SerHis: 2.86 ± 1.402
3.178SerIle: 3.178 ± 0.874
5.72SerLys: 5.72 ± 2.772
7.309SerLeu: 7.309 ± 1.102
1.907SerMet: 1.907 ± 0.947
1.907SerAsn: 1.907 ± 0.9
4.131SerPro: 4.131 ± 1.145
1.271SerGln: 1.271 ± 0.391
5.084SerArg: 5.084 ± 1.118
12.393SerSer: 12.393 ± 2.047
3.495SerThr: 3.495 ± 1.065
8.897SerVal: 8.897 ± 0.9
0.953SerTrp: 0.953 ± 0.411
3.178SerTyr: 3.178 ± 1.91
0.0SerXaa: 0.0 ± 0.0
Thr
4.131ThrAla: 4.131 ± 0.888
0.318ThrCys: 0.318 ± 0.353
3.495ThrAsp: 3.495 ± 0.67
2.224ThrGlu: 2.224 ± 0.526
4.449ThrPhe: 4.449 ± 1.431
2.86ThrGly: 2.86 ± 1.497
0.636ThrHis: 0.636 ± 0.466
3.495ThrIle: 3.495 ± 0.504
1.907ThrLys: 1.907 ± 0.527
4.449ThrLeu: 4.449 ± 1.601
0.953ThrMet: 0.953 ± 0.419
0.953ThrAsn: 0.953 ± 0.726
3.813ThrPro: 3.813 ± 0.803
0.953ThrGln: 0.953 ± 0.592
2.86ThrArg: 2.86 ± 0.726
3.495ThrSer: 3.495 ± 0.852
3.495ThrThr: 3.495 ± 0.693
3.813ThrVal: 3.813 ± 1.487
1.271ThrTrp: 1.271 ± 0.759
3.178ThrTyr: 3.178 ± 0.732
0.0ThrXaa: 0.0 ± 0.0
Val
5.402ValAla: 5.402 ± 0.984
0.636ValCys: 0.636 ± 0.609
4.449ValAsp: 4.449 ± 0.896
5.084ValGlu: 5.084 ± 1.209
3.178ValPhe: 3.178 ± 0.735
3.813ValGly: 3.813 ± 1.476
1.589ValHis: 1.589 ± 0.624
3.495ValIle: 3.495 ± 0.572
6.355ValLys: 6.355 ± 1.72
5.72ValLeu: 5.72 ± 1.531
1.907ValMet: 1.907 ± 0.807
1.271ValAsn: 1.271 ± 0.465
5.72ValPro: 5.72 ± 1.577
1.589ValGln: 1.589 ± 0.356
5.084ValArg: 5.084 ± 1.554
3.495ValSer: 3.495 ± 0.857
4.449ValThr: 4.449 ± 1.598
6.037ValVal: 6.037 ± 2.818
0.318ValTrp: 0.318 ± 0.233
2.542ValTyr: 2.542 ± 0.52
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.271TrpAsp: 1.271 ± 0.456
0.636TrpGlu: 0.636 ± 0.269
1.589TrpPhe: 1.589 ± 0.455
0.636TrpGly: 0.636 ± 0.654
0.318TrpHis: 0.318 ± 0.279
0.318TrpIle: 0.318 ± 0.389
0.318TrpLys: 0.318 ± 0.233
0.636TrpLeu: 0.636 ± 0.391
0.318TrpMet: 0.318 ± 0.254
0.953TrpAsn: 0.953 ± 0.561
0.318TrpPro: 0.318 ± 0.457
0.318TrpGln: 0.318 ± 0.381
0.318TrpArg: 0.318 ± 0.457
0.636TrpSer: 0.636 ± 0.487
0.0TrpThr: 0.0 ± 0.0
0.636TrpVal: 0.636 ± 0.404
0.636TrpTrp: 0.636 ± 0.442
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.224TyrAla: 2.224 ± 1.003
0.953TyrCys: 0.953 ± 0.641
1.907TyrAsp: 1.907 ± 1.011
1.589TyrGlu: 1.589 ± 0.58
0.953TyrPhe: 0.953 ± 0.443
2.542TyrGly: 2.542 ± 1.11
0.953TyrHis: 0.953 ± 0.447
2.542TyrIle: 2.542 ± 0.679
1.907TyrLys: 1.907 ± 0.73
4.766TyrLeu: 4.766 ± 1.351
0.318TyrMet: 0.318 ± 0.233
1.271TyrAsn: 1.271 ± 0.521
2.224TyrPro: 2.224 ± 0.936
2.224TyrGln: 2.224 ± 0.737
3.178TyrArg: 3.178 ± 0.689
2.542TyrSer: 2.542 ± 0.6
2.224TyrThr: 2.224 ± 0.939
1.907TyrVal: 1.907 ± 0.809
0.318TyrTrp: 0.318 ± 0.457
0.953TyrTyr: 0.953 ± 0.518
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.318XaaSer: 0.318 ± 0.457
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3148 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski