Amino acid dipepetide frequency for High Plains wheat mosaic emaravirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.198AlaAla: 0.198 ± 0.284
0.198AlaCys: 0.198 ± 0.117
0.992AlaAsp: 0.992 ± 0.298
1.983AlaGlu: 1.983 ± 0.703
1.388AlaPhe: 1.388 ± 0.401
0.992AlaGly: 0.992 ± 0.352
0.595AlaHis: 0.595 ± 0.232
2.975AlaIle: 2.975 ± 1.121
2.38AlaLys: 2.38 ± 0.639
2.777AlaLeu: 2.777 ± 0.734
1.388AlaMet: 1.388 ± 0.637
1.587AlaAsn: 1.587 ± 0.388
0.397AlaPro: 0.397 ± 0.286
0.397AlaGln: 0.397 ± 0.368
1.19AlaArg: 1.19 ± 0.442
1.785AlaSer: 1.785 ± 0.672
1.785AlaThr: 1.785 ± 0.817
2.182AlaVal: 2.182 ± 0.772
0.0AlaTrp: 0.0 ± 0.0
1.19AlaTyr: 1.19 ± 0.799
0.0AlaXaa: 0.0 ± 0.0
Cys
0.397CysAla: 0.397 ± 0.193
0.397CysCys: 0.397 ± 0.324
1.983CysAsp: 1.983 ± 0.823
0.992CysGlu: 0.992 ± 0.512
0.992CysPhe: 0.992 ± 0.452
0.992CysGly: 0.992 ± 0.653
0.595CysHis: 0.595 ± 0.307
1.388CysIle: 1.388 ± 0.674
1.388CysLys: 1.388 ± 0.583
2.578CysLeu: 2.578 ± 0.749
0.397CysMet: 0.397 ± 0.299
1.388CysAsn: 1.388 ± 0.583
0.198CysPro: 0.198 ± 0.117
0.198CysGln: 0.198 ± 0.219
0.793CysArg: 0.793 ± 0.547
0.397CysSer: 0.397 ± 0.299
0.793CysThr: 0.793 ± 0.406
0.595CysVal: 0.595 ± 0.221
0.0CysTrp: 0.0 ± 0.0
1.19CysTyr: 1.19 ± 0.597
0.0CysXaa: 0.0 ± 0.0
Asp
1.785AspAla: 1.785 ± 0.388
0.793AspCys: 0.793 ± 0.282
6.347AspAsp: 6.347 ± 0.807
3.967AspGlu: 3.967 ± 0.77
3.57AspPhe: 3.57 ± 0.621
3.57AspGly: 3.57 ± 1.395
1.587AspHis: 1.587 ± 0.333
6.942AspIle: 6.942 ± 2.332
4.165AspLys: 4.165 ± 0.956
5.553AspLeu: 5.553 ± 1.353
3.768AspMet: 3.768 ± 0.738
3.967AspAsn: 3.967 ± 0.511
1.587AspPro: 1.587 ± 0.664
3.173AspGln: 3.173 ± 0.941
2.38AspArg: 2.38 ± 0.653
4.958AspSer: 4.958 ± 0.597
2.38AspThr: 2.38 ± 0.747
5.355AspVal: 5.355 ± 0.931
0.0AspTrp: 0.0 ± 0.0
4.76AspTyr: 4.76 ± 0.721
0.0AspXaa: 0.0 ± 0.0
Glu
1.983GluAla: 1.983 ± 0.729
0.992GluCys: 0.992 ± 0.412
3.57GluAsp: 3.57 ± 0.738
5.752GluGlu: 5.752 ± 1.345
4.76GluPhe: 4.76 ± 1.218
1.388GluGly: 1.388 ± 0.346
1.983GluHis: 1.983 ± 0.797
5.355GluIle: 5.355 ± 1.273
4.562GluLys: 4.562 ± 0.61
7.14GluLeu: 7.14 ± 1.131
1.785GluMet: 1.785 ± 0.824
3.768GluAsn: 3.768 ± 0.628
2.182GluPro: 2.182 ± 0.743
1.388GluGln: 1.388 ± 0.82
1.587GluArg: 1.587 ± 0.504
5.553GluSer: 5.553 ± 1.62
3.768GluThr: 3.768 ± 0.574
2.578GluVal: 2.578 ± 0.894
0.198GluTrp: 0.198 ± 0.217
2.182GluTyr: 2.182 ± 0.653
0.0GluXaa: 0.0 ± 0.0
Phe
2.38PheAla: 2.38 ± 0.523
0.397PheCys: 0.397 ± 0.234
2.975PheAsp: 2.975 ± 0.546
3.57PheGlu: 3.57 ± 0.89
1.587PhePhe: 1.587 ± 0.654
1.587PheGly: 1.587 ± 0.662
0.793PheHis: 0.793 ± 0.427
4.76PheIle: 4.76 ± 1.171
3.768PheLys: 3.768 ± 0.899
4.562PheLeu: 4.562 ± 0.967
0.992PheMet: 0.992 ± 0.633
4.76PheAsn: 4.76 ± 1.284
1.785PhePro: 1.785 ± 0.31
2.38PheGln: 2.38 ± 0.895
1.785PheArg: 1.785 ± 0.618
4.562PheSer: 4.562 ± 1.108
2.777PheThr: 2.777 ± 0.689
2.975PheVal: 2.975 ± 1.206
0.0PheTrp: 0.0 ± 0.0
5.157PheTyr: 5.157 ± 0.805
0.0PheXaa: 0.0 ± 0.0
Gly
0.397GlyAla: 0.397 ± 0.282
0.793GlyCys: 0.793 ± 0.601
1.587GlyAsp: 1.587 ± 0.851
1.983GlyGlu: 1.983 ± 0.324
4.165GlyPhe: 4.165 ± 1.537
0.397GlyGly: 0.397 ± 0.368
0.397GlyHis: 0.397 ± 0.437
2.38GlyIle: 2.38 ± 0.409
2.578GlyLys: 2.578 ± 0.812
3.768GlyLeu: 3.768 ± 0.338
0.793GlyMet: 0.793 ± 0.534
2.578GlyAsn: 2.578 ± 0.608
0.397GlyPro: 0.397 ± 0.33
1.388GlyGln: 1.388 ± 0.655
0.595GlyArg: 0.595 ± 0.857
2.975GlySer: 2.975 ± 0.548
1.388GlyThr: 1.388 ± 0.423
1.19GlyVal: 1.19 ± 0.762
0.595GlyTrp: 0.595 ± 0.493
1.587GlyTyr: 1.587 ± 0.683
0.0GlyXaa: 0.0 ± 0.0
His
0.595HisAla: 0.595 ± 0.351
0.397HisCys: 0.397 ± 0.234
1.19HisAsp: 1.19 ± 0.461
0.992HisGlu: 0.992 ± 0.337
1.19HisPhe: 1.19 ± 0.511
1.388HisGly: 1.388 ± 0.582
0.198HisHis: 0.198 ± 0.117
2.578HisIle: 2.578 ± 0.826
2.578HisLys: 2.578 ± 0.736
1.19HisLeu: 1.19 ± 0.457
0.198HisMet: 0.198 ± 0.219
1.785HisAsn: 1.785 ± 0.488
0.595HisPro: 0.595 ± 0.221
0.0HisGln: 0.0 ± 0.0
0.595HisArg: 0.595 ± 0.314
1.785HisSer: 1.785 ± 0.404
1.785HisThr: 1.785 ± 0.497
1.19HisVal: 1.19 ± 0.488
0.397HisTrp: 0.397 ± 0.368
1.19HisTyr: 1.19 ± 0.646
0.0HisXaa: 0.0 ± 0.0
Ile
2.578IleAla: 2.578 ± 0.676
1.19IleCys: 1.19 ± 0.424
7.537IleAsp: 7.537 ± 1.471
4.958IleGlu: 4.958 ± 1.484
2.777IlePhe: 2.777 ± 0.895
3.967IleGly: 3.967 ± 0.992
0.992IleHis: 0.992 ± 0.399
5.752IleIle: 5.752 ± 1.073
10.313IleLys: 10.313 ± 0.986
5.355IleLeu: 5.355 ± 0.937
2.182IleMet: 2.182 ± 0.462
8.528IleAsn: 8.528 ± 0.943
3.57IlePro: 3.57 ± 1.088
2.182IleGln: 2.182 ± 0.843
3.173IleArg: 3.173 ± 0.764
7.933IleSer: 7.933 ± 1.37
3.372IleThr: 3.372 ± 0.684
4.562IleVal: 4.562 ± 0.967
0.0IleTrp: 0.0 ± 0.0
3.768IleTyr: 3.768 ± 0.617
0.0IleXaa: 0.0 ± 0.0
Lys
2.975LysAla: 2.975 ± 0.871
1.785LysCys: 1.785 ± 0.576
5.95LysAsp: 5.95 ± 1.081
3.768LysGlu: 3.768 ± 0.832
5.355LysPhe: 5.355 ± 1.047
1.983LysGly: 1.983 ± 0.808
2.38LysHis: 2.38 ± 0.569
5.355LysIle: 5.355 ± 1.035
10.908LysLys: 10.908 ± 2.031
10.115LysLeu: 10.115 ± 0.944
2.777LysMet: 2.777 ± 0.94
7.933LysAsn: 7.933 ± 1.222
3.57LysPro: 3.57 ± 0.744
2.578LysGln: 2.578 ± 0.885
2.578LysArg: 2.578 ± 0.997
7.735LysSer: 7.735 ± 1.025
5.752LysThr: 5.752 ± 1.461
3.173LysVal: 3.173 ± 0.839
1.388LysTrp: 1.388 ± 0.568
6.743LysTyr: 6.743 ± 1.473
0.0LysXaa: 0.0 ± 0.0
Leu
2.38LeuAla: 2.38 ± 1.067
1.587LeuCys: 1.587 ± 0.652
5.752LeuAsp: 5.752 ± 0.976
5.355LeuGlu: 5.355 ± 1.055
4.76LeuPhe: 4.76 ± 0.726
1.388LeuGly: 1.388 ± 0.788
2.182LeuHis: 2.182 ± 0.451
9.718LeuIle: 9.718 ± 1.89
8.528LeuLys: 8.528 ± 0.803
8.528LeuLeu: 8.528 ± 2.294
1.983LeuMet: 1.983 ± 0.518
6.347LeuAsn: 6.347 ± 0.698
2.38LeuPro: 2.38 ± 0.633
2.578LeuGln: 2.578 ± 0.499
2.182LeuArg: 2.182 ± 0.704
8.33LeuSer: 8.33 ± 1.918
4.363LeuThr: 4.363 ± 0.738
5.752LeuVal: 5.752 ± 1.401
0.0LeuTrp: 0.0 ± 0.0
3.372LeuTyr: 3.372 ± 1.361
0.0LeuXaa: 0.0 ± 0.0
Met
0.793MetAla: 0.793 ± 0.68
0.198MetCys: 0.198 ± 0.117
2.182MetAsp: 2.182 ± 0.495
1.785MetGlu: 1.785 ± 0.446
1.19MetPhe: 1.19 ± 0.478
0.198MetGly: 0.198 ± 0.286
0.793MetHis: 0.793 ± 0.461
1.785MetIle: 1.785 ± 0.598
2.777MetLys: 2.777 ± 1.236
3.768MetLeu: 3.768 ± 0.776
0.397MetMet: 0.397 ± 0.282
3.967MetAsn: 3.967 ± 0.582
0.595MetPro: 0.595 ± 0.3
0.397MetGln: 0.397 ± 0.193
0.793MetArg: 0.793 ± 0.468
2.38MetSer: 2.38 ± 0.691
0.992MetThr: 0.992 ± 0.445
2.182MetVal: 2.182 ± 0.65
0.397MetTrp: 0.397 ± 0.228
1.19MetTyr: 1.19 ± 0.506
0.0MetXaa: 0.0 ± 0.0
Asn
1.785AsnAla: 1.785 ± 0.693
1.785AsnCys: 1.785 ± 0.964
5.355AsnAsp: 5.355 ± 1.011
3.967AsnGlu: 3.967 ± 1.566
4.958AsnPhe: 4.958 ± 1.409
1.785AsnGly: 1.785 ± 0.85
1.785AsnHis: 1.785 ± 0.413
5.157AsnIle: 5.157 ± 0.959
7.735AsnLys: 7.735 ± 1.88
5.355AsnLeu: 5.355 ± 0.986
2.578AsnMet: 2.578 ± 0.797
4.958AsnAsn: 4.958 ± 1.057
2.975AsnPro: 2.975 ± 0.749
2.975AsnGln: 2.975 ± 0.749
2.975AsnArg: 2.975 ± 0.425
4.562AsnSer: 4.562 ± 0.68
4.363AsnThr: 4.363 ± 0.606
3.967AsnVal: 3.967 ± 0.757
0.397AsnTrp: 0.397 ± 0.286
5.355AsnTyr: 5.355 ± 1.618
0.0AsnXaa: 0.0 ± 0.0
Pro
0.595ProAla: 0.595 ± 0.357
0.397ProCys: 0.397 ± 0.234
1.983ProAsp: 1.983 ± 0.567
2.38ProGlu: 2.38 ± 0.954
0.595ProPhe: 0.595 ± 0.297
0.992ProGly: 0.992 ± 0.553
0.793ProHis: 0.793 ± 0.557
2.578ProIle: 2.578 ± 0.818
3.57ProLys: 3.57 ± 0.914
1.388ProLeu: 1.388 ± 0.619
1.19ProMet: 1.19 ± 0.437
0.793ProAsn: 0.793 ± 0.378
0.397ProPro: 0.397 ± 0.437
0.793ProGln: 0.793 ± 0.468
1.388ProArg: 1.388 ± 0.486
1.785ProSer: 1.785 ± 0.469
1.785ProThr: 1.785 ± 0.488
2.182ProVal: 2.182 ± 1.067
0.0ProTrp: 0.0 ± 0.0
1.19ProTyr: 1.19 ± 0.588
0.0ProXaa: 0.0 ± 0.0
Gln
0.595GlnAla: 0.595 ± 0.307
0.595GlnCys: 0.595 ± 0.264
1.983GlnAsp: 1.983 ± 0.689
2.975GlnGlu: 2.975 ± 0.641
0.992GlnPhe: 0.992 ± 0.326
0.595GlnGly: 0.595 ± 0.232
0.992GlnHis: 0.992 ± 0.277
2.182GlnIle: 2.182 ± 0.767
2.578GlnLys: 2.578 ± 1.092
2.38GlnLeu: 2.38 ± 0.766
0.397GlnMet: 0.397 ± 0.395
1.587GlnAsn: 1.587 ± 0.476
0.595GlnPro: 0.595 ± 0.33
0.992GlnGln: 0.992 ± 0.887
1.388GlnArg: 1.388 ± 0.48
1.587GlnSer: 1.587 ± 0.614
1.983GlnThr: 1.983 ± 0.803
1.19GlnVal: 1.19 ± 0.593
0.397GlnTrp: 0.397 ± 0.228
1.983GlnTyr: 1.983 ± 0.751
0.0GlnXaa: 0.0 ± 0.0
Arg
0.397ArgAla: 0.397 ± 0.327
0.793ArgCys: 0.793 ± 0.468
1.587ArgAsp: 1.587 ± 0.437
1.587ArgGlu: 1.587 ± 0.332
2.578ArgPhe: 2.578 ± 0.824
0.595ArgGly: 0.595 ± 0.37
0.595ArgHis: 0.595 ± 0.305
2.182ArgIle: 2.182 ± 0.592
2.182ArgLys: 2.182 ± 0.553
3.768ArgLeu: 3.768 ± 0.925
1.388ArgMet: 1.388 ± 0.55
2.182ArgAsn: 2.182 ± 1.109
0.397ArgPro: 0.397 ± 0.282
0.793ArgGln: 0.793 ± 0.429
2.182ArgArg: 2.182 ± 0.713
2.777ArgSer: 2.777 ± 0.414
1.785ArgThr: 1.785 ± 0.515
2.38ArgVal: 2.38 ± 0.533
0.198ArgTrp: 0.198 ± 0.304
3.768ArgTyr: 3.768 ± 1.015
0.0ArgXaa: 0.0 ± 0.0
Ser
1.983SerAla: 1.983 ± 0.434
2.182SerCys: 2.182 ± 0.783
7.14SerAsp: 7.14 ± 1.017
3.768SerGlu: 3.768 ± 1.039
3.57SerPhe: 3.57 ± 0.333
2.182SerGly: 2.182 ± 0.37
1.388SerHis: 1.388 ± 0.635
6.545SerIle: 6.545 ± 0.569
8.132SerLys: 8.132 ± 1.231
6.545SerLeu: 6.545 ± 1.706
2.38SerMet: 2.38 ± 0.326
6.148SerAsn: 6.148 ± 2.103
1.785SerPro: 1.785 ± 0.618
2.182SerGln: 2.182 ± 0.899
1.587SerArg: 1.587 ± 0.426
7.933SerSer: 7.933 ± 1.607
4.165SerThr: 4.165 ± 0.621
5.95SerVal: 5.95 ± 1.246
0.397SerTrp: 0.397 ± 0.327
4.363SerTyr: 4.363 ± 0.621
0.0SerXaa: 0.0 ± 0.0
Thr
1.785ThrAla: 1.785 ± 0.725
1.19ThrCys: 1.19 ± 0.853
4.165ThrAsp: 4.165 ± 0.684
4.165ThrGlu: 4.165 ± 0.859
2.38ThrPhe: 2.38 ± 0.569
3.57ThrGly: 3.57 ± 0.916
1.19ThrHis: 1.19 ± 0.496
7.14ThrIle: 7.14 ± 1.871
4.363ThrLys: 4.363 ± 0.75
3.768ThrLeu: 3.768 ± 1.018
0.992ThrMet: 0.992 ± 0.53
2.182ThrAsn: 2.182 ± 0.512
1.388ThrPro: 1.388 ± 0.698
0.992ThrGln: 0.992 ± 0.594
2.182ThrArg: 2.182 ± 0.586
3.173ThrSer: 3.173 ± 0.729
2.777ThrThr: 2.777 ± 0.726
3.967ThrVal: 3.967 ± 1.563
0.397ThrTrp: 0.397 ± 0.282
3.372ThrTyr: 3.372 ± 1.064
0.0ThrXaa: 0.0 ± 0.0
Val
1.388ValAla: 1.388 ± 0.808
0.595ValCys: 0.595 ± 0.251
3.768ValAsp: 3.768 ± 0.8
2.975ValGlu: 2.975 ± 1.189
3.967ValPhe: 3.967 ± 0.546
1.785ValGly: 1.785 ± 0.949
1.388ValHis: 1.388 ± 0.485
4.76ValIle: 4.76 ± 1.045
5.157ValLys: 5.157 ± 0.68
3.967ValLeu: 3.967 ± 0.996
0.595ValMet: 0.595 ± 0.345
4.165ValAsn: 4.165 ± 0.794
1.388ValPro: 1.388 ± 0.753
1.388ValGln: 1.388 ± 0.714
2.777ValArg: 2.777 ± 1.342
5.553ValSer: 5.553 ± 1.163
5.157ValThr: 5.157 ± 1.07
3.967ValVal: 3.967 ± 1.084
0.198ValTrp: 0.198 ± 0.217
2.38ValTyr: 2.38 ± 0.65
0.0ValXaa: 0.0 ± 0.0
Trp
0.198TrpAla: 0.198 ± 0.117
0.198TrpCys: 0.198 ± 0.219
0.198TrpAsp: 0.198 ± 0.117
0.397TrpGlu: 0.397 ± 0.282
0.595TrpPhe: 0.595 ± 0.37
0.0TrpGly: 0.0 ± 0.0
0.198TrpHis: 0.198 ± 0.304
0.397TrpIle: 0.397 ± 0.37
0.595TrpLys: 0.595 ± 0.504
0.198TrpLeu: 0.198 ± 0.117
0.198TrpMet: 0.198 ± 0.215
0.397TrpAsn: 0.397 ± 0.37
0.198TrpPro: 0.198 ± 0.217
0.0TrpGln: 0.0 ± 0.0
0.198TrpArg: 0.198 ± 0.217
0.992TrpSer: 0.992 ± 0.563
0.198TrpThr: 0.198 ± 0.217
0.198TrpVal: 0.198 ± 0.117
0.0TrpTrp: 0.0 ± 0.0
0.198TrpTyr: 0.198 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.19TyrAla: 1.19 ± 0.426
1.19TyrCys: 1.19 ± 0.377
4.562TyrAsp: 4.562 ± 1.556
5.157TyrGlu: 5.157 ± 1.245
2.182TyrPhe: 2.182 ± 0.753
2.578TyrGly: 2.578 ± 0.817
0.992TyrHis: 0.992 ± 0.577
4.76TyrIle: 4.76 ± 1.288
6.545TyrLys: 6.545 ± 1.023
4.76TyrLeu: 4.76 ± 1.447
2.182TyrMet: 2.182 ± 0.47
5.752TyrAsn: 5.752 ± 0.77
0.397TyrPro: 0.397 ± 0.37
1.19TyrGln: 1.19 ± 0.544
1.785TyrArg: 1.785 ± 0.831
3.57TyrSer: 3.57 ± 1.02
3.768TyrThr: 3.768 ± 0.901
1.785TyrVal: 1.785 ± 0.647
0.595TyrTrp: 0.595 ± 0.429
2.975TyrTyr: 2.975 ± 0.748
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (5043 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski