Amino acid dipepetide frequency for Magnaporthe oryzae chrysovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.048AlaAla: 17.048 ± 2.273
1.635AlaCys: 1.635 ± 0.366
9.575AlaAsp: 9.575 ± 1.063
7.707AlaGlu: 7.707 ± 1.015
3.036AlaPhe: 3.036 ± 0.398
11.677AlaGly: 11.677 ± 1.947
2.102AlaHis: 2.102 ± 0.631
3.27AlaIle: 3.27 ± 0.525
2.569AlaLys: 2.569 ± 0.567
12.377AlaLeu: 12.377 ± 1.729
4.904AlaMet: 4.904 ± 0.697
2.802AlaAsn: 2.802 ± 0.85
9.108AlaPro: 9.108 ± 1.709
3.737AlaGln: 3.737 ± 0.468
11.677AlaArg: 11.677 ± 1.8
9.341AlaSer: 9.341 ± 0.718
6.072AlaThr: 6.072 ± 0.511
10.976AlaVal: 10.976 ± 1.558
1.868AlaTrp: 1.868 ± 0.441
4.204AlaTyr: 4.204 ± 1.185
0.0AlaXaa: 0.0 ± 0.0
Cys
2.569CysAla: 2.569 ± 0.585
0.234CysCys: 0.234 ± 0.241
0.934CysAsp: 0.934 ± 0.358
0.701CysGlu: 0.701 ± 0.221
0.0CysPhe: 0.0 ± 0.0
2.569CysGly: 2.569 ± 0.668
0.234CysHis: 0.234 ± 0.241
0.234CysIle: 0.234 ± 0.23
0.467CysLys: 0.467 ± 0.312
0.934CysLeu: 0.934 ± 0.169
0.467CysMet: 0.467 ± 0.225
0.234CysAsn: 0.234 ± 0.197
0.467CysPro: 0.467 ± 0.281
0.234CysGln: 0.234 ± 0.241
0.701CysArg: 0.701 ± 0.391
0.934CysSer: 0.934 ± 0.419
0.934CysThr: 0.934 ± 0.254
0.701CysVal: 0.701 ± 0.27
0.234CysTrp: 0.234 ± 0.241
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
8.174AspAla: 8.174 ± 1.896
0.467AspCys: 0.467 ± 0.259
3.503AspAsp: 3.503 ± 0.745
2.802AspGlu: 2.802 ± 0.276
0.0AspPhe: 0.0 ± 0.0
6.072AspGly: 6.072 ± 0.44
1.168AspHis: 1.168 ± 0.108
1.868AspIle: 1.868 ± 0.538
1.168AspLys: 1.168 ± 0.655
2.569AspLeu: 2.569 ± 0.913
1.401AspMet: 1.401 ± 0.685
1.401AspAsn: 1.401 ± 0.623
2.569AspPro: 2.569 ± 0.702
2.335AspGln: 2.335 ± 0.675
3.27AspArg: 3.27 ± 0.741
1.868AspSer: 1.868 ± 0.678
3.737AspThr: 3.737 ± 0.692
6.072AspVal: 6.072 ± 1.019
0.701AspTrp: 0.701 ± 0.391
3.503AspTyr: 3.503 ± 0.83
0.0AspXaa: 0.0 ± 0.0
Glu
6.539GluAla: 6.539 ± 0.854
0.701GluCys: 0.701 ± 0.428
1.635GluAsp: 1.635 ± 0.399
2.802GluGlu: 2.802 ± 0.72
2.802GluPhe: 2.802 ± 1.159
2.802GluGly: 2.802 ± 0.81
0.701GluHis: 0.701 ± 0.479
1.168GluIle: 1.168 ± 0.405
1.635GluLys: 1.635 ± 0.64
4.671GluLeu: 4.671 ± 0.877
0.467GluMet: 0.467 ± 0.259
0.701GluAsn: 0.701 ± 0.266
3.737GluPro: 3.737 ± 0.461
2.569GluGln: 2.569 ± 0.768
4.437GluArg: 4.437 ± 0.883
3.27GluSer: 3.27 ± 0.493
2.335GluThr: 2.335 ± 0.767
5.605GluVal: 5.605 ± 0.771
1.168GluTrp: 1.168 ± 0.535
1.168GluTyr: 1.168 ± 0.451
0.0GluXaa: 0.0 ± 0.0
Phe
2.102PheAla: 2.102 ± 0.818
0.0PheCys: 0.0 ± 0.0
1.635PheAsp: 1.635 ± 0.435
0.701PheGlu: 0.701 ± 0.365
0.467PhePhe: 0.467 ± 0.259
2.335PheGly: 2.335 ± 0.786
0.0PheHis: 0.0 ± 0.0
0.467PheIle: 0.467 ± 0.393
1.401PheLys: 1.401 ± 0.923
1.635PheLeu: 1.635 ± 0.392
0.234PheMet: 0.234 ± 0.233
0.234PheAsn: 0.234 ± 0.197
0.701PhePro: 0.701 ± 0.59
0.701PheGln: 0.701 ± 0.391
1.168PheArg: 1.168 ± 0.787
1.168PheSer: 1.168 ± 0.899
1.401PheThr: 1.401 ± 0.464
2.102PheVal: 2.102 ± 0.312
0.467PheTrp: 0.467 ± 0.225
0.234PheTyr: 0.234 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
11.21GlyAla: 11.21 ± 2.136
1.868GlyCys: 1.868 ± 0.644
5.138GlyAsp: 5.138 ± 0.312
4.437GlyGlu: 4.437 ± 1.102
1.168GlyPhe: 1.168 ± 0.41
8.874GlyGly: 8.874 ± 2.269
2.569GlyHis: 2.569 ± 0.953
2.102GlyIle: 2.102 ± 0.761
2.802GlyLys: 2.802 ± 0.78
8.407GlyLeu: 8.407 ± 2.662
2.335GlyMet: 2.335 ± 0.46
2.335GlyAsn: 2.335 ± 0.858
6.072GlyPro: 6.072 ± 1.924
3.27GlyGln: 3.27 ± 1.201
7.006GlyArg: 7.006 ± 1.151
8.641GlySer: 8.641 ± 0.68
5.138GlyThr: 5.138 ± 0.677
6.773GlyVal: 6.773 ± 0.794
1.635GlyTrp: 1.635 ± 0.716
1.635GlyTyr: 1.635 ± 0.586
0.0GlyXaa: 0.0 ± 0.0
His
2.335HisAla: 2.335 ± 0.837
0.467HisCys: 0.467 ± 0.466
0.467HisAsp: 0.467 ± 0.259
0.467HisGlu: 0.467 ± 0.393
0.934HisPhe: 0.934 ± 0.254
1.868HisGly: 1.868 ± 0.204
0.701HisHis: 0.701 ± 0.639
0.467HisIle: 0.467 ± 0.323
0.934HisLys: 0.934 ± 0.254
1.168HisLeu: 1.168 ± 0.385
0.467HisMet: 0.467 ± 0.426
0.234HisAsn: 0.234 ± 0.197
1.635HisPro: 1.635 ± 0.653
1.168HisGln: 1.168 ± 0.108
2.335HisArg: 2.335 ± 0.504
2.802HisSer: 2.802 ± 0.739
1.168HisThr: 1.168 ± 0.55
1.168HisVal: 1.168 ± 0.657
0.467HisTrp: 0.467 ± 0.259
1.168HisTyr: 1.168 ± 0.451
0.0HisXaa: 0.0 ± 0.0
Ile
2.569IleAla: 2.569 ± 0.405
0.701IleCys: 0.701 ± 0.427
2.102IleAsp: 2.102 ± 0.757
0.701IleGlu: 0.701 ± 0.479
0.467IlePhe: 0.467 ± 0.393
2.802IleGly: 2.802 ± 0.758
0.934IleHis: 0.934 ± 0.602
0.467IleIle: 0.467 ± 0.393
0.701IleLys: 0.701 ± 0.242
0.701IleLeu: 0.701 ± 0.458
1.168IleMet: 1.168 ± 0.537
0.934IleAsn: 0.934 ± 0.374
0.934IlePro: 0.934 ± 0.482
0.467IleGln: 0.467 ± 0.393
1.868IleArg: 1.868 ± 0.36
0.467IleSer: 0.467 ± 0.241
1.401IleThr: 1.401 ± 0.722
2.102IleVal: 2.102 ± 0.559
0.467IleTrp: 0.467 ± 0.241
0.467IleTyr: 0.467 ± 0.241
0.0IleXaa: 0.0 ± 0.0
Lys
3.27LysAla: 3.27 ± 1.009
0.467LysCys: 0.467 ± 0.282
0.701LysAsp: 0.701 ± 0.242
1.401LysGlu: 1.401 ± 0.464
0.701LysPhe: 0.701 ± 0.479
2.569LysGly: 2.569 ± 0.404
0.934LysHis: 0.934 ± 0.787
1.401LysIle: 1.401 ± 0.421
0.934LysLys: 0.934 ± 0.548
3.737LysLeu: 3.737 ± 0.668
0.0LysMet: 0.0 ± 0.0
0.467LysAsn: 0.467 ± 0.241
2.102LysPro: 2.102 ± 0.553
0.467LysGln: 0.467 ± 0.259
0.934LysArg: 0.934 ± 0.541
1.168LysSer: 1.168 ± 0.278
0.701LysThr: 0.701 ± 0.365
1.168LysVal: 1.168 ± 0.443
0.701LysTrp: 0.701 ± 0.375
0.234LysTyr: 0.234 ± 0.213
0.0LysXaa: 0.0 ± 0.0
Leu
15.18LeuAla: 15.18 ± 0.869
1.868LeuCys: 1.868 ± 0.277
4.437LeuAsp: 4.437 ± 0.597
3.27LeuGlu: 3.27 ± 0.692
1.168LeuPhe: 1.168 ± 0.324
7.473LeuGly: 7.473 ± 0.804
2.569LeuHis: 2.569 ± 1.032
1.401LeuIle: 1.401 ± 0.526
1.868LeuLys: 1.868 ± 0.857
7.94LeuLeu: 7.94 ± 1.375
2.102LeuMet: 2.102 ± 0.483
1.868LeuAsn: 1.868 ± 0.192
4.204LeuPro: 4.204 ± 0.368
1.868LeuGln: 1.868 ± 1.085
6.773LeuArg: 6.773 ± 1.112
9.809LeuSer: 9.809 ± 1.084
4.437LeuThr: 4.437 ± 0.42
5.605LeuVal: 5.605 ± 0.771
0.467LeuTrp: 0.467 ± 0.259
2.569LeuTyr: 2.569 ± 0.639
0.0LeuXaa: 0.0 ± 0.0
Met
2.335MetAla: 2.335 ± 0.263
0.234MetCys: 0.234 ± 0.23
0.934MetAsp: 0.934 ± 0.254
1.168MetGlu: 1.168 ± 0.289
0.467MetPhe: 0.467 ± 0.323
3.97MetGly: 3.97 ± 0.773
0.467MetHis: 0.467 ± 0.259
0.701MetIle: 0.701 ± 0.59
0.467MetLys: 0.467 ± 0.225
2.569MetLeu: 2.569 ± 1.102
0.467MetMet: 0.467 ± 0.281
0.234MetAsn: 0.234 ± 0.23
0.467MetPro: 0.467 ± 0.312
1.635MetGln: 1.635 ± 0.75
2.569MetArg: 2.569 ± 0.907
1.401MetSer: 1.401 ± 0.461
0.0MetThr: 0.0 ± 0.0
1.635MetVal: 1.635 ± 0.912
0.467MetTrp: 0.467 ± 0.282
1.401MetTyr: 1.401 ± 0.561
0.0MetXaa: 0.0 ± 0.0
Asn
2.802AsnAla: 2.802 ± 0.781
0.234AsnCys: 0.234 ± 0.23
0.934AsnAsp: 0.934 ± 0.22
1.868AsnGlu: 1.868 ± 0.626
0.934AsnPhe: 0.934 ± 0.482
2.802AsnGly: 2.802 ± 0.943
0.701AsnHis: 0.701 ± 0.69
0.934AsnIle: 0.934 ± 0.357
0.934AsnLys: 0.934 ± 0.355
1.168AsnLeu: 1.168 ± 0.657
0.234AsnMet: 0.234 ± 0.23
1.401AsnAsn: 1.401 ± 0.535
0.934AsnPro: 0.934 ± 0.254
0.234AsnGln: 0.234 ± 0.213
1.635AsnArg: 1.635 ± 0.639
2.102AsnSer: 2.102 ± 0.527
1.168AsnThr: 1.168 ± 0.499
1.635AsnVal: 1.635 ± 0.318
0.234AsnTrp: 0.234 ± 0.197
1.168AsnTyr: 1.168 ± 0.253
0.0AsnXaa: 0.0 ± 0.0
Pro
8.407ProAla: 8.407 ± 1.17
0.701ProCys: 0.701 ± 0.27
2.569ProAsp: 2.569 ± 0.364
2.569ProGlu: 2.569 ± 0.542
0.701ProPhe: 0.701 ± 0.221
3.737ProGly: 3.737 ± 1.44
1.635ProHis: 1.635 ± 0.869
1.635ProIle: 1.635 ± 0.686
1.401ProLys: 1.401 ± 0.722
3.97ProLeu: 3.97 ± 0.686
0.934ProMet: 0.934 ± 0.313
2.102ProAsn: 2.102 ± 0.347
5.371ProPro: 5.371 ± 1.615
2.569ProGln: 2.569 ± 0.809
5.371ProArg: 5.371 ± 0.732
6.072ProSer: 6.072 ± 1.839
3.036ProThr: 3.036 ± 1.162
4.437ProVal: 4.437 ± 0.851
0.467ProTrp: 0.467 ± 0.259
1.168ProTyr: 1.168 ± 0.324
0.0ProXaa: 0.0 ± 0.0
Gln
5.605GlnAla: 5.605 ± 0.444
0.934GlnCys: 0.934 ± 0.624
1.168GlnAsp: 1.168 ± 0.491
1.868GlnGlu: 1.868 ± 0.441
0.701GlnPhe: 0.701 ± 0.479
2.102GlnGly: 2.102 ± 1.073
0.467GlnHis: 0.467 ± 0.225
0.234GlnIle: 0.234 ± 0.23
0.467GlnLys: 0.467 ± 0.393
3.503GlnLeu: 3.503 ± 0.934
0.234GlnMet: 0.234 ± 0.197
0.467GlnAsn: 0.467 ± 0.281
2.569GlnPro: 2.569 ± 0.714
0.467GlnGln: 0.467 ± 0.426
1.168GlnArg: 1.168 ± 0.393
2.335GlnSer: 2.335 ± 0.531
1.868GlnThr: 1.868 ± 0.599
4.204GlnVal: 4.204 ± 0.74
1.168GlnTrp: 1.168 ± 0.289
0.701GlnTyr: 0.701 ± 0.266
0.0GlnXaa: 0.0 ± 0.0
Arg
10.042ArgAla: 10.042 ± 2.081
0.467ArgCys: 0.467 ± 0.46
4.671ArgAsp: 4.671 ± 0.883
3.97ArgGlu: 3.97 ± 0.785
0.934ArgPhe: 0.934 ± 0.602
7.24ArgGly: 7.24 ± 1.965
1.868ArgHis: 1.868 ± 0.301
1.401ArgIle: 1.401 ± 0.484
0.234ArgLys: 0.234 ± 0.197
8.407ArgLeu: 8.407 ± 1.226
1.868ArgMet: 1.868 ± 0.533
1.168ArgAsn: 1.168 ± 0.571
3.97ArgPro: 3.97 ± 0.69
4.204ArgGln: 4.204 ± 0.557
7.006ArgArg: 7.006 ± 1.1
5.605ArgSer: 5.605 ± 1.056
4.671ArgThr: 4.671 ± 1.371
7.24ArgVal: 7.24 ± 0.577
0.701ArgTrp: 0.701 ± 0.35
2.335ArgTyr: 2.335 ± 0.951
0.0ArgXaa: 0.0 ± 0.0
Ser
9.575SerAla: 9.575 ± 2.035
1.401SerCys: 1.401 ± 0.681
5.371SerAsp: 5.371 ± 0.708
3.27SerGlu: 3.27 ± 0.768
1.401SerPhe: 1.401 ± 0.535
9.108SerGly: 9.108 ± 1.216
1.868SerHis: 1.868 ± 0.556
1.635SerIle: 1.635 ± 0.505
0.701SerLys: 0.701 ± 0.458
4.904SerLeu: 4.904 ± 1.393
2.102SerMet: 2.102 ± 0.758
2.102SerAsn: 2.102 ± 0.892
3.97SerPro: 3.97 ± 1.08
0.934SerGln: 0.934 ± 0.676
6.539SerArg: 6.539 ± 0.875
5.371SerSer: 5.371 ± 1.476
3.503SerThr: 3.503 ± 1.201
8.407SerVal: 8.407 ± 0.991
1.401SerTrp: 1.401 ± 0.465
2.802SerTyr: 2.802 ± 0.682
0.0SerXaa: 0.0 ± 0.0
Thr
7.24ThrAla: 7.24 ± 0.938
0.234ThrCys: 0.234 ± 0.213
1.868ThrAsp: 1.868 ± 0.441
2.569ThrGlu: 2.569 ± 0.916
0.934ThrPhe: 0.934 ± 0.563
6.305ThrGly: 6.305 ± 1.434
0.934ThrHis: 0.934 ± 0.624
0.701ThrIle: 0.701 ± 0.266
0.934ThrLys: 0.934 ± 0.169
5.371ThrLeu: 5.371 ± 1.619
1.401ThrMet: 1.401 ± 0.133
0.934ThrAsn: 0.934 ± 0.169
3.737ThrPro: 3.737 ± 1.037
2.102ThrGln: 2.102 ± 1.129
2.802ThrArg: 2.802 ± 0.728
4.204ThrSer: 4.204 ± 0.186
4.437ThrThr: 4.437 ± 1.441
4.204ThrVal: 4.204 ± 0.845
0.234ThrTrp: 0.234 ± 0.241
2.335ThrTyr: 2.335 ± 1.203
0.0ThrXaa: 0.0 ± 0.0
Val
13.312ValAla: 13.312 ± 1.121
1.168ValCys: 1.168 ± 0.55
3.27ValAsp: 3.27 ± 1.209
6.305ValGlu: 6.305 ± 0.664
1.401ValPhe: 1.401 ± 0.706
6.539ValGly: 6.539 ± 0.429
2.102ValHis: 2.102 ± 0.724
1.868ValIle: 1.868 ± 0.591
2.802ValLys: 2.802 ± 1.276
8.641ValLeu: 8.641 ± 0.673
0.934ValMet: 0.934 ± 0.355
2.569ValAsn: 2.569 ± 0.492
4.437ValPro: 4.437 ± 1.025
2.569ValGln: 2.569 ± 0.879
5.838ValArg: 5.838 ± 0.729
6.305ValSer: 6.305 ± 0.952
5.371ValThr: 5.371 ± 1.212
7.24ValVal: 7.24 ± 0.752
0.934ValTrp: 0.934 ± 0.412
1.868ValTyr: 1.868 ± 0.756
0.0ValXaa: 0.0 ± 0.0
Trp
2.335TrpAla: 2.335 ± 0.407
0.0TrpCys: 0.0 ± 0.0
0.234TrpAsp: 0.234 ± 0.197
0.934TrpGlu: 0.934 ± 0.432
0.467TrpPhe: 0.467 ± 0.259
0.234TrpGly: 0.234 ± 0.23
0.234TrpHis: 0.234 ± 0.197
0.467TrpIle: 0.467 ± 0.225
0.467TrpLys: 0.467 ± 0.259
1.868TrpLeu: 1.868 ± 0.356
0.701TrpMet: 0.701 ± 0.391
0.467TrpAsn: 0.467 ± 0.323
0.934TrpPro: 0.934 ± 0.412
0.467TrpGln: 0.467 ± 0.426
1.635TrpArg: 1.635 ± 0.506
0.934TrpSer: 0.934 ± 0.449
0.467TrpThr: 0.467 ± 0.278
1.401TrpVal: 1.401 ± 0.464
0.234TrpTrp: 0.234 ± 0.241
0.234TrpTyr: 0.234 ± 0.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.036TyrAla: 3.036 ± 0.734
0.0TyrCys: 0.0 ± 0.0
3.503TyrAsp: 3.503 ± 0.857
1.401TyrGlu: 1.401 ± 0.654
0.467TyrPhe: 0.467 ± 0.225
2.335TyrGly: 2.335 ± 0.717
0.234TyrHis: 0.234 ± 0.213
0.0TyrIle: 0.0 ± 0.0
1.168TyrLys: 1.168 ± 0.342
2.335TyrLeu: 2.335 ± 0.574
0.934TyrMet: 0.934 ± 0.437
1.401TyrAsn: 1.401 ± 0.333
1.168TyrPro: 1.168 ± 0.511
0.234TyrGln: 0.234 ± 0.213
3.27TyrArg: 3.27 ± 0.834
2.569TyrSer: 2.569 ± 0.375
1.635TyrThr: 1.635 ± 0.509
2.802TyrVal: 2.802 ± 0.675
0.701TyrTrp: 0.701 ± 0.221
0.701TyrTyr: 0.701 ± 0.375
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (4283 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski