Amino acid dipepetide frequency for Maize yellow dwarf virus RMV

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.725AlaAla: 4.725 ± 1.593
2.025AlaCys: 2.025 ± 0.834
3.712AlaAsp: 3.712 ± 0.409
3.712AlaGlu: 3.712 ± 1.024
2.362AlaPhe: 2.362 ± 0.922
5.4AlaGly: 5.4 ± 2.107
1.012AlaHis: 1.012 ± 0.269
1.687AlaIle: 1.687 ± 0.328
3.037AlaLys: 3.037 ± 0.654
8.437AlaLeu: 8.437 ± 2.016
0.675AlaMet: 0.675 ± 0.508
1.35AlaAsn: 1.35 ± 0.584
4.725AlaPro: 4.725 ± 0.89
2.362AlaGln: 2.362 ± 0.854
5.062AlaArg: 5.062 ± 0.75
9.45AlaSer: 9.45 ± 1.308
2.025AlaThr: 2.025 ± 1.464
3.712AlaVal: 3.712 ± 0.71
0.337AlaTrp: 0.337 ± 0.314
4.387AlaTyr: 4.387 ± 1.206
0.0AlaXaa: 0.0 ± 0.0
Cys
1.35CysAla: 1.35 ± 0.341
1.012CysCys: 1.012 ± 0.412
0.337CysAsp: 0.337 ± 0.254
1.35CysGlu: 1.35 ± 0.341
0.0CysPhe: 0.0 ± 0.0
1.35CysGly: 1.35 ± 0.711
0.0CysHis: 0.0 ± 0.0
1.012CysIle: 1.012 ± 0.495
1.35CysLys: 1.35 ± 0.482
1.687CysLeu: 1.687 ± 0.804
0.337CysMet: 0.337 ± 0.254
0.337CysAsn: 0.337 ± 0.434
0.337CysPro: 0.337 ± 0.254
0.337CysGln: 0.337 ± 0.254
1.012CysArg: 1.012 ± 0.412
2.7CysSer: 2.7 ± 0.823
0.675CysThr: 0.675 ± 0.332
1.687CysVal: 1.687 ± 0.61
0.337CysTrp: 0.337 ± 0.254
0.337CysTyr: 0.337 ± 0.434
0.0CysXaa: 0.0 ± 0.0
Asp
4.05AspAla: 4.05 ± 0.644
1.012AspCys: 1.012 ± 0.495
4.05AspAsp: 4.05 ± 0.922
3.712AspGlu: 3.712 ± 0.666
1.687AspPhe: 1.687 ± 0.756
4.387AspGly: 4.387 ± 0.815
0.675AspHis: 0.675 ± 0.521
2.7AspIle: 2.7 ± 1.068
0.675AspLys: 0.675 ± 0.629
4.387AspLeu: 4.387 ± 0.65
1.012AspMet: 1.012 ± 0.551
0.0AspAsn: 0.0 ± 0.0
2.7AspPro: 2.7 ± 0.425
2.025AspGln: 2.025 ± 0.977
1.687AspArg: 1.687 ± 0.932
2.362AspSer: 2.362 ± 1.515
2.7AspThr: 2.7 ± 1.067
2.025AspVal: 2.025 ± 0.584
1.012AspTrp: 1.012 ± 0.52
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
6.75GluAla: 6.75 ± 1.059
1.35GluCys: 1.35 ± 0.976
4.725GluAsp: 4.725 ± 1.433
3.037GluGlu: 3.037 ± 0.28
3.712GluPhe: 3.712 ± 0.734
3.712GluGly: 3.712 ± 0.782
0.337GluHis: 0.337 ± 0.418
5.4GluIle: 5.4 ± 0.93
1.687GluLys: 1.687 ± 0.459
4.725GluLeu: 4.725 ± 2.28
0.0GluMet: 0.0 ± 0.0
1.012GluAsn: 1.012 ± 0.526
1.012GluPro: 1.012 ± 0.448
2.025GluGln: 2.025 ± 0.876
1.35GluArg: 1.35 ± 0.482
3.037GluSer: 3.037 ± 0.87
4.387GluThr: 4.387 ± 0.639
3.037GluVal: 3.037 ± 0.869
0.675GluTrp: 0.675 ± 0.292
1.687GluTyr: 1.687 ± 0.514
0.0GluXaa: 0.0 ± 0.0
Phe
3.037PheAla: 3.037 ± 1.11
1.012PheCys: 1.012 ± 0.495
1.012PheAsp: 1.012 ± 0.52
4.05PheGlu: 4.05 ± 1.979
0.0PhePhe: 0.0 ± 0.0
2.7PheGly: 2.7 ± 0.603
1.35PheHis: 1.35 ± 0.341
2.362PheIle: 2.362 ± 1.48
2.7PheLys: 2.7 ± 0.623
1.687PheLeu: 1.687 ± 0.505
0.0PheMet: 0.0 ± 0.0
1.35PheAsn: 1.35 ± 0.665
0.337PhePro: 0.337 ± 0.434
3.375PheGln: 3.375 ± 0.807
2.025PheArg: 2.025 ± 0.439
3.712PheSer: 3.712 ± 1.157
2.025PheThr: 2.025 ± 0.646
3.037PheVal: 3.037 ± 0.669
1.012PheTrp: 1.012 ± 0.412
0.675PheTyr: 0.675 ± 0.629
0.0PheXaa: 0.0 ± 0.0
Gly
2.362GlyAla: 2.362 ± 0.914
2.025GlyCys: 2.025 ± 0.389
2.362GlyAsp: 2.362 ± 0.759
1.012GlyGlu: 1.012 ± 0.799
4.05GlyPhe: 4.05 ± 1.132
10.462GlyGly: 10.462 ± 2.774
0.0GlyHis: 0.0 ± 0.0
1.35GlyIle: 1.35 ± 0.46
3.375GlyLys: 3.375 ± 1.283
6.412GlyLeu: 6.412 ± 1.639
0.337GlyMet: 0.337 ± 0.434
4.05GlyAsn: 4.05 ± 1.158
2.7GlyPro: 2.7 ± 0.701
0.337GlyGln: 0.337 ± 0.314
6.75GlyArg: 6.75 ± 1.792
7.087GlySer: 7.087 ± 1.887
4.725GlyThr: 4.725 ± 0.602
4.05GlyVal: 4.05 ± 0.792
3.037GlyTrp: 3.037 ± 1.098
4.387GlyTyr: 4.387 ± 0.519
0.0GlyXaa: 0.0 ± 0.0
His
0.675HisAla: 0.675 ± 0.868
0.337HisCys: 0.337 ± 0.254
1.012HisAsp: 1.012 ± 0.495
1.012HisGlu: 1.012 ± 0.709
1.687HisPhe: 1.687 ± 0.61
1.687HisGly: 1.687 ± 0.505
0.0HisHis: 0.0 ± 0.0
0.675HisIle: 0.675 ± 0.406
1.35HisLys: 1.35 ± 0.713
1.687HisLeu: 1.687 ± 0.732
0.337HisMet: 0.337 ± 0.434
2.362HisAsn: 2.362 ± 0.444
0.675HisPro: 0.675 ± 0.292
1.35HisGln: 1.35 ± 0.355
0.675HisArg: 0.675 ± 0.332
1.687HisSer: 1.687 ± 0.595
0.675HisThr: 0.675 ± 0.49
1.35HisVal: 1.35 ± 0.665
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.062IleAla: 5.062 ± 1.393
0.675IleCys: 0.675 ± 0.521
2.362IleAsp: 2.362 ± 0.801
3.712IleGlu: 3.712 ± 0.734
1.35IlePhe: 1.35 ± 0.465
1.35IleGly: 1.35 ± 0.341
1.687IleHis: 1.687 ± 0.61
0.675IleIle: 0.675 ± 0.332
1.012IleLys: 1.012 ± 0.269
5.737IleLeu: 5.737 ± 1.027
0.337IleMet: 0.337 ± 0.254
1.687IleAsn: 1.687 ± 0.487
5.4IlePro: 5.4 ± 1.093
1.687IleGln: 1.687 ± 0.569
3.712IleArg: 3.712 ± 1.771
6.075IleSer: 6.075 ± 1.91
1.35IleThr: 1.35 ± 0.979
1.687IleVal: 1.687 ± 0.487
0.0IleTrp: 0.0 ± 0.0
2.362IleTyr: 2.362 ± 0.47
0.0IleXaa: 0.0 ± 0.0
Lys
3.712LysAla: 3.712 ± 1.041
0.337LysCys: 0.337 ± 0.254
3.375LysAsp: 3.375 ± 1.204
1.012LysGlu: 1.012 ± 0.52
1.687LysPhe: 1.687 ± 0.328
2.7LysGly: 2.7 ± 1.531
2.7LysHis: 2.7 ± 1.282
3.712LysIle: 3.712 ± 0.936
2.025LysLys: 2.025 ± 0.848
5.4LysLeu: 5.4 ± 0.85
1.35LysMet: 1.35 ± 0.469
1.35LysAsn: 1.35 ± 0.52
2.362LysPro: 2.362 ± 0.685
3.375LysGln: 3.375 ± 1.452
3.037LysArg: 3.037 ± 0.532
4.387LysSer: 4.387 ± 0.907
3.712LysThr: 3.712 ± 1.453
3.375LysVal: 3.375 ± 0.565
0.0LysTrp: 0.0 ± 0.0
1.35LysTyr: 1.35 ± 0.359
0.337LysXaa: 0.337 ± 0.314
Leu
8.775LeuAla: 8.775 ± 1.635
1.012LeuCys: 1.012 ± 0.448
2.7LeuAsp: 2.7 ± 0.736
5.062LeuGlu: 5.062 ± 1.085
4.725LeuPhe: 4.725 ± 1.544
4.387LeuGly: 4.387 ± 1.073
2.025LeuHis: 2.025 ± 0.427
4.05LeuIle: 4.05 ± 0.9
2.362LeuLys: 2.362 ± 0.524
12.487LeuLeu: 12.487 ± 3.763
1.687LeuMet: 1.687 ± 0.829
3.037LeuAsn: 3.037 ± 0.794
4.05LeuPro: 4.05 ± 1.026
3.037LeuGln: 3.037 ± 0.637
6.75LeuArg: 6.75 ± 1.572
10.8LeuSer: 10.8 ± 0.789
7.087LeuThr: 7.087 ± 0.468
7.425LeuVal: 7.425 ± 1.707
1.35LeuTrp: 1.35 ± 0.475
3.375LeuTyr: 3.375 ± 0.707
0.0LeuXaa: 0.0 ± 0.0
Met
1.35MetAla: 1.35 ± 0.713
0.0MetCys: 0.0 ± 0.0
0.337MetAsp: 0.337 ± 0.418
1.35MetGlu: 1.35 ± 0.58
0.0MetPhe: 0.0 ± 0.0
0.337MetGly: 0.337 ± 0.254
0.0MetHis: 0.0 ± 0.0
1.012MetIle: 1.012 ± 0.495
0.337MetLys: 0.337 ± 0.314
2.025MetLeu: 2.025 ± 0.699
0.0MetMet: 0.0 ± 0.0
0.675MetAsn: 0.675 ± 0.49
0.0MetPro: 0.0 ± 0.0
0.337MetGln: 0.337 ± 0.434
1.687MetArg: 1.687 ± 0.61
2.7MetSer: 2.7 ± 0.844
1.012MetThr: 1.012 ± 0.551
1.687MetVal: 1.687 ± 0.53
0.0MetTrp: 0.0 ± 0.0
0.337MetTyr: 0.337 ± 0.434
0.0MetXaa: 0.0 ± 0.0
Asn
2.362AsnAla: 2.362 ± 0.891
0.0AsnCys: 0.0 ± 0.0
2.025AsnAsp: 2.025 ± 0.848
1.35AsnGlu: 1.35 ± 0.713
1.35AsnPhe: 1.35 ± 0.611
3.037AsnGly: 3.037 ± 0.541
0.0AsnHis: 0.0 ± 0.0
0.675AsnIle: 0.675 ± 0.332
3.375AsnLys: 3.375 ± 0.86
4.05AsnLeu: 4.05 ± 0.768
0.337AsnMet: 0.337 ± 0.298
2.025AsnAsn: 2.025 ± 0.785
2.025AsnPro: 2.025 ± 0.621
2.025AsnGln: 2.025 ± 0.366
2.7AsnArg: 2.7 ± 0.457
5.737AsnSer: 5.737 ± 1.397
3.375AsnThr: 3.375 ± 1.608
0.337AsnVal: 0.337 ± 0.314
2.025AsnTrp: 2.025 ± 0.628
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.05ProAla: 4.05 ± 0.831
0.0ProCys: 0.0 ± 0.0
1.687ProAsp: 1.687 ± 0.88
2.025ProGlu: 2.025 ± 0.872
0.675ProPhe: 0.675 ± 0.406
3.712ProGly: 3.712 ± 1.442
1.012ProHis: 1.012 ± 0.402
1.35ProIle: 1.35 ± 0.475
4.725ProLys: 4.725 ± 0.775
1.687ProLeu: 1.687 ± 0.477
1.012ProMet: 1.012 ± 0.709
2.362ProAsn: 2.362 ± 0.926
5.062ProPro: 5.062 ± 1.009
2.7ProGln: 2.7 ± 0.681
2.025ProArg: 2.025 ± 0.675
4.387ProSer: 4.387 ± 1.508
8.1ProThr: 8.1 ± 3.328
4.725ProVal: 4.725 ± 0.907
0.675ProTrp: 0.675 ± 0.332
2.025ProTyr: 2.025 ± 0.388
0.0ProXaa: 0.0 ± 0.0
Gln
2.362GlnAla: 2.362 ± 0.744
0.675GlnCys: 0.675 ± 0.508
0.337GlnAsp: 0.337 ± 0.254
1.687GlnGlu: 1.687 ± 0.544
1.012GlnPhe: 1.012 ± 0.402
2.025GlnGly: 2.025 ± 0.99
1.012GlnHis: 1.012 ± 0.392
1.687GlnIle: 1.687 ± 0.708
4.387GlnLys: 4.387 ± 0.7
4.05GlnLeu: 4.05 ± 0.74
0.675GlnMet: 0.675 ± 0.332
4.725GlnAsn: 4.725 ± 1.391
2.025GlnPro: 2.025 ± 1.039
1.687GlnGln: 1.687 ± 0.666
3.375GlnArg: 3.375 ± 0.672
3.037GlnSer: 3.037 ± 0.752
1.35GlnThr: 1.35 ± 0.58
2.025GlnVal: 2.025 ± 1.264
1.012GlnTrp: 1.012 ± 0.456
0.675GlnTyr: 0.675 ± 0.533
0.0GlnXaa: 0.0 ± 0.0
Arg
4.387ArgAla: 4.387 ± 1.041
0.675ArgCys: 0.675 ± 0.868
3.712ArgAsp: 3.712 ± 1.77
3.375ArgGlu: 3.375 ± 0.671
0.675ArgPhe: 0.675 ± 0.649
7.425ArgGly: 7.425 ± 2.112
1.35ArgHis: 1.35 ± 0.713
4.725ArgIle: 4.725 ± 1.006
2.362ArgLys: 2.362 ± 0.472
9.112ArgLeu: 9.112 ± 1.544
1.012ArgMet: 1.012 ± 0.317
2.362ArgAsn: 2.362 ± 0.501
3.375ArgPro: 3.375 ± 1.541
1.012ArgGln: 1.012 ± 0.495
10.125ArgArg: 10.125 ± 4.148
5.062ArgSer: 5.062 ± 1.134
4.387ArgThr: 4.387 ± 0.919
3.037ArgVal: 3.037 ± 1.337
1.35ArgTrp: 1.35 ± 0.475
3.712ArgTyr: 3.712 ± 1.442
0.0ArgXaa: 0.0 ± 0.0
Ser
3.712SerAla: 3.712 ± 1.592
2.025SerCys: 2.025 ± 0.997
2.025SerAsp: 2.025 ± 0.426
5.062SerGlu: 5.062 ± 1.551
4.05SerPhe: 4.05 ± 0.751
8.775SerGly: 8.775 ± 2.217
2.025SerHis: 2.025 ± 0.607
7.087SerIle: 7.087 ± 1.161
6.075SerLys: 6.075 ± 1.284
10.125SerLeu: 10.125 ± 0.973
1.35SerMet: 1.35 ± 0.665
4.05SerAsn: 4.05 ± 1.979
3.712SerPro: 3.712 ± 1.6
3.037SerGln: 3.037 ± 0.602
7.425SerArg: 7.425 ± 2.024
11.475SerSer: 11.475 ± 1.393
3.712SerThr: 3.712 ± 0.922
7.425SerVal: 7.425 ± 0.591
0.675SerTrp: 0.675 ± 0.521
3.375SerTyr: 3.375 ± 0.769
0.0SerXaa: 0.0 ± 0.0
Thr
3.375ThrAla: 3.375 ± 0.6
1.012ThrCys: 1.012 ± 0.495
2.025ThrAsp: 2.025 ± 0.99
2.362ThrGlu: 2.362 ± 0.983
4.05ThrPhe: 4.05 ± 0.881
2.362ThrGly: 2.362 ± 0.715
1.35ThrHis: 1.35 ± 0.665
3.712ThrIle: 3.712 ± 1.196
3.712ThrLys: 3.712 ± 0.736
5.4ThrLeu: 5.4 ± 0.632
1.687ThrMet: 1.687 ± 0.514
2.362ThrAsn: 2.362 ± 0.679
7.762ThrPro: 7.762 ± 2.988
4.387ThrGln: 4.387 ± 1.595
4.725ThrArg: 4.725 ± 0.879
4.725ThrSer: 4.725 ± 0.978
3.712ThrThr: 3.712 ± 0.574
2.7ThrVal: 2.7 ± 1.016
1.35ThrTrp: 1.35 ± 0.465
1.012ThrTyr: 1.012 ± 0.551
0.0ThrXaa: 0.0 ± 0.0
Val
5.062ValAla: 5.062 ± 0.963
1.687ValCys: 1.687 ± 0.61
2.7ValAsp: 2.7 ± 0.457
6.412ValGlu: 6.412 ± 1.783
2.362ValPhe: 2.362 ± 0.524
3.375ValGly: 3.375 ± 1.288
1.687ValHis: 1.687 ± 0.5
3.037ValIle: 3.037 ± 0.696
2.7ValLys: 2.7 ± 0.91
3.037ValLeu: 3.037 ± 1.269
1.35ValMet: 1.35 ± 0.675
1.687ValAsn: 1.687 ± 0.561
4.387ValPro: 4.387 ± 0.983
3.037ValGln: 3.037 ± 1.098
4.387ValArg: 4.387 ± 0.74
4.387ValSer: 4.387 ± 1.266
3.712ValThr: 3.712 ± 0.865
4.725ValVal: 4.725 ± 1.753
1.012ValTrp: 1.012 ± 0.402
1.012ValTyr: 1.012 ± 0.551
0.0ValXaa: 0.0 ± 0.0
Trp
1.687TrpAla: 1.687 ± 0.459
0.675TrpCys: 0.675 ± 0.332
0.337TrpAsp: 0.337 ± 0.434
1.687TrpGlu: 1.687 ± 0.626
0.0TrpPhe: 0.0 ± 0.0
0.675TrpGly: 0.675 ± 0.292
0.0TrpHis: 0.0 ± 0.0
0.337TrpIle: 0.337 ± 0.254
0.675TrpLys: 0.675 ± 0.332
1.687TrpLeu: 1.687 ± 0.666
1.012TrpMet: 1.012 ± 0.495
0.675TrpAsn: 0.675 ± 0.332
1.35TrpPro: 1.35 ± 0.713
0.675TrpGln: 0.675 ± 0.381
2.025TrpArg: 2.025 ± 0.887
2.362TrpSer: 2.362 ± 0.73
0.0TrpThr: 0.0 ± 0.0
0.675TrpVal: 0.675 ± 0.332
0.0TrpTrp: 0.0 ± 0.0
0.337TrpTyr: 0.337 ± 0.314
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.35TyrAla: 1.35 ± 0.739
0.0TyrCys: 0.0 ± 0.0
2.025TyrAsp: 2.025 ± 1.098
0.675TyrGlu: 0.675 ± 0.49
2.025TyrPhe: 2.025 ± 0.87
1.012TyrGly: 1.012 ± 0.768
1.012TyrHis: 1.012 ± 0.402
0.675TyrIle: 0.675 ± 0.629
3.375TyrLys: 3.375 ± 0.692
2.362TyrLeu: 2.362 ± 0.827
0.337TyrMet: 0.337 ± 0.304
1.35TyrAsn: 1.35 ± 0.812
0.0TyrPro: 0.0 ± 0.0
1.012TyrGln: 1.012 ± 0.495
3.037TyrArg: 3.037 ± 0.709
2.025TyrSer: 2.025 ± 1.642
5.062TyrThr: 5.062 ± 0.901
2.7TyrVal: 2.7 ± 1.145
1.012TyrTrp: 1.012 ± 0.768
0.675TyrTyr: 0.675 ± 0.292
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.337XaaVal: 0.337 ± 0.314
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2964 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski