Amino acid dipepetide frequency for Maize associated rhabdovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.306AlaAla: 3.306 ± 1.125
0.275AlaCys: 0.275 ± 0.422
2.755AlaAsp: 2.755 ± 0.741
3.581AlaGlu: 3.581 ± 1.762
2.204AlaPhe: 2.204 ± 1.92
3.03AlaGly: 3.03 ± 0.896
1.377AlaHis: 1.377 ± 0.426
3.03AlaIle: 3.03 ± 1.163
5.51AlaLys: 5.51 ± 1.233
4.408AlaLeu: 4.408 ± 1.067
2.755AlaMet: 2.755 ± 1.413
1.653AlaAsn: 1.653 ± 0.636
1.653AlaPro: 1.653 ± 0.729
1.653AlaGln: 1.653 ± 0.739
3.306AlaArg: 3.306 ± 0.626
6.887AlaSer: 6.887 ± 1.849
3.03AlaThr: 3.03 ± 0.904
2.479AlaVal: 2.479 ± 0.946
0.275AlaTrp: 0.275 ± 0.17
1.102AlaTyr: 1.102 ± 0.505
0.0AlaXaa: 0.0 ± 0.0
Cys
0.551CysAla: 0.551 ± 0.383
0.275CysCys: 0.275 ± 0.488
0.275CysAsp: 0.275 ± 0.17
0.551CysGlu: 0.551 ± 0.353
0.826CysPhe: 0.826 ± 0.425
0.0CysGly: 0.0 ± 0.0
0.826CysHis: 0.826 ± 0.425
1.102CysIle: 1.102 ± 0.707
1.102CysLys: 1.102 ± 0.557
2.204CysLeu: 2.204 ± 1.173
0.275CysMet: 0.275 ± 0.527
1.102CysAsn: 1.102 ± 0.895
1.928CysPro: 1.928 ± 1.039
0.826CysGln: 0.826 ± 0.361
1.102CysArg: 1.102 ± 1.176
1.653CysSer: 1.653 ± 1.06
1.377CysThr: 1.377 ± 0.802
0.275CysVal: 0.275 ± 0.422
0.551CysTrp: 0.551 ± 0.341
0.275CysTyr: 0.275 ± 0.17
0.0CysXaa: 0.0 ± 0.0
Asp
3.306AspAla: 3.306 ± 2.132
1.377AspCys: 1.377 ± 0.613
3.581AspAsp: 3.581 ± 1.488
4.959AspGlu: 4.959 ± 1.909
2.204AspPhe: 2.204 ± 1.394
1.928AspGly: 1.928 ± 1.04
0.826AspHis: 0.826 ± 0.414
5.234AspIle: 5.234 ± 1.481
3.03AspLys: 3.03 ± 0.726
4.683AspLeu: 4.683 ± 0.624
0.551AspMet: 0.551 ± 0.341
2.755AspAsn: 2.755 ± 1.339
2.479AspPro: 2.479 ± 0.885
0.551AspGln: 0.551 ± 0.341
1.653AspArg: 1.653 ± 0.703
2.755AspSer: 2.755 ± 1.122
1.928AspThr: 1.928 ± 1.193
4.408AspVal: 4.408 ± 1.097
0.826AspTrp: 0.826 ± 0.361
1.928AspTyr: 1.928 ± 1.295
0.0AspXaa: 0.0 ± 0.0
Glu
4.408GluAla: 4.408 ± 1.71
1.102GluCys: 1.102 ± 0.895
3.581GluAsp: 3.581 ± 1.388
4.959GluGlu: 4.959 ± 3.171
2.479GluPhe: 2.479 ± 0.843
3.306GluGly: 3.306 ± 1.502
2.479GluHis: 2.479 ± 0.738
7.163GluIle: 7.163 ± 0.704
4.959GluLys: 4.959 ± 1.707
6.061GluLeu: 6.061 ± 1.053
2.204GluMet: 2.204 ± 0.724
3.581GluAsn: 3.581 ± 1.072
2.204GluPro: 2.204 ± 1.427
3.306GluGln: 3.306 ± 0.96
2.755GluArg: 2.755 ± 1.352
3.857GluSer: 3.857 ± 1.375
3.306GluThr: 3.306 ± 0.995
3.306GluVal: 3.306 ± 1.085
1.653GluTrp: 1.653 ± 0.818
3.306GluTyr: 3.306 ± 0.957
0.0GluXaa: 0.0 ± 0.0
Phe
1.928PheAla: 1.928 ± 0.573
0.551PheCys: 0.551 ± 0.341
0.826PheAsp: 0.826 ± 0.511
3.306PheGlu: 3.306 ± 1.443
1.928PhePhe: 1.928 ± 0.799
3.03PheGly: 3.03 ± 1.521
0.826PheHis: 0.826 ± 0.511
1.928PheIle: 1.928 ± 0.681
3.306PheLys: 3.306 ± 1.301
5.234PheLeu: 5.234 ± 2.026
0.0PheMet: 0.0 ± 0.0
2.479PheAsn: 2.479 ± 1.006
1.928PhePro: 1.928 ± 0.588
2.204PheGln: 2.204 ± 0.884
2.204PheArg: 2.204 ± 0.504
5.785PheSer: 5.785 ± 1.939
2.204PheThr: 2.204 ± 1.474
1.377PheVal: 1.377 ± 1.111
0.275PheTrp: 0.275 ± 0.17
1.102PheTyr: 1.102 ± 0.44
0.0PheXaa: 0.0 ± 0.0
Gly
1.653GlyAla: 1.653 ± 0.739
0.826GlyCys: 0.826 ± 0.425
3.306GlyAsp: 3.306 ± 1.15
4.959GlyGlu: 4.959 ± 0.377
1.928GlyPhe: 1.928 ± 0.558
3.03GlyGly: 3.03 ± 0.846
0.826GlyHis: 0.826 ± 0.511
5.785GlyIle: 5.785 ± 1.293
3.03GlyLys: 3.03 ± 2.135
6.061GlyLeu: 6.061 ± 1.111
1.928GlyMet: 1.928 ± 1.084
1.377GlyAsn: 1.377 ± 0.608
1.928GlyPro: 1.928 ± 0.615
1.102GlyGln: 1.102 ± 0.464
3.581GlyArg: 3.581 ± 0.967
3.03GlySer: 3.03 ± 0.991
2.755GlyThr: 2.755 ± 0.665
2.204GlyVal: 2.204 ± 1.035
1.653GlyTrp: 1.653 ± 0.586
1.928GlyTyr: 1.928 ± 0.549
0.0GlyXaa: 0.0 ± 0.0
His
1.377HisAla: 1.377 ± 0.779
0.551HisCys: 0.551 ± 0.353
0.275HisAsp: 0.275 ± 0.17
0.826HisGlu: 0.826 ± 0.361
0.551HisPhe: 0.551 ± 0.341
1.102HisGly: 1.102 ± 0.351
1.102HisHis: 1.102 ± 0.464
2.755HisIle: 2.755 ± 0.707
1.928HisLys: 1.928 ± 1.085
2.755HisLeu: 2.755 ± 0.665
0.551HisMet: 0.551 ± 0.477
0.826HisAsn: 0.826 ± 0.759
1.377HisPro: 1.377 ± 0.852
0.826HisGln: 0.826 ± 0.887
1.377HisArg: 1.377 ± 0.561
2.204HisSer: 2.204 ± 0.631
1.102HisThr: 1.102 ± 0.549
1.102HisVal: 1.102 ± 0.489
0.275HisTrp: 0.275 ± 0.422
0.826HisTyr: 0.826 ± 0.511
0.0HisXaa: 0.0 ± 0.0
Ile
5.51IleAla: 5.51 ± 0.432
1.377IleCys: 1.377 ± 1.282
4.959IleAsp: 4.959 ± 2.033
4.408IleGlu: 4.408 ± 1.633
4.683IlePhe: 4.683 ± 1.24
4.408IleGly: 4.408 ± 0.947
1.653IleHis: 1.653 ± 0.721
4.959IleIle: 4.959 ± 0.643
4.959IleLys: 4.959 ± 1.216
7.163IleLeu: 7.163 ± 0.934
0.826IleMet: 0.826 ± 0.713
4.132IleAsn: 4.132 ± 1.236
4.408IlePro: 4.408 ± 1.219
2.479IleGln: 2.479 ± 0.993
3.03IleArg: 3.03 ± 1.524
8.264IleSer: 8.264 ± 2.74
6.336IleThr: 6.336 ± 1.11
4.132IleVal: 4.132 ± 0.839
0.275IleTrp: 0.275 ± 0.17
2.204IleTyr: 2.204 ± 0.824
0.0IleXaa: 0.0 ± 0.0
Lys
3.857LysAla: 3.857 ± 1.719
1.377LysCys: 1.377 ± 0.561
4.408LysAsp: 4.408 ± 1.894
5.785LysGlu: 5.785 ± 0.657
2.204LysPhe: 2.204 ± 0.748
4.683LysGly: 4.683 ± 1.471
0.551LysHis: 0.551 ± 0.341
6.061LysIle: 6.061 ± 1.269
7.163LysLys: 7.163 ± 4.223
5.785LysLeu: 5.785 ± 1.821
1.928LysMet: 1.928 ± 0.918
3.306LysAsn: 3.306 ± 1.418
2.204LysPro: 2.204 ± 1.068
2.479LysGln: 2.479 ± 0.785
3.306LysArg: 3.306 ± 0.486
1.653LysSer: 1.653 ± 1.536
4.683LysThr: 4.683 ± 1.782
4.408LysVal: 4.408 ± 1.135
1.928LysTrp: 1.928 ± 0.923
1.928LysTyr: 1.928 ± 1.665
0.0LysXaa: 0.0 ± 0.0
Leu
3.857LeuAla: 3.857 ± 1.396
1.653LeuCys: 1.653 ± 0.311
4.683LeuAsp: 4.683 ± 0.98
6.612LeuGlu: 6.612 ± 1.731
3.306LeuPhe: 3.306 ± 1.581
4.959LeuGly: 4.959 ± 1.732
2.755LeuHis: 2.755 ± 1.259
7.438LeuIle: 7.438 ± 1.758
7.989LeuLys: 7.989 ± 0.474
8.264LeuLeu: 8.264 ± 1.509
4.683LeuMet: 4.683 ± 1.948
3.306LeuAsn: 3.306 ± 1.045
3.581LeuPro: 3.581 ± 1.271
1.928LeuGln: 1.928 ± 0.923
6.336LeuArg: 6.336 ± 1.688
7.989LeuSer: 7.989 ± 0.792
4.959LeuThr: 4.959 ± 1.94
2.755LeuVal: 2.755 ± 0.431
1.653LeuTrp: 1.653 ± 0.721
3.581LeuTyr: 3.581 ± 1.084
0.0LeuXaa: 0.0 ± 0.0
Met
3.03MetAla: 3.03 ± 0.626
0.551MetCys: 0.551 ± 0.43
1.653MetAsp: 1.653 ± 0.739
1.102MetGlu: 1.102 ± 0.877
0.826MetPhe: 0.826 ± 0.437
2.204MetGly: 2.204 ± 0.63
0.551MetHis: 0.551 ± 0.353
3.306MetIle: 3.306 ± 0.672
2.204MetLys: 2.204 ± 0.813
2.204MetLeu: 2.204 ± 0.759
1.653MetMet: 1.653 ± 1.029
1.377MetAsn: 1.377 ± 0.862
1.377MetPro: 1.377 ± 0.531
0.0MetGln: 0.0 ± 0.0
2.479MetArg: 2.479 ± 1.183
1.928MetSer: 1.928 ± 1.39
2.479MetThr: 2.479 ± 0.828
1.653MetVal: 1.653 ± 1.0
0.275MetTrp: 0.275 ± 0.17
0.826MetTyr: 0.826 ± 0.511
0.0MetXaa: 0.0 ± 0.0
Asn
2.755AsnAla: 2.755 ± 1.126
0.551AsnCys: 0.551 ± 0.543
1.928AsnAsp: 1.928 ± 1.121
2.479AsnGlu: 2.479 ± 1.169
2.204AsnPhe: 2.204 ± 0.927
0.826AsnGly: 0.826 ± 0.759
0.826AsnHis: 0.826 ± 0.361
5.51AsnIle: 5.51 ± 0.878
2.479AsnLys: 2.479 ± 0.942
4.408AsnLeu: 4.408 ± 1.295
1.653AsnMet: 1.653 ± 0.429
1.653AsnAsn: 1.653 ± 0.484
3.306AsnPro: 3.306 ± 0.486
0.826AsnGln: 0.826 ± 0.44
1.377AsnArg: 1.377 ± 0.446
3.581AsnSer: 3.581 ± 0.73
2.204AsnThr: 2.204 ± 0.691
2.755AsnVal: 2.755 ± 1.074
1.377AsnTrp: 1.377 ± 0.561
0.826AsnTyr: 0.826 ± 0.437
0.0AsnXaa: 0.0 ± 0.0
Pro
2.479ProAla: 2.479 ± 0.828
0.0ProCys: 0.0 ± 0.0
2.755ProAsp: 2.755 ± 1.389
4.132ProGlu: 4.132 ± 2.264
1.377ProPhe: 1.377 ± 0.561
3.306ProGly: 3.306 ± 0.957
0.826ProHis: 0.826 ± 0.437
2.479ProIle: 2.479 ± 1.082
4.132ProLys: 4.132 ± 0.888
3.581ProLeu: 3.581 ± 1.854
1.928ProMet: 1.928 ± 0.642
3.03ProAsn: 3.03 ± 0.96
3.03ProPro: 3.03 ± 2.024
1.102ProGln: 1.102 ± 0.351
1.653ProArg: 1.653 ± 0.311
5.51ProSer: 5.51 ± 1.592
3.581ProThr: 3.581 ± 1.274
1.377ProVal: 1.377 ± 0.469
0.275ProTrp: 0.275 ± 0.422
2.204ProTyr: 2.204 ± 0.801
0.0ProXaa: 0.0 ± 0.0
Gln
1.928GlnAla: 1.928 ± 0.573
0.826GlnCys: 0.826 ± 0.361
1.102GlnAsp: 1.102 ± 0.505
1.653GlnGlu: 1.653 ± 0.591
1.377GlnPhe: 1.377 ± 0.629
1.928GlnGly: 1.928 ± 0.923
1.653GlnHis: 1.653 ± 0.729
1.377GlnIle: 1.377 ± 0.852
1.653GlnLys: 1.653 ± 0.518
2.204GlnLeu: 2.204 ± 1.035
1.102GlnMet: 1.102 ± 0.785
0.826GlnAsn: 0.826 ± 0.699
0.551GlnPro: 0.551 ± 0.353
0.551GlnGln: 0.551 ± 0.847
1.377GlnArg: 1.377 ± 0.629
2.204GlnSer: 2.204 ± 1.015
1.928GlnThr: 1.928 ± 0.573
2.204GlnVal: 2.204 ± 1.52
0.275GlnTrp: 0.275 ± 0.17
1.377GlnTyr: 1.377 ± 0.852
0.0GlnXaa: 0.0 ± 0.0
Arg
2.204ArgAla: 2.204 ± 0.504
0.826ArgCys: 0.826 ± 0.759
1.928ArgAsp: 1.928 ± 0.55
3.857ArgGlu: 3.857 ± 1.235
1.653ArgPhe: 1.653 ± 0.989
2.755ArgGly: 2.755 ± 1.001
0.551ArgHis: 0.551 ± 0.671
3.581ArgIle: 3.581 ± 0.967
3.03ArgLys: 3.03 ± 0.731
5.234ArgLeu: 5.234 ± 1.609
3.03ArgMet: 3.03 ± 0.704
2.479ArgAsn: 2.479 ± 0.828
3.03ArgPro: 3.03 ± 1.139
1.928ArgGln: 1.928 ± 0.855
2.204ArgArg: 2.204 ± 1.363
3.581ArgSer: 3.581 ± 0.935
2.755ArgThr: 2.755 ± 0.665
2.755ArgVal: 2.755 ± 0.403
0.275ArgTrp: 0.275 ± 0.422
3.306ArgTyr: 3.306 ± 0.96
0.0ArgXaa: 0.0 ± 0.0
Ser
4.132SerAla: 4.132 ± 1.479
3.03SerCys: 3.03 ± 1.944
4.683SerAsp: 4.683 ± 0.869
5.785SerGlu: 5.785 ± 2.09
1.928SerPhe: 1.928 ± 0.265
4.959SerGly: 4.959 ± 1.296
2.479SerHis: 2.479 ± 0.618
6.061SerIle: 6.061 ± 1.5
4.408SerLys: 4.408 ± 0.931
6.061SerLeu: 6.061 ± 1.309
1.928SerMet: 1.928 ± 0.618
4.683SerAsn: 4.683 ± 1.209
3.581SerPro: 3.581 ± 0.749
2.204SerGln: 2.204 ± 0.813
4.683SerArg: 4.683 ± 1.728
6.612SerSer: 6.612 ± 1.435
4.683SerThr: 4.683 ± 1.131
5.234SerVal: 5.234 ± 1.84
1.928SerTrp: 1.928 ± 0.573
3.03SerTyr: 3.03 ± 0.896
0.0SerXaa: 0.0 ± 0.0
Thr
3.581ThrAla: 3.581 ± 0.635
0.826ThrCys: 0.826 ± 0.794
2.479ThrAsp: 2.479 ± 0.885
4.408ThrGlu: 4.408 ± 0.753
4.683ThrPhe: 4.683 ± 2.082
1.928ThrGly: 1.928 ± 0.977
1.377ThrHis: 1.377 ± 0.564
5.234ThrIle: 5.234 ± 1.191
3.03ThrLys: 3.03 ± 0.44
6.061ThrLeu: 6.061 ± 1.815
1.102ThrMet: 1.102 ± 0.681
1.377ThrAsn: 1.377 ± 0.676
4.132ThrPro: 4.132 ± 1.145
1.377ThrGln: 1.377 ± 0.446
2.479ThrArg: 2.479 ± 0.944
4.959ThrSer: 4.959 ± 1.573
1.928ThrThr: 1.928 ± 1.658
3.857ThrVal: 3.857 ± 0.659
1.377ThrTrp: 1.377 ± 0.353
2.204ThrTyr: 2.204 ± 0.603
0.0ThrXaa: 0.0 ± 0.0
Val
2.204ValAla: 2.204 ± 0.927
0.826ValCys: 0.826 ± 0.759
3.306ValAsp: 3.306 ± 1.443
2.755ValGlu: 2.755 ± 1.497
3.306ValPhe: 3.306 ± 0.547
3.03ValGly: 3.03 ± 1.322
0.826ValHis: 0.826 ± 0.414
4.132ValIle: 4.132 ± 1.994
3.306ValLys: 3.306 ± 1.631
2.755ValLeu: 2.755 ± 1.339
1.377ValMet: 1.377 ± 0.852
1.653ValAsn: 1.653 ± 1.571
3.581ValPro: 3.581 ± 1.019
0.826ValGln: 0.826 ± 0.361
2.479ValArg: 2.479 ± 1.679
5.51ValSer: 5.51 ± 1.421
4.132ValThr: 4.132 ± 1.161
2.204ValVal: 2.204 ± 0.909
0.826ValTrp: 0.826 ± 0.361
1.928ValTyr: 1.928 ± 0.594
0.0ValXaa: 0.0 ± 0.0
Trp
1.377TrpAla: 1.377 ± 0.778
0.0TrpCys: 0.0 ± 0.0
1.102TrpAsp: 1.102 ± 0.505
1.102TrpGlu: 1.102 ± 0.681
1.102TrpPhe: 1.102 ± 0.681
0.551TrpGly: 0.551 ± 0.341
0.275TrpHis: 0.275 ± 0.499
0.826TrpIle: 0.826 ± 0.639
1.102TrpLys: 1.102 ± 0.681
1.377TrpLeu: 1.377 ± 0.779
0.826TrpMet: 0.826 ± 0.44
1.102TrpAsn: 1.102 ± 0.44
0.0TrpPro: 0.0 ± 0.0
0.826TrpGln: 0.826 ± 0.361
1.653TrpArg: 1.653 ± 1.022
0.0TrpSer: 0.0 ± 0.0
1.377TrpThr: 1.377 ± 0.694
1.102TrpVal: 1.102 ± 0.351
0.0TrpTrp: 0.0 ± 0.0
0.826TrpTyr: 0.826 ± 0.759
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.826TyrAla: 0.826 ± 0.525
0.275TyrCys: 0.275 ± 0.17
1.377TyrAsp: 1.377 ± 0.608
2.755TyrGlu: 2.755 ± 0.9
1.653TyrPhe: 1.653 ± 0.772
1.928TyrGly: 1.928 ± 0.598
1.102TyrHis: 1.102 ± 0.578
2.204TyrIle: 2.204 ± 1.139
1.377TyrLys: 1.377 ± 0.69
5.785TyrLeu: 5.785 ± 1.499
1.102TyrMet: 1.102 ± 1.067
0.826TyrAsn: 0.826 ± 0.511
2.479TyrPro: 2.479 ± 0.568
1.102TyrGln: 1.102 ± 0.681
1.928TyrArg: 1.928 ± 0.573
4.132TyrSer: 4.132 ± 1.166
1.653TyrThr: 1.653 ± 0.919
1.377TyrVal: 1.377 ± 0.802
0.551TyrTrp: 0.551 ± 0.341
1.102TyrTyr: 1.102 ± 0.44
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3631 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski