Amino acid dipepetide frequency for Fig mosaic emaravirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.93AlaAla: 0.93 ± 0.48
0.465AlaCys: 0.465 ± 0.347
3.718AlaAsp: 3.718 ± 0.83
1.859AlaGlu: 1.859 ± 0.677
1.627AlaPhe: 1.627 ± 0.535
2.324AlaGly: 2.324 ± 1.673
1.394AlaHis: 1.394 ± 0.846
3.718AlaIle: 3.718 ± 0.757
2.789AlaLys: 2.789 ± 0.702
3.486AlaLeu: 3.486 ± 1.69
1.162AlaMet: 1.162 ± 0.636
1.627AlaAsn: 1.627 ± 0.268
1.162AlaPro: 1.162 ± 0.33
1.162AlaGln: 1.162 ± 0.344
1.859AlaArg: 1.859 ± 0.892
3.021AlaSer: 3.021 ± 1.465
2.556AlaThr: 2.556 ± 1.26
1.627AlaVal: 1.627 ± 2.04
0.0AlaTrp: 0.0 ± 0.0
2.324AlaTyr: 2.324 ± 0.685
0.0AlaXaa: 0.0 ± 0.0
Cys
0.465CysAla: 0.465 ± 0.804
0.0CysCys: 0.0 ± 0.0
0.93CysAsp: 0.93 ± 0.789
1.627CysGlu: 1.627 ± 0.558
0.93CysPhe: 0.93 ± 0.743
0.697CysGly: 0.697 ± 0.482
0.465CysHis: 0.465 ± 0.347
2.092CysIle: 2.092 ± 0.545
2.324CysLys: 2.324 ± 0.73
1.394CysLeu: 1.394 ± 0.596
0.697CysMet: 0.697 ± 0.307
2.092CysAsn: 2.092 ± 0.932
0.465CysPro: 0.465 ± 0.233
1.859CysGln: 1.859 ± 0.881
0.232CysArg: 0.232 ± 0.136
0.93CysSer: 0.93 ± 0.365
0.465CysThr: 0.465 ± 0.272
0.465CysVal: 0.465 ± 0.233
0.0CysTrp: 0.0 ± 0.0
0.93CysTyr: 0.93 ± 0.731
0.0CysXaa: 0.0 ± 0.0
Asp
3.021AspAla: 3.021 ± 0.667
1.394AspCys: 1.394 ± 0.7
3.718AspAsp: 3.718 ± 1.147
4.648AspGlu: 4.648 ± 1.053
3.951AspPhe: 3.951 ± 1.015
2.556AspGly: 2.556 ± 0.899
1.162AspHis: 1.162 ± 0.446
7.901AspIle: 7.901 ± 2.3
4.88AspLys: 4.88 ± 1.054
6.042AspLeu: 6.042 ± 1.755
1.162AspMet: 1.162 ± 0.428
4.88AspAsn: 4.88 ± 0.789
2.789AspPro: 2.789 ± 0.419
1.162AspGln: 1.162 ± 0.314
1.394AspArg: 1.394 ± 0.452
3.021AspSer: 3.021 ± 0.932
4.88AspThr: 4.88 ± 0.706
5.113AspVal: 5.113 ± 0.304
0.465AspTrp: 0.465 ± 0.489
3.951AspTyr: 3.951 ± 1.494
0.0AspXaa: 0.0 ± 0.0
Glu
2.789GluAla: 2.789 ± 1.568
0.93GluCys: 0.93 ± 0.625
4.416GluAsp: 4.416 ± 1.634
2.789GluGlu: 2.789 ± 0.479
4.183GluPhe: 4.183 ± 0.991
1.394GluGly: 1.394 ± 0.494
1.162GluHis: 1.162 ± 0.709
6.739GluIle: 6.739 ± 1.784
4.648GluLys: 4.648 ± 1.062
6.507GluLeu: 6.507 ± 1.229
2.324GluMet: 2.324 ± 0.704
2.556GluAsn: 2.556 ± 0.764
1.627GluPro: 1.627 ± 0.494
0.697GluGln: 0.697 ± 0.34
1.859GluArg: 1.859 ± 0.352
3.254GluSer: 3.254 ± 0.48
3.718GluThr: 3.718 ± 0.794
3.021GluVal: 3.021 ± 1.176
0.232GluTrp: 0.232 ± 0.136
3.486GluTyr: 3.486 ± 1.248
0.0GluXaa: 0.0 ± 0.0
Phe
0.93PheAla: 0.93 ± 0.542
0.697PheCys: 0.697 ± 0.298
2.789PheAsp: 2.789 ± 1.169
1.627PheGlu: 1.627 ± 0.457
1.859PhePhe: 1.859 ± 0.672
2.324PheGly: 2.324 ± 0.894
1.394PheHis: 1.394 ± 0.585
2.556PheIle: 2.556 ± 1.317
1.859PheLys: 1.859 ± 0.443
3.951PheLeu: 3.951 ± 1.462
2.092PheMet: 2.092 ± 0.473
4.648PheAsn: 4.648 ± 1.263
1.394PhePro: 1.394 ± 0.75
0.93PheGln: 0.93 ± 0.631
2.092PheArg: 2.092 ± 1.218
3.021PheSer: 3.021 ± 0.76
3.254PheThr: 3.254 ± 0.654
3.021PheVal: 3.021 ± 1.115
0.0PheTrp: 0.0 ± 0.0
3.021PheTyr: 3.021 ± 0.642
0.0PheXaa: 0.0 ± 0.0
Gly
0.465GlyAla: 0.465 ± 0.272
1.162GlyCys: 1.162 ± 1.172
2.789GlyAsp: 2.789 ± 1.2
1.162GlyGlu: 1.162 ± 0.635
2.556GlyPhe: 2.556 ± 0.919
0.232GlyGly: 0.232 ± 0.136
0.465GlyHis: 0.465 ± 0.272
1.627GlyIle: 1.627 ± 0.331
3.718GlyLys: 3.718 ± 1.141
2.789GlyLeu: 2.789 ± 0.507
1.162GlyMet: 1.162 ± 0.344
4.416GlyAsn: 4.416 ± 1.535
0.465GlyPro: 0.465 ± 0.233
0.93GlyGln: 0.93 ± 0.744
1.394GlyArg: 1.394 ± 0.548
3.021GlySer: 3.021 ± 1.147
1.627GlyThr: 1.627 ± 0.649
0.93GlyVal: 0.93 ± 0.293
0.0GlyTrp: 0.0 ± 0.0
3.254GlyTyr: 3.254 ± 0.378
0.0GlyXaa: 0.0 ± 0.0
His
0.697HisAla: 0.697 ± 0.352
0.697HisCys: 0.697 ± 0.274
1.627HisAsp: 1.627 ± 0.899
2.092HisGlu: 2.092 ± 0.703
1.162HisPhe: 1.162 ± 0.36
1.394HisGly: 1.394 ± 0.7
0.93HisHis: 0.93 ± 0.301
2.092HisIle: 2.092 ± 0.595
1.627HisLys: 1.627 ± 0.606
2.789HisLeu: 2.789 ± 1.025
0.697HisMet: 0.697 ± 0.373
0.93HisAsn: 0.93 ± 0.59
0.93HisPro: 0.93 ± 0.385
0.232HisGln: 0.232 ± 0.136
0.697HisArg: 0.697 ± 0.298
0.93HisSer: 0.93 ± 0.365
0.93HisThr: 0.93 ± 0.321
2.092HisVal: 2.092 ± 0.849
0.465HisTrp: 0.465 ± 0.316
1.394HisTyr: 1.394 ± 0.451
0.0HisXaa: 0.0 ± 0.0
Ile
4.183IleAla: 4.183 ± 1.131
2.324IleCys: 2.324 ± 0.721
6.739IleAsp: 6.739 ± 1.686
5.81IleGlu: 5.81 ± 0.836
3.486IlePhe: 3.486 ± 1.08
3.254IleGly: 3.254 ± 1.094
3.254IleHis: 3.254 ± 0.738
6.042IleIle: 6.042 ± 1.424
7.204IleLys: 7.204 ± 1.119
7.901IleLeu: 7.901 ± 1.579
2.324IleMet: 2.324 ± 1.06
6.972IleAsn: 6.972 ± 1.844
3.254IlePro: 3.254 ± 1.356
3.718IleGln: 3.718 ± 1.783
2.789IleArg: 2.789 ± 0.597
9.761IleSer: 9.761 ± 1.532
5.578IleThr: 5.578 ± 1.54
5.113IleVal: 5.113 ± 0.934
0.465IleTrp: 0.465 ± 0.489
3.951IleTyr: 3.951 ± 1.372
0.0IleXaa: 0.0 ± 0.0
Lys
2.092LysAla: 2.092 ± 1.33
0.465LysCys: 0.465 ± 0.352
6.042LysAsp: 6.042 ± 0.509
3.718LysGlu: 3.718 ± 1.596
3.486LysPhe: 3.486 ± 0.913
2.324LysGly: 2.324 ± 0.741
2.789LysHis: 2.789 ± 0.751
7.437LysIle: 7.437 ± 0.855
9.761LysLys: 9.761 ± 2.567
8.599LysLeu: 8.599 ± 1.521
0.697LysMet: 0.697 ± 0.544
5.345LysAsn: 5.345 ± 1.369
2.324LysPro: 2.324 ± 0.523
3.021LysGln: 3.021 ± 0.832
3.254LysArg: 3.254 ± 1.217
5.113LysSer: 5.113 ± 0.688
5.578LysThr: 5.578 ± 1.031
6.275LysVal: 6.275 ± 1.429
0.93LysTrp: 0.93 ± 0.545
5.113LysTyr: 5.113 ± 0.466
0.0LysXaa: 0.0 ± 0.0
Leu
3.951LeuAla: 3.951 ± 1.009
2.789LeuCys: 2.789 ± 1.097
4.183LeuAsp: 4.183 ± 1.325
5.345LeuGlu: 5.345 ± 0.838
3.021LeuPhe: 3.021 ± 1.173
2.789LeuGly: 2.789 ± 1.096
2.324LeuHis: 2.324 ± 0.696
10.69LeuIle: 10.69 ± 2.452
7.204LeuLys: 7.204 ± 0.905
9.528LeuLeu: 9.528 ± 1.724
1.859LeuMet: 1.859 ± 0.479
6.042LeuAsn: 6.042 ± 1.227
4.416LeuPro: 4.416 ± 1.251
4.183LeuGln: 4.183 ± 1.062
3.486LeuArg: 3.486 ± 0.456
6.042LeuSer: 6.042 ± 1.087
3.951LeuThr: 3.951 ± 0.44
3.951LeuVal: 3.951 ± 1.009
0.465LeuTrp: 0.465 ± 0.754
4.183LeuTyr: 4.183 ± 0.536
0.0LeuXaa: 0.0 ± 0.0
Met
2.556MetAla: 2.556 ± 0.532
0.697MetCys: 0.697 ± 0.298
1.162MetAsp: 1.162 ± 0.281
1.394MetGlu: 1.394 ± 0.779
1.162MetPhe: 1.162 ± 1.143
0.93MetGly: 0.93 ± 0.385
0.0MetHis: 0.0 ± 0.0
2.092MetIle: 2.092 ± 0.497
3.021MetLys: 3.021 ± 0.919
1.859MetLeu: 1.859 ± 0.857
0.697MetMet: 0.697 ± 0.962
1.162MetAsn: 1.162 ± 0.45
0.697MetPro: 0.697 ± 0.55
0.697MetGln: 0.697 ± 0.34
1.394MetArg: 1.394 ± 1.281
3.021MetSer: 3.021 ± 0.99
2.092MetThr: 2.092 ± 0.638
1.162MetVal: 1.162 ± 0.681
0.0MetTrp: 0.0 ± 0.0
0.93MetTyr: 0.93 ± 0.301
0.0MetXaa: 0.0 ± 0.0
Asn
2.556AsnAla: 2.556 ± 0.812
1.162AsnCys: 1.162 ± 0.681
3.951AsnAsp: 3.951 ± 0.535
4.416AsnGlu: 4.416 ± 0.769
1.627AsnPhe: 1.627 ± 0.747
1.394AsnGly: 1.394 ± 0.859
1.394AsnHis: 1.394 ± 0.385
9.296AsnIle: 9.296 ± 2.346
6.042AsnLys: 6.042 ± 0.928
5.113AsnLeu: 5.113 ± 0.714
1.627AsnMet: 1.627 ± 0.417
3.718AsnAsn: 3.718 ± 0.919
2.092AsnPro: 2.092 ± 0.68
1.627AsnGln: 1.627 ± 0.583
2.324AsnArg: 2.324 ± 0.717
6.275AsnSer: 6.275 ± 1.351
3.486AsnThr: 3.486 ± 0.711
4.648AsnVal: 4.648 ± 1.787
0.465AsnTrp: 0.465 ± 0.295
4.416AsnTyr: 4.416 ± 1.802
0.0AsnXaa: 0.0 ± 0.0
Pro
1.394ProAla: 1.394 ± 0.776
0.465ProCys: 0.465 ± 0.233
2.556ProAsp: 2.556 ± 1.0
3.021ProGlu: 3.021 ± 1.172
0.697ProPhe: 0.697 ± 0.612
1.162ProGly: 1.162 ± 0.674
0.465ProHis: 0.465 ± 0.403
3.254ProIle: 3.254 ± 0.391
1.627ProLys: 1.627 ± 0.442
2.324ProLeu: 2.324 ± 0.846
0.697ProMet: 0.697 ± 0.482
2.092ProAsn: 2.092 ± 0.6
0.465ProPro: 0.465 ± 0.272
0.232ProGln: 0.232 ± 0.136
1.394ProArg: 1.394 ± 0.676
2.556ProSer: 2.556 ± 0.588
2.556ProThr: 2.556 ± 1.263
1.627ProVal: 1.627 ± 1.024
0.0ProTrp: 0.0 ± 0.0
2.092ProTyr: 2.092 ± 0.384
0.0ProXaa: 0.0 ± 0.0
Gln
1.394GlnAla: 1.394 ± 0.831
0.232GlnCys: 0.232 ± 0.136
2.092GlnAsp: 2.092 ± 1.073
1.394GlnGlu: 1.394 ± 0.57
0.93GlnPhe: 0.93 ± 0.528
1.162GlnGly: 1.162 ± 0.344
0.697GlnHis: 0.697 ± 0.574
1.859GlnIle: 1.859 ± 0.564
3.254GlnLys: 3.254 ± 0.875
1.859GlnLeu: 1.859 ± 0.568
1.162GlnMet: 1.162 ± 0.887
2.092GlnAsn: 2.092 ± 0.58
0.465GlnPro: 0.465 ± 0.233
0.465GlnGln: 0.465 ± 0.678
1.627GlnArg: 1.627 ± 0.953
1.859GlnSer: 1.859 ± 0.37
2.324GlnThr: 2.324 ± 0.408
1.162GlnVal: 1.162 ± 0.36
0.232GlnTrp: 0.232 ± 0.295
0.93GlnTyr: 0.93 ± 0.525
0.0GlnXaa: 0.0 ± 0.0
Arg
0.93ArgAla: 0.93 ± 0.498
0.465ArgCys: 0.465 ± 0.233
3.254ArgAsp: 3.254 ± 0.917
2.789ArgGlu: 2.789 ± 1.1
2.324ArgPhe: 2.324 ± 0.758
0.93ArgGly: 0.93 ± 0.545
0.697ArgHis: 0.697 ± 0.408
2.556ArgIle: 2.556 ± 0.532
1.859ArgLys: 1.859 ± 1.308
4.183ArgLeu: 4.183 ± 1.176
0.465ArgMet: 0.465 ± 0.489
1.859ArgAsn: 1.859 ± 1.008
0.93ArgPro: 0.93 ± 0.301
0.93ArgGln: 0.93 ± 0.355
0.465ArgArg: 0.465 ± 0.272
2.092ArgSer: 2.092 ± 0.81
1.627ArgThr: 1.627 ± 0.494
1.627ArgVal: 1.627 ± 0.485
0.232ArgTrp: 0.232 ± 0.377
4.648ArgTyr: 4.648 ± 1.631
0.0ArgXaa: 0.0 ± 0.0
Ser
4.648SerAla: 4.648 ± 2.121
1.394SerCys: 1.394 ± 0.762
4.648SerAsp: 4.648 ± 1.274
4.416SerGlu: 4.416 ± 1.14
3.021SerPhe: 3.021 ± 1.386
2.324SerGly: 2.324 ± 0.628
2.092SerHis: 2.092 ± 0.691
5.578SerIle: 5.578 ± 1.11
6.042SerLys: 6.042 ± 2.602
7.437SerLeu: 7.437 ± 1.858
2.789SerMet: 2.789 ± 0.838
6.507SerAsn: 6.507 ± 1.512
1.859SerPro: 1.859 ± 0.461
2.556SerGln: 2.556 ± 1.292
2.092SerArg: 2.092 ± 1.225
6.275SerSer: 6.275 ± 1.946
5.578SerThr: 5.578 ± 0.901
3.951SerVal: 3.951 ± 0.858
0.232SerTrp: 0.232 ± 0.377
3.021SerTyr: 3.021 ± 0.488
0.0SerXaa: 0.0 ± 0.0
Thr
1.627ThrAla: 1.627 ± 0.6
1.394ThrCys: 1.394 ± 0.996
5.113ThrAsp: 5.113 ± 1.024
3.254ThrGlu: 3.254 ± 0.921
2.324ThrPhe: 2.324 ± 0.408
2.556ThrGly: 2.556 ± 0.899
0.93ThrHis: 0.93 ± 0.654
7.669ThrIle: 7.669 ± 1.348
4.416ThrLys: 4.416 ± 0.501
5.81ThrLeu: 5.81 ± 0.32
1.162ThrMet: 1.162 ± 0.344
2.556ThrAsn: 2.556 ± 0.657
2.324ThrPro: 2.324 ± 0.294
0.697ThrGln: 0.697 ± 0.274
1.859ThrArg: 1.859 ± 0.71
6.275ThrSer: 6.275 ± 1.515
2.092ThrThr: 2.092 ± 1.241
3.718ThrVal: 3.718 ± 0.665
0.232ThrTrp: 0.232 ± 0.377
4.648ThrTyr: 4.648 ± 2.243
0.0ThrXaa: 0.0 ± 0.0
Val
2.324ValAla: 2.324 ± 1.536
1.162ValCys: 1.162 ± 0.636
4.88ValAsp: 4.88 ± 0.993
3.718ValGlu: 3.718 ± 0.604
3.021ValPhe: 3.021 ± 0.988
1.394ValGly: 1.394 ± 0.787
1.394ValHis: 1.394 ± 0.503
5.345ValIle: 5.345 ± 1.509
4.88ValLys: 4.88 ± 1.596
3.021ValLeu: 3.021 ± 0.945
1.394ValMet: 1.394 ± 0.632
2.556ValAsn: 2.556 ± 0.26
1.394ValPro: 1.394 ± 0.425
1.394ValGln: 1.394 ± 1.098
2.556ValArg: 2.556 ± 0.365
6.275ValSer: 6.275 ± 2.104
3.486ValThr: 3.486 ± 1.292
2.556ValVal: 2.556 ± 1.803
0.93ValTrp: 0.93 ± 0.625
2.324ValTyr: 2.324 ± 0.519
0.0ValXaa: 0.0 ± 0.0
Trp
0.465TrpAla: 0.465 ± 0.403
0.232TrpCys: 0.232 ± 0.136
0.465TrpAsp: 0.465 ± 0.413
0.232TrpGlu: 0.232 ± 0.295
0.232TrpPhe: 0.232 ± 0.136
0.232TrpGly: 0.232 ± 0.377
0.0TrpHis: 0.0 ± 0.0
0.232TrpIle: 0.232 ± 0.136
0.93TrpLys: 0.93 ± 1.014
0.93TrpLeu: 0.93 ± 0.301
0.465TrpMet: 0.465 ± 0.631
0.465TrpAsn: 0.465 ± 0.316
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.232TrpSer: 0.232 ± 0.136
0.232TrpThr: 0.232 ± 0.377
0.232TrpVal: 0.232 ± 0.266
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.394TyrAla: 1.394 ± 0.817
0.93TyrCys: 0.93 ± 0.365
3.021TyrAsp: 3.021 ± 1.02
3.021TyrGlu: 3.021 ± 0.504
1.859TyrPhe: 1.859 ± 0.831
2.789TyrGly: 2.789 ± 1.096
1.627TyrHis: 1.627 ± 0.899
5.345TyrIle: 5.345 ± 1.494
6.275TyrLys: 6.275 ± 2.224
4.88TyrLeu: 4.88 ± 1.215
1.859TyrMet: 1.859 ± 0.66
5.113TyrAsn: 5.113 ± 0.861
1.627TyrPro: 1.627 ± 0.556
0.697TyrGln: 0.697 ± 0.489
2.092TyrArg: 2.092 ± 0.742
3.486TyrSer: 3.486 ± 1.451
4.648TyrThr: 4.648 ± 1.671
3.718TyrVal: 3.718 ± 1.016
0.232TyrTrp: 0.232 ± 0.402
4.416TyrTyr: 4.416 ± 1.106
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4304 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski