Amino acid dipepetide frequency for Japanese soil-borne wheat mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.472AlaAla: 6.472 ± 1.288
0.324AlaCys: 0.324 ± 0.453
3.236AlaAsp: 3.236 ± 2.147
3.883AlaGlu: 3.883 ± 0.522
2.589AlaPhe: 2.589 ± 1.528
1.618AlaGly: 1.618 ± 0.902
0.971AlaHis: 0.971 ± 0.41
3.236AlaIle: 3.236 ± 1.564
3.883AlaLys: 3.883 ± 1.445
8.091AlaLeu: 8.091 ± 3.133
1.294AlaMet: 1.294 ± 0.422
1.294AlaAsn: 1.294 ± 0.679
2.589AlaPro: 2.589 ± 1.112
1.942AlaGln: 1.942 ± 0.964
4.531AlaArg: 4.531 ± 1.05
5.178AlaSer: 5.178 ± 1.8
4.531AlaThr: 4.531 ± 1.158
6.472AlaVal: 6.472 ± 2.056
0.324AlaTrp: 0.324 ± 0.453
1.618AlaTyr: 1.618 ± 0.655
0.0AlaXaa: 0.0 ± 0.0
Cys
0.647CysAla: 0.647 ± 0.431
0.647CysCys: 0.647 ± 0.458
1.942CysAsp: 1.942 ± 0.934
1.942CysGlu: 1.942 ± 1.038
0.971CysPhe: 0.971 ± 0.421
2.589CysGly: 2.589 ± 1.355
0.324CysHis: 0.324 ± 0.229
0.324CysIle: 0.324 ± 0.229
0.971CysLys: 0.971 ± 0.435
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.324CysAsn: 0.324 ± 0.229
0.324CysPro: 0.324 ± 0.229
0.971CysGln: 0.971 ± 0.435
0.971CysArg: 0.971 ± 0.659
0.647CysSer: 0.647 ± 0.698
0.971CysThr: 0.971 ± 0.659
1.618CysVal: 1.618 ± 0.977
0.0CysTrp: 0.0 ± 0.0
0.971CysTyr: 0.971 ± 0.63
0.0CysXaa: 0.0 ± 0.0
Asp
3.236AspAla: 3.236 ± 0.974
1.942AspCys: 1.942 ± 0.429
4.854AspAsp: 4.854 ± 1.409
5.502AspGlu: 5.502 ± 1.604
3.236AspPhe: 3.236 ± 1.265
4.207AspGly: 4.207 ± 0.838
0.647AspHis: 0.647 ± 0.458
1.618AspIle: 1.618 ± 0.508
3.883AspLys: 3.883 ± 1.189
8.738AspLeu: 8.738 ± 2.172
0.647AspMet: 0.647 ± 0.698
0.647AspAsn: 0.647 ± 0.433
1.618AspPro: 1.618 ± 1.375
1.294AspGln: 1.294 ± 1.028
3.883AspArg: 3.883 ± 1.625
5.825AspSer: 5.825 ± 1.065
3.883AspThr: 3.883 ± 1.04
6.149AspVal: 6.149 ± 1.196
0.0AspTrp: 0.0 ± 0.0
1.294AspTyr: 1.294 ± 0.281
0.0AspXaa: 0.0 ± 0.0
Glu
5.502GluAla: 5.502 ± 1.21
0.647GluCys: 0.647 ± 0.6
7.12GluAsp: 7.12 ± 1.997
6.472GluGlu: 6.472 ± 0.892
2.913GluPhe: 2.913 ± 0.939
2.913GluGly: 2.913 ± 2.849
0.971GluHis: 0.971 ± 0.415
3.883GluIle: 3.883 ± 0.899
9.385GluLys: 9.385 ± 1.698
6.149GluLeu: 6.149 ± 1.55
2.265GluMet: 2.265 ± 0.575
3.56GluAsn: 3.56 ± 0.786
0.971GluPro: 0.971 ± 0.421
3.236GluGln: 3.236 ± 1.006
4.854GluArg: 4.854 ± 0.747
6.149GluSer: 6.149 ± 1.313
3.56GluThr: 3.56 ± 0.838
6.472GluVal: 6.472 ± 1.684
1.618GluTrp: 1.618 ± 0.46
1.942GluTyr: 1.942 ± 0.87
0.0GluXaa: 0.0 ± 0.0
Phe
2.913PheAla: 2.913 ± 0.698
1.294PheCys: 1.294 ± 0.596
3.236PheAsp: 3.236 ± 1.017
3.56PheGlu: 3.56 ± 1.1
1.618PhePhe: 1.618 ± 0.696
2.265PheGly: 2.265 ± 1.352
0.324PheHis: 0.324 ± 0.453
1.294PheIle: 1.294 ± 1.447
1.942PheLys: 1.942 ± 0.35
4.531PheLeu: 4.531 ± 0.968
1.942PheMet: 1.942 ± 0.751
2.265PheAsn: 2.265 ± 0.726
1.294PhePro: 1.294 ± 0.581
1.294PheGln: 1.294 ± 0.544
1.294PheArg: 1.294 ± 0.607
3.56PheSer: 3.56 ± 0.615
1.618PheThr: 1.618 ± 0.696
3.883PheVal: 3.883 ± 0.734
0.324PheTrp: 0.324 ± 0.349
2.265PheTyr: 2.265 ± 1.036
0.0PheXaa: 0.0 ± 0.0
Gly
2.589GlyAla: 2.589 ± 0.715
0.971GlyCys: 0.971 ± 0.41
3.236GlyAsp: 3.236 ± 1.077
4.207GlyGlu: 4.207 ± 2.336
2.265GlyPhe: 2.265 ± 0.989
5.502GlyGly: 5.502 ± 1.758
0.971GlyHis: 0.971 ± 0.52
1.294GlyIle: 1.294 ± 0.544
4.531GlyLys: 4.531 ± 2.033
2.589GlyLeu: 2.589 ± 0.838
1.942GlyMet: 1.942 ± 0.94
3.56GlyAsn: 3.56 ± 1.331
1.942GlyPro: 1.942 ± 0.741
1.294GlyGln: 1.294 ± 0.612
3.236GlyArg: 3.236 ± 0.751
5.178GlySer: 5.178 ± 1.139
2.913GlyThr: 2.913 ± 1.78
4.207GlyVal: 4.207 ± 1.661
0.324GlyTrp: 0.324 ± 0.636
2.589GlyTyr: 2.589 ± 1.668
0.0GlyXaa: 0.0 ± 0.0
His
1.618HisAla: 1.618 ± 0.561
0.971HisCys: 0.971 ± 0.687
0.647HisAsp: 0.647 ± 0.458
0.971HisGlu: 0.971 ± 0.415
0.971HisPhe: 0.971 ± 0.41
0.971HisGly: 0.971 ± 0.687
0.324HisHis: 0.324 ± 0.349
0.971HisIle: 0.971 ± 0.435
0.647HisLys: 0.647 ± 0.552
0.971HisLeu: 0.971 ± 0.648
0.0HisMet: 0.0 ± 0.0
0.647HisAsn: 0.647 ± 0.331
0.971HisPro: 0.971 ± 0.871
0.324HisGln: 0.324 ± 0.365
0.647HisArg: 0.647 ± 0.458
1.618HisSer: 1.618 ± 0.81
1.942HisThr: 1.942 ± 1.495
2.265HisVal: 2.265 ± 0.808
0.0HisTrp: 0.0 ± 0.0
0.647HisTyr: 0.647 ± 0.335
0.0HisXaa: 0.0 ± 0.0
Ile
2.589IleAla: 2.589 ± 1.173
0.324IleCys: 0.324 ± 0.229
4.854IleAsp: 4.854 ± 1.284
4.207IleGlu: 4.207 ± 1.591
1.942IlePhe: 1.942 ± 1.441
1.618IleGly: 1.618 ± 0.426
1.618IleHis: 1.618 ± 1.337
2.589IleIle: 2.589 ± 0.501
2.589IleLys: 2.589 ± 0.556
2.589IleLeu: 2.589 ± 0.82
0.0IleMet: 0.0 ± 0.0
1.618IleAsn: 1.618 ± 0.822
1.618IlePro: 1.618 ± 0.752
2.265IleGln: 2.265 ± 0.49
3.56IleArg: 3.56 ± 0.558
5.502IleSer: 5.502 ± 1.334
0.971IleThr: 0.971 ± 0.435
3.883IleVal: 3.883 ± 1.031
0.0IleTrp: 0.0 ± 0.0
1.618IleTyr: 1.618 ± 0.684
0.0IleXaa: 0.0 ± 0.0
Lys
4.854LysAla: 4.854 ± 0.983
1.618LysCys: 1.618 ± 0.928
5.178LysAsp: 5.178 ± 0.67
5.825LysGlu: 5.825 ± 2.082
2.589LysPhe: 2.589 ± 1.688
2.913LysGly: 2.913 ± 0.987
1.294LysHis: 1.294 ± 0.906
4.207LysIle: 4.207 ± 0.758
4.207LysLys: 4.207 ± 1.132
6.796LysLeu: 6.796 ± 2.138
3.56LysMet: 3.56 ± 0.875
3.56LysAsn: 3.56 ± 0.846
2.265LysPro: 2.265 ± 0.895
3.56LysGln: 3.56 ± 1.316
3.883LysArg: 3.883 ± 1.196
4.207LysSer: 4.207 ± 0.874
3.236LysThr: 3.236 ± 1.239
5.178LysVal: 5.178 ± 1.356
1.294LysTrp: 1.294 ± 1.028
3.236LysTyr: 3.236 ± 0.619
0.0LysXaa: 0.0 ± 0.0
Leu
4.207LeuAla: 4.207 ± 1.906
0.324LeuCys: 0.324 ± 0.366
5.502LeuAsp: 5.502 ± 1.201
6.796LeuGlu: 6.796 ± 1.541
3.883LeuPhe: 3.883 ± 1.621
5.825LeuGly: 5.825 ± 2.77
1.294LeuHis: 1.294 ± 0.581
3.883LeuIle: 3.883 ± 0.663
6.149LeuLys: 6.149 ± 1.681
8.738LeuLeu: 8.738 ± 2.376
3.236LeuMet: 3.236 ± 0.797
6.149LeuAsn: 6.149 ± 1.119
2.913LeuPro: 2.913 ± 0.987
4.207LeuGln: 4.207 ± 0.559
4.531LeuArg: 4.531 ± 1.482
5.825LeuSer: 5.825 ± 1.499
6.472LeuThr: 6.472 ± 1.464
4.531LeuVal: 4.531 ± 0.809
1.618LeuTrp: 1.618 ± 0.696
3.236LeuTyr: 3.236 ± 0.819
0.0LeuXaa: 0.0 ± 0.0
Met
3.56MetAla: 3.56 ± 1.768
0.324MetCys: 0.324 ± 0.365
0.971MetAsp: 0.971 ± 0.659
1.942MetGlu: 1.942 ± 1.038
1.618MetPhe: 1.618 ± 0.426
0.324MetGly: 0.324 ± 0.366
0.971MetHis: 0.971 ± 0.443
1.294MetIle: 1.294 ± 0.661
2.265MetLys: 2.265 ± 0.566
2.913MetLeu: 2.913 ± 1.259
0.324MetMet: 0.324 ± 0.365
1.942MetAsn: 1.942 ± 0.598
1.618MetPro: 1.618 ± 0.791
1.618MetGln: 1.618 ± 0.88
1.618MetArg: 1.618 ± 0.655
0.971MetSer: 0.971 ± 0.484
1.942MetThr: 1.942 ± 0.35
0.324MetVal: 0.324 ± 0.366
0.647MetTrp: 0.647 ± 0.458
0.647MetTyr: 0.647 ± 0.698
0.0MetXaa: 0.0 ± 0.0
Asn
3.236AsnAla: 3.236 ± 1.158
1.618AsnCys: 1.618 ± 0.791
1.618AsnAsp: 1.618 ± 0.576
3.56AsnGlu: 3.56 ± 0.868
2.589AsnPhe: 2.589 ± 1.212
3.236AsnGly: 3.236 ± 1.119
0.647AsnHis: 0.647 ± 0.458
1.294AsnIle: 1.294 ± 1.017
3.236AsnLys: 3.236 ± 0.643
3.236AsnLeu: 3.236 ± 1.56
1.294AsnMet: 1.294 ± 1.162
1.942AsnAsn: 1.942 ± 0.701
0.324AsnPro: 0.324 ± 0.229
1.618AsnGln: 1.618 ± 0.795
2.265AsnArg: 2.265 ± 0.566
2.265AsnSer: 2.265 ± 1.036
2.913AsnThr: 2.913 ± 0.774
4.531AsnVal: 4.531 ± 1.358
0.647AsnTrp: 0.647 ± 0.458
1.618AsnTyr: 1.618 ± 0.508
0.0AsnXaa: 0.0 ± 0.0
Pro
0.971ProAla: 0.971 ± 0.933
0.647ProCys: 0.647 ± 0.433
2.265ProAsp: 2.265 ± 1.09
3.236ProGlu: 3.236 ± 0.616
1.618ProPhe: 1.618 ± 0.757
1.942ProGly: 1.942 ± 0.624
0.324ProHis: 0.324 ± 0.366
1.618ProIle: 1.618 ± 0.824
4.207ProLys: 4.207 ± 1.046
2.589ProLeu: 2.589 ± 0.796
0.971ProMet: 0.971 ± 0.41
0.324ProAsn: 0.324 ± 0.229
0.324ProPro: 0.324 ± 0.229
1.294ProGln: 1.294 ± 0.708
1.618ProArg: 1.618 ± 0.981
1.942ProSer: 1.942 ± 1.045
0.971ProThr: 0.971 ± 0.795
1.618ProVal: 1.618 ± 0.791
0.0ProTrp: 0.0 ± 0.0
0.971ProTyr: 0.971 ± 0.443
0.0ProXaa: 0.0 ± 0.0
Gln
1.618GlnAla: 1.618 ± 0.611
0.324GlnCys: 0.324 ± 0.366
1.618GlnAsp: 1.618 ± 0.821
1.618GlnGlu: 1.618 ± 0.738
1.618GlnPhe: 1.618 ± 0.796
1.942GlnGly: 1.942 ± 0.506
1.294GlnHis: 1.294 ± 0.411
2.589GlnIle: 2.589 ± 0.755
3.883GlnLys: 3.883 ± 0.91
3.56GlnLeu: 3.56 ± 0.885
1.294GlnMet: 1.294 ± 0.697
0.647GlnAsn: 0.647 ± 0.433
0.647GlnPro: 0.647 ± 0.458
1.618GlnGln: 1.618 ± 0.538
4.531GlnArg: 4.531 ± 2.443
1.618GlnSer: 1.618 ± 1.626
2.913GlnThr: 2.913 ± 2.414
0.971GlnVal: 0.971 ± 0.63
0.0GlnTrp: 0.0 ± 0.0
0.324GlnTyr: 0.324 ± 0.229
0.0GlnXaa: 0.0 ± 0.0
Arg
4.854ArgAla: 4.854 ± 1.847
0.324ArgCys: 0.324 ± 0.229
3.236ArgAsp: 3.236 ± 0.616
3.236ArgGlu: 3.236 ± 1.375
1.618ArgPhe: 1.618 ± 0.655
2.913ArgGly: 2.913 ± 1.401
1.942ArgHis: 1.942 ± 0.832
2.589ArgIle: 2.589 ± 0.806
5.825ArgLys: 5.825 ± 0.952
4.854ArgLeu: 4.854 ± 1.717
2.589ArgMet: 2.589 ± 0.715
3.883ArgAsn: 3.883 ± 1.573
0.647ArgPro: 0.647 ± 0.458
1.618ArgGln: 1.618 ± 0.867
4.531ArgArg: 4.531 ± 1.558
5.502ArgSer: 5.502 ± 1.538
3.883ArgThr: 3.883 ± 1.736
2.589ArgVal: 2.589 ± 0.501
0.971ArgTrp: 0.971 ± 0.435
1.294ArgTyr: 1.294 ± 0.866
0.0ArgXaa: 0.0 ± 0.0
Ser
5.502SerAla: 5.502 ± 1.687
1.618SerCys: 1.618 ± 0.455
3.883SerAsp: 3.883 ± 1.13
5.825SerGlu: 5.825 ± 1.935
5.178SerPhe: 5.178 ± 0.686
3.883SerGly: 3.883 ± 1.687
0.0SerHis: 0.0 ± 0.0
3.883SerIle: 3.883 ± 1.046
4.854SerLys: 4.854 ± 1.563
6.796SerLeu: 6.796 ± 1.204
1.942SerMet: 1.942 ± 0.63
2.589SerAsn: 2.589 ± 0.512
2.589SerPro: 2.589 ± 1.119
1.294SerGln: 1.294 ± 1.214
2.589SerArg: 2.589 ± 0.705
4.854SerSer: 4.854 ± 1.228
3.56SerThr: 3.56 ± 0.989
7.12SerVal: 7.12 ± 2.467
0.324SerTrp: 0.324 ± 0.229
3.56SerTyr: 3.56 ± 0.753
0.0SerXaa: 0.0 ± 0.0
Thr
3.236ThrAla: 3.236 ± 1.551
0.324ThrCys: 0.324 ± 0.366
2.913ThrAsp: 2.913 ± 1.362
4.207ThrGlu: 4.207 ± 1.882
2.265ThrPhe: 2.265 ± 1.232
2.589ThrGly: 2.589 ± 0.703
1.618ThrHis: 1.618 ± 1.228
2.913ThrIle: 2.913 ± 0.778
4.207ThrLys: 4.207 ± 1.352
3.56ThrLeu: 3.56 ± 1.305
0.971ThrMet: 0.971 ± 0.621
2.265ThrAsn: 2.265 ± 0.972
1.294ThrPro: 1.294 ± 0.828
2.589ThrGln: 2.589 ± 1.272
3.236ThrArg: 3.236 ± 0.996
3.56ThrSer: 3.56 ± 1.364
4.207ThrThr: 4.207 ± 1.334
8.091ThrVal: 8.091 ± 1.39
0.647ThrTrp: 0.647 ± 0.331
1.618ThrTyr: 1.618 ± 0.912
0.0ThrXaa: 0.0 ± 0.0
Val
4.207ValAla: 4.207 ± 1.97
1.618ValCys: 1.618 ± 0.425
4.531ValAsp: 4.531 ± 1.155
11.003ValGlu: 11.003 ± 1.845
1.942ValPhe: 1.942 ± 0.482
4.531ValGly: 4.531 ± 1.254
1.618ValHis: 1.618 ± 0.815
4.207ValIle: 4.207 ± 0.548
4.531ValLys: 4.531 ± 0.521
6.472ValLeu: 6.472 ± 1.853
2.589ValMet: 2.589 ± 1.159
3.56ValAsn: 3.56 ± 0.856
3.883ValPro: 3.883 ± 0.869
0.971ValGln: 0.971 ± 0.687
3.883ValArg: 3.883 ± 0.874
5.178ValSer: 5.178 ± 0.519
1.942ValThr: 1.942 ± 0.568
10.356ValVal: 10.356 ± 1.664
1.294ValTrp: 1.294 ± 0.56
3.883ValTyr: 3.883 ± 0.924
0.0ValXaa: 0.0 ± 0.0
Trp
0.324TrpAla: 0.324 ± 0.229
0.324TrpCys: 0.324 ± 0.229
0.647TrpAsp: 0.647 ± 0.431
1.618TrpGlu: 1.618 ± 0.762
0.0TrpPhe: 0.0 ± 0.0
0.647TrpGly: 0.647 ± 0.304
0.0TrpHis: 0.0 ± 0.0
1.294TrpIle: 1.294 ± 0.424
0.324TrpLys: 0.324 ± 0.229
1.942TrpLeu: 1.942 ± 1.226
0.324TrpMet: 0.324 ± 0.252
0.324TrpAsn: 0.324 ± 0.366
0.0TrpPro: 0.0 ± 0.0
0.647TrpGln: 0.647 ± 0.713
0.324TrpArg: 0.324 ± 0.511
0.324TrpSer: 0.324 ± 0.366
0.647TrpThr: 0.647 ± 0.335
0.0TrpVal: 0.0 ± 0.0
0.324TrpTrp: 0.324 ± 0.366
0.647TrpTyr: 0.647 ± 0.458
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.942TyrAla: 1.942 ± 0.533
0.971TyrCys: 0.971 ± 0.52
1.618TyrAsp: 1.618 ± 0.538
0.971TyrGlu: 0.971 ± 0.822
1.294TyrPhe: 1.294 ± 0.544
2.913TyrGly: 2.913 ± 0.458
0.971TyrHis: 0.971 ± 0.613
0.971TyrIle: 0.971 ± 0.435
2.265TyrLys: 2.265 ± 0.49
4.207TyrLeu: 4.207 ± 1.444
0.324TyrMet: 0.324 ± 0.229
2.589TyrAsn: 2.589 ± 1.094
1.618TyrPro: 1.618 ± 0.46
0.971TyrGln: 0.971 ± 0.659
2.913TyrArg: 2.913 ± 1.386
1.942TyrSer: 1.942 ± 1.006
3.236TyrThr: 3.236 ± 1.023
1.942TyrVal: 1.942 ± 0.803
0.324TyrTrp: 0.324 ± 0.365
0.647TyrTyr: 0.647 ± 0.331
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3091 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski