Amino acid dipepetide frequency for Watermelon chlorotic stunt virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.108AlaAla: 3.108 ± 0.945
1.865AlaCys: 1.865 ± 0.701
0.622AlaAsp: 0.622 ± 0.738
3.729AlaGlu: 3.729 ± 1.599
1.865AlaPhe: 1.865 ± 0.828
1.243AlaGly: 1.243 ± 0.658
0.622AlaHis: 0.622 ± 0.675
0.622AlaIle: 0.622 ± 0.626
3.729AlaLys: 3.729 ± 1.158
3.729AlaLeu: 3.729 ± 1.115
1.243AlaMet: 1.243 ± 0.65
2.486AlaAsn: 2.486 ± 0.991
3.108AlaPro: 3.108 ± 1.176
2.486AlaGln: 2.486 ± 1.077
3.108AlaArg: 3.108 ± 1.236
3.729AlaSer: 3.729 ± 1.195
5.594AlaThr: 5.594 ± 2.418
3.729AlaVal: 3.729 ± 1.907
1.243AlaTrp: 1.243 ± 0.822
1.243AlaTyr: 1.243 ± 0.658
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.243CysCys: 1.243 ± 1.35
0.622CysAsp: 0.622 ± 0.626
1.243CysGlu: 1.243 ± 0.717
0.622CysPhe: 0.622 ± 0.485
1.865CysGly: 1.865 ± 1.511
0.0CysHis: 0.0 ± 0.0
0.622CysIle: 0.622 ± 0.558
0.622CysLys: 0.622 ± 0.551
1.243CysLeu: 1.243 ± 0.691
1.243CysMet: 1.243 ± 0.8
1.865CysAsn: 1.865 ± 0.701
3.108CysPro: 3.108 ± 1.807
0.622CysGln: 0.622 ± 0.78
1.243CysArg: 1.243 ± 0.822
3.108CysSer: 3.108 ± 1.869
0.0CysThr: 0.0 ± 0.0
1.865CysVal: 1.865 ± 1.116
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.865AspAla: 1.865 ± 1.013
0.622AspCys: 0.622 ± 0.558
3.729AspAsp: 3.729 ± 0.729
1.243AspGlu: 1.243 ± 0.717
3.729AspPhe: 3.729 ± 1.351
1.865AspGly: 1.865 ± 1.056
1.865AspHis: 1.865 ± 1.12
3.108AspIle: 3.108 ± 0.603
1.865AspLys: 1.865 ± 0.789
4.351AspLeu: 4.351 ± 1.145
0.622AspMet: 0.622 ± 0.738
2.486AspAsn: 2.486 ± 0.771
1.865AspPro: 1.865 ± 0.683
1.865AspGln: 1.865 ± 0.865
3.108AspArg: 3.108 ± 1.357
5.594AspSer: 5.594 ± 1.199
1.243AspThr: 1.243 ± 1.116
6.215AspVal: 6.215 ± 1.801
1.865AspTrp: 1.865 ± 1.079
1.865AspTyr: 1.865 ± 0.85
0.0AspXaa: 0.0 ± 0.0
Glu
2.486GluAla: 2.486 ± 0.729
0.622GluCys: 0.622 ± 0.626
3.108GluAsp: 3.108 ± 0.967
3.108GluGlu: 3.108 ± 1.904
1.865GluPhe: 1.865 ± 1.455
4.351GluGly: 4.351 ± 1.06
1.243GluHis: 1.243 ± 0.822
1.243GluIle: 1.243 ± 1.116
3.108GluLys: 3.108 ± 1.41
4.972GluLeu: 4.972 ± 1.796
0.0GluMet: 0.0 ± 0.0
4.351GluAsn: 4.351 ± 1.563
4.351GluPro: 4.351 ± 1.43
2.486GluGln: 2.486 ± 1.435
1.243GluArg: 1.243 ± 0.923
3.729GluSer: 3.729 ± 1.78
1.243GluThr: 1.243 ± 1.116
0.622GluVal: 0.622 ± 0.626
1.865GluTrp: 1.865 ± 1.079
2.486GluTyr: 2.486 ± 1.127
0.0GluXaa: 0.0 ± 0.0
Phe
1.865PheAla: 1.865 ± 0.701
0.622PheCys: 0.622 ± 0.551
2.486PheAsp: 2.486 ± 1.175
1.243PheGlu: 1.243 ± 0.628
1.865PhePhe: 1.865 ± 1.152
1.243PheGly: 1.243 ± 0.717
2.486PheHis: 2.486 ± 0.991
0.622PheIle: 0.622 ± 0.626
3.729PheLys: 3.729 ± 1.762
6.215PheLeu: 6.215 ± 2.915
0.622PheMet: 0.622 ± 0.485
1.865PheAsn: 1.865 ± 0.893
3.108PhePro: 3.108 ± 1.116
1.243PheGln: 1.243 ± 0.65
3.729PheArg: 3.729 ± 1.419
4.351PheSer: 4.351 ± 1.231
4.351PheThr: 4.351 ± 1.508
3.108PheVal: 3.108 ± 1.954
0.622PheTrp: 0.622 ± 0.558
0.622PheTyr: 0.622 ± 0.551
0.0PheXaa: 0.0 ± 0.0
Gly
1.243GlyAla: 1.243 ± 0.658
1.243GlyCys: 1.243 ± 0.893
3.108GlyAsp: 3.108 ± 0.597
4.972GlyGlu: 4.972 ± 1.206
1.243GlyPhe: 1.243 ± 1.089
3.108GlyGly: 3.108 ± 1.335
1.865GlyHis: 1.865 ± 1.21
4.972GlyIle: 4.972 ± 1.731
3.729GlyLys: 3.729 ± 2.011
2.486GlyLeu: 2.486 ± 1.043
1.865GlyMet: 1.865 ± 1.173
1.865GlyAsn: 1.865 ± 1.44
3.729GlyPro: 3.729 ± 1.448
1.865GlyGln: 1.865 ± 0.748
3.108GlyArg: 3.108 ± 1.108
3.108GlySer: 3.108 ± 1.374
2.486GlyThr: 2.486 ± 1.193
3.729GlyVal: 3.729 ± 1.472
0.0GlyTrp: 0.0 ± 0.0
0.622GlyTyr: 0.622 ± 0.558
0.0GlyXaa: 0.0 ± 0.0
His
0.622HisAla: 0.622 ± 0.551
0.622HisCys: 0.622 ± 0.675
1.243HisAsp: 1.243 ± 1.139
1.243HisGlu: 1.243 ± 1.35
1.865HisPhe: 1.865 ± 0.748
1.243HisGly: 1.243 ± 1.089
0.622HisHis: 0.622 ± 0.738
1.243HisIle: 1.243 ± 0.833
1.243HisLys: 1.243 ± 1.009
1.243HisLeu: 1.243 ± 0.97
0.0HisMet: 0.0 ± 0.0
2.486HisAsn: 2.486 ± 1.283
1.243HisPro: 1.243 ± 0.97
2.486HisGln: 2.486 ± 1.1
3.108HisArg: 3.108 ± 1.421
1.865HisSer: 1.865 ± 0.923
3.108HisThr: 3.108 ± 1.838
4.351HisVal: 4.351 ± 1.41
0.0HisTrp: 0.0 ± 0.0
2.486HisTyr: 2.486 ± 1.012
0.0HisXaa: 0.0 ± 0.0
Ile
1.243IleAla: 1.243 ± 0.923
0.0IleCys: 0.0 ± 0.0
3.729IleAsp: 3.729 ± 1.157
1.865IleGlu: 1.865 ± 1.673
3.108IlePhe: 3.108 ± 1.449
0.622IleGly: 0.622 ± 0.738
1.865IleHis: 1.865 ± 1.121
2.486IleIle: 2.486 ± 1.725
4.972IleLys: 4.972 ± 0.755
5.594IleLeu: 5.594 ± 1.998
1.243IleMet: 1.243 ± 0.796
1.865IleAsn: 1.865 ± 0.828
1.243IlePro: 1.243 ± 0.72
3.108IleGln: 3.108 ± 1.589
4.351IleArg: 4.351 ± 1.62
3.108IleSer: 3.108 ± 1.281
4.972IleThr: 4.972 ± 1.651
3.108IleVal: 3.108 ± 0.875
1.243IleTrp: 1.243 ± 0.842
1.865IleTyr: 1.865 ± 1.103
0.0IleXaa: 0.0 ± 0.0
Lys
3.108LysAla: 3.108 ± 1.546
1.865LysCys: 1.865 ± 0.863
2.486LysAsp: 2.486 ± 1.44
6.215LysGlu: 6.215 ± 2.657
1.865LysPhe: 1.865 ± 0.851
3.108LysGly: 3.108 ± 0.937
1.865LysHis: 1.865 ± 0.851
4.972LysIle: 4.972 ± 1.256
1.865LysLys: 1.865 ± 1.584
2.486LysLeu: 2.486 ± 1.126
0.0LysMet: 0.0 ± 0.0
4.972LysAsn: 4.972 ± 1.446
3.108LysPro: 3.108 ± 1.28
3.108LysGln: 3.108 ± 1.555
4.351LysArg: 4.351 ± 2.126
4.351LysSer: 4.351 ± 1.368
1.865LysThr: 1.865 ± 0.965
1.865LysVal: 1.865 ± 1.103
0.0LysTrp: 0.0 ± 0.0
3.729LysTyr: 3.729 ± 0.897
0.0LysXaa: 0.0 ± 0.0
Leu
2.486LeuAla: 2.486 ± 1.07
1.865LeuCys: 1.865 ± 1.131
5.594LeuAsp: 5.594 ± 1.0
4.351LeuGlu: 4.351 ± 1.162
3.108LeuPhe: 3.108 ± 1.954
4.972LeuGly: 4.972 ± 1.339
4.351LeuHis: 4.351 ± 1.303
4.351LeuIle: 4.351 ± 1.771
4.351LeuLys: 4.351 ± 1.871
4.972LeuLeu: 4.972 ± 1.451
1.865LeuMet: 1.865 ± 1.22
4.351LeuAsn: 4.351 ± 1.537
4.972LeuPro: 4.972 ± 1.505
2.486LeuGln: 2.486 ± 1.077
4.972LeuArg: 4.972 ± 2.388
6.837LeuSer: 6.837 ± 1.853
2.486LeuThr: 2.486 ± 1.317
4.351LeuVal: 4.351 ± 1.648
0.0LeuTrp: 0.0 ± 0.0
1.243LeuTyr: 1.243 ± 1.476
0.0LeuXaa: 0.0 ± 0.0
Met
0.622MetAla: 0.622 ± 0.551
0.622MetCys: 0.622 ± 0.808
1.243MetAsp: 1.243 ± 0.842
1.243MetGlu: 1.243 ± 0.833
1.865MetPhe: 1.865 ± 1.152
1.865MetGly: 1.865 ± 1.119
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.243MetLys: 1.243 ± 0.717
1.865MetLeu: 1.865 ± 1.039
0.0MetMet: 0.0 ± 0.0
1.865MetAsn: 1.865 ± 1.877
0.622MetPro: 0.622 ± 0.78
0.622MetGln: 0.622 ± 0.738
2.486MetArg: 2.486 ± 0.63
3.729MetSer: 3.729 ± 1.62
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.865MetTrp: 1.865 ± 0.85
2.486MetTyr: 2.486 ± 2.206
0.0MetXaa: 0.0 ± 0.0
Asn
3.108AsnAla: 3.108 ± 0.597
0.0AsnCys: 0.0 ± 0.0
2.486AsnAsp: 2.486 ± 0.923
0.622AsnGlu: 0.622 ± 0.551
1.865AsnPhe: 1.865 ± 0.893
3.729AsnGly: 3.729 ± 0.901
3.729AsnHis: 3.729 ± 2.268
2.486AsnIle: 2.486 ± 1.439
2.486AsnLys: 2.486 ± 1.642
4.972AsnLeu: 4.972 ± 2.179
1.865AsnMet: 1.865 ± 1.058
0.622AsnAsn: 0.622 ± 0.738
2.486AsnPro: 2.486 ± 0.836
3.108AsnGln: 3.108 ± 1.285
3.729AsnArg: 3.729 ± 1.046
0.622AsnSer: 0.622 ± 0.78
3.108AsnThr: 3.108 ± 1.518
6.837AsnVal: 6.837 ± 3.264
0.0AsnTrp: 0.0 ± 0.0
1.865AsnTyr: 1.865 ± 1.056
0.0AsnXaa: 0.0 ± 0.0
Pro
0.622ProAla: 0.622 ± 0.626
1.243ProCys: 1.243 ± 0.859
0.622ProAsp: 0.622 ± 0.551
3.108ProGlu: 3.108 ± 1.385
3.729ProPhe: 3.729 ± 1.225
2.486ProGly: 2.486 ± 1.077
2.486ProHis: 2.486 ± 1.297
3.729ProIle: 3.729 ± 1.388
2.486ProLys: 2.486 ± 1.439
4.351ProLeu: 4.351 ± 1.581
2.486ProMet: 2.486 ± 1.284
3.108ProAsn: 3.108 ± 1.619
2.486ProPro: 2.486 ± 1.614
1.865ProGln: 1.865 ± 1.166
4.351ProArg: 4.351 ± 1.822
8.08ProSer: 8.08 ± 1.725
3.729ProThr: 3.729 ± 1.191
3.108ProVal: 3.108 ± 1.898
1.243ProTrp: 1.243 ± 0.658
1.243ProTyr: 1.243 ± 0.717
0.0ProXaa: 0.0 ± 0.0
Gln
4.972GlnAla: 4.972 ± 2.182
1.243GlnCys: 1.243 ± 0.691
2.486GlnAsp: 2.486 ± 1.187
1.243GlnGlu: 1.243 ± 0.842
2.486GlnPhe: 2.486 ± 1.155
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
1.865GlnIle: 1.865 ± 1.131
1.243GlnLys: 1.243 ± 0.72
0.0GlnLeu: 0.0 ± 0.0
1.865GlnMet: 1.865 ± 0.828
2.486GlnAsn: 2.486 ± 1.497
3.108GlnPro: 3.108 ± 3.268
1.243GlnGln: 1.243 ± 0.628
2.486GlnArg: 2.486 ± 0.784
5.594GlnSer: 5.594 ± 1.243
5.594GlnThr: 5.594 ± 2.274
3.729GlnVal: 3.729 ± 2.154
0.622GlnTrp: 0.622 ± 0.551
0.622GlnTyr: 0.622 ± 0.626
0.0GlnXaa: 0.0 ± 0.0
Arg
3.108ArgAla: 3.108 ± 1.585
3.729ArgCys: 3.729 ± 1.192
5.594ArgAsp: 5.594 ± 1.736
1.243ArgGlu: 1.243 ± 0.72
4.972ArgPhe: 4.972 ± 1.848
3.108ArgGly: 3.108 ± 0.597
3.108ArgHis: 3.108 ± 1.65
3.729ArgIle: 3.729 ± 0.793
3.729ArgLys: 3.729 ± 2.151
4.972ArgLeu: 4.972 ± 2.403
1.865ArgMet: 1.865 ± 1.117
1.243ArgAsn: 1.243 ± 0.691
5.594ArgPro: 5.594 ± 1.488
3.108ArgGln: 3.108 ± 0.673
9.944ArgArg: 9.944 ± 3.514
7.458ArgSer: 7.458 ± 2.053
3.729ArgThr: 3.729 ± 1.526
5.594ArgVal: 5.594 ± 1.577
0.0ArgTrp: 0.0 ± 0.0
2.486ArgTyr: 2.486 ± 2.029
0.0ArgXaa: 0.0 ± 0.0
Ser
5.594SerAla: 5.594 ± 2.766
1.243SerCys: 1.243 ± 1.118
4.351SerAsp: 4.351 ± 1.914
1.865SerGlu: 1.865 ± 1.183
4.972SerPhe: 4.972 ± 1.624
3.729SerGly: 3.729 ± 2.456
2.486SerHis: 2.486 ± 1.314
3.729SerIle: 3.729 ± 1.191
5.594SerLys: 5.594 ± 2.089
3.108SerLeu: 3.108 ± 1.589
1.865SerMet: 1.865 ± 1.106
4.972SerAsn: 4.972 ± 2.349
4.351SerPro: 4.351 ± 2.133
5.594SerGln: 5.594 ± 2.709
7.458SerArg: 7.458 ± 1.682
11.187SerSer: 11.187 ± 3.704
6.837SerThr: 6.837 ± 1.985
4.972SerVal: 4.972 ± 1.851
0.0SerTrp: 0.0 ± 0.0
4.351SerTyr: 4.351 ± 0.984
0.0SerXaa: 0.0 ± 0.0
Thr
3.729ThrAla: 3.729 ± 1.047
0.0ThrCys: 0.0 ± 0.0
2.486ThrAsp: 2.486 ± 1.317
3.108ThrGlu: 3.108 ± 0.854
0.622ThrPhe: 0.622 ± 0.558
4.972ThrGly: 4.972 ± 1.343
1.243ThrHis: 1.243 ± 1.103
6.837ThrIle: 6.837 ± 1.939
2.486ThrLys: 2.486 ± 1.642
4.972ThrLeu: 4.972 ± 1.246
1.865ThrMet: 1.865 ± 1.1
3.108ThrAsn: 3.108 ± 1.084
5.594ThrPro: 5.594 ± 1.779
1.865ThrGln: 1.865 ± 1.091
4.972ThrArg: 4.972 ± 1.944
1.865ThrSer: 1.865 ± 0.707
1.865ThrThr: 1.865 ± 0.867
3.108ThrVal: 3.108 ± 1.539
1.243ThrTrp: 1.243 ± 0.856
3.729ThrTyr: 3.729 ± 1.795
0.0ThrXaa: 0.0 ± 0.0
Val
2.486ValAla: 2.486 ± 0.986
1.865ValCys: 1.865 ± 1.455
3.108ValAsp: 3.108 ± 0.926
3.108ValGlu: 3.108 ± 1.159
2.486ValPhe: 2.486 ± 1.455
3.729ValGly: 3.729 ± 0.868
1.865ValHis: 1.865 ± 0.85
3.108ValIle: 3.108 ± 1.595
4.972ValLys: 4.972 ± 2.035
8.08ValLeu: 8.08 ± 2.255
1.243ValMet: 1.243 ± 1.103
0.622ValAsn: 0.622 ± 0.626
1.865ValPro: 1.865 ± 0.707
3.108ValGln: 3.108 ± 1.399
4.972ValArg: 4.972 ± 1.553
6.837ValSer: 6.837 ± 2.662
4.972ValThr: 4.972 ± 2.334
2.486ValVal: 2.486 ± 1.175
1.243ValTrp: 1.243 ± 1.252
4.972ValTyr: 4.972 ± 2.032
0.0ValXaa: 0.0 ± 0.0
Trp
3.729TrpAla: 3.729 ± 1.266
0.0TrpCys: 0.0 ± 0.0
0.622TrpAsp: 0.622 ± 0.675
0.622TrpGlu: 0.622 ± 0.558
0.0TrpPhe: 0.0 ± 0.0
0.622TrpGly: 0.622 ± 0.485
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.243TrpLys: 1.243 ± 0.851
0.0TrpLeu: 0.0 ± 0.0
0.622TrpMet: 0.622 ± 0.551
0.622TrpAsn: 0.622 ± 0.738
0.0TrpPro: 0.0 ± 0.0
0.622TrpGln: 0.622 ± 0.485
1.243TrpArg: 1.243 ± 0.807
1.243TrpSer: 1.243 ± 0.833
0.622TrpThr: 0.622 ± 0.738
0.622TrpVal: 0.622 ± 0.626
0.0TrpTrp: 0.0 ± 0.0
0.622TrpTyr: 0.622 ± 0.485
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.108TyrAla: 3.108 ± 0.875
0.622TyrCys: 0.622 ± 0.78
1.243TyrAsp: 1.243 ± 0.717
3.108TyrGlu: 3.108 ± 1.209
1.865TyrPhe: 1.865 ± 0.893
2.486TyrGly: 2.486 ± 0.841
0.622TyrHis: 0.622 ± 0.485
1.865TyrIle: 1.865 ± 0.893
2.486TyrLys: 2.486 ± 0.946
4.351TyrLeu: 4.351 ± 1.935
1.243TyrMet: 1.243 ± 0.8
2.486TyrAsn: 2.486 ± 1.067
1.243TyrPro: 1.243 ± 0.72
0.0TyrGln: 0.0 ± 0.0
4.351TyrArg: 4.351 ± 1.951
1.865TyrSer: 1.865 ± 0.85
1.865TyrThr: 1.865 ± 1.294
3.729TyrVal: 3.729 ± 1.828
0.0TyrTrp: 0.0 ± 0.0
0.622TyrTyr: 0.622 ± 0.78
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1610 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski