Amino acid dipepetide frequency for Northern cereal mosaic virus (NCMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.148AlaAla: 3.148 ± 1.737
0.0AlaCys: 0.0 ± 0.0
4.197AlaAsp: 4.197 ± 1.707
3.148AlaGlu: 3.148 ± 1.392
2.099AlaPhe: 2.099 ± 1.007
6.296AlaGly: 6.296 ± 2.862
0.0AlaHis: 0.0 ± 0.0
2.623AlaIle: 2.623 ± 1.059
4.197AlaLys: 4.197 ± 1.301
5.247AlaLeu: 5.247 ± 1.176
0.525AlaMet: 0.525 ± 0.42
3.148AlaAsn: 3.148 ± 0.944
0.525AlaPro: 0.525 ± 0.434
1.049AlaGln: 1.049 ± 0.729
1.574AlaArg: 1.574 ± 0.819
6.296AlaSer: 6.296 ± 1.939
3.673AlaThr: 3.673 ± 1.945
6.296AlaVal: 6.296 ± 1.282
0.0AlaTrp: 0.0 ± 0.0
2.099AlaTyr: 2.099 ± 0.861
0.0AlaXaa: 0.0 ± 0.0
Cys
0.525CysAla: 0.525 ± 0.434
1.049CysCys: 1.049 ± 0.917
0.0CysAsp: 0.0 ± 0.0
0.525CysGlu: 0.525 ± 0.42
0.525CysPhe: 0.525 ± 0.42
1.574CysGly: 1.574 ± 0.667
0.525CysHis: 0.525 ± 0.42
2.099CysIle: 2.099 ± 1.303
1.049CysLys: 1.049 ± 0.695
1.049CysLeu: 1.049 ± 0.504
0.0CysMet: 0.0 ± 0.0
1.574CysAsn: 1.574 ± 1.259
3.148CysPro: 3.148 ± 1.345
0.525CysGln: 0.525 ± 0.429
1.574CysArg: 1.574 ± 1.328
1.049CysSer: 1.049 ± 0.839
0.525CysThr: 0.525 ± 0.429
1.049CysVal: 1.049 ± 0.722
0.0CysTrp: 0.0 ± 0.0
0.525CysTyr: 0.525 ± 0.42
0.0CysXaa: 0.0 ± 0.0
Asp
3.673AspAla: 3.673 ± 2.074
0.525AspCys: 0.525 ± 0.42
4.197AspAsp: 4.197 ± 1.441
5.247AspGlu: 5.247 ± 1.524
3.673AspPhe: 3.673 ± 1.536
3.148AspGly: 3.148 ± 1.071
1.574AspHis: 1.574 ± 0.781
3.148AspIle: 3.148 ± 0.75
3.148AspLys: 3.148 ± 0.751
5.771AspLeu: 5.771 ± 2.092
1.049AspMet: 1.049 ± 0.7
2.623AspAsn: 2.623 ± 0.865
3.148AspPro: 3.148 ± 1.258
1.574AspGln: 1.574 ± 1.402
0.525AspArg: 0.525 ± 0.557
3.673AspSer: 3.673 ± 1.19
3.148AspThr: 3.148 ± 1.118
5.771AspVal: 5.771 ± 1.639
1.049AspTrp: 1.049 ± 0.504
0.525AspTyr: 0.525 ± 0.434
0.0AspXaa: 0.0 ± 0.0
Glu
1.049GluAla: 1.049 ± 0.867
1.574GluCys: 1.574 ± 0.761
2.099GluAsp: 2.099 ± 1.188
2.099GluGlu: 2.099 ± 0.877
3.148GluPhe: 3.148 ± 1.096
6.821GluGly: 6.821 ± 2.524
1.574GluHis: 1.574 ± 0.401
4.722GluIle: 4.722 ± 2.184
4.722GluLys: 4.722 ± 1.179
5.247GluLeu: 5.247 ± 2.156
3.148GluMet: 3.148 ± 1.379
1.574GluAsn: 1.574 ± 0.781
2.099GluPro: 2.099 ± 0.815
1.574GluGln: 1.574 ± 0.807
3.673GluArg: 3.673 ± 1.29
5.771GluSer: 5.771 ± 1.739
5.247GluThr: 5.247 ± 1.087
7.345GluVal: 7.345 ± 1.578
1.049GluTrp: 1.049 ± 0.831
0.525GluTyr: 0.525 ± 0.522
0.0GluXaa: 0.0 ± 0.0
Phe
0.525PheAla: 0.525 ± 0.42
0.0PheCys: 0.0 ± 0.0
1.049PheAsp: 1.049 ± 0.831
2.623PheGlu: 2.623 ± 1.006
0.525PhePhe: 0.525 ± 0.429
3.673PheGly: 3.673 ± 1.438
1.574PheHis: 1.574 ± 0.807
2.099PheIle: 2.099 ± 1.55
4.722PheLys: 4.722 ± 0.918
4.722PheLeu: 4.722 ± 1.377
1.049PheMet: 1.049 ± 0.786
1.049PheAsn: 1.049 ± 0.491
1.049PhePro: 1.049 ± 1.044
2.099PheGln: 2.099 ± 1.184
2.623PheArg: 2.623 ± 0.909
7.345PheSer: 7.345 ± 1.733
1.574PheThr: 1.574 ± 0.863
1.049PheVal: 1.049 ± 0.985
0.0PheTrp: 0.0 ± 0.0
1.049PheTyr: 1.049 ± 0.504
0.0PheXaa: 0.0 ± 0.0
Gly
3.148GlyAla: 3.148 ± 0.731
1.574GlyCys: 1.574 ± 0.89
5.771GlyAsp: 5.771 ± 0.906
3.673GlyGlu: 3.673 ± 1.163
2.099GlyPhe: 2.099 ± 0.756
3.148GlyGly: 3.148 ± 1.089
0.525GlyHis: 0.525 ± 0.434
7.87GlyIle: 7.87 ± 2.101
4.722GlyLys: 4.722 ± 1.038
3.148GlyLeu: 3.148 ± 0.673
3.673GlyMet: 3.673 ± 2.054
2.099GlyAsn: 2.099 ± 1.24
0.525GlyPro: 0.525 ± 0.557
2.099GlyGln: 2.099 ± 0.991
3.148GlyArg: 3.148 ± 1.605
5.771GlySer: 5.771 ± 1.477
3.148GlyThr: 3.148 ± 1.035
4.197GlyVal: 4.197 ± 1.695
0.525GlyTrp: 0.525 ± 0.434
3.673GlyTyr: 3.673 ± 0.831
0.0GlyXaa: 0.0 ± 0.0
His
0.525HisAla: 0.525 ± 0.434
0.525HisCys: 0.525 ± 0.42
0.525HisAsp: 0.525 ± 0.434
0.525HisGlu: 0.525 ± 0.42
0.525HisPhe: 0.525 ± 0.522
0.525HisGly: 0.525 ± 0.429
0.525HisHis: 0.525 ± 0.429
0.525HisIle: 0.525 ± 0.429
0.525HisLys: 0.525 ± 0.434
3.673HisLeu: 3.673 ± 0.992
0.525HisMet: 0.525 ± 0.429
1.049HisAsn: 1.049 ± 0.839
0.525HisPro: 0.525 ± 0.42
1.574HisGln: 1.574 ± 0.819
0.0HisArg: 0.0 ± 0.0
0.525HisSer: 0.525 ± 0.42
0.525HisThr: 0.525 ± 0.434
1.574HisVal: 1.574 ± 1.083
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.623IleAla: 2.623 ± 0.674
0.0IleCys: 0.0 ± 0.0
2.099IleAsp: 2.099 ± 1.153
5.247IleGlu: 5.247 ± 1.283
3.673IlePhe: 3.673 ± 1.32
4.722IleGly: 4.722 ± 1.507
0.525IleHis: 0.525 ± 0.42
3.148IleIle: 3.148 ± 1.038
4.722IleLys: 4.722 ± 1.375
3.673IleLeu: 3.673 ± 1.914
2.099IleMet: 2.099 ± 0.819
2.623IleAsn: 2.623 ± 1.15
3.148IlePro: 3.148 ± 1.657
0.525IleGln: 0.525 ± 0.429
3.673IleArg: 3.673 ± 1.537
11.542IleSer: 11.542 ± 2.596
8.395IleThr: 8.395 ± 2.132
5.247IleVal: 5.247 ± 0.758
0.525IleTrp: 0.525 ± 0.42
3.673IleTyr: 3.673 ± 0.766
0.0IleXaa: 0.0 ± 0.0
Lys
5.247LysAla: 5.247 ± 1.153
1.574LysCys: 1.574 ± 0.401
3.673LysAsp: 3.673 ± 1.005
5.771LysGlu: 5.771 ± 1.617
3.673LysPhe: 3.673 ± 1.279
4.197LysGly: 4.197 ± 1.995
1.049LysHis: 1.049 ± 0.839
4.722LysIle: 4.722 ± 1.794
6.821LysLys: 6.821 ± 2.87
6.821LysLeu: 6.821 ± 1.665
2.623LysMet: 2.623 ± 1.089
4.197LysAsn: 4.197 ± 1.569
2.099LysPro: 2.099 ± 0.877
2.623LysGln: 2.623 ± 1.207
3.673LysArg: 3.673 ± 1.697
5.247LysSer: 5.247 ± 1.316
3.148LysThr: 3.148 ± 1.21
5.247LysVal: 5.247 ± 2.01
0.0LysTrp: 0.0 ± 0.0
2.099LysTyr: 2.099 ± 1.233
0.0LysXaa: 0.0 ± 0.0
Leu
2.623LeuAla: 2.623 ± 1.279
0.525LeuCys: 0.525 ± 0.42
6.296LeuAsp: 6.296 ± 2.214
6.296LeuGlu: 6.296 ± 1.258
5.247LeuPhe: 5.247 ± 1.234
4.197LeuGly: 4.197 ± 1.094
1.049LeuHis: 1.049 ± 0.576
7.87LeuIle: 7.87 ± 1.336
5.247LeuLys: 5.247 ± 1.132
5.771LeuLeu: 5.771 ± 2.34
2.099LeuMet: 2.099 ± 0.917
2.099LeuAsn: 2.099 ± 0.932
3.148LeuPro: 3.148 ± 1.131
2.099LeuGln: 2.099 ± 1.33
3.148LeuArg: 3.148 ± 1.414
13.116LeuSer: 13.116 ± 2.907
4.197LeuThr: 4.197 ± 1.546
7.345LeuVal: 7.345 ± 1.33
0.0LeuTrp: 0.0 ± 0.0
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
3.148MetAla: 3.148 ± 1.263
0.0MetCys: 0.0 ± 0.0
1.574MetAsp: 1.574 ± 0.777
0.0MetGlu: 0.0 ± 0.0
0.525MetPhe: 0.525 ± 0.67
2.099MetGly: 2.099 ± 1.172
0.525MetHis: 0.525 ± 0.434
4.197MetIle: 4.197 ± 1.698
3.148MetLys: 3.148 ± 0.8
1.574MetLeu: 1.574 ± 0.665
0.0MetMet: 0.0 ± 0.41
0.525MetAsn: 0.525 ± 0.522
0.525MetPro: 0.525 ± 0.67
1.049MetGln: 1.049 ± 0.504
1.574MetArg: 1.574 ± 0.678
4.197MetSer: 4.197 ± 0.973
1.049MetThr: 1.049 ± 0.794
1.574MetVal: 1.574 ± 0.994
0.525MetTrp: 0.525 ± 0.67
1.049MetTyr: 1.049 ± 0.463
0.0MetXaa: 0.0 ± 0.0
Asn
2.623AsnAla: 2.623 ± 0.809
1.049AsnCys: 1.049 ± 0.504
2.623AsnAsp: 2.623 ± 0.853
5.247AsnGlu: 5.247 ± 1.269
2.099AsnPhe: 2.099 ± 1.007
3.673AsnGly: 3.673 ± 0.94
0.0AsnHis: 0.0 ± 0.0
2.099AsnIle: 2.099 ± 0.861
2.099AsnLys: 2.099 ± 0.678
2.623AsnLeu: 2.623 ± 0.599
1.574AsnMet: 1.574 ± 0.641
1.574AsnAsn: 1.574 ± 1.259
1.574AsnPro: 1.574 ± 1.206
1.574AsnGln: 1.574 ± 0.89
0.0AsnArg: 0.0 ± 0.0
1.049AsnSer: 1.049 ± 0.605
1.049AsnThr: 1.049 ± 0.463
2.099AsnVal: 2.099 ± 0.521
0.0AsnTrp: 0.0 ± 0.0
2.099AsnTyr: 2.099 ± 0.536
0.0AsnXaa: 0.0 ± 0.0
Pro
4.197ProAla: 4.197 ± 1.351
0.525ProCys: 0.525 ± 0.67
1.574ProAsp: 1.574 ± 1.024
4.197ProGlu: 4.197 ± 1.419
0.0ProPhe: 0.0 ± 0.0
0.525ProGly: 0.525 ± 0.522
0.0ProHis: 0.0 ± 0.0
3.148ProIle: 3.148 ± 0.999
3.148ProLys: 3.148 ± 1.378
2.623ProLeu: 2.623 ± 0.711
1.049ProMet: 1.049 ± 0.538
1.574ProAsn: 1.574 ± 0.679
1.574ProPro: 1.574 ± 0.673
1.574ProGln: 1.574 ± 1.301
2.099ProArg: 2.099 ± 2.683
1.049ProSer: 1.049 ± 0.605
3.148ProThr: 3.148 ± 0.887
2.623ProVal: 2.623 ± 1.09
0.525ProTrp: 0.525 ± 0.42
2.623ProTyr: 2.623 ± 0.672
0.0ProXaa: 0.0 ± 0.0
Gln
1.049GlnAla: 1.049 ± 0.867
1.574GlnCys: 1.574 ± 0.763
2.099GlnAsp: 2.099 ± 0.661
3.148GlnGlu: 3.148 ± 1.682
0.525GlnPhe: 0.525 ± 0.42
2.099GlnGly: 2.099 ± 0.815
0.525GlnHis: 0.525 ± 0.522
1.574GlnIle: 1.574 ± 0.761
3.673GlnLys: 3.673 ± 1.177
1.049GlnLeu: 1.049 ± 0.695
1.049GlnMet: 1.049 ± 0.648
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
1.049GlnGln: 1.049 ± 0.867
1.049GlnArg: 1.049 ± 0.867
3.673GlnSer: 3.673 ± 1.29
1.574GlnThr: 1.574 ± 0.788
1.574GlnVal: 1.574 ± 0.674
0.525GlnTrp: 0.525 ± 0.434
0.525GlnTyr: 0.525 ± 0.434
0.0GlnXaa: 0.0 ± 0.0
Arg
3.148ArgAla: 3.148 ± 1.17
1.049ArgCys: 1.049 ± 0.695
2.099ArgAsp: 2.099 ± 1.304
3.148ArgGlu: 3.148 ± 1.129
2.623ArgPhe: 2.623 ± 1.09
2.623ArgGly: 2.623 ± 2.058
1.049ArgHis: 1.049 ± 0.792
2.623ArgIle: 2.623 ± 1.197
2.099ArgLys: 2.099 ± 0.926
4.722ArgLeu: 4.722 ± 1.976
1.049ArgMet: 1.049 ± 0.798
1.574ArgAsn: 1.574 ± 0.801
1.574ArgPro: 1.574 ± 0.777
0.525ArgGln: 0.525 ± 0.42
3.148ArgArg: 3.148 ± 1.159
4.197ArgSer: 4.197 ± 1.114
2.099ArgThr: 2.099 ± 0.853
4.197ArgVal: 4.197 ± 1.258
0.0ArgTrp: 0.0 ± 0.0
2.623ArgTyr: 2.623 ± 1.52
0.0ArgXaa: 0.0 ± 0.0
Ser
4.722SerAla: 4.722 ± 0.981
3.673SerCys: 3.673 ± 1.6
5.247SerAsp: 5.247 ± 1.199
6.296SerGlu: 6.296 ± 1.4
4.197SerPhe: 4.197 ± 1.464
4.722SerGly: 4.722 ± 1.5
1.049SerHis: 1.049 ± 0.491
8.395SerIle: 8.395 ± 2.591
5.247SerLys: 5.247 ± 1.565
8.395SerLeu: 8.395 ± 1.745
2.623SerMet: 2.623 ± 0.912
4.722SerAsn: 4.722 ± 2.007
5.771SerPro: 5.771 ± 1.842
2.099SerGln: 2.099 ± 1.007
4.722SerArg: 4.722 ± 1.361
3.673SerSer: 3.673 ± 1.37
5.247SerThr: 5.247 ± 2.448
7.87SerVal: 7.87 ± 2.134
2.623SerTrp: 2.623 ± 1.192
1.049SerTyr: 1.049 ± 0.702
0.0SerXaa: 0.0 ± 0.0
Thr
2.099ThrAla: 2.099 ± 1.184
1.049ThrCys: 1.049 ± 0.839
5.247ThrAsp: 5.247 ± 1.033
3.673ThrGlu: 3.673 ± 1.028
2.099ThrPhe: 2.099 ± 0.677
2.623ThrGly: 2.623 ± 1.181
1.049ThrHis: 1.049 ± 0.504
3.148ThrIle: 3.148 ± 1.073
4.722ThrLys: 4.722 ± 1.636
5.247ThrLeu: 5.247 ± 1.101
1.574ThrMet: 1.574 ± 0.948
1.574ThrAsn: 1.574 ± 1.025
2.099ThrPro: 2.099 ± 1.254
1.574ThrGln: 1.574 ± 0.881
3.673ThrArg: 3.673 ± 1.807
5.771ThrSer: 5.771 ± 1.291
5.247ThrThr: 5.247 ± 1.518
4.197ThrVal: 4.197 ± 1.094
2.099ThrTrp: 2.099 ± 0.795
3.673ThrTyr: 3.673 ± 1.396
0.0ThrXaa: 0.0 ± 0.0
Val
6.296ValAla: 6.296 ± 1.566
2.099ValCys: 2.099 ± 0.853
4.197ValAsp: 4.197 ± 1.599
2.623ValGlu: 2.623 ± 1.792
2.099ValPhe: 2.099 ± 0.894
5.771ValGly: 5.771 ± 1.093
1.574ValHis: 1.574 ± 0.841
4.197ValIle: 4.197 ± 0.938
5.771ValLys: 5.771 ± 1.404
4.722ValLeu: 4.722 ± 2.116
2.099ValMet: 2.099 ± 1.216
2.099ValAsn: 2.099 ± 0.946
4.722ValPro: 4.722 ± 1.318
1.574ValGln: 1.574 ± 0.898
5.247ValArg: 5.247 ± 1.16
7.87ValSer: 7.87 ± 1.922
5.771ValThr: 5.771 ± 1.224
6.296ValVal: 6.296 ± 2.144
2.623ValTrp: 2.623 ± 1.625
1.574ValTyr: 1.574 ± 0.931
0.0ValXaa: 0.0 ± 0.0
Trp
2.099TrpAla: 2.099 ± 0.901
0.0TrpCys: 0.0 ± 0.0
1.574TrpAsp: 1.574 ± 0.948
0.525TrpGlu: 0.525 ± 0.42
0.0TrpPhe: 0.0 ± 0.0
0.525TrpGly: 0.525 ± 0.67
0.0TrpHis: 0.0 ± 0.0
1.049TrpIle: 1.049 ± 0.839
1.574TrpLys: 1.574 ± 0.896
1.574TrpLeu: 1.574 ± 0.931
0.0TrpMet: 0.0 ± 0.0
0.525TrpAsn: 0.525 ± 0.434
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.525TrpArg: 0.525 ± 0.42
0.0TrpSer: 0.0 ± 0.0
0.525TrpThr: 0.525 ± 0.42
1.049TrpVal: 1.049 ± 0.463
0.0TrpTrp: 0.0 ± 0.0
0.525TrpTyr: 0.525 ± 0.42
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.673TyrAla: 3.673 ± 1.467
0.525TyrCys: 0.525 ± 0.42
1.049TyrAsp: 1.049 ± 0.491
1.049TyrGlu: 1.049 ± 0.463
1.049TyrPhe: 1.049 ± 0.867
1.574TyrGly: 1.574 ± 0.679
0.0TyrHis: 0.0 ± 0.0
2.623TyrIle: 2.623 ± 0.876
3.148TyrLys: 3.148 ± 0.831
4.722TyrLeu: 4.722 ± 1.432
1.049TyrMet: 1.049 ± 0.75
0.525TyrAsn: 0.525 ± 0.429
0.525TyrPro: 0.525 ± 0.42
1.574TyrGln: 1.574 ± 1.127
0.525TyrArg: 0.525 ± 0.522
0.525TyrSer: 0.525 ± 0.434
3.148TyrThr: 3.148 ± 1.21
2.623TyrVal: 2.623 ± 1.134
0.0TyrTrp: 0.0 ± 0.0
0.525TyrTyr: 0.525 ± 0.42
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1907 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski