Amino acid dipepetide frequency for Indian cassava mosaic virus (ICMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.388AlaAla: 4.388 ± 0.92
0.549AlaCys: 0.549 ± 0.553
1.646AlaAsp: 1.646 ± 0.627
0.549AlaGlu: 0.549 ± 0.478
2.194AlaPhe: 2.194 ± 0.884
1.097AlaGly: 1.097 ± 0.613
1.097AlaHis: 1.097 ± 0.708
1.646AlaIle: 1.646 ± 1.075
2.743AlaLys: 2.743 ± 1.254
6.583AlaLeu: 6.583 ± 2.159
0.549AlaMet: 0.549 ± 0.539
1.097AlaAsn: 1.097 ± 0.728
1.646AlaPro: 1.646 ± 0.743
2.194AlaGln: 2.194 ± 0.863
2.743AlaArg: 2.743 ± 1.242
3.291AlaSer: 3.291 ± 0.986
4.388AlaThr: 4.388 ± 1.354
4.388AlaVal: 4.388 ± 1.175
1.646AlaTrp: 1.646 ± 1.037
1.646AlaTyr: 1.646 ± 0.652
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.549CysCys: 0.549 ± 0.478
0.549CysAsp: 0.549 ± 0.555
0.549CysGlu: 0.549 ± 0.553
0.0CysPhe: 0.0 ± 0.0
1.646CysGly: 1.646 ± 0.967
0.549CysHis: 0.549 ± 0.667
1.097CysIle: 1.097 ± 0.613
2.743CysLys: 2.743 ± 0.95
0.549CysLeu: 0.549 ± 0.478
0.549CysMet: 0.549 ± 0.684
1.646CysAsn: 1.646 ± 0.949
1.646CysPro: 1.646 ± 1.343
0.549CysGln: 0.549 ± 0.437
1.646CysArg: 1.646 ± 0.652
2.743CysSer: 2.743 ± 1.247
2.194CysThr: 2.194 ± 0.997
1.097CysVal: 1.097 ± 0.613
1.097CysTrp: 1.097 ± 0.746
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.743AspAla: 2.743 ± 1.347
0.0AspCys: 0.0 ± 0.0
2.743AspAsp: 2.743 ± 1.129
2.194AspGlu: 2.194 ± 0.964
2.743AspPhe: 2.743 ± 0.706
4.388AspGly: 4.388 ± 1.078
3.291AspHis: 3.291 ± 1.517
4.388AspIle: 4.388 ± 1.913
2.194AspLys: 2.194 ± 0.786
6.034AspLeu: 6.034 ± 2.459
0.0AspMet: 0.0 ± 0.0
2.194AspAsn: 2.194 ± 0.861
2.194AspPro: 2.194 ± 0.787
2.194AspGln: 2.194 ± 1.128
3.84AspArg: 3.84 ± 1.169
6.034AspSer: 6.034 ± 1.261
2.194AspThr: 2.194 ± 0.887
4.937AspVal: 4.937 ± 1.296
0.549AspTrp: 0.549 ± 0.437
1.097AspTyr: 1.097 ± 0.542
0.0AspXaa: 0.0 ± 0.0
Glu
2.743GluAla: 2.743 ± 1.393
0.549GluCys: 0.549 ± 0.649
1.646GluAsp: 1.646 ± 0.652
2.743GluGlu: 2.743 ± 1.498
1.646GluPhe: 1.646 ± 0.71
3.84GluGly: 3.84 ± 1.416
1.646GluHis: 1.646 ± 0.848
1.097GluIle: 1.097 ± 0.955
1.097GluLys: 1.097 ± 0.542
4.388GluLeu: 4.388 ± 1.284
0.549GluMet: 0.549 ± 0.477
4.937GluAsn: 4.937 ± 1.738
2.743GluPro: 2.743 ± 0.534
1.097GluGln: 1.097 ± 0.742
2.743GluArg: 2.743 ± 1.877
2.743GluSer: 2.743 ± 1.028
0.0GluThr: 0.0 ± 0.0
1.646GluVal: 1.646 ± 0.824
1.097GluTrp: 1.097 ± 0.722
1.097GluTyr: 1.097 ± 0.955
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.646PheCys: 1.646 ± 0.704
2.194PheAsp: 2.194 ± 1.231
1.097PheGlu: 1.097 ± 0.615
1.646PhePhe: 1.646 ± 1.085
2.743PheGly: 2.743 ± 1.822
1.646PheHis: 1.646 ± 1.31
2.194PheIle: 2.194 ± 1.07
3.291PheLys: 3.291 ± 1.825
3.291PheLeu: 3.291 ± 1.278
0.0PheMet: 0.0 ± 0.0
1.097PheAsn: 1.097 ± 0.742
2.743PhePro: 2.743 ± 1.091
2.743PheGln: 2.743 ± 1.676
3.291PheArg: 3.291 ± 1.36
1.097PheSer: 1.097 ± 0.752
6.583PheThr: 6.583 ± 2.382
0.549PheVal: 0.549 ± 0.539
0.549PheTrp: 0.549 ± 0.478
1.097PheTyr: 1.097 ± 0.861
0.0PheXaa: 0.0 ± 0.0
Gly
1.097GlyAla: 1.097 ± 0.542
1.646GlyCys: 1.646 ± 1.099
4.937GlyAsp: 4.937 ± 1.821
2.194GlyGlu: 2.194 ± 0.897
1.646GlyPhe: 1.646 ± 1.577
3.84GlyGly: 3.84 ± 1.149
2.194GlyHis: 2.194 ± 0.919
1.646GlyIle: 1.646 ± 0.767
5.485GlyLys: 5.485 ± 1.985
6.583GlyLeu: 6.583 ± 2.589
0.549GlyMet: 0.549 ± 0.482
0.0GlyAsn: 0.0 ± 0.0
5.485GlyPro: 5.485 ± 1.886
1.646GlyGln: 1.646 ± 0.851
2.743GlyArg: 2.743 ± 1.1
6.034GlySer: 6.034 ± 1.268
4.388GlyThr: 4.388 ± 2.033
2.194GlyVal: 2.194 ± 1.176
0.549GlyTrp: 0.549 ± 0.478
0.549GlyTyr: 0.549 ± 0.649
0.0GlyXaa: 0.0 ± 0.0
His
1.646HisAla: 1.646 ± 1.066
1.097HisCys: 1.097 ± 0.693
2.743HisAsp: 2.743 ± 1.049
1.646HisGlu: 1.646 ± 0.887
2.194HisPhe: 2.194 ± 1.019
3.291HisGly: 3.291 ± 1.088
1.097HisHis: 1.097 ± 0.722
3.84HisIle: 3.84 ± 1.172
0.549HisLys: 0.549 ± 0.649
1.646HisLeu: 1.646 ± 1.057
0.549HisMet: 0.549 ± 0.477
3.84HisAsn: 3.84 ± 1.605
2.194HisPro: 2.194 ± 1.724
1.646HisGln: 1.646 ± 0.87
2.194HisArg: 2.194 ± 1.586
0.549HisSer: 0.549 ± 0.667
1.097HisThr: 1.097 ± 1.105
2.743HisVal: 2.743 ± 1.037
0.0HisTrp: 0.0 ± 0.0
2.194HisTyr: 2.194 ± 0.877
0.0HisXaa: 0.0 ± 0.0
Ile
0.549IleAla: 0.549 ± 0.555
2.194IleCys: 2.194 ± 0.817
3.291IleAsp: 3.291 ± 1.456
2.194IleGlu: 2.194 ± 1.392
1.097IlePhe: 1.097 ± 0.873
2.743IleGly: 2.743 ± 1.289
3.291IleHis: 3.291 ± 1.115
3.84IleIle: 3.84 ± 1.516
4.937IleLys: 4.937 ± 1.209
4.388IleLeu: 4.388 ± 0.909
0.549IleMet: 0.549 ± 0.586
4.937IleAsn: 4.937 ± 1.081
2.743IlePro: 2.743 ± 1.069
4.388IleGln: 4.388 ± 1.072
4.937IleArg: 4.937 ± 1.578
6.034IleSer: 6.034 ± 1.504
1.646IleThr: 1.646 ± 1.177
2.194IleVal: 2.194 ± 0.8
2.194IleTrp: 2.194 ± 0.788
2.194IleTyr: 2.194 ± 1.166
0.0IleXaa: 0.0 ± 0.0
Lys
3.291LysAla: 3.291 ± 1.307
0.549LysCys: 0.549 ± 0.437
2.743LysAsp: 2.743 ± 0.974
2.743LysGlu: 2.743 ± 0.909
0.549LysPhe: 0.549 ± 0.437
3.291LysGly: 3.291 ± 1.275
1.097LysHis: 1.097 ± 0.708
4.937LysIle: 4.937 ± 1.393
0.549LysLys: 0.549 ± 0.649
3.291LysLeu: 3.291 ± 2.014
0.0LysMet: 0.0 ± 0.0
4.937LysAsn: 4.937 ± 1.198
3.291LysPro: 3.291 ± 1.484
1.097LysGln: 1.097 ± 0.752
4.937LysArg: 4.937 ± 2.045
4.388LysSer: 4.388 ± 1.331
4.388LysThr: 4.388 ± 1.378
3.291LysVal: 3.291 ± 1.621
0.549LysTrp: 0.549 ± 0.553
3.84LysTyr: 3.84 ± 0.797
0.0LysXaa: 0.0 ± 0.0
Leu
2.743LeuAla: 2.743 ± 1.064
1.097LeuCys: 1.097 ± 0.873
6.034LeuAsp: 6.034 ± 1.762
3.84LeuGlu: 3.84 ± 0.942
2.743LeuPhe: 2.743 ± 1.677
5.485LeuGly: 5.485 ± 0.974
4.388LeuHis: 4.388 ± 1.419
5.485LeuIle: 5.485 ± 1.59
4.388LeuLys: 4.388 ± 1.612
2.194LeuLeu: 2.194 ± 1.103
0.549LeuMet: 0.549 ± 0.553
6.583LeuAsn: 6.583 ± 1.487
1.097LeuPro: 1.097 ± 1.11
1.646LeuGln: 1.646 ± 0.87
7.131LeuArg: 7.131 ± 2.133
5.485LeuSer: 5.485 ± 1.922
5.485LeuThr: 5.485 ± 1.876
4.937LeuVal: 4.937 ± 1.553
0.0LeuTrp: 0.0 ± 0.0
3.291LeuTyr: 3.291 ± 1.182
0.0LeuXaa: 0.0 ± 0.0
Met
0.549MetAla: 0.549 ± 0.553
1.646MetCys: 1.646 ± 0.627
2.743MetAsp: 2.743 ± 1.248
1.097MetGlu: 1.097 ± 0.679
1.097MetPhe: 1.097 ± 1.105
1.097MetGly: 1.097 ± 0.728
1.097MetHis: 1.097 ± 0.689
0.0MetIle: 0.0 ± 0.0
0.549MetLys: 0.549 ± 0.478
2.194MetLeu: 2.194 ± 1.095
0.0MetMet: 0.0 ± 0.0
1.097MetAsn: 1.097 ± 1.077
1.097MetPro: 1.097 ± 0.619
0.549MetGln: 0.549 ± 0.478
2.194MetArg: 2.194 ± 0.877
1.097MetSer: 1.097 ± 0.615
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.097MetTrp: 1.097 ± 0.678
1.646MetTyr: 1.646 ± 0.714
0.0MetXaa: 0.0 ± 0.0
Asn
5.485AsnAla: 5.485 ± 1.26
2.194AsnCys: 2.194 ± 1.386
2.743AsnAsp: 2.743 ± 1.485
2.194AsnGlu: 2.194 ± 1.031
1.646AsnPhe: 1.646 ± 0.814
2.194AsnGly: 2.194 ± 0.8
2.194AsnHis: 2.194 ± 1.277
3.291AsnIle: 3.291 ± 0.607
0.549AsnLys: 0.549 ± 0.478
4.388AsnLeu: 4.388 ± 1.358
2.194AsnMet: 2.194 ± 1.705
3.84AsnAsn: 3.84 ± 0.881
4.388AsnPro: 4.388 ± 1.329
3.84AsnGln: 3.84 ± 1.099
3.84AsnArg: 3.84 ± 1.06
4.388AsnSer: 4.388 ± 1.283
3.291AsnThr: 3.291 ± 1.17
3.291AsnVal: 3.291 ± 1.165
0.0AsnTrp: 0.0 ± 0.0
3.291AsnTyr: 3.291 ± 0.851
0.0AsnXaa: 0.0 ± 0.0
Pro
1.097ProAla: 1.097 ± 0.797
1.646ProCys: 1.646 ± 0.887
2.743ProAsp: 2.743 ± 1.028
3.291ProGlu: 3.291 ± 1.301
2.194ProPhe: 2.194 ± 1.048
3.84ProGly: 3.84 ± 1.172
1.646ProHis: 1.646 ± 1.31
3.84ProIle: 3.84 ± 1.235
3.291ProLys: 3.291 ± 1.615
5.485ProLeu: 5.485 ± 1.567
1.646ProMet: 1.646 ± 1.031
3.291ProAsn: 3.291 ± 1.165
2.743ProPro: 2.743 ± 1.835
2.194ProGln: 2.194 ± 0.941
4.388ProArg: 4.388 ± 1.461
3.291ProSer: 3.291 ± 1.531
4.388ProThr: 4.388 ± 2.079
3.84ProVal: 3.84 ± 1.362
1.097ProTrp: 1.097 ± 0.955
3.84ProTyr: 3.84 ± 0.905
0.0ProXaa: 0.0 ± 0.0
Gln
1.646GlnAla: 1.646 ± 0.801
1.097GlnCys: 1.097 ± 0.693
2.743GlnAsp: 2.743 ± 0.986
0.549GlnGlu: 0.549 ± 0.553
2.194GlnPhe: 2.194 ± 1.006
2.743GlnGly: 2.743 ± 1.049
1.646GlnHis: 1.646 ± 0.954
2.743GlnIle: 2.743 ± 1.727
0.0GlnLys: 0.0 ± 0.0
2.194GlnLeu: 2.194 ± 1.225
0.549GlnMet: 0.549 ± 0.477
0.549GlnAsn: 0.549 ± 0.437
3.84GlnPro: 3.84 ± 2.151
2.743GlnGln: 2.743 ± 0.877
2.194GlnArg: 2.194 ± 1.015
4.388GlnSer: 4.388 ± 1.277
3.84GlnThr: 3.84 ± 1.415
3.84GlnVal: 3.84 ± 1.197
0.0GlnTrp: 0.0 ± 0.0
0.549GlnTyr: 0.549 ± 0.553
0.0GlnXaa: 0.0 ± 0.0
Arg
4.388ArgAla: 4.388 ± 1.014
1.097ArgCys: 1.097 ± 0.678
5.485ArgAsp: 5.485 ± 0.945
2.194ArgGlu: 2.194 ± 1.019
4.937ArgPhe: 4.937 ± 1.624
3.291ArgGly: 3.291 ± 0.776
2.194ArgHis: 2.194 ± 0.997
3.84ArgIle: 3.84 ± 0.762
3.84ArgLys: 3.84 ± 1.549
4.937ArgLeu: 4.937 ± 1.687
1.097ArgMet: 1.097 ± 0.797
3.291ArgAsn: 3.291 ± 1.188
6.034ArgPro: 6.034 ± 1.368
2.194ArgGln: 2.194 ± 1.095
6.034ArgArg: 6.034 ± 2.957
7.68ArgSer: 7.68 ± 2.743
2.194ArgThr: 2.194 ± 0.916
6.583ArgVal: 6.583 ± 1.543
1.097ArgTrp: 1.097 ± 0.615
2.743ArgTyr: 2.743 ± 1.187
0.0ArgXaa: 0.0 ± 0.0
Ser
2.743SerAla: 2.743 ± 1.667
1.097SerCys: 1.097 ± 0.708
5.485SerAsp: 5.485 ± 1.18
3.291SerGlu: 3.291 ± 1.441
4.388SerPhe: 4.388 ± 1.274
2.743SerGly: 2.743 ± 1.101
0.549SerHis: 0.549 ± 0.478
4.937SerIle: 4.937 ± 0.99
5.485SerLys: 5.485 ± 1.922
4.937SerLeu: 4.937 ± 1.599
2.743SerMet: 2.743 ± 1.078
3.84SerAsn: 3.84 ± 1.009
7.68SerPro: 7.68 ± 1.397
2.194SerGln: 2.194 ± 0.994
6.583SerArg: 6.583 ± 2.125
11.519SerSer: 11.519 ± 2.815
4.937SerThr: 4.937 ± 1.32
4.388SerVal: 4.388 ± 1.73
1.097SerTrp: 1.097 ± 0.659
4.388SerTyr: 4.388 ± 2.231
0.0SerXaa: 0.0 ± 0.0
Thr
3.84ThrAla: 3.84 ± 1.833
1.097ThrCys: 1.097 ± 0.747
1.097ThrAsp: 1.097 ± 0.693
2.194ThrGlu: 2.194 ± 1.021
1.646ThrPhe: 1.646 ± 1.057
3.291ThrGly: 3.291 ± 0.607
3.291ThrHis: 3.291 ± 1.847
3.291ThrIle: 3.291 ± 0.763
3.84ThrLys: 3.84 ± 1.497
2.743ThrLeu: 2.743 ± 1.071
1.646ThrMet: 1.646 ± 0.721
4.388ThrAsn: 4.388 ± 1.231
3.291ThrPro: 3.291 ± 1.395
1.646ThrGln: 1.646 ± 0.854
4.388ThrArg: 4.388 ± 1.247
3.84ThrSer: 3.84 ± 1.346
1.646ThrThr: 1.646 ± 0.954
5.485ThrVal: 5.485 ± 1.778
2.194ThrTrp: 2.194 ± 0.916
2.743ThrTyr: 2.743 ± 1.024
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
1.097ValCys: 1.097 ± 0.619
2.194ValAsp: 2.194 ± 0.65
3.291ValGlu: 3.291 ± 1.486
2.194ValPhe: 2.194 ± 1.21
2.743ValGly: 2.743 ± 0.975
2.194ValHis: 2.194 ± 1.031
5.485ValIle: 5.485 ± 1.12
4.388ValLys: 4.388 ± 1.162
3.84ValLeu: 3.84 ± 1.023
3.84ValMet: 3.84 ± 1.203
4.388ValAsn: 4.388 ± 1.425
3.291ValPro: 3.291 ± 1.04
4.388ValGln: 4.388 ± 1.287
4.388ValArg: 4.388 ± 1.993
4.388ValSer: 4.388 ± 0.974
2.194ValThr: 2.194 ± 1.583
2.194ValVal: 2.194 ± 1.272
1.646ValTrp: 1.646 ± 0.809
2.194ValTyr: 2.194 ± 0.983
0.0ValXaa: 0.0 ± 0.0
Trp
3.84TrpAla: 3.84 ± 1.936
0.0TrpCys: 0.0 ± 0.0
0.549TrpAsp: 0.549 ± 0.649
0.549TrpGlu: 0.549 ± 0.539
0.549TrpPhe: 0.549 ± 0.437
0.549TrpGly: 0.549 ± 0.437
0.549TrpHis: 0.549 ± 0.553
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.549TrpLeu: 0.549 ± 0.477
1.097TrpMet: 1.097 ± 0.742
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.549TrpGln: 0.549 ± 0.437
2.194TrpArg: 2.194 ± 0.79
2.194TrpSer: 2.194 ± 1.157
1.646TrpThr: 1.646 ± 0.803
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.646TrpTyr: 1.646 ± 0.851
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.291TyrAla: 3.291 ± 0.821
1.097TyrCys: 1.097 ± 0.81
1.646TyrAsp: 1.646 ± 0.904
1.646TyrGlu: 1.646 ± 0.637
2.194TyrPhe: 2.194 ± 0.816
0.549TyrGly: 0.549 ± 0.437
1.097TyrHis: 1.097 ± 0.824
2.743TyrIle: 2.743 ± 1.258
3.84TyrLys: 3.84 ± 1.763
4.388TyrLeu: 4.388 ± 1.271
1.646TyrMet: 1.646 ± 0.818
3.291TyrAsn: 3.291 ± 0.798
1.646TyrPro: 1.646 ± 0.923
0.549TyrGln: 0.549 ± 0.553
2.743TyrArg: 2.743 ± 1.289
3.84TyrSer: 3.84 ± 1.428
1.097TyrThr: 1.097 ± 0.763
2.743TyrVal: 2.743 ± 1.282
0.0TyrTrp: 0.0 ± 0.0
1.646TyrTyr: 1.646 ± 0.79
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (1824 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski