Amino acid dipepetide frequency for Mojiang virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.446AlaAla: 3.446 ± 0.997
1.149AlaCys: 1.149 ± 0.629
3.61AlaAsp: 3.61 ± 1.228
3.938AlaGlu: 3.938 ± 0.448
1.477AlaPhe: 1.477 ± 0.402
4.431AlaGly: 4.431 ± 1.508
1.149AlaHis: 1.149 ± 0.464
5.251AlaIle: 5.251 ± 1.015
2.954AlaLys: 2.954 ± 0.866
5.415AlaLeu: 5.415 ± 1.357
1.313AlaMet: 1.313 ± 0.78
2.297AlaAsn: 2.297 ± 0.547
1.477AlaPro: 1.477 ± 0.737
2.133AlaGln: 2.133 ± 0.689
2.133AlaArg: 2.133 ± 0.409
3.774AlaSer: 3.774 ± 0.908
3.446AlaThr: 3.446 ± 1.238
3.774AlaVal: 3.774 ± 1.106
0.985AlaTrp: 0.985 ± 0.295
1.805AlaTyr: 1.805 ± 0.858
0.0AlaXaa: 0.0 ± 0.0
Cys
0.82CysAla: 0.82 ± 0.29
0.492CysCys: 0.492 ± 0.322
1.149CysAsp: 1.149 ± 0.425
0.328CysGlu: 0.328 ± 0.287
0.82CysPhe: 0.82 ± 0.315
0.492CysGly: 0.492 ± 0.322
0.164CysHis: 0.164 ± 0.107
0.656CysIle: 0.656 ± 0.311
0.328CysLys: 0.328 ± 0.325
1.313CysLeu: 1.313 ± 0.39
0.328CysMet: 0.328 ± 0.185
1.641CysAsn: 1.641 ± 0.604
1.149CysPro: 1.149 ± 0.421
0.656CysGln: 0.656 ± 0.329
0.164CysArg: 0.164 ± 0.191
1.805CysSer: 1.805 ± 0.327
1.805CysThr: 1.805 ± 0.593
0.492CysVal: 0.492 ± 0.322
0.328CysTrp: 0.328 ± 0.179
0.82CysTyr: 0.82 ± 0.418
0.0CysXaa: 0.0 ± 0.0
Asp
3.282AspAla: 3.282 ± 1.11
0.492AspCys: 0.492 ± 0.236
5.087AspAsp: 5.087 ± 0.964
5.087AspGlu: 5.087 ± 0.72
1.149AspPhe: 1.149 ± 0.619
3.118AspGly: 3.118 ± 0.702
2.297AspHis: 2.297 ± 0.812
5.251AspIle: 5.251 ± 0.69
4.102AspLys: 4.102 ± 0.868
6.564AspLeu: 6.564 ± 0.828
0.82AspMet: 0.82 ± 0.308
2.461AspAsn: 2.461 ± 0.643
2.626AspPro: 2.626 ± 0.496
2.297AspGln: 2.297 ± 1.167
3.938AspArg: 3.938 ± 0.665
6.4AspSer: 6.4 ± 1.392
4.431AspThr: 4.431 ± 0.957
3.938AspVal: 3.938 ± 0.737
0.164AspTrp: 0.164 ± 0.107
1.477AspTyr: 1.477 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
3.446GluAla: 3.446 ± 0.812
1.149GluCys: 1.149 ± 0.476
5.415GluAsp: 5.415 ± 1.407
4.595GluGlu: 4.595 ± 1.332
1.805GluPhe: 1.805 ± 0.526
3.118GluGly: 3.118 ± 0.654
1.149GluHis: 1.149 ± 0.382
4.759GluIle: 4.759 ± 1.13
3.938GluLys: 3.938 ± 1.079
4.266GluLeu: 4.266 ± 0.937
1.969GluMet: 1.969 ± 0.737
3.774GluAsn: 3.774 ± 0.88
3.282GluPro: 3.282 ± 0.953
1.805GluGln: 1.805 ± 0.467
1.477GluArg: 1.477 ± 0.578
5.087GluSer: 5.087 ± 0.834
3.774GluThr: 3.774 ± 0.751
3.118GluVal: 3.118 ± 0.79
1.477GluTrp: 1.477 ± 0.392
2.461GluTyr: 2.461 ± 0.52
0.0GluXaa: 0.0 ± 0.0
Phe
1.313PheAla: 1.313 ± 0.7
0.656PheCys: 0.656 ± 0.43
2.626PheAsp: 2.626 ± 0.4
1.313PheGlu: 1.313 ± 0.389
1.313PhePhe: 1.313 ± 0.576
0.985PheGly: 0.985 ± 0.373
1.149PheHis: 1.149 ± 0.393
1.477PheIle: 1.477 ± 0.387
2.133PheLys: 2.133 ± 0.73
3.118PheLeu: 3.118 ± 0.891
0.656PheMet: 0.656 ± 0.274
2.79PheAsn: 2.79 ± 0.96
1.149PhePro: 1.149 ± 0.588
1.477PheGln: 1.477 ± 0.519
1.477PheArg: 1.477 ± 0.319
1.641PheSer: 1.641 ± 0.521
0.985PheThr: 0.985 ± 0.286
1.969PheVal: 1.969 ± 0.637
0.656PheTrp: 0.656 ± 0.311
0.82PheTyr: 0.82 ± 0.308
0.0PheXaa: 0.0 ± 0.0
Gly
3.118GlyAla: 3.118 ± 0.973
0.985GlyCys: 0.985 ± 0.433
3.774GlyAsp: 3.774 ± 0.927
3.282GlyGlu: 3.282 ± 0.919
1.313GlyPhe: 1.313 ± 0.436
3.282GlyGly: 3.282 ± 1.143
1.149GlyHis: 1.149 ± 0.314
5.251GlyIle: 5.251 ± 0.314
3.938GlyLys: 3.938 ± 0.604
6.564GlyLeu: 6.564 ± 0.957
1.313GlyMet: 1.313 ± 0.303
3.282GlyAsn: 3.282 ± 0.885
2.79GlyPro: 2.79 ± 0.578
1.313GlyGln: 1.313 ± 0.669
3.774GlyArg: 3.774 ± 0.641
4.102GlySer: 4.102 ± 0.511
1.149GlyThr: 1.149 ± 0.647
2.954GlyVal: 2.954 ± 0.954
0.492GlyTrp: 0.492 ± 0.27
2.461GlyTyr: 2.461 ± 0.621
0.0GlyXaa: 0.0 ± 0.0
His
1.149HisAla: 1.149 ± 0.256
0.492HisCys: 0.492 ± 0.242
0.492HisAsp: 0.492 ± 0.319
1.313HisGlu: 1.313 ± 0.392
0.492HisPhe: 0.492 ± 0.236
0.656HisGly: 0.656 ± 0.263
0.164HisHis: 0.164 ± 0.17
1.313HisIle: 1.313 ± 0.57
1.477HisLys: 1.477 ± 0.444
1.805HisLeu: 1.805 ± 0.615
0.656HisMet: 0.656 ± 0.233
2.297HisAsn: 2.297 ± 0.722
0.82HisPro: 0.82 ± 0.537
0.82HisGln: 0.82 ± 0.394
0.656HisArg: 0.656 ± 0.33
1.477HisSer: 1.477 ± 0.267
1.149HisThr: 1.149 ± 0.424
0.82HisVal: 0.82 ± 0.334
0.0HisTrp: 0.0 ± 0.0
0.492HisTyr: 0.492 ± 0.235
0.0HisXaa: 0.0 ± 0.0
Ile
3.446IleAla: 3.446 ± 0.737
1.805IleCys: 1.805 ± 0.55
5.579IleAsp: 5.579 ± 1.189
6.4IleGlu: 6.4 ± 0.878
2.954IlePhe: 2.954 ± 0.622
5.251IleGly: 5.251 ± 0.591
0.656IleHis: 0.656 ± 0.315
6.892IleIle: 6.892 ± 1.42
4.431IleLys: 4.431 ± 0.478
6.564IleLeu: 6.564 ± 0.73
2.133IleMet: 2.133 ± 0.377
4.595IleAsn: 4.595 ± 1.13
4.431IlePro: 4.431 ± 1.163
3.938IleGln: 3.938 ± 0.872
3.61IleArg: 3.61 ± 0.654
6.236IleSer: 6.236 ± 1.692
6.564IleThr: 6.564 ± 1.904
3.446IleVal: 3.446 ± 0.398
0.492IleTrp: 0.492 ± 0.259
2.954IleTyr: 2.954 ± 0.698
0.0IleXaa: 0.0 ± 0.0
Lys
4.266LysAla: 4.266 ± 0.788
0.656LysCys: 0.656 ± 0.249
3.61LysAsp: 3.61 ± 0.676
4.759LysGlu: 4.759 ± 1.096
2.461LysPhe: 2.461 ± 0.85
3.938LysGly: 3.938 ± 1.018
0.492LysHis: 0.492 ± 0.219
5.907LysIle: 5.907 ± 1.318
3.61LysLys: 3.61 ± 0.802
4.431LysLeu: 4.431 ± 0.448
0.492LysMet: 0.492 ± 0.235
3.282LysAsn: 3.282 ± 0.523
5.415LysPro: 5.415 ± 1.371
3.774LysGln: 3.774 ± 0.805
2.626LysArg: 2.626 ± 0.693
7.384LysSer: 7.384 ± 1.284
3.446LysThr: 3.446 ± 0.736
4.266LysVal: 4.266 ± 1.06
0.492LysTrp: 0.492 ± 0.322
2.461LysTyr: 2.461 ± 0.695
0.0LysXaa: 0.0 ± 0.0
Leu
4.266LeuAla: 4.266 ± 0.811
1.149LeuCys: 1.149 ± 0.57
4.759LeuAsp: 4.759 ± 0.738
5.743LeuGlu: 5.743 ± 0.849
2.79LeuPhe: 2.79 ± 0.593
4.923LeuGly: 4.923 ± 0.994
2.461LeuHis: 2.461 ± 0.624
6.236LeuIle: 6.236 ± 1.535
5.743LeuLys: 5.743 ± 0.844
7.384LeuLeu: 7.384 ± 1.041
2.954LeuMet: 2.954 ± 0.599
5.907LeuAsn: 5.907 ± 0.988
2.954LeuPro: 2.954 ± 0.771
3.282LeuGln: 3.282 ± 0.662
4.431LeuArg: 4.431 ± 0.74
8.041LeuSer: 8.041 ± 1.464
7.384LeuThr: 7.384 ± 0.768
7.22LeuVal: 7.22 ± 1.094
0.656LeuTrp: 0.656 ± 0.3
2.133LeuTyr: 2.133 ± 0.776
0.0LeuXaa: 0.0 ± 0.0
Met
1.969MetAla: 1.969 ± 0.557
0.82MetCys: 0.82 ± 0.423
1.805MetAsp: 1.805 ± 0.487
1.149MetGlu: 1.149 ± 0.364
1.149MetPhe: 1.149 ± 0.588
1.313MetGly: 1.313 ± 0.503
0.164MetHis: 0.164 ± 0.107
2.297MetIle: 2.297 ± 0.355
1.641MetLys: 1.641 ± 0.333
1.969MetLeu: 1.969 ± 0.533
0.328MetMet: 0.328 ± 0.171
1.969MetAsn: 1.969 ± 0.697
0.82MetPro: 0.82 ± 0.407
0.492MetGln: 0.492 ± 0.219
1.641MetArg: 1.641 ± 0.517
2.133MetSer: 2.133 ± 0.304
0.985MetThr: 0.985 ± 0.52
1.313MetVal: 1.313 ± 0.401
0.164MetTrp: 0.164 ± 0.107
1.805MetTyr: 1.805 ± 0.583
0.0MetXaa: 0.0 ± 0.0
Asn
2.79AsnAla: 2.79 ± 0.855
0.656AsnCys: 0.656 ± 0.401
2.79AsnAsp: 2.79 ± 0.278
2.297AsnGlu: 2.297 ± 0.626
0.985AsnPhe: 0.985 ± 0.398
3.446AsnGly: 3.446 ± 0.516
0.82AsnHis: 0.82 ± 0.487
6.072AsnIle: 6.072 ± 0.735
4.923AsnLys: 4.923 ± 0.562
5.087AsnLeu: 5.087 ± 0.886
2.626AsnMet: 2.626 ± 0.459
2.954AsnAsn: 2.954 ± 0.834
4.266AsnPro: 4.266 ± 0.679
3.118AsnGln: 3.118 ± 1.01
3.118AsnArg: 3.118 ± 0.77
2.79AsnSer: 2.79 ± 1.344
2.79AsnThr: 2.79 ± 0.854
2.626AsnVal: 2.626 ± 0.579
1.477AsnTrp: 1.477 ± 0.535
1.805AsnTyr: 1.805 ± 0.514
0.0AsnXaa: 0.0 ± 0.0
Pro
3.61ProAla: 3.61 ± 1.082
0.164ProCys: 0.164 ± 0.107
2.133ProAsp: 2.133 ± 0.872
3.774ProGlu: 3.774 ± 0.73
1.477ProPhe: 1.477 ± 0.444
2.297ProGly: 2.297 ± 0.559
0.985ProHis: 0.985 ± 0.498
4.923ProIle: 4.923 ± 0.899
4.759ProLys: 4.759 ± 1.311
3.938ProLeu: 3.938 ± 0.726
0.328ProMet: 0.328 ± 0.185
3.118ProAsn: 3.118 ± 0.824
1.641ProPro: 1.641 ± 0.496
0.985ProGln: 0.985 ± 0.415
2.954ProArg: 2.954 ± 0.552
3.446ProSer: 3.446 ± 0.71
3.282ProThr: 3.282 ± 0.761
1.641ProVal: 1.641 ± 0.557
0.164ProTrp: 0.164 ± 0.234
1.805ProTyr: 1.805 ± 0.876
0.0ProXaa: 0.0 ± 0.0
Gln
2.626GlnAla: 2.626 ± 0.781
0.492GlnCys: 0.492 ± 0.229
2.297GlnAsp: 2.297 ± 0.535
2.954GlnGlu: 2.954 ± 1.081
0.492GlnPhe: 0.492 ± 0.235
2.79GlnGly: 2.79 ± 0.466
0.656GlnHis: 0.656 ± 0.266
1.969GlnIle: 1.969 ± 0.646
3.774GlnLys: 3.774 ± 0.569
3.446GlnLeu: 3.446 ± 0.789
1.641GlnMet: 1.641 ± 0.614
1.969GlnAsn: 1.969 ± 0.401
1.969GlnPro: 1.969 ± 0.659
1.805GlnGln: 1.805 ± 0.559
1.477GlnArg: 1.477 ± 0.24
2.297GlnSer: 2.297 ± 0.528
2.461GlnThr: 2.461 ± 0.758
1.641GlnVal: 1.641 ± 0.594
0.0GlnTrp: 0.0 ± 0.0
0.656GlnTyr: 0.656 ± 0.459
0.0GlnXaa: 0.0 ± 0.0
Arg
2.954ArgAla: 2.954 ± 0.54
0.82ArgCys: 0.82 ± 0.539
2.461ArgAsp: 2.461 ± 0.619
2.133ArgGlu: 2.133 ± 0.535
1.477ArgPhe: 1.477 ± 0.499
1.641ArgGly: 1.641 ± 0.511
0.656ArgHis: 0.656 ± 0.3
4.102ArgIle: 4.102 ± 0.558
3.446ArgLys: 3.446 ± 1.003
6.072ArgLeu: 6.072 ± 1.475
2.133ArgMet: 2.133 ± 0.756
2.297ArgAsn: 2.297 ± 0.576
1.969ArgPro: 1.969 ± 0.485
1.641ArgGln: 1.641 ± 0.405
2.79ArgArg: 2.79 ± 0.705
3.446ArgSer: 3.446 ± 0.596
3.282ArgThr: 3.282 ± 0.407
1.969ArgVal: 1.969 ± 0.979
0.492ArgTrp: 0.492 ± 0.548
2.133ArgTyr: 2.133 ± 0.945
0.0ArgXaa: 0.0 ± 0.0
Ser
4.102SerAla: 4.102 ± 0.776
1.805SerCys: 1.805 ± 0.563
6.4SerAsp: 6.4 ± 1.117
5.251SerGlu: 5.251 ± 0.722
3.118SerPhe: 3.118 ± 0.79
5.743SerGly: 5.743 ± 0.755
1.805SerHis: 1.805 ± 0.635
6.4SerIle: 6.4 ± 1.06
3.938SerLys: 3.938 ± 0.941
7.548SerLeu: 7.548 ± 1.408
1.805SerMet: 1.805 ± 0.617
4.102SerAsn: 4.102 ± 0.498
2.461SerPro: 2.461 ± 0.347
2.133SerGln: 2.133 ± 0.694
4.102SerArg: 4.102 ± 0.645
7.22SerSer: 7.22 ± 0.654
3.61SerThr: 3.61 ± 0.628
5.251SerVal: 5.251 ± 1.112
0.656SerTrp: 0.656 ± 0.252
2.79SerTyr: 2.79 ± 0.523
0.0SerXaa: 0.0 ± 0.0
Thr
4.595ThrAla: 4.595 ± 1.102
0.656ThrCys: 0.656 ± 0.397
3.446ThrAsp: 3.446 ± 0.805
2.79ThrGlu: 2.79 ± 0.407
0.985ThrPhe: 0.985 ± 0.308
2.79ThrGly: 2.79 ± 0.862
0.985ThrHis: 0.985 ± 0.495
5.415ThrIle: 5.415 ± 1.411
3.61ThrLys: 3.61 ± 0.594
5.415ThrLeu: 5.415 ± 0.94
1.149ThrMet: 1.149 ± 0.261
3.774ThrAsn: 3.774 ± 0.748
2.79ThrPro: 2.79 ± 0.219
2.79ThrGln: 2.79 ± 0.662
2.954ThrArg: 2.954 ± 0.337
3.774ThrSer: 3.774 ± 0.449
3.282ThrThr: 3.282 ± 0.649
3.774ThrVal: 3.774 ± 0.566
0.985ThrTrp: 0.985 ± 0.462
2.954ThrTyr: 2.954 ± 0.682
0.0ThrXaa: 0.0 ± 0.0
Val
2.297ValAla: 2.297 ± 0.592
0.328ValCys: 0.328 ± 0.215
4.759ValAsp: 4.759 ± 0.7
2.297ValGlu: 2.297 ± 0.42
1.969ValPhe: 1.969 ± 0.279
3.446ValGly: 3.446 ± 0.677
0.82ValHis: 0.82 ± 0.537
4.923ValIle: 4.923 ± 1.383
4.923ValLys: 4.923 ± 1.152
4.431ValLeu: 4.431 ± 1.047
1.313ValMet: 1.313 ± 0.266
2.297ValAsn: 2.297 ± 0.673
3.61ValPro: 3.61 ± 0.655
1.805ValGln: 1.805 ± 0.421
3.118ValArg: 3.118 ± 0.716
5.415ValSer: 5.415 ± 0.649
2.626ValThr: 2.626 ± 0.311
2.461ValVal: 2.461 ± 0.759
0.328ValTrp: 0.328 ± 0.296
1.969ValTyr: 1.969 ± 0.547
0.0ValXaa: 0.0 ± 0.0
Trp
0.985TrpAla: 0.985 ± 0.307
0.328TrpCys: 0.328 ± 0.256
1.149TrpAsp: 1.149 ± 0.369
0.985TrpGlu: 0.985 ± 0.247
0.328TrpPhe: 0.328 ± 0.215
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.656TrpIle: 0.656 ± 0.368
0.656TrpLys: 0.656 ± 0.43
1.149TrpLeu: 1.149 ± 0.457
0.164TrpMet: 0.164 ± 0.107
0.492TrpAsn: 0.492 ± 0.255
0.164TrpPro: 0.164 ± 0.107
0.328TrpGln: 0.328 ± 0.232
0.328TrpArg: 0.328 ± 0.215
1.313TrpSer: 1.313 ± 0.492
0.164TrpThr: 0.164 ± 0.107
0.328TrpVal: 0.328 ± 0.267
0.164TrpTrp: 0.164 ± 0.107
0.492TrpTyr: 0.492 ± 0.322
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.477TyrAla: 1.477 ± 0.633
0.656TyrCys: 0.656 ± 0.348
1.641TyrAsp: 1.641 ± 0.529
1.641TyrGlu: 1.641 ± 0.31
1.149TyrPhe: 1.149 ± 0.321
2.626TyrGly: 2.626 ± 0.499
0.82TyrHis: 0.82 ± 0.537
2.626TyrIle: 2.626 ± 0.531
3.446TyrLys: 3.446 ± 0.622
3.938TyrLeu: 3.938 ± 0.709
1.641TyrMet: 1.641 ± 0.854
2.133TyrAsn: 2.133 ± 0.635
1.641TyrPro: 1.641 ± 0.545
0.82TyrGln: 0.82 ± 0.436
1.149TyrArg: 1.149 ± 0.248
2.461TyrSer: 2.461 ± 0.614
2.297TyrThr: 2.297 ± 0.691
2.133TyrVal: 2.133 ± 0.315
0.0TyrTrp: 0.0 ± 0.0
1.477TyrTyr: 1.477 ± 0.606
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (6095 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski