Amino acid dipepetide frequency for African horse sickness virus 1 (AHSV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.029AlaAla: 6.029 ± 1.747
0.464AlaCys: 0.464 ± 0.247
4.483AlaAsp: 4.483 ± 1.077
3.71AlaGlu: 3.71 ± 0.8
2.164AlaPhe: 2.164 ± 0.697
3.401AlaGly: 3.401 ± 1.084
0.928AlaHis: 0.928 ± 0.419
3.865AlaIle: 3.865 ± 1.112
3.092AlaLys: 3.092 ± 1.072
8.811AlaLeu: 8.811 ± 1.432
2.319AlaMet: 2.319 ± 0.334
3.555AlaAsn: 3.555 ± 0.798
4.019AlaPro: 4.019 ± 1.094
1.237AlaGln: 1.237 ± 0.537
4.792AlaArg: 4.792 ± 0.599
3.555AlaSer: 3.555 ± 0.648
3.865AlaThr: 3.865 ± 1.009
4.792AlaVal: 4.792 ± 0.856
0.928AlaTrp: 0.928 ± 0.379
2.628AlaTyr: 2.628 ± 0.674
0.0AlaXaa: 0.0 ± 0.0
Cys
1.082CysAla: 1.082 ± 0.488
0.309CysCys: 0.309 ± 0.232
1.082CysAsp: 1.082 ± 0.277
0.464CysGlu: 0.464 ± 0.237
0.618CysPhe: 0.618 ± 0.526
1.237CysGly: 1.237 ± 0.663
0.309CysHis: 0.309 ± 0.17
0.773CysIle: 0.773 ± 0.37
0.464CysLys: 0.464 ± 0.25
1.082CysLeu: 1.082 ± 0.288
0.155CysMet: 0.155 ± 0.168
0.309CysAsn: 0.309 ± 0.17
0.309CysPro: 0.309 ± 0.164
0.618CysGln: 0.618 ± 0.321
0.773CysArg: 0.773 ± 0.338
1.082CysSer: 1.082 ± 0.599
0.773CysThr: 0.773 ± 0.354
0.773CysVal: 0.773 ± 0.269
0.309CysTrp: 0.309 ± 0.163
0.618CysTyr: 0.618 ± 0.213
0.0CysXaa: 0.0 ± 0.0
Asp
2.783AspAla: 2.783 ± 0.698
0.928AspCys: 0.928 ± 0.476
3.71AspAsp: 3.71 ± 0.832
4.947AspGlu: 4.947 ± 0.792
2.628AspPhe: 2.628 ± 0.46
6.338AspGly: 6.338 ± 0.905
1.082AspHis: 1.082 ± 0.365
3.71AspIle: 3.71 ± 0.582
2.319AspLys: 2.319 ± 0.473
6.183AspLeu: 6.183 ± 0.956
1.082AspMet: 1.082 ± 0.429
1.082AspAsn: 1.082 ± 0.319
3.246AspPro: 3.246 ± 0.822
1.237AspGln: 1.237 ± 0.31
4.483AspArg: 4.483 ± 0.924
2.628AspSer: 2.628 ± 0.494
2.783AspThr: 2.783 ± 0.385
6.183AspVal: 6.183 ± 0.825
0.928AspTrp: 0.928 ± 0.411
2.783AspTyr: 2.783 ± 0.612
0.0AspXaa: 0.0 ± 0.0
Glu
4.638GluAla: 4.638 ± 0.572
0.618GluCys: 0.618 ± 0.265
4.792GluAsp: 4.792 ± 0.724
5.565GluGlu: 5.565 ± 0.725
3.246GluPhe: 3.246 ± 1.029
3.71GluGly: 3.71 ± 1.192
0.618GluHis: 0.618 ± 0.392
6.029GluIle: 6.029 ± 0.736
5.41GluLys: 5.41 ± 0.84
4.792GluLeu: 4.792 ± 0.364
2.319GluMet: 2.319 ± 0.522
3.246GluAsn: 3.246 ± 0.522
2.01GluPro: 2.01 ± 0.433
1.855GluGln: 1.855 ± 0.519
5.41GluArg: 5.41 ± 0.713
2.783GluSer: 2.783 ± 0.42
3.865GluThr: 3.865 ± 0.683
4.328GluVal: 4.328 ± 0.806
1.082GluTrp: 1.082 ± 0.447
2.783GluTyr: 2.783 ± 0.685
0.0GluXaa: 0.0 ± 0.0
Phe
1.855PheAla: 1.855 ± 0.554
0.464PheCys: 0.464 ± 0.275
2.319PheAsp: 2.319 ± 0.368
2.783PheGlu: 2.783 ± 0.653
1.391PhePhe: 1.391 ± 0.623
4.019PheGly: 4.019 ± 0.626
0.464PheHis: 0.464 ± 0.287
2.319PheIle: 2.319 ± 0.851
2.783PheLys: 2.783 ± 0.965
2.783PheLeu: 2.783 ± 0.517
1.7PheMet: 1.7 ± 0.644
0.928PheAsn: 0.928 ± 0.507
1.082PhePro: 1.082 ± 0.321
0.928PheGln: 0.928 ± 0.237
3.092PheArg: 3.092 ± 0.88
3.71PheSer: 3.71 ± 1.221
1.7PheThr: 1.7 ± 0.437
2.319PheVal: 2.319 ± 0.565
0.0PheTrp: 0.0 ± 0.0
1.7PheTyr: 1.7 ± 0.467
0.0PheXaa: 0.0 ± 0.0
Gly
5.41GlyAla: 5.41 ± 1.687
0.773GlyCys: 0.773 ± 0.233
4.328GlyAsp: 4.328 ± 1.102
4.328GlyGlu: 4.328 ± 0.705
1.7GlyPhe: 1.7 ± 0.577
6.183GlyGly: 6.183 ± 3.025
1.546GlyHis: 1.546 ± 0.417
3.71GlyIle: 3.71 ± 0.536
3.246GlyLys: 3.246 ± 0.966
5.101GlyLeu: 5.101 ± 0.831
2.01GlyMet: 2.01 ± 0.617
2.164GlyAsn: 2.164 ± 0.508
1.7GlyPro: 1.7 ± 0.564
2.01GlyGln: 2.01 ± 0.659
4.019GlyArg: 4.019 ± 0.741
4.328GlySer: 4.328 ± 1.184
2.473GlyThr: 2.473 ± 0.484
4.483GlyVal: 4.483 ± 1.109
1.237GlyTrp: 1.237 ± 0.401
2.01GlyTyr: 2.01 ± 0.838
0.0GlyXaa: 0.0 ± 0.0
His
1.546HisAla: 1.546 ± 0.302
0.309HisCys: 0.309 ± 0.25
0.309HisAsp: 0.309 ± 0.177
1.391HisGlu: 1.391 ± 0.505
0.618HisPhe: 0.618 ± 0.285
1.391HisGly: 1.391 ± 0.375
0.464HisHis: 0.464 ± 0.236
1.391HisIle: 1.391 ± 0.469
0.773HisLys: 0.773 ± 0.29
2.319HisLeu: 2.319 ± 0.67
0.464HisMet: 0.464 ± 0.309
1.082HisAsn: 1.082 ± 0.351
1.546HisPro: 1.546 ± 0.509
0.773HisGln: 0.773 ± 0.31
0.773HisArg: 0.773 ± 0.341
0.928HisSer: 0.928 ± 0.24
0.773HisThr: 0.773 ± 0.259
1.7HisVal: 1.7 ± 0.578
0.309HisTrp: 0.309 ± 0.178
0.773HisTyr: 0.773 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
4.792IleAla: 4.792 ± 0.958
1.855IleCys: 1.855 ± 0.554
4.328IleAsp: 4.328 ± 0.756
4.483IleGlu: 4.483 ± 1.267
3.246IlePhe: 3.246 ± 0.742
4.019IleGly: 4.019 ± 0.616
1.237IleHis: 1.237 ± 0.462
3.71IleIle: 3.71 ± 1.064
4.947IleLys: 4.947 ± 0.529
5.565IleLeu: 5.565 ± 0.577
2.319IleMet: 2.319 ± 0.517
3.555IleAsn: 3.555 ± 1.077
2.628IlePro: 2.628 ± 0.585
4.019IleGln: 4.019 ± 0.992
2.783IleArg: 2.783 ± 0.42
4.947IleSer: 4.947 ± 0.722
4.483IleThr: 4.483 ± 0.69
3.555IleVal: 3.555 ± 0.636
0.773IleTrp: 0.773 ± 0.418
1.855IleTyr: 1.855 ± 0.334
0.0IleXaa: 0.0 ± 0.0
Lys
3.246LysAla: 3.246 ± 0.66
0.618LysCys: 0.618 ± 0.251
3.246LysAsp: 3.246 ± 0.796
4.947LysGlu: 4.947 ± 1.222
2.164LysPhe: 2.164 ± 0.651
2.628LysGly: 2.628 ± 0.423
1.391LysHis: 1.391 ± 0.631
6.183LysIle: 6.183 ± 0.898
4.328LysLys: 4.328 ± 0.986
4.792LysLeu: 4.792 ± 0.906
1.7LysMet: 1.7 ± 0.611
3.555LysAsn: 3.555 ± 0.59
1.546LysPro: 1.546 ± 0.49
1.391LysGln: 1.391 ± 0.483
5.874LysArg: 5.874 ± 1.088
3.865LysSer: 3.865 ± 0.958
3.246LysThr: 3.246 ± 0.56
3.865LysVal: 3.865 ± 0.855
0.928LysTrp: 0.928 ± 0.421
2.01LysTyr: 2.01 ± 0.623
0.0LysXaa: 0.0 ± 0.0
Leu
6.183LeuAla: 6.183 ± 1.162
1.082LeuCys: 1.082 ± 0.339
5.41LeuAsp: 5.41 ± 0.824
5.256LeuGlu: 5.256 ± 0.564
2.783LeuPhe: 2.783 ± 0.836
3.555LeuGly: 3.555 ± 0.489
1.546LeuHis: 1.546 ± 0.449
5.874LeuIle: 5.874 ± 1.221
8.348LeuLys: 8.348 ± 1.391
7.111LeuLeu: 7.111 ± 1.241
2.937LeuMet: 2.937 ± 0.874
2.937LeuAsn: 2.937 ± 0.456
3.092LeuPro: 3.092 ± 0.557
2.473LeuGln: 2.473 ± 0.837
7.265LeuArg: 7.265 ± 0.762
6.183LeuSer: 6.183 ± 0.842
4.947LeuThr: 4.947 ± 0.879
4.638LeuVal: 4.638 ± 0.565
1.237LeuTrp: 1.237 ± 0.442
2.01LeuTyr: 2.01 ± 0.438
0.0LeuXaa: 0.0 ± 0.0
Met
2.319MetAla: 2.319 ± 0.558
0.309MetCys: 0.309 ± 0.197
1.7MetAsp: 1.7 ± 0.404
2.473MetGlu: 2.473 ± 0.49
1.546MetPhe: 1.546 ± 0.323
0.773MetGly: 0.773 ± 0.381
0.928MetHis: 0.928 ± 0.403
2.164MetIle: 2.164 ± 0.614
1.7MetLys: 1.7 ± 0.557
3.246MetLeu: 3.246 ± 0.74
1.237MetMet: 1.237 ± 0.383
2.783MetAsn: 2.783 ± 0.484
1.237MetPro: 1.237 ± 0.382
1.391MetGln: 1.391 ± 0.659
3.401MetArg: 3.401 ± 0.734
3.401MetSer: 3.401 ± 0.971
1.237MetThr: 1.237 ± 0.538
1.237MetVal: 1.237 ± 0.346
0.464MetTrp: 0.464 ± 0.227
1.855MetTyr: 1.855 ± 0.453
0.0MetXaa: 0.0 ± 0.0
Asn
2.937AsnAla: 2.937 ± 0.783
0.464AsnCys: 0.464 ± 0.272
2.01AsnAsp: 2.01 ± 0.609
4.019AsnGlu: 4.019 ± 0.716
1.7AsnPhe: 1.7 ± 0.48
3.092AsnGly: 3.092 ± 0.66
0.773AsnHis: 0.773 ± 0.273
2.473AsnIle: 2.473 ± 0.339
2.164AsnLys: 2.164 ± 0.754
3.246AsnLeu: 3.246 ± 0.948
1.855AsnMet: 1.855 ± 0.456
0.618AsnAsn: 0.618 ± 0.257
1.082AsnPro: 1.082 ± 0.48
2.164AsnGln: 2.164 ± 0.662
2.473AsnArg: 2.473 ± 0.617
1.7AsnSer: 1.7 ± 0.456
2.164AsnThr: 2.164 ± 0.946
4.174AsnVal: 4.174 ± 0.83
0.309AsnTrp: 0.309 ± 0.174
1.7AsnTyr: 1.7 ± 0.538
0.0AsnXaa: 0.0 ± 0.0
Pro
1.391ProAla: 1.391 ± 0.444
0.0ProCys: 0.0 ± 0.0
2.164ProAsp: 2.164 ± 1.021
2.628ProGlu: 2.628 ± 0.45
1.082ProPhe: 1.082 ± 0.521
1.391ProGly: 1.391 ± 0.654
0.773ProHis: 0.773 ± 0.355
4.174ProIle: 4.174 ± 1.283
2.01ProLys: 2.01 ± 0.731
3.71ProLeu: 3.71 ± 0.943
0.773ProMet: 0.773 ± 0.328
1.237ProAsn: 1.237 ± 0.431
1.855ProPro: 1.855 ± 0.638
1.237ProGln: 1.237 ± 0.5
1.855ProArg: 1.855 ± 0.475
1.855ProSer: 1.855 ± 0.326
3.092ProThr: 3.092 ± 0.699
2.473ProVal: 2.473 ± 0.509
0.309ProTrp: 0.309 ± 0.225
2.628ProTyr: 2.628 ± 0.534
0.0ProXaa: 0.0 ± 0.0
Gln
2.01GlnAla: 2.01 ± 0.86
0.309GlnCys: 0.309 ± 0.224
0.928GlnAsp: 0.928 ± 0.28
1.855GlnGlu: 1.855 ± 0.476
1.391GlnPhe: 1.391 ± 0.26
2.628GlnGly: 2.628 ± 0.767
0.928GlnHis: 0.928 ± 0.336
2.319GlnIle: 2.319 ± 0.433
1.391GlnLys: 1.391 ± 0.393
2.01GlnLeu: 2.01 ± 0.504
2.01GlnMet: 2.01 ± 0.444
1.237GlnAsn: 1.237 ± 0.497
1.082GlnPro: 1.082 ± 0.379
1.391GlnGln: 1.391 ± 0.639
3.555GlnArg: 3.555 ± 0.98
2.937GlnSer: 2.937 ± 0.813
3.246GlnThr: 3.246 ± 0.624
2.164GlnVal: 2.164 ± 0.398
0.309GlnTrp: 0.309 ± 0.191
0.773GlnTyr: 0.773 ± 0.161
0.0GlnXaa: 0.0 ± 0.0
Arg
6.647ArgAla: 6.647 ± 0.65
1.082ArgCys: 1.082 ± 0.464
4.328ArgAsp: 4.328 ± 0.794
4.483ArgGlu: 4.483 ± 0.844
3.246ArgPhe: 3.246 ± 0.835
4.328ArgGly: 4.328 ± 0.607
0.464ArgHis: 0.464 ± 0.218
4.328ArgIle: 4.328 ± 0.804
4.019ArgLys: 4.019 ± 0.655
4.947ArgLeu: 4.947 ± 0.929
3.71ArgMet: 3.71 ± 0.594
3.092ArgAsn: 3.092 ± 0.644
1.546ArgPro: 1.546 ± 0.569
2.937ArgGln: 2.937 ± 0.688
4.947ArgArg: 4.947 ± 0.487
3.401ArgSer: 3.401 ± 0.575
3.865ArgThr: 3.865 ± 0.641
4.638ArgVal: 4.638 ± 0.601
0.773ArgTrp: 0.773 ± 0.244
2.783ArgTyr: 2.783 ± 0.559
0.0ArgXaa: 0.0 ± 0.0
Ser
4.328SerAla: 4.328 ± 0.998
0.618SerCys: 0.618 ± 0.313
4.328SerAsp: 4.328 ± 0.747
3.71SerGlu: 3.71 ± 0.668
2.473SerPhe: 2.473 ± 0.666
4.019SerGly: 4.019 ± 1.003
1.7SerHis: 1.7 ± 0.475
5.101SerIle: 5.101 ± 1.054
4.483SerLys: 4.483 ± 0.894
4.947SerLeu: 4.947 ± 0.989
2.01SerMet: 2.01 ± 0.566
1.7SerAsn: 1.7 ± 0.461
2.164SerPro: 2.164 ± 0.854
2.01SerGln: 2.01 ± 0.574
3.865SerArg: 3.865 ± 0.902
4.483SerSer: 4.483 ± 1.083
3.092SerThr: 3.092 ± 0.561
3.401SerVal: 3.401 ± 0.651
1.237SerTrp: 1.237 ± 0.512
2.164SerTyr: 2.164 ± 0.627
0.0SerXaa: 0.0 ± 0.0
Thr
2.783ThrAla: 2.783 ± 0.661
0.618ThrCys: 0.618 ± 0.526
2.783ThrAsp: 2.783 ± 0.458
5.256ThrGlu: 5.256 ± 1.139
1.7ThrPhe: 1.7 ± 0.565
2.937ThrGly: 2.937 ± 0.701
1.237ThrHis: 1.237 ± 0.641
3.71ThrIle: 3.71 ± 0.672
3.555ThrLys: 3.555 ± 0.987
5.874ThrLeu: 5.874 ± 1.411
2.783ThrMet: 2.783 ± 0.788
2.319ThrAsn: 2.319 ± 0.445
2.628ThrPro: 2.628 ± 0.902
2.01ThrGln: 2.01 ± 0.519
2.783ThrArg: 2.783 ± 0.59
2.937ThrSer: 2.937 ± 0.694
2.783ThrThr: 2.783 ± 0.607
3.71ThrVal: 3.71 ± 0.816
0.464ThrTrp: 0.464 ± 0.232
2.01ThrTyr: 2.01 ± 0.668
0.0ThrXaa: 0.0 ± 0.0
Val
5.256ValAla: 5.256 ± 0.992
1.237ValCys: 1.237 ± 0.512
5.41ValAsp: 5.41 ± 1.153
4.174ValGlu: 4.174 ± 0.741
2.319ValPhe: 2.319 ± 0.498
3.865ValGly: 3.865 ± 0.979
1.7ValHis: 1.7 ± 0.551
3.555ValIle: 3.555 ± 0.804
4.019ValLys: 4.019 ± 0.754
4.792ValLeu: 4.792 ± 0.998
2.628ValMet: 2.628 ± 0.57
2.937ValAsn: 2.937 ± 0.646
2.937ValPro: 2.937 ± 0.741
3.71ValGln: 3.71 ± 0.851
5.101ValArg: 5.101 ± 1.026
4.328ValSer: 4.328 ± 0.946
3.401ValThr: 3.401 ± 0.79
3.71ValVal: 3.71 ± 0.724
0.618ValTrp: 0.618 ± 0.374
2.164ValTyr: 2.164 ± 0.275
0.0ValXaa: 0.0 ± 0.0
Trp
0.773TrpAla: 0.773 ± 0.322
0.155TrpCys: 0.155 ± 0.178
0.928TrpAsp: 0.928 ± 0.446
0.928TrpGlu: 0.928 ± 0.396
0.928TrpPhe: 0.928 ± 0.309
0.618TrpGly: 0.618 ± 0.159
0.464TrpHis: 0.464 ± 0.247
1.237TrpIle: 1.237 ± 0.387
1.082TrpLys: 1.082 ± 0.415
0.928TrpLeu: 0.928 ± 0.341
0.309TrpMet: 0.309 ± 0.156
0.773TrpAsn: 0.773 ± 0.4
0.155TrpPro: 0.155 ± 0.116
0.155TrpGln: 0.155 ± 0.153
0.618TrpArg: 0.618 ± 0.248
0.773TrpSer: 0.773 ± 0.362
0.309TrpThr: 0.309 ± 0.263
1.082TrpVal: 1.082 ± 0.342
0.309TrpTrp: 0.309 ± 0.305
0.309TrpTyr: 0.309 ± 0.197
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.628TyrAla: 2.628 ± 1.068
0.928TyrCys: 0.928 ± 0.325
2.628TyrAsp: 2.628 ± 0.508
1.7TyrGlu: 1.7 ± 0.586
1.391TyrPhe: 1.391 ± 0.352
2.628TyrGly: 2.628 ± 0.57
1.082TyrHis: 1.082 ± 0.446
2.473TyrIle: 2.473 ± 0.343
1.391TyrLys: 1.391 ± 0.554
2.628TyrLeu: 2.628 ± 0.783
0.928TyrMet: 0.928 ± 0.273
2.01TyrAsn: 2.01 ± 0.402
0.773TyrPro: 0.773 ± 0.224
0.928TyrGln: 0.928 ± 0.36
1.7TyrArg: 1.7 ± 0.495
2.01TyrSer: 2.01 ± 0.861
2.783TyrThr: 2.783 ± 0.699
4.638TyrVal: 4.638 ± 0.51
0.309TyrTrp: 0.309 ± 0.178
1.7TyrTyr: 1.7 ± 0.379
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (6470 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski