Amino acid dipepetide frequency for African horse sickness virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.111AlaAla: 5.111 ± 1.456
0.319AlaCys: 0.319 ± 0.232
3.674AlaAsp: 3.674 ± 1.152
3.993AlaGlu: 3.993 ± 0.867
1.757AlaPhe: 1.757 ± 0.625
4.312AlaGly: 4.312 ± 1.222
0.639AlaHis: 0.639 ± 0.357
5.271AlaIle: 5.271 ± 1.702
3.354AlaLys: 3.354 ± 1.033
7.667AlaLeu: 7.667 ± 1.43
2.875AlaMet: 2.875 ± 0.675
2.236AlaAsn: 2.236 ± 0.725
3.833AlaPro: 3.833 ± 1.294
1.278AlaGln: 1.278 ± 0.511
4.312AlaArg: 4.312 ± 0.731
3.194AlaSer: 3.194 ± 0.595
3.674AlaThr: 3.674 ± 1.23
3.993AlaVal: 3.993 ± 0.684
1.118AlaTrp: 1.118 ± 0.343
2.875AlaTyr: 2.875 ± 0.67
0.0AlaXaa: 0.0 ± 0.0
Cys
0.639CysAla: 0.639 ± 0.386
0.319CysCys: 0.319 ± 0.251
1.118CysAsp: 1.118 ± 0.272
0.479CysGlu: 0.479 ± 0.276
0.639CysPhe: 0.639 ± 0.686
1.278CysGly: 1.278 ± 0.778
0.319CysHis: 0.319 ± 0.218
0.319CysIle: 0.319 ± 0.205
0.319CysLys: 0.319 ± 0.251
1.118CysLeu: 1.118 ± 0.264
0.16CysMet: 0.16 ± 0.18
0.319CysAsn: 0.319 ± 0.218
0.319CysPro: 0.319 ± 0.185
0.479CysGln: 0.479 ± 0.272
0.799CysArg: 0.799 ± 0.421
0.639CysSer: 0.639 ± 0.356
0.639CysThr: 0.639 ± 0.257
0.958CysVal: 0.958 ± 0.306
0.319CysTrp: 0.319 ± 0.181
0.958CysTyr: 0.958 ± 0.402
0.0CysXaa: 0.0 ± 0.0
Asp
3.194AspAla: 3.194 ± 0.833
0.639AspCys: 0.639 ± 0.381
3.833AspAsp: 3.833 ± 0.727
4.792AspGlu: 4.792 ± 0.89
3.035AspPhe: 3.035 ± 0.787
5.91AspGly: 5.91 ± 1.352
1.278AspHis: 1.278 ± 0.372
3.035AspIle: 3.035 ± 0.545
2.076AspLys: 2.076 ± 0.486
6.548AspLeu: 6.548 ± 1.19
1.757AspMet: 1.757 ± 0.483
1.278AspAsn: 1.278 ± 0.363
3.354AspPro: 3.354 ± 0.801
1.118AspGln: 1.118 ± 0.318
4.153AspArg: 4.153 ± 0.811
3.194AspSer: 3.194 ± 0.714
3.354AspThr: 3.354 ± 0.362
6.069AspVal: 6.069 ± 0.879
0.799AspTrp: 0.799 ± 0.384
2.556AspTyr: 2.556 ± 0.347
0.0AspXaa: 0.0 ± 0.0
Glu
4.153GluAla: 4.153 ± 0.738
0.799GluCys: 0.799 ± 0.315
4.632GluAsp: 4.632 ± 0.76
4.951GluGlu: 4.951 ± 0.655
3.194GluPhe: 3.194 ± 1.061
3.514GluGly: 3.514 ± 0.828
0.479GluHis: 0.479 ± 0.305
6.069GluIle: 6.069 ± 0.742
6.069GluLys: 6.069 ± 1.309
4.951GluLeu: 4.951 ± 0.65
2.556GluMet: 2.556 ± 0.548
2.236GluAsn: 2.236 ± 0.566
2.556GluPro: 2.556 ± 0.56
1.917GluGln: 1.917 ± 0.566
5.111GluArg: 5.111 ± 0.717
3.993GluSer: 3.993 ± 0.785
3.833GluThr: 3.833 ± 0.798
5.59GluVal: 5.59 ± 1.109
1.118GluTrp: 1.118 ± 0.527
3.354GluTyr: 3.354 ± 0.762
0.0GluXaa: 0.0 ± 0.0
Phe
2.236PheAla: 2.236 ± 0.65
0.16PheCys: 0.16 ± 0.18
2.076PheAsp: 2.076 ± 0.361
3.035PheGlu: 3.035 ± 0.522
1.757PhePhe: 1.757 ± 0.544
3.674PheGly: 3.674 ± 0.551
0.639PheHis: 0.639 ± 0.319
2.556PheIle: 2.556 ± 0.944
2.396PheLys: 2.396 ± 0.777
3.035PheLeu: 3.035 ± 0.541
1.917PheMet: 1.917 ± 0.694
1.278PheAsn: 1.278 ± 0.665
1.118PhePro: 1.118 ± 0.372
0.958PheGln: 0.958 ± 0.285
3.035PheArg: 3.035 ± 0.939
2.556PheSer: 2.556 ± 0.698
1.917PheThr: 1.917 ± 0.44
2.236PheVal: 2.236 ± 0.559
0.0PheTrp: 0.0 ± 0.0
2.236PheTyr: 2.236 ± 0.477
0.0PheXaa: 0.0 ± 0.0
Gly
5.59GlyAla: 5.59 ± 1.436
0.639GlyCys: 0.639 ± 0.256
3.993GlyAsp: 3.993 ± 1.288
5.43GlyGlu: 5.43 ± 1.197
2.076GlyPhe: 2.076 ± 0.518
6.389GlyGly: 6.389 ± 3.353
1.437GlyHis: 1.437 ± 0.489
3.354GlyIle: 3.354 ± 0.546
3.035GlyLys: 3.035 ± 0.896
4.951GlyLeu: 4.951 ± 0.936
1.757GlyMet: 1.757 ± 0.487
2.396GlyAsn: 2.396 ± 0.454
1.757GlyPro: 1.757 ± 0.559
1.757GlyGln: 1.757 ± 0.812
4.632GlyArg: 4.632 ± 0.905
4.312GlySer: 4.312 ± 1.367
2.715GlyThr: 2.715 ± 0.501
4.632GlyVal: 4.632 ± 1.102
1.118GlyTrp: 1.118 ± 0.329
2.076GlyTyr: 2.076 ± 0.79
0.0GlyXaa: 0.0 ± 0.0
His
1.757HisAla: 1.757 ± 0.423
0.319HisCys: 0.319 ± 0.274
0.958HisAsp: 0.958 ± 0.509
0.958HisGlu: 0.958 ± 0.57
0.479HisPhe: 0.479 ± 0.293
1.118HisGly: 1.118 ± 0.402
0.319HisHis: 0.319 ± 0.24
1.917HisIle: 1.917 ± 0.67
0.799HisLys: 0.799 ± 0.321
2.076HisLeu: 2.076 ± 0.83
0.479HisMet: 0.479 ± 0.336
0.958HisAsn: 0.958 ± 0.315
1.278HisPro: 1.278 ± 0.539
0.799HisGln: 0.799 ± 0.306
0.958HisArg: 0.958 ± 0.465
0.958HisSer: 0.958 ± 0.256
0.799HisThr: 0.799 ± 0.268
1.917HisVal: 1.917 ± 0.806
0.319HisTrp: 0.319 ± 0.202
0.799HisTyr: 0.799 ± 0.333
0.0HisXaa: 0.0 ± 0.0
Ile
3.833IleAla: 3.833 ± 1.015
1.437IleCys: 1.437 ± 0.563
4.312IleAsp: 4.312 ± 0.601
4.951IleGlu: 4.951 ± 1.437
3.514IlePhe: 3.514 ± 0.713
3.674IleGly: 3.674 ± 0.506
1.437IleHis: 1.437 ± 0.564
3.833IleIle: 3.833 ± 1.081
4.153IleLys: 4.153 ± 0.783
5.271IleLeu: 5.271 ± 0.825
1.757IleMet: 1.757 ± 0.56
3.674IleAsn: 3.674 ± 1.094
2.396IlePro: 2.396 ± 0.7
3.833IleGln: 3.833 ± 0.839
3.194IleArg: 3.194 ± 0.516
4.951IleSer: 4.951 ± 0.866
4.312IleThr: 4.312 ± 0.651
3.993IleVal: 3.993 ± 0.685
0.799IleTrp: 0.799 ± 0.444
2.236IleTyr: 2.236 ± 0.408
0.0IleXaa: 0.0 ± 0.0
Lys
3.833LysAla: 3.833 ± 0.959
0.479LysCys: 0.479 ± 0.274
3.354LysAsp: 3.354 ± 1.055
4.951LysGlu: 4.951 ± 1.04
2.875LysPhe: 2.875 ± 1.005
3.035LysGly: 3.035 ± 0.78
1.118LysHis: 1.118 ± 0.711
4.792LysIle: 4.792 ± 0.65
3.833LysLys: 3.833 ± 0.892
5.43LysLeu: 5.43 ± 1.16
2.076LysMet: 2.076 ± 0.723
3.194LysAsn: 3.194 ± 0.928
1.917LysPro: 1.917 ± 0.665
1.597LysGln: 1.597 ± 0.475
5.43LysArg: 5.43 ± 1.172
3.035LysSer: 3.035 ± 0.792
2.556LysThr: 2.556 ± 0.672
3.194LysVal: 3.194 ± 0.537
0.799LysTrp: 0.799 ± 0.426
2.076LysTyr: 2.076 ± 0.64
0.0LysXaa: 0.0 ± 0.0
Leu
6.069LeuAla: 6.069 ± 1.104
0.958LeuCys: 0.958 ± 0.34
5.91LeuAsp: 5.91 ± 0.737
5.111LeuGlu: 5.111 ± 0.625
2.076LeuPhe: 2.076 ± 0.86
3.194LeuGly: 3.194 ± 0.431
1.917LeuHis: 1.917 ± 0.565
4.951LeuIle: 4.951 ± 1.168
7.826LeuLys: 7.826 ± 1.5
7.187LeuLeu: 7.187 ± 1.13
3.354LeuMet: 3.354 ± 0.97
2.715LeuAsn: 2.715 ± 0.408
3.035LeuPro: 3.035 ± 0.524
2.875LeuGln: 2.875 ± 0.722
7.826LeuArg: 7.826 ± 0.768
6.708LeuSer: 6.708 ± 0.774
4.792LeuThr: 4.792 ± 0.852
4.632LeuVal: 4.632 ± 0.711
0.958LeuTrp: 0.958 ± 0.366
2.556LeuTyr: 2.556 ± 0.467
0.0LeuXaa: 0.0 ± 0.0
Met
2.076MetAla: 2.076 ± 0.567
0.639MetCys: 0.639 ± 0.237
1.917MetAsp: 1.917 ± 0.537
3.035MetGlu: 3.035 ± 0.615
1.278MetPhe: 1.278 ± 0.339
0.958MetGly: 0.958 ± 0.55
0.958MetHis: 0.958 ± 0.383
1.597MetIle: 1.597 ± 0.511
1.757MetLys: 1.757 ± 0.535
3.514MetLeu: 3.514 ± 0.655
1.437MetMet: 1.437 ± 0.388
2.715MetAsn: 2.715 ± 0.669
1.278MetPro: 1.278 ± 0.366
1.597MetGln: 1.597 ± 0.651
3.674MetArg: 3.674 ± 0.56
2.875MetSer: 2.875 ± 0.74
1.278MetThr: 1.278 ± 0.511
1.757MetVal: 1.757 ± 0.382
0.479MetTrp: 0.479 ± 0.253
2.076MetTyr: 2.076 ± 0.668
0.0MetXaa: 0.0 ± 0.0
Asn
3.833AsnAla: 3.833 ± 1.106
0.319AsnCys: 0.319 ± 0.239
2.236AsnAsp: 2.236 ± 0.642
4.153AsnGlu: 4.153 ± 0.886
1.597AsnPhe: 1.597 ± 0.413
3.194AsnGly: 3.194 ± 0.644
0.958AsnHis: 0.958 ± 0.333
2.076AsnIle: 2.076 ± 0.425
1.597AsnLys: 1.597 ± 0.631
3.514AsnLeu: 3.514 ± 0.9
1.597AsnMet: 1.597 ± 0.407
0.958AsnAsn: 0.958 ± 0.297
0.958AsnPro: 0.958 ± 0.598
1.278AsnGln: 1.278 ± 0.565
2.556AsnArg: 2.556 ± 0.455
1.917AsnSer: 1.917 ± 0.474
2.076AsnThr: 2.076 ± 0.952
4.153AsnVal: 4.153 ± 1.008
0.319AsnTrp: 0.319 ± 0.159
1.757AsnTyr: 1.757 ± 0.614
0.0AsnXaa: 0.0 ± 0.0
Pro
0.958ProAla: 0.958 ± 0.439
0.319ProCys: 0.319 ± 0.283
2.396ProAsp: 2.396 ± 0.974
2.715ProGlu: 2.715 ± 0.299
1.118ProPhe: 1.118 ± 0.505
1.437ProGly: 1.437 ± 0.648
0.958ProHis: 0.958 ± 0.399
4.312ProIle: 4.312 ± 1.165
2.556ProLys: 2.556 ± 0.889
3.514ProLeu: 3.514 ± 1.071
0.958ProMet: 0.958 ± 0.378
1.278ProAsn: 1.278 ± 0.415
1.597ProPro: 1.597 ± 0.59
1.118ProGln: 1.118 ± 0.532
2.076ProArg: 2.076 ± 0.415
2.076ProSer: 2.076 ± 0.394
3.354ProThr: 3.354 ± 0.714
2.236ProVal: 2.236 ± 0.656
0.319ProTrp: 0.319 ± 0.262
2.715ProTyr: 2.715 ± 0.594
0.0ProXaa: 0.0 ± 0.0
Gln
1.757GlnAla: 1.757 ± 0.768
0.319GlnCys: 0.319 ± 0.224
0.799GlnAsp: 0.799 ± 0.312
1.757GlnGlu: 1.757 ± 0.41
1.437GlnPhe: 1.437 ± 0.239
2.875GlnGly: 2.875 ± 0.697
0.958GlnHis: 0.958 ± 0.422
2.236GlnIle: 2.236 ± 0.615
1.597GlnLys: 1.597 ± 0.521
1.437GlnLeu: 1.437 ± 0.314
1.917GlnMet: 1.917 ± 0.563
1.757GlnAsn: 1.757 ± 0.361
1.437GlnPro: 1.437 ± 0.453
1.437GlnGln: 1.437 ± 0.671
3.354GlnArg: 3.354 ± 0.914
2.236GlnSer: 2.236 ± 0.723
2.715GlnThr: 2.715 ± 0.628
1.917GlnVal: 1.917 ± 0.458
0.639GlnTrp: 0.639 ± 0.31
0.799GlnTyr: 0.799 ± 0.293
0.0GlnXaa: 0.0 ± 0.0
Arg
5.75ArgAla: 5.75 ± 1.031
0.958ArgCys: 0.958 ± 0.577
4.792ArgAsp: 4.792 ± 0.562
4.472ArgGlu: 4.472 ± 0.999
3.674ArgPhe: 3.674 ± 0.917
4.632ArgGly: 4.632 ± 0.995
0.799ArgHis: 0.799 ± 0.453
4.632ArgIle: 4.632 ± 0.673
4.472ArgLys: 4.472 ± 0.747
5.111ArgLeu: 5.111 ± 1.091
3.833ArgMet: 3.833 ± 0.522
3.674ArgAsn: 3.674 ± 0.719
1.597ArgPro: 1.597 ± 0.598
2.236ArgGln: 2.236 ± 0.755
4.951ArgArg: 4.951 ± 0.54
3.833ArgSer: 3.833 ± 0.602
3.833ArgThr: 3.833 ± 0.771
4.312ArgVal: 4.312 ± 0.729
0.799ArgTrp: 0.799 ± 0.275
2.236ArgTyr: 2.236 ± 0.566
0.0ArgXaa: 0.0 ± 0.0
Ser
4.153SerAla: 4.153 ± 0.815
0.319SerCys: 0.319 ± 0.251
3.674SerAsp: 3.674 ± 0.924
3.674SerGlu: 3.674 ± 0.881
2.396SerPhe: 2.396 ± 0.623
5.271SerGly: 5.271 ± 1.816
1.437SerHis: 1.437 ± 0.306
4.951SerIle: 4.951 ± 1.103
3.833SerLys: 3.833 ± 1.026
4.312SerLeu: 4.312 ± 0.863
2.715SerMet: 2.715 ± 0.744
1.757SerAsn: 1.757 ± 0.376
2.396SerPro: 2.396 ± 0.808
2.556SerGln: 2.556 ± 0.715
3.035SerArg: 3.035 ± 0.709
3.833SerSer: 3.833 ± 1.302
3.354SerThr: 3.354 ± 0.714
3.514SerVal: 3.514 ± 0.635
1.278SerTrp: 1.278 ± 0.503
2.076SerTyr: 2.076 ± 0.574
0.0SerXaa: 0.0 ± 0.0
Thr
2.236ThrAla: 2.236 ± 0.587
0.639ThrCys: 0.639 ± 0.686
3.194ThrAsp: 3.194 ± 0.496
4.792ThrGlu: 4.792 ± 1.293
1.757ThrPhe: 1.757 ± 0.62
2.556ThrGly: 2.556 ± 0.743
1.757ThrHis: 1.757 ± 0.719
3.833ThrIle: 3.833 ± 0.576
3.993ThrLys: 3.993 ± 0.943
6.229ThrLeu: 6.229 ± 1.267
1.757ThrMet: 1.757 ± 0.64
2.556ThrAsn: 2.556 ± 0.605
2.556ThrPro: 2.556 ± 0.936
2.076ThrGln: 2.076 ± 0.488
3.194ThrArg: 3.194 ± 0.684
3.514ThrSer: 3.514 ± 0.963
3.035ThrThr: 3.035 ± 0.656
3.354ThrVal: 3.354 ± 0.769
0.479ThrTrp: 0.479 ± 0.267
1.757ThrTyr: 1.757 ± 0.623
0.0ThrXaa: 0.0 ± 0.0
Val
4.632ValAla: 4.632 ± 0.88
1.437ValCys: 1.437 ± 0.515
4.951ValAsp: 4.951 ± 0.93
4.632ValGlu: 4.632 ± 1.241
2.076ValPhe: 2.076 ± 0.52
3.674ValGly: 3.674 ± 0.792
1.118ValHis: 1.118 ± 0.298
3.993ValIle: 3.993 ± 0.912
3.993ValLys: 3.993 ± 0.951
5.59ValLeu: 5.59 ± 0.955
2.396ValMet: 2.396 ± 0.61
3.354ValAsn: 3.354 ± 0.779
3.674ValPro: 3.674 ± 0.915
3.514ValGln: 3.514 ± 0.943
4.951ValArg: 4.951 ± 1.026
3.833ValSer: 3.833 ± 0.69
3.674ValThr: 3.674 ± 0.532
3.194ValVal: 3.194 ± 0.715
0.319ValTrp: 0.319 ± 0.217
2.076ValTyr: 2.076 ± 0.28
0.0ValXaa: 0.0 ± 0.0
Trp
0.479TrpAla: 0.479 ± 0.303
0.16TrpCys: 0.16 ± 0.194
0.799TrpAsp: 0.799 ± 0.379
0.958TrpGlu: 0.958 ± 0.46
0.639TrpPhe: 0.639 ± 0.189
0.639TrpGly: 0.639 ± 0.223
0.479TrpHis: 0.479 ± 0.241
1.757TrpIle: 1.757 ± 0.783
0.639TrpLys: 0.639 ± 0.242
0.958TrpLeu: 0.958 ± 0.383
0.319TrpMet: 0.319 ± 0.166
0.799TrpAsn: 0.799 ± 0.397
0.0TrpPro: 0.0 ± 0.0
0.16TrpGln: 0.16 ± 0.181
1.118TrpArg: 1.118 ± 0.352
0.799TrpSer: 0.799 ± 0.399
0.319TrpThr: 0.319 ± 0.343
0.799TrpVal: 0.799 ± 0.263
0.319TrpTrp: 0.319 ± 0.362
0.479TrpTyr: 0.479 ± 0.207
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.194TyrAla: 3.194 ± 1.097
0.958TyrCys: 0.958 ± 0.421
3.354TyrAsp: 3.354 ± 0.648
2.236TyrGlu: 2.236 ± 0.643
1.437TyrPhe: 1.437 ± 0.407
2.715TyrGly: 2.715 ± 0.711
0.958TyrHis: 0.958 ± 0.41
2.236TyrIle: 2.236 ± 0.331
1.278TyrLys: 1.278 ± 0.414
2.236TyrLeu: 2.236 ± 0.802
1.118TyrMet: 1.118 ± 0.408
1.917TyrAsn: 1.917 ± 0.48
0.958TyrPro: 0.958 ± 0.378
0.799TyrGln: 0.799 ± 0.338
2.236TyrArg: 2.236 ± 0.444
2.076TyrSer: 2.076 ± 0.857
3.035TyrThr: 3.035 ± 0.688
4.632TyrVal: 4.632 ± 0.469
0.319TyrTrp: 0.319 ± 0.202
1.597TyrTyr: 1.597 ± 0.385
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6262 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski