Amino acid dipepetide frequency for Guangxi orbivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.63AlaAla: 4.63 ± 0.843
1.437AlaCys: 1.437 ± 0.459
3.033AlaAsp: 3.033 ± 0.762
3.352AlaGlu: 3.352 ± 0.801
1.756AlaPhe: 1.756 ± 0.516
4.31AlaGly: 4.31 ± 1.287
1.596AlaHis: 1.596 ± 0.382
4.63AlaIle: 4.63 ± 0.948
3.193AlaLys: 3.193 ± 0.786
8.461AlaLeu: 8.461 ± 1.188
2.075AlaMet: 2.075 ± 0.578
3.352AlaAsn: 3.352 ± 0.733
3.193AlaPro: 3.193 ± 0.787
3.193AlaGln: 3.193 ± 0.875
4.47AlaArg: 4.47 ± 0.732
2.874AlaSer: 2.874 ± 0.543
2.874AlaThr: 2.874 ± 0.347
3.672AlaVal: 3.672 ± 0.524
0.639AlaTrp: 0.639 ± 0.223
2.235AlaTyr: 2.235 ± 0.512
0.0AlaXaa: 0.0 ± 0.0
Cys
0.958CysAla: 0.958 ± 0.334
0.16CysCys: 0.16 ± 0.163
0.798CysAsp: 0.798 ± 0.289
0.798CysGlu: 0.798 ± 0.249
1.596CysPhe: 1.596 ± 0.405
0.0CysGly: 0.0 ± 0.0
0.16CysHis: 0.16 ± 0.135
1.756CysIle: 1.756 ± 0.453
0.639CysLys: 0.639 ± 0.286
0.639CysLeu: 0.639 ± 0.222
0.639CysMet: 0.639 ± 0.235
1.277CysAsn: 1.277 ± 0.274
0.0CysPro: 0.0 ± 0.0
0.479CysGln: 0.479 ± 0.174
0.319CysArg: 0.319 ± 0.247
1.277CysSer: 1.277 ± 0.683
1.117CysThr: 1.117 ± 0.294
0.479CysVal: 0.479 ± 0.194
0.319CysTrp: 0.319 ± 0.157
1.277CysTyr: 1.277 ± 0.642
0.0CysXaa: 0.0 ± 0.0
Asp
4.949AspAla: 4.949 ± 0.92
0.319AspCys: 0.319 ± 0.227
2.874AspAsp: 2.874 ± 0.587
4.151AspGlu: 4.151 ± 0.697
1.596AspPhe: 1.596 ± 0.403
2.714AspGly: 2.714 ± 1.013
0.319AspHis: 0.319 ± 0.326
4.31AspIle: 4.31 ± 0.784
3.033AspLys: 3.033 ± 0.895
7.184AspLeu: 7.184 ± 1.171
1.437AspMet: 1.437 ± 0.342
1.117AspAsn: 1.117 ± 0.374
2.075AspPro: 2.075 ± 0.54
0.958AspGln: 0.958 ± 0.25
2.554AspArg: 2.554 ± 0.443
3.512AspSer: 3.512 ± 0.705
3.831AspThr: 3.831 ± 0.867
5.747AspVal: 5.747 ± 0.969
0.479AspTrp: 0.479 ± 0.26
1.277AspTyr: 1.277 ± 0.321
0.0AspXaa: 0.0 ± 0.0
Glu
5.268GluAla: 5.268 ± 1.158
1.117GluCys: 1.117 ± 0.386
3.831GluAsp: 3.831 ± 0.73
7.503GluGlu: 7.503 ± 1.259
1.596GluPhe: 1.596 ± 0.733
3.033GluGly: 3.033 ± 0.43
1.596GluHis: 1.596 ± 0.629
6.705GluIle: 6.705 ± 0.785
3.831GluLys: 3.831 ± 0.59
3.033GluLeu: 3.033 ± 0.315
1.596GluMet: 1.596 ± 0.423
3.672GluAsn: 3.672 ± 0.683
2.235GluPro: 2.235 ± 0.547
1.277GluGln: 1.277 ± 0.438
3.831GluArg: 3.831 ± 0.633
5.907GluSer: 5.907 ± 1.493
3.831GluThr: 3.831 ± 0.518
5.747GluVal: 5.747 ± 0.863
0.639GluTrp: 0.639 ± 0.445
2.395GluTyr: 2.395 ± 0.643
0.0GluXaa: 0.0 ± 0.0
Phe
1.756PheAla: 1.756 ± 0.676
0.958PheCys: 0.958 ± 0.378
0.958PheAsp: 0.958 ± 0.32
2.554PheGlu: 2.554 ± 0.47
1.596PhePhe: 1.596 ± 0.572
1.756PheGly: 1.756 ± 0.295
1.277PheHis: 1.277 ± 0.373
3.512PheIle: 3.512 ± 0.707
2.714PheLys: 2.714 ± 0.562
2.714PheLeu: 2.714 ± 0.676
1.596PheMet: 1.596 ± 0.424
2.235PheAsn: 2.235 ± 0.421
1.437PhePro: 1.437 ± 0.371
0.639PheGln: 0.639 ± 0.352
2.235PheArg: 2.235 ± 0.405
3.672PheSer: 3.672 ± 1.054
1.596PheThr: 1.596 ± 0.429
3.672PheVal: 3.672 ± 0.712
0.319PheTrp: 0.319 ± 0.21
1.277PheTyr: 1.277 ± 0.357
0.0PheXaa: 0.0 ± 0.0
Gly
4.789GlyAla: 4.789 ± 0.966
0.639GlyCys: 0.639 ± 0.318
3.193GlyAsp: 3.193 ± 1.129
4.63GlyGlu: 4.63 ± 1.002
2.554GlyPhe: 2.554 ± 0.386
2.235GlyGly: 2.235 ± 0.751
0.958GlyHis: 0.958 ± 0.339
2.395GlyIle: 2.395 ± 0.503
2.874GlyLys: 2.874 ± 0.806
3.672GlyLeu: 3.672 ± 0.832
1.916GlyMet: 1.916 ± 0.605
2.554GlyAsn: 2.554 ± 0.536
1.916GlyPro: 1.916 ± 0.444
2.554GlyGln: 2.554 ± 0.585
2.554GlyArg: 2.554 ± 0.314
4.31GlySer: 4.31 ± 0.597
3.033GlyThr: 3.033 ± 0.735
4.31GlyVal: 4.31 ± 0.996
0.319GlyTrp: 0.319 ± 0.24
2.395GlyTyr: 2.395 ± 0.975
0.0GlyXaa: 0.0 ± 0.0
His
1.916HisAla: 1.916 ± 0.577
0.479HisCys: 0.479 ± 0.257
0.798HisAsp: 0.798 ± 0.368
1.437HisGlu: 1.437 ± 0.384
0.639HisPhe: 0.639 ± 0.364
1.277HisGly: 1.277 ± 0.525
0.639HisHis: 0.639 ± 0.326
2.075HisIle: 2.075 ± 0.525
0.798HisLys: 0.798 ± 0.196
3.033HisLeu: 3.033 ± 0.518
0.798HisMet: 0.798 ± 0.316
0.479HisAsn: 0.479 ± 0.258
0.479HisPro: 0.479 ± 0.274
0.319HisGln: 0.319 ± 0.183
1.756HisArg: 1.756 ± 0.679
2.075HisSer: 2.075 ± 0.427
0.639HisThr: 0.639 ± 0.184
1.437HisVal: 1.437 ± 0.516
0.479HisTrp: 0.479 ± 0.231
0.479HisTyr: 0.479 ± 0.291
0.0HisXaa: 0.0 ± 0.0
Ile
3.193IleAla: 3.193 ± 0.49
1.117IleCys: 1.117 ± 0.358
4.31IleAsp: 4.31 ± 0.702
3.831IleGlu: 3.831 ± 1.163
2.395IlePhe: 2.395 ± 0.465
4.47IleGly: 4.47 ± 0.632
2.714IleHis: 2.714 ± 0.452
4.31IleIle: 4.31 ± 0.762
3.991IleLys: 3.991 ± 0.823
4.949IleLeu: 4.949 ± 1.011
2.395IleMet: 2.395 ± 0.441
3.352IleAsn: 3.352 ± 0.972
2.395IlePro: 2.395 ± 0.459
2.714IleGln: 2.714 ± 0.565
3.831IleArg: 3.831 ± 0.798
5.587IleSer: 5.587 ± 0.717
4.47IleThr: 4.47 ± 0.547
4.151IleVal: 4.151 ± 0.256
0.639IleTrp: 0.639 ± 0.281
2.554IleTyr: 2.554 ± 0.811
0.0IleXaa: 0.0 ± 0.0
Lys
2.714LysAla: 2.714 ± 0.404
0.639LysCys: 0.639 ± 0.398
3.831LysAsp: 3.831 ± 0.592
3.831LysGlu: 3.831 ± 0.75
2.235LysPhe: 2.235 ± 0.519
2.714LysGly: 2.714 ± 0.334
0.958LysHis: 0.958 ± 0.334
4.31LysIle: 4.31 ± 1.047
4.151LysLys: 4.151 ± 0.723
5.907LysLeu: 5.907 ± 0.852
1.437LysMet: 1.437 ± 0.464
3.193LysAsn: 3.193 ± 0.83
1.756LysPro: 1.756 ± 0.557
1.117LysGln: 1.117 ± 0.385
5.268LysArg: 5.268 ± 1.307
3.512LysSer: 3.512 ± 0.666
3.672LysThr: 3.672 ± 0.515
3.831LysVal: 3.831 ± 0.593
0.798LysTrp: 0.798 ± 0.324
1.117LysTyr: 1.117 ± 0.292
0.0LysXaa: 0.0 ± 0.0
Leu
7.822LeuAla: 7.822 ± 0.838
1.916LeuCys: 1.916 ± 0.683
5.109LeuAsp: 5.109 ± 0.861
5.907LeuGlu: 5.907 ± 0.648
3.033LeuPhe: 3.033 ± 0.272
4.31LeuGly: 4.31 ± 0.53
0.958LeuHis: 0.958 ± 0.262
5.109LeuIle: 5.109 ± 0.651
4.47LeuLys: 4.47 ± 0.604
8.301LeuLeu: 8.301 ± 0.807
3.831LeuMet: 3.831 ± 0.67
5.587LeuAsn: 5.587 ± 1.187
5.109LeuPro: 5.109 ± 0.719
3.352LeuGln: 3.352 ± 0.762
7.822LeuArg: 7.822 ± 1.165
8.142LeuSer: 8.142 ± 0.709
3.352LeuThr: 3.352 ± 0.708
5.109LeuVal: 5.109 ± 0.619
0.639LeuTrp: 0.639 ± 0.184
2.235LeuTyr: 2.235 ± 0.547
0.0LeuXaa: 0.0 ± 0.0
Met
1.916MetAla: 1.916 ± 0.622
0.479MetCys: 0.479 ± 0.22
1.596MetAsp: 1.596 ± 0.553
1.277MetGlu: 1.277 ± 0.715
0.958MetPhe: 0.958 ± 0.448
1.596MetGly: 1.596 ± 0.362
1.117MetHis: 1.117 ± 0.344
1.277MetIle: 1.277 ± 0.481
2.554MetLys: 2.554 ± 0.391
3.512MetLeu: 3.512 ± 0.593
1.756MetMet: 1.756 ± 0.39
1.756MetAsn: 1.756 ± 0.519
1.916MetPro: 1.916 ± 0.777
1.117MetGln: 1.117 ± 0.372
3.033MetArg: 3.033 ± 0.313
3.193MetSer: 3.193 ± 0.606
0.958MetThr: 0.958 ± 0.558
1.756MetVal: 1.756 ± 0.2
0.319MetTrp: 0.319 ± 0.227
1.756MetTyr: 1.756 ± 0.573
0.0MetXaa: 0.0 ± 0.0
Asn
2.874AsnAla: 2.874 ± 0.562
0.958AsnCys: 0.958 ± 0.299
1.756AsnAsp: 1.756 ± 0.263
2.874AsnGlu: 2.874 ± 0.441
2.874AsnPhe: 2.874 ± 0.668
2.874AsnGly: 2.874 ± 0.393
1.117AsnHis: 1.117 ± 0.365
2.235AsnIle: 2.235 ± 0.333
3.033AsnLys: 3.033 ± 0.499
4.63AsnLeu: 4.63 ± 0.765
0.639AsnMet: 0.639 ± 0.288
1.117AsnAsn: 1.117 ± 0.268
2.075AsnPro: 2.075 ± 0.582
1.596AsnGln: 1.596 ± 0.671
2.874AsnArg: 2.874 ± 0.446
3.033AsnSer: 3.033 ± 0.424
4.63AsnThr: 4.63 ± 0.606
5.428AsnVal: 5.428 ± 0.669
0.639AsnTrp: 0.639 ± 0.368
2.395AsnTyr: 2.395 ± 0.444
0.0AsnXaa: 0.0 ± 0.0
Pro
2.395ProAla: 2.395 ± 0.804
0.319ProCys: 0.319 ± 0.159
3.193ProAsp: 3.193 ± 0.921
3.352ProGlu: 3.352 ± 0.598
1.596ProPhe: 1.596 ± 0.565
1.437ProGly: 1.437 ± 0.297
0.639ProHis: 0.639 ± 0.216
1.596ProIle: 1.596 ± 0.577
1.916ProLys: 1.916 ± 0.619
2.395ProLeu: 2.395 ± 0.867
1.117ProMet: 1.117 ± 0.451
2.395ProAsn: 2.395 ± 0.444
2.075ProPro: 2.075 ± 0.462
1.756ProGln: 1.756 ± 0.534
2.395ProArg: 2.395 ± 0.711
2.395ProSer: 2.395 ± 0.725
2.874ProThr: 2.874 ± 0.438
3.033ProVal: 3.033 ± 0.45
0.479ProTrp: 0.479 ± 0.296
1.756ProTyr: 1.756 ± 0.545
0.0ProXaa: 0.0 ± 0.0
Gln
1.117GlnAla: 1.117 ± 0.298
0.479GlnCys: 0.479 ± 0.302
1.437GlnAsp: 1.437 ± 0.369
1.916GlnGlu: 1.916 ± 0.43
1.437GlnPhe: 1.437 ± 0.474
2.395GlnGly: 2.395 ± 0.692
0.479GlnHis: 0.479 ± 0.188
3.033GlnIle: 3.033 ± 0.507
1.916GlnLys: 1.916 ± 0.543
3.193GlnLeu: 3.193 ± 0.574
1.117GlnMet: 1.117 ± 0.366
1.437GlnAsn: 1.437 ± 0.49
1.117GlnPro: 1.117 ± 0.55
1.277GlnGln: 1.277 ± 0.412
3.033GlnArg: 3.033 ± 0.54
2.714GlnSer: 2.714 ± 0.636
1.756GlnThr: 1.756 ± 0.355
1.437GlnVal: 1.437 ± 0.499
0.798GlnTrp: 0.798 ± 0.26
0.639GlnTyr: 0.639 ± 0.309
0.0GlnXaa: 0.0 ± 0.0
Arg
4.47ArgAla: 4.47 ± 0.669
0.639ArgCys: 0.639 ± 0.339
3.672ArgAsp: 3.672 ± 0.64
4.47ArgGlu: 4.47 ± 0.494
3.352ArgPhe: 3.352 ± 0.614
4.31ArgGly: 4.31 ± 0.558
1.437ArgHis: 1.437 ± 0.494
4.949ArgIle: 4.949 ± 0.427
3.352ArgLys: 3.352 ± 0.829
5.428ArgLeu: 5.428 ± 0.963
2.395ArgMet: 2.395 ± 0.612
2.874ArgAsn: 2.874 ± 0.826
1.916ArgPro: 1.916 ± 0.369
1.437ArgGln: 1.437 ± 0.314
5.109ArgArg: 5.109 ± 0.931
4.63ArgSer: 4.63 ± 0.829
3.991ArgThr: 3.991 ± 0.689
5.747ArgVal: 5.747 ± 0.916
0.479ArgTrp: 0.479 ± 0.314
2.075ArgTyr: 2.075 ± 0.452
0.0ArgXaa: 0.0 ± 0.0
Ser
4.949SerAla: 4.949 ± 0.594
0.16SerCys: 0.16 ± 0.166
5.587SerAsp: 5.587 ± 0.706
6.226SerGlu: 6.226 ± 0.749
3.352SerPhe: 3.352 ± 0.518
3.352SerGly: 3.352 ± 0.514
1.916SerHis: 1.916 ± 0.67
4.151SerIle: 4.151 ± 0.876
4.151SerLys: 4.151 ± 0.986
6.705SerLeu: 6.705 ± 1.287
3.033SerMet: 3.033 ± 0.675
3.672SerAsn: 3.672 ± 0.595
3.033SerPro: 3.033 ± 0.784
2.554SerGln: 2.554 ± 0.491
5.747SerArg: 5.747 ± 0.587
4.789SerSer: 4.789 ± 0.893
3.991SerThr: 3.991 ± 0.912
3.831SerVal: 3.831 ± 0.841
0.958SerTrp: 0.958 ± 0.341
1.756SerTyr: 1.756 ± 0.454
0.0SerXaa: 0.0 ± 0.0
Thr
3.193ThrAla: 3.193 ± 0.7
0.798ThrCys: 0.798 ± 0.409
2.395ThrAsp: 2.395 ± 0.587
5.268ThrGlu: 5.268 ± 0.589
2.235ThrPhe: 2.235 ± 0.674
3.991ThrGly: 3.991 ± 0.793
2.075ThrHis: 2.075 ± 0.395
3.512ThrIle: 3.512 ± 0.786
3.193ThrLys: 3.193 ± 0.572
5.747ThrLeu: 5.747 ± 0.895
1.596ThrMet: 1.596 ± 0.33
2.714ThrAsn: 2.714 ± 0.63
2.874ThrPro: 2.874 ± 0.871
2.235ThrGln: 2.235 ± 0.628
3.831ThrArg: 3.831 ± 0.79
3.193ThrSer: 3.193 ± 0.787
2.235ThrThr: 2.235 ± 0.475
3.512ThrVal: 3.512 ± 0.403
0.319ThrTrp: 0.319 ± 0.248
1.756ThrTyr: 1.756 ± 0.436
0.0ThrXaa: 0.0 ± 0.0
Val
4.31ValAla: 4.31 ± 0.458
0.639ValCys: 0.639 ± 0.3
3.352ValAsp: 3.352 ± 0.685
3.033ValGlu: 3.033 ± 0.393
1.916ValPhe: 1.916 ± 0.431
4.63ValGly: 4.63 ± 1.023
1.277ValHis: 1.277 ± 0.58
3.831ValIle: 3.831 ± 0.795
4.47ValLys: 4.47 ± 0.73
8.142ValLeu: 8.142 ± 0.901
3.033ValMet: 3.033 ± 0.537
3.672ValAsn: 3.672 ± 0.85
2.235ValPro: 2.235 ± 0.615
2.874ValGln: 2.874 ± 0.997
3.193ValArg: 3.193 ± 0.979
5.907ValSer: 5.907 ± 1.161
4.789ValThr: 4.789 ± 0.551
4.789ValVal: 4.789 ± 1.006
0.798ValTrp: 0.798 ± 0.282
2.874ValTyr: 2.874 ± 0.646
0.0ValXaa: 0.0 ± 0.0
Trp
0.319TrpAla: 0.319 ± 0.174
0.16TrpCys: 0.16 ± 0.163
0.639TrpAsp: 0.639 ± 0.248
0.639TrpGlu: 0.639 ± 0.269
0.639TrpPhe: 0.639 ± 0.235
0.479TrpGly: 0.479 ± 0.343
0.16TrpHis: 0.16 ± 0.156
0.958TrpIle: 0.958 ± 0.341
1.117TrpLys: 1.117 ± 0.54
0.958TrpLeu: 0.958 ± 0.351
0.16TrpMet: 0.16 ± 0.163
0.798TrpAsn: 0.798 ± 0.557
0.319TrpPro: 0.319 ± 0.261
0.639TrpGln: 0.639 ± 0.271
1.117TrpArg: 1.117 ± 0.375
0.16TrpSer: 0.16 ± 0.156
0.319TrpThr: 0.319 ± 0.312
0.16TrpVal: 0.16 ± 0.156
0.16TrpTrp: 0.16 ± 0.156
0.479TrpTyr: 0.479 ± 0.421
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.437TyrAla: 1.437 ± 0.524
1.277TyrCys: 1.277 ± 0.365
2.075TyrAsp: 2.075 ± 0.478
1.596TyrGlu: 1.596 ± 0.552
1.117TyrPhe: 1.117 ± 0.481
1.916TyrGly: 1.916 ± 0.575
0.639TyrHis: 0.639 ± 0.275
2.714TyrIle: 2.714 ± 0.74
1.596TyrLys: 1.596 ± 0.389
3.512TyrLeu: 3.512 ± 0.967
1.117TyrMet: 1.117 ± 0.365
2.395TyrAsn: 2.395 ± 0.369
0.958TyrPro: 0.958 ± 0.398
0.639TyrGln: 0.639 ± 0.277
1.756TyrArg: 1.756 ± 0.31
3.033TyrSer: 3.033 ± 0.474
2.714TyrThr: 2.714 ± 0.685
2.075TyrVal: 2.075 ± 0.432
0.16TyrTrp: 0.16 ± 0.163
0.958TyrTyr: 0.958 ± 0.331
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6265 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski