Amino acid dipepetide frequency for Bluetongue virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.289AlaAla: 6.289 ± 1.316
0.812AlaCys: 0.812 ± 0.275
3.855AlaAsp: 3.855 ± 0.812
5.275AlaGlu: 5.275 ± 1.121
2.029AlaPhe: 2.029 ± 0.916
2.637AlaGly: 2.637 ± 0.897
0.812AlaHis: 0.812 ± 0.353
4.666AlaIle: 4.666 ± 1.199
5.275AlaLys: 5.275 ± 1.357
6.289AlaLeu: 6.289 ± 1.087
1.623AlaMet: 1.623 ± 0.621
1.826AlaAsn: 1.826 ± 0.527
3.652AlaPro: 3.652 ± 0.659
2.84AlaGln: 2.84 ± 0.491
4.463AlaArg: 4.463 ± 1.251
2.637AlaSer: 2.637 ± 0.266
4.869AlaThr: 4.869 ± 0.951
3.855AlaVal: 3.855 ± 1.114
1.014AlaTrp: 1.014 ± 0.404
3.652AlaTyr: 3.652 ± 0.522
0.0AlaXaa: 0.0 ± 0.0
Cys
0.406CysAla: 0.406 ± 0.24
0.203CysCys: 0.203 ± 0.151
0.609CysAsp: 0.609 ± 0.453
0.406CysGlu: 0.406 ± 0.276
0.609CysPhe: 0.609 ± 0.41
2.029CysGly: 2.029 ± 0.537
0.406CysHis: 0.406 ± 0.23
0.609CysIle: 0.609 ± 0.249
0.812CysLys: 0.812 ± 0.33
0.812CysLeu: 0.812 ± 0.329
0.0CysMet: 0.0 ± 0.0
0.406CysAsn: 0.406 ± 0.212
0.406CysPro: 0.406 ± 0.23
0.406CysGln: 0.406 ± 0.276
0.406CysArg: 0.406 ± 0.216
0.609CysSer: 0.609 ± 0.392
0.406CysThr: 0.406 ± 0.288
0.812CysVal: 0.812 ± 0.251
0.203CysTrp: 0.203 ± 0.189
1.623CysTyr: 1.623 ± 0.76
0.0CysXaa: 0.0 ± 0.0
Asp
4.666AspAla: 4.666 ± 0.924
0.406AspCys: 0.406 ± 0.2
5.275AspAsp: 5.275 ± 0.998
7.709AspGlu: 7.709 ± 2.364
2.029AspPhe: 2.029 ± 0.634
2.637AspGly: 2.637 ± 0.289
1.014AspHis: 1.014 ± 0.501
3.855AspIle: 3.855 ± 0.719
1.826AspLys: 1.826 ± 0.643
5.884AspLeu: 5.884 ± 1.335
2.84AspMet: 2.84 ± 0.659
0.812AspAsn: 0.812 ± 0.604
3.246AspPro: 3.246 ± 0.55
0.609AspGln: 0.609 ± 0.237
4.869AspArg: 4.869 ± 1.035
3.246AspSer: 3.246 ± 0.528
1.623AspThr: 1.623 ± 0.597
8.115AspVal: 8.115 ± 0.845
1.014AspTrp: 1.014 ± 0.376
1.42AspTyr: 1.42 ± 0.538
0.0AspXaa: 0.0 ± 0.0
Glu
5.478GluAla: 5.478 ± 1.21
0.203GluCys: 0.203 ± 0.151
3.855GluAsp: 3.855 ± 0.727
8.521GluGlu: 8.521 ± 1.343
2.029GluPhe: 2.029 ± 0.322
3.652GluGly: 3.652 ± 0.938
1.014GluHis: 1.014 ± 0.283
4.666GluIle: 4.666 ± 1.568
4.666GluLys: 4.666 ± 1.139
6.695GluLeu: 6.695 ± 0.791
2.029GluMet: 2.029 ± 0.369
1.623GluAsn: 1.623 ± 0.388
2.84GluPro: 2.84 ± 0.914
2.232GluGln: 2.232 ± 0.778
7.101GluArg: 7.101 ± 1.34
4.869GluSer: 4.869 ± 1.272
2.029GluThr: 2.029 ± 0.637
5.072GluVal: 5.072 ± 0.823
1.42GluTrp: 1.42 ± 0.538
2.435GluTyr: 2.435 ± 0.422
0.0GluXaa: 0.0 ± 0.0
Phe
2.435PheAla: 2.435 ± 0.775
0.406PheCys: 0.406 ± 0.26
3.043PheAsp: 3.043 ± 1.029
2.232PheGlu: 2.232 ± 0.279
2.232PhePhe: 2.232 ± 0.742
1.826PheGly: 1.826 ± 0.561
1.42PheHis: 1.42 ± 0.493
2.232PheIle: 2.232 ± 0.769
3.043PheLys: 3.043 ± 0.779
3.855PheLeu: 3.855 ± 0.931
1.014PheMet: 1.014 ± 0.376
1.623PheAsn: 1.623 ± 0.562
2.232PhePro: 2.232 ± 0.741
1.014PheGln: 1.014 ± 0.308
2.232PheArg: 2.232 ± 0.764
2.232PheSer: 2.232 ± 0.499
1.826PheThr: 1.826 ± 0.633
1.42PheVal: 1.42 ± 0.476
0.406PheTrp: 0.406 ± 0.212
1.623PheTyr: 1.623 ± 0.324
0.0PheXaa: 0.0 ± 0.0
Gly
5.681GlyAla: 5.681 ± 0.94
0.812GlyCys: 0.812 ± 0.411
4.26GlyAsp: 4.26 ± 1.011
3.652GlyGlu: 3.652 ± 0.622
2.232GlyPhe: 2.232 ± 0.676
3.246GlyGly: 3.246 ± 1.599
0.812GlyHis: 0.812 ± 0.365
3.652GlyIle: 3.652 ± 0.747
2.84GlyLys: 2.84 ± 1.099
2.637GlyLeu: 2.637 ± 0.641
1.826GlyMet: 1.826 ± 0.642
1.623GlyAsn: 1.623 ± 0.367
1.826GlyPro: 1.826 ± 0.669
1.826GlyGln: 1.826 ± 0.772
4.666GlyArg: 4.666 ± 0.746
2.84GlySer: 2.84 ± 1.077
2.435GlyThr: 2.435 ± 0.984
4.869GlyVal: 4.869 ± 1.369
0.812GlyTrp: 0.812 ± 0.352
2.435GlyTyr: 2.435 ± 0.622
0.0GlyXaa: 0.0 ± 0.0
His
1.623HisAla: 1.623 ± 0.369
0.812HisCys: 0.812 ± 0.261
0.609HisAsp: 0.609 ± 0.392
0.609HisGlu: 0.609 ± 0.399
1.217HisPhe: 1.217 ± 0.65
1.623HisGly: 1.623 ± 0.705
0.0HisHis: 0.0 ± 0.0
2.029HisIle: 2.029 ± 0.602
0.812HisLys: 0.812 ± 0.433
2.435HisLeu: 2.435 ± 0.493
1.217HisMet: 1.217 ± 0.406
1.014HisAsn: 1.014 ± 0.354
1.42HisPro: 1.42 ± 0.403
0.609HisGln: 0.609 ± 0.243
1.826HisArg: 1.826 ± 0.513
1.623HisSer: 1.623 ± 0.473
1.217HisThr: 1.217 ± 0.431
1.623HisVal: 1.623 ± 0.555
0.609HisTrp: 0.609 ± 0.286
1.014HisTyr: 1.014 ± 0.313
0.0HisXaa: 0.0 ± 0.0
Ile
5.884IleAla: 5.884 ± 0.738
0.203IleCys: 0.203 ± 0.151
5.275IleAsp: 5.275 ± 1.074
4.463IleGlu: 4.463 ± 0.738
1.826IlePhe: 1.826 ± 0.599
3.855IleGly: 3.855 ± 0.728
2.029IleHis: 2.029 ± 0.655
3.449IleIle: 3.449 ± 0.425
4.058IleLys: 4.058 ± 0.831
5.884IleLeu: 5.884 ± 0.899
1.826IleMet: 1.826 ± 0.676
2.232IleAsn: 2.232 ± 1.051
2.84IlePro: 2.84 ± 0.714
3.246IleGln: 3.246 ± 0.387
4.26IleArg: 4.26 ± 1.011
4.26IleSer: 4.26 ± 1.083
2.84IleThr: 2.84 ± 0.627
4.666IleVal: 4.666 ± 0.985
0.812IleTrp: 0.812 ± 0.359
2.435IleTyr: 2.435 ± 0.953
0.0IleXaa: 0.0 ± 0.0
Lys
3.246LysAla: 3.246 ± 0.784
0.609LysCys: 0.609 ± 0.34
2.84LysAsp: 2.84 ± 0.87
5.478LysGlu: 5.478 ± 1.733
2.232LysPhe: 2.232 ± 0.716
2.84LysGly: 2.84 ± 0.369
1.826LysHis: 1.826 ± 0.624
4.058LysIle: 4.058 ± 1.349
6.086LysLys: 6.086 ± 1.746
5.681LysLeu: 5.681 ± 1.074
3.246LysMet: 3.246 ± 0.809
2.029LysAsn: 2.029 ± 0.78
1.217LysPro: 1.217 ± 0.632
2.232LysGln: 2.232 ± 0.632
4.26LysArg: 4.26 ± 0.923
2.435LysSer: 2.435 ± 0.904
3.652LysThr: 3.652 ± 1.023
4.666LysVal: 4.666 ± 0.672
0.609LysTrp: 0.609 ± 0.32
1.42LysTyr: 1.42 ± 0.419
0.0LysXaa: 0.0 ± 0.0
Leu
4.666LeuAla: 4.666 ± 0.987
1.217LeuCys: 1.217 ± 0.365
4.666LeuAsp: 4.666 ± 0.873
5.681LeuGlu: 5.681 ± 1.188
3.043LeuPhe: 3.043 ± 1.048
4.26LeuGly: 4.26 ± 1.454
2.029LeuHis: 2.029 ± 0.613
5.275LeuIle: 5.275 ± 1.037
6.695LeuLys: 6.695 ± 1.175
6.695LeuLeu: 6.695 ± 1.239
2.029LeuMet: 2.029 ± 0.694
4.869LeuAsn: 4.869 ± 0.964
5.275LeuPro: 5.275 ± 0.934
3.246LeuGln: 3.246 ± 0.937
7.709LeuArg: 7.709 ± 1.137
5.681LeuSer: 5.681 ± 0.589
4.26LeuThr: 4.26 ± 0.645
3.855LeuVal: 3.855 ± 0.643
0.812LeuTrp: 0.812 ± 0.345
2.84LeuTyr: 2.84 ± 0.974
0.0LeuXaa: 0.0 ± 0.0
Met
1.826MetAla: 1.826 ± 0.408
0.203MetCys: 0.203 ± 0.192
2.232MetAsp: 2.232 ± 0.571
1.217MetGlu: 1.217 ± 0.472
1.826MetPhe: 1.826 ± 0.641
1.623MetGly: 1.623 ± 0.787
1.217MetHis: 1.217 ± 0.611
3.043MetIle: 3.043 ± 0.762
2.232MetLys: 2.232 ± 0.49
3.449MetLeu: 3.449 ± 1.03
2.232MetMet: 2.232 ± 0.771
1.42MetAsn: 1.42 ± 0.483
1.42MetPro: 1.42 ± 0.718
0.812MetGln: 0.812 ± 0.261
3.246MetArg: 3.246 ± 0.638
2.435MetSer: 2.435 ± 0.708
1.826MetThr: 1.826 ± 0.822
2.637MetVal: 2.637 ± 0.499
0.406MetTrp: 0.406 ± 0.241
1.42MetTyr: 1.42 ± 0.396
0.0MetXaa: 0.0 ± 0.0
Asn
2.637AsnAla: 2.637 ± 1.033
0.406AsnCys: 0.406 ± 0.383
3.855AsnAsp: 3.855 ± 0.45
3.449AsnGlu: 3.449 ± 0.712
2.029AsnPhe: 2.029 ± 0.594
2.84AsnGly: 2.84 ± 0.526
1.217AsnHis: 1.217 ± 0.76
1.826AsnIle: 1.826 ± 0.425
0.609AsnLys: 0.609 ± 0.31
2.84AsnLeu: 2.84 ± 0.63
1.217AsnMet: 1.217 ± 0.461
0.609AsnAsn: 0.609 ± 0.242
3.043AsnPro: 3.043 ± 0.543
1.217AsnGln: 1.217 ± 0.442
1.623AsnArg: 1.623 ± 0.427
2.029AsnSer: 2.029 ± 0.322
1.623AsnThr: 1.623 ± 0.438
2.84AsnVal: 2.84 ± 0.721
0.609AsnTrp: 0.609 ± 0.285
1.014AsnTyr: 1.014 ± 0.348
0.0AsnXaa: 0.0 ± 0.0
Pro
2.232ProAla: 2.232 ± 0.492
0.812ProCys: 0.812 ± 0.473
3.652ProAsp: 3.652 ± 0.69
2.232ProGlu: 2.232 ± 0.539
1.217ProPhe: 1.217 ± 0.522
2.637ProGly: 2.637 ± 0.94
0.812ProHis: 0.812 ± 0.501
3.043ProIle: 3.043 ± 0.488
1.826ProLys: 1.826 ± 0.577
3.449ProLeu: 3.449 ± 0.564
1.42ProMet: 1.42 ± 0.457
1.826ProAsn: 1.826 ± 0.469
1.42ProPro: 1.42 ± 0.538
2.232ProGln: 2.232 ± 0.491
3.043ProArg: 3.043 ± 0.678
1.826ProSer: 1.826 ± 0.609
3.652ProThr: 3.652 ± 1.107
1.826ProVal: 1.826 ± 0.549
0.406ProTrp: 0.406 ± 0.273
2.232ProTyr: 2.232 ± 0.784
0.0ProXaa: 0.0 ± 0.0
Gln
1.623GlnAla: 1.623 ± 0.469
0.406GlnCys: 0.406 ± 0.276
1.826GlnAsp: 1.826 ± 0.607
1.826GlnGlu: 1.826 ± 0.893
1.623GlnPhe: 1.623 ± 0.44
2.435GlnGly: 2.435 ± 0.601
0.609GlnHis: 0.609 ± 0.274
4.869GlnIle: 4.869 ± 1.268
2.435GlnLys: 2.435 ± 0.997
2.84GlnLeu: 2.84 ± 0.748
1.623GlnMet: 1.623 ± 0.48
1.42GlnAsn: 1.42 ± 0.526
1.623GlnPro: 1.623 ± 0.405
1.826GlnGln: 1.826 ± 0.626
2.84GlnArg: 2.84 ± 0.826
1.826GlnSer: 1.826 ± 0.428
2.029GlnThr: 2.029 ± 0.932
2.232GlnVal: 2.232 ± 0.726
0.609GlnTrp: 0.609 ± 0.284
1.217GlnTyr: 1.217 ± 0.395
0.0GlnXaa: 0.0 ± 0.0
Arg
4.26ArgAla: 4.26 ± 1.391
0.812ArgCys: 0.812 ± 0.388
5.275ArgAsp: 5.275 ± 0.522
4.666ArgGlu: 4.666 ± 1.216
4.869ArgPhe: 4.869 ± 1.064
3.043ArgGly: 3.043 ± 0.605
1.014ArgHis: 1.014 ± 0.382
5.072ArgIle: 5.072 ± 0.944
3.855ArgLys: 3.855 ± 0.766
4.463ArgLeu: 4.463 ± 0.776
3.043ArgMet: 3.043 ± 0.308
3.652ArgAsn: 3.652 ± 0.635
1.42ArgPro: 1.42 ± 0.334
2.637ArgGln: 2.637 ± 0.411
4.666ArgArg: 4.666 ± 0.42
4.058ArgSer: 4.058 ± 0.493
3.652ArgThr: 3.652 ± 0.935
6.086ArgVal: 6.086 ± 0.862
2.029ArgTrp: 2.029 ± 0.692
2.029ArgTyr: 2.029 ± 0.61
0.0ArgXaa: 0.0 ± 0.0
Ser
4.26SerAla: 4.26 ± 0.745
0.406SerCys: 0.406 ± 0.302
2.435SerAsp: 2.435 ± 0.734
3.449SerGlu: 3.449 ± 0.633
1.623SerPhe: 1.623 ± 0.44
3.855SerGly: 3.855 ± 1.207
2.029SerHis: 2.029 ± 0.534
3.855SerIle: 3.855 ± 0.86
3.246SerLys: 3.246 ± 0.658
5.275SerLeu: 5.275 ± 1.156
1.826SerMet: 1.826 ± 0.414
2.435SerAsn: 2.435 ± 0.695
2.435SerPro: 2.435 ± 0.538
2.637SerGln: 2.637 ± 0.441
3.043SerArg: 3.043 ± 0.575
3.652SerSer: 3.652 ± 1.052
2.637SerThr: 2.637 ± 0.637
4.869SerVal: 4.869 ± 0.588
0.812SerTrp: 0.812 ± 0.249
2.637SerTyr: 2.637 ± 0.655
0.0SerXaa: 0.0 ± 0.0
Thr
2.84ThrAla: 2.84 ± 0.809
1.42ThrCys: 1.42 ± 0.765
1.623ThrAsp: 1.623 ± 0.549
2.84ThrGlu: 2.84 ± 0.961
1.217ThrPhe: 1.217 ± 0.482
3.652ThrGly: 3.652 ± 1.252
1.623ThrHis: 1.623 ± 0.512
3.449ThrIle: 3.449 ± 0.633
3.246ThrLys: 3.246 ± 0.949
5.275ThrLeu: 5.275 ± 1.048
1.826ThrMet: 1.826 ± 0.69
2.435ThrAsn: 2.435 ± 0.512
1.826ThrPro: 1.826 ± 0.453
2.232ThrGln: 2.232 ± 0.846
2.029ThrArg: 2.029 ± 0.325
3.043ThrSer: 3.043 ± 0.813
3.652ThrThr: 3.652 ± 0.745
2.435ThrVal: 2.435 ± 0.535
0.406ThrTrp: 0.406 ± 0.394
2.637ThrTyr: 2.637 ± 0.721
0.0ThrXaa: 0.0 ± 0.0
Val
4.26ValAla: 4.26 ± 0.944
1.014ValCys: 1.014 ± 0.475
3.449ValAsp: 3.449 ± 0.777
4.058ValGlu: 4.058 ± 0.564
2.637ValPhe: 2.637 ± 0.593
4.058ValGly: 4.058 ± 0.588
2.232ValHis: 2.232 ± 0.525
4.058ValIle: 4.058 ± 1.223
3.652ValLys: 3.652 ± 0.691
6.289ValLeu: 6.289 ± 1.428
3.855ValMet: 3.855 ± 0.827
2.637ValAsn: 2.637 ± 0.677
2.637ValPro: 2.637 ± 0.656
4.463ValGln: 4.463 ± 0.808
5.884ValArg: 5.884 ± 0.867
4.869ValSer: 4.869 ± 1.0
2.637ValThr: 2.637 ± 0.884
3.855ValVal: 3.855 ± 0.878
1.014ValTrp: 1.014 ± 0.35
2.84ValTyr: 2.84 ± 0.716
0.0ValXaa: 0.0 ± 0.0
Trp
0.609TrpAla: 0.609 ± 0.335
0.0TrpCys: 0.0 ± 0.0
1.014TrpAsp: 1.014 ± 0.283
1.217TrpGlu: 1.217 ± 0.45
0.812TrpPhe: 0.812 ± 0.329
0.203TrpGly: 0.203 ± 0.197
0.812TrpHis: 0.812 ± 0.462
1.217TrpIle: 1.217 ± 0.442
1.217TrpLys: 1.217 ± 0.425
1.42TrpLeu: 1.42 ± 0.553
0.609TrpMet: 0.609 ± 0.285
1.217TrpAsn: 1.217 ± 0.515
0.0TrpPro: 0.0 ± 0.0
0.609TrpGln: 0.609 ± 0.41
0.406TrpArg: 0.406 ± 0.27
0.812TrpSer: 0.812 ± 0.452
0.812TrpThr: 0.812 ± 0.334
1.014TrpVal: 1.014 ± 0.517
0.406TrpTrp: 0.406 ± 0.216
0.203TrpTyr: 0.203 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.449TyrAla: 3.449 ± 1.054
1.014TyrCys: 1.014 ± 0.713
3.449TyrAsp: 3.449 ± 0.928
3.449TyrGlu: 3.449 ± 0.967
1.014TyrPhe: 1.014 ± 0.332
2.029TyrGly: 2.029 ± 0.305
1.014TyrHis: 1.014 ± 0.419
1.42TyrIle: 1.42 ± 0.478
2.232TyrLys: 2.232 ± 0.524
2.637TyrLeu: 2.637 ± 0.414
1.42TyrMet: 1.42 ± 0.445
1.826TyrAsn: 1.826 ± 0.811
1.014TyrPro: 1.014 ± 0.354
0.812TyrGln: 0.812 ± 0.441
2.029TyrArg: 2.029 ± 0.309
2.435TyrSer: 2.435 ± 0.571
2.029TyrThr: 2.029 ± 0.729
3.652TyrVal: 3.652 ± 1.003
0.203TyrTrp: 0.203 ± 0.151
1.217TyrTyr: 1.217 ± 0.531
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (4930 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski