Amino acid dipepetide frequency for Bluetongue virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.02AlaAla: 6.02 ± 1.266
0.814AlaCys: 0.814 ± 0.291
3.417AlaAsp: 3.417 ± 0.581
5.369AlaGlu: 5.369 ± 0.948
1.952AlaPhe: 1.952 ± 0.707
2.929AlaGly: 2.929 ± 0.837
0.814AlaHis: 0.814 ± 0.242
4.556AlaIle: 4.556 ± 1.166
3.742AlaLys: 3.742 ± 0.963
6.834AlaLeu: 6.834 ± 1.149
2.441AlaMet: 2.441 ± 0.547
2.929AlaAsn: 2.929 ± 0.388
3.58AlaPro: 3.58 ± 0.665
3.742AlaGln: 3.742 ± 0.765
4.068AlaArg: 4.068 ± 1.129
3.254AlaSer: 3.254 ± 0.68
4.881AlaThr: 4.881 ± 0.814
3.905AlaVal: 3.905 ± 0.753
0.976AlaTrp: 0.976 ± 0.361
3.742AlaTyr: 3.742 ± 0.442
0.0AlaXaa: 0.0 ± 0.0
Cys
0.325CysAla: 0.325 ± 0.257
0.325CysCys: 0.325 ± 0.249
0.488CysAsp: 0.488 ± 0.376
0.325CysGlu: 0.325 ± 0.251
0.814CysPhe: 0.814 ± 0.353
1.79CysGly: 1.79 ± 0.422
0.163CysHis: 0.163 ± 0.125
0.651CysIle: 0.651 ± 0.234
0.651CysLys: 0.651 ± 0.324
0.976CysLeu: 0.976 ± 0.265
0.163CysMet: 0.163 ± 0.125
0.488CysAsn: 0.488 ± 0.195
0.325CysPro: 0.325 ± 0.192
0.325CysGln: 0.325 ± 0.251
0.163CysArg: 0.163 ± 0.157
0.488CysSer: 0.488 ± 0.329
0.325CysThr: 0.325 ± 0.182
0.488CysVal: 0.488 ± 0.249
0.163CysTrp: 0.163 ± 0.125
1.139CysTyr: 1.139 ± 0.612
0.0CysXaa: 0.0 ± 0.0
Asp
3.905AspAla: 3.905 ± 0.961
0.488AspCys: 0.488 ± 0.175
5.207AspAsp: 5.207 ± 1.119
7.485AspGlu: 7.485 ± 1.967
2.278AspPhe: 2.278 ± 0.657
2.766AspGly: 2.766 ± 0.413
0.976AspHis: 0.976 ± 0.403
4.556AspIle: 4.556 ± 0.914
2.278AspLys: 2.278 ± 0.838
5.207AspLeu: 5.207 ± 1.265
2.115AspMet: 2.115 ± 0.771
0.488AspAsn: 0.488 ± 0.288
3.417AspPro: 3.417 ± 0.522
0.814AspGln: 0.814 ± 0.361
4.23AspArg: 4.23 ± 0.669
2.603AspSer: 2.603 ± 0.532
2.115AspThr: 2.115 ± 0.68
7.159AspVal: 7.159 ± 0.829
0.976AspTrp: 0.976 ± 0.382
2.115AspTyr: 2.115 ± 0.525
0.0AspXaa: 0.0 ± 0.0
Glu
5.532GluAla: 5.532 ± 0.988
0.0GluCys: 0.0 ± 0.0
4.23GluAsp: 4.23 ± 0.766
7.322GluGlu: 7.322 ± 1.373
3.254GluPhe: 3.254 ± 0.82
4.068GluGly: 4.068 ± 1.381
1.139GluHis: 1.139 ± 0.383
5.695GluIle: 5.695 ± 1.314
4.068GluLys: 4.068 ± 0.744
6.02GluLeu: 6.02 ± 0.719
2.603GluMet: 2.603 ± 0.445
1.464GluAsn: 1.464 ± 0.314
3.091GluPro: 3.091 ± 0.834
2.603GluGln: 2.603 ± 0.513
6.671GluArg: 6.671 ± 1.193
4.881GluSer: 4.881 ± 1.363
2.278GluThr: 2.278 ± 0.47
4.719GluVal: 4.719 ± 0.519
0.814GluTrp: 0.814 ± 0.379
2.603GluTyr: 2.603 ± 0.418
0.0GluXaa: 0.0 ± 0.0
Phe
2.278PheAla: 2.278 ± 0.82
0.325PheCys: 0.325 ± 0.178
3.091PheAsp: 3.091 ± 0.93
2.603PheGlu: 2.603 ± 0.442
1.79PhePhe: 1.79 ± 0.59
2.278PheGly: 2.278 ± 0.597
0.976PheHis: 0.976 ± 0.242
2.766PheIle: 2.766 ± 0.804
2.766PheLys: 2.766 ± 0.6
3.254PheLeu: 3.254 ± 0.836
0.976PheMet: 0.976 ± 0.484
1.79PheAsn: 1.79 ± 0.443
1.627PhePro: 1.627 ± 0.51
1.302PheGln: 1.302 ± 0.372
2.766PheArg: 2.766 ± 0.843
2.115PheSer: 2.115 ± 0.492
2.278PheThr: 2.278 ± 0.586
1.627PheVal: 1.627 ± 0.376
0.325PheTrp: 0.325 ± 0.187
2.278PheTyr: 2.278 ± 0.637
0.0PheXaa: 0.0 ± 0.0
Gly
4.556GlyAla: 4.556 ± 1.09
0.976GlyCys: 0.976 ± 0.299
4.719GlyAsp: 4.719 ± 1.443
4.068GlyGlu: 4.068 ± 0.866
2.603GlyPhe: 2.603 ± 0.459
3.254GlyGly: 3.254 ± 1.462
0.976GlyHis: 0.976 ± 0.342
4.23GlyIle: 4.23 ± 0.636
2.766GlyLys: 2.766 ± 0.582
3.091GlyLeu: 3.091 ± 0.588
1.952GlyMet: 1.952 ± 0.509
1.627GlyAsn: 1.627 ± 0.363
2.115GlyPro: 2.115 ± 0.778
1.952GlyGln: 1.952 ± 0.565
4.393GlyArg: 4.393 ± 0.741
3.091GlySer: 3.091 ± 0.573
2.441GlyThr: 2.441 ± 0.545
4.23GlyVal: 4.23 ± 1.38
0.814GlyTrp: 0.814 ± 0.274
2.441GlyTyr: 2.441 ± 0.613
0.0GlyXaa: 0.0 ± 0.0
His
1.464HisAla: 1.464 ± 0.476
0.488HisCys: 0.488 ± 0.221
0.976HisAsp: 0.976 ± 0.389
0.976HisGlu: 0.976 ± 0.401
0.651HisPhe: 0.651 ± 0.338
1.302HisGly: 1.302 ± 0.589
0.325HisHis: 0.325 ± 0.349
1.464HisIle: 1.464 ± 0.393
1.139HisLys: 1.139 ± 0.374
2.441HisLeu: 2.441 ± 0.526
0.814HisMet: 0.814 ± 0.365
1.627HisAsn: 1.627 ± 0.241
1.464HisPro: 1.464 ± 0.336
0.488HisGln: 0.488 ± 0.195
1.464HisArg: 1.464 ± 0.57
1.302HisSer: 1.302 ± 0.298
1.302HisThr: 1.302 ± 0.342
1.79HisVal: 1.79 ± 0.509
0.488HisTrp: 0.488 ± 0.291
0.651HisTyr: 0.651 ± 0.354
0.0HisXaa: 0.0 ± 0.0
Ile
6.346IleAla: 6.346 ± 0.782
0.325IleCys: 0.325 ± 0.202
4.719IleAsp: 4.719 ± 0.783
4.23IleGlu: 4.23 ± 0.724
2.115IlePhe: 2.115 ± 0.46
3.742IleGly: 3.742 ± 0.542
1.302IleHis: 1.302 ± 0.69
3.58IleIle: 3.58 ± 0.634
4.881IleLys: 4.881 ± 0.62
6.183IleLeu: 6.183 ± 0.833
1.464IleMet: 1.464 ± 0.64
3.58IleAsn: 3.58 ± 0.679
3.417IlePro: 3.417 ± 0.548
3.091IleGln: 3.091 ± 0.427
4.556IleArg: 4.556 ± 1.164
4.393IleSer: 4.393 ± 0.48
3.091IleThr: 3.091 ± 0.59
5.044IleVal: 5.044 ± 0.877
0.651IleTrp: 0.651 ± 0.329
2.603IleTyr: 2.603 ± 0.474
0.0IleXaa: 0.0 ± 0.0
Lys
3.742LysAla: 3.742 ± 0.768
0.325LysCys: 0.325 ± 0.159
2.441LysAsp: 2.441 ± 0.804
5.044LysGlu: 5.044 ± 1.851
2.115LysPhe: 2.115 ± 0.568
2.929LysGly: 2.929 ± 0.548
1.464LysHis: 1.464 ± 0.518
4.881LysIle: 4.881 ± 1.08
5.207LysLys: 5.207 ± 1.3
4.393LysLeu: 4.393 ± 0.657
3.091LysMet: 3.091 ± 0.878
2.115LysAsn: 2.115 ± 0.737
0.976LysPro: 0.976 ± 0.335
2.603LysGln: 2.603 ± 0.681
3.905LysArg: 3.905 ± 0.859
1.952LysSer: 1.952 ± 0.723
3.091LysThr: 3.091 ± 0.607
4.556LysVal: 4.556 ± 0.898
0.488LysTrp: 0.488 ± 0.267
2.278LysTyr: 2.278 ± 0.348
0.0LysXaa: 0.0 ± 0.0
Leu
4.393LeuAla: 4.393 ± 0.682
1.139LeuCys: 1.139 ± 0.356
4.719LeuAsp: 4.719 ± 0.687
4.719LeuGlu: 4.719 ± 0.676
3.091LeuPhe: 3.091 ± 0.888
3.905LeuGly: 3.905 ± 0.974
2.278LeuHis: 2.278 ± 0.447
5.532LeuIle: 5.532 ± 0.892
6.671LeuLys: 6.671 ± 0.942
5.695LeuLeu: 5.695 ± 0.852
2.278LeuMet: 2.278 ± 0.537
4.881LeuAsn: 4.881 ± 0.879
5.044LeuPro: 5.044 ± 0.921
3.254LeuGln: 3.254 ± 0.878
7.485LeuArg: 7.485 ± 0.795
6.183LeuSer: 6.183 ± 0.597
5.532LeuThr: 5.532 ± 0.95
3.58LeuVal: 3.58 ± 0.55
0.814LeuTrp: 0.814 ± 0.302
2.603LeuTyr: 2.603 ± 0.876
0.0LeuXaa: 0.0 ± 0.0
Met
2.441MetAla: 2.441 ± 0.501
0.651MetCys: 0.651 ± 0.233
2.278MetAsp: 2.278 ± 0.542
1.139MetGlu: 1.139 ± 0.381
1.139MetPhe: 1.139 ± 0.379
1.302MetGly: 1.302 ± 0.669
1.302MetHis: 1.302 ± 0.465
3.091MetIle: 3.091 ± 0.468
2.115MetLys: 2.115 ± 0.511
4.068MetLeu: 4.068 ± 0.702
2.278MetMet: 2.278 ± 0.512
1.627MetAsn: 1.627 ± 0.382
1.464MetPro: 1.464 ± 0.634
1.139MetGln: 1.139 ± 0.409
3.254MetArg: 3.254 ± 0.729
2.115MetSer: 2.115 ± 0.669
1.627MetThr: 1.627 ± 0.615
2.441MetVal: 2.441 ± 0.346
0.651MetTrp: 0.651 ± 0.216
0.976MetTyr: 0.976 ± 0.239
0.0MetXaa: 0.0 ± 0.0
Asn
2.603AsnAla: 2.603 ± 1.418
0.488AsnCys: 0.488 ± 0.329
3.091AsnAsp: 3.091 ± 0.576
4.068AsnGlu: 4.068 ± 0.553
1.79AsnPhe: 1.79 ± 0.424
2.766AsnGly: 2.766 ± 0.55
0.814AsnHis: 0.814 ± 0.38
1.952AsnIle: 1.952 ± 0.461
0.976AsnLys: 0.976 ± 0.468
3.091AsnLeu: 3.091 ± 0.513
1.302AsnMet: 1.302 ± 0.389
0.325AsnAsn: 0.325 ± 0.178
2.766AsnPro: 2.766 ± 0.502
1.464AsnGln: 1.464 ± 0.441
2.441AsnArg: 2.441 ± 0.556
2.603AsnSer: 2.603 ± 0.948
1.79AsnThr: 1.79 ± 0.414
3.742AsnVal: 3.742 ± 0.604
0.325AsnTrp: 0.325 ± 0.251
0.814AsnTyr: 0.814 ± 0.237
0.0AsnXaa: 0.0 ± 0.0
Pro
2.115ProAla: 2.115 ± 0.508
0.651ProCys: 0.651 ± 0.373
3.254ProAsp: 3.254 ± 0.886
3.58ProGlu: 3.58 ± 0.85
1.302ProPhe: 1.302 ± 0.406
3.091ProGly: 3.091 ± 0.772
0.976ProHis: 0.976 ± 0.484
3.417ProIle: 3.417 ± 0.533
1.952ProLys: 1.952 ± 0.563
3.905ProLeu: 3.905 ± 0.538
1.302ProMet: 1.302 ± 0.585
1.627ProAsn: 1.627 ± 0.426
1.302ProPro: 1.302 ± 0.459
1.627ProGln: 1.627 ± 0.373
2.603ProArg: 2.603 ± 0.648
1.79ProSer: 1.79 ± 0.512
3.58ProThr: 3.58 ± 1.112
1.627ProVal: 1.627 ± 0.517
0.488ProTrp: 0.488 ± 0.214
2.441ProTyr: 2.441 ± 0.543
0.0ProXaa: 0.0 ± 0.0
Gln
2.278GlnAla: 2.278 ± 0.609
0.325GlnCys: 0.325 ± 0.251
1.302GlnAsp: 1.302 ± 0.427
2.441GlnGlu: 2.441 ± 0.825
0.976GlnPhe: 0.976 ± 0.234
2.278GlnGly: 2.278 ± 0.591
0.814GlnHis: 0.814 ± 0.326
4.393GlnIle: 4.393 ± 1.08
2.278GlnLys: 2.278 ± 0.642
2.603GlnLeu: 2.603 ± 0.745
1.464GlnMet: 1.464 ± 0.514
1.302GlnAsn: 1.302 ± 0.579
2.115GlnPro: 2.115 ± 0.5
1.627GlnGln: 1.627 ± 0.671
3.254GlnArg: 3.254 ± 0.704
2.278GlnSer: 2.278 ± 0.444
1.627GlnThr: 1.627 ± 0.558
2.603GlnVal: 2.603 ± 0.839
0.814GlnTrp: 0.814 ± 0.446
0.976GlnTyr: 0.976 ± 0.296
0.0GlnXaa: 0.0 ± 0.0
Arg
4.881ArgAla: 4.881 ± 1.27
0.651ArgCys: 0.651 ± 0.341
5.695ArgAsp: 5.695 ± 0.382
4.719ArgGlu: 4.719 ± 1.075
4.556ArgPhe: 4.556 ± 0.821
4.068ArgGly: 4.068 ± 0.902
1.627ArgHis: 1.627 ± 0.522
4.719ArgIle: 4.719 ± 0.909
2.766ArgLys: 2.766 ± 0.811
5.532ArgLeu: 5.532 ± 0.695
2.441ArgMet: 2.441 ± 0.5
4.068ArgAsn: 4.068 ± 0.689
1.627ArgPro: 1.627 ± 0.47
2.603ArgGln: 2.603 ± 0.555
5.369ArgArg: 5.369 ± 0.67
3.417ArgSer: 3.417 ± 0.645
2.929ArgThr: 2.929 ± 0.74
6.183ArgVal: 6.183 ± 0.943
1.464ArgTrp: 1.464 ± 0.656
2.115ArgTyr: 2.115 ± 0.448
0.0ArgXaa: 0.0 ± 0.0
Ser
4.068SerAla: 4.068 ± 0.835
0.0SerCys: 0.0 ± 0.0
2.929SerAsp: 2.929 ± 0.635
3.742SerGlu: 3.742 ± 0.806
2.278SerPhe: 2.278 ± 0.494
3.091SerGly: 3.091 ± 0.627
1.627SerHis: 1.627 ± 0.363
3.905SerIle: 3.905 ± 0.626
3.417SerLys: 3.417 ± 0.96
4.719SerLeu: 4.719 ± 0.848
1.79SerMet: 1.79 ± 0.313
1.952SerAsn: 1.952 ± 0.501
1.952SerPro: 1.952 ± 0.497
2.441SerGln: 2.441 ± 0.453
3.417SerArg: 3.417 ± 0.846
4.068SerSer: 4.068 ± 1.115
3.254SerThr: 3.254 ± 0.561
4.068SerVal: 4.068 ± 0.672
0.976SerTrp: 0.976 ± 0.239
2.115SerTyr: 2.115 ± 0.5
0.0SerXaa: 0.0 ± 0.0
Thr
4.068ThrAla: 4.068 ± 1.025
0.814ThrCys: 0.814 ± 0.632
1.627ThrAsp: 1.627 ± 0.59
3.091ThrGlu: 3.091 ± 1.087
1.627ThrPhe: 1.627 ± 0.514
3.417ThrGly: 3.417 ± 0.884
1.952ThrHis: 1.952 ± 0.448
3.254ThrIle: 3.254 ± 0.532
3.742ThrLys: 3.742 ± 0.857
5.532ThrLeu: 5.532 ± 1.133
1.627ThrMet: 1.627 ± 0.471
1.464ThrAsn: 1.464 ± 0.413
1.952ThrPro: 1.952 ± 0.47
2.278ThrGln: 2.278 ± 0.688
3.254ThrArg: 3.254 ± 0.432
2.278ThrSer: 2.278 ± 0.746
2.929ThrThr: 2.929 ± 0.663
3.58ThrVal: 3.58 ± 0.563
0.325ThrTrp: 0.325 ± 0.354
2.441ThrTyr: 2.441 ± 0.93
0.0ThrXaa: 0.0 ± 0.0
Val
4.881ValAla: 4.881 ± 0.874
0.814ValCys: 0.814 ± 0.448
3.091ValAsp: 3.091 ± 0.618
4.556ValGlu: 4.556 ± 0.617
2.929ValPhe: 2.929 ± 0.602
4.393ValGly: 4.393 ± 0.703
1.79ValHis: 1.79 ± 0.462
3.58ValIle: 3.58 ± 0.612
3.254ValLys: 3.254 ± 0.387
6.834ValLeu: 6.834 ± 1.627
4.556ValMet: 4.556 ± 0.639
1.79ValAsn: 1.79 ± 0.468
3.254ValPro: 3.254 ± 0.39
3.417ValGln: 3.417 ± 0.688
5.369ValArg: 5.369 ± 1.106
3.742ValSer: 3.742 ± 0.932
2.929ValThr: 2.929 ± 0.764
3.742ValVal: 3.742 ± 0.781
0.976ValTrp: 0.976 ± 0.265
3.417ValTyr: 3.417 ± 0.583
0.0ValXaa: 0.0 ± 0.0
Trp
0.488TrpAla: 0.488 ± 0.291
0.163TrpCys: 0.163 ± 0.125
0.651TrpAsp: 0.651 ± 0.276
0.814TrpGlu: 0.814 ± 0.377
0.814TrpPhe: 0.814 ± 0.254
0.325TrpGly: 0.325 ± 0.196
0.651TrpHis: 0.651 ± 0.36
0.976TrpIle: 0.976 ± 0.344
1.139TrpLys: 1.139 ± 0.389
0.976TrpLeu: 0.976 ± 0.525
0.488TrpMet: 0.488 ± 0.249
1.302TrpAsn: 1.302 ± 0.389
0.0TrpPro: 0.0 ± 0.0
0.488TrpGln: 0.488 ± 0.365
0.325TrpArg: 0.325 ± 0.222
0.814TrpSer: 0.814 ± 0.378
0.488TrpThr: 0.488 ± 0.302
1.139TrpVal: 1.139 ± 0.366
0.325TrpTrp: 0.325 ± 0.186
0.488TrpTyr: 0.488 ± 0.376
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.58TyrAla: 3.58 ± 1.043
0.814TyrCys: 0.814 ± 0.607
3.254TyrAsp: 3.254 ± 0.872
2.766TyrGlu: 2.766 ± 0.728
1.139TyrPhe: 1.139 ± 0.319
2.603TyrGly: 2.603 ± 0.389
0.651TyrHis: 0.651 ± 0.312
2.115TyrIle: 2.115 ± 0.353
1.79TyrLys: 1.79 ± 0.403
2.441TyrLeu: 2.441 ± 0.523
1.79TyrMet: 1.79 ± 0.379
2.766TyrAsn: 2.766 ± 0.702
1.139TyrPro: 1.139 ± 0.381
0.488TyrGln: 0.488 ± 0.374
2.441TyrArg: 2.441 ± 0.41
2.441TyrSer: 2.441 ± 0.517
2.929TyrThr: 2.929 ± 0.648
3.091TyrVal: 3.091 ± 1.171
0.0TyrTrp: 0.0 ± 0.0
0.814TyrTyr: 0.814 ± 0.385
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6147 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski