Amino acid dipepetide frequency for Rotavirus H

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.392AlaAla: 4.392 ± 0.986
0.176AlaCys: 0.176 ± 0.186
4.568AlaAsp: 4.568 ± 0.826
4.919AlaGlu: 4.919 ± 1.053
2.811AlaPhe: 2.811 ± 0.954
2.108AlaGly: 2.108 ± 0.698
0.878AlaHis: 0.878 ± 0.282
4.919AlaIle: 4.919 ± 0.574
4.919AlaLys: 4.919 ± 1.086
5.973AlaLeu: 5.973 ± 1.033
1.405AlaMet: 1.405 ± 0.321
5.622AlaAsn: 5.622 ± 1.035
1.581AlaPro: 1.581 ± 0.628
2.987AlaGln: 2.987 ± 0.608
3.865AlaArg: 3.865 ± 0.814
3.865AlaSer: 3.865 ± 0.724
5.271AlaThr: 5.271 ± 0.822
3.162AlaVal: 3.162 ± 1.161
0.703AlaTrp: 0.703 ± 0.274
1.757AlaTyr: 1.757 ± 0.634
0.351AlaXaa: 0.351 ± 0.243
Cys
0.527CysAla: 0.527 ± 0.302
0.176CysCys: 0.176 ± 0.199
0.703CysAsp: 0.703 ± 0.369
1.23CysGlu: 1.23 ± 0.422
0.351CysPhe: 0.351 ± 0.177
1.054CysGly: 1.054 ± 0.843
0.176CysHis: 0.176 ± 0.155
0.878CysIle: 0.878 ± 0.369
0.703CysLys: 0.703 ± 0.203
0.878CysLeu: 0.878 ± 0.326
0.351CysMet: 0.351 ± 0.397
1.23CysAsn: 1.23 ± 0.549
0.0CysPro: 0.0 ± 0.0
0.703CysGln: 0.703 ± 0.465
0.527CysArg: 0.527 ± 0.322
1.054CysSer: 1.054 ± 0.608
0.703CysThr: 0.703 ± 0.4
0.703CysVal: 0.703 ± 0.362
0.0CysTrp: 0.0 ± 0.0
1.054CysTyr: 1.054 ± 0.41
0.0CysXaa: 0.0 ± 0.0
Asp
3.514AspAla: 3.514 ± 0.775
0.351AspCys: 0.351 ± 0.284
4.041AspAsp: 4.041 ± 0.599
3.865AspGlu: 3.865 ± 0.838
2.284AspPhe: 2.284 ± 0.382
3.162AspGly: 3.162 ± 0.546
0.703AspHis: 0.703 ± 0.488
5.271AspIle: 5.271 ± 1.013
4.216AspLys: 4.216 ± 0.935
6.5AspLeu: 6.5 ± 0.831
1.405AspMet: 1.405 ± 0.542
2.811AspAsn: 2.811 ± 0.586
2.46AspPro: 2.46 ± 0.524
1.933AspGln: 1.933 ± 0.644
3.514AspArg: 3.514 ± 0.936
5.095AspSer: 5.095 ± 1.372
2.46AspThr: 2.46 ± 0.83
4.041AspVal: 4.041 ± 0.708
1.054AspTrp: 1.054 ± 0.451
1.405AspTyr: 1.405 ± 0.734
0.351AspXaa: 0.351 ± 0.272
Glu
4.041GluAla: 4.041 ± 1.064
0.176GluCys: 0.176 ± 0.153
3.338GluAsp: 3.338 ± 0.8
2.635GluGlu: 2.635 ± 0.877
3.162GluPhe: 3.162 ± 0.518
2.46GluGly: 2.46 ± 0.76
0.703GluHis: 0.703 ± 0.51
5.622GluIle: 5.622 ± 1.055
5.095GluLys: 5.095 ± 0.843
5.446GluLeu: 5.446 ± 0.715
1.933GluMet: 1.933 ± 0.846
3.338GluAsn: 3.338 ± 0.902
2.108GluPro: 2.108 ± 0.609
2.635GluGln: 2.635 ± 0.713
3.162GluArg: 3.162 ± 0.656
4.743GluSer: 4.743 ± 1.034
3.514GluThr: 3.514 ± 0.757
2.635GluVal: 2.635 ± 0.262
0.703GluTrp: 0.703 ± 0.274
3.162GluTyr: 3.162 ± 0.626
0.0GluXaa: 0.0 ± 0.0
Phe
3.162PheAla: 3.162 ± 0.697
1.405PheCys: 1.405 ± 0.559
2.987PheAsp: 2.987 ± 1.054
2.811PheGlu: 2.811 ± 0.556
1.054PhePhe: 1.054 ± 0.4
2.46PheGly: 2.46 ± 0.564
0.878PheHis: 0.878 ± 0.448
2.811PheIle: 2.811 ± 0.569
3.162PheLys: 3.162 ± 0.748
2.811PheLeu: 2.811 ± 0.7
0.527PheMet: 0.527 ± 0.459
3.689PheAsn: 3.689 ± 0.745
1.757PhePro: 1.757 ± 0.698
1.405PheGln: 1.405 ± 0.374
2.635PheArg: 2.635 ± 0.875
2.108PheSer: 2.108 ± 0.5
2.635PheThr: 2.635 ± 0.577
3.514PheVal: 3.514 ± 0.758
0.527PheTrp: 0.527 ± 0.252
1.581PheTyr: 1.581 ± 0.478
0.0PheXaa: 0.0 ± 0.0
Gly
2.46GlyAla: 2.46 ± 0.608
0.0GlyCys: 0.0 ± 0.0
1.757GlyAsp: 1.757 ± 0.325
2.46GlyGlu: 2.46 ± 0.393
2.284GlyPhe: 2.284 ± 0.824
2.284GlyGly: 2.284 ± 0.745
1.581GlyHis: 1.581 ± 0.501
3.514GlyIle: 3.514 ± 0.416
2.284GlyLys: 2.284 ± 0.517
2.811GlyLeu: 2.811 ± 0.835
0.527GlyMet: 0.527 ± 0.224
1.581GlyAsn: 1.581 ± 0.525
1.054GlyPro: 1.054 ± 0.414
1.405GlyGln: 1.405 ± 0.505
2.987GlyArg: 2.987 ± 0.721
2.46GlySer: 2.46 ± 0.53
2.987GlyThr: 2.987 ± 0.73
2.284GlyVal: 2.284 ± 0.459
0.351GlyTrp: 0.351 ± 0.264
1.405GlyTyr: 1.405 ± 0.298
0.0GlyXaa: 0.0 ± 0.0
His
1.405HisAla: 1.405 ± 0.46
0.0HisCys: 0.0 ± 0.0
0.527HisAsp: 0.527 ± 0.297
1.054HisGlu: 1.054 ± 0.463
0.878HisPhe: 0.878 ± 0.419
1.23HisGly: 1.23 ± 0.516
0.176HisHis: 0.176 ± 0.199
1.405HisIle: 1.405 ± 0.533
0.176HisLys: 0.176 ± 0.146
2.108HisLeu: 2.108 ± 0.512
0.703HisMet: 0.703 ± 0.349
0.878HisAsn: 0.878 ± 0.291
0.878HisPro: 0.878 ± 0.324
0.0HisGln: 0.0 ± 0.0
0.878HisArg: 0.878 ± 0.603
1.23HisSer: 1.23 ± 0.308
1.581HisThr: 1.581 ± 0.61
0.351HisVal: 0.351 ± 0.252
0.176HisTrp: 0.176 ± 0.186
1.23HisTyr: 1.23 ± 0.47
0.176HisXaa: 0.176 ± 0.197
Ile
5.271IleAla: 5.271 ± 0.922
1.757IleCys: 1.757 ± 0.506
4.568IleAsp: 4.568 ± 0.696
5.798IleGlu: 5.798 ± 0.724
3.162IlePhe: 3.162 ± 0.664
3.689IleGly: 3.689 ± 0.79
1.23IleHis: 1.23 ± 0.278
5.622IleIle: 5.622 ± 0.603
5.271IleLys: 5.271 ± 1.289
5.271IleLeu: 5.271 ± 1.002
1.933IleMet: 1.933 ± 0.664
5.622IleAsn: 5.622 ± 0.728
3.865IlePro: 3.865 ± 0.637
3.689IleGln: 3.689 ± 1.097
4.743IleArg: 4.743 ± 0.984
8.96IleSer: 8.96 ± 2.105
3.865IleThr: 3.865 ± 0.7
4.216IleVal: 4.216 ± 0.949
0.527IleTrp: 0.527 ± 0.402
2.284IleTyr: 2.284 ± 0.585
0.0IleXaa: 0.0 ± 0.0
Lys
4.216LysAla: 4.216 ± 0.906
1.405LysCys: 1.405 ± 0.479
3.162LysAsp: 3.162 ± 0.921
4.216LysGlu: 4.216 ± 0.896
2.284LysPhe: 2.284 ± 0.831
1.581LysGly: 1.581 ± 0.269
1.405LysHis: 1.405 ± 0.591
6.852LysIle: 6.852 ± 1.719
5.446LysLys: 5.446 ± 1.036
3.514LysLeu: 3.514 ± 0.715
3.338LysMet: 3.338 ± 0.732
4.041LysAsn: 4.041 ± 0.451
1.933LysPro: 1.933 ± 0.332
3.689LysGln: 3.689 ± 0.739
2.987LysArg: 2.987 ± 1.03
3.514LysSer: 3.514 ± 0.561
5.622LysThr: 5.622 ± 0.919
4.568LysVal: 4.568 ± 0.854
0.176LysTrp: 0.176 ± 0.153
2.811LysTyr: 2.811 ± 0.864
0.878LysXaa: 0.878 ± 0.417
Leu
6.5LeuAla: 6.5 ± 0.692
1.054LeuCys: 1.054 ± 0.498
4.392LeuAsp: 4.392 ± 0.793
4.392LeuGlu: 4.392 ± 0.75
3.338LeuPhe: 3.338 ± 0.517
2.635LeuGly: 2.635 ± 0.505
1.757LeuHis: 1.757 ± 0.312
5.798LeuIle: 5.798 ± 0.816
4.919LeuLys: 4.919 ± 0.917
6.676LeuLeu: 6.676 ± 0.891
1.757LeuMet: 1.757 ± 0.415
4.568LeuAsn: 4.568 ± 0.834
3.162LeuPro: 3.162 ± 0.665
4.743LeuGln: 4.743 ± 0.695
4.216LeuArg: 4.216 ± 0.851
7.027LeuSer: 7.027 ± 0.624
5.798LeuThr: 5.798 ± 0.805
5.446LeuVal: 5.446 ± 0.637
0.351LeuTrp: 0.351 ± 0.207
2.635LeuTyr: 2.635 ± 0.727
0.0LeuXaa: 0.0 ± 0.0
Met
2.284MetAla: 2.284 ± 0.644
0.703MetCys: 0.703 ± 0.451
1.933MetAsp: 1.933 ± 0.675
1.405MetGlu: 1.405 ± 0.475
1.23MetPhe: 1.23 ± 0.517
0.703MetGly: 0.703 ± 0.176
0.176MetHis: 0.176 ± 0.153
2.987MetIle: 2.987 ± 0.611
2.108MetLys: 2.108 ± 0.52
3.162MetLeu: 3.162 ± 0.48
1.405MetMet: 1.405 ± 0.423
1.933MetAsn: 1.933 ± 0.47
1.581MetPro: 1.581 ± 0.501
0.527MetGln: 0.527 ± 0.234
1.405MetArg: 1.405 ± 0.584
2.635MetSer: 2.635 ± 0.587
0.878MetThr: 0.878 ± 0.376
1.581MetVal: 1.581 ± 0.5
0.0MetTrp: 0.0 ± 0.0
1.405MetTyr: 1.405 ± 0.478
0.0MetXaa: 0.0 ± 0.0
Asn
4.216AsnAla: 4.216 ± 0.955
1.054AsnCys: 1.054 ± 0.602
5.095AsnAsp: 5.095 ± 0.917
3.689AsnGlu: 3.689 ± 0.886
2.635AsnPhe: 2.635 ± 0.633
2.46AsnGly: 2.46 ± 0.523
1.405AsnHis: 1.405 ± 0.494
4.743AsnIle: 4.743 ± 0.93
3.338AsnLys: 3.338 ± 0.698
5.271AsnLeu: 5.271 ± 0.882
1.933AsnMet: 1.933 ± 0.433
4.216AsnAsn: 4.216 ± 1.016
3.514AsnPro: 3.514 ± 1.049
1.757AsnGln: 1.757 ± 0.541
3.338AsnArg: 3.338 ± 0.433
6.325AsnSer: 6.325 ± 0.845
3.689AsnThr: 3.689 ± 0.866
4.743AsnVal: 4.743 ± 0.627
0.527AsnTrp: 0.527 ± 0.272
2.46AsnTyr: 2.46 ± 0.622
0.0AsnXaa: 0.0 ± 0.0
Pro
3.162ProAla: 3.162 ± 0.791
0.351ProCys: 0.351 ± 0.27
1.405ProAsp: 1.405 ± 0.399
2.108ProGlu: 2.108 ± 0.275
1.405ProPhe: 1.405 ± 0.449
1.581ProGly: 1.581 ± 0.629
0.703ProHis: 0.703 ± 0.354
2.987ProIle: 2.987 ± 0.712
2.108ProLys: 2.108 ± 0.714
2.635ProLeu: 2.635 ± 0.944
1.581ProMet: 1.581 ± 0.405
3.514ProAsn: 3.514 ± 0.57
1.405ProPro: 1.405 ± 0.491
2.46ProGln: 2.46 ± 0.375
0.703ProArg: 0.703 ± 0.316
2.284ProSer: 2.284 ± 0.485
3.338ProThr: 3.338 ± 0.738
3.162ProVal: 3.162 ± 0.667
0.527ProTrp: 0.527 ± 0.193
2.108ProTyr: 2.108 ± 0.522
0.0ProXaa: 0.0 ± 0.0
Gln
1.054GlnAla: 1.054 ± 0.397
0.703GlnCys: 0.703 ± 0.36
1.581GlnAsp: 1.581 ± 0.491
2.46GlnGlu: 2.46 ± 0.574
1.581GlnPhe: 1.581 ± 0.543
1.23GlnGly: 1.23 ± 0.39
1.054GlnHis: 1.054 ± 0.358
4.392GlnIle: 4.392 ± 0.486
3.338GlnLys: 3.338 ± 1.197
5.271GlnLeu: 5.271 ± 0.662
0.527GlnMet: 0.527 ± 0.321
3.162GlnAsn: 3.162 ± 0.837
1.405GlnPro: 1.405 ± 0.494
1.933GlnGln: 1.933 ± 0.689
2.284GlnArg: 2.284 ± 0.383
2.284GlnSer: 2.284 ± 0.694
1.581GlnThr: 1.581 ± 0.394
2.46GlnVal: 2.46 ± 0.597
0.351GlnTrp: 0.351 ± 0.192
1.581GlnTyr: 1.581 ± 0.632
0.0GlnXaa: 0.0 ± 0.0
Arg
3.162ArgAla: 3.162 ± 0.891
1.054ArgCys: 1.054 ± 0.674
4.041ArgAsp: 4.041 ± 0.95
3.338ArgGlu: 3.338 ± 0.652
2.635ArgPhe: 2.635 ± 0.362
1.054ArgGly: 1.054 ± 0.415
0.176ArgHis: 0.176 ± 0.149
4.568ArgIle: 4.568 ± 0.699
3.338ArgLys: 3.338 ± 0.68
3.338ArgLeu: 3.338 ± 0.845
3.162ArgMet: 3.162 ± 0.667
3.338ArgAsn: 3.338 ± 0.575
1.757ArgPro: 1.757 ± 0.484
2.46ArgGln: 2.46 ± 0.522
2.811ArgArg: 2.811 ± 1.218
2.635ArgSer: 2.635 ± 0.42
4.216ArgThr: 4.216 ± 0.56
2.987ArgVal: 2.987 ± 0.663
0.527ArgTrp: 0.527 ± 0.254
2.284ArgTyr: 2.284 ± 0.449
0.0ArgXaa: 0.0 ± 0.0
Ser
6.676SerAla: 6.676 ± 1.081
0.527SerCys: 0.527 ± 0.272
4.743SerAsp: 4.743 ± 0.86
3.865SerGlu: 3.865 ± 0.866
4.568SerPhe: 4.568 ± 0.452
3.338SerGly: 3.338 ± 0.882
0.527SerHis: 0.527 ± 0.296
6.149SerIle: 6.149 ± 1.192
6.325SerLys: 6.325 ± 0.684
5.798SerLeu: 5.798 ± 1.063
1.581SerMet: 1.581 ± 0.304
4.216SerAsn: 4.216 ± 0.703
2.284SerPro: 2.284 ± 0.464
1.933SerGln: 1.933 ± 0.645
3.162SerArg: 3.162 ± 0.568
3.338SerSer: 3.338 ± 0.911
3.865SerThr: 3.865 ± 0.729
4.392SerVal: 4.392 ± 0.809
0.703SerTrp: 0.703 ± 0.325
2.108SerTyr: 2.108 ± 0.382
0.0SerXaa: 0.0 ± 0.0
Thr
4.041ThrAla: 4.041 ± 0.821
0.878ThrCys: 0.878 ± 0.288
4.568ThrAsp: 4.568 ± 0.579
2.635ThrGlu: 2.635 ± 0.696
2.987ThrPhe: 2.987 ± 0.548
2.108ThrGly: 2.108 ± 0.885
1.405ThrHis: 1.405 ± 0.418
5.622ThrIle: 5.622 ± 1.049
3.689ThrLys: 3.689 ± 0.525
5.798ThrLeu: 5.798 ± 1.154
1.933ThrMet: 1.933 ± 0.414
4.041ThrAsn: 4.041 ± 0.743
3.865ThrPro: 3.865 ± 0.651
2.108ThrGln: 2.108 ± 0.469
2.811ThrArg: 2.811 ± 0.774
3.865ThrSer: 3.865 ± 1.004
5.798ThrThr: 5.798 ± 0.777
4.216ThrVal: 4.216 ± 1.253
0.176ThrTrp: 0.176 ± 0.197
2.46ThrTyr: 2.46 ± 0.665
0.0ThrXaa: 0.0 ± 0.0
Val
3.689ValAla: 3.689 ± 0.847
0.703ValCys: 0.703 ± 0.327
3.689ValAsp: 3.689 ± 0.828
4.041ValGlu: 4.041 ± 0.931
3.338ValPhe: 3.338 ± 0.821
1.405ValGly: 1.405 ± 0.563
0.703ValHis: 0.703 ± 0.586
4.392ValIle: 4.392 ± 1.203
3.865ValLys: 3.865 ± 0.653
4.392ValLeu: 4.392 ± 0.782
2.635ValMet: 2.635 ± 0.506
4.568ValAsn: 4.568 ± 0.631
3.865ValPro: 3.865 ± 0.763
2.46ValGln: 2.46 ± 0.596
4.392ValArg: 4.392 ± 0.551
2.635ValSer: 2.635 ± 0.61
4.041ValThr: 4.041 ± 0.834
2.46ValVal: 2.46 ± 0.81
0.0ValTrp: 0.0 ± 0.0
1.933ValTyr: 1.933 ± 0.381
0.0ValXaa: 0.0 ± 0.0
Trp
0.527TrpAla: 0.527 ± 0.343
0.0TrpCys: 0.0 ± 0.0
0.351TrpAsp: 0.351 ± 0.209
0.351TrpGlu: 0.351 ± 0.284
0.176TrpPhe: 0.176 ± 0.146
0.0TrpGly: 0.0 ± 0.0
0.176TrpHis: 0.176 ± 0.146
0.527TrpIle: 0.527 ± 0.327
1.23TrpLys: 1.23 ± 0.282
0.878TrpLeu: 0.878 ± 0.319
0.176TrpMet: 0.176 ± 0.149
0.351TrpAsn: 0.351 ± 0.21
0.176TrpPro: 0.176 ± 0.186
0.176TrpGln: 0.176 ± 0.153
0.878TrpArg: 0.878 ± 0.265
0.703TrpSer: 0.703 ± 0.28
0.351TrpThr: 0.351 ± 0.222
0.176TrpVal: 0.176 ± 0.199
0.0TrpTrp: 0.0 ± 0.0
0.176TrpTyr: 0.176 ± 0.197
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.933TyrAla: 1.933 ± 0.507
0.527TyrCys: 0.527 ± 0.319
2.811TyrAsp: 2.811 ± 0.577
3.338TyrGlu: 3.338 ± 0.836
1.757TyrPhe: 1.757 ± 0.277
1.757TyrGly: 1.757 ± 0.439
1.23TyrHis: 1.23 ± 0.427
2.284TyrIle: 2.284 ± 0.685
1.933TyrLys: 1.933 ± 0.57
2.284TyrLeu: 2.284 ± 0.673
1.054TyrMet: 1.054 ± 0.419
3.689TyrAsn: 3.689 ± 0.818
0.878TyrPro: 0.878 ± 0.338
1.23TyrGln: 1.23 ± 0.367
1.405TyrArg: 1.405 ± 0.489
3.338TyrSer: 3.338 ± 0.513
2.46TyrThr: 2.46 ± 0.635
1.933TyrVal: 1.933 ± 0.327
0.0TyrTrp: 0.0 ± 0.0
1.405TyrTyr: 1.405 ± 0.402
0.176TyrXaa: 0.176 ± 0.149
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.176XaaAsp: 0.176 ± 0.19
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.176XaaGly: 0.176 ± 0.163
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.176XaaLys: 0.176 ± 0.197
0.351XaaLeu: 0.351 ± 0.259
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.176XaaGln: 0.176 ± 0.155
0.176XaaArg: 0.176 ± 0.155
0.0XaaSer: 0.0 ± 0.0
0.351XaaThr: 0.351 ± 0.244
0.176XaaVal: 0.176 ± 0.199
0.0XaaTrp: 0.0 ± 0.0
0.176XaaTyr: 0.176 ± 0.153
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (5693 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski