Amino acid dipepetide frequency for Rotavirus B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.546AlaAla: 3.546 ± 1.13
1.773AlaCys: 1.773 ± 0.433
3.191AlaAsp: 3.191 ± 1.106
5.142AlaGlu: 5.142 ± 0.665
2.837AlaPhe: 2.837 ± 0.58
2.305AlaGly: 2.305 ± 0.517
0.532AlaHis: 0.532 ± 0.211
4.61AlaIle: 4.61 ± 0.862
4.255AlaLys: 4.255 ± 1.393
4.61AlaLeu: 4.61 ± 0.69
1.241AlaMet: 1.241 ± 0.31
2.482AlaAsn: 2.482 ± 0.451
2.128AlaPro: 2.128 ± 0.582
2.837AlaGln: 2.837 ± 0.687
2.837AlaArg: 2.837 ± 0.895
3.191AlaSer: 3.191 ± 0.842
3.901AlaThr: 3.901 ± 1.046
3.369AlaVal: 3.369 ± 0.604
1.064AlaTrp: 1.064 ± 0.395
2.482AlaTyr: 2.482 ± 0.42
0.0AlaXaa: 0.0 ± 0.0
Cys
0.709CysAla: 0.709 ± 0.349
0.532CysCys: 0.532 ± 0.26
0.709CysAsp: 0.709 ± 0.417
1.064CysGlu: 1.064 ± 0.467
0.887CysPhe: 0.887 ± 0.282
1.596CysGly: 1.596 ± 0.712
0.177CysHis: 0.177 ± 0.137
0.709CysIle: 0.709 ± 0.362
0.177CysLys: 0.177 ± 0.153
0.709CysLeu: 0.709 ± 0.292
0.355CysMet: 0.355 ± 0.237
1.064CysAsn: 1.064 ± 0.437
0.532CysPro: 0.532 ± 0.452
0.887CysGln: 0.887 ± 0.26
0.532CysArg: 0.532 ± 0.376
1.241CysSer: 1.241 ± 0.51
0.709CysThr: 0.709 ± 0.536
0.709CysVal: 0.709 ± 0.506
0.0CysTrp: 0.0 ± 0.0
0.532CysTyr: 0.532 ± 0.271
0.0CysXaa: 0.0 ± 0.0
Asp
3.901AspAla: 3.901 ± 0.714
0.355AspCys: 0.355 ± 0.224
3.723AspAsp: 3.723 ± 1.002
4.433AspGlu: 4.433 ± 0.628
2.837AspPhe: 2.837 ± 0.706
2.66AspGly: 2.66 ± 0.785
0.532AspHis: 0.532 ± 0.348
6.383AspIle: 6.383 ± 1.3
2.837AspLys: 2.837 ± 0.562
5.319AspLeu: 5.319 ± 0.897
1.064AspMet: 1.064 ± 0.415
4.255AspAsn: 4.255 ± 0.702
2.305AspPro: 2.305 ± 0.582
3.014AspGln: 3.014 ± 0.607
2.66AspArg: 2.66 ± 0.668
3.901AspSer: 3.901 ± 0.842
3.546AspThr: 3.546 ± 0.564
4.255AspVal: 4.255 ± 0.677
1.064AspTrp: 1.064 ± 0.424
2.66AspTyr: 2.66 ± 0.564
0.0AspXaa: 0.0 ± 0.0
Glu
2.482GluAla: 2.482 ± 0.59
1.064GluCys: 1.064 ± 0.317
4.255GluAsp: 4.255 ± 0.652
4.433GluGlu: 4.433 ± 1.058
3.723GluPhe: 3.723 ± 0.742
2.128GluGly: 2.128 ± 0.767
1.773GluHis: 1.773 ± 0.491
5.142GluIle: 5.142 ± 0.928
4.787GluLys: 4.787 ± 0.804
4.965GluLeu: 4.965 ± 0.96
1.596GluMet: 1.596 ± 0.501
2.305GluAsn: 2.305 ± 0.923
1.241GluPro: 1.241 ± 0.523
1.95GluGln: 1.95 ± 0.564
4.61GluArg: 4.61 ± 0.623
3.723GluSer: 3.723 ± 0.532
2.66GluThr: 2.66 ± 0.892
3.369GluVal: 3.369 ± 0.682
1.064GluTrp: 1.064 ± 0.372
1.596GluTyr: 1.596 ± 0.354
0.0GluXaa: 0.0 ± 0.0
Phe
3.901PheAla: 3.901 ± 0.882
0.709PheCys: 0.709 ± 0.454
2.305PheAsp: 2.305 ± 1.285
4.433PheGlu: 4.433 ± 0.417
1.064PhePhe: 1.064 ± 0.394
2.305PheGly: 2.305 ± 0.765
1.064PheHis: 1.064 ± 0.359
2.837PheIle: 2.837 ± 0.565
2.128PheLys: 2.128 ± 0.639
2.482PheLeu: 2.482 ± 0.84
0.709PheMet: 0.709 ± 0.468
3.546PheAsn: 3.546 ± 1.208
1.241PhePro: 1.241 ± 0.362
2.128PheGln: 2.128 ± 0.686
2.66PheArg: 2.66 ± 0.383
4.433PheSer: 4.433 ± 1.022
3.191PheThr: 3.191 ± 0.738
2.66PheVal: 2.66 ± 0.391
0.177PheTrp: 0.177 ± 0.165
1.596PheTyr: 1.596 ± 0.523
0.0PheXaa: 0.0 ± 0.0
Gly
2.66GlyAla: 2.66 ± 0.422
0.709GlyCys: 0.709 ± 0.423
1.95GlyAsp: 1.95 ± 0.623
1.596GlyGlu: 1.596 ± 0.346
2.305GlyPhe: 2.305 ± 0.855
2.128GlyGly: 2.128 ± 0.418
1.064GlyHis: 1.064 ± 0.377
3.901GlyIle: 3.901 ± 0.507
2.482GlyLys: 2.482 ± 1.116
3.014GlyLeu: 3.014 ± 0.382
1.241GlyMet: 1.241 ± 0.331
3.014GlyAsn: 3.014 ± 0.547
1.064GlyPro: 1.064 ± 0.312
1.241GlyGln: 1.241 ± 0.369
2.305GlyArg: 2.305 ± 0.505
2.482GlySer: 2.482 ± 0.622
2.128GlyThr: 2.128 ± 0.578
2.305GlyVal: 2.305 ± 0.611
0.355GlyTrp: 0.355 ± 0.18
1.596GlyTyr: 1.596 ± 0.361
0.0GlyXaa: 0.0 ± 0.0
His
0.709HisAla: 0.709 ± 0.447
0.177HisCys: 0.177 ± 0.186
0.532HisAsp: 0.532 ± 0.315
1.418HisGlu: 1.418 ± 0.686
0.532HisPhe: 0.532 ± 0.231
0.887HisGly: 0.887 ± 0.383
0.355HisHis: 0.355 ± 0.231
1.241HisIle: 1.241 ± 0.482
0.532HisLys: 0.532 ± 0.252
2.128HisLeu: 2.128 ± 0.768
0.887HisMet: 0.887 ± 0.429
1.773HisAsn: 1.773 ± 0.334
0.532HisPro: 0.532 ± 0.321
0.532HisGln: 0.532 ± 0.331
1.95HisArg: 1.95 ± 0.51
1.596HisSer: 1.596 ± 0.628
1.596HisThr: 1.596 ± 0.54
1.596HisVal: 1.596 ± 0.52
0.0HisTrp: 0.0 ± 0.0
0.532HisTyr: 0.532 ± 0.391
0.0HisXaa: 0.0 ± 0.0
Ile
4.433IleAla: 4.433 ± 0.512
0.709IleCys: 0.709 ± 0.324
6.206IleAsp: 6.206 ± 0.793
4.255IleGlu: 4.255 ± 0.869
2.66IlePhe: 2.66 ± 0.542
1.95IleGly: 1.95 ± 0.594
1.241IleHis: 1.241 ± 0.637
6.028IleIle: 6.028 ± 0.73
5.674IleLys: 5.674 ± 1.267
7.447IleLeu: 7.447 ± 0.851
2.66IleMet: 2.66 ± 0.451
4.61IleAsn: 4.61 ± 0.738
3.901IlePro: 3.901 ± 0.676
3.191IleGln: 3.191 ± 0.977
4.433IleArg: 4.433 ± 0.755
7.092IleSer: 7.092 ± 0.919
4.433IleThr: 4.433 ± 1.304
4.078IleVal: 4.078 ± 0.702
0.177IleTrp: 0.177 ± 0.165
2.66IleTyr: 2.66 ± 0.825
0.0IleXaa: 0.0 ± 0.0
Lys
3.369LysAla: 3.369 ± 0.713
0.532LysCys: 0.532 ± 0.303
4.787LysAsp: 4.787 ± 1.253
3.191LysGlu: 3.191 ± 0.654
2.482LysPhe: 2.482 ± 0.807
2.482LysGly: 2.482 ± 0.57
1.241LysHis: 1.241 ± 0.575
6.383LysIle: 6.383 ± 1.139
3.369LysLys: 3.369 ± 0.729
6.028LysLeu: 6.028 ± 1.203
1.95LysMet: 1.95 ± 0.68
3.723LysAsn: 3.723 ± 0.959
3.546LysPro: 3.546 ± 0.902
3.014LysGln: 3.014 ± 0.6
4.078LysArg: 4.078 ± 0.999
3.901LysSer: 3.901 ± 0.708
5.496LysThr: 5.496 ± 0.827
2.305LysVal: 2.305 ± 0.554
0.355LysTrp: 0.355 ± 0.306
3.369LysTyr: 3.369 ± 0.896
0.0LysXaa: 0.0 ± 0.0
Leu
5.851LeuAla: 5.851 ± 1.182
0.887LeuCys: 0.887 ± 0.393
5.851LeuAsp: 5.851 ± 1.101
4.965LeuGlu: 4.965 ± 1.094
2.837LeuPhe: 2.837 ± 0.639
3.901LeuGly: 3.901 ± 0.853
1.241LeuHis: 1.241 ± 0.532
7.092LeuIle: 7.092 ± 1.128
5.674LeuLys: 5.674 ± 1.137
9.22LeuLeu: 9.22 ± 1.674
1.418LeuMet: 1.418 ± 0.474
6.206LeuAsn: 6.206 ± 0.802
2.305LeuPro: 2.305 ± 0.47
3.723LeuGln: 3.723 ± 0.634
4.965LeuArg: 4.965 ± 0.734
8.511LeuSer: 8.511 ± 0.719
6.56LeuThr: 6.56 ± 0.978
3.901LeuVal: 3.901 ± 0.826
0.355LeuTrp: 0.355 ± 0.157
3.901LeuTyr: 3.901 ± 1.608
0.0LeuXaa: 0.0 ± 0.0
Met
2.305MetAla: 2.305 ± 0.569
0.709MetCys: 0.709 ± 0.302
1.95MetAsp: 1.95 ± 0.776
0.355MetGlu: 0.355 ± 0.292
1.418MetPhe: 1.418 ± 0.423
0.709MetGly: 0.709 ± 0.268
0.709MetHis: 0.709 ± 0.348
1.596MetIle: 1.596 ± 0.496
1.064MetLys: 1.064 ± 0.304
3.014MetLeu: 3.014 ± 0.47
0.355MetMet: 0.355 ± 0.232
1.596MetAsn: 1.596 ± 0.569
1.241MetPro: 1.241 ± 0.415
0.709MetGln: 0.709 ± 0.349
0.887MetArg: 0.887 ± 0.268
2.482MetSer: 2.482 ± 0.654
1.418MetThr: 1.418 ± 0.445
0.709MetVal: 0.709 ± 0.302
0.0MetTrp: 0.0 ± 0.0
0.355MetTyr: 0.355 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
4.078AsnAla: 4.078 ± 0.742
1.241AsnCys: 1.241 ± 0.689
3.191AsnAsp: 3.191 ± 0.553
2.482AsnGlu: 2.482 ± 0.571
3.369AsnPhe: 3.369 ± 0.536
2.128AsnGly: 2.128 ± 0.675
2.128AsnHis: 2.128 ± 0.624
4.965AsnIle: 4.965 ± 1.01
2.305AsnLys: 2.305 ± 0.76
4.787AsnLeu: 4.787 ± 1.0
1.596AsnMet: 1.596 ± 0.461
3.191AsnAsn: 3.191 ± 0.901
2.128AsnPro: 2.128 ± 0.593
1.596AsnGln: 1.596 ± 0.517
4.787AsnArg: 4.787 ± 0.799
5.674AsnSer: 5.674 ± 0.977
4.078AsnThr: 4.078 ± 0.775
6.028AsnVal: 6.028 ± 1.056
0.887AsnTrp: 0.887 ± 0.44
2.66AsnTyr: 2.66 ± 0.824
0.0AsnXaa: 0.0 ± 0.0
Pro
1.95ProAla: 1.95 ± 0.454
0.355ProCys: 0.355 ± 0.253
1.241ProAsp: 1.241 ± 0.549
2.482ProGlu: 2.482 ± 0.407
1.773ProPhe: 1.773 ± 0.697
1.241ProGly: 1.241 ± 0.339
0.709ProHis: 0.709 ± 0.302
2.305ProIle: 2.305 ± 0.608
3.191ProLys: 3.191 ± 0.687
3.369ProLeu: 3.369 ± 0.496
1.241ProMet: 1.241 ± 0.508
2.66ProAsn: 2.66 ± 0.543
0.887ProPro: 0.887 ± 0.384
1.418ProGln: 1.418 ± 0.344
0.355ProArg: 0.355 ± 0.318
2.837ProSer: 2.837 ± 0.687
3.546ProThr: 3.546 ± 0.711
1.95ProVal: 1.95 ± 0.521
0.887ProTrp: 0.887 ± 0.24
2.837ProTyr: 2.837 ± 0.633
0.0ProXaa: 0.0 ± 0.0
Gln
1.418GlnAla: 1.418 ± 0.389
0.532GlnCys: 0.532 ± 0.393
1.064GlnAsp: 1.064 ± 0.534
2.482GlnGlu: 2.482 ± 0.571
1.596GlnPhe: 1.596 ± 0.317
1.241GlnGly: 1.241 ± 0.382
1.596GlnHis: 1.596 ± 0.498
3.369GlnIle: 3.369 ± 0.831
3.546GlnLys: 3.546 ± 0.77
4.61GlnLeu: 4.61 ± 0.977
1.064GlnMet: 1.064 ± 0.377
1.95GlnAsn: 1.95 ± 0.669
2.305GlnPro: 2.305 ± 0.7
1.773GlnGln: 1.773 ± 0.709
1.95GlnArg: 1.95 ± 0.678
3.723GlnSer: 3.723 ± 0.968
2.66GlnThr: 2.66 ± 0.715
1.95GlnVal: 1.95 ± 0.53
0.532GlnTrp: 0.532 ± 0.305
1.773GlnTyr: 1.773 ± 0.436
0.0GlnXaa: 0.0 ± 0.0
Arg
2.482ArgAla: 2.482 ± 0.553
0.887ArgCys: 0.887 ± 0.458
3.014ArgAsp: 3.014 ± 0.464
4.078ArgGlu: 4.078 ± 0.661
2.305ArgPhe: 2.305 ± 0.728
1.95ArgGly: 1.95 ± 0.705
0.709ArgHis: 0.709 ± 0.424
5.496ArgIle: 5.496 ± 0.751
3.723ArgLys: 3.723 ± 0.783
4.255ArgLeu: 4.255 ± 1.197
0.709ArgMet: 0.709 ± 0.259
3.191ArgAsn: 3.191 ± 0.808
2.482ArgPro: 2.482 ± 0.56
1.95ArgGln: 1.95 ± 0.489
2.482ArgArg: 2.482 ± 0.803
3.723ArgSer: 3.723 ± 0.84
3.369ArgThr: 3.369 ± 0.907
2.305ArgVal: 2.305 ± 0.62
0.887ArgTrp: 0.887 ± 0.372
2.128ArgTyr: 2.128 ± 0.359
0.0ArgXaa: 0.0 ± 0.0
Ser
5.496SerAla: 5.496 ± 1.103
0.709SerCys: 0.709 ± 0.275
7.979SerAsp: 7.979 ± 0.808
2.837SerGlu: 2.837 ± 0.594
4.255SerPhe: 4.255 ± 0.884
2.837SerGly: 2.837 ± 0.937
1.241SerHis: 1.241 ± 0.317
5.496SerIle: 5.496 ± 0.796
7.092SerLys: 7.092 ± 1.017
6.206SerLeu: 6.206 ± 0.767
2.305SerMet: 2.305 ± 0.413
5.319SerAsn: 5.319 ± 0.893
3.546SerPro: 3.546 ± 0.615
4.078SerGln: 4.078 ± 1.136
3.014SerArg: 3.014 ± 0.541
5.851SerSer: 5.851 ± 1.267
3.901SerThr: 3.901 ± 0.634
4.61SerVal: 4.61 ± 0.785
0.532SerTrp: 0.532 ± 0.301
2.66SerTyr: 2.66 ± 0.951
0.0SerXaa: 0.0 ± 0.0
Thr
3.191ThrAla: 3.191 ± 0.991
0.532ThrCys: 0.532 ± 0.244
3.369ThrAsp: 3.369 ± 0.733
4.078ThrGlu: 4.078 ± 1.042
3.723ThrPhe: 3.723 ± 0.438
3.014ThrGly: 3.014 ± 0.866
1.241ThrHis: 1.241 ± 0.42
4.255ThrIle: 4.255 ± 0.973
4.255ThrLys: 4.255 ± 0.514
7.447ThrLeu: 7.447 ± 0.856
0.887ThrMet: 0.887 ± 0.385
3.369ThrAsn: 3.369 ± 0.632
2.482ThrPro: 2.482 ± 0.605
2.305ThrGln: 2.305 ± 0.493
2.305ThrArg: 2.305 ± 0.59
5.496ThrSer: 5.496 ± 0.708
4.078ThrThr: 4.078 ± 0.602
4.965ThrVal: 4.965 ± 0.63
0.177ThrTrp: 0.177 ± 0.165
3.901ThrTyr: 3.901 ± 0.58
0.0ThrXaa: 0.0 ± 0.0
Val
3.546ValAla: 3.546 ± 0.989
0.887ValCys: 0.887 ± 0.422
3.369ValAsp: 3.369 ± 0.821
2.837ValGlu: 2.837 ± 0.607
2.66ValPhe: 2.66 ± 0.803
2.66ValGly: 2.66 ± 0.723
1.064ValHis: 1.064 ± 0.282
3.369ValIle: 3.369 ± 0.727
4.078ValLys: 4.078 ± 0.808
5.319ValLeu: 5.319 ± 1.155
0.887ValMet: 0.887 ± 0.335
4.787ValAsn: 4.787 ± 0.814
2.128ValPro: 2.128 ± 0.411
2.482ValGln: 2.482 ± 0.503
2.66ValArg: 2.66 ± 0.798
5.319ValSer: 5.319 ± 0.844
3.901ValThr: 3.901 ± 0.629
3.369ValVal: 3.369 ± 0.719
0.532ValTrp: 0.532 ± 0.338
1.596ValTyr: 1.596 ± 0.375
0.0ValXaa: 0.0 ± 0.0
Trp
0.532TrpAla: 0.532 ± 0.247
0.177TrpCys: 0.177 ± 0.204
0.709TrpAsp: 0.709 ± 0.253
0.0TrpGlu: 0.0 ± 0.0
0.355TrpPhe: 0.355 ± 0.295
0.177TrpGly: 0.177 ± 0.199
0.0TrpHis: 0.0 ± 0.0
0.532TrpIle: 0.532 ± 0.246
1.241TrpLys: 1.241 ± 0.518
1.064TrpLeu: 1.064 ± 0.332
0.177TrpMet: 0.177 ± 0.143
0.709TrpAsn: 0.709 ± 0.319
0.177TrpPro: 0.177 ± 0.186
0.709TrpGln: 0.709 ± 0.384
0.709TrpArg: 0.709 ± 0.244
0.532TrpSer: 0.532 ± 0.26
0.532TrpThr: 0.532 ± 0.243
0.532TrpVal: 0.532 ± 0.28
0.0TrpTrp: 0.0 ± 0.0
0.355TrpTyr: 0.355 ± 0.203
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.773TyrAla: 1.773 ± 0.518
0.355TyrCys: 0.355 ± 0.329
2.305TyrAsp: 2.305 ± 0.545
2.305TyrGlu: 2.305 ± 1.015
1.95TyrPhe: 1.95 ± 0.591
1.241TyrGly: 1.241 ± 0.518
0.887TyrHis: 0.887 ± 0.484
2.128TyrIle: 2.128 ± 0.861
3.191TyrLys: 3.191 ± 0.525
3.369TyrLeu: 3.369 ± 0.677
1.064TyrMet: 1.064 ± 0.328
3.546TyrAsn: 3.546 ± 0.459
1.064TyrPro: 1.064 ± 0.252
1.596TyrGln: 1.596 ± 0.532
1.95TyrArg: 1.95 ± 0.448
4.255TyrSer: 4.255 ± 1.217
3.369TyrThr: 3.369 ± 0.924
2.66TyrVal: 2.66 ± 0.628
0.177TyrTrp: 0.177 ± 0.137
1.596TyrTyr: 1.596 ± 0.417
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (5641 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski