Amino acid dipepetide frequency for Human rotavirus G9P[8]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.415AlaAla: 2.415 ± 0.656
1.342AlaCys: 1.342 ± 0.664
3.756AlaAsp: 3.756 ± 1.041
1.878AlaGlu: 1.878 ± 0.88
1.878AlaPhe: 1.878 ± 0.692
1.61AlaGly: 1.61 ± 0.943
0.0AlaHis: 0.0 ± 0.0
4.025AlaIle: 4.025 ± 0.794
2.683AlaLys: 2.683 ± 0.953
4.293AlaLeu: 4.293 ± 1.131
0.537AlaMet: 0.537 ± 0.42
4.025AlaAsn: 4.025 ± 1.076
1.61AlaPro: 1.61 ± 0.573
1.61AlaGln: 1.61 ± 0.735
1.342AlaArg: 1.342 ± 1.132
4.293AlaSer: 4.293 ± 1.367
3.488AlaThr: 3.488 ± 0.99
3.488AlaVal: 3.488 ± 1.001
0.0AlaTrp: 0.0 ± 0.0
1.878AlaTyr: 1.878 ± 0.55
0.0AlaXaa: 0.0 ± 0.0
Cys
0.268CysAla: 0.268 ± 0.29
0.537CysCys: 0.537 ± 0.322
0.805CysAsp: 0.805 ± 0.446
0.805CysGlu: 0.805 ± 0.429
0.805CysPhe: 0.805 ± 0.384
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.805CysIle: 0.805 ± 0.522
1.61CysLys: 1.61 ± 0.863
1.073CysLeu: 1.073 ± 0.566
1.073CysMet: 1.073 ± 0.447
1.342CysAsn: 1.342 ± 0.687
0.268CysPro: 0.268 ± 0.26
0.805CysGln: 0.805 ± 0.517
1.073CysArg: 1.073 ± 0.747
1.61CysSer: 1.61 ± 0.936
1.342CysThr: 1.342 ± 0.811
1.073CysVal: 1.073 ± 0.365
0.0CysTrp: 0.0 ± 0.0
0.537CysTyr: 0.537 ± 0.325
0.0CysXaa: 0.0 ± 0.0
Asp
2.683AspAla: 2.683 ± 1.077
0.537AspCys: 0.537 ± 0.329
4.025AspAsp: 4.025 ± 1.195
3.756AspGlu: 3.756 ± 0.469
4.293AspPhe: 4.293 ± 1.121
2.146AspGly: 2.146 ± 0.376
0.805AspHis: 0.805 ± 0.493
5.635AspIle: 5.635 ± 1.253
4.561AspLys: 4.561 ± 1.001
3.488AspLeu: 3.488 ± 0.737
1.342AspMet: 1.342 ± 0.568
3.22AspAsn: 3.22 ± 0.949
1.61AspPro: 1.61 ± 0.681
3.22AspGln: 3.22 ± 1.069
1.878AspArg: 1.878 ± 0.684
6.708AspSer: 6.708 ± 2.083
3.22AspThr: 3.22 ± 0.825
5.366AspVal: 5.366 ± 0.945
1.342AspTrp: 1.342 ± 0.569
3.756AspTyr: 3.756 ± 0.896
0.0AspXaa: 0.0 ± 0.0
Glu
2.415GluAla: 2.415 ± 0.78
0.268GluCys: 0.268 ± 0.236
2.683GluAsp: 2.683 ± 0.697
1.342GluGlu: 1.342 ± 0.767
1.878GluPhe: 1.878 ± 0.497
0.268GluGly: 0.268 ± 0.279
0.268GluHis: 0.268 ± 0.29
3.756GluIle: 3.756 ± 0.915
3.488GluLys: 3.488 ± 0.956
7.781GluLeu: 7.781 ± 1.597
2.951GluMet: 2.951 ± 0.936
3.22GluAsn: 3.22 ± 0.663
1.878GluPro: 1.878 ± 0.655
2.415GluGln: 2.415 ± 0.925
2.683GluArg: 2.683 ± 1.142
3.22GluSer: 3.22 ± 1.127
2.146GluThr: 2.146 ± 0.666
3.22GluVal: 3.22 ± 0.83
1.61GluTrp: 1.61 ± 0.571
4.83GluTyr: 4.83 ± 1.198
0.0GluXaa: 0.0 ± 0.0
Phe
1.61PheAla: 1.61 ± 0.661
0.268PheCys: 0.268 ± 0.262
3.22PheAsp: 3.22 ± 1.063
1.878PheGlu: 1.878 ± 0.638
0.537PhePhe: 0.537 ± 0.328
2.146PheGly: 2.146 ± 0.692
1.878PheHis: 1.878 ± 0.768
3.22PheIle: 3.22 ± 0.832
3.22PheLys: 3.22 ± 0.664
4.561PheLeu: 4.561 ± 1.517
0.0PheMet: 0.0 ± 0.0
3.756PheAsn: 3.756 ± 1.018
2.146PhePro: 2.146 ± 1.073
1.878PheGln: 1.878 ± 0.933
1.342PheArg: 1.342 ± 0.407
3.488PheSer: 3.488 ± 0.776
4.293PheThr: 4.293 ± 0.605
1.878PheVal: 1.878 ± 0.906
0.537PheTrp: 0.537 ± 0.339
2.415PheTyr: 2.415 ± 0.672
0.0PheXaa: 0.0 ± 0.0
Gly
1.073GlyAla: 1.073 ± 0.444
1.073GlyCys: 1.073 ± 0.475
0.805GlyAsp: 0.805 ± 0.417
1.878GlyGlu: 1.878 ± 0.973
1.073GlyPhe: 1.073 ± 0.688
1.342GlyGly: 1.342 ± 0.585
1.073GlyHis: 1.073 ± 0.415
3.756GlyIle: 3.756 ± 0.922
3.488GlyLys: 3.488 ± 1.292
2.415GlyLeu: 2.415 ± 0.773
1.342GlyMet: 1.342 ± 0.459
1.878GlyAsn: 1.878 ± 0.732
1.61GlyPro: 1.61 ± 0.587
1.342GlyGln: 1.342 ± 0.45
0.805GlyArg: 0.805 ± 0.431
2.146GlySer: 2.146 ± 0.877
1.61GlyThr: 1.61 ± 1.134
2.415GlyVal: 2.415 ± 0.308
1.073GlyTrp: 1.073 ± 0.566
1.342GlyTyr: 1.342 ± 0.418
0.0GlyXaa: 0.0 ± 0.0
His
0.805HisAla: 0.805 ± 0.441
0.268HisCys: 0.268 ± 0.236
1.342HisAsp: 1.342 ± 0.45
0.537HisGlu: 0.537 ± 0.329
0.537HisPhe: 0.537 ± 0.335
0.805HisGly: 0.805 ± 0.594
0.805HisHis: 0.805 ± 0.485
0.537HisIle: 0.537 ± 0.33
2.683HisLys: 2.683 ± 0.643
1.61HisLeu: 1.61 ± 0.633
0.537HisMet: 0.537 ± 0.358
1.073HisAsn: 1.073 ± 0.527
0.268HisPro: 0.268 ± 0.228
0.268HisGln: 0.268 ± 0.262
0.268HisArg: 0.268 ± 0.236
1.878HisSer: 1.878 ± 0.499
1.073HisThr: 1.073 ± 0.735
1.073HisVal: 1.073 ± 0.453
0.268HisTrp: 0.268 ± 0.228
1.073HisTyr: 1.073 ± 0.498
0.0HisXaa: 0.0 ± 0.0
Ile
4.025IleAla: 4.025 ± 1.215
0.537IleCys: 0.537 ± 0.523
5.366IleAsp: 5.366 ± 1.644
5.366IleGlu: 5.366 ± 1.111
2.146IlePhe: 2.146 ± 0.688
3.488IleGly: 3.488 ± 0.745
0.805IleHis: 0.805 ± 0.418
6.976IleIle: 6.976 ± 1.237
4.025IleLys: 4.025 ± 0.893
5.098IleLeu: 5.098 ± 0.878
0.537IleMet: 0.537 ± 0.353
8.049IleAsn: 8.049 ± 1.306
2.683IlePro: 2.683 ± 0.704
3.22IleGln: 3.22 ± 0.875
4.025IleArg: 4.025 ± 0.89
3.756IleSer: 3.756 ± 1.336
8.049IleThr: 8.049 ± 1.031
5.635IleVal: 5.635 ± 1.147
0.537IleTrp: 0.537 ± 0.371
3.756IleTyr: 3.756 ± 1.405
0.0IleXaa: 0.0 ± 0.0
Lys
1.61LysAla: 1.61 ± 0.675
2.146LysCys: 2.146 ± 0.681
3.488LysAsp: 3.488 ± 0.958
3.756LysGlu: 3.756 ± 1.355
2.951LysPhe: 2.951 ± 1.102
2.683LysGly: 2.683 ± 0.718
0.805LysHis: 0.805 ± 0.39
3.756LysIle: 3.756 ± 1.002
3.488LysLys: 3.488 ± 1.522
8.586LysLeu: 8.586 ± 1.611
2.415LysMet: 2.415 ± 0.747
4.025LysAsn: 4.025 ± 1.223
2.415LysPro: 2.415 ± 0.709
3.756LysGln: 3.756 ± 1.233
3.488LysArg: 3.488 ± 0.43
3.488LysSer: 3.488 ± 1.354
3.756LysThr: 3.756 ± 0.844
4.561LysVal: 4.561 ± 1.034
1.878LysTrp: 1.878 ± 0.811
4.293LysTyr: 4.293 ± 1.186
0.0LysXaa: 0.0 ± 0.0
Leu
3.22LeuAla: 3.22 ± 0.998
1.073LeuCys: 1.073 ± 0.598
7.781LeuAsp: 7.781 ± 1.181
5.366LeuGlu: 5.366 ± 1.584
4.293LeuPhe: 4.293 ± 0.843
2.951LeuGly: 2.951 ± 0.902
2.146LeuHis: 2.146 ± 0.937
7.513LeuIle: 7.513 ± 0.632
6.708LeuLys: 6.708 ± 1.314
8.049LeuLeu: 8.049 ± 1.139
4.025LeuMet: 4.025 ± 1.072
6.976LeuAsn: 6.976 ± 1.235
3.488LeuPro: 3.488 ± 0.901
2.683LeuGln: 2.683 ± 0.887
4.561LeuArg: 4.561 ± 1.143
6.439LeuSer: 6.439 ± 1.396
5.903LeuThr: 5.903 ± 0.814
4.293LeuVal: 4.293 ± 1.236
0.268LeuTrp: 0.268 ± 0.27
2.951LeuTyr: 2.951 ± 1.164
0.0LeuXaa: 0.0 ± 0.0
Met
1.073MetAla: 1.073 ± 0.463
0.0MetCys: 0.0 ± 0.0
3.22MetAsp: 3.22 ± 0.92
1.073MetGlu: 1.073 ± 0.831
1.342MetPhe: 1.342 ± 0.467
1.073MetGly: 1.073 ± 0.429
0.537MetHis: 0.537 ± 0.421
1.342MetIle: 1.342 ± 0.825
2.415MetLys: 2.415 ± 0.675
3.488MetLeu: 3.488 ± 0.868
0.268MetMet: 0.268 ± 0.26
2.146MetAsn: 2.146 ± 0.854
0.537MetPro: 0.537 ± 0.325
0.537MetGln: 0.537 ± 0.355
2.146MetArg: 2.146 ± 0.666
2.415MetSer: 2.415 ± 1.286
1.61MetThr: 1.61 ± 0.583
0.537MetVal: 0.537 ± 0.328
0.537MetTrp: 0.537 ± 0.378
2.146MetTyr: 2.146 ± 0.606
0.0MetXaa: 0.0 ± 0.0
Asn
3.756AsnAla: 3.756 ± 0.992
1.342AsnCys: 1.342 ± 0.751
4.293AsnAsp: 4.293 ± 0.861
5.098AsnGlu: 5.098 ± 0.865
3.22AsnPhe: 3.22 ± 1.028
3.22AsnGly: 3.22 ± 1.093
2.683AsnHis: 2.683 ± 1.211
2.951AsnIle: 2.951 ± 1.26
3.22AsnLys: 3.22 ± 0.743
6.976AsnLeu: 6.976 ± 1.403
2.683AsnMet: 2.683 ± 0.931
4.293AsnAsn: 4.293 ± 1.157
2.146AsnPro: 2.146 ± 0.646
1.61AsnGln: 1.61 ± 0.482
2.415AsnArg: 2.415 ± 0.695
5.635AsnSer: 5.635 ± 1.07
4.561AsnThr: 4.561 ± 0.881
7.781AsnVal: 7.781 ± 1.577
2.146AsnTrp: 2.146 ± 0.819
3.22AsnTyr: 3.22 ± 0.7
0.0AsnXaa: 0.0 ± 0.0
Pro
0.805ProAla: 0.805 ± 0.621
0.0ProCys: 0.0 ± 0.0
1.61ProAsp: 1.61 ± 0.436
0.268ProGlu: 0.268 ± 0.314
2.146ProPhe: 2.146 ± 0.484
1.342ProGly: 1.342 ± 0.476
1.073ProHis: 1.073 ± 0.565
4.025ProIle: 4.025 ± 0.572
0.537ProLys: 0.537 ± 0.456
1.878ProLeu: 1.878 ± 0.57
1.61ProMet: 1.61 ± 0.548
1.073ProAsn: 1.073 ± 0.848
2.146ProPro: 2.146 ± 0.776
1.878ProGln: 1.878 ± 0.916
2.415ProArg: 2.415 ± 0.591
2.415ProSer: 2.415 ± 0.904
3.756ProThr: 3.756 ± 1.252
2.146ProVal: 2.146 ± 0.85
0.0ProTrp: 0.0 ± 0.0
1.61ProTyr: 1.61 ± 0.557
0.0ProXaa: 0.0 ± 0.0
Gln
0.805GlnAla: 0.805 ± 0.599
0.268GlnCys: 0.268 ± 0.285
1.342GlnAsp: 1.342 ± 0.649
2.415GlnGlu: 2.415 ± 0.557
1.61GlnPhe: 1.61 ± 0.635
0.805GlnGly: 0.805 ± 0.411
1.61GlnHis: 1.61 ± 0.71
3.488GlnIle: 3.488 ± 1.354
1.878GlnLys: 1.878 ± 0.66
3.22GlnLeu: 3.22 ± 0.879
1.073GlnMet: 1.073 ± 0.53
3.756GlnAsn: 3.756 ± 1.064
1.342GlnPro: 1.342 ± 0.637
3.488GlnGln: 3.488 ± 1.337
1.61GlnArg: 1.61 ± 0.832
2.951GlnSer: 2.951 ± 0.637
2.415GlnThr: 2.415 ± 0.831
2.683GlnVal: 2.683 ± 0.866
0.537GlnTrp: 0.537 ± 0.471
2.951GlnTyr: 2.951 ± 1.146
0.0GlnXaa: 0.0 ± 0.0
Arg
1.878ArgAla: 1.878 ± 0.776
1.073ArgCys: 1.073 ± 0.527
1.878ArgAsp: 1.878 ± 0.9
1.073ArgGlu: 1.073 ± 0.65
2.146ArgPhe: 2.146 ± 0.53
1.342ArgGly: 1.342 ± 0.482
1.073ArgHis: 1.073 ± 0.5
3.756ArgIle: 3.756 ± 0.715
3.488ArgLys: 3.488 ± 0.73
3.22ArgLeu: 3.22 ± 1.594
1.61ArgMet: 1.61 ± 0.54
3.756ArgAsn: 3.756 ± 1.058
1.342ArgPro: 1.342 ± 0.605
1.878ArgGln: 1.878 ± 0.669
2.146ArgArg: 2.146 ± 0.798
2.951ArgSer: 2.951 ± 0.848
2.683ArgThr: 2.683 ± 0.769
3.488ArgVal: 3.488 ± 0.885
0.268ArgTrp: 0.268 ± 0.262
1.61ArgTyr: 1.61 ± 0.602
0.0ArgXaa: 0.0 ± 0.0
Ser
3.756SerAla: 3.756 ± 0.973
0.805SerCys: 0.805 ± 0.462
4.293SerAsp: 4.293 ± 1.43
4.83SerGlu: 4.83 ± 1.174
3.488SerPhe: 3.488 ± 0.782
1.61SerGly: 1.61 ± 0.55
0.805SerHis: 0.805 ± 0.394
8.049SerIle: 8.049 ± 1.819
4.83SerLys: 4.83 ± 1.108
6.708SerLeu: 6.708 ± 1.331
2.683SerMet: 2.683 ± 0.632
5.635SerAsn: 5.635 ± 1.377
2.415SerPro: 2.415 ± 0.664
2.951SerGln: 2.951 ± 0.752
3.488SerArg: 3.488 ± 0.748
7.513SerSer: 7.513 ± 2.471
4.83SerThr: 4.83 ± 1.365
4.293SerVal: 4.293 ± 0.592
0.268SerTrp: 0.268 ± 0.245
3.22SerTyr: 3.22 ± 0.848
0.0SerXaa: 0.0 ± 0.0
Thr
4.561ThrAla: 4.561 ± 1.027
0.268ThrCys: 0.268 ± 0.26
5.098ThrAsp: 5.098 ± 1.383
3.488ThrGlu: 3.488 ± 0.977
4.025ThrPhe: 4.025 ± 0.745
1.61ThrGly: 1.61 ± 0.789
0.537ThrHis: 0.537 ± 0.291
4.83ThrIle: 4.83 ± 1.057
2.415ThrLys: 2.415 ± 0.83
8.854ThrLeu: 8.854 ± 2.006
1.61ThrMet: 1.61 ± 0.728
3.22ThrAsn: 3.22 ± 0.694
1.878ThrPro: 1.878 ± 0.672
2.683ThrGln: 2.683 ± 0.858
2.951ThrArg: 2.951 ± 1.059
6.171ThrSer: 6.171 ± 0.974
5.903ThrThr: 5.903 ± 1.506
4.561ThrVal: 4.561 ± 1.157
1.342ThrTrp: 1.342 ± 0.369
1.878ThrTyr: 1.878 ± 0.698
0.0ThrXaa: 0.0 ± 0.0
Val
5.635ValAla: 5.635 ± 1.384
2.146ValCys: 2.146 ± 0.765
3.488ValAsp: 3.488 ± 1.106
4.293ValGlu: 4.293 ± 0.978
3.756ValPhe: 3.756 ± 0.688
2.951ValGly: 2.951 ± 0.613
0.268ValHis: 0.268 ± 0.236
4.83ValIle: 4.83 ± 0.925
5.366ValLys: 5.366 ± 1.184
5.098ValLeu: 5.098 ± 1.341
1.073ValMet: 1.073 ± 0.628
5.903ValAsn: 5.903 ± 1.581
1.878ValPro: 1.878 ± 0.906
1.878ValGln: 1.878 ± 0.491
1.61ValArg: 1.61 ± 0.526
3.756ValSer: 3.756 ± 0.693
3.756ValThr: 3.756 ± 0.916
3.22ValVal: 3.22 ± 1.161
0.537ValTrp: 0.537 ± 0.322
2.415ValTyr: 2.415 ± 0.566
0.0ValXaa: 0.0 ± 0.0
Trp
0.268TrpAla: 0.268 ± 0.245
0.805TrpCys: 0.805 ± 0.707
0.805TrpAsp: 0.805 ± 0.507
0.268TrpGlu: 0.268 ± 0.283
0.537TrpPhe: 0.537 ± 0.33
0.268TrpGly: 0.268 ± 0.27
0.0TrpHis: 0.0 ± 0.0
1.342TrpIle: 1.342 ± 0.55
2.415TrpLys: 2.415 ± 0.747
1.342TrpLeu: 1.342 ± 0.531
0.268TrpMet: 0.268 ± 0.285
0.805TrpAsn: 0.805 ± 0.494
0.268TrpPro: 0.268 ± 0.26
0.805TrpGln: 0.805 ± 0.519
0.537TrpArg: 0.537 ± 0.322
0.537TrpSer: 0.537 ± 0.339
1.342TrpThr: 1.342 ± 0.709
0.268TrpVal: 0.268 ± 0.285
0.268TrpTrp: 0.268 ± 0.26
0.805TrpTyr: 0.805 ± 0.384
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.488TyrAla: 3.488 ± 0.942
1.073TyrCys: 1.073 ± 0.455
4.025TyrAsp: 4.025 ± 1.242
2.951TyrGlu: 2.951 ± 0.752
1.61TyrPhe: 1.61 ± 0.573
1.878TyrGly: 1.878 ± 0.679
0.537TyrHis: 0.537 ± 0.322
3.488TyrIle: 3.488 ± 0.928
4.83TyrLys: 4.83 ± 1.043
2.951TyrLeu: 2.951 ± 1.06
0.805TyrMet: 0.805 ± 0.391
5.098TyrAsn: 5.098 ± 0.937
0.805TyrPro: 0.805 ± 0.494
1.342TyrGln: 1.342 ± 0.658
1.878TyrArg: 1.878 ± 0.679
5.098TyrSer: 5.098 ± 1.208
2.146TyrThr: 2.146 ± 0.808
2.146TyrVal: 2.146 ± 0.548
0.537TyrTrp: 0.537 ± 0.334
4.293TyrTyr: 4.293 ± 1.591
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (3728 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski