Amino acid dipepetide frequency for Rotavirus G3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.029AlaAla: 3.029 ± 1.023
1.515AlaCys: 1.515 ± 0.669
3.635AlaAsp: 3.635 ± 0.934
3.332AlaGlu: 3.332 ± 0.837
2.726AlaPhe: 2.726 ± 0.603
1.818AlaGly: 1.818 ± 0.727
0.606AlaHis: 0.606 ± 0.343
4.544AlaIle: 4.544 ± 1.315
1.515AlaLys: 1.515 ± 0.743
3.332AlaLeu: 3.332 ± 0.947
1.818AlaMet: 1.818 ± 0.638
4.241AlaAsn: 4.241 ± 0.804
1.818AlaPro: 1.818 ± 0.744
1.818AlaGln: 1.818 ± 0.705
2.121AlaArg: 2.121 ± 1.088
3.635AlaSer: 3.635 ± 0.822
3.938AlaThr: 3.938 ± 0.849
3.635AlaVal: 3.635 ± 1.066
0.0AlaTrp: 0.0 ± 0.0
0.606AlaTyr: 0.606 ± 0.401
0.0AlaXaa: 0.0 ± 0.0
Cys
0.303CysAla: 0.303 ± 0.267
0.303CysCys: 0.303 ± 0.295
1.212CysAsp: 1.212 ± 0.545
1.212CysGlu: 1.212 ± 0.368
0.909CysPhe: 0.909 ± 0.384
0.303CysGly: 0.303 ± 0.271
0.0CysHis: 0.0 ± 0.0
0.909CysIle: 0.909 ± 0.471
1.818CysLys: 1.818 ± 0.792
1.515CysLeu: 1.515 ± 0.704
0.303CysMet: 0.303 ± 0.267
1.212CysAsn: 1.212 ± 0.419
0.303CysPro: 0.303 ± 0.245
0.909CysGln: 0.909 ± 0.63
0.606CysArg: 0.606 ± 0.591
1.515CysSer: 1.515 ± 1.197
1.212CysThr: 1.212 ± 0.641
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.606CysTyr: 0.606 ± 0.374
0.0CysXaa: 0.0 ± 0.0
Asp
3.332AspAla: 3.332 ± 1.288
0.303AspCys: 0.303 ± 0.295
3.029AspAsp: 3.029 ± 1.471
3.332AspGlu: 3.332 ± 1.146
2.726AspPhe: 2.726 ± 0.515
2.424AspGly: 2.424 ± 0.577
0.606AspHis: 0.606 ± 0.376
5.15AspIle: 5.15 ± 1.199
3.635AspLys: 3.635 ± 1.139
5.453AspLeu: 5.453 ± 0.783
1.818AspMet: 1.818 ± 0.561
2.726AspAsn: 2.726 ± 0.953
2.424AspPro: 2.424 ± 0.65
2.121AspGln: 2.121 ± 0.786
2.424AspArg: 2.424 ± 1.006
6.362AspSer: 6.362 ± 2.267
3.635AspThr: 3.635 ± 0.967
3.938AspVal: 3.938 ± 0.963
0.606AspTrp: 0.606 ± 0.334
4.241AspTyr: 4.241 ± 1.3
0.0AspXaa: 0.0 ± 0.0
Glu
3.029GluAla: 3.029 ± 0.663
0.0GluCys: 0.0 ± 0.0
2.726GluAsp: 2.726 ± 0.775
3.332GluGlu: 3.332 ± 1.615
0.909GluPhe: 0.909 ± 0.622
1.212GluGly: 1.212 ± 0.443
1.212GluHis: 1.212 ± 0.612
5.756GluIle: 5.756 ± 1.511
3.938GluLys: 3.938 ± 0.813
4.544GluLeu: 4.544 ± 1.19
2.726GluMet: 2.726 ± 0.828
5.15GluAsn: 5.15 ± 1.227
2.121GluPro: 2.121 ± 1.165
2.424GluGln: 2.424 ± 1.018
3.029GluArg: 3.029 ± 0.789
4.544GluSer: 4.544 ± 0.858
2.726GluThr: 2.726 ± 0.998
3.635GluVal: 3.635 ± 0.874
1.818GluTrp: 1.818 ± 0.83
5.453GluTyr: 5.453 ± 1.181
0.0GluXaa: 0.0 ± 0.0
Phe
1.212PheAla: 1.212 ± 0.479
0.606PheCys: 0.606 ± 0.379
3.029PheAsp: 3.029 ± 1.056
2.424PheGlu: 2.424 ± 0.914
0.303PhePhe: 0.303 ± 0.308
1.818PheGly: 1.818 ± 0.66
0.909PheHis: 0.909 ± 0.488
2.424PheIle: 2.424 ± 0.703
3.635PheLys: 3.635 ± 0.698
3.635PheLeu: 3.635 ± 1.009
0.606PheMet: 0.606 ± 0.386
3.635PheAsn: 3.635 ± 0.733
2.424PhePro: 2.424 ± 0.984
1.515PheGln: 1.515 ± 0.712
1.212PheArg: 1.212 ± 0.629
3.332PheSer: 3.332 ± 0.889
4.544PheThr: 4.544 ± 1.377
0.909PheVal: 0.909 ± 0.58
0.303PheTrp: 0.303 ± 0.309
1.212PheTyr: 1.212 ± 0.623
0.0PheXaa: 0.0 ± 0.0
Gly
1.515GlyAla: 1.515 ± 0.652
1.212GlyCys: 1.212 ± 0.475
0.606GlyAsp: 0.606 ± 0.391
1.212GlyGlu: 1.212 ± 0.632
0.606GlyPhe: 0.606 ± 0.534
1.515GlyGly: 1.515 ± 0.697
1.212GlyHis: 1.212 ± 0.628
3.635GlyIle: 3.635 ± 0.842
3.029GlyLys: 3.029 ± 0.901
2.121GlyLeu: 2.121 ± 0.786
1.212GlyMet: 1.212 ± 0.47
0.909GlyAsn: 0.909 ± 0.564
2.424GlyPro: 2.424 ± 1.034
1.212GlyGln: 1.212 ± 0.457
0.606GlyArg: 0.606 ± 0.471
2.726GlySer: 2.726 ± 1.021
1.818GlyThr: 1.818 ± 1.17
2.424GlyVal: 2.424 ± 0.785
0.909GlyTrp: 0.909 ± 0.395
2.121GlyTyr: 2.121 ± 0.969
0.0GlyXaa: 0.0 ± 0.0
His
0.909HisAla: 0.909 ± 0.41
0.303HisCys: 0.303 ± 0.295
0.909HisAsp: 0.909 ± 0.563
0.909HisGlu: 0.909 ± 0.718
0.606HisPhe: 0.606 ± 0.591
0.606HisGly: 0.606 ± 0.379
0.303HisHis: 0.303 ± 0.267
0.303HisIle: 0.303 ± 0.267
1.818HisLys: 1.818 ± 0.569
0.909HisLeu: 0.909 ± 0.578
0.0HisMet: 0.0 ± 0.0
1.212HisAsn: 1.212 ± 0.677
0.303HisPro: 0.303 ± 0.329
1.212HisGln: 1.212 ± 0.63
0.303HisArg: 0.303 ± 0.341
1.212HisSer: 1.212 ± 0.519
0.909HisThr: 0.909 ± 0.643
0.606HisVal: 0.606 ± 0.374
0.303HisTrp: 0.303 ± 0.276
0.909HisTyr: 0.909 ± 0.441
0.0HisXaa: 0.0 ± 0.0
Ile
3.635IleAla: 3.635 ± 0.961
0.606IleCys: 0.606 ± 0.37
3.938IleAsp: 3.938 ± 1.484
6.362IleGlu: 6.362 ± 1.017
2.726IlePhe: 2.726 ± 1.189
3.029IleGly: 3.029 ± 0.724
0.303IleHis: 0.303 ± 0.286
4.847IleIle: 4.847 ± 1.139
6.059IleLys: 6.059 ± 1.199
6.362IleLeu: 6.362 ± 1.466
0.909IleMet: 0.909 ± 0.454
5.15IleAsn: 5.15 ± 1.704
2.424IlePro: 2.424 ± 0.742
2.726IleGln: 2.726 ± 0.885
4.544IleArg: 4.544 ± 0.913
5.15IleSer: 5.15 ± 1.357
5.756IleThr: 5.756 ± 1.141
4.241IleVal: 4.241 ± 0.921
0.303IleTrp: 0.303 ± 0.265
5.453IleTyr: 5.453 ± 1.391
0.0IleXaa: 0.0 ± 0.0
Lys
1.818LysAla: 1.818 ± 0.802
2.121LysCys: 2.121 ± 0.664
4.241LysAsp: 4.241 ± 0.879
5.15LysGlu: 5.15 ± 1.976
2.424LysPhe: 2.424 ± 0.838
2.424LysGly: 2.424 ± 0.655
0.909LysHis: 0.909 ± 0.438
4.241LysIle: 4.241 ± 0.911
3.635LysLys: 3.635 ± 1.225
8.482LysLeu: 8.482 ± 1.567
1.818LysMet: 1.818 ± 0.92
3.635LysAsn: 3.635 ± 1.135
2.121LysPro: 2.121 ± 0.753
3.635LysGln: 3.635 ± 1.33
3.635LysArg: 3.635 ± 0.587
5.15LysSer: 5.15 ± 1.289
3.635LysThr: 3.635 ± 1.158
3.938LysVal: 3.938 ± 1.153
1.515LysTrp: 1.515 ± 0.772
4.241LysTyr: 4.241 ± 1.425
0.0LysXaa: 0.0 ± 0.0
Leu
3.635LeuAla: 3.635 ± 0.593
0.606LeuCys: 0.606 ± 0.489
6.968LeuAsp: 6.968 ± 1.364
5.15LeuGlu: 5.15 ± 1.661
4.544LeuPhe: 4.544 ± 0.943
1.515LeuGly: 1.515 ± 0.591
1.818LeuHis: 1.818 ± 0.74
7.271LeuIle: 7.271 ± 1.779
6.362LeuLys: 6.362 ± 1.433
8.482LeuLeu: 8.482 ± 1.253
3.938LeuMet: 3.938 ± 0.726
8.785LeuAsn: 8.785 ± 1.211
3.029LeuPro: 3.029 ± 0.73
3.635LeuGln: 3.635 ± 0.919
4.241LeuArg: 4.241 ± 1.007
7.876LeuSer: 7.876 ± 1.518
8.482LeuThr: 8.482 ± 1.114
3.938LeuVal: 3.938 ± 0.896
0.606LeuTrp: 0.606 ± 0.573
3.332LeuTyr: 3.332 ± 0.79
0.0LeuXaa: 0.0 ± 0.0
Met
2.121MetAla: 2.121 ± 0.762
0.0MetCys: 0.0 ± 0.0
3.332MetAsp: 3.332 ± 0.931
1.515MetGlu: 1.515 ± 0.8
1.212MetPhe: 1.212 ± 0.516
1.212MetGly: 1.212 ± 0.482
0.303MetHis: 0.303 ± 0.295
0.909MetIle: 0.909 ± 0.427
3.029MetLys: 3.029 ± 0.993
3.332MetLeu: 3.332 ± 0.67
0.909MetMet: 0.909 ± 0.41
1.818MetAsn: 1.818 ± 0.546
1.212MetPro: 1.212 ± 0.396
0.606MetGln: 0.606 ± 0.485
2.726MetArg: 2.726 ± 0.56
1.818MetSer: 1.818 ± 0.549
1.818MetThr: 1.818 ± 0.437
0.606MetVal: 0.606 ± 0.366
0.606MetTrp: 0.606 ± 0.351
0.909MetTyr: 0.909 ± 0.365
0.0MetXaa: 0.0 ± 0.0
Asn
5.453AsnAla: 5.453 ± 1.258
1.212AsnCys: 1.212 ± 0.634
3.938AsnAsp: 3.938 ± 0.999
4.847AsnGlu: 4.847 ± 0.806
3.332AsnPhe: 3.332 ± 1.139
3.029AsnGly: 3.029 ± 0.935
1.212AsnHis: 1.212 ± 0.604
2.121AsnIle: 2.121 ± 0.781
5.15AsnLys: 5.15 ± 1.339
6.362AsnLeu: 6.362 ± 1.041
2.424AsnMet: 2.424 ± 0.797
5.453AsnAsn: 5.453 ± 1.223
3.029AsnPro: 3.029 ± 0.704
1.818AsnGln: 1.818 ± 0.812
2.121AsnArg: 2.121 ± 0.743
5.756AsnSer: 5.756 ± 1.245
4.241AsnThr: 4.241 ± 1.211
6.968AsnVal: 6.968 ± 0.998
2.726AsnTrp: 2.726 ± 0.821
4.544AsnTyr: 4.544 ± 0.703
0.0AsnXaa: 0.0 ± 0.0
Pro
1.212ProAla: 1.212 ± 0.863
0.0ProCys: 0.0 ± 0.0
1.515ProAsp: 1.515 ± 0.788
0.909ProGlu: 0.909 ± 0.565
1.515ProPhe: 1.515 ± 0.582
2.424ProGly: 2.424 ± 0.978
0.606ProHis: 0.606 ± 0.374
4.241ProIle: 4.241 ± 0.796
1.212ProLys: 1.212 ± 0.716
2.726ProLeu: 2.726 ± 0.703
0.909ProMet: 0.909 ± 0.397
2.121ProAsn: 2.121 ± 1.019
2.121ProPro: 2.121 ± 0.866
1.515ProGln: 1.515 ± 0.706
2.424ProArg: 2.424 ± 0.537
2.424ProSer: 2.424 ± 1.094
2.726ProThr: 2.726 ± 0.929
1.818ProVal: 1.818 ± 1.126
0.0ProTrp: 0.0 ± 0.0
1.515ProTyr: 1.515 ± 0.526
0.0ProXaa: 0.0 ± 0.0
Gln
2.121GlnAla: 2.121 ± 0.667
0.0GlnCys: 0.0 ± 0.0
0.606GlnAsp: 0.606 ± 0.37
2.424GlnGlu: 2.424 ± 0.489
1.515GlnPhe: 1.515 ± 0.709
0.606GlnGly: 0.606 ± 0.374
1.515GlnHis: 1.515 ± 0.594
2.121GlnIle: 2.121 ± 0.65
1.818GlnLys: 1.818 ± 0.696
5.453GlnLeu: 5.453 ± 1.266
0.909GlnMet: 0.909 ± 0.428
3.332GlnAsn: 3.332 ± 1.097
0.606GlnPro: 0.606 ± 0.534
2.726GlnGln: 2.726 ± 1.088
2.424GlnArg: 2.424 ± 0.931
2.121GlnSer: 2.121 ± 0.609
2.424GlnThr: 2.424 ± 0.886
3.029GlnVal: 3.029 ± 0.499
0.303GlnTrp: 0.303 ± 0.295
2.121GlnTyr: 2.121 ± 0.869
0.0GlnXaa: 0.0 ± 0.0
Arg
1.212ArgAla: 1.212 ± 0.417
1.515ArgCys: 1.515 ± 0.629
1.515ArgAsp: 1.515 ± 0.5
2.424ArgGlu: 2.424 ± 1.164
2.726ArgPhe: 2.726 ± 0.56
1.212ArgGly: 1.212 ± 0.491
0.909ArgHis: 0.909 ± 0.499
3.635ArgIle: 3.635 ± 0.595
3.029ArgLys: 3.029 ± 0.865
1.818ArgLeu: 1.818 ± 0.71
2.424ArgMet: 2.424 ± 0.821
6.059ArgAsn: 6.059 ± 1.256
1.212ArgPro: 1.212 ± 0.589
1.515ArgGln: 1.515 ± 0.644
2.121ArgArg: 2.121 ± 0.837
3.635ArgSer: 3.635 ± 1.277
1.515ArgThr: 1.515 ± 0.569
4.544ArgVal: 4.544 ± 1.003
1.212ArgTrp: 1.212 ± 0.439
1.212ArgTyr: 1.212 ± 0.457
0.0ArgXaa: 0.0 ± 0.0
Ser
3.332SerAla: 3.332 ± 1.064
0.606SerCys: 0.606 ± 0.361
5.453SerAsp: 5.453 ± 1.644
4.544SerGlu: 4.544 ± 1.122
3.332SerPhe: 3.332 ± 0.8
2.121SerGly: 2.121 ± 0.64
1.212SerHis: 1.212 ± 0.675
8.482SerIle: 8.482 ± 2.08
5.453SerLys: 5.453 ± 0.767
7.573SerLeu: 7.573 ± 1.41
2.121SerMet: 2.121 ± 0.626
4.544SerAsn: 4.544 ± 1.183
1.818SerPro: 1.818 ± 0.736
2.726SerGln: 2.726 ± 0.525
4.544SerArg: 4.544 ± 0.971
6.968SerSer: 6.968 ± 1.925
4.544SerThr: 4.544 ± 1.124
4.544SerVal: 4.544 ± 0.761
0.303SerTrp: 0.303 ± 0.245
3.332SerTyr: 3.332 ± 1.026
0.0SerXaa: 0.0 ± 0.0
Thr
3.635ThrAla: 3.635 ± 0.864
0.303ThrCys: 0.303 ± 0.245
5.15ThrAsp: 5.15 ± 1.101
3.029ThrGlu: 3.029 ± 1.194
2.424ThrPhe: 2.424 ± 0.773
1.818ThrGly: 1.818 ± 0.779
0.606ThrHis: 0.606 ± 0.382
6.665ThrIle: 6.665 ± 1.157
3.635ThrLys: 3.635 ± 1.137
10.3ThrLeu: 10.3 ± 1.78
1.818ThrMet: 1.818 ± 0.829
4.544ThrAsn: 4.544 ± 1.578
1.515ThrPro: 1.515 ± 0.579
2.424ThrGln: 2.424 ± 0.417
1.818ThrArg: 1.818 ± 0.819
5.453ThrSer: 5.453 ± 1.598
5.453ThrThr: 5.453 ± 1.49
3.332ThrVal: 3.332 ± 0.996
0.606ThrTrp: 0.606 ± 0.412
2.424ThrTyr: 2.424 ± 0.905
0.0ThrXaa: 0.0 ± 0.0
Val
4.544ValAla: 4.544 ± 1.052
2.121ValCys: 2.121 ± 0.636
3.332ValAsp: 3.332 ± 1.149
4.847ValGlu: 4.847 ± 1.198
2.424ValPhe: 2.424 ± 0.672
2.424ValGly: 2.424 ± 0.617
0.0ValHis: 0.0 ± 0.0
4.544ValIle: 4.544 ± 1.015
4.544ValLys: 4.544 ± 1.104
6.968ValLeu: 6.968 ± 1.116
1.818ValMet: 1.818 ± 0.661
5.15ValAsn: 5.15 ± 1.236
0.909ValPro: 0.909 ± 0.44
1.212ValGln: 1.212 ± 0.554
1.818ValArg: 1.818 ± 0.784
3.029ValSer: 3.029 ± 1.155
3.938ValThr: 3.938 ± 0.919
3.029ValVal: 3.029 ± 0.848
0.303ValTrp: 0.303 ± 0.295
1.212ValTyr: 1.212 ± 0.457
0.0ValXaa: 0.0 ± 0.0
Trp
0.303TrpAla: 0.303 ± 0.309
0.909TrpCys: 0.909 ± 0.886
0.0TrpAsp: 0.0 ± 0.0
0.303TrpGlu: 0.303 ± 0.308
0.303TrpPhe: 0.303 ± 0.295
0.303TrpGly: 0.303 ± 0.286
0.0TrpHis: 0.0 ± 0.0
0.606TrpIle: 0.606 ± 0.341
2.424TrpLys: 2.424 ± 0.768
1.818TrpLeu: 1.818 ± 0.527
0.303TrpMet: 0.303 ± 0.286
1.515TrpAsn: 1.515 ± 0.866
0.303TrpPro: 0.303 ± 0.245
0.909TrpGln: 0.909 ± 0.56
0.303TrpArg: 0.303 ± 0.295
0.303TrpSer: 0.303 ± 0.286
0.606TrpThr: 0.606 ± 0.375
0.303TrpVal: 0.303 ± 0.271
0.303TrpTrp: 0.303 ± 0.245
1.515TrpTyr: 1.515 ± 0.386
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.332TyrAla: 3.332 ± 0.682
0.909TyrCys: 0.909 ± 0.423
5.15TyrAsp: 5.15 ± 1.16
3.332TyrGlu: 3.332 ± 1.023
2.424TyrPhe: 2.424 ± 0.821
1.212TyrGly: 1.212 ± 0.604
0.303TyrHis: 0.303 ± 0.295
2.121TyrIle: 2.121 ± 0.585
3.029TyrLys: 3.029 ± 0.914
3.029TyrLeu: 3.029 ± 1.223
0.909TyrMet: 0.909 ± 0.596
4.544TyrAsn: 4.544 ± 1.275
1.818TyrPro: 1.818 ± 0.497
1.515TyrGln: 1.515 ± 0.437
2.424TyrArg: 2.424 ± 0.771
4.544TyrSer: 4.544 ± 1.552
3.029TyrThr: 3.029 ± 1.315
2.726TyrVal: 2.726 ± 0.781
0.606TyrTrp: 0.606 ± 0.377
3.635TyrTyr: 3.635 ± 1.401
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (3302 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski