Amino acid dipepetide frequency for Menghai rhabdovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.189AlaAla: 1.189 ± 0.503
0.297AlaCys: 0.297 ± 0.484
2.379AlaAsp: 2.379 ± 0.306
2.081AlaGlu: 2.081 ± 0.581
0.595AlaPhe: 0.595 ± 0.376
0.595AlaGly: 0.595 ± 0.755
1.189AlaHis: 1.189 ± 0.424
1.784AlaIle: 1.784 ± 0.528
2.676AlaLys: 2.676 ± 1.654
5.65AlaLeu: 5.65 ± 2.938
1.487AlaMet: 1.487 ± 0.739
3.271AlaAsn: 3.271 ± 0.885
1.487AlaPro: 1.487 ± 1.483
1.189AlaGln: 1.189 ± 0.706
2.081AlaArg: 2.081 ± 1.125
3.568AlaSer: 3.568 ± 1.924
1.487AlaThr: 1.487 ± 0.291
2.379AlaVal: 2.379 ± 0.955
1.487AlaTrp: 1.487 ± 0.75
1.189AlaTyr: 1.189 ± 1.126
0.0AlaXaa: 0.0 ± 0.0
Cys
2.379CysAla: 2.379 ± 1.567
0.0CysCys: 0.0 ± 0.0
0.892CysAsp: 0.892 ± 0.529
0.595CysGlu: 0.595 ± 0.689
0.595CysPhe: 0.595 ± 0.353
1.189CysGly: 1.189 ± 0.826
0.0CysHis: 0.0 ± 0.0
1.189CysIle: 1.189 ± 0.474
0.892CysLys: 0.892 ± 0.474
2.676CysLeu: 2.676 ± 0.743
0.0CysMet: 0.0 ± 0.0
0.892CysAsn: 0.892 ± 0.799
0.297CysPro: 0.297 ± 0.44
0.0CysGln: 0.0 ± 0.0
0.297CysArg: 0.297 ± 0.176
0.297CysSer: 0.297 ± 0.176
0.892CysThr: 0.892 ± 0.39
0.892CysVal: 0.892 ± 0.639
0.297CysTrp: 0.297 ± 0.176
0.595CysTyr: 0.595 ± 0.879
0.0CysXaa: 0.0 ± 0.0
Asp
1.784AspAla: 1.784 ± 0.826
1.487AspCys: 1.487 ± 0.981
3.271AspAsp: 3.271 ± 2.272
4.758AspGlu: 4.758 ± 1.752
3.568AspPhe: 3.568 ± 1.019
3.271AspGly: 3.271 ± 1.168
2.379AspHis: 2.379 ± 0.955
6.839AspIle: 6.839 ± 2.011
8.326AspLys: 8.326 ± 1.749
4.758AspLeu: 4.758 ± 1.203
1.487AspMet: 1.487 ± 0.485
4.163AspAsn: 4.163 ± 1.304
2.974AspPro: 2.974 ± 0.617
1.487AspGln: 1.487 ± 0.917
2.676AspArg: 2.676 ± 0.895
3.866AspSer: 3.866 ± 1.079
0.595AspThr: 0.595 ± 0.353
2.379AspVal: 2.379 ± 1.448
0.297AspTrp: 0.297 ± 0.176
3.271AspTyr: 3.271 ± 1.036
0.0AspXaa: 0.0 ± 0.0
Glu
1.487GluAla: 1.487 ± 1.092
0.892GluCys: 0.892 ± 0.508
4.163GluAsp: 4.163 ± 1.17
5.352GluGlu: 5.352 ± 1.633
3.271GluPhe: 3.271 ± 1.21
2.974GluGly: 2.974 ± 0.658
1.189GluHis: 1.189 ± 0.636
7.731GluIle: 7.731 ± 2.137
5.352GluLys: 5.352 ± 1.899
5.947GluLeu: 5.947 ± 1.356
0.892GluMet: 0.892 ± 1.361
4.163GluAsn: 4.163 ± 1.218
1.784GluPro: 1.784 ± 0.674
2.081GluGln: 2.081 ± 0.605
2.676GluArg: 2.676 ± 0.763
4.758GluSer: 4.758 ± 1.226
4.163GluThr: 4.163 ± 1.169
5.055GluVal: 5.055 ± 1.446
1.189GluTrp: 1.189 ± 0.351
3.568GluTyr: 3.568 ± 0.717
0.0GluXaa: 0.0 ± 0.0
Phe
1.189PheAla: 1.189 ± 0.351
0.892PheCys: 0.892 ± 0.39
3.866PheAsp: 3.866 ± 1.121
2.676PheGlu: 2.676 ± 0.743
1.487PhePhe: 1.487 ± 0.6
1.784PheGly: 1.784 ± 0.779
0.297PheHis: 0.297 ± 0.44
3.271PheIle: 3.271 ± 1.119
5.65PheLys: 5.65 ± 1.382
4.163PheLeu: 4.163 ± 1.656
0.595PheMet: 0.595 ± 0.339
2.081PheAsn: 2.081 ± 0.689
1.487PhePro: 1.487 ± 0.548
0.297PheGln: 0.297 ± 0.176
2.081PheArg: 2.081 ± 0.661
3.271PheSer: 3.271 ± 0.909
1.487PheThr: 1.487 ± 1.003
2.081PheVal: 2.081 ± 0.882
0.297PheTrp: 0.297 ± 0.176
2.081PheTyr: 2.081 ± 0.764
0.0PheXaa: 0.0 ± 0.0
Gly
1.189GlyAla: 1.189 ± 0.424
0.297GlyCys: 0.297 ± 0.44
2.676GlyAsp: 2.676 ± 0.948
2.676GlyGlu: 2.676 ± 2.839
2.676GlyPhe: 2.676 ± 0.917
2.379GlyGly: 2.379 ± 1.067
0.892GlyHis: 0.892 ± 0.529
4.758GlyIle: 4.758 ± 0.959
2.974GlyLys: 2.974 ± 0.761
4.758GlyLeu: 4.758 ± 0.728
1.189GlyMet: 1.189 ± 0.784
2.676GlyAsn: 2.676 ± 0.378
1.784GlyPro: 1.784 ± 0.807
0.892GlyGln: 0.892 ± 0.39
2.379GlyArg: 2.379 ± 0.584
2.974GlySer: 2.974 ± 0.464
1.189GlyThr: 1.189 ± 0.636
2.081GlyVal: 2.081 ± 1.115
0.595GlyTrp: 0.595 ± 0.353
1.487GlyTyr: 1.487 ± 0.6
0.0GlyXaa: 0.0 ± 0.0
His
1.189HisAla: 1.189 ± 0.784
0.297HisCys: 0.297 ± 0.44
1.487HisAsp: 1.487 ± 0.544
0.595HisGlu: 0.595 ± 0.376
0.297HisPhe: 0.297 ± 0.484
0.297HisGly: 0.297 ± 0.176
0.297HisHis: 0.297 ± 0.176
0.595HisIle: 0.595 ± 0.353
1.784HisLys: 1.784 ± 0.407
2.379HisLeu: 2.379 ± 0.691
0.892HisMet: 0.892 ± 0.786
1.189HisAsn: 1.189 ± 0.424
2.081HisPro: 2.081 ± 0.473
0.595HisGln: 0.595 ± 0.353
0.595HisArg: 0.595 ± 0.376
2.081HisSer: 2.081 ± 0.621
1.189HisThr: 1.189 ± 0.351
0.892HisVal: 0.892 ± 0.799
0.595HisTrp: 0.595 ± 0.51
2.081HisTyr: 2.081 ± 1.193
0.0HisXaa: 0.0 ± 0.0
Ile
2.379IleAla: 2.379 ± 0.911
2.081IleCys: 2.081 ± 0.569
6.542IleAsp: 6.542 ± 1.505
5.65IleGlu: 5.65 ± 1.0
3.568IlePhe: 3.568 ± 1.158
5.352IleGly: 5.352 ± 1.431
1.189IleHis: 1.189 ± 0.474
7.136IleIle: 7.136 ± 0.43
10.407IleLys: 10.407 ± 2.446
8.921IleLeu: 8.921 ± 1.713
2.676IleMet: 2.676 ± 0.796
4.46IleAsn: 4.46 ± 1.102
2.676IlePro: 2.676 ± 0.895
2.676IleGln: 2.676 ± 0.62
2.081IleArg: 2.081 ± 0.976
7.731IleSer: 7.731 ± 0.856
2.081IleThr: 2.081 ± 0.904
1.487IleVal: 1.487 ± 0.6
0.595IleTrp: 0.595 ± 0.392
4.163IleTyr: 4.163 ± 0.88
0.0IleXaa: 0.0 ± 0.0
Lys
5.352LysAla: 5.352 ± 1.174
0.892LysCys: 0.892 ± 0.799
5.947LysAsp: 5.947 ± 0.695
8.029LysGlu: 8.029 ± 2.47
4.46LysPhe: 4.46 ± 0.6
5.055LysGly: 5.055 ± 1.179
1.487LysHis: 1.487 ± 0.664
6.839LysIle: 6.839 ± 1.261
9.515LysLys: 9.515 ± 3.58
6.839LysLeu: 6.839 ± 0.971
2.676LysMet: 2.676 ± 1.3
7.731LysAsn: 7.731 ± 0.881
2.379LysPro: 2.379 ± 0.803
1.487LysGln: 1.487 ± 0.291
3.866LysArg: 3.866 ± 0.713
7.136LysSer: 7.136 ± 1.014
7.434LysThr: 7.434 ± 2.46
3.866LysVal: 3.866 ± 1.932
0.297LysTrp: 0.297 ± 0.176
5.947LysTyr: 5.947 ± 1.756
0.0LysXaa: 0.0 ± 0.0
Leu
3.866LeuAla: 3.866 ± 0.777
1.189LeuCys: 1.189 ± 0.351
3.866LeuAsp: 3.866 ± 1.323
6.244LeuGlu: 6.244 ± 0.996
2.974LeuPhe: 2.974 ± 0.611
2.081LeuGly: 2.081 ± 0.44
2.081LeuHis: 2.081 ± 0.314
7.731LeuIle: 7.731 ± 2.251
10.407LeuLys: 10.407 ± 2.479
8.921LeuLeu: 8.921 ± 0.722
3.866LeuMet: 3.866 ± 1.536
7.731LeuAsn: 7.731 ± 2.061
1.784LeuPro: 1.784 ± 0.528
3.271LeuGln: 3.271 ± 1.574
3.568LeuArg: 3.568 ± 0.758
10.407LeuSer: 10.407 ± 2.466
3.568LeuThr: 3.568 ± 1.841
3.866LeuVal: 3.866 ± 1.648
0.297LeuTrp: 0.297 ± 0.176
4.758LeuTyr: 4.758 ± 1.181
0.0LeuXaa: 0.0 ± 0.0
Met
2.081MetAla: 2.081 ± 0.815
0.0MetCys: 0.0 ± 0.0
1.784MetAsp: 1.784 ± 1.175
0.892MetGlu: 0.892 ± 0.456
1.189MetPhe: 1.189 ± 0.706
1.189MetGly: 1.189 ± 0.784
0.892MetHis: 0.892 ± 0.862
2.676MetIle: 2.676 ± 0.378
2.379MetLys: 2.379 ± 0.745
4.46MetLeu: 4.46 ± 1.244
0.892MetMet: 0.892 ± 0.954
3.271MetAsn: 3.271 ± 0.69
1.189MetPro: 1.189 ± 0.539
0.297MetGln: 0.297 ± 0.176
1.487MetArg: 1.487 ± 0.6
1.784MetSer: 1.784 ± 0.79
1.784MetThr: 1.784 ± 0.739
0.892MetVal: 0.892 ± 0.529
0.297MetTrp: 0.297 ± 0.57
0.892MetTyr: 0.892 ± 0.639
0.0MetXaa: 0.0 ± 0.0
Asn
2.379AsnAla: 2.379 ± 1.175
0.595AsnCys: 0.595 ± 0.879
3.866AsnAsp: 3.866 ± 0.634
3.271AsnGlu: 3.271 ± 0.552
1.189AsnPhe: 1.189 ± 0.826
2.081AsnGly: 2.081 ± 1.411
1.487AsnHis: 1.487 ± 0.739
8.623AsnIle: 8.623 ± 2.196
8.029AsnLys: 8.029 ± 1.623
6.839AsnLeu: 6.839 ± 1.857
2.081AsnMet: 2.081 ± 0.863
6.542AsnAsn: 6.542 ± 1.422
2.974AsnPro: 2.974 ± 0.569
2.379AsnGln: 2.379 ± 0.565
2.379AsnArg: 2.379 ± 0.989
5.352AsnSer: 5.352 ± 1.531
3.866AsnThr: 3.866 ± 1.135
2.974AsnVal: 2.974 ± 0.916
0.892AsnTrp: 0.892 ± 0.368
3.568AsnTyr: 3.568 ± 1.75
0.0AsnXaa: 0.0 ± 0.0
Pro
0.892ProAla: 0.892 ± 0.577
0.297ProCys: 0.297 ± 0.176
1.784ProAsp: 1.784 ± 0.329
3.271ProGlu: 3.271 ± 0.595
0.595ProPhe: 0.595 ± 0.834
1.487ProGly: 1.487 ± 0.535
0.595ProHis: 0.595 ± 0.453
2.676ProIle: 2.676 ± 0.834
2.974ProLys: 2.974 ± 0.674
1.784ProLeu: 1.784 ± 0.913
0.892ProMet: 0.892 ± 0.632
1.784ProAsn: 1.784 ± 0.674
0.892ProPro: 0.892 ± 0.456
0.297ProGln: 0.297 ± 0.176
1.487ProArg: 1.487 ± 0.745
2.379ProSer: 2.379 ± 0.686
2.081ProThr: 2.081 ± 0.473
2.676ProVal: 2.676 ± 0.912
0.595ProTrp: 0.595 ± 0.376
2.081ProTyr: 2.081 ± 0.314
0.0ProXaa: 0.0 ± 0.0
Gln
0.892GlnAla: 0.892 ± 0.368
0.297GlnCys: 0.297 ± 0.176
1.784GlnAsp: 1.784 ± 0.583
2.081GlnGlu: 2.081 ± 0.314
0.595GlnPhe: 0.595 ± 0.353
0.892GlnGly: 0.892 ± 0.508
1.189GlnHis: 1.189 ± 0.351
1.487GlnIle: 1.487 ± 1.01
2.081GlnLys: 2.081 ± 1.125
2.081GlnLeu: 2.081 ± 1.235
0.297GlnMet: 0.297 ± 0.176
0.595GlnAsn: 0.595 ± 0.353
1.189GlnPro: 1.189 ± 0.474
0.0GlnGln: 0.0 ± 0.0
1.487GlnArg: 1.487 ± 0.6
2.974GlnSer: 2.974 ± 0.892
1.784GlnThr: 1.784 ± 0.51
2.081GlnVal: 2.081 ± 0.85
0.0GlnTrp: 0.0 ± 0.0
1.189GlnTyr: 1.189 ± 0.523
0.0GlnXaa: 0.0 ± 0.0
Arg
2.081ArgAla: 2.081 ± 0.621
1.784ArgCys: 1.784 ± 0.746
3.271ArgAsp: 3.271 ± 1.297
2.974ArgGlu: 2.974 ± 0.955
2.081ArgPhe: 2.081 ± 1.235
0.892ArgGly: 0.892 ± 0.368
0.595ArgHis: 0.595 ± 0.689
2.081ArgIle: 2.081 ± 0.69
1.487ArgLys: 1.487 ± 0.664
3.271ArgLeu: 3.271 ± 0.771
2.081ArgMet: 2.081 ± 0.569
2.081ArgAsn: 2.081 ± 0.85
1.487ArgPro: 1.487 ± 0.739
1.189ArgGln: 1.189 ± 0.591
0.595ArgArg: 0.595 ± 0.376
2.676ArgSer: 2.676 ± 1.112
2.081ArgThr: 2.081 ± 0.69
1.189ArgVal: 1.189 ± 0.474
0.892ArgTrp: 0.892 ± 0.529
2.081ArgTyr: 2.081 ± 0.661
0.0ArgXaa: 0.0 ± 0.0
Ser
2.379SerAla: 2.379 ± 1.197
0.297SerCys: 0.297 ± 0.176
7.434SerAsp: 7.434 ± 1.146
7.434SerGlu: 7.434 ± 1.06
3.568SerPhe: 3.568 ± 1.505
4.163SerGly: 4.163 ± 1.18
1.487SerHis: 1.487 ± 0.291
5.352SerIle: 5.352 ± 1.0
6.839SerLys: 6.839 ± 1.853
8.623SerLeu: 8.623 ± 2.512
2.081SerMet: 2.081 ± 0.581
4.758SerAsn: 4.758 ± 0.613
2.081SerPro: 2.081 ± 0.774
1.487SerGln: 1.487 ± 0.494
2.379SerArg: 2.379 ± 0.779
8.623SerSer: 8.623 ± 2.141
2.676SerThr: 2.676 ± 1.129
4.163SerVal: 4.163 ± 1.284
1.784SerTrp: 1.784 ± 0.528
4.46SerTyr: 4.46 ± 1.122
0.0SerXaa: 0.0 ± 0.0
Thr
2.379ThrAla: 2.379 ± 1.197
1.784ThrCys: 1.784 ± 0.329
2.081ThrAsp: 2.081 ± 0.791
4.163ThrGlu: 4.163 ± 1.088
2.974ThrPhe: 2.974 ± 1.036
2.081ThrGly: 2.081 ± 0.791
0.892ThrHis: 0.892 ± 0.508
4.163ThrIle: 4.163 ± 1.089
4.163ThrLys: 4.163 ± 0.953
2.379ThrLeu: 2.379 ± 1.004
2.676ThrMet: 2.676 ± 1.588
2.081ThrAsn: 2.081 ± 0.953
0.595ThrPro: 0.595 ± 0.353
1.487ThrGln: 1.487 ± 0.656
0.892ThrArg: 0.892 ± 0.368
3.866ThrSer: 3.866 ± 1.241
1.487ThrThr: 1.487 ± 1.562
3.568ThrVal: 3.568 ± 1.084
1.189ThrTrp: 1.189 ± 0.784
1.487ThrTyr: 1.487 ± 0.544
0.0ThrXaa: 0.0 ± 0.0
Val
1.487ValAla: 1.487 ± 0.767
0.892ValCys: 0.892 ± 0.368
4.46ValAsp: 4.46 ± 2.108
1.784ValGlu: 1.784 ± 0.329
2.379ValPhe: 2.379 ± 0.995
2.081ValGly: 2.081 ± 0.904
0.892ValHis: 0.892 ± 0.577
3.568ValIle: 3.568 ± 0.378
5.352ValLys: 5.352 ± 1.382
3.271ValLeu: 3.271 ± 1.342
1.189ValMet: 1.189 ± 0.523
3.866ValAsn: 3.866 ± 0.602
0.892ValPro: 0.892 ± 0.368
1.784ValGln: 1.784 ± 0.736
0.892ValArg: 0.892 ± 0.39
3.866ValSer: 3.866 ± 1.234
2.379ValThr: 2.379 ± 1.05
1.784ValVal: 1.784 ± 1.059
0.297ValTrp: 0.297 ± 0.44
4.758ValTyr: 4.758 ± 1.684
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.487TrpGlu: 1.487 ± 0.739
0.595TrpPhe: 0.595 ± 0.392
0.297TrpGly: 0.297 ± 0.176
0.297TrpHis: 0.297 ± 0.176
0.892TrpIle: 0.892 ± 0.39
0.892TrpLys: 0.892 ± 0.39
0.595TrpLeu: 0.595 ± 0.392
1.487TrpMet: 1.487 ± 0.639
2.081TrpAsn: 2.081 ± 0.934
0.297TrpPro: 0.297 ± 0.176
0.0TrpGln: 0.0 ± 0.0
0.297TrpArg: 0.297 ± 0.176
0.595TrpSer: 0.595 ± 0.51
1.487TrpThr: 1.487 ± 0.6
1.189TrpVal: 1.189 ± 1.214
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.487TyrAla: 1.487 ± 0.739
0.595TyrCys: 0.595 ± 0.353
2.081TyrAsp: 2.081 ± 0.738
2.974TyrGlu: 2.974 ± 1.061
2.379TyrPhe: 2.379 ± 0.565
2.379TyrGly: 2.379 ± 0.778
2.081TyrHis: 2.081 ± 1.027
4.46TyrIle: 4.46 ± 1.037
5.055TyrLys: 5.055 ± 1.408
4.163TyrLeu: 4.163 ± 1.182
0.595TyrMet: 0.595 ± 0.879
5.947TyrAsn: 5.947 ± 1.2
1.189TyrPro: 1.189 ± 0.474
2.081TyrGln: 2.081 ± 1.059
2.676TyrArg: 2.676 ± 0.52
3.866TyrSer: 3.866 ± 0.648
2.676TyrThr: 2.676 ± 1.019
2.379TyrVal: 2.379 ± 0.744
0.595TyrTrp: 0.595 ± 0.376
2.676TyrTyr: 2.676 ± 0.834
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3364 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski