Amino acid dipepetide frequency for Beihai mantis shrimp virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.384AlaAla: 6.384 ± 2.079
0.871AlaCys: 0.871 ± 0.613
4.063AlaAsp: 4.063 ± 0.689
5.514AlaGlu: 5.514 ± 1.78
2.612AlaPhe: 2.612 ± 1.712
3.772AlaGly: 3.772 ± 2.542
0.871AlaHis: 0.871 ± 0.836
3.772AlaIle: 3.772 ± 1.184
7.255AlaLys: 7.255 ± 0.937
4.933AlaLeu: 4.933 ± 1.144
2.031AlaMet: 2.031 ± 1.104
3.192AlaAsn: 3.192 ± 1.298
3.192AlaPro: 3.192 ± 0.575
2.612AlaGln: 2.612 ± 0.666
3.192AlaArg: 3.192 ± 0.878
4.063AlaSer: 4.063 ± 0.849
6.094AlaThr: 6.094 ± 3.575
6.094AlaVal: 6.094 ± 0.856
0.0AlaTrp: 0.0 ± 0.0
2.031AlaTyr: 2.031 ± 0.687
0.0AlaXaa: 0.0 ± 0.0
Cys
1.161CysAla: 1.161 ± 0.733
0.58CysCys: 0.58 ± 0.315
1.741CysAsp: 1.741 ± 1.858
2.902CysGlu: 2.902 ± 1.093
0.871CysPhe: 0.871 ± 0.473
0.871CysGly: 0.871 ± 0.473
0.871CysHis: 0.871 ± 0.473
1.161CysIle: 1.161 ± 0.446
1.741CysLys: 1.741 ± 0.917
1.451CysLeu: 1.451 ± 0.67
0.58CysMet: 0.58 ± 0.315
0.58CysAsn: 0.58 ± 0.315
0.58CysPro: 0.58 ± 0.315
0.58CysGln: 0.58 ± 0.728
0.29CysArg: 0.29 ± 0.158
0.58CysSer: 0.58 ± 0.579
1.161CysThr: 1.161 ± 0.446
2.322CysVal: 2.322 ± 1.261
0.29CysTrp: 0.29 ± 0.647
1.451CysTyr: 1.451 ± 0.788
0.0CysXaa: 0.0 ± 0.0
Asp
4.933AspAla: 4.933 ± 2.579
1.741AspCys: 1.741 ± 0.623
3.772AspAsp: 3.772 ± 0.938
6.094AspGlu: 6.094 ± 2.142
2.612AspPhe: 2.612 ± 0.765
4.353AspGly: 4.353 ± 1.866
0.871AspHis: 0.871 ± 1.218
2.902AspIle: 2.902 ± 1.144
3.192AspLys: 3.192 ± 1.412
6.965AspLeu: 6.965 ± 1.286
1.741AspMet: 1.741 ± 0.706
2.322AspAsn: 2.322 ± 2.761
3.772AspPro: 3.772 ± 1.412
1.161AspGln: 1.161 ± 0.414
1.451AspArg: 1.451 ± 0.511
2.322AspSer: 2.322 ± 1.474
2.902AspThr: 2.902 ± 1.144
4.353AspVal: 4.353 ± 1.217
0.29AspTrp: 0.29 ± 0.158
0.871AspTyr: 0.871 ± 0.699
0.0AspXaa: 0.0 ± 0.0
Glu
4.353GluAla: 4.353 ± 1.036
0.29GluCys: 0.29 ± 0.158
4.353GluAsp: 4.353 ± 1.617
6.674GluGlu: 6.674 ± 1.52
3.772GluPhe: 3.772 ± 1.334
6.384GluGly: 6.384 ± 0.772
1.451GluHis: 1.451 ± 0.846
4.353GluIle: 4.353 ± 0.731
4.933GluLys: 4.933 ± 2.084
4.933GluLeu: 4.933 ± 1.472
1.741GluMet: 1.741 ± 1.086
3.192GluAsn: 3.192 ± 0.916
1.161GluPro: 1.161 ± 0.501
3.192GluGln: 3.192 ± 1.349
3.192GluArg: 3.192 ± 0.878
3.482GluSer: 3.482 ± 1.075
2.322GluThr: 2.322 ± 0.61
7.835GluVal: 7.835 ± 1.637
0.58GluTrp: 0.58 ± 0.315
2.322GluTyr: 2.322 ± 1.526
0.0GluXaa: 0.0 ± 0.0
Phe
2.612PheAla: 2.612 ± 0.784
0.58PheCys: 0.58 ± 0.315
2.612PheAsp: 2.612 ± 0.322
2.612PheGlu: 2.612 ± 0.997
1.451PhePhe: 1.451 ± 0.818
2.031PheGly: 2.031 ± 0.82
0.0PheHis: 0.0 ± 0.0
2.322PheIle: 2.322 ± 0.343
2.322PheLys: 2.322 ± 0.974
4.353PheLeu: 4.353 ± 0.568
0.871PheMet: 0.871 ± 0.822
2.322PheAsn: 2.322 ± 0.58
1.741PhePro: 1.741 ± 0.814
2.322PheGln: 2.322 ± 0.771
2.322PheArg: 2.322 ± 0.83
3.192PheSer: 3.192 ± 0.98
4.063PheThr: 4.063 ± 2.469
3.772PheVal: 3.772 ± 0.577
0.58PheTrp: 0.58 ± 0.315
1.451PheTyr: 1.451 ± 0.92
0.0PheXaa: 0.0 ± 0.0
Gly
2.612GlyAla: 2.612 ± 0.803
0.58GlyCys: 0.58 ± 0.315
3.482GlyAsp: 3.482 ± 1.015
3.772GlyGlu: 3.772 ± 0.883
2.322GlyPhe: 2.322 ± 0.563
4.063GlyGly: 4.063 ± 0.626
2.612GlyHis: 2.612 ± 0.997
3.772GlyIle: 3.772 ± 1.228
7.545GlyLys: 7.545 ± 3.174
2.612GlyLeu: 2.612 ± 1.508
1.741GlyMet: 1.741 ± 0.538
2.612GlyAsn: 2.612 ± 1.117
1.451GlyPro: 1.451 ± 0.945
2.612GlyGln: 2.612 ± 0.923
3.772GlyArg: 3.772 ± 1.381
2.031GlySer: 2.031 ± 0.885
3.772GlyThr: 3.772 ± 1.56
6.384GlyVal: 6.384 ± 1.165
0.29GlyTrp: 0.29 ± 0.5
1.741GlyTyr: 1.741 ± 0.79
0.0GlyXaa: 0.0 ± 0.0
His
2.322HisAla: 2.322 ± 1.662
1.161HisCys: 1.161 ± 0.675
1.161HisAsp: 1.161 ± 0.794
0.58HisGlu: 0.58 ± 0.315
0.58HisPhe: 0.58 ± 0.315
1.161HisGly: 1.161 ± 0.794
0.29HisHis: 0.29 ± 0.158
0.871HisIle: 0.871 ± 0.424
1.741HisLys: 1.741 ± 0.746
1.741HisLeu: 1.741 ± 0.706
1.451HisMet: 1.451 ± 0.715
0.29HisAsn: 0.29 ± 0.158
0.871HisPro: 0.871 ± 0.473
0.871HisGln: 0.871 ± 0.473
1.741HisArg: 1.741 ± 0.917
2.612HisSer: 2.612 ± 0.769
0.871HisThr: 0.871 ± 0.424
1.161HisVal: 1.161 ± 0.563
0.58HisTrp: 0.58 ± 0.448
0.58HisTyr: 0.58 ± 0.315
0.0HisXaa: 0.0 ± 0.0
Ile
4.353IleAla: 4.353 ± 1.128
2.031IleCys: 2.031 ± 1.104
3.192IleAsp: 3.192 ± 0.84
2.612IleGlu: 2.612 ± 0.746
1.161IlePhe: 1.161 ± 0.631
3.192IleGly: 3.192 ± 0.614
1.451IleHis: 1.451 ± 0.564
1.741IleIle: 1.741 ± 0.891
1.451IleLys: 1.451 ± 0.592
3.192IleLeu: 3.192 ± 0.728
0.58IleMet: 0.58 ± 0.414
2.902IleAsn: 2.902 ± 1.859
3.772IlePro: 3.772 ± 0.707
1.451IleGln: 1.451 ± 0.487
2.612IleArg: 2.612 ± 1.427
2.612IleSer: 2.612 ± 0.634
2.902IleThr: 2.902 ± 1.217
4.063IleVal: 4.063 ± 0.687
0.0IleTrp: 0.0 ± 0.0
0.58IleTyr: 0.58 ± 0.315
0.0IleXaa: 0.0 ± 0.0
Lys
5.804LysAla: 5.804 ± 1.04
2.031LysCys: 2.031 ± 0.814
4.933LysAsp: 4.933 ± 1.764
2.902LysGlu: 2.902 ± 1.263
2.031LysPhe: 2.031 ± 0.757
4.643LysGly: 4.643 ± 2.523
1.161LysHis: 1.161 ± 0.563
3.482LysIle: 3.482 ± 1.411
5.223LysLys: 5.223 ± 1.996
8.125LysLeu: 8.125 ± 1.977
2.031LysMet: 2.031 ± 0.728
3.482LysAsn: 3.482 ± 1.278
2.902LysPro: 2.902 ± 0.985
2.322LysGln: 2.322 ± 0.693
2.612LysArg: 2.612 ± 0.978
3.482LysSer: 3.482 ± 1.094
3.772LysThr: 3.772 ± 1.157
5.804LysVal: 5.804 ± 1.899
1.161LysTrp: 1.161 ± 0.414
1.451LysTyr: 1.451 ± 0.715
0.0LysXaa: 0.0 ± 0.0
Leu
5.514LeuAla: 5.514 ± 1.391
2.612LeuCys: 2.612 ± 1.117
5.223LeuAsp: 5.223 ± 1.994
2.902LeuGlu: 2.902 ± 0.817
5.223LeuPhe: 5.223 ± 1.206
3.772LeuGly: 3.772 ± 0.739
0.871LeuHis: 0.871 ± 0.822
1.741LeuIle: 1.741 ± 0.623
5.223LeuLys: 5.223 ± 1.101
7.545LeuLeu: 7.545 ± 3.429
1.741LeuMet: 1.741 ± 0.896
3.192LeuAsn: 3.192 ± 1.342
4.643LeuPro: 4.643 ± 1.243
5.223LeuGln: 5.223 ± 1.033
6.674LeuArg: 6.674 ± 1.892
6.384LeuSer: 6.384 ± 1.355
5.804LeuThr: 5.804 ± 3.032
4.643LeuVal: 4.643 ± 1.713
0.58LeuTrp: 0.58 ± 0.947
3.772LeuTyr: 3.772 ± 1.867
0.0LeuXaa: 0.0 ± 0.0
Met
1.161MetAla: 1.161 ± 0.522
0.29MetCys: 0.29 ± 0.647
1.451MetAsp: 1.451 ± 0.487
4.063MetGlu: 4.063 ± 1.615
0.29MetPhe: 0.29 ± 0.5
0.58MetGly: 0.58 ± 0.315
0.871MetHis: 0.871 ± 0.549
0.58MetIle: 0.58 ± 0.315
1.741MetLys: 1.741 ± 0.946
1.451MetLeu: 1.451 ± 0.788
0.871MetMet: 0.871 ± 0.473
1.161MetAsn: 1.161 ± 0.935
0.871MetPro: 0.871 ± 0.473
0.58MetGln: 0.58 ± 0.436
2.612MetArg: 2.612 ± 1.022
2.031MetSer: 2.031 ± 1.104
1.741MetThr: 1.741 ± 1.512
2.031MetVal: 2.031 ± 0.719
0.29MetTrp: 0.29 ± 0.158
0.871MetTyr: 0.871 ± 0.671
0.0MetXaa: 0.0 ± 0.0
Asn
5.514AsnAla: 5.514 ± 1.611
1.451AsnCys: 1.451 ± 0.735
2.612AsnAsp: 2.612 ± 0.723
2.612AsnGlu: 2.612 ± 0.784
2.612AsnPhe: 2.612 ± 1.117
0.871AsnGly: 0.871 ± 0.473
2.322AsnHis: 2.322 ± 1.002
1.161AsnIle: 1.161 ± 0.446
2.902AsnLys: 2.902 ± 0.75
3.482AsnLeu: 3.482 ± 1.333
0.58AsnMet: 0.58 ± 0.315
2.612AsnAsn: 2.612 ± 0.759
1.161AsnPro: 1.161 ± 0.563
1.741AsnGln: 1.741 ± 0.543
1.161AsnArg: 1.161 ± 0.873
2.031AsnSer: 2.031 ± 2.193
2.322AsnThr: 2.322 ± 1.208
4.353AsnVal: 4.353 ± 0.967
1.451AsnTrp: 1.451 ± 0.487
1.161AsnTyr: 1.161 ± 0.501
0.0AsnXaa: 0.0 ± 0.0
Pro
3.192ProAla: 3.192 ± 2.602
0.58ProCys: 0.58 ± 0.315
3.482ProAsp: 3.482 ± 1.35
3.192ProGlu: 3.192 ± 0.823
1.161ProPhe: 1.161 ± 0.522
2.031ProGly: 2.031 ± 0.85
0.871ProHis: 0.871 ± 0.885
2.612ProIle: 2.612 ± 1.588
2.902ProLys: 2.902 ± 0.817
2.322ProLeu: 2.322 ± 1.479
1.161ProMet: 1.161 ± 0.631
2.322ProAsn: 2.322 ± 0.855
1.161ProPro: 1.161 ± 0.873
1.451ProGln: 1.451 ± 0.818
2.902ProArg: 2.902 ± 0.91
1.451ProSer: 1.451 ± 0.518
2.031ProThr: 2.031 ± 0.879
3.192ProVal: 3.192 ± 0.575
0.58ProTrp: 0.58 ± 0.436
2.031ProTyr: 2.031 ± 0.814
0.0ProXaa: 0.0 ± 0.0
Gln
1.451GlnAla: 1.451 ± 0.909
0.58GlnCys: 0.58 ± 0.947
2.322GlnAsp: 2.322 ± 0.829
3.482GlnGlu: 3.482 ± 1.094
2.612GlnPhe: 2.612 ± 0.749
3.772GlnGly: 3.772 ± 1.27
0.871GlnHis: 0.871 ± 0.473
2.612GlnIle: 2.612 ± 0.749
1.741GlnLys: 1.741 ± 0.64
4.353GlnLeu: 4.353 ± 1.302
1.161GlnMet: 1.161 ± 0.442
1.451GlnAsn: 1.451 ± 0.818
2.031GlnPro: 2.031 ± 0.482
3.482GlnGln: 3.482 ± 2.503
3.482GlnArg: 3.482 ± 1.179
3.192GlnSer: 3.192 ± 0.59
2.902GlnThr: 2.902 ± 0.644
2.031GlnVal: 2.031 ± 1.402
0.29GlnTrp: 0.29 ± 0.158
0.58GlnTyr: 0.58 ± 0.315
0.0GlnXaa: 0.0 ± 0.0
Arg
3.772ArgAla: 3.772 ± 1.157
0.871ArgCys: 0.871 ± 0.985
2.612ArgAsp: 2.612 ± 1.309
3.772ArgGlu: 3.772 ± 0.577
2.322ArgPhe: 2.322 ± 1.052
3.192ArgGly: 3.192 ± 0.616
2.031ArgHis: 2.031 ± 1.104
1.741ArgIle: 1.741 ± 0.623
3.192ArgLys: 3.192 ± 1.734
4.643ArgLeu: 4.643 ± 1.97
1.161ArgMet: 1.161 ± 0.631
2.031ArgAsn: 2.031 ± 1.104
1.451ArgPro: 1.451 ± 0.867
2.031ArgGln: 2.031 ± 0.482
3.482ArgArg: 3.482 ± 1.015
3.772ArgSer: 3.772 ± 1.206
2.902ArgThr: 2.902 ± 2.436
5.514ArgVal: 5.514 ± 0.711
2.031ArgTrp: 2.031 ± 0.509
3.772ArgTyr: 3.772 ± 1.11
0.0ArgXaa: 0.0 ± 0.0
Ser
3.482SerAla: 3.482 ± 1.015
2.031SerCys: 2.031 ± 0.814
3.772SerAsp: 3.772 ± 0.658
4.353SerGlu: 4.353 ± 1.472
4.643SerPhe: 4.643 ± 2.038
3.482SerGly: 3.482 ± 0.924
1.451SerHis: 1.451 ± 0.518
2.902SerIle: 2.902 ± 0.791
5.223SerLys: 5.223 ± 1.649
3.482SerLeu: 3.482 ± 1.48
0.871SerMet: 0.871 ± 0.671
2.031SerAsn: 2.031 ± 0.719
1.741SerPro: 1.741 ± 1.299
1.741SerGln: 1.741 ± 0.623
4.353SerArg: 4.353 ± 1.032
3.772SerSer: 3.772 ± 1.882
2.322SerThr: 2.322 ± 1.288
4.643SerVal: 4.643 ± 0.735
1.161SerTrp: 1.161 ± 0.501
2.031SerTyr: 2.031 ± 1.052
0.0SerXaa: 0.0 ± 0.0
Thr
5.514ThrAla: 5.514 ± 2.544
1.451ThrCys: 1.451 ± 1.402
2.031ThrAsp: 2.031 ± 1.007
2.612ThrGlu: 2.612 ± 0.723
2.902ThrPhe: 2.902 ± 1.735
2.612ThrGly: 2.612 ± 1.44
1.451ThrHis: 1.451 ± 0.487
1.451ThrIle: 1.451 ± 0.487
3.192ThrLys: 3.192 ± 1.404
4.063ThrLeu: 4.063 ± 1.696
1.451ThrMet: 1.451 ± 1.841
3.192ThrAsn: 3.192 ± 1.728
2.612ThrPro: 2.612 ± 0.519
2.031ThrGln: 2.031 ± 0.821
2.322ThrArg: 2.322 ± 0.937
5.804ThrSer: 5.804 ± 2.384
2.902ThrThr: 2.902 ± 1.628
4.353ThrVal: 4.353 ± 1.243
0.58ThrTrp: 0.58 ± 0.947
2.031ThrTyr: 2.031 ± 0.846
0.0ThrXaa: 0.0 ± 0.0
Val
5.514ValAla: 5.514 ± 1.236
1.451ValCys: 1.451 ± 0.618
3.772ValAsp: 3.772 ± 2.683
7.255ValGlu: 7.255 ± 1.31
2.902ValPhe: 2.902 ± 0.817
5.514ValGly: 5.514 ± 1.493
1.741ValHis: 1.741 ± 0.814
4.643ValIle: 4.643 ± 1.697
4.933ValLys: 4.933 ± 0.687
8.706ValLeu: 8.706 ± 1.276
2.902ValMet: 2.902 ± 1.185
3.772ValAsn: 3.772 ± 0.921
3.482ValPro: 3.482 ± 0.827
5.223ValGln: 5.223 ± 1.365
5.223ValArg: 5.223 ± 2.153
4.063ValSer: 4.063 ± 1.141
1.741ValThr: 1.741 ± 0.543
6.965ValVal: 6.965 ± 1.528
1.161ValTrp: 1.161 ± 0.631
1.451ValTyr: 1.451 ± 0.945
0.0ValXaa: 0.0 ± 0.0
Trp
0.58TrpAla: 0.58 ± 0.436
0.0TrpCys: 0.0 ± 0.0
0.58TrpAsp: 0.58 ± 0.734
0.29TrpGlu: 0.29 ± 0.158
0.58TrpPhe: 0.58 ± 0.315
0.29TrpGly: 0.29 ± 0.158
0.0TrpHis: 0.0 ± 0.0
0.871TrpIle: 0.871 ± 0.473
1.451TrpLys: 1.451 ± 0.715
2.031TrpLeu: 2.031 ± 0.821
0.0TrpMet: 0.0 ± 0.0
0.871TrpAsn: 0.871 ± 0.424
0.58TrpPro: 0.58 ± 0.448
0.871TrpGln: 0.871 ± 0.473
0.0TrpArg: 0.0 ± 0.0
1.161TrpSer: 1.161 ± 0.446
0.29TrpThr: 0.29 ± 0.647
0.58TrpVal: 0.58 ± 0.457
0.29TrpTrp: 0.29 ± 0.537
1.161TrpTyr: 1.161 ± 0.675
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.322TyrAla: 2.322 ± 0.58
0.58TyrCys: 0.58 ± 0.822
1.451TyrAsp: 1.451 ± 0.564
2.612TyrGlu: 2.612 ± 1.131
0.871TyrPhe: 0.871 ± 0.395
3.192TyrGly: 3.192 ± 1.375
0.58TyrHis: 0.58 ± 0.703
1.161TyrIle: 1.161 ± 0.414
1.451TyrLys: 1.451 ± 1.637
2.902TyrLeu: 2.902 ± 1.144
0.58TyrMet: 0.58 ± 0.315
0.58TyrAsn: 0.58 ± 0.436
1.451TyrPro: 1.451 ± 1.365
2.902TyrGln: 2.902 ± 2.565
2.902TyrArg: 2.902 ± 0.894
1.451TyrSer: 1.451 ± 0.518
1.741TyrThr: 1.741 ± 0.814
2.322TyrVal: 2.322 ± 0.8
0.29TyrTrp: 0.29 ± 0.158
0.871TyrTyr: 0.871 ± 0.671
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3447 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski