Amino acid dipepetide frequency for Beihai tiger crab virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.283AlaAla: 9.283 ± 0.988
0.96AlaCys: 0.96 ± 0.529
2.881AlaAsp: 2.881 ± 0.876
5.762AlaGlu: 5.762 ± 1.459
4.481AlaPhe: 4.481 ± 1.507
6.722AlaGly: 6.722 ± 1.943
1.921AlaHis: 1.921 ± 1.231
3.201AlaIle: 3.201 ± 1.374
4.802AlaLys: 4.802 ± 0.754
6.402AlaLeu: 6.402 ± 1.936
1.601AlaMet: 1.601 ± 0.642
4.481AlaAsn: 4.481 ± 0.788
4.161AlaPro: 4.161 ± 2.207
3.521AlaGln: 3.521 ± 1.12
5.122AlaArg: 5.122 ± 1.239
4.481AlaSer: 4.481 ± 0.746
3.841AlaThr: 3.841 ± 1.047
8.003AlaVal: 8.003 ± 1.928
0.0AlaTrp: 0.0 ± 0.0
3.201AlaTyr: 3.201 ± 1.126
0.0AlaXaa: 0.0 ± 0.0
Cys
0.96CysAla: 0.96 ± 0.529
0.0CysCys: 0.0 ± 0.0
0.32CysAsp: 0.32 ± 0.176
1.601CysGlu: 1.601 ± 0.97
0.96CysPhe: 0.96 ± 0.567
1.921CysGly: 1.921 ± 1.058
0.64CysHis: 0.64 ± 0.353
0.32CysIle: 0.32 ± 0.176
0.32CysLys: 0.32 ± 0.176
2.881CysLeu: 2.881 ± 0.771
0.32CysMet: 0.32 ± 0.176
0.64CysAsn: 0.64 ± 0.353
0.32CysPro: 0.32 ± 0.483
0.32CysGln: 0.32 ± 0.65
0.96CysArg: 0.96 ± 0.5
0.32CysSer: 0.32 ± 0.176
2.241CysThr: 2.241 ± 0.902
0.32CysVal: 0.32 ± 0.176
0.0CysTrp: 0.0 ± 0.0
0.64CysTyr: 0.64 ± 0.353
0.0CysXaa: 0.0 ± 0.0
Asp
5.442AspAla: 5.442 ± 1.661
0.96AspCys: 0.96 ± 0.529
3.521AspAsp: 3.521 ± 0.807
2.561AspGlu: 2.561 ± 0.984
2.881AspPhe: 2.881 ± 1.422
4.802AspGly: 4.802 ± 2.646
0.96AspHis: 0.96 ± 0.406
3.201AspIle: 3.201 ± 1.177
2.241AspLys: 2.241 ± 1.159
5.122AspLeu: 5.122 ± 1.53
0.32AspMet: 0.32 ± 0.176
2.241AspAsn: 2.241 ± 1.69
2.241AspPro: 2.241 ± 0.872
0.64AspGln: 0.64 ± 0.41
3.201AspArg: 3.201 ± 0.659
3.521AspSer: 3.521 ± 1.589
4.161AspThr: 4.161 ± 0.878
4.481AspVal: 4.481 ± 1.559
0.96AspTrp: 0.96 ± 0.406
2.561AspTyr: 2.561 ± 1.038
0.0AspXaa: 0.0 ± 0.0
Glu
3.201GluAla: 3.201 ± 1.85
1.601GluCys: 1.601 ± 0.686
2.561GluAsp: 2.561 ± 1.039
5.762GluGlu: 5.762 ± 1.906
2.241GluPhe: 2.241 ± 0.879
3.201GluGly: 3.201 ± 1.087
1.601GluHis: 1.601 ± 0.686
4.481GluIle: 4.481 ± 2.038
2.241GluLys: 2.241 ± 0.879
8.323GluLeu: 8.323 ± 1.095
0.64GluMet: 0.64 ± 0.852
2.561GluAsn: 2.561 ± 0.653
3.201GluPro: 3.201 ± 0.917
1.601GluGln: 1.601 ± 0.882
4.161GluArg: 4.161 ± 1.545
3.521GluSer: 3.521 ± 1.076
3.521GluThr: 3.521 ± 1.426
4.161GluVal: 4.161 ± 1.489
1.28GluTrp: 1.28 ± 0.519
1.601GluTyr: 1.601 ± 1.257
0.0GluXaa: 0.0 ± 0.0
Phe
3.521PheAla: 3.521 ± 1.015
0.96PheCys: 0.96 ± 0.489
2.241PheAsp: 2.241 ± 0.847
2.241PheGlu: 2.241 ± 0.648
2.881PhePhe: 2.881 ± 2.139
2.561PheGly: 2.561 ± 1.348
2.561PheHis: 2.561 ± 1.039
2.241PheIle: 2.241 ± 0.673
2.561PheLys: 2.561 ± 0.751
4.161PheLeu: 4.161 ± 1.951
0.96PheMet: 0.96 ± 1.118
3.201PheAsn: 3.201 ± 1.916
2.881PhePro: 2.881 ± 1.236
1.601PheGln: 1.601 ± 0.587
1.921PheArg: 1.921 ± 0.618
1.921PheSer: 1.921 ± 0.618
2.561PheThr: 2.561 ± 2.318
3.201PheVal: 3.201 ± 1.352
0.32PheTrp: 0.32 ± 0.176
0.96PheTyr: 0.96 ± 0.879
0.0PheXaa: 0.0 ± 0.0
Gly
4.161GlyAla: 4.161 ± 0.878
0.32GlyCys: 0.32 ± 0.176
4.481GlyAsp: 4.481 ± 0.595
3.521GlyGlu: 3.521 ± 1.057
2.881GlyPhe: 2.881 ± 1.338
3.521GlyGly: 3.521 ± 0.77
1.601GlyHis: 1.601 ± 0.587
2.881GlyIle: 2.881 ± 1.438
4.481GlyLys: 4.481 ± 1.635
3.841GlyLeu: 3.841 ± 0.832
1.28GlyMet: 1.28 ± 1.097
1.921GlyAsn: 1.921 ± 0.726
2.241GlyPro: 2.241 ± 0.847
2.241GlyGln: 2.241 ± 0.691
5.442GlyArg: 5.442 ± 2.566
3.841GlySer: 3.841 ± 4.291
4.161GlyThr: 4.161 ± 3.486
5.122GlyVal: 5.122 ± 1.15
1.28GlyTrp: 1.28 ± 0.654
3.201GlyTyr: 3.201 ± 1.126
0.0GlyXaa: 0.0 ± 0.0
His
1.921HisAla: 1.921 ± 1.37
0.64HisCys: 0.64 ± 0.41
1.921HisAsp: 1.921 ± 1.058
1.601HisGlu: 1.601 ± 0.882
1.921HisPhe: 1.921 ± 1.37
2.241HisGly: 2.241 ± 0.863
0.96HisHis: 0.96 ± 0.737
2.241HisIle: 2.241 ± 0.837
0.96HisLys: 0.96 ± 0.406
1.921HisLeu: 1.921 ± 1.135
0.32HisMet: 0.32 ± 0.176
1.28HisAsn: 1.28 ± 0.472
0.0HisPro: 0.0 ± 0.0
0.96HisGln: 0.96 ± 0.529
1.601HisArg: 1.601 ± 0.592
0.64HisSer: 0.64 ± 0.41
0.96HisThr: 0.96 ± 0.406
0.64HisVal: 0.64 ± 0.353
0.64HisTrp: 0.64 ± 0.353
1.28HisTyr: 1.28 ± 0.705
0.0HisXaa: 0.0 ± 0.0
Ile
3.841IleAla: 3.841 ± 1.327
0.96IleCys: 0.96 ± 0.489
2.241IleAsp: 2.241 ± 0.644
2.561IleGlu: 2.561 ± 1.544
0.64IlePhe: 0.64 ± 0.496
1.28IleGly: 1.28 ± 0.998
1.601IleHis: 1.601 ± 0.796
1.601IleIle: 1.601 ± 1.537
3.841IleLys: 3.841 ± 0.567
1.28IleLeu: 1.28 ± 0.543
1.601IleMet: 1.601 ± 0.816
5.442IleAsn: 5.442 ± 2.853
5.442IlePro: 5.442 ± 2.615
1.921IleGln: 1.921 ± 1.588
4.481IleArg: 4.481 ± 2.153
2.561IleSer: 2.561 ± 0.882
2.561IleThr: 2.561 ± 0.908
6.082IleVal: 6.082 ± 1.447
0.0IleTrp: 0.0 ± 0.0
0.96IleTyr: 0.96 ± 0.529
0.0IleXaa: 0.0 ± 0.0
Lys
3.201LysAla: 3.201 ± 1.219
1.601LysCys: 1.601 ± 0.882
3.841LysAsp: 3.841 ± 1.127
3.521LysGlu: 3.521 ± 1.057
3.521LysPhe: 3.521 ± 1.213
3.841LysGly: 3.841 ± 1.712
1.28LysHis: 1.28 ± 0.523
3.201LysIle: 3.201 ± 0.92
2.881LysLys: 2.881 ± 1.115
6.082LysLeu: 6.082 ± 1.607
1.601LysMet: 1.601 ± 0.686
2.241LysAsn: 2.241 ± 0.913
2.241LysPro: 2.241 ± 1.235
2.561LysGln: 2.561 ± 1.076
1.28LysArg: 1.28 ± 0.604
1.921LysSer: 1.921 ± 1.135
3.521LysThr: 3.521 ± 0.807
4.161LysVal: 4.161 ± 1.879
0.32LysTrp: 0.32 ± 0.176
1.601LysTyr: 1.601 ± 0.587
0.0LysXaa: 0.0 ± 0.0
Leu
8.323LeuAla: 8.323 ± 0.804
1.28LeuCys: 1.28 ± 0.705
4.481LeuAsp: 4.481 ± 2.464
4.802LeuGlu: 4.802 ± 1.286
4.161LeuPhe: 4.161 ± 1.329
3.841LeuGly: 3.841 ± 0.828
0.64LeuHis: 0.64 ± 0.353
5.762LeuIle: 5.762 ± 3.029
4.802LeuLys: 4.802 ± 1.819
7.682LeuLeu: 7.682 ± 1.578
1.921LeuMet: 1.921 ± 0.726
3.201LeuAsn: 3.201 ± 1.163
5.122LeuPro: 5.122 ± 1.284
5.762LeuGln: 5.762 ± 0.832
4.481LeuArg: 4.481 ± 1.538
7.362LeuSer: 7.362 ± 1.856
4.802LeuThr: 4.802 ± 1.115
5.442LeuVal: 5.442 ± 1.651
0.32LeuTrp: 0.32 ± 0.176
1.601LeuTyr: 1.601 ± 0.642
0.0LeuXaa: 0.0 ± 0.0
Met
3.201MetAla: 3.201 ± 0.918
0.32MetCys: 0.32 ± 0.176
2.561MetAsp: 2.561 ± 1.268
0.64MetGlu: 0.64 ± 0.584
0.32MetPhe: 0.32 ± 0.176
1.28MetGly: 1.28 ± 0.705
0.32MetHis: 0.32 ± 0.176
0.64MetIle: 0.64 ± 0.584
1.601MetLys: 1.601 ± 0.642
2.241MetLeu: 2.241 ± 1.037
0.32MetMet: 0.32 ± 0.562
1.28MetAsn: 1.28 ± 0.543
0.64MetPro: 0.64 ± 0.353
0.64MetGln: 0.64 ± 0.496
0.96MetArg: 0.96 ± 0.406
1.601MetSer: 1.601 ± 1.028
0.96MetThr: 0.96 ± 0.406
1.601MetVal: 1.601 ± 0.557
0.0MetTrp: 0.0 ± 0.0
0.96MetTyr: 0.96 ± 0.848
0.0MetXaa: 0.0 ± 0.0
Asn
3.841AsnAla: 3.841 ± 0.753
0.64AsnCys: 0.64 ± 0.496
1.921AsnAsp: 1.921 ± 0.526
3.201AsnGlu: 3.201 ± 1.038
3.201AsnPhe: 3.201 ± 0.717
2.561AsnGly: 2.561 ± 0.933
1.601AsnHis: 1.601 ± 0.686
2.241AsnIle: 2.241 ± 0.837
0.96AsnLys: 0.96 ± 0.529
3.841AsnLeu: 3.841 ± 0.731
1.601AsnMet: 1.601 ± 1.13
0.96AsnAsn: 0.96 ± 0.489
1.601AsnPro: 1.601 ± 0.816
2.561AsnGln: 2.561 ± 1.086
3.201AsnArg: 3.201 ± 1.188
2.881AsnSer: 2.881 ± 0.466
2.241AsnThr: 2.241 ± 2.184
3.841AsnVal: 3.841 ± 1.225
2.561AsnTrp: 2.561 ± 0.966
3.521AsnTyr: 3.521 ± 0.994
0.0AsnXaa: 0.0 ± 0.0
Pro
3.521ProAla: 3.521 ± 2.101
0.64ProCys: 0.64 ± 0.353
3.521ProAsp: 3.521 ± 1.813
2.881ProGlu: 2.881 ± 1.236
1.921ProPhe: 1.921 ± 1.407
3.201ProGly: 3.201 ± 1.005
1.28ProHis: 1.28 ± 0.472
1.28ProIle: 1.28 ± 1.064
3.201ProLys: 3.201 ± 1.146
5.762ProLeu: 5.762 ± 2.309
0.32ProMet: 0.32 ± 0.483
1.601ProAsn: 1.601 ± 0.557
2.881ProPro: 2.881 ± 2.359
1.28ProGln: 1.28 ± 0.519
2.881ProArg: 2.881 ± 0.713
3.841ProSer: 3.841 ± 1.241
3.841ProThr: 3.841 ± 1.622
4.481ProVal: 4.481 ± 1.262
0.0ProTrp: 0.0 ± 0.0
0.64ProTyr: 0.64 ± 0.353
0.0ProXaa: 0.0 ± 0.0
Gln
2.881GlnAla: 2.881 ± 1.204
0.96GlnCys: 0.96 ± 0.529
2.241GlnAsp: 2.241 ± 1.159
1.28GlnGlu: 1.28 ± 1.226
2.241GlnPhe: 2.241 ± 0.879
1.921GlnGly: 1.921 ± 0.811
2.241GlnHis: 2.241 ± 0.931
1.28GlnIle: 1.28 ± 0.998
2.561GlnLys: 2.561 ± 1.411
4.161GlnLeu: 4.161 ± 0.609
3.201GlnMet: 3.201 ± 0.453
1.28GlnAsn: 1.28 ± 0.705
2.561GlnPro: 2.561 ± 0.654
0.64GlnGln: 0.64 ± 1.123
2.241GlnArg: 2.241 ± 0.709
2.241GlnSer: 2.241 ± 1.019
3.201GlnThr: 3.201 ± 1.087
1.601GlnVal: 1.601 ± 0.686
0.32GlnTrp: 0.32 ± 0.562
0.64GlnTyr: 0.64 ± 0.41
0.0GlnXaa: 0.0 ± 0.0
Arg
5.442ArgAla: 5.442 ± 1.227
0.64ArgCys: 0.64 ± 0.353
3.201ArgAsp: 3.201 ± 1.249
3.521ArgGlu: 3.521 ± 1.557
1.921ArgPhe: 1.921 ± 0.922
1.921ArgGly: 1.921 ± 1.474
0.64ArgHis: 0.64 ± 0.353
2.561ArgIle: 2.561 ± 0.354
2.561ArgLys: 2.561 ± 1.067
3.521ArgLeu: 3.521 ± 0.731
1.921ArgMet: 1.921 ± 0.798
2.561ArgAsn: 2.561 ± 0.58
2.241ArgPro: 2.241 ± 0.837
3.521ArgGln: 3.521 ± 1.103
2.561ArgArg: 2.561 ± 0.58
3.521ArgSer: 3.521 ± 1.057
4.802ArgThr: 4.802 ± 2.083
5.122ArgVal: 5.122 ± 1.1
0.96ArgTrp: 0.96 ± 1.163
3.201ArgTyr: 3.201 ± 1.193
0.0ArgXaa: 0.0 ± 0.0
Ser
5.442SerAla: 5.442 ± 1.104
0.64SerCys: 0.64 ± 1.263
2.241SerAsp: 2.241 ± 1.037
3.201SerGlu: 3.201 ± 1.699
2.241SerPhe: 2.241 ± 1.986
4.802SerGly: 4.802 ± 1.238
0.64SerHis: 0.64 ± 0.353
1.921SerIle: 1.921 ± 0.526
2.241SerLys: 2.241 ± 1.125
7.682SerLeu: 7.682 ± 0.799
0.96SerMet: 0.96 ± 0.529
1.921SerAsn: 1.921 ± 0.979
3.841SerPro: 3.841 ± 1.444
2.881SerGln: 2.881 ± 0.713
3.521SerArg: 3.521 ± 1.055
3.201SerSer: 3.201 ± 2.533
3.841SerThr: 3.841 ± 2.002
3.841SerVal: 3.841 ± 1.348
0.32SerTrp: 0.32 ± 0.176
2.881SerTyr: 2.881 ± 0.876
0.0SerXaa: 0.0 ± 0.0
Thr
5.122ThrAla: 5.122 ± 1.632
0.32ThrCys: 0.32 ± 0.631
3.521ThrAsp: 3.521 ± 1.679
4.802ThrGlu: 4.802 ± 1.03
3.201ThrPhe: 3.201 ± 1.853
3.841ThrGly: 3.841 ± 2.515
1.601ThrHis: 1.601 ± 0.874
3.521ThrIle: 3.521 ± 3.189
3.841ThrLys: 3.841 ± 0.792
4.161ThrLeu: 4.161 ± 0.817
1.601ThrMet: 1.601 ± 1.028
4.161ThrAsn: 4.161 ± 1.982
3.201ThrPro: 3.201 ± 0.717
1.921ThrGln: 1.921 ± 0.543
3.841ThrArg: 3.841 ± 1.31
4.802ThrSer: 4.802 ± 1.848
3.521ThrThr: 3.521 ± 0.85
4.161ThrVal: 4.161 ± 0.886
0.64ThrTrp: 0.64 ± 0.41
1.921ThrTyr: 1.921 ± 0.726
0.0ThrXaa: 0.0 ± 0.0
Val
7.042ValAla: 7.042 ± 1.521
1.28ValCys: 1.28 ± 0.519
4.161ValAsp: 4.161 ± 0.934
5.122ValGlu: 5.122 ± 1.408
3.201ValPhe: 3.201 ± 1.354
6.082ValGly: 6.082 ± 1.961
1.601ValHis: 1.601 ± 0.592
2.881ValIle: 2.881 ± 1.789
5.762ValLys: 5.762 ± 1.743
3.521ValLeu: 3.521 ± 0.99
0.96ValMet: 0.96 ± 0.507
4.802ValAsn: 4.802 ± 0.915
3.521ValPro: 3.521 ± 1.465
3.521ValGln: 3.521 ± 1.123
3.201ValArg: 3.201 ± 1.005
3.201ValSer: 3.201 ± 1.39
6.082ValThr: 6.082 ± 2.921
7.362ValVal: 7.362 ± 2.349
0.64ValTrp: 0.64 ± 0.496
1.28ValTyr: 1.28 ± 0.705
0.0ValXaa: 0.0 ± 0.0
Trp
0.32TrpAla: 0.32 ± 0.176
0.32TrpCys: 0.32 ± 0.176
0.64TrpAsp: 0.64 ± 0.788
0.32TrpGlu: 0.32 ± 0.562
0.64TrpPhe: 0.64 ± 0.353
0.64TrpGly: 0.64 ± 0.542
0.0TrpHis: 0.0 ± 0.0
1.28TrpIle: 1.28 ± 0.593
0.96TrpLys: 0.96 ± 0.567
0.64TrpLeu: 0.64 ± 0.41
0.32TrpMet: 0.32 ± 0.176
0.64TrpAsn: 0.64 ± 0.542
0.32TrpPro: 0.32 ± 0.562
0.96TrpGln: 0.96 ± 0.406
0.32TrpArg: 0.32 ± 0.176
1.921TrpSer: 1.921 ± 0.703
0.32TrpThr: 0.32 ± 0.483
0.64TrpVal: 0.64 ± 0.353
0.32TrpTrp: 0.32 ± 0.176
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.161TyrAla: 4.161 ± 1.03
0.96TyrCys: 0.96 ± 0.737
3.521TyrAsp: 3.521 ± 0.994
2.561TyrGlu: 2.561 ± 1.069
0.0TyrPhe: 0.0 ± 0.0
2.241TyrGly: 2.241 ± 0.879
0.96TyrHis: 0.96 ± 0.567
4.161TyrIle: 4.161 ± 1.52
1.601TyrLys: 1.601 ± 0.882
2.561TyrLeu: 2.561 ± 0.984
0.0TyrMet: 0.0 ± 0.0
2.561TyrAsn: 2.561 ± 0.518
0.32TyrPro: 0.32 ± 0.176
0.64TyrGln: 0.64 ± 0.85
0.96TyrArg: 0.96 ± 0.529
0.96TyrSer: 0.96 ± 0.5
2.561TyrThr: 2.561 ± 1.667
1.28TyrVal: 1.28 ± 0.705
0.64TyrTrp: 0.64 ± 0.41
0.64TyrTyr: 0.64 ± 0.785
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3125 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski