Amino acid dipepetide frequency for Beihai shrimp virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.358AlaAla: 4.358 ± 0.611
1.453AlaCys: 1.453 ± 0.596
3.874AlaAsp: 3.874 ± 0.879
1.937AlaGlu: 1.937 ± 1.294
1.453AlaPhe: 1.453 ± 0.475
2.906AlaGly: 2.906 ± 1.529
1.937AlaHis: 1.937 ± 0.97
4.843AlaIle: 4.843 ± 0.891
4.843AlaLys: 4.843 ± 0.31
3.874AlaLeu: 3.874 ± 0.486
1.453AlaMet: 1.453 ± 0.635
3.39AlaAsn: 3.39 ± 0.405
3.874AlaPro: 3.874 ± 1.323
5.327AlaGln: 5.327 ± 2.004
1.937AlaArg: 1.937 ± 0.992
4.843AlaSer: 4.843 ± 1.142
1.937AlaThr: 1.937 ± 0.562
2.906AlaVal: 2.906 ± 0.949
1.937AlaTrp: 1.937 ± 0.97
0.969AlaTyr: 0.969 ± 0.666
0.0AlaXaa: 0.0 ± 0.0
Cys
1.453CysAla: 1.453 ± 1.179
0.484CysCys: 0.484 ± 0.393
0.969CysAsp: 0.969 ± 0.496
3.39CysGlu: 3.39 ± 1.582
0.0CysPhe: 0.0 ± 0.0
0.484CysGly: 0.484 ± 0.393
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.484CysLys: 0.484 ± 0.393
2.421CysLeu: 2.421 ± 1.355
0.484CysMet: 0.484 ± 0.333
1.937CysAsn: 1.937 ± 0.911
0.484CysPro: 0.484 ± 0.465
0.969CysGln: 0.969 ± 0.281
0.484CysArg: 0.484 ± 0.333
0.484CysSer: 0.484 ± 0.393
0.484CysThr: 0.484 ± 0.333
0.969CysVal: 0.969 ± 0.281
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.906AspAla: 2.906 ± 0.407
0.484AspCys: 0.484 ± 0.393
2.906AspAsp: 2.906 ± 1.08
3.39AspGlu: 3.39 ± 0.963
6.295AspPhe: 6.295 ± 0.332
4.843AspGly: 4.843 ± 1.617
0.969AspHis: 0.969 ± 0.281
0.969AspIle: 0.969 ± 0.281
2.906AspLys: 2.906 ± 0.278
4.358AspLeu: 4.358 ± 1.865
0.484AspMet: 0.484 ± 0.333
1.453AspAsn: 1.453 ± 0.596
2.421AspPro: 2.421 ± 0.705
0.969AspGln: 0.969 ± 0.281
3.874AspArg: 3.874 ± 0.228
1.937AspSer: 1.937 ± 1.11
2.906AspThr: 2.906 ± 0.843
4.358AspVal: 4.358 ± 0.778
0.969AspTrp: 0.969 ± 0.446
1.937AspTyr: 1.937 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
1.453GluAla: 1.453 ± 1.0
0.969GluCys: 0.969 ± 0.281
3.874GluAsp: 3.874 ± 1.541
4.843GluGlu: 4.843 ± 0.44
4.358GluPhe: 4.358 ± 0.568
2.906GluGly: 2.906 ± 0.661
0.484GluHis: 0.484 ± 0.333
1.453GluIle: 1.453 ± 1.0
4.358GluLys: 4.358 ± 1.45
4.843GluLeu: 4.843 ± 0.766
2.421GluMet: 2.421 ± 0.338
1.937GluAsn: 1.937 ± 0.243
4.358GluPro: 4.358 ± 1.381
1.937GluGln: 1.937 ± 0.562
6.295GluArg: 6.295 ± 2.121
7.748GluSer: 7.748 ± 0.353
9.201GluThr: 9.201 ± 1.653
2.906GluVal: 2.906 ± 0.278
0.969GluTrp: 0.969 ± 0.496
1.453GluTyr: 1.453 ± 0.764
0.0GluXaa: 0.0 ± 0.0
Phe
4.843PheAla: 4.843 ± 1.617
0.484PheCys: 0.484 ± 0.333
2.421PheAsp: 2.421 ± 0.155
2.906PheGlu: 2.906 ± 0.546
1.453PhePhe: 1.453 ± 0.475
3.874PheGly: 3.874 ± 1.169
0.484PheHis: 0.484 ± 0.393
1.453PheIle: 1.453 ± 0.475
1.453PheLys: 1.453 ± 0.764
5.327PheLeu: 5.327 ± 0.965
0.0PheMet: 0.0 ± 0.0
0.969PheAsn: 0.969 ± 0.496
0.969PhePro: 0.969 ± 0.281
0.484PheGln: 0.484 ± 0.333
2.421PheArg: 2.421 ± 0.808
3.39PheSer: 3.39 ± 1.174
4.843PheThr: 4.843 ± 1.652
2.906PheVal: 2.906 ± 1.413
0.484PheTrp: 0.484 ± 0.465
0.969PheTyr: 0.969 ± 0.666
0.0PheXaa: 0.0 ± 0.0
Gly
3.874GlyAla: 3.874 ± 1.151
2.906GlyCys: 2.906 ± 1.744
3.874GlyAsp: 3.874 ± 0.749
2.421GlyGlu: 2.421 ± 0.155
1.937GlyPhe: 1.937 ± 0.243
4.358GlyGly: 4.358 ± 2.196
2.421GlyHis: 2.421 ± 1.088
2.906GlyIle: 2.906 ± 1.533
2.421GlyLys: 2.421 ± 0.681
4.358GlyLeu: 4.358 ± 0.108
1.937GlyMet: 1.937 ± 0.661
2.906GlyAsn: 2.906 ± 1.193
1.937GlyPro: 1.937 ± 1.315
1.453GlyGln: 1.453 ± 0.204
2.421GlyArg: 2.421 ± 1.479
6.78GlySer: 6.78 ± 2.513
6.78GlyThr: 6.78 ± 1.945
3.39GlyVal: 3.39 ± 0.594
0.484GlyTrp: 0.484 ± 0.393
0.969GlyTyr: 0.969 ± 0.446
0.0GlyXaa: 0.0 ± 0.0
His
1.937HisAla: 1.937 ± 0.439
0.484HisCys: 0.484 ± 0.333
1.937HisAsp: 1.937 ± 1.333
0.484HisGlu: 0.484 ± 0.393
0.484HisPhe: 0.484 ± 0.465
0.484HisGly: 0.484 ± 0.333
0.0HisHis: 0.0 ± 0.0
0.969HisIle: 0.969 ± 0.666
1.453HisLys: 1.453 ± 0.204
1.937HisLeu: 1.937 ± 0.97
0.484HisMet: 0.484 ± 0.393
0.0HisAsn: 0.0 ± 0.0
0.969HisPro: 0.969 ± 0.446
1.453HisGln: 1.453 ± 0.596
0.969HisArg: 0.969 ± 0.281
1.453HisSer: 1.453 ± 0.596
2.421HisThr: 2.421 ± 0.547
1.937HisVal: 1.937 ± 1.333
0.0HisTrp: 0.0 ± 0.0
0.484HisTyr: 0.484 ± 0.393
0.0HisXaa: 0.0 ± 0.0
Ile
3.874IleAla: 3.874 ± 0.879
0.969IleCys: 0.969 ± 0.786
0.969IleAsp: 0.969 ± 0.666
3.874IleGlu: 3.874 ± 0.503
1.453IlePhe: 1.453 ± 0.475
6.78IleGly: 6.78 ± 0.654
1.453IleHis: 1.453 ± 0.635
2.906IleIle: 2.906 ± 0.661
1.937IleLys: 1.937 ± 0.562
1.453IleLeu: 1.453 ± 0.635
0.0IleMet: 0.0 ± 0.0
1.937IleAsn: 1.937 ± 0.911
5.327IlePro: 5.327 ± 0.911
3.39IleGln: 3.39 ± 0.561
2.421IleArg: 2.421 ± 0.705
3.874IleSer: 3.874 ± 0.922
2.906IleThr: 2.906 ± 0.872
4.843IleVal: 4.843 ± 0.44
1.453IleTrp: 1.453 ± 1.179
0.484IleTyr: 0.484 ± 0.333
0.0IleXaa: 0.0 ± 0.0
Lys
2.906LysAla: 2.906 ± 1.338
0.0LysCys: 0.0 ± 0.0
1.453LysAsp: 1.453 ± 1.0
2.421LysGlu: 2.421 ± 1.088
4.843LysPhe: 4.843 ± 0.72
2.421LysGly: 2.421 ± 0.808
0.969LysHis: 0.969 ± 0.666
4.358LysIle: 4.358 ± 1.231
4.358LysLys: 4.358 ± 2.123
7.748LysLeu: 7.748 ± 1.31
0.969LysMet: 0.969 ± 0.446
2.421LysAsn: 2.421 ± 0.681
2.421LysPro: 2.421 ± 0.547
2.906LysGln: 2.906 ± 0.661
4.843LysArg: 4.843 ± 1.652
4.358LysSer: 4.358 ± 1.381
5.327LysThr: 5.327 ± 1.777
5.327LysVal: 5.327 ± 2.747
0.969LysTrp: 0.969 ± 0.496
0.969LysTyr: 0.969 ± 0.786
0.0LysXaa: 0.0 ± 0.0
Leu
5.327LeuAla: 5.327 ± 3.168
2.421LeuCys: 2.421 ± 0.571
5.811LeuAsp: 5.811 ± 1.729
7.264LeuGlu: 7.264 ± 2.373
1.937LeuPhe: 1.937 ± 0.97
2.421LeuGly: 2.421 ± 0.705
2.421LeuHis: 2.421 ± 0.705
7.748LeuIle: 7.748 ± 0.353
4.358LeuLys: 4.358 ± 0.778
7.748LeuLeu: 7.748 ± 1.044
2.906LeuMet: 2.906 ± 0.661
3.39LeuAsn: 3.39 ± 0.861
3.874LeuPro: 3.874 ± 0.749
1.453LeuGln: 1.453 ± 0.878
5.327LeuArg: 5.327 ± 0.965
7.264LeuSer: 7.264 ± 2.528
2.906LeuThr: 2.906 ± 0.894
3.39LeuVal: 3.39 ± 1.236
0.969LeuTrp: 0.969 ± 0.496
2.421LeuTyr: 2.421 ± 0.155
0.0LeuXaa: 0.0 ± 0.0
Met
0.969MetAla: 0.969 ± 0.931
0.969MetCys: 0.969 ± 0.786
0.969MetAsp: 0.969 ± 0.446
2.906MetGlu: 2.906 ± 1.27
0.969MetPhe: 0.969 ± 0.281
2.421MetGly: 2.421 ± 0.808
0.969MetHis: 0.969 ± 0.666
0.969MetIle: 0.969 ± 0.281
2.421MetLys: 2.421 ± 0.547
1.937MetLeu: 1.937 ± 0.771
0.0MetMet: 0.0 ± 0.0
1.937MetAsn: 1.937 ± 0.892
0.484MetPro: 0.484 ± 0.333
1.453MetGln: 1.453 ± 0.204
0.0MetArg: 0.0 ± 0.0
0.969MetSer: 0.969 ± 0.666
0.969MetThr: 0.969 ± 0.446
1.453MetVal: 1.453 ± 0.635
0.0MetTrp: 0.0 ± 0.0
0.969MetTyr: 0.969 ± 0.281
0.0MetXaa: 0.0 ± 0.0
Asn
1.937AsnAla: 1.937 ± 0.439
0.484AsnCys: 0.484 ± 0.333
1.937AsnAsp: 1.937 ± 0.97
2.906AsnGlu: 2.906 ± 0.546
0.484AsnPhe: 0.484 ± 0.333
3.874AsnGly: 3.874 ± 1.102
0.484AsnHis: 0.484 ± 0.333
0.484AsnIle: 0.484 ± 0.333
3.874AsnLys: 3.874 ± 1.102
4.843AsnLeu: 4.843 ± 0.452
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
1.453AsnPro: 1.453 ± 0.878
3.874AsnGln: 3.874 ± 1.151
2.421AsnArg: 2.421 ± 0.547
4.358AsnSer: 4.358 ± 0.568
1.453AsnThr: 1.453 ± 0.635
2.906AsnVal: 2.906 ± 0.407
0.484AsnTrp: 0.484 ± 0.333
0.484AsnTyr: 0.484 ± 0.393
0.0AsnXaa: 0.0 ± 0.0
Pro
4.358ProAla: 4.358 ± 1.047
0.969ProCys: 0.969 ± 0.281
2.421ProAsp: 2.421 ± 1.088
4.358ProGlu: 4.358 ± 2.402
0.0ProPhe: 0.0 ± 0.0
4.358ProGly: 4.358 ± 1.668
0.484ProHis: 0.484 ± 0.393
2.906ProIle: 2.906 ± 0.894
2.906ProLys: 2.906 ± 1.27
3.39ProLeu: 3.39 ± 0.3
2.421ProMet: 2.421 ± 0.705
4.358ProAsn: 4.358 ± 1.45
5.811ProPro: 5.811 ± 2.062
1.937ProGln: 1.937 ± 0.911
0.969ProArg: 0.969 ± 0.931
1.453ProSer: 1.453 ± 0.204
7.748ProThr: 7.748 ± 1.093
3.39ProVal: 3.39 ± 0.861
1.453ProTrp: 1.453 ± 0.204
0.969ProTyr: 0.969 ± 0.496
0.0ProXaa: 0.0 ± 0.0
Gln
2.906GlnAla: 2.906 ± 1.013
0.0GlnCys: 0.0 ± 0.0
0.969GlnAsp: 0.969 ± 0.281
2.421GlnGlu: 2.421 ± 0.705
1.937GlnPhe: 1.937 ± 0.243
0.484GlnGly: 0.484 ± 0.393
0.969GlnHis: 0.969 ± 0.446
1.937GlnIle: 1.937 ± 0.97
3.39GlnLys: 3.39 ± 1.477
2.906GlnLeu: 2.906 ± 0.278
0.969GlnMet: 0.969 ± 0.583
1.937GlnAsn: 1.937 ± 0.243
3.39GlnPro: 3.39 ± 0.594
3.874GlnGln: 3.874 ± 1.541
3.39GlnArg: 3.39 ± 0.405
2.906GlnSer: 2.906 ± 0.546
3.39GlnThr: 3.39 ± 1.202
3.874GlnVal: 3.874 ± 1.784
0.969GlnTrp: 0.969 ± 0.281
2.421GlnTyr: 2.421 ± 0.155
0.0GlnXaa: 0.0 ± 0.0
Arg
3.874ArgAla: 3.874 ± 1.941
0.0ArgCys: 0.0 ± 0.0
2.421ArgAsp: 2.421 ± 0.155
4.843ArgGlu: 4.843 ± 1.51
4.843ArgPhe: 4.843 ± 0.31
3.39ArgGly: 3.39 ± 0.878
0.969ArgHis: 0.969 ± 0.666
1.937ArgIle: 1.937 ± 0.97
1.937ArgLys: 1.937 ± 0.243
3.39ArgLeu: 3.39 ± 1.562
0.969ArgMet: 0.969 ± 0.666
1.937ArgAsn: 1.937 ± 0.243
1.453ArgPro: 1.453 ± 0.635
2.906ArgGln: 2.906 ± 0.661
5.811ArgArg: 5.811 ± 2.359
3.39ArgSer: 3.39 ± 1.708
4.358ArgThr: 4.358 ± 1.283
3.39ArgVal: 3.39 ± 0.939
0.969ArgTrp: 0.969 ± 0.786
2.906ArgTyr: 2.906 ± 0.872
0.0ArgXaa: 0.0 ± 0.0
Ser
6.295SerAla: 6.295 ± 2.121
1.937SerCys: 1.937 ± 0.992
4.843SerAsp: 4.843 ± 0.95
5.811SerGlu: 5.811 ± 2.386
1.937SerPhe: 1.937 ± 0.97
5.811SerGly: 5.811 ± 1.729
0.969SerHis: 0.969 ± 0.786
3.874SerIle: 3.874 ± 0.749
6.295SerLys: 6.295 ± 0.902
4.843SerLeu: 4.843 ± 1.867
3.39SerMet: 3.39 ± 0.363
2.906SerAsn: 2.906 ± 0.407
5.327SerPro: 5.327 ± 0.403
2.906SerGln: 2.906 ± 0.872
4.358SerArg: 4.358 ± 0.68
9.685SerSer: 9.685 ± 2.197
4.843SerThr: 4.843 ± 0.72
5.811SerVal: 5.811 ± 2.027
0.484SerTrp: 0.484 ± 0.465
1.937SerTyr: 1.937 ± 0.661
0.0SerXaa: 0.0 ± 0.0
Thr
3.39ThrAla: 3.39 ± 2.333
0.484ThrCys: 0.484 ± 0.393
1.453ThrAsp: 1.453 ± 0.596
7.264ThrGlu: 7.264 ± 0.736
2.421ThrPhe: 2.421 ± 0.571
2.906ThrGly: 2.906 ± 1.487
2.906ThrHis: 2.906 ± 1.08
4.358ThrIle: 4.358 ± 1.283
2.906ThrLys: 2.906 ± 0.278
5.811ThrLeu: 5.811 ± 1.984
3.39ThrMet: 3.39 ± 1.004
2.421ThrAsn: 2.421 ± 0.547
5.327ThrPro: 5.327 ± 0.773
2.421ThrGln: 2.421 ± 0.155
4.358ThrArg: 4.358 ± 0.751
8.717ThrSer: 8.717 ± 1.543
5.811ThrThr: 5.811 ± 2.412
5.327ThrVal: 5.327 ± 2.116
0.484ThrTrp: 0.484 ± 0.333
1.937ThrTyr: 1.937 ± 0.562
0.0ThrXaa: 0.0 ± 0.0
Val
1.453ValAla: 1.453 ± 0.849
0.484ValCys: 0.484 ± 0.333
6.295ValAsp: 6.295 ± 1.264
2.906ValGlu: 2.906 ± 1.27
3.39ValPhe: 3.39 ± 0.3
3.39ValGly: 3.39 ± 0.878
0.484ValHis: 0.484 ± 0.393
5.327ValIle: 5.327 ± 1.417
6.295ValLys: 6.295 ± 3.154
4.843ValLeu: 4.843 ± 2.177
0.484ValMet: 0.484 ± 0.333
1.453ValAsn: 1.453 ± 0.475
5.811ValPro: 5.811 ± 1.323
3.874ValGln: 3.874 ± 1.419
1.937ValArg: 1.937 ± 0.771
6.295ValSer: 6.295 ± 1.454
3.874ValThr: 3.874 ± 1.931
4.843ValVal: 4.843 ± 2.23
0.484ValTrp: 0.484 ± 0.333
1.937ValTyr: 1.937 ± 0.439
0.0ValXaa: 0.0 ± 0.0
Trp
0.969TrpAla: 0.969 ± 0.281
0.0TrpCys: 0.0 ± 0.0
0.484TrpAsp: 0.484 ± 0.393
0.969TrpGlu: 0.969 ± 0.446
0.969TrpPhe: 0.969 ± 0.281
0.0TrpGly: 0.0 ± 0.0
0.484TrpHis: 0.484 ± 0.393
0.484TrpIle: 0.484 ± 0.333
0.969TrpLys: 0.969 ± 0.931
1.937TrpLeu: 1.937 ± 1.572
0.0TrpMet: 0.0 ± 0.0
0.484TrpAsn: 0.484 ± 0.333
0.484TrpPro: 0.484 ± 0.393
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
2.421TrpSer: 2.421 ± 1.966
1.937TrpThr: 1.937 ± 1.294
0.0TrpVal: 0.0 ± 0.0
0.484TrpTrp: 0.484 ± 0.465
0.969TrpTyr: 0.969 ± 0.446
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.969TyrAla: 0.969 ± 0.281
0.0TyrCys: 0.0 ± 0.0
1.453TyrAsp: 1.453 ± 0.475
1.453TyrGlu: 1.453 ± 1.179
0.969TyrPhe: 0.969 ± 0.281
2.421TyrGly: 2.421 ± 0.155
0.484TyrHis: 0.484 ± 0.333
2.421TyrIle: 2.421 ± 0.571
2.421TyrLys: 2.421 ± 0.547
3.39TyrLeu: 3.39 ± 0.861
0.484TyrMet: 0.484 ± 0.333
0.969TyrAsn: 0.969 ± 0.496
0.484TyrPro: 0.484 ± 0.333
1.937TyrGln: 1.937 ± 0.439
1.453TyrArg: 1.453 ± 1.179
1.937TyrSer: 1.937 ± 0.771
0.0TyrThr: 0.0 ± 0.0
1.937TyrVal: 1.937 ± 0.661
0.0TyrTrp: 0.0 ± 0.0
1.937TyrTyr: 1.937 ± 0.992
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2066 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski