Amino acid dipepetide frequency for Beihai shrimp virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.142AlaAla: 3.142 ± 0.716
1.571AlaCys: 1.571 ± 0.963
2.357AlaAsp: 2.357 ± 0.976
2.357AlaGlu: 2.357 ± 0.234
1.571AlaPhe: 1.571 ± 0.963
3.142AlaGly: 3.142 ± 0.716
1.571AlaHis: 1.571 ± 0.963
0.786AlaIle: 0.786 ± 0.482
5.499AlaLys: 5.499 ± 0.951
7.855AlaLeu: 7.855 ± 1.185
4.713AlaMet: 4.713 ± 0.469
3.928AlaAsn: 3.928 ± 2.408
1.571AlaPro: 1.571 ± 0.963
6.284AlaGln: 6.284 ± 0.989
5.499AlaArg: 5.499 ± 0.26
4.713AlaSer: 4.713 ± 0.469
5.499AlaThr: 5.499 ± 0.951
7.07AlaVal: 7.07 ± 1.914
2.357AlaTrp: 2.357 ± 0.234
1.571AlaTyr: 1.571 ± 0.247
0.0AlaXaa: 0.0 ± 0.0
Cys
1.571CysAla: 1.571 ± 0.963
0.0CysCys: 0.0 ± 0.0
2.357CysAsp: 2.357 ± 0.234
0.786CysGlu: 0.786 ± 0.729
0.0CysPhe: 0.0 ± 0.0
1.571CysGly: 1.571 ± 0.247
0.0CysHis: 0.0 ± 0.0
0.786CysIle: 0.786 ± 0.729
1.571CysLys: 1.571 ± 0.963
0.0CysLeu: 0.0 ± 0.0
0.786CysMet: 0.786 ± 0.729
0.786CysAsn: 0.786 ± 0.482
0.786CysPro: 0.786 ± 0.482
0.0CysGln: 0.0 ± 0.0
0.786CysArg: 0.786 ± 0.482
1.571CysSer: 1.571 ± 0.247
0.786CysThr: 0.786 ± 0.482
0.786CysVal: 0.786 ± 0.482
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.928AspAla: 3.928 ± 0.013
0.0AspCys: 0.0 ± 0.0
5.499AspAsp: 5.499 ± 0.951
5.499AspGlu: 5.499 ± 0.951
3.142AspPhe: 3.142 ± 0.716
3.928AspGly: 3.928 ± 0.013
0.786AspHis: 0.786 ± 0.482
2.357AspIle: 2.357 ± 2.187
2.357AspLys: 2.357 ± 0.234
3.928AspLeu: 3.928 ± 1.198
0.0AspMet: 0.0 ± 0.0
0.786AspAsn: 0.786 ± 0.729
3.142AspPro: 3.142 ± 1.705
2.357AspGln: 2.357 ± 0.976
3.928AspArg: 3.928 ± 1.198
0.786AspSer: 0.786 ± 0.482
2.357AspThr: 2.357 ± 0.976
7.07AspVal: 7.07 ± 1.718
2.357AspTrp: 2.357 ± 2.187
1.571AspTyr: 1.571 ± 0.247
0.0AspXaa: 0.0 ± 0.0
Glu
4.713GluAla: 4.713 ± 0.469
0.0GluCys: 0.0 ± 0.0
2.357GluAsp: 2.357 ± 0.234
8.641GluGlu: 8.641 ± 0.456
4.713GluPhe: 4.713 ± 0.469
3.928GluGly: 3.928 ± 2.434
1.571GluHis: 1.571 ± 0.247
0.786GluIle: 0.786 ± 0.482
1.571GluLys: 1.571 ± 0.247
10.998GluLeu: 10.998 ± 0.52
0.0GluMet: 0.0 ± 0.0
3.142GluAsn: 3.142 ± 0.716
4.713GluPro: 4.713 ± 0.742
3.928GluGln: 3.928 ± 0.013
3.142GluArg: 3.142 ± 0.495
2.357GluSer: 2.357 ± 1.445
2.357GluThr: 2.357 ± 0.234
2.357GluVal: 2.357 ± 0.976
0.0GluTrp: 0.0 ± 0.0
1.571GluTyr: 1.571 ± 1.458
0.0GluXaa: 0.0 ± 0.0
Phe
3.142PheAla: 3.142 ± 1.927
1.571PheCys: 1.571 ± 0.247
1.571PheAsp: 1.571 ± 1.458
3.928PheGlu: 3.928 ± 1.198
0.0PhePhe: 0.0 ± 0.0
3.142PheGly: 3.142 ± 1.705
0.786PheHis: 0.786 ± 0.482
2.357PheIle: 2.357 ± 2.187
0.786PheLys: 0.786 ± 0.482
3.142PheLeu: 3.142 ± 0.495
0.786PheMet: 0.786 ± 0.482
0.786PheAsn: 0.786 ± 0.482
0.0PhePro: 0.0 ± 0.0
1.571PheGln: 1.571 ± 0.963
1.571PheArg: 1.571 ± 1.458
1.571PheSer: 1.571 ± 0.963
0.0PheThr: 0.0 ± 0.0
3.142PheVal: 3.142 ± 0.716
0.786PheTrp: 0.786 ± 0.482
0.786PheTyr: 0.786 ± 0.729
0.0PheXaa: 0.0 ± 0.0
Gly
3.928GlyAla: 3.928 ± 1.198
0.786GlyCys: 0.786 ± 0.729
3.142GlyAsp: 3.142 ± 0.716
2.357GlyGlu: 2.357 ± 2.187
3.142GlyPhe: 3.142 ± 0.495
3.928GlyGly: 3.928 ± 1.223
1.571GlyHis: 1.571 ± 0.247
3.928GlyIle: 3.928 ± 1.223
3.142GlyLys: 3.142 ± 0.716
5.499GlyLeu: 5.499 ± 2.681
0.786GlyMet: 0.786 ± 0.729
3.142GlyAsn: 3.142 ± 0.495
2.357GlyPro: 2.357 ± 0.976
7.07GlyGln: 7.07 ± 0.507
2.357GlyArg: 2.357 ± 0.234
2.357GlySer: 2.357 ± 0.234
5.499GlyThr: 5.499 ± 0.26
7.07GlyVal: 7.07 ± 3.124
1.571GlyTrp: 1.571 ± 1.458
1.571GlyTyr: 1.571 ± 0.247
0.0GlyXaa: 0.0 ± 0.0
His
2.357HisAla: 2.357 ± 0.976
0.786HisCys: 0.786 ± 0.729
0.786HisAsp: 0.786 ± 0.482
0.0HisGlu: 0.0 ± 0.0
0.786HisPhe: 0.786 ± 0.729
1.571HisGly: 1.571 ± 0.963
0.786HisHis: 0.786 ± 0.729
0.786HisIle: 0.786 ± 0.482
0.786HisLys: 0.786 ± 0.729
3.142HisLeu: 3.142 ± 2.916
0.0HisMet: 0.0 ± 0.0
0.786HisAsn: 0.786 ± 0.482
2.357HisPro: 2.357 ± 0.234
1.571HisGln: 1.571 ± 0.247
3.142HisArg: 3.142 ± 0.716
2.357HisSer: 2.357 ± 0.234
1.571HisThr: 1.571 ± 0.963
2.357HisVal: 2.357 ± 0.976
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.142IleAla: 3.142 ± 1.927
1.571IleCys: 1.571 ± 0.963
3.142IleAsp: 3.142 ± 1.705
3.142IleGlu: 3.142 ± 1.705
0.786IlePhe: 0.786 ± 0.482
2.357IleGly: 2.357 ± 0.234
2.357IleHis: 2.357 ± 0.234
3.142IleIle: 3.142 ± 0.495
1.571IleLys: 1.571 ± 0.247
7.07IleLeu: 7.07 ± 1.718
1.571IleMet: 1.571 ± 0.247
1.571IleAsn: 1.571 ± 0.247
1.571IlePro: 1.571 ± 0.963
2.357IleGln: 2.357 ± 0.234
1.571IleArg: 1.571 ± 0.247
6.284IleSer: 6.284 ± 2.2
3.142IleThr: 3.142 ± 0.495
0.786IleVal: 0.786 ± 0.482
0.0IleTrp: 0.0 ± 0.0
1.571IleTyr: 1.571 ± 0.247
0.0IleXaa: 0.0 ± 0.0
Lys
3.928LysAla: 3.928 ± 1.198
0.0LysCys: 0.0 ± 0.0
1.571LysAsp: 1.571 ± 0.247
4.713LysGlu: 4.713 ± 0.469
0.0LysPhe: 0.0 ± 0.0
0.786LysGly: 0.786 ± 0.482
3.142LysHis: 3.142 ± 0.495
3.928LysIle: 3.928 ± 0.013
3.142LysLys: 3.142 ± 0.495
8.641LysLeu: 8.641 ± 0.755
0.786LysMet: 0.786 ± 0.482
2.357LysAsn: 2.357 ± 1.445
4.713LysPro: 4.713 ± 0.742
0.786LysGln: 0.786 ± 0.482
0.786LysArg: 0.786 ± 0.482
3.142LysSer: 3.142 ± 0.495
5.499LysThr: 5.499 ± 2.161
4.713LysVal: 4.713 ± 1.679
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
10.998LeuAla: 10.998 ± 1.731
1.571LeuCys: 1.571 ± 0.247
6.284LeuAsp: 6.284 ± 0.989
7.855LeuGlu: 7.855 ± 1.185
3.142LeuPhe: 3.142 ± 2.916
7.855LeuGly: 7.855 ± 1.236
3.142LeuHis: 3.142 ± 1.705
7.855LeuIle: 7.855 ± 0.026
5.499LeuLys: 5.499 ± 2.161
11.783LeuLeu: 11.783 ± 0.039
2.357LeuMet: 2.357 ± 1.368
1.571LeuAsn: 1.571 ± 0.247
3.928LeuPro: 3.928 ± 1.198
3.928LeuGln: 3.928 ± 0.013
8.641LeuArg: 8.641 ± 3.176
4.713LeuSer: 4.713 ± 0.469
3.928LeuThr: 3.928 ± 0.013
3.928LeuVal: 3.928 ± 1.198
0.786LeuTrp: 0.786 ± 0.482
3.142LeuTyr: 3.142 ± 0.495
0.0LeuXaa: 0.0 ± 0.0
Met
2.357MetAla: 2.357 ± 0.234
0.0MetCys: 0.0 ± 0.0
0.786MetAsp: 0.786 ± 0.482
2.357MetGlu: 2.357 ± 1.445
0.786MetPhe: 0.786 ± 0.729
0.786MetGly: 0.786 ± 0.729
0.0MetHis: 0.0 ± 0.0
1.571MetIle: 1.571 ± 1.458
2.357MetLys: 2.357 ± 0.976
2.357MetLeu: 2.357 ± 0.234
0.786MetMet: 0.786 ± 0.362
0.786MetAsn: 0.786 ± 0.729
2.357MetPro: 2.357 ± 0.976
2.357MetGln: 2.357 ± 0.976
1.571MetArg: 1.571 ± 0.963
3.142MetSer: 3.142 ± 0.716
1.571MetThr: 1.571 ± 0.963
0.786MetVal: 0.786 ± 0.729
0.786MetTrp: 0.786 ± 0.729
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.142AsnAla: 3.142 ± 1.927
0.786AsnCys: 0.786 ± 0.482
0.0AsnAsp: 0.0 ± 0.0
2.357AsnGlu: 2.357 ± 0.976
0.786AsnPhe: 0.786 ± 0.482
3.142AsnGly: 3.142 ± 0.716
2.357AsnHis: 2.357 ± 0.976
2.357AsnIle: 2.357 ± 1.445
0.786AsnLys: 0.786 ± 0.482
5.499AsnLeu: 5.499 ± 1.471
0.786AsnMet: 0.786 ± 0.482
2.357AsnAsn: 2.357 ± 0.976
3.142AsnPro: 3.142 ± 0.495
0.0AsnGln: 0.0 ± 0.0
3.142AsnArg: 3.142 ± 0.716
3.928AsnSer: 3.928 ± 0.013
0.786AsnThr: 0.786 ± 0.482
2.357AsnVal: 2.357 ± 1.445
1.571AsnTrp: 1.571 ± 0.247
0.786AsnTyr: 0.786 ± 0.482
0.0AsnXaa: 0.0 ± 0.0
Pro
1.571ProAla: 1.571 ± 0.963
0.0ProCys: 0.0 ± 0.0
2.357ProAsp: 2.357 ± 0.234
1.571ProGlu: 1.571 ± 1.458
3.142ProPhe: 3.142 ± 0.716
3.928ProGly: 3.928 ± 1.223
0.786ProHis: 0.786 ± 0.729
1.571ProIle: 1.571 ± 0.247
4.713ProLys: 4.713 ± 0.469
2.357ProLeu: 2.357 ± 0.976
0.786ProMet: 0.786 ± 0.482
3.928ProAsn: 3.928 ± 1.198
2.357ProPro: 2.357 ± 0.976
3.928ProGln: 3.928 ± 1.198
0.786ProArg: 0.786 ± 0.729
3.142ProSer: 3.142 ± 0.716
6.284ProThr: 6.284 ± 0.222
7.07ProVal: 7.07 ± 1.914
1.571ProTrp: 1.571 ± 1.458
2.357ProTyr: 2.357 ± 0.234
0.0ProXaa: 0.0 ± 0.0
Gln
4.713GlnAla: 4.713 ± 1.679
2.357GlnCys: 2.357 ± 1.445
2.357GlnAsp: 2.357 ± 2.187
2.357GlnGlu: 2.357 ± 0.976
0.0GlnPhe: 0.0 ± 0.0
0.0GlnGly: 0.0 ± 0.0
2.357GlnHis: 2.357 ± 0.234
3.928GlnIle: 3.928 ± 2.408
2.357GlnLys: 2.357 ± 1.445
5.499GlnLeu: 5.499 ± 1.471
2.357GlnMet: 2.357 ± 2.187
1.571GlnAsn: 1.571 ± 1.458
1.571GlnPro: 1.571 ± 0.247
3.142GlnGln: 3.142 ± 0.716
1.571GlnArg: 1.571 ± 0.247
4.713GlnSer: 4.713 ± 0.469
3.928GlnThr: 3.928 ± 0.013
2.357GlnVal: 2.357 ± 0.234
0.786GlnTrp: 0.786 ± 0.729
1.571GlnTyr: 1.571 ± 0.247
0.0GlnXaa: 0.0 ± 0.0
Arg
6.284ArgAla: 6.284 ± 0.989
0.786ArgCys: 0.786 ± 0.482
1.571ArgAsp: 1.571 ± 0.247
1.571ArgGlu: 1.571 ± 0.247
4.713ArgPhe: 4.713 ± 0.469
3.142ArgGly: 3.142 ± 0.495
2.357ArgHis: 2.357 ± 0.976
1.571ArgIle: 1.571 ± 0.247
1.571ArgLys: 1.571 ± 0.963
3.928ArgLeu: 3.928 ± 1.223
1.571ArgMet: 1.571 ± 0.963
2.357ArgAsn: 2.357 ± 1.445
1.571ArgPro: 1.571 ± 0.247
0.0ArgGln: 0.0 ± 0.0
3.928ArgArg: 3.928 ± 0.013
4.713ArgSer: 4.713 ± 1.952
3.142ArgThr: 3.142 ± 0.495
3.928ArgVal: 3.928 ± 1.198
3.142ArgTrp: 3.142 ± 0.495
2.357ArgTyr: 2.357 ± 0.234
0.0ArgXaa: 0.0 ± 0.0
Ser
3.928SerAla: 3.928 ± 0.013
0.786SerCys: 0.786 ± 0.482
3.142SerAsp: 3.142 ± 0.716
0.786SerGlu: 0.786 ± 0.482
3.142SerPhe: 3.142 ± 0.716
5.499SerGly: 5.499 ± 2.681
2.357SerHis: 2.357 ± 0.234
1.571SerIle: 1.571 ± 0.963
3.928SerLys: 3.928 ± 1.198
5.499SerLeu: 5.499 ± 0.951
1.571SerMet: 1.571 ± 0.247
2.357SerAsn: 2.357 ± 0.234
7.855SerPro: 7.855 ± 1.185
5.499SerGln: 5.499 ± 0.951
3.142SerArg: 3.142 ± 0.495
6.284SerSer: 6.284 ± 0.222
6.284SerThr: 6.284 ± 1.432
2.357SerVal: 2.357 ± 0.234
2.357SerTrp: 2.357 ± 0.976
1.571SerTyr: 1.571 ± 1.458
0.0SerXaa: 0.0 ± 0.0
Thr
3.142ThrAla: 3.142 ± 0.495
1.571ThrCys: 1.571 ± 0.247
5.499ThrAsp: 5.499 ± 0.951
4.713ThrGlu: 4.713 ± 0.469
0.786ThrPhe: 0.786 ± 0.482
7.855ThrGly: 7.855 ± 0.026
0.0ThrHis: 0.0 ± 0.0
3.928ThrIle: 3.928 ± 2.434
3.142ThrLys: 3.142 ± 0.716
4.713ThrLeu: 4.713 ± 1.679
2.357ThrMet: 2.357 ± 0.976
2.357ThrAsn: 2.357 ± 0.976
3.928ThrPro: 3.928 ± 1.198
0.786ThrGln: 0.786 ± 0.482
2.357ThrArg: 2.357 ± 1.445
6.284ThrSer: 6.284 ± 1.432
3.142ThrThr: 3.142 ± 2.916
2.357ThrVal: 2.357 ± 0.234
0.786ThrTrp: 0.786 ± 0.482
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.284ValAla: 6.284 ± 2.643
0.786ValCys: 0.786 ± 0.729
5.499ValAsp: 5.499 ± 0.26
4.713ValGlu: 4.713 ± 1.679
0.786ValPhe: 0.786 ± 0.729
6.284ValGly: 6.284 ± 1.432
0.0ValHis: 0.0 ± 0.0
3.142ValIle: 3.142 ± 0.495
3.928ValLys: 3.928 ± 0.013
6.284ValLeu: 6.284 ± 2.643
2.357ValMet: 2.357 ± 0.234
3.142ValAsn: 3.142 ± 0.716
4.713ValPro: 4.713 ± 1.679
1.571ValGln: 1.571 ± 1.458
3.928ValArg: 3.928 ± 0.013
5.499ValSer: 5.499 ± 2.161
0.786ValThr: 0.786 ± 0.482
6.284ValVal: 6.284 ± 3.853
0.0ValTrp: 0.0 ± 0.0
1.571ValTyr: 1.571 ± 0.963
0.0ValXaa: 0.0 ± 0.0
Trp
0.786TrpAla: 0.786 ± 0.482
0.0TrpCys: 0.0 ± 0.0
1.571TrpAsp: 1.571 ± 1.458
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.786TrpGly: 0.786 ± 0.729
0.0TrpHis: 0.0 ± 0.0
1.571TrpIle: 1.571 ± 0.963
2.357TrpLys: 2.357 ± 2.187
1.571TrpLeu: 1.571 ± 1.458
2.357TrpMet: 2.357 ± 0.234
1.571TrpAsn: 1.571 ± 0.247
0.786TrpPro: 0.786 ± 0.729
0.786TrpGln: 0.786 ± 0.729
0.786TrpArg: 0.786 ± 0.482
0.786TrpSer: 0.786 ± 0.729
3.142TrpThr: 3.142 ± 1.705
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.786TrpTyr: 0.786 ± 0.482
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.0TyrCys: 0.0 ± 0.0
3.928TyrAsp: 3.928 ± 0.013
3.142TyrGlu: 3.142 ± 2.916
0.0TyrPhe: 0.0 ± 0.0
1.571TyrGly: 1.571 ± 0.963
0.0TyrHis: 0.0 ± 0.0
0.786TyrIle: 0.786 ± 0.482
1.571TyrLys: 1.571 ± 0.247
3.928TyrLeu: 3.928 ± 0.013
0.0TyrMet: 0.0 ± 0.0
0.786TyrAsn: 0.786 ± 0.482
0.786TyrPro: 0.786 ± 0.482
1.571TyrGln: 1.571 ± 0.247
1.571TyrArg: 1.571 ± 1.458
1.571TyrSer: 1.571 ± 0.247
0.0TyrThr: 0.0 ± 0.0
0.786TyrVal: 0.786 ± 0.482
0.786TyrTrp: 0.786 ± 0.729
1.571TyrTyr: 1.571 ± 0.963
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1274 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski