Amino acid dipepetide frequency for Beihai sobemo-like virus 23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.13AlaAla: 3.13 ± 0.974
0.782AlaCys: 0.782 ± 0.453
2.347AlaAsp: 2.347 ± 2.82
4.695AlaGlu: 4.695 ± 1.324
2.347AlaPhe: 2.347 ± 0.034
3.13AlaGly: 3.13 ± 0.974
2.347AlaHis: 2.347 ± 0.034
0.0AlaIle: 0.0 ± 0.0
5.477AlaLys: 5.477 ± 1.777
3.912AlaLeu: 3.912 ± 0.521
1.565AlaMet: 1.565 ± 0.487
0.0AlaAsn: 0.0 ± 0.0
1.565AlaPro: 1.565 ± 0.906
2.347AlaGln: 2.347 ± 0.034
0.782AlaArg: 0.782 ± 0.94
3.13AlaSer: 3.13 ± 0.418
0.782AlaThr: 0.782 ± 0.94
2.347AlaVal: 2.347 ± 0.034
1.565AlaTrp: 1.565 ± 0.487
1.565AlaTyr: 1.565 ± 0.487
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.782CysAsp: 0.782 ± 0.453
0.782CysGlu: 0.782 ± 0.94
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.565CysIle: 1.565 ± 0.487
0.0CysLys: 0.0 ± 0.0
3.912CysLeu: 3.912 ± 3.307
0.782CysMet: 0.782 ± 0.453
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.782CysGln: 0.782 ± 0.453
1.565CysArg: 1.565 ± 0.487
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.782CysVal: 0.782 ± 0.453
0.0CysTrp: 0.0 ± 0.0
2.347CysTyr: 2.347 ± 2.82
0.0CysXaa: 0.0 ± 0.0
Asp
2.347AspAla: 2.347 ± 0.034
0.0AspCys: 0.0 ± 0.0
2.347AspAsp: 2.347 ± 1.427
4.695AspGlu: 4.695 ± 1.324
1.565AspPhe: 1.565 ± 0.906
3.912AspGly: 3.912 ± 0.521
0.0AspHis: 0.0 ± 0.0
3.13AspIle: 3.13 ± 0.974
7.825AspLys: 7.825 ± 0.35
4.695AspLeu: 4.695 ± 2.717
1.565AspMet: 1.565 ± 1.88
3.13AspAsn: 3.13 ± 1.811
3.912AspPro: 3.912 ± 1.914
2.347AspGln: 2.347 ± 1.358
1.565AspArg: 1.565 ± 0.487
3.912AspSer: 3.912 ± 1.914
2.347AspThr: 2.347 ± 0.034
4.695AspVal: 4.695 ± 0.069
0.782AspTrp: 0.782 ± 0.94
1.565AspTyr: 1.565 ± 0.906
0.0AspXaa: 0.0 ± 0.0
Glu
4.695GluAla: 4.695 ± 2.854
0.0GluCys: 0.0 ± 0.0
5.477GluAsp: 5.477 ± 0.384
6.26GluGlu: 6.26 ± 0.837
2.347GluPhe: 2.347 ± 1.427
4.695GluGly: 4.695 ± 2.717
0.782GluHis: 0.782 ± 0.453
8.607GluIle: 8.607 ± 2.195
9.39GluLys: 9.39 ± 1.255
4.695GluLeu: 4.695 ± 1.324
3.13GluMet: 3.13 ± 1.6
7.825GluAsn: 7.825 ± 0.35
3.13GluPro: 3.13 ± 0.418
3.912GluGln: 3.912 ± 2.264
0.782GluArg: 0.782 ± 0.453
3.912GluSer: 3.912 ± 0.521
3.13GluThr: 3.13 ± 1.811
9.39GluVal: 9.39 ± 0.137
1.565GluTrp: 1.565 ± 0.906
0.782GluTyr: 0.782 ± 0.94
0.0GluXaa: 0.0 ± 0.0
Phe
3.13PheAla: 3.13 ± 3.76
1.565PheCys: 1.565 ± 0.487
1.565PheAsp: 1.565 ± 0.487
1.565PheGlu: 1.565 ± 0.906
3.13PhePhe: 3.13 ± 1.811
2.347PheGly: 2.347 ± 2.82
0.0PheHis: 0.0 ± 0.0
2.347PheIle: 2.347 ± 1.427
2.347PheLys: 2.347 ± 0.034
3.13PheLeu: 3.13 ± 2.367
0.0PheMet: 0.0 ± 0.0
2.347PheAsn: 2.347 ± 1.427
0.782PhePro: 0.782 ± 0.453
0.782PheGln: 0.782 ± 0.453
0.0PheArg: 0.0 ± 0.0
4.695PheSer: 4.695 ± 1.461
1.565PheThr: 1.565 ± 0.487
3.13PheVal: 3.13 ± 1.811
0.0PheTrp: 0.0 ± 0.0
3.13PheTyr: 3.13 ± 0.418
0.0PheXaa: 0.0 ± 0.0
Gly
2.347GlyAla: 2.347 ± 1.427
0.782GlyCys: 0.782 ± 0.94
1.565GlyAsp: 1.565 ± 0.487
3.13GlyGlu: 3.13 ± 0.974
2.347GlyPhe: 2.347 ± 0.034
2.347GlyGly: 2.347 ± 0.034
0.782GlyHis: 0.782 ± 0.453
2.347GlyIle: 2.347 ± 0.034
3.13GlyLys: 3.13 ± 0.418
3.13GlyLeu: 3.13 ± 1.811
0.782GlyMet: 0.782 ± 0.453
1.565GlyAsn: 1.565 ± 0.487
0.782GlyPro: 0.782 ± 0.94
2.347GlyGln: 2.347 ± 1.358
2.347GlyArg: 2.347 ± 0.034
3.13GlySer: 3.13 ± 1.811
4.695GlyThr: 4.695 ± 0.069
0.782GlyVal: 0.782 ± 0.94
3.13GlyTrp: 3.13 ± 2.367
5.477GlyTyr: 5.477 ± 2.401
0.0GlyXaa: 0.0 ± 0.0
His
0.782HisAla: 0.782 ± 0.453
0.0HisCys: 0.0 ± 0.0
1.565HisAsp: 1.565 ± 0.906
0.0HisGlu: 0.0 ± 0.0
0.782HisPhe: 0.782 ± 0.453
2.347HisGly: 2.347 ± 1.427
0.782HisHis: 0.782 ± 0.453
1.565HisIle: 1.565 ± 0.487
3.13HisLys: 3.13 ± 0.974
1.565HisLeu: 1.565 ± 0.487
0.782HisMet: 0.782 ± 0.453
0.782HisAsn: 0.782 ± 0.453
0.782HisPro: 0.782 ± 0.94
1.565HisGln: 1.565 ± 0.906
0.782HisArg: 0.782 ± 0.453
0.782HisSer: 0.782 ± 0.453
1.565HisThr: 1.565 ± 0.487
0.782HisVal: 0.782 ± 0.453
0.0HisTrp: 0.0 ± 0.0
0.782HisTyr: 0.782 ± 0.453
0.0HisXaa: 0.0 ± 0.0
Ile
2.347IleAla: 2.347 ± 1.427
2.347IleCys: 2.347 ± 0.034
5.477IleAsp: 5.477 ± 1.777
2.347IleGlu: 2.347 ± 1.358
0.782IlePhe: 0.782 ± 0.453
0.0IleGly: 0.0 ± 0.0
1.565IleHis: 1.565 ± 0.487
3.912IleIle: 3.912 ± 0.521
3.912IleLys: 3.912 ± 1.914
7.042IleLeu: 7.042 ± 0.103
1.565IleMet: 1.565 ± 1.88
2.347IleAsn: 2.347 ± 2.82
2.347IlePro: 2.347 ± 0.034
3.912IleGln: 3.912 ± 0.871
1.565IleArg: 1.565 ± 0.906
3.912IleSer: 3.912 ± 0.871
2.347IleThr: 2.347 ± 1.358
3.13IleVal: 3.13 ± 0.418
0.0IleTrp: 0.0 ± 0.0
3.13IleTyr: 3.13 ± 0.418
0.0IleXaa: 0.0 ± 0.0
Lys
3.13LysAla: 3.13 ± 0.418
1.565LysCys: 1.565 ± 1.88
1.565LysAsp: 1.565 ± 0.487
16.432LysGlu: 16.432 ± 3.938
3.912LysPhe: 3.912 ± 3.307
0.782LysGly: 0.782 ± 0.453
0.782LysHis: 0.782 ± 0.94
2.347LysIle: 2.347 ± 1.358
7.042LysLys: 7.042 ± 1.29
10.955LysLeu: 10.955 ± 2.161
4.695LysMet: 4.695 ± 1.324
1.565LysAsn: 1.565 ± 0.906
0.782LysPro: 0.782 ± 0.453
7.042LysGln: 7.042 ± 1.29
4.695LysArg: 4.695 ± 1.324
5.477LysSer: 5.477 ± 2.401
4.695LysThr: 4.695 ± 2.717
8.607LysVal: 8.607 ± 0.59
1.565LysTrp: 1.565 ± 0.487
2.347LysTyr: 2.347 ± 1.427
0.0LysXaa: 0.0 ± 0.0
Leu
1.565LeuAla: 1.565 ± 0.487
1.565LeuCys: 1.565 ± 1.88
4.695LeuAsp: 4.695 ± 1.324
11.737LeuGlu: 11.737 ± 0.172
4.695LeuPhe: 4.695 ± 2.854
5.477LeuGly: 5.477 ± 0.384
4.695LeuHis: 4.695 ± 1.461
4.695LeuIle: 4.695 ± 0.069
8.607LeuLys: 8.607 ± 3.588
5.477LeuLeu: 5.477 ± 1.777
3.912LeuMet: 3.912 ± 0.871
5.477LeuAsn: 5.477 ± 0.384
4.695LeuPro: 4.695 ± 2.854
1.565LeuGln: 1.565 ± 0.487
3.13LeuArg: 3.13 ± 0.418
5.477LeuSer: 5.477 ± 0.384
3.13LeuThr: 3.13 ± 0.974
5.477LeuVal: 5.477 ± 0.384
0.0LeuTrp: 0.0 ± 0.0
3.912LeuTyr: 3.912 ± 0.871
0.0LeuXaa: 0.0 ± 0.0
Met
3.912MetAla: 3.912 ± 0.871
0.0MetCys: 0.0 ± 0.0
1.565MetAsp: 1.565 ± 0.487
2.347MetGlu: 2.347 ± 0.034
0.0MetPhe: 0.0 ± 0.0
1.565MetGly: 1.565 ± 0.487
0.782MetHis: 0.782 ± 0.453
0.782MetIle: 0.782 ± 0.94
3.13MetLys: 3.13 ± 0.974
0.782MetLeu: 0.782 ± 0.453
2.347MetMet: 2.347 ± 1.427
2.347MetAsn: 2.347 ± 1.427
2.347MetPro: 2.347 ± 1.358
1.565MetGln: 1.565 ± 0.487
0.0MetArg: 0.0 ± 0.0
2.347MetSer: 2.347 ± 1.358
1.565MetThr: 1.565 ± 0.906
3.13MetVal: 3.13 ± 0.418
0.0MetTrp: 0.0 ± 0.0
3.912MetTyr: 3.912 ± 3.307
0.0MetXaa: 0.0 ± 0.0
Asn
1.565AsnAla: 1.565 ± 0.487
0.0AsnCys: 0.0 ± 0.0
1.565AsnAsp: 1.565 ± 0.487
1.565AsnGlu: 1.565 ± 1.88
0.782AsnPhe: 0.782 ± 0.94
3.912AsnGly: 3.912 ± 0.521
0.0AsnHis: 0.0 ± 0.0
1.565AsnIle: 1.565 ± 0.906
6.26AsnLys: 6.26 ± 1.949
5.477AsnLeu: 5.477 ± 1.009
1.565AsnMet: 1.565 ± 0.906
0.782AsnAsn: 0.782 ± 0.453
0.782AsnPro: 0.782 ± 0.453
3.912AsnGln: 3.912 ± 2.264
2.347AsnArg: 2.347 ± 0.034
6.26AsnSer: 6.26 ± 0.556
1.565AsnThr: 1.565 ± 0.487
0.0AsnVal: 0.0 ± 0.0
1.565AsnTrp: 1.565 ± 0.906
2.347AsnTyr: 2.347 ± 1.427
0.0AsnXaa: 0.0 ± 0.0
Pro
1.565ProAla: 1.565 ± 0.906
0.782ProCys: 0.782 ± 0.94
2.347ProAsp: 2.347 ± 0.034
7.042ProGlu: 7.042 ± 1.29
0.782ProPhe: 0.782 ± 0.94
2.347ProGly: 2.347 ± 1.427
1.565ProHis: 1.565 ± 0.487
1.565ProIle: 1.565 ± 0.487
3.912ProLys: 3.912 ± 0.521
0.782ProLeu: 0.782 ± 0.453
1.565ProMet: 1.565 ± 0.487
1.565ProAsn: 1.565 ± 1.88
0.782ProPro: 0.782 ± 0.453
0.782ProGln: 0.782 ± 0.453
0.0ProArg: 0.0 ± 0.0
3.912ProSer: 3.912 ± 0.871
2.347ProThr: 2.347 ± 1.358
4.695ProVal: 4.695 ± 0.069
0.0ProTrp: 0.0 ± 0.0
2.347ProTyr: 2.347 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
2.347GlnAla: 2.347 ± 1.358
0.0GlnCys: 0.0 ± 0.0
0.782GlnAsp: 0.782 ± 0.453
3.13GlnGlu: 3.13 ± 1.811
0.0GlnPhe: 0.0 ± 0.0
1.565GlnGly: 1.565 ± 0.487
0.0GlnHis: 0.0 ± 0.0
3.13GlnIle: 3.13 ± 0.418
7.042GlnLys: 7.042 ± 2.682
6.26GlnLeu: 6.26 ± 0.837
0.782GlnMet: 0.782 ± 0.453
0.782GlnAsn: 0.782 ± 0.94
0.782GlnPro: 0.782 ± 0.453
3.912GlnGln: 3.912 ± 2.264
3.912GlnArg: 3.912 ± 0.521
3.912GlnSer: 3.912 ± 0.871
1.565GlnThr: 1.565 ± 0.906
7.042GlnVal: 7.042 ± 0.103
0.0GlnTrp: 0.0 ± 0.0
2.347GlnTyr: 2.347 ± 1.358
0.0GlnXaa: 0.0 ± 0.0
Arg
0.782ArgAla: 0.782 ± 0.453
1.565ArgCys: 1.565 ± 0.906
2.347ArgAsp: 2.347 ± 1.358
2.347ArgGlu: 2.347 ± 0.034
3.13ArgPhe: 3.13 ± 2.367
1.565ArgGly: 1.565 ± 0.906
1.565ArgHis: 1.565 ± 0.906
3.13ArgIle: 3.13 ± 0.418
1.565ArgLys: 1.565 ± 0.906
4.695ArgLeu: 4.695 ± 1.461
0.0ArgMet: 0.0 ± 0.0
0.782ArgAsn: 0.782 ± 0.94
0.782ArgPro: 0.782 ± 0.94
2.347ArgGln: 2.347 ± 0.034
2.347ArgArg: 2.347 ± 0.034
1.565ArgSer: 1.565 ± 0.906
1.565ArgThr: 1.565 ± 0.906
3.912ArgVal: 3.912 ± 2.264
1.565ArgTrp: 1.565 ± 0.487
1.565ArgTyr: 1.565 ± 0.487
0.0ArgXaa: 0.0 ± 0.0
Ser
1.565SerAla: 1.565 ± 0.906
0.0SerCys: 0.0 ± 0.0
5.477SerAsp: 5.477 ± 0.384
3.912SerGlu: 3.912 ± 2.264
3.13SerPhe: 3.13 ± 0.418
5.477SerGly: 5.477 ± 1.009
1.565SerHis: 1.565 ± 0.906
6.26SerIle: 6.26 ± 0.556
4.695SerLys: 4.695 ± 1.324
5.477SerLeu: 5.477 ± 2.401
2.347SerMet: 2.347 ± 0.034
0.782SerAsn: 0.782 ± 0.453
6.26SerPro: 6.26 ± 0.837
3.912SerGln: 3.912 ± 0.871
3.912SerArg: 3.912 ± 0.521
7.042SerSer: 7.042 ± 0.103
3.13SerThr: 3.13 ± 0.418
3.13SerVal: 3.13 ± 0.418
3.13SerTrp: 3.13 ± 2.367
3.13SerTyr: 3.13 ± 0.974
0.0SerXaa: 0.0 ± 0.0
Thr
3.13ThrAla: 3.13 ± 1.811
0.782ThrCys: 0.782 ± 0.453
4.695ThrAsp: 4.695 ± 2.717
2.347ThrGlu: 2.347 ± 1.427
2.347ThrPhe: 2.347 ± 1.358
1.565ThrGly: 1.565 ± 0.487
1.565ThrHis: 1.565 ± 0.906
2.347ThrIle: 2.347 ± 1.427
2.347ThrLys: 2.347 ± 1.427
4.695ThrLeu: 4.695 ± 1.324
0.782ThrMet: 0.782 ± 0.94
3.13ThrAsn: 3.13 ± 0.974
0.782ThrPro: 0.782 ± 0.453
2.347ThrGln: 2.347 ± 0.034
3.13ThrArg: 3.13 ± 1.811
1.565ThrSer: 1.565 ± 0.906
2.347ThrThr: 2.347 ± 0.034
3.912ThrVal: 3.912 ± 0.871
0.0ThrTrp: 0.0 ± 0.0
2.347ThrTyr: 2.347 ± 1.358
0.0ThrXaa: 0.0 ± 0.0
Val
4.695ValAla: 4.695 ± 0.069
0.782ValCys: 0.782 ± 0.94
6.26ValAsp: 6.26 ± 1.949
7.042ValGlu: 7.042 ± 2.682
3.13ValPhe: 3.13 ± 0.974
2.347ValGly: 2.347 ± 0.034
0.782ValHis: 0.782 ± 0.453
1.565ValIle: 1.565 ± 0.906
4.695ValLys: 4.695 ± 1.324
5.477ValLeu: 5.477 ± 0.384
2.347ValMet: 2.347 ± 0.034
5.477ValAsn: 5.477 ± 3.17
7.825ValPro: 7.825 ± 0.35
1.565ValGln: 1.565 ± 0.906
3.13ValArg: 3.13 ± 0.418
6.26ValSer: 6.26 ± 0.837
2.347ValThr: 2.347 ± 1.358
5.477ValVal: 5.477 ± 1.777
0.0ValTrp: 0.0 ± 0.0
2.347ValTyr: 2.347 ± 1.427
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
2.347TrpAsp: 2.347 ± 1.427
0.0TrpGlu: 0.0 ± 0.0
1.565TrpPhe: 1.565 ± 1.88
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.782TrpIle: 0.782 ± 0.453
0.0TrpLys: 0.0 ± 0.0
3.13TrpLeu: 3.13 ± 0.418
1.565TrpMet: 1.565 ± 1.88
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.565TrpArg: 1.565 ± 0.487
1.565TrpSer: 1.565 ± 0.487
2.347TrpThr: 2.347 ± 0.034
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.782TyrAla: 0.782 ± 0.453
0.782TyrCys: 0.782 ± 0.94
1.565TyrAsp: 1.565 ± 0.487
3.13TyrGlu: 3.13 ± 0.974
0.782TyrPhe: 0.782 ± 0.453
1.565TyrGly: 1.565 ± 0.906
1.565TyrHis: 1.565 ± 0.487
2.347TyrIle: 2.347 ± 0.034
5.477TyrLys: 5.477 ± 1.009
4.695TyrLeu: 4.695 ± 2.854
1.565TyrMet: 1.565 ± 0.985
3.13TyrAsn: 3.13 ± 0.974
1.565TyrPro: 1.565 ± 0.487
2.347TyrGln: 2.347 ± 2.82
1.565TyrArg: 1.565 ± 0.906
5.477TyrSer: 5.477 ± 1.009
3.13TyrThr: 3.13 ± 0.974
3.13TyrVal: 3.13 ± 1.811
0.0TyrTrp: 0.0 ± 0.0
1.565TyrTyr: 1.565 ± 0.487
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1279 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski