Amino acid dipepetide frequency for Beihai sobemo-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.178AlaAla: 1.178 ± 0.742
1.178AlaCys: 1.178 ± 0.828
4.711AlaAsp: 4.711 ± 0.17
5.889AlaGlu: 5.889 ± 0.572
3.534AlaPhe: 3.534 ± 0.913
5.889AlaGly: 5.889 ± 0.998
1.178AlaHis: 1.178 ± 0.828
4.711AlaIle: 4.711 ± 1.4
1.178AlaLys: 1.178 ± 0.742
5.889AlaLeu: 5.889 ± 3.712
1.178AlaMet: 1.178 ± 0.742
2.356AlaAsn: 2.356 ± 0.085
2.356AlaPro: 2.356 ± 0.085
0.0AlaGln: 0.0 ± 0.0
3.534AlaArg: 3.534 ± 0.913
2.356AlaSer: 2.356 ± 0.085
2.356AlaThr: 2.356 ± 1.485
3.534AlaVal: 3.534 ± 0.913
1.178AlaTrp: 1.178 ± 0.742
4.711AlaTyr: 4.711 ± 0.17
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.178CysAsp: 1.178 ± 0.828
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
1.178CysHis: 1.178 ± 0.828
1.178CysIle: 1.178 ± 0.828
1.178CysLys: 1.178 ± 0.742
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
2.356CysAsn: 2.356 ± 1.655
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
2.356CysSer: 2.356 ± 1.655
1.178CysThr: 1.178 ± 0.828
2.356CysVal: 2.356 ± 1.655
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.889AspAla: 5.889 ± 0.572
1.178AspCys: 1.178 ± 0.828
11.779AspAsp: 11.779 ± 2.715
8.245AspGlu: 8.245 ± 2.653
4.711AspPhe: 4.711 ± 1.74
3.534AspGly: 3.534 ± 0.657
0.0AspHis: 0.0 ± 0.0
3.534AspIle: 3.534 ± 2.227
11.779AspLys: 11.779 ± 1.145
3.534AspLeu: 3.534 ± 0.657
3.534AspMet: 3.534 ± 0.913
1.178AspAsn: 1.178 ± 0.742
3.534AspPro: 3.534 ± 0.657
3.534AspGln: 3.534 ± 0.913
1.178AspArg: 1.178 ± 0.742
3.534AspSer: 3.534 ± 0.657
4.711AspThr: 4.711 ± 0.17
1.178AspVal: 1.178 ± 0.828
1.178AspTrp: 1.178 ± 0.828
3.534AspTyr: 3.534 ± 0.657
0.0AspXaa: 0.0 ± 0.0
Glu
5.889GluAla: 5.889 ± 2.142
0.0GluCys: 0.0 ± 0.0
8.245GluAsp: 8.245 ± 2.057
3.534GluGlu: 3.534 ± 0.913
2.356GluPhe: 2.356 ± 0.085
2.356GluGly: 2.356 ± 1.655
1.178GluHis: 1.178 ± 0.828
5.889GluIle: 5.889 ± 3.712
5.889GluLys: 5.889 ± 2.142
9.423GluLeu: 9.423 ± 1.23
3.534GluMet: 3.534 ± 0.913
2.356GluAsn: 2.356 ± 1.655
2.356GluPro: 2.356 ± 0.085
1.178GluGln: 1.178 ± 0.828
4.711GluArg: 4.711 ± 0.17
5.889GluSer: 5.889 ± 3.712
3.534GluThr: 3.534 ± 0.657
3.534GluVal: 3.534 ± 0.657
2.356GluTrp: 2.356 ± 0.085
1.178GluTyr: 1.178 ± 0.828
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.178PheCys: 1.178 ± 0.828
3.534PheAsp: 3.534 ± 0.913
3.534PheGlu: 3.534 ± 2.227
0.0PhePhe: 0.0 ± 0.0
5.889PheGly: 5.889 ± 2.568
1.178PheHis: 1.178 ± 0.828
0.0PheIle: 0.0 ± 0.0
0.0PheLys: 0.0 ± 0.0
4.711PheLeu: 4.711 ± 0.17
1.178PheMet: 1.178 ± 0.828
2.356PheAsn: 2.356 ± 0.085
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
0.0PheArg: 0.0 ± 0.0
2.356PheSer: 2.356 ± 0.085
2.356PheThr: 2.356 ± 1.655
1.178PheVal: 1.178 ± 0.828
0.0PheTrp: 0.0 ± 0.0
2.356PheTyr: 2.356 ± 1.655
0.0PheXaa: 0.0 ± 0.0
Gly
5.889GlyAla: 5.889 ± 0.572
0.0GlyCys: 0.0 ± 0.0
8.245GlyAsp: 8.245 ± 1.083
0.0GlyGlu: 0.0 ± 0.0
2.356GlyPhe: 2.356 ± 0.085
1.178GlyGly: 1.178 ± 0.828
1.178GlyHis: 1.178 ± 0.828
2.356GlyIle: 2.356 ± 1.655
0.0GlyLys: 0.0 ± 0.0
7.067GlyLeu: 7.067 ± 1.825
1.178GlyMet: 1.178 ± 0.742
2.356GlyAsn: 2.356 ± 1.655
1.178GlyPro: 1.178 ± 0.742
1.178GlyGln: 1.178 ± 0.828
4.711GlyArg: 4.711 ± 1.74
3.534GlySer: 3.534 ± 2.227
1.178GlyThr: 1.178 ± 0.828
5.889GlyVal: 5.889 ± 0.998
1.178GlyTrp: 1.178 ± 0.742
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.178HisAla: 1.178 ± 0.828
0.0HisCys: 0.0 ± 0.0
1.178HisAsp: 1.178 ± 0.828
0.0HisGlu: 0.0 ± 0.0
1.178HisPhe: 1.178 ± 0.828
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.178HisIle: 1.178 ± 0.742
0.0HisLys: 0.0 ± 0.0
4.711HisLeu: 4.711 ± 3.31
1.178HisMet: 1.178 ± 0.828
1.178HisAsn: 1.178 ± 0.828
1.178HisPro: 1.178 ± 0.828
3.534HisGln: 3.534 ± 0.913
3.534HisArg: 3.534 ± 2.483
3.534HisSer: 3.534 ± 0.913
1.178HisThr: 1.178 ± 0.828
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.889IleAla: 5.889 ± 0.572
0.0IleCys: 0.0 ± 0.0
3.534IleAsp: 3.534 ± 0.657
3.534IleGlu: 3.534 ± 0.657
1.178IlePhe: 1.178 ± 0.742
0.0IleGly: 0.0 ± 0.0
0.0IleHis: 0.0 ± 0.0
7.067IleIle: 7.067 ± 1.315
4.711IleLys: 4.711 ± 0.17
2.356IleLeu: 2.356 ± 1.655
0.0IleMet: 0.0 ± 0.0
2.356IleAsn: 2.356 ± 0.085
3.534IlePro: 3.534 ± 0.913
3.534IleGln: 3.534 ± 0.657
3.534IleArg: 3.534 ± 0.913
2.356IleSer: 2.356 ± 0.085
3.534IleThr: 3.534 ± 2.227
5.889IleVal: 5.889 ± 0.572
1.178IleTrp: 1.178 ± 0.828
1.178IleTyr: 1.178 ± 0.742
0.0IleXaa: 0.0 ± 0.0
Lys
7.067LysAla: 7.067 ± 0.255
0.0LysCys: 0.0 ± 0.0
7.067LysAsp: 7.067 ± 0.255
4.711LysGlu: 4.711 ± 1.4
0.0LysPhe: 0.0 ± 0.0
2.356LysGly: 2.356 ± 1.485
0.0LysHis: 0.0 ± 0.0
3.534LysIle: 3.534 ± 2.227
14.134LysLys: 14.134 ± 5.769
5.889LysLeu: 5.889 ± 0.998
1.178LysMet: 1.178 ± 0.742
3.534LysAsn: 3.534 ± 0.657
5.889LysPro: 5.889 ± 2.142
5.889LysGln: 5.889 ± 0.572
5.889LysArg: 5.889 ± 2.142
5.889LysSer: 5.889 ± 2.142
1.178LysThr: 1.178 ± 0.828
5.889LysVal: 5.889 ± 2.142
0.0LysTrp: 0.0 ± 0.0
3.534LysTyr: 3.534 ± 0.657
0.0LysXaa: 0.0 ± 0.0
Leu
7.067LeuAla: 7.067 ± 1.315
3.534LeuCys: 3.534 ± 2.483
3.534LeuAsp: 3.534 ± 0.657
10.601LeuGlu: 10.601 ± 0.402
0.0LeuPhe: 0.0 ± 0.0
3.534LeuGly: 3.534 ± 0.913
3.534LeuHis: 3.534 ± 0.913
1.178LeuIle: 1.178 ± 0.828
9.423LeuLys: 9.423 ± 1.91
4.711LeuLeu: 4.711 ± 0.17
3.534LeuMet: 3.534 ± 2.111
3.534LeuAsn: 3.534 ± 0.913
1.178LeuPro: 1.178 ± 0.742
5.889LeuGln: 5.889 ± 0.572
7.067LeuArg: 7.067 ± 1.315
5.889LeuSer: 5.889 ± 0.998
1.178LeuThr: 1.178 ± 0.742
3.534LeuVal: 3.534 ± 0.913
1.178LeuTrp: 1.178 ± 0.828
2.356LeuTyr: 2.356 ± 1.655
0.0LeuXaa: 0.0 ± 0.0
Met
2.356MetAla: 2.356 ± 0.085
1.178MetCys: 1.178 ± 0.828
1.178MetAsp: 1.178 ± 0.742
3.534MetGlu: 3.534 ± 2.483
1.178MetPhe: 1.178 ± 0.828
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
3.534MetIle: 3.534 ± 0.913
4.711MetLys: 4.711 ± 0.17
1.178MetLeu: 1.178 ± 0.742
2.356MetMet: 2.356 ± 0.085
4.711MetAsn: 4.711 ± 2.97
1.178MetPro: 1.178 ± 0.742
0.0MetGln: 0.0 ± 0.0
1.178MetArg: 1.178 ± 0.828
3.534MetSer: 3.534 ± 0.913
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.178MetTrp: 1.178 ± 0.828
2.356MetTyr: 2.356 ± 1.655
0.0MetXaa: 0.0 ± 0.0
Asn
2.356AsnAla: 2.356 ± 0.085
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
8.245AsnGlu: 8.245 ± 2.057
1.178AsnPhe: 1.178 ± 0.828
2.356AsnGly: 2.356 ± 0.085
1.178AsnHis: 1.178 ± 0.828
3.534AsnIle: 3.534 ± 0.913
1.178AsnLys: 1.178 ± 0.742
2.356AsnLeu: 2.356 ± 0.085
2.356AsnMet: 2.356 ± 1.655
4.711AsnAsn: 4.711 ± 1.74
5.889AsnPro: 5.889 ± 0.998
2.356AsnGln: 2.356 ± 1.485
2.356AsnArg: 2.356 ± 0.085
1.178AsnSer: 1.178 ± 0.828
1.178AsnThr: 1.178 ± 0.828
2.356AsnVal: 2.356 ± 0.085
1.178AsnTrp: 1.178 ± 0.742
4.711AsnTyr: 4.711 ± 1.4
0.0AsnXaa: 0.0 ± 0.0
Pro
2.356ProAla: 2.356 ± 0.085
0.0ProCys: 0.0 ± 0.0
1.178ProAsp: 1.178 ± 0.742
2.356ProGlu: 2.356 ± 1.485
1.178ProPhe: 1.178 ± 0.742
4.711ProGly: 4.711 ± 0.17
1.178ProHis: 1.178 ± 0.828
1.178ProIle: 1.178 ± 0.828
2.356ProLys: 2.356 ± 0.085
4.711ProLeu: 4.711 ± 0.17
1.178ProMet: 1.178 ± 0.828
2.356ProAsn: 2.356 ± 1.485
4.711ProPro: 4.711 ± 2.97
0.0ProGln: 0.0 ± 0.0
1.178ProArg: 1.178 ± 0.828
4.711ProSer: 4.711 ± 1.4
5.889ProThr: 5.889 ± 3.712
2.356ProVal: 2.356 ± 1.485
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.178GlnAla: 1.178 ± 0.828
0.0GlnCys: 0.0 ± 0.0
1.178GlnAsp: 1.178 ± 0.828
2.356GlnGlu: 2.356 ± 1.485
2.356GlnPhe: 2.356 ± 1.655
2.356GlnGly: 2.356 ± 1.655
2.356GlnHis: 2.356 ± 1.655
0.0GlnIle: 0.0 ± 0.0
2.356GlnLys: 2.356 ± 1.485
5.889GlnLeu: 5.889 ± 0.998
1.178GlnMet: 1.178 ± 1.193
1.178GlnAsn: 1.178 ± 0.742
0.0GlnPro: 0.0 ± 0.0
2.356GlnGln: 2.356 ± 1.655
0.0GlnArg: 0.0 ± 0.0
4.711GlnSer: 4.711 ± 1.4
1.178GlnThr: 1.178 ± 0.742
2.356GlnVal: 2.356 ± 1.485
1.178GlnTrp: 1.178 ± 0.828
2.356GlnTyr: 2.356 ± 0.085
0.0GlnXaa: 0.0 ± 0.0
Arg
2.356ArgAla: 2.356 ± 0.085
1.178ArgCys: 1.178 ± 0.828
3.534ArgAsp: 3.534 ± 0.657
3.534ArgGlu: 3.534 ± 0.913
1.178ArgPhe: 1.178 ± 0.828
1.178ArgGly: 1.178 ± 0.742
3.534ArgHis: 3.534 ± 0.913
4.711ArgIle: 4.711 ± 1.4
3.534ArgLys: 3.534 ± 0.913
8.245ArgLeu: 8.245 ± 4.223
0.0ArgMet: 0.0 ± 0.0
4.711ArgAsn: 4.711 ± 0.17
1.178ArgPro: 1.178 ± 0.742
0.0ArgGln: 0.0 ± 0.0
2.356ArgArg: 2.356 ± 1.655
4.711ArgSer: 4.711 ± 1.4
2.356ArgThr: 2.356 ± 1.485
1.178ArgVal: 1.178 ± 0.742
0.0ArgTrp: 0.0 ± 0.0
1.178ArgTyr: 1.178 ± 0.828
0.0ArgXaa: 0.0 ± 0.0
Ser
2.356SerAla: 2.356 ± 1.655
2.356SerCys: 2.356 ± 0.085
3.534SerAsp: 3.534 ± 0.657
4.711SerGlu: 4.711 ± 1.4
2.356SerPhe: 2.356 ± 0.085
4.711SerGly: 4.711 ± 0.17
0.0SerHis: 0.0 ± 0.0
1.178SerIle: 1.178 ± 0.742
8.245SerLys: 8.245 ± 5.197
5.889SerLeu: 5.889 ± 0.572
3.534SerMet: 3.534 ± 0.657
5.889SerAsn: 5.889 ± 0.572
0.0SerPro: 0.0 ± 0.0
0.0SerGln: 0.0 ± 0.0
1.178SerArg: 1.178 ± 0.828
9.423SerSer: 9.423 ± 4.37
4.711SerThr: 4.711 ± 0.17
7.067SerVal: 7.067 ± 1.315
0.0SerTrp: 0.0 ± 0.0
4.711SerTyr: 4.711 ± 0.17
0.0SerXaa: 0.0 ± 0.0
Thr
2.356ThrAla: 2.356 ± 1.485
0.0ThrCys: 0.0 ± 0.0
2.356ThrAsp: 2.356 ± 1.655
3.534ThrGlu: 3.534 ± 2.227
1.178ThrPhe: 1.178 ± 0.828
3.534ThrGly: 3.534 ± 0.913
1.178ThrHis: 1.178 ± 0.828
4.711ThrIle: 4.711 ± 1.74
4.711ThrLys: 4.711 ± 1.4
1.178ThrLeu: 1.178 ± 0.742
2.356ThrMet: 2.356 ± 0.085
1.178ThrAsn: 1.178 ± 0.828
4.711ThrPro: 4.711 ± 2.97
1.178ThrGln: 1.178 ± 0.828
1.178ThrArg: 1.178 ± 0.742
2.356ThrSer: 2.356 ± 1.485
1.178ThrThr: 1.178 ± 0.828
2.356ThrVal: 2.356 ± 0.085
1.178ThrTrp: 1.178 ± 0.742
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.356ValAla: 2.356 ± 1.485
1.178ValCys: 1.178 ± 0.828
7.067ValAsp: 7.067 ± 1.315
5.889ValGlu: 5.889 ± 0.572
2.356ValPhe: 2.356 ± 1.655
1.178ValGly: 1.178 ± 0.742
2.356ValHis: 2.356 ± 1.655
4.711ValIle: 4.711 ± 3.31
4.711ValLys: 4.711 ± 1.4
2.356ValLeu: 2.356 ± 1.655
1.178ValMet: 1.178 ± 0.742
0.0ValAsn: 0.0 ± 0.0
4.711ValPro: 4.711 ± 1.4
7.067ValGln: 7.067 ± 1.315
2.356ValArg: 2.356 ± 0.085
2.356ValSer: 2.356 ± 0.085
3.534ValThr: 3.534 ± 0.913
4.711ValVal: 4.711 ± 0.17
0.0ValTrp: 0.0 ± 0.0
1.178ValTyr: 1.178 ± 0.742
0.0ValXaa: 0.0 ± 0.0
Trp
1.178TrpAla: 1.178 ± 0.828
0.0TrpCys: 0.0 ± 0.0
1.178TrpAsp: 1.178 ± 0.828
1.178TrpGlu: 1.178 ± 0.742
0.0TrpPhe: 0.0 ± 0.0
1.178TrpGly: 1.178 ± 0.742
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
2.356TrpLys: 2.356 ± 1.485
1.178TrpLeu: 1.178 ± 0.828
2.356TrpMet: 2.356 ± 1.655
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.356TrpArg: 2.356 ± 0.085
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.178TrpVal: 1.178 ± 0.828
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.0TyrCys: 0.0 ± 0.0
7.067TyrAsp: 7.067 ± 0.255
0.0TyrGlu: 0.0 ± 0.0
3.534TyrPhe: 3.534 ± 0.657
4.711TyrGly: 4.711 ± 1.74
3.534TyrHis: 3.534 ± 0.913
1.178TyrIle: 1.178 ± 0.742
1.178TyrLys: 1.178 ± 0.742
1.178TyrLeu: 1.178 ± 0.828
1.178TyrMet: 1.178 ± 0.742
2.356TyrAsn: 2.356 ± 0.085
0.0TyrPro: 0.0 ± 0.0
0.0TyrGln: 0.0 ± 0.0
2.356TyrArg: 2.356 ± 1.485
1.178TyrSer: 1.178 ± 0.828
0.0TyrThr: 0.0 ± 0.0
4.711TyrVal: 4.711 ± 1.74
1.178TyrTrp: 1.178 ± 0.828
1.178TyrTyr: 1.178 ± 0.828
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (850 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski