Amino acid dipepetide frequency for Beihai sobemo-like virus 11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.502AlaAla: 11.502 ± 1.742
1.278AlaCys: 1.278 ± 0.633
5.112AlaAsp: 5.112 ± 2.532
7.029AlaGlu: 7.029 ± 1.002
3.195AlaPhe: 3.195 ± 0.897
11.502AlaGly: 11.502 ± 1.742
1.917AlaHis: 1.917 ± 0.29
1.278AlaIle: 1.278 ± 0.607
4.473AlaLys: 4.473 ± 0.264
7.668AlaLeu: 7.668 ± 2.401
3.195AlaMet: 3.195 ± 0.493
2.556AlaAsn: 2.556 ± 0.026
1.917AlaPro: 1.917 ± 1.53
3.195AlaGln: 3.195 ± 0.897
5.751AlaArg: 5.751 ± 0.871
10.224AlaSer: 10.224 ± 0.105
2.556AlaThr: 2.556 ± 0.026
5.751AlaVal: 5.751 ± 3.351
1.278AlaTrp: 1.278 ± 0.607
2.556AlaTyr: 2.556 ± 0.026
0.0AlaXaa: 0.0 ± 0.0
Cys
0.639CysAla: 0.639 ± 0.317
0.0CysCys: 0.0 ± 0.0
1.278CysAsp: 1.278 ± 0.633
0.639CysGlu: 0.639 ± 0.317
0.0CysPhe: 0.0 ± 0.0
1.278CysGly: 1.278 ± 0.633
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.278CysLys: 1.278 ± 0.633
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.639CysPro: 0.639 ± 0.923
0.0CysGln: 0.0 ± 0.0
1.917CysArg: 1.917 ± 0.95
0.639CysSer: 0.639 ± 0.317
0.639CysThr: 0.639 ± 0.923
1.917CysVal: 1.917 ± 0.95
0.0CysTrp: 0.0 ± 0.0
1.917CysTyr: 1.917 ± 0.95
0.0CysXaa: 0.0 ± 0.0
Asp
5.112AspAla: 5.112 ± 0.052
1.278AspCys: 1.278 ± 0.633
1.917AspAsp: 1.917 ± 0.95
3.834AspGlu: 3.834 ± 0.659
2.556AspPhe: 2.556 ± 1.214
6.39AspGly: 6.39 ± 1.925
1.278AspHis: 1.278 ± 0.607
1.278AspIle: 1.278 ± 0.633
3.834AspLys: 3.834 ± 0.659
9.585AspLeu: 9.585 ± 1.028
0.639AspMet: 0.639 ± 0.528
3.195AspAsn: 3.195 ± 0.897
5.112AspPro: 5.112 ± 0.052
2.556AspGln: 2.556 ± 1.266
2.556AspArg: 2.556 ± 1.266
1.278AspSer: 1.278 ± 0.633
1.278AspThr: 1.278 ± 0.607
1.917AspVal: 1.917 ± 0.95
0.0AspTrp: 0.0 ± 0.0
2.556AspTyr: 2.556 ± 1.266
0.0AspXaa: 0.0 ± 0.0
Glu
9.585GluAla: 9.585 ± 1.028
0.0GluCys: 0.0 ± 0.0
3.834GluAsp: 3.834 ± 0.659
4.473GluGlu: 4.473 ± 2.216
3.195GluPhe: 3.195 ± 1.583
3.834GluGly: 3.834 ± 1.821
0.639GluHis: 0.639 ± 0.317
3.195GluIle: 3.195 ± 0.343
4.473GluLys: 4.473 ± 2.216
4.473GluLeu: 4.473 ± 0.976
0.0GluMet: 0.0 ± 0.0
1.278GluAsn: 1.278 ± 0.607
5.112GluPro: 5.112 ± 2.532
5.751GluGln: 5.751 ± 1.609
6.39GluArg: 6.39 ± 3.165
0.0GluSer: 0.0 ± 0.0
1.917GluThr: 1.917 ± 0.29
3.834GluVal: 3.834 ± 1.899
1.278GluTrp: 1.278 ± 0.633
1.917GluTyr: 1.917 ± 0.29
0.0GluXaa: 0.0 ± 0.0
Phe
2.556PheAla: 2.556 ± 1.266
0.639PheCys: 0.639 ± 0.317
2.556PheAsp: 2.556 ± 0.026
1.278PheGlu: 1.278 ± 0.633
0.0PhePhe: 0.0 ± 0.0
1.917PheGly: 1.917 ± 0.95
0.0PheHis: 0.0 ± 0.0
1.278PheIle: 1.278 ± 0.633
2.556PheLys: 2.556 ± 1.266
5.112PheLeu: 5.112 ± 0.052
0.639PheMet: 0.639 ± 0.317
0.0PheAsn: 0.0 ± 0.0
1.917PhePro: 1.917 ± 1.53
1.917PheGln: 1.917 ± 0.95
1.917PheArg: 1.917 ± 0.29
3.195PheSer: 3.195 ± 0.343
5.751PheThr: 5.751 ± 2.111
1.278PheVal: 1.278 ± 0.633
0.0PheTrp: 0.0 ± 0.0
1.917PheTyr: 1.917 ± 0.29
0.0PheXaa: 0.0 ± 0.0
Gly
5.751GlyAla: 5.751 ± 1.609
1.278GlyCys: 1.278 ± 0.633
6.39GlyAsp: 6.39 ± 0.685
9.585GlyGlu: 9.585 ± 1.028
3.195GlyPhe: 3.195 ± 0.343
7.029GlyGly: 7.029 ± 5.198
2.556GlyHis: 2.556 ± 0.026
5.751GlyIle: 5.751 ± 0.369
4.473GlyLys: 4.473 ± 2.216
8.307GlyLeu: 8.307 ± 0.395
0.639GlyMet: 0.639 ± 0.923
1.917GlyAsn: 1.917 ± 0.95
5.751GlyPro: 5.751 ± 1.609
3.195GlyGln: 3.195 ± 0.897
5.112GlyArg: 5.112 ± 1.188
4.473GlySer: 4.473 ± 3.984
5.751GlyThr: 5.751 ± 0.871
5.112GlyVal: 5.112 ± 1.188
1.278GlyTrp: 1.278 ± 0.633
2.556GlyTyr: 2.556 ± 1.266
0.0GlyXaa: 0.0 ± 0.0
His
0.639HisAla: 0.639 ± 0.923
0.639HisCys: 0.639 ± 0.317
1.917HisAsp: 1.917 ± 0.95
1.278HisGlu: 1.278 ± 0.633
0.639HisPhe: 0.639 ± 0.317
3.195HisGly: 3.195 ± 0.343
0.639HisHis: 0.639 ± 0.317
0.639HisIle: 0.639 ± 0.317
0.0HisLys: 0.0 ± 0.0
2.556HisLeu: 2.556 ± 0.026
0.639HisMet: 0.639 ± 0.317
3.195HisAsn: 3.195 ± 0.897
0.639HisPro: 0.639 ± 0.317
0.639HisGln: 0.639 ± 0.317
0.639HisArg: 0.639 ± 0.317
3.195HisSer: 3.195 ± 0.897
1.278HisThr: 1.278 ± 0.607
0.639HisVal: 0.639 ± 0.317
0.0HisTrp: 0.0 ± 0.0
0.639HisTyr: 0.639 ± 0.317
0.0HisXaa: 0.0 ± 0.0
Ile
1.278IleAla: 1.278 ± 0.633
1.278IleCys: 1.278 ± 0.633
3.195IleAsp: 3.195 ± 1.583
3.195IleGlu: 3.195 ± 0.343
1.278IlePhe: 1.278 ± 0.607
3.195IleGly: 3.195 ± 0.897
1.278IleHis: 1.278 ± 0.633
0.639IleIle: 0.639 ± 0.317
1.278IleLys: 1.278 ± 0.607
2.556IleLeu: 2.556 ± 1.266
0.0IleMet: 0.0 ± 0.0
1.278IleAsn: 1.278 ± 1.847
3.834IlePro: 3.834 ± 0.581
1.278IleGln: 1.278 ± 0.633
1.278IleArg: 1.278 ± 0.633
3.195IleSer: 3.195 ± 0.897
3.834IleThr: 3.834 ± 0.581
1.917IleVal: 1.917 ± 0.29
0.639IleTrp: 0.639 ± 0.317
2.556IleTyr: 2.556 ± 1.266
0.0IleXaa: 0.0 ± 0.0
Lys
6.39LysAla: 6.39 ± 0.555
0.639LysCys: 0.639 ± 0.317
5.751LysAsp: 5.751 ± 1.609
2.556LysGlu: 2.556 ± 1.266
1.917LysPhe: 1.917 ± 0.95
2.556LysGly: 2.556 ± 1.266
1.917LysHis: 1.917 ± 0.29
1.917LysIle: 1.917 ± 0.95
1.917LysLys: 1.917 ± 0.29
5.751LysLeu: 5.751 ± 0.871
0.639LysMet: 0.639 ± 0.317
1.278LysAsn: 1.278 ± 0.633
1.917LysPro: 1.917 ± 1.53
1.278LysGln: 1.278 ± 0.607
1.917LysArg: 1.917 ± 0.29
3.834LysSer: 3.834 ± 0.659
3.834LysThr: 3.834 ± 0.659
3.195LysVal: 3.195 ± 0.343
0.0LysTrp: 0.0 ± 0.0
3.834LysTyr: 3.834 ± 1.899
0.0LysXaa: 0.0 ± 0.0
Leu
11.502LeuAla: 11.502 ± 0.738
0.0LeuCys: 0.0 ± 0.0
4.473LeuAsp: 4.473 ± 1.504
1.278LeuGlu: 1.278 ± 0.633
3.834LeuPhe: 3.834 ± 1.899
12.141LeuGly: 12.141 ± 4.774
2.556LeuHis: 2.556 ± 1.266
2.556LeuIle: 2.556 ± 0.026
1.917LeuLys: 1.917 ± 0.29
5.751LeuLeu: 5.751 ± 2.111
2.556LeuMet: 2.556 ± 1.266
2.556LeuAsn: 2.556 ± 0.026
5.112LeuPro: 5.112 ± 1.188
4.473LeuGln: 4.473 ± 1.504
3.834LeuArg: 3.834 ± 1.899
5.751LeuSer: 5.751 ± 3.351
3.195LeuThr: 3.195 ± 0.897
5.751LeuVal: 5.751 ± 2.111
1.917LeuTrp: 1.917 ± 0.95
2.556LeuTyr: 2.556 ± 0.026
0.0LeuXaa: 0.0 ± 0.0
Met
1.278MetAla: 1.278 ± 0.633
0.639MetCys: 0.639 ± 0.923
1.278MetAsp: 1.278 ± 0.633
3.195MetGlu: 3.195 ± 0.897
1.278MetPhe: 1.278 ± 0.633
2.556MetGly: 2.556 ± 0.026
0.639MetHis: 0.639 ± 0.923
1.278MetIle: 1.278 ± 0.633
1.278MetLys: 1.278 ± 0.607
1.278MetLeu: 1.278 ± 0.607
0.639MetMet: 0.639 ± 0.317
0.0MetAsn: 0.0 ± 0.0
1.917MetPro: 1.917 ± 0.29
1.278MetGln: 1.278 ± 0.633
0.639MetArg: 0.639 ± 0.317
3.834MetSer: 3.834 ± 0.659
0.639MetThr: 0.639 ± 0.317
1.917MetVal: 1.917 ± 0.95
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.278AsnAla: 1.278 ± 0.607
0.639AsnCys: 0.639 ± 0.923
1.278AsnAsp: 1.278 ± 0.633
1.278AsnGlu: 1.278 ± 0.633
0.639AsnPhe: 0.639 ± 0.923
3.834AsnGly: 3.834 ± 0.581
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
1.917AsnLys: 1.917 ± 0.29
3.195AsnLeu: 3.195 ± 0.343
0.0AsnMet: 0.0 ± 0.0
0.639AsnAsn: 0.639 ± 0.923
1.278AsnPro: 1.278 ± 0.633
1.917AsnGln: 1.917 ± 0.29
0.0AsnArg: 0.0 ± 0.0
2.556AsnSer: 2.556 ± 0.026
1.917AsnThr: 1.917 ± 1.53
3.834AsnVal: 3.834 ± 0.581
0.0AsnTrp: 0.0 ± 0.0
2.556AsnTyr: 2.556 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
5.751ProAla: 5.751 ± 0.871
0.0ProCys: 0.0 ± 0.0
3.195ProAsp: 3.195 ± 0.343
6.39ProGlu: 6.39 ± 3.165
0.639ProPhe: 0.639 ± 0.923
3.195ProGly: 3.195 ± 0.897
1.917ProHis: 1.917 ± 0.29
3.195ProIle: 3.195 ± 1.583
3.834ProLys: 3.834 ± 0.581
3.195ProLeu: 3.195 ± 0.897
2.556ProMet: 2.556 ± 2.454
0.639ProAsn: 0.639 ± 0.923
2.556ProPro: 2.556 ± 0.026
3.834ProGln: 3.834 ± 1.899
3.834ProArg: 3.834 ± 3.061
1.917ProSer: 1.917 ± 0.95
6.39ProThr: 6.39 ± 1.795
1.917ProVal: 1.917 ± 0.29
0.639ProTrp: 0.639 ± 0.317
2.556ProTyr: 2.556 ± 1.214
0.0ProXaa: 0.0 ± 0.0
Gln
5.751GlnAla: 5.751 ± 4.591
0.0GlnCys: 0.0 ± 0.0
3.834GlnAsp: 3.834 ± 1.899
2.556GlnGlu: 2.556 ± 1.266
1.917GlnPhe: 1.917 ± 0.29
2.556GlnGly: 2.556 ± 1.266
0.0GlnHis: 0.0 ± 0.0
1.278GlnIle: 1.278 ± 0.633
1.917GlnLys: 1.917 ± 0.29
5.751GlnLeu: 5.751 ± 1.609
1.278GlnMet: 1.278 ± 0.633
1.278GlnAsn: 1.278 ± 0.633
1.917GlnPro: 1.917 ± 0.95
1.917GlnGln: 1.917 ± 0.29
2.556GlnArg: 2.556 ± 1.214
3.195GlnSer: 3.195 ± 1.583
1.917GlnThr: 1.917 ± 0.29
1.278GlnVal: 1.278 ± 0.633
0.0GlnTrp: 0.0 ± 0.0
1.278GlnTyr: 1.278 ± 0.633
0.0GlnXaa: 0.0 ± 0.0
Arg
5.751ArgAla: 5.751 ± 2.111
2.556ArgCys: 2.556 ± 1.266
2.556ArgAsp: 2.556 ± 1.266
5.112ArgGlu: 5.112 ± 1.292
1.278ArgPhe: 1.278 ± 0.633
3.195ArgGly: 3.195 ± 0.897
0.0ArgHis: 0.0 ± 0.0
3.834ArgIle: 3.834 ± 0.581
3.195ArgLys: 3.195 ± 0.343
3.195ArgLeu: 3.195 ± 0.343
3.834ArgMet: 3.834 ± 1.899
1.278ArgAsn: 1.278 ± 0.607
3.195ArgPro: 3.195 ± 2.137
1.278ArgGln: 1.278 ± 0.633
3.834ArgArg: 3.834 ± 0.581
3.834ArgSer: 3.834 ± 1.821
3.195ArgThr: 3.195 ± 1.583
2.556ArgVal: 2.556 ± 0.026
1.278ArgTrp: 1.278 ± 0.607
0.639ArgTyr: 0.639 ± 0.317
0.0ArgXaa: 0.0 ± 0.0
Ser
3.195SerAla: 3.195 ± 1.583
0.639SerCys: 0.639 ± 0.317
3.834SerAsp: 3.834 ± 1.821
1.917SerGlu: 1.917 ± 0.29
3.195SerPhe: 3.195 ± 0.897
8.307SerGly: 8.307 ± 3.325
1.278SerHis: 1.278 ± 0.607
5.112SerIle: 5.112 ± 1.188
2.556SerLys: 2.556 ± 1.266
4.473SerLeu: 4.473 ± 0.976
1.917SerMet: 1.917 ± 0.29
1.917SerAsn: 1.917 ± 0.95
5.751SerPro: 5.751 ± 0.871
2.556SerGln: 2.556 ± 0.026
3.834SerArg: 3.834 ± 0.581
3.834SerSer: 3.834 ± 3.061
1.917SerThr: 1.917 ± 1.53
7.029SerVal: 7.029 ± 2.718
0.639SerTrp: 0.639 ± 0.317
3.195SerTyr: 3.195 ± 0.897
0.0SerXaa: 0.0 ± 0.0
Thr
5.751ThrAla: 5.751 ± 2.111
0.639ThrCys: 0.639 ± 0.317
0.639ThrAsp: 0.639 ± 0.317
1.278ThrGlu: 1.278 ± 0.607
3.834ThrPhe: 3.834 ± 0.659
5.112ThrGly: 5.112 ± 0.052
1.278ThrHis: 1.278 ± 0.633
1.278ThrIle: 1.278 ± 1.847
4.473ThrLys: 4.473 ± 0.976
2.556ThrLeu: 2.556 ± 2.454
1.917ThrMet: 1.917 ± 0.29
3.195ThrAsn: 3.195 ± 0.897
4.473ThrPro: 4.473 ± 0.264
1.278ThrGln: 1.278 ± 0.633
2.556ThrArg: 2.556 ± 1.214
4.473ThrSer: 4.473 ± 1.504
2.556ThrThr: 2.556 ± 2.454
3.195ThrVal: 3.195 ± 3.377
1.917ThrTrp: 1.917 ± 0.95
2.556ThrTyr: 2.556 ± 1.214
0.0ThrXaa: 0.0 ± 0.0
Val
8.307ValAla: 8.307 ± 5.805
0.639ValCys: 0.639 ± 0.317
1.917ValAsp: 1.917 ± 2.77
3.834ValGlu: 3.834 ± 0.659
2.556ValPhe: 2.556 ± 1.266
5.751ValGly: 5.751 ± 1.609
3.195ValHis: 3.195 ± 1.583
0.639ValIle: 0.639 ± 0.317
5.112ValLys: 5.112 ± 0.052
4.473ValLeu: 4.473 ± 0.976
1.917ValMet: 1.917 ± 0.95
1.917ValAsn: 1.917 ± 0.29
1.917ValPro: 1.917 ± 1.53
1.917ValGln: 1.917 ± 0.29
1.917ValArg: 1.917 ± 0.29
5.112ValSer: 5.112 ± 2.428
3.834ValThr: 3.834 ± 0.659
5.751ValVal: 5.751 ± 0.871
0.639ValTrp: 0.639 ± 0.317
1.278ValTyr: 1.278 ± 0.607
0.0ValXaa: 0.0 ± 0.0
Trp
0.639TrpAla: 0.639 ± 0.317
0.0TrpCys: 0.0 ± 0.0
0.639TrpAsp: 0.639 ± 0.317
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.278TrpGly: 1.278 ± 0.633
0.0TrpHis: 0.0 ± 0.0
1.278TrpIle: 1.278 ± 0.607
0.639TrpLys: 0.639 ± 0.317
0.0TrpLeu: 0.0 ± 0.0
0.639TrpMet: 0.639 ± 0.317
0.0TrpAsn: 0.0 ± 0.0
1.917TrpPro: 1.917 ± 0.29
0.639TrpGln: 0.639 ± 0.317
0.639TrpArg: 0.639 ± 0.317
1.278TrpSer: 1.278 ± 0.633
0.639TrpThr: 0.639 ± 0.317
1.278TrpVal: 1.278 ± 0.633
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.278TyrAla: 1.278 ± 0.607
0.0TyrCys: 0.0 ± 0.0
3.195TyrAsp: 3.195 ± 0.343
4.473TyrGlu: 4.473 ± 2.216
1.278TyrPhe: 1.278 ± 0.633
1.278TyrGly: 1.278 ± 0.607
2.556TyrHis: 2.556 ± 0.026
2.556TyrIle: 2.556 ± 0.026
1.917TyrLys: 1.917 ± 0.29
3.834TyrLeu: 3.834 ± 1.899
1.278TyrMet: 1.278 ± 0.607
0.639TyrAsn: 0.639 ± 0.317
1.278TyrPro: 1.278 ± 0.633
1.278TyrGln: 1.278 ± 0.633
4.473TyrArg: 4.473 ± 0.976
1.278TyrSer: 1.278 ± 0.607
1.917TyrThr: 1.917 ± 2.77
2.556TyrVal: 2.556 ± 1.266
0.0TyrTrp: 0.0 ± 0.0
0.639TyrTyr: 0.639 ± 0.317
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1566 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski