Amino acid dipepetide frequency for Hubei sobemo-like virus 44

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.556AlaAla: 4.556 ± 0.061
3.417AlaCys: 3.417 ± 0.743
2.278AlaAsp: 2.278 ± 1.547
3.417AlaGlu: 3.417 ± 0.835
3.417AlaPhe: 3.417 ± 0.743
4.556AlaGly: 4.556 ± 1.639
1.139AlaHis: 1.139 ± 0.774
2.278AlaIle: 2.278 ± 0.031
6.834AlaLys: 6.834 ± 0.092
5.695AlaLeu: 5.695 ± 0.712
3.417AlaMet: 3.417 ± 0.596
1.139AlaAsn: 1.139 ± 0.804
1.139AlaPro: 1.139 ± 0.774
3.417AlaGln: 3.417 ± 0.835
3.417AlaArg: 3.417 ± 0.835
2.278AlaSer: 2.278 ± 0.031
4.556AlaThr: 4.556 ± 0.061
5.695AlaVal: 5.695 ± 0.865
2.278AlaTrp: 2.278 ± 1.547
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.139CysCys: 1.139 ± 0.774
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.139CysPhe: 1.139 ± 0.804
1.139CysGly: 1.139 ± 0.774
1.139CysHis: 1.139 ± 0.804
2.278CysIle: 2.278 ± 1.547
1.139CysLys: 1.139 ± 0.774
1.139CysLeu: 1.139 ± 0.774
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.139CysPro: 1.139 ± 0.804
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.139CysThr: 1.139 ± 0.804
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
1.139AspCys: 1.139 ± 0.774
2.278AspAsp: 2.278 ± 1.547
3.417AspGlu: 3.417 ± 0.835
4.556AspPhe: 4.556 ± 1.639
6.834AspGly: 6.834 ± 0.092
1.139AspHis: 1.139 ± 0.774
2.278AspIle: 2.278 ± 1.608
5.695AspLys: 5.695 ± 0.865
6.834AspLeu: 6.834 ± 3.064
0.0AspMet: 0.0 ± 0.0
1.139AspAsn: 1.139 ± 0.774
3.417AspPro: 3.417 ± 2.321
1.139AspGln: 1.139 ± 0.804
2.278AspArg: 2.278 ± 1.608
3.417AspSer: 3.417 ± 0.835
3.417AspThr: 3.417 ± 0.743
4.556AspVal: 4.556 ± 1.517
2.278AspTrp: 2.278 ± 0.031
2.278AspTyr: 2.278 ± 1.547
0.0AspXaa: 0.0 ± 0.0
Glu
2.278GluAla: 2.278 ± 1.608
0.0GluCys: 0.0 ± 0.0
3.417GluAsp: 3.417 ± 2.412
3.417GluGlu: 3.417 ± 2.321
4.556GluPhe: 4.556 ± 3.094
3.417GluGly: 3.417 ± 2.321
0.0GluHis: 0.0 ± 0.0
3.417GluIle: 3.417 ± 0.743
1.139GluLys: 1.139 ± 0.774
5.695GluLeu: 5.695 ± 2.29
0.0GluMet: 0.0 ± 0.0
2.278GluAsn: 2.278 ± 1.608
1.139GluPro: 1.139 ± 0.774
2.278GluGln: 2.278 ± 0.031
2.278GluArg: 2.278 ± 0.031
5.695GluSer: 5.695 ± 2.443
3.417GluThr: 3.417 ± 2.321
4.556GluVal: 4.556 ± 1.639
5.695GluTrp: 5.695 ± 0.712
4.556GluTyr: 4.556 ± 0.061
0.0GluXaa: 0.0 ± 0.0
Phe
2.278PheAla: 2.278 ± 1.608
0.0PheCys: 0.0 ± 0.0
3.417PheAsp: 3.417 ± 2.321
2.278PheGlu: 2.278 ± 0.031
0.0PhePhe: 0.0 ± 0.0
1.139PheGly: 1.139 ± 0.774
0.0PheHis: 0.0 ± 0.0
3.417PheIle: 3.417 ± 2.321
2.278PheLys: 2.278 ± 1.547
4.556PheLeu: 4.556 ± 0.061
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
2.278PhePro: 2.278 ± 0.031
4.556PheGln: 4.556 ± 1.517
2.278PheArg: 2.278 ± 1.547
2.278PheSer: 2.278 ± 0.031
0.0PheThr: 0.0 ± 0.0
0.0PheVal: 0.0 ± 0.0
1.139PheTrp: 1.139 ± 0.774
1.139PheTyr: 1.139 ± 0.804
0.0PheXaa: 0.0 ± 0.0
Gly
4.556GlyAla: 4.556 ± 1.517
1.139GlyCys: 1.139 ± 0.804
6.834GlyAsp: 6.834 ± 0.092
5.695GlyGlu: 5.695 ± 0.865
4.556GlyPhe: 4.556 ± 1.517
7.973GlyGly: 7.973 ± 2.473
1.139GlyHis: 1.139 ± 0.774
5.695GlyIle: 5.695 ± 0.865
2.278GlyLys: 2.278 ± 1.608
5.695GlyLeu: 5.695 ± 0.865
4.556GlyMet: 4.556 ± 0.061
3.417GlyAsn: 3.417 ± 2.412
5.695GlyPro: 5.695 ± 2.443
4.556GlyGln: 4.556 ± 1.639
3.417GlyArg: 3.417 ± 0.743
9.112GlySer: 9.112 ± 4.855
2.278GlyThr: 2.278 ± 1.608
6.834GlyVal: 6.834 ± 0.092
1.139GlyTrp: 1.139 ± 0.804
5.695GlyTyr: 5.695 ± 0.712
0.0GlyXaa: 0.0 ± 0.0
His
3.417HisAla: 3.417 ± 2.321
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.139HisGlu: 1.139 ± 0.774
0.0HisPhe: 0.0 ± 0.0
1.139HisGly: 1.139 ± 0.804
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.139HisLys: 1.139 ± 0.774
3.417HisLeu: 3.417 ± 0.743
2.278HisMet: 2.278 ± 1.547
1.139HisAsn: 1.139 ± 0.774
0.0HisPro: 0.0 ± 0.0
1.139HisGln: 1.139 ± 0.774
0.0HisArg: 0.0 ± 0.0
2.278HisSer: 2.278 ± 0.031
0.0HisThr: 0.0 ± 0.0
1.139HisVal: 1.139 ± 0.804
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.556IleAla: 4.556 ± 3.216
0.0IleCys: 0.0 ± 0.0
1.139IleAsp: 1.139 ± 0.804
2.278IleGlu: 2.278 ± 0.031
0.0IlePhe: 0.0 ± 0.0
7.973IleGly: 7.973 ± 2.473
1.139IleHis: 1.139 ± 0.774
1.139IleIle: 1.139 ± 0.804
3.417IleLys: 3.417 ± 0.743
1.139IleLeu: 1.139 ± 0.774
2.278IleMet: 2.278 ± 1.547
0.0IleAsn: 0.0 ± 0.0
3.417IlePro: 3.417 ± 2.321
0.0IleGln: 0.0 ± 0.0
1.139IleArg: 1.139 ± 0.804
4.556IleSer: 4.556 ± 1.639
9.112IleThr: 9.112 ± 3.277
4.556IleVal: 4.556 ± 1.639
0.0IleTrp: 0.0 ± 0.0
2.278IleTyr: 2.278 ± 1.608
0.0IleXaa: 0.0 ± 0.0
Lys
4.556LysAla: 4.556 ± 1.517
0.0LysCys: 0.0 ± 0.0
2.278LysAsp: 2.278 ± 0.031
1.139LysGlu: 1.139 ± 0.774
1.139LysPhe: 1.139 ± 0.774
3.417LysGly: 3.417 ± 2.412
1.139LysHis: 1.139 ± 0.774
5.695LysIle: 5.695 ± 4.02
3.417LysLys: 3.417 ± 2.412
6.834LysLeu: 6.834 ± 3.247
2.278LysMet: 2.278 ± 1.547
1.139LysAsn: 1.139 ± 0.804
2.278LysPro: 2.278 ± 1.608
6.834LysGln: 6.834 ± 1.486
3.417LysArg: 3.417 ± 0.835
6.834LysSer: 6.834 ± 3.064
4.556LysThr: 4.556 ± 1.517
2.278LysVal: 2.278 ± 1.608
0.0LysTrp: 0.0 ± 0.0
3.417LysTyr: 3.417 ± 0.743
0.0LysXaa: 0.0 ± 0.0
Leu
6.834LeuAla: 6.834 ± 3.064
0.0LeuCys: 0.0 ± 0.0
10.251LeuAsp: 10.251 ± 0.926
6.834LeuGlu: 6.834 ± 3.064
2.278LeuPhe: 2.278 ± 0.031
12.528LeuGly: 12.528 ± 0.957
2.278LeuHis: 2.278 ± 1.547
6.834LeuIle: 6.834 ± 0.092
3.417LeuLys: 3.417 ± 2.412
3.417LeuLeu: 3.417 ± 0.743
2.278LeuMet: 2.278 ± 1.547
2.278LeuAsn: 2.278 ± 1.547
3.417LeuPro: 3.417 ± 0.835
4.556LeuGln: 4.556 ± 1.517
5.695LeuArg: 5.695 ± 0.712
5.695LeuSer: 5.695 ± 2.443
3.417LeuThr: 3.417 ± 0.835
4.556LeuVal: 4.556 ± 0.061
2.278LeuTrp: 2.278 ± 1.547
4.556LeuTyr: 4.556 ± 3.094
0.0LeuXaa: 0.0 ± 0.0
Met
2.278MetAla: 2.278 ± 0.031
0.0MetCys: 0.0 ± 0.0
3.417MetAsp: 3.417 ± 0.743
3.417MetGlu: 3.417 ± 2.321
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
1.139MetHis: 1.139 ± 0.804
1.139MetIle: 1.139 ± 0.804
4.556MetLys: 4.556 ± 0.061
2.278MetLeu: 2.278 ± 1.547
1.139MetMet: 1.139 ± 0.774
1.139MetAsn: 1.139 ± 0.774
2.278MetPro: 2.278 ± 1.608
1.139MetGln: 1.139 ± 0.774
2.278MetArg: 2.278 ± 1.547
2.278MetSer: 2.278 ± 1.547
1.139MetThr: 1.139 ± 0.804
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.278AsnAla: 2.278 ± 1.547
0.0AsnCys: 0.0 ± 0.0
1.139AsnAsp: 1.139 ± 0.774
1.139AsnGlu: 1.139 ± 0.804
0.0AsnPhe: 0.0 ± 0.0
1.139AsnGly: 1.139 ± 0.774
1.139AsnHis: 1.139 ± 0.804
3.417AsnIle: 3.417 ± 2.412
2.278AsnLys: 2.278 ± 0.031
2.278AsnLeu: 2.278 ± 1.608
0.0AsnMet: 0.0 ± 0.0
1.139AsnAsn: 1.139 ± 0.774
0.0AsnPro: 0.0 ± 0.0
0.0AsnGln: 0.0 ± 0.0
1.139AsnArg: 1.139 ± 0.774
1.139AsnSer: 1.139 ± 0.774
1.139AsnThr: 1.139 ± 0.774
0.0AsnVal: 0.0 ± 0.0
2.278AsnTrp: 2.278 ± 0.031
2.278AsnTyr: 2.278 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
2.278ProAla: 2.278 ± 0.031
2.278ProCys: 2.278 ± 1.608
4.556ProAsp: 4.556 ± 1.517
2.278ProGlu: 2.278 ± 1.608
0.0ProPhe: 0.0 ± 0.0
5.695ProGly: 5.695 ± 0.865
2.278ProHis: 2.278 ± 1.547
0.0ProIle: 0.0 ± 0.0
2.278ProLys: 2.278 ± 1.608
3.417ProLeu: 3.417 ± 2.321
1.139ProMet: 1.139 ± 0.804
2.278ProAsn: 2.278 ± 1.608
2.278ProPro: 2.278 ± 1.608
1.139ProGln: 1.139 ± 0.774
0.0ProArg: 0.0 ± 0.0
5.695ProSer: 5.695 ± 0.865
1.139ProThr: 1.139 ± 0.804
3.417ProVal: 3.417 ± 2.321
0.0ProTrp: 0.0 ± 0.0
5.695ProTyr: 5.695 ± 0.712
0.0ProXaa: 0.0 ± 0.0
Gln
3.417GlnAla: 3.417 ± 2.321
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
3.417GlnGlu: 3.417 ± 0.743
1.139GlnPhe: 1.139 ± 0.774
3.417GlnGly: 3.417 ± 0.743
0.0GlnHis: 0.0 ± 0.0
1.139GlnIle: 1.139 ± 0.804
2.278GlnLys: 2.278 ± 0.031
4.556GlnLeu: 4.556 ± 1.517
1.139GlnMet: 1.139 ± 0.774
0.0GlnAsn: 0.0 ± 0.0
1.139GlnPro: 1.139 ± 0.774
4.556GlnGln: 4.556 ± 0.061
2.278GlnArg: 2.278 ± 0.031
4.556GlnSer: 4.556 ± 3.216
3.417GlnThr: 3.417 ± 0.835
7.973GlnVal: 7.973 ± 2.473
1.139GlnTrp: 1.139 ± 0.774
1.139GlnTyr: 1.139 ± 0.774
0.0GlnXaa: 0.0 ± 0.0
Arg
3.417ArgAla: 3.417 ± 0.743
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
3.417ArgGlu: 3.417 ± 0.835
3.417ArgPhe: 3.417 ± 0.835
7.973ArgGly: 7.973 ± 4.051
0.0ArgHis: 0.0 ± 0.0
2.278ArgIle: 2.278 ± 0.031
3.417ArgLys: 3.417 ± 0.743
5.695ArgLeu: 5.695 ± 2.29
2.278ArgMet: 2.278 ± 1.608
0.0ArgAsn: 0.0 ± 0.0
1.139ArgPro: 1.139 ± 0.804
1.139ArgGln: 1.139 ± 0.804
2.278ArgArg: 2.278 ± 1.547
0.0ArgSer: 0.0 ± 0.0
2.278ArgThr: 2.278 ± 0.031
2.278ArgVal: 2.278 ± 1.547
3.417ArgTrp: 3.417 ± 2.321
2.278ArgTyr: 2.278 ± 0.031
0.0ArgXaa: 0.0 ± 0.0
Ser
7.973SerAla: 7.973 ± 2.473
0.0SerCys: 0.0 ± 0.0
4.556SerAsp: 4.556 ± 1.639
5.695SerGlu: 5.695 ± 0.712
1.139SerPhe: 1.139 ± 0.774
7.973SerGly: 7.973 ± 2.473
2.278SerHis: 2.278 ± 1.547
1.139SerIle: 1.139 ± 0.804
5.695SerLys: 5.695 ± 0.865
6.834SerLeu: 6.834 ± 3.247
1.139SerMet: 1.139 ± 0.804
1.139SerAsn: 1.139 ± 0.774
3.417SerPro: 3.417 ± 0.835
4.556SerGln: 4.556 ± 0.061
5.695SerArg: 5.695 ± 2.443
5.695SerSer: 5.695 ± 0.712
3.417SerThr: 3.417 ± 2.412
1.139SerVal: 1.139 ± 0.774
1.139SerTrp: 1.139 ± 0.774
2.278SerTyr: 2.278 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
5.695ThrAla: 5.695 ± 4.02
0.0ThrCys: 0.0 ± 0.0
4.556ThrAsp: 4.556 ± 1.639
2.278ThrGlu: 2.278 ± 0.031
0.0ThrPhe: 0.0 ± 0.0
4.556ThrGly: 4.556 ± 0.061
0.0ThrHis: 0.0 ± 0.0
3.417ThrIle: 3.417 ± 0.743
2.278ThrLys: 2.278 ± 0.031
4.556ThrLeu: 4.556 ± 0.061
0.0ThrMet: 0.0 ± 0.0
2.278ThrAsn: 2.278 ± 0.031
4.556ThrPro: 4.556 ± 1.639
0.0ThrGln: 0.0 ± 0.0
1.139ThrArg: 1.139 ± 0.774
4.556ThrSer: 4.556 ± 0.061
4.556ThrThr: 4.556 ± 0.061
5.695ThrVal: 5.695 ± 0.865
1.139ThrTrp: 1.139 ± 0.804
3.417ThrTyr: 3.417 ± 0.743
0.0ThrXaa: 0.0 ± 0.0
Val
2.278ValAla: 2.278 ± 1.608
1.139ValCys: 1.139 ± 0.774
3.417ValAsp: 3.417 ± 0.743
3.417ValGlu: 3.417 ± 0.835
2.278ValPhe: 2.278 ± 1.547
7.973ValGly: 7.973 ± 4.051
0.0ValHis: 0.0 ± 0.0
2.278ValIle: 2.278 ± 1.608
3.417ValLys: 3.417 ± 0.835
7.973ValLeu: 7.973 ± 0.682
1.139ValMet: 1.139 ± 0.804
3.417ValAsn: 3.417 ± 2.321
5.695ValPro: 5.695 ± 0.712
4.556ValGln: 4.556 ± 0.061
1.139ValArg: 1.139 ± 0.774
2.278ValSer: 2.278 ± 0.031
2.278ValThr: 2.278 ± 1.608
5.695ValVal: 5.695 ± 0.865
0.0ValTrp: 0.0 ± 0.0
2.278ValTyr: 2.278 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.139TrpAsp: 1.139 ± 0.774
0.0TrpGlu: 0.0 ± 0.0
1.139TrpPhe: 1.139 ± 0.774
1.139TrpGly: 1.139 ± 0.804
1.139TrpHis: 1.139 ± 0.774
0.0TrpIle: 0.0 ± 0.0
5.695TrpLys: 5.695 ± 0.712
5.695TrpLeu: 5.695 ± 0.712
1.139TrpMet: 1.139 ± 0.774
0.0TrpAsn: 0.0 ± 0.0
1.139TrpPro: 1.139 ± 0.774
0.0TrpGln: 0.0 ± 0.0
1.139TrpArg: 1.139 ± 0.774
1.139TrpSer: 1.139 ± 0.774
1.139TrpThr: 1.139 ± 0.774
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
3.417TrpTyr: 3.417 ± 0.743
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.278TyrAla: 2.278 ± 1.547
1.139TyrCys: 1.139 ± 0.774
3.417TyrAsp: 3.417 ± 2.321
4.556TyrGlu: 4.556 ± 1.517
2.278TyrPhe: 2.278 ± 1.547
2.278TyrGly: 2.278 ± 1.547
1.139TyrHis: 1.139 ± 0.804
2.278TyrIle: 2.278 ± 1.608
0.0TyrLys: 0.0 ± 0.0
5.695TyrLeu: 5.695 ± 0.712
2.278TyrMet: 2.278 ± 1.977
0.0TyrAsn: 0.0 ± 0.0
2.278TyrPro: 2.278 ± 1.547
1.139TyrGln: 1.139 ± 0.804
5.695TyrArg: 5.695 ± 2.443
3.417TyrSer: 3.417 ± 2.412
2.278TyrThr: 2.278 ± 0.031
2.278TyrVal: 2.278 ± 0.031
1.139TyrTrp: 1.139 ± 0.774
1.139TyrTyr: 1.139 ± 0.804
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (879 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski