Amino acid dipepetide frequency for Hubei tombus-like virus 42

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.073AlaAla: 2.073 ± 1.299
2.073AlaCys: 2.073 ± 0.25
4.838AlaAsp: 4.838 ± 0.115
2.764AlaGlu: 2.764 ± 1.733
0.0AlaPhe: 0.0 ± 0.0
0.691AlaGly: 0.691 ± 0.433
4.147AlaHis: 4.147 ± 0.548
3.455AlaIle: 3.455 ± 1.117
7.602AlaLys: 7.602 ± 1.529
6.911AlaLeu: 6.911 ± 0.136
1.382AlaMet: 1.382 ± 0.183
2.764AlaAsn: 2.764 ± 0.365
4.147AlaPro: 4.147 ± 0.501
1.382AlaGln: 1.382 ± 0.866
5.529AlaArg: 5.529 ± 0.318
3.455AlaSer: 3.455 ± 0.068
3.455AlaThr: 3.455 ± 1.117
4.147AlaVal: 4.147 ± 0.548
1.382AlaTrp: 1.382 ± 0.183
2.073AlaTyr: 2.073 ± 0.799
0.0AlaXaa: 0.0 ± 0.0
Cys
1.382CysAla: 1.382 ± 0.866
0.0CysCys: 0.0 ± 0.0
0.691CysAsp: 0.691 ± 0.616
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.382CysGly: 1.382 ± 1.232
0.691CysHis: 0.691 ± 0.616
2.073CysIle: 2.073 ± 1.299
0.691CysLys: 0.691 ± 0.616
2.764CysLeu: 2.764 ± 1.414
0.691CysMet: 0.691 ± 0.616
2.073CysAsn: 2.073 ± 0.25
0.691CysPro: 0.691 ± 0.433
2.764CysGln: 2.764 ± 0.365
1.382CysArg: 1.382 ± 0.866
1.382CysSer: 1.382 ± 1.232
1.382CysThr: 1.382 ± 0.866
2.764CysVal: 2.764 ± 0.684
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.22AspAla: 6.22 ± 0.751
1.382AspCys: 1.382 ± 0.183
4.147AspAsp: 4.147 ± 1.55
1.382AspGlu: 1.382 ± 0.183
1.382AspPhe: 1.382 ± 0.183
5.529AspGly: 5.529 ± 1.367
2.764AspHis: 2.764 ± 0.365
4.838AspIle: 4.838 ± 0.934
3.455AspLys: 3.455 ± 3.079
1.382AspLeu: 1.382 ± 0.183
0.0AspMet: 0.0 ± 0.0
3.455AspAsn: 3.455 ± 1.117
6.22AspPro: 6.22 ± 0.751
2.073AspGln: 2.073 ± 0.25
1.382AspArg: 1.382 ± 0.866
5.529AspSer: 5.529 ± 1.367
2.073AspThr: 2.073 ± 0.799
1.382AspVal: 1.382 ± 0.183
0.0AspTrp: 0.0 ± 0.0
0.691AspTyr: 0.691 ± 0.433
0.0AspXaa: 0.0 ± 0.0
Glu
2.073GluAla: 2.073 ± 1.299
0.691GluCys: 0.691 ± 0.433
1.382GluAsp: 1.382 ± 0.866
2.073GluGlu: 2.073 ± 0.799
3.455GluPhe: 3.455 ± 0.068
2.073GluGly: 2.073 ± 0.25
0.0GluHis: 0.0 ± 0.0
3.455GluIle: 3.455 ± 0.068
2.073GluLys: 2.073 ± 0.799
1.382GluLeu: 1.382 ± 0.866
0.691GluMet: 0.691 ± 0.616
2.073GluAsn: 2.073 ± 0.799
2.764GluPro: 2.764 ± 0.365
4.147GluGln: 4.147 ± 2.599
1.382GluArg: 1.382 ± 0.183
3.455GluSer: 3.455 ± 0.068
4.838GluThr: 4.838 ± 0.934
2.073GluVal: 2.073 ± 0.799
1.382GluTrp: 1.382 ± 0.183
0.691GluTyr: 0.691 ± 0.433
0.0GluXaa: 0.0 ± 0.0
Phe
3.455PheAla: 3.455 ± 1.117
0.691PheCys: 0.691 ± 0.616
2.764PheAsp: 2.764 ± 0.365
2.073PheGlu: 2.073 ± 0.25
2.073PhePhe: 2.073 ± 1.299
1.382PheGly: 1.382 ± 1.232
0.691PheHis: 0.691 ± 0.433
3.455PheIle: 3.455 ± 0.068
0.691PheLys: 0.691 ± 0.616
2.764PheLeu: 2.764 ± 2.463
0.0PheMet: 0.0 ± 0.0
2.073PheAsn: 2.073 ± 0.25
2.764PhePro: 2.764 ± 0.684
0.0PheGln: 0.0 ± 0.0
2.073PheArg: 2.073 ± 0.25
2.764PheSer: 2.764 ± 1.733
2.764PheThr: 2.764 ± 1.414
1.382PheVal: 1.382 ± 0.183
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.764GlyAla: 2.764 ± 1.733
1.382GlyCys: 1.382 ± 0.866
2.073GlyAsp: 2.073 ± 0.799
0.691GlyGlu: 0.691 ± 0.433
1.382GlyPhe: 1.382 ± 1.232
2.073GlyGly: 2.073 ± 1.299
0.691GlyHis: 0.691 ± 0.433
3.455GlyIle: 3.455 ± 0.068
0.691GlyLys: 0.691 ± 0.433
6.22GlyLeu: 6.22 ± 1.347
1.382GlyMet: 1.382 ± 0.234
2.073GlyAsn: 2.073 ± 0.25
3.455GlyPro: 3.455 ± 0.981
4.147GlyGln: 4.147 ± 2.599
2.764GlyArg: 2.764 ± 1.414
3.455GlySer: 3.455 ± 1.117
5.529GlyThr: 5.529 ± 0.318
2.073GlyVal: 2.073 ± 0.799
0.691GlyTrp: 0.691 ± 0.616
2.073GlyTyr: 2.073 ± 1.299
0.0GlyXaa: 0.0 ± 0.0
His
2.764HisAla: 2.764 ± 1.414
2.073HisCys: 2.073 ± 0.799
1.382HisAsp: 1.382 ± 0.866
0.691HisGlu: 0.691 ± 0.616
0.691HisPhe: 0.691 ± 0.433
1.382HisGly: 1.382 ± 0.866
2.073HisHis: 2.073 ± 1.847
1.382HisIle: 1.382 ± 0.866
1.382HisLys: 1.382 ± 0.183
1.382HisLeu: 1.382 ± 0.183
0.691HisMet: 0.691 ± 0.433
0.691HisAsn: 0.691 ± 0.433
4.147HisPro: 4.147 ± 0.501
3.455HisGln: 3.455 ± 0.068
4.838HisArg: 4.838 ± 1.983
2.073HisSer: 2.073 ± 1.299
3.455HisThr: 3.455 ± 1.117
2.764HisVal: 2.764 ± 0.365
0.0HisTrp: 0.0 ± 0.0
1.382HisTyr: 1.382 ± 0.183
0.0HisXaa: 0.0 ± 0.0
Ile
2.764IleAla: 2.764 ± 0.365
2.764IleCys: 2.764 ± 0.684
0.691IleAsp: 0.691 ± 0.433
1.382IleGlu: 1.382 ± 1.232
2.073IlePhe: 2.073 ± 1.299
3.455IleGly: 3.455 ± 0.981
2.073IleHis: 2.073 ± 1.299
8.293IleIle: 8.293 ± 1.002
6.22IleLys: 6.22 ± 1.8
2.764IleLeu: 2.764 ± 0.684
1.382IleMet: 1.382 ± 0.183
2.764IleAsn: 2.764 ± 0.365
2.764IlePro: 2.764 ± 0.365
0.0IleGln: 0.0 ± 0.0
4.147IleArg: 4.147 ± 0.501
6.911IleSer: 6.911 ± 0.913
4.838IleThr: 4.838 ± 0.934
4.147IleVal: 4.147 ± 0.548
2.073IleTrp: 2.073 ± 0.25
2.764IleTyr: 2.764 ± 1.733
0.0IleXaa: 0.0 ± 0.0
Lys
2.073LysAla: 2.073 ± 0.25
0.691LysCys: 0.691 ± 0.433
3.455LysAsp: 3.455 ± 1.117
2.073LysGlu: 2.073 ± 0.25
2.764LysPhe: 2.764 ± 1.414
2.073LysGly: 2.073 ± 0.799
3.455LysHis: 3.455 ± 0.068
2.073LysIle: 2.073 ± 0.25
2.073LysLys: 2.073 ± 0.799
6.22LysLeu: 6.22 ± 2.396
2.764LysMet: 2.764 ± 1.414
0.691LysAsn: 0.691 ± 0.433
2.764LysPro: 2.764 ± 2.463
2.764LysGln: 2.764 ± 1.414
5.529LysArg: 5.529 ± 0.318
2.073LysSer: 2.073 ± 0.25
4.147LysThr: 4.147 ± 1.597
4.147LysVal: 4.147 ± 1.597
0.691LysTrp: 0.691 ± 0.616
2.764LysTyr: 2.764 ± 1.414
0.0LysXaa: 0.0 ± 0.0
Leu
6.911LeuAla: 6.911 ± 0.913
2.764LeuCys: 2.764 ± 0.365
2.764LeuAsp: 2.764 ± 0.684
4.147LeuGlu: 4.147 ± 3.695
1.382LeuPhe: 1.382 ± 0.866
4.838LeuGly: 4.838 ± 1.164
4.838LeuHis: 4.838 ± 1.983
4.147LeuIle: 4.147 ± 1.597
5.529LeuLys: 5.529 ± 0.731
8.984LeuLeu: 8.984 ± 3.81
2.073LeuMet: 2.073 ± 0.799
2.764LeuAsn: 2.764 ± 1.414
4.838LeuPro: 4.838 ± 0.115
2.764LeuGln: 2.764 ± 0.684
7.602LeuArg: 7.602 ± 1.618
4.147LeuSer: 4.147 ± 1.597
6.911LeuThr: 6.911 ± 0.913
4.147LeuVal: 4.147 ± 1.597
0.691LeuTrp: 0.691 ± 0.433
2.073LeuTyr: 2.073 ± 0.799
0.0LeuXaa: 0.0 ± 0.0
Met
2.073MetAla: 2.073 ± 1.299
0.0MetCys: 0.0 ± 0.0
2.073MetAsp: 2.073 ± 0.25
1.382MetGlu: 1.382 ± 0.183
1.382MetPhe: 1.382 ± 0.183
2.073MetGly: 2.073 ± 1.847
0.691MetHis: 0.691 ± 0.616
0.691MetIle: 0.691 ± 0.433
2.764MetLys: 2.764 ± 1.414
0.691MetLeu: 0.691 ± 0.616
0.691MetMet: 0.691 ± 0.616
0.691MetAsn: 0.691 ± 0.433
2.073MetPro: 2.073 ± 0.799
0.691MetGln: 0.691 ± 0.616
1.382MetArg: 1.382 ± 1.232
0.0MetSer: 0.0 ± 0.0
1.382MetThr: 1.382 ± 0.866
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.691MetTyr: 0.691 ± 0.433
0.0MetXaa: 0.0 ± 0.0
Asn
2.764AsnAla: 2.764 ± 0.365
0.0AsnCys: 0.0 ± 0.0
1.382AsnAsp: 1.382 ± 0.183
4.838AsnGlu: 4.838 ± 3.032
1.382AsnPhe: 1.382 ± 0.183
2.073AsnGly: 2.073 ± 0.799
2.073AsnHis: 2.073 ± 1.299
4.147AsnIle: 4.147 ± 0.548
0.691AsnLys: 0.691 ± 0.616
4.838AsnLeu: 4.838 ± 1.164
0.691AsnMet: 0.691 ± 0.433
0.0AsnAsn: 0.0 ± 0.0
1.382AsnPro: 1.382 ± 0.183
2.073AsnGln: 2.073 ± 0.799
2.764AsnArg: 2.764 ± 1.414
2.073AsnSer: 2.073 ± 0.25
2.073AsnThr: 2.073 ± 0.25
0.691AsnVal: 0.691 ± 0.433
0.0AsnTrp: 0.0 ± 0.0
1.382AsnTyr: 1.382 ± 0.183
0.0AsnXaa: 0.0 ± 0.0
Pro
2.764ProAla: 2.764 ± 0.365
0.0ProCys: 0.0 ± 0.0
3.455ProAsp: 3.455 ± 0.068
1.382ProGlu: 1.382 ± 0.183
2.764ProPhe: 2.764 ± 1.414
4.147ProGly: 4.147 ± 1.55
1.382ProHis: 1.382 ± 1.232
4.147ProIle: 4.147 ± 0.501
2.073ProLys: 2.073 ± 0.25
5.529ProLeu: 5.529 ± 1.78
1.382ProMet: 1.382 ± 0.183
2.073ProAsn: 2.073 ± 0.799
4.147ProPro: 4.147 ± 0.548
4.838ProGln: 4.838 ± 1.164
2.764ProArg: 2.764 ± 1.414
6.911ProSer: 6.911 ± 1.185
5.529ProThr: 5.529 ± 1.367
2.764ProVal: 2.764 ± 0.684
0.691ProTrp: 0.691 ± 0.433
3.455ProTyr: 3.455 ± 0.981
0.0ProXaa: 0.0 ± 0.0
Gln
2.764GlnAla: 2.764 ± 0.684
0.691GlnCys: 0.691 ± 0.616
2.073GlnAsp: 2.073 ± 0.799
2.073GlnGlu: 2.073 ± 0.25
4.147GlnPhe: 4.147 ± 0.501
2.073GlnGly: 2.073 ± 1.299
2.764GlnHis: 2.764 ± 0.684
3.455GlnIle: 3.455 ± 1.117
0.691GlnLys: 0.691 ± 0.616
2.764GlnLeu: 2.764 ± 1.414
2.073GlnMet: 2.073 ± 0.612
1.382GlnAsn: 1.382 ± 0.183
2.764GlnPro: 2.764 ± 0.365
1.382GlnGln: 1.382 ± 0.183
3.455GlnArg: 3.455 ± 0.068
4.147GlnSer: 4.147 ± 2.599
1.382GlnThr: 1.382 ± 0.866
0.691GlnVal: 0.691 ± 0.433
2.073GlnTrp: 2.073 ± 0.25
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.838ArgAla: 4.838 ± 0.934
2.764ArgCys: 2.764 ± 0.365
4.147ArgAsp: 4.147 ± 0.501
3.455ArgGlu: 3.455 ± 1.117
2.073ArgPhe: 2.073 ± 0.799
1.382ArgGly: 1.382 ± 0.866
3.455ArgHis: 3.455 ± 1.117
3.455ArgIle: 3.455 ± 0.068
5.529ArgLys: 5.529 ± 1.78
6.911ArgLeu: 6.911 ± 2.234
2.073ArgMet: 2.073 ± 0.25
4.147ArgAsn: 4.147 ± 0.548
5.529ArgPro: 5.529 ± 2.416
2.073ArgGln: 2.073 ± 1.299
5.529ArgArg: 5.529 ± 1.367
5.529ArgSer: 5.529 ± 2.416
5.529ArgThr: 5.529 ± 0.731
3.455ArgVal: 3.455 ± 0.981
1.382ArgTrp: 1.382 ± 0.183
2.073ArgTyr: 2.073 ± 1.847
0.0ArgXaa: 0.0 ± 0.0
Ser
4.147SerAla: 4.147 ± 1.597
0.0SerCys: 0.0 ± 0.0
3.455SerAsp: 3.455 ± 2.166
4.147SerGlu: 4.147 ± 2.599
0.0SerPhe: 0.0 ± 0.0
7.602SerGly: 7.602 ± 2.667
2.073SerHis: 2.073 ± 1.299
2.764SerIle: 2.764 ± 0.365
4.147SerLys: 4.147 ± 1.597
8.293SerLeu: 8.293 ± 1.096
0.691SerMet: 0.691 ± 0.616
2.073SerAsn: 2.073 ± 1.299
3.455SerPro: 3.455 ± 2.03
4.147SerGln: 4.147 ± 0.501
8.293SerArg: 8.293 ± 3.1
10.366SerSer: 10.366 ± 1.252
5.529SerThr: 5.529 ± 3.465
3.455SerVal: 3.455 ± 0.068
0.0SerTrp: 0.0 ± 0.0
5.529SerTyr: 5.529 ± 3.878
0.0SerXaa: 0.0 ± 0.0
Thr
7.602ThrAla: 7.602 ± 1.529
2.073ThrCys: 2.073 ± 0.25
6.22ThrAsp: 6.22 ± 0.751
4.147ThrGlu: 4.147 ± 1.55
2.764ThrPhe: 2.764 ± 0.684
2.073ThrGly: 2.073 ± 1.299
2.073ThrHis: 2.073 ± 0.25
4.838ThrIle: 4.838 ± 0.115
4.147ThrLys: 4.147 ± 0.501
4.838ThrLeu: 4.838 ± 0.934
1.382ThrMet: 1.382 ± 0.183
0.0ThrAsn: 0.0 ± 0.0
5.529ThrPro: 5.529 ± 0.731
0.0ThrGln: 0.0 ± 0.0
6.22ThrArg: 6.22 ± 0.298
8.293ThrSer: 8.293 ± 0.047
10.366ThrThr: 10.366 ± 1.895
3.455ThrVal: 3.455 ± 1.117
0.0ThrTrp: 0.0 ± 0.0
0.691ThrTyr: 0.691 ± 0.433
0.0ThrXaa: 0.0 ± 0.0
Val
1.382ValAla: 1.382 ± 0.183
2.073ValCys: 2.073 ± 0.799
4.838ValAsp: 4.838 ± 2.213
2.073ValGlu: 2.073 ± 1.847
1.382ValPhe: 1.382 ± 0.183
3.455ValGly: 3.455 ± 0.068
0.691ValHis: 0.691 ± 0.433
2.073ValIle: 2.073 ± 0.25
3.455ValLys: 3.455 ± 0.981
4.147ValLeu: 4.147 ± 0.501
0.691ValMet: 0.691 ± 0.616
2.073ValAsn: 2.073 ± 0.799
1.382ValPro: 1.382 ± 0.183
3.455ValGln: 3.455 ± 0.068
6.911ValArg: 6.911 ± 2.234
4.147ValSer: 4.147 ± 0.548
2.073ValThr: 2.073 ± 0.25
4.838ValVal: 4.838 ± 1.983
0.0ValTrp: 0.0 ± 0.0
0.691ValTyr: 0.691 ± 0.616
0.0ValXaa: 0.0 ± 0.0
Trp
0.691TrpAla: 0.691 ± 0.433
0.0TrpCys: 0.0 ± 0.0
1.382TrpAsp: 1.382 ± 0.183
1.382TrpGlu: 1.382 ± 0.866
2.073TrpPhe: 2.073 ± 0.25
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.691TrpIle: 0.691 ± 0.433
0.0TrpLys: 0.0 ± 0.0
0.691TrpLeu: 0.691 ± 0.616
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.691TrpArg: 0.691 ± 0.616
0.691TrpSer: 0.691 ± 0.433
1.382TrpThr: 1.382 ± 1.232
2.073TrpVal: 2.073 ± 0.799
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.764TyrAla: 2.764 ± 0.365
0.691TyrCys: 0.691 ± 0.616
2.073TyrAsp: 2.073 ± 0.25
0.0TyrGlu: 0.0 ± 0.0
0.691TyrPhe: 0.691 ± 0.616
0.0TyrGly: 0.0 ± 0.0
2.073TyrHis: 2.073 ± 0.799
1.382TyrIle: 1.382 ± 0.183
1.382TyrLys: 1.382 ± 0.183
4.838TyrLeu: 4.838 ± 0.115
0.0TyrMet: 0.0 ± 0.0
3.455TyrAsn: 3.455 ± 0.068
1.382TyrPro: 1.382 ± 0.183
0.691TyrGln: 0.691 ± 0.433
0.691TyrArg: 0.691 ± 0.433
2.764TyrSer: 2.764 ± 2.463
2.073TyrThr: 2.073 ± 0.799
1.382TyrVal: 1.382 ± 0.183
0.691TyrTrp: 0.691 ± 0.616
1.382TyrTyr: 1.382 ± 0.866
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1448 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski