Amino acid dipepetide frequency for Hubei picorna-like virus 14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.561AlaAla: 4.561 ± 0.141
0.702AlaCys: 0.702 ± 0.364
2.807AlaAsp: 2.807 ± 0.829
0.702AlaGlu: 0.702 ± 0.364
4.561AlaPhe: 4.561 ± 0.141
2.456AlaGly: 2.456 ± 0.606
1.404AlaHis: 1.404 ± 0.525
4.912AlaIle: 4.912 ± 1.92
3.509AlaLys: 3.509 ± 0.687
4.912AlaLeu: 4.912 ± 0.586
0.702AlaMet: 0.702 ± 0.364
3.509AlaAsn: 3.509 ± 1.313
3.158AlaPro: 3.158 ± 0.242
3.158AlaGln: 3.158 ± 1.495
2.456AlaArg: 2.456 ± 0.647
4.211AlaSer: 4.211 ± 0.323
2.456AlaThr: 2.456 ± 0.02
4.211AlaVal: 4.211 ± 0.323
0.351AlaTrp: 0.351 ± 0.445
1.754AlaTyr: 1.754 ± 0.283
0.0AlaXaa: 0.0 ± 0.0
Cys
1.754CysAla: 1.754 ± 0.283
0.0CysCys: 0.0 ± 0.0
1.053CysAsp: 1.053 ± 0.707
1.754CysGlu: 1.754 ± 0.283
0.351CysPhe: 0.351 ± 0.182
1.053CysGly: 1.053 ± 0.081
0.0CysHis: 0.0 ± 0.0
1.754CysIle: 1.754 ± 0.909
1.404CysLys: 1.404 ± 0.728
1.404CysLeu: 1.404 ± 0.525
0.702CysMet: 0.702 ± 0.364
1.053CysAsn: 1.053 ± 0.081
1.754CysPro: 1.754 ± 0.343
0.0CysGln: 0.0 ± 0.0
1.053CysArg: 1.053 ± 0.081
1.404CysSer: 1.404 ± 0.728
0.702CysThr: 0.702 ± 0.364
1.404CysVal: 1.404 ± 0.728
0.0CysTrp: 0.0 ± 0.0
1.404CysTyr: 1.404 ± 0.101
0.0CysXaa: 0.0 ± 0.0
Asp
2.807AspAla: 2.807 ± 0.202
2.105AspCys: 2.105 ± 0.465
4.211AspAsp: 4.211 ± 0.303
3.158AspGlu: 3.158 ± 0.869
3.86AspPhe: 3.86 ± 0.121
3.158AspGly: 3.158 ± 0.869
1.754AspHis: 1.754 ± 0.343
3.86AspIle: 3.86 ± 0.121
4.561AspLys: 4.561 ± 0.768
3.509AspLeu: 3.509 ± 1.313
2.105AspMet: 2.105 ± 1.006
0.702AspAsn: 0.702 ± 0.364
1.053AspPro: 1.053 ± 0.081
2.456AspGln: 2.456 ± 0.606
1.754AspArg: 1.754 ± 0.343
3.509AspSer: 3.509 ± 0.566
5.263AspThr: 5.263 ± 0.404
5.263AspVal: 5.263 ± 2.283
0.702AspTrp: 0.702 ± 0.364
5.263AspTyr: 5.263 ± 0.404
0.0AspXaa: 0.0 ± 0.0
Glu
3.158GluAla: 3.158 ± 0.242
0.702GluCys: 0.702 ± 0.364
2.807GluAsp: 2.807 ± 0.829
3.509GluGlu: 3.509 ± 1.192
3.509GluPhe: 3.509 ± 0.566
3.158GluGly: 3.158 ± 0.242
0.0GluHis: 0.0 ± 0.0
4.211GluIle: 4.211 ± 0.93
5.263GluLys: 5.263 ± 2.102
4.912GluLeu: 4.912 ± 1.294
2.105GluMet: 2.105 ± 0.465
1.053GluAsn: 1.053 ± 0.081
1.053GluPro: 1.053 ± 0.081
3.86GluGln: 3.86 ± 2.001
2.456GluArg: 2.456 ± 1.273
4.561GluSer: 4.561 ± 0.141
3.158GluThr: 3.158 ± 0.242
2.807GluVal: 2.807 ± 0.424
1.053GluTrp: 1.053 ± 0.081
3.509GluTyr: 3.509 ± 0.566
0.0GluXaa: 0.0 ± 0.0
Phe
1.404PheAla: 1.404 ± 0.728
0.351PheCys: 0.351 ± 0.182
3.86PheAsp: 3.86 ± 0.121
4.912PheGlu: 4.912 ± 1.294
1.053PhePhe: 1.053 ± 0.081
2.105PheGly: 2.105 ± 0.162
1.404PheHis: 1.404 ± 1.152
5.263PheIle: 5.263 ± 0.849
3.509PheLys: 3.509 ± 0.687
3.158PheLeu: 3.158 ± 0.242
0.351PheMet: 0.351 ± 0.445
2.807PheAsn: 2.807 ± 0.829
0.702PhePro: 0.702 ± 0.263
0.702PheGln: 0.702 ± 0.263
2.807PheArg: 2.807 ± 0.424
2.456PheSer: 2.456 ± 0.02
3.86PheThr: 3.86 ± 1.758
4.211PheVal: 4.211 ± 0.303
0.0PheTrp: 0.0 ± 0.0
1.053PheTyr: 1.053 ± 0.707
0.0PheXaa: 0.0 ± 0.0
Gly
2.456GlyAla: 2.456 ± 0.606
2.105GlyCys: 2.105 ± 0.465
3.509GlyAsp: 3.509 ± 0.566
2.456GlyGlu: 2.456 ± 0.606
2.105GlyPhe: 2.105 ± 0.465
2.105GlyGly: 2.105 ± 1.414
0.351GlyHis: 0.351 ± 0.445
3.509GlyIle: 3.509 ± 1.192
4.211GlyLys: 4.211 ± 0.93
4.211GlyLeu: 4.211 ± 0.303
1.754GlyMet: 1.754 ± 0.97
3.509GlyAsn: 3.509 ± 0.566
0.351GlyPro: 0.351 ± 0.182
2.456GlyGln: 2.456 ± 0.606
2.807GlyArg: 2.807 ± 1.051
3.86GlySer: 3.86 ± 2.384
1.754GlyThr: 1.754 ± 0.97
2.807GlyVal: 2.807 ± 1.051
0.351GlyTrp: 0.351 ± 0.445
1.404GlyTyr: 1.404 ± 0.525
0.0GlyXaa: 0.0 ± 0.0
His
0.351HisAla: 0.351 ± 0.445
0.351HisCys: 0.351 ± 0.182
0.351HisAsp: 0.351 ± 0.182
0.702HisGlu: 0.702 ± 0.263
1.053HisPhe: 1.053 ± 0.707
1.053HisGly: 1.053 ± 0.081
0.351HisHis: 0.351 ± 0.182
1.404HisIle: 1.404 ± 0.101
1.754HisLys: 1.754 ± 0.909
0.702HisLeu: 0.702 ± 0.263
0.702HisMet: 0.702 ± 0.364
0.702HisAsn: 0.702 ± 0.364
0.351HisPro: 0.351 ± 0.445
1.053HisGln: 1.053 ± 0.546
2.105HisArg: 2.105 ± 0.465
2.105HisSer: 2.105 ± 0.788
1.754HisThr: 1.754 ± 0.97
1.404HisVal: 1.404 ± 1.152
0.351HisTrp: 0.351 ± 0.182
1.053HisTyr: 1.053 ± 0.546
0.0HisXaa: 0.0 ± 0.0
Ile
5.965IleAla: 5.965 ± 0.04
1.053IleCys: 1.053 ± 0.081
3.86IleAsp: 3.86 ± 1.131
4.211IleGlu: 4.211 ± 1.556
1.053IlePhe: 1.053 ± 0.546
3.158IleGly: 3.158 ± 1.011
1.053IleHis: 1.053 ± 0.081
6.316IleIle: 6.316 ± 2.021
5.614IleLys: 5.614 ± 1.657
5.614IleLeu: 5.614 ± 0.848
2.105IleMet: 2.105 ± 1.091
6.667IleAsn: 6.667 ± 0.95
4.211IlePro: 4.211 ± 0.95
1.754IleGln: 1.754 ± 0.283
2.807IleArg: 2.807 ± 0.202
3.158IleSer: 3.158 ± 0.384
4.211IleThr: 4.211 ± 0.303
5.614IleVal: 5.614 ± 0.404
1.404IleTrp: 1.404 ± 0.728
3.86IleTyr: 3.86 ± 2.001
0.0IleXaa: 0.0 ± 0.0
Lys
2.456LysAla: 2.456 ± 1.233
2.456LysCys: 2.456 ± 1.273
5.263LysAsp: 5.263 ± 0.223
3.509LysGlu: 3.509 ± 0.566
2.456LysPhe: 2.456 ± 0.02
3.86LysGly: 3.86 ± 0.748
1.053LysHis: 1.053 ± 0.081
6.316LysIle: 6.316 ± 0.768
3.509LysLys: 3.509 ± 0.566
8.07LysLeu: 8.07 ± 1.678
2.105LysMet: 2.105 ± 1.091
3.86LysAsn: 3.86 ± 1.374
1.754LysPro: 1.754 ± 0.343
1.053LysGln: 1.053 ± 0.081
2.456LysArg: 2.456 ± 0.02
4.211LysSer: 4.211 ± 1.556
3.509LysThr: 3.509 ± 0.06
3.158LysVal: 3.158 ± 1.011
1.053LysTrp: 1.053 ± 0.546
4.211LysTyr: 4.211 ± 2.183
0.0LysXaa: 0.0 ± 0.0
Leu
6.316LeuAla: 6.316 ± 1.395
0.702LeuCys: 0.702 ± 0.364
7.018LeuAsp: 7.018 ± 0.506
4.561LeuGlu: 4.561 ± 2.365
3.509LeuPhe: 3.509 ± 0.06
3.86LeuGly: 3.86 ± 0.505
2.105LeuHis: 2.105 ± 1.091
4.211LeuIle: 4.211 ± 0.303
4.912LeuLys: 4.912 ± 1.92
4.912LeuLeu: 4.912 ± 0.041
3.158LeuMet: 3.158 ± 0.203
4.561LeuAsn: 4.561 ± 0.485
4.561LeuPro: 4.561 ± 0.768
2.105LeuGln: 2.105 ± 0.465
5.965LeuArg: 5.965 ± 0.04
8.421LeuSer: 8.421 ± 0.646
3.158LeuThr: 3.158 ± 0.869
7.018LeuVal: 7.018 ± 1.374
1.053LeuTrp: 1.053 ± 0.546
3.86LeuTyr: 3.86 ± 0.121
0.0LeuXaa: 0.0 ± 0.0
Met
1.754MetAla: 1.754 ± 0.283
0.702MetCys: 0.702 ± 0.889
1.053MetAsp: 1.053 ± 0.546
1.404MetGlu: 1.404 ± 0.728
0.702MetPhe: 0.702 ± 0.364
1.053MetGly: 1.053 ± 0.707
0.0MetHis: 0.0 ± 0.0
1.053MetIle: 1.053 ± 0.546
2.807MetLys: 2.807 ± 1.455
2.456MetLeu: 2.456 ± 0.647
0.702MetMet: 0.702 ± 0.364
2.105MetAsn: 2.105 ± 0.788
1.404MetPro: 1.404 ± 0.101
0.351MetGln: 0.351 ± 0.182
2.105MetArg: 2.105 ± 0.465
1.754MetSer: 1.754 ± 0.283
1.053MetThr: 1.053 ± 0.707
1.754MetVal: 1.754 ± 0.283
0.0MetTrp: 0.0 ± 0.0
2.456MetTyr: 2.456 ± 1.233
0.0MetXaa: 0.0 ± 0.0
Asn
5.263AsnAla: 5.263 ± 0.223
0.702AsnCys: 0.702 ± 0.364
1.404AsnAsp: 1.404 ± 0.525
3.158AsnGlu: 3.158 ± 0.384
2.456AsnPhe: 2.456 ± 0.02
2.105AsnGly: 2.105 ± 0.788
1.754AsnHis: 1.754 ± 0.909
3.158AsnIle: 3.158 ± 0.384
1.754AsnLys: 1.754 ± 0.909
4.211AsnLeu: 4.211 ± 2.183
1.404AsnMet: 1.404 ± 0.101
3.158AsnAsn: 3.158 ± 1.637
4.561AsnPro: 4.561 ± 0.485
1.404AsnGln: 1.404 ± 0.101
3.158AsnArg: 3.158 ± 0.869
3.86AsnSer: 3.86 ± 0.121
2.807AsnThr: 2.807 ± 0.202
4.561AsnVal: 4.561 ± 0.141
0.351AsnTrp: 0.351 ± 0.182
3.158AsnTyr: 3.158 ± 0.384
0.0AsnXaa: 0.0 ± 0.0
Pro
3.158ProAla: 3.158 ± 0.242
0.351ProCys: 0.351 ± 0.445
2.807ProAsp: 2.807 ± 1.051
2.456ProGlu: 2.456 ± 1.273
4.211ProPhe: 4.211 ± 2.829
2.105ProGly: 2.105 ± 1.414
0.702ProHis: 0.702 ± 0.889
2.807ProIle: 2.807 ± 0.202
1.053ProLys: 1.053 ± 0.546
2.807ProLeu: 2.807 ± 0.202
1.404ProMet: 1.404 ± 0.525
1.754ProAsn: 1.754 ± 0.343
1.404ProPro: 1.404 ± 1.152
1.053ProGln: 1.053 ± 0.081
1.404ProArg: 1.404 ± 0.525
1.754ProSer: 1.754 ± 0.283
3.158ProThr: 3.158 ± 0.869
3.158ProVal: 3.158 ± 0.384
0.702ProTrp: 0.702 ± 0.263
1.754ProTyr: 1.754 ± 0.343
0.0ProXaa: 0.0 ± 0.0
Gln
1.404GlnAla: 1.404 ± 1.152
0.351GlnCys: 0.351 ± 0.182
1.754GlnAsp: 1.754 ± 0.283
2.456GlnGlu: 2.456 ± 0.647
1.754GlnPhe: 1.754 ± 0.343
2.105GlnGly: 2.105 ± 0.162
1.053GlnHis: 1.053 ± 0.707
2.105GlnIle: 2.105 ± 0.162
1.404GlnLys: 1.404 ± 0.728
2.456GlnLeu: 2.456 ± 1.273
0.351GlnMet: 0.351 ± 0.182
1.404GlnAsn: 1.404 ± 0.525
0.351GlnPro: 0.351 ± 0.445
1.754GlnGln: 1.754 ± 0.343
3.509GlnArg: 3.509 ± 0.566
1.404GlnSer: 1.404 ± 0.101
1.053GlnThr: 1.053 ± 0.081
3.158GlnVal: 3.158 ± 0.869
0.351GlnTrp: 0.351 ± 0.445
3.509GlnTyr: 3.509 ± 1.192
0.0GlnXaa: 0.0 ± 0.0
Arg
1.404ArgAla: 1.404 ± 0.101
1.053ArgCys: 1.053 ± 0.081
1.404ArgAsp: 1.404 ± 0.101
2.807ArgGlu: 2.807 ± 1.455
2.105ArgPhe: 2.105 ± 0.162
2.807ArgGly: 2.807 ± 0.202
0.702ArgHis: 0.702 ± 0.263
3.509ArgIle: 3.509 ± 1.192
4.561ArgLys: 4.561 ± 0.485
7.719ArgLeu: 7.719 ± 0.384
1.053ArgMet: 1.053 ± 0.546
2.105ArgAsn: 2.105 ± 1.091
2.807ArgPro: 2.807 ± 1.051
3.158ArgGln: 3.158 ± 1.011
3.158ArgArg: 3.158 ± 0.384
2.807ArgSer: 2.807 ± 0.829
3.509ArgThr: 3.509 ± 3.193
3.509ArgVal: 3.509 ± 1.313
0.0ArgTrp: 0.0 ± 0.0
0.702ArgTyr: 0.702 ± 0.364
0.0ArgXaa: 0.0 ± 0.0
Ser
1.754SerAla: 1.754 ± 0.343
2.807SerCys: 2.807 ± 1.051
7.018SerAsp: 7.018 ± 3.253
5.965SerGlu: 5.965 ± 0.667
3.158SerPhe: 3.158 ± 0.242
4.211SerGly: 4.211 ± 0.303
1.754SerHis: 1.754 ± 0.283
6.316SerIle: 6.316 ± 0.142
3.86SerLys: 3.86 ± 1.374
7.018SerLeu: 7.018 ± 0.121
1.754SerMet: 1.754 ± 0.343
4.211SerAsn: 4.211 ± 0.93
2.105SerPro: 2.105 ± 0.788
2.105SerGln: 2.105 ± 0.465
2.456SerArg: 2.456 ± 0.647
5.263SerSer: 5.263 ± 0.849
4.211SerThr: 4.211 ± 2.829
3.86SerVal: 3.86 ± 0.121
0.351SerTrp: 0.351 ± 0.182
2.105SerTyr: 2.105 ± 0.788
0.0SerXaa: 0.0 ± 0.0
Thr
3.158ThrAla: 3.158 ± 0.869
0.702ThrCys: 0.702 ± 0.263
3.86ThrAsp: 3.86 ± 0.505
2.456ThrGlu: 2.456 ± 0.606
2.105ThrPhe: 2.105 ± 0.788
2.456ThrGly: 2.456 ± 1.233
1.053ThrHis: 1.053 ± 0.707
3.86ThrIle: 3.86 ± 0.505
3.509ThrLys: 3.509 ± 0.687
6.667ThrLeu: 6.667 ± 1.577
2.456ThrMet: 2.456 ± 0.606
3.509ThrAsn: 3.509 ± 0.06
3.509ThrPro: 3.509 ± 0.687
1.404ThrGln: 1.404 ± 0.525
1.754ThrArg: 1.754 ± 0.343
4.211ThrSer: 4.211 ± 1.576
0.702ThrThr: 0.702 ± 0.263
3.509ThrVal: 3.509 ± 3.193
0.351ThrTrp: 0.351 ± 0.182
3.509ThrTyr: 3.509 ± 1.313
0.0ThrXaa: 0.0 ± 0.0
Val
4.561ValAla: 4.561 ± 0.768
1.754ValCys: 1.754 ± 0.909
4.561ValAsp: 4.561 ± 1.394
4.561ValGlu: 4.561 ± 0.141
1.404ValPhe: 1.404 ± 0.728
2.807ValGly: 2.807 ± 0.424
1.404ValHis: 1.404 ± 0.728
4.912ValIle: 4.912 ± 1.839
5.614ValLys: 5.614 ± 0.404
7.719ValLeu: 7.719 ± 0.869
0.351ValMet: 0.351 ± 0.182
3.86ValAsn: 3.86 ± 0.121
1.754ValPro: 1.754 ± 1.596
2.105ValGln: 2.105 ± 1.414
3.86ValArg: 3.86 ± 0.121
7.018ValSer: 7.018 ± 3.88
3.158ValThr: 3.158 ± 0.869
6.316ValVal: 6.316 ± 1.111
1.754ValTrp: 1.754 ± 0.97
2.105ValTyr: 2.105 ± 0.162
0.0ValXaa: 0.0 ± 0.0
Trp
0.351TrpAla: 0.351 ± 0.182
0.0TrpCys: 0.0 ± 0.0
0.702TrpAsp: 0.702 ± 0.364
0.351TrpGlu: 0.351 ± 0.182
0.702TrpPhe: 0.702 ± 0.364
0.351TrpGly: 0.351 ± 0.182
0.702TrpHis: 0.702 ± 0.364
1.053TrpIle: 1.053 ± 0.707
0.702TrpLys: 0.702 ± 0.889
0.351TrpLeu: 0.351 ± 0.445
0.351TrpMet: 0.351 ± 0.445
0.351TrpAsn: 0.351 ± 0.182
0.702TrpPro: 0.702 ± 0.364
0.351TrpGln: 0.351 ± 0.182
0.351TrpArg: 0.351 ± 0.445
1.404TrpSer: 1.404 ± 0.101
0.702TrpThr: 0.702 ± 0.364
0.702TrpVal: 0.702 ± 0.263
0.0TrpTrp: 0.0 ± 0.0
0.702TrpTyr: 0.702 ± 0.364
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.456TyrAla: 2.456 ± 0.647
1.053TyrCys: 1.053 ± 0.546
1.754TyrAsp: 1.754 ± 0.97
1.754TyrGlu: 1.754 ± 0.909
3.86TyrPhe: 3.86 ± 0.121
2.105TyrGly: 2.105 ± 0.162
1.053TyrHis: 1.053 ± 0.546
3.158TyrIle: 3.158 ± 1.637
2.807TyrLys: 2.807 ± 0.829
3.509TyrLeu: 3.509 ± 0.06
0.702TyrMet: 0.702 ± 0.889
3.509TyrAsn: 3.509 ± 0.566
2.456TyrPro: 2.456 ± 0.02
1.404TyrGln: 1.404 ± 0.101
2.456TyrArg: 2.456 ± 0.647
4.912TyrSer: 4.912 ± 1.212
4.561TyrThr: 4.561 ± 0.141
3.158TyrVal: 3.158 ± 0.242
0.702TyrTrp: 0.702 ± 0.263
3.509TyrTyr: 3.509 ± 0.566
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2851 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski