Amino acid dipepetide frequency for Beihai picorna-like virus 42

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.617AlaAla: 8.617 ± 2.411
0.392AlaCys: 0.392 ± 0.418
4.309AlaAsp: 4.309 ± 1.514
4.309AlaGlu: 4.309 ± 0.897
2.35AlaPhe: 2.35 ± 0.575
7.834AlaGly: 7.834 ± 2.192
1.567AlaHis: 1.567 ± 0.438
4.7AlaIle: 4.7 ± 1.15
2.35AlaLys: 2.35 ± 0.575
7.442AlaLeu: 7.442 ± 1.307
1.175AlaMet: 1.175 ± 0.021
3.134AlaAsn: 3.134 ± 0.356
4.309AlaPro: 4.309 ± 2.13
1.175AlaGln: 1.175 ± 0.021
3.134AlaArg: 3.134 ± 0.972
6.659AlaSer: 6.659 ± 0.939
4.309AlaThr: 4.309 ± 0.897
3.917AlaVal: 3.917 ± 1.712
1.958AlaTrp: 1.958 ± 0.376
1.567AlaTyr: 1.567 ± 0.178
0.0AlaXaa: 0.0 ± 0.0
Cys
0.783CysAla: 0.783 ± 0.397
0.392CysCys: 0.392 ± 0.418
0.392CysAsp: 0.392 ± 0.199
0.392CysGlu: 0.392 ± 0.199
0.392CysPhe: 0.392 ± 0.199
2.35CysGly: 2.35 ± 0.041
0.0CysHis: 0.0 ± 0.0
0.783CysIle: 0.783 ± 0.397
1.175CysLys: 1.175 ± 0.021
0.392CysLeu: 0.392 ± 0.418
0.392CysMet: 0.392 ± 0.418
0.392CysAsn: 0.392 ± 0.199
0.783CysPro: 0.783 ± 0.397
0.0CysGln: 0.0 ± 0.0
1.175CysArg: 1.175 ± 0.596
1.958CysSer: 1.958 ± 0.993
0.783CysThr: 0.783 ± 0.397
1.175CysVal: 1.175 ± 0.021
0.0CysTrp: 0.0 ± 0.0
0.783CysTyr: 0.783 ± 0.397
0.0CysXaa: 0.0 ± 0.0
Asp
3.917AspAla: 3.917 ± 1.096
0.0AspCys: 0.0 ± 0.0
2.742AspAsp: 2.742 ± 0.459
4.309AspGlu: 4.309 ± 0.952
4.309AspPhe: 4.309 ± 0.897
5.092AspGly: 5.092 ± 0.5
1.175AspHis: 1.175 ± 0.596
2.742AspIle: 2.742 ± 0.459
2.35AspLys: 2.35 ± 1.191
6.267AspLeu: 6.267 ± 1.944
3.134AspMet: 3.134 ± 0.972
3.134AspAsn: 3.134 ± 0.972
4.7AspPro: 4.7 ± 0.699
1.958AspGln: 1.958 ± 0.993
2.742AspArg: 2.742 ± 0.774
4.309AspSer: 4.309 ± 0.952
3.134AspThr: 3.134 ± 0.877
4.7AspVal: 4.7 ± 0.083
0.392AspTrp: 0.392 ± 0.199
3.525AspTyr: 3.525 ± 1.171
0.0AspXaa: 0.0 ± 0.0
Glu
5.092GluAla: 5.092 ± 1.349
1.175GluCys: 1.175 ± 0.596
4.309GluAsp: 4.309 ± 0.897
5.092GluGlu: 5.092 ± 1.349
1.958GluPhe: 1.958 ± 0.376
0.392GluGly: 0.392 ± 0.199
1.175GluHis: 1.175 ± 0.596
5.484GluIle: 5.484 ± 0.315
3.917GluLys: 3.917 ± 0.753
2.35GluLeu: 2.35 ± 0.575
1.175GluMet: 1.175 ± 0.187
3.525GluAsn: 3.525 ± 0.062
1.958GluPro: 1.958 ± 0.376
1.175GluGln: 1.175 ± 0.021
3.134GluArg: 3.134 ± 0.877
4.7GluSer: 4.7 ± 0.699
1.567GluThr: 1.567 ± 0.438
4.7GluVal: 4.7 ± 0.534
0.783GluTrp: 0.783 ± 0.397
2.742GluTyr: 2.742 ± 0.157
0.0GluXaa: 0.0 ± 0.0
Phe
3.134PheAla: 3.134 ± 1.493
1.175PheCys: 1.175 ± 0.021
3.525PheAsp: 3.525 ± 1.171
3.134PheGlu: 3.134 ± 0.356
3.525PhePhe: 3.525 ± 0.062
5.484PheGly: 5.484 ± 0.315
1.175PheHis: 1.175 ± 0.021
2.35PheIle: 2.35 ± 0.041
2.742PheLys: 2.742 ± 0.157
7.834PheLeu: 7.834 ± 0.89
0.783PheMet: 0.783 ± 0.836
1.567PheAsn: 1.567 ± 1.055
3.134PhePro: 3.134 ± 0.356
1.567PheGln: 1.567 ± 0.178
2.742PheArg: 2.742 ± 0.157
4.7PheSer: 4.7 ± 0.083
3.917PheThr: 3.917 ± 1.096
1.175PheVal: 1.175 ± 0.596
2.742PheTrp: 2.742 ± 0.459
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.917GlyAla: 3.917 ± 2.329
0.0GlyCys: 0.0 ± 0.0
4.309GlyAsp: 4.309 ± 0.335
4.309GlyGlu: 4.309 ± 0.897
3.917GlyPhe: 3.917 ± 0.753
3.917GlyGly: 3.917 ± 1.096
1.567GlyHis: 1.567 ± 0.794
6.267GlyIle: 6.267 ± 1.328
4.309GlyLys: 4.309 ± 0.281
4.309GlyLeu: 4.309 ± 0.281
1.567GlyMet: 1.567 ± 0.438
2.742GlyAsn: 2.742 ± 0.157
2.35GlyPro: 2.35 ± 0.658
1.958GlyGln: 1.958 ± 0.376
3.134GlyArg: 3.134 ± 0.972
5.484GlySer: 5.484 ± 1.534
5.092GlyThr: 5.092 ± 0.5
3.525GlyVal: 3.525 ± 1.911
0.783GlyTrp: 0.783 ± 0.219
1.958GlyTyr: 1.958 ± 0.24
0.0GlyXaa: 0.0 ± 0.0
His
1.567HisAla: 1.567 ± 0.438
0.0HisCys: 0.0 ± 0.0
0.392HisAsp: 0.392 ± 0.199
1.175HisGlu: 1.175 ± 0.596
0.783HisPhe: 0.783 ± 0.397
1.175HisGly: 1.175 ± 0.021
0.0HisHis: 0.0 ± 0.0
1.175HisIle: 1.175 ± 0.596
1.958HisLys: 1.958 ± 0.993
2.35HisLeu: 2.35 ± 0.575
0.783HisMet: 0.783 ± 0.219
0.783HisAsn: 0.783 ± 0.219
2.742HisPro: 2.742 ± 0.157
0.783HisGln: 0.783 ± 0.397
0.392HisArg: 0.392 ± 0.418
0.392HisSer: 0.392 ± 0.199
1.567HisThr: 1.567 ± 0.178
3.917HisVal: 3.917 ± 0.753
0.783HisTrp: 0.783 ± 0.219
0.392HisTyr: 0.392 ± 0.418
0.0HisXaa: 0.0 ± 0.0
Ile
5.092IleAla: 5.092 ± 0.5
1.567IleCys: 1.567 ± 0.178
5.092IleAsp: 5.092 ± 0.5
4.7IleGlu: 4.7 ± 0.534
3.134IlePhe: 3.134 ± 0.972
4.7IleGly: 4.7 ± 1.315
1.175IleHis: 1.175 ± 0.637
1.958IleIle: 1.958 ± 0.376
2.35IleLys: 2.35 ± 0.575
3.134IleLeu: 3.134 ± 0.972
1.567IleMet: 1.567 ± 0.438
1.567IleAsn: 1.567 ± 0.438
3.525IlePro: 3.525 ± 0.062
1.958IleGln: 1.958 ± 0.993
1.958IleArg: 1.958 ± 0.376
1.958IleSer: 1.958 ± 0.993
2.742IleThr: 2.742 ± 0.459
5.092IleVal: 5.092 ± 0.732
1.958IleTrp: 1.958 ± 0.376
0.783IleTyr: 0.783 ± 0.397
0.0IleXaa: 0.0 ± 0.0
Lys
1.567LysAla: 1.567 ± 0.794
0.392LysCys: 0.392 ± 0.199
2.742LysAsp: 2.742 ± 1.39
1.175LysGlu: 1.175 ± 0.596
3.134LysPhe: 3.134 ± 0.26
1.958LysGly: 1.958 ± 0.376
1.958LysHis: 1.958 ± 0.993
3.525LysIle: 3.525 ± 0.678
2.35LysLys: 2.35 ± 0.575
4.309LysLeu: 4.309 ± 0.897
1.958LysMet: 1.958 ± 0.376
0.783LysAsn: 0.783 ± 0.397
2.742LysPro: 2.742 ± 0.459
1.567LysGln: 1.567 ± 0.178
3.525LysArg: 3.525 ± 1.171
3.917LysSer: 3.917 ± 0.137
2.742LysThr: 2.742 ± 0.157
3.525LysVal: 3.525 ± 0.062
0.0LysTrp: 0.0 ± 0.0
2.742LysTyr: 2.742 ± 0.774
0.0LysXaa: 0.0 ± 0.0
Leu
9.009LeuAla: 9.009 ± 1.596
0.392LeuCys: 0.392 ± 0.199
6.267LeuAsp: 6.267 ± 1.944
5.875LeuGlu: 5.875 ± 0.513
8.226LeuPhe: 8.226 ± 1.088
4.7LeuGly: 4.7 ± 0.534
0.783LeuHis: 0.783 ± 0.836
3.525LeuIle: 3.525 ± 1.171
3.525LeuLys: 3.525 ± 1.787
6.659LeuLeu: 6.659 ± 2.143
1.175LeuMet: 1.175 ± 0.021
3.134LeuAsn: 3.134 ± 0.26
2.35LeuPro: 2.35 ± 1.274
1.567LeuGln: 1.567 ± 0.794
5.875LeuArg: 5.875 ± 1.129
5.484LeuSer: 5.484 ± 0.918
4.7LeuThr: 4.7 ± 0.699
5.875LeuVal: 5.875 ± 1.129
0.0LeuTrp: 0.0 ± 0.0
2.742LeuTyr: 2.742 ± 0.157
0.0LeuXaa: 0.0 ± 0.0
Met
1.958MetAla: 1.958 ± 0.856
0.392MetCys: 0.392 ± 0.199
1.958MetAsp: 1.958 ± 0.24
1.175MetGlu: 1.175 ± 0.021
0.783MetPhe: 0.783 ± 0.397
2.742MetGly: 2.742 ± 0.774
0.392MetHis: 0.392 ± 0.199
0.783MetIle: 0.783 ± 0.219
2.35MetLys: 2.35 ± 1.274
3.525MetLeu: 3.525 ± 0.554
1.175MetMet: 1.175 ± 0.021
1.567MetAsn: 1.567 ± 0.438
0.392MetPro: 0.392 ± 0.199
0.392MetGln: 0.392 ± 0.199
2.35MetArg: 2.35 ± 0.041
2.35MetSer: 2.35 ± 0.575
0.783MetThr: 0.783 ± 0.219
2.35MetVal: 2.35 ± 0.658
0.783MetTrp: 0.783 ± 0.397
0.783MetTyr: 0.783 ± 0.397
0.0MetXaa: 0.0 ± 0.0
Asn
2.742AsnAla: 2.742 ± 1.075
1.567AsnCys: 1.567 ± 0.178
0.783AsnAsp: 0.783 ± 0.219
2.35AsnGlu: 2.35 ± 0.041
1.958AsnPhe: 1.958 ± 0.24
4.309AsnGly: 4.309 ± 1.514
1.175AsnHis: 1.175 ± 0.021
3.134AsnIle: 3.134 ± 1.493
1.567AsnLys: 1.567 ± 0.178
3.917AsnLeu: 3.917 ± 0.753
2.35AsnMet: 2.35 ± 0.041
2.742AsnAsn: 2.742 ± 1.075
2.742AsnPro: 2.742 ± 0.157
0.783AsnGln: 0.783 ± 0.836
2.742AsnArg: 2.742 ± 0.774
2.742AsnSer: 2.742 ± 0.157
1.958AsnThr: 1.958 ± 0.856
4.309AsnVal: 4.309 ± 0.335
0.392AsnTrp: 0.392 ± 0.199
0.392AsnTyr: 0.392 ± 0.418
0.0AsnXaa: 0.0 ± 0.0
Pro
5.092ProAla: 5.092 ± 0.116
1.175ProCys: 1.175 ± 0.596
3.917ProAsp: 3.917 ± 0.753
1.567ProGlu: 1.567 ± 0.438
3.134ProPhe: 3.134 ± 0.877
2.742ProGly: 2.742 ± 0.157
0.392ProHis: 0.392 ± 0.199
1.958ProIle: 1.958 ± 0.376
1.958ProLys: 1.958 ± 0.376
3.917ProLeu: 3.917 ± 0.753
1.567ProMet: 1.567 ± 1.277
1.567ProAsn: 1.567 ± 0.438
0.783ProPro: 0.783 ± 0.219
1.567ProGln: 1.567 ± 1.055
2.35ProArg: 2.35 ± 0.041
1.567ProSer: 1.567 ± 0.438
3.134ProThr: 3.134 ± 2.726
4.7ProVal: 4.7 ± 1.315
1.175ProTrp: 1.175 ± 0.637
2.35ProTyr: 2.35 ± 0.658
0.0ProXaa: 0.0 ± 0.0
Gln
1.175GlnAla: 1.175 ± 0.596
0.0GlnCys: 0.0 ± 0.0
1.567GlnAsp: 1.567 ± 0.178
1.958GlnGlu: 1.958 ± 0.24
2.35GlnPhe: 2.35 ± 0.041
0.783GlnGly: 0.783 ± 0.836
0.783GlnHis: 0.783 ± 0.219
1.567GlnIle: 1.567 ± 0.794
1.567GlnLys: 1.567 ± 0.438
2.742GlnLeu: 2.742 ± 0.459
0.783GlnMet: 0.783 ± 0.397
1.175GlnAsn: 1.175 ± 0.637
1.175GlnPro: 1.175 ± 1.253
1.175GlnGln: 1.175 ± 0.021
1.175GlnArg: 1.175 ± 0.596
1.175GlnSer: 1.175 ± 0.596
0.783GlnThr: 0.783 ± 0.397
1.958GlnVal: 1.958 ± 0.376
1.567GlnTrp: 1.567 ± 0.178
0.783GlnTyr: 0.783 ± 0.397
0.0GlnXaa: 0.0 ± 0.0
Arg
1.958ArgAla: 1.958 ± 0.24
1.175ArgCys: 1.175 ± 0.596
3.134ArgAsp: 3.134 ± 0.972
4.309ArgGlu: 4.309 ± 1.568
2.35ArgPhe: 2.35 ± 0.658
3.917ArgGly: 3.917 ± 0.137
2.742ArgHis: 2.742 ± 0.774
2.35ArgIle: 2.35 ± 0.575
2.742ArgLys: 2.742 ± 0.157
3.525ArgLeu: 3.525 ± 0.062
0.392ArgMet: 0.392 ± 0.199
3.525ArgAsn: 3.525 ± 0.554
1.958ArgPro: 1.958 ± 0.856
0.392ArgGln: 0.392 ± 0.418
5.092ArgArg: 5.092 ± 2.581
5.092ArgSer: 5.092 ± 1.965
2.35ArgThr: 2.35 ± 0.575
6.267ArgVal: 6.267 ± 0.095
0.0ArgTrp: 0.0 ± 0.0
2.35ArgTyr: 2.35 ± 0.658
0.0ArgXaa: 0.0 ± 0.0
Ser
4.7SerAla: 4.7 ± 0.534
1.567SerCys: 1.567 ± 0.794
6.267SerAsp: 6.267 ± 1.137
3.525SerGlu: 3.525 ± 0.554
3.134SerPhe: 3.134 ± 0.356
3.134SerGly: 3.134 ± 0.356
2.742SerHis: 2.742 ± 0.774
3.525SerIle: 3.525 ± 0.678
2.742SerLys: 2.742 ± 0.157
7.442SerLeu: 7.442 ± 0.542
1.958SerMet: 1.958 ± 0.376
3.525SerAsn: 3.525 ± 2.527
2.742SerPro: 2.742 ± 0.774
2.35SerGln: 2.35 ± 0.658
3.917SerArg: 3.917 ± 0.753
3.525SerSer: 3.525 ± 0.062
4.309SerThr: 4.309 ± 1.514
4.309SerVal: 4.309 ± 0.281
1.958SerTrp: 1.958 ± 0.993
3.525SerTyr: 3.525 ± 0.062
0.0SerXaa: 0.0 ± 0.0
Thr
4.309ThrAla: 4.309 ± 2.13
0.392ThrCys: 0.392 ± 0.199
3.134ThrAsp: 3.134 ± 0.972
1.175ThrGlu: 1.175 ± 0.021
4.7ThrPhe: 4.7 ± 1.932
5.092ThrGly: 5.092 ± 0.5
1.567ThrHis: 1.567 ± 0.178
3.525ThrIle: 3.525 ± 0.678
2.35ThrLys: 2.35 ± 0.041
3.134ThrLeu: 3.134 ± 2.109
1.567ThrMet: 1.567 ± 0.178
3.525ThrAsn: 3.525 ± 1.295
3.525ThrPro: 3.525 ± 1.295
2.35ThrGln: 2.35 ± 1.274
2.35ThrArg: 2.35 ± 0.658
4.309ThrSer: 4.309 ± 0.281
4.309ThrThr: 4.309 ± 1.514
3.134ThrVal: 3.134 ± 1.493
0.392ThrTrp: 0.392 ± 0.199
3.134ThrTyr: 3.134 ± 0.356
0.0ThrXaa: 0.0 ± 0.0
Val
5.875ValAla: 5.875 ± 0.513
0.392ValCys: 0.392 ± 0.199
5.875ValAsp: 5.875 ± 1.129
3.134ValGlu: 3.134 ± 0.26
2.742ValPhe: 2.742 ± 1.075
2.742ValGly: 2.742 ± 0.459
1.958ValHis: 1.958 ± 0.376
3.917ValIle: 3.917 ± 0.48
1.958ValLys: 1.958 ± 0.993
5.092ValLeu: 5.092 ± 0.732
3.134ValMet: 3.134 ± 0.972
4.309ValAsn: 4.309 ± 0.335
3.134ValPro: 3.134 ± 0.877
2.35ValGln: 2.35 ± 0.041
4.7ValArg: 4.7 ± 0.534
6.267ValSer: 6.267 ± 0.095
5.875ValThr: 5.875 ± 3.801
4.309ValVal: 4.309 ± 0.335
1.958ValTrp: 1.958 ± 0.376
3.525ValTyr: 3.525 ± 1.295
0.0ValXaa: 0.0 ± 0.0
Trp
2.35TrpAla: 2.35 ± 1.191
0.392TrpCys: 0.392 ± 0.418
2.742TrpAsp: 2.742 ± 0.774
0.392TrpGlu: 0.392 ± 0.199
1.175TrpPhe: 1.175 ± 0.596
0.392TrpGly: 0.392 ± 0.418
0.783TrpHis: 0.783 ± 0.219
1.175TrpIle: 1.175 ± 0.596
1.175TrpLys: 1.175 ± 0.637
0.0TrpLeu: 0.0 ± 0.0
0.783TrpMet: 0.783 ± 0.397
0.392TrpAsn: 0.392 ± 0.199
0.783TrpPro: 0.783 ± 0.219
0.0TrpGln: 0.0 ± 0.0
0.783TrpArg: 0.783 ± 0.219
0.783TrpSer: 0.783 ± 0.219
1.567TrpThr: 1.567 ± 0.794
1.175TrpVal: 1.175 ± 0.021
0.0TrpTrp: 0.0 ± 0.0
1.175TrpTyr: 1.175 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.35TyrAla: 2.35 ± 0.575
1.958TyrCys: 1.958 ± 0.376
1.175TyrAsp: 1.175 ± 0.596
2.35TyrGlu: 2.35 ± 0.041
2.35TyrPhe: 2.35 ± 0.658
1.958TyrGly: 1.958 ± 0.376
0.0TyrHis: 0.0 ± 0.0
1.567TyrIle: 1.567 ± 0.178
1.175TyrLys: 1.175 ± 0.021
3.525TyrLeu: 3.525 ± 0.554
1.175TyrMet: 1.175 ± 0.021
1.567TyrAsn: 1.567 ± 0.178
1.175TyrPro: 1.175 ± 0.596
1.175TyrGln: 1.175 ± 0.596
2.35TyrArg: 2.35 ± 1.274
3.525TyrSer: 3.525 ± 2.527
2.35TyrThr: 2.35 ± 0.041
3.134TyrVal: 3.134 ± 0.972
0.392TyrTrp: 0.392 ± 0.199
2.35TyrTyr: 2.35 ± 0.658
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2554 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski