Amino acid dipepetide frequency for Beihai picorna-like virus 101

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.3AlaAla: 1.3 ± 0.695
1.733AlaCys: 1.733 ± 0.927
2.6AlaAsp: 2.6 ± 0.069
2.6AlaGlu: 2.6 ± 0.73
3.899AlaPhe: 3.899 ± 1.218
5.633AlaGly: 5.633 ± 0.291
0.867AlaHis: 0.867 ± 0.463
5.199AlaIle: 5.199 ± 1.183
5.633AlaLys: 5.633 ± 1.691
6.932AlaLeu: 6.932 ± 0.404
0.433AlaMet: 0.433 ± 0.232
3.899AlaAsn: 3.899 ± 1.218
3.033AlaPro: 3.033 ± 0.301
2.6AlaGln: 2.6 ± 0.069
5.199AlaArg: 5.199 ± 1.459
3.466AlaSer: 3.466 ± 0.789
3.466AlaThr: 3.466 ± 0.128
3.899AlaVal: 3.899 ± 1.878
1.3AlaTrp: 1.3 ± 0.626
3.033AlaTyr: 3.033 ± 0.961
0.0AlaXaa: 0.0 ± 0.0
Cys
1.733CysAla: 1.733 ± 0.927
0.0CysCys: 0.0 ± 0.0
2.166CysAsp: 2.166 ± 0.163
1.3CysGlu: 1.3 ± 0.695
0.867CysPhe: 0.867 ± 0.463
2.6CysGly: 2.6 ± 1.39
0.433CysHis: 0.433 ± 0.232
0.867CysIle: 0.867 ± 0.463
1.3CysLys: 1.3 ± 0.695
0.867CysLeu: 0.867 ± 0.197
0.867CysMet: 0.867 ± 0.463
0.867CysAsn: 0.867 ± 0.463
0.433CysPro: 0.433 ± 0.232
0.433CysGln: 0.433 ± 0.232
0.867CysArg: 0.867 ± 0.197
0.433CysSer: 0.433 ± 0.232
0.867CysThr: 0.867 ± 0.197
2.166CysVal: 2.166 ± 0.163
0.0CysTrp: 0.0 ± 0.0
0.433CysTyr: 0.433 ± 0.232
0.0CysXaa: 0.0 ± 0.0
Asp
2.6AspAla: 2.6 ± 0.069
0.433AspCys: 0.433 ± 0.232
5.633AspAsp: 5.633 ± 0.291
5.199AspGlu: 5.199 ± 0.799
4.766AspPhe: 4.766 ± 2.075
2.6AspGly: 2.6 ± 1.39
1.3AspHis: 1.3 ± 0.626
2.166AspIle: 2.166 ± 1.158
2.6AspLys: 2.6 ± 0.069
4.766AspLeu: 4.766 ± 0.567
1.733AspMet: 1.733 ± 0.266
3.466AspAsn: 3.466 ± 0.789
2.166AspPro: 2.166 ± 0.498
2.6AspGln: 2.6 ± 0.069
1.3AspArg: 1.3 ± 0.034
3.466AspSer: 3.466 ± 0.532
4.333AspThr: 4.333 ± 0.335
5.199AspVal: 5.199 ± 0.138
0.0AspTrp: 0.0 ± 0.0
2.166AspTyr: 2.166 ± 0.163
0.0AspXaa: 0.0 ± 0.0
Glu
3.466GluAla: 3.466 ± 0.532
0.867GluCys: 0.867 ± 0.463
3.033GluAsp: 3.033 ± 0.301
2.6GluGlu: 2.6 ± 0.069
3.033GluPhe: 3.033 ± 0.961
2.166GluGly: 2.166 ± 0.498
0.867GluHis: 0.867 ± 0.463
3.466GluIle: 3.466 ± 1.193
4.333GluLys: 4.333 ± 0.325
4.766GluLeu: 4.766 ± 0.094
1.3GluMet: 1.3 ± 0.034
5.633GluAsn: 5.633 ± 1.03
1.3GluPro: 1.3 ± 0.626
1.733GluGln: 1.733 ± 0.266
4.333GluArg: 4.333 ± 1.656
3.033GluSer: 3.033 ± 0.36
3.899GluThr: 3.899 ± 0.764
3.466GluVal: 3.466 ± 0.532
1.733GluTrp: 1.733 ± 0.927
0.867GluTyr: 0.867 ± 0.463
0.0GluXaa: 0.0 ± 0.0
Phe
2.6PheAla: 2.6 ± 0.592
1.3PheCys: 1.3 ± 0.034
3.033PheAsp: 3.033 ± 1.622
4.333PheGlu: 4.333 ± 0.986
2.6PhePhe: 2.6 ± 0.73
3.466PheGly: 3.466 ± 0.789
1.733PheHis: 1.733 ± 0.394
4.333PheIle: 4.333 ± 0.335
2.6PheLys: 2.6 ± 0.73
1.733PheLeu: 1.733 ± 1.716
1.3PheMet: 1.3 ± 0.034
4.766PheAsn: 4.766 ± 1.888
2.166PhePro: 2.166 ± 1.484
3.899PheGln: 3.899 ± 1.218
2.166PheArg: 2.166 ± 0.498
5.633PheSer: 5.633 ± 0.951
5.633PheThr: 5.633 ± 0.291
5.199PheVal: 5.199 ± 1.844
0.867PheTrp: 0.867 ± 0.197
0.867PheTyr: 0.867 ± 0.463
0.0PheXaa: 0.0 ± 0.0
Gly
3.899GlyAla: 3.899 ± 1.218
1.3GlyCys: 1.3 ± 0.626
6.066GlyAsp: 6.066 ± 0.059
2.6GlyGlu: 2.6 ± 0.069
3.033GlyPhe: 3.033 ± 0.36
2.166GlyGly: 2.166 ± 0.823
1.3GlyHis: 1.3 ± 0.695
4.766GlyIle: 4.766 ± 1.888
3.466GlyLys: 3.466 ± 0.532
3.033GlyLeu: 3.033 ± 0.36
1.733GlyMet: 1.733 ± 0.266
1.733GlyAsn: 1.733 ± 0.394
3.466GlyPro: 3.466 ± 1.449
0.433GlyGln: 0.433 ± 0.232
3.899GlyArg: 3.899 ± 0.764
3.899GlySer: 3.899 ± 0.764
3.899GlyThr: 3.899 ± 0.764
5.199GlyVal: 5.199 ± 1.844
0.433GlyTrp: 0.433 ± 0.232
4.333GlyTyr: 4.333 ± 0.996
0.0GlyXaa: 0.0 ± 0.0
His
1.3HisAla: 1.3 ± 0.626
1.3HisCys: 1.3 ± 0.695
0.867HisAsp: 0.867 ± 0.197
0.867HisGlu: 0.867 ± 0.197
0.433HisPhe: 0.433 ± 0.232
3.033HisGly: 3.033 ± 0.36
0.433HisHis: 0.433 ± 0.232
2.6HisIle: 2.6 ± 1.39
0.867HisLys: 0.867 ± 0.197
1.733HisLeu: 1.733 ± 0.266
0.433HisMet: 0.433 ± 0.232
1.3HisAsn: 1.3 ± 0.034
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
2.6HisArg: 2.6 ± 0.73
1.3HisSer: 1.3 ± 0.034
3.033HisThr: 3.033 ± 0.301
1.3HisVal: 1.3 ± 0.695
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.932IleAla: 6.932 ± 2.386
0.433IleCys: 0.433 ± 0.232
3.466IleAsp: 3.466 ± 0.532
2.166IleGlu: 2.166 ± 0.498
4.333IlePhe: 4.333 ± 0.325
3.466IleGly: 3.466 ± 0.128
2.166IleHis: 2.166 ± 0.498
5.199IleIle: 5.199 ± 0.138
1.3IleLys: 1.3 ± 0.695
3.466IleLeu: 3.466 ± 1.193
0.433IleMet: 0.433 ± 0.232
4.333IleAsn: 4.333 ± 1.647
3.033IlePro: 3.033 ± 0.301
1.3IleGln: 1.3 ± 0.695
3.033IleArg: 3.033 ± 0.36
4.333IleSer: 4.333 ± 0.325
4.766IleThr: 4.766 ± 0.754
6.932IleVal: 6.932 ± 0.404
0.0IleTrp: 0.0 ± 0.0
2.166IleTyr: 2.166 ± 0.498
0.0IleXaa: 0.0 ± 0.0
Lys
2.6LysAla: 2.6 ± 0.069
1.733LysCys: 1.733 ± 0.927
3.466LysAsp: 3.466 ± 0.532
1.3LysGlu: 1.3 ± 0.695
1.733LysPhe: 1.733 ± 0.266
2.166LysGly: 2.166 ± 1.158
1.3LysHis: 1.3 ± 0.695
4.766LysIle: 4.766 ± 1.227
2.166LysLys: 2.166 ± 0.498
3.466LysLeu: 3.466 ± 0.128
0.867LysMet: 0.867 ± 0.463
3.033LysAsn: 3.033 ± 1.02
2.166LysPro: 2.166 ± 0.498
1.733LysGln: 1.733 ± 1.055
3.899LysArg: 3.899 ± 0.764
4.766LysSer: 4.766 ± 1.227
3.033LysThr: 3.033 ± 0.301
3.899LysVal: 3.899 ± 1.425
1.3LysTrp: 1.3 ± 0.695
2.166LysTyr: 2.166 ± 1.158
0.0LysXaa: 0.0 ± 0.0
Leu
3.033LeuAla: 3.033 ± 1.02
2.166LeuCys: 2.166 ± 0.498
6.932LeuAsp: 6.932 ± 1.578
3.033LeuGlu: 3.033 ± 0.961
4.766LeuPhe: 4.766 ± 0.094
3.899LeuGly: 3.899 ± 0.557
0.433LeuHis: 0.433 ± 0.232
2.6LeuIle: 2.6 ± 0.069
7.799LeuLys: 7.799 ± 2.189
4.333LeuLeu: 4.333 ± 1.647
2.166LeuMet: 2.166 ± 0.631
5.199LeuAsn: 5.199 ± 0.138
4.766LeuPro: 4.766 ± 0.754
3.033LeuGln: 3.033 ± 0.36
3.033LeuArg: 3.033 ± 0.301
5.633LeuSer: 5.633 ± 1.612
3.466LeuThr: 3.466 ± 1.449
4.333LeuVal: 4.333 ± 2.968
0.433LeuTrp: 0.433 ± 0.232
1.733LeuTyr: 1.733 ± 0.266
0.0LeuXaa: 0.0 ± 0.0
Met
3.033MetAla: 3.033 ± 0.301
0.867MetCys: 0.867 ± 0.463
1.3MetAsp: 1.3 ± 0.695
3.033MetGlu: 3.033 ± 1.622
1.3MetPhe: 1.3 ± 0.034
0.867MetGly: 0.867 ± 0.197
1.733MetHis: 1.733 ± 0.266
2.166MetIle: 2.166 ± 0.498
0.0MetLys: 0.0 ± 0.0
3.033MetLeu: 3.033 ± 1.622
0.433MetMet: 0.433 ± 0.232
0.867MetAsn: 0.867 ± 0.858
1.733MetPro: 1.733 ± 0.266
0.867MetGln: 0.867 ± 0.858
0.867MetArg: 0.867 ± 0.463
3.033MetSer: 3.033 ± 1.02
0.867MetThr: 0.867 ± 0.197
2.166MetVal: 2.166 ± 0.498
0.433MetTrp: 0.433 ± 0.232
0.433MetTyr: 0.433 ± 0.429
0.0MetXaa: 0.0 ± 0.0
Asn
4.766AsnAla: 4.766 ± 0.754
0.433AsnCys: 0.433 ± 0.232
2.166AsnAsp: 2.166 ± 0.823
2.166AsnGlu: 2.166 ± 0.498
3.899AsnPhe: 3.899 ± 0.557
3.899AsnGly: 3.899 ± 0.764
0.433AsnHis: 0.433 ± 0.232
3.899AsnIle: 3.899 ± 1.878
1.3AsnLys: 1.3 ± 0.034
2.6AsnLeu: 2.6 ± 0.592
2.166AsnMet: 2.166 ± 0.627
5.199AsnAsn: 5.199 ± 1.844
2.6AsnPro: 2.6 ± 1.252
1.733AsnGln: 1.733 ± 0.394
1.3AsnArg: 1.3 ± 0.034
3.899AsnSer: 3.899 ± 0.557
3.899AsnThr: 3.899 ± 0.557
4.333AsnVal: 4.333 ± 0.325
1.3AsnTrp: 1.3 ± 1.287
3.466AsnTyr: 3.466 ± 0.128
0.0AsnXaa: 0.0 ± 0.0
Pro
2.6ProAla: 2.6 ± 0.592
0.433ProCys: 0.433 ± 0.232
1.733ProAsp: 1.733 ± 0.394
1.733ProGlu: 1.733 ± 0.266
2.6ProPhe: 2.6 ± 0.069
0.867ProGly: 0.867 ± 0.463
0.867ProHis: 0.867 ± 0.463
3.033ProIle: 3.033 ± 0.301
1.3ProLys: 1.3 ± 0.626
4.766ProLeu: 4.766 ± 2.075
2.166ProMet: 2.166 ± 0.163
3.899ProAsn: 3.899 ± 2.539
1.733ProPro: 1.733 ± 1.716
0.867ProGln: 0.867 ± 0.197
1.733ProArg: 1.733 ± 0.927
3.033ProSer: 3.033 ± 0.36
3.033ProThr: 3.033 ± 1.681
2.6ProVal: 2.6 ± 0.592
0.867ProTrp: 0.867 ± 0.197
3.466ProTyr: 3.466 ± 2.11
0.0ProXaa: 0.0 ± 0.0
Gln
2.6GlnAla: 2.6 ± 0.069
0.0GlnCys: 0.0 ± 0.0
1.733GlnAsp: 1.733 ± 0.266
1.733GlnGlu: 1.733 ± 0.266
2.166GlnPhe: 2.166 ± 0.823
2.166GlnGly: 2.166 ± 0.163
1.3GlnHis: 1.3 ± 0.695
1.733GlnIle: 1.733 ± 0.394
0.867GlnLys: 0.867 ± 0.463
3.466GlnLeu: 3.466 ± 0.789
1.3GlnMet: 1.3 ± 0.695
0.433GlnAsn: 0.433 ± 0.232
1.3GlnPro: 1.3 ± 1.287
1.733GlnGln: 1.733 ± 0.266
0.867GlnArg: 0.867 ± 0.463
3.033GlnSer: 3.033 ± 0.301
1.733GlnThr: 1.733 ± 0.266
2.166GlnVal: 2.166 ± 0.823
0.433GlnTrp: 0.433 ± 0.232
1.3GlnTyr: 1.3 ± 0.626
0.0GlnXaa: 0.0 ± 0.0
Arg
2.166ArgAla: 2.166 ± 0.498
0.433ArgCys: 0.433 ± 0.232
1.733ArgAsp: 1.733 ± 0.927
1.3ArgGlu: 1.3 ± 0.034
5.199ArgPhe: 5.199 ± 0.138
2.6ArgGly: 2.6 ± 0.069
2.166ArgHis: 2.166 ± 0.498
3.899ArgIle: 3.899 ± 0.103
3.033ArgLys: 3.033 ± 1.622
5.199ArgLeu: 5.199 ± 0.138
3.033ArgMet: 3.033 ± 0.36
2.6ArgAsn: 2.6 ± 0.069
0.867ArgPro: 0.867 ± 0.197
0.433ArgGln: 0.433 ± 0.232
3.466ArgArg: 3.466 ± 1.853
2.166ArgSer: 2.166 ± 0.498
3.466ArgThr: 3.466 ± 0.789
5.199ArgVal: 5.199 ± 2.12
1.3ArgTrp: 1.3 ± 0.626
1.733ArgTyr: 1.733 ± 0.394
0.0ArgXaa: 0.0 ± 0.0
Ser
8.232SerAla: 8.232 ± 0.222
0.433SerCys: 0.433 ± 0.232
3.033SerAsp: 3.033 ± 0.36
3.466SerGlu: 3.466 ± 0.789
6.499SerPhe: 6.499 ± 1.149
3.899SerGly: 3.899 ± 0.557
1.3SerHis: 1.3 ± 0.034
5.199SerIle: 5.199 ± 1.459
4.766SerLys: 4.766 ± 0.094
2.166SerLeu: 2.166 ± 0.163
3.033SerMet: 3.033 ± 1.622
2.166SerAsn: 2.166 ± 2.144
2.166SerPro: 2.166 ± 0.823
1.3SerGln: 1.3 ± 0.034
3.033SerArg: 3.033 ± 1.02
1.733SerSer: 1.733 ± 0.266
5.199SerThr: 5.199 ± 1.183
4.333SerVal: 4.333 ± 0.335
1.3SerTrp: 1.3 ± 0.695
1.3SerTyr: 1.3 ± 0.626
0.0SerXaa: 0.0 ± 0.0
Thr
6.932ThrAla: 6.932 ± 0.404
1.3ThrCys: 1.3 ± 0.034
2.6ThrAsp: 2.6 ± 0.73
4.766ThrGlu: 4.766 ± 0.567
3.033ThrPhe: 3.033 ± 0.301
4.766ThrGly: 4.766 ± 2.736
1.733ThrHis: 1.733 ± 1.055
2.6ThrIle: 2.6 ± 1.252
1.733ThrLys: 1.733 ± 0.266
3.899ThrLeu: 3.899 ± 0.103
1.733ThrMet: 1.733 ± 0.927
1.733ThrAsn: 1.733 ± 0.394
3.899ThrPro: 3.899 ± 1.218
5.199ThrGln: 5.199 ± 0.799
2.6ThrArg: 2.6 ± 1.252
4.766ThrSer: 4.766 ± 0.754
4.766ThrThr: 4.766 ± 1.415
4.766ThrVal: 4.766 ± 0.094
0.433ThrTrp: 0.433 ± 0.429
2.6ThrTyr: 2.6 ± 0.73
0.0ThrXaa: 0.0 ± 0.0
Val
3.033ValAla: 3.033 ± 1.02
2.166ValCys: 2.166 ± 0.498
3.466ValAsp: 3.466 ± 0.128
8.666ValGlu: 8.666 ± 2.652
3.033ValPhe: 3.033 ± 0.36
7.366ValGly: 7.366 ± 0.025
2.6ValHis: 2.6 ± 0.069
2.6ValIle: 2.6 ± 0.592
3.033ValLys: 3.033 ± 0.961
9.099ValLeu: 9.099 ± 1.74
3.033ValMet: 3.033 ± 1.681
2.166ValAsn: 2.166 ± 1.484
3.899ValPro: 3.899 ± 1.878
1.733ValGln: 1.733 ± 0.266
3.466ValArg: 3.466 ± 0.532
4.766ValSer: 4.766 ± 1.415
4.766ValThr: 4.766 ± 0.567
5.633ValVal: 5.633 ± 0.291
0.867ValTrp: 0.867 ± 0.197
0.867ValTyr: 0.867 ± 0.463
0.0ValXaa: 0.0 ± 0.0
Trp
0.867TrpAla: 0.867 ± 0.858
0.867TrpCys: 0.867 ± 0.463
0.867TrpAsp: 0.867 ± 0.197
1.3TrpGlu: 1.3 ± 0.695
2.166TrpPhe: 2.166 ± 0.823
0.433TrpGly: 0.433 ± 0.232
0.0TrpHis: 0.0 ± 0.0
0.867TrpIle: 0.867 ± 0.197
0.867TrpLys: 0.867 ± 0.463
1.3TrpLeu: 1.3 ± 0.034
0.0TrpMet: 0.0 ± 0.0
0.433TrpAsn: 0.433 ± 0.232
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.733TrpArg: 1.733 ± 0.394
0.867TrpSer: 0.867 ± 0.197
0.0TrpThr: 0.0 ± 0.0
1.3TrpVal: 1.3 ± 0.695
0.433TrpTrp: 0.433 ± 0.429
0.433TrpTyr: 0.433 ± 0.429
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.033TyrAla: 3.033 ± 1.622
1.3TyrCys: 1.3 ± 0.034
2.6TyrAsp: 2.6 ± 0.73
1.733TyrGlu: 1.733 ± 0.394
0.867TyrPhe: 0.867 ± 0.463
2.6TyrGly: 2.6 ± 0.73
0.433TyrHis: 0.433 ± 0.429
1.3TyrIle: 1.3 ± 0.034
2.6TyrLys: 2.6 ± 0.73
2.6TyrLeu: 2.6 ± 1.252
0.433TyrMet: 0.433 ± 0.232
1.733TyrAsn: 1.733 ± 0.266
2.6TyrPro: 2.6 ± 0.069
0.433TyrGln: 0.433 ± 0.232
2.6TyrArg: 2.6 ± 0.592
1.3TyrSer: 1.3 ± 0.034
1.733TyrThr: 1.733 ± 0.394
2.166TyrVal: 2.166 ± 0.823
1.3TyrTrp: 1.3 ± 0.034
0.867TyrTyr: 0.867 ± 0.197
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2309 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski