Amino acid dipepetide frequency for Beihai picorna-like virus 43

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.093AlaAla: 4.093 ± 0.395
1.637AlaCys: 1.637 ± 0.809
2.865AlaAsp: 2.865 ± 0.811
3.275AlaGlu: 3.275 ± 0.409
2.047AlaPhe: 2.047 ± 0.198
3.684AlaGly: 3.684 ± 0.612
2.047AlaHis: 2.047 ± 0.407
2.865AlaIle: 2.865 ± 0.207
2.047AlaLys: 2.047 ± 1.011
3.275AlaLeu: 3.275 ± 0.195
0.409AlaMet: 0.409 ± 0.402
4.912AlaAsn: 4.912 ± 2.409
4.503AlaPro: 4.503 ± 0.193
2.865AlaGln: 2.865 ± 1.606
2.047AlaArg: 2.047 ± 0.198
6.549AlaSer: 6.549 ± 2.808
3.684AlaThr: 3.684 ± 1.202
2.865AlaVal: 2.865 ± 0.398
1.637AlaTrp: 1.637 ± 0.4
2.865AlaTyr: 2.865 ± 0.207
0.0AlaXaa: 0.0 ± 0.0
Cys
1.637CysAla: 1.637 ± 1.004
0.409CysCys: 0.409 ± 0.202
2.047CysAsp: 2.047 ± 0.407
1.637CysGlu: 1.637 ± 0.809
0.0CysPhe: 0.0 ± 0.0
1.637CysGly: 1.637 ± 0.205
0.0CysHis: 0.0 ± 0.0
1.637CysIle: 1.637 ± 0.809
2.456CysLys: 2.456 ± 0.609
2.047CysLeu: 2.047 ± 0.407
0.819CysMet: 0.819 ± 0.2
0.409CysAsn: 0.409 ± 0.202
0.0CysPro: 0.0 ± 0.0
0.819CysGln: 0.819 ± 0.2
0.819CysArg: 0.819 ± 0.405
1.228CysSer: 1.228 ± 0.002
1.228CysThr: 1.228 ± 0.002
0.409CysVal: 0.409 ± 0.402
0.409CysTrp: 0.409 ± 0.402
1.637CysTyr: 1.637 ± 0.809
0.0CysXaa: 0.0 ± 0.0
Asp
2.047AspAla: 2.047 ± 0.802
0.819AspCys: 0.819 ± 0.405
1.637AspAsp: 1.637 ± 0.809
3.275AspGlu: 3.275 ± 0.195
4.093AspPhe: 4.093 ± 0.814
2.865AspGly: 2.865 ± 0.398
1.228AspHis: 1.228 ± 0.602
3.275AspIle: 3.275 ± 0.8
4.093AspLys: 4.093 ± 1.418
6.959AspLeu: 6.959 ± 1.021
1.637AspMet: 1.637 ± 0.809
2.865AspAsn: 2.865 ± 1.002
2.865AspPro: 2.865 ± 1.606
0.409AspGln: 0.409 ± 0.202
0.819AspArg: 0.819 ± 0.405
3.275AspSer: 3.275 ± 0.195
2.456AspThr: 2.456 ± 0.005
4.503AspVal: 4.503 ± 0.797
0.409AspTrp: 0.409 ± 0.402
2.456AspTyr: 2.456 ± 0.6
0.0AspXaa: 0.0 ± 0.0
Glu
4.093GluAla: 4.093 ± 0.814
0.819GluCys: 0.819 ± 0.405
3.275GluAsp: 3.275 ± 1.404
2.865GluGlu: 2.865 ± 0.207
2.456GluPhe: 2.456 ± 0.609
1.637GluGly: 1.637 ± 0.205
1.228GluHis: 1.228 ± 0.607
4.912GluIle: 4.912 ± 1.823
3.275GluLys: 3.275 ± 1.014
5.321GluLeu: 5.321 ± 0.212
0.819GluMet: 0.819 ± 0.405
2.456GluAsn: 2.456 ± 0.6
2.456GluPro: 2.456 ± 1.214
2.865GluGln: 2.865 ± 0.811
2.047GluArg: 2.047 ± 0.407
4.093GluSer: 4.093 ± 0.814
2.456GluThr: 2.456 ± 0.609
4.503GluVal: 4.503 ± 0.412
0.819GluTrp: 0.819 ± 0.2
2.865GluTyr: 2.865 ± 0.207
0.0GluXaa: 0.0 ± 0.0
Phe
4.093PheAla: 4.093 ± 1.604
1.228PheCys: 1.228 ± 0.607
0.409PheAsp: 0.409 ± 0.202
2.865PheGlu: 2.865 ± 1.416
2.047PhePhe: 2.047 ± 0.407
2.456PheGly: 2.456 ± 0.609
1.228PheHis: 1.228 ± 0.602
4.503PheIle: 4.503 ± 1.621
5.321PheLys: 5.321 ± 0.212
4.912PheLeu: 4.912 ± 0.614
1.228PheMet: 1.228 ± 0.002
4.503PheAsn: 4.503 ± 0.412
2.047PhePro: 2.047 ± 0.407
1.228PheGln: 1.228 ± 0.602
2.456PheArg: 2.456 ± 1.214
1.637PheSer: 1.637 ± 0.205
4.912PheThr: 4.912 ± 0.614
2.865PheVal: 2.865 ± 0.398
0.819PheTrp: 0.819 ± 0.405
0.819PheTyr: 0.819 ± 0.2
0.0PheXaa: 0.0 ± 0.0
Gly
5.321GlyAla: 5.321 ± 0.393
0.819GlyCys: 0.819 ± 0.2
3.684GlyAsp: 3.684 ± 1.216
2.865GlyGlu: 2.865 ± 0.207
2.047GlyPhe: 2.047 ± 0.407
2.865GlyGly: 2.865 ± 2.211
0.819GlyHis: 0.819 ± 0.405
4.912GlyIle: 4.912 ± 1.218
2.047GlyLys: 2.047 ± 0.407
2.865GlyLeu: 2.865 ± 0.398
1.228GlyMet: 1.228 ± 0.602
2.456GlyAsn: 2.456 ± 0.609
1.228GlyPro: 1.228 ± 0.602
0.819GlyGln: 0.819 ± 0.2
2.047GlyArg: 2.047 ± 0.407
6.549GlySer: 6.549 ± 0.995
2.047GlyThr: 2.047 ± 1.407
3.684GlyVal: 3.684 ± 1.202
0.409GlyTrp: 0.409 ± 0.202
2.456GlyTyr: 2.456 ± 0.005
0.0GlyXaa: 0.0 ± 0.0
His
1.228HisAla: 1.228 ± 0.607
1.228HisCys: 1.228 ± 0.607
0.819HisAsp: 0.819 ± 0.2
1.637HisGlu: 1.637 ± 0.809
1.637HisPhe: 1.637 ± 0.205
0.409HisGly: 0.409 ± 0.402
0.0HisHis: 0.0 ± 0.0
3.684HisIle: 3.684 ± 0.612
0.409HisLys: 0.409 ± 0.202
1.637HisLeu: 1.637 ± 0.4
1.228HisMet: 1.228 ± 0.002
0.819HisAsn: 0.819 ± 0.2
1.228HisPro: 1.228 ± 0.607
0.819HisGln: 0.819 ± 0.405
1.637HisArg: 1.637 ± 0.4
2.865HisSer: 2.865 ± 0.207
1.228HisThr: 1.228 ± 0.002
0.819HisVal: 0.819 ± 0.405
0.0HisTrp: 0.0 ± 0.0
0.409HisTyr: 0.409 ± 0.402
0.0HisXaa: 0.0 ± 0.0
Ile
5.321IleAla: 5.321 ± 0.212
2.047IleCys: 2.047 ± 0.198
4.093IleAsp: 4.093 ± 0.395
5.321IleGlu: 5.321 ± 0.816
3.275IlePhe: 3.275 ± 1.014
2.047IleGly: 2.047 ± 0.198
4.503IleHis: 4.503 ± 1.016
6.14IleIle: 6.14 ± 0.616
3.684IleLys: 3.684 ± 0.612
8.187IleLeu: 8.187 ± 2.232
1.637IleMet: 1.637 ± 0.205
4.912IleAsn: 4.912 ± 0.009
2.865IlePro: 2.865 ± 0.398
3.275IleGln: 3.275 ± 0.409
3.275IleArg: 3.275 ± 0.195
5.731IleSer: 5.731 ± 0.795
6.959IleThr: 6.959 ± 0.793
2.456IleVal: 2.456 ± 0.005
0.819IleTrp: 0.819 ± 0.804
5.321IleTyr: 5.321 ± 0.816
0.0IleXaa: 0.0 ± 0.0
Lys
2.047LysAla: 2.047 ± 1.011
0.819LysCys: 0.819 ± 0.405
3.275LysAsp: 3.275 ± 1.618
4.093LysGlu: 4.093 ± 1.418
4.503LysPhe: 4.503 ± 1.016
4.093LysGly: 4.093 ± 2.023
1.637LysHis: 1.637 ± 0.4
4.093LysIle: 4.093 ± 0.814
5.731LysLys: 5.731 ± 2.832
8.187LysLeu: 8.187 ± 1.023
2.047LysMet: 2.047 ± 0.344
4.503LysAsn: 4.503 ± 0.193
1.637LysPro: 1.637 ± 0.4
2.865LysGln: 2.865 ± 0.811
2.865LysArg: 2.865 ± 0.811
4.503LysSer: 4.503 ± 1.016
1.228LysThr: 1.228 ± 0.607
4.503LysVal: 4.503 ± 1.016
0.409LysTrp: 0.409 ± 0.202
2.865LysTyr: 2.865 ± 1.416
0.0LysXaa: 0.0 ± 0.0
Leu
3.684LeuAla: 3.684 ± 1.202
4.093LeuCys: 4.093 ± 0.814
6.14LeuAsp: 6.14 ± 0.616
2.865LeuGlu: 2.865 ± 0.398
3.275LeuPhe: 3.275 ± 0.195
5.731LeuGly: 5.731 ± 0.414
2.047LeuHis: 2.047 ± 1.011
6.14LeuIle: 6.14 ± 0.012
6.549LeuLys: 6.549 ± 2.028
4.912LeuLeu: 4.912 ± 0.614
1.637LeuMet: 1.637 ± 0.809
3.684LeuAsn: 3.684 ± 0.612
4.093LeuPro: 4.093 ± 0.209
2.456LeuGln: 2.456 ± 0.609
2.865LeuArg: 2.865 ± 0.207
7.368LeuSer: 7.368 ± 3.613
6.549LeuThr: 6.549 ± 0.214
3.684LeuVal: 3.684 ± 0.007
0.409LeuTrp: 0.409 ± 0.202
3.684LeuTyr: 3.684 ± 0.007
0.0LeuXaa: 0.0 ± 0.0
Met
1.228MetAla: 1.228 ± 0.002
0.409MetCys: 0.409 ± 0.202
1.637MetAsp: 1.637 ± 0.205
2.456MetGlu: 2.456 ± 1.214
1.228MetPhe: 1.228 ± 0.607
1.228MetGly: 1.228 ± 0.002
0.819MetHis: 0.819 ± 0.405
2.865MetIle: 2.865 ± 0.398
1.228MetLys: 1.228 ± 0.607
2.456MetLeu: 2.456 ± 0.005
0.409MetMet: 0.409 ± 0.202
2.047MetAsn: 2.047 ± 1.407
1.637MetPro: 1.637 ± 0.4
0.819MetGln: 0.819 ± 0.2
1.637MetArg: 1.637 ± 0.809
0.819MetSer: 0.819 ± 0.2
0.819MetThr: 0.819 ± 0.405
0.409MetVal: 0.409 ± 0.202
0.0MetTrp: 0.0 ± 0.0
0.819MetTyr: 0.819 ± 0.2
0.0MetXaa: 0.0 ± 0.0
Asn
2.456AsnAla: 2.456 ± 1.204
1.228AsnCys: 1.228 ± 0.002
1.637AsnAsp: 1.637 ± 1.004
3.275AsnGlu: 3.275 ± 0.195
3.275AsnPhe: 3.275 ± 0.195
2.865AsnGly: 2.865 ± 0.398
1.228AsnHis: 1.228 ± 0.602
2.865AsnIle: 2.865 ± 0.207
4.093AsnLys: 4.093 ± 1.418
3.275AsnLeu: 3.275 ± 0.8
1.637AsnMet: 1.637 ± 0.205
2.456AsnAsn: 2.456 ± 0.005
3.684AsnPro: 3.684 ± 0.612
2.865AsnGln: 2.865 ± 0.398
3.275AsnArg: 3.275 ± 0.409
4.093AsnSer: 4.093 ± 0.814
2.865AsnThr: 2.865 ± 1.606
5.321AsnVal: 5.321 ± 1.602
1.228AsnTrp: 1.228 ± 1.207
2.865AsnTyr: 2.865 ± 1.002
0.0AsnXaa: 0.0 ± 0.0
Pro
2.047ProAla: 2.047 ± 0.802
0.0ProCys: 0.0 ± 0.0
2.865ProAsp: 2.865 ± 1.416
1.637ProGlu: 1.637 ± 0.809
4.093ProPhe: 4.093 ± 0.395
2.047ProGly: 2.047 ± 0.802
0.409ProHis: 0.409 ± 0.402
3.684ProIle: 3.684 ± 1.202
2.865ProLys: 2.865 ± 0.811
3.275ProLeu: 3.275 ± 0.195
1.637ProMet: 1.637 ± 0.205
2.047ProAsn: 2.047 ± 0.198
1.228ProPro: 1.228 ± 0.607
0.819ProGln: 0.819 ± 0.2
1.228ProArg: 1.228 ± 0.002
2.865ProSer: 2.865 ± 1.606
3.684ProThr: 3.684 ± 1.202
3.684ProVal: 3.684 ± 0.597
1.228ProTrp: 1.228 ± 0.002
2.047ProTyr: 2.047 ± 0.198
0.0ProXaa: 0.0 ± 0.0
Gln
0.819GlnAla: 0.819 ± 0.2
1.228GlnCys: 1.228 ± 0.002
0.819GlnAsp: 0.819 ± 0.405
1.228GlnGlu: 1.228 ± 0.002
0.819GlnPhe: 0.819 ± 0.2
1.637GlnGly: 1.637 ± 0.205
0.819GlnHis: 0.819 ± 0.405
1.637GlnIle: 1.637 ± 0.809
1.637GlnLys: 1.637 ± 0.809
3.684GlnLeu: 3.684 ± 0.612
0.409GlnMet: 0.409 ± 0.202
1.637GlnAsn: 1.637 ± 0.4
1.228GlnPro: 1.228 ± 0.002
0.409GlnGln: 0.409 ± 0.202
3.275GlnArg: 3.275 ± 0.195
3.684GlnSer: 3.684 ± 0.597
2.047GlnThr: 2.047 ± 0.802
2.865GlnVal: 2.865 ± 0.398
0.819GlnTrp: 0.819 ± 0.2
1.637GlnTyr: 1.637 ± 0.4
0.0GlnXaa: 0.0 ± 0.0
Arg
2.047ArgAla: 2.047 ± 0.407
0.819ArgCys: 0.819 ± 0.804
0.409ArgAsp: 0.409 ± 0.202
2.456ArgGlu: 2.456 ± 0.609
3.275ArgPhe: 3.275 ± 0.409
3.684ArgGly: 3.684 ± 1.202
0.819ArgHis: 0.819 ± 0.405
4.912ArgIle: 4.912 ± 0.614
3.684ArgLys: 3.684 ± 0.007
2.047ArgLeu: 2.047 ± 0.407
2.047ArgMet: 2.047 ± 0.198
2.047ArgAsn: 2.047 ± 1.011
1.228ArgPro: 1.228 ± 0.602
1.228ArgGln: 1.228 ± 0.607
2.047ArgArg: 2.047 ± 0.198
2.456ArgSer: 2.456 ± 0.609
3.275ArgThr: 3.275 ± 0.8
4.093ArgVal: 4.093 ± 1.0
0.409ArgTrp: 0.409 ± 0.202
0.819ArgTyr: 0.819 ± 0.405
0.0ArgXaa: 0.0 ± 0.0
Ser
6.14SerAla: 6.14 ± 1.197
1.228SerCys: 1.228 ± 0.602
2.047SerAsp: 2.047 ± 0.802
4.912SerGlu: 4.912 ± 0.595
3.275SerPhe: 3.275 ± 0.409
3.684SerGly: 3.684 ± 0.597
1.637SerHis: 1.637 ± 0.809
9.005SerIle: 9.005 ± 1.595
6.14SerLys: 6.14 ± 2.43
4.503SerLeu: 4.503 ± 0.797
0.819SerMet: 0.819 ± 1.018
4.503SerAsn: 4.503 ± 0.193
2.865SerPro: 2.865 ± 1.606
1.637SerGln: 1.637 ± 1.004
2.865SerArg: 2.865 ± 2.211
6.14SerSer: 6.14 ± 0.012
5.731SerThr: 5.731 ± 0.795
5.731SerVal: 5.731 ± 1.4
1.228SerTrp: 1.228 ± 0.002
1.637SerTyr: 1.637 ± 0.205
0.0SerXaa: 0.0 ± 0.0
Thr
4.503ThrAla: 4.503 ± 1.016
0.409ThrCys: 0.409 ± 0.202
4.503ThrAsp: 4.503 ± 1.402
2.865ThrGlu: 2.865 ± 0.207
5.731ThrPhe: 5.731 ± 1.018
4.093ThrGly: 4.093 ± 1.604
1.228ThrHis: 1.228 ± 0.607
5.731ThrIle: 5.731 ± 0.795
2.456ThrLys: 2.456 ± 0.609
4.093ThrLeu: 4.093 ± 1.604
1.637ThrMet: 1.637 ± 0.205
2.456ThrAsn: 2.456 ± 1.809
3.275ThrPro: 3.275 ± 0.8
2.047ThrGln: 2.047 ± 0.802
2.456ThrArg: 2.456 ± 0.609
6.14ThrSer: 6.14 ± 1.802
4.503ThrThr: 4.503 ± 2.611
2.865ThrVal: 2.865 ± 0.207
0.409ThrTrp: 0.409 ± 0.202
1.637ThrTyr: 1.637 ± 0.205
0.0ThrXaa: 0.0 ± 0.0
Val
4.093ValAla: 4.093 ± 1.0
1.228ValCys: 1.228 ± 0.002
4.093ValAsp: 4.093 ± 2.209
3.275ValGlu: 3.275 ± 0.409
2.047ValPhe: 2.047 ± 0.198
1.637ValGly: 1.637 ± 0.205
0.0ValHis: 0.0 ± 0.0
3.275ValIle: 3.275 ± 0.195
5.321ValLys: 5.321 ± 0.393
7.368ValLeu: 7.368 ± 0.59
1.637ValMet: 1.637 ± 0.205
3.684ValAsn: 3.684 ± 0.612
3.684ValPro: 3.684 ± 0.007
2.456ValGln: 2.456 ± 1.214
3.275ValArg: 3.275 ± 0.8
3.684ValSer: 3.684 ± 1.202
2.865ValThr: 2.865 ± 0.207
2.047ValVal: 2.047 ± 0.407
1.228ValTrp: 1.228 ± 0.607
2.456ValTyr: 2.456 ± 2.413
0.0ValXaa: 0.0 ± 0.0
Trp
0.409TrpAla: 0.409 ± 0.402
0.0TrpCys: 0.0 ± 0.0
1.637TrpAsp: 1.637 ± 0.4
0.409TrpGlu: 0.409 ± 0.202
0.409TrpPhe: 0.409 ± 0.202
0.409TrpGly: 0.409 ± 0.202
0.409TrpHis: 0.409 ± 0.402
1.637TrpIle: 1.637 ± 1.004
0.0TrpLys: 0.0 ± 0.0
0.819TrpLeu: 0.819 ± 0.2
0.409TrpMet: 0.409 ± 0.202
0.819TrpAsn: 0.819 ± 0.405
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.228TrpArg: 1.228 ± 0.002
1.228TrpSer: 1.228 ± 0.602
1.637TrpThr: 1.637 ± 0.4
0.409TrpVal: 0.409 ± 0.202
0.0TrpTrp: 0.0 ± 0.0
1.228TrpTyr: 1.228 ± 0.002
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.865TyrAla: 2.865 ± 0.207
0.819TyrCys: 0.819 ± 0.2
4.093TyrAsp: 4.093 ± 1.604
2.047TyrGlu: 2.047 ± 0.407
2.047TyrPhe: 2.047 ± 0.198
2.456TyrGly: 2.456 ± 0.005
1.637TyrHis: 1.637 ± 0.4
4.093TyrIle: 4.093 ± 0.814
2.865TyrLys: 2.865 ± 0.811
2.047TyrLeu: 2.047 ± 1.011
1.228TyrMet: 1.228 ± 0.607
3.275TyrAsn: 3.275 ± 2.009
1.637TyrPro: 1.637 ± 1.004
1.637TyrGln: 1.637 ± 0.205
1.637TyrArg: 1.637 ± 0.4
1.228TyrSer: 1.228 ± 0.002
2.865TyrThr: 2.865 ± 0.811
1.637TyrVal: 1.637 ± 0.205
0.409TyrTrp: 0.409 ± 0.402
2.456TyrTyr: 2.456 ± 0.609
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2444 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski