Amino acid dipepetide frequency for Beihai picorna-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.585AlaAla: 5.585 ± 1.119
1.489AlaCys: 1.489 ± 0.763
2.978AlaAsp: 2.978 ± 1.128
6.701AlaGlu: 6.701 ± 1.445
3.351AlaPhe: 3.351 ± 0.39
3.723AlaGly: 3.723 ± 0.581
1.117AlaHis: 1.117 ± 0.573
4.84AlaIle: 4.84 ± 0.49
2.978AlaLys: 2.978 ± 0.464
6.329AlaLeu: 6.329 ± 0.59
1.862AlaMet: 1.862 ± 0.233
4.095AlaAsn: 4.095 ± 0.772
2.234AlaPro: 2.234 ± 0.846
2.606AlaGln: 2.606 ± 1.318
1.862AlaArg: 1.862 ± 0.291
4.84AlaSer: 4.84 ± 1.501
3.351AlaThr: 3.351 ± 0.273
4.095AlaVal: 4.095 ± 0.555
1.117AlaTrp: 1.117 ± 0.755
3.723AlaTyr: 3.723 ± 1.909
0.0AlaXaa: 0.0 ± 0.0
Cys
1.489CysAla: 1.489 ± 0.763
0.0CysCys: 0.0 ± 0.0
0.745CysAsp: 0.745 ± 0.382
0.372CysGlu: 0.372 ± 0.473
1.489CysPhe: 1.489 ± 0.763
0.0CysGly: 0.0 ± 0.0
0.372CysHis: 0.372 ± 0.191
1.862CysIle: 1.862 ± 0.954
1.117CysLys: 1.117 ± 0.573
1.489CysLeu: 1.489 ± 0.763
0.372CysMet: 0.372 ± 0.191
1.862CysAsn: 1.862 ± 0.291
1.117CysPro: 1.117 ± 0.755
0.372CysGln: 0.372 ± 0.473
0.0CysArg: 0.0 ± 0.0
0.745CysSer: 0.745 ± 0.282
0.0CysThr: 0.0 ± 0.0
1.862CysVal: 1.862 ± 0.291
0.372CysTrp: 0.372 ± 0.191
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.117AspAla: 1.117 ± 0.091
0.745AspCys: 0.745 ± 0.382
5.585AspAsp: 5.585 ± 0.872
4.095AspGlu: 4.095 ± 0.772
4.095AspPhe: 4.095 ± 0.109
3.351AspGly: 3.351 ± 0.39
1.117AspHis: 1.117 ± 0.091
4.468AspIle: 4.468 ± 1.028
4.468AspLys: 4.468 ± 1.627
7.818AspLeu: 7.818 ± 0.637
1.117AspMet: 1.117 ± 0.091
4.095AspAsn: 4.095 ± 0.109
2.606AspPro: 2.606 ± 1.318
1.117AspGln: 1.117 ± 0.091
1.117AspArg: 1.117 ± 0.091
2.606AspSer: 2.606 ± 0.672
4.468AspThr: 4.468 ± 0.364
2.606AspVal: 2.606 ± 0.009
1.489AspTrp: 1.489 ± 0.1
1.862AspTyr: 1.862 ± 0.373
0.0AspXaa: 0.0 ± 0.0
Glu
3.723GluAla: 3.723 ± 1.245
0.745GluCys: 0.745 ± 0.382
4.468GluAsp: 4.468 ± 0.299
7.074GluGlu: 7.074 ± 3.626
2.978GluPhe: 2.978 ± 0.2
3.723GluGly: 3.723 ± 0.581
1.117GluHis: 1.117 ± 0.573
5.585GluIle: 5.585 ± 0.455
4.468GluLys: 4.468 ± 1.627
5.212GluLeu: 5.212 ± 0.018
2.234GluMet: 2.234 ± 0.846
2.606GluAsn: 2.606 ± 0.009
2.234GluPro: 2.234 ± 1.145
2.606GluGln: 2.606 ± 0.009
2.606GluArg: 2.606 ± 0.672
3.723GluSer: 3.723 ± 0.746
2.978GluThr: 2.978 ± 0.863
3.351GluVal: 3.351 ± 0.273
1.489GluTrp: 1.489 ± 0.763
4.095GluTyr: 4.095 ± 0.772
0.0GluXaa: 0.0 ± 0.0
Phe
5.585PheAla: 5.585 ± 1.119
0.372PheCys: 0.372 ± 0.191
4.84PheAsp: 4.84 ± 0.837
2.606PheGlu: 2.606 ± 0.672
2.978PhePhe: 2.978 ± 0.863
2.606PheGly: 2.606 ± 1.318
0.745PheHis: 0.745 ± 0.382
2.978PheIle: 2.978 ± 0.2
5.585PheLys: 5.585 ± 0.872
3.351PheLeu: 3.351 ± 1.054
0.372PheMet: 0.372 ± 0.191
2.978PheAsn: 2.978 ± 0.863
1.862PhePro: 1.862 ± 0.954
1.489PheGln: 1.489 ± 0.763
2.234PheArg: 2.234 ± 0.182
4.095PheSer: 4.095 ± 0.772
2.234PheThr: 2.234 ± 0.482
2.978PheVal: 2.978 ± 1.128
0.745PheTrp: 0.745 ± 0.282
1.489PheTyr: 1.489 ± 0.564
0.0PheXaa: 0.0 ± 0.0
Gly
4.095GlyAla: 4.095 ± 0.555
0.372GlyCys: 0.372 ± 0.191
3.351GlyAsp: 3.351 ± 0.39
5.212GlyGlu: 5.212 ± 1.973
1.862GlyPhe: 1.862 ± 0.291
3.723GlyGly: 3.723 ± 2.073
0.745GlyHis: 0.745 ± 0.382
2.606GlyIle: 2.606 ± 1.336
3.723GlyLys: 3.723 ± 1.245
5.957GlyLeu: 5.957 ± 0.928
0.372GlyMet: 0.372 ± 0.473
2.978GlyAsn: 2.978 ± 0.464
1.862GlyPro: 1.862 ± 0.373
0.372GlyGln: 0.372 ± 0.473
3.351GlyArg: 3.351 ± 0.273
2.978GlySer: 2.978 ± 0.2
4.468GlyThr: 4.468 ± 1.691
2.978GlyVal: 2.978 ± 1.527
0.372GlyTrp: 0.372 ± 0.473
2.234GlyTyr: 2.234 ± 1.509
0.0GlyXaa: 0.0 ± 0.0
His
1.489HisAla: 1.489 ± 0.1
0.745HisCys: 0.745 ± 0.282
2.234HisAsp: 2.234 ± 0.846
1.489HisGlu: 1.489 ± 0.763
1.862HisPhe: 1.862 ± 0.373
1.489HisGly: 1.489 ± 0.1
0.0HisHis: 0.0 ± 0.0
0.745HisIle: 0.745 ± 0.282
0.745HisLys: 0.745 ± 0.282
1.862HisLeu: 1.862 ± 0.291
0.745HisMet: 0.745 ± 0.382
0.745HisAsn: 0.745 ± 0.282
1.489HisPro: 1.489 ± 0.763
0.372HisGln: 0.372 ± 0.473
0.745HisArg: 0.745 ± 0.282
0.745HisSer: 0.745 ± 0.382
0.745HisThr: 0.745 ± 0.382
1.489HisVal: 1.489 ± 0.763
0.745HisTrp: 0.745 ± 0.382
1.117HisTyr: 1.117 ± 0.091
0.0HisXaa: 0.0 ± 0.0
Ile
4.095IleAla: 4.095 ± 0.772
1.489IleCys: 1.489 ± 0.1
4.84IleAsp: 4.84 ± 0.837
4.84IleGlu: 4.84 ± 2.481
2.978IlePhe: 2.978 ± 0.863
3.723IleGly: 3.723 ± 0.581
3.351IleHis: 3.351 ± 0.273
2.978IleIle: 2.978 ± 0.464
2.234IleLys: 2.234 ± 0.182
4.84IleLeu: 4.84 ± 1.501
2.978IleMet: 2.978 ± 1.527
2.978IleAsn: 2.978 ± 0.464
4.468IlePro: 4.468 ± 0.299
1.489IleGln: 1.489 ± 1.227
3.351IleArg: 3.351 ± 1.718
5.585IleSer: 5.585 ± 0.872
5.212IleThr: 5.212 ± 1.31
3.723IleVal: 3.723 ± 0.082
1.489IleTrp: 1.489 ± 0.1
1.489IleTyr: 1.489 ± 0.1
0.0IleXaa: 0.0 ± 0.0
Lys
8.191LysAla: 8.191 ± 1.544
0.0LysCys: 0.0 ± 0.0
2.978LysAsp: 2.978 ± 0.863
3.723LysGlu: 3.723 ± 1.909
4.095LysPhe: 4.095 ± 0.555
3.723LysGly: 3.723 ± 0.082
2.606LysHis: 2.606 ± 1.336
4.095LysIle: 4.095 ± 0.109
4.095LysLys: 4.095 ± 1.436
2.978LysLeu: 2.978 ± 0.863
1.489LysMet: 1.489 ± 0.763
4.095LysAsn: 4.095 ± 0.555
4.095LysPro: 4.095 ± 0.555
1.117LysGln: 1.117 ± 0.091
2.978LysArg: 2.978 ± 0.863
4.468LysSer: 4.468 ± 0.963
2.978LysThr: 2.978 ± 0.863
6.329LysVal: 6.329 ± 1.917
1.489LysTrp: 1.489 ± 0.1
1.489LysTyr: 1.489 ± 0.1
0.0LysXaa: 0.0 ± 0.0
Leu
5.585LeuAla: 5.585 ± 0.872
1.489LeuCys: 1.489 ± 0.1
4.095LeuAsp: 4.095 ± 0.555
3.723LeuGlu: 3.723 ± 0.581
2.978LeuPhe: 2.978 ± 0.863
3.723LeuGly: 3.723 ± 1.409
1.862LeuHis: 1.862 ± 0.291
7.074LeuIle: 7.074 ± 1.635
3.723LeuLys: 3.723 ± 0.581
6.329LeuLeu: 6.329 ± 0.074
1.489LeuMet: 1.489 ± 0.763
7.074LeuAsn: 7.074 ± 0.355
4.095LeuPro: 4.095 ± 0.109
1.117LeuGln: 1.117 ± 0.091
4.84LeuArg: 4.84 ± 0.837
5.212LeuSer: 5.212 ± 0.646
5.212LeuThr: 5.212 ± 0.018
4.095LeuVal: 4.095 ± 0.772
0.0LeuTrp: 0.0 ± 0.0
2.606LeuTyr: 2.606 ± 1.336
0.0LeuXaa: 0.0 ± 0.0
Met
1.489MetAla: 1.489 ± 0.763
0.745MetCys: 0.745 ± 0.382
1.117MetAsp: 1.117 ± 0.091
1.117MetGlu: 1.117 ± 0.091
1.489MetPhe: 1.489 ± 0.763
1.117MetGly: 1.117 ± 0.573
0.0MetHis: 0.0 ± 0.0
1.117MetIle: 1.117 ± 0.573
0.745MetLys: 0.745 ± 0.382
2.234MetLeu: 2.234 ± 0.482
0.745MetMet: 0.745 ± 0.382
2.234MetAsn: 2.234 ± 0.482
1.862MetPro: 1.862 ± 0.373
0.372MetGln: 0.372 ± 0.191
3.351MetArg: 3.351 ± 0.39
1.489MetSer: 1.489 ± 0.1
1.862MetThr: 1.862 ± 1.7
2.978MetVal: 2.978 ± 2.455
0.745MetTrp: 0.745 ± 0.382
1.117MetTyr: 1.117 ± 0.573
0.0MetXaa: 0.0 ± 0.0
Asn
3.351AsnAla: 3.351 ± 1.054
1.117AsnCys: 1.117 ± 0.573
1.862AsnAsp: 1.862 ± 0.373
2.606AsnGlu: 2.606 ± 0.655
2.234AsnPhe: 2.234 ± 0.482
4.095AsnGly: 4.095 ± 0.109
0.745AsnHis: 0.745 ± 0.382
2.978AsnIle: 2.978 ± 0.863
5.585AsnLys: 5.585 ± 0.872
4.468AsnLeu: 4.468 ± 0.963
2.606AsnMet: 2.606 ± 0.655
3.723AsnAsn: 3.723 ± 0.581
4.468AsnPro: 4.468 ± 1.028
2.234AsnGln: 2.234 ± 0.482
1.489AsnArg: 1.489 ± 1.227
4.84AsnSer: 4.84 ± 0.173
2.234AsnThr: 2.234 ± 1.509
3.723AsnVal: 3.723 ± 0.581
0.745AsnTrp: 0.745 ± 0.282
1.862AsnTyr: 1.862 ± 1.037
0.0AsnXaa: 0.0 ± 0.0
Pro
1.489ProAla: 1.489 ± 1.227
1.117ProCys: 1.117 ± 0.091
3.723ProAsp: 3.723 ± 0.082
3.351ProGlu: 3.351 ± 1.054
3.723ProPhe: 3.723 ± 0.746
2.234ProGly: 2.234 ± 0.182
1.117ProHis: 1.117 ± 0.755
4.095ProIle: 4.095 ± 0.772
4.095ProLys: 4.095 ± 1.436
1.862ProLeu: 1.862 ± 0.954
2.234ProMet: 2.234 ± 1.509
2.234ProAsn: 2.234 ± 0.482
2.234ProPro: 2.234 ± 1.509
1.862ProGln: 1.862 ± 0.291
2.606ProArg: 2.606 ± 0.655
2.978ProSer: 2.978 ± 2.455
2.978ProThr: 2.978 ± 1.128
1.862ProVal: 1.862 ± 1.037
0.372ProTrp: 0.372 ± 0.191
1.862ProTyr: 1.862 ± 1.037
0.0ProXaa: 0.0 ± 0.0
Gln
1.117GlnAla: 1.117 ± 0.573
0.0GlnCys: 0.0 ± 0.0
1.117GlnAsp: 1.117 ± 0.091
1.862GlnGlu: 1.862 ± 0.291
1.117GlnPhe: 1.117 ± 0.755
1.862GlnGly: 1.862 ± 1.7
0.745GlnHis: 0.745 ± 0.382
1.489GlnIle: 1.489 ± 0.1
2.606GlnLys: 2.606 ± 1.336
2.606GlnLeu: 2.606 ± 0.672
0.372GlnMet: 0.372 ± 0.191
0.745GlnAsn: 0.745 ± 0.945
1.117GlnPro: 1.117 ± 0.755
0.0GlnGln: 0.0 ± 0.0
2.978GlnArg: 2.978 ± 0.2
2.234GlnSer: 2.234 ± 0.846
1.489GlnThr: 1.489 ± 1.227
1.489GlnVal: 1.489 ± 0.564
0.372GlnTrp: 0.372 ± 0.473
1.862GlnTyr: 1.862 ± 1.7
0.0GlnXaa: 0.0 ± 0.0
Arg
3.351ArgAla: 3.351 ± 0.39
0.745ArgCys: 0.745 ± 0.282
2.978ArgAsp: 2.978 ± 1.527
3.723ArgGlu: 3.723 ± 1.245
2.606ArgPhe: 2.606 ± 0.009
2.978ArgGly: 2.978 ± 1.791
1.489ArgHis: 1.489 ± 1.227
3.723ArgIle: 3.723 ± 0.746
3.351ArgLys: 3.351 ± 0.39
2.606ArgLeu: 2.606 ± 0.672
0.745ArgMet: 0.745 ± 0.282
2.234ArgAsn: 2.234 ± 0.482
1.862ArgPro: 1.862 ± 0.373
2.606ArgGln: 2.606 ± 1.318
3.723ArgArg: 3.723 ± 0.746
2.978ArgSer: 2.978 ± 0.863
1.489ArgThr: 1.489 ± 0.564
3.723ArgVal: 3.723 ± 0.581
0.745ArgTrp: 0.745 ± 0.382
1.862ArgTyr: 1.862 ± 0.373
0.0ArgXaa: 0.0 ± 0.0
Ser
4.468SerAla: 4.468 ± 1.028
0.372SerCys: 0.372 ± 0.191
3.723SerAsp: 3.723 ± 0.581
5.957SerGlu: 5.957 ± 0.928
2.606SerPhe: 2.606 ± 1.318
1.117SerGly: 1.117 ± 0.573
1.117SerHis: 1.117 ± 0.091
4.095SerIle: 4.095 ± 0.109
7.074SerLys: 7.074 ± 1.019
2.606SerLeu: 2.606 ± 0.009
2.234SerMet: 2.234 ± 0.846
4.468SerAsn: 4.468 ± 0.364
2.978SerPro: 2.978 ± 0.2
0.372SerGln: 0.372 ± 0.191
3.351SerArg: 3.351 ± 0.937
4.468SerSer: 4.468 ± 0.299
3.723SerThr: 3.723 ± 0.581
7.446SerVal: 7.446 ± 0.499
0.745SerTrp: 0.745 ± 0.282
4.095SerTyr: 4.095 ± 1.882
0.0SerXaa: 0.0 ± 0.0
Thr
2.234ThrAla: 2.234 ± 0.846
0.372ThrCys: 0.372 ± 0.191
2.606ThrAsp: 2.606 ± 0.655
3.351ThrGlu: 3.351 ± 1.6
4.095ThrPhe: 4.095 ± 0.555
2.234ThrGly: 2.234 ± 0.182
0.372ThrHis: 0.372 ± 0.191
5.585ThrIle: 5.585 ± 1.782
4.095ThrLys: 4.095 ± 0.772
5.585ThrLeu: 5.585 ± 0.455
2.606ThrMet: 2.606 ± 0.672
2.978ThrAsn: 2.978 ± 0.2
3.351ThrPro: 3.351 ± 2.928
1.489ThrGln: 1.489 ± 0.1
3.351ThrArg: 3.351 ± 0.937
5.957ThrSer: 5.957 ± 2.255
4.468ThrThr: 4.468 ± 0.364
1.489ThrVal: 1.489 ± 0.1
0.0ThrTrp: 0.0 ± 0.0
1.489ThrTyr: 1.489 ± 0.1
0.0ThrXaa: 0.0 ± 0.0
Val
6.329ValAla: 6.329 ± 0.737
1.117ValCys: 1.117 ± 0.091
2.234ValAsp: 2.234 ± 0.182
3.723ValGlu: 3.723 ± 1.245
3.351ValPhe: 3.351 ± 1.054
4.468ValGly: 4.468 ± 0.364
1.489ValHis: 1.489 ± 0.1
4.468ValIle: 4.468 ± 0.364
2.606ValLys: 2.606 ± 0.009
4.095ValLeu: 4.095 ± 0.772
2.234ValMet: 2.234 ± 0.923
1.862ValAsn: 1.862 ± 0.373
2.606ValPro: 2.606 ± 0.009
1.862ValGln: 1.862 ± 0.373
3.723ValArg: 3.723 ± 1.245
2.606ValSer: 2.606 ± 0.655
3.723ValThr: 3.723 ± 0.746
1.862ValVal: 1.862 ± 0.291
1.862ValTrp: 1.862 ± 0.373
3.351ValTyr: 3.351 ± 0.273
0.0ValXaa: 0.0 ± 0.0
Trp
1.489TrpAla: 1.489 ± 0.564
1.117TrpCys: 1.117 ± 0.091
1.117TrpAsp: 1.117 ± 0.091
0.372TrpGlu: 0.372 ± 0.191
1.862TrpPhe: 1.862 ± 0.291
0.745TrpGly: 0.745 ± 0.382
0.745TrpHis: 0.745 ± 0.282
1.489TrpIle: 1.489 ± 0.763
1.117TrpLys: 1.117 ± 0.755
0.745TrpLeu: 0.745 ± 0.282
0.0TrpMet: 0.0 ± 0.0
0.745TrpAsn: 0.745 ± 0.382
0.0TrpPro: 0.0 ± 0.0
1.489TrpGln: 1.489 ± 1.227
0.372TrpArg: 0.372 ± 0.191
0.745TrpSer: 0.745 ± 0.382
1.117TrpThr: 1.117 ± 0.755
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.745TrpTyr: 0.745 ± 0.382
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.606TyrAla: 2.606 ± 0.655
1.489TyrCys: 1.489 ± 0.763
3.351TyrAsp: 3.351 ± 1.718
1.117TyrGlu: 1.117 ± 0.091
0.372TyrPhe: 0.372 ± 0.191
2.978TyrGly: 2.978 ± 0.464
0.745TyrHis: 0.745 ± 0.945
1.862TyrIle: 1.862 ± 0.291
2.606TyrLys: 2.606 ± 1.336
3.351TyrLeu: 3.351 ± 0.273
0.745TyrMet: 0.745 ± 0.382
2.234TyrAsn: 2.234 ± 0.846
1.489TyrPro: 1.489 ± 0.1
2.234TyrGln: 2.234 ± 0.182
1.489TyrArg: 1.489 ± 0.564
3.723TyrSer: 3.723 ± 1.409
2.978TyrThr: 2.978 ± 1.791
1.862TyrVal: 1.862 ± 0.373
1.117TyrTrp: 1.117 ± 0.755
1.489TyrTyr: 1.489 ± 0.564
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2687 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski