Amino acid dipepetide frequency for Beihai picorna-like virus 82

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.475AlaAla: 6.475 ± 0.898
0.36AlaCys: 0.36 ± 0.314
2.518AlaAsp: 2.518 ± 1.19
3.597AlaGlu: 3.597 ± 0.891
2.878AlaPhe: 2.878 ± 0.007
3.957AlaGly: 3.957 ± 0.577
1.799AlaHis: 1.799 ± 0.058
6.835AlaIle: 6.835 ± 0.584
2.878AlaLys: 2.878 ± 0.511
5.396AlaLeu: 5.396 ± 0.328
0.36AlaMet: 0.36 ± 0.19
2.158AlaAsn: 2.158 ± 0.131
3.957AlaPro: 3.957 ± 0.073
5.396AlaGln: 5.396 ± 0.175
4.317AlaArg: 4.317 ± 1.248
7.554AlaSer: 7.554 ± 1.555
2.878AlaThr: 2.878 ± 1.0
6.475AlaVal: 6.475 ± 0.613
0.719AlaTrp: 0.719 ± 0.38
2.518AlaTyr: 2.518 ± 0.321
0.0AlaXaa: 0.0 ± 0.0
Cys
1.799CysAla: 1.799 ± 0.949
0.36CysCys: 0.36 ± 0.19
0.36CysAsp: 0.36 ± 0.19
1.439CysGlu: 1.439 ± 0.255
1.079CysPhe: 1.079 ± 0.569
1.079CysGly: 1.079 ± 0.569
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.439CysLys: 1.439 ± 0.759
1.439CysLeu: 1.439 ± 0.255
1.439CysMet: 1.439 ± 0.255
1.079CysAsn: 1.079 ± 0.569
1.439CysPro: 1.439 ± 0.255
1.799CysGln: 1.799 ± 0.445
0.36CysArg: 0.36 ± 0.19
2.158CysSer: 2.158 ± 0.635
1.079CysThr: 1.079 ± 0.066
2.878CysVal: 2.878 ± 0.496
0.36CysTrp: 0.36 ± 0.314
0.719CysTyr: 0.719 ± 0.124
0.0CysXaa: 0.0 ± 0.0
Asp
3.597AspAla: 3.597 ± 0.117
1.079AspCys: 1.079 ± 0.569
5.036AspAsp: 5.036 ± 0.869
4.676AspGlu: 4.676 ± 0.956
3.597AspPhe: 3.597 ± 0.117
2.518AspGly: 2.518 ± 0.321
1.439AspHis: 1.439 ± 0.759
2.878AspIle: 2.878 ± 0.007
2.518AspLys: 2.518 ± 0.321
5.396AspLeu: 5.396 ± 0.832
1.439AspMet: 1.439 ± 0.752
1.799AspAsn: 1.799 ± 0.562
2.518AspPro: 2.518 ± 0.825
1.799AspGln: 1.799 ± 0.445
0.719AspArg: 0.719 ± 0.124
2.878AspSer: 2.878 ± 0.496
2.158AspThr: 2.158 ± 1.884
5.036AspVal: 5.036 ± 1.146
0.36AspTrp: 0.36 ± 0.314
1.439AspTyr: 1.439 ± 0.248
0.0AspXaa: 0.0 ± 0.0
Glu
3.237GluAla: 3.237 ± 0.307
1.079GluCys: 1.079 ± 0.569
2.518GluAsp: 2.518 ± 0.321
2.158GluGlu: 2.158 ± 0.635
1.799GluPhe: 1.799 ± 1.066
3.237GluGly: 3.237 ± 0.197
1.439GluHis: 1.439 ± 0.752
1.439GluIle: 1.439 ± 0.248
3.237GluLys: 3.237 ± 0.701
1.799GluLeu: 1.799 ± 0.445
2.518GluMet: 2.518 ± 0.321
2.878GluAsn: 2.878 ± 1.015
2.878GluPro: 2.878 ± 1.015
2.518GluGln: 2.518 ± 0.321
3.597GluArg: 3.597 ± 0.891
2.518GluSer: 2.518 ± 0.686
2.518GluThr: 2.518 ± 0.321
5.396GluVal: 5.396 ± 0.679
2.518GluTrp: 2.518 ± 0.825
1.799GluTyr: 1.799 ± 0.058
0.0GluXaa: 0.0 ± 0.0
Phe
4.676PheAla: 4.676 ± 0.555
1.079PheCys: 1.079 ± 0.066
2.158PheAsp: 2.158 ± 0.372
2.158PheGlu: 2.158 ± 0.131
1.439PhePhe: 1.439 ± 0.752
2.878PheGly: 2.878 ± 0.007
1.079PheHis: 1.079 ± 0.066
0.719PheIle: 0.719 ± 0.124
0.36PheLys: 0.36 ± 0.314
3.957PheLeu: 3.957 ± 0.431
1.079PheMet: 1.079 ± 0.066
2.518PheAsn: 2.518 ± 0.183
1.799PhePro: 1.799 ± 0.562
1.799PheGln: 1.799 ± 0.949
2.518PheArg: 2.518 ± 0.183
6.115PheSer: 6.115 ± 1.212
2.518PheThr: 2.518 ± 1.19
3.957PheVal: 3.957 ± 0.577
0.36PheTrp: 0.36 ± 0.19
1.799PheTyr: 1.799 ± 0.445
0.0PheXaa: 0.0 ± 0.0
Gly
3.597GlyAla: 3.597 ± 0.621
0.719GlyCys: 0.719 ± 0.38
6.475GlyAsp: 6.475 ± 0.394
4.317GlyGlu: 4.317 ± 0.263
2.158GlyPhe: 2.158 ± 0.372
2.878GlyGly: 2.878 ± 1.0
0.36GlyHis: 0.36 ± 0.19
2.518GlyIle: 2.518 ± 0.686
4.317GlyLys: 4.317 ± 2.278
5.396GlyLeu: 5.396 ± 0.328
0.719GlyMet: 0.719 ± 0.124
2.158GlyAsn: 2.158 ± 0.131
3.237GlyPro: 3.237 ± 0.307
2.518GlyGln: 2.518 ± 0.686
2.518GlyArg: 2.518 ± 0.183
3.957GlySer: 3.957 ± 0.431
2.878GlyThr: 2.878 ± 0.496
3.597GlyVal: 3.597 ± 0.387
0.36GlyTrp: 0.36 ± 0.314
1.799GlyTyr: 1.799 ± 0.562
0.0GlyXaa: 0.0 ± 0.0
His
0.719HisAla: 0.719 ± 0.628
0.719HisCys: 0.719 ± 0.38
0.36HisAsp: 0.36 ± 0.19
1.079HisGlu: 1.079 ± 0.438
1.439HisPhe: 1.439 ± 0.759
2.518HisGly: 2.518 ± 0.686
0.0HisHis: 0.0 ± 0.0
0.36HisIle: 0.36 ± 0.19
0.719HisLys: 0.719 ± 0.38
2.878HisLeu: 2.878 ± 1.015
0.36HisMet: 0.36 ± 0.19
1.079HisAsn: 1.079 ± 0.438
1.079HisPro: 1.079 ± 0.438
1.439HisGln: 1.439 ± 0.248
0.0HisArg: 0.0 ± 0.0
2.158HisSer: 2.158 ± 0.635
2.518HisThr: 2.518 ± 0.183
0.719HisVal: 0.719 ± 0.38
0.719HisTrp: 0.719 ± 0.124
0.36HisTyr: 0.36 ± 0.314
0.0HisXaa: 0.0 ± 0.0
Ile
5.036IleAla: 5.036 ± 1.373
1.079IleCys: 1.079 ± 0.066
2.878IleAsp: 2.878 ± 0.007
3.597IleGlu: 3.597 ± 0.117
2.158IlePhe: 2.158 ± 0.372
2.878IleGly: 2.878 ± 0.007
1.079IleHis: 1.079 ± 0.438
1.079IleIle: 1.079 ± 0.438
1.439IleLys: 1.439 ± 0.752
5.396IleLeu: 5.396 ± 1.336
0.719IleMet: 0.719 ± 0.124
3.597IleAsn: 3.597 ± 0.621
3.237IlePro: 3.237 ± 0.307
4.676IleGln: 4.676 ± 0.453
1.799IleArg: 1.799 ± 0.562
4.317IleSer: 4.317 ± 0.241
1.079IleThr: 1.079 ± 0.438
2.518IleVal: 2.518 ± 0.825
0.719IleTrp: 0.719 ± 0.124
2.158IleTyr: 2.158 ± 0.635
0.0IleXaa: 0.0 ± 0.0
Lys
2.878LysAla: 2.878 ± 0.007
1.079LysCys: 1.079 ± 0.569
1.079LysAsp: 1.079 ± 0.569
1.079LysGlu: 1.079 ± 0.569
1.079LysPhe: 1.079 ± 0.066
1.799LysGly: 1.799 ± 0.949
0.36LysHis: 0.36 ± 0.19
3.597LysIle: 3.597 ± 0.117
2.878LysLys: 2.878 ± 1.518
4.317LysLeu: 4.317 ± 0.241
1.079LysMet: 1.079 ± 0.569
1.799LysAsn: 1.799 ± 0.949
1.439LysPro: 1.439 ± 0.248
0.719LysGln: 0.719 ± 0.38
2.518LysArg: 2.518 ± 0.321
2.518LysSer: 2.518 ± 0.686
3.597LysThr: 3.597 ± 0.387
5.755LysVal: 5.755 ± 1.022
0.719LysTrp: 0.719 ± 0.38
1.439LysTyr: 1.439 ± 0.255
0.0LysXaa: 0.0 ± 0.0
Leu
6.115LeuAla: 6.115 ± 1.715
2.158LeuCys: 2.158 ± 1.139
2.878LeuAsp: 2.878 ± 1.015
6.115LeuGlu: 6.115 ± 0.299
2.878LeuPhe: 2.878 ± 1.0
6.475LeuGly: 6.475 ± 1.621
2.518LeuHis: 2.518 ± 0.321
4.317LeuIle: 4.317 ± 0.241
3.597LeuLys: 3.597 ± 0.387
6.115LeuLeu: 6.115 ± 0.708
1.799LeuMet: 1.799 ± 0.562
3.957LeuAsn: 3.957 ± 0.073
6.475LeuPro: 6.475 ± 0.613
2.878LeuGln: 2.878 ± 1.015
4.676LeuArg: 4.676 ± 1.562
7.554LeuSer: 7.554 ± 0.548
5.755LeuThr: 5.755 ± 1.022
5.036LeuVal: 5.036 ± 1.146
1.439LeuTrp: 1.439 ± 0.248
4.317LeuTyr: 4.317 ± 1.774
0.0LeuXaa: 0.0 ± 0.0
Met
1.079MetAla: 1.079 ± 0.438
1.799MetCys: 1.799 ± 0.445
0.36MetAsp: 0.36 ± 0.314
0.36MetGlu: 0.36 ± 0.19
0.719MetPhe: 0.719 ± 0.124
1.799MetGly: 1.799 ± 0.058
1.079MetHis: 1.079 ± 0.066
1.439MetIle: 1.439 ± 0.248
1.079MetLys: 1.079 ± 0.066
0.719MetLeu: 0.719 ± 0.38
0.719MetMet: 0.719 ± 0.38
1.079MetAsn: 1.079 ± 0.066
1.439MetPro: 1.439 ± 0.752
1.439MetGln: 1.439 ± 0.759
2.158MetArg: 2.158 ± 1.139
1.799MetSer: 1.799 ± 1.066
1.439MetThr: 1.439 ± 0.248
1.079MetVal: 1.079 ± 0.438
0.0MetTrp: 0.0 ± 0.0
1.079MetTyr: 1.079 ± 0.569
0.0MetXaa: 0.0 ± 0.0
Asn
2.518AsnAla: 2.518 ± 1.19
1.079AsnCys: 1.079 ± 0.066
1.439AsnAsp: 1.439 ± 0.255
2.518AsnGlu: 2.518 ± 0.686
2.158AsnPhe: 2.158 ± 0.131
1.799AsnGly: 1.799 ± 1.57
1.079AsnHis: 1.079 ± 0.569
3.237AsnIle: 3.237 ± 0.307
1.439AsnLys: 1.439 ± 0.248
3.237AsnLeu: 3.237 ± 0.307
2.158AsnMet: 2.158 ± 1.139
1.439AsnAsn: 1.439 ± 0.248
3.237AsnPro: 3.237 ± 0.307
1.799AsnGln: 1.799 ± 0.058
1.799AsnArg: 1.799 ± 0.949
3.237AsnSer: 3.237 ± 0.81
2.878AsnThr: 2.878 ± 1.518
2.518AsnVal: 2.518 ± 0.183
0.719AsnTrp: 0.719 ± 0.38
1.439AsnTyr: 1.439 ± 0.752
0.0AsnXaa: 0.0 ± 0.0
Pro
4.317ProAla: 4.317 ± 1.248
1.079ProCys: 1.079 ± 0.066
3.957ProAsp: 3.957 ± 0.431
2.518ProGlu: 2.518 ± 1.19
2.878ProPhe: 2.878 ± 0.511
2.518ProGly: 2.518 ± 0.686
0.719ProHis: 0.719 ± 0.38
3.597ProIle: 3.597 ± 0.621
1.439ProLys: 1.439 ± 0.255
5.755ProLeu: 5.755 ± 0.489
0.36ProMet: 0.36 ± 0.19
1.799ProAsn: 1.799 ± 0.445
1.079ProPro: 1.079 ± 0.438
1.799ProGln: 1.799 ± 0.058
3.237ProArg: 3.237 ± 0.307
5.396ProSer: 5.396 ± 0.328
2.878ProThr: 2.878 ± 2.008
5.396ProVal: 5.396 ± 1.183
0.719ProTrp: 0.719 ± 0.628
3.597ProTyr: 3.597 ± 1.628
0.0ProXaa: 0.0 ± 0.0
Gln
3.957GlnAla: 3.957 ± 1.08
1.439GlnCys: 1.439 ± 0.759
1.079GlnAsp: 1.079 ± 0.066
3.597GlnGlu: 3.597 ± 0.891
2.158GlnPhe: 2.158 ± 0.131
2.158GlnGly: 2.158 ± 0.372
1.799GlnHis: 1.799 ± 0.445
1.079GlnIle: 1.079 ± 0.066
2.158GlnLys: 2.158 ± 0.635
3.957GlnLeu: 3.957 ± 0.073
2.158GlnMet: 2.158 ± 0.131
1.799GlnAsn: 1.799 ± 0.058
2.878GlnPro: 2.878 ± 0.007
2.158GlnGln: 2.158 ± 0.131
3.597GlnArg: 3.597 ± 0.891
3.237GlnSer: 3.237 ± 0.197
2.518GlnThr: 2.518 ± 0.321
1.439GlnVal: 1.439 ± 0.752
0.719GlnTrp: 0.719 ± 0.124
0.719GlnTyr: 0.719 ± 0.124
0.0GlnXaa: 0.0 ± 0.0
Arg
3.237ArgAla: 3.237 ± 0.701
1.799ArgCys: 1.799 ± 0.445
3.237ArgAsp: 3.237 ± 0.701
1.439ArgGlu: 1.439 ± 0.255
2.518ArgPhe: 2.518 ± 0.183
3.237ArgGly: 3.237 ± 0.197
0.719ArgHis: 0.719 ± 0.124
5.036ArgIle: 5.036 ± 0.139
2.158ArgLys: 2.158 ± 0.131
4.317ArgLeu: 4.317 ± 0.745
1.079ArgMet: 1.079 ± 0.066
0.719ArgAsn: 0.719 ± 0.124
3.957ArgPro: 3.957 ± 0.935
1.799ArgGln: 1.799 ± 0.058
2.878ArgArg: 2.878 ± 1.518
4.676ArgSer: 4.676 ± 0.956
1.079ArgThr: 1.079 ± 0.438
6.115ArgVal: 6.115 ± 0.204
1.079ArgTrp: 1.079 ± 0.569
3.237ArgTyr: 3.237 ± 0.307
0.0ArgXaa: 0.0 ± 0.0
Ser
6.835SerAla: 6.835 ± 0.424
1.439SerCys: 1.439 ± 0.248
3.957SerAsp: 3.957 ± 1.08
3.237SerGlu: 3.237 ± 0.197
4.317SerPhe: 4.317 ± 1.774
5.396SerGly: 5.396 ± 0.175
1.439SerHis: 1.439 ± 0.752
3.957SerIle: 3.957 ± 0.431
3.237SerLys: 3.237 ± 0.701
6.115SerLeu: 6.115 ± 0.204
1.079SerMet: 1.079 ± 0.135
2.878SerAsn: 2.878 ± 0.496
5.036SerPro: 5.036 ± 1.876
3.957SerGln: 3.957 ± 0.577
4.317SerArg: 4.317 ± 0.241
11.871SerSer: 11.871 ± 0.723
6.115SerThr: 6.115 ± 2.314
8.993SerVal: 8.993 ± 1.803
1.079SerTrp: 1.079 ± 0.438
2.158SerTyr: 2.158 ± 0.876
0.0SerXaa: 0.0 ± 0.0
Thr
4.676ThrAla: 4.676 ± 0.051
1.079ThrCys: 1.079 ± 0.438
4.317ThrAsp: 4.317 ± 1.248
1.439ThrGlu: 1.439 ± 0.248
2.878ThrPhe: 2.878 ± 1.504
2.158ThrGly: 2.158 ± 0.131
0.36ThrHis: 0.36 ± 0.19
2.158ThrIle: 2.158 ± 0.131
1.079ThrLys: 1.079 ± 0.569
5.036ThrLeu: 5.036 ± 0.139
1.079ThrMet: 1.079 ± 0.438
2.518ThrAsn: 2.518 ± 1.19
3.597ThrPro: 3.597 ± 1.124
1.799ThrGln: 1.799 ± 0.445
3.597ThrArg: 3.597 ± 0.387
6.835ThrSer: 6.835 ± 1.935
1.799ThrThr: 1.799 ± 0.562
5.755ThrVal: 5.755 ± 0.489
0.0ThrTrp: 0.0 ± 0.0
2.518ThrTyr: 2.518 ± 0.825
0.0ThrXaa: 0.0 ± 0.0
Val
5.396ValAla: 5.396 ± 0.175
2.158ValCys: 2.158 ± 0.131
3.597ValAsp: 3.597 ± 0.117
3.597ValGlu: 3.597 ± 0.891
3.957ValPhe: 3.957 ± 0.577
3.597ValGly: 3.597 ± 0.387
2.518ValHis: 2.518 ± 0.183
3.237ValIle: 3.237 ± 0.197
3.237ValLys: 3.237 ± 0.81
10.072ValLeu: 10.072 ± 0.781
1.079ValMet: 1.079 ± 0.609
3.237ValAsn: 3.237 ± 1.314
4.676ValPro: 4.676 ± 2.57
2.518ValGln: 2.518 ± 0.183
6.475ValArg: 6.475 ± 0.898
6.835ValSer: 6.835 ± 0.424
5.755ValThr: 5.755 ± 0.014
6.475ValVal: 6.475 ± 1.621
1.079ValTrp: 1.079 ± 0.569
3.597ValTyr: 3.597 ± 0.387
0.0ValXaa: 0.0 ± 0.0
Trp
1.439TrpAla: 1.439 ± 0.248
0.36TrpCys: 0.36 ± 0.19
1.079TrpAsp: 1.079 ± 0.066
0.719TrpGlu: 0.719 ± 0.38
0.0TrpPhe: 0.0 ± 0.0
0.36TrpGly: 0.36 ± 0.19
0.0TrpHis: 0.0 ± 0.0
1.799TrpIle: 1.799 ± 0.058
0.36TrpLys: 0.36 ± 0.19
2.158TrpLeu: 2.158 ± 0.372
0.36TrpMet: 0.36 ± 0.314
0.719TrpAsn: 0.719 ± 0.38
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.439TrpArg: 1.439 ± 0.248
0.36TrpSer: 0.36 ± 0.314
1.079TrpThr: 1.079 ± 0.569
1.079TrpVal: 1.079 ± 0.438
0.0TrpTrp: 0.0 ± 0.0
1.079TrpTyr: 1.079 ± 0.569
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.439TyrAla: 1.439 ± 0.759
0.36TyrCys: 0.36 ± 0.19
3.597TyrAsp: 3.597 ± 0.387
1.439TyrGlu: 1.439 ± 0.255
2.878TyrPhe: 2.878 ± 0.007
3.237TyrGly: 3.237 ± 1.205
1.439TyrHis: 1.439 ± 0.248
2.158TyrIle: 2.158 ± 0.372
2.158TyrLys: 2.158 ± 0.131
3.597TyrLeu: 3.597 ± 0.621
0.719TyrMet: 0.719 ± 0.124
2.518TyrAsn: 2.518 ± 0.183
1.079TyrPro: 1.079 ± 0.438
1.799TyrGln: 1.799 ± 0.058
1.799TyrArg: 1.799 ± 0.445
1.439TyrSer: 1.439 ± 0.248
2.158TyrThr: 2.158 ± 0.131
3.237TyrVal: 3.237 ± 0.307
0.719TyrTrp: 0.719 ± 0.124
2.158TyrTyr: 2.158 ± 0.635
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2781 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski