Amino acid dipepetide frequency for Beihai picorna-like virus 47

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.927AlaAla: 5.927 ± 0.786
0.79AlaCys: 0.79 ± 0.802
4.741AlaAsp: 4.741 ± 0.134
5.136AlaGlu: 5.136 ± 0.97
2.766AlaPhe: 2.766 ± 0.334
4.346AlaGly: 4.346 ± 1.938
0.79AlaHis: 0.79 ± 0.184
3.161AlaIle: 3.161 ± 0.502
3.556AlaLys: 3.556 ± 0.101
5.136AlaLeu: 5.136 ± 0.352
1.58AlaMet: 1.58 ± 0.869
3.556AlaAsn: 3.556 ± 1.755
4.741AlaPro: 4.741 ± 1.721
1.976AlaGln: 1.976 ± 0.468
2.371AlaArg: 2.371 ± 0.551
5.531AlaSer: 5.531 ± 1.286
7.112AlaThr: 7.112 ± 1.654
3.161AlaVal: 3.161 ± 0.117
1.185AlaTrp: 1.185 ± 0.034
2.371AlaTyr: 2.371 ± 0.067
0.0AlaXaa: 0.0 ± 0.0
Cys
0.79CysAla: 0.79 ± 0.435
0.0CysCys: 0.0 ± 0.0
1.185CysAsp: 1.185 ± 0.652
0.79CysGlu: 0.79 ± 0.435
1.185CysPhe: 1.185 ± 0.034
1.58CysGly: 1.58 ± 0.251
0.0CysHis: 0.0 ± 0.0
1.185CysIle: 1.185 ± 0.652
0.0CysLys: 0.0 ± 0.0
0.395CysLeu: 0.395 ± 0.217
0.395CysMet: 0.395 ± 0.175
1.185CysAsn: 1.185 ± 0.652
0.395CysPro: 0.395 ± 0.217
0.395CysGln: 0.395 ± 0.401
0.395CysArg: 0.395 ± 0.401
1.58CysSer: 1.58 ± 0.367
1.185CysThr: 1.185 ± 0.034
1.185CysVal: 1.185 ± 0.652
0.0CysTrp: 0.0 ± 0.0
0.79CysTyr: 0.79 ± 0.184
0.0CysXaa: 0.0 ± 0.0
Asp
1.976AspAla: 1.976 ± 0.468
0.79AspCys: 0.79 ± 0.435
5.136AspAsp: 5.136 ± 1.589
6.322AspGlu: 6.322 ± 0.233
3.161AspPhe: 3.161 ± 0.117
1.976AspGly: 1.976 ± 1.087
1.185AspHis: 1.185 ± 0.034
3.951AspIle: 3.951 ± 0.919
3.556AspLys: 3.556 ± 0.719
4.346AspLeu: 4.346 ± 0.535
0.79AspMet: 0.79 ± 0.184
2.766AspAsn: 2.766 ± 0.334
3.951AspPro: 3.951 ± 1.537
2.371AspGln: 2.371 ± 0.551
2.371AspArg: 2.371 ± 0.686
2.766AspSer: 2.766 ± 0.952
3.951AspThr: 3.951 ± 0.318
4.741AspVal: 4.741 ± 1.721
0.79AspTrp: 0.79 ± 0.802
3.161AspTyr: 3.161 ± 1.12
0.0AspXaa: 0.0 ± 0.0
Glu
3.556GluAla: 3.556 ± 0.719
0.79GluCys: 0.79 ± 0.435
2.766GluAsp: 2.766 ± 2.189
5.136GluGlu: 5.136 ± 0.267
3.556GluPhe: 3.556 ± 1.338
1.185GluGly: 1.185 ± 0.652
0.395GluHis: 0.395 ± 0.217
2.371GluIle: 2.371 ± 0.067
1.185GluLys: 1.185 ± 0.652
1.976GluLeu: 1.976 ± 0.468
1.58GluMet: 1.58 ± 0.869
2.371GluAsn: 2.371 ± 0.067
2.766GluPro: 2.766 ± 0.903
2.371GluGln: 2.371 ± 1.304
2.766GluArg: 2.766 ± 1.521
3.161GluSer: 3.161 ± 0.502
4.346GluThr: 4.346 ± 0.083
4.741GluVal: 4.741 ± 0.134
0.79GluTrp: 0.79 ± 0.435
2.766GluTyr: 2.766 ± 0.903
0.0GluXaa: 0.0 ± 0.0
Phe
1.976PheAla: 1.976 ± 1.387
0.0PheCys: 0.0 ± 0.0
2.766PheAsp: 2.766 ± 0.903
1.58PheGlu: 1.58 ± 0.869
1.976PhePhe: 1.976 ± 0.468
3.556PheGly: 3.556 ± 0.518
1.185PheHis: 1.185 ± 0.034
2.766PheIle: 2.766 ± 0.903
1.58PheLys: 1.58 ± 0.367
3.556PheLeu: 3.556 ± 1.136
0.395PheMet: 0.395 ± 0.401
1.58PheAsn: 1.58 ± 0.367
2.766PhePro: 2.766 ± 0.903
1.185PheGln: 1.185 ± 0.034
4.741PheArg: 4.741 ± 0.753
3.951PheSer: 3.951 ± 1.537
5.531PheThr: 5.531 ± 0.569
2.766PheVal: 2.766 ± 0.285
1.185PheTrp: 1.185 ± 0.652
2.766PheTyr: 2.766 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
2.766GlyAla: 2.766 ± 0.334
0.0GlyCys: 0.0 ± 0.0
4.741GlyAsp: 4.741 ± 1.102
3.161GlyGlu: 3.161 ± 0.502
3.161GlyPhe: 3.161 ± 1.12
3.161GlyGly: 3.161 ± 0.117
1.58GlyHis: 1.58 ± 0.869
5.136GlyIle: 5.136 ± 0.97
2.766GlyLys: 2.766 ± 0.285
1.976GlyLeu: 1.976 ± 0.15
3.161GlyMet: 3.161 ± 1.12
2.766GlyAsn: 2.766 ± 0.903
1.976GlyPro: 1.976 ± 0.15
1.58GlyGln: 1.58 ± 0.986
3.951GlyArg: 3.951 ± 0.318
4.741GlySer: 4.741 ± 0.484
3.951GlyThr: 3.951 ± 0.318
5.531GlyVal: 5.531 ± 2.523
1.185GlyTrp: 1.185 ± 0.652
4.741GlyTyr: 4.741 ± 2.958
0.0GlyXaa: 0.0 ± 0.0
His
1.976HisAla: 1.976 ± 0.769
0.395HisCys: 0.395 ± 0.217
1.58HisAsp: 1.58 ± 0.869
1.58HisGlu: 1.58 ± 0.367
1.185HisPhe: 1.185 ± 0.034
0.79HisGly: 0.79 ± 0.435
1.185HisHis: 1.185 ± 0.652
0.0HisIle: 0.0 ± 0.0
1.185HisLys: 1.185 ± 0.034
1.976HisLeu: 1.976 ± 0.15
0.0HisMet: 0.0 ± 0.0
1.185HisAsn: 1.185 ± 0.034
1.976HisPro: 1.976 ± 0.15
1.185HisGln: 1.185 ± 0.652
1.58HisArg: 1.58 ± 0.251
1.976HisSer: 1.976 ± 0.15
1.58HisThr: 1.58 ± 0.367
1.58HisVal: 1.58 ± 0.367
0.0HisTrp: 0.0 ± 0.0
1.58HisTyr: 1.58 ± 0.869
0.0HisXaa: 0.0 ± 0.0
Ile
4.346IleAla: 4.346 ± 1.154
0.395IleCys: 0.395 ± 0.401
3.161IleAsp: 3.161 ± 1.353
2.371IleGlu: 2.371 ± 1.304
2.371IlePhe: 2.371 ± 0.067
3.161IleGly: 3.161 ± 0.735
1.58IleHis: 1.58 ± 0.367
3.556IleIle: 3.556 ± 0.719
3.556IleLys: 3.556 ± 0.719
6.322IleLeu: 6.322 ± 2.859
1.58IleMet: 1.58 ± 0.251
2.766IleAsn: 2.766 ± 0.285
1.58IlePro: 1.58 ± 0.367
2.371IleGln: 2.371 ± 0.067
1.58IleArg: 1.58 ± 0.367
5.531IleSer: 5.531 ± 2.523
5.531IleThr: 5.531 ± 1.905
3.951IleVal: 3.951 ± 0.3
0.395IleTrp: 0.395 ± 0.217
1.58IleTyr: 1.58 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
4.346LysAla: 4.346 ± 0.535
0.395LysCys: 0.395 ± 0.217
4.346LysAsp: 4.346 ± 1.154
1.976LysGlu: 1.976 ± 1.087
2.371LysPhe: 2.371 ± 0.067
1.58LysGly: 1.58 ± 0.251
0.79LysHis: 0.79 ± 0.435
1.58LysIle: 1.58 ± 0.251
3.556LysLys: 3.556 ± 0.101
4.346LysLeu: 4.346 ± 0.083
1.185LysMet: 1.185 ± 0.652
1.58LysAsn: 1.58 ± 1.604
1.58LysPro: 1.58 ± 0.367
0.79LysGln: 0.79 ± 0.435
2.371LysArg: 2.371 ± 0.686
3.951LysSer: 3.951 ± 0.937
3.556LysThr: 3.556 ± 0.101
2.766LysVal: 2.766 ± 1.521
0.79LysTrp: 0.79 ± 0.435
1.976LysTyr: 1.976 ± 0.468
0.0LysXaa: 0.0 ± 0.0
Leu
5.531LeuAla: 5.531 ± 0.569
1.58LeuCys: 1.58 ± 0.251
3.556LeuAsp: 3.556 ± 0.719
3.161LeuGlu: 3.161 ± 0.502
1.58LeuPhe: 1.58 ± 0.367
6.717LeuGly: 6.717 ± 0.634
2.371LeuHis: 2.371 ± 0.067
3.556LeuIle: 3.556 ± 0.719
3.556LeuLys: 3.556 ± 1.338
5.531LeuLeu: 5.531 ± 0.049
1.185LeuMet: 1.185 ± 0.652
3.161LeuAsn: 3.161 ± 0.117
3.556LeuPro: 3.556 ± 0.101
5.136LeuGln: 5.136 ± 0.352
4.741LeuArg: 4.741 ± 0.484
5.136LeuSer: 5.136 ± 0.352
4.741LeuThr: 4.741 ± 0.753
5.927LeuVal: 5.927 ± 0.786
1.185LeuTrp: 1.185 ± 1.203
3.161LeuTyr: 3.161 ± 0.502
0.0LeuXaa: 0.0 ± 0.0
Met
1.58MetAla: 1.58 ± 0.367
0.79MetCys: 0.79 ± 0.435
1.58MetAsp: 1.58 ± 0.251
1.976MetGlu: 1.976 ± 1.087
1.185MetPhe: 1.185 ± 0.034
1.976MetGly: 1.976 ± 0.15
0.79MetHis: 0.79 ± 0.435
1.185MetIle: 1.185 ± 0.034
0.79MetLys: 0.79 ± 0.435
0.395MetLeu: 0.395 ± 0.217
0.79MetMet: 0.79 ± 0.435
1.185MetAsn: 1.185 ± 0.585
1.58MetPro: 1.58 ± 0.251
1.976MetGln: 1.976 ± 1.087
2.766MetArg: 2.766 ± 0.285
2.766MetSer: 2.766 ± 0.334
0.79MetThr: 0.79 ± 0.435
1.185MetVal: 1.185 ± 0.034
0.395MetTrp: 0.395 ± 0.217
0.395MetTyr: 0.395 ± 0.217
0.0MetXaa: 0.0 ± 0.0
Asn
3.556AsnAla: 3.556 ± 0.518
1.58AsnCys: 1.58 ± 0.251
1.976AsnAsp: 1.976 ± 0.15
1.185AsnGlu: 1.185 ± 1.203
2.766AsnPhe: 2.766 ± 0.285
4.346AsnGly: 4.346 ± 0.083
0.79AsnHis: 0.79 ± 0.184
2.766AsnIle: 2.766 ± 0.334
1.185AsnLys: 1.185 ± 0.652
3.556AsnLeu: 3.556 ± 1.136
0.0AsnMet: 0.0 ± 0.0
1.976AsnAsn: 1.976 ± 0.468
4.346AsnPro: 4.346 ± 0.701
1.976AsnGln: 1.976 ± 0.468
1.976AsnArg: 1.976 ± 0.15
3.556AsnSer: 3.556 ± 0.719
2.371AsnThr: 2.371 ± 2.407
4.346AsnVal: 4.346 ± 0.701
0.79AsnTrp: 0.79 ± 0.184
0.395AsnTyr: 0.395 ± 0.401
0.0AsnXaa: 0.0 ± 0.0
Pro
3.951ProAla: 3.951 ± 2.774
0.395ProCys: 0.395 ± 0.401
1.976ProAsp: 1.976 ± 0.468
3.161ProGlu: 3.161 ± 1.739
3.161ProPhe: 3.161 ± 0.735
3.161ProGly: 3.161 ± 0.117
1.58ProHis: 1.58 ± 0.986
3.951ProIle: 3.951 ± 1.537
1.976ProLys: 1.976 ± 0.769
3.556ProLeu: 3.556 ± 0.719
2.371ProMet: 2.371 ± 0.191
1.976ProAsn: 1.976 ± 0.468
3.556ProPro: 3.556 ± 0.101
1.58ProGln: 1.58 ± 0.367
1.185ProArg: 1.185 ± 1.203
2.766ProSer: 2.766 ± 0.903
4.741ProThr: 4.741 ± 0.484
2.766ProVal: 2.766 ± 0.952
1.58ProTrp: 1.58 ± 0.367
2.371ProTyr: 2.371 ± 0.686
0.0ProXaa: 0.0 ± 0.0
Gln
1.58GlnAla: 1.58 ± 0.869
0.395GlnCys: 0.395 ± 0.217
0.79GlnAsp: 0.79 ± 0.184
0.0GlnGlu: 0.0 ± 0.0
1.58GlnPhe: 1.58 ± 0.986
2.766GlnGly: 2.766 ± 0.285
1.976GlnHis: 1.976 ± 1.087
1.976GlnIle: 1.976 ± 0.468
0.79GlnLys: 0.79 ± 0.184
3.556GlnLeu: 3.556 ± 0.719
0.0GlnMet: 0.0 ± 0.0
1.185GlnAsn: 1.185 ± 0.034
1.58GlnPro: 1.58 ± 0.367
0.395GlnGln: 0.395 ± 0.217
1.185GlnArg: 1.185 ± 0.034
5.531GlnSer: 5.531 ± 1.286
2.766GlnThr: 2.766 ± 0.285
1.976GlnVal: 1.976 ± 0.15
0.395GlnTrp: 0.395 ± 0.217
3.161GlnTyr: 3.161 ± 0.117
0.0GlnXaa: 0.0 ± 0.0
Arg
4.741ArgAla: 4.741 ± 2.339
0.79ArgCys: 0.79 ± 0.435
3.161ArgAsp: 3.161 ± 0.502
2.766ArgGlu: 2.766 ± 0.903
3.951ArgPhe: 3.951 ± 0.3
3.951ArgGly: 3.951 ± 0.3
1.185ArgHis: 1.185 ± 0.034
3.556ArgIle: 3.556 ± 0.518
2.766ArgLys: 2.766 ± 0.903
5.531ArgLeu: 5.531 ± 0.049
1.976ArgMet: 1.976 ± 0.15
1.185ArgAsn: 1.185 ± 0.034
2.766ArgPro: 2.766 ± 0.952
1.185ArgGln: 1.185 ± 0.652
2.371ArgArg: 2.371 ± 0.686
4.741ArgSer: 4.741 ± 0.484
1.58ArgThr: 1.58 ± 0.869
4.741ArgVal: 4.741 ± 0.753
0.0ArgTrp: 0.0 ± 0.0
3.556ArgTyr: 3.556 ± 0.518
0.0ArgXaa: 0.0 ± 0.0
Ser
5.927SerAla: 5.927 ± 2.306
0.79SerCys: 0.79 ± 0.184
4.346SerAsp: 4.346 ± 0.083
2.371SerGlu: 2.371 ± 0.551
3.556SerPhe: 3.556 ± 0.719
6.322SerGly: 6.322 ± 0.385
0.79SerHis: 0.79 ± 0.184
6.322SerIle: 6.322 ± 1.47
5.531SerLys: 5.531 ± 0.569
6.717SerLeu: 6.717 ± 0.634
3.556SerMet: 3.556 ± 0.101
3.951SerAsn: 3.951 ± 0.919
1.58SerPro: 1.58 ± 0.367
1.976SerGln: 1.976 ± 0.769
4.741SerArg: 4.741 ± 1.102
5.927SerSer: 5.927 ± 0.45
3.161SerThr: 3.161 ± 0.735
3.951SerVal: 3.951 ± 0.3
1.976SerTrp: 1.976 ± 0.15
3.951SerTyr: 3.951 ± 1.555
0.0SerXaa: 0.0 ± 0.0
Thr
5.136ThrAla: 5.136 ± 0.352
1.185ThrCys: 1.185 ± 0.034
3.556ThrAsp: 3.556 ± 1.136
1.58ThrGlu: 1.58 ± 0.367
3.556ThrPhe: 3.556 ± 0.518
4.346ThrGly: 4.346 ± 1.938
0.395ThrHis: 0.395 ± 0.217
2.371ThrIle: 2.371 ± 1.17
3.556ThrLys: 3.556 ± 1.338
5.927ThrLeu: 5.927 ± 1.405
1.976ThrMet: 1.976 ± 0.769
5.927ThrAsn: 5.927 ± 1.069
4.346ThrPro: 4.346 ± 0.535
2.371ThrGln: 2.371 ± 1.788
4.346ThrArg: 4.346 ± 0.083
3.951ThrSer: 3.951 ± 0.919
5.136ThrThr: 5.136 ± 2.74
5.531ThrVal: 5.531 ± 0.569
0.79ThrTrp: 0.79 ± 0.184
3.556ThrTyr: 3.556 ± 0.719
0.0ThrXaa: 0.0 ± 0.0
Val
4.741ValAla: 4.741 ± 1.102
1.976ValCys: 1.976 ± 1.087
5.136ValAsp: 5.136 ± 1.504
4.346ValGlu: 4.346 ± 1.154
1.58ValPhe: 1.58 ± 0.869
3.161ValGly: 3.161 ± 0.117
1.976ValHis: 1.976 ± 0.468
5.136ValIle: 5.136 ± 0.267
2.766ValLys: 2.766 ± 0.285
5.531ValLeu: 5.531 ± 0.049
1.58ValMet: 1.58 ± 0.251
3.161ValAsn: 3.161 ± 0.735
4.346ValPro: 4.346 ± 1.32
1.58ValGln: 1.58 ± 0.251
4.741ValArg: 4.741 ± 0.484
4.741ValSer: 4.741 ± 1.371
3.556ValThr: 3.556 ± 1.136
2.766ValVal: 2.766 ± 0.903
0.79ValTrp: 0.79 ± 0.435
3.951ValTyr: 3.951 ± 0.919
0.0ValXaa: 0.0 ± 0.0
Trp
1.185TrpAla: 1.185 ± 0.034
0.79TrpCys: 0.79 ± 0.435
1.185TrpAsp: 1.185 ± 0.034
0.0TrpGlu: 0.0 ± 0.0
0.395TrpPhe: 0.395 ± 0.217
0.79TrpGly: 0.79 ± 0.435
0.395TrpHis: 0.395 ± 0.401
0.395TrpIle: 0.395 ± 0.217
1.185TrpLys: 1.185 ± 0.652
1.58TrpLeu: 1.58 ± 0.367
0.0TrpMet: 0.0 ± 0.0
0.395TrpAsn: 0.395 ± 0.217
0.79TrpPro: 0.79 ± 0.802
0.79TrpGln: 0.79 ± 0.184
1.185TrpArg: 1.185 ± 0.034
1.976TrpSer: 1.976 ± 0.15
0.79TrpThr: 0.79 ± 0.184
0.395TrpVal: 0.395 ± 0.217
0.0TrpTrp: 0.0 ± 0.0
0.79TrpTyr: 0.79 ± 0.435
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.346TyrAla: 4.346 ± 0.083
0.79TyrCys: 0.79 ± 0.435
2.371TyrAsp: 2.371 ± 0.067
1.976TyrGlu: 1.976 ± 1.087
2.766TyrPhe: 2.766 ± 1.571
3.161TyrGly: 3.161 ± 1.12
3.556TyrHis: 3.556 ± 0.518
2.766TyrIle: 2.766 ± 0.285
0.79TyrLys: 0.79 ± 0.184
3.161TyrLeu: 3.161 ± 1.12
1.58TyrMet: 1.58 ± 0.869
1.58TyrAsn: 1.58 ± 0.986
1.976TyrPro: 1.976 ± 0.15
0.395TyrGln: 0.395 ± 0.217
5.136TyrArg: 5.136 ± 0.267
3.161TyrSer: 3.161 ± 0.735
3.161TyrThr: 3.161 ± 0.502
3.556TyrVal: 3.556 ± 0.719
0.79TyrTrp: 0.79 ± 0.435
1.976TyrTyr: 1.976 ± 0.15
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2532 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski