Amino acid dipepetide frequency for Shahe picorna-like virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.459AlaAla: 4.459 ± 0.339
0.811AlaCys: 0.811 ± 0.265
4.864AlaAsp: 4.864 ± 0.566
4.864AlaGlu: 4.864 ± 1.284
6.486AlaPhe: 6.486 ± 0.035
3.243AlaGly: 3.243 ± 0.342
0.405AlaHis: 0.405 ± 0.227
4.459AlaIle: 4.459 ± 0.339
4.864AlaLys: 4.864 ± 1.284
6.486AlaLeu: 6.486 ± 1.473
2.432AlaMet: 2.432 ± 0.077
3.648AlaAsn: 3.648 ± 2.272
2.837AlaPro: 2.837 ± 1.288
1.216AlaGln: 1.216 ± 0.681
5.27AlaArg: 5.27 ± 0.792
8.512AlaSer: 8.512 ± 3.144
4.864AlaThr: 4.864 ± 1.591
6.08AlaVal: 6.08 ± 0.527
1.621AlaTrp: 1.621 ± 0.907
2.027AlaTyr: 2.027 ± 1.022
0.0AlaXaa: 0.0 ± 0.0
Cys
0.811CysAla: 0.811 ± 0.454
0.0CysCys: 0.0 ± 0.0
0.811CysAsp: 0.811 ± 0.265
0.405CysGlu: 0.405 ± 0.227
1.216CysPhe: 1.216 ± 0.038
1.216CysGly: 1.216 ± 0.038
0.0CysHis: 0.0 ± 0.0
1.216CysIle: 1.216 ± 0.757
0.405CysLys: 0.405 ± 0.227
1.621CysLeu: 1.621 ± 0.907
0.0CysMet: 0.0 ± 0.0
0.405CysAsn: 0.405 ± 0.227
1.621CysPro: 1.621 ± 0.907
0.811CysGln: 0.811 ± 0.984
0.0CysArg: 0.0 ± 0.0
0.405CysSer: 0.405 ± 0.492
1.621CysThr: 1.621 ± 0.189
0.405CysVal: 0.405 ± 0.492
0.0CysTrp: 0.0 ± 0.0
0.405CysTyr: 0.405 ± 0.227
0.0CysXaa: 0.0 ± 0.0
Asp
3.243AspAla: 3.243 ± 1.096
0.811AspCys: 0.811 ± 0.454
3.243AspAsp: 3.243 ± 1.096
3.648AspGlu: 3.648 ± 1.323
3.243AspPhe: 3.243 ± 0.342
2.837AspGly: 2.837 ± 0.869
2.432AspHis: 2.432 ± 0.077
3.243AspIle: 3.243 ± 1.096
3.243AspLys: 3.243 ± 1.096
3.243AspLeu: 3.243 ± 0.342
2.432AspMet: 2.432 ± 1.361
2.837AspAsn: 2.837 ± 0.15
1.216AspPro: 1.216 ± 0.038
2.432AspGln: 2.432 ± 0.077
1.216AspArg: 1.216 ± 0.038
2.432AspSer: 2.432 ± 0.077
2.027AspThr: 2.027 ± 0.415
3.648AspVal: 3.648 ± 0.115
0.405AspTrp: 0.405 ± 0.227
1.621AspTyr: 1.621 ± 0.907
0.0AspXaa: 0.0 ± 0.0
Glu
6.08GluAla: 6.08 ± 1.246
1.216GluCys: 1.216 ± 0.038
4.054GluAsp: 4.054 ± 0.112
6.08GluGlu: 6.08 ± 1.965
4.864GluPhe: 4.864 ± 0.153
4.459GluGly: 4.459 ± 1.777
1.216GluHis: 1.216 ± 0.681
2.432GluIle: 2.432 ± 0.077
4.054GluLys: 4.054 ± 1.55
4.864GluLeu: 4.864 ± 1.284
2.837GluMet: 2.837 ± 1.588
1.621GluAsn: 1.621 ± 0.189
2.027GluPro: 2.027 ± 0.415
2.432GluGln: 2.432 ± 1.361
0.811GluArg: 0.811 ± 0.454
4.054GluSer: 4.054 ± 1.55
2.027GluThr: 2.027 ± 0.303
1.216GluVal: 1.216 ± 0.038
0.405GluTrp: 0.405 ± 0.227
1.216GluTyr: 1.216 ± 0.038
0.0GluXaa: 0.0 ± 0.0
Phe
5.27PheAla: 5.27 ± 2.083
1.621PheCys: 1.621 ± 0.189
2.432PheAsp: 2.432 ± 0.077
1.621PheGlu: 1.621 ± 0.53
1.216PhePhe: 1.216 ± 0.681
4.864PheGly: 4.864 ± 0.153
0.811PheHis: 0.811 ± 0.984
3.648PheIle: 3.648 ± 1.553
2.432PheLys: 2.432 ± 1.361
3.648PheLeu: 3.648 ± 0.115
2.027PheMet: 2.027 ± 1.134
2.837PheAsn: 2.837 ± 1.288
0.811PhePro: 0.811 ± 0.454
3.243PheGln: 3.243 ± 0.342
1.621PheArg: 1.621 ± 0.53
6.486PheSer: 6.486 ± 0.684
5.27PheThr: 5.27 ± 0.074
2.837PheVal: 2.837 ± 0.869
1.621PheTrp: 1.621 ± 0.189
1.216PheTyr: 1.216 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
3.243GlyAla: 3.243 ± 1.096
0.811GlyCys: 0.811 ± 0.454
3.243GlyAsp: 3.243 ± 1.096
4.054GlyGlu: 4.054 ± 0.607
3.648GlyPhe: 3.648 ± 0.115
3.648GlyGly: 3.648 ± 2.272
0.405GlyHis: 0.405 ± 0.227
6.486GlyIle: 6.486 ± 0.035
4.459GlyLys: 4.459 ± 1.058
4.054GlyLeu: 4.054 ± 0.831
1.621GlyMet: 1.621 ± 0.455
4.864GlyAsn: 4.864 ± 0.153
1.216GlyPro: 1.216 ± 0.038
1.216GlyGln: 1.216 ± 0.038
0.811GlyArg: 0.811 ± 0.265
5.27GlySer: 5.27 ± 2.083
6.891GlyThr: 6.891 ± 4.051
2.837GlyVal: 2.837 ± 0.15
0.811GlyTrp: 0.811 ± 0.454
2.837GlyTyr: 2.837 ± 0.569
0.0GlyXaa: 0.0 ± 0.0
His
0.811HisAla: 0.811 ± 0.984
0.405HisCys: 0.405 ± 0.492
0.811HisAsp: 0.811 ± 0.454
1.216HisGlu: 1.216 ± 0.681
1.621HisPhe: 1.621 ± 0.189
0.405HisGly: 0.405 ± 0.492
0.405HisHis: 0.405 ± 0.227
0.811HisIle: 0.811 ± 0.265
0.811HisLys: 0.811 ± 0.454
1.216HisLeu: 1.216 ± 0.038
0.0HisMet: 0.0 ± 0.0
0.811HisAsn: 0.811 ± 0.454
1.216HisPro: 1.216 ± 0.681
2.027HisGln: 2.027 ± 0.415
0.405HisArg: 0.405 ± 0.227
0.811HisSer: 0.811 ± 0.454
0.405HisThr: 0.405 ± 0.227
1.621HisVal: 1.621 ± 0.189
0.405HisTrp: 0.405 ± 0.492
0.405HisTyr: 0.405 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
7.702IleAla: 7.702 ± 0.716
0.811IleCys: 0.811 ± 0.265
1.621IleAsp: 1.621 ± 0.53
3.648IleGlu: 3.648 ± 0.604
1.621IlePhe: 1.621 ± 0.189
2.837IleGly: 2.837 ± 2.006
1.216IleHis: 1.216 ± 0.038
3.243IleIle: 3.243 ± 1.096
2.027IleLys: 2.027 ± 1.134
4.054IleLeu: 4.054 ± 0.112
2.432IleMet: 2.432 ± 0.642
2.837IleAsn: 2.837 ± 0.569
4.459IlePro: 4.459 ± 2.537
0.405IleGln: 0.405 ± 0.227
1.621IleArg: 1.621 ± 0.53
5.27IleSer: 5.27 ± 1.364
3.648IleThr: 3.648 ± 0.115
3.648IleVal: 3.648 ± 0.115
0.811IleTrp: 0.811 ± 0.454
2.027IleTyr: 2.027 ± 0.303
0.0IleXaa: 0.0 ± 0.0
Lys
5.675LysAla: 5.675 ± 2.457
0.0LysCys: 0.0 ± 0.0
3.648LysAsp: 3.648 ± 1.323
2.837LysGlu: 2.837 ± 0.869
1.621LysPhe: 1.621 ± 0.189
3.243LysGly: 3.243 ± 1.096
0.405LysHis: 0.405 ± 0.227
4.054LysIle: 4.054 ± 0.112
2.837LysLys: 2.837 ± 1.588
3.648LysLeu: 3.648 ± 0.115
3.243LysMet: 3.243 ± 1.096
2.432LysAsn: 2.432 ± 1.361
3.243LysPro: 3.243 ± 0.377
1.216LysGln: 1.216 ± 0.038
2.432LysArg: 2.432 ± 0.642
5.27LysSer: 5.27 ± 2.23
3.243LysThr: 3.243 ± 0.377
5.27LysVal: 5.27 ± 0.792
1.216LysTrp: 1.216 ± 0.681
2.432LysTyr: 2.432 ± 1.361
0.0LysXaa: 0.0 ± 0.0
Leu
5.675LeuAla: 5.675 ± 1.856
2.837LeuCys: 2.837 ± 0.15
4.459LeuAsp: 4.459 ± 0.38
4.864LeuGlu: 4.864 ± 0.153
3.648LeuPhe: 3.648 ± 0.604
4.459LeuGly: 4.459 ± 1.777
1.216LeuHis: 1.216 ± 0.681
3.243LeuIle: 3.243 ± 1.096
7.296LeuLys: 7.296 ± 1.927
6.08LeuLeu: 6.08 ± 1.246
0.811LeuMet: 0.811 ± 0.454
2.837LeuAsn: 2.837 ± 0.15
3.648LeuPro: 3.648 ± 0.834
2.837LeuGln: 2.837 ± 0.869
3.648LeuArg: 3.648 ± 0.115
8.107LeuSer: 8.107 ± 0.224
4.054LeuThr: 4.054 ± 0.831
4.864LeuVal: 4.864 ± 2.31
1.621LeuTrp: 1.621 ± 0.907
1.621LeuTyr: 1.621 ± 1.249
0.0LeuXaa: 0.0 ± 0.0
Met
1.621MetAla: 1.621 ± 0.189
0.0MetCys: 0.0 ± 0.0
2.432MetAsp: 2.432 ± 0.077
2.027MetGlu: 2.027 ± 1.134
1.621MetPhe: 1.621 ± 0.189
2.837MetGly: 2.837 ± 0.15
0.405MetHis: 0.405 ± 0.227
2.432MetIle: 2.432 ± 0.077
1.216MetLys: 1.216 ± 0.681
2.837MetLeu: 2.837 ± 0.15
0.0MetMet: 0.0 ± 0.0
2.027MetAsn: 2.027 ± 0.303
2.027MetPro: 2.027 ± 0.415
0.405MetGln: 0.405 ± 0.227
2.432MetArg: 2.432 ± 1.361
3.648MetSer: 3.648 ± 0.115
3.243MetThr: 3.243 ± 1.815
1.621MetVal: 1.621 ± 0.907
0.0MetTrp: 0.0 ± 0.0
0.811MetTyr: 0.811 ± 0.454
0.0MetXaa: 0.0 ± 0.0
Asn
6.486AsnAla: 6.486 ± 0.035
0.811AsnCys: 0.811 ± 0.265
1.216AsnAsp: 1.216 ± 0.038
2.027AsnGlu: 2.027 ± 0.415
4.459AsnPhe: 4.459 ± 0.38
2.837AsnGly: 2.837 ± 1.288
0.811AsnHis: 0.811 ± 0.454
3.648AsnIle: 3.648 ± 2.991
2.837AsnLys: 2.837 ± 0.869
3.243AsnLeu: 3.243 ± 0.342
1.216AsnMet: 1.216 ± 0.681
4.054AsnAsn: 4.054 ± 0.112
2.432AsnPro: 2.432 ± 0.796
1.216AsnGln: 1.216 ± 0.757
1.621AsnArg: 1.621 ± 0.189
3.243AsnSer: 3.243 ± 0.342
3.648AsnThr: 3.648 ± 1.553
2.837AsnVal: 2.837 ± 0.569
0.405AsnTrp: 0.405 ± 0.227
2.027AsnTyr: 2.027 ± 1.022
0.0AsnXaa: 0.0 ± 0.0
Pro
2.837ProAla: 2.837 ± 0.569
0.811ProCys: 0.811 ± 0.454
0.0ProAsp: 0.0 ± 0.0
1.621ProGlu: 1.621 ± 0.189
3.243ProPhe: 3.243 ± 1.78
2.837ProGly: 2.837 ± 1.588
0.811ProHis: 0.811 ± 0.265
1.216ProIle: 1.216 ± 0.757
2.432ProLys: 2.432 ± 1.361
5.27ProLeu: 5.27 ± 0.074
2.432ProMet: 2.432 ± 1.514
2.837ProAsn: 2.837 ± 2.006
0.405ProPro: 0.405 ± 0.492
1.621ProGln: 1.621 ± 0.189
1.216ProArg: 1.216 ± 0.681
5.27ProSer: 5.27 ± 2.802
3.648ProThr: 3.648 ± 1.553
2.837ProVal: 2.837 ± 0.569
0.405ProTrp: 0.405 ± 0.227
2.027ProTyr: 2.027 ± 0.303
0.0ProXaa: 0.0 ± 0.0
Gln
1.216GlnAla: 1.216 ± 0.681
0.811GlnCys: 0.811 ± 0.265
1.621GlnAsp: 1.621 ± 0.907
1.621GlnGlu: 1.621 ± 0.907
2.837GlnPhe: 2.837 ± 0.569
4.459GlnGly: 4.459 ± 1.058
0.405GlnHis: 0.405 ± 0.227
0.811GlnIle: 0.811 ± 0.265
0.811GlnLys: 0.811 ± 0.454
3.648GlnLeu: 3.648 ± 0.115
1.216GlnMet: 1.216 ± 0.038
0.405GlnAsn: 0.405 ± 0.227
2.027GlnPro: 2.027 ± 0.303
1.216GlnGln: 1.216 ± 0.038
2.027GlnArg: 2.027 ± 0.415
2.432GlnSer: 2.432 ± 0.796
0.811GlnThr: 0.811 ± 0.265
3.648GlnVal: 3.648 ± 0.604
0.811GlnTrp: 0.811 ± 0.454
1.216GlnTyr: 1.216 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
2.837ArgAla: 2.837 ± 1.588
0.405ArgCys: 0.405 ± 0.227
3.243ArgAsp: 3.243 ± 0.377
2.432ArgGlu: 2.432 ± 1.361
1.216ArgPhe: 1.216 ± 0.757
4.054ArgGly: 4.054 ± 0.607
1.216ArgHis: 1.216 ± 0.681
2.432ArgIle: 2.432 ± 0.642
3.243ArgLys: 3.243 ± 0.377
2.432ArgLeu: 2.432 ± 0.796
2.432ArgMet: 2.432 ± 1.361
2.837ArgAsn: 2.837 ± 0.15
0.811ArgPro: 0.811 ± 0.454
1.216ArgGln: 1.216 ± 0.038
1.216ArgArg: 1.216 ± 0.681
2.432ArgSer: 2.432 ± 0.077
2.027ArgThr: 2.027 ± 1.022
3.648ArgVal: 3.648 ± 0.834
0.0ArgTrp: 0.0 ± 0.0
2.027ArgTyr: 2.027 ± 0.415
0.0ArgXaa: 0.0 ± 0.0
Ser
6.08SerAla: 6.08 ± 1.629
0.405SerCys: 0.405 ± 0.227
1.621SerAsp: 1.621 ± 0.189
5.675SerGlu: 5.675 ± 1.738
5.27SerPhe: 5.27 ± 2.083
4.864SerGly: 4.864 ± 1.591
2.027SerHis: 2.027 ± 0.415
2.837SerIle: 2.837 ± 1.288
3.648SerLys: 3.648 ± 1.553
5.675SerLeu: 5.675 ± 0.3
3.648SerMet: 3.648 ± 0.604
5.675SerAsn: 5.675 ± 3.294
3.243SerPro: 3.243 ± 0.377
3.243SerGln: 3.243 ± 1.096
4.054SerArg: 4.054 ± 1.55
6.08SerSer: 6.08 ± 0.91
9.323SerThr: 9.323 ± 4.128
7.702SerVal: 7.702 ± 2.879
0.405SerTrp: 0.405 ± 0.492
2.432SerTyr: 2.432 ± 0.077
0.0SerXaa: 0.0 ± 0.0
Thr
6.486ThrAla: 6.486 ± 2.121
0.0ThrCys: 0.0 ± 0.0
2.432ThrAsp: 2.432 ± 1.361
3.648ThrGlu: 3.648 ± 0.604
3.243ThrPhe: 3.243 ± 0.377
3.243ThrGly: 3.243 ± 1.78
0.811ThrHis: 0.811 ± 0.265
2.027ThrIle: 2.027 ± 0.415
4.459ThrLys: 4.459 ± 0.339
5.675ThrLeu: 5.675 ± 3.294
2.837ThrMet: 2.837 ± 0.569
2.432ThrAsn: 2.432 ± 0.077
3.243ThrPro: 3.243 ± 1.061
2.027ThrGln: 2.027 ± 1.022
3.243ThrArg: 3.243 ± 1.061
5.27ThrSer: 5.27 ± 2.802
7.296ThrThr: 7.296 ± 3.105
6.08ThrVal: 6.08 ± 0.527
2.027ThrTrp: 2.027 ± 1.022
4.054ThrTyr: 4.054 ± 0.112
0.0ThrXaa: 0.0 ± 0.0
Val
4.459ValAla: 4.459 ± 0.38
0.405ValCys: 0.405 ± 0.492
4.054ValAsp: 4.054 ± 1.55
4.054ValGlu: 4.054 ± 1.55
1.216ValPhe: 1.216 ± 0.681
4.054ValGly: 4.054 ± 0.607
0.811ValHis: 0.811 ± 0.265
5.675ValIle: 5.675 ± 0.418
4.864ValLys: 4.864 ± 2.722
4.864ValLeu: 4.864 ± 1.284
0.405ValMet: 0.405 ± 0.367
4.054ValAsn: 4.054 ± 0.607
5.27ValPro: 5.27 ± 4.24
4.459ValGln: 4.459 ± 1.058
4.459ValArg: 4.459 ± 1.818
6.891ValSer: 6.891 ± 1.895
2.432ValThr: 2.432 ± 0.796
2.027ValVal: 2.027 ± 1.741
0.405ValTrp: 0.405 ± 0.227
1.216ValTyr: 1.216 ± 1.476
0.0ValXaa: 0.0 ± 0.0
Trp
1.216TrpAla: 1.216 ± 0.681
0.0TrpCys: 0.0 ± 0.0
1.216TrpAsp: 1.216 ± 0.681
0.405TrpGlu: 0.405 ± 0.227
1.621TrpPhe: 1.621 ± 0.53
0.405TrpGly: 0.405 ± 0.227
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.405TrpLys: 0.405 ± 0.227
1.621TrpLeu: 1.621 ± 0.189
0.405TrpMet: 0.405 ± 0.227
0.811TrpAsn: 0.811 ± 0.454
0.405TrpPro: 0.405 ± 0.227
0.405TrpGln: 0.405 ± 0.227
2.027TrpArg: 2.027 ± 0.415
0.811TrpSer: 0.811 ± 0.454
1.216TrpThr: 1.216 ± 0.757
0.811TrpVal: 0.811 ± 0.454
0.0TrpTrp: 0.0 ± 0.0
1.216TrpTyr: 1.216 ± 0.038
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.243TyrAla: 3.243 ± 2.498
0.405TyrCys: 0.405 ± 0.227
2.432TyrAsp: 2.432 ± 0.642
1.216TyrGlu: 1.216 ± 0.038
1.621TyrPhe: 1.621 ± 0.189
1.621TyrGly: 1.621 ± 1.968
0.811TyrHis: 0.811 ± 0.265
2.027TyrIle: 2.027 ± 0.415
1.621TyrLys: 1.621 ± 0.53
3.648TyrLeu: 3.648 ± 1.323
0.811TyrMet: 0.811 ± 0.265
0.811TyrAsn: 0.811 ± 0.265
1.621TyrPro: 1.621 ± 0.53
0.405TyrGln: 0.405 ± 0.227
2.027TyrArg: 2.027 ± 1.134
1.216TyrSer: 1.216 ± 0.757
2.837TyrThr: 2.837 ± 0.869
2.837TyrVal: 2.837 ± 0.569
1.621TyrTrp: 1.621 ± 0.189
0.405TyrTyr: 0.405 ± 0.492
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2468 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski