Amino acid dipepetide frequency for Shahe picorna-like virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.653AlaAla: 3.653 ± 1.105
0.406AlaCys: 0.406 ± 0.208
2.841AlaAsp: 2.841 ± 0.265
2.841AlaGlu: 2.841 ± 0.265
4.464AlaPhe: 4.464 ± 0.501
5.276AlaGly: 5.276 ± 1.464
1.623AlaHis: 1.623 ± 0.831
2.435AlaIle: 2.435 ± 0.538
3.247AlaLys: 3.247 ± 0.718
4.058AlaLeu: 4.058 ± 0.293
1.623AlaMet: 1.623 ± 0.236
3.247AlaAsn: 3.247 ± 1.313
5.276AlaPro: 5.276 ± 1.464
2.029AlaGln: 2.029 ± 0.444
2.841AlaArg: 2.841 ± 1.521
7.305AlaSer: 7.305 ± 0.425
5.682AlaThr: 5.682 ± 2.447
3.653AlaVal: 3.653 ± 0.085
1.218AlaTrp: 1.218 ± 0.567
4.058AlaTyr: 4.058 ± 1.493
0.0AlaXaa: 0.0 ± 0.0
Cys
1.623CysAla: 1.623 ± 0.236
0.406CysCys: 0.406 ± 0.208
1.623CysAsp: 1.623 ± 0.236
2.435CysGlu: 2.435 ± 1.247
1.218CysPhe: 1.218 ± 0.028
1.623CysGly: 1.623 ± 0.831
0.812CysHis: 0.812 ± 0.416
1.623CysIle: 1.623 ± 0.359
0.0CysLys: 0.0 ± 0.0
1.623CysLeu: 1.623 ± 0.236
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.218CysPro: 1.218 ± 0.028
0.0CysGln: 0.0 ± 0.0
1.623CysArg: 1.623 ± 0.359
0.812CysSer: 0.812 ± 0.416
0.812CysThr: 0.812 ± 0.416
1.623CysVal: 1.623 ± 0.831
0.0CysTrp: 0.0 ± 0.0
0.406CysTyr: 0.406 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
1.623AspAla: 1.623 ± 0.236
0.812AspCys: 0.812 ± 0.416
5.276AspAsp: 5.276 ± 1.512
4.87AspGlu: 4.87 ± 0.114
3.247AspPhe: 3.247 ± 0.472
2.029AspGly: 2.029 ± 0.444
0.406AspHis: 0.406 ± 0.208
4.464AspIle: 4.464 ± 1.691
3.247AspLys: 3.247 ± 1.663
5.276AspLeu: 5.276 ± 0.321
2.029AspMet: 2.029 ± 0.948
2.435AspAsn: 2.435 ± 0.538
3.653AspPro: 3.653 ± 1.7
0.812AspGln: 0.812 ± 0.179
2.435AspArg: 2.435 ± 0.057
3.653AspSer: 3.653 ± 1.105
4.058AspThr: 4.058 ± 1.493
3.653AspVal: 3.653 ± 1.7
1.218AspTrp: 1.218 ± 0.028
1.623AspTyr: 1.623 ± 0.831
0.0AspXaa: 0.0 ± 0.0
Glu
2.841GluAla: 2.841 ± 0.331
1.623GluCys: 1.623 ± 0.236
3.247GluAsp: 3.247 ± 0.123
4.87GluGlu: 4.87 ± 1.304
3.653GluPhe: 3.653 ± 0.085
2.435GluGly: 2.435 ± 0.652
2.029GluHis: 2.029 ± 0.444
2.029GluIle: 2.029 ± 0.444
1.623GluLys: 1.623 ± 0.831
7.711GluLeu: 7.711 ± 1.569
0.406GluMet: 0.406 ± 0.387
2.029GluAsn: 2.029 ± 0.151
2.435GluPro: 2.435 ± 0.057
0.812GluGln: 0.812 ± 0.416
2.029GluArg: 2.029 ± 1.039
2.841GluSer: 2.841 ± 1.455
1.218GluThr: 1.218 ± 0.567
4.058GluVal: 4.058 ± 0.293
0.406GluTrp: 0.406 ± 0.208
1.623GluTyr: 1.623 ± 0.359
0.0GluXaa: 0.0 ± 0.0
Phe
4.058PheAla: 4.058 ± 0.897
0.406PheCys: 0.406 ± 0.208
4.87PheAsp: 4.87 ± 0.482
2.841PheGlu: 2.841 ± 0.926
3.247PhePhe: 3.247 ± 0.472
3.247PheGly: 3.247 ± 0.718
2.435PheHis: 2.435 ± 0.057
1.623PheIle: 1.623 ± 0.236
2.435PheLys: 2.435 ± 1.247
5.682PheLeu: 5.682 ± 1.72
1.218PheMet: 1.218 ± 0.028
2.435PheAsn: 2.435 ± 0.652
2.841PhePro: 2.841 ± 0.86
2.029PheGln: 2.029 ± 0.151
2.435PheArg: 2.435 ± 0.652
3.653PheSer: 3.653 ± 1.276
3.653PheThr: 3.653 ± 1.105
5.276PheVal: 5.276 ± 0.917
0.812PheTrp: 0.812 ± 0.416
2.029PheTyr: 2.029 ± 0.151
0.0PheXaa: 0.0 ± 0.0
Gly
3.653GlyAla: 3.653 ± 2.296
0.0GlyCys: 0.0 ± 0.0
4.058GlyAsp: 4.058 ± 0.888
4.058GlyGlu: 4.058 ± 0.888
3.247GlyPhe: 3.247 ± 0.123
4.464GlyGly: 4.464 ± 0.69
2.029GlyHis: 2.029 ± 1.039
2.029GlyIle: 2.029 ± 0.746
4.464GlyLys: 4.464 ± 1.691
3.247GlyLeu: 3.247 ± 0.472
1.218GlyMet: 1.218 ± 0.624
3.653GlyAsn: 3.653 ± 1.105
2.841GlyPro: 2.841 ± 0.926
1.218GlyGln: 1.218 ± 0.567
3.653GlyArg: 3.653 ± 0.68
5.276GlySer: 5.276 ± 2.655
5.682GlyThr: 5.682 ± 0.066
4.87GlyVal: 4.87 ± 0.114
1.218GlyTrp: 1.218 ± 0.624
0.812GlyTyr: 0.812 ± 0.179
0.0GlyXaa: 0.0 ± 0.0
His
2.435HisAla: 2.435 ± 0.057
0.812HisCys: 0.812 ± 0.416
0.406HisAsp: 0.406 ± 0.208
0.406HisGlu: 0.406 ± 0.208
0.812HisPhe: 0.812 ± 0.179
4.058HisGly: 4.058 ± 1.483
0.406HisHis: 0.406 ± 0.208
2.029HisIle: 2.029 ± 0.444
0.812HisLys: 0.812 ± 0.416
2.435HisLeu: 2.435 ± 0.057
0.812HisMet: 0.812 ± 0.416
0.406HisAsn: 0.406 ± 0.208
1.623HisPro: 1.623 ± 0.236
0.406HisGln: 0.406 ± 0.208
0.812HisArg: 0.812 ± 0.416
2.029HisSer: 2.029 ± 0.444
2.029HisThr: 2.029 ± 0.444
0.0HisVal: 0.0 ± 0.0
0.406HisTrp: 0.406 ± 0.387
1.218HisTyr: 1.218 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
3.247IleAla: 3.247 ± 0.472
0.812IleCys: 0.812 ± 0.416
2.841IleAsp: 2.841 ± 0.265
2.435IleGlu: 2.435 ± 0.652
1.623IlePhe: 1.623 ± 0.236
3.247IleGly: 3.247 ± 0.718
0.812IleHis: 0.812 ± 0.179
2.841IleIle: 2.841 ± 0.265
2.435IleLys: 2.435 ± 1.134
4.87IleLeu: 4.87 ± 0.709
0.812IleMet: 0.812 ± 0.416
3.247IleAsn: 3.247 ± 0.472
1.218IlePro: 1.218 ± 0.567
4.058IleGln: 4.058 ± 1.493
2.841IleArg: 2.841 ± 0.86
4.464IleSer: 4.464 ± 1.88
2.029IleThr: 2.029 ± 0.444
3.653IleVal: 3.653 ± 1.276
1.218IleTrp: 1.218 ± 0.567
2.435IleTyr: 2.435 ± 0.538
0.0IleXaa: 0.0 ± 0.0
Lys
3.653LysAla: 3.653 ± 0.68
2.029LysCys: 2.029 ± 0.444
3.247LysAsp: 3.247 ± 0.472
2.029LysGlu: 2.029 ± 1.039
3.247LysPhe: 3.247 ± 0.718
2.029LysGly: 2.029 ± 1.039
0.812LysHis: 0.812 ± 0.179
2.435LysIle: 2.435 ± 0.057
2.435LysLys: 2.435 ± 0.652
4.058LysLeu: 4.058 ± 0.293
0.812LysMet: 0.812 ± 0.416
0.812LysAsn: 0.812 ± 0.179
1.623LysPro: 1.623 ± 0.831
0.406LysGln: 0.406 ± 0.387
3.247LysArg: 3.247 ± 0.472
4.058LysSer: 4.058 ± 0.293
2.435LysThr: 2.435 ± 1.247
3.653LysVal: 3.653 ± 0.68
1.218LysTrp: 1.218 ± 0.624
3.653LysTyr: 3.653 ± 1.871
0.0LysXaa: 0.0 ± 0.0
Leu
5.682LeuAla: 5.682 ± 1.256
2.435LeuCys: 2.435 ± 0.652
4.464LeuAsp: 4.464 ± 1.096
4.058LeuGlu: 4.058 ± 0.888
3.653LeuPhe: 3.653 ± 0.085
5.276LeuGly: 5.276 ± 0.321
3.653LeuHis: 3.653 ± 1.871
4.87LeuIle: 4.87 ± 0.482
6.088LeuLys: 6.088 ± 1.928
4.464LeuLeu: 4.464 ± 1.691
1.623LeuMet: 1.623 ± 0.236
5.276LeuAsn: 5.276 ± 1.464
2.435LeuPro: 2.435 ± 0.057
3.247LeuGln: 3.247 ± 1.908
6.088LeuArg: 6.088 ± 0.453
8.523LeuSer: 8.523 ± 0.992
6.494LeuThr: 6.494 ± 1.436
5.276LeuVal: 5.276 ± 0.321
1.623LeuTrp: 1.623 ± 0.831
1.218LeuTyr: 1.218 ± 0.028
0.0LeuXaa: 0.0 ± 0.0
Met
0.812MetAla: 0.812 ± 0.179
0.0MetCys: 0.0 ± 0.0
2.029MetAsp: 2.029 ± 0.151
1.218MetGlu: 1.218 ± 0.624
0.0MetPhe: 0.0 ± 0.0
2.029MetGly: 2.029 ± 0.746
0.406MetHis: 0.406 ± 0.208
1.218MetIle: 1.218 ± 0.624
1.218MetLys: 1.218 ± 0.567
3.653MetLeu: 3.653 ± 1.276
1.218MetMet: 1.218 ± 0.624
2.029MetAsn: 2.029 ± 1.039
0.406MetPro: 0.406 ± 0.387
0.0MetGln: 0.0 ± 0.0
1.623MetArg: 1.623 ± 0.831
0.406MetSer: 0.406 ± 0.387
1.218MetThr: 1.218 ± 0.624
1.623MetVal: 1.623 ± 0.831
0.406MetTrp: 0.406 ± 0.208
0.812MetTyr: 0.812 ± 0.416
0.0MetXaa: 0.0 ± 0.0
Asn
2.841AsnAla: 2.841 ± 0.265
1.218AsnCys: 1.218 ± 0.567
0.812AsnAsp: 0.812 ± 0.179
1.218AsnGlu: 1.218 ± 0.624
2.435AsnPhe: 2.435 ± 0.057
4.464AsnGly: 4.464 ± 0.69
0.812AsnHis: 0.812 ± 0.416
4.058AsnIle: 4.058 ± 1.493
0.812AsnLys: 0.812 ± 0.416
2.435AsnLeu: 2.435 ± 0.652
1.623AsnMet: 1.623 ± 0.359
2.841AsnAsn: 2.841 ± 0.926
4.058AsnPro: 4.058 ± 0.293
0.812AsnGln: 0.812 ± 0.416
2.435AsnArg: 2.435 ± 0.057
3.247AsnSer: 3.247 ± 1.663
4.87AsnThr: 4.87 ± 2.863
4.464AsnVal: 4.464 ± 0.094
0.406AsnTrp: 0.406 ± 0.387
3.653AsnTyr: 3.653 ± 1.105
0.0AsnXaa: 0.0 ± 0.0
Pro
4.87ProAla: 4.87 ± 2.267
0.406ProCys: 0.406 ± 0.387
2.435ProAsp: 2.435 ± 0.652
0.812ProGlu: 0.812 ± 0.416
2.435ProPhe: 2.435 ± 0.652
2.029ProGly: 2.029 ± 0.151
1.218ProHis: 1.218 ± 0.028
2.841ProIle: 2.841 ± 0.265
1.218ProLys: 1.218 ± 0.028
5.276ProLeu: 5.276 ± 1.464
1.623ProMet: 1.623 ± 0.954
1.623ProAsn: 1.623 ± 0.954
2.029ProPro: 2.029 ± 0.746
0.406ProGln: 0.406 ± 0.208
1.218ProArg: 1.218 ± 0.624
4.87ProSer: 4.87 ± 1.077
2.435ProThr: 2.435 ± 0.538
5.276ProVal: 5.276 ± 2.059
1.623ProTrp: 1.623 ± 0.236
2.841ProTyr: 2.841 ± 0.86
0.0ProXaa: 0.0 ± 0.0
Gln
1.623GlnAla: 1.623 ± 0.236
0.406GlnCys: 0.406 ± 0.387
2.435GlnAsp: 2.435 ± 1.729
2.029GlnGlu: 2.029 ± 0.444
4.058GlnPhe: 4.058 ± 0.293
2.029GlnGly: 2.029 ± 1.342
2.435GlnHis: 2.435 ± 0.057
1.623GlnIle: 1.623 ± 0.359
0.812GlnLys: 0.812 ± 0.416
2.841GlnLeu: 2.841 ± 0.926
1.218GlnMet: 1.218 ± 0.624
1.218GlnAsn: 1.218 ± 0.028
0.406GlnPro: 0.406 ± 0.387
2.029GlnGln: 2.029 ± 0.151
0.406GlnArg: 0.406 ± 0.208
2.841GlnSer: 2.841 ± 0.265
1.218GlnThr: 1.218 ± 0.567
1.218GlnVal: 1.218 ± 1.162
0.406GlnTrp: 0.406 ± 0.387
0.406GlnTyr: 0.406 ± 0.208
0.0GlnXaa: 0.0 ± 0.0
Arg
3.653ArgAla: 3.653 ± 0.68
0.812ArgCys: 0.812 ± 0.416
2.435ArgAsp: 2.435 ± 0.057
2.435ArgGlu: 2.435 ± 1.134
4.058ArgPhe: 4.058 ± 0.293
3.653ArgGly: 3.653 ± 0.085
0.812ArgHis: 0.812 ± 0.416
2.435ArgIle: 2.435 ± 1.134
3.653ArgLys: 3.653 ± 0.68
3.653ArgLeu: 3.653 ± 0.085
0.812ArgMet: 0.812 ± 0.416
2.841ArgAsn: 2.841 ± 1.455
1.623ArgPro: 1.623 ± 0.236
0.812ArgGln: 0.812 ± 0.416
6.088ArgArg: 6.088 ± 1.928
5.276ArgSer: 5.276 ± 1.512
1.623ArgThr: 1.623 ± 0.831
6.088ArgVal: 6.088 ± 1.049
0.812ArgTrp: 0.812 ± 0.179
1.218ArgTyr: 1.218 ± 0.028
0.0ArgXaa: 0.0 ± 0.0
Ser
5.276SerAla: 5.276 ± 1.464
2.435SerCys: 2.435 ± 0.652
3.247SerAsp: 3.247 ± 0.718
4.87SerGlu: 4.87 ± 0.709
8.523SerPhe: 8.523 ± 1.984
4.87SerGly: 4.87 ± 0.709
0.0SerHis: 0.0 ± 0.0
4.058SerIle: 4.058 ± 0.293
5.682SerLys: 5.682 ± 0.661
7.305SerLeu: 7.305 ± 1.615
0.812SerMet: 0.812 ± 0.416
4.87SerAsn: 4.87 ± 0.482
2.029SerPro: 2.029 ± 0.151
2.841SerGln: 2.841 ± 1.521
3.653SerArg: 3.653 ± 0.085
8.523SerSer: 8.523 ± 0.397
4.87SerThr: 4.87 ± 3.458
7.305SerVal: 7.305 ± 0.766
1.218SerTrp: 1.218 ± 0.028
2.029SerTyr: 2.029 ± 0.746
0.0SerXaa: 0.0 ± 0.0
Thr
4.058ThrAla: 4.058 ± 1.493
0.0ThrCys: 0.0 ± 0.0
3.653ThrAsp: 3.653 ± 0.085
2.841ThrGlu: 2.841 ± 0.265
0.812ThrPhe: 0.812 ± 0.179
3.653ThrGly: 3.653 ± 1.7
0.406ThrHis: 0.406 ± 0.387
3.247ThrIle: 3.247 ± 0.718
2.029ThrLys: 2.029 ± 0.444
4.464ThrLeu: 4.464 ± 3.07
1.218ThrMet: 1.218 ± 0.028
3.247ThrAsn: 3.247 ± 0.472
6.088ThrPro: 6.088 ± 1.049
2.841ThrGln: 2.841 ± 0.265
3.653ThrArg: 3.653 ± 0.085
5.682ThrSer: 5.682 ± 3.042
3.247ThrThr: 3.247 ± 0.718
6.494ThrVal: 6.494 ± 0.841
0.406ThrTrp: 0.406 ± 0.387
2.435ThrTyr: 2.435 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
6.899ValAla: 6.899 ± 0.558
2.029ValCys: 2.029 ± 0.444
4.87ValAsp: 4.87 ± 0.709
0.812ValGlu: 0.812 ± 0.179
3.653ValPhe: 3.653 ± 0.51
2.435ValGly: 2.435 ± 0.538
2.029ValHis: 2.029 ± 0.151
2.029ValIle: 2.029 ± 0.151
4.058ValLys: 4.058 ± 1.483
7.711ValLeu: 7.711 ± 0.812
1.218ValMet: 1.218 ± 0.454
4.87ValAsn: 4.87 ± 0.114
4.464ValPro: 4.464 ± 1.285
4.87ValGln: 4.87 ± 0.482
5.276ValArg: 5.276 ± 0.321
5.682ValSer: 5.682 ± 0.661
3.653ValThr: 3.653 ± 0.51
3.653ValVal: 3.653 ± 0.085
1.218ValTrp: 1.218 ± 0.028
2.435ValTyr: 2.435 ± 0.538
0.0ValXaa: 0.0 ± 0.0
Trp
0.406TrpAla: 0.406 ± 0.387
2.029TrpCys: 2.029 ± 1.039
0.812TrpAsp: 0.812 ± 0.179
0.406TrpGlu: 0.406 ± 0.208
1.218TrpPhe: 1.218 ± 0.028
0.812TrpGly: 0.812 ± 0.416
0.812TrpHis: 0.812 ± 0.179
0.406TrpIle: 0.406 ± 0.208
0.812TrpLys: 0.812 ± 0.416
2.029TrpLeu: 2.029 ± 0.151
0.406TrpMet: 0.406 ± 0.208
0.812TrpAsn: 0.812 ± 0.179
0.0TrpPro: 0.0 ± 0.0
0.812TrpGln: 0.812 ± 0.416
1.218TrpArg: 1.218 ± 0.028
1.623TrpSer: 1.623 ± 0.359
0.406TrpThr: 0.406 ± 0.387
0.812TrpVal: 0.812 ± 0.775
0.406TrpTrp: 0.406 ± 0.208
0.406TrpTyr: 0.406 ± 0.208
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.87TyrAla: 4.87 ± 1.672
0.406TyrCys: 0.406 ± 0.208
1.218TyrAsp: 1.218 ± 0.567
2.435TyrGlu: 2.435 ± 0.538
1.623TyrPhe: 1.623 ± 0.831
2.029TyrGly: 2.029 ± 0.444
0.406TyrHis: 0.406 ± 0.208
2.841TyrIle: 2.841 ± 0.86
0.812TyrLys: 0.812 ± 0.416
3.653TyrLeu: 3.653 ± 0.68
1.218TyrMet: 1.218 ± 0.624
1.623TyrAsn: 1.623 ± 0.954
1.218TyrPro: 1.218 ± 0.567
1.218TyrGln: 1.218 ± 0.567
1.218TyrArg: 1.218 ± 0.028
3.653TyrSer: 3.653 ± 0.085
2.841TyrThr: 2.841 ± 0.265
1.623TyrVal: 1.623 ± 0.359
0.406TyrTrp: 0.406 ± 0.208
0.812TyrTyr: 0.812 ± 0.416
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2465 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski