Amino acid dipepetide frequency for Shahe picorna-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.275AlaAla: 6.275 ± 0.601
2.215AlaCys: 2.215 ± 0.505
2.584AlaAsp: 2.584 ± 0.312
4.061AlaGlu: 4.061 ± 1.203
3.691AlaPhe: 3.691 ± 1.949
5.537AlaGly: 5.537 ± 0.986
0.738AlaHis: 0.738 ± 0.385
7.752AlaIle: 7.752 ± 0.384
3.691AlaLys: 3.691 ± 0.288
5.906AlaLeu: 5.906 ± 0.314
2.215AlaMet: 2.215 ± 0.505
1.846AlaAsn: 1.846 ± 0.698
1.846AlaPro: 1.846 ± 0.409
1.107AlaGln: 1.107 ± 0.529
2.215AlaArg: 2.215 ± 0.602
7.383AlaSer: 7.383 ± 0.023
4.43AlaThr: 4.43 ± 1.01
3.691AlaVal: 3.691 ± 0.819
1.107AlaTrp: 1.107 ± 0.529
2.584AlaTyr: 2.584 ± 0.312
0.0AlaXaa: 0.0 ± 0.0
Cys
1.107CysAla: 1.107 ± 0.578
0.369CysCys: 0.369 ± 0.361
0.738CysAsp: 0.738 ± 0.168
1.846CysGlu: 1.846 ± 0.409
1.107CysPhe: 1.107 ± 0.578
2.215CysGly: 2.215 ± 0.602
0.0CysHis: 0.0 ± 0.0
1.107CysIle: 1.107 ± 0.578
2.584CysLys: 2.584 ± 1.348
1.107CysLeu: 1.107 ± 0.529
0.0CysMet: 0.0 ± 0.0
1.477CysAsn: 1.477 ± 0.77
1.477CysPro: 1.477 ± 0.217
1.477CysGln: 1.477 ± 0.337
0.738CysArg: 0.738 ± 0.168
0.369CysSer: 0.369 ± 0.193
1.846CysThr: 1.846 ± 0.409
2.584CysVal: 2.584 ± 0.312
0.0CysTrp: 0.0 ± 0.0
1.107CysTyr: 1.107 ± 0.578
0.0CysXaa: 0.0 ± 0.0
Asp
3.322AspAla: 3.322 ± 0.073
0.0AspCys: 0.0 ± 0.0
2.953AspAsp: 2.953 ± 0.12
3.322AspGlu: 3.322 ± 0.073
3.691AspPhe: 3.691 ± 0.842
2.584AspGly: 2.584 ± 0.312
0.738AspHis: 0.738 ± 0.385
4.799AspIle: 4.799 ± 0.843
5.168AspLys: 5.168 ± 0.482
7.383AspLeu: 7.383 ± 0.53
0.738AspMet: 0.738 ± 0.168
1.107AspAsn: 1.107 ± 0.024
3.322AspPro: 3.322 ± 0.481
1.846AspGln: 1.846 ± 0.409
1.477AspArg: 1.477 ± 0.217
3.691AspSer: 3.691 ± 0.819
3.322AspThr: 3.322 ± 2.695
1.846AspVal: 1.846 ± 0.409
0.738AspTrp: 0.738 ± 0.168
2.215AspTyr: 2.215 ± 0.048
0.0AspXaa: 0.0 ± 0.0
Glu
3.322GluAla: 3.322 ± 0.481
1.107GluCys: 1.107 ± 0.578
2.953GluAsp: 2.953 ± 0.12
2.215GluGlu: 2.215 ± 0.048
4.061GluPhe: 4.061 ± 0.458
4.061GluGly: 4.061 ± 0.096
0.369GluHis: 0.369 ± 0.361
2.584GluIle: 2.584 ± 0.312
2.953GluLys: 2.953 ± 0.434
7.752GluLeu: 7.752 ± 2.383
1.107GluMet: 1.107 ± 0.024
2.215GluAsn: 2.215 ± 0.048
1.477GluPro: 1.477 ± 0.217
1.846GluGln: 1.846 ± 0.409
2.953GluArg: 2.953 ± 0.434
5.537GluSer: 5.537 ± 0.986
2.215GluThr: 2.215 ± 0.602
3.322GluVal: 3.322 ± 0.481
1.477GluTrp: 1.477 ± 0.77
2.953GluTyr: 2.953 ± 1.54
0.0GluXaa: 0.0 ± 0.0
Phe
2.584PheAla: 2.584 ± 0.312
1.107PheCys: 1.107 ± 0.578
2.953PheAsp: 2.953 ± 0.12
2.584PheGlu: 2.584 ± 0.241
1.477PhePhe: 1.477 ± 0.89
5.537PheGly: 5.537 ± 0.674
1.107PheHis: 1.107 ± 0.024
2.953PheIle: 2.953 ± 0.987
1.846PheLys: 1.846 ± 0.409
5.537PheLeu: 5.537 ± 0.432
1.477PheMet: 1.477 ± 0.343
4.061PheAsn: 4.061 ± 1.203
0.738PhePro: 0.738 ± 0.168
2.215PheGln: 2.215 ± 1.058
2.953PheArg: 2.953 ± 1.227
2.584PheSer: 2.584 ± 0.241
3.322PheThr: 3.322 ± 0.073
2.953PheVal: 2.953 ± 0.673
1.107PheTrp: 1.107 ± 1.083
1.477PheTyr: 1.477 ± 0.77
0.0PheXaa: 0.0 ± 0.0
Gly
3.691GlyAla: 3.691 ± 0.842
1.846GlyCys: 1.846 ± 0.963
4.061GlyAsp: 4.061 ± 0.649
3.691GlyGlu: 3.691 ± 0.265
2.584GlyPhe: 2.584 ± 0.312
5.537GlyGly: 5.537 ± 1.539
1.477GlyHis: 1.477 ± 0.337
3.691GlyIle: 3.691 ± 0.819
4.061GlyLys: 4.061 ± 1.011
4.799GlyLeu: 4.799 ± 0.264
1.107GlyMet: 1.107 ± 0.024
5.168GlyAsn: 5.168 ± 0.072
2.584GlyPro: 2.584 ± 0.866
1.846GlyGln: 1.846 ± 0.144
2.215GlyArg: 2.215 ± 1.058
5.537GlySer: 5.537 ± 1.539
5.537GlyThr: 5.537 ± 0.121
3.322GlyVal: 3.322 ± 0.626
0.369GlyTrp: 0.369 ± 0.193
2.953GlyTyr: 2.953 ± 0.673
0.0GlyXaa: 0.0 ± 0.0
His
0.369HisAla: 0.369 ± 0.193
0.738HisCys: 0.738 ± 0.385
1.477HisAsp: 1.477 ± 0.77
1.107HisGlu: 1.107 ± 0.529
1.107HisPhe: 1.107 ± 0.024
1.107HisGly: 1.107 ± 0.024
0.369HisHis: 0.369 ± 0.361
2.215HisIle: 2.215 ± 0.048
2.584HisLys: 2.584 ± 0.794
1.107HisLeu: 1.107 ± 0.578
2.215HisMet: 2.215 ± 1.155
0.369HisAsn: 0.369 ± 0.361
0.0HisPro: 0.0 ± 0.0
0.369HisGln: 0.369 ± 0.193
0.738HisArg: 0.738 ± 0.168
0.369HisSer: 0.369 ± 0.361
1.477HisThr: 1.477 ± 0.337
1.107HisVal: 1.107 ± 0.529
0.369HisTrp: 0.369 ± 0.361
1.477HisTyr: 1.477 ± 0.217
0.0HisXaa: 0.0 ± 0.0
Ile
4.061IleAla: 4.061 ± 2.31
0.738IleCys: 0.738 ± 0.385
6.645IleAsp: 6.645 ± 0.145
2.953IleGlu: 2.953 ± 1.54
2.215IlePhe: 2.215 ± 0.602
4.43IleGly: 4.43 ± 0.097
1.477IleHis: 1.477 ± 0.217
1.846IleIle: 1.846 ± 0.144
1.477IleLys: 1.477 ± 0.217
4.799IleLeu: 4.799 ± 2.503
1.477IleMet: 1.477 ± 0.337
2.953IleAsn: 2.953 ± 0.673
2.215IlePro: 2.215 ± 0.602
1.107IleGln: 1.107 ± 0.578
2.215IleArg: 2.215 ± 0.505
4.799IleSer: 4.799 ± 0.264
3.691IleThr: 3.691 ± 0.842
2.953IleVal: 2.953 ± 0.987
0.369IleTrp: 0.369 ± 0.193
1.477IleTyr: 1.477 ± 0.217
0.0IleXaa: 0.0 ± 0.0
Lys
4.799LysAla: 4.799 ± 0.289
1.477LysCys: 1.477 ± 0.77
1.477LysAsp: 1.477 ± 0.77
5.537LysGlu: 5.537 ± 0.432
3.322LysPhe: 3.322 ± 0.073
3.322LysGly: 3.322 ± 1.18
1.846LysHis: 1.846 ± 0.409
1.846LysIle: 1.846 ± 0.963
3.691LysLys: 3.691 ± 1.372
4.43LysLeu: 4.43 ± 0.097
0.738LysMet: 0.738 ± 0.385
1.846LysAsn: 1.846 ± 0.409
2.215LysPro: 2.215 ± 0.505
1.846LysGln: 1.846 ± 0.409
3.691LysArg: 3.691 ± 1.372
7.383LysSer: 7.383 ± 2.191
2.584LysThr: 2.584 ± 0.794
2.215LysVal: 2.215 ± 1.058
0.738LysTrp: 0.738 ± 0.385
2.584LysTyr: 2.584 ± 0.794
0.0LysXaa: 0.0 ± 0.0
Leu
6.275LeuAla: 6.275 ± 0.601
2.953LeuCys: 2.953 ± 1.54
5.537LeuAsp: 5.537 ± 0.674
2.953LeuGlu: 2.953 ± 1.54
5.168LeuPhe: 5.168 ± 0.625
2.953LeuGly: 2.953 ± 0.12
2.215LeuHis: 2.215 ± 0.048
2.953LeuIle: 2.953 ± 0.987
5.168LeuLys: 5.168 ± 1.035
7.383LeuLeu: 7.383 ± 2.191
1.477LeuMet: 1.477 ± 0.77
4.43LeuAsn: 4.43 ± 0.097
6.645LeuPro: 6.645 ± 1.252
2.584LeuGln: 2.584 ± 0.241
6.275LeuArg: 6.275 ± 0.047
5.537LeuSer: 5.537 ± 0.674
8.49LeuThr: 8.49 ± 1.106
6.645LeuVal: 6.645 ± 2.359
0.738LeuTrp: 0.738 ± 0.168
3.322LeuTyr: 3.322 ± 0.626
0.0LeuXaa: 0.0 ± 0.0
Met
3.322MetAla: 3.322 ± 0.626
0.738MetCys: 0.738 ± 0.385
1.107MetAsp: 1.107 ± 0.024
0.738MetGlu: 0.738 ± 0.385
0.738MetPhe: 0.738 ± 0.385
0.738MetGly: 0.738 ± 0.385
2.215MetHis: 2.215 ± 0.048
0.369MetIle: 0.369 ± 0.361
0.369MetLys: 0.369 ± 0.361
1.107MetLeu: 1.107 ± 0.024
1.107MetMet: 1.107 ± 0.578
1.846MetAsn: 1.846 ± 0.698
1.107MetPro: 1.107 ± 0.529
2.584MetGln: 2.584 ± 0.866
1.846MetArg: 1.846 ± 0.409
0.369MetSer: 0.369 ± 0.193
2.215MetThr: 2.215 ± 0.048
1.477MetVal: 1.477 ± 0.77
0.369MetTrp: 0.369 ± 0.193
0.369MetTyr: 0.369 ± 0.193
0.0MetXaa: 0.0 ± 0.0
Asn
2.215AsnAla: 2.215 ± 0.505
1.107AsnCys: 1.107 ± 0.024
1.846AsnAsp: 1.846 ± 0.144
2.215AsnGlu: 2.215 ± 1.058
1.477AsnPhe: 1.477 ± 0.217
4.799AsnGly: 4.799 ± 0.264
0.369AsnHis: 0.369 ± 0.193
2.584AsnIle: 2.584 ± 0.794
2.215AsnLys: 2.215 ± 0.505
3.691AsnLeu: 3.691 ± 1.926
2.584AsnMet: 2.584 ± 1.419
1.477AsnAsn: 1.477 ± 0.217
2.215AsnPro: 2.215 ± 1.058
1.477AsnGln: 1.477 ± 0.217
1.846AsnArg: 1.846 ± 0.409
2.215AsnSer: 2.215 ± 0.505
4.43AsnThr: 4.43 ± 0.457
4.061AsnVal: 4.061 ± 0.096
0.738AsnTrp: 0.738 ± 0.385
1.846AsnTyr: 1.846 ± 0.409
0.0AsnXaa: 0.0 ± 0.0
Pro
3.322ProAla: 3.322 ± 0.073
2.584ProCys: 2.584 ± 0.312
2.953ProAsp: 2.953 ± 0.673
3.691ProGlu: 3.691 ± 1.926
4.061ProPhe: 4.061 ± 0.649
1.107ProGly: 1.107 ± 0.024
1.477ProHis: 1.477 ± 0.217
2.953ProIle: 2.953 ± 0.12
1.477ProLys: 1.477 ± 0.77
6.275ProLeu: 6.275 ± 0.047
0.738ProMet: 0.738 ± 0.168
0.369ProAsn: 0.369 ± 0.193
2.215ProPro: 2.215 ± 1.058
2.215ProGln: 2.215 ± 1.612
1.477ProArg: 1.477 ± 0.217
1.477ProSer: 1.477 ± 0.337
2.584ProThr: 2.584 ± 0.312
2.215ProVal: 2.215 ± 0.505
0.369ProTrp: 0.369 ± 0.361
2.584ProTyr: 2.584 ± 0.312
0.0ProXaa: 0.0 ± 0.0
Gln
1.846GlnAla: 1.846 ± 0.698
1.107GlnCys: 1.107 ± 0.024
2.215GlnAsp: 2.215 ± 0.505
2.215GlnGlu: 2.215 ± 0.505
1.107GlnPhe: 1.107 ± 0.578
1.107GlnGly: 1.107 ± 0.024
1.107GlnHis: 1.107 ± 0.024
0.369GlnIle: 0.369 ± 0.361
1.107GlnLys: 1.107 ± 0.024
4.43GlnLeu: 4.43 ± 0.65
1.477GlnMet: 1.477 ± 0.217
0.369GlnAsn: 0.369 ± 0.193
1.846GlnPro: 1.846 ± 0.144
0.0GlnGln: 0.0 ± 0.0
2.584GlnArg: 2.584 ± 0.241
4.061GlnSer: 4.061 ± 1.756
2.584GlnThr: 2.584 ± 0.241
2.215GlnVal: 2.215 ± 0.505
0.0GlnTrp: 0.0 ± 0.0
0.369GlnTyr: 0.369 ± 0.361
0.0GlnXaa: 0.0 ± 0.0
Arg
4.061ArgAla: 4.061 ± 0.096
2.215ArgCys: 2.215 ± 0.048
2.584ArgAsp: 2.584 ± 0.312
3.691ArgGlu: 3.691 ± 1.372
3.322ArgPhe: 3.322 ± 0.073
2.215ArgGly: 2.215 ± 1.058
1.107ArgHis: 1.107 ± 0.024
3.322ArgIle: 3.322 ± 0.626
2.215ArgLys: 2.215 ± 1.155
1.477ArgLeu: 1.477 ± 0.217
1.477ArgMet: 1.477 ± 0.337
1.846ArgAsn: 1.846 ± 0.409
3.691ArgPro: 3.691 ± 0.842
1.107ArgGln: 1.107 ± 0.024
2.953ArgArg: 2.953 ± 0.434
2.215ArgSer: 2.215 ± 0.602
1.477ArgThr: 1.477 ± 0.337
3.322ArgVal: 3.322 ± 0.481
0.369ArgTrp: 0.369 ± 0.361
3.322ArgTyr: 3.322 ± 0.481
0.0ArgXaa: 0.0 ± 0.0
Ser
5.168SerAla: 5.168 ± 0.625
0.738SerCys: 0.738 ± 0.385
5.168SerAsp: 5.168 ± 0.482
5.537SerGlu: 5.537 ± 0.121
1.846SerPhe: 1.846 ± 0.698
7.383SerGly: 7.383 ± 3.344
1.846SerHis: 1.846 ± 0.144
4.43SerIle: 4.43 ± 0.097
4.061SerLys: 4.061 ± 1.011
5.537SerLeu: 5.537 ± 1.228
1.477SerMet: 1.477 ± 0.419
5.168SerAsn: 5.168 ± 0.072
3.322SerPro: 3.322 ± 0.626
2.584SerGln: 2.584 ± 0.241
3.691SerArg: 3.691 ± 0.842
5.168SerSer: 5.168 ± 1.732
4.799SerThr: 4.799 ± 1.371
5.537SerVal: 5.537 ± 0.986
1.107SerTrp: 1.107 ± 0.578
1.846SerTyr: 1.846 ± 0.698
0.0SerXaa: 0.0 ± 0.0
Thr
3.691ThrAla: 3.691 ± 1.395
0.738ThrCys: 0.738 ± 0.722
1.477ThrAsp: 1.477 ± 0.217
1.477ThrGlu: 1.477 ± 0.337
3.322ThrPhe: 3.322 ± 0.481
4.799ThrGly: 4.799 ± 0.843
0.738ThrHis: 0.738 ± 0.385
4.43ThrIle: 4.43 ± 2.67
5.537ThrLys: 5.537 ± 1.228
5.906ThrLeu: 5.906 ± 0.314
1.477ThrMet: 1.477 ± 0.217
4.799ThrAsn: 4.799 ± 0.289
4.799ThrPro: 4.799 ± 0.264
1.846ThrGln: 1.846 ± 0.144
1.846ThrArg: 1.846 ± 0.144
7.383ThrSer: 7.383 ± 1.683
7.383ThrThr: 7.383 ± 4.451
5.168ThrVal: 5.168 ± 0.482
0.738ThrTrp: 0.738 ± 0.168
2.215ThrTyr: 2.215 ± 1.058
0.0ThrXaa: 0.0 ± 0.0
Val
7.383ValAla: 7.383 ± 0.577
1.107ValCys: 1.107 ± 0.578
3.322ValAsp: 3.322 ± 1.588
3.691ValGlu: 3.691 ± 1.372
3.322ValPhe: 3.322 ± 0.073
3.691ValGly: 3.691 ± 0.842
1.107ValHis: 1.107 ± 0.024
1.477ValIle: 1.477 ± 0.217
4.061ValLys: 4.061 ± 0.458
7.014ValLeu: 7.014 ± 0.769
1.107ValMet: 1.107 ± 0.578
1.477ValAsn: 1.477 ± 0.217
2.584ValPro: 2.584 ± 0.312
2.215ValGln: 2.215 ± 0.048
2.584ValArg: 2.584 ± 0.794
5.168ValSer: 5.168 ± 0.625
4.061ValThr: 4.061 ± 0.096
2.584ValVal: 2.584 ± 0.866
1.107ValTrp: 1.107 ± 0.024
2.953ValTyr: 2.953 ± 0.673
0.0ValXaa: 0.0 ± 0.0
Trp
0.369TrpAla: 0.369 ± 0.193
0.369TrpCys: 0.369 ± 0.361
1.107TrpAsp: 1.107 ± 0.578
0.369TrpGlu: 0.369 ± 0.361
0.738TrpPhe: 0.738 ± 0.385
0.369TrpGly: 0.369 ± 0.193
0.369TrpHis: 0.369 ± 0.193
0.369TrpIle: 0.369 ± 0.193
0.369TrpLys: 0.369 ± 0.193
1.107TrpLeu: 1.107 ± 0.024
0.0TrpMet: 0.0 ± 0.0
0.738TrpAsn: 0.738 ± 0.168
0.0TrpPro: 0.0 ± 0.0
1.107TrpGln: 1.107 ± 0.529
0.738TrpArg: 0.738 ± 0.168
1.477TrpSer: 1.477 ± 0.337
0.369TrpThr: 0.369 ± 0.193
1.477TrpVal: 1.477 ± 0.89
0.369TrpTrp: 0.369 ± 0.193
0.738TrpTyr: 0.738 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.691TyrAla: 3.691 ± 0.288
0.0TyrCys: 0.0 ± 0.0
1.107TyrAsp: 1.107 ± 0.578
2.584TyrGlu: 2.584 ± 0.241
2.584TyrPhe: 2.584 ± 0.241
2.584TyrGly: 2.584 ± 0.241
0.0TyrHis: 0.0 ± 0.0
1.846TyrIle: 1.846 ± 0.409
2.953TyrLys: 2.953 ± 0.12
2.953TyrLeu: 2.953 ± 0.434
0.369TyrMet: 0.369 ± 0.193
1.846TyrAsn: 1.846 ± 0.144
1.846TyrPro: 1.846 ± 0.144
0.738TyrGln: 0.738 ± 0.168
2.953TyrArg: 2.953 ± 0.12
3.691TyrSer: 3.691 ± 0.288
2.953TyrThr: 2.953 ± 0.987
3.322TyrVal: 3.322 ± 1.034
0.369TyrTrp: 0.369 ± 0.361
1.107TyrTyr: 1.107 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2710 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski