Amino acid dipepetide frequency for Beihai picorna-like virus 83

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.916AlaAla: 3.916 ± 0.204
2.136AlaCys: 2.136 ± 1.124
1.78AlaAsp: 1.78 ± 0.66
3.916AlaGlu: 3.916 ± 3.739
4.628AlaPhe: 4.628 ± 0.76
4.272AlaGly: 4.272 ± 0.389
1.424AlaHis: 1.424 ± 0.318
5.34AlaIle: 5.34 ± 0.742
3.56AlaLys: 3.56 ± 0.876
4.984AlaLeu: 4.984 ± 1.476
1.068AlaMet: 1.068 ± 0.562
2.492AlaAsn: 2.492 ± 1.037
3.204AlaPro: 3.204 ± 0.968
3.916AlaGln: 3.916 ± 1.662
2.492AlaArg: 2.492 ± 0.551
4.272AlaSer: 4.272 ± 0.7
6.764AlaThr: 6.764 ± 3.108
3.916AlaVal: 3.916 ± 2.061
0.712AlaTrp: 0.712 ± 0.383
2.136AlaTyr: 2.136 ± 0.58
0.0AlaXaa: 0.0 ± 0.0
Cys
2.492CysAla: 2.492 ± 0.934
1.78CysCys: 1.78 ± 0.937
2.848CysAsp: 2.848 ± 0.923
1.068CysGlu: 1.068 ± 0.562
2.136CysPhe: 2.136 ± 1.124
1.78CysGly: 1.78 ± 0.937
0.0CysHis: 0.0 ± 0.0
0.356CysIle: 0.356 ± 0.187
3.56CysLys: 3.56 ± 1.873
1.424CysLeu: 1.424 ± 0.749
1.068CysMet: 1.068 ± 0.562
0.712CysAsn: 0.712 ± 0.375
1.424CysPro: 1.424 ± 0.637
0.356CysGln: 0.356 ± 0.187
1.78CysArg: 1.78 ± 0.937
2.136CysSer: 2.136 ± 1.124
0.0CysThr: 0.0 ± 0.0
2.492CysVal: 2.492 ± 1.311
0.0CysTrp: 0.0 ± 0.0
0.712CysTyr: 0.712 ± 0.375
0.0CysXaa: 0.0 ± 0.0
Asp
3.204AspAla: 3.204 ± 0.731
2.136AspCys: 2.136 ± 1.124
3.56AspAsp: 3.56 ± 0.716
7.12AspGlu: 7.12 ± 0.837
3.916AspPhe: 3.916 ± 0.887
3.56AspGly: 3.56 ± 1.284
1.068AspHis: 1.068 ± 0.625
1.424AspIle: 1.424 ± 1.112
3.204AspLys: 3.204 ± 1.686
2.848AspLeu: 2.848 ± 0.95
1.424AspMet: 1.424 ± 0.766
2.848AspAsn: 2.848 ± 1.499
3.56AspPro: 3.56 ± 1.685
2.492AspGln: 2.492 ± 0.587
1.424AspArg: 1.424 ± 0.318
2.848AspSer: 2.848 ± 1.016
1.78AspThr: 1.78 ± 0.937
4.984AspVal: 4.984 ± 0.728
1.068AspTrp: 1.068 ± 0.562
3.204AspTyr: 3.204 ± 1.239
0.0AspXaa: 0.0 ± 0.0
Glu
4.272GluAla: 4.272 ± 0.955
1.068GluCys: 1.068 ± 0.625
4.628GluAsp: 4.628 ± 1.609
6.052GluGlu: 6.052 ± 1.836
2.136GluPhe: 2.136 ± 0.596
2.492GluGly: 2.492 ± 0.587
0.712GluHis: 0.712 ± 0.668
4.272GluIle: 4.272 ± 1.16
5.34GluLys: 5.34 ± 2.207
6.408GluLeu: 6.408 ± 1.494
1.424GluMet: 1.424 ± 0.749
2.492GluAsn: 2.492 ± 0.587
2.492GluPro: 2.492 ± 0.587
2.848GluGln: 2.848 ± 0.421
3.916GluArg: 3.916 ± 1.467
3.56GluSer: 3.56 ± 3.217
3.204GluThr: 3.204 ± 1.521
4.272GluVal: 4.272 ± 1.032
1.068GluTrp: 1.068 ± 0.899
3.204GluTyr: 3.204 ± 0.894
0.0GluXaa: 0.0 ± 0.0
Phe
1.424PheAla: 1.424 ± 0.59
1.424PheCys: 1.424 ± 0.749
2.848PheAsp: 2.848 ± 1.081
6.764PheGlu: 6.764 ± 1.583
2.492PhePhe: 2.492 ± 0.748
5.34PheGly: 5.34 ± 0.915
0.712PheHis: 0.712 ± 0.375
1.424PheIle: 1.424 ± 0.59
3.56PheLys: 3.56 ± 0.716
2.848PheLeu: 2.848 ± 0.923
0.356PheMet: 0.356 ± 0.524
2.136PheAsn: 2.136 ± 0.596
2.492PhePro: 2.492 ± 0.748
2.136PheGln: 2.136 ± 0.596
2.136PheArg: 2.136 ± 0.738
5.34PheSer: 5.34 ± 0.469
2.848PheThr: 2.848 ± 0.421
5.34PheVal: 5.34 ± 0.413
0.0PheTrp: 0.0 ± 0.0
2.136PheTyr: 2.136 ± 0.35
0.0PheXaa: 0.0 ± 0.0
Gly
3.204GlyAla: 3.204 ± 2.606
1.068GlyCys: 1.068 ± 0.562
4.272GlyAsp: 4.272 ± 2.298
2.848GlyGlu: 2.848 ± 0.95
3.56GlyPhe: 3.56 ± 0.857
4.628GlyGly: 4.628 ± 0.76
1.068GlyHis: 1.068 ± 0.298
3.916GlyIle: 3.916 ± 1.467
7.832GlyLys: 7.832 ± 1.073
6.764GlyLeu: 6.764 ± 2.004
2.492GlyMet: 2.492 ± 0.587
2.848GlyAsn: 2.848 ± 0.636
2.848GlyPro: 2.848 ± 0.636
2.492GlyGln: 2.492 ± 1.199
0.712GlyArg: 0.712 ± 0.668
5.696GlySer: 5.696 ± 3.212
5.34GlyThr: 5.34 ± 2.218
3.56GlyVal: 3.56 ± 0.895
0.356GlyTrp: 0.356 ± 0.187
1.78GlyTyr: 1.78 ± 0.429
0.0GlyXaa: 0.0 ± 0.0
His
0.356HisAla: 0.356 ± 0.187
0.0HisCys: 0.0 ± 0.0
2.136HisAsp: 2.136 ± 0.596
0.712HisGlu: 0.712 ± 0.668
2.136HisPhe: 2.136 ± 0.804
1.78HisGly: 1.78 ± 0.701
1.068HisHis: 1.068 ± 0.625
1.068HisIle: 1.068 ± 0.753
1.068HisLys: 1.068 ± 0.625
2.492HisLeu: 2.492 ± 1.311
0.712HisMet: 0.712 ± 0.504
0.356HisAsn: 0.356 ± 0.187
0.712HisPro: 0.712 ± 0.383
1.78HisGln: 1.78 ± 0.429
0.0HisArg: 0.0 ± 0.0
0.712HisSer: 0.712 ± 0.383
0.712HisThr: 0.712 ± 0.668
2.136HisVal: 2.136 ± 1.124
0.712HisTrp: 0.712 ± 0.375
0.356HisTyr: 0.356 ± 0.187
0.0HisXaa: 0.0 ± 0.0
Ile
4.628IleAla: 4.628 ± 1.242
0.712IleCys: 0.712 ± 0.375
4.628IleAsp: 4.628 ± 1.924
3.916IleGlu: 3.916 ± 0.492
1.78IlePhe: 1.78 ± 0.66
2.136IleGly: 2.136 ± 1.149
0.712IleHis: 0.712 ± 0.375
3.204IleIle: 3.204 ± 1.239
2.492IleLys: 2.492 ± 1.311
3.204IleLeu: 3.204 ± 1.031
2.492IleMet: 2.492 ± 0.917
2.848IleAsn: 2.848 ± 0.364
2.492IlePro: 2.492 ± 0.339
1.068IleGln: 1.068 ± 0.298
5.34IleArg: 5.34 ± 2.207
3.204IleSer: 3.204 ± 0.731
3.916IleThr: 3.916 ± 0.887
2.492IleVal: 2.492 ± 0.551
0.356IleTrp: 0.356 ± 0.187
2.136IleTyr: 2.136 ± 0.596
0.0IleXaa: 0.0 ± 0.0
Lys
5.34LysAla: 5.34 ± 1.67
3.916LysCys: 3.916 ± 1.467
5.34LysAsp: 5.34 ± 2.81
3.916LysGlu: 3.916 ± 1.467
3.204LysPhe: 3.204 ± 0.836
3.916LysGly: 3.916 ± 1.179
1.424LysHis: 1.424 ± 1.336
6.764LysIle: 6.764 ± 2.387
7.12LysLys: 7.12 ± 1.887
4.628LysLeu: 4.628 ± 1.043
3.204LysMet: 3.204 ± 1.239
4.628LysAsn: 4.628 ± 0.576
2.492LysPro: 2.492 ± 0.339
3.204LysGln: 3.204 ± 0.731
2.492LysArg: 2.492 ± 0.587
1.424LysSer: 1.424 ± 0.637
5.34LysThr: 5.34 ± 1.286
3.56LysVal: 3.56 ± 0.66
0.356LysTrp: 0.356 ± 0.187
2.848LysTyr: 2.848 ± 1.499
0.0LysXaa: 0.0 ± 0.0
Leu
7.476LeuAla: 7.476 ± 2.277
0.712LeuCys: 0.712 ± 0.375
3.56LeuAsp: 3.56 ± 0.954
2.848LeuGlu: 2.848 ± 0.95
3.56LeuPhe: 3.56 ± 1.85
5.34LeuGly: 5.34 ± 1.981
2.136LeuHis: 2.136 ± 1.25
4.272LeuIle: 4.272 ± 1.651
4.272LeuLys: 4.272 ± 1.747
4.628LeuLeu: 4.628 ± 0.663
2.492LeuMet: 2.492 ± 1.875
1.068LeuAsn: 1.068 ± 0.562
3.204LeuPro: 3.204 ± 0.836
4.272LeuGln: 4.272 ± 2.298
1.78LeuArg: 1.78 ± 0.429
3.56LeuSer: 3.56 ± 0.857
5.34LeuThr: 5.34 ± 0.742
3.204LeuVal: 3.204 ± 1.239
0.712LeuTrp: 0.712 ± 0.383
2.492LeuTyr: 2.492 ± 0.551
0.0LeuXaa: 0.0 ± 0.0
Met
3.56MetAla: 3.56 ± 0.857
2.136MetCys: 2.136 ± 1.124
0.356MetAsp: 0.356 ± 0.187
2.492MetGlu: 2.492 ± 0.748
0.712MetPhe: 0.712 ± 0.383
1.78MetGly: 1.78 ± 1.28
1.424MetHis: 1.424 ± 0.318
1.424MetIle: 1.424 ± 0.637
1.068MetLys: 1.068 ± 0.562
1.78MetLeu: 1.78 ± 0.429
0.712MetMet: 0.712 ± 0.925
2.848MetAsn: 2.848 ± 1.18
1.068MetPro: 1.068 ± 0.562
2.136MetGln: 2.136 ± 2.072
1.78MetArg: 1.78 ± 0.937
1.068MetSer: 1.068 ± 0.298
1.78MetThr: 1.78 ± 0.66
0.712MetVal: 0.712 ± 0.375
0.356MetTrp: 0.356 ± 0.187
1.78MetTyr: 1.78 ± 1.323
0.0MetXaa: 0.0 ± 0.0
Asn
4.984AsnAla: 4.984 ± 1.584
1.424AsnCys: 1.424 ± 0.749
1.78AsnAsp: 1.78 ± 0.66
3.56AsnGlu: 3.56 ± 0.876
2.492AsnPhe: 2.492 ± 1.311
3.56AsnGly: 3.56 ± 0.041
2.136AsnHis: 2.136 ± 1.124
1.78AsnIle: 1.78 ± 0.937
1.78AsnLys: 1.78 ± 0.429
1.068AsnLeu: 1.068 ± 0.753
0.712AsnMet: 0.712 ± 0.668
2.136AsnAsn: 2.136 ± 0.738
3.204AsnPro: 3.204 ± 0.179
2.492AsnGln: 2.492 ± 1.199
2.848AsnArg: 2.848 ± 0.636
3.56AsnSer: 3.56 ± 1.28
1.068AsnThr: 1.068 ± 1.414
2.848AsnVal: 2.848 ± 1.851
0.712AsnTrp: 0.712 ± 0.383
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.916ProAla: 3.916 ± 1.866
1.424ProCys: 1.424 ± 0.749
2.492ProAsp: 2.492 ± 0.339
2.848ProGlu: 2.848 ± 0.364
3.204ProPhe: 3.204 ± 1.239
4.628ProGly: 4.628 ± 1.288
0.712ProHis: 0.712 ± 0.375
2.492ProIle: 2.492 ± 0.587
4.628ProLys: 4.628 ± 1.043
2.848ProLeu: 2.848 ± 1.18
1.78ProMet: 1.78 ± 1.323
1.424ProAsn: 1.424 ± 0.318
2.136ProPro: 2.136 ± 1.383
1.068ProGln: 1.068 ± 0.899
2.848ProArg: 2.848 ± 0.95
3.56ProSer: 3.56 ± 0.716
2.492ProThr: 2.492 ± 0.339
3.56ProVal: 3.56 ± 0.041
1.068ProTrp: 1.068 ± 0.625
1.424ProTyr: 1.424 ± 0.318
0.0ProXaa: 0.0 ± 0.0
Gln
3.916GlnAla: 3.916 ± 2.553
1.068GlnCys: 1.068 ± 0.562
0.356GlnAsp: 0.356 ± 0.187
1.78GlnGlu: 1.78 ± 0.66
1.068GlnPhe: 1.068 ± 0.899
3.204GlnGly: 3.204 ± 1.103
1.78GlnHis: 1.78 ± 0.448
1.068GlnIle: 1.068 ± 0.298
3.916GlnLys: 3.916 ± 1.244
3.204GlnLeu: 3.204 ± 1.418
2.136GlnMet: 2.136 ± 0.35
2.136GlnAsn: 2.136 ± 0.804
3.204GlnPro: 3.204 ± 1.521
2.136GlnGln: 2.136 ± 0.58
1.424GlnArg: 1.424 ± 1.112
1.424GlnSer: 1.424 ± 0.318
2.848GlnThr: 2.848 ± 2.046
1.78GlnVal: 1.78 ± 1.323
1.068GlnTrp: 1.068 ± 0.298
2.136GlnTyr: 2.136 ± 1.149
0.0GlnXaa: 0.0 ± 0.0
Arg
1.424ArgAla: 1.424 ± 0.318
1.068ArgCys: 1.068 ± 0.562
0.0ArgAsp: 0.0 ± 0.0
2.492ArgGlu: 2.492 ± 0.587
1.068ArgPhe: 1.068 ± 0.298
1.424ArgGly: 1.424 ± 0.766
0.356ArgHis: 0.356 ± 0.187
3.56ArgIle: 3.56 ± 0.716
6.764ArgLys: 6.764 ± 2.345
2.136ArgLeu: 2.136 ± 0.58
1.068ArgMet: 1.068 ± 0.562
0.712ArgAsn: 0.712 ± 0.375
4.272ArgPro: 4.272 ± 0.389
2.492ArgGln: 2.492 ± 1.248
2.136ArgArg: 2.136 ± 0.58
4.272ArgSer: 4.272 ± 0.345
3.56ArgThr: 3.56 ± 0.66
1.78ArgVal: 1.78 ± 0.429
0.0ArgTrp: 0.0 ± 0.0
0.356ArgTyr: 0.356 ± 0.524
0.0ArgXaa: 0.0 ± 0.0
Ser
4.628SerAla: 4.628 ± 1.92
1.424SerCys: 1.424 ± 0.749
4.272SerAsp: 4.272 ± 0.7
2.848SerGlu: 2.848 ± 0.364
3.916SerPhe: 3.916 ± 1.467
5.34SerGly: 5.34 ± 1.68
1.068SerHis: 1.068 ± 0.298
3.56SerIle: 3.56 ± 2.035
5.34SerLys: 5.34 ± 1.91
4.984SerLeu: 4.984 ± 0.678
0.712SerMet: 0.712 ± 0.668
2.848SerAsn: 2.848 ± 1.016
2.136SerPro: 2.136 ± 0.58
2.136SerGln: 2.136 ± 0.738
1.424SerArg: 1.424 ± 1.112
3.916SerSer: 3.916 ± 2.949
3.916SerThr: 3.916 ± 0.782
3.916SerVal: 3.916 ± 0.976
0.712SerTrp: 0.712 ± 0.383
1.424SerTyr: 1.424 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
3.204ThrAla: 3.204 ± 1.031
1.424ThrCys: 1.424 ± 0.637
3.204ThrAsp: 3.204 ± 0.968
3.204ThrGlu: 3.204 ± 0.731
3.204ThrPhe: 3.204 ± 1.031
4.272ThrGly: 4.272 ± 2.742
1.424ThrHis: 1.424 ± 0.318
3.204ThrIle: 3.204 ± 0.557
2.136ThrLys: 2.136 ± 1.124
2.848ThrLeu: 2.848 ± 0.421
2.136ThrMet: 2.136 ± 0.35
3.56ThrAsn: 3.56 ± 3.339
4.628ThrPro: 4.628 ± 2.223
2.136ThrGln: 2.136 ± 1.797
3.56ThrArg: 3.56 ± 1.402
2.136ThrSer: 2.136 ± 0.804
4.272ThrThr: 4.272 ± 2.353
5.696ThrVal: 5.696 ± 1.225
0.712ThrTrp: 0.712 ± 0.668
4.628ThrTyr: 4.628 ± 1.288
0.0ThrXaa: 0.0 ± 0.0
Val
3.56ValAla: 3.56 ± 0.857
2.136ValCys: 2.136 ± 1.124
6.764ValAsp: 6.764 ± 1.699
2.492ValGlu: 2.492 ± 0.587
4.272ValPhe: 4.272 ± 1.032
4.272ValGly: 4.272 ± 0.955
1.068ValHis: 1.068 ± 0.562
2.848ValIle: 2.848 ± 1.499
3.56ValLys: 3.56 ± 0.041
5.696ValLeu: 5.696 ± 1.225
2.136ValMet: 2.136 ± 0.804
3.916ValAsn: 3.916 ± 2.52
2.848ValPro: 2.848 ± 0.421
1.068ValGln: 1.068 ± 0.298
2.136ValArg: 2.136 ± 1.124
4.984ValSer: 4.984 ± 2.679
2.848ValThr: 2.848 ± 1.901
5.696ValVal: 5.696 ± 3.689
0.356ValTrp: 0.356 ± 0.524
2.848ValTyr: 2.848 ± 0.364
0.0ValXaa: 0.0 ± 0.0
Trp
1.068TrpAla: 1.068 ± 0.753
0.356TrpCys: 0.356 ± 0.187
0.712TrpAsp: 0.712 ± 0.383
1.424TrpGlu: 1.424 ± 0.749
1.424TrpPhe: 1.424 ± 0.318
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.712TrpIle: 0.712 ± 0.375
1.068TrpLys: 1.068 ± 0.562
0.712TrpLeu: 0.712 ± 1.048
0.356TrpMet: 0.356 ± 0.187
0.0TrpAsn: 0.0 ± 0.0
0.712TrpPro: 0.712 ± 0.383
0.356TrpGln: 0.356 ± 0.524
0.356TrpArg: 0.356 ± 0.187
0.712TrpSer: 0.712 ± 0.668
0.356TrpThr: 0.356 ± 0.187
0.356TrpVal: 0.356 ± 0.524
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.356TyrAla: 0.356 ± 0.524
0.712TyrCys: 0.712 ± 0.375
2.136TyrAsp: 2.136 ± 0.58
3.56TyrGlu: 3.56 ± 0.954
2.848TyrPhe: 2.848 ± 0.636
3.56TyrGly: 3.56 ± 2.091
0.712TyrHis: 0.712 ± 0.375
1.068TyrIle: 1.068 ± 0.899
2.848TyrLys: 2.848 ± 1.532
1.424TyrLeu: 1.424 ± 0.318
2.136TyrMet: 2.136 ± 1.251
2.136TyrAsn: 2.136 ± 0.58
1.424TyrPro: 1.424 ± 0.318
1.068TyrGln: 1.068 ± 0.562
0.356TyrArg: 0.356 ± 0.187
2.136TyrSer: 2.136 ± 0.35
3.204TyrThr: 3.204 ± 1.521
3.204TyrVal: 3.204 ± 0.557
0.356TyrTrp: 0.356 ± 0.187
1.78TyrTyr: 1.78 ± 0.66
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2810 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski