Amino acid dipepetide frequency for Beihai picorna-like virus 26

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.401AlaAla: 6.401 ± 0.202
1.13AlaCys: 1.13 ± 0.109
3.389AlaAsp: 3.389 ± 1.562
4.142AlaGlu: 4.142 ± 0.421
4.142AlaPhe: 4.142 ± 0.196
5.648AlaGly: 5.648 ± 0.069
0.753AlaHis: 0.753 ± 0.133
2.636AlaIle: 2.636 ± 1.078
6.401AlaLys: 6.401 ± 1.648
5.271AlaLeu: 5.271 ± 0.311
3.389AlaMet: 3.389 ± 0.945
4.895AlaAsn: 4.895 ± 0.063
4.518AlaPro: 4.518 ± 0.795
4.142AlaGln: 4.142 ± 1.038
3.765AlaArg: 3.765 ± 1.896
4.518AlaSer: 4.518 ± 2.029
5.648AlaThr: 5.648 ± 1.303
4.518AlaVal: 4.518 ± 0.438
1.506AlaTrp: 1.506 ± 0.882
4.895AlaTyr: 4.895 ± 0.68
0.0AlaXaa: 0.0 ± 0.0
Cys
1.506CysAla: 1.506 ± 0.968
0.0CysCys: 0.0 ± 0.0
0.753CysAsp: 0.753 ± 0.133
1.13CysGlu: 1.13 ± 0.726
0.753CysPhe: 0.753 ± 0.133
0.377CysGly: 0.377 ± 0.242
0.753CysHis: 0.753 ± 0.484
1.13CysIle: 1.13 ± 0.109
1.13CysLys: 1.13 ± 0.507
0.753CysLeu: 0.753 ± 0.484
0.753CysMet: 0.753 ± 0.484
0.753CysAsn: 0.753 ± 0.133
0.753CysPro: 0.753 ± 0.133
0.377CysGln: 0.377 ± 0.242
1.13CysArg: 1.13 ± 0.109
1.506CysSer: 1.506 ± 0.265
0.377CysThr: 0.377 ± 0.242
0.377CysVal: 0.377 ± 0.242
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.895AspAla: 4.895 ± 1.297
0.0AspCys: 0.0 ± 0.0
5.271AspAsp: 5.271 ± 1.539
2.636AspGlu: 2.636 ± 0.156
3.765AspPhe: 3.765 ± 1.28
1.506AspGly: 1.506 ± 0.968
2.259AspHis: 2.259 ± 0.836
3.765AspIle: 3.765 ± 1.28
1.506AspLys: 1.506 ± 0.968
4.895AspLeu: 4.895 ± 0.553
1.506AspMet: 1.506 ± 0.352
3.012AspAsn: 3.012 ± 1.32
1.883AspPro: 1.883 ± 1.256
3.389AspGln: 3.389 ± 0.328
1.506AspArg: 1.506 ± 0.968
6.401AspSer: 6.401 ± 0.819
4.142AspThr: 4.142 ± 1.038
3.389AspVal: 3.389 ± 0.288
0.753AspTrp: 0.753 ± 0.484
2.259AspTyr: 2.259 ± 0.836
0.0AspXaa: 0.0 ± 0.0
Glu
4.142GluAla: 4.142 ± 0.813
2.259GluCys: 2.259 ± 0.398
3.765GluAsp: 3.765 ± 1.896
4.142GluGlu: 4.142 ± 0.813
1.883GluPhe: 1.883 ± 0.594
2.636GluGly: 2.636 ± 0.156
1.13GluHis: 1.13 ± 0.726
1.506GluIle: 1.506 ± 0.265
3.389GluLys: 3.389 ± 2.178
6.401GluLeu: 6.401 ± 3.285
0.377GluMet: 0.377 ± 0.242
3.012GluAsn: 3.012 ± 0.086
3.012GluPro: 3.012 ± 0.086
1.13GluGln: 1.13 ± 0.109
2.259GluArg: 2.259 ± 1.452
5.271GluSer: 5.271 ± 0.311
1.506GluThr: 1.506 ± 0.352
7.154GluVal: 7.154 ± 1.568
1.506GluTrp: 1.506 ± 0.352
3.012GluTyr: 3.012 ± 1.32
0.0GluXaa: 0.0 ± 0.0
Phe
2.259PheAla: 2.259 ± 0.398
0.377PheCys: 0.377 ± 0.242
4.142PheAsp: 4.142 ± 0.196
2.636PheGlu: 2.636 ± 0.156
1.506PhePhe: 1.506 ± 0.968
4.142PheGly: 4.142 ± 0.421
1.883PheHis: 1.883 ± 0.023
2.259PheIle: 2.259 ± 0.219
1.506PheLys: 1.506 ± 0.968
3.389PheLeu: 3.389 ± 0.288
0.753PheMet: 0.753 ± 0.133
1.13PheAsn: 1.13 ± 0.507
0.753PhePro: 0.753 ± 0.484
1.883PheGln: 1.883 ± 1.21
1.506PheArg: 1.506 ± 0.882
3.389PheSer: 3.389 ± 0.945
3.389PheThr: 3.389 ± 1.522
3.765PheVal: 3.765 ± 0.57
0.0PheTrp: 0.0 ± 0.0
1.883PheTyr: 1.883 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
3.765GlyAla: 3.765 ± 0.046
0.753GlyCys: 0.753 ± 0.133
4.895GlyAsp: 4.895 ± 1.297
4.895GlyGlu: 4.895 ± 1.787
2.636GlyPhe: 2.636 ± 0.772
3.012GlyGly: 3.012 ± 1.147
0.377GlyHis: 0.377 ± 0.242
3.012GlyIle: 3.012 ± 0.703
3.765GlyLys: 3.765 ± 0.57
5.271GlyLeu: 5.271 ± 0.311
1.883GlyMet: 1.883 ± 0.64
1.883GlyAsn: 1.883 ± 0.023
4.142GlyPro: 4.142 ± 1.038
2.259GlyGln: 2.259 ± 0.219
2.259GlyArg: 2.259 ± 0.398
4.518GlySer: 4.518 ± 1.412
5.271GlyThr: 5.271 ± 0.311
3.012GlyVal: 3.012 ± 1.147
0.377GlyTrp: 0.377 ± 0.242
0.753GlyTyr: 0.753 ± 0.133
0.0GlyXaa: 0.0 ± 0.0
His
1.506HisAla: 1.506 ± 0.968
0.0HisCys: 0.0 ± 0.0
0.377HisAsp: 0.377 ± 0.242
1.506HisGlu: 1.506 ± 0.968
0.753HisPhe: 0.753 ± 0.749
2.636HisGly: 2.636 ± 1.078
0.753HisHis: 0.753 ± 0.484
2.259HisIle: 2.259 ± 1.452
1.883HisLys: 1.883 ± 0.023
2.636HisLeu: 2.636 ± 1.078
0.377HisMet: 0.377 ± 0.242
1.506HisAsn: 1.506 ± 0.882
2.259HisPro: 2.259 ± 1.452
1.13HisGln: 1.13 ± 0.109
0.377HisArg: 0.377 ± 0.375
1.506HisSer: 1.506 ± 0.352
2.636HisThr: 2.636 ± 0.772
0.377HisVal: 0.377 ± 0.375
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.024IleAla: 6.024 ± 0.789
1.506IleCys: 1.506 ± 0.352
3.389IleAsp: 3.389 ± 0.328
4.518IleGlu: 4.518 ± 1.412
1.883IlePhe: 1.883 ± 0.594
2.636IleGly: 2.636 ± 0.772
1.13IleHis: 1.13 ± 0.109
3.012IleIle: 3.012 ± 0.53
5.271IleLys: 5.271 ± 2.772
3.765IleLeu: 3.765 ± 1.187
1.13IleMet: 1.13 ± 0.726
3.012IleAsn: 3.012 ± 1.147
1.13IlePro: 1.13 ± 0.109
0.377IleGln: 0.377 ± 0.242
3.765IleArg: 3.765 ± 0.57
5.271IleSer: 5.271 ± 0.922
4.518IleThr: 4.518 ± 1.412
3.765IleVal: 3.765 ± 0.57
0.0IleTrp: 0.0 ± 0.0
2.259IleTyr: 2.259 ± 0.836
0.0IleXaa: 0.0 ± 0.0
Lys
3.389LysAla: 3.389 ± 0.945
0.753LysCys: 0.753 ± 0.484
4.895LysAsp: 4.895 ± 1.913
3.389LysGlu: 3.389 ± 1.562
1.13LysPhe: 1.13 ± 0.726
2.259LysGly: 2.259 ± 0.219
1.883LysHis: 1.883 ± 0.594
1.883LysIle: 1.883 ± 0.64
3.765LysLys: 3.765 ± 0.57
6.024LysLeu: 6.024 ± 1.406
1.506LysMet: 1.506 ± 0.265
1.13LysAsn: 1.13 ± 0.726
1.883LysPro: 1.883 ± 0.594
2.259LysGln: 2.259 ± 0.219
3.389LysArg: 3.389 ± 0.328
5.271LysSer: 5.271 ± 3.389
1.883LysThr: 1.883 ± 1.21
4.518LysVal: 4.518 ± 1.671
0.377LysTrp: 0.377 ± 0.242
2.259LysTyr: 2.259 ± 0.836
0.0LysXaa: 0.0 ± 0.0
Leu
7.154LeuAla: 7.154 ± 4.035
0.753LeuCys: 0.753 ± 0.484
4.518LeuAsp: 4.518 ± 0.438
4.895LeuGlu: 4.895 ± 0.063
2.636LeuPhe: 2.636 ± 0.461
3.765LeuGly: 3.765 ± 1.28
3.389LeuHis: 3.389 ± 0.328
6.401LeuIle: 6.401 ± 2.882
3.765LeuLys: 3.765 ± 1.804
5.648LeuLeu: 5.648 ± 0.069
2.259LeuMet: 2.259 ± 0.153
4.895LeuAsn: 4.895 ± 1.17
3.765LeuPro: 3.765 ± 0.663
3.012LeuGln: 3.012 ± 0.086
4.895LeuArg: 4.895 ± 1.787
5.648LeuSer: 5.648 ± 1.303
6.401LeuThr: 6.401 ± 1.435
4.895LeuVal: 4.895 ± 0.68
0.753LeuTrp: 0.753 ± 0.484
1.506LeuTyr: 1.506 ± 0.968
0.0LeuXaa: 0.0 ± 0.0
Met
3.012MetAla: 3.012 ± 1.32
0.753MetCys: 0.753 ± 0.484
0.753MetAsp: 0.753 ± 0.133
1.13MetGlu: 1.13 ± 0.726
1.883MetPhe: 1.883 ± 1.21
1.883MetGly: 1.883 ± 0.64
0.377MetHis: 0.377 ± 0.375
1.13MetIle: 1.13 ± 0.109
1.883MetLys: 1.883 ± 0.594
2.259MetLeu: 2.259 ± 0.398
1.13MetMet: 1.13 ± 0.507
0.753MetAsn: 0.753 ± 0.484
0.753MetPro: 0.753 ± 0.133
0.753MetGln: 0.753 ± 0.749
1.13MetArg: 1.13 ± 0.726
3.765MetSer: 3.765 ± 1.28
0.377MetThr: 0.377 ± 0.375
3.012MetVal: 3.012 ± 0.086
0.377MetTrp: 0.377 ± 0.242
0.753MetTyr: 0.753 ± 0.749
0.0MetXaa: 0.0 ± 0.0
Asn
5.271AsnAla: 5.271 ± 0.928
0.0AsnCys: 0.0 ± 0.0
1.506AsnAsp: 1.506 ± 0.352
1.506AsnGlu: 1.506 ± 0.265
1.13AsnPhe: 1.13 ± 0.507
4.895AsnGly: 4.895 ± 0.063
1.506AsnHis: 1.506 ± 0.968
3.012AsnIle: 3.012 ± 0.53
3.012AsnLys: 3.012 ± 0.086
4.895AsnLeu: 4.895 ± 0.553
1.13AsnMet: 1.13 ± 0.109
2.636AsnAsn: 2.636 ± 2.006
3.765AsnPro: 3.765 ± 0.663
1.13AsnGln: 1.13 ± 0.726
1.506AsnArg: 1.506 ± 0.265
5.271AsnSer: 5.271 ± 1.545
3.012AsnThr: 3.012 ± 0.086
3.389AsnVal: 3.389 ± 0.905
1.13AsnTrp: 1.13 ± 0.109
2.636AsnTyr: 2.636 ± 1.389
0.0AsnXaa: 0.0 ± 0.0
Pro
3.012ProAla: 3.012 ± 1.764
0.377ProCys: 0.377 ± 0.242
1.883ProAsp: 1.883 ± 0.023
4.895ProGlu: 4.895 ± 1.913
3.012ProPhe: 3.012 ± 0.53
1.883ProGly: 1.883 ± 0.64
1.506ProHis: 1.506 ± 0.352
1.883ProIle: 1.883 ± 0.023
2.259ProLys: 2.259 ± 0.836
4.142ProLeu: 4.142 ± 0.196
1.506ProMet: 1.506 ± 0.265
3.765ProAsn: 3.765 ± 0.663
0.753ProPro: 0.753 ± 0.133
1.506ProGln: 1.506 ± 0.968
1.506ProArg: 1.506 ± 0.352
3.012ProSer: 3.012 ± 0.703
6.024ProThr: 6.024 ± 3.527
2.636ProVal: 2.636 ± 0.156
0.377ProTrp: 0.377 ± 0.375
2.636ProTyr: 2.636 ± 2.006
0.0ProXaa: 0.0 ± 0.0
Gln
2.259GlnAla: 2.259 ± 0.219
1.506GlnCys: 1.506 ± 0.352
1.883GlnAsp: 1.883 ± 0.023
1.13GlnGlu: 1.13 ± 0.109
1.883GlnPhe: 1.883 ± 1.21
2.636GlnGly: 2.636 ± 0.461
0.753GlnHis: 0.753 ± 0.484
3.012GlnIle: 3.012 ± 0.086
1.13GlnLys: 1.13 ± 0.109
3.765GlnLeu: 3.765 ± 0.046
0.377GlnMet: 0.377 ± 0.242
1.883GlnAsn: 1.883 ± 0.64
1.506GlnPro: 1.506 ± 0.265
0.753GlnGln: 0.753 ± 0.133
1.13GlnArg: 1.13 ± 0.109
1.883GlnSer: 1.883 ± 0.64
1.883GlnThr: 1.883 ± 0.64
2.636GlnVal: 2.636 ± 0.461
0.377GlnTrp: 0.377 ± 0.242
1.13GlnTyr: 1.13 ± 0.507
0.0GlnXaa: 0.0 ± 0.0
Arg
0.377ArgAla: 0.377 ± 0.375
0.377ArgCys: 0.377 ± 0.242
1.506ArgAsp: 1.506 ± 0.265
3.389ArgGlu: 3.389 ± 0.288
3.389ArgPhe: 3.389 ± 0.905
2.259ArgGly: 2.259 ± 1.631
2.259ArgHis: 2.259 ± 1.452
4.142ArgIle: 4.142 ± 0.196
2.259ArgLys: 2.259 ± 0.219
4.895ArgLeu: 4.895 ± 0.063
2.636ArgMet: 2.636 ± 0.461
3.389ArgAsn: 3.389 ± 0.288
3.765ArgPro: 3.765 ± 1.896
1.506ArgGln: 1.506 ± 0.882
1.883ArgArg: 1.883 ± 0.594
1.506ArgSer: 1.506 ± 0.352
1.13ArgThr: 1.13 ± 0.109
1.506ArgVal: 1.506 ± 0.352
1.13ArgTrp: 1.13 ± 0.109
1.506ArgTyr: 1.506 ± 0.265
0.0ArgXaa: 0.0 ± 0.0
Ser
9.789SerAla: 9.789 ± 1.723
1.13SerCys: 1.13 ± 0.726
4.518SerAsp: 4.518 ± 0.795
3.389SerGlu: 3.389 ± 0.328
2.636SerPhe: 2.636 ± 0.156
6.777SerGly: 6.777 ± 1.193
1.13SerHis: 1.13 ± 0.507
7.907SerIle: 7.907 ± 0.15
2.636SerLys: 2.636 ± 1.694
4.518SerLeu: 4.518 ± 0.795
1.13SerMet: 1.13 ± 0.705
3.389SerAsn: 3.389 ± 0.905
3.389SerPro: 3.389 ± 0.288
2.259SerGln: 2.259 ± 0.398
3.389SerArg: 3.389 ± 0.328
7.53SerSer: 7.53 ± 3.176
3.389SerThr: 3.389 ± 0.905
4.518SerVal: 4.518 ± 0.179
1.883SerTrp: 1.883 ± 0.023
3.389SerTyr: 3.389 ± 0.945
0.0SerXaa: 0.0 ± 0.0
Thr
4.518ThrAla: 4.518 ± 1.412
1.13ThrCys: 1.13 ± 0.507
3.765ThrAsp: 3.765 ± 1.896
3.765ThrGlu: 3.765 ± 1.896
2.259ThrPhe: 2.259 ± 1.014
4.142ThrGly: 4.142 ± 1.038
0.753ThrHis: 0.753 ± 0.484
4.518ThrIle: 4.518 ± 0.179
1.506ThrLys: 1.506 ± 0.352
3.765ThrLeu: 3.765 ± 0.663
1.13ThrMet: 1.13 ± 0.109
3.389ThrAsn: 3.389 ± 0.288
3.389ThrPro: 3.389 ± 2.138
3.389ThrGln: 3.389 ± 0.328
2.636ThrArg: 2.636 ± 0.156
4.895ThrSer: 4.895 ± 2.403
4.142ThrThr: 4.142 ± 1.429
5.271ThrVal: 5.271 ± 0.311
0.377ThrTrp: 0.377 ± 0.242
3.389ThrTyr: 3.389 ± 0.905
0.0ThrXaa: 0.0 ± 0.0
Val
7.907ValAla: 7.907 ± 0.467
0.753ValCys: 0.753 ± 0.484
4.142ValAsp: 4.142 ± 0.421
4.142ValGlu: 4.142 ± 0.196
2.636ValPhe: 2.636 ± 1.078
2.636ValGly: 2.636 ± 0.461
1.506ValHis: 1.506 ± 0.882
1.883ValIle: 1.883 ± 0.023
4.518ValLys: 4.518 ± 1.671
4.518ValLeu: 4.518 ± 1.412
1.13ValMet: 1.13 ± 0.109
4.895ValAsn: 4.895 ± 0.063
4.518ValPro: 4.518 ± 1.671
1.13ValGln: 1.13 ± 0.507
3.389ValArg: 3.389 ± 0.905
3.765ValSer: 3.765 ± 0.57
3.765ValThr: 3.765 ± 1.28
6.401ValVal: 6.401 ± 4.115
1.13ValTrp: 1.13 ± 0.726
3.012ValTyr: 3.012 ± 0.086
0.0ValXaa: 0.0 ± 0.0
Trp
0.753TrpAla: 0.753 ± 0.484
0.0TrpCys: 0.0 ± 0.0
1.506TrpAsp: 1.506 ± 0.968
0.377TrpGlu: 0.377 ± 0.242
1.13TrpPhe: 1.13 ± 0.507
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.377TrpIle: 0.377 ± 0.242
1.13TrpLys: 1.13 ± 0.507
1.883TrpLeu: 1.883 ± 1.21
1.13TrpMet: 1.13 ± 0.507
0.377TrpAsn: 0.377 ± 0.375
0.377TrpPro: 0.377 ± 0.242
0.0TrpGln: 0.0 ± 0.0
0.377TrpArg: 0.377 ± 0.375
1.883TrpSer: 1.883 ± 0.023
0.377TrpThr: 0.377 ± 0.242
0.753TrpVal: 0.753 ± 0.484
0.0TrpTrp: 0.0 ± 0.0
0.377TrpTyr: 0.377 ± 0.242
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.765TyrAla: 3.765 ± 1.187
0.753TyrCys: 0.753 ± 0.133
1.883TyrAsp: 1.883 ± 0.023
1.883TyrGlu: 1.883 ± 0.023
1.506TyrPhe: 1.506 ± 0.968
3.012TyrGly: 3.012 ± 0.53
0.377TyrHis: 0.377 ± 0.375
2.636TyrIle: 2.636 ± 1.078
1.13TyrLys: 1.13 ± 0.726
2.259TyrLeu: 2.259 ± 0.219
2.259TyrMet: 2.259 ± 0.398
2.636TyrAsn: 2.636 ± 0.772
2.259TyrPro: 2.259 ± 0.219
1.13TyrGln: 1.13 ± 0.109
3.012TyrArg: 3.012 ± 1.147
2.259TyrSer: 2.259 ± 0.219
2.259TyrThr: 2.259 ± 1.014
1.883TyrVal: 1.883 ± 0.594
0.753TyrTrp: 0.753 ± 0.133
0.753TyrTyr: 0.753 ± 0.749
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2657 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski