Amino acid dipepetide frequency for Microviridae Fen7895_21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.753AlaAla: 8.753 ± 4.788
1.459AlaCys: 1.459 ± 1.492
8.023AlaAsp: 8.023 ± 3.215
2.918AlaGlu: 2.918 ± 1.934
2.918AlaPhe: 2.918 ± 1.822
4.376AlaGly: 4.376 ± 2.006
2.918AlaHis: 2.918 ± 1.372
6.565AlaIle: 6.565 ± 2.694
5.106AlaLys: 5.106 ± 1.45
8.023AlaLeu: 8.023 ± 1.486
0.0AlaMet: 0.0 ± 0.0
5.835AlaAsn: 5.835 ± 4.312
4.376AlaPro: 4.376 ± 2.006
8.023AlaGln: 8.023 ± 4.091
6.565AlaArg: 6.565 ± 1.251
2.918AlaSer: 2.918 ± 1.066
3.647AlaThr: 3.647 ± 1.945
3.647AlaVal: 3.647 ± 1.321
1.459AlaTrp: 1.459 ± 0.686
4.376AlaTyr: 4.376 ± 2.561
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.729CysCys: 0.729 ± 0.493
0.0CysAsp: 0.0 ± 0.0
0.729CysGlu: 0.729 ± 0.493
0.0CysPhe: 0.0 ± 0.0
1.459CysGly: 1.459 ± 1.492
0.0CysHis: 0.0 ± 0.0
1.459CysIle: 1.459 ± 0.686
0.0CysLys: 0.0 ± 0.0
2.188CysLeu: 2.188 ± 2.387
0.0CysMet: 0.0 ± 0.0
1.459CysAsn: 1.459 ± 0.686
1.459CysPro: 1.459 ± 1.492
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.729CysThr: 0.729 ± 0.493
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.835AspAla: 5.835 ± 1.512
0.0AspCys: 0.0 ± 0.0
2.918AspAsp: 2.918 ± 2.984
4.376AspGlu: 4.376 ± 0.902
2.918AspPhe: 2.918 ± 1.326
5.106AspGly: 5.106 ± 1.277
0.729AspHis: 0.729 ± 0.746
0.0AspIle: 0.0 ± 0.0
2.918AspLys: 2.918 ± 1.942
2.918AspLeu: 2.918 ± 1.942
1.459AspMet: 1.459 ± 0.963
2.918AspAsn: 2.918 ± 0.656
3.647AspPro: 3.647 ± 1.036
2.918AspGln: 2.918 ± 0.798
5.835AspArg: 5.835 ± 1.794
0.729AspSer: 0.729 ± 0.493
4.376AspThr: 4.376 ± 2.399
4.376AspVal: 4.376 ± 1.866
1.459AspTrp: 1.459 ± 0.986
4.376AspTyr: 4.376 ± 2.233
0.0AspXaa: 0.0 ± 0.0
Glu
3.647GluAla: 3.647 ± 1.793
0.0GluCys: 0.0 ± 0.0
5.106GluAsp: 5.106 ± 1.928
1.459GluGlu: 1.459 ± 0.963
2.188GluPhe: 2.188 ± 0.933
2.188GluGly: 2.188 ± 1.174
0.729GluHis: 0.729 ± 0.493
2.918GluIle: 2.918 ± 2.156
0.0GluLys: 0.0 ± 0.0
3.647GluLeu: 3.647 ± 1.825
1.459GluMet: 1.459 ± 0.669
0.0GluAsn: 0.0 ± 0.0
1.459GluPro: 1.459 ± 0.686
1.459GluGln: 1.459 ± 0.986
0.729GluArg: 0.729 ± 0.493
0.0GluSer: 0.0 ± 0.0
2.918GluThr: 2.918 ± 1.338
2.188GluVal: 2.188 ± 0.933
0.729GluTrp: 0.729 ± 0.493
3.647GluTyr: 3.647 ± 1.562
0.0GluXaa: 0.0 ± 0.0
Phe
3.647PheAla: 3.647 ± 1.321
0.0PheCys: 0.0 ± 0.0
2.188PheAsp: 2.188 ± 0.933
1.459PheGlu: 1.459 ± 0.963
2.188PhePhe: 2.188 ± 1.346
0.729PheGly: 0.729 ± 1.173
1.459PheHis: 1.459 ± 0.686
2.188PheIle: 2.188 ± 1.346
2.188PheLys: 2.188 ± 1.131
4.376PheLeu: 4.376 ± 1.866
0.729PheMet: 0.729 ± 0.493
2.918PheAsn: 2.918 ± 0.798
1.459PhePro: 1.459 ± 1.323
2.918PheGln: 2.918 ± 0.656
2.918PheArg: 2.918 ± 1.484
2.188PheSer: 2.188 ± 1.174
0.729PheThr: 0.729 ± 0.493
2.918PheVal: 2.918 ± 1.489
0.0PheTrp: 0.0 ± 0.0
1.459PheTyr: 1.459 ± 0.986
0.0PheXaa: 0.0 ± 0.0
Gly
3.647GlyAla: 3.647 ± 1.908
0.0GlyCys: 0.0 ± 0.0
3.647GlyAsp: 3.647 ± 2.465
2.918GlyGlu: 2.918 ± 1.489
1.459GlyPhe: 1.459 ± 0.986
3.647GlyGly: 3.647 ± 1.036
2.188GlyHis: 2.188 ± 0.553
3.647GlyIle: 3.647 ± 1.691
0.729GlyLys: 0.729 ± 0.746
5.835GlyLeu: 5.835 ± 1.512
0.0GlyMet: 0.0 ± 0.0
4.376GlyAsn: 4.376 ± 1.461
0.0GlyPro: 0.0 ± 0.0
1.459GlyGln: 1.459 ± 0.986
2.918GlyArg: 2.918 ± 2.684
4.376GlySer: 4.376 ± 1.342
5.106GlyThr: 5.106 ± 2.288
3.647GlyVal: 3.647 ± 2.465
0.729GlyTrp: 0.729 ± 0.746
3.647GlyTyr: 3.647 ± 1.562
0.0GlyXaa: 0.0 ± 0.0
His
1.459HisAla: 1.459 ± 0.986
0.0HisCys: 0.0 ± 0.0
1.459HisAsp: 1.459 ± 0.669
0.0HisGlu: 0.0 ± 0.0
2.188HisPhe: 2.188 ± 0.933
1.459HisGly: 1.459 ± 0.986
0.0HisHis: 0.0 ± 0.0
1.459HisIle: 1.459 ± 1.492
0.0HisLys: 0.0 ± 0.0
2.188HisLeu: 2.188 ± 0.949
0.0HisMet: 0.0 ± 0.636
0.729HisAsn: 0.729 ± 0.693
0.729HisPro: 0.729 ± 0.746
1.459HisGln: 1.459 ± 0.963
3.647HisArg: 3.647 ± 2.798
0.729HisSer: 0.729 ± 0.693
0.0HisThr: 0.0 ± 0.0
2.918HisVal: 2.918 ± 2.065
0.729HisTrp: 0.729 ± 0.493
1.459HisTyr: 1.459 ± 1.492
0.0HisXaa: 0.0 ± 0.0
Ile
5.106IleAla: 5.106 ± 2.207
0.729IleCys: 0.729 ± 1.173
2.188IleAsp: 2.188 ± 0.553
1.459IleGlu: 1.459 ± 0.986
1.459IlePhe: 1.459 ± 0.986
3.647IleGly: 3.647 ± 0.791
0.729IleHis: 0.729 ± 0.746
0.729IleIle: 0.729 ± 0.982
2.918IleLys: 2.918 ± 1.462
5.106IleLeu: 5.106 ± 1.599
5.106IleMet: 5.106 ± 1.881
2.188IleAsn: 2.188 ± 0.959
4.376IlePro: 4.376 ± 1.476
3.647IleGln: 3.647 ± 1.945
6.565IleArg: 6.565 ± 3.442
6.565IleSer: 6.565 ± 4.342
2.188IleThr: 2.188 ± 0.933
1.459IleVal: 1.459 ± 1.323
0.0IleTrp: 0.0 ± 0.0
0.729IleTyr: 0.729 ± 0.693
0.0IleXaa: 0.0 ± 0.0
Lys
3.647LysAla: 3.647 ± 2.613
0.0LysCys: 0.0 ± 0.0
0.729LysAsp: 0.729 ± 0.982
2.918LysGlu: 2.918 ± 1.356
4.376LysPhe: 4.376 ± 1.684
2.188LysGly: 2.188 ± 0.553
0.0LysHis: 0.0 ± 0.0
1.459LysIle: 1.459 ± 1.175
2.188LysLys: 2.188 ± 1.708
4.376LysLeu: 4.376 ± 1.512
0.729LysMet: 0.729 ± 0.493
0.0LysAsn: 0.0 ± 0.0
2.188LysPro: 2.188 ± 0.959
4.376LysGln: 4.376 ± 2.394
2.918LysArg: 2.918 ± 2.646
2.188LysSer: 2.188 ± 0.553
1.459LysThr: 1.459 ± 1.386
2.918LysVal: 2.918 ± 1.634
0.729LysTrp: 0.729 ± 0.493
0.729LysTyr: 0.729 ± 0.746
0.0LysXaa: 0.0 ± 0.0
Leu
9.482LeuAla: 9.482 ± 1.752
1.459LeuCys: 1.459 ± 1.323
8.023LeuAsp: 8.023 ± 2.73
3.647LeuGlu: 3.647 ± 1.793
2.188LeuPhe: 2.188 ± 1.285
4.376LeuGly: 4.376 ± 1.461
1.459LeuHis: 1.459 ± 1.492
7.294LeuIle: 7.294 ± 5.54
6.565LeuLys: 6.565 ± 2.409
12.4LeuLeu: 12.4 ± 14.729
4.376LeuMet: 4.376 ± 1.926
5.835LeuAsn: 5.835 ± 1.768
7.294LeuPro: 7.294 ± 2.525
3.647LeuGln: 3.647 ± 1.566
2.918LeuArg: 2.918 ± 1.573
6.565LeuSer: 6.565 ± 2.934
5.106LeuThr: 5.106 ± 3.003
6.565LeuVal: 6.565 ± 2.982
1.459LeuTrp: 1.459 ± 0.986
1.459LeuTyr: 1.459 ± 1.128
0.0LeuXaa: 0.0 ± 0.0
Met
2.188MetAla: 2.188 ± 1.319
0.729MetCys: 0.729 ± 0.493
0.729MetAsp: 0.729 ± 0.493
1.459MetGlu: 1.459 ± 0.986
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.729MetHis: 0.729 ± 0.493
1.459MetIle: 1.459 ± 1.128
0.729MetLys: 0.729 ± 0.746
2.918MetLeu: 2.918 ± 2.157
2.188MetMet: 2.188 ± 1.641
1.459MetAsn: 1.459 ± 0.817
0.729MetPro: 0.729 ± 0.693
1.459MetGln: 1.459 ± 1.386
2.188MetArg: 2.188 ± 1.27
5.835MetSer: 5.835 ± 1.316
2.918MetThr: 2.918 ± 1.954
0.0MetVal: 0.0 ± 0.0
1.459MetTrp: 1.459 ± 0.669
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
10.212AsnAla: 10.212 ± 5.392
0.729AsnCys: 0.729 ± 0.746
3.647AsnAsp: 3.647 ± 1.833
1.459AsnGlu: 1.459 ± 0.669
2.188AsnPhe: 2.188 ± 2.238
3.647AsnGly: 3.647 ± 1.036
0.729AsnHis: 0.729 ± 0.493
1.459AsnIle: 1.459 ± 0.986
2.188AsnLys: 2.188 ± 1.174
4.376AsnLeu: 4.376 ± 2.166
0.729AsnMet: 0.729 ± 0.693
7.294AsnAsn: 7.294 ± 3.89
1.459AsnPro: 1.459 ± 1.11
1.459AsnGln: 1.459 ± 1.492
2.918AsnArg: 2.918 ± 0.656
4.376AsnSer: 4.376 ± 3.298
7.294AsnThr: 7.294 ± 3.026
4.376AsnVal: 4.376 ± 0.888
0.0AsnTrp: 0.0 ± 0.0
0.729AsnTyr: 0.729 ± 0.693
0.0AsnXaa: 0.0 ± 0.0
Pro
2.918ProAla: 2.918 ± 1.13
1.459ProCys: 1.459 ± 0.686
0.729ProAsp: 0.729 ± 0.982
2.188ProGlu: 2.188 ± 1.174
2.188ProPhe: 2.188 ± 1.147
2.188ProGly: 2.188 ± 0.933
2.188ProHis: 2.188 ± 1.319
5.106ProIle: 5.106 ± 1.586
3.647ProLys: 3.647 ± 1.02
5.835ProLeu: 5.835 ± 2.568
2.188ProMet: 2.188 ± 1.27
4.376ProAsn: 4.376 ± 1.164
3.647ProPro: 3.647 ± 2.09
1.459ProGln: 1.459 ± 0.963
2.188ProArg: 2.188 ± 1.346
2.918ProSer: 2.918 ± 1.066
3.647ProThr: 3.647 ± 2.014
5.106ProVal: 5.106 ± 1.667
0.729ProTrp: 0.729 ± 0.493
0.729ProTyr: 0.729 ± 0.493
0.0ProXaa: 0.0 ± 0.0
Gln
3.647GlnAla: 3.647 ± 1.908
0.0GlnCys: 0.0 ± 0.0
1.459GlnAsp: 1.459 ± 1.11
1.459GlnGlu: 1.459 ± 0.817
2.188GlnPhe: 2.188 ± 1.978
5.106GlnGly: 5.106 ± 1.577
2.188GlnHis: 2.188 ± 1.174
1.459GlnIle: 1.459 ± 0.669
2.918GlnLys: 2.918 ± 1.045
5.835GlnLeu: 5.835 ± 2.537
1.459GlnMet: 1.459 ± 1.386
4.376GlnAsn: 4.376 ± 1.758
4.376GlnPro: 4.376 ± 2.006
3.647GlnGln: 3.647 ± 1.945
4.376GlnArg: 4.376 ± 1.512
1.459GlnSer: 1.459 ± 1.11
2.188GlnThr: 2.188 ± 2.079
0.729GlnVal: 0.729 ± 0.982
0.729GlnTrp: 0.729 ± 0.493
1.459GlnTyr: 1.459 ± 0.686
0.0GlnXaa: 0.0 ± 0.0
Arg
4.376ArgAla: 4.376 ± 1.881
1.459ArgCys: 1.459 ± 0.686
2.918ArgAsp: 2.918 ± 0.656
0.0ArgGlu: 0.0 ± 0.0
2.188ArgPhe: 2.188 ± 1.008
0.729ArgGly: 0.729 ± 0.493
0.729ArgHis: 0.729 ± 0.493
5.106ArgIle: 5.106 ± 2.123
3.647ArgLys: 3.647 ± 1.417
10.941ArgLeu: 10.941 ± 5.274
0.729ArgMet: 0.729 ± 0.746
2.918ArgAsn: 2.918 ± 2.065
2.918ArgPro: 2.918 ± 1.573
1.459ArgGln: 1.459 ± 0.669
0.729ArgArg: 0.729 ± 0.746
3.647ArgSer: 3.647 ± 2.798
2.188ArgThr: 2.188 ± 1.346
5.106ArgVal: 5.106 ± 1.157
0.0ArgTrp: 0.0 ± 0.0
5.106ArgTyr: 5.106 ± 2.265
0.0ArgXaa: 0.0 ± 0.0
Ser
6.565SerAla: 6.565 ± 1.449
0.729SerCys: 0.729 ± 0.493
3.647SerAsp: 3.647 ± 1.833
2.918SerGlu: 2.918 ± 1.326
0.0SerPhe: 0.0 ± 0.0
2.918SerGly: 2.918 ± 1.356
0.729SerHis: 0.729 ± 0.493
5.835SerIle: 5.835 ± 4.211
2.188SerLys: 2.188 ± 0.553
8.753SerLeu: 8.753 ± 5.273
1.459SerMet: 1.459 ± 1.386
1.459SerAsn: 1.459 ± 0.669
2.918SerPro: 2.918 ± 1.066
2.918SerGln: 2.918 ± 0.656
1.459SerArg: 1.459 ± 0.686
2.188SerSer: 2.188 ± 1.627
2.918SerThr: 2.918 ± 0.798
5.835SerVal: 5.835 ± 2.415
0.0SerTrp: 0.0 ± 0.0
2.188SerTyr: 2.188 ± 1.403
0.0SerXaa: 0.0 ± 0.0
Thr
6.565ThrAla: 6.565 ± 2.544
0.0ThrCys: 0.0 ± 0.0
5.106ThrAsp: 5.106 ± 1.577
0.729ThrGlu: 0.729 ± 0.493
0.0ThrPhe: 0.0 ± 0.0
3.647ThrGly: 3.647 ± 1.933
0.729ThrHis: 0.729 ± 0.693
3.647ThrIle: 3.647 ± 1.02
0.729ThrLys: 0.729 ± 0.982
5.835ThrLeu: 5.835 ± 1.876
2.918ThrMet: 2.918 ± 0.656
5.835ThrAsn: 5.835 ± 4.797
3.647ThrPro: 3.647 ± 2.824
2.918ThrGln: 2.918 ± 1.368
2.188ThrArg: 2.188 ± 0.553
4.376ThrSer: 4.376 ± 2.54
4.376ThrThr: 4.376 ± 1.917
4.376ThrVal: 4.376 ± 0.918
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.565ValAla: 6.565 ± 2.789
0.729ValCys: 0.729 ± 0.746
3.647ValAsp: 3.647 ± 2.809
2.188ValGlu: 2.188 ± 1.479
2.918ValPhe: 2.918 ± 1.561
3.647ValGly: 3.647 ± 1.28
0.729ValHis: 0.729 ± 0.693
3.647ValIle: 3.647 ± 1.367
1.459ValLys: 1.459 ± 1.128
3.647ValLeu: 3.647 ± 2.485
1.459ValMet: 1.459 ± 1.017
5.106ValAsn: 5.106 ± 1.667
6.565ValPro: 6.565 ± 2.8
2.918ValGln: 2.918 ± 0.656
3.647ValArg: 3.647 ± 2.427
4.376ValSer: 4.376 ± 1.164
4.376ValThr: 4.376 ± 1.164
6.565ValVal: 6.565 ± 1.449
2.188ValTrp: 2.188 ± 1.346
1.459ValTyr: 1.459 ± 0.669
0.0ValXaa: 0.0 ± 0.0
Trp
0.729TrpAla: 0.729 ± 0.493
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.729TrpGlu: 0.729 ± 0.493
2.188TrpPhe: 2.188 ± 0.933
0.729TrpGly: 0.729 ± 0.493
1.459TrpHis: 1.459 ± 0.686
0.729TrpIle: 0.729 ± 0.746
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.729TrpMet: 0.729 ± 0.693
0.0TrpAsn: 0.0 ± 0.0
1.459TrpPro: 1.459 ± 0.986
1.459TrpGln: 1.459 ± 0.986
0.0TrpArg: 0.0 ± 0.0
1.459TrpSer: 1.459 ± 0.986
0.0TrpThr: 0.0 ± 0.0
0.729TrpVal: 0.729 ± 0.746
0.0TrpTrp: 0.0 ± 0.0
0.729TrpTyr: 0.729 ± 0.493
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.918TyrAla: 2.918 ± 1.972
0.0TyrCys: 0.0 ± 0.0
3.647TyrAsp: 3.647 ± 1.562
1.459TyrGlu: 1.459 ± 1.492
2.918TyrPhe: 2.918 ± 1.573
2.188TyrGly: 2.188 ± 1.403
2.918TyrHis: 2.918 ± 2.065
0.729TyrIle: 0.729 ± 0.493
0.0TyrLys: 0.0 ± 0.0
2.918TyrLeu: 2.918 ± 1.573
0.729TyrMet: 0.729 ± 0.493
2.188TyrAsn: 2.188 ± 2.079
0.729TyrPro: 0.729 ± 0.493
1.459TyrGln: 1.459 ± 0.986
2.188TyrArg: 2.188 ± 1.479
0.729TyrSer: 0.729 ± 0.493
1.459TyrThr: 1.459 ± 1.492
4.376TyrVal: 4.376 ± 1.042
0.729TyrTrp: 0.729 ± 0.493
2.918TyrTyr: 2.918 ± 1.326
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1372 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski