Amino acid dipepetide frequency for Rhizophagus sp. RF1 medium virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.684AlaAla: 0.684 ± 0.382
0.0AlaCys: 0.0 ± 0.0
0.684AlaAsp: 0.684 ± 0.66
1.367AlaGlu: 1.367 ± 0.765
1.367AlaPhe: 1.367 ± 0.278
1.367AlaGly: 1.367 ± 0.765
0.0AlaHis: 0.0 ± 0.0
3.418AlaIle: 3.418 ± 1.912
2.051AlaLys: 2.051 ± 0.104
2.051AlaLeu: 2.051 ± 1.147
0.0AlaMet: 0.0 ± 0.0
1.367AlaAsn: 1.367 ± 0.765
1.367AlaPro: 1.367 ± 0.765
1.367AlaGln: 1.367 ± 0.278
1.367AlaArg: 1.367 ± 0.765
0.684AlaSer: 0.684 ± 0.382
1.367AlaThr: 1.367 ± 0.765
1.367AlaVal: 1.367 ± 0.278
0.684AlaTrp: 0.684 ± 0.66
1.367AlaTyr: 1.367 ± 0.278
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.367CysGlu: 1.367 ± 0.765
1.367CysPhe: 1.367 ± 0.765
1.367CysGly: 1.367 ± 0.278
0.684CysHis: 0.684 ± 0.66
1.367CysIle: 1.367 ± 0.278
1.367CysLys: 1.367 ± 1.321
1.367CysLeu: 1.367 ± 0.278
0.0CysMet: 0.0 ± 0.0
2.051CysAsn: 2.051 ± 0.104
0.0CysPro: 0.0 ± 0.0
0.684CysGln: 0.684 ± 0.382
0.684CysArg: 0.684 ± 0.382
0.684CysSer: 0.684 ± 0.382
0.684CysThr: 0.684 ± 0.382
1.367CysVal: 1.367 ± 0.765
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.684AspAla: 0.684 ± 0.382
0.684AspCys: 0.684 ± 0.382
6.152AspAsp: 6.152 ± 2.399
6.152AspGlu: 6.152 ± 0.73
4.101AspPhe: 4.101 ± 0.209
3.418AspGly: 3.418 ± 0.869
0.684AspHis: 0.684 ± 0.382
7.519AspIle: 7.519 ± 1.078
4.101AspLys: 4.101 ± 0.209
4.785AspLeu: 4.785 ± 0.591
1.367AspMet: 1.367 ± 0.278
5.468AspAsn: 5.468 ± 2.155
0.684AspPro: 0.684 ± 0.382
4.785AspGln: 4.785 ± 0.591
4.101AspArg: 4.101 ± 1.252
3.418AspSer: 3.418 ± 1.912
2.734AspThr: 2.734 ± 0.556
4.785AspVal: 4.785 ± 2.677
2.051AspTrp: 2.051 ± 0.104
3.418AspTyr: 3.418 ± 1.216
0.0AspXaa: 0.0 ± 0.0
Glu
0.684GluAla: 0.684 ± 0.382
0.0GluCys: 0.0 ± 0.0
0.684GluAsp: 0.684 ± 0.382
1.367GluGlu: 1.367 ± 1.321
1.367GluPhe: 1.367 ± 0.765
1.367GluGly: 1.367 ± 0.278
1.367GluHis: 1.367 ± 0.765
3.418GluIle: 3.418 ± 1.216
5.468GluLys: 5.468 ± 0.069
5.468GluLeu: 5.468 ± 1.112
0.684GluMet: 0.684 ± 0.382
2.734GluAsn: 2.734 ± 1.599
1.367GluPro: 1.367 ± 0.278
0.684GluGln: 0.684 ± 0.382
6.835GluArg: 6.835 ± 0.347
5.468GluSer: 5.468 ± 0.069
4.785GluThr: 4.785 ± 1.634
4.785GluVal: 4.785 ± 1.634
2.051GluTrp: 2.051 ± 0.104
4.101GluTyr: 4.101 ± 0.834
0.0GluXaa: 0.0 ± 0.0
Phe
2.051PheAla: 2.051 ± 0.104
0.684PheCys: 0.684 ± 0.66
1.367PheAsp: 1.367 ± 0.765
2.734PheGlu: 2.734 ± 1.599
2.734PhePhe: 2.734 ± 1.53
2.051PheGly: 2.051 ± 0.104
0.684PheHis: 0.684 ± 0.66
2.051PheIle: 2.051 ± 0.104
4.101PheLys: 4.101 ± 0.209
2.051PheLeu: 2.051 ± 0.104
2.051PheMet: 2.051 ± 1.147
4.101PheAsn: 4.101 ± 1.252
1.367PhePro: 1.367 ± 0.278
1.367PheGln: 1.367 ± 0.765
0.684PheArg: 0.684 ± 0.66
4.785PheSer: 4.785 ± 0.591
0.0PheThr: 0.0 ± 0.0
4.785PheVal: 4.785 ± 1.494
0.684PheTrp: 0.684 ± 0.66
1.367PheTyr: 1.367 ± 0.765
0.0PheXaa: 0.0 ± 0.0
Gly
1.367GlyAla: 1.367 ± 0.765
1.367GlyCys: 1.367 ± 0.765
4.101GlyAsp: 4.101 ± 2.295
1.367GlyGlu: 1.367 ± 0.765
0.684GlyPhe: 0.684 ± 0.382
1.367GlyGly: 1.367 ± 0.278
0.0GlyHis: 0.0 ± 0.0
4.101GlyIle: 4.101 ± 1.877
5.468GlyLys: 5.468 ± 0.069
4.785GlyLeu: 4.785 ± 0.452
2.051GlyMet: 2.051 ± 0.938
1.367GlyAsn: 1.367 ± 0.765
0.0GlyPro: 0.0 ± 0.0
1.367GlyGln: 1.367 ± 0.765
3.418GlyArg: 3.418 ± 0.869
4.785GlySer: 4.785 ± 0.591
0.0GlyThr: 0.0 ± 0.0
5.468GlyVal: 5.468 ± 1.112
0.684GlyTrp: 0.684 ± 0.382
1.367GlyTyr: 1.367 ± 0.765
0.0GlyXaa: 0.0 ± 0.0
His
1.367HisAla: 1.367 ± 0.278
0.0HisCys: 0.0 ± 0.0
1.367HisAsp: 1.367 ± 0.765
1.367HisGlu: 1.367 ± 0.765
0.0HisPhe: 0.0 ± 0.0
0.684HisGly: 0.684 ± 0.382
0.0HisHis: 0.0 ± 0.0
1.367HisIle: 1.367 ± 0.765
1.367HisLys: 1.367 ± 1.321
0.684HisLeu: 0.684 ± 0.382
0.684HisMet: 0.684 ± 0.382
0.0HisAsn: 0.0 ± 0.0
0.684HisPro: 0.684 ± 0.382
0.684HisGln: 0.684 ± 0.382
0.684HisArg: 0.684 ± 0.66
2.051HisSer: 2.051 ± 1.981
0.0HisThr: 0.0 ± 0.0
0.684HisVal: 0.684 ± 0.382
0.0HisTrp: 0.0 ± 0.0
0.684HisTyr: 0.684 ± 0.382
0.0HisXaa: 0.0 ± 0.0
Ile
0.684IleAla: 0.684 ± 0.382
0.0IleCys: 0.0 ± 0.0
8.202IleAsp: 8.202 ± 0.418
5.468IleGlu: 5.468 ± 0.974
3.418IlePhe: 3.418 ± 1.216
4.101IleGly: 4.101 ± 0.209
0.0IleHis: 0.0 ± 0.0
9.569IleIle: 9.569 ± 1.946
9.569IleLys: 9.569 ± 0.14
8.886IleLeu: 8.886 ± 2.328
1.367IleMet: 1.367 ± 0.765
10.253IleAsn: 10.253 ± 3.649
2.051IlePro: 2.051 ± 0.104
2.051IleGln: 2.051 ± 0.938
7.519IleArg: 7.519 ± 0.035
2.734IleSer: 2.734 ± 1.599
3.418IleThr: 3.418 ± 3.302
3.418IleVal: 3.418 ± 2.259
0.684IleTrp: 0.684 ± 0.66
3.418IleTyr: 3.418 ± 1.912
0.0IleXaa: 0.0 ± 0.0
Lys
2.051LysAla: 2.051 ± 0.938
0.0LysCys: 0.0 ± 0.0
6.835LysAsp: 6.835 ± 0.347
4.101LysGlu: 4.101 ± 1.877
5.468LysPhe: 5.468 ± 2.155
2.734LysGly: 2.734 ± 0.487
2.051LysHis: 2.051 ± 0.938
10.936LysIle: 10.936 ± 2.224
2.734LysLys: 2.734 ± 0.556
6.152LysLeu: 6.152 ± 2.399
0.684LysMet: 0.684 ± 0.66
6.152LysAsn: 6.152 ± 1.356
3.418LysPro: 3.418 ± 1.216
1.367LysGln: 1.367 ± 0.765
2.734LysArg: 2.734 ± 0.556
4.785LysSer: 4.785 ± 1.494
4.101LysThr: 4.101 ± 0.209
4.785LysVal: 4.785 ± 1.494
3.418LysTrp: 3.418 ± 0.174
6.152LysTyr: 6.152 ± 0.313
0.0LysXaa: 0.0 ± 0.0
Leu
2.734LeuAla: 2.734 ± 0.487
2.051LeuCys: 2.051 ± 1.981
8.202LeuAsp: 8.202 ± 1.461
3.418LeuGlu: 3.418 ± 0.869
2.734LeuPhe: 2.734 ± 1.599
4.785LeuGly: 4.785 ± 1.634
1.367LeuHis: 1.367 ± 0.765
4.785LeuIle: 4.785 ± 0.452
4.785LeuLys: 4.785 ± 0.452
7.519LeuLeu: 7.519 ± 1.078
4.101LeuMet: 4.101 ± 1.051
6.152LeuAsn: 6.152 ± 1.772
6.152LeuPro: 6.152 ± 0.313
2.051LeuGln: 2.051 ± 0.104
8.886LeuArg: 8.886 ± 2.328
10.936LeuSer: 10.936 ± 1.947
7.519LeuThr: 7.519 ± 1.078
6.835LeuVal: 6.835 ± 1.739
2.051LeuTrp: 2.051 ± 0.938
2.734LeuTyr: 2.734 ± 0.487
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.367MetCys: 1.367 ± 0.278
2.734MetAsp: 2.734 ± 1.53
1.367MetGlu: 1.367 ± 0.765
0.684MetPhe: 0.684 ± 0.382
2.051MetGly: 2.051 ± 0.938
0.0MetHis: 0.0 ± 0.0
4.101MetIle: 4.101 ± 3.962
1.367MetLys: 1.367 ± 0.765
2.734MetLeu: 2.734 ± 0.556
0.684MetMet: 0.684 ± 1.019
0.684MetAsn: 0.684 ± 0.382
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
2.051MetArg: 2.051 ± 0.104
3.418MetSer: 3.418 ± 1.216
1.367MetThr: 1.367 ± 0.765
0.684MetVal: 0.684 ± 0.382
0.684MetTrp: 0.684 ± 0.66
1.367MetTyr: 1.367 ± 0.278
0.0MetXaa: 0.0 ± 0.0
Asn
0.684AsnAla: 0.684 ± 0.382
1.367AsnCys: 1.367 ± 1.321
4.101AsnAsp: 4.101 ± 1.877
3.418AsnGlu: 3.418 ± 0.869
2.734AsnPhe: 2.734 ± 0.556
2.734AsnGly: 2.734 ± 0.487
0.684AsnHis: 0.684 ± 0.382
7.519AsnIle: 7.519 ± 2.05
4.785AsnLys: 4.785 ± 2.537
8.886AsnLeu: 8.886 ± 1.843
1.367AsnMet: 1.367 ± 0.278
3.418AsnAsn: 3.418 ± 0.869
4.101AsnPro: 4.101 ± 2.295
1.367AsnGln: 1.367 ± 0.278
3.418AsnArg: 3.418 ± 0.174
4.101AsnSer: 4.101 ± 1.877
3.418AsnThr: 3.418 ± 0.174
6.152AsnVal: 6.152 ± 0.73
2.051AsnTrp: 2.051 ± 0.104
3.418AsnTyr: 3.418 ± 0.174
0.0AsnXaa: 0.0 ± 0.0
Pro
2.734ProAla: 2.734 ± 0.487
0.0ProCys: 0.0 ± 0.0
2.734ProAsp: 2.734 ± 0.487
1.367ProGlu: 1.367 ± 0.765
2.734ProPhe: 2.734 ± 0.487
0.0ProGly: 0.0 ± 0.0
0.684ProHis: 0.684 ± 0.382
5.468ProIle: 5.468 ± 1.112
1.367ProLys: 1.367 ± 0.278
2.734ProLeu: 2.734 ± 0.487
2.051ProMet: 2.051 ± 0.104
1.367ProAsn: 1.367 ± 0.278
0.684ProPro: 0.684 ± 0.382
1.367ProGln: 1.367 ± 0.765
0.684ProArg: 0.684 ± 0.382
4.785ProSer: 4.785 ± 1.494
0.684ProThr: 0.684 ± 0.382
1.367ProVal: 1.367 ± 0.765
0.0ProTrp: 0.0 ± 0.0
1.367ProTyr: 1.367 ± 0.765
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
2.051GlnAsp: 2.051 ± 1.147
0.684GlnGlu: 0.684 ± 0.382
1.367GlnPhe: 1.367 ± 0.765
0.684GlnGly: 0.684 ± 0.382
0.684GlnHis: 0.684 ± 0.66
2.051GlnIle: 2.051 ± 0.104
1.367GlnLys: 1.367 ± 0.278
2.734GlnLeu: 2.734 ± 0.556
1.367GlnMet: 1.367 ± 0.765
1.367GlnAsn: 1.367 ± 0.278
1.367GlnPro: 1.367 ± 0.765
0.684GlnGln: 0.684 ± 0.382
1.367GlnArg: 1.367 ± 0.765
3.418GlnSer: 3.418 ± 0.174
1.367GlnThr: 1.367 ± 0.278
0.684GlnVal: 0.684 ± 0.66
0.684GlnTrp: 0.684 ± 0.382
0.684GlnTyr: 0.684 ± 0.382
0.0GlnXaa: 0.0 ± 0.0
Arg
2.051ArgAla: 2.051 ± 0.104
2.051ArgCys: 2.051 ± 1.147
5.468ArgAsp: 5.468 ± 0.069
2.051ArgGlu: 2.051 ± 0.938
2.051ArgPhe: 2.051 ± 0.104
2.734ArgGly: 2.734 ± 0.487
2.051ArgHis: 2.051 ± 0.104
3.418ArgIle: 3.418 ± 1.216
7.519ArgLys: 7.519 ± 4.136
8.886ArgLeu: 8.886 ± 1.843
1.367ArgMet: 1.367 ± 0.278
2.051ArgAsn: 2.051 ± 0.104
2.734ArgPro: 2.734 ± 0.487
0.0ArgGln: 0.0 ± 0.0
6.152ArgArg: 6.152 ± 1.356
2.051ArgSer: 2.051 ± 0.104
0.684ArgThr: 0.684 ± 0.382
3.418ArgVal: 3.418 ± 0.174
2.734ArgTrp: 2.734 ± 0.487
4.101ArgTyr: 4.101 ± 0.834
0.0ArgXaa: 0.0 ± 0.0
Ser
1.367SerAla: 1.367 ± 0.765
0.684SerCys: 0.684 ± 0.382
2.734SerAsp: 2.734 ± 0.487
6.835SerGlu: 6.835 ± 2.433
1.367SerPhe: 1.367 ± 0.765
4.785SerGly: 4.785 ± 0.452
2.051SerHis: 2.051 ± 1.147
2.734SerIle: 2.734 ± 0.487
3.418SerLys: 3.418 ± 1.216
11.62SerLeu: 11.62 ± 0.799
2.734SerMet: 2.734 ± 0.556
5.468SerAsn: 5.468 ± 1.112
0.684SerPro: 0.684 ± 0.66
1.367SerGln: 1.367 ± 0.278
3.418SerArg: 3.418 ± 0.869
3.418SerSer: 3.418 ± 0.869
2.734SerThr: 2.734 ± 0.556
5.468SerVal: 5.468 ± 1.112
1.367SerTrp: 1.367 ± 0.765
4.785SerTyr: 4.785 ± 0.591
0.0SerXaa: 0.0 ± 0.0
Thr
2.734ThrAla: 2.734 ± 1.53
0.684ThrCys: 0.684 ± 0.382
2.734ThrAsp: 2.734 ± 0.556
1.367ThrGlu: 1.367 ± 0.278
0.684ThrPhe: 0.684 ± 0.382
0.684ThrGly: 0.684 ± 0.382
0.0ThrHis: 0.0 ± 0.0
2.051ThrIle: 2.051 ± 0.938
6.835ThrLys: 6.835 ± 0.347
6.835ThrLeu: 6.835 ± 0.347
0.0ThrMet: 0.0 ± 0.0
4.785ThrAsn: 4.785 ± 1.494
2.734ThrPro: 2.734 ± 0.556
0.0ThrGln: 0.0 ± 0.0
0.684ThrArg: 0.684 ± 0.382
0.684ThrSer: 0.684 ± 0.66
1.367ThrThr: 1.367 ± 0.278
1.367ThrVal: 1.367 ± 0.765
1.367ThrTrp: 1.367 ± 0.765
2.051ThrTyr: 2.051 ± 0.104
0.0ThrXaa: 0.0 ± 0.0
Val
2.051ValAla: 2.051 ± 1.147
1.367ValCys: 1.367 ± 0.765
6.152ValAsp: 6.152 ± 0.73
4.101ValGlu: 4.101 ± 0.209
2.734ValPhe: 2.734 ± 1.53
2.734ValGly: 2.734 ± 0.556
1.367ValHis: 1.367 ± 0.278
6.152ValIle: 6.152 ± 1.356
7.519ValLys: 7.519 ± 2.121
2.734ValLeu: 2.734 ± 1.599
2.051ValMet: 2.051 ± 0.938
8.886ValAsn: 8.886 ± 1.843
2.734ValPro: 2.734 ± 0.487
0.684ValGln: 0.684 ± 0.66
4.101ValArg: 4.101 ± 1.877
4.101ValSer: 4.101 ± 0.209
1.367ValThr: 1.367 ± 1.321
2.734ValVal: 2.734 ± 0.556
0.684ValTrp: 0.684 ± 0.66
4.101ValTyr: 4.101 ± 1.252
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.684TrpCys: 0.684 ± 0.382
2.734TrpAsp: 2.734 ± 0.487
1.367TrpGlu: 1.367 ± 0.278
1.367TrpPhe: 1.367 ± 0.278
0.684TrpGly: 0.684 ± 0.382
0.0TrpHis: 0.0 ± 0.0
2.051TrpIle: 2.051 ± 0.938
2.051TrpLys: 2.051 ± 0.938
2.734TrpLeu: 2.734 ± 0.556
2.051TrpMet: 2.051 ± 0.938
1.367TrpAsn: 1.367 ± 0.278
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.367TrpArg: 1.367 ± 1.321
0.684TrpSer: 0.684 ± 0.382
0.684TrpThr: 0.684 ± 0.382
2.051TrpVal: 2.051 ± 1.147
0.0TrpTrp: 0.0 ± 0.0
0.684TrpTyr: 0.684 ± 0.382
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
2.051TyrCys: 2.051 ± 1.147
2.051TyrAsp: 2.051 ± 0.104
3.418TyrGlu: 3.418 ± 0.869
2.734TyrPhe: 2.734 ± 0.556
4.785TyrGly: 4.785 ± 0.591
0.0TyrHis: 0.0 ± 0.0
2.734TyrIle: 2.734 ± 0.556
3.418TyrLys: 3.418 ± 0.869
6.152TyrLeu: 6.152 ± 0.313
0.684TyrMet: 0.684 ± 0.66
0.684TyrAsn: 0.684 ± 0.382
2.051TyrPro: 2.051 ± 0.104
2.734TyrGln: 2.734 ± 0.487
3.418TyrArg: 3.418 ± 0.174
2.051TyrSer: 2.051 ± 1.147
1.367TyrThr: 1.367 ± 0.278
6.152TyrVal: 6.152 ± 1.356
0.684TyrTrp: 0.684 ± 0.66
0.684TyrTyr: 0.684 ± 0.382
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1464 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski