Amino acid dipepetide frequency for Potexvirus sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.447AlaAla: 3.447 ± 0.896
0.0AlaCys: 0.0 ± 0.0
2.954AlaAsp: 2.954 ± 1.426
3.939AlaGlu: 3.939 ± 1.902
3.939AlaPhe: 3.939 ± 1.548
3.939AlaGly: 3.939 ± 1.543
3.939AlaHis: 3.939 ± 2.729
0.985AlaIle: 0.985 ± 1.203
1.969AlaLys: 1.969 ± 0.951
10.832AlaLeu: 10.832 ± 5.859
1.477AlaMet: 1.477 ± 0.713
2.954AlaAsn: 2.954 ± 1.426
2.954AlaPro: 2.954 ± 1.269
1.969AlaGln: 1.969 ± 1.342
2.954AlaArg: 2.954 ± 1.426
3.939AlaSer: 3.939 ± 1.083
6.401AlaThr: 6.401 ± 1.109
3.939AlaVal: 3.939 ± 1.657
1.477AlaTrp: 1.477 ± 0.713
2.462AlaTyr: 2.462 ± 0.639
0.0AlaXaa: 0.0 ± 0.0
Cys
1.969CysAla: 1.969 ± 1.806
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.492CysGlu: 0.492 ± 0.238
0.492CysPhe: 0.492 ± 0.238
0.985CysGly: 0.985 ± 0.814
0.492CysHis: 0.492 ± 0.238
0.0CysIle: 0.0 ± 0.0
0.985CysLys: 0.985 ± 1.477
0.985CysLeu: 0.985 ± 0.475
0.0CysMet: 0.0 ± 0.0
0.985CysAsn: 0.985 ± 0.475
0.985CysPro: 0.985 ± 0.814
1.477CysGln: 1.477 ± 1.391
1.969CysArg: 1.969 ± 1.113
0.0CysSer: 0.0 ± 0.0
2.462CysThr: 2.462 ± 1.414
1.477CysVal: 1.477 ± 0.682
0.492CysTrp: 0.492 ± 1.594
0.492CysTyr: 0.492 ± 1.311
0.0CysXaa: 0.0 ± 0.0
Asp
2.462AspAla: 2.462 ± 1.483
0.0AspCys: 0.0 ± 0.0
2.954AspAsp: 2.954 ± 1.426
3.447AspGlu: 3.447 ± 2.294
2.462AspPhe: 2.462 ± 1.189
0.985AspGly: 0.985 ± 1.203
1.969AspHis: 1.969 ± 1.113
1.477AspIle: 1.477 ± 0.713
1.477AspLys: 1.477 ± 0.682
5.416AspLeu: 5.416 ± 1.179
0.985AspMet: 0.985 ± 0.475
0.985AspAsn: 0.985 ± 0.475
3.447AspPro: 3.447 ± 1.278
4.431AspGln: 4.431 ± 2.14
1.477AspArg: 1.477 ± 0.682
3.447AspSer: 3.447 ± 0.896
3.447AspThr: 3.447 ± 1.442
1.969AspVal: 1.969 ± 0.951
0.985AspTrp: 0.985 ± 0.475
1.477AspTyr: 1.477 ± 1.134
0.0AspXaa: 0.0 ± 0.0
Glu
5.416GluAla: 5.416 ± 2.615
0.985GluCys: 0.985 ± 0.475
2.462GluAsp: 2.462 ± 1.189
4.431GluGlu: 4.431 ± 1.285
2.462GluPhe: 2.462 ± 1.483
2.954GluGly: 2.954 ± 0.741
0.985GluHis: 0.985 ± 0.475
4.431GluIle: 4.431 ± 1.061
3.447GluLys: 3.447 ± 1.664
6.401GluLeu: 6.401 ± 1.109
0.985GluMet: 0.985 ± 0.475
3.447GluAsn: 3.447 ± 1.664
2.462GluPro: 2.462 ± 1.189
0.492GluGln: 0.492 ± 0.238
3.939GluArg: 3.939 ± 1.548
3.447GluSer: 3.447 ± 0.896
2.954GluThr: 2.954 ± 1.426
8.863GluVal: 8.863 ± 2.223
0.492GluTrp: 0.492 ± 0.238
0.985GluTyr: 0.985 ± 0.814
0.0GluXaa: 0.0 ± 0.0
Phe
2.462PheAla: 2.462 ± 1.142
0.492PheCys: 0.492 ± 0.238
3.939PheAsp: 3.939 ± 1.902
4.924PheGlu: 4.924 ± 1.498
1.477PhePhe: 1.477 ± 0.713
2.462PheGly: 2.462 ± 1.189
1.477PheHis: 1.477 ± 0.713
3.447PheIle: 3.447 ± 0.979
0.985PheLys: 0.985 ± 0.475
3.447PheLeu: 3.447 ± 0.896
2.462PheMet: 2.462 ± 0.676
2.462PheAsn: 2.462 ± 2.095
1.477PhePro: 1.477 ± 0.682
0.985PheGln: 0.985 ± 0.475
3.447PheArg: 3.447 ± 0.979
3.939PheSer: 3.939 ± 3.028
4.924PheThr: 4.924 ± 1.477
0.985PheVal: 0.985 ± 0.814
1.477PheTrp: 1.477 ± 1.391
1.477PheTyr: 1.477 ± 0.713
0.0PheXaa: 0.0 ± 0.0
Gly
2.462GlyAla: 2.462 ± 1.483
1.969GlyCys: 1.969 ± 1.806
3.939GlyAsp: 3.939 ± 1.477
1.969GlyGlu: 1.969 ± 0.951
3.447GlyPhe: 3.447 ± 1.713
2.954GlyGly: 2.954 ± 1.071
0.985GlyHis: 0.985 ± 0.475
2.954GlyIle: 2.954 ± 1.899
4.431GlyLys: 4.431 ± 1.285
2.462GlyLeu: 2.462 ± 1.483
0.985GlyMet: 0.985 ± 0.451
0.985GlyAsn: 0.985 ± 0.475
2.954GlyPro: 2.954 ± 1.269
1.969GlyGln: 1.969 ± 0.617
3.939GlyArg: 3.939 ± 1.082
3.447GlySer: 3.447 ± 3.21
3.939GlyThr: 3.939 ± 2.192
2.462GlyVal: 2.462 ± 0.639
0.492GlyTrp: 0.492 ± 0.238
0.492GlyTyr: 0.492 ± 0.238
0.0GlyXaa: 0.0 ± 0.0
His
2.954HisAla: 2.954 ± 1.364
0.985HisCys: 0.985 ± 1.203
0.492HisAsp: 0.492 ± 0.238
1.969HisGlu: 1.969 ± 0.617
3.939HisPhe: 3.939 ± 1.543
2.462HisGly: 2.462 ± 1.142
1.477HisHis: 1.477 ± 0.713
0.0HisIle: 0.0 ± 0.0
0.492HisLys: 0.492 ± 0.238
1.969HisLeu: 1.969 ± 0.951
0.985HisMet: 0.985 ± 1.203
0.0HisAsn: 0.0 ± 0.0
1.477HisPro: 1.477 ± 0.682
2.462HisGln: 2.462 ± 1.142
2.462HisArg: 2.462 ± 0.639
2.954HisSer: 2.954 ± 1.826
1.477HisThr: 1.477 ± 0.713
0.492HisVal: 0.492 ± 0.238
0.492HisTrp: 0.492 ± 0.238
0.492HisTyr: 0.492 ± 0.238
0.0HisXaa: 0.0 ± 0.0
Ile
2.954IleAla: 2.954 ± 1.364
0.492IleCys: 0.492 ± 0.987
2.462IleAsp: 2.462 ± 1.189
1.477IleGlu: 1.477 ± 0.713
2.954IlePhe: 2.954 ± 1.426
2.462IleGly: 2.462 ± 1.142
1.969IleHis: 1.969 ± 1.113
1.969IleIle: 1.969 ± 1.113
5.908IleLys: 5.908 ± 1.482
5.908IleLeu: 5.908 ± 1.134
1.969IleMet: 1.969 ± 0.951
3.447IleAsn: 3.447 ± 1.664
4.431IlePro: 4.431 ± 1.233
3.447IleGln: 3.447 ± 1.664
1.477IleArg: 1.477 ± 1.794
3.447IleSer: 3.447 ± 2.412
4.431IleThr: 4.431 ± 2.14
0.492IleVal: 0.492 ± 0.238
0.0IleTrp: 0.0 ± 0.0
1.969IleTyr: 1.969 ± 1.113
0.0IleXaa: 0.0 ± 0.0
Lys
6.401LysAla: 6.401 ± 1.435
1.477LysCys: 1.477 ± 1.959
2.954LysAsp: 2.954 ± 1.899
4.431LysGlu: 4.431 ± 2.14
0.985LysPhe: 0.985 ± 1.477
2.462LysGly: 2.462 ± 2.239
1.477LysHis: 1.477 ± 0.713
3.447LysIle: 3.447 ± 1.664
1.477LysLys: 1.477 ± 0.713
8.863LysLeu: 8.863 ± 3.435
2.462LysMet: 2.462 ± 1.334
1.969LysAsn: 1.969 ± 0.951
3.939LysPro: 3.939 ± 1.082
0.985LysGln: 0.985 ± 0.475
0.492LysArg: 0.492 ± 0.238
3.939LysSer: 3.939 ± 1.543
2.954LysThr: 2.954 ± 1.369
2.954LysVal: 2.954 ± 1.364
0.492LysTrp: 0.492 ± 0.238
1.477LysTyr: 1.477 ± 0.713
0.0LysXaa: 0.0 ± 0.0
Leu
6.893LeuAla: 6.893 ± 3.448
1.969LeuCys: 1.969 ± 0.617
6.401LeuAsp: 6.401 ± 1.627
4.924LeuGlu: 4.924 ± 1.498
3.939LeuPhe: 3.939 ± 1.902
6.893LeuGly: 6.893 ± 2.652
2.462LeuHis: 2.462 ± 1.334
5.908LeuIle: 5.908 ± 1.308
5.908LeuLys: 5.908 ± 1.143
7.386LeuLeu: 7.386 ± 2.721
0.0LeuMet: 0.0 ± 0.0
2.462LeuAsn: 2.462 ± 0.639
10.34LeuPro: 10.34 ± 3.998
3.939LeuGln: 3.939 ± 1.233
6.401LeuArg: 6.401 ± 3.004
8.37LeuSer: 8.37 ± 1.116
10.832LeuThr: 10.832 ± 6.16
4.924LeuVal: 4.924 ± 1.034
0.985LeuTrp: 0.985 ± 0.475
3.939LeuTyr: 3.939 ± 1.082
0.0LeuXaa: 0.0 ± 0.0
Met
0.985MetAla: 0.985 ± 0.475
0.0MetCys: 0.0 ± 0.0
0.985MetAsp: 0.985 ± 0.814
0.985MetGlu: 0.985 ± 0.475
1.477MetPhe: 1.477 ± 1.134
1.477MetGly: 1.477 ± 0.713
0.0MetHis: 0.0 ± 0.0
0.492MetIle: 0.492 ± 0.238
1.969MetLys: 1.969 ± 0.617
3.447MetLeu: 3.447 ± 1.664
0.0MetMet: 0.0 ± 0.0
0.492MetAsn: 0.492 ± 0.238
1.477MetPro: 1.477 ± 1.134
1.969MetGln: 1.969 ± 1.342
0.985MetArg: 0.985 ± 0.475
1.477MetSer: 1.477 ± 1.134
0.985MetThr: 0.985 ± 0.475
0.492MetVal: 0.492 ± 0.238
0.492MetTrp: 0.492 ± 1.594
1.477MetTyr: 1.477 ± 0.713
0.0MetXaa: 0.0 ± 0.0
Asn
2.462AsnAla: 2.462 ± 1.189
1.969AsnCys: 1.969 ± 0.951
1.477AsnAsp: 1.477 ± 0.713
2.954AsnGlu: 2.954 ± 1.426
1.969AsnPhe: 1.969 ± 0.951
1.477AsnGly: 1.477 ± 0.713
0.492AsnHis: 0.492 ± 0.987
3.447AsnIle: 3.447 ± 1.278
0.985AsnLys: 0.985 ± 0.475
2.462AsnLeu: 2.462 ± 1.483
1.969AsnMet: 1.969 ± 1.225
0.492AsnAsn: 0.492 ± 0.238
3.447AsnPro: 3.447 ± 1.278
0.492AsnGln: 0.492 ± 0.238
2.462AsnArg: 2.462 ± 2.041
4.431AsnSer: 4.431 ± 2.097
1.969AsnThr: 1.969 ± 0.951
1.969AsnVal: 1.969 ± 0.951
0.0AsnTrp: 0.0 ± 0.0
1.969AsnTyr: 1.969 ± 0.951
0.0AsnXaa: 0.0 ± 0.0
Pro
5.908ProAla: 5.908 ± 1.308
0.492ProCys: 0.492 ± 0.238
2.462ProAsp: 2.462 ± 0.639
4.924ProGlu: 4.924 ± 1.09
1.477ProPhe: 1.477 ± 1.134
2.954ProGly: 2.954 ± 0.741
2.462ProHis: 2.462 ± 2.095
4.431ProIle: 4.431 ± 1.285
5.908ProLys: 5.908 ± 2.437
7.878ProLeu: 7.878 ± 4.321
0.492ProMet: 0.492 ± 0.238
3.447ProAsn: 3.447 ± 3.21
4.431ProPro: 4.431 ± 0.959
1.477ProGln: 1.477 ± 1.134
1.477ProArg: 1.477 ± 0.713
5.416ProSer: 5.416 ± 1.548
5.416ProThr: 5.416 ± 4.247
4.924ProVal: 4.924 ± 1.954
0.985ProTrp: 0.985 ± 0.475
1.477ProTyr: 1.477 ± 0.713
0.0ProXaa: 0.0 ± 0.0
Gln
1.477GlnAla: 1.477 ± 1.391
0.0GlnCys: 0.0 ± 0.0
2.462GlnAsp: 2.462 ± 1.189
2.462GlnGlu: 2.462 ± 0.639
0.985GlnPhe: 0.985 ± 0.475
2.462GlnGly: 2.462 ± 1.673
0.492GlnHis: 0.492 ± 0.238
4.431GlnIle: 4.431 ± 2.14
1.969GlnLys: 1.969 ± 0.951
4.924GlnLeu: 4.924 ± 1.837
0.985GlnMet: 0.985 ± 0.475
0.985GlnAsn: 0.985 ± 0.814
5.416GlnPro: 5.416 ± 1.179
1.477GlnGln: 1.477 ± 0.713
0.985GlnArg: 0.985 ± 1.477
3.939GlnSer: 3.939 ± 2.684
3.447GlnThr: 3.447 ± 1.278
0.985GlnVal: 0.985 ± 0.475
1.477GlnTrp: 1.477 ± 0.713
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.431ArgAla: 4.431 ± 0.959
0.0ArgCys: 0.0 ± 0.0
2.954ArgAsp: 2.954 ± 1.899
4.924ArgGlu: 4.924 ± 1.278
2.462ArgPhe: 2.462 ± 1.673
2.462ArgGly: 2.462 ± 0.639
0.985ArgHis: 0.985 ± 0.814
3.447ArgIle: 3.447 ± 0.896
0.985ArgLys: 0.985 ± 1.477
7.386ArgLeu: 7.386 ± 0.55
0.0ArgMet: 0.0 ± 0.0
1.969ArgAsn: 1.969 ± 0.951
2.954ArgPro: 2.954 ± 2.649
2.954ArgGln: 2.954 ± 1.426
3.939ArgArg: 3.939 ± 1.477
3.939ArgSer: 3.939 ± 3.621
4.431ArgThr: 4.431 ± 1.061
0.985ArgVal: 0.985 ± 0.475
0.492ArgTrp: 0.492 ± 1.594
1.969ArgTyr: 1.969 ± 0.951
0.0ArgXaa: 0.0 ± 0.0
Ser
2.462SerAla: 2.462 ± 1.142
0.492SerCys: 0.492 ± 0.238
1.477SerAsp: 1.477 ± 0.682
3.939SerGlu: 3.939 ± 1.082
2.954SerPhe: 2.954 ± 1.426
4.431SerGly: 4.431 ± 2.966
0.492SerHis: 0.492 ± 0.987
3.447SerIle: 3.447 ± 0.896
5.908SerLys: 5.908 ± 1.143
9.355SerLeu: 9.355 ± 2.909
0.985SerMet: 0.985 ± 1.203
3.447SerAsn: 3.447 ± 1.278
5.416SerPro: 5.416 ± 2.539
2.462SerGln: 2.462 ± 1.483
5.908SerArg: 5.908 ± 4.102
7.878SerSer: 7.878 ± 7.46
4.924SerThr: 4.924 ± 5.58
2.462SerVal: 2.462 ± 2.041
2.462SerTrp: 2.462 ± 2.041
2.462SerTyr: 2.462 ± 0.639
0.0SerXaa: 0.0 ± 0.0
Thr
4.431ThrAla: 4.431 ± 1.234
1.969ThrCys: 1.969 ± 2.953
1.477ThrAsp: 1.477 ± 1.546
5.908ThrGlu: 5.908 ± 2.192
5.416ThrPhe: 5.416 ± 1.548
3.447ThrGly: 3.447 ± 1.442
4.431ThrHis: 4.431 ± 1.643
4.431ThrIle: 4.431 ± 2.14
4.431ThrLys: 4.431 ± 2.735
7.878ThrLeu: 7.878 ± 5.932
1.477ThrMet: 1.477 ± 0.682
4.924ThrAsn: 4.924 ± 1.267
6.401ThrPro: 6.401 ± 1.779
2.462ThrGln: 2.462 ± 4.538
2.462ThrArg: 2.462 ± 1.203
1.969ThrSer: 1.969 ± 1.629
2.462ThrThr: 2.462 ± 1.189
4.924ThrVal: 4.924 ± 4.072
0.0ThrTrp: 0.0 ± 0.0
2.462ThrTyr: 2.462 ± 1.142
0.0ThrXaa: 0.0 ± 0.0
Val
1.969ValAla: 1.969 ± 0.617
2.462ValCys: 2.462 ± 2.239
0.492ValAsp: 0.492 ± 0.987
1.969ValGlu: 1.969 ± 0.951
3.447ValPhe: 3.447 ± 3.419
0.492ValGly: 0.492 ± 0.238
3.447ValHis: 3.447 ± 0.896
2.462ValIle: 2.462 ± 1.203
3.939ValLys: 3.939 ± 1.083
4.431ValLeu: 4.431 ± 1.233
2.462ValMet: 2.462 ± 1.189
0.492ValAsn: 0.492 ± 0.238
1.477ValPro: 1.477 ± 0.682
5.416ValGln: 5.416 ± 1.718
4.431ValArg: 4.431 ± 2.046
4.924ValSer: 4.924 ± 1.498
2.462ValThr: 2.462 ± 1.203
3.447ValVal: 3.447 ± 1.278
0.492ValTrp: 0.492 ± 0.238
1.477ValTyr: 1.477 ± 0.713
0.0ValXaa: 0.0 ± 0.0
Trp
2.954TrpAla: 2.954 ± 1.369
0.0TrpCys: 0.0 ± 0.0
2.462TrpAsp: 2.462 ± 1.414
0.985TrpGlu: 0.985 ± 1.477
0.985TrpPhe: 0.985 ± 0.475
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.492TrpIle: 0.492 ± 0.238
0.0TrpLys: 0.0 ± 0.0
0.492TrpLeu: 0.492 ± 0.238
0.0TrpMet: 0.0 ± 0.0
0.492TrpAsn: 0.492 ± 0.238
0.985TrpPro: 0.985 ± 2.128
0.0TrpGln: 0.0 ± 0.0
0.985TrpArg: 0.985 ± 0.475
0.492TrpSer: 0.492 ± 1.594
0.985TrpThr: 0.985 ± 0.475
1.969TrpVal: 1.969 ± 0.951
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.969TyrAla: 1.969 ± 0.951
0.985TyrCys: 0.985 ± 0.475
0.492TyrAsp: 0.492 ± 0.238
0.985TyrGlu: 0.985 ± 0.475
1.477TyrPhe: 1.477 ± 0.713
1.477TyrGly: 1.477 ± 0.713
0.0TyrHis: 0.0 ± 0.0
1.969TyrIle: 1.969 ± 1.629
2.462TyrLys: 2.462 ± 1.189
2.954TyrLeu: 2.954 ± 1.218
0.492TyrMet: 0.492 ± 0.238
2.954TyrAsn: 2.954 ± 1.218
1.969TyrPro: 1.969 ± 0.951
0.492TyrGln: 0.492 ± 0.238
1.477TyrArg: 1.477 ± 1.134
1.969TyrSer: 1.969 ± 0.617
2.462TyrThr: 2.462 ± 1.203
1.477TyrVal: 1.477 ± 0.713
0.492TyrTrp: 0.492 ± 0.238
1.477TyrTyr: 1.477 ± 0.713
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2032 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski