Amino acid dipepetide frequency for Wenzhou picorna-like virus 20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.678AlaAla: 3.678 ± 1.8
2.861AlaCys: 2.861 ± 0.974
2.043AlaAsp: 2.043 ± 1.431
2.861AlaGlu: 2.861 ± 0.968
2.861AlaPhe: 2.861 ± 0.327
6.539AlaGly: 6.539 ± 2.121
0.817AlaHis: 0.817 ± 0.184
2.861AlaIle: 2.861 ± 0.327
2.043AlaLys: 2.043 ± 1.431
4.087AlaLeu: 4.087 ± 0.374
0.817AlaMet: 0.817 ± 0.463
3.269AlaAsn: 3.269 ± 0.737
4.495AlaPro: 4.495 ± 3.279
1.226AlaGln: 1.226 ± 0.6
2.861AlaArg: 2.861 ± 0.974
4.495AlaSer: 4.495 ± 0.042
4.904AlaThr: 4.904 ± 3.047
2.452AlaVal: 2.452 ± 0.552
0.409AlaTrp: 0.409 ± 0.232
2.861AlaTyr: 2.861 ± 0.321
0.0AlaXaa: 0.0 ± 0.0
Cys
2.452CysAla: 2.452 ± 0.742
0.817CysCys: 0.817 ± 0.463
0.817CysAsp: 0.817 ± 0.184
0.409CysGlu: 0.409 ± 0.232
1.635CysPhe: 1.635 ± 0.368
0.409CysGly: 0.409 ± 0.232
0.409CysHis: 0.409 ± 0.232
0.409CysIle: 0.409 ± 0.232
1.226CysLys: 1.226 ± 0.695
0.0CysLeu: 0.0 ± 0.0
0.409CysMet: 0.409 ± 0.232
1.226CysAsn: 1.226 ± 0.047
1.635CysPro: 1.635 ± 0.927
0.0CysGln: 0.0 ± 0.0
0.817CysArg: 0.817 ± 0.184
1.226CysSer: 1.226 ± 0.6
1.635CysThr: 1.635 ± 0.927
1.226CysVal: 1.226 ± 0.047
0.409CysTrp: 0.409 ± 0.416
1.226CysTyr: 1.226 ± 0.695
0.0CysXaa: 0.0 ± 0.0
Asp
1.635AspAla: 1.635 ± 0.279
0.817AspCys: 0.817 ± 0.184
4.904AspAsp: 4.904 ± 1.105
3.269AspGlu: 3.269 ± 1.206
2.452AspPhe: 2.452 ± 0.742
1.635AspGly: 1.635 ± 1.016
1.635AspHis: 1.635 ± 0.927
5.313AspIle: 5.313 ± 2.168
4.495AspLys: 4.495 ± 2.548
4.495AspLeu: 4.495 ± 0.689
2.043AspMet: 2.043 ± 0.137
2.452AspAsn: 2.452 ± 0.095
3.678AspPro: 3.678 ± 1.8
2.452AspGln: 2.452 ± 0.742
1.226AspArg: 1.226 ± 0.695
4.904AspSer: 4.904 ± 0.837
4.087AspThr: 4.087 ± 1.568
3.678AspVal: 3.678 ± 1.152
1.226AspTrp: 1.226 ± 0.695
3.269AspTyr: 3.269 ± 0.558
0.0AspXaa: 0.0 ± 0.0
Glu
3.269GluAla: 3.269 ± 0.089
0.0GluCys: 0.0 ± 0.0
3.678GluAsp: 3.678 ± 0.142
2.043GluGlu: 2.043 ± 1.158
2.861GluPhe: 2.861 ± 0.974
2.452GluGly: 2.452 ± 0.095
1.635GluHis: 1.635 ± 0.927
3.678GluIle: 3.678 ± 2.447
3.678GluLys: 3.678 ± 0.79
4.087GluLeu: 4.087 ± 0.374
1.635GluMet: 1.635 ± 0.927
1.226GluAsn: 1.226 ± 0.047
2.043GluPro: 2.043 ± 1.158
2.043GluGln: 2.043 ± 0.784
1.635GluArg: 1.635 ± 0.279
2.043GluSer: 2.043 ± 0.784
2.043GluThr: 2.043 ± 0.137
4.087GluVal: 4.087 ± 0.921
1.226GluTrp: 1.226 ± 0.695
2.452GluTyr: 2.452 ± 1.39
0.0GluXaa: 0.0 ± 0.0
Phe
2.861PheAla: 2.861 ± 1.616
1.226PheCys: 1.226 ± 0.047
4.904PheAsp: 4.904 ± 2.132
1.226PheGlu: 1.226 ± 0.695
2.043PhePhe: 2.043 ± 0.511
3.678PheGly: 3.678 ± 0.142
1.226PheHis: 1.226 ± 0.047
4.087PheIle: 4.087 ± 0.374
2.452PheLys: 2.452 ± 0.742
5.721PheLeu: 5.721 ± 1.301
0.817PheMet: 0.817 ± 0.463
3.269PheAsn: 3.269 ± 0.089
1.226PhePro: 1.226 ± 0.6
1.226PheGln: 1.226 ± 0.047
1.635PheArg: 1.635 ± 0.927
6.13PheSer: 6.13 ± 0.885
4.087PheThr: 4.087 ± 1.021
3.269PheVal: 3.269 ± 0.558
0.409PheTrp: 0.409 ± 0.232
2.043PheTyr: 2.043 ± 0.137
0.0PheXaa: 0.0 ± 0.0
Gly
6.539GlyAla: 6.539 ± 2.768
0.0GlyCys: 0.0 ± 0.0
4.087GlyAsp: 4.087 ± 0.273
2.043GlyGlu: 2.043 ± 0.137
2.861GlyPhe: 2.861 ± 0.321
3.269GlyGly: 3.269 ± 1.384
0.409GlyHis: 0.409 ± 0.416
2.861GlyIle: 2.861 ± 0.974
6.539GlyLys: 6.539 ± 1.116
2.861GlyLeu: 2.861 ± 0.321
2.043GlyMet: 2.043 ± 0.137
3.678GlyAsn: 3.678 ± 0.142
0.817GlyPro: 0.817 ± 0.463
1.226GlyGln: 1.226 ± 1.247
1.635GlyArg: 1.635 ± 0.279
6.539GlySer: 6.539 ± 0.826
1.226GlyThr: 1.226 ± 0.047
3.678GlyVal: 3.678 ± 1.8
1.226GlyTrp: 1.226 ± 0.6
2.452GlyTyr: 2.452 ± 0.742
0.0GlyXaa: 0.0 ± 0.0
His
1.226HisAla: 1.226 ± 0.047
0.0HisCys: 0.0 ± 0.0
1.635HisAsp: 1.635 ± 0.279
0.0HisGlu: 0.0 ± 0.0
1.635HisPhe: 1.635 ± 0.279
0.409HisGly: 0.409 ± 0.232
0.817HisHis: 0.817 ± 0.463
0.817HisIle: 0.817 ± 0.463
1.226HisLys: 1.226 ± 0.047
4.087HisLeu: 4.087 ± 0.374
0.409HisMet: 0.409 ± 0.416
1.635HisAsn: 1.635 ± 0.279
0.817HisPro: 0.817 ± 0.463
0.409HisGln: 0.409 ± 0.232
0.409HisArg: 0.409 ± 0.232
0.409HisSer: 0.409 ± 0.416
0.817HisThr: 0.817 ± 0.463
1.226HisVal: 1.226 ± 0.695
0.817HisTrp: 0.817 ± 0.184
0.817HisTyr: 0.817 ± 0.184
0.0HisXaa: 0.0 ± 0.0
Ile
2.861IleAla: 2.861 ± 0.321
1.635IleCys: 1.635 ± 0.279
5.313IleAsp: 5.313 ± 0.226
4.087IleGlu: 4.087 ± 0.921
4.087IlePhe: 4.087 ± 1.021
3.269IleGly: 3.269 ± 0.737
0.817IleHis: 0.817 ± 0.184
3.678IleIle: 3.678 ± 0.79
4.495IleLys: 4.495 ± 1.253
4.904IleLeu: 4.904 ± 0.837
0.817IleMet: 0.817 ± 0.463
4.087IleAsn: 4.087 ± 1.669
3.678IlePro: 3.678 ± 0.142
2.861IleGln: 2.861 ± 0.327
1.635IleArg: 1.635 ± 0.927
6.13IleSer: 6.13 ± 2.352
3.269IleThr: 3.269 ± 1.384
4.904IleVal: 4.904 ± 0.19
0.817IleTrp: 0.817 ± 0.184
2.043IleTyr: 2.043 ± 0.137
0.0IleXaa: 0.0 ± 0.0
Lys
2.861LysAla: 2.861 ± 0.327
1.635LysCys: 1.635 ± 0.368
2.043LysAsp: 2.043 ± 0.511
4.495LysGlu: 4.495 ± 0.042
3.269LysPhe: 3.269 ± 1.206
2.043LysGly: 2.043 ± 0.511
1.226LysHis: 1.226 ± 0.695
6.947LysIle: 6.947 ± 0.701
4.087LysLys: 4.087 ± 1.669
5.721LysLeu: 5.721 ± 0.653
2.452LysMet: 2.452 ± 0.095
3.678LysAsn: 3.678 ± 0.79
2.043LysPro: 2.043 ± 1.158
1.226LysGln: 1.226 ± 0.695
4.087LysArg: 4.087 ± 0.374
5.313LysSer: 5.313 ± 1.069
2.452LysThr: 2.452 ± 0.742
4.087LysVal: 4.087 ± 1.021
0.409LysTrp: 0.409 ± 0.232
4.087LysTyr: 4.087 ± 0.374
0.0LysXaa: 0.0 ± 0.0
Leu
4.904LeuAla: 4.904 ± 1.752
0.0LeuCys: 0.0 ± 0.0
6.947LeuAsp: 6.947 ± 0.701
6.539LeuGlu: 6.539 ± 1.764
4.904LeuPhe: 4.904 ± 1.485
4.087LeuGly: 4.087 ± 0.374
0.817LeuHis: 0.817 ± 0.463
2.043LeuIle: 2.043 ± 0.511
6.13LeuLys: 6.13 ± 1.532
5.721LeuLeu: 5.721 ± 1.948
2.452LeuMet: 2.452 ± 0.145
4.495LeuAsn: 4.495 ± 0.606
2.452LeuPro: 2.452 ± 1.39
3.269LeuGln: 3.269 ± 0.089
2.452LeuArg: 2.452 ± 1.2
4.904LeuSer: 4.904 ± 1.105
8.991LeuThr: 8.991 ± 2.026
4.904LeuVal: 4.904 ± 0.837
1.635LeuTrp: 1.635 ± 0.368
2.043LeuTyr: 2.043 ± 0.137
0.0LeuXaa: 0.0 ± 0.0
Met
1.226MetAla: 1.226 ± 0.047
1.635MetCys: 1.635 ± 0.279
1.226MetAsp: 1.226 ± 0.047
1.226MetGlu: 1.226 ± 0.695
2.043MetPhe: 2.043 ± 0.511
0.409MetGly: 0.409 ± 0.232
0.0MetHis: 0.0 ± 0.0
0.409MetIle: 0.409 ± 0.232
2.043MetLys: 2.043 ± 0.511
0.817MetLeu: 0.817 ± 0.463
0.817MetMet: 0.817 ± 0.463
1.635MetAsn: 1.635 ± 0.368
0.409MetPro: 0.409 ± 0.416
0.817MetGln: 0.817 ± 0.184
0.817MetArg: 0.817 ± 0.832
2.043MetSer: 2.043 ± 0.511
2.861MetThr: 2.861 ± 0.974
1.635MetVal: 1.635 ± 0.279
0.0MetTrp: 0.0 ± 0.0
2.043MetTyr: 2.043 ± 0.784
0.0MetXaa: 0.0 ± 0.0
Asn
3.678AsnAla: 3.678 ± 0.142
2.043AsnCys: 2.043 ± 1.158
2.861AsnAsp: 2.861 ± 0.321
2.043AsnGlu: 2.043 ± 0.784
3.269AsnPhe: 3.269 ± 0.089
3.269AsnGly: 3.269 ± 1.853
0.409AsnHis: 0.409 ± 0.232
7.356AsnIle: 7.356 ± 0.932
3.678AsnLys: 3.678 ± 0.142
4.495AsnLeu: 4.495 ± 1.336
0.409AsnMet: 0.409 ± 0.232
4.495AsnAsn: 4.495 ± 0.606
4.904AsnPro: 4.904 ± 0.457
2.043AsnGln: 2.043 ± 0.137
1.635AsnArg: 1.635 ± 0.927
6.13AsnSer: 6.13 ± 0.885
4.087AsnThr: 4.087 ± 1.568
5.721AsnVal: 5.721 ± 1.289
0.0AsnTrp: 0.0 ± 0.0
2.043AsnTyr: 2.043 ± 0.511
0.0AsnXaa: 0.0 ± 0.0
Pro
0.817ProAla: 0.817 ± 0.184
0.817ProCys: 0.817 ± 0.184
2.043ProAsp: 2.043 ± 0.511
1.226ProGlu: 1.226 ± 0.695
3.678ProPhe: 3.678 ± 1.152
2.043ProGly: 2.043 ± 0.784
1.635ProHis: 1.635 ± 0.279
3.678ProIle: 3.678 ± 0.142
2.452ProLys: 2.452 ± 1.39
4.087ProLeu: 4.087 ± 1.669
2.043ProMet: 2.043 ± 0.137
4.087ProAsn: 4.087 ± 0.374
2.043ProPro: 2.043 ± 0.137
0.817ProGln: 0.817 ± 0.463
2.452ProArg: 2.452 ± 0.095
4.495ProSer: 4.495 ± 0.689
3.678ProThr: 3.678 ± 2.447
4.904ProVal: 4.904 ± 1.105
0.817ProTrp: 0.817 ± 0.184
1.226ProTyr: 1.226 ± 0.6
0.0ProXaa: 0.0 ± 0.0
Gln
1.635GlnAla: 1.635 ± 0.279
0.409GlnCys: 0.409 ± 0.232
1.635GlnAsp: 1.635 ± 0.368
0.409GlnGlu: 0.409 ± 0.416
0.817GlnPhe: 0.817 ± 0.832
1.226GlnGly: 1.226 ± 0.047
0.817GlnHis: 0.817 ± 0.463
2.861GlnIle: 2.861 ± 0.974
2.452GlnLys: 2.452 ± 0.742
2.452GlnLeu: 2.452 ± 0.742
0.409GlnMet: 0.409 ± 0.416
0.817GlnAsn: 0.817 ± 0.463
2.452GlnPro: 2.452 ± 0.095
1.226GlnGln: 1.226 ± 0.047
1.226GlnArg: 1.226 ± 0.695
2.452GlnSer: 2.452 ± 0.095
2.043GlnThr: 2.043 ± 0.784
0.409GlnVal: 0.409 ± 0.416
0.817GlnTrp: 0.817 ± 0.832
1.635GlnTyr: 1.635 ± 0.368
0.0GlnXaa: 0.0 ± 0.0
Arg
1.635ArgAla: 1.635 ± 0.368
0.817ArgCys: 0.817 ± 0.184
2.043ArgAsp: 2.043 ± 0.137
2.043ArgGlu: 2.043 ± 1.158
1.635ArgPhe: 1.635 ± 0.279
2.043ArgGly: 2.043 ± 1.431
0.409ArgHis: 0.409 ± 0.232
2.452ArgIle: 2.452 ± 0.095
2.452ArgLys: 2.452 ± 0.742
2.043ArgLeu: 2.043 ± 0.511
1.635ArgMet: 1.635 ± 0.368
2.452ArgAsn: 2.452 ± 1.2
2.861ArgPro: 2.861 ± 0.327
0.409ArgGln: 0.409 ± 0.416
1.226ArgArg: 1.226 ± 0.047
2.861ArgSer: 2.861 ± 0.327
1.635ArgThr: 1.635 ± 0.279
2.861ArgVal: 2.861 ± 0.321
0.409ArgTrp: 0.409 ± 0.232
2.043ArgTyr: 2.043 ± 0.511
0.0ArgXaa: 0.0 ± 0.0
Ser
4.495SerAla: 4.495 ± 0.689
0.817SerCys: 0.817 ± 0.463
3.678SerAsp: 3.678 ± 1.8
3.269SerGlu: 3.269 ± 0.089
3.678SerPhe: 3.678 ± 2.085
7.765SerGly: 7.765 ± 0.778
0.817SerHis: 0.817 ± 0.184
6.13SerIle: 6.13 ± 0.885
4.495SerLys: 4.495 ± 0.689
6.539SerLeu: 6.539 ± 1.473
0.817SerMet: 0.817 ± 0.297
4.087SerAsn: 4.087 ± 0.921
5.313SerPro: 5.313 ± 0.873
3.269SerGln: 3.269 ± 1.206
2.452SerArg: 2.452 ± 1.2
6.539SerSer: 6.539 ± 0.826
6.539SerThr: 6.539 ± 1.764
6.947SerVal: 6.947 ± 1.241
0.817SerTrp: 0.817 ± 0.184
4.087SerTyr: 4.087 ± 0.374
0.0SerXaa: 0.0 ± 0.0
Thr
7.356ThrAla: 7.356 ± 2.305
1.226ThrCys: 1.226 ± 0.695
2.861ThrAsp: 2.861 ± 0.327
4.495ThrGlu: 4.495 ± 1.336
2.861ThrPhe: 2.861 ± 0.321
2.861ThrGly: 2.861 ± 0.321
2.861ThrHis: 2.861 ± 0.321
4.495ThrIle: 4.495 ± 0.042
4.495ThrLys: 4.495 ± 0.042
4.904ThrLeu: 4.904 ± 0.19
1.635ThrMet: 1.635 ± 0.279
4.495ThrAsn: 4.495 ± 1.336
2.452ThrPro: 2.452 ± 0.095
0.409ThrGln: 0.409 ± 0.232
2.043ThrArg: 2.043 ± 0.784
4.904ThrSer: 4.904 ± 1.105
4.904ThrThr: 4.904 ± 2.4
4.495ThrVal: 4.495 ± 1.336
1.226ThrTrp: 1.226 ± 0.6
2.861ThrTyr: 2.861 ± 0.968
0.0ThrXaa: 0.0 ± 0.0
Val
4.495ValAla: 4.495 ± 1.984
1.635ValCys: 1.635 ± 0.279
3.269ValAsp: 3.269 ± 0.737
3.678ValGlu: 3.678 ± 0.79
4.087ValPhe: 4.087 ± 1.021
3.269ValGly: 3.269 ± 0.737
0.817ValHis: 0.817 ± 0.184
3.678ValIle: 3.678 ± 0.505
2.452ValLys: 2.452 ± 0.095
6.539ValLeu: 6.539 ± 0.826
0.817ValMet: 0.817 ± 0.463
6.539ValAsn: 6.539 ± 0.469
4.087ValPro: 4.087 ± 0.921
1.635ValGln: 1.635 ± 0.279
2.452ValArg: 2.452 ± 1.847
6.947ValSer: 6.947 ± 0.594
4.087ValThr: 4.087 ± 1.568
4.087ValVal: 4.087 ± 0.273
0.409ValTrp: 0.409 ± 0.416
2.452ValTyr: 2.452 ± 0.742
0.0ValXaa: 0.0 ± 0.0
Trp
0.409TrpAla: 0.409 ± 0.416
0.0TrpCys: 0.0 ± 0.0
0.817TrpAsp: 0.817 ± 0.463
0.409TrpGlu: 0.409 ± 0.416
0.817TrpPhe: 0.817 ± 0.463
0.817TrpGly: 0.817 ± 0.184
0.409TrpHis: 0.409 ± 0.416
0.409TrpIle: 0.409 ± 0.416
0.409TrpLys: 0.409 ± 0.416
2.043TrpLeu: 2.043 ± 0.511
0.0TrpMet: 0.0 ± 0.0
1.226TrpAsn: 1.226 ± 0.047
0.817TrpPro: 0.817 ± 0.463
0.817TrpGln: 0.817 ± 0.832
1.226TrpArg: 1.226 ± 0.047
0.817TrpSer: 0.817 ± 0.184
1.635TrpThr: 1.635 ± 0.368
0.409TrpVal: 0.409 ± 0.232
0.0TrpTrp: 0.0 ± 0.0
0.817TrpTyr: 0.817 ± 0.184
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.409TyrAla: 0.409 ± 0.416
0.0TyrCys: 0.0 ± 0.0
2.861TyrAsp: 2.861 ± 0.974
2.043TyrGlu: 2.043 ± 0.511
2.043TyrPhe: 2.043 ± 0.137
4.495TyrGly: 4.495 ± 0.689
1.635TyrHis: 1.635 ± 0.279
1.226TyrIle: 1.226 ± 0.6
2.861TyrLys: 2.861 ± 0.974
4.087TyrLeu: 4.087 ± 0.273
1.226TyrMet: 1.226 ± 0.047
5.721TyrAsn: 5.721 ± 0.653
0.817TyrPro: 0.817 ± 0.184
0.817TyrGln: 0.817 ± 0.463
2.043TyrArg: 2.043 ± 0.137
3.678TyrSer: 3.678 ± 0.79
2.861TyrThr: 2.861 ± 0.321
2.452TyrVal: 2.452 ± 0.095
1.226TyrTrp: 1.226 ± 0.047
0.817TyrTyr: 0.817 ± 0.463
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2448 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski