Amino acid dipepetide frequency for Zirqa virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.879AlaAla: 1.879 ± 1.069
0.342AlaCys: 0.342 ± 0.172
2.05AlaAsp: 2.05 ± 1.207
3.074AlaGlu: 3.074 ± 0.827
2.562AlaPhe: 2.562 ± 0.545
2.391AlaGly: 2.391 ± 0.534
0.854AlaHis: 0.854 ± 0.453
3.587AlaIle: 3.587 ± 1.146
3.416AlaLys: 3.416 ± 1.158
4.953AlaLeu: 4.953 ± 2.423
1.537AlaMet: 1.537 ± 0.536
0.683AlaAsn: 0.683 ± 0.387
0.854AlaPro: 0.854 ± 0.453
1.879AlaGln: 1.879 ± 0.553
2.391AlaArg: 2.391 ± 0.833
3.587AlaSer: 3.587 ± 0.658
2.391AlaThr: 2.391 ± 0.521
3.757AlaVal: 3.757 ± 1.048
0.512AlaTrp: 0.512 ± 0.405
2.05AlaTyr: 2.05 ± 1.022
0.0AlaXaa: 0.0 ± 0.0
Cys
1.537CysAla: 1.537 ± 0.667
1.025CysCys: 1.025 ± 0.288
1.025CysAsp: 1.025 ± 0.515
0.683CysGlu: 0.683 ± 0.343
0.854CysPhe: 0.854 ± 0.298
1.366CysGly: 1.366 ± 1.361
0.854CysHis: 0.854 ± 0.484
1.025CysIle: 1.025 ± 0.263
1.537CysLys: 1.537 ± 0.914
2.733CysLeu: 2.733 ± 1.372
0.171CysMet: 0.171 ± 0.239
1.708CysAsn: 1.708 ± 1.147
2.05CysPro: 2.05 ± 1.317
0.683CysGln: 0.683 ± 0.387
0.854CysArg: 0.854 ± 0.301
1.879CysSer: 1.879 ± 0.581
1.879CysThr: 1.879 ± 1.381
1.025CysVal: 1.025 ± 0.288
0.854CysTrp: 0.854 ± 0.573
0.683CysTyr: 0.683 ± 0.343
0.0CysXaa: 0.0 ± 0.0
Asp
2.391AspAla: 2.391 ± 1.046
1.366AspCys: 1.366 ± 0.436
3.245AspAsp: 3.245 ± 1.298
3.757AspGlu: 3.757 ± 0.082
2.391AspPhe: 2.391 ± 0.765
3.245AspGly: 3.245 ± 0.899
0.683AspHis: 0.683 ± 0.175
4.441AspIle: 4.441 ± 0.984
4.953AspLys: 4.953 ± 0.315
5.465AspLeu: 5.465 ± 0.957
1.537AspMet: 1.537 ± 1.154
2.22AspAsn: 2.22 ± 0.586
1.025AspPro: 1.025 ± 0.263
2.22AspGln: 2.22 ± 0.995
2.562AspArg: 2.562 ± 0.385
2.904AspSer: 2.904 ± 0.23
2.05AspThr: 2.05 ± 0.057
4.441AspVal: 4.441 ± 0.952
0.683AspTrp: 0.683 ± 0.357
1.708AspTyr: 1.708 ± 0.243
0.0AspXaa: 0.0 ± 0.0
Glu
2.904GluAla: 2.904 ± 0.934
1.879GluCys: 1.879 ± 0.477
6.149GluAsp: 6.149 ± 1.181
3.587GluGlu: 3.587 ± 1.488
2.562GluPhe: 2.562 ± 0.542
3.245GluGly: 3.245 ± 0.825
2.22GluHis: 2.22 ± 0.935
4.27GluIle: 4.27 ± 0.061
3.416GluLys: 3.416 ± 0.353
7.003GluLeu: 7.003 ± 1.722
2.562GluMet: 2.562 ± 0.857
3.928GluAsn: 3.928 ± 0.999
3.245GluPro: 3.245 ± 0.825
2.22GluGln: 2.22 ± 0.713
3.416GluArg: 3.416 ± 1.021
3.587GluSer: 3.587 ± 1.117
4.27GluThr: 4.27 ± 1.503
4.953GluVal: 4.953 ± 0.342
0.683GluTrp: 0.683 ± 0.343
1.879GluTyr: 1.879 ± 0.466
0.0GluXaa: 0.0 ± 0.0
Phe
1.366PheAla: 1.366 ± 1.232
1.025PheCys: 1.025 ± 0.288
2.05PheAsp: 2.05 ± 0.378
2.22PheGlu: 2.22 ± 0.614
3.074PhePhe: 3.074 ± 0.302
2.562PheGly: 2.562 ± 0.685
0.683PheHis: 0.683 ± 0.343
3.416PheIle: 3.416 ± 0.889
3.587PheLys: 3.587 ± 1.192
4.953PheLeu: 4.953 ± 1.134
0.854PheMet: 0.854 ± 0.484
1.879PheAsn: 1.879 ± 0.501
2.22PhePro: 2.22 ± 0.614
1.537PheGln: 1.537 ± 1.154
1.196PheArg: 1.196 ± 0.725
4.953PheSer: 4.953 ± 0.754
2.562PheThr: 2.562 ± 0.18
1.708PheVal: 1.708 ± 0.696
0.342PheTrp: 0.342 ± 0.172
1.879PheTyr: 1.879 ± 0.182
0.0PheXaa: 0.0 ± 0.0
Gly
2.562GlyAla: 2.562 ± 0.904
1.537GlyCys: 1.537 ± 1.847
2.391GlyAsp: 2.391 ± 0.539
2.22GlyGlu: 2.22 ± 0.514
2.05GlyPhe: 2.05 ± 1.207
1.879GlyGly: 1.879 ± 0.787
1.366GlyHis: 1.366 ± 0.508
3.074GlyIle: 3.074 ± 0.829
3.928GlyLys: 3.928 ± 0.638
5.124GlyLeu: 5.124 ± 0.49
0.854GlyMet: 0.854 ± 0.94
2.733GlyAsn: 2.733 ± 1.555
0.683GlyPro: 0.683 ± 0.357
2.05GlyGln: 2.05 ± 0.057
2.733GlyArg: 2.733 ± 0.589
5.124GlySer: 5.124 ± 0.906
3.928GlyThr: 3.928 ± 2.113
2.562GlyVal: 2.562 ± 0.685
0.683GlyTrp: 0.683 ± 0.823
1.537GlyTyr: 1.537 ± 0.124
0.0GlyXaa: 0.0 ± 0.0
His
1.708HisAla: 1.708 ± 0.696
1.025HisCys: 1.025 ± 0.515
0.512HisAsp: 0.512 ± 0.405
1.025HisGlu: 1.025 ± 0.327
1.366HisPhe: 1.366 ± 0.44
1.537HisGly: 1.537 ± 0.914
0.683HisHis: 0.683 ± 0.343
1.025HisIle: 1.025 ± 0.288
1.366HisLys: 1.366 ± 0.686
2.05HisLeu: 2.05 ± 0.72
1.366HisMet: 1.366 ± 0.805
0.512HisAsn: 0.512 ± 0.404
1.196HisPro: 1.196 ± 0.725
0.512HisGln: 0.512 ± 0.405
1.366HisArg: 1.366 ± 0.35
1.708HisSer: 1.708 ± 0.122
1.537HisThr: 1.537 ± 0.639
0.683HisVal: 0.683 ± 0.343
0.171HisTrp: 0.171 ± 0.239
0.512HisTyr: 0.512 ± 0.291
0.0HisXaa: 0.0 ± 0.0
Ile
2.391IleAla: 2.391 ± 0.149
1.366IleCys: 1.366 ± 0.44
3.928IleAsp: 3.928 ± 1.069
5.636IleGlu: 5.636 ± 0.779
2.562IlePhe: 2.562 ± 0.904
2.22IleGly: 2.22 ± 0.614
1.196IleHis: 1.196 ± 0.26
5.465IleIle: 5.465 ± 2.053
7.003IleLys: 7.003 ± 0.847
6.149IleLeu: 6.149 ± 0.17
1.537IleMet: 1.537 ± 0.431
2.904IleAsn: 2.904 ± 0.915
2.22IlePro: 2.22 ± 0.74
3.074IleGln: 3.074 ± 0.248
2.562IleArg: 2.562 ± 0.808
7.344IleSer: 7.344 ± 1.472
4.611IleThr: 4.611 ± 1.147
4.782IleVal: 4.782 ± 0.981
0.683IleTrp: 0.683 ± 0.357
1.537IleTyr: 1.537 ± 0.602
0.0IleXaa: 0.0 ± 0.0
Lys
5.978LysAla: 5.978 ± 1.631
1.879LysCys: 1.879 ± 0.809
4.611LysAsp: 4.611 ± 0.489
7.003LysGlu: 7.003 ± 1.522
4.441LysPhe: 4.441 ± 1.728
3.245LysGly: 3.245 ± 0.891
1.879LysHis: 1.879 ± 0.182
4.782LysIle: 4.782 ± 0.753
6.661LysLys: 6.661 ± 1.065
7.857LysLeu: 7.857 ± 0.455
1.708LysMet: 1.708 ± 0.603
4.27LysAsn: 4.27 ± 0.664
2.22LysPro: 2.22 ± 0.743
1.025LysGln: 1.025 ± 0.581
4.611LysArg: 4.611 ± 1.461
5.465LysSer: 5.465 ± 1.353
5.124LysThr: 5.124 ± 0.62
5.465LysVal: 5.465 ± 1.169
1.708LysTrp: 1.708 ± 0.243
1.537LysTyr: 1.537 ± 0.34
0.0LysXaa: 0.0 ± 0.0
Leu
3.416LeuAla: 3.416 ± 0.53
2.22LeuCys: 2.22 ± 0.586
3.928LeuAsp: 3.928 ± 0.444
6.832LeuGlu: 6.832 ± 0.963
4.27LeuPhe: 4.27 ± 0.609
4.782LeuGly: 4.782 ± 1.367
1.879LeuHis: 1.879 ± 1.291
6.832LeuIle: 6.832 ± 1.297
8.198LeuLys: 8.198 ± 1.691
9.394LeuLeu: 9.394 ± 2.456
1.879LeuMet: 1.879 ± 0.88
6.149LeuAsn: 6.149 ± 2.173
3.757LeuPro: 3.757 ± 0.657
4.441LeuGln: 4.441 ± 0.394
4.099LeuArg: 4.099 ± 1.309
11.272LeuSer: 11.272 ± 0.927
6.832LeuThr: 6.832 ± 0.706
5.295LeuVal: 5.295 ± 0.203
0.342LeuTrp: 0.342 ± 0.477
3.587LeuTyr: 3.587 ± 1.055
0.0LeuXaa: 0.0 ± 0.0
Met
0.854MetAla: 0.854 ± 0.423
0.342MetCys: 0.342 ± 0.172
1.537MetAsp: 1.537 ± 0.349
1.879MetGlu: 1.879 ± 0.882
0.854MetPhe: 0.854 ± 0.484
1.196MetGly: 1.196 ± 0.306
0.512MetHis: 0.512 ± 0.642
1.708MetIle: 1.708 ± 0.421
1.537MetLys: 1.537 ± 0.349
3.587MetLeu: 3.587 ± 0.129
0.512MetMet: 0.512 ± 0.144
1.537MetAsn: 1.537 ± 0.651
0.512MetPro: 0.512 ± 0.405
2.05MetGln: 2.05 ± 0.907
0.683MetArg: 0.683 ± 0.547
2.05MetSer: 2.05 ± 0.575
2.05MetThr: 2.05 ± 0.699
1.025MetVal: 1.025 ± 0.263
0.0MetTrp: 0.0 ± 0.0
0.854MetTyr: 0.854 ± 0.484
0.0MetXaa: 0.0 ± 0.0
Asn
2.391AsnAla: 2.391 ± 1.494
1.879AsnCys: 1.879 ± 1.085
2.733AsnAsp: 2.733 ± 0.585
3.074AsnGlu: 3.074 ± 0.788
2.05AsnPhe: 2.05 ± 0.057
2.562AsnGly: 2.562 ± 1.136
1.537AsnHis: 1.537 ± 0.659
4.099AsnIle: 4.099 ± 1.918
3.928AsnLys: 3.928 ± 1.172
3.928AsnLeu: 3.928 ± 1.152
1.025AsnMet: 1.025 ± 0.32
2.22AsnAsn: 2.22 ± 1.006
1.366AsnPro: 1.366 ± 0.508
1.025AsnGln: 1.025 ± 0.288
2.562AsnArg: 2.562 ± 0.545
4.441AsnSer: 4.441 ± 0.421
3.074AsnThr: 3.074 ± 0.788
2.904AsnVal: 2.904 ± 0.76
1.025AsnTrp: 1.025 ± 0.263
1.537AsnTyr: 1.537 ± 0.34
0.0AsnXaa: 0.0 ± 0.0
Pro
2.733ProAla: 2.733 ± 2.012
0.0ProCys: 0.0 ± 0.0
2.05ProAsp: 2.05 ± 0.72
2.562ProGlu: 2.562 ± 1.175
1.537ProPhe: 1.537 ± 1.213
1.196ProGly: 1.196 ± 0.27
0.512ProHis: 0.512 ± 0.144
1.708ProIle: 1.708 ± 0.488
3.416ProLys: 3.416 ± 1.206
1.708ProLeu: 1.708 ± 0.444
0.512ProMet: 0.512 ± 0.898
1.025ProAsn: 1.025 ± 0.327
0.342ProPro: 0.342 ± 0.172
0.854ProGln: 0.854 ± 0.573
1.366ProArg: 1.366 ± 0.44
2.562ProSer: 2.562 ± 0.542
2.391ProThr: 2.391 ± 0.611
2.733ProVal: 2.733 ± 1.487
0.854ProTrp: 0.854 ± 0.298
0.512ProTyr: 0.512 ± 0.291
0.0ProXaa: 0.0 ± 0.0
Gln
1.879GlnAla: 1.879 ± 0.148
0.854GlnCys: 0.854 ± 0.244
1.537GlnAsp: 1.537 ± 0.872
2.22GlnGlu: 2.22 ± 0.355
0.854GlnPhe: 0.854 ± 0.244
1.879GlnGly: 1.879 ± 1.583
0.854GlnHis: 0.854 ± 0.244
1.025GlnIle: 1.025 ± 0.327
3.757GlnLys: 3.757 ± 0.082
4.953GlnLeu: 4.953 ± 1.622
1.366GlnMet: 1.366 ± 0.35
2.05GlnAsn: 2.05 ± 1.602
0.683GlnPro: 0.683 ± 0.343
2.391GlnGln: 2.391 ± 0.448
1.879GlnArg: 1.879 ± 0.466
1.879GlnSer: 1.879 ± 0.553
2.22GlnThr: 2.22 ± 0.995
2.904GlnVal: 2.904 ± 0.776
0.342GlnTrp: 0.342 ± 0.194
0.854GlnTyr: 0.854 ± 0.484
0.0GlnXaa: 0.0 ± 0.0
Arg
1.879ArgAla: 1.879 ± 0.477
0.683ArgCys: 0.683 ± 0.175
2.562ArgAsp: 2.562 ± 1.269
2.562ArgGlu: 2.562 ± 0.732
2.05ArgPhe: 2.05 ± 0.057
1.366ArgGly: 1.366 ± 0.686
1.196ArgHis: 1.196 ± 0.26
3.928ArgIle: 3.928 ± 1.074
2.733ArgLys: 2.733 ± 0.697
4.953ArgLeu: 4.953 ± 1.994
1.366ArgMet: 1.366 ± 0.35
2.904ArgAsn: 2.904 ± 0.832
1.025ArgPro: 1.025 ± 0.515
1.366ArgGln: 1.366 ± 0.585
1.537ArgArg: 1.537 ± 0.124
3.928ArgSer: 3.928 ± 0.859
2.904ArgThr: 2.904 ± 0.733
2.904ArgVal: 2.904 ± 0.438
0.171ArgTrp: 0.171 ± 0.097
1.879ArgTyr: 1.879 ± 0.663
0.0ArgXaa: 0.0 ± 0.0
Ser
3.757SerAla: 3.757 ± 0.364
1.708SerCys: 1.708 ± 0.557
4.782SerAsp: 4.782 ± 1.665
7.003SerGlu: 7.003 ± 1.369
3.928SerPhe: 3.928 ± 1.069
5.295SerGly: 5.295 ± 2.007
1.537SerHis: 1.537 ± 0.124
7.515SerIle: 7.515 ± 1.601
7.515SerLys: 7.515 ± 1.081
7.515SerLeu: 7.515 ± 2.131
2.733SerMet: 2.733 ± 0.182
3.587SerAsn: 3.587 ± 0.809
1.879SerPro: 1.879 ± 0.477
2.391SerGln: 2.391 ± 0.833
3.928SerArg: 3.928 ± 0.835
9.564SerSer: 9.564 ± 1.627
3.928SerThr: 3.928 ± 1.31
6.661SerVal: 6.661 ± 1.07
1.366SerTrp: 1.366 ± 0.743
2.391SerTyr: 2.391 ± 0.149
0.0SerXaa: 0.0 ± 0.0
Thr
2.05ThrAla: 2.05 ± 0.378
2.733ThrCys: 2.733 ± 1.647
3.416ThrAsp: 3.416 ± 0.353
4.953ThrGlu: 4.953 ± 0.229
2.562ThrPhe: 2.562 ± 1.41
4.099ThrGly: 4.099 ± 0.434
0.854ThrHis: 0.854 ± 0.573
2.05ThrIle: 2.05 ± 0.654
4.099ThrLys: 4.099 ± 1.054
6.149ThrLeu: 6.149 ± 1.563
1.196ThrMet: 1.196 ± 0.678
2.733ThrAsn: 2.733 ± 0.917
2.05ThrPro: 2.05 ± 0.525
1.879ThrGln: 1.879 ± 0.787
1.708ThrArg: 1.708 ± 0.596
7.173ThrSer: 7.173 ± 1.093
3.245ThrThr: 3.245 ± 0.583
4.953ThrVal: 4.953 ± 1.89
1.025ThrTrp: 1.025 ± 0.771
2.391ThrTyr: 2.391 ± 0.723
0.0ThrXaa: 0.0 ± 0.0
Val
1.537ValAla: 1.537 ± 0.34
0.854ValCys: 0.854 ± 0.573
2.391ValAsp: 2.391 ± 0.225
5.636ValGlu: 5.636 ± 0.863
2.562ValPhe: 2.562 ± 1.136
2.391ValGly: 2.391 ± 0.781
2.05ValHis: 2.05 ± 1.617
5.465ValIle: 5.465 ± 0.443
6.319ValLys: 6.319 ± 1.525
6.149ValLeu: 6.149 ± 1.437
1.196ValMet: 1.196 ± 0.306
3.587ValAsn: 3.587 ± 1.986
2.22ValPro: 2.22 ± 1.006
2.904ValGln: 2.904 ± 0.38
2.733ValArg: 2.733 ± 0.637
5.807ValSer: 5.807 ± 0.625
3.416ValThr: 3.416 ± 0.738
5.295ValVal: 5.295 ± 0.359
0.512ValTrp: 0.512 ± 0.291
1.879ValTyr: 1.879 ± 0.569
0.0ValXaa: 0.0 ± 0.0
Trp
0.342TrpAla: 0.342 ± 0.429
0.512TrpCys: 0.512 ± 0.144
0.342TrpAsp: 0.342 ± 0.429
1.025TrpGlu: 1.025 ± 0.36
0.171TrpPhe: 0.171 ± 0.473
1.196TrpGly: 1.196 ± 0.785
0.171TrpHis: 0.171 ± 0.239
1.025TrpIle: 1.025 ± 0.515
1.366TrpLys: 1.366 ± 0.714
1.366TrpLeu: 1.366 ± 0.292
0.512TrpMet: 0.512 ± 0.716
1.025TrpAsn: 1.025 ± 0.263
0.512TrpPro: 0.512 ± 0.404
0.342TrpGln: 0.342 ± 0.172
0.683TrpArg: 0.683 ± 0.175
1.025TrpSer: 1.025 ± 0.263
0.683TrpThr: 0.683 ± 0.343
0.0TrpVal: 0.0 ± 0.0
0.342TrpTrp: 0.342 ± 0.429
0.342TrpTyr: 0.342 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.196TyrAla: 1.196 ± 0.785
1.196TyrCys: 1.196 ± 0.533
2.05TyrAsp: 2.05 ± 0.927
1.025TyrGlu: 1.025 ± 0.288
1.366TyrPhe: 1.366 ± 0.436
1.366TyrGly: 1.366 ± 0.775
0.342TyrHis: 0.342 ± 0.172
2.733TyrIle: 2.733 ± 0.372
2.733TyrLys: 2.733 ± 1.017
3.587TyrLeu: 3.587 ± 0.935
0.683TyrMet: 0.683 ± 0.403
1.708TyrAsn: 1.708 ± 0.488
0.512TyrPro: 0.512 ± 0.405
1.537TyrGln: 1.537 ± 0.65
1.025TyrArg: 1.025 ± 0.581
2.733TyrSer: 2.733 ± 0.701
2.22TyrThr: 2.22 ± 0.586
0.854TyrVal: 0.854 ± 0.301
0.683TyrTrp: 0.683 ± 0.403
1.366TyrTyr: 1.366 ± 0.44
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5856 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski