Amino acid dipepetide frequency for Ixcanal virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.088AlaAla: 4.088 ± 2.172
1.277AlaCys: 1.277 ± 0.743
2.299AlaAsp: 2.299 ± 0.985
2.81AlaGlu: 2.81 ± 0.88
1.022AlaPhe: 1.022 ± 0.287
3.066AlaGly: 3.066 ± 1.225
1.533AlaHis: 1.533 ± 1.1
5.11AlaIle: 5.11 ± 0.941
2.044AlaLys: 2.044 ± 0.526
5.11AlaLeu: 5.11 ± 2.468
2.555AlaMet: 2.555 ± 0.336
0.766AlaAsn: 0.766 ± 0.438
2.044AlaPro: 2.044 ± 0.789
1.022AlaGln: 1.022 ± 0.325
3.321AlaArg: 3.321 ± 0.9
3.066AlaSer: 3.066 ± 0.334
2.555AlaThr: 2.555 ± 0.502
3.321AlaVal: 3.321 ± 0.952
0.0AlaTrp: 0.0 ± 0.0
1.788AlaTyr: 1.788 ± 0.47
0.0AlaXaa: 0.0 ± 0.0
Cys
0.255CysAla: 0.255 ± 0.149
1.022CysCys: 1.022 ± 0.596
1.533CysAsp: 1.533 ± 0.402
2.555CysGlu: 2.555 ± 0.634
1.788CysPhe: 1.788 ± 1.164
0.766CysGly: 0.766 ± 0.635
0.511CysHis: 0.511 ± 0.423
1.022CysIle: 1.022 ± 0.535
2.299CysLys: 2.299 ± 0.586
1.788CysLeu: 1.788 ± 0.973
0.766CysMet: 0.766 ± 0.431
0.766CysAsn: 0.766 ± 0.202
0.511CysPro: 0.511 ± 0.423
1.277CysGln: 1.277 ± 0.724
1.277CysArg: 1.277 ± 0.462
4.854CysSer: 4.854 ± 2.466
1.533CysThr: 1.533 ± 0.43
1.788CysVal: 1.788 ± 0.577
0.255CysTrp: 0.255 ± 0.212
1.533CysTyr: 1.533 ± 0.954
0.0CysXaa: 0.0 ± 0.0
Asp
2.81AspAla: 2.81 ± 2.1
1.533AspCys: 1.533 ± 0.659
5.365AspAsp: 5.365 ± 0.674
4.343AspGlu: 4.343 ± 0.81
3.321AspPhe: 3.321 ± 1.745
3.832AspGly: 3.832 ± 1.164
1.788AspHis: 1.788 ± 0.708
3.577AspIle: 3.577 ± 0.735
2.81AspLys: 2.81 ± 0.538
5.876AspLeu: 5.876 ± 1.276
2.299AspMet: 2.299 ± 0.78
2.299AspAsn: 2.299 ± 1.047
2.044AspPro: 2.044 ± 0.643
2.044AspGln: 2.044 ± 0.593
2.044AspArg: 2.044 ± 0.344
5.365AspSer: 5.365 ± 1.422
2.299AspThr: 2.299 ± 0.985
2.299AspVal: 2.299 ± 0.791
0.511AspTrp: 0.511 ± 0.143
2.299AspTyr: 2.299 ± 0.739
0.0AspXaa: 0.0 ± 0.0
Glu
4.088GluAla: 4.088 ± 0.977
1.277GluCys: 1.277 ± 0.462
5.876GluAsp: 5.876 ± 0.734
5.365GluGlu: 5.365 ± 1.545
4.088GluPhe: 4.088 ± 0.794
4.854GluGly: 4.854 ± 0.815
0.766GluHis: 0.766 ± 0.202
4.088GluIle: 4.088 ± 0.689
4.088GluLys: 4.088 ± 1.1
8.687GluLeu: 8.687 ± 1.488
1.022GluMet: 1.022 ± 0.325
2.81GluAsn: 2.81 ± 0.407
1.788GluPro: 1.788 ± 0.599
1.277GluGln: 1.277 ± 0.462
2.299GluArg: 2.299 ± 0.716
5.11GluSer: 5.11 ± 0.747
2.81GluThr: 2.81 ± 0.726
5.365GluVal: 5.365 ± 0.198
0.511GluTrp: 0.511 ± 0.298
1.277GluTyr: 1.277 ± 0.463
0.0GluXaa: 0.0 ± 0.0
Phe
2.81PheAla: 2.81 ± 0.901
1.788PheCys: 1.788 ± 0.862
2.299PheAsp: 2.299 ± 0.95
2.555PheGlu: 2.555 ± 0.323
2.044PhePhe: 2.044 ± 0.344
2.555PheGly: 2.555 ± 0.502
0.766PheHis: 0.766 ± 0.329
4.088PheIle: 4.088 ± 0.803
3.066PheLys: 3.066 ± 0.534
3.066PheLeu: 3.066 ± 0.961
1.788PheMet: 1.788 ± 0.248
1.533PheAsn: 1.533 ± 0.607
2.299PhePro: 2.299 ± 0.95
0.0PheGln: 0.0 ± 0.0
3.321PheArg: 3.321 ± 0.67
5.365PheSer: 5.365 ± 0.7
2.299PheThr: 2.299 ± 0.606
3.321PheVal: 3.321 ± 0.741
0.766PheTrp: 0.766 ± 0.202
0.511PheTyr: 0.511 ± 0.438
0.0PheXaa: 0.0 ± 0.0
Gly
3.577GlyAla: 3.577 ± 0.898
1.533GlyCys: 1.533 ± 0.659
2.81GlyAsp: 2.81 ± 0.319
3.066GlyGlu: 3.066 ± 0.563
4.854GlyPhe: 4.854 ± 1.415
3.832GlyGly: 3.832 ± 0.541
2.299GlyHis: 2.299 ± 0.469
1.788GlyIle: 1.788 ± 0.52
4.343GlyLys: 4.343 ± 0.762
3.321GlyLeu: 3.321 ± 0.151
2.044GlyMet: 2.044 ± 0.504
2.555GlyAsn: 2.555 ± 1.325
2.299GlyPro: 2.299 ± 0.415
1.533GlyGln: 1.533 ± 1.034
3.321GlyArg: 3.321 ± 0.963
5.876GlySer: 5.876 ± 1.124
4.599GlyThr: 4.599 ± 1.291
5.621GlyVal: 5.621 ± 0.762
1.022GlyTrp: 1.022 ± 0.479
1.277GlyTyr: 1.277 ± 0.772
0.0GlyXaa: 0.0 ± 0.0
His
1.277HisAla: 1.277 ± 0.462
0.511HisCys: 0.511 ± 0.143
1.533HisAsp: 1.533 ± 0.22
1.022HisGlu: 1.022 ± 0.297
1.277HisPhe: 1.277 ± 0.321
1.277HisGly: 1.277 ± 0.463
0.511HisHis: 0.511 ± 0.298
1.533HisIle: 1.533 ± 0.43
1.788HisLys: 1.788 ± 1.207
1.533HisLeu: 1.533 ± 0.404
0.766HisMet: 0.766 ± 0.635
1.022HisAsn: 1.022 ± 0.821
1.788HisPro: 1.788 ± 1.512
0.766HisGln: 0.766 ± 0.202
1.788HisArg: 1.788 ± 0.515
1.277HisSer: 1.277 ± 0.36
1.788HisThr: 1.788 ± 0.449
1.533HisVal: 1.533 ± 0.376
0.0HisTrp: 0.0 ± 0.0
1.533HisTyr: 1.533 ± 0.43
0.0HisXaa: 0.0 ± 0.0
Ile
1.788IleAla: 1.788 ± 0.361
1.788IleCys: 1.788 ± 0.599
3.066IleAsp: 3.066 ± 0.394
4.088IleGlu: 4.088 ± 1.174
2.044IlePhe: 2.044 ± 0.643
4.854IleGly: 4.854 ± 1.251
1.533IleHis: 1.533 ± 0.43
3.577IleIle: 3.577 ± 0.898
3.577IleLys: 3.577 ± 0.564
5.876IleLeu: 5.876 ± 0.329
1.022IleMet: 1.022 ± 0.297
2.81IleAsn: 2.81 ± 0.599
2.044IlePro: 2.044 ± 0.612
1.788IleGln: 1.788 ± 0.449
4.343IleArg: 4.343 ± 1.043
8.942IleSer: 8.942 ± 1.664
2.299IleThr: 2.299 ± 0.586
3.066IleVal: 3.066 ± 0.864
0.766IleTrp: 0.766 ± 0.447
1.533IleTyr: 1.533 ± 0.607
0.0IleXaa: 0.0 ± 0.0
Lys
3.321LysAla: 3.321 ± 0.429
2.81LysCys: 2.81 ± 1.118
3.066LysAsp: 3.066 ± 0.6
3.577LysGlu: 3.577 ± 0.55
3.321LysPhe: 3.321 ± 0.447
2.299LysGly: 2.299 ± 0.415
0.766LysHis: 0.766 ± 0.343
3.577LysIle: 3.577 ± 0.81
4.854LysLys: 4.854 ± 1.209
4.343LysLeu: 4.343 ± 1.238
3.066LysMet: 3.066 ± 1.304
1.788LysAsn: 1.788 ± 0.623
1.788LysPro: 1.788 ± 0.25
2.299LysGln: 2.299 ± 0.415
3.066LysArg: 3.066 ± 0.846
5.365LysSer: 5.365 ± 1.279
4.088LysThr: 4.088 ± 0.625
5.876LysVal: 5.876 ± 0.938
2.044LysTrp: 2.044 ± 0.224
1.788LysTyr: 1.788 ± 0.518
0.0LysXaa: 0.0 ± 0.0
Leu
3.577LeuAla: 3.577 ± 2.543
1.277LeuCys: 1.277 ± 0.317
6.387LeuAsp: 6.387 ± 2.013
5.876LeuGlu: 5.876 ± 0.897
5.11LeuPhe: 5.11 ± 0.744
5.11LeuGly: 5.11 ± 1.583
2.044LeuHis: 2.044 ± 0.612
5.11LeuIle: 5.11 ± 1.39
6.132LeuLys: 6.132 ± 0.926
9.453LeuLeu: 9.453 ± 1.369
1.533LeuMet: 1.533 ± 0.635
2.81LeuAsn: 2.81 ± 0.484
3.321LeuPro: 3.321 ± 1.622
5.365LeuGln: 5.365 ± 0.671
5.876LeuArg: 5.876 ± 1.006
10.731LeuSer: 10.731 ± 0.361
3.832LeuThr: 3.832 ± 0.585
5.621LeuVal: 5.621 ± 0.642
0.255LeuTrp: 0.255 ± 0.149
1.533LeuTyr: 1.533 ± 0.402
0.0LeuXaa: 0.0 ± 0.0
Met
1.788MetAla: 1.788 ± 0.518
0.511MetCys: 0.511 ± 0.143
1.533MetAsp: 1.533 ± 0.66
2.555MetGlu: 2.555 ± 0.502
1.533MetPhe: 1.533 ± 0.404
2.044MetGly: 2.044 ± 0.745
1.277MetHis: 1.277 ± 0.64
3.321MetIle: 3.321 ± 0.426
2.299MetLys: 2.299 ± 0.358
1.533MetLeu: 1.533 ± 1.111
1.788MetMet: 1.788 ± 0.752
0.766MetAsn: 0.766 ± 0.202
1.022MetPro: 1.022 ± 0.287
1.788MetGln: 1.788 ± 0.248
1.533MetArg: 1.533 ± 0.22
2.044MetSer: 2.044 ± 0.309
2.044MetThr: 2.044 ± 0.612
1.533MetVal: 1.533 ± 0.272
0.0MetTrp: 0.0 ± 0.0
0.255MetTyr: 0.255 ± 0.149
0.0MetXaa: 0.0 ± 0.0
Asn
2.555AsnAla: 2.555 ± 0.962
0.766AsnCys: 0.766 ± 0.526
2.555AsnAsp: 2.555 ± 0.784
2.299AsnGlu: 2.299 ± 0.416
2.299AsnPhe: 2.299 ± 0.663
3.577AsnGly: 3.577 ± 1.19
1.022AsnHis: 1.022 ± 0.596
1.277AsnIle: 1.277 ± 0.326
2.81AsnLys: 2.81 ± 1.094
2.044AsnLeu: 2.044 ± 0.587
1.022AsnMet: 1.022 ± 0.297
1.277AsnAsn: 1.277 ± 0.463
3.577AsnPro: 3.577 ± 0.522
1.533AsnGln: 1.533 ± 0.709
1.022AsnArg: 1.022 ± 0.535
3.321AsnSer: 3.321 ± 0.67
0.766AsnThr: 0.766 ± 0.329
1.788AsnVal: 1.788 ± 0.577
0.766AsnTrp: 0.766 ± 0.438
0.766AsnTyr: 0.766 ± 0.329
0.0AsnXaa: 0.0 ± 0.0
Pro
1.788ProAla: 1.788 ± 0.623
0.0ProCys: 0.0 ± 0.0
2.044ProAsp: 2.044 ± 1.408
4.343ProGlu: 4.343 ± 1.945
2.044ProPhe: 2.044 ± 0.79
2.81ProGly: 2.81 ± 1.094
1.022ProHis: 1.022 ± 0.717
1.277ProIle: 1.277 ± 0.317
2.555ProLys: 2.555 ± 0.3
4.343ProLeu: 4.343 ± 0.81
0.766ProMet: 0.766 ± 0.412
2.555ProAsn: 2.555 ± 0.928
0.766ProPro: 0.766 ± 0.412
1.788ProGln: 1.788 ± 0.248
1.788ProArg: 1.788 ± 0.973
3.577ProSer: 3.577 ± 0.522
0.766ProThr: 0.766 ± 0.343
3.321ProVal: 3.321 ± 1.12
1.533ProTrp: 1.533 ± 0.404
0.511ProTyr: 0.511 ± 0.298
0.0ProXaa: 0.0 ± 0.0
Gln
1.277GlnAla: 1.277 ± 0.724
1.788GlnCys: 1.788 ± 0.577
1.277GlnAsp: 1.277 ± 0.616
1.277GlnGlu: 1.277 ± 0.36
1.022GlnPhe: 1.022 ± 0.341
2.555GlnGly: 2.555 ± 0.466
1.022GlnHis: 1.022 ± 0.497
1.788GlnIle: 1.788 ± 1.043
3.066GlnLys: 3.066 ± 0.394
1.533GlnLeu: 1.533 ± 0.376
1.533GlnMet: 1.533 ± 0.507
1.022GlnAsn: 1.022 ± 0.287
2.044GlnPro: 2.044 ± 0.866
1.533GlnGln: 1.533 ± 0.66
1.788GlnArg: 1.788 ± 0.652
3.321GlnSer: 3.321 ± 0.447
1.277GlnThr: 1.277 ± 0.463
1.533GlnVal: 1.533 ± 0.402
0.0GlnTrp: 0.0 ± 0.0
1.277GlnTyr: 1.277 ± 0.743
0.0GlnXaa: 0.0 ± 0.0
Arg
3.832ArgAla: 3.832 ± 1.386
1.533ArgCys: 1.533 ± 0.786
4.088ArgAsp: 4.088 ± 0.551
4.343ArgGlu: 4.343 ± 0.561
1.533ArgPhe: 1.533 ± 0.43
4.088ArgGly: 4.088 ± 1.068
0.766ArgHis: 0.766 ± 0.329
4.599ArgIle: 4.599 ± 0.941
1.788ArgLys: 1.788 ± 0.25
5.365ArgLeu: 5.365 ± 0.604
2.044ArgMet: 2.044 ± 0.866
3.321ArgAsn: 3.321 ± 1.211
2.044ArgPro: 2.044 ± 0.224
0.766ArgGln: 0.766 ± 0.202
2.299ArgArg: 2.299 ± 0.668
4.599ArgSer: 4.599 ± 1.241
2.555ArgThr: 2.555 ± 0.926
3.577ArgVal: 3.577 ± 0.627
1.022ArgTrp: 1.022 ± 0.497
1.788ArgTyr: 1.788 ± 0.752
0.0ArgXaa: 0.0 ± 0.0
Ser
3.832SerAla: 3.832 ± 0.951
3.321SerCys: 3.321 ± 2.117
5.11SerAsp: 5.11 ± 1.089
7.92SerGlu: 7.92 ± 1.056
4.088SerPhe: 4.088 ± 1.05
6.132SerGly: 6.132 ± 1.099
3.832SerHis: 3.832 ± 1.008
5.365SerIle: 5.365 ± 1.339
6.387SerLys: 6.387 ± 1.231
11.242SerLeu: 11.242 ± 0.708
1.788SerMet: 1.788 ± 0.248
2.044SerAsn: 2.044 ± 0.344
4.343SerPro: 4.343 ± 0.968
2.299SerGln: 2.299 ± 0.716
6.643SerArg: 6.643 ± 1.638
9.198SerSer: 9.198 ± 1.41
4.088SerThr: 4.088 ± 0.684
6.132SerVal: 6.132 ± 0.958
2.044SerTrp: 2.044 ± 0.925
2.555SerTyr: 2.555 ± 0.341
0.0SerXaa: 0.0 ± 0.0
Thr
1.022ThrAla: 1.022 ± 0.535
1.277ThrCys: 1.277 ± 0.36
2.81ThrAsp: 2.81 ± 0.711
4.088ThrGlu: 4.088 ± 1.12
1.533ThrPhe: 1.533 ± 0.404
3.577ThrGly: 3.577 ± 0.81
0.766ThrHis: 0.766 ± 0.202
3.321ThrIle: 3.321 ± 0.827
2.81ThrLys: 2.81 ± 0.88
5.876ThrLeu: 5.876 ± 0.961
1.022ThrMet: 1.022 ± 0.325
2.044ThrAsn: 2.044 ± 0.457
2.044ThrPro: 2.044 ± 0.593
1.533ThrGln: 1.533 ± 0.272
3.577ThrArg: 3.577 ± 1.086
5.11ThrSer: 5.11 ± 0.682
3.066ThrThr: 3.066 ± 1.005
2.555ThrVal: 2.555 ± 0.467
0.0ThrTrp: 0.0 ± 0.0
1.022ThrTyr: 1.022 ± 0.395
0.0ThrXaa: 0.0 ± 0.0
Val
2.81ValAla: 2.81 ± 1.866
2.555ValCys: 2.555 ± 0.634
3.832ValAsp: 3.832 ± 0.474
4.088ValGlu: 4.088 ± 0.568
1.533ValPhe: 1.533 ± 0.659
1.788ValGly: 1.788 ± 0.449
1.533ValHis: 1.533 ± 0.22
3.832ValIle: 3.832 ± 0.338
4.854ValLys: 4.854 ± 1.379
5.365ValLeu: 5.365 ± 1.199
3.577ValMet: 3.577 ± 0.735
3.577ValAsn: 3.577 ± 2.367
2.044ValPro: 2.044 ± 0.525
2.044ValGln: 2.044 ± 0.27
4.343ValArg: 4.343 ± 0.807
7.154ValSer: 7.154 ± 1.071
3.577ValThr: 3.577 ± 1.566
5.365ValVal: 5.365 ± 0.862
0.511ValTrp: 0.511 ± 0.298
2.555ValTyr: 2.555 ± 0.605
0.0ValXaa: 0.0 ± 0.0
Trp
0.766TrpAla: 0.766 ± 0.202
0.255TrpCys: 0.255 ± 0.149
0.255TrpAsp: 0.255 ± 0.212
0.255TrpGlu: 0.255 ± 0.149
0.255TrpPhe: 0.255 ± 0.149
1.022TrpGly: 1.022 ± 0.297
0.0TrpHis: 0.0 ± 0.0
0.255TrpIle: 0.255 ± 0.212
0.255TrpLys: 0.255 ± 0.149
1.788TrpLeu: 1.788 ± 0.248
0.255TrpMet: 0.255 ± 0.212
0.255TrpAsn: 0.255 ± 0.149
0.511TrpPro: 0.511 ± 0.411
0.255TrpGln: 0.255 ± 0.501
0.511TrpArg: 0.511 ± 0.143
1.533TrpSer: 1.533 ± 0.607
1.788TrpThr: 1.788 ± 0.518
2.299TrpVal: 2.299 ± 0.243
0.255TrpTrp: 0.255 ± 0.149
0.255TrpTyr: 0.255 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.533TyrAla: 1.533 ± 0.404
1.277TyrCys: 1.277 ± 0.944
1.022TyrAsp: 1.022 ± 0.596
1.022TyrGlu: 1.022 ± 0.535
1.533TyrPhe: 1.533 ± 0.402
1.533TyrGly: 1.533 ± 0.607
0.766TyrHis: 0.766 ± 0.447
2.299TyrIle: 2.299 ± 0.358
1.277TyrLys: 1.277 ± 0.317
3.066TyrLeu: 3.066 ± 0.89
0.255TyrMet: 0.255 ± 0.443
1.022TyrAsn: 1.022 ± 0.287
1.277TyrPro: 1.277 ± 0.533
1.022TyrGln: 1.022 ± 0.906
1.788TyrArg: 1.788 ± 0.52
2.555TyrSer: 2.555 ± 1.191
1.022TyrThr: 1.022 ± 0.325
1.022TyrVal: 1.022 ± 0.497
0.766TyrTrp: 0.766 ± 0.329
0.511TyrTyr: 0.511 ± 0.298
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3915 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski