Amino acid dipepetide frequency for Tomato chlorosis virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.292AlaAla: 3.292 ± 0.536
0.775AlaCys: 0.775 ± 0.529
2.711AlaAsp: 2.711 ± 0.816
2.13AlaGlu: 2.13 ± 0.684
2.711AlaPhe: 2.711 ± 0.811
3.486AlaGly: 3.486 ± 0.725
0.387AlaHis: 0.387 ± 0.187
1.936AlaIle: 1.936 ± 0.541
4.454AlaLys: 4.454 ± 0.646
4.841AlaLeu: 4.841 ± 0.563
1.162AlaMet: 1.162 ± 0.61
2.711AlaAsn: 2.711 ± 0.56
0.775AlaPro: 0.775 ± 0.344
1.549AlaGln: 1.549 ± 0.267
2.905AlaArg: 2.905 ± 0.863
2.324AlaSer: 2.324 ± 0.668
1.162AlaThr: 1.162 ± 0.443
3.486AlaVal: 3.486 ± 0.44
0.387AlaTrp: 0.387 ± 0.276
0.775AlaTyr: 0.775 ± 0.242
0.194AlaXaa: 0.194 ± 0.13
Cys
0.968CysAla: 0.968 ± 0.495
0.387CysCys: 0.387 ± 0.187
2.711CysAsp: 2.711 ± 1.027
0.968CysGlu: 0.968 ± 0.431
0.775CysPhe: 0.775 ± 0.4
1.162CysGly: 1.162 ± 0.617
0.194CysHis: 0.194 ± 0.213
0.194CysIle: 0.194 ± 0.13
0.775CysLys: 0.775 ± 0.399
3.098CysLeu: 3.098 ± 0.936
0.194CysMet: 0.194 ± 0.239
0.775CysAsn: 0.775 ± 0.242
0.387CysPro: 0.387 ± 0.258
1.162CysGln: 1.162 ± 0.366
0.194CysArg: 0.194 ± 0.235
1.549CysSer: 1.549 ± 0.491
0.775CysThr: 0.775 ± 0.357
1.549CysVal: 1.549 ± 0.726
0.387CysTrp: 0.387 ± 0.261
1.936CysTyr: 1.936 ± 0.481
0.194CysXaa: 0.194 ± 0.13
Asp
2.13AspAla: 2.13 ± 0.403
1.162AspCys: 1.162 ± 0.413
5.229AspAsp: 5.229 ± 1.071
2.517AspGlu: 2.517 ± 0.273
6.971AspPhe: 6.971 ± 1.14
3.292AspGly: 3.292 ± 0.794
1.162AspHis: 1.162 ± 0.282
3.486AspIle: 3.486 ± 0.696
4.841AspLys: 4.841 ± 0.744
6.584AspLeu: 6.584 ± 1.123
2.13AspMet: 2.13 ± 0.719
2.905AspAsn: 2.905 ± 0.782
0.968AspPro: 0.968 ± 0.402
0.581AspGln: 0.581 ± 0.458
4.26AspArg: 4.26 ± 0.955
3.873AspSer: 3.873 ± 0.866
4.26AspThr: 4.26 ± 0.648
6.39AspVal: 6.39 ± 0.973
0.581AspTrp: 0.581 ± 0.244
2.711AspTyr: 2.711 ± 0.645
0.0AspXaa: 0.0 ± 0.0
Glu
1.936GluAla: 1.936 ± 0.452
1.549GluCys: 1.549 ± 0.569
2.517GluAsp: 2.517 ± 0.988
1.936GluGlu: 1.936 ± 0.821
1.936GluPhe: 1.936 ± 0.36
2.324GluGly: 2.324 ± 0.538
0.387GluHis: 0.387 ± 0.261
4.067GluIle: 4.067 ± 0.853
5.229GluLys: 5.229 ± 1.103
3.679GluLeu: 3.679 ± 0.756
1.162GluMet: 1.162 ± 0.497
2.711GluAsn: 2.711 ± 0.549
1.743GluPro: 1.743 ± 0.862
1.162GluGln: 1.162 ± 0.469
3.679GluArg: 3.679 ± 0.721
4.454GluSer: 4.454 ± 1.041
2.324GluThr: 2.324 ± 0.561
4.454GluVal: 4.454 ± 0.982
0.194GluTrp: 0.194 ± 0.202
2.711GluTyr: 2.711 ± 0.412
0.0GluXaa: 0.0 ± 0.0
Phe
2.711PheAla: 2.711 ± 0.604
1.162PheCys: 1.162 ± 0.425
4.26PheAsp: 4.26 ± 0.831
2.711PheGlu: 2.711 ± 0.773
4.26PhePhe: 4.26 ± 0.953
4.648PheGly: 4.648 ± 0.685
1.162PheHis: 1.162 ± 0.422
3.679PheIle: 3.679 ± 0.822
4.648PheLys: 4.648 ± 0.799
5.229PheLeu: 5.229 ± 0.71
1.356PheMet: 1.356 ± 0.572
2.711PheAsn: 2.711 ± 0.74
1.549PhePro: 1.549 ± 0.678
0.581PheGln: 0.581 ± 0.369
2.13PheArg: 2.13 ± 0.71
8.133PheSer: 8.133 ± 1.442
3.292PheThr: 3.292 ± 0.686
6.003PheVal: 6.003 ± 1.131
0.0PheTrp: 0.0 ± 0.0
2.324PheTyr: 2.324 ± 0.617
0.0PheXaa: 0.0 ± 0.0
Gly
1.936GlyAla: 1.936 ± 0.428
1.162GlyCys: 1.162 ± 0.413
4.454GlyAsp: 4.454 ± 0.696
3.292GlyGlu: 3.292 ± 0.661
3.098GlyPhe: 3.098 ± 0.429
4.067GlyGly: 4.067 ± 0.844
1.936GlyHis: 1.936 ± 0.643
2.517GlyIle: 2.517 ± 0.71
5.229GlyLys: 5.229 ± 0.703
5.035GlyLeu: 5.035 ± 1.029
1.162GlyMet: 1.162 ± 0.545
3.292GlyAsn: 3.292 ± 0.636
1.162GlyPro: 1.162 ± 0.345
0.968GlyGln: 0.968 ± 0.476
2.324GlyArg: 2.324 ± 0.573
3.098GlySer: 3.098 ± 0.877
2.324GlyThr: 2.324 ± 0.687
4.067GlyVal: 4.067 ± 0.98
0.775GlyTrp: 0.775 ± 0.841
1.549GlyTyr: 1.549 ± 0.49
0.0GlyXaa: 0.0 ± 0.0
His
0.775HisAla: 0.775 ± 0.361
0.581HisCys: 0.581 ± 0.272
1.356HisAsp: 1.356 ± 0.706
0.387HisGlu: 0.387 ± 0.327
1.356HisPhe: 1.356 ± 0.555
0.775HisGly: 0.775 ± 0.327
0.0HisHis: 0.0 ± 0.0
0.968HisIle: 0.968 ± 0.508
1.549HisLys: 1.549 ± 0.267
1.549HisLeu: 1.549 ± 0.462
0.194HisMet: 0.194 ± 0.222
0.581HisAsn: 0.581 ± 0.479
1.162HisPro: 1.162 ± 0.397
0.0HisGln: 0.0 ± 0.0
1.356HisArg: 1.356 ± 0.468
1.356HisSer: 1.356 ± 0.471
0.581HisThr: 0.581 ± 0.272
1.356HisVal: 1.356 ± 0.467
0.0HisTrp: 0.0 ± 0.0
1.356HisTyr: 1.356 ± 0.372
0.0HisXaa: 0.0 ± 0.0
Ile
2.13IleAla: 2.13 ± 0.699
0.194IleCys: 0.194 ± 0.13
3.679IleAsp: 3.679 ± 0.882
3.679IleGlu: 3.679 ± 0.812
3.292IlePhe: 3.292 ± 0.704
2.13IleGly: 2.13 ± 0.666
1.743IleHis: 1.743 ± 0.642
4.648IleIle: 4.648 ± 1.038
5.422IleLys: 5.422 ± 1.003
6.584IleLeu: 6.584 ± 1.297
1.549IleMet: 1.549 ± 0.367
3.873IleAsn: 3.873 ± 1.054
3.486IlePro: 3.486 ± 0.628
1.743IleGln: 1.743 ± 0.46
3.486IleArg: 3.486 ± 0.647
6.971IleSer: 6.971 ± 1.181
2.517IleThr: 2.517 ± 0.834
4.841IleVal: 4.841 ± 0.911
0.194IleTrp: 0.194 ± 0.202
2.517IleTyr: 2.517 ± 0.768
0.0IleXaa: 0.0 ± 0.0
Lys
2.905LysAla: 2.905 ± 0.512
1.356LysCys: 1.356 ± 0.446
3.486LysAsp: 3.486 ± 0.749
3.098LysGlu: 3.098 ± 0.947
4.26LysPhe: 4.26 ± 0.797
3.098LysGly: 3.098 ± 0.344
0.775LysHis: 0.775 ± 0.316
7.552LysIle: 7.552 ± 0.794
4.067LysLys: 4.067 ± 1.151
6.39LysLeu: 6.39 ± 0.769
2.905LysMet: 2.905 ± 0.725
3.292LysAsn: 3.292 ± 0.585
3.679LysPro: 3.679 ± 0.675
2.13LysGln: 2.13 ± 0.83
2.905LysArg: 2.905 ± 0.739
3.679LysSer: 3.679 ± 0.733
4.454LysThr: 4.454 ± 0.705
5.035LysVal: 5.035 ± 1.274
0.194LysTrp: 0.194 ± 0.235
3.486LysTyr: 3.486 ± 0.509
0.0LysXaa: 0.0 ± 0.0
Leu
4.454LeuAla: 4.454 ± 1.012
2.711LeuCys: 2.711 ± 0.746
4.454LeuAsp: 4.454 ± 0.696
4.648LeuGlu: 4.648 ± 0.944
4.841LeuPhe: 4.841 ± 1.343
5.035LeuGly: 5.035 ± 0.627
1.549LeuHis: 1.549 ± 0.398
6.39LeuIle: 6.39 ± 1.575
5.229LeuLys: 5.229 ± 0.502
7.552LeuLeu: 7.552 ± 1.307
2.13LeuMet: 2.13 ± 0.879
4.26LeuAsn: 4.26 ± 0.844
3.098LeuPro: 3.098 ± 0.712
2.13LeuGln: 2.13 ± 0.539
6.971LeuArg: 6.971 ± 1.469
8.133LeuSer: 8.133 ± 0.886
4.454LeuThr: 4.454 ± 0.57
9.101LeuVal: 9.101 ± 1.338
0.968LeuTrp: 0.968 ± 0.364
5.229LeuTyr: 5.229 ± 0.872
0.0LeuXaa: 0.0 ± 0.0
Met
0.968MetAla: 0.968 ± 0.21
0.387MetCys: 0.387 ± 0.29
2.13MetAsp: 2.13 ± 0.641
0.775MetGlu: 0.775 ± 0.349
1.743MetPhe: 1.743 ± 0.52
1.549MetGly: 1.549 ± 0.401
0.194MetHis: 0.194 ± 0.13
2.517MetIle: 2.517 ± 0.731
2.324MetLys: 2.324 ± 0.798
2.324MetLeu: 2.324 ± 0.687
0.0MetMet: 0.0 ± 0.0
1.356MetAsn: 1.356 ± 0.366
0.387MetPro: 0.387 ± 0.426
0.968MetGln: 0.968 ± 0.21
1.356MetArg: 1.356 ± 0.49
2.13MetSer: 2.13 ± 0.941
1.743MetThr: 1.743 ± 0.481
0.968MetVal: 0.968 ± 0.37
0.387MetTrp: 0.387 ± 0.187
0.581MetTyr: 0.581 ± 0.376
0.0MetXaa: 0.0 ± 0.0
Asn
3.679AsnAla: 3.679 ± 0.805
0.775AsnCys: 0.775 ± 0.335
2.324AsnAsp: 2.324 ± 0.493
2.905AsnGlu: 2.905 ± 0.664
3.679AsnPhe: 3.679 ± 0.966
2.324AsnGly: 2.324 ± 0.754
0.775AsnHis: 0.775 ± 0.471
4.067AsnIle: 4.067 ± 0.648
4.067AsnLys: 4.067 ± 0.737
5.035AsnLeu: 5.035 ± 0.496
0.968AsnMet: 0.968 ± 0.419
2.711AsnAsn: 2.711 ± 0.81
2.711AsnPro: 2.711 ± 0.83
1.356AsnGln: 1.356 ± 0.659
2.13AsnArg: 2.13 ± 0.637
6.197AsnSer: 6.197 ± 1.34
2.517AsnThr: 2.517 ± 0.667
4.26AsnVal: 4.26 ± 0.964
0.581AsnTrp: 0.581 ± 0.262
2.517AsnTyr: 2.517 ± 1.252
0.0AsnXaa: 0.0 ± 0.0
Pro
1.162ProAla: 1.162 ± 0.636
0.581ProCys: 0.581 ± 0.244
4.454ProAsp: 4.454 ± 0.871
1.743ProGlu: 1.743 ± 0.614
1.162ProPhe: 1.162 ± 0.437
2.13ProGly: 2.13 ± 0.72
0.387ProHis: 0.387 ± 0.214
2.13ProIle: 2.13 ± 0.593
1.549ProLys: 1.549 ± 0.471
3.098ProLeu: 3.098 ± 1.215
0.775ProMet: 0.775 ± 0.323
2.905ProAsn: 2.905 ± 0.864
2.324ProPro: 2.324 ± 0.765
0.581ProGln: 0.581 ± 0.386
0.775ProArg: 0.775 ± 0.242
2.711ProSer: 2.711 ± 0.971
2.13ProThr: 2.13 ± 0.616
3.098ProVal: 3.098 ± 0.723
0.387ProTrp: 0.387 ± 0.293
1.936ProTyr: 1.936 ± 0.55
0.0ProXaa: 0.0 ± 0.0
Gln
0.968GlnAla: 0.968 ± 0.276
0.194GlnCys: 0.194 ± 0.222
1.162GlnAsp: 1.162 ± 0.668
1.356GlnGlu: 1.356 ± 0.408
1.743GlnPhe: 1.743 ± 0.286
2.13GlnGly: 2.13 ± 0.451
0.387GlnHis: 0.387 ± 0.243
0.775GlnIle: 0.775 ± 0.374
1.162GlnLys: 1.162 ± 0.617
2.517GlnLeu: 2.517 ± 0.67
0.968GlnMet: 0.968 ± 0.33
1.356GlnAsn: 1.356 ± 0.43
0.581GlnPro: 0.581 ± 0.553
0.968GlnGln: 0.968 ± 0.405
1.356GlnArg: 1.356 ± 0.344
0.775GlnSer: 0.775 ± 0.357
1.162GlnThr: 1.162 ± 0.566
1.743GlnVal: 1.743 ± 0.472
0.0GlnTrp: 0.0 ± 0.0
1.549GlnTyr: 1.549 ± 0.63
0.0GlnXaa: 0.0 ± 0.0
Arg
2.517ArgAla: 2.517 ± 0.599
1.743ArgCys: 1.743 ± 0.636
3.098ArgAsp: 3.098 ± 0.643
3.486ArgGlu: 3.486 ± 0.882
3.873ArgPhe: 3.873 ± 0.841
1.162ArgGly: 1.162 ± 0.635
0.968ArgHis: 0.968 ± 0.652
3.486ArgIle: 3.486 ± 0.517
2.711ArgLys: 2.711 ± 0.48
6.584ArgLeu: 6.584 ± 0.947
0.968ArgMet: 0.968 ± 0.533
2.905ArgAsn: 2.905 ± 0.629
2.13ArgPro: 2.13 ± 0.455
0.775ArgGln: 0.775 ± 0.284
4.26ArgArg: 4.26 ± 0.742
4.648ArgSer: 4.648 ± 0.902
2.711ArgThr: 2.711 ± 0.757
3.679ArgVal: 3.679 ± 0.856
0.581ArgTrp: 0.581 ± 0.28
2.13ArgTyr: 2.13 ± 0.598
0.0ArgXaa: 0.0 ± 0.0
Ser
3.679SerAla: 3.679 ± 0.53
1.356SerCys: 1.356 ± 0.495
6.39SerAsp: 6.39 ± 0.719
4.26SerGlu: 4.26 ± 0.698
4.648SerPhe: 4.648 ± 0.667
4.841SerGly: 4.841 ± 0.769
1.936SerHis: 1.936 ± 0.518
5.809SerIle: 5.809 ± 1.214
6.971SerLys: 6.971 ± 1.363
8.521SerLeu: 8.521 ± 1.455
2.324SerMet: 2.324 ± 0.823
4.26SerAsn: 4.26 ± 1.183
2.324SerPro: 2.324 ± 0.646
2.13SerGln: 2.13 ± 0.57
3.098SerArg: 3.098 ± 0.583
6.971SerSer: 6.971 ± 0.63
4.26SerThr: 4.26 ± 0.966
5.422SerVal: 5.422 ± 0.667
0.775SerTrp: 0.775 ± 0.432
2.711SerTyr: 2.711 ± 0.881
0.0SerXaa: 0.0 ± 0.0
Thr
3.679ThrAla: 3.679 ± 0.749
0.775ThrCys: 0.775 ± 0.267
3.098ThrAsp: 3.098 ± 0.376
3.098ThrGlu: 3.098 ± 0.548
3.873ThrPhe: 3.873 ± 0.811
3.679ThrGly: 3.679 ± 0.772
0.775ThrHis: 0.775 ± 0.291
3.679ThrIle: 3.679 ± 0.949
0.968ThrLys: 0.968 ± 0.364
5.035ThrLeu: 5.035 ± 1.007
1.162ThrMet: 1.162 ± 0.364
3.098ThrAsn: 3.098 ± 0.638
1.936ThrPro: 1.936 ± 0.778
0.968ThrGln: 0.968 ± 0.458
1.936ThrArg: 1.936 ± 0.687
4.648ThrSer: 4.648 ± 0.556
3.292ThrThr: 3.292 ± 0.65
3.486ThrVal: 3.486 ± 0.531
0.775ThrTrp: 0.775 ± 0.385
1.936ThrTyr: 1.936 ± 0.332
0.0ThrXaa: 0.0 ± 0.0
Val
3.098ValAla: 3.098 ± 0.453
2.13ValCys: 2.13 ± 0.744
5.035ValAsp: 5.035 ± 0.731
4.841ValGlu: 4.841 ± 0.992
3.486ValPhe: 3.486 ± 0.784
3.679ValGly: 3.679 ± 0.722
1.356ValHis: 1.356 ± 0.589
3.873ValIle: 3.873 ± 0.811
4.454ValLys: 4.454 ± 0.87
5.229ValLeu: 5.229 ± 0.653
1.743ValMet: 1.743 ± 0.729
5.809ValAsn: 5.809 ± 1.013
3.873ValPro: 3.873 ± 0.639
1.936ValGln: 1.936 ± 0.387
5.422ValArg: 5.422 ± 0.575
6.003ValSer: 6.003 ± 0.838
5.035ValThr: 5.035 ± 0.931
6.971ValVal: 6.971 ± 1.746
0.775ValTrp: 0.775 ± 0.344
3.679ValTyr: 3.679 ± 0.902
0.194ValXaa: 0.194 ± 0.202
Trp
0.194TrpAla: 0.194 ± 0.235
0.194TrpCys: 0.194 ± 0.13
0.581TrpAsp: 0.581 ± 0.376
0.387TrpGlu: 0.387 ± 0.421
1.162TrpPhe: 1.162 ± 0.333
0.387TrpGly: 0.387 ± 0.261
0.0TrpHis: 0.0 ± 0.0
0.194TrpIle: 0.194 ± 0.202
0.581TrpLys: 0.581 ± 0.458
0.968TrpLeu: 0.968 ± 0.423
0.581TrpMet: 0.581 ± 0.272
0.387TrpAsn: 0.387 ± 0.317
0.0TrpPro: 0.0 ± 0.0
0.194TrpGln: 0.194 ± 0.202
0.581TrpArg: 0.581 ± 0.468
0.581TrpSer: 0.581 ± 0.262
0.0TrpThr: 0.0 ± 0.0
0.581TrpVal: 0.581 ± 0.391
0.0TrpTrp: 0.0 ± 0.0
0.387TrpTyr: 0.387 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.968TyrAla: 0.968 ± 0.481
1.162TyrCys: 1.162 ± 0.426
2.905TyrAsp: 2.905 ± 0.586
1.743TyrGlu: 1.743 ± 0.631
3.679TyrPhe: 3.679 ± 0.98
1.549TyrGly: 1.549 ± 0.468
1.162TyrHis: 1.162 ± 0.288
2.905TyrIle: 2.905 ± 1.025
2.517TyrLys: 2.517 ± 0.592
3.292TyrLeu: 3.292 ± 1.231
1.162TyrMet: 1.162 ± 0.289
3.486TyrAsn: 3.486 ± 0.761
1.743TyrPro: 1.743 ± 0.744
0.968TyrGln: 0.968 ± 0.452
3.292TyrArg: 3.292 ± 0.918
4.454TyrSer: 4.454 ± 1.286
2.905TyrThr: 2.905 ± 0.45
2.13TyrVal: 2.13 ± 0.521
0.0TyrTrp: 0.0 ± 0.0
1.936TyrTyr: 1.936 ± 0.669
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.194XaaGlu: 0.194 ± 0.13
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.194XaaLeu: 0.194 ± 0.202
0.0XaaMet: 0.0 ± 0.0
0.194XaaAsn: 0.194 ± 0.13
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (5165 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski