Amino acid dipepetide frequency for Liberibacter phage HHCA1-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.049AlaAla: 1.049 ± 0.312
1.398AlaCys: 1.398 ± 0.46
2.971AlaAsp: 2.971 ± 0.618
3.495AlaGlu: 3.495 ± 0.483
2.447AlaPhe: 2.447 ± 0.468
2.796AlaGly: 2.796 ± 0.515
1.922AlaHis: 1.922 ± 0.384
3.845AlaIle: 3.845 ± 0.704
5.592AlaLys: 5.592 ± 0.736
7.078AlaLeu: 7.078 ± 0.698
1.049AlaMet: 1.049 ± 0.253
2.796AlaAsn: 2.796 ± 0.408
1.66AlaPro: 1.66 ± 0.398
3.583AlaGln: 3.583 ± 0.603
3.67AlaArg: 3.67 ± 0.54
6.117AlaSer: 6.117 ± 0.913
3.495AlaThr: 3.495 ± 0.722
4.282AlaVal: 4.282 ± 0.574
1.311AlaTrp: 1.311 ± 0.364
1.748AlaTyr: 1.748 ± 0.413
0.0AlaXaa: 0.0 ± 0.0
Cys
0.524CysAla: 0.524 ± 0.238
0.175CysCys: 0.175 ± 0.11
0.874CysAsp: 0.874 ± 0.252
0.874CysGlu: 0.874 ± 0.293
0.35CysPhe: 0.35 ± 0.153
1.049CysGly: 1.049 ± 0.32
0.35CysHis: 0.35 ± 0.28
0.699CysIle: 0.699 ± 0.281
0.612CysLys: 0.612 ± 0.262
1.485CysLeu: 1.485 ± 0.291
0.175CysMet: 0.175 ± 0.127
0.35CysAsn: 0.35 ± 0.18
0.612CysPro: 0.612 ± 0.272
0.35CysGln: 0.35 ± 0.218
0.699CysArg: 0.699 ± 0.333
0.524CysSer: 0.524 ± 0.213
0.961CysThr: 0.961 ± 0.329
0.699CysVal: 0.699 ± 0.35
0.175CysTrp: 0.175 ± 0.146
0.262CysTyr: 0.262 ± 0.154
0.0CysXaa: 0.0 ± 0.0
Asp
3.146AspAla: 3.146 ± 0.667
1.311AspCys: 1.311 ± 0.419
3.058AspAsp: 3.058 ± 0.599
4.631AspGlu: 4.631 ± 0.655
2.359AspPhe: 2.359 ± 0.382
3.146AspGly: 3.146 ± 0.533
0.612AspHis: 0.612 ± 0.226
5.33AspIle: 5.33 ± 0.766
6.728AspLys: 6.728 ± 1.506
5.243AspLeu: 5.243 ± 0.85
1.049AspMet: 1.049 ± 0.27
2.709AspAsn: 2.709 ± 0.581
2.621AspPro: 2.621 ± 0.451
1.311AspGln: 1.311 ± 0.306
2.884AspArg: 2.884 ± 0.569
3.408AspSer: 3.408 ± 0.648
3.757AspThr: 3.757 ± 0.658
4.631AspVal: 4.631 ± 0.612
1.136AspTrp: 1.136 ± 0.386
1.835AspTyr: 1.835 ± 0.325
0.0AspXaa: 0.0 ± 0.0
Glu
5.33GluAla: 5.33 ± 0.718
0.175GluCys: 0.175 ± 0.124
4.282GluAsp: 4.282 ± 0.754
6.379GluGlu: 6.379 ± 0.849
3.845GluPhe: 3.845 ± 0.594
4.981GluGly: 4.981 ± 0.609
2.272GluHis: 2.272 ± 0.544
4.719GluIle: 4.719 ± 0.652
5.33GluLys: 5.33 ± 0.782
6.728GluLeu: 6.728 ± 0.832
1.485GluMet: 1.485 ± 0.302
3.233GluAsn: 3.233 ± 0.512
1.398GluPro: 1.398 ± 0.333
4.544GluGln: 4.544 ± 0.622
5.33GluArg: 5.33 ± 0.777
3.146GluSer: 3.146 ± 0.566
3.321GluThr: 3.321 ± 0.49
5.068GluVal: 5.068 ± 0.776
0.612GluTrp: 0.612 ± 0.203
2.796GluTyr: 2.796 ± 0.509
0.0GluXaa: 0.0 ± 0.0
Phe
2.971PheAla: 2.971 ± 0.433
0.786PheCys: 0.786 ± 0.305
2.884PheAsp: 2.884 ± 0.563
3.408PheGlu: 3.408 ± 0.544
1.485PhePhe: 1.485 ± 0.362
2.796PheGly: 2.796 ± 0.535
0.874PheHis: 0.874 ± 0.294
3.146PheIle: 3.146 ± 0.503
4.194PheLys: 4.194 ± 0.745
3.757PheLeu: 3.757 ± 0.596
0.35PheMet: 0.35 ± 0.14
2.097PheAsn: 2.097 ± 0.476
1.049PhePro: 1.049 ± 0.334
1.049PheGln: 1.049 ± 0.293
1.573PheArg: 1.573 ± 0.346
3.146PheSer: 3.146 ± 0.411
1.573PheThr: 1.573 ± 0.315
2.621PheVal: 2.621 ± 0.524
0.612PheTrp: 0.612 ± 0.323
1.398PheTyr: 1.398 ± 0.28
0.0PheXaa: 0.0 ± 0.0
Gly
3.583GlyAla: 3.583 ± 0.49
1.136GlyCys: 1.136 ± 0.306
3.583GlyAsp: 3.583 ± 1.397
4.631GlyGlu: 4.631 ± 0.566
3.146GlyPhe: 3.146 ± 0.567
4.194GlyGly: 4.194 ± 0.683
1.049GlyHis: 1.049 ± 0.372
3.845GlyIle: 3.845 ± 0.722
4.719GlyLys: 4.719 ± 0.901
6.292GlyLeu: 6.292 ± 0.806
1.485GlyMet: 1.485 ± 0.368
2.359GlyAsn: 2.359 ± 0.721
4.02GlyPro: 4.02 ± 4.651
3.67GlyGln: 3.67 ± 1.976
2.709GlyArg: 2.709 ± 0.48
4.544GlySer: 4.544 ± 0.81
3.408GlyThr: 3.408 ± 0.494
3.67GlyVal: 3.67 ± 0.562
0.786GlyTrp: 0.786 ± 0.272
2.359GlyTyr: 2.359 ± 0.528
0.0GlyXaa: 0.0 ± 0.0
His
0.874HisAla: 0.874 ± 0.299
0.699HisCys: 0.699 ± 0.249
0.786HisAsp: 0.786 ± 0.269
1.049HisGlu: 1.049 ± 0.364
1.136HisPhe: 1.136 ± 0.242
0.874HisGly: 0.874 ± 0.296
0.961HisHis: 0.961 ± 0.333
1.398HisIle: 1.398 ± 0.251
1.398HisLys: 1.398 ± 0.439
1.922HisLeu: 1.922 ± 0.44
0.262HisMet: 0.262 ± 0.149
1.311HisAsn: 1.311 ± 0.28
1.311HisPro: 1.311 ± 0.314
0.874HisGln: 0.874 ± 0.317
1.136HisArg: 1.136 ± 0.322
1.049HisSer: 1.049 ± 0.255
0.786HisThr: 0.786 ± 0.314
0.874HisVal: 0.874 ± 0.263
0.437HisTrp: 0.437 ± 0.202
0.961HisTyr: 0.961 ± 0.303
0.0HisXaa: 0.0 ± 0.0
Ile
5.855IleAla: 5.855 ± 0.751
0.437IleCys: 0.437 ± 0.186
4.719IleAsp: 4.719 ± 0.629
5.592IleGlu: 5.592 ± 0.798
1.485IlePhe: 1.485 ± 0.352
3.321IleGly: 3.321 ± 0.618
0.961IleHis: 0.961 ± 0.343
3.583IleIle: 3.583 ± 0.568
5.243IleLys: 5.243 ± 0.695
5.418IleLeu: 5.418 ± 0.922
0.786IleMet: 0.786 ± 0.296
3.058IleAsn: 3.058 ± 0.501
3.233IlePro: 3.233 ± 0.548
1.835IleGln: 1.835 ± 0.413
3.058IleArg: 3.058 ± 0.645
4.02IleSer: 4.02 ± 0.651
4.806IleThr: 4.806 ± 0.66
3.757IleVal: 3.757 ± 0.548
0.961IleTrp: 0.961 ± 0.286
1.835IleTyr: 1.835 ± 0.452
0.0IleXaa: 0.0 ± 0.0
Lys
6.117LysAla: 6.117 ± 0.672
0.699LysCys: 0.699 ± 0.231
5.243LysAsp: 5.243 ± 0.664
6.728LysGlu: 6.728 ± 0.867
2.01LysPhe: 2.01 ± 0.484
7.253LysGly: 7.253 ± 2.354
2.272LysHis: 2.272 ± 0.556
4.369LysIle: 4.369 ± 0.551
5.418LysLys: 5.418 ± 0.763
7.165LysLeu: 7.165 ± 0.778
1.748LysMet: 1.748 ± 0.502
3.233LysAsn: 3.233 ± 0.85
2.971LysPro: 2.971 ± 0.513
3.495LysGln: 3.495 ± 0.597
5.942LysArg: 5.942 ± 0.72
4.631LysSer: 4.631 ± 0.61
5.33LysThr: 5.33 ± 0.65
4.369LysVal: 4.369 ± 0.638
1.398LysTrp: 1.398 ± 0.426
2.097LysTyr: 2.097 ± 0.43
0.0LysXaa: 0.0 ± 0.0
Leu
6.379LeuAla: 6.379 ± 0.828
1.049LeuCys: 1.049 ± 0.354
5.68LeuAsp: 5.68 ± 0.733
6.903LeuGlu: 6.903 ± 0.972
3.757LeuPhe: 3.757 ± 0.61
4.719LeuGly: 4.719 ± 0.637
1.223LeuHis: 1.223 ± 0.344
6.816LeuIle: 6.816 ± 0.791
9.787LeuLys: 9.787 ± 1.064
6.991LeuLeu: 6.991 ± 0.825
1.66LeuMet: 1.66 ± 0.272
3.321LeuAsn: 3.321 ± 0.431
2.621LeuPro: 2.621 ± 0.448
4.02LeuGln: 4.02 ± 0.724
4.631LeuArg: 4.631 ± 0.488
7.165LeuSer: 7.165 ± 0.812
5.068LeuThr: 5.068 ± 0.708
5.156LeuVal: 5.156 ± 0.728
1.573LeuTrp: 1.573 ± 0.477
2.272LeuTyr: 2.272 ± 0.528
0.0LeuXaa: 0.0 ± 0.0
Met
1.573MetAla: 1.573 ± 0.4
0.35MetCys: 0.35 ± 0.155
1.66MetAsp: 1.66 ± 0.585
1.748MetGlu: 1.748 ± 0.415
0.699MetPhe: 0.699 ± 0.222
1.922MetGly: 1.922 ± 0.425
0.524MetHis: 0.524 ± 0.169
0.612MetIle: 0.612 ± 0.232
1.311MetLys: 1.311 ± 0.344
0.786MetLeu: 0.786 ± 0.24
0.699MetMet: 0.699 ± 0.258
0.612MetAsn: 0.612 ± 0.262
0.786MetPro: 0.786 ± 0.236
1.049MetGln: 1.049 ± 0.259
0.786MetArg: 0.786 ± 0.289
1.049MetSer: 1.049 ± 0.357
1.485MetThr: 1.485 ± 0.309
1.136MetVal: 1.136 ± 0.289
0.087MetTrp: 0.087 ± 0.095
0.699MetTyr: 0.699 ± 0.245
0.0MetXaa: 0.0 ± 0.0
Asn
4.194AsnAla: 4.194 ± 0.931
0.262AsnCys: 0.262 ± 0.121
1.748AsnAsp: 1.748 ± 0.313
2.097AsnGlu: 2.097 ± 0.406
1.748AsnPhe: 1.748 ± 0.403
2.01AsnGly: 2.01 ± 0.432
0.699AsnHis: 0.699 ± 0.237
3.146AsnIle: 3.146 ± 0.551
3.058AsnLys: 3.058 ± 0.477
2.884AsnLeu: 2.884 ± 0.488
0.699AsnMet: 0.699 ± 0.253
1.922AsnAsn: 1.922 ± 0.68
2.709AsnPro: 2.709 ± 0.669
1.049AsnGln: 1.049 ± 0.281
2.359AsnArg: 2.359 ± 0.477
2.447AsnSer: 2.447 ± 0.5
2.796AsnThr: 2.796 ± 0.515
1.311AsnVal: 1.311 ± 0.429
0.35AsnTrp: 0.35 ± 0.161
1.223AsnTyr: 1.223 ± 0.317
0.0AsnXaa: 0.0 ± 0.0
Pro
1.398ProAla: 1.398 ± 0.327
0.262ProCys: 0.262 ± 0.154
3.058ProAsp: 3.058 ± 0.715
1.922ProGlu: 1.922 ± 0.368
2.097ProPhe: 2.097 ± 0.513
0.786ProGly: 0.786 ± 0.296
0.437ProHis: 0.437 ± 0.189
1.835ProIle: 1.835 ± 0.345
2.621ProLys: 2.621 ± 0.601
4.806ProLeu: 4.806 ± 0.581
0.786ProMet: 0.786 ± 0.267
1.136ProAsn: 1.136 ± 0.311
1.311ProPro: 1.311 ± 0.363
5.592ProGln: 5.592 ± 4.137
1.835ProArg: 1.835 ± 0.417
2.796ProSer: 2.796 ± 0.409
1.485ProThr: 1.485 ± 0.385
1.748ProVal: 1.748 ± 0.303
0.786ProTrp: 0.786 ± 0.275
1.398ProTyr: 1.398 ± 0.351
0.0ProXaa: 0.0 ± 0.0
Gln
2.971GlnAla: 2.971 ± 0.531
0.262GlnCys: 0.262 ± 0.131
2.359GlnAsp: 2.359 ± 0.362
4.282GlnGlu: 4.282 ± 0.742
1.398GlnPhe: 1.398 ± 0.341
6.117GlnGly: 6.117 ± 4.604
0.524GlnHis: 0.524 ± 0.233
2.097GlnIle: 2.097 ± 0.464
3.67GlnLys: 3.67 ± 0.61
3.321GlnLeu: 3.321 ± 0.504
1.398GlnMet: 1.398 ± 0.38
1.573GlnAsn: 1.573 ± 0.354
1.136GlnPro: 1.136 ± 0.302
1.66GlnGln: 1.66 ± 0.379
2.709GlnArg: 2.709 ± 0.504
2.534GlnSer: 2.534 ± 0.448
3.757GlnThr: 3.757 ± 2.1
1.922GlnVal: 1.922 ± 0.444
0.786GlnTrp: 0.786 ± 0.26
1.573GlnTyr: 1.573 ± 0.378
0.0GlnXaa: 0.0 ± 0.0
Arg
2.884ArgAla: 2.884 ± 0.464
0.437ArgCys: 0.437 ± 0.182
3.495ArgAsp: 3.495 ± 0.577
4.544ArgGlu: 4.544 ± 0.8
3.146ArgPhe: 3.146 ± 0.477
3.67ArgGly: 3.67 ± 0.58
0.612ArgHis: 0.612 ± 0.25
3.932ArgIle: 3.932 ± 0.674
3.408ArgLys: 3.408 ± 0.565
5.33ArgLeu: 5.33 ± 0.715
1.311ArgMet: 1.311 ± 0.284
1.485ArgAsn: 1.485 ± 0.475
1.485ArgPro: 1.485 ± 0.385
2.884ArgGln: 2.884 ± 0.453
3.67ArgArg: 3.67 ± 0.871
3.321ArgSer: 3.321 ± 0.587
2.621ArgThr: 2.621 ± 0.53
3.146ArgVal: 3.146 ± 0.532
1.66ArgTrp: 1.66 ± 0.325
2.097ArgTyr: 2.097 ± 0.445
0.0ArgXaa: 0.0 ± 0.0
Ser
4.194SerAla: 4.194 ± 0.626
0.961SerCys: 0.961 ± 0.388
5.243SerAsp: 5.243 ± 0.636
4.719SerGlu: 4.719 ± 0.6
3.67SerPhe: 3.67 ± 0.54
4.107SerGly: 4.107 ± 0.531
1.922SerHis: 1.922 ± 0.375
4.456SerIle: 4.456 ± 0.692
5.767SerLys: 5.767 ± 0.814
5.855SerLeu: 5.855 ± 0.566
0.961SerMet: 0.961 ± 0.312
1.835SerAsn: 1.835 ± 0.409
2.884SerPro: 2.884 ± 0.43
3.495SerGln: 3.495 ± 0.547
2.447SerArg: 2.447 ± 0.443
6.117SerSer: 6.117 ± 1.002
3.583SerThr: 3.583 ± 0.587
4.719SerVal: 4.719 ± 0.548
0.437SerTrp: 0.437 ± 0.187
1.398SerTyr: 1.398 ± 0.358
0.0SerXaa: 0.0 ± 0.0
Thr
3.495ThrAla: 3.495 ± 0.6
0.087ThrCys: 0.087 ± 0.094
2.097ThrAsp: 2.097 ± 0.446
4.107ThrGlu: 4.107 ± 0.642
2.796ThrPhe: 2.796 ± 0.476
5.068ThrGly: 5.068 ± 2.067
0.874ThrHis: 0.874 ± 0.239
3.146ThrIle: 3.146 ± 0.656
5.418ThrLys: 5.418 ± 0.704
5.156ThrLeu: 5.156 ± 0.62
1.049ThrMet: 1.049 ± 0.427
1.835ThrAsn: 1.835 ± 0.417
3.058ThrPro: 3.058 ± 0.532
2.534ThrGln: 2.534 ± 0.598
3.233ThrArg: 3.233 ± 0.336
3.845ThrSer: 3.845 ± 0.67
2.621ThrThr: 2.621 ± 0.566
3.146ThrVal: 3.146 ± 0.55
0.437ThrTrp: 0.437 ± 0.181
1.573ThrTyr: 1.573 ± 0.351
0.0ThrXaa: 0.0 ± 0.0
Val
2.884ValAla: 2.884 ± 0.606
0.874ValCys: 0.874 ± 0.279
4.456ValAsp: 4.456 ± 0.598
4.544ValGlu: 4.544 ± 0.516
2.534ValPhe: 2.534 ± 0.422
3.67ValGly: 3.67 ± 0.683
0.961ValHis: 0.961 ± 0.376
2.971ValIle: 2.971 ± 0.481
5.068ValLys: 5.068 ± 0.828
5.68ValLeu: 5.68 ± 0.72
1.66ValMet: 1.66 ± 0.419
1.922ValAsn: 1.922 ± 0.423
1.922ValPro: 1.922 ± 0.378
1.748ValGln: 1.748 ± 0.39
3.233ValArg: 3.233 ± 0.706
5.855ValSer: 5.855 ± 0.791
2.097ValThr: 2.097 ± 0.381
2.971ValVal: 2.971 ± 0.614
0.874ValTrp: 0.874 ± 0.258
1.485ValTyr: 1.485 ± 0.34
0.0ValXaa: 0.0 ± 0.0
Trp
0.961TrpAla: 0.961 ± 0.256
0.0TrpCys: 0.0 ± 0.0
1.049TrpAsp: 1.049 ± 0.302
1.223TrpGlu: 1.223 ± 0.299
0.961TrpPhe: 0.961 ± 0.279
0.612TrpGly: 0.612 ± 0.225
0.262TrpHis: 0.262 ± 0.136
1.66TrpIle: 1.66 ± 0.339
1.223TrpLys: 1.223 ± 0.276
1.573TrpLeu: 1.573 ± 0.413
0.437TrpMet: 0.437 ± 0.18
0.524TrpAsn: 0.524 ± 0.221
0.0TrpPro: 0.0 ± 0.0
0.262TrpGln: 0.262 ± 0.151
0.437TrpArg: 0.437 ± 0.184
0.961TrpSer: 0.961 ± 0.322
0.612TrpThr: 0.612 ± 0.23
0.961TrpVal: 0.961 ± 0.299
0.437TrpTrp: 0.437 ± 0.209
0.961TrpTyr: 0.961 ± 0.28
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.922TyrAla: 1.922 ± 0.361
0.437TyrCys: 0.437 ± 0.172
1.66TyrAsp: 1.66 ± 0.307
2.097TyrGlu: 2.097 ± 0.483
0.699TyrPhe: 0.699 ± 0.259
2.097TyrGly: 2.097 ± 0.458
0.786TyrHis: 0.786 ± 0.239
2.359TyrIle: 2.359 ± 0.519
2.01TyrLys: 2.01 ± 0.403
3.321TyrLeu: 3.321 ± 0.554
0.437TyrMet: 0.437 ± 0.228
1.398TyrAsn: 1.398 ± 0.409
1.311TyrPro: 1.311 ± 0.428
1.049TyrGln: 1.049 ± 0.263
2.796TyrArg: 2.796 ± 0.574
2.097TyrSer: 2.097 ± 0.403
2.01TyrThr: 2.01 ± 0.379
1.223TyrVal: 1.223 ± 0.284
0.262TyrTrp: 0.262 ± 0.226
1.311TyrTyr: 1.311 ± 0.485
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (11445 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski