Amino acid dipepetide frequency for Xylella phage Cota

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.195AlaAla: 12.195 ± 2.053
1.284AlaCys: 1.284 ± 0.427
6.499AlaAsp: 6.499 ± 0.775
8.424AlaGlu: 8.424 ± 0.924
2.648AlaPhe: 2.648 ± 0.56
7.702AlaGly: 7.702 ± 1.201
2.006AlaHis: 2.006 ± 0.37
4.092AlaIle: 4.092 ± 0.678
4.092AlaLys: 4.092 ± 0.651
8.585AlaLeu: 8.585 ± 1.02
3.049AlaMet: 3.049 ± 0.645
4.092AlaAsn: 4.092 ± 0.681
3.61AlaPro: 3.61 ± 0.408
4.814AlaGln: 4.814 ± 0.812
6.499AlaArg: 6.499 ± 0.777
5.135AlaSer: 5.135 ± 0.703
6.017AlaThr: 6.017 ± 0.533
6.739AlaVal: 6.739 ± 0.512
1.845AlaTrp: 1.845 ± 0.453
4.814AlaTyr: 4.814 ± 0.554
0.0AlaXaa: 0.0 ± 0.0
Cys
0.562CysAla: 0.562 ± 0.268
0.0CysCys: 0.0 ± 0.0
0.802CysAsp: 0.802 ± 0.362
0.401CysGlu: 0.401 ± 0.213
0.241CysPhe: 0.241 ± 0.144
0.883CysGly: 0.883 ± 0.372
0.0CysHis: 0.0 ± 0.0
0.321CysIle: 0.321 ± 0.191
0.481CysLys: 0.481 ± 0.287
0.722CysLeu: 0.722 ± 0.242
0.562CysMet: 0.562 ± 0.206
0.722CysAsn: 0.722 ± 0.282
0.16CysPro: 0.16 ± 0.132
0.321CysGln: 0.321 ± 0.169
0.481CysArg: 0.481 ± 0.258
0.481CysSer: 0.481 ± 0.208
0.642CysThr: 0.642 ± 0.334
0.562CysVal: 0.562 ± 0.244
0.08CysTrp: 0.08 ± 0.068
0.08CysTyr: 0.08 ± 0.081
0.0CysXaa: 0.0 ± 0.0
Asp
6.418AspAla: 6.418 ± 0.716
0.562AspCys: 0.562 ± 0.241
4.493AspAsp: 4.493 ± 0.541
3.45AspGlu: 3.45 ± 0.585
2.487AspPhe: 2.487 ± 0.539
6.258AspGly: 6.258 ± 0.79
0.722AspHis: 0.722 ± 0.216
3.53AspIle: 3.53 ± 0.435
3.289AspLys: 3.289 ± 0.524
4.974AspLeu: 4.974 ± 0.609
1.926AspMet: 1.926 ± 0.443
2.166AspAsn: 2.166 ± 0.33
3.209AspPro: 3.209 ± 0.402
1.845AspGln: 1.845 ± 0.414
4.172AspArg: 4.172 ± 0.579
3.129AspSer: 3.129 ± 0.588
3.129AspThr: 3.129 ± 0.636
4.814AspVal: 4.814 ± 0.542
0.802AspTrp: 0.802 ± 0.227
2.407AspTyr: 2.407 ± 0.478
0.0AspXaa: 0.0 ± 0.0
Glu
7.782GluAla: 7.782 ± 0.916
0.642GluCys: 0.642 ± 0.218
3.771GluAsp: 3.771 ± 0.515
3.931GluGlu: 3.931 ± 0.54
2.888GluPhe: 2.888 ± 0.56
3.931GluGly: 3.931 ± 0.439
1.364GluHis: 1.364 ± 0.335
2.648GluIle: 2.648 ± 0.518
3.129GluLys: 3.129 ± 0.543
4.734GluLeu: 4.734 ± 0.749
1.765GluMet: 1.765 ± 0.346
2.246GluAsn: 2.246 ± 0.412
2.006GluPro: 2.006 ± 0.392
4.252GluGln: 4.252 ± 0.475
4.413GluArg: 4.413 ± 0.77
3.049GluSer: 3.049 ± 0.567
3.45GluThr: 3.45 ± 0.606
4.172GluVal: 4.172 ± 0.769
1.203GluTrp: 1.203 ± 0.467
3.129GluTyr: 3.129 ± 0.458
0.0GluXaa: 0.0 ± 0.0
Phe
3.45PheAla: 3.45 ± 0.558
0.08PheCys: 0.08 ± 0.079
2.808PheAsp: 2.808 ± 0.534
2.567PheGlu: 2.567 ± 0.523
1.043PhePhe: 1.043 ± 0.285
3.53PheGly: 3.53 ± 0.688
0.481PheHis: 0.481 ± 0.22
1.685PheIle: 1.685 ± 0.355
1.364PheLys: 1.364 ± 0.366
2.246PheLeu: 2.246 ± 0.304
0.562PheMet: 0.562 ± 0.241
1.685PheAsn: 1.685 ± 0.29
1.284PhePro: 1.284 ± 0.287
1.524PheGln: 1.524 ± 0.361
2.086PheArg: 2.086 ± 0.473
1.926PheSer: 1.926 ± 0.411
2.808PheThr: 2.808 ± 0.551
2.888PheVal: 2.888 ± 0.537
0.241PheTrp: 0.241 ± 0.148
0.802PheTyr: 0.802 ± 0.26
0.0PheXaa: 0.0 ± 0.0
Gly
7.702GlyAla: 7.702 ± 1.008
0.642GlyCys: 0.642 ± 0.269
5.696GlyAsp: 5.696 ± 0.717
4.894GlyGlu: 4.894 ± 0.624
3.129GlyPhe: 3.129 ± 0.59
6.98GlyGly: 6.98 ± 0.777
1.444GlyHis: 1.444 ± 0.381
3.289GlyIle: 3.289 ± 0.453
5.375GlyLys: 5.375 ± 0.528
5.375GlyLeu: 5.375 ± 0.705
2.728GlyMet: 2.728 ± 0.682
3.209GlyAsn: 3.209 ± 0.495
2.086GlyPro: 2.086 ± 0.482
2.888GlyGln: 2.888 ± 0.514
5.215GlyArg: 5.215 ± 0.778
4.814GlySer: 4.814 ± 0.78
5.375GlyThr: 5.375 ± 0.905
4.894GlyVal: 4.894 ± 0.823
1.284GlyTrp: 1.284 ± 0.346
2.407GlyTyr: 2.407 ± 0.381
0.0GlyXaa: 0.0 ± 0.0
His
1.685HisAla: 1.685 ± 0.302
0.321HisCys: 0.321 ± 0.195
1.284HisAsp: 1.284 ± 0.269
0.883HisGlu: 0.883 ± 0.265
0.802HisPhe: 0.802 ± 0.336
1.123HisGly: 1.123 ± 0.316
0.642HisHis: 0.642 ± 0.245
0.963HisIle: 0.963 ± 0.256
1.043HisLys: 1.043 ± 0.367
0.802HisLeu: 0.802 ± 0.281
0.722HisMet: 0.722 ± 0.281
0.642HisAsn: 0.642 ± 0.308
1.845HisPro: 1.845 ± 0.432
0.642HisGln: 0.642 ± 0.186
1.043HisArg: 1.043 ± 0.354
0.963HisSer: 0.963 ± 0.235
0.963HisThr: 0.963 ± 0.337
1.123HisVal: 1.123 ± 0.341
0.321HisTrp: 0.321 ± 0.159
0.401HisTyr: 0.401 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
5.696IleAla: 5.696 ± 0.584
0.321IleCys: 0.321 ± 0.156
3.45IleAsp: 3.45 ± 0.424
3.851IleGlu: 3.851 ± 0.551
1.043IlePhe: 1.043 ± 0.35
3.45IleGly: 3.45 ± 0.428
1.284IleHis: 1.284 ± 0.288
2.728IleIle: 2.728 ± 0.323
2.888IleLys: 2.888 ± 0.455
2.006IleLeu: 2.006 ± 0.508
1.203IleMet: 1.203 ± 0.315
1.845IleAsn: 1.845 ± 0.366
2.246IlePro: 2.246 ± 0.443
2.166IleGln: 2.166 ± 0.399
2.808IleArg: 2.808 ± 0.471
2.487IleSer: 2.487 ± 0.487
4.012IleThr: 4.012 ± 0.527
3.129IleVal: 3.129 ± 0.494
0.802IleTrp: 0.802 ± 0.248
0.802IleTyr: 0.802 ± 0.264
0.0IleXaa: 0.0 ± 0.0
Lys
4.252LysAla: 4.252 ± 0.787
0.241LysCys: 0.241 ± 0.153
2.728LysAsp: 2.728 ± 0.389
3.289LysGlu: 3.289 ± 0.576
1.605LysPhe: 1.605 ± 0.25
2.969LysGly: 2.969 ± 0.427
1.123LysHis: 1.123 ± 0.312
2.567LysIle: 2.567 ± 0.587
1.845LysLys: 1.845 ± 0.702
5.536LysLeu: 5.536 ± 0.619
1.043LysMet: 1.043 ± 0.249
2.327LysAsn: 2.327 ± 0.43
2.086LysPro: 2.086 ± 0.485
1.685LysGln: 1.685 ± 0.391
3.771LysArg: 3.771 ± 0.65
2.567LysSer: 2.567 ± 0.47
2.086LysThr: 2.086 ± 0.453
3.049LysVal: 3.049 ± 0.468
1.123LysTrp: 1.123 ± 0.353
1.765LysTyr: 1.765 ± 0.393
0.0LysXaa: 0.0 ± 0.0
Leu
8.665LeuAla: 8.665 ± 1.012
0.802LeuCys: 0.802 ± 0.31
5.375LeuAsp: 5.375 ± 0.653
3.931LeuGlu: 3.931 ± 0.566
2.327LeuPhe: 2.327 ± 0.375
5.536LeuGly: 5.536 ± 0.708
1.605LeuHis: 1.605 ± 0.437
3.851LeuIle: 3.851 ± 0.673
2.166LeuLys: 2.166 ± 0.361
8.184LeuLeu: 8.184 ± 0.857
1.685LeuMet: 1.685 ± 0.286
3.289LeuAsn: 3.289 ± 0.474
3.691LeuPro: 3.691 ± 0.473
3.61LeuGln: 3.61 ± 0.575
6.82LeuArg: 6.82 ± 0.827
5.215LeuSer: 5.215 ± 0.645
5.456LeuThr: 5.456 ± 0.606
4.734LeuVal: 4.734 ± 0.805
0.802LeuTrp: 0.802 ± 0.242
1.765LeuTyr: 1.765 ± 0.335
0.0LeuXaa: 0.0 ± 0.0
Met
2.648MetAla: 2.648 ± 0.455
0.0MetCys: 0.0 ± 0.0
1.765MetAsp: 1.765 ± 0.294
1.685MetGlu: 1.685 ± 0.307
0.883MetPhe: 0.883 ± 0.256
1.845MetGly: 1.845 ± 0.544
0.241MetHis: 0.241 ± 0.119
1.123MetIle: 1.123 ± 0.285
1.123MetLys: 1.123 ± 0.319
2.327MetLeu: 2.327 ± 0.429
0.722MetMet: 0.722 ± 0.257
1.043MetAsn: 1.043 ± 0.281
1.043MetPro: 1.043 ± 0.277
0.963MetGln: 0.963 ± 0.296
2.327MetArg: 2.327 ± 0.422
2.246MetSer: 2.246 ± 0.477
2.086MetThr: 2.086 ± 0.344
1.444MetVal: 1.444 ± 0.378
0.321MetTrp: 0.321 ± 0.175
1.043MetTyr: 1.043 ± 0.328
0.0MetXaa: 0.0 ± 0.0
Asn
3.37AsnAla: 3.37 ± 0.527
0.481AsnCys: 0.481 ± 0.238
2.648AsnAsp: 2.648 ± 0.396
1.605AsnGlu: 1.605 ± 0.331
1.364AsnPhe: 1.364 ± 0.355
3.45AsnGly: 3.45 ± 0.436
0.642AsnHis: 0.642 ± 0.278
2.166AsnIle: 2.166 ± 0.548
2.246AsnLys: 2.246 ± 0.386
4.012AsnLeu: 4.012 ± 0.381
1.123AsnMet: 1.123 ± 0.315
2.166AsnAsn: 2.166 ± 0.48
2.006AsnPro: 2.006 ± 0.412
1.685AsnGln: 1.685 ± 0.372
2.888AsnArg: 2.888 ± 0.474
2.246AsnSer: 2.246 ± 0.402
2.166AsnThr: 2.166 ± 0.421
1.926AsnVal: 1.926 ± 0.352
0.963AsnTrp: 0.963 ± 0.329
1.845AsnTyr: 1.845 ± 0.295
0.0AsnXaa: 0.0 ± 0.0
Pro
3.289ProAla: 3.289 ± 0.614
0.321ProCys: 0.321 ± 0.152
3.61ProAsp: 3.61 ± 0.429
4.092ProGlu: 4.092 ± 0.72
1.685ProPhe: 1.685 ± 0.317
2.808ProGly: 2.808 ± 0.5
0.642ProHis: 0.642 ± 0.206
2.006ProIle: 2.006 ± 0.473
2.166ProLys: 2.166 ± 0.54
2.808ProLeu: 2.808 ± 0.753
0.562ProMet: 0.562 ± 0.201
1.926ProAsn: 1.926 ± 0.503
1.685ProPro: 1.685 ± 0.482
1.524ProGln: 1.524 ± 0.377
2.166ProArg: 2.166 ± 0.302
2.327ProSer: 2.327 ± 0.46
2.648ProThr: 2.648 ± 0.599
2.648ProVal: 2.648 ± 0.693
0.562ProTrp: 0.562 ± 0.237
1.605ProTyr: 1.605 ± 0.333
0.0ProXaa: 0.0 ± 0.0
Gln
4.974GlnAla: 4.974 ± 0.843
0.321GlnCys: 0.321 ± 0.225
1.444GlnAsp: 1.444 ± 0.283
2.487GlnGlu: 2.487 ± 0.472
1.284GlnPhe: 1.284 ± 0.393
3.45GlnGly: 3.45 ± 0.388
0.883GlnHis: 0.883 ± 0.282
2.728GlnIle: 2.728 ± 0.611
1.605GlnLys: 1.605 ± 0.395
4.012GlnLeu: 4.012 ± 0.734
1.364GlnMet: 1.364 ± 0.422
2.246GlnAsn: 2.246 ± 0.436
1.845GlnPro: 1.845 ± 0.306
2.246GlnGln: 2.246 ± 0.563
3.53GlnArg: 3.53 ± 0.502
2.166GlnSer: 2.166 ± 0.489
2.086GlnThr: 2.086 ± 0.428
2.246GlnVal: 2.246 ± 0.418
0.722GlnTrp: 0.722 ± 0.246
1.364GlnTyr: 1.364 ± 0.375
0.0GlnXaa: 0.0 ± 0.0
Arg
7.461ArgAla: 7.461 ± 0.917
0.481ArgCys: 0.481 ± 0.274
3.61ArgAsp: 3.61 ± 0.588
4.413ArgGlu: 4.413 ± 0.612
2.327ArgPhe: 2.327 ± 0.482
4.894ArgGly: 4.894 ± 0.739
0.802ArgHis: 0.802 ± 0.298
4.012ArgIle: 4.012 ± 0.669
3.931ArgLys: 3.931 ± 0.689
4.734ArgLeu: 4.734 ± 0.687
2.728ArgMet: 2.728 ± 0.461
3.049ArgAsn: 3.049 ± 0.551
2.166ArgPro: 2.166 ± 0.484
2.327ArgGln: 2.327 ± 0.562
4.172ArgArg: 4.172 ± 0.661
3.37ArgSer: 3.37 ± 0.579
2.888ArgThr: 2.888 ± 0.625
4.573ArgVal: 4.573 ± 0.606
1.364ArgTrp: 1.364 ± 0.314
2.728ArgTyr: 2.728 ± 0.471
0.0ArgXaa: 0.0 ± 0.0
Ser
6.579SerAla: 6.579 ± 0.828
0.481SerCys: 0.481 ± 0.203
3.53SerAsp: 3.53 ± 0.514
2.808SerGlu: 2.808 ± 0.623
1.926SerPhe: 1.926 ± 0.445
5.616SerGly: 5.616 ± 0.757
1.123SerHis: 1.123 ± 0.326
3.129SerIle: 3.129 ± 0.606
2.648SerLys: 2.648 ± 0.407
4.573SerLeu: 4.573 ± 0.643
1.203SerMet: 1.203 ± 0.33
1.765SerAsn: 1.765 ± 0.467
2.808SerPro: 2.808 ± 0.573
1.765SerGln: 1.765 ± 0.283
2.969SerArg: 2.969 ± 0.571
2.969SerSer: 2.969 ± 0.508
3.37SerThr: 3.37 ± 0.518
3.691SerVal: 3.691 ± 0.573
0.883SerTrp: 0.883 ± 0.316
1.685SerTyr: 1.685 ± 0.601
0.0SerXaa: 0.0 ± 0.0
Thr
5.295ThrAla: 5.295 ± 0.613
0.722ThrCys: 0.722 ± 0.224
3.049ThrAsp: 3.049 ± 0.477
3.771ThrGlu: 3.771 ± 0.606
2.246ThrPhe: 2.246 ± 0.267
5.777ThrGly: 5.777 ± 0.759
0.802ThrHis: 0.802 ± 0.255
2.086ThrIle: 2.086 ± 0.422
3.45ThrLys: 3.45 ± 0.569
4.252ThrLeu: 4.252 ± 0.844
1.364ThrMet: 1.364 ± 0.332
2.327ThrAsn: 2.327 ± 0.517
3.049ThrPro: 3.049 ± 0.673
3.049ThrGln: 3.049 ± 0.473
3.61ThrArg: 3.61 ± 0.697
3.61ThrSer: 3.61 ± 0.644
4.332ThrThr: 4.332 ± 0.797
4.974ThrVal: 4.974 ± 0.899
1.203ThrTrp: 1.203 ± 0.276
1.845ThrTyr: 1.845 ± 0.42
0.0ThrXaa: 0.0 ± 0.0
Val
7.06ValAla: 7.06 ± 0.678
0.401ValCys: 0.401 ± 0.196
3.61ValAsp: 3.61 ± 0.549
4.734ValGlu: 4.734 ± 0.775
2.327ValPhe: 2.327 ± 0.41
5.616ValGly: 5.616 ± 0.89
1.364ValHis: 1.364 ± 0.411
2.728ValIle: 2.728 ± 0.629
2.808ValLys: 2.808 ± 0.498
4.413ValLeu: 4.413 ± 0.563
1.444ValMet: 1.444 ± 0.295
2.327ValAsn: 2.327 ± 0.502
2.567ValPro: 2.567 ± 0.532
2.567ValGln: 2.567 ± 0.52
4.332ValArg: 4.332 ± 0.652
4.172ValSer: 4.172 ± 0.969
4.734ValThr: 4.734 ± 0.55
5.215ValVal: 5.215 ± 0.624
1.123ValTrp: 1.123 ± 0.267
2.567ValTyr: 2.567 ± 0.575
0.0ValXaa: 0.0 ± 0.0
Trp
1.284TrpAla: 1.284 ± 0.294
0.241TrpCys: 0.241 ± 0.134
0.883TrpAsp: 0.883 ± 0.332
1.043TrpGlu: 1.043 ± 0.327
1.284TrpPhe: 1.284 ± 0.34
0.802TrpGly: 0.802 ± 0.29
0.562TrpHis: 0.562 ± 0.24
0.642TrpIle: 0.642 ± 0.207
0.722TrpLys: 0.722 ± 0.266
2.006TrpLeu: 2.006 ± 0.356
0.241TrpMet: 0.241 ± 0.124
0.642TrpAsn: 0.642 ± 0.274
0.241TrpPro: 0.241 ± 0.145
1.123TrpGln: 1.123 ± 0.383
1.364TrpArg: 1.364 ± 0.341
0.883TrpSer: 0.883 ± 0.277
0.802TrpThr: 0.802 ± 0.305
0.722TrpVal: 0.722 ± 0.227
0.321TrpTrp: 0.321 ± 0.176
0.401TrpTyr: 0.401 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.771TyrAla: 3.771 ± 0.553
0.16TyrCys: 0.16 ± 0.116
2.728TyrAsp: 2.728 ± 0.463
2.246TyrGlu: 2.246 ± 0.356
1.524TyrPhe: 1.524 ± 0.341
2.808TyrGly: 2.808 ± 0.408
0.481TyrHis: 0.481 ± 0.208
1.926TyrIle: 1.926 ± 0.375
1.284TyrLys: 1.284 ± 0.337
3.209TyrLeu: 3.209 ± 0.544
0.802TyrMet: 0.802 ± 0.264
1.284TyrAsn: 1.284 ± 0.259
1.444TyrPro: 1.444 ± 0.415
2.006TyrGln: 2.006 ± 0.329
1.444TyrArg: 1.444 ± 0.308
1.765TyrSer: 1.765 ± 0.397
1.765TyrThr: 1.765 ± 0.397
2.567TyrVal: 2.567 ± 0.537
0.241TyrTrp: 0.241 ± 0.133
0.883TyrTyr: 0.883 ± 0.307
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (12465 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski