Amino acid dipepetide frequency for Xylella phage Sano

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.967AlaAla: 14.967 ± 2.331
0.957AlaCys: 0.957 ± 0.252
6.47AlaAsp: 6.47 ± 0.672
6.47AlaGlu: 6.47 ± 0.78
4.164AlaPhe: 4.164 ± 0.477
6.864AlaGly: 6.864 ± 0.775
1.913AlaHis: 1.913 ± 0.409
4.501AlaIle: 4.501 ± 0.569
5.514AlaLys: 5.514 ± 0.761
8.44AlaLeu: 8.44 ± 0.714
3.207AlaMet: 3.207 ± 0.528
3.151AlaAsn: 3.151 ± 0.606
3.713AlaPro: 3.713 ± 0.462
4.614AlaGln: 4.614 ± 0.969
7.033AlaArg: 7.033 ± 0.684
6.302AlaSer: 6.302 ± 0.693
4.895AlaThr: 4.895 ± 0.577
6.414AlaVal: 6.414 ± 0.766
1.632AlaTrp: 1.632 ± 0.258
2.532AlaTyr: 2.532 ± 0.36
0.0AlaXaa: 0.0 ± 0.0
Cys
0.9CysAla: 0.9 ± 0.223
0.0CysCys: 0.0 ± 0.0
0.675CysAsp: 0.675 ± 0.212
0.506CysGlu: 0.506 ± 0.171
0.338CysPhe: 0.338 ± 0.134
0.731CysGly: 0.731 ± 0.197
0.281CysHis: 0.281 ± 0.114
0.113CysIle: 0.113 ± 0.067
0.45CysLys: 0.45 ± 0.154
0.506CysLeu: 0.506 ± 0.16
0.281CysMet: 0.281 ± 0.122
0.225CysAsn: 0.225 ± 0.105
0.45CysPro: 0.45 ± 0.17
0.113CysGln: 0.113 ± 0.108
0.394CysArg: 0.394 ± 0.16
0.338CysSer: 0.338 ± 0.124
0.338CysThr: 0.338 ± 0.15
0.675CysVal: 0.675 ± 0.193
0.169CysTrp: 0.169 ± 0.101
0.225CysTyr: 0.225 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
6.864AspAla: 6.864 ± 0.605
0.45AspCys: 0.45 ± 0.155
5.233AspAsp: 5.233 ± 0.723
5.683AspGlu: 5.683 ± 0.789
2.419AspPhe: 2.419 ± 0.429
6.527AspGly: 6.527 ± 0.627
1.238AspHis: 1.238 ± 0.294
4.276AspIle: 4.276 ± 0.434
2.982AspLys: 2.982 ± 0.398
4.445AspLeu: 4.445 ± 0.46
1.8AspMet: 1.8 ± 0.327
2.082AspAsn: 2.082 ± 0.323
2.588AspPro: 2.588 ± 0.402
1.519AspGln: 1.519 ± 0.3
3.826AspArg: 3.826 ± 0.509
3.488AspSer: 3.488 ± 0.437
3.095AspThr: 3.095 ± 0.454
3.77AspVal: 3.77 ± 0.429
1.125AspTrp: 1.125 ± 0.269
2.363AspTyr: 2.363 ± 0.349
0.0AspXaa: 0.0 ± 0.0
Glu
6.47GluAla: 6.47 ± 0.915
0.338GluCys: 0.338 ± 0.197
3.545GluAsp: 3.545 ± 0.484
5.514GluGlu: 5.514 ± 0.763
2.588GluPhe: 2.588 ± 0.42
4.951GluGly: 4.951 ± 0.512
1.35GluHis: 1.35 ± 0.326
3.095GluIle: 3.095 ± 0.423
4.501GluLys: 4.501 ± 0.588
6.696GluLeu: 6.696 ± 0.695
2.588GluMet: 2.588 ± 0.341
2.419GluAsn: 2.419 ± 0.367
2.138GluPro: 2.138 ± 0.33
3.713GluGln: 3.713 ± 0.519
4.22GluArg: 4.22 ± 0.601
3.151GluSer: 3.151 ± 0.505
3.432GluThr: 3.432 ± 0.429
3.77GluVal: 3.77 ± 0.483
1.125GluTrp: 1.125 ± 0.308
1.463GluTyr: 1.463 ± 0.256
0.0GluXaa: 0.0 ± 0.0
Phe
3.882PheAla: 3.882 ± 0.484
0.281PheCys: 0.281 ± 0.124
2.926PheAsp: 2.926 ± 0.468
2.926PheGlu: 2.926 ± 0.357
1.125PhePhe: 1.125 ± 0.291
3.488PheGly: 3.488 ± 0.428
0.506PheHis: 0.506 ± 0.211
2.082PheIle: 2.082 ± 0.39
1.913PheLys: 1.913 ± 0.301
2.532PheLeu: 2.532 ± 0.288
0.788PheMet: 0.788 ± 0.253
2.194PheAsn: 2.194 ± 0.38
1.744PhePro: 1.744 ± 0.342
1.125PheGln: 1.125 ± 0.247
2.138PheArg: 2.138 ± 0.299
2.307PheSer: 2.307 ± 0.386
1.913PheThr: 1.913 ± 0.359
2.701PheVal: 2.701 ± 0.434
0.563PheTrp: 0.563 ± 0.162
1.069PheTyr: 1.069 ± 0.219
0.0PheXaa: 0.0 ± 0.0
Gly
7.427GlyAla: 7.427 ± 0.661
0.844GlyCys: 0.844 ± 0.244
5.739GlyAsp: 5.739 ± 0.598
5.008GlyGlu: 5.008 ± 0.497
3.207GlyPhe: 3.207 ± 0.432
5.852GlyGly: 5.852 ± 0.628
0.957GlyHis: 0.957 ± 0.217
3.32GlyIle: 3.32 ± 0.365
3.939GlyLys: 3.939 ± 0.486
5.008GlyLeu: 5.008 ± 0.427
2.307GlyMet: 2.307 ± 0.336
2.982GlyAsn: 2.982 ± 0.507
3.376GlyPro: 3.376 ± 0.393
2.532GlyGln: 2.532 ± 0.364
4.67GlyArg: 4.67 ± 0.594
4.389GlySer: 4.389 ± 0.581
4.051GlyThr: 4.051 ± 0.766
5.57GlyVal: 5.57 ± 0.561
1.8GlyTrp: 1.8 ± 0.385
2.813GlyTyr: 2.813 ± 0.408
0.0GlyXaa: 0.0 ± 0.0
His
2.082HisAla: 2.082 ± 0.431
0.281HisCys: 0.281 ± 0.108
1.182HisAsp: 1.182 ± 0.283
1.069HisGlu: 1.069 ± 0.269
0.844HisPhe: 0.844 ± 0.199
1.519HisGly: 1.519 ± 0.37
0.338HisHis: 0.338 ± 0.139
0.957HisIle: 0.957 ± 0.244
0.957HisLys: 0.957 ± 0.218
1.125HisLeu: 1.125 ± 0.319
0.563HisMet: 0.563 ± 0.204
1.013HisAsn: 1.013 ± 0.275
0.9HisPro: 0.9 ± 0.286
0.225HisGln: 0.225 ± 0.114
1.407HisArg: 1.407 ± 0.229
1.069HisSer: 1.069 ± 0.278
1.013HisThr: 1.013 ± 0.214
1.632HisVal: 1.632 ± 0.308
0.394HisTrp: 0.394 ± 0.135
0.788HisTyr: 0.788 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
4.614IleAla: 4.614 ± 0.503
0.45IleCys: 0.45 ± 0.144
3.263IleAsp: 3.263 ± 0.434
3.77IleGlu: 3.77 ± 0.434
1.575IlePhe: 1.575 ± 0.303
3.601IleGly: 3.601 ± 0.472
0.675IleHis: 0.675 ± 0.203
2.138IleIle: 2.138 ± 0.345
2.082IleLys: 2.082 ± 0.335
3.095IleLeu: 3.095 ± 0.405
1.069IleMet: 1.069 ± 0.24
2.307IleAsn: 2.307 ± 0.307
1.8IlePro: 1.8 ± 0.306
1.519IleGln: 1.519 ± 0.313
3.545IleArg: 3.545 ± 0.436
2.082IleSer: 2.082 ± 0.375
3.207IleThr: 3.207 ± 0.515
3.095IleVal: 3.095 ± 0.392
0.225IleTrp: 0.225 ± 0.121
1.182IleTyr: 1.182 ± 0.24
0.0IleXaa: 0.0 ± 0.0
Lys
5.514LysAla: 5.514 ± 0.971
0.506LysCys: 0.506 ± 0.208
2.363LysAsp: 2.363 ± 0.427
3.77LysGlu: 3.77 ± 0.554
1.744LysPhe: 1.744 ± 0.244
3.376LysGly: 3.376 ± 0.364
1.069LysHis: 1.069 ± 0.248
1.744LysIle: 1.744 ± 0.395
5.289LysLys: 5.289 ± 0.806
4.164LysLeu: 4.164 ± 0.531
1.688LysMet: 1.688 ± 0.378
1.8LysAsn: 1.8 ± 0.32
2.926LysPro: 2.926 ± 0.387
2.419LysGln: 2.419 ± 0.426
4.389LysArg: 4.389 ± 0.499
2.87LysSer: 2.87 ± 0.411
2.701LysThr: 2.701 ± 0.432
3.77LysVal: 3.77 ± 0.387
1.125LysTrp: 1.125 ± 0.317
1.463LysTyr: 1.463 ± 0.299
0.0LysXaa: 0.0 ± 0.0
Leu
9.171LeuAla: 9.171 ± 1.117
0.675LeuCys: 0.675 ± 0.209
6.133LeuAsp: 6.133 ± 0.679
4.839LeuGlu: 4.839 ± 0.69
2.926LeuPhe: 2.926 ± 0.499
5.345LeuGly: 5.345 ± 0.578
2.082LeuHis: 2.082 ± 0.429
3.263LeuIle: 3.263 ± 0.389
4.276LeuLys: 4.276 ± 0.617
6.808LeuLeu: 6.808 ± 0.615
1.632LeuMet: 1.632 ± 0.301
3.038LeuAsn: 3.038 ± 0.641
3.207LeuPro: 3.207 ± 0.364
2.757LeuGln: 2.757 ± 0.377
6.133LeuArg: 6.133 ± 0.648
5.345LeuSer: 5.345 ± 0.401
4.67LeuThr: 4.67 ± 0.417
5.514LeuVal: 5.514 ± 0.604
1.238LeuTrp: 1.238 ± 0.318
1.857LeuTyr: 1.857 ± 0.354
0.0LeuXaa: 0.0 ± 0.0
Met
2.532MetAla: 2.532 ± 0.402
0.113MetCys: 0.113 ± 0.078
1.069MetAsp: 1.069 ± 0.256
2.082MetGlu: 2.082 ± 0.316
1.125MetPhe: 1.125 ± 0.242
1.688MetGly: 1.688 ± 0.3
0.675MetHis: 0.675 ± 0.166
1.238MetIle: 1.238 ± 0.281
1.294MetLys: 1.294 ± 0.244
2.644MetLeu: 2.644 ± 0.42
0.9MetMet: 0.9 ± 0.193
0.844MetAsn: 0.844 ± 0.216
1.069MetPro: 1.069 ± 0.273
1.182MetGln: 1.182 ± 0.256
2.082MetArg: 2.082 ± 0.426
2.982MetSer: 2.982 ± 0.466
2.419MetThr: 2.419 ± 0.362
1.125MetVal: 1.125 ± 0.355
0.338MetTrp: 0.338 ± 0.127
0.506MetTyr: 0.506 ± 0.202
0.0MetXaa: 0.0 ± 0.0
Asn
3.77AsnAla: 3.77 ± 0.461
0.169AsnCys: 0.169 ± 0.104
2.701AsnAsp: 2.701 ± 0.386
2.476AsnGlu: 2.476 ± 0.313
1.125AsnPhe: 1.125 ± 0.287
3.545AsnGly: 3.545 ± 0.428
0.506AsnHis: 0.506 ± 0.183
1.519AsnIle: 1.519 ± 0.374
1.463AsnLys: 1.463 ± 0.311
2.757AsnLeu: 2.757 ± 0.588
0.9AsnMet: 0.9 ± 0.176
1.8AsnAsn: 1.8 ± 0.278
2.194AsnPro: 2.194 ± 0.413
0.563AsnGln: 0.563 ± 0.194
2.194AsnArg: 2.194 ± 0.308
2.082AsnSer: 2.082 ± 0.315
2.251AsnThr: 2.251 ± 0.625
3.207AsnVal: 3.207 ± 0.5
0.844AsnTrp: 0.844 ± 0.25
1.407AsnTyr: 1.407 ± 0.299
0.0AsnXaa: 0.0 ± 0.0
Pro
3.488ProAla: 3.488 ± 0.435
0.281ProCys: 0.281 ± 0.123
2.588ProAsp: 2.588 ± 0.336
2.926ProGlu: 2.926 ± 0.531
2.082ProPhe: 2.082 ± 0.429
4.107ProGly: 4.107 ± 0.625
1.069ProHis: 1.069 ± 0.265
1.463ProIle: 1.463 ± 0.334
2.588ProLys: 2.588 ± 0.424
3.038ProLeu: 3.038 ± 0.436
1.519ProMet: 1.519 ± 0.32
1.407ProAsn: 1.407 ± 0.348
3.095ProPro: 3.095 ± 0.542
1.913ProGln: 1.913 ± 0.26
2.363ProArg: 2.363 ± 0.321
2.644ProSer: 2.644 ± 0.4
2.532ProThr: 2.532 ± 0.378
2.644ProVal: 2.644 ± 0.441
1.069ProTrp: 1.069 ± 0.237
1.575ProTyr: 1.575 ± 0.303
0.0ProXaa: 0.0 ± 0.0
Gln
4.895GlnAla: 4.895 ± 0.987
0.281GlnCys: 0.281 ± 0.123
1.125GlnAsp: 1.125 ± 0.293
1.575GlnGlu: 1.575 ± 0.443
1.294GlnPhe: 1.294 ± 0.238
2.476GlnGly: 2.476 ± 0.433
0.844GlnHis: 0.844 ± 0.224
1.969GlnIle: 1.969 ± 0.382
2.532GlnLys: 2.532 ± 0.458
3.488GlnLeu: 3.488 ± 0.573
1.013GlnMet: 1.013 ± 0.208
1.463GlnAsn: 1.463 ± 0.364
0.844GlnPro: 0.844 ± 0.252
2.532GlnGln: 2.532 ± 0.755
3.545GlnArg: 3.545 ± 0.532
1.688GlnSer: 1.688 ± 0.427
1.125GlnThr: 1.125 ± 0.289
2.588GlnVal: 2.588 ± 0.343
0.619GlnTrp: 0.619 ± 0.197
1.294GlnTyr: 1.294 ± 0.264
0.0GlnXaa: 0.0 ± 0.0
Arg
5.964ArgAla: 5.964 ± 0.756
0.394ArgCys: 0.394 ± 0.135
4.726ArgAsp: 4.726 ± 0.632
4.839ArgGlu: 4.839 ± 0.445
3.095ArgPhe: 3.095 ± 0.458
4.164ArgGly: 4.164 ± 0.402
1.125ArgHis: 1.125 ± 0.246
3.601ArgIle: 3.601 ± 0.464
3.826ArgLys: 3.826 ± 0.373
5.683ArgLeu: 5.683 ± 0.716
2.419ArgMet: 2.419 ± 0.321
2.644ArgAsn: 2.644 ± 0.355
3.545ArgPro: 3.545 ± 0.54
1.575ArgGln: 1.575 ± 0.446
4.951ArgArg: 4.951 ± 0.719
3.263ArgSer: 3.263 ± 0.356
3.151ArgThr: 3.151 ± 0.398
5.458ArgVal: 5.458 ± 0.579
1.238ArgTrp: 1.238 ± 0.314
1.913ArgTyr: 1.913 ± 0.274
0.0ArgXaa: 0.0 ± 0.0
Ser
5.064SerAla: 5.064 ± 0.621
0.394SerCys: 0.394 ± 0.118
3.488SerAsp: 3.488 ± 0.435
3.263SerGlu: 3.263 ± 0.444
2.082SerPhe: 2.082 ± 0.357
4.839SerGly: 4.839 ± 0.526
0.844SerHis: 0.844 ± 0.222
2.363SerIle: 2.363 ± 0.301
3.207SerLys: 3.207 ± 0.399
5.064SerLeu: 5.064 ± 0.507
1.182SerMet: 1.182 ± 0.28
2.138SerAsn: 2.138 ± 0.36
2.87SerPro: 2.87 ± 0.473
1.463SerGln: 1.463 ± 0.308
4.107SerArg: 4.107 ± 0.512
3.32SerSer: 3.32 ± 0.533
3.77SerThr: 3.77 ± 0.541
2.813SerVal: 2.813 ± 0.441
1.125SerTrp: 1.125 ± 0.244
2.251SerTyr: 2.251 ± 0.339
0.0SerXaa: 0.0 ± 0.0
Thr
5.176ThrAla: 5.176 ± 0.614
0.506ThrCys: 0.506 ± 0.162
3.657ThrAsp: 3.657 ± 0.492
3.601ThrGlu: 3.601 ± 0.483
2.251ThrPhe: 2.251 ± 0.368
4.951ThrGly: 4.951 ± 0.708
1.182ThrHis: 1.182 ± 0.295
2.926ThrIle: 2.926 ± 0.459
2.926ThrLys: 2.926 ± 0.429
4.726ThrLeu: 4.726 ± 0.401
1.294ThrMet: 1.294 ± 0.254
1.688ThrAsn: 1.688 ± 0.369
2.757ThrPro: 2.757 ± 0.409
1.857ThrGln: 1.857 ± 0.259
2.757ThrArg: 2.757 ± 0.441
2.476ThrSer: 2.476 ± 0.382
3.488ThrThr: 3.488 ± 0.453
4.783ThrVal: 4.783 ± 0.5
0.957ThrTrp: 0.957 ± 0.252
1.35ThrTyr: 1.35 ± 0.301
0.0ThrXaa: 0.0 ± 0.0
Val
6.639ValAla: 6.639 ± 0.756
0.506ValCys: 0.506 ± 0.182
5.176ValAsp: 5.176 ± 0.482
5.008ValGlu: 5.008 ± 0.526
2.644ValPhe: 2.644 ± 0.354
4.557ValGly: 4.557 ± 0.51
1.238ValHis: 1.238 ± 0.272
2.363ValIle: 2.363 ± 0.31
2.644ValLys: 2.644 ± 0.419
5.964ValLeu: 5.964 ± 0.479
1.688ValMet: 1.688 ± 0.32
2.194ValAsn: 2.194 ± 0.351
3.545ValPro: 3.545 ± 0.482
3.151ValGln: 3.151 ± 0.362
4.557ValArg: 4.557 ± 0.457
3.432ValSer: 3.432 ± 0.445
4.164ValThr: 4.164 ± 0.631
5.008ValVal: 5.008 ± 0.629
0.957ValTrp: 0.957 ± 0.261
2.87ValTyr: 2.87 ± 0.427
0.0ValXaa: 0.0 ± 0.0
Trp
1.519TrpAla: 1.519 ± 0.329
0.225TrpCys: 0.225 ± 0.125
1.125TrpAsp: 1.125 ± 0.256
0.394TrpGlu: 0.394 ± 0.119
0.563TrpPhe: 0.563 ± 0.201
1.238TrpGly: 1.238 ± 0.265
0.731TrpHis: 0.731 ± 0.206
0.675TrpIle: 0.675 ± 0.196
1.069TrpLys: 1.069 ± 0.243
2.026TrpLeu: 2.026 ± 0.387
0.45TrpMet: 0.45 ± 0.159
0.844TrpAsn: 0.844 ± 0.245
0.506TrpPro: 0.506 ± 0.172
1.013TrpGln: 1.013 ± 0.226
1.125TrpArg: 1.125 ± 0.277
0.9TrpSer: 0.9 ± 0.236
0.675TrpThr: 0.675 ± 0.196
1.407TrpVal: 1.407 ± 0.304
0.394TrpTrp: 0.394 ± 0.135
0.506TrpTyr: 0.506 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.813TyrAla: 2.813 ± 0.308
0.056TyrCys: 0.056 ± 0.047
2.588TyrAsp: 2.588 ± 0.396
1.407TyrGlu: 1.407 ± 0.228
1.013TyrPhe: 1.013 ± 0.206
1.969TyrGly: 1.969 ± 0.347
0.731TyrHis: 0.731 ± 0.161
1.688TyrIle: 1.688 ± 0.265
1.238TyrLys: 1.238 ± 0.273
2.701TyrLeu: 2.701 ± 0.397
0.563TyrMet: 0.563 ± 0.205
1.238TyrAsn: 1.238 ± 0.256
1.125TyrPro: 1.125 ± 0.274
1.407TyrGln: 1.407 ± 0.307
2.138TyrArg: 2.138 ± 0.314
1.519TyrSer: 1.519 ± 0.312
2.419TyrThr: 2.419 ± 0.435
2.363TyrVal: 2.363 ± 0.302
0.45TyrTrp: 0.45 ± 0.169
0.844TyrTyr: 0.844 ± 0.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (17774 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski