Amino acid dipepetide frequency for Pectobacterium phage Peat1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.266AlaAla: 13.266 ± 1.105
0.847AlaCys: 0.847 ± 0.269
5.222AlaAsp: 5.222 ± 0.564
5.786AlaGlu: 5.786 ± 0.787
2.893AlaPhe: 2.893 ± 0.558
7.127AlaGly: 7.127 ± 0.727
1.976AlaHis: 1.976 ± 0.469
3.175AlaIle: 3.175 ± 0.437
4.516AlaLys: 4.516 ± 0.569
9.455AlaLeu: 9.455 ± 0.831
2.258AlaMet: 2.258 ± 0.401
3.74AlaAsn: 3.74 ± 0.633
3.246AlaPro: 3.246 ± 0.435
6.209AlaGln: 6.209 ± 0.849
4.375AlaArg: 4.375 ± 0.545
6.492AlaSer: 6.492 ± 0.916
5.433AlaThr: 5.433 ± 0.88
7.409AlaVal: 7.409 ± 0.689
1.129AlaTrp: 1.129 ± 0.257
3.599AlaTyr: 3.599 ± 0.536
0.141AlaXaa: 0.141 ± 0.114
Cys
0.635CysAla: 0.635 ± 0.239
0.071CysCys: 0.071 ± 0.065
0.847CysAsp: 0.847 ± 0.292
0.353CysGlu: 0.353 ± 0.123
0.282CysPhe: 0.282 ± 0.14
0.706CysGly: 0.706 ± 0.227
0.282CysHis: 0.282 ± 0.126
0.776CysIle: 0.776 ± 0.197
0.282CysLys: 0.282 ± 0.15
0.706CysLeu: 0.706 ± 0.21
0.423CysMet: 0.423 ± 0.172
0.494CysAsn: 0.494 ± 0.221
0.635CysPro: 0.635 ± 0.232
0.423CysGln: 0.423 ± 0.156
0.706CysArg: 0.706 ± 0.227
1.058CysSer: 1.058 ± 0.275
1.2CysThr: 1.2 ± 0.31
0.847CysVal: 0.847 ± 0.234
0.353CysTrp: 0.353 ± 0.153
0.847CysTyr: 0.847 ± 0.264
0.0CysXaa: 0.0 ± 0.0
Asp
6.703AspAla: 6.703 ± 0.665
0.212AspCys: 0.212 ± 0.113
3.81AspAsp: 3.81 ± 0.522
3.105AspGlu: 3.105 ± 0.424
1.764AspPhe: 1.764 ± 0.268
4.093AspGly: 4.093 ± 0.516
0.635AspHis: 0.635 ± 0.2
4.093AspIle: 4.093 ± 0.431
2.54AspLys: 2.54 ± 0.595
4.869AspLeu: 4.869 ± 0.514
2.329AspMet: 2.329 ± 0.336
2.822AspAsn: 2.822 ± 0.394
2.187AspPro: 2.187 ± 0.342
1.2AspGln: 1.2 ± 0.405
2.611AspArg: 2.611 ± 0.456
4.234AspSer: 4.234 ± 0.549
4.445AspThr: 4.445 ± 0.417
4.516AspVal: 4.516 ± 0.536
1.482AspTrp: 1.482 ± 0.28
2.046AspTyr: 2.046 ± 0.388
0.0AspXaa: 0.0 ± 0.0
Glu
5.363GluAla: 5.363 ± 0.627
0.564GluCys: 0.564 ± 0.253
3.669GluAsp: 3.669 ± 0.564
3.458GluGlu: 3.458 ± 0.716
2.822GluPhe: 2.822 ± 0.489
2.681GluGly: 2.681 ± 0.351
1.058GluHis: 1.058 ± 0.29
1.976GluIle: 1.976 ± 0.491
2.399GluLys: 2.399 ± 0.387
5.433GluLeu: 5.433 ± 0.482
1.411GluMet: 1.411 ± 0.316
1.835GluAsn: 1.835 ± 0.336
1.058GluPro: 1.058 ± 0.297
3.599GluGln: 3.599 ± 0.539
2.117GluArg: 2.117 ± 0.413
2.822GluSer: 2.822 ± 0.404
2.681GluThr: 2.681 ± 0.404
3.81GluVal: 3.81 ± 0.523
0.564GluTrp: 0.564 ± 0.225
2.399GluTyr: 2.399 ± 0.433
0.0GluXaa: 0.0 ± 0.0
Phe
2.964PheAla: 2.964 ± 0.392
0.282PheCys: 0.282 ± 0.15
2.329PheAsp: 2.329 ± 0.412
1.27PheGlu: 1.27 ± 0.268
0.917PhePhe: 0.917 ± 0.256
2.54PheGly: 2.54 ± 0.479
0.423PheHis: 0.423 ± 0.196
1.693PheIle: 1.693 ± 0.331
1.835PheLys: 1.835 ± 0.363
2.187PheLeu: 2.187 ± 0.342
0.423PheMet: 0.423 ± 0.143
1.482PheAsn: 1.482 ± 0.455
1.129PhePro: 1.129 ± 0.278
1.482PheGln: 1.482 ± 0.294
1.482PheArg: 1.482 ± 0.335
1.976PheSer: 1.976 ± 0.365
1.482PheThr: 1.482 ± 0.387
2.258PheVal: 2.258 ± 0.376
0.212PheTrp: 0.212 ± 0.109
0.847PheTyr: 0.847 ± 0.272
0.0PheXaa: 0.0 ± 0.0
Gly
7.056GlyAla: 7.056 ± 0.7
1.482GlyCys: 1.482 ± 0.525
4.022GlyAsp: 4.022 ± 0.59
2.964GlyGlu: 2.964 ± 0.408
2.399GlyPhe: 2.399 ± 0.305
5.363GlyGly: 5.363 ± 0.705
0.988GlyHis: 0.988 ± 0.289
5.08GlyIle: 5.08 ± 0.666
3.81GlyLys: 3.81 ± 0.599
6.492GlyLeu: 6.492 ± 0.556
1.835GlyMet: 1.835 ± 0.281
3.316GlyAsn: 3.316 ± 0.586
1.129GlyPro: 1.129 ± 0.346
2.399GlyGln: 2.399 ± 0.438
3.951GlyArg: 3.951 ± 0.612
5.222GlySer: 5.222 ± 0.588
6.986GlyThr: 6.986 ± 0.813
6.633GlyVal: 6.633 ± 0.605
0.847GlyTrp: 0.847 ± 0.277
3.74GlyTyr: 3.74 ± 0.65
0.0GlyXaa: 0.0 ± 0.0
His
1.552HisAla: 1.552 ± 0.368
0.564HisCys: 0.564 ± 0.203
0.706HisAsp: 0.706 ± 0.263
1.129HisGlu: 1.129 ± 0.319
0.494HisPhe: 0.494 ± 0.178
1.341HisGly: 1.341 ± 0.382
0.494HisHis: 0.494 ± 0.221
1.411HisIle: 1.411 ± 0.345
0.776HisLys: 0.776 ± 0.32
1.623HisLeu: 1.623 ± 0.343
0.564HisMet: 0.564 ± 0.223
0.635HisAsn: 0.635 ± 0.24
1.2HisPro: 1.2 ± 0.275
0.776HisGln: 0.776 ± 0.276
1.058HisArg: 1.058 ± 0.291
0.988HisSer: 0.988 ± 0.274
0.776HisThr: 0.776 ± 0.222
1.341HisVal: 1.341 ± 0.295
0.494HisTrp: 0.494 ± 0.175
0.564HisTyr: 0.564 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
4.163IleAla: 4.163 ± 0.412
0.564IleCys: 0.564 ± 0.193
3.599IleAsp: 3.599 ± 0.594
2.329IleGlu: 2.329 ± 0.434
0.847IlePhe: 0.847 ± 0.214
2.964IleGly: 2.964 ± 0.464
0.776IleHis: 0.776 ± 0.285
2.258IleIle: 2.258 ± 0.392
2.822IleLys: 2.822 ± 0.457
4.093IleLeu: 4.093 ± 0.633
0.917IleMet: 0.917 ± 0.234
2.399IleAsn: 2.399 ± 0.542
2.117IlePro: 2.117 ± 0.309
2.258IleGln: 2.258 ± 0.364
1.693IleArg: 1.693 ± 0.332
2.964IleSer: 2.964 ± 0.411
4.587IleThr: 4.587 ± 0.633
2.611IleVal: 2.611 ± 0.42
0.564IleTrp: 0.564 ± 0.289
1.2IleTyr: 1.2 ± 0.282
0.0IleXaa: 0.0 ± 0.0
Lys
4.587LysAla: 4.587 ± 0.782
0.423LysCys: 0.423 ± 0.142
3.316LysAsp: 3.316 ± 0.431
3.387LysGlu: 3.387 ± 0.551
0.494LysPhe: 0.494 ± 0.176
2.752LysGly: 2.752 ± 0.4
0.988LysHis: 0.988 ± 0.281
1.411LysIle: 1.411 ± 0.34
2.187LysLys: 2.187 ± 0.514
4.869LysLeu: 4.869 ± 0.551
0.776LysMet: 0.776 ± 0.286
1.2LysAsn: 1.2 ± 0.329
2.046LysPro: 2.046 ± 0.319
2.47LysGln: 2.47 ± 0.459
3.034LysArg: 3.034 ± 0.506
2.54LysSer: 2.54 ± 0.374
1.905LysThr: 1.905 ± 0.377
3.316LysVal: 3.316 ± 0.514
0.776LysTrp: 0.776 ± 0.333
2.47LysTyr: 2.47 ± 0.487
0.0LysXaa: 0.0 ± 0.0
Leu
7.691LeuAla: 7.691 ± 0.873
0.917LeuCys: 0.917 ± 0.255
4.798LeuAsp: 4.798 ± 0.581
5.292LeuGlu: 5.292 ± 0.663
2.187LeuPhe: 2.187 ± 0.364
6.492LeuGly: 6.492 ± 0.837
1.905LeuHis: 1.905 ± 0.433
3.458LeuIle: 3.458 ± 0.595
3.387LeuLys: 3.387 ± 0.505
7.621LeuLeu: 7.621 ± 0.765
2.258LeuMet: 2.258 ± 0.314
4.728LeuAsn: 4.728 ± 0.633
4.939LeuPro: 4.939 ± 0.565
3.528LeuGln: 3.528 ± 0.6
5.574LeuArg: 5.574 ± 0.847
6.492LeuSer: 6.492 ± 0.742
5.433LeuThr: 5.433 ± 0.661
6.844LeuVal: 6.844 ± 0.561
0.564LeuTrp: 0.564 ± 0.226
2.964LeuTyr: 2.964 ± 0.414
0.0LeuXaa: 0.0 ± 0.0
Met
2.399MetAla: 2.399 ± 0.429
0.282MetCys: 0.282 ± 0.16
1.058MetAsp: 1.058 ± 0.255
0.988MetGlu: 0.988 ± 0.243
0.988MetPhe: 0.988 ± 0.232
2.046MetGly: 2.046 ± 0.348
0.635MetHis: 0.635 ± 0.221
0.917MetIle: 0.917 ± 0.251
0.635MetLys: 0.635 ± 0.189
2.329MetLeu: 2.329 ± 0.438
0.494MetMet: 0.494 ± 0.151
1.2MetAsn: 1.2 ± 0.262
1.27MetPro: 1.27 ± 0.456
1.835MetGln: 1.835 ± 0.43
2.117MetArg: 2.117 ± 0.385
1.764MetSer: 1.764 ± 0.422
1.552MetThr: 1.552 ± 0.334
1.835MetVal: 1.835 ± 0.395
0.212MetTrp: 0.212 ± 0.102
1.129MetTyr: 1.129 ± 0.263
0.0MetXaa: 0.0 ± 0.0
Asn
2.893AsnAla: 2.893 ± 0.47
0.564AsnCys: 0.564 ± 0.242
1.764AsnAsp: 1.764 ± 0.288
1.905AsnGlu: 1.905 ± 0.393
1.341AsnPhe: 1.341 ± 0.311
4.093AsnGly: 4.093 ± 0.603
0.564AsnHis: 0.564 ± 0.204
1.905AsnIle: 1.905 ± 0.436
2.611AsnLys: 2.611 ± 0.362
4.445AsnLeu: 4.445 ± 0.771
1.27AsnMet: 1.27 ± 0.312
2.399AsnAsn: 2.399 ± 0.482
2.187AsnPro: 2.187 ± 0.447
2.329AsnGln: 2.329 ± 0.486
2.47AsnArg: 2.47 ± 0.388
2.964AsnSer: 2.964 ± 0.753
3.951AsnThr: 3.951 ± 0.551
3.034AsnVal: 3.034 ± 0.457
0.635AsnTrp: 0.635 ± 0.2
0.564AsnTyr: 0.564 ± 0.205
0.0AsnXaa: 0.0 ± 0.0
Pro
4.304ProAla: 4.304 ± 0.511
0.212ProCys: 0.212 ± 0.114
3.74ProAsp: 3.74 ± 0.579
3.387ProGlu: 3.387 ± 0.555
0.635ProPhe: 0.635 ± 0.226
2.681ProGly: 2.681 ± 0.352
0.423ProHis: 0.423 ± 0.19
1.552ProIle: 1.552 ± 0.342
1.693ProLys: 1.693 ± 0.39
2.822ProLeu: 2.822 ± 0.501
1.058ProMet: 1.058 ± 0.255
0.988ProAsn: 0.988 ± 0.205
1.552ProPro: 1.552 ± 0.421
1.2ProGln: 1.2 ± 0.303
1.411ProArg: 1.411 ± 0.328
2.752ProSer: 2.752 ± 0.432
2.964ProThr: 2.964 ± 0.539
3.81ProVal: 3.81 ± 0.447
0.847ProTrp: 0.847 ± 0.288
1.623ProTyr: 1.623 ± 0.267
0.0ProXaa: 0.0 ± 0.0
Gln
6.28GlnAla: 6.28 ± 0.763
0.353GlnCys: 0.353 ± 0.172
2.822GlnAsp: 2.822 ± 0.421
2.187GlnGlu: 2.187 ± 0.449
1.764GlnPhe: 1.764 ± 0.368
3.881GlnGly: 3.881 ± 0.639
1.341GlnHis: 1.341 ± 0.364
1.835GlnIle: 1.835 ± 0.368
1.835GlnLys: 1.835 ± 0.395
4.022GlnLeu: 4.022 ± 0.561
1.058GlnMet: 1.058 ± 0.312
2.54GlnAsn: 2.54 ± 0.538
1.341GlnPro: 1.341 ± 0.277
3.246GlnGln: 3.246 ± 0.772
2.964GlnArg: 2.964 ± 0.563
3.034GlnSer: 3.034 ± 0.529
2.046GlnThr: 2.046 ± 0.414
3.105GlnVal: 3.105 ± 0.403
0.423GlnTrp: 0.423 ± 0.156
2.822GlnTyr: 2.822 ± 0.451
0.0GlnXaa: 0.0 ± 0.0
Arg
4.163ArgAla: 4.163 ± 0.401
0.847ArgCys: 0.847 ± 0.259
3.175ArgAsp: 3.175 ± 0.468
3.175ArgGlu: 3.175 ± 0.427
1.411ArgPhe: 1.411 ± 0.351
3.599ArgGly: 3.599 ± 0.493
1.27ArgHis: 1.27 ± 0.288
2.964ArgIle: 2.964 ± 0.523
2.611ArgLys: 2.611 ± 0.39
3.387ArgLeu: 3.387 ± 0.456
2.117ArgMet: 2.117 ± 0.429
2.681ArgAsn: 2.681 ± 0.49
1.764ArgPro: 1.764 ± 0.414
2.329ArgGln: 2.329 ± 0.377
4.445ArgArg: 4.445 ± 0.466
3.246ArgSer: 3.246 ± 0.648
3.881ArgThr: 3.881 ± 0.591
4.304ArgVal: 4.304 ± 0.535
0.917ArgTrp: 0.917 ± 0.222
2.329ArgTyr: 2.329 ± 0.405
0.141ArgXaa: 0.141 ± 0.117
Ser
7.621SerAla: 7.621 ± 0.839
0.917SerCys: 0.917 ± 0.252
3.387SerAsp: 3.387 ± 0.415
2.046SerGlu: 2.046 ± 0.312
1.835SerPhe: 1.835 ± 0.395
6.351SerGly: 6.351 ± 0.855
0.706SerHis: 0.706 ± 0.269
3.387SerIle: 3.387 ± 0.542
3.387SerLys: 3.387 ± 0.637
5.574SerLeu: 5.574 ± 0.525
2.329SerMet: 2.329 ± 0.384
2.681SerAsn: 2.681 ± 0.582
2.187SerPro: 2.187 ± 0.354
2.258SerGln: 2.258 ± 0.438
3.316SerArg: 3.316 ± 0.46
4.375SerSer: 4.375 ± 0.597
5.08SerThr: 5.08 ± 0.655
6.351SerVal: 6.351 ± 0.797
0.988SerTrp: 0.988 ± 0.301
1.552SerTyr: 1.552 ± 0.311
0.0SerXaa: 0.0 ± 0.0
Thr
7.621ThrAla: 7.621 ± 0.932
0.706ThrCys: 0.706 ± 0.248
3.81ThrAsp: 3.81 ± 0.506
3.458ThrGlu: 3.458 ± 0.425
1.411ThrPhe: 1.411 ± 0.279
7.268ThrGly: 7.268 ± 0.976
1.482ThrHis: 1.482 ± 0.289
2.399ThrIle: 2.399 ± 0.434
3.246ThrLys: 3.246 ± 0.448
5.574ThrLeu: 5.574 ± 0.629
0.847ThrMet: 0.847 ± 0.202
2.752ThrAsn: 2.752 ± 0.561
3.81ThrPro: 3.81 ± 0.58
2.258ThrGln: 2.258 ± 0.452
2.893ThrArg: 2.893 ± 0.512
4.728ThrSer: 4.728 ± 0.603
4.093ThrThr: 4.093 ± 1.035
5.786ThrVal: 5.786 ± 0.894
0.706ThrTrp: 0.706 ± 0.188
2.54ThrTyr: 2.54 ± 0.43
0.0ThrXaa: 0.0 ± 0.0
Val
6.209ValAla: 6.209 ± 0.6
0.988ValCys: 0.988 ± 0.252
4.375ValAsp: 4.375 ± 0.513
2.964ValGlu: 2.964 ± 0.456
2.893ValPhe: 2.893 ± 0.379
5.857ValGly: 5.857 ± 0.586
1.693ValHis: 1.693 ± 0.302
2.752ValIle: 2.752 ± 0.417
2.964ValLys: 2.964 ± 0.542
6.844ValLeu: 6.844 ± 0.761
1.835ValMet: 1.835 ± 0.367
3.458ValAsn: 3.458 ± 0.555
3.528ValPro: 3.528 ± 0.478
6.421ValGln: 6.421 ± 0.734
4.445ValArg: 4.445 ± 0.534
4.798ValSer: 4.798 ± 0.672
5.222ValThr: 5.222 ± 0.649
5.574ValVal: 5.574 ± 0.791
0.917ValTrp: 0.917 ± 0.232
3.316ValTyr: 3.316 ± 0.556
0.071ValXaa: 0.071 ± 0.075
Trp
0.847TrpAla: 0.847 ± 0.227
0.212TrpCys: 0.212 ± 0.104
0.635TrpAsp: 0.635 ± 0.245
0.847TrpGlu: 0.847 ± 0.224
0.635TrpPhe: 0.635 ± 0.255
1.341TrpGly: 1.341 ± 0.386
0.141TrpHis: 0.141 ± 0.086
0.423TrpIle: 0.423 ± 0.172
0.282TrpLys: 0.282 ± 0.164
1.552TrpLeu: 1.552 ± 0.361
0.282TrpMet: 0.282 ± 0.123
0.706TrpAsn: 0.706 ± 0.26
0.494TrpPro: 0.494 ± 0.207
0.706TrpGln: 0.706 ± 0.262
0.776TrpArg: 0.776 ± 0.198
0.706TrpSer: 0.706 ± 0.206
0.635TrpThr: 0.635 ± 0.229
1.058TrpVal: 1.058 ± 0.277
0.141TrpTrp: 0.141 ± 0.107
1.411TrpTyr: 1.411 ± 0.364
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.976TyrAla: 1.976 ± 0.393
0.917TyrCys: 0.917 ± 0.29
2.611TyrAsp: 2.611 ± 0.339
1.693TyrGlu: 1.693 ± 0.316
1.2TyrPhe: 1.2 ± 0.29
2.54TyrGly: 2.54 ± 0.525
0.706TyrHis: 0.706 ± 0.217
2.258TyrIle: 2.258 ± 0.401
1.2TyrLys: 1.2 ± 0.346
3.175TyrLeu: 3.175 ± 0.555
1.058TyrMet: 1.058 ± 0.288
1.835TyrAsn: 1.835 ± 0.535
1.835TyrPro: 1.835 ± 0.366
1.976TyrGln: 1.976 ± 0.374
3.316TyrArg: 3.316 ± 0.522
3.105TyrSer: 3.105 ± 0.504
2.964TyrThr: 2.964 ± 0.522
2.54TyrVal: 2.54 ± 0.433
1.058TyrTrp: 1.058 ± 0.349
1.341TyrTyr: 1.341 ± 0.419
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.071XaaMet: 0.071 ± 0.074
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.071XaaSer: 0.071 ± 0.076
0.071XaaThr: 0.071 ± 0.071
0.141XaaVal: 0.141 ± 0.115
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (14173 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski