Amino acid dipepetide frequency for Yersinia phage YeP1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.709AlaAla: 9.709 ± 1.211
0.83AlaCys: 0.83 ± 0.326
5.726AlaAsp: 5.726 ± 0.612
5.892AlaGlu: 5.892 ± 0.745
3.319AlaPhe: 3.319 ± 0.697
7.717AlaGly: 7.717 ± 0.976
1.162AlaHis: 1.162 ± 0.28
5.477AlaIle: 5.477 ± 0.735
4.896AlaLys: 4.896 ± 0.637
8.796AlaLeu: 8.796 ± 0.795
3.734AlaMet: 3.734 ± 0.544
3.07AlaAsn: 3.07 ± 0.671
2.904AlaPro: 2.904 ± 0.498
3.153AlaGln: 3.153 ± 0.493
4.564AlaArg: 4.564 ± 0.62
4.813AlaSer: 4.813 ± 0.697
6.141AlaThr: 6.141 ± 0.949
5.477AlaVal: 5.477 ± 0.601
1.245AlaTrp: 1.245 ± 0.294
2.406AlaTyr: 2.406 ± 0.47
0.0AlaXaa: 0.0 ± 0.0
Cys
1.079CysAla: 1.079 ± 0.367
0.166CysCys: 0.166 ± 0.102
0.747CysAsp: 0.747 ± 0.253
0.747CysGlu: 0.747 ± 0.295
0.166CysPhe: 0.166 ± 0.112
0.996CysGly: 0.996 ± 0.344
0.332CysHis: 0.332 ± 0.154
0.664CysIle: 0.664 ± 0.205
0.83CysLys: 0.83 ± 0.313
0.996CysLeu: 0.996 ± 0.384
0.249CysMet: 0.249 ± 0.139
0.498CysAsn: 0.498 ± 0.193
0.83CysPro: 0.83 ± 0.263
0.747CysGln: 0.747 ± 0.189
1.162CysArg: 1.162 ± 0.297
0.332CysSer: 0.332 ± 0.197
0.664CysThr: 0.664 ± 0.259
0.747CysVal: 0.747 ± 0.239
0.415CysTrp: 0.415 ± 0.206
0.415CysTyr: 0.415 ± 0.228
0.0CysXaa: 0.0 ± 0.0
Asp
4.315AspAla: 4.315 ± 0.61
0.664AspCys: 0.664 ± 0.229
4.315AspAsp: 4.315 ± 0.672
4.564AspGlu: 4.564 ± 0.594
2.323AspPhe: 2.323 ± 0.429
5.477AspGly: 5.477 ± 0.785
0.747AspHis: 0.747 ± 0.252
3.402AspIle: 3.402 ± 0.582
2.904AspLys: 2.904 ± 0.477
5.975AspLeu: 5.975 ± 0.645
1.411AspMet: 1.411 ± 0.345
2.157AspAsn: 2.157 ± 0.363
3.485AspPro: 3.485 ± 0.554
1.577AspGln: 1.577 ± 0.41
2.489AspArg: 2.489 ± 0.458
3.651AspSer: 3.651 ± 0.578
3.402AspThr: 3.402 ± 0.562
3.651AspVal: 3.651 ± 0.451
0.747AspTrp: 0.747 ± 0.232
1.743AspTyr: 1.743 ± 0.321
0.0AspXaa: 0.0 ± 0.0
Glu
5.394GluAla: 5.394 ± 0.76
1.079GluCys: 1.079 ± 0.339
2.821GluAsp: 2.821 ± 0.396
3.485GluGlu: 3.485 ± 0.654
2.489GluPhe: 2.489 ± 0.437
2.821GluGly: 2.821 ± 0.424
0.996GluHis: 0.996 ± 0.248
3.485GluIle: 3.485 ± 0.524
2.987GluLys: 2.987 ± 0.512
7.302GluLeu: 7.302 ± 0.696
1.909GluMet: 1.909 ± 0.345
2.572GluAsn: 2.572 ± 0.372
2.323GluPro: 2.323 ± 0.511
3.153GluGln: 3.153 ± 0.676
3.402GluArg: 3.402 ± 0.591
4.232GluSer: 4.232 ± 0.815
2.157GluThr: 2.157 ± 0.42
4.979GluVal: 4.979 ± 0.591
0.83GluTrp: 0.83 ± 0.294
2.157GluTyr: 2.157 ± 0.378
0.0GluXaa: 0.0 ± 0.0
Phe
1.826PheAla: 1.826 ± 0.444
0.415PheCys: 0.415 ± 0.184
2.987PheAsp: 2.987 ± 0.472
1.743PheGlu: 1.743 ± 0.319
1.245PhePhe: 1.245 ± 0.375
3.07PheGly: 3.07 ± 0.424
0.664PheHis: 0.664 ± 0.202
2.323PheIle: 2.323 ± 0.418
2.904PheLys: 2.904 ± 0.663
2.406PheLeu: 2.406 ± 0.423
1.66PheMet: 1.66 ± 0.479
2.075PheAsn: 2.075 ± 0.352
1.411PhePro: 1.411 ± 0.312
0.913PheGln: 0.913 ± 0.215
1.245PheArg: 1.245 ± 0.293
2.406PheSer: 2.406 ± 0.38
2.572PheThr: 2.572 ± 0.421
2.323PheVal: 2.323 ± 0.621
0.498PheTrp: 0.498 ± 0.213
1.577PheTyr: 1.577 ± 0.392
0.0PheXaa: 0.0 ± 0.0
Gly
5.975GlyAla: 5.975 ± 0.874
0.664GlyCys: 0.664 ± 0.216
4.73GlyAsp: 4.73 ± 0.503
5.477GlyGlu: 5.477 ± 0.497
3.9GlyPhe: 3.9 ± 0.416
6.39GlyGly: 6.39 ± 0.867
1.328GlyHis: 1.328 ± 0.373
4.73GlyIle: 4.73 ± 0.745
2.987GlyLys: 2.987 ± 0.518
6.058GlyLeu: 6.058 ± 0.864
2.323GlyMet: 2.323 ± 0.449
3.153GlyAsn: 3.153 ± 0.47
1.909GlyPro: 1.909 ± 0.389
2.075GlyGln: 2.075 ± 0.423
4.149GlyArg: 4.149 ± 0.714
3.568GlySer: 3.568 ± 0.553
4.066GlyThr: 4.066 ± 0.768
5.809GlyVal: 5.809 ± 0.681
1.826GlyTrp: 1.826 ± 0.39
2.821GlyTyr: 2.821 ± 0.545
0.0GlyXaa: 0.0 ± 0.0
His
1.328HisAla: 1.328 ± 0.302
0.166HisCys: 0.166 ± 0.118
1.162HisAsp: 1.162 ± 0.563
0.913HisGlu: 0.913 ± 0.271
0.498HisPhe: 0.498 ± 0.229
1.826HisGly: 1.826 ± 0.365
0.415HisHis: 0.415 ± 0.208
1.079HisIle: 1.079 ± 0.305
0.83HisLys: 0.83 ± 0.234
1.66HisLeu: 1.66 ± 0.31
0.498HisMet: 0.498 ± 0.242
0.747HisAsn: 0.747 ± 0.291
0.498HisPro: 0.498 ± 0.262
0.415HisGln: 0.415 ± 0.171
0.996HisArg: 0.996 ± 0.28
0.664HisSer: 0.664 ± 0.269
0.913HisThr: 0.913 ± 0.3
1.245HisVal: 1.245 ± 0.262
0.332HisTrp: 0.332 ± 0.147
0.581HisTyr: 0.581 ± 0.263
0.0HisXaa: 0.0 ± 0.0
Ile
5.892IleAla: 5.892 ± 0.844
0.913IleCys: 0.913 ± 0.277
3.319IleAsp: 3.319 ± 0.569
3.983IleGlu: 3.983 ± 0.663
1.411IlePhe: 1.411 ± 0.438
4.979IleGly: 4.979 ± 0.685
0.498IleHis: 0.498 ± 0.192
3.07IleIle: 3.07 ± 0.527
3.983IleLys: 3.983 ± 0.636
4.066IleLeu: 4.066 ± 0.643
1.411IleMet: 1.411 ± 0.291
3.153IleAsn: 3.153 ± 0.557
3.07IlePro: 3.07 ± 0.5
1.66IleGln: 1.66 ± 0.371
2.904IleArg: 2.904 ± 0.419
4.896IleSer: 4.896 ± 0.576
5.062IleThr: 5.062 ± 0.682
3.651IleVal: 3.651 ± 0.585
0.581IleTrp: 0.581 ± 0.193
2.406IleTyr: 2.406 ± 0.419
0.0IleXaa: 0.0 ± 0.0
Lys
5.477LysAla: 5.477 ± 0.729
0.581LysCys: 0.581 ± 0.232
3.402LysAsp: 3.402 ± 0.561
3.9LysGlu: 3.9 ± 0.681
1.743LysPhe: 1.743 ± 0.296
3.568LysGly: 3.568 ± 0.66
1.079LysHis: 1.079 ± 0.273
2.987LysIle: 2.987 ± 0.494
3.07LysLys: 3.07 ± 0.652
4.979LysLeu: 4.979 ± 0.684
1.577LysMet: 1.577 ± 0.337
2.904LysAsn: 2.904 ± 0.47
2.987LysPro: 2.987 ± 0.496
2.572LysGln: 2.572 ± 0.416
3.236LysArg: 3.236 ± 0.629
3.734LysSer: 3.734 ± 0.701
2.904LysThr: 2.904 ± 0.628
2.904LysVal: 2.904 ± 0.568
0.996LysTrp: 0.996 ± 0.317
0.913LysTyr: 0.913 ± 0.227
0.0LysXaa: 0.0 ± 0.0
Leu
8.63LeuAla: 8.63 ± 0.656
1.411LeuCys: 1.411 ± 0.475
4.647LeuAsp: 4.647 ± 0.552
5.394LeuGlu: 5.394 ± 0.7
2.323LeuPhe: 2.323 ± 0.523
4.647LeuGly: 4.647 ± 0.677
1.162LeuHis: 1.162 ± 0.311
5.394LeuIle: 5.394 ± 0.602
6.472LeuLys: 6.472 ± 0.91
7.551LeuLeu: 7.551 ± 0.877
2.157LeuMet: 2.157 ± 0.456
4.398LeuAsn: 4.398 ± 0.432
5.394LeuPro: 5.394 ± 0.857
2.406LeuGln: 2.406 ± 0.356
5.726LeuArg: 5.726 ± 0.592
6.887LeuSer: 6.887 ± 0.882
5.892LeuThr: 5.892 ± 0.771
5.56LeuVal: 5.56 ± 0.714
1.245LeuTrp: 1.245 ± 0.337
2.738LeuTyr: 2.738 ± 0.523
0.0LeuXaa: 0.0 ± 0.0
Met
3.983MetAla: 3.983 ± 0.482
0.332MetCys: 0.332 ± 0.166
1.162MetAsp: 1.162 ± 0.31
0.913MetGlu: 0.913 ± 0.259
1.328MetPhe: 1.328 ± 0.334
1.826MetGly: 1.826 ± 0.401
0.664MetHis: 0.664 ± 0.219
1.411MetIle: 1.411 ± 0.323
1.328MetLys: 1.328 ± 0.306
2.821MetLeu: 2.821 ± 0.497
0.747MetMet: 0.747 ± 0.258
1.411MetAsn: 1.411 ± 0.365
1.162MetPro: 1.162 ± 0.281
1.245MetGln: 1.245 ± 0.338
1.411MetArg: 1.411 ± 0.356
1.743MetSer: 1.743 ± 0.308
1.826MetThr: 1.826 ± 0.37
1.577MetVal: 1.577 ± 0.261
0.249MetTrp: 0.249 ± 0.14
0.332MetTyr: 0.332 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
4.066AsnAla: 4.066 ± 0.675
0.498AsnCys: 0.498 ± 0.198
3.236AsnAsp: 3.236 ± 0.653
2.572AsnGlu: 2.572 ± 0.473
1.328AsnPhe: 1.328 ± 0.313
3.651AsnGly: 3.651 ± 0.513
0.913AsnHis: 0.913 ± 0.349
2.406AsnIle: 2.406 ± 0.445
2.987AsnLys: 2.987 ± 0.526
2.904AsnLeu: 2.904 ± 0.464
1.162AsnMet: 1.162 ± 0.359
1.494AsnAsn: 1.494 ± 0.311
2.489AsnPro: 2.489 ± 0.399
1.826AsnGln: 1.826 ± 0.42
2.323AsnArg: 2.323 ± 0.395
3.236AsnSer: 3.236 ± 0.563
2.24AsnThr: 2.24 ± 0.478
1.743AsnVal: 1.743 ± 0.324
0.498AsnTrp: 0.498 ± 0.171
0.83AsnTyr: 0.83 ± 0.248
0.0AsnXaa: 0.0 ± 0.0
Pro
3.9ProAla: 3.9 ± 0.455
0.664ProCys: 0.664 ± 0.216
3.568ProAsp: 3.568 ± 0.569
3.319ProGlu: 3.319 ± 0.594
1.743ProPhe: 1.743 ± 0.391
2.24ProGly: 2.24 ± 0.393
0.83ProHis: 0.83 ± 0.295
2.655ProIle: 2.655 ± 0.484
2.24ProLys: 2.24 ± 0.368
3.153ProLeu: 3.153 ± 0.484
1.909ProMet: 1.909 ± 0.424
2.406ProAsn: 2.406 ± 0.419
1.909ProPro: 1.909 ± 0.425
1.992ProGln: 1.992 ± 0.44
1.411ProArg: 1.411 ± 0.338
3.817ProSer: 3.817 ± 0.655
2.738ProThr: 2.738 ± 0.396
3.485ProVal: 3.485 ± 0.581
0.249ProTrp: 0.249 ± 0.196
1.66ProTyr: 1.66 ± 0.31
0.0ProXaa: 0.0 ± 0.0
Gln
2.821GlnAla: 2.821 ± 0.604
0.498GlnCys: 0.498 ± 0.194
0.664GlnAsp: 0.664 ± 0.243
1.494GlnGlu: 1.494 ± 0.417
1.66GlnPhe: 1.66 ± 0.307
2.157GlnGly: 2.157 ± 0.479
0.498GlnHis: 0.498 ± 0.194
2.323GlnIle: 2.323 ± 0.455
1.992GlnLys: 1.992 ± 0.357
5.477GlnLeu: 5.477 ± 0.574
1.162GlnMet: 1.162 ± 0.341
1.245GlnAsn: 1.245 ± 0.323
1.826GlnPro: 1.826 ± 0.404
2.323GlnGln: 2.323 ± 0.458
2.987GlnArg: 2.987 ± 0.525
1.743GlnSer: 1.743 ± 0.396
2.655GlnThr: 2.655 ± 0.41
2.323GlnVal: 2.323 ± 0.455
0.83GlnTrp: 0.83 ± 0.235
1.079GlnTyr: 1.079 ± 0.29
0.0GlnXaa: 0.0 ± 0.0
Arg
5.228ArgAla: 5.228 ± 0.559
0.747ArgCys: 0.747 ± 0.293
2.24ArgAsp: 2.24 ± 0.529
2.655ArgGlu: 2.655 ± 0.524
2.406ArgPhe: 2.406 ± 0.441
3.983ArgGly: 3.983 ± 0.559
1.162ArgHis: 1.162 ± 0.284
3.983ArgIle: 3.983 ± 0.552
2.655ArgLys: 2.655 ± 0.46
5.477ArgLeu: 5.477 ± 0.702
1.577ArgMet: 1.577 ± 0.411
2.489ArgAsn: 2.489 ± 0.449
2.157ArgPro: 2.157 ± 0.388
1.577ArgGln: 1.577 ± 0.291
2.904ArgArg: 2.904 ± 0.483
2.821ArgSer: 2.821 ± 0.563
2.821ArgThr: 2.821 ± 0.353
4.315ArgVal: 4.315 ± 0.486
0.83ArgTrp: 0.83 ± 0.264
2.489ArgTyr: 2.489 ± 0.545
0.0ArgXaa: 0.0 ± 0.0
Ser
5.643SerAla: 5.643 ± 0.841
0.913SerCys: 0.913 ± 0.258
4.149SerAsp: 4.149 ± 0.603
3.402SerGlu: 3.402 ± 0.49
2.24SerPhe: 2.24 ± 0.43
5.975SerGly: 5.975 ± 0.651
1.245SerHis: 1.245 ± 0.347
4.481SerIle: 4.481 ± 0.739
4.232SerLys: 4.232 ± 0.762
5.975SerLeu: 5.975 ± 0.657
1.245SerMet: 1.245 ± 0.335
1.826SerAsn: 1.826 ± 0.388
2.24SerPro: 2.24 ± 0.539
2.489SerGln: 2.489 ± 0.428
3.734SerArg: 3.734 ± 0.5
3.402SerSer: 3.402 ± 0.559
2.323SerThr: 2.323 ± 0.367
4.647SerVal: 4.647 ± 0.617
0.664SerTrp: 0.664 ± 0.213
1.743SerTyr: 1.743 ± 0.39
0.0SerXaa: 0.0 ± 0.0
Thr
6.307ThrAla: 6.307 ± 0.934
0.747ThrCys: 0.747 ± 0.288
4.481ThrAsp: 4.481 ± 0.631
3.651ThrGlu: 3.651 ± 0.546
2.157ThrPhe: 2.157 ± 0.419
5.311ThrGly: 5.311 ± 0.677
1.245ThrHis: 1.245 ± 0.296
3.651ThrIle: 3.651 ± 0.49
2.157ThrLys: 2.157 ± 0.447
4.979ThrLeu: 4.979 ± 0.707
0.913ThrMet: 0.913 ± 0.224
2.323ThrAsn: 2.323 ± 0.528
3.153ThrPro: 3.153 ± 0.463
2.987ThrGln: 2.987 ± 0.494
2.821ThrArg: 2.821 ± 0.387
3.153ThrSer: 3.153 ± 0.405
4.149ThrThr: 4.149 ± 0.534
4.066ThrVal: 4.066 ± 0.678
1.245ThrTrp: 1.245 ± 0.336
1.66ThrTyr: 1.66 ± 0.329
0.0ThrXaa: 0.0 ± 0.0
Val
6.39ValAla: 6.39 ± 0.813
0.83ValCys: 0.83 ± 0.228
3.485ValAsp: 3.485 ± 0.49
3.817ValGlu: 3.817 ± 0.477
1.909ValPhe: 1.909 ± 0.426
4.647ValGly: 4.647 ± 0.54
0.83ValHis: 0.83 ± 0.274
5.228ValIle: 5.228 ± 0.689
3.485ValLys: 3.485 ± 0.616
4.979ValLeu: 4.979 ± 0.785
1.079ValMet: 1.079 ± 0.292
3.07ValAsn: 3.07 ± 0.416
3.817ValPro: 3.817 ± 0.625
2.489ValGln: 2.489 ± 0.389
3.402ValArg: 3.402 ± 0.491
4.398ValSer: 4.398 ± 0.609
5.643ValThr: 5.643 ± 0.925
4.398ValVal: 4.398 ± 0.654
0.913ValTrp: 0.913 ± 0.238
1.245ValTyr: 1.245 ± 0.319
0.0ValXaa: 0.0 ± 0.0
Trp
0.996TrpAla: 0.996 ± 0.302
0.166TrpCys: 0.166 ± 0.11
0.498TrpAsp: 0.498 ± 0.18
0.996TrpGlu: 0.996 ± 0.312
0.664TrpPhe: 0.664 ± 0.225
0.747TrpGly: 0.747 ± 0.254
0.498TrpHis: 0.498 ± 0.226
0.664TrpIle: 0.664 ± 0.253
1.328TrpLys: 1.328 ± 0.303
1.245TrpLeu: 1.245 ± 0.26
0.166TrpMet: 0.166 ± 0.114
0.415TrpAsn: 0.415 ± 0.172
0.747TrpPro: 0.747 ± 0.262
0.747TrpGln: 0.747 ± 0.271
1.577TrpArg: 1.577 ± 0.325
0.913TrpSer: 0.913 ± 0.289
0.581TrpThr: 0.581 ± 0.215
1.328TrpVal: 1.328 ± 0.357
0.498TrpTrp: 0.498 ± 0.191
0.415TrpTyr: 0.415 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.075TyrAla: 2.075 ± 0.505
0.498TyrCys: 0.498 ± 0.206
1.66TyrAsp: 1.66 ± 0.432
1.66TyrGlu: 1.66 ± 0.342
1.411TyrPhe: 1.411 ± 0.313
1.909TyrGly: 1.909 ± 0.434
0.498TyrHis: 0.498 ± 0.221
1.494TyrIle: 1.494 ± 0.361
1.245TyrLys: 1.245 ± 0.324
3.236TyrLeu: 3.236 ± 0.473
0.498TyrMet: 0.498 ± 0.169
0.996TyrAsn: 0.996 ± 0.263
1.577TyrPro: 1.577 ± 0.373
1.494TyrGln: 1.494 ± 0.307
2.075TyrArg: 2.075 ± 0.488
2.157TyrSer: 2.157 ± 0.426
2.406TyrThr: 2.406 ± 0.45
1.826TyrVal: 1.826 ± 0.371
0.498TyrTrp: 0.498 ± 0.199
0.996TyrTyr: 0.996 ± 0.316
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (12052 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski