Amino acid dipepetide frequency for Bacillus phage 000TH010

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.574AlaAla: 8.574 ± 1.707
0.399AlaCys: 0.399 ± 0.18
4.121AlaAsp: 4.121 ± 0.55
5.982AlaGlu: 5.982 ± 0.796
3.257AlaPhe: 3.257 ± 0.597
5.649AlaGly: 5.649 ± 0.591
1.396AlaHis: 1.396 ± 0.257
5.118AlaIle: 5.118 ± 0.602
5.184AlaLys: 5.184 ± 0.581
5.649AlaLeu: 5.649 ± 0.662
2.193AlaMet: 2.193 ± 0.396
3.19AlaAsn: 3.19 ± 0.364
1.662AlaPro: 1.662 ± 0.406
2.592AlaGln: 2.592 ± 0.423
3.456AlaArg: 3.456 ± 0.563
4.054AlaSer: 4.054 ± 0.662
3.589AlaThr: 3.589 ± 0.641
5.184AlaVal: 5.184 ± 0.717
1.462AlaTrp: 1.462 ± 0.286
2.26AlaTyr: 2.26 ± 0.34
0.0AlaXaa: 0.0 ± 0.0
Cys
0.399CysAla: 0.399 ± 0.166
0.266CysCys: 0.266 ± 0.163
0.731CysAsp: 0.731 ± 0.25
0.665CysGlu: 0.665 ± 0.233
0.133CysPhe: 0.133 ± 0.078
0.864CysGly: 0.864 ± 0.335
0.266CysHis: 0.266 ± 0.132
0.532CysIle: 0.532 ± 0.207
0.93CysLys: 0.93 ± 0.28
0.598CysLeu: 0.598 ± 0.253
0.066CysMet: 0.066 ± 0.066
0.598CysAsn: 0.598 ± 0.216
0.266CysPro: 0.266 ± 0.137
0.266CysGln: 0.266 ± 0.126
0.465CysArg: 0.465 ± 0.19
0.266CysSer: 0.266 ± 0.12
0.266CysThr: 0.266 ± 0.149
0.731CysVal: 0.731 ± 0.188
0.0CysTrp: 0.0 ± 0.0
0.199CysTyr: 0.199 ± 0.121
0.0CysXaa: 0.0 ± 0.0
Asp
3.921AspAla: 3.921 ± 0.53
0.93AspCys: 0.93 ± 0.244
3.921AspAsp: 3.921 ± 0.648
4.918AspGlu: 4.918 ± 0.657
2.924AspPhe: 2.924 ± 0.392
5.118AspGly: 5.118 ± 0.636
1.063AspHis: 1.063 ± 0.27
4.918AspIle: 4.918 ± 0.697
4.586AspLys: 4.586 ± 0.532
4.586AspLeu: 4.586 ± 0.521
1.462AspMet: 1.462 ± 0.324
3.057AspAsn: 3.057 ± 0.617
2.393AspPro: 2.393 ± 0.472
1.263AspGln: 1.263 ± 0.319
2.659AspArg: 2.659 ± 0.385
2.659AspSer: 2.659 ± 0.455
2.924AspThr: 2.924 ± 0.412
3.589AspVal: 3.589 ± 0.561
0.731AspTrp: 0.731 ± 0.238
2.193AspTyr: 2.193 ± 0.394
0.0AspXaa: 0.0 ± 0.0
Glu
4.519GluAla: 4.519 ± 0.583
0.864GluCys: 0.864 ± 0.258
3.124GluAsp: 3.124 ± 0.461
9.504GluGlu: 9.504 ± 1.143
3.057GluPhe: 3.057 ± 0.522
5.118GluGly: 5.118 ± 0.47
1.462GluHis: 1.462 ± 0.285
4.852GluIle: 4.852 ± 0.631
6.846GluLys: 6.846 ± 0.865
8.507GluLeu: 8.507 ± 0.881
3.988GluMet: 3.988 ± 0.536
5.383GluAsn: 5.383 ± 0.724
2.991GluPro: 2.991 ± 0.456
3.323GluGln: 3.323 ± 0.468
3.921GluArg: 3.921 ± 0.539
4.121GluSer: 4.121 ± 0.594
3.921GluThr: 3.921 ± 0.534
4.719GluVal: 4.719 ± 0.631
1.196GluTrp: 1.196 ± 0.348
2.725GluTyr: 2.725 ± 0.426
0.0GluXaa: 0.0 ± 0.0
Phe
3.257PheAla: 3.257 ± 0.418
0.399PheCys: 0.399 ± 0.149
2.459PheAsp: 2.459 ± 0.434
2.924PheGlu: 2.924 ± 0.469
1.794PhePhe: 1.794 ± 0.31
2.791PheGly: 2.791 ± 0.428
0.598PheHis: 0.598 ± 0.183
2.659PheIle: 2.659 ± 0.406
4.054PheLys: 4.054 ± 0.422
2.592PheLeu: 2.592 ± 0.496
0.997PheMet: 0.997 ± 0.253
1.794PheAsn: 1.794 ± 0.322
0.93PhePro: 0.93 ± 0.282
1.595PheGln: 1.595 ± 0.332
1.196PheArg: 1.196 ± 0.241
1.794PheSer: 1.794 ± 0.365
2.791PheThr: 2.791 ± 0.367
2.06PheVal: 2.06 ± 0.326
0.532PheTrp: 0.532 ± 0.184
1.728PheTyr: 1.728 ± 0.295
0.0PheXaa: 0.0 ± 0.0
Gly
5.383GlyAla: 5.383 ± 0.749
0.93GlyCys: 0.93 ± 0.273
3.788GlyAsp: 3.788 ± 0.546
5.849GlyGlu: 5.849 ± 0.493
2.459GlyPhe: 2.459 ± 0.395
6.912GlyGly: 6.912 ± 0.701
1.462GlyHis: 1.462 ± 0.345
4.586GlyIle: 4.586 ± 0.622
6.713GlyLys: 6.713 ± 0.792
3.589GlyLeu: 3.589 ± 0.535
1.662GlyMet: 1.662 ± 0.321
3.39GlyAsn: 3.39 ± 0.444
1.861GlyPro: 1.861 ± 0.456
2.725GlyGln: 2.725 ± 0.449
3.523GlyArg: 3.523 ± 0.502
4.187GlySer: 4.187 ± 0.631
4.187GlyThr: 4.187 ± 0.541
4.187GlyVal: 4.187 ± 0.539
0.532GlyTrp: 0.532 ± 0.183
3.921GlyTyr: 3.921 ± 0.588
0.0GlyXaa: 0.0 ± 0.0
His
1.396HisAla: 1.396 ± 0.352
0.332HisCys: 0.332 ± 0.187
1.13HisAsp: 1.13 ± 0.303
1.329HisGlu: 1.329 ± 0.291
0.997HisPhe: 0.997 ± 0.259
1.329HisGly: 1.329 ± 0.373
0.399HisHis: 0.399 ± 0.203
1.13HisIle: 1.13 ± 0.234
1.595HisLys: 1.595 ± 0.299
1.063HisLeu: 1.063 ± 0.246
0.399HisMet: 0.399 ± 0.175
1.263HisAsn: 1.263 ± 0.324
0.731HisPro: 0.731 ± 0.198
0.598HisGln: 0.598 ± 0.209
1.13HisArg: 1.13 ± 0.322
1.196HisSer: 1.196 ± 0.318
0.665HisThr: 0.665 ± 0.17
1.063HisVal: 1.063 ± 0.256
0.465HisTrp: 0.465 ± 0.232
0.665HisTyr: 0.665 ± 0.209
0.0HisXaa: 0.0 ± 0.0
Ile
4.652IleAla: 4.652 ± 0.652
0.332IleCys: 0.332 ± 0.135
4.32IleAsp: 4.32 ± 0.608
5.583IleGlu: 5.583 ± 0.646
2.193IlePhe: 2.193 ± 0.34
3.39IleGly: 3.39 ± 0.439
1.263IleHis: 1.263 ± 0.269
4.387IleIle: 4.387 ± 0.545
6.38IleLys: 6.38 ± 0.692
4.652IleLeu: 4.652 ± 0.579
2.659IleMet: 2.659 ± 0.432
4.121IleAsn: 4.121 ± 0.539
2.193IlePro: 2.193 ± 0.463
2.193IleGln: 2.193 ± 0.313
2.858IleArg: 2.858 ± 0.387
3.456IleSer: 3.456 ± 0.478
4.32IleThr: 4.32 ± 0.568
4.187IleVal: 4.187 ± 0.456
0.93IleTrp: 0.93 ± 0.291
2.725IleTyr: 2.725 ± 0.447
0.0IleXaa: 0.0 ± 0.0
Lys
7.178LysAla: 7.178 ± 0.759
0.266LysCys: 0.266 ± 0.122
4.32LysAsp: 4.32 ± 0.588
7.51LysGlu: 7.51 ± 0.918
2.06LysPhe: 2.06 ± 0.386
5.184LysGly: 5.184 ± 0.544
1.794LysHis: 1.794 ± 0.408
5.982LysIle: 5.982 ± 0.607
8.308LysLys: 8.308 ± 0.963
5.782LysLeu: 5.782 ± 0.658
3.323LysMet: 3.323 ± 0.614
5.849LysAsn: 5.849 ± 0.645
2.858LysPro: 2.858 ± 0.48
2.526LysGln: 2.526 ± 0.399
4.985LysArg: 4.985 ± 0.794
3.523LysSer: 3.523 ± 0.626
4.719LysThr: 4.719 ± 0.556
5.383LysVal: 5.383 ± 0.509
0.864LysTrp: 0.864 ± 0.256
2.326LysTyr: 2.326 ± 0.456
0.0LysXaa: 0.0 ± 0.0
Leu
5.118LeuAla: 5.118 ± 0.569
0.731LeuCys: 0.731 ± 0.279
5.051LeuAsp: 5.051 ± 0.495
7.178LeuGlu: 7.178 ± 0.9
2.193LeuPhe: 2.193 ± 0.359
5.184LeuGly: 5.184 ± 0.505
1.196LeuHis: 1.196 ± 0.216
4.121LeuIle: 4.121 ± 0.597
6.646LeuLys: 6.646 ± 0.824
5.383LeuLeu: 5.383 ± 0.662
1.927LeuMet: 1.927 ± 0.301
3.855LeuAsn: 3.855 ± 0.476
2.127LeuPro: 2.127 ± 0.449
3.257LeuGln: 3.257 ± 0.618
3.855LeuArg: 3.855 ± 0.548
4.387LeuSer: 4.387 ± 0.562
4.785LeuThr: 4.785 ± 0.641
3.39LeuVal: 3.39 ± 0.55
0.997LeuTrp: 0.997 ± 0.268
2.858LeuTyr: 2.858 ± 0.378
0.0LeuXaa: 0.0 ± 0.0
Met
2.791MetAla: 2.791 ± 0.535
0.332MetCys: 0.332 ± 0.151
1.196MetAsp: 1.196 ± 0.262
1.861MetGlu: 1.861 ± 0.454
0.864MetPhe: 0.864 ± 0.247
1.595MetGly: 1.595 ± 0.324
0.465MetHis: 0.465 ± 0.182
1.861MetIle: 1.861 ± 0.355
2.725MetLys: 2.725 ± 0.494
1.529MetLeu: 1.529 ± 0.335
0.997MetMet: 0.997 ± 0.294
2.526MetAsn: 2.526 ± 0.444
1.529MetPro: 1.529 ± 0.317
0.997MetGln: 0.997 ± 0.273
1.861MetArg: 1.861 ± 0.314
2.06MetSer: 2.06 ± 0.365
1.861MetThr: 1.861 ± 0.321
1.13MetVal: 1.13 ± 0.319
0.532MetTrp: 0.532 ± 0.216
0.997MetTyr: 0.997 ± 0.246
0.0MetXaa: 0.0 ± 0.0
Asn
4.32AsnAla: 4.32 ± 0.741
0.465AsnCys: 0.465 ± 0.159
3.589AsnAsp: 3.589 ± 0.637
5.051AsnGlu: 5.051 ± 0.777
2.26AsnPhe: 2.26 ± 0.356
4.652AsnGly: 4.652 ± 0.535
1.263AsnHis: 1.263 ± 0.299
3.589AsnIle: 3.589 ± 0.588
4.586AsnLys: 4.586 ± 0.571
3.921AsnLeu: 3.921 ± 0.566
1.462AsnMet: 1.462 ± 0.221
2.393AsnAsn: 2.393 ± 0.508
2.326AsnPro: 2.326 ± 0.415
0.997AsnGln: 0.997 ± 0.237
1.794AsnArg: 1.794 ± 0.265
3.323AsnSer: 3.323 ± 0.418
2.393AsnThr: 2.393 ± 0.423
2.991AsnVal: 2.991 ± 0.532
0.598AsnTrp: 0.598 ± 0.234
1.994AsnTyr: 1.994 ± 0.355
0.0AsnXaa: 0.0 ± 0.0
Pro
2.393ProAla: 2.393 ± 0.387
0.266ProCys: 0.266 ± 0.151
2.459ProAsp: 2.459 ± 0.39
2.991ProGlu: 2.991 ± 0.427
1.329ProPhe: 1.329 ± 0.286
2.791ProGly: 2.791 ± 0.498
0.798ProHis: 0.798 ± 0.221
1.861ProIle: 1.861 ± 0.308
2.193ProLys: 2.193 ± 0.386
2.858ProLeu: 2.858 ± 0.364
0.598ProMet: 0.598 ± 0.188
1.595ProAsn: 1.595 ± 0.309
1.529ProPro: 1.529 ± 0.32
1.063ProGln: 1.063 ± 0.249
0.798ProArg: 0.798 ± 0.262
1.728ProSer: 1.728 ± 0.297
1.263ProThr: 1.263 ± 0.33
3.19ProVal: 3.19 ± 0.565
0.399ProTrp: 0.399 ± 0.181
0.997ProTyr: 0.997 ± 0.271
0.0ProXaa: 0.0 ± 0.0
Gln
2.991GlnAla: 2.991 ± 0.402
0.0GlnCys: 0.0 ± 0.0
2.06GlnAsp: 2.06 ± 0.436
2.526GlnGlu: 2.526 ± 0.441
0.93GlnPhe: 0.93 ± 0.33
1.994GlnGly: 1.994 ± 0.352
0.665GlnHis: 0.665 ± 0.193
2.06GlnIle: 2.06 ± 0.404
3.523GlnLys: 3.523 ± 0.523
3.057GlnLeu: 3.057 ± 0.48
0.93GlnMet: 0.93 ± 0.268
1.462GlnAsn: 1.462 ± 0.305
1.063GlnPro: 1.063 ± 0.299
1.529GlnGln: 1.529 ± 0.284
1.329GlnArg: 1.329 ± 0.259
1.728GlnSer: 1.728 ± 0.275
1.662GlnThr: 1.662 ± 0.301
2.127GlnVal: 2.127 ± 0.362
0.266GlnTrp: 0.266 ± 0.126
1.329GlnTyr: 1.329 ± 0.285
0.0GlnXaa: 0.0 ± 0.0
Arg
2.06ArgAla: 2.06 ± 0.41
0.532ArgCys: 0.532 ± 0.214
2.659ArgAsp: 2.659 ± 0.376
2.991ArgGlu: 2.991 ± 0.361
2.26ArgPhe: 2.26 ± 0.454
3.589ArgGly: 3.589 ± 0.483
1.13ArgHis: 1.13 ± 0.274
3.722ArgIle: 3.722 ± 0.485
4.054ArgLys: 4.054 ± 0.627
4.32ArgLeu: 4.32 ± 0.613
1.396ArgMet: 1.396 ± 0.307
2.06ArgAsn: 2.06 ± 0.361
1.063ArgPro: 1.063 ± 0.25
1.595ArgGln: 1.595 ± 0.381
2.659ArgArg: 2.659 ± 0.476
1.728ArgSer: 1.728 ± 0.263
3.19ArgThr: 3.19 ± 0.5
3.39ArgVal: 3.39 ± 0.5
0.665ArgTrp: 0.665 ± 0.272
1.794ArgTyr: 1.794 ± 0.384
0.0ArgXaa: 0.0 ± 0.0
Ser
4.453SerAla: 4.453 ± 0.907
0.133SerCys: 0.133 ± 0.11
4.254SerAsp: 4.254 ± 0.633
3.855SerGlu: 3.855 ± 0.605
2.791SerPhe: 2.791 ± 0.413
4.453SerGly: 4.453 ± 0.542
0.665SerHis: 0.665 ± 0.198
3.39SerIle: 3.39 ± 0.652
3.988SerLys: 3.988 ± 0.522
3.921SerLeu: 3.921 ± 0.595
1.462SerMet: 1.462 ± 0.335
1.927SerAsn: 1.927 ± 0.296
1.529SerPro: 1.529 ± 0.255
1.396SerGln: 1.396 ± 0.333
2.791SerArg: 2.791 ± 0.48
2.659SerSer: 2.659 ± 0.534
3.456SerThr: 3.456 ± 0.5
3.39SerVal: 3.39 ± 0.457
0.798SerTrp: 0.798 ± 0.217
2.06SerTyr: 2.06 ± 0.43
0.0SerXaa: 0.0 ± 0.0
Thr
5.051ThrAla: 5.051 ± 0.695
0.332ThrCys: 0.332 ± 0.163
3.124ThrAsp: 3.124 ± 0.576
4.519ThrGlu: 4.519 ± 0.555
2.526ThrPhe: 2.526 ± 0.39
3.988ThrGly: 3.988 ± 0.524
0.864ThrHis: 0.864 ± 0.236
3.523ThrIle: 3.523 ± 0.483
3.722ThrLys: 3.722 ± 0.484
4.719ThrLeu: 4.719 ± 0.57
1.396ThrMet: 1.396 ± 0.32
2.326ThrAsn: 2.326 ± 0.358
2.526ThrPro: 2.526 ± 0.41
1.263ThrGln: 1.263 ± 0.342
2.526ThrArg: 2.526 ± 0.529
3.057ThrSer: 3.057 ± 0.557
2.725ThrThr: 2.725 ± 0.489
3.988ThrVal: 3.988 ± 0.503
0.598ThrTrp: 0.598 ± 0.207
2.26ThrTyr: 2.26 ± 0.317
0.0ThrXaa: 0.0 ± 0.0
Val
3.523ValAla: 3.523 ± 0.471
0.465ValCys: 0.465 ± 0.172
3.722ValAsp: 3.722 ± 0.535
4.32ValGlu: 4.32 ± 0.609
2.725ValPhe: 2.725 ± 0.434
3.456ValGly: 3.456 ± 0.453
0.93ValHis: 0.93 ± 0.234
5.118ValIle: 5.118 ± 0.509
5.251ValLys: 5.251 ± 0.64
3.921ValLeu: 3.921 ± 0.535
0.93ValMet: 0.93 ± 0.319
4.054ValAsn: 4.054 ± 0.626
2.193ValPro: 2.193 ± 0.412
2.526ValGln: 2.526 ± 0.395
2.459ValArg: 2.459 ± 0.407
4.254ValSer: 4.254 ± 0.59
4.121ValThr: 4.121 ± 0.621
3.124ValVal: 3.124 ± 0.433
1.263ValTrp: 1.263 ± 0.243
2.659ValTyr: 2.659 ± 0.409
0.0ValXaa: 0.0 ± 0.0
Trp
0.665TrpAla: 0.665 ± 0.214
0.266TrpCys: 0.266 ± 0.164
0.731TrpAsp: 0.731 ± 0.239
0.798TrpGlu: 0.798 ± 0.238
0.598TrpPhe: 0.598 ± 0.268
1.063TrpGly: 1.063 ± 0.253
0.598TrpHis: 0.598 ± 0.21
0.93TrpIle: 0.93 ± 0.246
0.798TrpLys: 0.798 ± 0.206
0.665TrpLeu: 0.665 ± 0.22
0.399TrpMet: 0.399 ± 0.17
1.196TrpAsn: 1.196 ± 0.423
0.199TrpPro: 0.199 ± 0.125
0.532TrpGln: 0.532 ± 0.176
0.93TrpArg: 0.93 ± 0.257
0.93TrpSer: 0.93 ± 0.202
0.465TrpThr: 0.465 ± 0.167
0.997TrpVal: 0.997 ± 0.256
0.066TrpTrp: 0.066 ± 0.069
0.465TrpTyr: 0.465 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.326TyrAla: 2.326 ± 0.399
0.266TyrCys: 0.266 ± 0.15
3.057TyrAsp: 3.057 ± 0.437
3.855TyrGlu: 3.855 ± 0.579
1.595TyrPhe: 1.595 ± 0.288
2.725TyrGly: 2.725 ± 0.347
0.598TyrHis: 0.598 ± 0.18
2.459TyrIle: 2.459 ± 0.424
2.659TyrLys: 2.659 ± 0.525
2.924TyrLeu: 2.924 ± 0.404
1.396TyrMet: 1.396 ± 0.299
1.927TyrAsn: 1.927 ± 0.333
1.13TyrPro: 1.13 ± 0.352
0.997TyrGln: 0.997 ± 0.283
1.662TyrArg: 1.662 ± 0.332
2.26TyrSer: 2.26 ± 0.474
1.728TyrThr: 1.728 ± 0.363
2.193TyrVal: 2.193 ± 0.382
0.399TyrTrp: 0.399 ± 0.174
1.396TyrTyr: 1.396 ± 0.335
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 90 proteins (15047 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski