Amino acid dipepetide frequency for Escherichia phage SSL-2009a

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.706AlaAla: 12.706 ± 1.106
0.913AlaCys: 0.913 ± 0.289
6.739AlaAsp: 6.739 ± 0.664
6.388AlaGlu: 6.388 ± 0.748
4.001AlaPhe: 4.001 ± 0.565
8.424AlaGly: 8.424 ± 0.82
1.685AlaHis: 1.685 ± 0.356
6.458AlaIle: 6.458 ± 0.588
7.16AlaLys: 7.16 ± 1.122
7.933AlaLeu: 7.933 ± 0.792
3.019AlaMet: 3.019 ± 0.522
3.721AlaAsn: 3.721 ± 0.689
2.808AlaPro: 2.808 ± 0.523
4.844AlaGln: 4.844 ± 0.741
6.388AlaArg: 6.388 ± 0.625
5.405AlaSer: 5.405 ± 0.559
5.967AlaThr: 5.967 ± 0.709
7.652AlaVal: 7.652 ± 0.747
1.755AlaTrp: 1.755 ± 0.322
2.668AlaTyr: 2.668 ± 0.364
0.0AlaXaa: 0.0 ± 0.0
Cys
1.264CysAla: 1.264 ± 0.325
0.281CysCys: 0.281 ± 0.146
0.772CysAsp: 0.772 ± 0.231
0.772CysGlu: 0.772 ± 0.235
0.421CysPhe: 0.421 ± 0.181
0.983CysGly: 0.983 ± 0.314
0.211CysHis: 0.211 ± 0.128
0.562CysIle: 0.562 ± 0.166
0.913CysLys: 0.913 ± 0.206
0.913CysLeu: 0.913 ± 0.297
0.211CysMet: 0.211 ± 0.094
0.632CysAsn: 0.632 ± 0.233
0.281CysPro: 0.281 ± 0.124
0.491CysGln: 0.491 ± 0.188
0.632CysArg: 0.632 ± 0.183
0.491CysSer: 0.491 ± 0.166
0.491CysThr: 0.491 ± 0.193
0.702CysVal: 0.702 ± 0.19
0.351CysTrp: 0.351 ± 0.125
0.491CysTyr: 0.491 ± 0.205
0.0CysXaa: 0.0 ± 0.0
Asp
7.301AspAla: 7.301 ± 0.683
0.842AspCys: 0.842 ± 0.228
4.563AspAsp: 4.563 ± 1.195
5.195AspGlu: 5.195 ± 1.203
2.808AspPhe: 2.808 ± 0.45
5.897AspGly: 5.897 ± 0.762
1.193AspHis: 1.193 ± 0.294
2.878AspIle: 2.878 ± 0.482
2.668AspLys: 2.668 ± 0.558
5.265AspLeu: 5.265 ± 0.631
1.825AspMet: 1.825 ± 0.338
2.738AspAsn: 2.738 ± 0.511
3.159AspPro: 3.159 ± 0.535
1.895AspGln: 1.895 ± 0.342
2.808AspArg: 2.808 ± 0.402
2.317AspSer: 2.317 ± 0.384
3.791AspThr: 3.791 ± 0.547
4.563AspVal: 4.563 ± 0.509
1.053AspTrp: 1.053 ± 0.285
1.264AspTyr: 1.264 ± 0.267
0.0AspXaa: 0.0 ± 0.0
Glu
5.686GluAla: 5.686 ± 0.714
0.772GluCys: 0.772 ± 0.243
4.563GluAsp: 4.563 ± 0.806
3.65GluGlu: 3.65 ± 0.558
2.387GluPhe: 2.387 ± 0.373
3.159GluGly: 3.159 ± 0.554
1.404GluHis: 1.404 ± 0.312
4.914GluIle: 4.914 ± 0.53
3.44GluLys: 3.44 ± 0.571
4.352GluLeu: 4.352 ± 0.623
1.895GluMet: 1.895 ± 0.368
2.387GluAsn: 2.387 ± 0.402
1.334GluPro: 1.334 ± 0.506
2.597GluGln: 2.597 ± 0.431
3.931GluArg: 3.931 ± 0.593
3.159GluSer: 3.159 ± 0.518
2.597GluThr: 2.597 ± 0.438
4.072GluVal: 4.072 ± 0.488
0.983GluTrp: 0.983 ± 0.321
3.089GluTyr: 3.089 ± 0.483
0.0GluXaa: 0.0 ± 0.0
Phe
3.51PheAla: 3.51 ± 0.476
0.351PheCys: 0.351 ± 0.136
2.878PheAsp: 2.878 ± 0.511
2.176PheGlu: 2.176 ± 0.335
1.404PhePhe: 1.404 ± 0.332
3.51PheGly: 3.51 ± 0.446
0.842PheHis: 0.842 ± 0.258
1.966PheIle: 1.966 ± 0.422
2.036PheLys: 2.036 ± 0.42
2.036PheLeu: 2.036 ± 0.428
0.702PheMet: 0.702 ± 0.245
1.615PheAsn: 1.615 ± 0.482
1.474PhePro: 1.474 ± 0.381
1.474PheGln: 1.474 ± 0.298
2.738PheArg: 2.738 ± 0.508
1.825PheSer: 1.825 ± 0.342
2.387PheThr: 2.387 ± 0.353
2.738PheVal: 2.738 ± 0.628
0.562PheTrp: 0.562 ± 0.217
0.913PheTyr: 0.913 ± 0.259
0.0PheXaa: 0.0 ± 0.0
Gly
6.529GlyAla: 6.529 ± 0.681
1.264GlyCys: 1.264 ± 0.337
4.844GlyAsp: 4.844 ± 0.701
4.633GlyGlu: 4.633 ± 0.557
3.51GlyPhe: 3.51 ± 0.459
6.599GlyGly: 6.599 ± 0.925
1.334GlyHis: 1.334 ± 0.421
4.282GlyIle: 4.282 ± 0.476
6.529GlyLys: 6.529 ± 0.97
6.318GlyLeu: 6.318 ± 0.645
2.457GlyMet: 2.457 ± 0.486
4.142GlyAsn: 4.142 ± 0.624
1.895GlyPro: 1.895 ± 0.35
2.106GlyGln: 2.106 ± 0.436
4.423GlyArg: 4.423 ± 0.581
3.791GlySer: 3.791 ± 0.538
3.791GlyThr: 3.791 ± 0.594
6.178GlyVal: 6.178 ± 0.645
0.983GlyTrp: 0.983 ± 0.26
2.036GlyTyr: 2.036 ± 0.38
0.0GlyXaa: 0.0 ± 0.0
His
1.404HisAla: 1.404 ± 0.284
0.281HisCys: 0.281 ± 0.16
1.474HisAsp: 1.474 ± 0.372
0.842HisGlu: 0.842 ± 0.219
0.632HisPhe: 0.632 ± 0.207
1.334HisGly: 1.334 ± 0.416
0.562HisHis: 0.562 ± 0.236
1.123HisIle: 1.123 ± 0.252
1.334HisLys: 1.334 ± 0.259
1.123HisLeu: 1.123 ± 0.294
0.351HisMet: 0.351 ± 0.144
0.491HisAsn: 0.491 ± 0.154
1.264HisPro: 1.264 ± 0.282
0.913HisGln: 0.913 ± 0.288
0.702HisArg: 0.702 ± 0.214
0.772HisSer: 0.772 ± 0.219
1.334HisThr: 1.334 ± 0.348
0.983HisVal: 0.983 ± 0.375
0.491HisTrp: 0.491 ± 0.168
0.913HisTyr: 0.913 ± 0.242
0.0HisXaa: 0.0 ± 0.0
Ile
5.616IleAla: 5.616 ± 0.527
0.702IleCys: 0.702 ± 0.22
3.931IleAsp: 3.931 ± 0.479
3.299IleGlu: 3.299 ± 0.444
1.615IlePhe: 1.615 ± 0.361
3.58IleGly: 3.58 ± 0.572
0.772IleHis: 0.772 ± 0.218
2.668IleIle: 2.668 ± 0.435
4.072IleLys: 4.072 ± 0.712
2.808IleLeu: 2.808 ± 0.388
1.685IleMet: 1.685 ± 0.283
3.299IleAsn: 3.299 ± 0.474
2.387IlePro: 2.387 ± 0.463
1.615IleGln: 1.615 ± 0.437
3.159IleArg: 3.159 ± 0.467
2.176IleSer: 2.176 ± 0.42
4.563IleThr: 4.563 ± 0.707
3.299IleVal: 3.299 ± 0.455
1.053IleTrp: 1.053 ± 0.328
1.685IleTyr: 1.685 ± 0.386
0.0IleXaa: 0.0 ± 0.0
Lys
7.231LysAla: 7.231 ± 1.245
0.772LysCys: 0.772 ± 0.245
2.668LysAsp: 2.668 ± 0.423
3.51LysGlu: 3.51 ± 0.601
1.755LysPhe: 1.755 ± 0.281
4.423LysGly: 4.423 ± 0.467
1.404LysHis: 1.404 ± 0.356
2.457LysIle: 2.457 ± 0.449
3.299LysLys: 3.299 ± 0.663
4.703LysLeu: 4.703 ± 0.658
2.457LysMet: 2.457 ± 0.404
2.036LysAsn: 2.036 ± 0.363
2.948LysPro: 2.948 ± 0.58
3.159LysGln: 3.159 ± 0.429
3.791LysArg: 3.791 ± 0.646
3.299LysSer: 3.299 ± 0.712
4.423LysThr: 4.423 ± 0.516
4.352LysVal: 4.352 ± 0.492
1.123LysTrp: 1.123 ± 0.258
1.404LysTyr: 1.404 ± 0.333
0.0LysXaa: 0.0 ± 0.0
Leu
6.599LeuAla: 6.599 ± 0.745
0.983LeuCys: 0.983 ± 0.299
4.282LeuAsp: 4.282 ± 0.574
3.229LeuGlu: 3.229 ± 0.474
3.159LeuPhe: 3.159 ± 0.612
5.476LeuGly: 5.476 ± 0.591
1.404LeuHis: 1.404 ± 0.3
4.212LeuIle: 4.212 ± 0.507
4.212LeuLys: 4.212 ± 0.541
5.054LeuLeu: 5.054 ± 0.536
2.106LeuMet: 2.106 ± 0.402
3.37LeuAsn: 3.37 ± 0.449
3.37LeuPro: 3.37 ± 0.532
2.948LeuGln: 2.948 ± 0.396
5.054LeuArg: 5.054 ± 0.611
4.352LeuSer: 4.352 ± 0.607
5.827LeuThr: 5.827 ± 0.608
5.405LeuVal: 5.405 ± 0.792
0.562LeuTrp: 0.562 ± 0.178
2.176LeuTyr: 2.176 ± 0.371
0.0LeuXaa: 0.0 ± 0.0
Met
3.299MetAla: 3.299 ± 0.55
0.351MetCys: 0.351 ± 0.142
1.404MetAsp: 1.404 ± 0.408
1.474MetGlu: 1.474 ± 0.294
0.913MetPhe: 0.913 ± 0.205
1.685MetGly: 1.685 ± 0.393
0.491MetHis: 0.491 ± 0.139
1.966MetIle: 1.966 ± 0.462
1.966MetLys: 1.966 ± 0.328
2.176MetLeu: 2.176 ± 0.425
0.491MetMet: 0.491 ± 0.212
1.615MetAsn: 1.615 ± 0.418
0.772MetPro: 0.772 ± 0.227
1.404MetGln: 1.404 ± 0.293
1.474MetArg: 1.474 ± 0.348
1.404MetSer: 1.404 ± 0.296
2.036MetThr: 2.036 ± 0.362
1.755MetVal: 1.755 ± 0.4
0.491MetTrp: 0.491 ± 0.171
0.983MetTyr: 0.983 ± 0.239
0.0MetXaa: 0.0 ± 0.0
Asn
4.563AsnAla: 4.563 ± 0.717
0.211AsnCys: 0.211 ± 0.14
2.387AsnAsp: 2.387 ± 0.349
2.527AsnGlu: 2.527 ± 0.555
1.193AsnPhe: 1.193 ± 0.293
4.423AsnGly: 4.423 ± 0.548
0.983AsnHis: 0.983 ± 0.279
1.685AsnIle: 1.685 ± 0.308
1.966AsnLys: 1.966 ± 0.361
2.738AsnLeu: 2.738 ± 0.359
1.404AsnMet: 1.404 ± 0.286
1.755AsnAsn: 1.755 ± 0.444
2.106AsnPro: 2.106 ± 0.389
1.895AsnGln: 1.895 ± 0.359
2.246AsnArg: 2.246 ± 0.341
1.544AsnSer: 1.544 ± 0.341
2.738AsnThr: 2.738 ± 0.41
3.229AsnVal: 3.229 ± 0.453
0.842AsnTrp: 0.842 ± 0.228
1.755AsnTyr: 1.755 ± 0.42
0.0AsnXaa: 0.0 ± 0.0
Pro
4.493ProAla: 4.493 ± 0.605
0.281ProCys: 0.281 ± 0.125
3.299ProAsp: 3.299 ± 0.482
2.457ProGlu: 2.457 ± 0.591
1.053ProPhe: 1.053 ± 0.271
3.159ProGly: 3.159 ± 0.544
0.702ProHis: 0.702 ± 0.218
2.036ProIle: 2.036 ± 0.402
2.036ProLys: 2.036 ± 0.473
3.44ProLeu: 3.44 ± 0.576
0.351ProMet: 0.351 ± 0.148
1.123ProAsn: 1.123 ± 0.242
1.404ProPro: 1.404 ± 0.337
1.053ProGln: 1.053 ± 0.271
2.948ProArg: 2.948 ± 0.553
1.544ProSer: 1.544 ± 0.337
2.527ProThr: 2.527 ± 0.465
3.089ProVal: 3.089 ± 0.351
0.421ProTrp: 0.421 ± 0.188
1.193ProTyr: 1.193 ± 0.34
0.0ProXaa: 0.0 ± 0.0
Gln
4.142GlnAla: 4.142 ± 0.657
0.351GlnCys: 0.351 ± 0.171
1.544GlnAsp: 1.544 ± 0.34
1.544GlnGlu: 1.544 ± 0.385
1.334GlnPhe: 1.334 ± 0.321
2.317GlnGly: 2.317 ± 0.397
0.983GlnHis: 0.983 ± 0.217
3.65GlnIle: 3.65 ± 0.528
2.036GlnLys: 2.036 ± 0.412
3.44GlnLeu: 3.44 ± 0.492
1.474GlnMet: 1.474 ± 0.323
1.123GlnAsn: 1.123 ± 0.282
1.685GlnPro: 1.685 ± 0.301
1.544GlnGln: 1.544 ± 0.372
2.317GlnArg: 2.317 ± 0.43
1.615GlnSer: 1.615 ± 0.443
3.019GlnThr: 3.019 ± 0.446
2.668GlnVal: 2.668 ± 0.412
0.772GlnTrp: 0.772 ± 0.248
0.983GlnTyr: 0.983 ± 0.252
0.0GlnXaa: 0.0 ± 0.0
Arg
6.178ArgAla: 6.178 ± 0.79
0.772ArgCys: 0.772 ± 0.226
3.861ArgAsp: 3.861 ± 0.666
3.229ArgGlu: 3.229 ± 0.524
2.387ArgPhe: 2.387 ± 0.353
4.001ArgGly: 4.001 ± 0.526
1.404ArgHis: 1.404 ± 0.319
3.65ArgIle: 3.65 ± 0.606
4.352ArgLys: 4.352 ± 0.711
4.493ArgLeu: 4.493 ± 0.513
1.544ArgMet: 1.544 ± 0.365
2.597ArgAsn: 2.597 ± 0.411
2.668ArgPro: 2.668 ± 0.512
2.527ArgGln: 2.527 ± 0.411
4.703ArgArg: 4.703 ± 0.695
2.597ArgSer: 2.597 ± 0.477
2.668ArgThr: 2.668 ± 0.357
5.054ArgVal: 5.054 ± 0.599
0.842ArgTrp: 0.842 ± 0.243
1.615ArgTyr: 1.615 ± 0.343
0.0ArgXaa: 0.0 ± 0.0
Ser
4.423SerAla: 4.423 ± 0.62
0.211SerCys: 0.211 ± 0.122
2.948SerAsp: 2.948 ± 0.451
3.65SerGlu: 3.65 ± 0.563
1.895SerPhe: 1.895 ± 0.338
5.546SerGly: 5.546 ± 0.614
0.562SerHis: 0.562 ± 0.183
1.685SerIle: 1.685 ± 0.326
2.948SerLys: 2.948 ± 0.55
3.51SerLeu: 3.51 ± 0.473
1.264SerMet: 1.264 ± 0.286
1.966SerAsn: 1.966 ± 0.375
1.193SerPro: 1.193 ± 0.227
1.474SerGln: 1.474 ± 0.357
1.685SerArg: 1.685 ± 0.357
2.036SerSer: 2.036 ± 0.328
3.299SerThr: 3.299 ± 0.526
4.001SerVal: 4.001 ± 0.565
1.053SerTrp: 1.053 ± 0.306
1.334SerTyr: 1.334 ± 0.264
0.0SerXaa: 0.0 ± 0.0
Thr
7.511ThrAla: 7.511 ± 0.821
0.491ThrCys: 0.491 ± 0.179
3.58ThrAsp: 3.58 ± 0.665
3.65ThrGlu: 3.65 ± 0.58
2.948ThrPhe: 2.948 ± 0.535
5.265ThrGly: 5.265 ± 0.575
0.702ThrHis: 0.702 ± 0.234
2.387ThrIle: 2.387 ± 0.378
3.44ThrLys: 3.44 ± 0.581
5.616ThrLeu: 5.616 ± 0.637
1.685ThrMet: 1.685 ± 0.356
3.089ThrAsn: 3.089 ± 0.59
3.299ThrPro: 3.299 ± 0.451
2.387ThrGln: 2.387 ± 0.512
3.51ThrArg: 3.51 ± 0.474
3.019ThrSer: 3.019 ± 0.577
2.878ThrThr: 2.878 ± 0.543
4.282ThrVal: 4.282 ± 0.567
0.983ThrTrp: 0.983 ± 0.27
1.264ThrTyr: 1.264 ± 0.357
0.0ThrXaa: 0.0 ± 0.0
Val
8.354ValAla: 8.354 ± 0.621
0.983ValCys: 0.983 ± 0.236
5.054ValAsp: 5.054 ± 0.603
5.686ValGlu: 5.686 ± 0.656
2.387ValPhe: 2.387 ± 0.539
4.633ValGly: 4.633 ± 0.608
1.123ValHis: 1.123 ± 0.291
3.861ValIle: 3.861 ± 0.675
4.493ValLys: 4.493 ± 0.587
5.125ValLeu: 5.125 ± 0.606
1.825ValMet: 1.825 ± 0.402
3.019ValAsn: 3.019 ± 0.419
2.387ValPro: 2.387 ± 0.428
2.738ValGln: 2.738 ± 0.398
5.265ValArg: 5.265 ± 0.476
3.159ValSer: 3.159 ± 0.396
4.072ValThr: 4.072 ± 0.565
5.967ValVal: 5.967 ± 0.618
0.772ValTrp: 0.772 ± 0.255
1.825ValTyr: 1.825 ± 0.48
0.0ValXaa: 0.0 ± 0.0
Trp
2.106TrpAla: 2.106 ± 0.486
0.351TrpCys: 0.351 ± 0.118
0.983TrpAsp: 0.983 ± 0.24
0.913TrpGlu: 0.913 ± 0.249
0.562TrpPhe: 0.562 ± 0.225
0.772TrpGly: 0.772 ± 0.23
0.351TrpHis: 0.351 ± 0.145
0.491TrpIle: 0.491 ± 0.187
0.772TrpLys: 0.772 ± 0.212
1.053TrpLeu: 1.053 ± 0.254
0.562TrpMet: 0.562 ± 0.19
0.632TrpAsn: 0.632 ± 0.263
0.772TrpPro: 0.772 ± 0.267
0.421TrpGln: 0.421 ± 0.183
1.334TrpArg: 1.334 ± 0.251
0.632TrpSer: 0.632 ± 0.188
0.983TrpThr: 0.983 ± 0.251
1.544TrpVal: 1.544 ± 0.307
0.491TrpTrp: 0.491 ± 0.174
0.421TrpTyr: 0.421 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.721TyrAla: 3.721 ± 0.486
0.702TyrCys: 0.702 ± 0.224
2.246TyrAsp: 2.246 ± 0.537
1.544TyrGlu: 1.544 ± 0.329
0.772TyrPhe: 0.772 ± 0.231
2.457TyrGly: 2.457 ± 0.477
0.211TyrHis: 0.211 ± 0.104
1.053TyrIle: 1.053 ± 0.271
1.615TyrLys: 1.615 ± 0.318
1.685TyrLeu: 1.685 ± 0.384
0.772TyrMet: 0.772 ± 0.197
0.842TyrAsn: 0.842 ± 0.265
1.404TyrPro: 1.404 ± 0.46
1.053TyrGln: 1.053 ± 0.314
2.036TyrArg: 2.036 ± 0.485
1.544TyrSer: 1.544 ± 0.322
2.597TyrThr: 2.597 ± 0.394
1.193TyrVal: 1.193 ± 0.255
0.562TyrTrp: 0.562 ± 0.232
0.842TyrTyr: 0.842 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (14246 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski