Amino acid dipepetide frequency for Escherichia phage pro147

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.471AlaAla: 10.471 ± 2.157
0.691AlaCys: 0.691 ± 0.28
5.334AlaAsp: 5.334 ± 0.869
5.334AlaGlu: 5.334 ± 0.712
3.062AlaPhe: 3.062 ± 0.502
7.409AlaGly: 7.409 ± 1.077
1.778AlaHis: 1.778 ± 0.382
4.643AlaIle: 4.643 ± 0.55
5.433AlaLys: 5.433 ± 0.759
10.274AlaLeu: 10.274 ± 1.278
2.074AlaMet: 2.074 ± 0.423
2.173AlaAsn: 2.173 ± 0.393
4.149AlaPro: 4.149 ± 0.671
3.655AlaGln: 3.655 ± 0.654
4.445AlaArg: 4.445 ± 1.019
7.211AlaSer: 7.211 ± 0.906
6.125AlaThr: 6.125 ± 0.987
7.014AlaVal: 7.014 ± 0.993
1.581AlaTrp: 1.581 ± 0.398
2.074AlaTyr: 2.074 ± 0.449
0.0AlaXaa: 0.0 ± 0.0
Cys
1.087CysAla: 1.087 ± 0.353
0.099CysCys: 0.099 ± 0.091
0.691CysAsp: 0.691 ± 0.274
0.198CysGlu: 0.198 ± 0.134
0.296CysPhe: 0.296 ± 0.171
0.593CysGly: 0.593 ± 0.237
0.198CysHis: 0.198 ± 0.131
0.296CysIle: 0.296 ± 0.17
0.494CysLys: 0.494 ± 0.235
0.691CysLeu: 0.691 ± 0.241
0.296CysMet: 0.296 ± 0.168
0.296CysAsn: 0.296 ± 0.209
0.198CysPro: 0.198 ± 0.136
0.691CysGln: 0.691 ± 0.233
0.889CysArg: 0.889 ± 0.254
0.691CysSer: 0.691 ± 0.249
0.79CysThr: 0.79 ± 0.275
0.691CysVal: 0.691 ± 0.249
0.099CysTrp: 0.099 ± 0.091
0.395CysTyr: 0.395 ± 0.193
0.0CysXaa: 0.0 ± 0.0
Asp
6.421AspAla: 6.421 ± 0.829
0.395AspCys: 0.395 ± 0.152
3.457AspAsp: 3.457 ± 0.656
4.445AspGlu: 4.445 ± 0.849
2.964AspPhe: 2.964 ± 0.573
5.038AspGly: 5.038 ± 0.577
0.395AspHis: 0.395 ± 0.164
4.544AspIle: 4.544 ± 0.752
2.074AspLys: 2.074 ± 0.374
4.84AspLeu: 4.84 ± 0.586
0.494AspMet: 0.494 ± 0.239
1.482AspAsn: 1.482 ± 0.35
1.581AspPro: 1.581 ± 0.446
1.482AspGln: 1.482 ± 0.397
1.877AspArg: 1.877 ± 0.433
3.161AspSer: 3.161 ± 0.516
3.457AspThr: 3.457 ± 0.552
3.556AspVal: 3.556 ± 0.562
0.79AspTrp: 0.79 ± 0.26
2.865AspTyr: 2.865 ± 0.561
0.0AspXaa: 0.0 ± 0.0
Glu
4.544GluAla: 4.544 ± 0.758
0.395GluCys: 0.395 ± 0.198
2.074GluAsp: 2.074 ± 0.488
3.853GluGlu: 3.853 ± 0.52
1.877GluPhe: 1.877 ± 0.394
2.568GluGly: 2.568 ± 0.463
1.383GluHis: 1.383 ± 0.345
2.964GluIle: 2.964 ± 0.618
4.149GluLys: 4.149 ± 0.677
8.1GluLeu: 8.1 ± 0.623
2.568GluMet: 2.568 ± 0.438
3.457GluAsn: 3.457 ± 0.551
2.766GluPro: 2.766 ± 0.596
2.766GluGln: 2.766 ± 0.597
4.544GluArg: 4.544 ± 0.983
3.951GluSer: 3.951 ± 0.545
3.359GluThr: 3.359 ± 0.575
3.754GluVal: 3.754 ± 0.636
1.383GluTrp: 1.383 ± 0.393
2.47GluTyr: 2.47 ± 0.376
0.0GluXaa: 0.0 ± 0.0
Phe
3.062PheAla: 3.062 ± 0.422
0.593PheCys: 0.593 ± 0.233
1.976PheAsp: 1.976 ± 0.39
2.47PheGlu: 2.47 ± 0.481
1.482PhePhe: 1.482 ± 0.411
1.778PheGly: 1.778 ± 0.459
0.593PheHis: 0.593 ± 0.235
1.284PheIle: 1.284 ± 0.433
2.964PheLys: 2.964 ± 0.624
3.359PheLeu: 3.359 ± 0.652
1.087PheMet: 1.087 ± 0.309
1.581PheAsn: 1.581 ± 0.346
1.482PhePro: 1.482 ± 0.438
1.383PheGln: 1.383 ± 0.292
2.074PheArg: 2.074 ± 0.419
2.47PheSer: 2.47 ± 0.507
2.964PheThr: 2.964 ± 0.484
1.383PheVal: 1.383 ± 0.359
0.691PheTrp: 0.691 ± 0.272
1.284PheTyr: 1.284 ± 0.358
0.0PheXaa: 0.0 ± 0.0
Gly
5.137GlyAla: 5.137 ± 0.949
0.593GlyCys: 0.593 ± 0.333
4.544GlyAsp: 4.544 ± 0.639
3.754GlyGlu: 3.754 ± 0.579
2.272GlyPhe: 2.272 ± 0.505
5.038GlyGly: 5.038 ± 0.782
1.185GlyHis: 1.185 ± 0.37
3.26GlyIle: 3.26 ± 0.474
5.73GlyLys: 5.73 ± 0.739
4.939GlyLeu: 4.939 ± 0.663
2.272GlyMet: 2.272 ± 0.556
2.272GlyAsn: 2.272 ± 0.578
0.494GlyPro: 0.494 ± 0.186
1.976GlyGln: 1.976 ± 0.441
4.84GlyArg: 4.84 ± 0.658
3.556GlySer: 3.556 ± 0.488
3.853GlyThr: 3.853 ± 0.793
5.334GlyVal: 5.334 ± 0.745
0.79GlyTrp: 0.79 ± 0.214
1.976GlyTyr: 1.976 ± 0.414
0.0GlyXaa: 0.0 ± 0.0
His
1.581HisAla: 1.581 ± 0.467
0.691HisCys: 0.691 ± 0.307
1.284HisAsp: 1.284 ± 0.305
1.087HisGlu: 1.087 ± 0.329
0.494HisPhe: 0.494 ± 0.318
1.185HisGly: 1.185 ± 0.441
0.593HisHis: 0.593 ± 0.198
1.581HisIle: 1.581 ± 0.318
0.494HisLys: 0.494 ± 0.202
1.482HisLeu: 1.482 ± 0.34
0.395HisMet: 0.395 ± 0.164
0.988HisAsn: 0.988 ± 0.367
0.79HisPro: 0.79 ± 0.259
0.889HisGln: 0.889 ± 0.319
1.087HisArg: 1.087 ± 0.312
1.087HisSer: 1.087 ± 0.4
0.889HisThr: 0.889 ± 0.32
1.284HisVal: 1.284 ± 0.382
0.395HisTrp: 0.395 ± 0.177
0.296HisTyr: 0.296 ± 0.156
0.0HisXaa: 0.0 ± 0.0
Ile
4.149IleAla: 4.149 ± 0.673
0.691IleCys: 0.691 ± 0.349
3.457IleAsp: 3.457 ± 0.585
3.359IleGlu: 3.359 ± 0.494
2.371IlePhe: 2.371 ± 0.509
3.754IleGly: 3.754 ± 0.664
0.593IleHis: 0.593 ± 0.281
3.161IleIle: 3.161 ± 0.464
2.865IleLys: 2.865 ± 0.812
2.766IleLeu: 2.766 ± 0.484
1.383IleMet: 1.383 ± 0.4
3.359IleAsn: 3.359 ± 0.545
2.272IlePro: 2.272 ± 0.506
1.679IleGln: 1.679 ± 0.385
4.248IleArg: 4.248 ± 0.547
5.236IleSer: 5.236 ± 0.671
5.631IleThr: 5.631 ± 0.762
3.26IleVal: 3.26 ± 0.472
0.691IleTrp: 0.691 ± 0.227
1.877IleTyr: 1.877 ± 0.483
0.0IleXaa: 0.0 ± 0.0
Lys
4.939LysAla: 4.939 ± 0.709
0.494LysCys: 0.494 ± 0.198
2.173LysAsp: 2.173 ± 0.435
3.359LysGlu: 3.359 ± 0.531
2.074LysPhe: 2.074 ± 0.368
3.161LysGly: 3.161 ± 0.439
1.482LysHis: 1.482 ± 0.366
3.359LysIle: 3.359 ± 0.965
4.347LysLys: 4.347 ± 1.083
5.927LysLeu: 5.927 ± 0.725
0.988LysMet: 0.988 ± 0.343
4.149LysAsn: 4.149 ± 0.842
2.865LysPro: 2.865 ± 0.606
2.074LysGln: 2.074 ± 0.423
4.544LysArg: 4.544 ± 0.967
3.26LysSer: 3.26 ± 0.597
4.742LysThr: 4.742 ± 0.602
3.655LysVal: 3.655 ± 0.576
0.79LysTrp: 0.79 ± 0.271
2.766LysTyr: 2.766 ± 0.512
0.0LysXaa: 0.0 ± 0.0
Leu
9.385LeuAla: 9.385 ± 1.118
0.691LeuCys: 0.691 ± 0.291
4.939LeuAsp: 4.939 ± 0.801
5.927LeuGlu: 5.927 ± 0.789
3.853LeuPhe: 3.853 ± 0.719
4.742LeuGly: 4.742 ± 0.904
2.272LeuHis: 2.272 ± 0.422
4.939LeuIle: 4.939 ± 0.716
6.421LeuLys: 6.421 ± 0.724
5.038LeuLeu: 5.038 ± 0.663
3.062LeuMet: 3.062 ± 0.514
4.643LeuAsn: 4.643 ± 0.583
3.655LeuPro: 3.655 ± 0.666
3.26LeuGln: 3.26 ± 0.475
4.84LeuArg: 4.84 ± 0.488
6.223LeuSer: 6.223 ± 0.827
8.002LeuThr: 8.002 ± 0.811
4.84LeuVal: 4.84 ± 0.677
0.889LeuTrp: 0.889 ± 0.285
2.964LeuTyr: 2.964 ± 0.586
0.0LeuXaa: 0.0 ± 0.0
Met
3.26MetAla: 3.26 ± 0.502
0.198MetCys: 0.198 ± 0.119
0.79MetAsp: 0.79 ± 0.283
1.087MetGlu: 1.087 ± 0.342
1.284MetPhe: 1.284 ± 0.31
1.087MetGly: 1.087 ± 0.295
0.593MetHis: 0.593 ± 0.212
1.087MetIle: 1.087 ± 0.317
1.284MetLys: 1.284 ± 0.309
2.964MetLeu: 2.964 ± 0.624
1.185MetMet: 1.185 ± 0.317
2.173MetAsn: 2.173 ± 0.439
0.988MetPro: 0.988 ± 0.305
0.79MetGln: 0.79 ± 0.244
1.976MetArg: 1.976 ± 0.496
2.173MetSer: 2.173 ± 0.405
3.26MetThr: 3.26 ± 0.545
1.087MetVal: 1.087 ± 0.28
0.198MetTrp: 0.198 ± 0.123
0.691MetTyr: 0.691 ± 0.217
0.0MetXaa: 0.0 ± 0.0
Asn
3.655AsnAla: 3.655 ± 0.661
0.296AsnCys: 0.296 ± 0.183
2.964AsnAsp: 2.964 ± 0.584
2.568AsnGlu: 2.568 ± 0.535
1.383AsnPhe: 1.383 ± 0.342
3.853AsnGly: 3.853 ± 0.733
0.198AsnHis: 0.198 ± 0.121
3.062AsnIle: 3.062 ± 0.518
2.568AsnLys: 2.568 ± 0.513
3.161AsnLeu: 3.161 ± 0.521
0.889AsnMet: 0.889 ± 0.306
2.272AsnAsn: 2.272 ± 0.421
2.272AsnPro: 2.272 ± 0.55
1.383AsnGln: 1.383 ± 0.286
2.766AsnArg: 2.766 ± 0.656
2.964AsnSer: 2.964 ± 0.675
2.272AsnThr: 2.272 ± 0.452
1.877AsnVal: 1.877 ± 0.338
0.296AsnTrp: 0.296 ± 0.22
1.185AsnTyr: 1.185 ± 0.252
0.0AsnXaa: 0.0 ± 0.0
Pro
3.951ProAla: 3.951 ± 0.729
0.099ProCys: 0.099 ± 0.106
2.865ProAsp: 2.865 ± 0.554
2.964ProGlu: 2.964 ± 0.623
1.383ProPhe: 1.383 ± 0.557
2.074ProGly: 2.074 ± 0.411
0.988ProHis: 0.988 ± 0.392
1.679ProIle: 1.679 ± 0.448
2.47ProLys: 2.47 ± 0.499
3.754ProLeu: 3.754 ± 0.48
0.79ProMet: 0.79 ± 0.267
0.79ProAsn: 0.79 ± 0.281
1.284ProPro: 1.284 ± 0.251
1.482ProGln: 1.482 ± 0.416
2.272ProArg: 2.272 ± 0.628
2.766ProSer: 2.766 ± 0.609
1.778ProThr: 1.778 ± 0.37
4.643ProVal: 4.643 ± 0.719
0.494ProTrp: 0.494 ± 0.252
0.889ProTyr: 0.889 ± 0.283
0.0ProXaa: 0.0 ± 0.0
Gln
3.951GlnAla: 3.951 ± 1.19
0.198GlnCys: 0.198 ± 0.123
1.976GlnAsp: 1.976 ± 0.504
2.667GlnGlu: 2.667 ± 0.522
0.593GlnPhe: 0.593 ± 0.197
1.778GlnGly: 1.778 ± 0.274
0.494GlnHis: 0.494 ± 0.213
2.272GlnIle: 2.272 ± 0.603
2.568GlnLys: 2.568 ± 0.564
3.951GlnLeu: 3.951 ± 0.533
1.284GlnMet: 1.284 ± 0.306
0.79GlnAsn: 0.79 ± 0.243
1.679GlnPro: 1.679 ± 0.439
2.173GlnGln: 2.173 ± 0.599
3.754GlnArg: 3.754 ± 0.649
2.371GlnSer: 2.371 ± 0.53
1.976GlnThr: 1.976 ± 0.34
1.778GlnVal: 1.778 ± 0.386
0.593GlnTrp: 0.593 ± 0.207
0.494GlnTyr: 0.494 ± 0.216
0.0GlnXaa: 0.0 ± 0.0
Arg
5.631ArgAla: 5.631 ± 0.655
0.988ArgCys: 0.988 ± 0.213
3.26ArgAsp: 3.26 ± 0.432
4.544ArgGlu: 4.544 ± 0.583
1.679ArgPhe: 1.679 ± 0.392
3.062ArgGly: 3.062 ± 0.891
1.482ArgHis: 1.482 ± 0.391
4.445ArgIle: 4.445 ± 0.618
3.853ArgLys: 3.853 ± 0.672
5.927ArgLeu: 5.927 ± 0.975
1.581ArgMet: 1.581 ± 0.384
2.47ArgAsn: 2.47 ± 0.508
2.074ArgPro: 2.074 ± 0.431
3.556ArgGln: 3.556 ± 0.782
4.939ArgArg: 4.939 ± 0.695
2.865ArgSer: 2.865 ± 0.516
2.568ArgThr: 2.568 ± 0.416
4.939ArgVal: 4.939 ± 0.955
1.185ArgTrp: 1.185 ± 0.341
2.47ArgTyr: 2.47 ± 0.43
0.0ArgXaa: 0.0 ± 0.0
Ser
5.927SerAla: 5.927 ± 0.969
0.494SerCys: 0.494 ± 0.207
3.556SerAsp: 3.556 ± 0.652
4.05SerGlu: 4.05 ± 0.556
2.47SerPhe: 2.47 ± 0.483
3.655SerGly: 3.655 ± 0.607
1.284SerHis: 1.284 ± 0.48
3.062SerIle: 3.062 ± 0.47
3.951SerLys: 3.951 ± 0.908
6.619SerLeu: 6.619 ± 1.258
1.679SerMet: 1.679 ± 0.404
3.161SerAsn: 3.161 ± 0.664
2.667SerPro: 2.667 ± 0.585
2.371SerGln: 2.371 ± 0.371
4.05SerArg: 4.05 ± 0.483
3.161SerSer: 3.161 ± 0.733
3.754SerThr: 3.754 ± 0.538
5.73SerVal: 5.73 ± 0.828
0.494SerTrp: 0.494 ± 0.166
1.679SerTyr: 1.679 ± 0.451
0.0SerXaa: 0.0 ± 0.0
Thr
7.014ThrAla: 7.014 ± 1.411
0.691ThrCys: 0.691 ± 0.259
3.457ThrAsp: 3.457 ± 0.58
3.161ThrGlu: 3.161 ± 0.632
2.272ThrPhe: 2.272 ± 0.481
6.816ThrGly: 6.816 ± 1.048
1.185ThrHis: 1.185 ± 0.339
3.655ThrIle: 3.655 ± 0.566
3.359ThrLys: 3.359 ± 0.865
7.211ThrLeu: 7.211 ± 0.882
2.074ThrMet: 2.074 ± 0.415
2.074ThrAsn: 2.074 ± 0.521
3.26ThrPro: 3.26 ± 0.407
1.778ThrGln: 1.778 ± 0.542
4.05ThrArg: 4.05 ± 0.537
3.754ThrSer: 3.754 ± 0.392
3.951ThrThr: 3.951 ± 0.704
4.742ThrVal: 4.742 ± 0.579
0.395ThrTrp: 0.395 ± 0.166
0.988ThrTyr: 0.988 ± 0.25
0.0ThrXaa: 0.0 ± 0.0
Val
6.619ValAla: 6.619 ± 0.908
1.087ValCys: 1.087 ± 0.342
4.149ValAsp: 4.149 ± 0.611
4.445ValGlu: 4.445 ± 0.706
2.074ValPhe: 2.074 ± 0.456
4.248ValGly: 4.248 ± 0.827
0.691ValHis: 0.691 ± 0.21
4.149ValIle: 4.149 ± 0.565
4.544ValLys: 4.544 ± 0.683
6.125ValLeu: 6.125 ± 0.74
2.568ValMet: 2.568 ± 0.568
2.568ValAsn: 2.568 ± 0.522
2.074ValPro: 2.074 ± 0.415
2.173ValGln: 2.173 ± 0.376
2.964ValArg: 2.964 ± 0.608
5.137ValSer: 5.137 ± 0.592
4.544ValThr: 4.544 ± 0.854
4.643ValVal: 4.643 ± 0.665
0.395ValTrp: 0.395 ± 0.151
1.284ValTyr: 1.284 ± 0.355
0.0ValXaa: 0.0 ± 0.0
Trp
1.087TrpAla: 1.087 ± 0.241
0.0TrpCys: 0.0 ± 0.0
0.889TrpAsp: 0.889 ± 0.267
0.988TrpGlu: 0.988 ± 0.294
0.593TrpPhe: 0.593 ± 0.186
0.198TrpGly: 0.198 ± 0.182
0.494TrpHis: 0.494 ± 0.236
0.79TrpIle: 0.79 ± 0.23
0.79TrpLys: 0.79 ± 0.252
1.482TrpLeu: 1.482 ± 0.445
0.395TrpMet: 0.395 ± 0.198
0.395TrpAsn: 0.395 ± 0.289
1.087TrpPro: 1.087 ± 0.311
0.296TrpGln: 0.296 ± 0.149
1.383TrpArg: 1.383 ± 0.326
0.494TrpSer: 0.494 ± 0.21
0.198TrpThr: 0.198 ± 0.126
0.494TrpVal: 0.494 ± 0.194
0.593TrpTrp: 0.593 ± 0.205
0.79TrpTyr: 0.79 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.062TyrAla: 3.062 ± 0.63
0.296TyrCys: 0.296 ± 0.169
1.482TyrAsp: 1.482 ± 0.423
2.667TyrGlu: 2.667 ± 0.451
1.482TyrPhe: 1.482 ± 0.394
1.778TyrGly: 1.778 ± 0.371
0.691TyrHis: 0.691 ± 0.3
2.568TyrIle: 2.568 ± 0.495
0.889TyrLys: 0.889 ± 0.383
1.976TyrLeu: 1.976 ± 0.458
0.988TyrMet: 0.988 ± 0.297
0.79TyrAsn: 0.79 ± 0.218
1.778TyrPro: 1.778 ± 0.484
1.482TyrGln: 1.482 ± 0.441
1.976TyrArg: 1.976 ± 0.5
1.284TyrSer: 1.284 ± 0.382
1.778TyrThr: 1.778 ± 0.429
1.778TyrVal: 1.778 ± 0.449
0.691TyrTrp: 0.691 ± 0.249
0.889TyrTyr: 0.889 ± 0.276
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (10124 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski