Amino acid dipepetide frequency for Escherichia phage CJ19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.171AlaAla: 8.171 ± 1.115
0.871AlaCys: 0.871 ± 0.269
5.023AlaAsp: 5.023 ± 0.487
5.559AlaGlu: 5.559 ± 0.753
3.215AlaPhe: 3.215 ± 0.539
6.095AlaGly: 6.095 ± 0.527
0.871AlaHis: 0.871 ± 0.224
5.09AlaIle: 5.09 ± 0.479
5.559AlaLys: 5.559 ± 0.75
7.636AlaLeu: 7.636 ± 0.768
2.478AlaMet: 2.478 ± 0.456
3.416AlaAsn: 3.416 ± 0.491
1.34AlaPro: 1.34 ± 0.294
3.416AlaGln: 3.416 ± 0.41
4.756AlaArg: 4.756 ± 0.555
5.023AlaSer: 5.023 ± 0.534
3.885AlaThr: 3.885 ± 0.75
5.626AlaVal: 5.626 ± 0.653
1.474AlaTrp: 1.474 ± 0.269
2.947AlaTyr: 2.947 ± 0.431
0.0AlaXaa: 0.0 ± 0.0
Cys
1.139CysAla: 1.139 ± 0.271
0.201CysCys: 0.201 ± 0.115
1.005CysAsp: 1.005 ± 0.214
0.938CysGlu: 0.938 ± 0.343
0.603CysPhe: 0.603 ± 0.184
1.407CysGly: 1.407 ± 0.345
0.335CysHis: 0.335 ± 0.159
1.139CysIle: 1.139 ± 0.269
1.005CysLys: 1.005 ± 0.229
0.536CysLeu: 0.536 ± 0.175
0.402CysMet: 0.402 ± 0.148
0.335CysAsn: 0.335 ± 0.152
0.335CysPro: 0.335 ± 0.151
0.268CysGln: 0.268 ± 0.151
0.804CysArg: 0.804 ± 0.224
0.603CysSer: 0.603 ± 0.183
0.603CysThr: 0.603 ± 0.192
1.206CysVal: 1.206 ± 0.286
0.268CysTrp: 0.268 ± 0.131
0.268CysTyr: 0.268 ± 0.112
0.0CysXaa: 0.0 ± 0.0
Asp
5.224AspAla: 5.224 ± 0.605
0.603AspCys: 0.603 ± 0.221
3.952AspAsp: 3.952 ± 0.628
5.023AspGlu: 5.023 ± 0.67
2.478AspPhe: 2.478 ± 0.399
7.502AspGly: 7.502 ± 0.789
1.072AspHis: 1.072 ± 0.235
3.416AspIle: 3.416 ± 0.417
4.421AspLys: 4.421 ± 0.568
3.885AspLeu: 3.885 ± 0.522
1.674AspMet: 1.674 ± 0.341
2.746AspAsn: 2.746 ± 0.412
1.875AspPro: 1.875 ± 0.317
1.808AspGln: 1.808 ± 0.305
2.009AspArg: 2.009 ± 0.345
3.282AspSer: 3.282 ± 0.416
3.081AspThr: 3.081 ± 0.378
4.153AspVal: 4.153 ± 0.513
0.938AspTrp: 0.938 ± 0.249
3.081AspTyr: 3.081 ± 0.423
0.067AspXaa: 0.067 ± 0.068
Glu
5.09GluAla: 5.09 ± 0.539
0.67GluCys: 0.67 ± 0.177
3.55GluAsp: 3.55 ± 0.472
3.952GluGlu: 3.952 ± 0.629
3.416GluPhe: 3.416 ± 0.468
3.617GluGly: 3.617 ± 0.405
1.005GluHis: 1.005 ± 0.302
4.823GluIle: 4.823 ± 0.673
4.354GluLys: 4.354 ± 0.571
4.22GluLeu: 4.22 ± 0.516
3.081GluMet: 3.081 ± 0.405
3.349GluAsn: 3.349 ± 0.509
2.143GluPro: 2.143 ± 0.367
3.081GluGln: 3.081 ± 0.597
2.411GluArg: 2.411 ± 0.399
4.287GluSer: 4.287 ± 0.553
2.344GluThr: 2.344 ± 0.411
4.956GluVal: 4.956 ± 0.671
1.072GluTrp: 1.072 ± 0.316
2.545GluTyr: 2.545 ± 0.431
0.0GluXaa: 0.0 ± 0.0
Phe
2.679PheAla: 2.679 ± 0.466
0.536PheCys: 0.536 ± 0.171
3.751PheAsp: 3.751 ± 0.532
2.143PheGlu: 2.143 ± 0.431
0.67PhePhe: 0.67 ± 0.181
3.483PheGly: 3.483 ± 0.582
0.536PheHis: 0.536 ± 0.164
2.478PheIle: 2.478 ± 0.343
2.813PheLys: 2.813 ± 0.355
2.076PheLeu: 2.076 ± 0.415
1.005PheMet: 1.005 ± 0.303
2.746PheAsn: 2.746 ± 0.574
1.674PhePro: 1.674 ± 0.287
1.541PheGln: 1.541 ± 0.312
1.808PheArg: 1.808 ± 0.318
2.277PheSer: 2.277 ± 0.343
1.875PheThr: 1.875 ± 0.366
3.081PheVal: 3.081 ± 0.462
0.871PheTrp: 0.871 ± 0.245
1.273PheTyr: 1.273 ± 0.305
0.0PheXaa: 0.0 ± 0.0
Gly
5.961GlyAla: 5.961 ± 0.737
1.34GlyCys: 1.34 ± 0.327
4.889GlyAsp: 4.889 ± 0.528
4.421GlyGlu: 4.421 ± 0.408
3.349GlyPhe: 3.349 ± 0.561
6.296GlyGly: 6.296 ± 1.065
1.407GlyHis: 1.407 ± 0.325
5.425GlyIle: 5.425 ± 0.693
6.765GlyLys: 6.765 ± 0.735
4.956GlyLeu: 4.956 ± 0.558
3.081GlyMet: 3.081 ± 0.455
3.885GlyAsn: 3.885 ± 0.532
0.804GlyPro: 0.804 ± 0.235
2.009GlyGln: 2.009 ± 0.435
3.416GlyArg: 3.416 ± 0.433
5.76GlySer: 5.76 ± 0.573
3.349GlyThr: 3.349 ± 0.379
6.698GlyVal: 6.698 ± 0.664
0.938GlyTrp: 0.938 ± 0.227
3.081GlyTyr: 3.081 ± 0.411
0.0GlyXaa: 0.0 ± 0.0
His
1.005HisAla: 1.005 ± 0.323
0.268HisCys: 0.268 ± 0.129
1.34HisAsp: 1.34 ± 0.292
1.34HisGlu: 1.34 ± 0.325
0.67HisPhe: 0.67 ± 0.184
1.541HisGly: 1.541 ± 0.321
0.402HisHis: 0.402 ± 0.161
1.273HisIle: 1.273 ± 0.328
1.541HisLys: 1.541 ± 0.341
1.139HisLeu: 1.139 ± 0.293
0.402HisMet: 0.402 ± 0.149
0.536HisAsn: 0.536 ± 0.188
0.938HisPro: 0.938 ± 0.248
0.737HisGln: 0.737 ± 0.223
1.206HisArg: 1.206 ± 0.278
1.072HisSer: 1.072 ± 0.284
1.005HisThr: 1.005 ± 0.304
1.34HisVal: 1.34 ± 0.405
0.134HisTrp: 0.134 ± 0.092
0.67HisTyr: 0.67 ± 0.211
0.0HisXaa: 0.0 ± 0.0
Ile
6.028IleAla: 6.028 ± 0.589
0.67IleCys: 0.67 ± 0.211
4.956IleAsp: 4.956 ± 0.546
4.421IleGlu: 4.421 ± 0.667
2.009IlePhe: 2.009 ± 0.342
4.22IleGly: 4.22 ± 0.441
1.608IleHis: 1.608 ± 0.358
4.019IleIle: 4.019 ± 0.408
5.224IleLys: 5.224 ± 0.55
4.153IleLeu: 4.153 ± 0.641
1.072IleMet: 1.072 ± 0.34
3.148IleAsn: 3.148 ± 0.585
2.612IlePro: 2.612 ± 0.425
2.076IleGln: 2.076 ± 0.3
3.349IleArg: 3.349 ± 0.488
4.622IleSer: 4.622 ± 0.581
4.153IleThr: 4.153 ± 0.528
3.751IleVal: 3.751 ± 0.47
1.273IleTrp: 1.273 ± 0.304
2.21IleTyr: 2.21 ± 0.415
0.0IleXaa: 0.0 ± 0.0
Lys
7.1LysAla: 7.1 ± 0.732
0.67LysCys: 0.67 ± 0.241
4.22LysAsp: 4.22 ± 0.416
5.157LysGlu: 5.157 ± 0.664
2.746LysPhe: 2.746 ± 0.312
4.22LysGly: 4.22 ± 0.555
1.273LysHis: 1.273 ± 0.379
4.421LysIle: 4.421 ± 0.405
4.889LysLys: 4.889 ± 0.581
4.153LysLeu: 4.153 ± 0.534
3.148LysMet: 3.148 ± 0.434
3.014LysAsn: 3.014 ± 0.454
2.344LysPro: 2.344 ± 0.403
2.612LysGln: 2.612 ± 0.349
3.617LysArg: 3.617 ± 0.592
4.153LysSer: 4.153 ± 0.661
3.751LysThr: 3.751 ± 0.352
5.425LysVal: 5.425 ± 0.646
1.206LysTrp: 1.206 ± 0.295
2.612LysTyr: 2.612 ± 0.383
0.067LysXaa: 0.067 ± 0.056
Leu
5.76LeuAla: 5.76 ± 0.673
0.938LeuCys: 0.938 ± 0.21
3.818LeuAsp: 3.818 ± 0.44
3.751LeuGlu: 3.751 ± 0.544
2.21LeuPhe: 2.21 ± 0.44
5.09LeuGly: 5.09 ± 0.527
1.005LeuHis: 1.005 ± 0.385
4.689LeuIle: 4.689 ± 0.47
4.421LeuLys: 4.421 ± 0.526
4.421LeuLeu: 4.421 ± 0.622
1.474LeuMet: 1.474 ± 0.329
3.885LeuAsn: 3.885 ± 0.539
2.947LeuPro: 2.947 ± 0.409
2.411LeuGln: 2.411 ± 0.396
3.215LeuArg: 3.215 ± 0.453
5.425LeuSer: 5.425 ± 0.645
4.421LeuThr: 4.421 ± 0.451
3.55LeuVal: 3.55 ± 0.508
0.603LeuTrp: 0.603 ± 0.217
2.21LeuTyr: 2.21 ± 0.38
0.0LeuXaa: 0.0 ± 0.0
Met
3.014MetAla: 3.014 ± 0.44
0.201MetCys: 0.201 ± 0.117
1.005MetAsp: 1.005 ± 0.273
1.741MetGlu: 1.741 ± 0.314
1.005MetPhe: 1.005 ± 0.258
1.407MetGly: 1.407 ± 0.328
0.737MetHis: 0.737 ± 0.232
2.746MetIle: 2.746 ± 0.378
2.009MetLys: 2.009 ± 0.356
1.875MetLeu: 1.875 ± 0.355
1.005MetMet: 1.005 ± 0.254
1.674MetAsn: 1.674 ± 0.358
1.139MetPro: 1.139 ± 0.255
1.273MetGln: 1.273 ± 0.279
1.608MetArg: 1.608 ± 0.326
2.746MetSer: 2.746 ± 0.434
1.942MetThr: 1.942 ± 0.386
2.143MetVal: 2.143 ± 0.382
0.268MetTrp: 0.268 ± 0.114
0.871MetTyr: 0.871 ± 0.233
0.0MetXaa: 0.0 ± 0.0
Asn
3.684AsnAla: 3.684 ± 0.461
0.737AsnCys: 0.737 ± 0.212
3.483AsnAsp: 3.483 ± 0.472
2.947AsnGlu: 2.947 ± 0.455
2.143AsnPhe: 2.143 ± 0.315
6.095AsnGly: 6.095 ± 0.79
1.206AsnHis: 1.206 ± 0.285
1.674AsnIle: 1.674 ± 0.285
3.148AsnLys: 3.148 ± 0.445
3.081AsnLeu: 3.081 ± 0.453
1.206AsnMet: 1.206 ± 0.281
3.148AsnAsn: 3.148 ± 0.636
2.009AsnPro: 2.009 ± 0.328
1.741AsnGln: 1.741 ± 0.364
1.674AsnArg: 1.674 ± 0.306
3.282AsnSer: 3.282 ± 0.481
2.009AsnThr: 2.009 ± 0.381
3.952AsnVal: 3.952 ± 0.472
0.469AsnTrp: 0.469 ± 0.153
1.808AsnTyr: 1.808 ± 0.322
0.0AsnXaa: 0.0 ± 0.0
Pro
2.21ProAla: 2.21 ± 0.283
0.67ProCys: 0.67 ± 0.225
2.277ProAsp: 2.277 ± 0.359
2.88ProGlu: 2.88 ± 0.605
1.942ProPhe: 1.942 ± 0.426
1.875ProGly: 1.875 ± 0.368
0.804ProHis: 0.804 ± 0.219
2.143ProIle: 2.143 ± 0.389
1.407ProLys: 1.407 ± 0.306
1.005ProLeu: 1.005 ± 0.204
1.005ProMet: 1.005 ± 0.199
1.674ProAsn: 1.674 ± 0.293
0.804ProPro: 0.804 ± 0.275
0.938ProGln: 0.938 ± 0.264
1.474ProArg: 1.474 ± 0.274
1.474ProSer: 1.474 ± 0.261
1.741ProThr: 1.741 ± 0.303
2.679ProVal: 2.679 ± 0.331
0.67ProTrp: 0.67 ± 0.229
0.938ProTyr: 0.938 ± 0.201
0.0ProXaa: 0.0 ± 0.0
Gln
3.952GlnAla: 3.952 ± 0.488
0.335GlnCys: 0.335 ± 0.125
1.541GlnAsp: 1.541 ± 0.303
1.942GlnGlu: 1.942 ± 0.31
1.741GlnPhe: 1.741 ± 0.3
2.076GlnGly: 2.076 ± 0.403
0.737GlnHis: 0.737 ± 0.23
2.746GlnIle: 2.746 ± 0.373
2.478GlnLys: 2.478 ± 0.46
3.349GlnLeu: 3.349 ± 0.369
1.072GlnMet: 1.072 ± 0.262
1.34GlnAsn: 1.34 ± 0.274
1.005GlnPro: 1.005 ± 0.24
2.009GlnGln: 2.009 ± 0.767
1.942GlnArg: 1.942 ± 0.336
2.545GlnSer: 2.545 ± 0.352
1.072GlnThr: 1.072 ± 0.232
3.215GlnVal: 3.215 ± 0.419
0.603GlnTrp: 0.603 ± 0.207
1.942GlnTyr: 1.942 ± 0.398
0.0GlnXaa: 0.0 ± 0.0
Arg
3.55ArgAla: 3.55 ± 0.498
0.804ArgCys: 0.804 ± 0.315
2.21ArgAsp: 2.21 ± 0.387
3.148ArgGlu: 3.148 ± 0.343
1.875ArgPhe: 1.875 ± 0.35
2.88ArgGly: 2.88 ± 0.386
0.67ArgHis: 0.67 ± 0.227
3.215ArgIle: 3.215 ± 0.516
4.153ArgLys: 4.153 ± 0.5
3.483ArgLeu: 3.483 ± 0.47
1.674ArgMet: 1.674 ± 0.293
2.009ArgAsn: 2.009 ± 0.366
1.608ArgPro: 1.608 ± 0.356
2.21ArgGln: 2.21 ± 0.405
2.344ArgArg: 2.344 ± 0.481
3.081ArgSer: 3.081 ± 0.44
2.21ArgThr: 2.21 ± 0.405
3.617ArgVal: 3.617 ± 0.538
0.603ArgTrp: 0.603 ± 0.18
1.741ArgTyr: 1.741 ± 0.339
0.067ArgXaa: 0.067 ± 0.066
Ser
5.224SerAla: 5.224 ± 0.773
0.67SerCys: 0.67 ± 0.222
3.885SerAsp: 3.885 ± 0.521
4.622SerGlu: 4.622 ± 0.491
2.545SerPhe: 2.545 ± 0.365
6.698SerGly: 6.698 ± 0.711
0.938SerHis: 0.938 ± 0.237
4.689SerIle: 4.689 ± 0.678
4.287SerLys: 4.287 ± 0.567
4.756SerLeu: 4.756 ± 0.527
1.206SerMet: 1.206 ± 0.301
3.684SerAsn: 3.684 ± 0.6
1.674SerPro: 1.674 ± 0.316
2.947SerGln: 2.947 ± 0.423
2.88SerArg: 2.88 ± 0.391
3.751SerSer: 3.751 ± 0.566
3.483SerThr: 3.483 ± 0.506
4.555SerVal: 4.555 ± 0.462
1.34SerTrp: 1.34 ± 0.286
2.344SerTyr: 2.344 ± 0.391
0.134SerXaa: 0.134 ± 0.099
Thr
4.086ThrAla: 4.086 ± 0.597
0.536ThrCys: 0.536 ± 0.158
2.813ThrAsp: 2.813 ± 0.393
2.88ThrGlu: 2.88 ± 0.401
2.344ThrPhe: 2.344 ± 0.386
4.956ThrGly: 4.956 ± 0.521
1.005ThrHis: 1.005 ± 0.226
3.215ThrIle: 3.215 ± 0.362
3.282ThrLys: 3.282 ± 0.568
3.751ThrLeu: 3.751 ± 0.44
1.541ThrMet: 1.541 ± 0.302
2.009ThrAsn: 2.009 ± 0.364
2.143ThrPro: 2.143 ± 0.344
2.545ThrGln: 2.545 ± 0.415
1.741ThrArg: 1.741 ± 0.293
3.55ThrSer: 3.55 ± 0.546
2.947ThrThr: 2.947 ± 0.47
4.287ThrVal: 4.287 ± 0.429
0.737ThrTrp: 0.737 ± 0.266
2.076ThrTyr: 2.076 ± 0.364
0.0ThrXaa: 0.0 ± 0.0
Val
4.823ValAla: 4.823 ± 0.501
1.273ValCys: 1.273 ± 0.288
4.756ValAsp: 4.756 ± 0.579
4.153ValGlu: 4.153 ± 0.548
2.344ValPhe: 2.344 ± 0.348
4.555ValGly: 4.555 ± 0.515
1.674ValHis: 1.674 ± 0.345
5.492ValIle: 5.492 ± 0.469
5.425ValLys: 5.425 ± 0.583
4.689ValLeu: 4.689 ± 0.368
2.478ValMet: 2.478 ± 0.435
4.086ValAsn: 4.086 ± 0.545
1.608ValPro: 1.608 ± 0.329
2.411ValGln: 2.411 ± 0.482
3.885ValArg: 3.885 ± 0.638
4.889ValSer: 4.889 ± 0.432
5.358ValThr: 5.358 ± 0.646
5.291ValVal: 5.291 ± 0.78
0.737ValTrp: 0.737 ± 0.206
2.947ValTyr: 2.947 ± 0.495
0.0ValXaa: 0.0 ± 0.0
Trp
1.407TrpAla: 1.407 ± 0.263
0.402TrpCys: 0.402 ± 0.181
0.67TrpAsp: 0.67 ± 0.188
0.871TrpGlu: 0.871 ± 0.219
0.737TrpPhe: 0.737 ± 0.236
0.804TrpGly: 0.804 ± 0.182
0.402TrpHis: 0.402 ± 0.16
0.603TrpIle: 0.603 ± 0.181
1.34TrpLys: 1.34 ± 0.252
1.273TrpLeu: 1.273 ± 0.321
0.536TrpMet: 0.536 ± 0.165
0.469TrpAsn: 0.469 ± 0.154
0.469TrpPro: 0.469 ± 0.201
0.469TrpGln: 0.469 ± 0.174
0.871TrpArg: 0.871 ± 0.196
1.34TrpSer: 1.34 ± 0.322
0.603TrpThr: 0.603 ± 0.279
0.871TrpVal: 0.871 ± 0.229
0.268TrpTrp: 0.268 ± 0.121
0.67TrpTyr: 0.67 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.277TyrAla: 2.277 ± 0.31
0.938TyrCys: 0.938 ± 0.31
3.215TyrAsp: 3.215 ± 0.458
2.009TyrGlu: 2.009 ± 0.335
1.206TyrPhe: 1.206 ± 0.247
3.014TyrGly: 3.014 ± 0.413
0.871TyrHis: 0.871 ± 0.247
2.277TyrIle: 2.277 ± 0.346
2.277TyrLys: 2.277 ± 0.456
1.808TyrLeu: 1.808 ± 0.316
0.804TyrMet: 0.804 ± 0.204
2.344TyrAsn: 2.344 ± 0.397
1.139TyrPro: 1.139 ± 0.215
1.139TyrGln: 1.139 ± 0.232
2.143TyrArg: 2.143 ± 0.384
3.081TyrSer: 3.081 ± 0.433
2.612TyrThr: 2.612 ± 0.355
2.478TyrVal: 2.478 ± 0.415
0.603TyrTrp: 0.603 ± 0.199
1.005TyrTyr: 1.005 ± 0.273
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.067XaaAla: 0.067 ± 0.056
0.067XaaCys: 0.067 ± 0.066
0.0XaaAsp: 0.0 ± 0.0
0.067XaaGlu: 0.067 ± 0.068
0.067XaaPhe: 0.067 ± 0.08
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.067XaaSer: 0.067 ± 0.066
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (14931 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski