Amino acid dipepetide frequency for Citrobacter phage vB_CroP_CrRp3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.702AlaAla: 8.702 ± 1.201
0.385AlaCys: 0.385 ± 0.2
5.853AlaAsp: 5.853 ± 0.728
7.085AlaGlu: 7.085 ± 1.091
3.08AlaPhe: 3.08 ± 0.615
7.47AlaGly: 7.47 ± 0.872
1.232AlaHis: 1.232 ± 0.289
5.468AlaIle: 5.468 ± 0.542
5.083AlaLys: 5.083 ± 0.607
7.085AlaLeu: 7.085 ± 0.686
2.772AlaMet: 2.772 ± 0.444
3.928AlaAsn: 3.928 ± 0.539
2.849AlaPro: 2.849 ± 0.446
5.006AlaGln: 5.006 ± 0.868
5.391AlaArg: 5.391 ± 0.591
5.237AlaSer: 5.237 ± 0.846
5.16AlaThr: 5.16 ± 0.659
7.162AlaVal: 7.162 ± 0.753
0.77AlaTrp: 0.77 ± 0.194
2.926AlaTyr: 2.926 ± 0.544
0.0AlaXaa: 0.0 ± 0.0
Cys
0.539CysAla: 0.539 ± 0.242
0.077CysCys: 0.077 ± 0.071
0.539CysAsp: 0.539 ± 0.198
0.385CysGlu: 0.385 ± 0.195
0.385CysPhe: 0.385 ± 0.164
1.001CysGly: 1.001 ± 0.344
0.308CysHis: 0.308 ± 0.186
0.308CysIle: 0.308 ± 0.153
0.693CysLys: 0.693 ± 0.257
0.693CysLeu: 0.693 ± 0.289
0.462CysMet: 0.462 ± 0.173
0.539CysAsn: 0.539 ± 0.23
0.462CysPro: 0.462 ± 0.21
0.385CysGln: 0.385 ± 0.172
0.847CysArg: 0.847 ± 0.271
0.539CysSer: 0.539 ± 0.263
0.231CysThr: 0.231 ± 0.152
0.693CysVal: 0.693 ± 0.244
0.077CysTrp: 0.077 ± 0.071
0.77CysTyr: 0.77 ± 0.264
0.0CysXaa: 0.0 ± 0.0
Asp
6.392AspAla: 6.392 ± 0.829
0.693AspCys: 0.693 ± 0.202
3.466AspAsp: 3.466 ± 0.683
4.005AspGlu: 4.005 ± 0.634
2.387AspPhe: 2.387 ± 0.465
4.159AspGly: 4.159 ± 0.612
1.078AspHis: 1.078 ± 0.264
3.62AspIle: 3.62 ± 0.545
4.698AspLys: 4.698 ± 0.569
5.391AspLeu: 5.391 ± 0.698
1.617AspMet: 1.617 ± 0.295
2.926AspAsn: 2.926 ± 0.453
1.848AspPro: 1.848 ± 0.374
0.77AspGln: 0.77 ± 0.269
3.08AspArg: 3.08 ± 0.422
3.62AspSer: 3.62 ± 0.544
4.159AspThr: 4.159 ± 0.526
4.005AspVal: 4.005 ± 0.567
1.232AspTrp: 1.232 ± 0.341
1.925AspTyr: 1.925 ± 0.433
0.0AspXaa: 0.0 ± 0.0
Glu
8.24GluAla: 8.24 ± 0.97
1.001GluCys: 1.001 ± 0.345
4.467GluAsp: 4.467 ± 0.643
7.701GluGlu: 7.701 ± 0.895
2.772GluPhe: 2.772 ± 0.449
6.007GluGly: 6.007 ± 0.691
1.232GluHis: 1.232 ± 0.264
3.08GluIle: 3.08 ± 0.411
3.928GluLys: 3.928 ± 0.451
6.469GluLeu: 6.469 ± 0.687
2.31GluMet: 2.31 ± 0.384
1.771GluAsn: 1.771 ± 0.47
2.464GluPro: 2.464 ± 0.534
3.466GluGln: 3.466 ± 0.461
3.928GluArg: 3.928 ± 0.514
2.695GluSer: 2.695 ± 0.522
3.003GluThr: 3.003 ± 0.462
5.699GluVal: 5.699 ± 0.665
1.155GluTrp: 1.155 ± 0.272
1.617GluTyr: 1.617 ± 0.338
0.0GluXaa: 0.0 ± 0.0
Phe
2.31PheAla: 2.31 ± 0.397
0.231PheCys: 0.231 ± 0.174
3.543PheAsp: 3.543 ± 0.46
2.926PheGlu: 2.926 ± 0.395
1.232PhePhe: 1.232 ± 0.354
2.618PheGly: 2.618 ± 0.485
0.77PheHis: 0.77 ± 0.218
1.925PheIle: 1.925 ± 0.282
2.695PheLys: 2.695 ± 0.47
3.08PheLeu: 3.08 ± 0.513
1.771PheMet: 1.771 ± 0.374
2.31PheAsn: 2.31 ± 0.515
1.078PhePro: 1.078 ± 0.237
1.155PheGln: 1.155 ± 0.285
1.925PheArg: 1.925 ± 0.316
1.694PheSer: 1.694 ± 0.392
2.31PheThr: 2.31 ± 0.378
1.386PheVal: 1.386 ± 0.333
0.154PheTrp: 0.154 ± 0.107
0.924PheTyr: 0.924 ± 0.234
0.0PheXaa: 0.0 ± 0.0
Gly
6.392GlyAla: 6.392 ± 0.549
0.77GlyCys: 0.77 ± 0.28
5.622GlyAsp: 5.622 ± 0.552
4.852GlyGlu: 4.852 ± 0.633
3.389GlyPhe: 3.389 ± 0.647
6.315GlyGly: 6.315 ± 0.947
2.233GlyHis: 2.233 ± 0.417
4.313GlyIle: 4.313 ± 0.554
6.161GlyLys: 6.161 ± 0.822
5.545GlyLeu: 5.545 ± 0.656
1.771GlyMet: 1.771 ± 0.298
3.157GlyAsn: 3.157 ± 0.483
0.231GlyPro: 0.231 ± 0.133
3.62GlyGln: 3.62 ± 0.537
3.851GlyArg: 3.851 ± 0.478
5.545GlySer: 5.545 ± 0.826
4.236GlyThr: 4.236 ± 0.522
5.237GlyVal: 5.237 ± 0.717
1.463GlyTrp: 1.463 ± 0.28
3.08GlyTyr: 3.08 ± 0.462
0.0GlyXaa: 0.0 ± 0.0
His
1.155HisAla: 1.155 ± 0.258
0.308HisCys: 0.308 ± 0.146
1.463HisAsp: 1.463 ± 0.349
1.54HisGlu: 1.54 ± 0.456
1.078HisPhe: 1.078 ± 0.256
1.694HisGly: 1.694 ± 0.352
0.385HisHis: 0.385 ± 0.171
1.232HisIle: 1.232 ± 0.286
1.155HisLys: 1.155 ± 0.266
2.31HisLeu: 2.31 ± 0.389
0.77HisMet: 0.77 ± 0.268
0.847HisAsn: 0.847 ± 0.201
0.616HisPro: 0.616 ± 0.215
0.693HisGln: 0.693 ± 0.227
1.155HisArg: 1.155 ± 0.353
0.847HisSer: 0.847 ± 0.248
1.232HisThr: 1.232 ± 0.433
1.001HisVal: 1.001 ± 0.304
0.154HisTrp: 0.154 ± 0.092
0.693HisTyr: 0.693 ± 0.266
0.0HisXaa: 0.0 ± 0.0
Ile
5.006IleAla: 5.006 ± 0.48
0.154IleCys: 0.154 ± 0.091
3.466IleAsp: 3.466 ± 0.524
4.236IleGlu: 4.236 ± 0.519
1.54IlePhe: 1.54 ± 0.519
3.774IleGly: 3.774 ± 0.461
1.54IleHis: 1.54 ± 0.364
3.774IleIle: 3.774 ± 0.578
3.851IleLys: 3.851 ± 0.528
3.543IleLeu: 3.543 ± 0.411
2.156IleMet: 2.156 ± 0.434
2.387IleAsn: 2.387 ± 0.506
2.31IlePro: 2.31 ± 0.464
2.079IleGln: 2.079 ± 0.395
3.003IleArg: 3.003 ± 0.403
2.772IleSer: 2.772 ± 0.43
2.849IleThr: 2.849 ± 0.669
4.39IleVal: 4.39 ± 0.57
0.924IleTrp: 0.924 ± 0.249
1.617IleTyr: 1.617 ± 0.391
0.0IleXaa: 0.0 ± 0.0
Lys
8.24LysAla: 8.24 ± 0.946
0.847LysCys: 0.847 ± 0.242
3.08LysAsp: 3.08 ± 0.547
4.852LysGlu: 4.852 ± 0.665
1.617LysPhe: 1.617 ± 0.309
5.776LysGly: 5.776 ± 0.724
1.694LysHis: 1.694 ± 0.455
3.157LysIle: 3.157 ± 0.419
4.621LysLys: 4.621 ± 0.821
5.776LysLeu: 5.776 ± 0.738
1.386LysMet: 1.386 ± 0.402
1.925LysAsn: 1.925 ± 0.43
3.157LysPro: 3.157 ± 0.508
2.695LysGln: 2.695 ± 0.426
3.466LysArg: 3.466 ± 0.432
3.235LysSer: 3.235 ± 0.352
2.31LysThr: 2.31 ± 0.401
4.005LysVal: 4.005 ± 0.671
0.693LysTrp: 0.693 ± 0.201
2.233LysTyr: 2.233 ± 0.388
0.0LysXaa: 0.0 ± 0.0
Leu
6.623LeuAla: 6.623 ± 0.874
0.847LeuCys: 0.847 ± 0.246
5.391LeuAsp: 5.391 ± 0.667
4.621LeuGlu: 4.621 ± 0.436
2.926LeuPhe: 2.926 ± 0.53
5.237LeuGly: 5.237 ± 0.652
1.001LeuHis: 1.001 ± 0.326
4.082LeuIle: 4.082 ± 0.512
5.237LeuLys: 5.237 ± 0.674
5.237LeuLeu: 5.237 ± 0.493
2.31LeuMet: 2.31 ± 0.448
3.774LeuAsn: 3.774 ± 0.524
3.466LeuPro: 3.466 ± 0.67
3.851LeuGln: 3.851 ± 0.596
4.544LeuArg: 4.544 ± 0.701
5.006LeuSer: 5.006 ± 0.665
3.928LeuThr: 3.928 ± 0.489
5.622LeuVal: 5.622 ± 0.598
0.616LeuTrp: 0.616 ± 0.226
2.849LeuTyr: 2.849 ± 0.367
0.0LeuXaa: 0.0 ± 0.0
Met
3.466MetAla: 3.466 ± 0.585
0.154MetCys: 0.154 ± 0.099
1.386MetAsp: 1.386 ± 0.348
1.848MetGlu: 1.848 ± 0.324
1.078MetPhe: 1.078 ± 0.283
2.464MetGly: 2.464 ± 0.543
0.539MetHis: 0.539 ± 0.222
1.386MetIle: 1.386 ± 0.298
1.848MetLys: 1.848 ± 0.404
2.926MetLeu: 2.926 ± 0.479
0.77MetMet: 0.77 ± 0.22
1.155MetAsn: 1.155 ± 0.243
1.232MetPro: 1.232 ± 0.252
1.848MetGln: 1.848 ± 0.414
1.771MetArg: 1.771 ± 0.416
2.387MetSer: 2.387 ± 0.352
1.54MetThr: 1.54 ± 0.322
2.079MetVal: 2.079 ± 0.379
0.462MetTrp: 0.462 ± 0.171
0.847MetTyr: 0.847 ± 0.324
0.0MetXaa: 0.0 ± 0.0
Asn
3.157AsnAla: 3.157 ± 0.425
0.308AsnCys: 0.308 ± 0.174
1.771AsnAsp: 1.771 ± 0.326
2.618AsnGlu: 2.618 ± 0.506
1.54AsnPhe: 1.54 ± 0.281
3.003AsnGly: 3.003 ± 0.441
0.847AsnHis: 0.847 ± 0.218
2.541AsnIle: 2.541 ± 0.512
3.312AsnLys: 3.312 ± 0.487
3.62AsnLeu: 3.62 ± 0.561
1.001AsnMet: 1.001 ± 0.231
2.31AsnAsn: 2.31 ± 0.501
1.54AsnPro: 1.54 ± 0.258
2.002AsnGln: 2.002 ± 0.419
3.003AsnArg: 3.003 ± 0.433
1.925AsnSer: 1.925 ± 0.431
3.235AsnThr: 3.235 ± 0.633
3.003AsnVal: 3.003 ± 0.629
0.616AsnTrp: 0.616 ± 0.223
1.925AsnTyr: 1.925 ± 0.349
0.0AsnXaa: 0.0 ± 0.0
Pro
3.543ProAla: 3.543 ± 0.444
0.385ProCys: 0.385 ± 0.167
2.541ProAsp: 2.541 ± 0.448
3.543ProGlu: 3.543 ± 0.457
1.386ProPhe: 1.386 ± 0.393
1.078ProGly: 1.078 ± 0.286
0.847ProHis: 0.847 ± 0.314
1.54ProIle: 1.54 ± 0.383
2.002ProLys: 2.002 ± 0.333
2.31ProLeu: 2.31 ± 0.333
1.386ProMet: 1.386 ± 0.266
1.309ProAsn: 1.309 ± 0.336
0.847ProPro: 0.847 ± 0.234
1.078ProGln: 1.078 ± 0.236
0.924ProArg: 0.924 ± 0.194
2.156ProSer: 2.156 ± 0.436
2.618ProThr: 2.618 ± 0.533
3.08ProVal: 3.08 ± 0.527
0.77ProTrp: 0.77 ± 0.204
1.54ProTyr: 1.54 ± 0.349
0.0ProXaa: 0.0 ± 0.0
Gln
5.006GlnAla: 5.006 ± 0.673
0.693GlnCys: 0.693 ± 0.261
2.233GlnAsp: 2.233 ± 0.292
2.772GlnGlu: 2.772 ± 0.558
1.463GlnPhe: 1.463 ± 0.376
2.772GlnGly: 2.772 ± 0.424
0.462GlnHis: 0.462 ± 0.166
2.618GlnIle: 2.618 ± 0.396
1.617GlnLys: 1.617 ± 0.341
2.926GlnLeu: 2.926 ± 0.48
2.31GlnMet: 2.31 ± 0.454
2.387GlnAsn: 2.387 ± 0.516
0.77GlnPro: 0.77 ± 0.211
2.464GlnGln: 2.464 ± 0.39
2.387GlnArg: 2.387 ± 0.417
2.387GlnSer: 2.387 ± 0.633
1.925GlnThr: 1.925 ± 0.289
2.849GlnVal: 2.849 ± 0.514
0.539GlnTrp: 0.539 ± 0.199
1.617GlnTyr: 1.617 ± 0.349
0.0GlnXaa: 0.0 ± 0.0
Arg
5.16ArgAla: 5.16 ± 0.832
0.539ArgCys: 0.539 ± 0.275
2.926ArgAsp: 2.926 ± 0.386
4.005ArgGlu: 4.005 ± 0.641
1.617ArgPhe: 1.617 ± 0.284
4.698ArgGly: 4.698 ± 0.665
1.386ArgHis: 1.386 ± 0.298
3.851ArgIle: 3.851 ± 0.578
3.235ArgLys: 3.235 ± 0.497
4.544ArgLeu: 4.544 ± 0.508
2.387ArgMet: 2.387 ± 0.378
2.849ArgAsn: 2.849 ± 0.479
1.232ArgPro: 1.232 ± 0.322
2.31ArgGln: 2.31 ± 0.437
2.926ArgArg: 2.926 ± 0.495
2.079ArgSer: 2.079 ± 0.415
2.464ArgThr: 2.464 ± 0.465
3.543ArgVal: 3.543 ± 0.421
0.616ArgTrp: 0.616 ± 0.22
2.31ArgTyr: 2.31 ± 0.525
0.0ArgXaa: 0.0 ± 0.0
Ser
4.313SerAla: 4.313 ± 0.719
0.693SerCys: 0.693 ± 0.279
3.62SerAsp: 3.62 ± 0.44
4.313SerGlu: 4.313 ± 0.608
2.541SerPhe: 2.541 ± 0.389
5.237SerGly: 5.237 ± 0.607
1.232SerHis: 1.232 ± 0.241
3.157SerIle: 3.157 ± 0.522
3.157SerLys: 3.157 ± 0.529
4.159SerLeu: 4.159 ± 0.466
1.771SerMet: 1.771 ± 0.302
2.233SerAsn: 2.233 ± 0.571
1.54SerPro: 1.54 ± 0.333
2.618SerGln: 2.618 ± 0.407
3.466SerArg: 3.466 ± 0.457
3.389SerSer: 3.389 ± 0.712
2.695SerThr: 2.695 ± 0.452
3.466SerVal: 3.466 ± 0.53
0.693SerTrp: 0.693 ± 0.178
1.848SerTyr: 1.848 ± 0.387
0.0SerXaa: 0.0 ± 0.0
Thr
3.851ThrAla: 3.851 ± 0.575
0.616ThrCys: 0.616 ± 0.2
3.235ThrAsp: 3.235 ± 0.766
3.928ThrGlu: 3.928 ± 0.486
2.31ThrPhe: 2.31 ± 0.523
4.852ThrGly: 4.852 ± 0.574
1.232ThrHis: 1.232 ± 0.368
3.312ThrIle: 3.312 ± 0.457
3.62ThrLys: 3.62 ± 0.611
4.313ThrLeu: 4.313 ± 0.542
0.847ThrMet: 0.847 ± 0.255
1.848ThrAsn: 1.848 ± 0.378
3.08ThrPro: 3.08 ± 0.469
1.694ThrGln: 1.694 ± 0.399
3.312ThrArg: 3.312 ± 0.489
3.08ThrSer: 3.08 ± 0.732
3.697ThrThr: 3.697 ± 0.59
3.543ThrVal: 3.543 ± 0.526
0.924ThrTrp: 0.924 ± 0.315
1.309ThrTyr: 1.309 ± 0.414
0.0ThrXaa: 0.0 ± 0.0
Val
6.777ValAla: 6.777 ± 0.634
0.693ValCys: 0.693 ± 0.247
3.774ValAsp: 3.774 ± 0.603
4.39ValGlu: 4.39 ± 0.53
2.002ValPhe: 2.002 ± 0.396
5.16ValGly: 5.16 ± 0.759
1.309ValHis: 1.309 ± 0.347
3.62ValIle: 3.62 ± 0.463
5.083ValLys: 5.083 ± 0.476
3.928ValLeu: 3.928 ± 0.649
2.233ValMet: 2.233 ± 0.401
3.389ValAsn: 3.389 ± 0.38
3.774ValPro: 3.774 ± 0.49
2.695ValGln: 2.695 ± 0.466
3.697ValArg: 3.697 ± 0.631
4.698ValSer: 4.698 ± 0.574
4.236ValThr: 4.236 ± 0.576
4.621ValVal: 4.621 ± 0.628
0.693ValTrp: 0.693 ± 0.204
1.848ValTyr: 1.848 ± 0.352
0.0ValXaa: 0.0 ± 0.0
Trp
0.924TrpAla: 0.924 ± 0.235
0.154TrpCys: 0.154 ± 0.096
0.462TrpAsp: 0.462 ± 0.188
1.001TrpGlu: 1.001 ± 0.263
0.385TrpPhe: 0.385 ± 0.202
1.078TrpGly: 1.078 ± 0.287
0.231TrpHis: 0.231 ± 0.125
0.385TrpIle: 0.385 ± 0.185
1.232TrpLys: 1.232 ± 0.364
1.001TrpLeu: 1.001 ± 0.236
0.231TrpMet: 0.231 ± 0.113
0.462TrpAsn: 0.462 ± 0.163
1.078TrpPro: 1.078 ± 0.315
0.616TrpGln: 0.616 ± 0.187
0.539TrpArg: 0.539 ± 0.184
1.155TrpSer: 1.155 ± 0.382
0.693TrpThr: 0.693 ± 0.212
1.001TrpVal: 1.001 ± 0.296
0.154TrpTrp: 0.154 ± 0.097
0.308TrpTyr: 0.308 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.849TyrAla: 2.849 ± 0.488
0.385TyrCys: 0.385 ± 0.186
2.233TyrAsp: 2.233 ± 0.33
2.387TyrGlu: 2.387 ± 0.422
1.309TyrPhe: 1.309 ± 0.333
3.389TyrGly: 3.389 ± 0.671
0.77TyrHis: 0.77 ± 0.277
2.002TyrIle: 2.002 ± 0.316
1.54TyrLys: 1.54 ± 0.279
2.31TyrLeu: 2.31 ± 0.432
0.847TyrMet: 0.847 ± 0.282
1.771TyrAsn: 1.771 ± 0.357
1.309TyrPro: 1.309 ± 0.335
1.155TyrGln: 1.155 ± 0.296
1.463TyrArg: 1.463 ± 0.382
1.771TyrSer: 1.771 ± 0.343
2.079TyrThr: 2.079 ± 0.355
2.233TyrVal: 2.233 ± 0.48
0.385TyrTrp: 0.385 ± 0.22
1.001TyrTyr: 1.001 ± 0.287
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12986 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski