Amino acid dipepetide frequency for Enterobacteria phage cdtI

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.023AlaAla: 11.023 ± 1.73
1.027AlaCys: 1.027 ± 0.294
5.272AlaAsp: 5.272 ± 0.758
5.683AlaGlu: 5.683 ± 0.842
3.56AlaPhe: 3.56 ± 0.44
7.531AlaGly: 7.531 ± 1.136
1.164AlaHis: 1.164 ± 0.25
4.998AlaIle: 4.998 ± 0.649
3.081AlaLys: 3.081 ± 0.41
7.668AlaLeu: 7.668 ± 0.998
3.081AlaMet: 3.081 ± 0.515
2.739AlaAsn: 2.739 ± 0.487
2.67AlaPro: 2.67 ± 0.393
3.081AlaGln: 3.081 ± 0.689
5.34AlaArg: 5.34 ± 0.802
6.641AlaSer: 6.641 ± 0.687
5.203AlaThr: 5.203 ± 0.628
7.257AlaVal: 7.257 ± 0.791
1.78AlaTrp: 1.78 ± 0.414
2.739AlaTyr: 2.739 ± 0.436
0.0AlaXaa: 0.0 ± 0.0
Cys
1.301CysAla: 1.301 ± 0.331
0.274CysCys: 0.274 ± 0.122
0.685CysAsp: 0.685 ± 0.201
0.479CysGlu: 0.479 ± 0.211
0.753CysPhe: 0.753 ± 0.304
1.438CysGly: 1.438 ± 0.351
0.342CysHis: 0.342 ± 0.168
0.548CysIle: 0.548 ± 0.351
0.342CysLys: 0.342 ± 0.158
1.575CysLeu: 1.575 ± 0.373
0.342CysMet: 0.342 ± 0.165
0.479CysAsn: 0.479 ± 0.161
0.616CysPro: 0.616 ± 0.3
0.548CysGln: 0.548 ± 0.206
1.095CysArg: 1.095 ± 0.432
1.301CysSer: 1.301 ± 0.285
0.753CysThr: 0.753 ± 0.209
0.89CysVal: 0.89 ± 0.248
0.342CysTrp: 0.342 ± 0.179
0.274CysTyr: 0.274 ± 0.114
0.0CysXaa: 0.0 ± 0.0
Asp
5.82AspAla: 5.82 ± 0.639
0.548AspCys: 0.548 ± 0.191
3.56AspAsp: 3.56 ± 0.475
2.876AspGlu: 2.876 ± 0.424
2.328AspPhe: 2.328 ± 0.429
5.683AspGly: 5.683 ± 0.664
0.616AspHis: 0.616 ± 0.168
3.56AspIle: 3.56 ± 0.583
2.876AspLys: 2.876 ± 0.478
4.929AspLeu: 4.929 ± 0.674
1.643AspMet: 1.643 ± 0.345
3.423AspAsn: 3.423 ± 0.505
2.259AspPro: 2.259 ± 0.419
1.78AspGln: 1.78 ± 0.365
2.602AspArg: 2.602 ± 0.519
3.423AspSer: 3.423 ± 0.525
2.67AspThr: 2.67 ± 0.432
3.492AspVal: 3.492 ± 0.508
1.78AspTrp: 1.78 ± 0.394
1.575AspTyr: 1.575 ± 0.303
0.0AspXaa: 0.0 ± 0.0
Glu
4.587GluAla: 4.587 ± 0.618
1.095GluCys: 1.095 ± 0.306
2.807GluAsp: 2.807 ± 0.466
2.602GluGlu: 2.602 ± 0.505
1.712GluPhe: 1.712 ± 0.311
2.944GluGly: 2.944 ± 0.392
1.164GluHis: 1.164 ± 0.272
3.286GluIle: 3.286 ± 0.537
3.834GluLys: 3.834 ± 0.441
5.751GluLeu: 5.751 ± 0.683
1.643GluMet: 1.643 ± 0.362
2.944GluAsn: 2.944 ± 0.455
3.081GluPro: 3.081 ± 0.55
3.492GluGln: 3.492 ± 0.708
4.519GluArg: 4.519 ± 0.593
3.355GluSer: 3.355 ± 0.663
3.56GluThr: 3.56 ± 0.593
3.218GluVal: 3.218 ± 0.48
0.753GluTrp: 0.753 ± 0.221
1.369GluTyr: 1.369 ± 0.291
0.0GluXaa: 0.0 ± 0.0
Phe
2.122PheAla: 2.122 ± 0.38
0.89PheCys: 0.89 ± 0.232
2.191PheAsp: 2.191 ± 0.324
1.985PheGlu: 1.985 ± 0.401
1.095PhePhe: 1.095 ± 0.26
1.849PheGly: 1.849 ± 0.374
0.342PheHis: 0.342 ± 0.155
2.396PheIle: 2.396 ± 0.526
2.122PheLys: 2.122 ± 0.342
2.533PheLeu: 2.533 ± 0.505
1.095PheMet: 1.095 ± 0.239
1.643PheAsn: 1.643 ± 0.403
1.575PhePro: 1.575 ± 0.339
1.027PheGln: 1.027 ± 0.24
2.807PheArg: 2.807 ± 0.554
3.697PheSer: 3.697 ± 0.695
1.78PheThr: 1.78 ± 0.326
3.149PheVal: 3.149 ± 0.443
0.753PheTrp: 0.753 ± 0.183
0.685PheTyr: 0.685 ± 0.208
0.0PheXaa: 0.0 ± 0.0
Gly
6.504GlyAla: 6.504 ± 0.779
1.232GlyCys: 1.232 ± 0.3
5.066GlyAsp: 5.066 ± 0.758
4.793GlyGlu: 4.793 ± 0.693
3.012GlyPhe: 3.012 ± 0.571
4.245GlyGly: 4.245 ± 0.574
1.232GlyHis: 1.232 ± 0.282
4.45GlyIle: 4.45 ± 0.613
4.519GlyLys: 4.519 ± 0.659
3.766GlyLeu: 3.766 ± 0.438
1.985GlyMet: 1.985 ± 0.34
4.039GlyAsn: 4.039 ± 0.554
3.149GlyPro: 3.149 ± 1.147
2.944GlyGln: 2.944 ± 0.484
4.176GlyArg: 4.176 ± 0.545
4.724GlySer: 4.724 ± 0.567
3.355GlyThr: 3.355 ± 0.634
5.956GlyVal: 5.956 ± 0.694
1.643GlyTrp: 1.643 ± 0.354
2.191GlyTyr: 2.191 ± 0.415
0.0GlyXaa: 0.0 ± 0.0
His
1.643HisAla: 1.643 ± 0.375
0.411HisCys: 0.411 ± 0.187
1.506HisAsp: 1.506 ± 0.391
1.301HisGlu: 1.301 ± 0.267
1.095HisPhe: 1.095 ± 0.261
1.301HisGly: 1.301 ± 0.351
0.685HisHis: 0.685 ± 0.223
1.232HisIle: 1.232 ± 0.365
0.753HisLys: 0.753 ± 0.204
1.506HisLeu: 1.506 ± 0.309
0.274HisMet: 0.274 ± 0.137
0.548HisAsn: 0.548 ± 0.189
0.548HisPro: 0.548 ± 0.161
0.685HisGln: 0.685 ± 0.192
0.959HisArg: 0.959 ± 0.299
1.506HisSer: 1.506 ± 0.394
0.548HisThr: 0.548 ± 0.178
0.89HisVal: 0.89 ± 0.248
0.616HisTrp: 0.616 ± 0.215
0.411HisTyr: 0.411 ± 0.143
0.0HisXaa: 0.0 ± 0.0
Ile
4.793IleAla: 4.793 ± 0.628
0.89IleCys: 0.89 ± 0.232
3.697IleAsp: 3.697 ± 0.478
2.739IleGlu: 2.739 ± 0.479
1.438IlePhe: 1.438 ± 0.378
3.766IleGly: 3.766 ± 0.543
0.685IleHis: 0.685 ± 0.215
2.807IleIle: 2.807 ± 0.54
2.465IleLys: 2.465 ± 0.367
3.286IleLeu: 3.286 ± 0.618
1.506IleMet: 1.506 ± 0.297
3.492IleAsn: 3.492 ± 0.694
3.149IlePro: 3.149 ± 0.593
1.712IleGln: 1.712 ± 0.381
4.656IleArg: 4.656 ± 0.576
4.998IleSer: 4.998 ± 0.554
4.519IleThr: 4.519 ± 0.637
2.533IleVal: 2.533 ± 0.411
0.342IleTrp: 0.342 ± 0.142
1.232IleTyr: 1.232 ± 0.389
0.0IleXaa: 0.0 ± 0.0
Lys
4.519LysAla: 4.519 ± 0.603
1.027LysCys: 1.027 ± 0.3
2.876LysAsp: 2.876 ± 0.489
3.218LysGlu: 3.218 ± 0.589
1.506LysPhe: 1.506 ± 0.345
4.245LysGly: 4.245 ± 0.597
0.822LysHis: 0.822 ± 0.203
2.739LysIle: 2.739 ± 0.584
3.971LysLys: 3.971 ± 0.753
4.382LysLeu: 4.382 ± 0.618
1.301LysMet: 1.301 ± 0.271
2.122LysAsn: 2.122 ± 0.373
1.985LysPro: 1.985 ± 0.472
3.012LysGln: 3.012 ± 0.371
3.149LysArg: 3.149 ± 0.579
2.67LysSer: 2.67 ± 0.336
3.355LysThr: 3.355 ± 0.442
3.492LysVal: 3.492 ± 0.453
0.753LysTrp: 0.753 ± 0.245
1.78LysTyr: 1.78 ± 0.319
0.0LysXaa: 0.0 ± 0.0
Leu
6.983LeuAla: 6.983 ± 0.699
1.369LeuCys: 1.369 ± 0.446
4.176LeuAsp: 4.176 ± 0.478
4.108LeuGlu: 4.108 ± 0.512
3.149LeuPhe: 3.149 ± 0.614
4.929LeuGly: 4.929 ± 0.629
2.122LeuHis: 2.122 ± 0.401
4.724LeuIle: 4.724 ± 0.797
4.929LeuLys: 4.929 ± 0.708
6.162LeuLeu: 6.162 ± 0.705
1.712LeuMet: 1.712 ± 0.263
4.108LeuAsn: 4.108 ± 0.576
4.45LeuPro: 4.45 ± 0.528
3.012LeuGln: 3.012 ± 0.513
6.23LeuArg: 6.23 ± 0.655
7.12LeuSer: 7.12 ± 0.906
6.093LeuThr: 6.093 ± 0.675
4.587LeuVal: 4.587 ± 0.467
1.643LeuTrp: 1.643 ± 0.473
1.917LeuTyr: 1.917 ± 0.443
0.0LeuXaa: 0.0 ± 0.0
Met
2.602MetAla: 2.602 ± 0.404
0.068MetCys: 0.068 ± 0.064
1.095MetAsp: 1.095 ± 0.202
1.712MetGlu: 1.712 ± 0.384
0.822MetPhe: 0.822 ± 0.252
1.095MetGly: 1.095 ± 0.291
0.342MetHis: 0.342 ± 0.155
1.027MetIle: 1.027 ± 0.345
1.301MetLys: 1.301 ± 0.372
2.602MetLeu: 2.602 ± 0.329
0.342MetMet: 0.342 ± 0.138
1.164MetAsn: 1.164 ± 0.26
1.575MetPro: 1.575 ± 0.293
1.369MetGln: 1.369 ± 0.291
1.643MetArg: 1.643 ± 0.252
2.122MetSer: 2.122 ± 0.377
1.917MetThr: 1.917 ± 0.409
1.643MetVal: 1.643 ± 0.338
0.205MetTrp: 0.205 ± 0.102
0.616MetTyr: 0.616 ± 0.229
0.0MetXaa: 0.0 ± 0.0
Asn
4.245AsnAla: 4.245 ± 0.571
0.342AsnCys: 0.342 ± 0.135
2.122AsnAsp: 2.122 ± 0.274
2.944AsnGlu: 2.944 ± 0.444
1.232AsnPhe: 1.232 ± 0.257
4.382AsnGly: 4.382 ± 0.591
0.89AsnHis: 0.89 ± 0.291
2.807AsnIle: 2.807 ± 0.445
2.122AsnLys: 2.122 ± 0.446
3.355AsnLeu: 3.355 ± 0.495
1.027AsnMet: 1.027 ± 0.252
2.465AsnAsn: 2.465 ± 0.478
2.533AsnPro: 2.533 ± 0.426
1.506AsnGln: 1.506 ± 0.317
2.944AsnArg: 2.944 ± 0.703
3.423AsnSer: 3.423 ± 0.734
2.191AsnThr: 2.191 ± 0.361
2.191AsnVal: 2.191 ± 0.421
0.616AsnTrp: 0.616 ± 0.201
1.232AsnTyr: 1.232 ± 0.288
0.0AsnXaa: 0.0 ± 0.0
Pro
5.066ProAla: 5.066 ± 0.685
0.479ProCys: 0.479 ± 0.205
3.423ProAsp: 3.423 ± 0.579
3.629ProGlu: 3.629 ± 0.629
1.369ProPhe: 1.369 ± 0.342
4.108ProGly: 4.108 ± 0.54
1.369ProHis: 1.369 ± 0.358
1.369ProIle: 1.369 ± 0.342
2.328ProLys: 2.328 ± 0.496
2.876ProLeu: 2.876 ± 0.577
0.822ProMet: 0.822 ± 0.26
1.506ProAsn: 1.506 ± 0.355
2.328ProPro: 2.328 ± 0.442
1.369ProGln: 1.369 ± 0.354
1.917ProArg: 1.917 ± 0.343
2.739ProSer: 2.739 ± 0.476
2.054ProThr: 2.054 ± 0.372
4.382ProVal: 4.382 ± 0.612
0.616ProTrp: 0.616 ± 0.214
1.301ProTyr: 1.301 ± 0.368
0.0ProXaa: 0.0 ± 0.0
Gln
4.587GlnAla: 4.587 ± 0.884
0.753GlnCys: 0.753 ± 0.247
1.027GlnAsp: 1.027 ± 0.225
2.465GlnGlu: 2.465 ± 0.437
1.027GlnPhe: 1.027 ± 0.254
2.465GlnGly: 2.465 ± 0.452
1.027GlnHis: 1.027 ± 0.197
3.149GlnIle: 3.149 ± 0.584
2.191GlnLys: 2.191 ± 0.501
3.971GlnLeu: 3.971 ± 0.8
1.712GlnMet: 1.712 ± 0.331
0.89GlnAsn: 0.89 ± 0.275
1.232GlnPro: 1.232 ± 0.288
2.602GlnGln: 2.602 ± 1.006
2.67GlnArg: 2.67 ± 0.625
2.876GlnSer: 2.876 ± 0.437
2.122GlnThr: 2.122 ± 0.391
2.944GlnVal: 2.944 ± 0.512
0.753GlnTrp: 0.753 ± 0.243
1.301GlnTyr: 1.301 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
4.382ArgAla: 4.382 ± 0.523
0.753ArgCys: 0.753 ± 0.262
3.012ArgAsp: 3.012 ± 0.466
3.697ArgGlu: 3.697 ± 0.347
2.054ArgPhe: 2.054 ± 0.34
3.834ArgGly: 3.834 ± 0.526
1.369ArgHis: 1.369 ± 0.263
3.149ArgIle: 3.149 ± 0.443
3.903ArgLys: 3.903 ± 0.509
5.683ArgLeu: 5.683 ± 0.658
2.533ArgMet: 2.533 ± 0.4
3.286ArgAsn: 3.286 ± 0.469
2.876ArgPro: 2.876 ± 0.546
3.355ArgGln: 3.355 ± 0.935
4.861ArgArg: 4.861 ± 0.782
2.807ArgSer: 2.807 ± 0.496
3.492ArgThr: 3.492 ± 0.62
3.697ArgVal: 3.697 ± 0.593
1.095ArgTrp: 1.095 ± 0.346
2.944ArgTyr: 2.944 ± 0.379
0.0ArgXaa: 0.0 ± 0.0
Ser
7.463SerAla: 7.463 ± 1.068
0.89SerCys: 0.89 ± 0.474
5.683SerAsp: 5.683 ± 0.592
4.176SerGlu: 4.176 ± 0.567
2.465SerPhe: 2.465 ± 0.484
6.504SerGly: 6.504 ± 0.609
1.027SerHis: 1.027 ± 0.316
2.191SerIle: 2.191 ± 0.428
2.876SerLys: 2.876 ± 0.578
6.367SerLeu: 6.367 ± 0.771
1.301SerMet: 1.301 ± 0.291
2.259SerAsn: 2.259 ± 0.406
3.355SerPro: 3.355 ± 0.607
2.876SerGln: 2.876 ± 0.351
3.492SerArg: 3.492 ± 0.464
4.656SerSer: 4.656 ± 0.785
3.492SerThr: 3.492 ± 0.547
6.915SerVal: 6.915 ± 0.627
0.685SerTrp: 0.685 ± 0.244
1.712SerTyr: 1.712 ± 0.317
0.0SerXaa: 0.0 ± 0.0
Thr
4.929ThrAla: 4.929 ± 0.748
0.822ThrCys: 0.822 ± 0.368
3.423ThrAsp: 3.423 ± 0.606
3.766ThrGlu: 3.766 ± 0.348
2.328ThrPhe: 2.328 ± 0.373
5.203ThrGly: 5.203 ± 0.713
1.301ThrHis: 1.301 ± 0.288
2.876ThrIle: 2.876 ± 0.425
2.396ThrLys: 2.396 ± 0.519
4.382ThrLeu: 4.382 ± 0.566
0.685ThrMet: 0.685 ± 0.261
1.917ThrAsn: 1.917 ± 0.386
3.149ThrPro: 3.149 ± 0.672
2.533ThrGln: 2.533 ± 0.469
3.286ThrArg: 3.286 ± 0.564
3.903ThrSer: 3.903 ± 0.434
3.218ThrThr: 3.218 ± 0.558
4.587ThrVal: 4.587 ± 0.791
1.095ThrTrp: 1.095 ± 0.261
1.985ThrTyr: 1.985 ± 0.338
0.0ThrXaa: 0.0 ± 0.0
Val
5.135ValAla: 5.135 ± 0.739
0.89ValCys: 0.89 ± 0.256
3.081ValAsp: 3.081 ± 0.528
3.423ValGlu: 3.423 ± 0.585
2.465ValPhe: 2.465 ± 0.492
4.45ValGly: 4.45 ± 0.618
1.232ValHis: 1.232 ± 0.336
4.861ValIle: 4.861 ± 0.733
4.793ValLys: 4.793 ± 0.62
7.12ValLeu: 7.12 ± 0.688
1.301ValMet: 1.301 ± 0.276
3.56ValAsn: 3.56 ± 0.539
2.876ValPro: 2.876 ± 0.556
2.533ValGln: 2.533 ± 0.486
3.56ValArg: 3.56 ± 0.485
4.929ValSer: 4.929 ± 0.502
4.45ValThr: 4.45 ± 0.638
3.697ValVal: 3.697 ± 0.509
1.095ValTrp: 1.095 ± 0.292
1.78ValTyr: 1.78 ± 0.354
0.0ValXaa: 0.0 ± 0.0
Trp
1.575TrpAla: 1.575 ± 0.337
0.205TrpCys: 0.205 ± 0.113
0.616TrpAsp: 0.616 ± 0.236
0.616TrpGlu: 0.616 ± 0.193
0.685TrpPhe: 0.685 ± 0.211
1.095TrpGly: 1.095 ± 0.237
0.342TrpHis: 0.342 ± 0.189
0.822TrpIle: 0.822 ± 0.211
1.027TrpLys: 1.027 ± 0.252
2.465TrpLeu: 2.465 ± 0.488
0.685TrpMet: 0.685 ± 0.238
0.753TrpAsn: 0.753 ± 0.282
0.822TrpPro: 0.822 ± 0.331
0.548TrpGln: 0.548 ± 0.171
1.095TrpArg: 1.095 ± 0.314
1.095TrpSer: 1.095 ± 0.235
1.095TrpThr: 1.095 ± 0.3
0.753TrpVal: 0.753 ± 0.283
0.479TrpTrp: 0.479 ± 0.213
0.753TrpTyr: 0.753 ± 0.204
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.259TyrAla: 2.259 ± 0.443
0.411TyrCys: 0.411 ± 0.164
2.122TyrAsp: 2.122 ± 0.319
1.506TyrGlu: 1.506 ± 0.31
1.643TyrPhe: 1.643 ± 0.398
1.917TyrGly: 1.917 ± 0.298
0.342TyrHis: 0.342 ± 0.146
1.712TyrIle: 1.712 ± 0.343
1.095TyrLys: 1.095 ± 0.227
3.149TyrLeu: 3.149 ± 0.42
0.205TyrMet: 0.205 ± 0.133
1.232TyrAsn: 1.232 ± 0.254
0.822TyrPro: 0.822 ± 0.218
1.78TyrGln: 1.78 ± 0.373
1.78TyrArg: 1.78 ± 0.298
2.396TyrSer: 2.396 ± 0.462
1.917TyrThr: 1.917 ± 0.301
0.959TyrVal: 0.959 ± 0.311
0.479TyrTrp: 0.479 ± 0.171
0.548TyrTyr: 0.548 ± 0.194
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (14607 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski