Amino acid dipepetide frequency for Enterobacter phage phiKDA1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.87AlaAla: 15.87 ± 1.256
0.951AlaCys: 0.951 ± 0.327
5.558AlaAsp: 5.558 ± 0.677
5.924AlaGlu: 5.924 ± 0.893
2.706AlaPhe: 2.706 ± 0.462
9.141AlaGly: 9.141 ± 1.046
1.609AlaHis: 1.609 ± 0.514
3.583AlaIle: 3.583 ± 0.519
6.289AlaLys: 6.289 ± 0.851
9.507AlaLeu: 9.507 ± 0.645
3.364AlaMet: 3.364 ± 0.46
2.413AlaAsn: 2.413 ± 0.489
5.485AlaPro: 5.485 ± 1.062
5.631AlaGln: 5.631 ± 0.753
4.388AlaArg: 4.388 ± 0.593
6.362AlaSer: 6.362 ± 0.808
5.997AlaThr: 5.997 ± 0.796
7.313AlaVal: 7.313 ± 0.801
1.609AlaTrp: 1.609 ± 0.362
4.315AlaTyr: 4.315 ± 0.57
0.0AlaXaa: 0.0 ± 0.0
Cys
0.951CysAla: 0.951 ± 0.277
0.073CysCys: 0.073 ± 0.07
0.658CysAsp: 0.658 ± 0.203
0.585CysGlu: 0.585 ± 0.21
0.804CysPhe: 0.804 ± 0.296
1.024CysGly: 1.024 ± 0.248
0.293CysHis: 0.293 ± 0.143
1.316CysIle: 1.316 ± 0.384
0.731CysLys: 0.731 ± 0.253
1.17CysLeu: 1.17 ± 0.309
0.731CysMet: 0.731 ± 0.243
0.512CysAsn: 0.512 ± 0.223
0.658CysPro: 0.658 ± 0.255
0.585CysGln: 0.585 ± 0.189
1.024CysArg: 1.024 ± 0.297
0.731CysSer: 0.731 ± 0.241
0.804CysThr: 0.804 ± 0.258
0.731CysVal: 0.731 ± 0.221
0.073CysTrp: 0.073 ± 0.062
0.512CysTyr: 0.512 ± 0.227
0.0CysXaa: 0.0 ± 0.0
Asp
6.509AspAla: 6.509 ± 0.771
0.804AspCys: 0.804 ± 0.282
2.925AspAsp: 2.925 ± 0.492
2.998AspGlu: 2.998 ± 0.436
1.901AspPhe: 1.901 ± 0.382
5.046AspGly: 5.046 ± 0.63
0.878AspHis: 0.878 ± 0.252
2.852AspIle: 2.852 ± 0.399
3.145AspLys: 3.145 ± 0.681
4.315AspLeu: 4.315 ± 0.651
1.609AspMet: 1.609 ± 0.347
2.194AspAsn: 2.194 ± 0.344
2.413AspPro: 2.413 ± 0.276
2.048AspGln: 2.048 ± 0.521
2.194AspArg: 2.194 ± 0.51
4.315AspSer: 4.315 ± 0.484
4.754AspThr: 4.754 ± 0.474
3.291AspVal: 3.291 ± 0.423
1.243AspTrp: 1.243 ± 0.239
2.779AspTyr: 2.779 ± 0.576
0.0AspXaa: 0.0 ± 0.0
Glu
6.143GluAla: 6.143 ± 0.734
0.585GluCys: 0.585 ± 0.221
2.998GluAsp: 2.998 ± 0.398
3.51GluGlu: 3.51 ± 0.514
3.072GluPhe: 3.072 ± 0.486
3.876GluGly: 3.876 ± 0.398
1.682GluHis: 1.682 ± 0.359
1.901GluIle: 1.901 ± 0.345
2.413GluLys: 2.413 ± 0.386
6.801GluLeu: 6.801 ± 0.73
1.901GluMet: 1.901 ± 0.34
1.316GluAsn: 1.316 ± 0.32
1.243GluPro: 1.243 ± 0.295
4.095GluGln: 4.095 ± 0.699
3.218GluArg: 3.218 ± 0.507
2.486GluSer: 2.486 ± 0.323
2.852GluThr: 2.852 ± 0.459
4.095GluVal: 4.095 ± 0.494
0.658GluTrp: 0.658 ± 0.276
2.925GluTyr: 2.925 ± 0.449
0.0GluXaa: 0.0 ± 0.0
Phe
2.56PheAla: 2.56 ± 0.43
0.293PheCys: 0.293 ± 0.137
1.901PheAsp: 1.901 ± 0.379
1.828PheGlu: 1.828 ± 0.423
0.878PhePhe: 0.878 ± 0.251
2.194PheGly: 2.194 ± 0.487
0.512PheHis: 0.512 ± 0.214
1.609PheIle: 1.609 ± 0.488
1.463PheLys: 1.463 ± 0.444
2.267PheLeu: 2.267 ± 0.382
0.731PheMet: 0.731 ± 0.276
1.463PheAsn: 1.463 ± 0.249
1.463PhePro: 1.463 ± 0.299
1.463PheGln: 1.463 ± 0.364
1.463PheArg: 1.463 ± 0.303
1.755PheSer: 1.755 ± 0.342
1.316PheThr: 1.316 ± 0.282
1.024PheVal: 1.024 ± 0.26
0.439PheTrp: 0.439 ± 0.197
1.536PheTyr: 1.536 ± 0.352
0.0PheXaa: 0.0 ± 0.0
Gly
7.533GlyAla: 7.533 ± 0.807
1.609GlyCys: 1.609 ± 0.49
4.095GlyAsp: 4.095 ± 0.572
3.291GlyGlu: 3.291 ± 0.491
2.633GlyPhe: 2.633 ± 0.51
5.924GlyGly: 5.924 ± 0.825
0.804GlyHis: 0.804 ± 0.303
3.949GlyIle: 3.949 ± 0.49
4.022GlyLys: 4.022 ± 0.627
6.509GlyLeu: 6.509 ± 0.694
2.56GlyMet: 2.56 ± 0.543
3.218GlyAsn: 3.218 ± 0.468
1.389GlyPro: 1.389 ± 0.401
3.145GlyGln: 3.145 ± 0.422
3.803GlyArg: 3.803 ± 0.431
4.973GlySer: 4.973 ± 0.576
4.534GlyThr: 4.534 ± 0.673
6.07GlyVal: 6.07 ± 0.642
0.731GlyTrp: 0.731 ± 0.22
3.437GlyTyr: 3.437 ± 0.383
0.0GlyXaa: 0.0 ± 0.0
His
0.804HisAla: 0.804 ± 0.208
0.512HisCys: 0.512 ± 0.175
1.316HisAsp: 1.316 ± 0.3
1.097HisGlu: 1.097 ± 0.289
0.293HisPhe: 0.293 ± 0.144
1.609HisGly: 1.609 ± 0.382
0.219HisHis: 0.219 ± 0.129
0.878HisIle: 0.878 ± 0.216
0.731HisLys: 0.731 ± 0.197
1.901HisLeu: 1.901 ± 0.345
0.585HisMet: 0.585 ± 0.187
0.731HisAsn: 0.731 ± 0.242
0.804HisPro: 0.804 ± 0.346
0.366HisGln: 0.366 ± 0.166
1.024HisArg: 1.024 ± 0.264
1.024HisSer: 1.024 ± 0.229
0.512HisThr: 0.512 ± 0.189
0.731HisVal: 0.731 ± 0.187
0.219HisTrp: 0.219 ± 0.116
0.878HisTyr: 0.878 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
4.315IleAla: 4.315 ± 0.492
0.658IleCys: 0.658 ± 0.263
3.072IleAsp: 3.072 ± 0.392
2.779IleGlu: 2.779 ± 0.556
0.512IlePhe: 0.512 ± 0.2
2.706IleGly: 2.706 ± 0.44
0.439IleHis: 0.439 ± 0.187
1.755IleIle: 1.755 ± 0.297
2.56IleLys: 2.56 ± 0.488
2.706IleLeu: 2.706 ± 0.44
1.389IleMet: 1.389 ± 0.297
2.852IleAsn: 2.852 ± 0.495
2.34IlePro: 2.34 ± 0.459
3.145IleGln: 3.145 ± 0.439
2.34IleArg: 2.34 ± 0.424
2.779IleSer: 2.779 ± 0.531
2.413IleThr: 2.413 ± 0.463
2.486IleVal: 2.486 ± 0.396
0.219IleTrp: 0.219 ± 0.129
1.316IleTyr: 1.316 ± 0.285
0.0IleXaa: 0.0 ± 0.0
Lys
6.07LysAla: 6.07 ± 0.745
0.366LysCys: 0.366 ± 0.158
2.925LysAsp: 2.925 ± 0.52
2.998LysGlu: 2.998 ± 0.51
1.024LysPhe: 1.024 ± 0.307
3.291LysGly: 3.291 ± 0.52
0.731LysHis: 0.731 ± 0.212
1.463LysIle: 1.463 ± 0.264
2.413LysLys: 2.413 ± 0.478
4.607LysLeu: 4.607 ± 0.45
1.536LysMet: 1.536 ± 0.336
1.609LysAsn: 1.609 ± 0.331
2.56LysPro: 2.56 ± 0.525
3.657LysGln: 3.657 ± 0.664
4.022LysArg: 4.022 ± 0.658
3.437LysSer: 3.437 ± 0.461
2.486LysThr: 2.486 ± 0.386
3.437LysVal: 3.437 ± 0.541
0.878LysTrp: 0.878 ± 0.255
2.267LysTyr: 2.267 ± 0.482
0.0LysXaa: 0.0 ± 0.0
Leu
10.385LeuAla: 10.385 ± 0.791
1.097LeuCys: 1.097 ± 0.242
6.143LeuAsp: 6.143 ± 0.722
5.265LeuGlu: 5.265 ± 0.675
2.56LeuPhe: 2.56 ± 0.509
6.216LeuGly: 6.216 ± 0.632
1.828LeuHis: 1.828 ± 0.38
4.095LeuIle: 4.095 ± 0.633
5.046LeuLys: 5.046 ± 0.686
7.021LeuLeu: 7.021 ± 0.653
2.56LeuMet: 2.56 ± 0.33
3.583LeuAsn: 3.583 ± 0.353
4.022LeuPro: 4.022 ± 0.649
4.607LeuGln: 4.607 ± 0.686
4.827LeuArg: 4.827 ± 0.607
5.265LeuSer: 5.265 ± 0.543
4.607LeuThr: 4.607 ± 0.661
5.924LeuVal: 5.924 ± 0.607
1.316LeuTrp: 1.316 ± 0.361
3.803LeuTyr: 3.803 ± 0.558
0.0LeuXaa: 0.0 ± 0.0
Met
2.194MetAla: 2.194 ± 0.387
0.366MetCys: 0.366 ± 0.15
1.975MetAsp: 1.975 ± 0.354
1.097MetGlu: 1.097 ± 0.322
0.804MetPhe: 0.804 ± 0.226
1.755MetGly: 1.755 ± 0.363
1.316MetHis: 1.316 ± 0.279
0.878MetIle: 0.878 ± 0.307
1.17MetLys: 1.17 ± 0.224
4.022MetLeu: 4.022 ± 0.565
0.585MetMet: 0.585 ± 0.221
1.389MetAsn: 1.389 ± 0.229
1.609MetPro: 1.609 ± 0.29
2.486MetGln: 2.486 ± 0.439
1.828MetArg: 1.828 ± 0.488
1.901MetSer: 1.901 ± 0.346
1.609MetThr: 1.609 ± 0.346
1.463MetVal: 1.463 ± 0.393
0.219MetTrp: 0.219 ± 0.106
1.17MetTyr: 1.17 ± 0.313
0.0MetXaa: 0.0 ± 0.0
Asn
2.925AsnAla: 2.925 ± 0.401
0.658AsnCys: 0.658 ± 0.266
1.828AsnAsp: 1.828 ± 0.382
1.316AsnGlu: 1.316 ± 0.321
1.097AsnPhe: 1.097 ± 0.28
2.998AsnGly: 2.998 ± 0.462
0.219AsnHis: 0.219 ± 0.138
2.34AsnIle: 2.34 ± 0.421
1.755AsnLys: 1.755 ± 0.319
3.876AsnLeu: 3.876 ± 0.51
1.316AsnMet: 1.316 ± 0.339
1.828AsnAsn: 1.828 ± 0.442
2.56AsnPro: 2.56 ± 0.373
1.024AsnGln: 1.024 ± 0.254
1.975AsnArg: 1.975 ± 0.388
3.145AsnSer: 3.145 ± 0.523
2.925AsnThr: 2.925 ± 0.469
3.291AsnVal: 3.291 ± 0.436
0.878AsnTrp: 0.878 ± 0.246
1.17AsnTyr: 1.17 ± 0.379
0.0AsnXaa: 0.0 ± 0.0
Pro
4.973ProAla: 4.973 ± 0.664
0.439ProCys: 0.439 ± 0.159
2.852ProAsp: 2.852 ± 0.545
4.168ProGlu: 4.168 ± 0.566
0.951ProPhe: 0.951 ± 0.254
1.975ProGly: 1.975 ± 0.438
0.219ProHis: 0.219 ± 0.105
1.389ProIle: 1.389 ± 0.367
2.706ProLys: 2.706 ± 0.375
2.779ProLeu: 2.779 ± 0.452
1.17ProMet: 1.17 ± 0.285
1.389ProAsn: 1.389 ± 0.266
1.389ProPro: 1.389 ± 0.28
1.316ProGln: 1.316 ± 0.287
1.609ProArg: 1.609 ± 0.315
2.194ProSer: 2.194 ± 0.333
2.706ProThr: 2.706 ± 0.373
4.242ProVal: 4.242 ± 0.581
0.512ProTrp: 0.512 ± 0.189
1.389ProTyr: 1.389 ± 0.357
0.0ProXaa: 0.0 ± 0.0
Gln
6.289GlnAla: 6.289 ± 0.907
0.219GlnCys: 0.219 ± 0.121
1.828GlnAsp: 1.828 ± 0.411
3.73GlnGlu: 3.73 ± 0.59
1.389GlnPhe: 1.389 ± 0.24
3.364GlnGly: 3.364 ± 0.529
1.389GlnHis: 1.389 ± 0.342
1.463GlnIle: 1.463 ± 0.322
2.34GlnLys: 2.34 ± 0.438
5.704GlnLeu: 5.704 ± 0.671
1.975GlnMet: 1.975 ± 0.477
2.34GlnAsn: 2.34 ± 0.346
1.828GlnPro: 1.828 ± 0.319
3.803GlnGln: 3.803 ± 0.548
2.706GlnArg: 2.706 ± 0.333
2.779GlnSer: 2.779 ± 0.5
2.779GlnThr: 2.779 ± 0.522
2.706GlnVal: 2.706 ± 0.437
0.658GlnTrp: 0.658 ± 0.23
2.194GlnTyr: 2.194 ± 0.358
0.0GlnXaa: 0.0 ± 0.0
Arg
5.777ArgAla: 5.777 ± 1.02
0.951ArgCys: 0.951 ± 0.32
2.779ArgAsp: 2.779 ± 0.388
3.583ArgGlu: 3.583 ± 0.569
1.901ArgPhe: 1.901 ± 0.376
3.876ArgGly: 3.876 ± 0.792
0.878ArgHis: 0.878 ± 0.24
2.852ArgIle: 2.852 ± 0.541
3.072ArgLys: 3.072 ± 0.481
4.973ArgLeu: 4.973 ± 0.584
1.463ArgMet: 1.463 ± 0.312
1.755ArgAsn: 1.755 ± 0.334
1.316ArgPro: 1.316 ± 0.356
2.34ArgGln: 2.34 ± 0.415
3.583ArgArg: 3.583 ± 0.474
3.145ArgSer: 3.145 ± 0.504
3.73ArgThr: 3.73 ± 0.47
3.291ArgVal: 3.291 ± 0.51
0.878ArgTrp: 0.878 ± 0.226
2.121ArgTyr: 2.121 ± 0.306
0.0ArgXaa: 0.0 ± 0.0
Ser
6.582SerAla: 6.582 ± 0.68
0.951SerCys: 0.951 ± 0.242
3.218SerAsp: 3.218 ± 0.442
2.925SerGlu: 2.925 ± 0.376
1.755SerPhe: 1.755 ± 0.31
5.924SerGly: 5.924 ± 0.717
0.658SerHis: 0.658 ± 0.183
2.925SerIle: 2.925 ± 0.573
3.437SerLys: 3.437 ± 0.537
5.485SerLeu: 5.485 ± 0.779
2.267SerMet: 2.267 ± 0.313
2.413SerAsn: 2.413 ± 0.384
2.267SerPro: 2.267 ± 0.362
2.34SerGln: 2.34 ± 0.448
3.218SerArg: 3.218 ± 0.466
3.949SerSer: 3.949 ± 0.637
4.461SerThr: 4.461 ± 0.542
3.949SerVal: 3.949 ± 0.591
1.17SerTrp: 1.17 ± 0.257
2.852SerTyr: 2.852 ± 0.48
0.0SerXaa: 0.0 ± 0.0
Thr
7.606ThrAla: 7.606 ± 1.029
0.878ThrCys: 0.878 ± 0.213
3.876ThrAsp: 3.876 ± 0.458
3.145ThrGlu: 3.145 ± 0.466
1.389ThrPhe: 1.389 ± 0.389
4.9ThrGly: 4.9 ± 0.925
0.585ThrHis: 0.585 ± 0.178
2.413ThrIle: 2.413 ± 0.395
3.364ThrLys: 3.364 ± 0.666
4.315ThrLeu: 4.315 ± 0.738
1.316ThrMet: 1.316 ± 0.294
2.413ThrAsn: 2.413 ± 0.431
2.56ThrPro: 2.56 ± 0.408
2.56ThrGln: 2.56 ± 0.404
2.706ThrArg: 2.706 ± 0.42
4.095ThrSer: 4.095 ± 0.791
3.437ThrThr: 3.437 ± 0.977
4.9ThrVal: 4.9 ± 0.539
0.951ThrTrp: 0.951 ± 0.248
1.682ThrTyr: 1.682 ± 0.312
0.0ThrXaa: 0.0 ± 0.0
Val
5.704ValAla: 5.704 ± 0.786
1.097ValCys: 1.097 ± 0.261
4.461ValAsp: 4.461 ± 0.651
3.73ValGlu: 3.73 ± 0.481
1.17ValPhe: 1.17 ± 0.378
5.485ValGly: 5.485 ± 0.736
1.389ValHis: 1.389 ± 0.341
2.413ValIle: 2.413 ± 0.393
2.413ValLys: 2.413 ± 0.384
6.143ValLeu: 6.143 ± 0.666
1.828ValMet: 1.828 ± 0.311
2.998ValAsn: 2.998 ± 0.495
2.706ValPro: 2.706 ± 0.31
3.949ValGln: 3.949 ± 0.651
4.315ValArg: 4.315 ± 0.533
4.242ValSer: 4.242 ± 0.66
4.534ValThr: 4.534 ± 0.962
4.168ValVal: 4.168 ± 0.602
1.024ValTrp: 1.024 ± 0.33
2.56ValTyr: 2.56 ± 0.496
0.0ValXaa: 0.0 ± 0.0
Trp
0.878TrpAla: 0.878 ± 0.217
0.366TrpCys: 0.366 ± 0.171
1.097TrpAsp: 1.097 ± 0.23
1.536TrpGlu: 1.536 ± 0.287
0.585TrpPhe: 0.585 ± 0.231
0.585TrpGly: 0.585 ± 0.2
0.146TrpHis: 0.146 ± 0.111
0.366TrpIle: 0.366 ± 0.173
1.17TrpLys: 1.17 ± 0.241
1.609TrpLeu: 1.609 ± 0.335
0.0TrpMet: 0.0 ± 0.0
0.878TrpAsn: 0.878 ± 0.288
0.219TrpPro: 0.219 ± 0.148
0.512TrpGln: 0.512 ± 0.17
1.024TrpArg: 1.024 ± 0.33
0.804TrpSer: 0.804 ± 0.232
0.366TrpThr: 0.366 ± 0.164
1.682TrpVal: 1.682 ± 0.383
0.146TrpTrp: 0.146 ± 0.101
0.512TrpTyr: 0.512 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.949TyrAla: 3.949 ± 0.436
1.097TyrCys: 1.097 ± 0.265
2.706TyrAsp: 2.706 ± 0.414
2.267TyrGlu: 2.267 ± 0.44
0.731TyrPhe: 0.731 ± 0.248
2.34TyrGly: 2.34 ± 0.612
0.439TyrHis: 0.439 ± 0.154
2.34TyrIle: 2.34 ± 0.467
1.536TyrLys: 1.536 ± 0.241
4.534TyrLeu: 4.534 ± 0.554
0.878TyrMet: 0.878 ± 0.259
1.828TyrAsn: 1.828 ± 0.33
1.536TyrPro: 1.536 ± 0.376
2.413TyrGln: 2.413 ± 0.444
2.998TyrArg: 2.998 ± 0.489
3.364TyrSer: 3.364 ± 0.477
2.267TyrThr: 2.267 ± 0.35
1.536TyrVal: 1.536 ± 0.344
0.658TyrTrp: 0.658 ± 0.25
1.536TyrTyr: 1.536 ± 0.3
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (13675 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski