Amino acid dipepetide frequency for Pseudomonas phage LUZ24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.899AlaAla: 7.899 ± 1.589
0.878AlaCys: 0.878 ± 0.283
4.023AlaAsp: 4.023 ± 0.492
6.217AlaGlu: 6.217 ± 0.953
2.852AlaPhe: 2.852 ± 0.423
7.021AlaGly: 7.021 ± 1.321
1.39AlaHis: 1.39 ± 0.326
4.096AlaIle: 4.096 ± 0.551
4.461AlaLys: 4.461 ± 0.614
7.241AlaLeu: 7.241 ± 0.843
2.706AlaMet: 2.706 ± 0.448
3.657AlaAsn: 3.657 ± 0.556
3.145AlaPro: 3.145 ± 0.59
4.681AlaGln: 4.681 ± 1.07
3.218AlaArg: 3.218 ± 0.562
5.12AlaSer: 5.12 ± 0.724
3.584AlaThr: 3.584 ± 0.596
6.217AlaVal: 6.217 ± 0.849
1.024AlaTrp: 1.024 ± 0.268
2.852AlaTyr: 2.852 ± 0.569
0.0AlaXaa: 0.0 ± 0.0
Cys
0.951CysAla: 0.951 ± 0.321
0.146CysCys: 0.146 ± 0.096
0.878CysAsp: 0.878 ± 0.287
1.243CysGlu: 1.243 ± 0.284
0.146CysPhe: 0.146 ± 0.106
1.024CysGly: 1.024 ± 0.363
0.658CysHis: 0.658 ± 0.209
0.585CysIle: 0.585 ± 0.224
0.293CysLys: 0.293 ± 0.131
0.585CysLeu: 0.585 ± 0.189
0.146CysMet: 0.146 ± 0.109
0.585CysAsn: 0.585 ± 0.196
0.512CysPro: 0.512 ± 0.237
0.366CysGln: 0.366 ± 0.167
0.366CysArg: 0.366 ± 0.17
0.805CysSer: 0.805 ± 0.273
0.585CysThr: 0.585 ± 0.186
0.658CysVal: 0.658 ± 0.302
0.366CysTrp: 0.366 ± 0.196
0.439CysTyr: 0.439 ± 0.165
0.0CysXaa: 0.0 ± 0.0
Asp
5.046AspAla: 5.046 ± 0.612
0.366AspCys: 0.366 ± 0.18
3.145AspAsp: 3.145 ± 0.423
4.169AspGlu: 4.169 ± 0.675
1.975AspPhe: 1.975 ± 0.415
5.12AspGly: 5.12 ± 0.733
1.755AspHis: 1.755 ± 0.481
3.876AspIle: 3.876 ± 0.486
2.999AspLys: 2.999 ± 0.496
6.217AspLeu: 6.217 ± 0.685
1.755AspMet: 1.755 ± 0.335
2.414AspAsn: 2.414 ± 0.41
3.73AspPro: 3.73 ± 0.58
1.536AspGln: 1.536 ± 0.402
3.73AspArg: 3.73 ± 0.495
2.633AspSer: 2.633 ± 0.598
3.511AspThr: 3.511 ± 0.626
3.072AspVal: 3.072 ± 0.364
1.755AspTrp: 1.755 ± 0.372
1.755AspTyr: 1.755 ± 0.337
0.0AspXaa: 0.0 ± 0.0
Glu
6.07GluAla: 6.07 ± 1.073
0.878GluCys: 0.878 ± 0.269
4.534GluAsp: 4.534 ± 0.662
6.802GluGlu: 6.802 ± 1.149
2.706GluPhe: 2.706 ± 0.387
5.12GluGly: 5.12 ± 0.721
1.097GluHis: 1.097 ± 0.312
3.949GluIle: 3.949 ± 0.511
3.73GluLys: 3.73 ± 0.6
6.655GluLeu: 6.655 ± 0.795
2.779GluMet: 2.779 ± 0.506
2.267GluAsn: 2.267 ± 0.315
2.121GluPro: 2.121 ± 0.412
2.34GluGln: 2.34 ± 0.434
4.973GluArg: 4.973 ± 0.706
3.145GluSer: 3.145 ± 0.576
2.999GluThr: 2.999 ± 0.443
6.509GluVal: 6.509 ± 0.77
0.878GluTrp: 0.878 ± 0.214
2.121GluTyr: 2.121 ± 0.43
0.0GluXaa: 0.0 ± 0.0
Phe
1.975PheAla: 1.975 ± 0.394
0.366PheCys: 0.366 ± 0.147
2.34PheAsp: 2.34 ± 0.395
2.194PheGlu: 2.194 ± 0.448
1.536PhePhe: 1.536 ± 0.438
4.461PheGly: 4.461 ± 0.564
0.658PheHis: 0.658 ± 0.206
2.487PheIle: 2.487 ± 0.503
1.902PheLys: 1.902 ± 0.43
3.803PheLeu: 3.803 ± 0.521
0.951PheMet: 0.951 ± 0.197
1.609PheAsn: 1.609 ± 0.39
1.755PhePro: 1.755 ± 0.391
1.536PheGln: 1.536 ± 0.287
1.682PheArg: 1.682 ± 0.381
2.633PheSer: 2.633 ± 0.485
2.194PheThr: 2.194 ± 0.316
2.267PheVal: 2.267 ± 0.438
0.878PheTrp: 0.878 ± 0.301
0.951PheTyr: 0.951 ± 0.298
0.0PheXaa: 0.0 ± 0.0
Gly
7.241GlyAla: 7.241 ± 1.307
1.024GlyCys: 1.024 ± 0.269
5.193GlyAsp: 5.193 ± 0.553
4.534GlyGlu: 4.534 ± 0.709
3.218GlyPhe: 3.218 ± 0.483
7.241GlyGly: 7.241 ± 1.266
1.755GlyHis: 1.755 ± 0.391
4.461GlyIle: 4.461 ± 0.526
5.12GlyLys: 5.12 ± 0.66
5.778GlyLeu: 5.778 ± 0.69
2.194GlyMet: 2.194 ± 0.375
3.803GlyAsn: 3.803 ± 0.638
2.852GlyPro: 2.852 ± 0.477
2.999GlyGln: 2.999 ± 0.46
4.388GlyArg: 4.388 ± 0.529
5.851GlySer: 5.851 ± 0.833
5.339GlyThr: 5.339 ± 0.671
6.363GlyVal: 6.363 ± 0.614
1.316GlyTrp: 1.316 ± 0.33
2.414GlyTyr: 2.414 ± 0.357
0.0GlyXaa: 0.0 ± 0.0
His
1.536HisAla: 1.536 ± 0.361
0.439HisCys: 0.439 ± 0.167
0.658HisAsp: 0.658 ± 0.268
1.463HisGlu: 1.463 ± 0.49
1.097HisPhe: 1.097 ± 0.274
1.682HisGly: 1.682 ± 0.477
0.512HisHis: 0.512 ± 0.214
1.097HisIle: 1.097 ± 0.261
1.316HisLys: 1.316 ± 0.313
2.999HisLeu: 2.999 ± 0.499
0.293HisMet: 0.293 ± 0.155
0.731HisAsn: 0.731 ± 0.225
0.585HisPro: 0.585 ± 0.183
0.658HisGln: 0.658 ± 0.228
1.682HisArg: 1.682 ± 0.317
0.878HisSer: 0.878 ± 0.269
0.731HisThr: 0.731 ± 0.193
1.097HisVal: 1.097 ± 0.254
0.512HisTrp: 0.512 ± 0.272
0.878HisTyr: 0.878 ± 0.255
0.0HisXaa: 0.0 ± 0.0
Ile
4.169IleAla: 4.169 ± 0.495
0.805IleCys: 0.805 ± 0.24
2.925IleAsp: 2.925 ± 0.46
3.072IleGlu: 3.072 ± 0.423
2.194IlePhe: 2.194 ± 0.485
4.534IleGly: 4.534 ± 0.499
0.805IleHis: 0.805 ± 0.251
2.56IleIle: 2.56 ± 0.447
3.218IleLys: 3.218 ± 0.632
4.242IleLeu: 4.242 ± 0.575
1.024IleMet: 1.024 ± 0.338
2.048IleAsn: 2.048 ± 0.307
3.218IlePro: 3.218 ± 0.532
2.487IleGln: 2.487 ± 0.486
3.291IleArg: 3.291 ± 0.508
2.633IleSer: 2.633 ± 0.385
2.852IleThr: 2.852 ± 0.485
2.633IleVal: 2.633 ± 0.493
0.731IleTrp: 0.731 ± 0.202
1.755IleTyr: 1.755 ± 0.32
0.0IleXaa: 0.0 ± 0.0
Lys
4.315LysAla: 4.315 ± 0.762
0.512LysCys: 0.512 ± 0.196
3.73LysAsp: 3.73 ± 0.511
4.169LysGlu: 4.169 ± 0.521
2.048LysPhe: 2.048 ± 0.421
3.949LysGly: 3.949 ± 0.667
1.316LysHis: 1.316 ± 0.337
2.706LysIle: 2.706 ± 0.462
3.218LysLys: 3.218 ± 0.543
4.242LysLeu: 4.242 ± 0.597
1.17LysMet: 1.17 ± 0.292
2.194LysAsn: 2.194 ± 0.437
2.34LysPro: 2.34 ± 0.495
1.609LysGln: 1.609 ± 0.309
3.437LysArg: 3.437 ± 0.594
3.511LysSer: 3.511 ± 0.584
3.364LysThr: 3.364 ± 0.512
4.827LysVal: 4.827 ± 0.656
1.463LysTrp: 1.463 ± 0.285
2.706LysTyr: 2.706 ± 0.485
0.0LysXaa: 0.0 ± 0.0
Leu
6.729LeuAla: 6.729 ± 0.758
0.805LeuCys: 0.805 ± 0.209
5.924LeuAsp: 5.924 ± 0.585
6.363LeuGlu: 6.363 ± 0.867
2.852LeuPhe: 2.852 ± 0.454
7.606LeuGly: 7.606 ± 1.101
1.463LeuHis: 1.463 ± 0.347
3.072LeuIle: 3.072 ± 0.431
6.07LeuLys: 6.07 ± 0.726
7.021LeuLeu: 7.021 ± 0.723
2.194LeuMet: 2.194 ± 0.526
3.511LeuAsn: 3.511 ± 0.494
3.145LeuPro: 3.145 ± 0.543
3.437LeuGln: 3.437 ± 0.614
6.07LeuArg: 6.07 ± 0.596
4.681LeuSer: 4.681 ± 0.786
3.657LeuThr: 3.657 ± 0.471
4.9LeuVal: 4.9 ± 0.58
0.878LeuTrp: 0.878 ± 0.263
1.755LeuTyr: 1.755 ± 0.375
0.0LeuXaa: 0.0 ± 0.0
Met
3.437MetAla: 3.437 ± 0.567
0.219MetCys: 0.219 ± 0.127
1.902MetAsp: 1.902 ± 0.378
1.024MetGlu: 1.024 ± 0.214
1.024MetPhe: 1.024 ± 0.243
1.975MetGly: 1.975 ± 0.44
0.293MetHis: 0.293 ± 0.125
1.39MetIle: 1.39 ± 0.352
2.048MetLys: 2.048 ± 0.377
1.975MetLeu: 1.975 ± 0.322
0.585MetMet: 0.585 ± 0.186
1.024MetAsn: 1.024 ± 0.278
1.097MetPro: 1.097 ± 0.291
1.243MetGln: 1.243 ± 0.258
1.609MetArg: 1.609 ± 0.343
2.048MetSer: 2.048 ± 0.405
1.609MetThr: 1.609 ± 0.385
1.463MetVal: 1.463 ± 0.327
0.512MetTrp: 0.512 ± 0.175
0.731MetTyr: 0.731 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
3.291AsnAla: 3.291 ± 0.539
0.878AsnCys: 0.878 ± 0.251
2.706AsnAsp: 2.706 ± 0.4
2.925AsnGlu: 2.925 ± 0.453
1.609AsnPhe: 1.609 ± 0.351
2.999AsnGly: 2.999 ± 0.627
1.097AsnHis: 1.097 ± 0.283
2.852AsnIle: 2.852 ± 0.52
1.609AsnLys: 1.609 ± 0.361
2.999AsnLeu: 2.999 ± 0.464
1.097AsnMet: 1.097 ± 0.251
1.682AsnAsn: 1.682 ± 0.411
2.925AsnPro: 2.925 ± 0.477
2.414AsnGln: 2.414 ± 0.54
2.852AsnArg: 2.852 ± 0.41
2.048AsnSer: 2.048 ± 0.367
2.852AsnThr: 2.852 ± 0.514
2.487AsnVal: 2.487 ± 0.418
0.878AsnTrp: 0.878 ± 0.201
1.316AsnTyr: 1.316 ± 0.34
0.0AsnXaa: 0.0 ± 0.0
Pro
3.364ProAla: 3.364 ± 0.574
0.366ProCys: 0.366 ± 0.165
3.218ProAsp: 3.218 ± 0.469
4.827ProGlu: 4.827 ± 0.658
2.34ProPhe: 2.34 ± 0.428
3.437ProGly: 3.437 ± 0.519
0.731ProHis: 0.731 ± 0.225
1.463ProIle: 1.463 ± 0.344
2.999ProLys: 2.999 ± 0.391
2.121ProLeu: 2.121 ± 0.342
0.951ProMet: 0.951 ± 0.25
2.048ProAsn: 2.048 ± 0.465
1.682ProPro: 1.682 ± 0.433
2.048ProGln: 2.048 ± 0.417
1.536ProArg: 1.536 ± 0.412
2.194ProSer: 2.194 ± 0.445
2.706ProThr: 2.706 ± 0.374
2.633ProVal: 2.633 ± 0.531
0.731ProTrp: 0.731 ± 0.224
1.755ProTyr: 1.755 ± 0.38
0.0ProXaa: 0.0 ± 0.0
Gln
4.608GlnAla: 4.608 ± 0.856
0.293GlnCys: 0.293 ± 0.147
2.121GlnAsp: 2.121 ± 0.46
3.511GlnGlu: 3.511 ± 0.542
1.17GlnPhe: 1.17 ± 0.315
2.852GlnGly: 2.852 ± 0.529
0.366GlnHis: 0.366 ± 0.144
1.609GlnIle: 1.609 ± 0.377
1.39GlnLys: 1.39 ± 0.285
3.218GlnLeu: 3.218 ± 0.562
1.755GlnMet: 1.755 ± 0.39
1.828GlnAsn: 1.828 ± 0.445
1.097GlnPro: 1.097 ± 0.284
2.048GlnGln: 2.048 ± 0.416
3.072GlnArg: 3.072 ± 0.492
2.267GlnSer: 2.267 ± 0.447
1.755GlnThr: 1.755 ± 0.373
3.657GlnVal: 3.657 ± 0.463
0.512GlnTrp: 0.512 ± 0.222
1.024GlnTyr: 1.024 ± 0.289
0.0GlnXaa: 0.0 ± 0.0
Arg
4.681ArgAla: 4.681 ± 0.795
0.585ArgCys: 0.585 ± 0.212
3.291ArgAsp: 3.291 ± 0.503
3.218ArgGlu: 3.218 ± 0.545
1.902ArgPhe: 1.902 ± 0.427
4.242ArgGly: 4.242 ± 0.577
1.17ArgHis: 1.17 ± 0.275
3.949ArgIle: 3.949 ± 0.532
3.511ArgLys: 3.511 ± 0.588
5.412ArgLeu: 5.412 ± 0.58
1.975ArgMet: 1.975 ± 0.372
3.291ArgAsn: 3.291 ± 0.466
2.34ArgPro: 2.34 ± 0.403
2.706ArgGln: 2.706 ± 0.459
3.657ArgArg: 3.657 ± 0.656
3.291ArgSer: 3.291 ± 0.498
2.048ArgThr: 2.048 ± 0.388
4.827ArgVal: 4.827 ± 0.559
1.097ArgTrp: 1.097 ± 0.302
1.536ArgTyr: 1.536 ± 0.384
0.0ArgXaa: 0.0 ± 0.0
Ser
4.534SerAla: 4.534 ± 0.597
0.658SerCys: 0.658 ± 0.24
3.364SerAsp: 3.364 ± 0.521
4.973SerGlu: 4.973 ± 0.599
2.048SerPhe: 2.048 ± 0.471
5.558SerGly: 5.558 ± 0.509
1.536SerHis: 1.536 ± 0.309
3.364SerIle: 3.364 ± 0.417
3.511SerLys: 3.511 ± 0.557
3.584SerLeu: 3.584 ± 0.618
1.316SerMet: 1.316 ± 0.265
2.414SerAsn: 2.414 ± 0.421
2.56SerPro: 2.56 ± 0.535
2.34SerGln: 2.34 ± 0.466
2.56SerArg: 2.56 ± 0.31
4.169SerSer: 4.169 ± 0.648
2.194SerThr: 2.194 ± 0.442
4.023SerVal: 4.023 ± 0.56
1.243SerTrp: 1.243 ± 0.395
1.902SerTyr: 1.902 ± 0.421
0.0SerXaa: 0.0 ± 0.0
Thr
4.534ThrAla: 4.534 ± 0.512
0.512ThrCys: 0.512 ± 0.227
2.048ThrAsp: 2.048 ± 0.428
3.218ThrGlu: 3.218 ± 0.458
2.487ThrPhe: 2.487 ± 0.499
4.534ThrGly: 4.534 ± 0.692
1.463ThrHis: 1.463 ± 0.296
2.56ThrIle: 2.56 ± 0.498
2.487ThrLys: 2.487 ± 0.4
4.973ThrLeu: 4.973 ± 0.582
1.463ThrMet: 1.463 ± 0.305
2.048ThrAsn: 2.048 ± 0.348
3.584ThrPro: 3.584 ± 0.604
2.414ThrGln: 2.414 ± 0.408
2.999ThrArg: 2.999 ± 0.438
2.56ThrSer: 2.56 ± 0.452
2.779ThrThr: 2.779 ± 0.507
3.584ThrVal: 3.584 ± 0.556
0.731ThrTrp: 0.731 ± 0.251
1.243ThrTyr: 1.243 ± 0.32
0.0ThrXaa: 0.0 ± 0.0
Val
4.608ValAla: 4.608 ± 0.619
0.585ValCys: 0.585 ± 0.203
4.681ValAsp: 4.681 ± 0.492
4.388ValGlu: 4.388 ± 0.615
3.145ValPhe: 3.145 ± 0.425
5.705ValGly: 5.705 ± 0.65
2.048ValHis: 2.048 ± 0.426
2.852ValIle: 2.852 ± 0.497
3.73ValLys: 3.73 ± 0.469
5.339ValLeu: 5.339 ± 0.672
1.609ValMet: 1.609 ± 0.31
3.803ValAsn: 3.803 ± 0.406
2.121ValPro: 2.121 ± 0.395
1.902ValGln: 1.902 ± 0.411
4.534ValArg: 4.534 ± 0.481
4.534ValSer: 4.534 ± 0.517
4.681ValThr: 4.681 ± 0.603
5.558ValVal: 5.558 ± 0.724
0.951ValTrp: 0.951 ± 0.297
3.072ValTyr: 3.072 ± 0.518
0.0ValXaa: 0.0 ± 0.0
Trp
1.463TrpAla: 1.463 ± 0.29
0.366TrpCys: 0.366 ± 0.166
1.243TrpAsp: 1.243 ± 0.29
1.463TrpGlu: 1.463 ± 0.302
0.585TrpPhe: 0.585 ± 0.177
1.243TrpGly: 1.243 ± 0.292
0.0TrpHis: 0.0 ± 0.0
0.585TrpIle: 0.585 ± 0.253
1.682TrpLys: 1.682 ± 0.375
1.243TrpLeu: 1.243 ± 0.291
0.658TrpMet: 0.658 ± 0.196
0.731TrpAsn: 0.731 ± 0.234
0.731TrpPro: 0.731 ± 0.339
0.439TrpGln: 0.439 ± 0.181
1.024TrpArg: 1.024 ± 0.268
1.097TrpSer: 1.097 ± 0.393
1.024TrpThr: 1.024 ± 0.261
0.658TrpVal: 0.658 ± 0.204
0.366TrpTrp: 0.366 ± 0.202
0.585TrpTyr: 0.585 ± 0.213
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.828TyrAla: 1.828 ± 0.36
0.658TyrCys: 0.658 ± 0.219
2.633TyrAsp: 2.633 ± 0.433
1.902TyrGlu: 1.902 ± 0.316
1.243TyrPhe: 1.243 ± 0.258
2.779TyrGly: 2.779 ± 0.407
0.731TyrHis: 0.731 ± 0.29
1.682TyrIle: 1.682 ± 0.288
1.536TyrLys: 1.536 ± 0.335
2.487TyrLeu: 2.487 ± 0.439
0.512TyrMet: 0.512 ± 0.26
1.828TyrAsn: 1.828 ± 0.269
1.609TyrPro: 1.609 ± 0.39
0.951TyrGln: 0.951 ± 0.221
2.048TyrArg: 2.048 ± 0.349
1.828TyrSer: 1.828 ± 0.334
1.755TyrThr: 1.755 ± 0.423
2.414TyrVal: 2.414 ± 0.415
0.366TyrTrp: 0.366 ± 0.17
1.243TyrTyr: 1.243 ± 0.305
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (13674 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski