Amino acid dipepetide frequency for Pseudomonas phage Andromeda

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.108AlaAla: 16.108 ± 2.315
0.797AlaCys: 0.797 ± 0.353
6.699AlaAsp: 6.699 ± 0.811
7.895AlaGlu: 7.895 ± 0.998
3.19AlaPhe: 3.19 ± 0.483
8.852AlaGly: 8.852 ± 0.81
2.073AlaHis: 2.073 ± 0.324
5.981AlaIle: 5.981 ± 0.669
7.177AlaLys: 7.177 ± 0.83
8.772AlaLeu: 8.772 ± 0.839
3.349AlaMet: 3.349 ± 0.644
4.545AlaAsn: 4.545 ± 0.961
2.791AlaPro: 2.791 ± 0.585
5.024AlaGln: 5.024 ± 0.81
6.539AlaArg: 6.539 ± 0.728
5.502AlaSer: 5.502 ± 0.704
6.14AlaThr: 6.14 ± 0.866
7.337AlaVal: 7.337 ± 0.695
1.356AlaTrp: 1.356 ± 0.312
3.19AlaTyr: 3.19 ± 0.547
0.0AlaXaa: 0.0 ± 0.0
Cys
0.957CysAla: 0.957 ± 0.253
0.08CysCys: 0.08 ± 0.073
0.797CysAsp: 0.797 ± 0.318
0.239CysGlu: 0.239 ± 0.142
0.239CysPhe: 0.239 ± 0.132
0.399CysGly: 0.399 ± 0.216
0.399CysHis: 0.399 ± 0.156
0.239CysIle: 0.239 ± 0.134
0.558CysLys: 0.558 ± 0.236
0.957CysLeu: 0.957 ± 0.309
0.638CysMet: 0.638 ± 0.246
0.319CysAsn: 0.319 ± 0.166
0.558CysPro: 0.558 ± 0.255
0.159CysGln: 0.159 ± 0.125
0.797CysArg: 0.797 ± 0.315
0.478CysSer: 0.478 ± 0.255
0.399CysThr: 0.399 ± 0.246
0.718CysVal: 0.718 ± 0.22
0.08CysTrp: 0.08 ± 0.077
0.239CysTyr: 0.239 ± 0.156
0.0CysXaa: 0.0 ± 0.0
Asp
7.416AspAla: 7.416 ± 0.718
0.718AspCys: 0.718 ± 0.297
4.067AspAsp: 4.067 ± 0.532
3.429AspGlu: 3.429 ± 0.532
1.754AspPhe: 1.754 ± 0.385
5.263AspGly: 5.263 ± 0.795
0.877AspHis: 0.877 ± 0.25
2.791AspIle: 2.791 ± 0.462
3.907AspLys: 3.907 ± 0.463
5.742AspLeu: 5.742 ± 0.643
2.073AspMet: 2.073 ± 0.398
2.392AspAsn: 2.392 ± 0.418
3.429AspPro: 3.429 ± 0.583
2.472AspGln: 2.472 ± 0.451
2.951AspArg: 2.951 ± 0.49
3.748AspSer: 3.748 ± 0.549
2.951AspThr: 2.951 ± 0.318
4.785AspVal: 4.785 ± 0.676
0.957AspTrp: 0.957 ± 0.378
1.754AspTyr: 1.754 ± 0.316
0.0AspXaa: 0.0 ± 0.0
Glu
6.3GluAla: 6.3 ± 0.86
0.718GluCys: 0.718 ± 0.225
2.951GluAsp: 2.951 ± 0.447
2.871GluGlu: 2.871 ± 0.451
2.392GluPhe: 2.392 ± 0.385
4.785GluGly: 4.785 ± 0.61
1.595GluHis: 1.595 ± 0.384
1.914GluIle: 1.914 ± 0.387
2.233GluLys: 2.233 ± 0.467
4.944GluLeu: 4.944 ± 0.522
1.675GluMet: 1.675 ± 0.389
1.675GluAsn: 1.675 ± 0.336
2.632GluPro: 2.632 ± 0.508
3.27GluGln: 3.27 ± 0.508
3.589GluArg: 3.589 ± 0.576
3.19GluSer: 3.19 ± 0.455
2.632GluThr: 2.632 ± 0.49
4.306GluVal: 4.306 ± 0.853
1.116GluTrp: 1.116 ± 0.359
1.754GluTyr: 1.754 ± 0.298
0.0GluXaa: 0.0 ± 0.0
Phe
2.472PheAla: 2.472 ± 0.458
0.319PheCys: 0.319 ± 0.172
3.748PheAsp: 3.748 ± 0.506
1.675PheGlu: 1.675 ± 0.321
1.435PhePhe: 1.435 ± 0.365
2.871PheGly: 2.871 ± 0.545
1.037PheHis: 1.037 ± 0.358
2.073PheIle: 2.073 ± 0.328
2.313PheLys: 2.313 ± 0.399
2.233PheLeu: 2.233 ± 0.366
0.638PheMet: 0.638 ± 0.202
2.073PheAsn: 2.073 ± 0.451
1.834PhePro: 1.834 ± 0.427
2.073PheGln: 2.073 ± 0.331
2.233PheArg: 2.233 ± 0.424
1.754PheSer: 1.754 ± 0.442
2.153PheThr: 2.153 ± 0.326
1.994PheVal: 1.994 ± 0.374
0.399PheTrp: 0.399 ± 0.184
0.877PheTyr: 0.877 ± 0.242
0.0PheXaa: 0.0 ± 0.0
Gly
8.134GlyAla: 8.134 ± 1.055
0.877GlyCys: 0.877 ± 0.324
4.864GlyAsp: 4.864 ± 0.531
3.509GlyGlu: 3.509 ± 0.471
1.994GlyPhe: 1.994 ± 0.413
7.895GlyGly: 7.895 ± 0.933
1.834GlyHis: 1.834 ± 0.455
3.429GlyIle: 3.429 ± 0.468
5.901GlyLys: 5.901 ± 0.594
4.625GlyLeu: 4.625 ± 0.658
3.589GlyMet: 3.589 ± 0.821
3.27GlyAsn: 3.27 ± 0.449
2.313GlyPro: 2.313 ± 0.414
4.147GlyGln: 4.147 ± 0.419
4.306GlyArg: 4.306 ± 0.64
4.306GlySer: 4.306 ± 0.558
5.981GlyThr: 5.981 ± 0.954
4.545GlyVal: 4.545 ± 0.459
0.877GlyTrp: 0.877 ± 0.227
1.515GlyTyr: 1.515 ± 0.373
0.0GlyXaa: 0.0 ± 0.0
His
2.632HisAla: 2.632 ± 0.483
0.239HisCys: 0.239 ± 0.139
1.116HisAsp: 1.116 ± 0.326
1.037HisGlu: 1.037 ± 0.267
0.957HisPhe: 0.957 ± 0.337
1.994HisGly: 1.994 ± 0.399
0.558HisHis: 0.558 ± 0.278
1.754HisIle: 1.754 ± 0.359
0.957HisLys: 0.957 ± 0.245
1.356HisLeu: 1.356 ± 0.28
0.718HisMet: 0.718 ± 0.251
0.797HisAsn: 0.797 ± 0.269
1.595HisPro: 1.595 ± 0.305
0.718HisGln: 0.718 ± 0.224
1.196HisArg: 1.196 ± 0.39
0.877HisSer: 0.877 ± 0.261
0.957HisThr: 0.957 ± 0.257
0.957HisVal: 0.957 ± 0.345
0.718HisTrp: 0.718 ± 0.214
0.638HisTyr: 0.638 ± 0.214
0.0HisXaa: 0.0 ± 0.0
Ile
4.147IleAla: 4.147 ± 0.547
0.638IleCys: 0.638 ± 0.209
3.11IleAsp: 3.11 ± 0.491
3.668IleGlu: 3.668 ± 0.45
1.595IlePhe: 1.595 ± 0.508
3.668IleGly: 3.668 ± 0.524
0.718IleHis: 0.718 ± 0.271
2.233IleIle: 2.233 ± 0.347
2.871IleLys: 2.871 ± 0.465
3.828IleLeu: 3.828 ± 0.435
1.595IleMet: 1.595 ± 0.313
2.313IleAsn: 2.313 ± 0.426
2.313IlePro: 2.313 ± 0.48
2.233IleGln: 2.233 ± 0.401
2.951IleArg: 2.951 ± 0.525
2.153IleSer: 2.153 ± 0.356
3.27IleThr: 3.27 ± 0.619
2.392IleVal: 2.392 ± 0.535
0.399IleTrp: 0.399 ± 0.163
0.718IleTyr: 0.718 ± 0.214
0.0IleXaa: 0.0 ± 0.0
Lys
6.061LysAla: 6.061 ± 0.832
0.159LysCys: 0.159 ± 0.118
4.705LysAsp: 4.705 ± 0.638
3.11LysGlu: 3.11 ± 0.551
2.153LysPhe: 2.153 ± 0.433
3.668LysGly: 3.668 ± 0.485
1.754LysHis: 1.754 ± 0.456
2.313LysIle: 2.313 ± 0.446
1.754LysLys: 1.754 ± 0.399
4.944LysLeu: 4.944 ± 0.471
1.037LysMet: 1.037 ± 0.338
2.392LysAsn: 2.392 ± 0.382
2.632LysPro: 2.632 ± 0.502
2.791LysGln: 2.791 ± 0.476
2.871LysArg: 2.871 ± 0.5
1.994LysSer: 1.994 ± 0.436
3.27LysThr: 3.27 ± 0.619
3.668LysVal: 3.668 ± 0.556
1.116LysTrp: 1.116 ± 0.318
1.515LysTyr: 1.515 ± 0.299
0.0LysXaa: 0.0 ± 0.0
Leu
9.649LeuAla: 9.649 ± 0.872
0.558LeuCys: 0.558 ± 0.217
5.343LeuAsp: 5.343 ± 0.551
4.226LeuGlu: 4.226 ± 0.57
2.392LeuPhe: 2.392 ± 0.412
5.263LeuGly: 5.263 ± 0.806
1.515LeuHis: 1.515 ± 0.516
4.545LeuIle: 4.545 ± 0.647
3.668LeuLys: 3.668 ± 0.465
6.778LeuLeu: 6.778 ± 0.648
2.472LeuMet: 2.472 ± 0.634
3.27LeuAsn: 3.27 ± 0.518
4.306LeuPro: 4.306 ± 0.555
3.19LeuGln: 3.19 ± 0.572
5.104LeuArg: 5.104 ± 0.788
5.423LeuSer: 5.423 ± 0.715
4.864LeuThr: 4.864 ± 0.664
5.343LeuVal: 5.343 ± 0.613
0.957LeuTrp: 0.957 ± 0.257
2.153LeuTyr: 2.153 ± 0.432
0.0LeuXaa: 0.0 ± 0.0
Met
3.349MetAla: 3.349 ± 0.704
0.159MetCys: 0.159 ± 0.104
2.153MetAsp: 2.153 ± 0.476
1.515MetGlu: 1.515 ± 0.387
1.356MetPhe: 1.356 ± 0.276
2.313MetGly: 2.313 ± 0.438
0.957MetHis: 0.957 ± 0.247
0.957MetIle: 0.957 ± 0.283
1.435MetLys: 1.435 ± 0.382
2.711MetLeu: 2.711 ± 0.583
0.638MetMet: 0.638 ± 0.221
1.435MetAsn: 1.435 ± 0.384
1.435MetPro: 1.435 ± 0.465
1.675MetGln: 1.675 ± 0.523
1.515MetArg: 1.515 ± 0.292
2.472MetSer: 2.472 ± 0.356
1.196MetThr: 1.196 ± 0.25
1.754MetVal: 1.754 ± 0.364
0.319MetTrp: 0.319 ± 0.157
1.116MetTyr: 1.116 ± 0.272
0.0MetXaa: 0.0 ± 0.0
Asn
4.226AsnAla: 4.226 ± 0.705
0.239AsnCys: 0.239 ± 0.223
2.153AsnAsp: 2.153 ± 0.424
2.472AsnGlu: 2.472 ± 0.442
1.515AsnPhe: 1.515 ± 0.3
2.632AsnGly: 2.632 ± 0.383
0.558AsnHis: 0.558 ± 0.272
1.435AsnIle: 1.435 ± 0.339
2.233AsnLys: 2.233 ± 0.546
3.03AsnLeu: 3.03 ± 0.387
1.037AsnMet: 1.037 ± 0.308
1.276AsnAsn: 1.276 ± 0.358
3.03AsnPro: 3.03 ± 0.43
2.472AsnGln: 2.472 ± 0.525
2.552AsnArg: 2.552 ± 0.532
2.313AsnSer: 2.313 ± 0.473
2.233AsnThr: 2.233 ± 0.536
3.349AsnVal: 3.349 ± 0.67
0.797AsnTrp: 0.797 ± 0.331
1.435AsnTyr: 1.435 ± 0.278
0.0AsnXaa: 0.0 ± 0.0
Pro
4.545ProAla: 4.545 ± 0.847
0.558ProCys: 0.558 ± 0.277
3.509ProAsp: 3.509 ± 0.518
3.828ProGlu: 3.828 ± 0.647
1.914ProPhe: 1.914 ± 0.468
3.987ProGly: 3.987 ± 0.676
0.957ProHis: 0.957 ± 0.284
1.994ProIle: 1.994 ± 0.323
2.472ProLys: 2.472 ± 0.55
3.668ProLeu: 3.668 ± 0.73
1.116ProMet: 1.116 ± 0.275
1.515ProAsn: 1.515 ± 0.471
1.356ProPro: 1.356 ± 0.415
1.994ProGln: 1.994 ± 0.42
1.515ProArg: 1.515 ± 0.373
2.073ProSer: 2.073 ± 0.392
2.472ProThr: 2.472 ± 0.455
2.632ProVal: 2.632 ± 0.557
0.797ProTrp: 0.797 ± 0.231
1.356ProTyr: 1.356 ± 0.21
0.0ProXaa: 0.0 ± 0.0
Gln
6.858GlnAla: 6.858 ± 1.083
0.319GlnCys: 0.319 ± 0.132
1.994GlnAsp: 1.994 ± 0.414
1.834GlnGlu: 1.834 ± 0.35
2.472GlnPhe: 2.472 ± 0.423
2.791GlnGly: 2.791 ± 0.492
1.595GlnHis: 1.595 ± 0.293
2.153GlnIle: 2.153 ± 0.412
2.073GlnLys: 2.073 ± 0.36
3.668GlnLeu: 3.668 ± 0.569
1.914GlnMet: 1.914 ± 0.41
1.595GlnAsn: 1.595 ± 0.312
1.675GlnPro: 1.675 ± 0.36
4.067GlnGln: 4.067 ± 0.955
2.711GlnArg: 2.711 ± 0.5
2.632GlnSer: 2.632 ± 0.519
2.552GlnThr: 2.552 ± 0.388
3.27GlnVal: 3.27 ± 0.693
0.877GlnTrp: 0.877 ± 0.233
1.435GlnTyr: 1.435 ± 0.364
0.0GlnXaa: 0.0 ± 0.0
Arg
5.263ArgAla: 5.263 ± 0.628
0.319ArgCys: 0.319 ± 0.156
3.828ArgAsp: 3.828 ± 0.632
3.828ArgGlu: 3.828 ± 0.772
1.994ArgPhe: 1.994 ± 0.379
3.03ArgGly: 3.03 ± 0.51
1.116ArgHis: 1.116 ± 0.39
2.871ArgIle: 2.871 ± 0.522
3.27ArgLys: 3.27 ± 0.563
4.625ArgLeu: 4.625 ± 0.651
1.595ArgMet: 1.595 ± 0.25
2.791ArgAsn: 2.791 ± 0.398
1.914ArgPro: 1.914 ± 0.402
3.589ArgGln: 3.589 ± 0.543
3.349ArgArg: 3.349 ± 0.603
3.509ArgSer: 3.509 ± 0.478
2.871ArgThr: 2.871 ± 0.457
4.306ArgVal: 4.306 ± 0.721
0.957ArgTrp: 0.957 ± 0.238
1.515ArgTyr: 1.515 ± 0.412
0.0ArgXaa: 0.0 ± 0.0
Ser
6.619SerAla: 6.619 ± 0.766
0.558SerCys: 0.558 ± 0.312
2.791SerAsp: 2.791 ± 0.472
2.392SerGlu: 2.392 ± 0.471
1.595SerPhe: 1.595 ± 0.3
4.306SerGly: 4.306 ± 0.698
1.037SerHis: 1.037 ± 0.194
3.03SerIle: 3.03 ± 0.543
3.589SerLys: 3.589 ± 0.63
4.944SerLeu: 4.944 ± 0.559
1.914SerMet: 1.914 ± 0.415
2.153SerAsn: 2.153 ± 0.5
2.153SerPro: 2.153 ± 0.407
1.994SerGln: 1.994 ± 0.381
3.03SerArg: 3.03 ± 0.383
2.632SerSer: 2.632 ± 0.492
3.668SerThr: 3.668 ± 0.439
3.668SerVal: 3.668 ± 0.491
0.399SerTrp: 0.399 ± 0.184
1.356SerTyr: 1.356 ± 0.407
0.0SerXaa: 0.0 ± 0.0
Thr
7.257ThrAla: 7.257 ± 1.087
0.718ThrCys: 0.718 ± 0.234
3.27ThrAsp: 3.27 ± 0.546
3.349ThrGlu: 3.349 ± 0.495
2.233ThrPhe: 2.233 ± 0.299
5.901ThrGly: 5.901 ± 0.677
0.877ThrHis: 0.877 ± 0.298
2.153ThrIle: 2.153 ± 0.347
2.313ThrLys: 2.313 ± 0.466
5.582ThrLeu: 5.582 ± 0.818
1.515ThrMet: 1.515 ± 0.336
2.552ThrAsn: 2.552 ± 0.394
3.349ThrPro: 3.349 ± 0.44
1.834ThrGln: 1.834 ± 0.413
2.951ThrArg: 2.951 ± 0.41
3.27ThrSer: 3.27 ± 0.553
2.472ThrThr: 2.472 ± 0.533
3.27ThrVal: 3.27 ± 0.505
0.638ThrTrp: 0.638 ± 0.312
1.754ThrTyr: 1.754 ± 0.463
0.0ThrXaa: 0.0 ± 0.0
Val
7.177ValAla: 7.177 ± 0.781
0.319ValCys: 0.319 ± 0.174
3.987ValAsp: 3.987 ± 0.537
3.589ValGlu: 3.589 ± 0.571
2.951ValPhe: 2.951 ± 0.493
4.705ValGly: 4.705 ± 0.604
1.435ValHis: 1.435 ± 0.361
3.11ValIle: 3.11 ± 0.446
3.987ValLys: 3.987 ± 0.472
3.748ValLeu: 3.748 ± 0.491
1.834ValMet: 1.834 ± 0.287
3.19ValAsn: 3.19 ± 0.54
3.19ValPro: 3.19 ± 0.485
2.632ValGln: 2.632 ± 0.55
3.589ValArg: 3.589 ± 0.651
3.429ValSer: 3.429 ± 0.516
4.705ValThr: 4.705 ± 0.582
5.024ValVal: 5.024 ± 0.846
1.037ValTrp: 1.037 ± 0.211
2.313ValTyr: 2.313 ± 0.425
0.0ValXaa: 0.0 ± 0.0
Trp
1.116TrpAla: 1.116 ± 0.243
0.399TrpCys: 0.399 ± 0.158
0.638TrpAsp: 0.638 ± 0.217
0.638TrpGlu: 0.638 ± 0.265
0.558TrpPhe: 0.558 ± 0.257
0.957TrpGly: 0.957 ± 0.277
0.399TrpHis: 0.399 ± 0.205
0.558TrpIle: 0.558 ± 0.187
0.319TrpLys: 0.319 ± 0.144
2.153TrpLeu: 2.153 ± 0.47
0.399TrpMet: 0.399 ± 0.152
0.319TrpAsn: 0.319 ± 0.154
0.399TrpPro: 0.399 ± 0.161
0.797TrpGln: 0.797 ± 0.275
1.356TrpArg: 1.356 ± 0.406
0.319TrpSer: 0.319 ± 0.129
0.957TrpThr: 0.957 ± 0.203
1.276TrpVal: 1.276 ± 0.322
0.159TrpTrp: 0.159 ± 0.092
0.558TrpTyr: 0.558 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.03TyrAla: 3.03 ± 0.417
0.558TyrCys: 0.558 ± 0.233
1.754TyrAsp: 1.754 ± 0.466
1.435TyrGlu: 1.435 ± 0.318
1.515TyrPhe: 1.515 ± 0.343
2.632TyrGly: 2.632 ± 0.445
0.478TyrHis: 0.478 ± 0.188
1.276TyrIle: 1.276 ± 0.36
0.877TyrLys: 0.877 ± 0.223
2.791TyrLeu: 2.791 ± 0.545
0.638TyrMet: 0.638 ± 0.186
1.116TyrAsn: 1.116 ± 0.284
1.356TyrPro: 1.356 ± 0.41
1.356TyrGln: 1.356 ± 0.363
1.435TyrArg: 1.435 ± 0.303
1.754TyrSer: 1.754 ± 0.309
1.515TyrThr: 1.515 ± 0.306
1.356TyrVal: 1.356 ± 0.373
0.319TyrTrp: 0.319 ± 0.13
0.558TyrTyr: 0.558 ± 0.238
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (12541 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski