Amino acid dipepetide frequency for Salmonella phage SP069

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.562AlaAla: 9.562 ± 1.665
0.81AlaCys: 0.81 ± 0.246
4.781AlaAsp: 4.781 ± 0.764
5.997AlaGlu: 5.997 ± 0.767
2.836AlaPhe: 2.836 ± 0.495
6.321AlaGly: 6.321 ± 0.756
1.621AlaHis: 1.621 ± 0.358
4.295AlaIle: 4.295 ± 0.54
4.295AlaLys: 4.295 ± 0.758
6.564AlaLeu: 6.564 ± 0.934
2.269AlaMet: 2.269 ± 0.364
5.186AlaAsn: 5.186 ± 0.736
2.35AlaPro: 2.35 ± 0.359
3.89AlaGln: 3.89 ± 1.329
3.485AlaArg: 3.485 ± 0.672
5.186AlaSer: 5.186 ± 0.683
4.943AlaThr: 4.943 ± 1.109
4.538AlaVal: 4.538 ± 0.663
0.891AlaTrp: 0.891 ± 0.273
2.431AlaTyr: 2.431 ± 0.38
0.0AlaXaa: 0.0 ± 0.0
Cys
0.81CysAla: 0.81 ± 0.28
0.243CysCys: 0.243 ± 0.228
1.053CysAsp: 1.053 ± 0.382
0.567CysGlu: 0.567 ± 0.223
0.243CysPhe: 0.243 ± 0.13
0.486CysGly: 0.486 ± 0.197
0.162CysHis: 0.162 ± 0.105
0.972CysIle: 0.972 ± 0.265
0.405CysLys: 0.405 ± 0.162
0.567CysLeu: 0.567 ± 0.177
0.243CysMet: 0.243 ± 0.129
0.567CysAsn: 0.567 ± 0.187
0.324CysPro: 0.324 ± 0.194
0.567CysGln: 0.567 ± 0.189
0.648CysArg: 0.648 ± 0.231
0.405CysSer: 0.405 ± 0.231
0.486CysThr: 0.486 ± 0.19
0.729CysVal: 0.729 ± 0.234
0.324CysTrp: 0.324 ± 0.151
0.648CysTyr: 0.648 ± 0.228
0.0CysXaa: 0.0 ± 0.0
Asp
5.267AspAla: 5.267 ± 0.695
0.486AspCys: 0.486 ± 0.247
5.186AspAsp: 5.186 ± 0.677
3.647AspGlu: 3.647 ± 0.488
2.917AspPhe: 2.917 ± 0.429
4.457AspGly: 4.457 ± 0.578
0.972AspHis: 0.972 ± 0.294
3.89AspIle: 3.89 ± 0.543
3.323AspLys: 3.323 ± 0.508
4.619AspLeu: 4.619 ± 0.546
2.107AspMet: 2.107 ± 0.342
3.566AspAsn: 3.566 ± 0.628
2.836AspPro: 2.836 ± 0.52
1.864AspGln: 1.864 ± 0.434
3.647AspArg: 3.647 ± 0.532
2.755AspSer: 2.755 ± 0.374
3.241AspThr: 3.241 ± 0.451
4.943AspVal: 4.943 ± 0.547
1.459AspTrp: 1.459 ± 0.324
2.917AspTyr: 2.917 ± 0.571
0.0AspXaa: 0.0 ± 0.0
Glu
4.295GluAla: 4.295 ± 0.567
0.567GluCys: 0.567 ± 0.195
2.674GluAsp: 2.674 ± 0.531
2.755GluGlu: 2.755 ± 0.468
1.702GluPhe: 1.702 ± 0.397
3.404GluGly: 3.404 ± 0.573
1.459GluHis: 1.459 ± 0.444
4.457GluIle: 4.457 ± 0.565
4.295GluLys: 4.295 ± 0.657
6.402GluLeu: 6.402 ± 0.862
1.54GluMet: 1.54 ± 0.486
2.998GluAsn: 2.998 ± 0.503
2.35GluPro: 2.35 ± 0.563
4.376GluGln: 4.376 ± 0.736
4.295GluArg: 4.295 ± 0.71
2.593GluSer: 2.593 ± 0.549
2.917GluThr: 2.917 ± 0.59
3.323GluVal: 3.323 ± 0.599
1.621GluTrp: 1.621 ± 0.366
2.593GluTyr: 2.593 ± 0.399
0.0GluXaa: 0.0 ± 0.0
Phe
2.674PheAla: 2.674 ± 0.58
0.486PheCys: 0.486 ± 0.203
2.188PheAsp: 2.188 ± 0.36
1.702PheGlu: 1.702 ± 0.343
0.972PhePhe: 0.972 ± 0.27
2.269PheGly: 2.269 ± 0.537
0.324PheHis: 0.324 ± 0.151
1.945PheIle: 1.945 ± 0.4
2.917PheLys: 2.917 ± 0.539
2.107PheLeu: 2.107 ± 0.468
1.053PheMet: 1.053 ± 0.295
1.216PheAsn: 1.216 ± 0.352
1.783PhePro: 1.783 ± 0.322
1.053PheGln: 1.053 ± 0.291
1.135PheArg: 1.135 ± 0.276
2.431PheSer: 2.431 ± 0.482
3.485PheThr: 3.485 ± 0.612
2.998PheVal: 2.998 ± 0.462
0.486PheTrp: 0.486 ± 0.213
1.54PheTyr: 1.54 ± 0.415
0.0PheXaa: 0.0 ± 0.0
Gly
4.538GlyAla: 4.538 ± 0.759
0.486GlyCys: 0.486 ± 0.2
4.619GlyAsp: 4.619 ± 0.553
5.348GlyGlu: 5.348 ± 0.5
3.647GlyPhe: 3.647 ± 0.74
5.916GlyGly: 5.916 ± 1.12
0.972GlyHis: 0.972 ± 0.324
3.971GlyIle: 3.971 ± 0.566
3.89GlyLys: 3.89 ± 0.446
5.511GlyLeu: 5.511 ± 0.674
1.945GlyMet: 1.945 ± 0.387
3.566GlyAsn: 3.566 ± 0.553
1.54GlyPro: 1.54 ± 0.323
2.836GlyGln: 2.836 ± 0.461
3.079GlyArg: 3.079 ± 0.509
5.267GlySer: 5.267 ± 0.56
5.024GlyThr: 5.024 ± 0.669
5.673GlyVal: 5.673 ± 0.64
1.945GlyTrp: 1.945 ± 0.38
2.836GlyTyr: 2.836 ± 0.433
0.0GlyXaa: 0.0 ± 0.0
His
1.135HisAla: 1.135 ± 0.327
0.324HisCys: 0.324 ± 0.149
1.216HisAsp: 1.216 ± 0.408
1.378HisGlu: 1.378 ± 0.319
0.405HisPhe: 0.405 ± 0.15
1.54HisGly: 1.54 ± 0.356
1.053HisHis: 1.053 ± 0.592
1.297HisIle: 1.297 ± 0.303
1.135HisLys: 1.135 ± 0.332
1.216HisLeu: 1.216 ± 0.288
0.0HisMet: 0.0 ± 0.0
0.729HisAsn: 0.729 ± 0.186
0.81HisPro: 0.81 ± 0.247
0.486HisGln: 0.486 ± 0.239
0.972HisArg: 0.972 ± 0.249
1.053HisSer: 1.053 ± 0.302
0.891HisThr: 0.891 ± 0.293
1.378HisVal: 1.378 ± 0.279
0.162HisTrp: 0.162 ± 0.116
0.648HisTyr: 0.648 ± 0.236
0.0HisXaa: 0.0 ± 0.0
Ile
4.862IleAla: 4.862 ± 0.582
0.81IleCys: 0.81 ± 0.306
5.511IleAsp: 5.511 ± 0.669
4.133IleGlu: 4.133 ± 0.641
1.945IlePhe: 1.945 ± 0.4
3.89IleGly: 3.89 ± 0.562
1.378IleHis: 1.378 ± 0.482
3.647IleIle: 3.647 ± 0.581
3.404IleLys: 3.404 ± 0.681
2.917IleLeu: 2.917 ± 0.408
1.621IleMet: 1.621 ± 0.346
4.052IleAsn: 4.052 ± 0.498
2.593IlePro: 2.593 ± 0.505
2.431IleGln: 2.431 ± 0.497
2.593IleArg: 2.593 ± 0.464
3.566IleSer: 3.566 ± 0.519
5.105IleThr: 5.105 ± 0.623
2.998IleVal: 2.998 ± 0.525
0.243IleTrp: 0.243 ± 0.144
2.35IleTyr: 2.35 ± 0.39
0.0IleXaa: 0.0 ± 0.0
Lys
5.348LysAla: 5.348 ± 1.018
0.81LysCys: 0.81 ± 0.246
3.16LysAsp: 3.16 ± 0.513
3.241LysGlu: 3.241 ± 0.476
2.107LysPhe: 2.107 ± 0.417
2.998LysGly: 2.998 ± 0.447
0.81LysHis: 0.81 ± 0.357
3.241LysIle: 3.241 ± 0.438
2.998LysLys: 2.998 ± 0.478
4.781LysLeu: 4.781 ± 0.665
1.54LysMet: 1.54 ± 0.364
2.026LysAsn: 2.026 ± 0.329
2.35LysPro: 2.35 ± 0.438
3.566LysGln: 3.566 ± 0.796
3.241LysArg: 3.241 ± 0.45
2.998LysSer: 2.998 ± 0.519
3.241LysThr: 3.241 ± 0.565
4.457LysVal: 4.457 ± 0.72
0.81LysTrp: 0.81 ± 0.282
1.702LysTyr: 1.702 ± 0.355
0.0LysXaa: 0.0 ± 0.0
Leu
7.536LeuAla: 7.536 ± 0.948
0.648LeuCys: 0.648 ± 0.241
4.052LeuAsp: 4.052 ± 0.461
4.7LeuGlu: 4.7 ± 0.654
2.107LeuPhe: 2.107 ± 0.394
4.052LeuGly: 4.052 ± 0.46
1.135LeuHis: 1.135 ± 0.257
3.566LeuIle: 3.566 ± 0.485
5.916LeuLys: 5.916 ± 0.529
6.321LeuLeu: 6.321 ± 0.772
2.107LeuMet: 2.107 ± 0.44
4.7LeuAsn: 4.7 ± 0.589
3.728LeuPro: 3.728 ± 0.519
3.404LeuGln: 3.404 ± 0.702
3.971LeuArg: 3.971 ± 0.684
6.078LeuSer: 6.078 ± 0.452
6.24LeuThr: 6.24 ± 0.5
4.376LeuVal: 4.376 ± 0.484
1.216LeuTrp: 1.216 ± 0.343
2.431LeuTyr: 2.431 ± 0.423
0.0LeuXaa: 0.0 ± 0.0
Met
2.674MetAla: 2.674 ± 0.385
0.243MetCys: 0.243 ± 0.166
1.459MetAsp: 1.459 ± 0.277
1.378MetGlu: 1.378 ± 0.303
1.135MetPhe: 1.135 ± 0.339
2.188MetGly: 2.188 ± 0.468
0.486MetHis: 0.486 ± 0.192
2.026MetIle: 2.026 ± 0.314
1.621MetLys: 1.621 ± 0.295
1.621MetLeu: 1.621 ± 0.334
0.567MetMet: 0.567 ± 0.22
1.378MetAsn: 1.378 ± 0.278
0.729MetPro: 0.729 ± 0.201
0.81MetGln: 0.81 ± 0.22
1.135MetArg: 1.135 ± 0.33
2.026MetSer: 2.026 ± 0.36
2.026MetThr: 2.026 ± 0.567
1.864MetVal: 1.864 ± 0.371
0.324MetTrp: 0.324 ± 0.19
0.486MetTyr: 0.486 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
4.538AsnAla: 4.538 ± 0.526
0.891AsnCys: 0.891 ± 0.278
2.593AsnAsp: 2.593 ± 0.438
3.404AsnGlu: 3.404 ± 0.461
1.702AsnPhe: 1.702 ± 0.365
4.7AsnGly: 4.7 ± 0.644
1.216AsnHis: 1.216 ± 0.29
3.566AsnIle: 3.566 ± 0.555
2.026AsnLys: 2.026 ± 0.386
3.647AsnLeu: 3.647 ± 0.789
1.378AsnMet: 1.378 ± 0.304
2.755AsnAsn: 2.755 ± 0.519
2.917AsnPro: 2.917 ± 0.465
2.431AsnGln: 2.431 ± 0.717
2.107AsnArg: 2.107 ± 0.344
3.241AsnSer: 3.241 ± 0.432
3.485AsnThr: 3.485 ± 0.725
3.89AsnVal: 3.89 ± 0.59
0.486AsnTrp: 0.486 ± 0.184
2.35AsnTyr: 2.35 ± 0.43
0.0AsnXaa: 0.0 ± 0.0
Pro
2.836ProAla: 2.836 ± 0.618
0.081ProCys: 0.081 ± 0.078
3.809ProAsp: 3.809 ± 0.616
2.593ProGlu: 2.593 ± 0.543
1.135ProPhe: 1.135 ± 0.284
3.971ProGly: 3.971 ± 0.546
0.729ProHis: 0.729 ± 0.317
1.702ProIle: 1.702 ± 0.285
1.621ProLys: 1.621 ± 0.364
2.107ProLeu: 2.107 ± 0.432
1.135ProMet: 1.135 ± 0.318
2.107ProAsn: 2.107 ± 0.433
1.621ProPro: 1.621 ± 0.37
1.135ProGln: 1.135 ± 0.282
2.188ProArg: 2.188 ± 0.393
1.945ProSer: 1.945 ± 0.383
2.998ProThr: 2.998 ± 0.532
4.781ProVal: 4.781 ± 0.713
0.648ProTrp: 0.648 ± 0.228
1.864ProTyr: 1.864 ± 0.394
0.0ProXaa: 0.0 ± 0.0
Gln
4.862GlnAla: 4.862 ± 1.399
0.81GlnCys: 0.81 ± 0.23
2.026GlnAsp: 2.026 ± 0.424
2.107GlnGlu: 2.107 ± 0.468
1.783GlnPhe: 1.783 ± 0.349
2.188GlnGly: 2.188 ± 0.432
0.648GlnHis: 0.648 ± 0.214
2.512GlnIle: 2.512 ± 0.66
1.945GlnLys: 1.945 ± 0.687
5.024GlnLeu: 5.024 ± 0.971
0.891GlnMet: 0.891 ± 0.192
1.945GlnAsn: 1.945 ± 0.47
1.864GlnPro: 1.864 ± 0.396
2.917GlnGln: 2.917 ± 1.254
2.35GlnArg: 2.35 ± 0.43
2.431GlnSer: 2.431 ± 0.844
2.836GlnThr: 2.836 ± 0.406
3.079GlnVal: 3.079 ± 0.837
0.972GlnTrp: 0.972 ± 0.34
1.378GlnTyr: 1.378 ± 0.322
0.0GlnXaa: 0.0 ± 0.0
Arg
4.133ArgAla: 4.133 ± 0.521
0.729ArgCys: 0.729 ± 0.275
3.566ArgAsp: 3.566 ± 0.481
2.188ArgGlu: 2.188 ± 0.452
1.702ArgPhe: 1.702 ± 0.351
2.836ArgGly: 2.836 ± 0.477
0.81ArgHis: 0.81 ± 0.233
2.917ArgIle: 2.917 ± 0.599
2.836ArgLys: 2.836 ± 0.645
4.619ArgLeu: 4.619 ± 0.659
1.54ArgMet: 1.54 ± 0.325
2.836ArgAsn: 2.836 ± 0.421
2.188ArgPro: 2.188 ± 0.457
1.783ArgGln: 1.783 ± 0.478
2.269ArgArg: 2.269 ± 0.496
2.998ArgSer: 2.998 ± 0.448
3.16ArgThr: 3.16 ± 0.588
3.566ArgVal: 3.566 ± 0.491
0.972ArgTrp: 0.972 ± 0.306
1.945ArgTyr: 1.945 ± 0.33
0.0ArgXaa: 0.0 ± 0.0
Ser
5.024SerAla: 5.024 ± 0.817
0.243SerCys: 0.243 ± 0.184
3.485SerAsp: 3.485 ± 0.51
3.079SerGlu: 3.079 ± 0.338
2.107SerPhe: 2.107 ± 0.397
5.429SerGly: 5.429 ± 0.651
0.972SerHis: 0.972 ± 0.254
4.295SerIle: 4.295 ± 0.588
3.079SerLys: 3.079 ± 0.479
4.781SerLeu: 4.781 ± 0.686
1.135SerMet: 1.135 ± 0.361
2.836SerAsn: 2.836 ± 0.417
2.188SerPro: 2.188 ± 0.379
3.16SerGln: 3.16 ± 0.77
2.836SerArg: 2.836 ± 0.447
3.16SerSer: 3.16 ± 0.477
2.998SerThr: 2.998 ± 0.42
4.862SerVal: 4.862 ± 0.536
0.972SerTrp: 0.972 ± 0.283
1.621SerTyr: 1.621 ± 0.431
0.0SerXaa: 0.0 ± 0.0
Thr
4.295ThrAla: 4.295 ± 0.794
0.486ThrCys: 0.486 ± 0.203
4.538ThrAsp: 4.538 ± 0.502
3.728ThrGlu: 3.728 ± 0.591
1.864ThrPhe: 1.864 ± 0.472
7.536ThrGly: 7.536 ± 0.889
0.972ThrHis: 0.972 ± 0.272
4.214ThrIle: 4.214 ± 0.604
2.917ThrLys: 2.917 ± 0.632
5.916ThrLeu: 5.916 ± 0.592
1.621ThrMet: 1.621 ± 0.507
3.485ThrAsn: 3.485 ± 0.518
3.728ThrPro: 3.728 ± 0.58
2.107ThrGln: 2.107 ± 0.478
3.566ThrArg: 3.566 ± 0.549
2.836ThrSer: 2.836 ± 0.331
4.538ThrThr: 4.538 ± 0.699
5.267ThrVal: 5.267 ± 0.885
0.81ThrTrp: 0.81 ± 0.243
2.431ThrTyr: 2.431 ± 0.36
0.0ThrXaa: 0.0 ± 0.0
Val
5.348ValAla: 5.348 ± 0.672
0.567ValCys: 0.567 ± 0.161
4.538ValAsp: 4.538 ± 0.633
4.862ValGlu: 4.862 ± 0.678
2.431ValPhe: 2.431 ± 0.443
5.024ValGly: 5.024 ± 0.81
1.216ValHis: 1.216 ± 0.279
4.376ValIle: 4.376 ± 0.6
3.89ValLys: 3.89 ± 0.498
5.348ValLeu: 5.348 ± 0.57
2.107ValMet: 2.107 ± 0.397
4.862ValAsn: 4.862 ± 0.611
2.674ValPro: 2.674 ± 0.509
2.836ValGln: 2.836 ± 0.466
3.241ValArg: 3.241 ± 0.507
4.052ValSer: 4.052 ± 0.538
6.402ValThr: 6.402 ± 0.868
4.295ValVal: 4.295 ± 0.665
0.648ValTrp: 0.648 ± 0.206
2.593ValTyr: 2.593 ± 0.419
0.0ValXaa: 0.0 ± 0.0
Trp
0.567TrpAla: 0.567 ± 0.269
0.081TrpCys: 0.081 ± 0.071
0.972TrpAsp: 0.972 ± 0.282
1.297TrpGlu: 1.297 ± 0.268
0.891TrpPhe: 0.891 ± 0.327
1.54TrpGly: 1.54 ± 0.334
0.162TrpHis: 0.162 ± 0.105
0.81TrpIle: 0.81 ± 0.244
0.891TrpLys: 0.891 ± 0.377
1.783TrpLeu: 1.783 ± 0.299
0.486TrpMet: 0.486 ± 0.164
0.81TrpAsn: 0.81 ± 0.271
0.567TrpPro: 0.567 ± 0.192
0.567TrpGln: 0.567 ± 0.182
0.81TrpArg: 0.81 ± 0.351
0.972TrpSer: 0.972 ± 0.314
0.648TrpThr: 0.648 ± 0.197
0.972TrpVal: 0.972 ± 0.26
0.243TrpTrp: 0.243 ± 0.128
0.729TrpTyr: 0.729 ± 0.342
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.945TyrAla: 1.945 ± 0.392
0.648TyrCys: 0.648 ± 0.203
2.755TyrAsp: 2.755 ± 0.421
2.512TyrGlu: 2.512 ± 0.419
1.135TyrPhe: 1.135 ± 0.361
1.864TyrGly: 1.864 ± 0.392
0.729TyrHis: 0.729 ± 0.314
2.35TyrIle: 2.35 ± 0.493
2.026TyrLys: 2.026 ± 0.403
2.431TyrLeu: 2.431 ± 0.414
0.81TyrMet: 0.81 ± 0.203
1.702TyrAsn: 1.702 ± 0.338
1.864TyrPro: 1.864 ± 0.414
2.35TyrGln: 2.35 ± 0.488
2.026TyrArg: 2.026 ± 0.497
2.269TyrSer: 2.269 ± 0.335
2.188TyrThr: 2.188 ± 0.481
3.323TyrVal: 3.323 ± 0.465
0.567TyrTrp: 0.567 ± 0.193
1.216TyrTyr: 1.216 ± 0.33
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (12341 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski