Amino acid dipepetide frequency for Salmonella phage STP03

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.386AlaAla: 11.386 ± 1.68
1.357AlaCys: 1.357 ± 0.328
6.258AlaAsp: 6.258 ± 0.68
6.108AlaGlu: 6.108 ± 0.784
4.449AlaPhe: 4.449 ± 0.584
7.465AlaGly: 7.465 ± 0.753
2.187AlaHis: 2.187 ± 0.391
4.6AlaIle: 4.6 ± 0.857
5.806AlaLys: 5.806 ± 0.863
7.39AlaLeu: 7.39 ± 0.708
2.187AlaMet: 2.187 ± 0.388
3.544AlaAsn: 3.544 ± 0.494
3.092AlaPro: 3.092 ± 0.479
3.695AlaGln: 3.695 ± 0.955
4.826AlaArg: 4.826 ± 0.561
6.108AlaSer: 6.108 ± 0.774
5.504AlaThr: 5.504 ± 0.658
7.314AlaVal: 7.314 ± 0.844
1.131AlaTrp: 1.131 ± 0.282
3.318AlaTyr: 3.318 ± 0.406
0.0AlaXaa: 0.0 ± 0.0
Cys
0.754CysAla: 0.754 ± 0.221
0.151CysCys: 0.151 ± 0.111
0.829CysAsp: 0.829 ± 0.229
0.98CysGlu: 0.98 ± 0.329
0.377CysPhe: 0.377 ± 0.188
0.679CysGly: 0.679 ± 0.231
0.226CysHis: 0.226 ± 0.125
0.377CysIle: 0.377 ± 0.163
0.829CysLys: 0.829 ± 0.254
0.603CysLeu: 0.603 ± 0.247
0.226CysMet: 0.226 ± 0.123
0.603CysAsn: 0.603 ± 0.226
0.075CysPro: 0.075 ± 0.066
0.226CysGln: 0.226 ± 0.149
0.528CysArg: 0.528 ± 0.207
0.452CysSer: 0.452 ± 0.252
0.679CysThr: 0.679 ± 0.198
0.98CysVal: 0.98 ± 0.279
0.377CysTrp: 0.377 ± 0.151
0.226CysTyr: 0.226 ± 0.11
0.0CysXaa: 0.0 ± 0.0
Asp
6.786AspAla: 6.786 ± 0.78
0.754AspCys: 0.754 ± 0.234
3.619AspAsp: 3.619 ± 0.501
4.072AspGlu: 4.072 ± 0.565
2.715AspPhe: 2.715 ± 0.342
6.032AspGly: 6.032 ± 0.731
0.754AspHis: 0.754 ± 0.235
3.544AspIle: 3.544 ± 0.369
3.544AspLys: 3.544 ± 0.435
4.75AspLeu: 4.75 ± 0.622
1.357AspMet: 1.357 ± 0.245
2.488AspAsn: 2.488 ± 0.462
2.111AspPro: 2.111 ± 0.373
0.603AspGln: 0.603 ± 0.23
2.715AspArg: 2.715 ± 0.431
3.469AspSer: 3.469 ± 0.502
4.298AspThr: 4.298 ± 0.568
3.469AspVal: 3.469 ± 0.489
0.98AspTrp: 0.98 ± 0.265
2.338AspTyr: 2.338 ± 0.454
0.0AspXaa: 0.0 ± 0.0
Glu
6.635GluAla: 6.635 ± 0.758
0.528GluCys: 0.528 ± 0.212
3.695GluAsp: 3.695 ± 0.463
4.901GluGlu: 4.901 ± 0.875
3.167GluPhe: 3.167 ± 0.619
4.524GluGly: 4.524 ± 0.648
0.905GluHis: 0.905 ± 0.257
3.544GluIle: 3.544 ± 0.515
4.6GluLys: 4.6 ± 0.712
6.032GluLeu: 6.032 ± 0.76
2.639GluMet: 2.639 ± 0.393
2.564GluAsn: 2.564 ± 0.461
2.187GluPro: 2.187 ± 0.566
3.242GluGln: 3.242 ± 0.612
3.469GluArg: 3.469 ± 0.635
3.846GluSer: 3.846 ± 0.583
3.695GluThr: 3.695 ± 0.599
4.524GluVal: 4.524 ± 0.543
0.98GluTrp: 0.98 ± 0.291
2.111GluTyr: 2.111 ± 0.472
0.0GluXaa: 0.0 ± 0.0
Phe
2.941PheAla: 2.941 ± 0.432
0.603PheCys: 0.603 ± 0.184
3.016PheAsp: 3.016 ± 0.442
2.413PheGlu: 2.413 ± 0.443
0.679PhePhe: 0.679 ± 0.188
3.016PheGly: 3.016 ± 0.445
0.603PheHis: 0.603 ± 0.197
2.262PheIle: 2.262 ± 0.444
1.734PheLys: 1.734 ± 0.434
2.036PheLeu: 2.036 ± 0.34
0.528PheMet: 0.528 ± 0.177
1.583PheAsn: 1.583 ± 0.368
1.583PhePro: 1.583 ± 0.416
1.433PheGln: 1.433 ± 0.29
2.262PheArg: 2.262 ± 0.338
2.413PheSer: 2.413 ± 0.492
3.092PheThr: 3.092 ± 0.505
3.016PheVal: 3.016 ± 0.545
0.829PheTrp: 0.829 ± 0.258
1.282PheTyr: 1.282 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
7.239GlyAla: 7.239 ± 0.703
0.829GlyCys: 0.829 ± 0.242
3.77GlyAsp: 3.77 ± 0.561
5.731GlyGlu: 5.731 ± 0.997
2.639GlyPhe: 2.639 ± 0.449
6.409GlyGly: 6.409 ± 0.716
1.659GlyHis: 1.659 ± 0.479
3.016GlyIle: 3.016 ± 0.382
4.977GlyLys: 4.977 ± 0.521
5.278GlyLeu: 5.278 ± 0.472
2.413GlyMet: 2.413 ± 0.61
3.544GlyAsn: 3.544 ± 0.544
1.81GlyPro: 1.81 ± 0.356
2.941GlyGln: 2.941 ± 0.434
4.6GlyArg: 4.6 ± 0.615
4.373GlySer: 4.373 ± 0.759
4.223GlyThr: 4.223 ± 0.612
6.258GlyVal: 6.258 ± 0.665
1.282GlyTrp: 1.282 ± 0.269
2.941GlyTyr: 2.941 ± 0.469
0.0GlyXaa: 0.0 ± 0.0
His
1.282HisAla: 1.282 ± 0.354
0.528HisCys: 0.528 ± 0.189
1.131HisAsp: 1.131 ± 0.34
1.056HisGlu: 1.056 ± 0.268
0.603HisPhe: 0.603 ± 0.212
0.829HisGly: 0.829 ± 0.32
0.754HisHis: 0.754 ± 0.247
1.056HisIle: 1.056 ± 0.246
1.357HisLys: 1.357 ± 0.339
1.206HisLeu: 1.206 ± 0.329
0.452HisMet: 0.452 ± 0.185
0.377HisAsn: 0.377 ± 0.142
1.357HisPro: 1.357 ± 0.395
1.206HisGln: 1.206 ± 0.324
0.754HisArg: 0.754 ± 0.215
0.754HisSer: 0.754 ± 0.239
0.905HisThr: 0.905 ± 0.286
0.905HisVal: 0.905 ± 0.23
0.151HisTrp: 0.151 ± 0.111
0.679HisTyr: 0.679 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
4.977IleAla: 4.977 ± 0.828
0.528IleCys: 0.528 ± 0.197
3.921IleAsp: 3.921 ± 0.499
2.564IleGlu: 2.564 ± 0.44
1.357IlePhe: 1.357 ± 0.348
3.092IleGly: 3.092 ± 0.399
0.829IleHis: 0.829 ± 0.242
2.488IleIle: 2.488 ± 0.488
3.469IleLys: 3.469 ± 0.572
3.092IleLeu: 3.092 ± 0.497
0.829IleMet: 0.829 ± 0.235
2.187IleAsn: 2.187 ± 0.427
2.865IlePro: 2.865 ± 0.458
1.734IleGln: 1.734 ± 0.4
2.639IleArg: 2.639 ± 0.366
3.167IleSer: 3.167 ± 0.645
4.373IleThr: 4.373 ± 0.578
3.619IleVal: 3.619 ± 0.409
0.829IleTrp: 0.829 ± 0.24
1.734IleTyr: 1.734 ± 0.39
0.0IleXaa: 0.0 ± 0.0
Lys
5.731LysAla: 5.731 ± 0.797
0.679LysCys: 0.679 ± 0.266
3.619LysAsp: 3.619 ± 0.553
4.449LysGlu: 4.449 ± 0.778
2.111LysPhe: 2.111 ± 0.266
3.846LysGly: 3.846 ± 0.42
1.131LysHis: 1.131 ± 0.267
2.338LysIle: 2.338 ± 0.436
3.092LysLys: 3.092 ± 0.542
5.806LysLeu: 5.806 ± 0.59
2.413LysMet: 2.413 ± 0.505
2.564LysAsn: 2.564 ± 0.405
2.187LysPro: 2.187 ± 0.408
2.111LysGln: 2.111 ± 0.436
3.996LysArg: 3.996 ± 0.577
3.167LysSer: 3.167 ± 0.536
3.846LysThr: 3.846 ± 0.407
3.318LysVal: 3.318 ± 0.544
0.829LysTrp: 0.829 ± 0.271
2.564LysTyr: 2.564 ± 0.352
0.0LysXaa: 0.0 ± 0.0
Leu
7.088LeuAla: 7.088 ± 0.653
0.679LeuCys: 0.679 ± 0.246
4.223LeuAsp: 4.223 ± 0.408
5.052LeuGlu: 5.052 ± 0.811
2.111LeuPhe: 2.111 ± 0.456
4.901LeuGly: 4.901 ± 0.555
0.905LeuHis: 0.905 ± 0.257
4.6LeuIle: 4.6 ± 0.5
4.6LeuLys: 4.6 ± 0.648
5.504LeuLeu: 5.504 ± 0.714
2.262LeuMet: 2.262 ± 0.394
4.223LeuAsn: 4.223 ± 0.536
3.619LeuPro: 3.619 ± 0.571
3.393LeuGln: 3.393 ± 0.547
5.278LeuArg: 5.278 ± 0.662
4.298LeuSer: 4.298 ± 0.58
5.354LeuThr: 5.354 ± 0.449
5.655LeuVal: 5.655 ± 0.592
1.056LeuTrp: 1.056 ± 0.288
2.564LeuTyr: 2.564 ± 0.347
0.0LeuXaa: 0.0 ± 0.0
Met
2.488MetAla: 2.488 ± 0.326
0.226MetCys: 0.226 ± 0.123
1.056MetAsp: 1.056 ± 0.267
1.282MetGlu: 1.282 ± 0.311
0.754MetPhe: 0.754 ± 0.223
1.357MetGly: 1.357 ± 0.313
0.302MetHis: 0.302 ± 0.13
1.282MetIle: 1.282 ± 0.299
1.659MetLys: 1.659 ± 0.363
2.564MetLeu: 2.564 ± 0.503
0.452MetMet: 0.452 ± 0.185
0.98MetAsn: 0.98 ± 0.217
1.357MetPro: 1.357 ± 0.355
0.754MetGln: 0.754 ± 0.203
1.508MetArg: 1.508 ± 0.253
2.187MetSer: 2.187 ± 0.284
2.262MetThr: 2.262 ± 0.466
2.187MetVal: 2.187 ± 0.439
0.528MetTrp: 0.528 ± 0.2
0.679MetTyr: 0.679 ± 0.242
0.0MetXaa: 0.0 ± 0.0
Asn
4.298AsnAla: 4.298 ± 0.609
0.377AsnCys: 0.377 ± 0.144
2.941AsnAsp: 2.941 ± 0.427
2.865AsnGlu: 2.865 ± 0.479
1.659AsnPhe: 1.659 ± 0.38
3.996AsnGly: 3.996 ± 0.595
0.528AsnHis: 0.528 ± 0.191
2.715AsnIle: 2.715 ± 0.447
2.262AsnLys: 2.262 ± 0.432
3.77AsnLeu: 3.77 ± 0.436
0.754AsnMet: 0.754 ± 0.273
2.715AsnAsn: 2.715 ± 0.463
1.885AsnPro: 1.885 ± 0.348
1.131AsnGln: 1.131 ± 0.286
2.338AsnArg: 2.338 ± 0.355
2.187AsnSer: 2.187 ± 0.372
1.885AsnThr: 1.885 ± 0.347
3.167AsnVal: 3.167 ± 0.398
0.754AsnTrp: 0.754 ± 0.213
1.433AsnTyr: 1.433 ± 0.285
0.0AsnXaa: 0.0 ± 0.0
Pro
2.941ProAla: 2.941 ± 0.535
0.377ProCys: 0.377 ± 0.172
3.469ProAsp: 3.469 ± 0.515
3.092ProGlu: 3.092 ± 0.437
1.885ProPhe: 1.885 ± 0.308
3.393ProGly: 3.393 ± 0.591
0.452ProHis: 0.452 ± 0.175
1.734ProIle: 1.734 ± 0.344
2.639ProLys: 2.639 ± 0.407
3.167ProLeu: 3.167 ± 0.545
0.679ProMet: 0.679 ± 0.202
1.659ProAsn: 1.659 ± 0.475
1.433ProPro: 1.433 ± 0.321
1.282ProGln: 1.282 ± 0.306
1.659ProArg: 1.659 ± 0.287
2.187ProSer: 2.187 ± 0.385
1.433ProThr: 1.433 ± 0.336
4.072ProVal: 4.072 ± 0.538
0.377ProTrp: 0.377 ± 0.179
1.282ProTyr: 1.282 ± 0.399
0.0ProXaa: 0.0 ± 0.0
Gln
4.072GlnAla: 4.072 ± 0.617
0.226GlnCys: 0.226 ± 0.14
1.357GlnAsp: 1.357 ± 0.301
2.187GlnGlu: 2.187 ± 0.502
1.357GlnPhe: 1.357 ± 0.369
2.488GlnGly: 2.488 ± 0.423
0.603GlnHis: 0.603 ± 0.229
2.413GlnIle: 2.413 ± 0.526
2.187GlnLys: 2.187 ± 0.406
3.167GlnLeu: 3.167 ± 0.442
1.282GlnMet: 1.282 ± 0.27
2.036GlnAsn: 2.036 ± 0.408
2.036GlnPro: 2.036 ± 0.374
2.715GlnGln: 2.715 ± 0.779
1.659GlnArg: 1.659 ± 0.297
2.036GlnSer: 2.036 ± 0.352
1.734GlnThr: 1.734 ± 0.368
2.413GlnVal: 2.413 ± 0.425
0.603GlnTrp: 0.603 ± 0.218
1.206GlnTyr: 1.206 ± 0.237
0.0GlnXaa: 0.0 ± 0.0
Arg
5.278ArgAla: 5.278 ± 0.566
0.377ArgCys: 0.377 ± 0.154
3.318ArgAsp: 3.318 ± 0.487
4.147ArgGlu: 4.147 ± 0.593
1.81ArgPhe: 1.81 ± 0.354
3.619ArgGly: 3.619 ± 0.474
1.206ArgHis: 1.206 ± 0.341
2.865ArgIle: 2.865 ± 0.433
3.544ArgLys: 3.544 ± 0.599
4.223ArgLeu: 4.223 ± 0.541
1.96ArgMet: 1.96 ± 0.355
3.092ArgAsn: 3.092 ± 0.462
2.111ArgPro: 2.111 ± 0.386
2.564ArgGln: 2.564 ± 0.397
4.675ArgArg: 4.675 ± 0.712
1.96ArgSer: 1.96 ± 0.408
2.79ArgThr: 2.79 ± 0.506
4.449ArgVal: 4.449 ± 0.481
0.905ArgTrp: 0.905 ± 0.226
1.81ArgTyr: 1.81 ± 0.425
0.0ArgXaa: 0.0 ± 0.0
Ser
6.183SerAla: 6.183 ± 1.032
0.226SerCys: 0.226 ± 0.123
3.167SerAsp: 3.167 ± 0.464
3.846SerGlu: 3.846 ± 0.713
2.187SerPhe: 2.187 ± 0.393
6.409SerGly: 6.409 ± 0.782
0.905SerHis: 0.905 ± 0.234
2.187SerIle: 2.187 ± 0.435
2.79SerLys: 2.79 ± 0.345
4.977SerLeu: 4.977 ± 0.685
1.433SerMet: 1.433 ± 0.315
2.413SerAsn: 2.413 ± 0.319
1.734SerPro: 1.734 ± 0.361
1.96SerGln: 1.96 ± 0.358
3.167SerArg: 3.167 ± 0.51
2.865SerSer: 2.865 ± 0.384
3.921SerThr: 3.921 ± 0.616
4.449SerVal: 4.449 ± 0.681
0.679SerTrp: 0.679 ± 0.195
1.885SerTyr: 1.885 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
6.334ThrAla: 6.334 ± 0.577
0.452ThrCys: 0.452 ± 0.178
4.072ThrAsp: 4.072 ± 0.526
3.619ThrGlu: 3.619 ± 0.551
2.79ThrPhe: 2.79 ± 0.497
5.731ThrGly: 5.731 ± 0.852
1.131ThrHis: 1.131 ± 0.319
3.092ThrIle: 3.092 ± 0.443
3.242ThrLys: 3.242 ± 0.411
4.75ThrLeu: 4.75 ± 0.64
1.131ThrMet: 1.131 ± 0.288
1.81ThrAsn: 1.81 ± 0.348
3.393ThrPro: 3.393 ± 0.455
1.508ThrGln: 1.508 ± 0.354
3.393ThrArg: 3.393 ± 0.428
4.675ThrSer: 4.675 ± 0.633
3.695ThrThr: 3.695 ± 0.534
4.524ThrVal: 4.524 ± 0.722
0.98ThrTrp: 0.98 ± 0.236
2.338ThrTyr: 2.338 ± 0.394
0.0ThrXaa: 0.0 ± 0.0
Val
6.485ValAla: 6.485 ± 0.628
0.754ValCys: 0.754 ± 0.268
3.846ValAsp: 3.846 ± 0.498
6.032ValGlu: 6.032 ± 0.709
2.564ValPhe: 2.564 ± 0.446
4.449ValGly: 4.449 ± 0.579
1.056ValHis: 1.056 ± 0.251
3.921ValIle: 3.921 ± 0.621
4.6ValLys: 4.6 ± 0.756
4.977ValLeu: 4.977 ± 0.62
1.433ValMet: 1.433 ± 0.369
3.393ValAsn: 3.393 ± 0.54
2.564ValPro: 2.564 ± 0.683
2.941ValGln: 2.941 ± 0.388
3.619ValArg: 3.619 ± 0.465
5.127ValSer: 5.127 ± 0.666
5.58ValThr: 5.58 ± 0.693
5.58ValVal: 5.58 ± 0.809
0.905ValTrp: 0.905 ± 0.233
3.092ValTyr: 3.092 ± 0.418
0.0ValXaa: 0.0 ± 0.0
Trp
1.282TrpAla: 1.282 ± 0.357
0.151TrpCys: 0.151 ± 0.106
0.754TrpAsp: 0.754 ± 0.218
0.754TrpGlu: 0.754 ± 0.222
0.679TrpPhe: 0.679 ± 0.294
0.98TrpGly: 0.98 ± 0.254
0.452TrpHis: 0.452 ± 0.188
0.679TrpIle: 0.679 ± 0.254
0.452TrpLys: 0.452 ± 0.16
1.659TrpLeu: 1.659 ± 0.333
0.528TrpMet: 0.528 ± 0.219
0.603TrpAsn: 0.603 ± 0.227
0.528TrpPro: 0.528 ± 0.191
0.754TrpGln: 0.754 ± 0.223
1.056TrpArg: 1.056 ± 0.344
0.302TrpSer: 0.302 ± 0.141
0.98TrpThr: 0.98 ± 0.223
1.282TrpVal: 1.282 ± 0.272
0.452TrpTrp: 0.452 ± 0.195
0.377TrpTyr: 0.377 ± 0.145
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.544TyrAla: 3.544 ± 0.531
0.377TyrCys: 0.377 ± 0.143
2.338TyrAsp: 2.338 ± 0.549
2.639TyrGlu: 2.639 ± 0.471
1.433TyrPhe: 1.433 ± 0.381
2.715TyrGly: 2.715 ± 0.436
0.754TyrHis: 0.754 ± 0.261
1.583TyrIle: 1.583 ± 0.324
2.488TyrLys: 2.488 ± 0.497
2.413TyrLeu: 2.413 ± 0.455
0.829TyrMet: 0.829 ± 0.191
1.282TyrAsn: 1.282 ± 0.239
1.282TyrPro: 1.282 ± 0.304
1.583TyrGln: 1.583 ± 0.379
2.564TyrArg: 2.564 ± 0.505
1.81TyrSer: 1.81 ± 0.386
2.413TyrThr: 2.413 ± 0.338
1.734TyrVal: 1.734 ± 0.383
0.075TyrTrp: 0.075 ± 0.089
1.131TyrTyr: 1.131 ± 0.216
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (13263 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski