Amino acid dipepetide frequency for Listeria phage LP-101

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.917AlaAla: 3.917 ± 1.481
0.753AlaCys: 0.753 ± 0.217
4.218AlaAsp: 4.218 ± 0.675
5.423AlaGlu: 5.423 ± 0.687
3.239AlaPhe: 3.239 ± 0.595
5.197AlaGly: 5.197 ± 1.481
0.753AlaHis: 0.753 ± 0.215
4.143AlaIle: 4.143 ± 0.62
6.176AlaLys: 6.176 ± 0.771
4.745AlaLeu: 4.745 ± 0.77
1.958AlaMet: 1.958 ± 0.342
3.841AlaAsn: 3.841 ± 0.675
1.732AlaPro: 1.732 ± 0.357
2.41AlaGln: 2.41 ± 0.451
2.184AlaArg: 2.184 ± 0.455
5.046AlaSer: 5.046 ± 0.761
4.143AlaThr: 4.143 ± 0.484
4.896AlaVal: 4.896 ± 0.809
1.732AlaTrp: 1.732 ± 0.36
2.184AlaTyr: 2.184 ± 0.28
0.0AlaXaa: 0.0 ± 0.0
Cys
0.452CysAla: 0.452 ± 0.156
0.226CysCys: 0.226 ± 0.148
0.904CysAsp: 0.904 ± 0.228
0.829CysGlu: 0.829 ± 0.244
0.452CysPhe: 0.452 ± 0.202
0.829CysGly: 0.829 ± 0.297
0.151CysHis: 0.151 ± 0.12
0.301CysIle: 0.301 ± 0.13
0.603CysLys: 0.603 ± 0.246
0.678CysLeu: 0.678 ± 0.264
0.151CysMet: 0.151 ± 0.106
0.527CysAsn: 0.527 ± 0.221
0.226CysPro: 0.226 ± 0.127
0.301CysGln: 0.301 ± 0.163
0.151CysArg: 0.151 ± 0.114
0.301CysSer: 0.301 ± 0.137
0.226CysThr: 0.226 ± 0.109
0.753CysVal: 0.753 ± 0.241
0.226CysTrp: 0.226 ± 0.128
0.301CysTyr: 0.301 ± 0.139
0.0CysXaa: 0.0 ± 0.0
Asp
3.239AspAla: 3.239 ± 0.44
0.829AspCys: 0.829 ± 0.236
4.82AspAsp: 4.82 ± 0.56
5.649AspGlu: 5.649 ± 0.876
3.088AspPhe: 3.088 ± 0.454
3.992AspGly: 3.992 ± 0.575
0.678AspHis: 0.678 ± 0.242
5.498AspIle: 5.498 ± 0.692
5.272AspLys: 5.272 ± 0.681
5.724AspLeu: 5.724 ± 0.579
1.958AspMet: 1.958 ± 0.431
4.218AspAsn: 4.218 ± 0.481
1.356AspPro: 1.356 ± 0.353
1.28AspGln: 1.28 ± 0.346
1.732AspArg: 1.732 ± 0.341
2.335AspSer: 2.335 ± 0.543
3.239AspThr: 3.239 ± 0.411
4.594AspVal: 4.594 ± 0.481
0.979AspTrp: 0.979 ± 0.3
3.088AspTyr: 3.088 ± 0.523
0.0AspXaa: 0.0 ± 0.0
Glu
4.745GluAla: 4.745 ± 0.57
1.28GluCys: 1.28 ± 0.316
4.82GluAsp: 4.82 ± 0.847
6.779GluGlu: 6.779 ± 1.101
3.088GluPhe: 3.088 ± 0.614
4.218GluGly: 4.218 ± 0.63
1.506GluHis: 1.506 ± 0.329
5.498GluIle: 5.498 ± 0.736
6.553GluLys: 6.553 ± 0.7
7.833GluLeu: 7.833 ± 0.679
2.862GluMet: 2.862 ± 0.385
4.368GluAsn: 4.368 ± 0.554
1.28GluPro: 1.28 ± 0.275
2.561GluGln: 2.561 ± 0.539
3.54GluArg: 3.54 ± 0.694
4.218GluSer: 4.218 ± 0.629
3.992GluThr: 3.992 ± 0.512
5.724GluVal: 5.724 ± 0.741
1.054GluTrp: 1.054 ± 0.288
3.992GluTyr: 3.992 ± 0.651
0.0GluXaa: 0.0 ± 0.0
Phe
3.615PheAla: 3.615 ± 0.431
0.603PheCys: 0.603 ± 0.238
3.013PheAsp: 3.013 ± 0.45
3.389PheGlu: 3.389 ± 0.467
1.657PhePhe: 1.657 ± 0.283
3.013PheGly: 3.013 ± 0.404
0.753PheHis: 0.753 ± 0.257
2.787PheIle: 2.787 ± 0.583
4.067PheLys: 4.067 ± 0.479
3.239PheLeu: 3.239 ± 0.523
1.054PheMet: 1.054 ± 0.358
2.636PheAsn: 2.636 ± 0.44
1.356PhePro: 1.356 ± 0.293
1.431PheGln: 1.431 ± 0.321
1.732PheArg: 1.732 ± 0.379
2.937PheSer: 2.937 ± 0.42
2.41PheThr: 2.41 ± 0.543
2.561PheVal: 2.561 ± 0.56
0.377PheTrp: 0.377 ± 0.169
1.356PheTyr: 1.356 ± 0.328
0.0PheXaa: 0.0 ± 0.0
Gly
4.067GlyAla: 4.067 ± 1.037
0.377GlyCys: 0.377 ± 0.171
3.615GlyAsp: 3.615 ± 0.477
4.594GlyGlu: 4.594 ± 0.597
3.314GlyPhe: 3.314 ± 0.493
3.841GlyGly: 3.841 ± 0.667
0.753GlyHis: 0.753 ± 0.227
3.465GlyIle: 3.465 ± 0.445
7.005GlyLys: 7.005 ± 0.702
5.875GlyLeu: 5.875 ± 0.872
1.506GlyMet: 1.506 ± 0.297
3.465GlyAsn: 3.465 ± 0.53
0.753GlyPro: 0.753 ± 0.218
2.335GlyGln: 2.335 ± 0.39
2.109GlyArg: 2.109 ± 0.415
3.54GlySer: 3.54 ± 0.76
3.615GlyThr: 3.615 ± 0.596
3.314GlyVal: 3.314 ± 0.494
0.678GlyTrp: 0.678 ± 0.221
2.41GlyTyr: 2.41 ± 0.44
0.0GlyXaa: 0.0 ± 0.0
His
0.678HisAla: 0.678 ± 0.253
0.151HisCys: 0.151 ± 0.095
0.904HisAsp: 0.904 ± 0.277
1.205HisGlu: 1.205 ± 0.312
0.452HisPhe: 0.452 ± 0.19
0.603HisGly: 0.603 ± 0.238
0.603HisHis: 0.603 ± 0.23
1.431HisIle: 1.431 ± 0.326
1.205HisLys: 1.205 ± 0.318
1.808HisLeu: 1.808 ± 0.49
0.151HisMet: 0.151 ± 0.119
0.904HisAsn: 0.904 ± 0.355
0.603HisPro: 0.603 ± 0.197
0.377HisGln: 0.377 ± 0.17
0.753HisArg: 0.753 ± 0.231
0.979HisSer: 0.979 ± 0.247
0.753HisThr: 0.753 ± 0.251
1.28HisVal: 1.28 ± 0.354
0.151HisTrp: 0.151 ± 0.119
0.527HisTyr: 0.527 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
4.745IleAla: 4.745 ± 0.475
0.753IleCys: 0.753 ± 0.249
4.67IleAsp: 4.67 ± 0.769
5.875IleGlu: 5.875 ± 0.895
2.937IlePhe: 2.937 ± 0.493
3.54IleGly: 3.54 ± 0.587
1.205IleHis: 1.205 ± 0.259
5.498IleIle: 5.498 ± 0.832
6.025IleLys: 6.025 ± 0.649
4.745IleLeu: 4.745 ± 0.708
0.979IleMet: 0.979 ± 0.238
5.348IleAsn: 5.348 ± 0.624
1.958IlePro: 1.958 ± 0.403
2.711IleGln: 2.711 ± 0.498
2.636IleArg: 2.636 ± 0.532
4.218IleSer: 4.218 ± 0.581
3.841IleThr: 3.841 ± 0.51
3.163IleVal: 3.163 ± 0.526
0.829IleTrp: 0.829 ± 0.246
2.862IleTyr: 2.862 ± 0.525
0.0IleXaa: 0.0 ± 0.0
Lys
7.306LysAla: 7.306 ± 1.236
0.452LysCys: 0.452 ± 0.221
5.272LysAsp: 5.272 ± 0.538
7.532LysGlu: 7.532 ± 0.907
3.088LysPhe: 3.088 ± 0.414
4.293LysGly: 4.293 ± 0.819
1.657LysHis: 1.657 ± 0.478
5.649LysIle: 5.649 ± 0.589
7.984LysLys: 7.984 ± 0.934
8.21LysLeu: 8.21 ± 0.784
2.937LysMet: 2.937 ± 0.592
6.402LysAsn: 6.402 ± 0.542
2.26LysPro: 2.26 ± 0.405
3.239LysGln: 3.239 ± 0.504
4.218LysArg: 4.218 ± 0.553
4.444LysSer: 4.444 ± 0.59
6.025LysThr: 6.025 ± 0.639
5.8LysVal: 5.8 ± 0.601
1.13LysTrp: 1.13 ± 0.264
3.841LysTyr: 3.841 ± 0.525
0.0LysXaa: 0.0 ± 0.0
Leu
5.724LeuAla: 5.724 ± 0.811
0.678LeuCys: 0.678 ± 0.232
5.272LeuAsp: 5.272 ± 0.566
6.251LeuGlu: 6.251 ± 0.855
3.841LeuPhe: 3.841 ± 0.693
4.67LeuGly: 4.67 ± 0.618
1.205LeuHis: 1.205 ± 0.322
5.423LeuIle: 5.423 ± 0.759
7.758LeuLys: 7.758 ± 0.943
6.101LeuLeu: 6.101 ± 0.79
1.506LeuMet: 1.506 ± 0.353
6.176LeuAsn: 6.176 ± 0.619
2.636LeuPro: 2.636 ± 0.394
2.862LeuGln: 2.862 ± 0.549
2.636LeuArg: 2.636 ± 0.536
6.628LeuSer: 6.628 ± 1.492
5.122LeuThr: 5.122 ± 0.851
3.917LeuVal: 3.917 ± 0.692
0.829LeuTrp: 0.829 ± 0.258
3.314LeuTyr: 3.314 ± 0.529
0.0LeuXaa: 0.0 ± 0.0
Met
2.034MetAla: 2.034 ± 0.32
0.151MetCys: 0.151 ± 0.107
1.582MetAsp: 1.582 ± 0.358
1.28MetGlu: 1.28 ± 0.326
1.054MetPhe: 1.054 ± 0.287
0.904MetGly: 0.904 ± 0.233
0.301MetHis: 0.301 ± 0.125
1.808MetIle: 1.808 ± 0.337
3.389MetLys: 3.389 ± 0.478
1.582MetLeu: 1.582 ± 0.339
0.301MetMet: 0.301 ± 0.197
1.356MetAsn: 1.356 ± 0.24
0.904MetPro: 0.904 ± 0.337
0.753MetGln: 0.753 ± 0.292
1.205MetArg: 1.205 ± 0.304
1.28MetSer: 1.28 ± 0.323
1.657MetThr: 1.657 ± 0.364
0.527MetVal: 0.527 ± 0.205
0.452MetTrp: 0.452 ± 0.17
0.904MetTyr: 0.904 ± 0.295
0.0MetXaa: 0.0 ± 0.0
Asn
4.594AsnAla: 4.594 ± 0.764
0.075AsnCys: 0.075 ± 0.082
4.067AsnAsp: 4.067 ± 0.444
5.272AsnGlu: 5.272 ± 0.615
1.808AsnPhe: 1.808 ± 0.387
5.574AsnGly: 5.574 ± 0.602
0.904AsnHis: 0.904 ± 0.211
3.992AsnIle: 3.992 ± 0.586
5.574AsnLys: 5.574 ± 0.744
4.896AsnLeu: 4.896 ± 0.537
1.205AsnMet: 1.205 ± 0.29
3.992AsnAsn: 3.992 ± 0.567
2.26AsnPro: 2.26 ± 0.509
2.26AsnGln: 2.26 ± 0.409
1.732AsnArg: 1.732 ± 0.398
4.368AsnSer: 4.368 ± 0.779
4.218AsnThr: 4.218 ± 0.645
3.239AsnVal: 3.239 ± 0.53
0.979AsnTrp: 0.979 ± 0.307
2.335AsnTyr: 2.335 ± 0.455
0.0AsnXaa: 0.0 ± 0.0
Pro
2.26ProAla: 2.26 ± 0.435
0.0ProCys: 0.0 ± 0.0
1.506ProAsp: 1.506 ± 0.357
3.239ProGlu: 3.239 ± 0.498
1.13ProPhe: 1.13 ± 0.336
0.904ProGly: 0.904 ± 0.306
0.603ProHis: 0.603 ± 0.243
0.753ProIle: 0.753 ± 0.176
1.356ProLys: 1.356 ± 0.282
2.41ProLeu: 2.41 ± 0.456
0.301ProMet: 0.301 ± 0.144
0.904ProAsn: 0.904 ± 0.266
0.753ProPro: 0.753 ± 0.28
1.054ProGln: 1.054 ± 0.226
1.28ProArg: 1.28 ± 0.37
1.28ProSer: 1.28 ± 0.286
1.356ProThr: 1.356 ± 0.293
2.636ProVal: 2.636 ± 0.414
0.226ProTrp: 0.226 ± 0.098
0.829ProTyr: 0.829 ± 0.234
0.0ProXaa: 0.0 ± 0.0
Gln
2.561GlnAla: 2.561 ± 0.583
0.301GlnCys: 0.301 ± 0.15
1.28GlnAsp: 1.28 ± 0.31
2.41GlnGlu: 2.41 ± 0.34
1.657GlnPhe: 1.657 ± 0.377
1.582GlnGly: 1.582 ± 0.423
0.829GlnHis: 0.829 ± 0.235
3.163GlnIle: 3.163 ± 0.619
2.561GlnLys: 2.561 ± 0.538
3.088GlnLeu: 3.088 ± 0.496
0.603GlnMet: 0.603 ± 0.202
1.657GlnAsn: 1.657 ± 0.333
0.527GlnPro: 0.527 ± 0.21
1.732GlnGln: 1.732 ± 0.464
1.205GlnArg: 1.205 ± 0.291
3.013GlnSer: 3.013 ± 0.446
2.335GlnThr: 2.335 ± 0.556
1.732GlnVal: 1.732 ± 0.375
0.226GlnTrp: 0.226 ± 0.141
1.28GlnTyr: 1.28 ± 0.291
0.0GlnXaa: 0.0 ± 0.0
Arg
2.034ArgAla: 2.034 ± 0.42
0.301ArgCys: 0.301 ± 0.173
1.28ArgAsp: 1.28 ± 0.358
2.862ArgGlu: 2.862 ± 0.477
2.26ArgPhe: 2.26 ± 0.422
2.26ArgGly: 2.26 ± 0.385
0.753ArgHis: 0.753 ± 0.296
2.109ArgIle: 2.109 ± 0.539
3.465ArgLys: 3.465 ± 0.555
2.636ArgLeu: 2.636 ± 0.585
1.205ArgMet: 1.205 ± 0.289
1.958ArgAsn: 1.958 ± 0.429
0.678ArgPro: 0.678 ± 0.208
1.356ArgGln: 1.356 ± 0.36
1.431ArgArg: 1.431 ± 0.462
2.26ArgSer: 2.26 ± 0.442
2.636ArgThr: 2.636 ± 0.499
1.657ArgVal: 1.657 ± 0.345
0.226ArgTrp: 0.226 ± 0.12
1.958ArgTyr: 1.958 ± 0.36
0.0ArgXaa: 0.0 ± 0.0
Ser
3.465SerAla: 3.465 ± 0.922
0.452SerCys: 0.452 ± 0.175
4.519SerAsp: 4.519 ± 0.544
4.594SerGlu: 4.594 ± 0.553
3.465SerPhe: 3.465 ± 0.659
4.594SerGly: 4.594 ± 0.824
0.452SerHis: 0.452 ± 0.206
5.046SerIle: 5.046 ± 0.644
5.348SerLys: 5.348 ± 0.887
5.046SerLeu: 5.046 ± 0.853
0.678SerMet: 0.678 ± 0.199
4.067SerAsn: 4.067 ± 0.471
1.205SerPro: 1.205 ± 0.29
2.034SerGln: 2.034 ± 0.286
1.28SerArg: 1.28 ± 0.309
3.841SerSer: 3.841 ± 0.624
3.615SerThr: 3.615 ± 0.547
4.368SerVal: 4.368 ± 0.596
0.603SerTrp: 0.603 ± 0.208
2.937SerTyr: 2.937 ± 0.386
0.0SerXaa: 0.0 ± 0.0
Thr
4.519ThrAla: 4.519 ± 0.602
0.452ThrCys: 0.452 ± 0.167
3.239ThrAsp: 3.239 ± 0.541
4.143ThrGlu: 4.143 ± 0.624
1.883ThrPhe: 1.883 ± 0.41
3.992ThrGly: 3.992 ± 0.605
0.979ThrHis: 0.979 ± 0.285
3.992ThrIle: 3.992 ± 0.595
6.477ThrLys: 6.477 ± 0.801
5.649ThrLeu: 5.649 ± 0.592
0.904ThrMet: 0.904 ± 0.29
3.992ThrAsn: 3.992 ± 0.626
1.808ThrPro: 1.808 ± 0.355
1.28ThrGln: 1.28 ± 0.265
1.28ThrArg: 1.28 ± 0.249
3.917ThrSer: 3.917 ± 0.442
3.615ThrThr: 3.615 ± 0.618
3.54ThrVal: 3.54 ± 0.518
1.054ThrTrp: 1.054 ± 0.274
1.883ThrTyr: 1.883 ± 0.514
0.0ThrXaa: 0.0 ± 0.0
Val
5.122ValAla: 5.122 ± 0.894
0.301ValCys: 0.301 ± 0.131
4.519ValAsp: 4.519 ± 0.631
4.067ValGlu: 4.067 ± 0.595
3.389ValPhe: 3.389 ± 0.657
3.239ValGly: 3.239 ± 0.441
0.603ValHis: 0.603 ± 0.208
3.841ValIle: 3.841 ± 0.396
5.498ValLys: 5.498 ± 0.685
4.143ValLeu: 4.143 ± 0.577
1.582ValMet: 1.582 ± 0.343
4.444ValAsn: 4.444 ± 0.538
1.657ValPro: 1.657 ± 0.398
2.109ValGln: 2.109 ± 0.462
2.26ValArg: 2.26 ± 0.406
3.766ValSer: 3.766 ± 0.648
3.088ValThr: 3.088 ± 0.572
4.594ValVal: 4.594 ± 0.73
1.13ValTrp: 1.13 ± 0.354
2.109ValTyr: 2.109 ± 0.47
0.0ValXaa: 0.0 ± 0.0
Trp
1.054TrpAla: 1.054 ± 0.35
0.151TrpCys: 0.151 ± 0.093
1.205TrpAsp: 1.205 ± 0.267
1.13TrpGlu: 1.13 ± 0.352
0.678TrpPhe: 0.678 ± 0.178
0.603TrpGly: 0.603 ± 0.21
0.226TrpHis: 0.226 ± 0.139
0.603TrpIle: 0.603 ± 0.214
1.431TrpLys: 1.431 ± 0.406
1.28TrpLeu: 1.28 ± 0.378
0.527TrpMet: 0.527 ± 0.188
0.603TrpAsn: 0.603 ± 0.263
0.0TrpPro: 0.0 ± 0.0
0.452TrpGln: 0.452 ± 0.218
0.603TrpArg: 0.603 ± 0.214
0.979TrpSer: 0.979 ± 0.309
0.377TrpThr: 0.377 ± 0.177
0.904TrpVal: 0.904 ± 0.295
0.075TrpTrp: 0.075 ± 0.083
0.377TrpTyr: 0.377 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.561TyrAla: 2.561 ± 0.477
0.301TyrCys: 0.301 ± 0.18
2.862TyrAsp: 2.862 ± 0.485
3.389TyrGlu: 3.389 ± 0.565
1.506TyrPhe: 1.506 ± 0.406
2.787TyrGly: 2.787 ± 0.496
0.603TyrHis: 0.603 ± 0.222
3.54TyrIle: 3.54 ± 0.565
3.992TyrLys: 3.992 ± 0.503
2.862TyrLeu: 2.862 ± 0.428
0.979TyrMet: 0.979 ± 0.221
2.486TyrAsn: 2.486 ± 0.383
0.829TyrPro: 0.829 ± 0.243
1.28TyrGln: 1.28 ± 0.273
1.356TyrArg: 1.356 ± 0.328
2.26TyrSer: 2.26 ± 0.399
2.335TyrThr: 2.335 ± 0.421
2.26TyrVal: 2.26 ± 0.472
0.301TyrTrp: 0.301 ± 0.155
2.26TyrTyr: 2.26 ± 0.463
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (13278 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski