Amino acid dipepetide frequency for Lactobacillus phage phiadh

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.453AlaAla: 2.453 ± 0.507
0.446AlaCys: 0.446 ± 0.166
4.461AlaAsp: 4.461 ± 0.582
3.643AlaGlu: 3.643 ± 0.567
3.048AlaPhe: 3.048 ± 0.627
5.13AlaGly: 5.13 ± 0.838
1.041AlaHis: 1.041 ± 0.198
6.022AlaIle: 6.022 ± 0.643
6.245AlaLys: 6.245 ± 0.83
5.204AlaLeu: 5.204 ± 0.651
1.933AlaMet: 1.933 ± 0.333
4.832AlaAsn: 4.832 ± 0.612
0.818AlaPro: 0.818 ± 0.276
2.751AlaGln: 2.751 ± 0.598
3.345AlaArg: 3.345 ± 0.438
3.42AlaSer: 3.42 ± 0.533
3.792AlaThr: 3.792 ± 0.507
2.899AlaVal: 2.899 ± 0.473
1.19AlaTrp: 1.19 ± 0.282
2.528AlaTyr: 2.528 ± 0.504
0.0AlaXaa: 0.0 ± 0.0
Cys
0.297CysAla: 0.297 ± 0.146
0.297CysCys: 0.297 ± 0.131
0.52CysAsp: 0.52 ± 0.193
0.372CysGlu: 0.372 ± 0.203
0.446CysPhe: 0.446 ± 0.166
0.52CysGly: 0.52 ± 0.196
0.297CysHis: 0.297 ± 0.179
0.149CysIle: 0.149 ± 0.112
0.372CysLys: 0.372 ± 0.163
0.595CysLeu: 0.595 ± 0.228
0.074CysMet: 0.074 ± 0.072
0.52CysAsn: 0.52 ± 0.22
0.595CysPro: 0.595 ± 0.251
0.223CysGln: 0.223 ± 0.14
0.372CysArg: 0.372 ± 0.225
0.892CysSer: 0.892 ± 0.276
0.595CysThr: 0.595 ± 0.216
0.074CysVal: 0.074 ± 0.073
0.149CysTrp: 0.149 ± 0.118
0.372CysTyr: 0.372 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
3.643AspAla: 3.643 ± 0.653
0.669AspCys: 0.669 ± 0.215
3.122AspAsp: 3.122 ± 0.589
5.13AspGlu: 5.13 ± 0.714
3.494AspPhe: 3.494 ± 0.495
5.055AspGly: 5.055 ± 0.665
0.818AspHis: 0.818 ± 0.281
5.055AspIle: 5.055 ± 0.685
5.576AspLys: 5.576 ± 0.626
6.468AspLeu: 6.468 ± 0.879
1.561AspMet: 1.561 ± 0.325
3.569AspAsn: 3.569 ± 0.666
2.974AspPro: 2.974 ± 0.405
2.825AspGln: 2.825 ± 0.536
2.528AspArg: 2.528 ± 0.397
3.643AspSer: 3.643 ± 0.512
3.048AspThr: 3.048 ± 0.394
3.345AspVal: 3.345 ± 0.486
1.71AspTrp: 1.71 ± 0.464
3.122AspTyr: 3.122 ± 0.511
0.0AspXaa: 0.0 ± 0.0
Glu
4.758GluAla: 4.758 ± 0.763
0.223GluCys: 0.223 ± 0.126
3.569GluAsp: 3.569 ± 0.587
2.602GluGlu: 2.602 ± 0.622
2.528GluPhe: 2.528 ± 0.532
3.494GluGly: 3.494 ± 0.46
0.446GluHis: 0.446 ± 0.183
3.122GluIle: 3.122 ± 0.471
4.386GluLys: 4.386 ± 0.455
5.055GluLeu: 5.055 ± 0.802
1.71GluMet: 1.71 ± 0.38
2.974GluAsn: 2.974 ± 0.482
1.71GluPro: 1.71 ± 0.368
2.676GluGln: 2.676 ± 0.53
2.676GluArg: 2.676 ± 0.43
2.899GluSer: 2.899 ± 0.528
3.494GluThr: 3.494 ± 0.492
3.494GluVal: 3.494 ± 0.542
1.115GluTrp: 1.115 ± 0.279
2.676GluTyr: 2.676 ± 0.562
0.0GluXaa: 0.0 ± 0.0
Phe
2.379PheAla: 2.379 ± 0.318
0.223PheCys: 0.223 ± 0.119
2.825PheAsp: 2.825 ± 0.401
2.156PheGlu: 2.156 ± 0.406
1.933PhePhe: 1.933 ± 0.477
3.792PheGly: 3.792 ± 1.042
0.743PheHis: 0.743 ± 0.262
2.751PheIle: 2.751 ± 0.417
3.345PheLys: 3.345 ± 0.492
2.676PheLeu: 2.676 ± 0.541
0.595PheMet: 0.595 ± 0.222
2.528PheAsn: 2.528 ± 0.39
1.115PhePro: 1.115 ± 0.291
1.487PheGln: 1.487 ± 0.319
2.082PheArg: 2.082 ± 0.421
2.156PheSer: 2.156 ± 0.311
2.23PheThr: 2.23 ± 0.465
1.859PheVal: 1.859 ± 0.344
0.818PheTrp: 0.818 ± 0.219
1.338PheTyr: 1.338 ± 0.322
0.0PheXaa: 0.0 ± 0.0
Gly
3.494GlyAla: 3.494 ± 0.863
0.297GlyCys: 0.297 ± 0.146
3.643GlyAsp: 3.643 ± 0.604
3.569GlyGlu: 3.569 ± 0.45
2.156GlyPhe: 2.156 ± 0.403
4.461GlyGly: 4.461 ± 1.236
1.487GlyHis: 1.487 ± 0.299
4.832GlyIle: 4.832 ± 0.746
7.286GlyLys: 7.286 ± 1.54
5.278GlyLeu: 5.278 ± 0.681
1.933GlyMet: 1.933 ± 0.336
4.386GlyAsn: 4.386 ± 0.687
1.636GlyPro: 1.636 ± 0.458
2.453GlyGln: 2.453 ± 0.549
2.23GlyArg: 2.23 ± 0.423
4.312GlySer: 4.312 ± 0.601
3.94GlyThr: 3.94 ± 0.718
3.569GlyVal: 3.569 ± 0.591
1.561GlyTrp: 1.561 ± 0.301
2.751GlyTyr: 2.751 ± 0.463
0.0GlyXaa: 0.0 ± 0.0
His
1.264HisAla: 1.264 ± 0.304
0.149HisCys: 0.149 ± 0.11
0.743HisAsp: 0.743 ± 0.249
1.041HisGlu: 1.041 ± 0.311
0.818HisPhe: 0.818 ± 0.221
1.338HisGly: 1.338 ± 0.329
0.52HisHis: 0.52 ± 0.201
1.041HisIle: 1.041 ± 0.295
1.71HisLys: 1.71 ± 0.37
1.19HisLeu: 1.19 ± 0.302
0.52HisMet: 0.52 ± 0.183
1.115HisAsn: 1.115 ± 0.296
0.446HisPro: 0.446 ± 0.197
0.446HisGln: 0.446 ± 0.222
0.743HisArg: 0.743 ± 0.239
1.784HisSer: 1.784 ± 0.273
1.19HisThr: 1.19 ± 0.278
1.041HisVal: 1.041 ± 0.305
0.446HisTrp: 0.446 ± 0.176
0.743HisTyr: 0.743 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
4.386IleAla: 4.386 ± 0.6
0.446IleCys: 0.446 ± 0.222
5.427IleAsp: 5.427 ± 0.729
3.792IleGlu: 3.792 ± 0.591
2.305IlePhe: 2.305 ± 0.286
3.792IleGly: 3.792 ± 0.761
1.041IleHis: 1.041 ± 0.216
3.792IleIle: 3.792 ± 0.677
4.535IleLys: 4.535 ± 0.594
4.163IleLeu: 4.163 ± 0.521
1.933IleMet: 1.933 ± 0.466
4.832IleAsn: 4.832 ± 0.505
2.676IlePro: 2.676 ± 0.533
3.717IleGln: 3.717 ± 0.565
2.751IleArg: 2.751 ± 0.478
5.501IleSer: 5.501 ± 0.691
5.576IleThr: 5.576 ± 0.88
3.643IleVal: 3.643 ± 0.522
0.966IleTrp: 0.966 ± 0.197
2.602IleTyr: 2.602 ± 0.511
0.0IleXaa: 0.0 ± 0.0
Lys
5.948LysAla: 5.948 ± 0.571
0.669LysCys: 0.669 ± 0.251
5.13LysAsp: 5.13 ± 0.7
5.576LysGlu: 5.576 ± 0.815
3.271LysPhe: 3.271 ± 0.552
4.163LysGly: 4.163 ± 0.992
1.561LysHis: 1.561 ± 0.445
5.948LysIle: 5.948 ± 0.677
7.063LysLys: 7.063 ± 1.226
7.434LysLeu: 7.434 ± 0.872
2.305LysMet: 2.305 ± 0.419
6.468LysAsn: 6.468 ± 0.908
2.974LysPro: 2.974 ± 0.499
4.386LysGln: 4.386 ± 0.399
3.345LysArg: 3.345 ± 0.594
5.724LysSer: 5.724 ± 0.934
4.758LysThr: 4.758 ± 0.522
4.089LysVal: 4.089 ± 0.442
1.636LysTrp: 1.636 ± 0.425
3.792LysTyr: 3.792 ± 0.502
0.0LysXaa: 0.0 ± 0.0
Leu
5.724LeuAla: 5.724 ± 0.581
0.669LeuCys: 0.669 ± 0.241
4.981LeuAsp: 4.981 ± 0.573
3.792LeuGlu: 3.792 ± 0.458
2.825LeuPhe: 2.825 ± 0.424
4.684LeuGly: 4.684 ± 0.641
1.636LeuHis: 1.636 ± 0.357
5.13LeuIle: 5.13 ± 0.855
7.509LeuLys: 7.509 ± 0.683
5.724LeuLeu: 5.724 ± 0.791
1.933LeuMet: 1.933 ± 0.378
6.542LeuAsn: 6.542 ± 0.735
1.933LeuPro: 1.933 ± 0.332
3.792LeuGln: 3.792 ± 0.417
2.899LeuArg: 2.899 ± 0.466
6.022LeuSer: 6.022 ± 0.58
5.278LeuThr: 5.278 ± 0.607
5.055LeuVal: 5.055 ± 0.673
0.818LeuTrp: 0.818 ± 0.216
2.602LeuTyr: 2.602 ± 0.347
0.0LeuXaa: 0.0 ± 0.0
Met
1.784MetAla: 1.784 ± 0.331
0.149MetCys: 0.149 ± 0.116
1.413MetAsp: 1.413 ± 0.27
1.71MetGlu: 1.71 ± 0.39
0.966MetPhe: 0.966 ± 0.32
0.966MetGly: 0.966 ± 0.25
0.297MetHis: 0.297 ± 0.142
1.487MetIle: 1.487 ± 0.329
2.528MetLys: 2.528 ± 0.444
1.636MetLeu: 1.636 ± 0.299
0.743MetMet: 0.743 ± 0.214
2.007MetAsn: 2.007 ± 0.347
1.264MetPro: 1.264 ± 0.361
1.264MetGln: 1.264 ± 0.325
1.487MetArg: 1.487 ± 0.316
2.007MetSer: 2.007 ± 0.405
1.636MetThr: 1.636 ± 0.329
1.413MetVal: 1.413 ± 0.302
0.149MetTrp: 0.149 ± 0.085
0.595MetTyr: 0.595 ± 0.229
0.0MetXaa: 0.0 ± 0.0
Asn
4.609AsnAla: 4.609 ± 0.571
0.669AsnCys: 0.669 ± 0.228
4.015AsnAsp: 4.015 ± 0.504
3.122AsnGlu: 3.122 ± 0.625
2.751AsnPhe: 2.751 ± 0.56
4.461AsnGly: 4.461 ± 0.67
1.784AsnHis: 1.784 ± 0.294
4.089AsnIle: 4.089 ± 0.484
5.055AsnLys: 5.055 ± 0.478
5.204AsnLeu: 5.204 ± 0.613
1.933AsnMet: 1.933 ± 0.334
4.238AsnAsn: 4.238 ± 0.654
2.007AsnPro: 2.007 ± 0.401
3.197AsnGln: 3.197 ± 0.602
2.453AsnArg: 2.453 ± 0.352
4.386AsnSer: 4.386 ± 0.636
4.609AsnThr: 4.609 ± 0.624
4.015AsnVal: 4.015 ± 0.512
1.413AsnTrp: 1.413 ± 0.324
2.751AsnTyr: 2.751 ± 0.541
0.0AsnXaa: 0.0 ± 0.0
Pro
2.305ProAla: 2.305 ± 0.329
0.074ProCys: 0.074 ± 0.072
3.271ProAsp: 3.271 ± 0.481
2.007ProGlu: 2.007 ± 0.385
0.966ProPhe: 0.966 ± 0.37
2.379ProGly: 2.379 ± 0.456
0.223ProHis: 0.223 ± 0.114
2.156ProIle: 2.156 ± 0.441
2.156ProLys: 2.156 ± 0.436
2.751ProLeu: 2.751 ± 0.465
0.595ProMet: 0.595 ± 0.174
1.933ProAsn: 1.933 ± 0.333
0.52ProPro: 0.52 ± 0.186
1.413ProGln: 1.413 ± 0.233
0.595ProArg: 0.595 ± 0.186
1.933ProSer: 1.933 ± 0.298
2.379ProThr: 2.379 ± 0.386
2.305ProVal: 2.305 ± 0.34
0.074ProTrp: 0.074 ± 0.067
1.338ProTyr: 1.338 ± 0.304
0.0ProXaa: 0.0 ± 0.0
Gln
3.717GlnAla: 3.717 ± 0.824
0.074GlnCys: 0.074 ± 0.07
2.899GlnAsp: 2.899 ± 0.417
2.602GlnGlu: 2.602 ± 0.548
1.041GlnPhe: 1.041 ± 0.273
2.453GlnGly: 2.453 ± 0.459
0.743GlnHis: 0.743 ± 0.275
2.825GlnIle: 2.825 ± 0.534
4.461GlnLys: 4.461 ± 0.623
3.122GlnLeu: 3.122 ± 0.527
0.966GlnMet: 0.966 ± 0.228
2.23GlnAsn: 2.23 ± 0.494
1.338GlnPro: 1.338 ± 0.261
2.379GlnGln: 2.379 ± 0.518
1.71GlnArg: 1.71 ± 0.402
3.122GlnSer: 3.122 ± 0.565
2.602GlnThr: 2.602 ± 0.455
2.528GlnVal: 2.528 ± 0.412
1.041GlnTrp: 1.041 ± 0.41
2.23GlnTyr: 2.23 ± 0.366
0.0GlnXaa: 0.0 ± 0.0
Arg
3.122ArgAla: 3.122 ± 0.507
0.372ArgCys: 0.372 ± 0.161
2.825ArgAsp: 2.825 ± 0.456
1.784ArgGlu: 1.784 ± 0.392
1.784ArgPhe: 1.784 ± 0.421
1.561ArgGly: 1.561 ± 0.275
0.892ArgHis: 0.892 ± 0.242
3.271ArgIle: 3.271 ± 0.529
3.494ArgLys: 3.494 ± 0.581
3.42ArgLeu: 3.42 ± 0.541
1.115ArgMet: 1.115 ± 0.254
2.751ArgAsn: 2.751 ± 0.466
1.041ArgPro: 1.041 ± 0.296
0.966ArgGln: 0.966 ± 0.228
2.007ArgArg: 2.007 ± 0.445
2.379ArgSer: 2.379 ± 0.335
1.71ArgThr: 1.71 ± 0.38
2.379ArgVal: 2.379 ± 0.449
0.892ArgTrp: 0.892 ± 0.225
1.859ArgTyr: 1.859 ± 0.383
0.0ArgXaa: 0.0 ± 0.0
Ser
4.015SerAla: 4.015 ± 0.7
0.669SerCys: 0.669 ± 0.3
5.13SerAsp: 5.13 ± 0.736
3.494SerGlu: 3.494 ± 0.698
2.899SerPhe: 2.899 ± 0.472
4.386SerGly: 4.386 ± 0.505
1.041SerHis: 1.041 ± 0.33
4.684SerIle: 4.684 ± 0.618
4.832SerLys: 4.832 ± 0.714
5.873SerLeu: 5.873 ± 0.677
1.561SerMet: 1.561 ± 0.322
5.427SerAsn: 5.427 ± 0.636
2.602SerPro: 2.602 ± 0.433
3.122SerGln: 3.122 ± 0.495
2.23SerArg: 2.23 ± 0.315
4.386SerSer: 4.386 ± 0.744
2.974SerThr: 2.974 ± 0.351
2.825SerVal: 2.825 ± 0.409
1.338SerTrp: 1.338 ± 0.287
3.42SerTyr: 3.42 ± 0.564
0.0SerXaa: 0.0 ± 0.0
Thr
4.015ThrAla: 4.015 ± 0.495
0.297ThrCys: 0.297 ± 0.159
4.758ThrAsp: 4.758 ± 0.688
2.825ThrGlu: 2.825 ± 0.506
2.676ThrPhe: 2.676 ± 0.388
4.907ThrGly: 4.907 ± 0.687
1.115ThrHis: 1.115 ± 0.263
4.684ThrIle: 4.684 ± 0.572
4.832ThrLys: 4.832 ± 0.689
4.609ThrLeu: 4.609 ± 0.494
1.933ThrMet: 1.933 ± 0.478
3.569ThrAsn: 3.569 ± 0.734
2.007ThrPro: 2.007 ± 0.391
2.082ThrGln: 2.082 ± 0.385
1.784ThrArg: 1.784 ± 0.399
4.089ThrSer: 4.089 ± 0.652
3.717ThrThr: 3.717 ± 0.606
3.792ThrVal: 3.792 ± 0.709
0.966ThrTrp: 0.966 ± 0.227
2.082ThrTyr: 2.082 ± 0.508
0.0ThrXaa: 0.0 ± 0.0
Val
4.015ValAla: 4.015 ± 0.523
0.595ValCys: 0.595 ± 0.221
4.163ValAsp: 4.163 ± 0.501
2.453ValGlu: 2.453 ± 0.386
1.561ValPhe: 1.561 ± 0.354
3.569ValGly: 3.569 ± 0.554
0.818ValHis: 0.818 ± 0.223
3.42ValIle: 3.42 ± 0.423
5.576ValLys: 5.576 ± 0.552
3.42ValLeu: 3.42 ± 0.454
1.115ValMet: 1.115 ± 0.304
3.792ValAsn: 3.792 ± 0.427
1.933ValPro: 1.933 ± 0.43
2.007ValGln: 2.007 ± 0.406
1.487ValArg: 1.487 ± 0.455
4.089ValSer: 4.089 ± 0.522
4.089ValThr: 4.089 ± 0.524
2.751ValVal: 2.751 ± 0.5
0.892ValTrp: 0.892 ± 0.255
2.082ValTyr: 2.082 ± 0.371
0.0ValXaa: 0.0 ± 0.0
Trp
1.041TrpAla: 1.041 ± 0.276
0.074TrpCys: 0.074 ± 0.066
1.413TrpAsp: 1.413 ± 0.299
1.041TrpGlu: 1.041 ± 0.243
0.52TrpPhe: 0.52 ± 0.182
1.19TrpGly: 1.19 ± 0.318
0.446TrpHis: 0.446 ± 0.183
1.041TrpIle: 1.041 ± 0.367
1.71TrpLys: 1.71 ± 0.382
1.561TrpLeu: 1.561 ± 0.324
0.223TrpMet: 0.223 ± 0.127
1.041TrpAsn: 1.041 ± 0.343
0.372TrpPro: 0.372 ± 0.156
1.041TrpGln: 1.041 ± 0.259
0.818TrpArg: 0.818 ± 0.173
1.636TrpSer: 1.636 ± 0.467
0.892TrpThr: 0.892 ± 0.205
0.52TrpVal: 0.52 ± 0.197
0.074TrpTrp: 0.074 ± 0.065
0.966TrpTyr: 0.966 ± 0.289
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.23TyrAla: 2.23 ± 0.429
0.669TyrCys: 0.669 ± 0.31
3.048TyrAsp: 3.048 ± 0.61
2.899TyrGlu: 2.899 ± 0.644
1.041TyrPhe: 1.041 ± 0.218
2.974TyrGly: 2.974 ± 0.619
1.338TyrHis: 1.338 ± 0.304
2.379TyrIle: 2.379 ± 0.406
3.643TyrLys: 3.643 ± 0.483
4.163TyrLeu: 4.163 ± 0.649
0.743TyrMet: 0.743 ± 0.212
2.007TyrAsn: 2.007 ± 0.444
1.413TyrPro: 1.413 ± 0.362
1.859TyrGln: 1.859 ± 0.388
2.082TyrArg: 2.082 ± 0.336
2.528TyrSer: 2.528 ± 0.446
2.23TyrThr: 2.23 ± 0.511
2.23TyrVal: 2.23 ± 0.333
0.372TyrTrp: 0.372 ± 0.144
1.561TyrTyr: 1.561 ± 0.316
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (13452 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski