Amino acid dipepetide frequency for Listeria phage PSA

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.348AlaAla: 3.348 ± 0.795
0.419AlaCys: 0.419 ± 0.18
4.018AlaAsp: 4.018 ± 0.611
5.943AlaGlu: 5.943 ± 0.718
3.516AlaPhe: 3.516 ± 0.497
3.934AlaGly: 3.934 ± 0.801
0.837AlaHis: 0.837 ± 0.28
5.106AlaIle: 5.106 ± 0.638
6.11AlaLys: 6.11 ± 0.718
4.353AlaLeu: 4.353 ± 0.94
2.427AlaMet: 2.427 ± 0.433
3.683AlaAsn: 3.683 ± 0.507
1.256AlaPro: 1.256 ± 0.32
2.511AlaGln: 2.511 ± 0.507
2.427AlaArg: 2.427 ± 0.451
3.348AlaSer: 3.348 ± 0.519
5.943AlaThr: 5.943 ± 0.985
5.106AlaVal: 5.106 ± 0.708
1.339AlaTrp: 1.339 ± 0.368
2.009AlaTyr: 2.009 ± 0.3
0.0AlaXaa: 0.0 ± 0.0
Cys
1.004CysAla: 1.004 ± 0.304
0.167CysCys: 0.167 ± 0.099
0.586CysAsp: 0.586 ± 0.263
1.004CysGlu: 1.004 ± 0.285
0.251CysPhe: 0.251 ± 0.146
0.335CysGly: 0.335 ± 0.155
0.084CysHis: 0.084 ± 0.079
0.167CysIle: 0.167 ± 0.122
0.67CysLys: 0.67 ± 0.304
0.335CysLeu: 0.335 ± 0.176
0.251CysMet: 0.251 ± 0.152
1.088CysAsn: 1.088 ± 0.266
0.167CysPro: 0.167 ± 0.146
0.419CysGln: 0.419 ± 0.252
0.084CysArg: 0.084 ± 0.092
0.502CysSer: 0.502 ± 0.199
0.335CysThr: 0.335 ± 0.199
0.837CysVal: 0.837 ± 0.292
0.0CysTrp: 0.0 ± 0.0
0.502CysTyr: 0.502 ± 0.188
0.0CysXaa: 0.0 ± 0.0
Asp
4.269AspAla: 4.269 ± 0.59
0.586AspCys: 0.586 ± 0.232
4.018AspAsp: 4.018 ± 0.563
6.027AspGlu: 6.027 ± 0.632
2.26AspPhe: 2.26 ± 0.429
4.018AspGly: 4.018 ± 0.552
0.753AspHis: 0.753 ± 0.263
5.859AspIle: 5.859 ± 0.755
4.52AspLys: 4.52 ± 0.514
5.943AspLeu: 5.943 ± 0.911
1.841AspMet: 1.841 ± 0.38
4.101AspAsn: 4.101 ± 0.536
1.088AspPro: 1.088 ± 0.354
1.256AspGln: 1.256 ± 0.317
1.841AspArg: 1.841 ± 0.32
3.264AspSer: 3.264 ± 0.533
3.013AspThr: 3.013 ± 0.614
4.436AspVal: 4.436 ± 0.455
0.753AspTrp: 0.753 ± 0.31
2.678AspTyr: 2.678 ± 0.574
0.0AspXaa: 0.0 ± 0.0
Glu
6.11GluAla: 6.11 ± 0.719
0.586GluCys: 0.586 ± 0.266
3.599GluAsp: 3.599 ± 0.542
6.027GluGlu: 6.027 ± 1.141
3.599GluPhe: 3.599 ± 0.631
4.604GluGly: 4.604 ± 0.608
1.674GluHis: 1.674 ± 0.304
6.278GluIle: 6.278 ± 0.806
6.194GluLys: 6.194 ± 0.762
7.282GluLeu: 7.282 ± 0.678
3.264GluMet: 3.264 ± 0.484
4.938GluAsn: 4.938 ± 0.604
1.172GluPro: 1.172 ± 0.294
4.018GluGln: 4.018 ± 0.514
3.348GluArg: 3.348 ± 0.621
3.432GluSer: 3.432 ± 0.548
4.938GluThr: 4.938 ± 0.582
5.357GluVal: 5.357 ± 0.707
1.339GluTrp: 1.339 ± 0.315
3.097GluTyr: 3.097 ± 0.454
0.0GluXaa: 0.0 ± 0.0
Phe
3.432PheAla: 3.432 ± 0.514
0.419PheCys: 0.419 ± 0.19
2.93PheAsp: 2.93 ± 0.417
3.181PheGlu: 3.181 ± 0.466
1.339PhePhe: 1.339 ± 0.251
1.841PheGly: 1.841 ± 0.41
0.837PheHis: 0.837 ± 0.208
3.013PheIle: 3.013 ± 0.588
3.767PheLys: 3.767 ± 0.561
3.767PheLeu: 3.767 ± 0.519
1.59PheMet: 1.59 ± 0.303
2.427PheAsn: 2.427 ± 0.568
1.088PhePro: 1.088 ± 0.322
0.837PheGln: 0.837 ± 0.224
1.758PheArg: 1.758 ± 0.368
2.762PheSer: 2.762 ± 0.458
3.432PheThr: 3.432 ± 0.495
2.678PheVal: 2.678 ± 0.352
0.586PheTrp: 0.586 ± 0.257
1.423PheTyr: 1.423 ± 0.348
0.0PheXaa: 0.0 ± 0.0
Gly
3.85GlyAla: 3.85 ± 0.572
0.502GlyCys: 0.502 ± 0.212
3.683GlyAsp: 3.683 ± 0.613
3.683GlyGlu: 3.683 ± 0.486
3.432GlyPhe: 3.432 ± 0.404
2.93GlyGly: 2.93 ± 0.58
0.753GlyHis: 0.753 ± 0.274
4.269GlyIle: 4.269 ± 0.53
5.776GlyLys: 5.776 ± 0.555
6.027GlyLeu: 6.027 ± 0.758
1.507GlyMet: 1.507 ± 0.396
2.595GlyAsn: 2.595 ± 0.523
1.088GlyPro: 1.088 ± 0.285
1.758GlyGln: 1.758 ± 0.469
2.26GlyArg: 2.26 ± 0.409
3.181GlySer: 3.181 ± 0.452
2.427GlyThr: 2.427 ± 0.526
3.767GlyVal: 3.767 ± 0.633
0.586GlyTrp: 0.586 ± 0.24
2.009GlyTyr: 2.009 ± 0.452
0.0GlyXaa: 0.0 ± 0.0
His
0.586HisAla: 0.586 ± 0.212
0.084HisCys: 0.084 ± 0.073
1.339HisAsp: 1.339 ± 0.393
0.67HisGlu: 0.67 ± 0.259
0.419HisPhe: 0.419 ± 0.157
0.335HisGly: 0.335 ± 0.139
0.586HisHis: 0.586 ± 0.291
1.674HisIle: 1.674 ± 0.402
0.921HisLys: 0.921 ± 0.271
1.841HisLeu: 1.841 ± 0.5
0.084HisMet: 0.084 ± 0.079
1.088HisAsn: 1.088 ± 0.358
0.837HisPro: 0.837 ± 0.28
0.419HisGln: 0.419 ± 0.2
0.753HisArg: 0.753 ± 0.251
0.753HisSer: 0.753 ± 0.291
1.088HisThr: 1.088 ± 0.414
1.423HisVal: 1.423 ± 0.384
0.167HisTrp: 0.167 ± 0.121
1.256HisTyr: 1.256 ± 0.339
0.0HisXaa: 0.0 ± 0.0
Ile
5.273IleAla: 5.273 ± 0.591
1.256IleCys: 1.256 ± 0.347
6.361IleAsp: 6.361 ± 0.775
5.524IleGlu: 5.524 ± 0.732
3.348IlePhe: 3.348 ± 0.516
3.599IleGly: 3.599 ± 0.64
1.339IleHis: 1.339 ± 0.36
5.524IleIle: 5.524 ± 0.701
6.11IleLys: 6.11 ± 0.709
4.269IleLeu: 4.269 ± 0.622
0.753IleMet: 0.753 ± 0.266
5.357IleAsn: 5.357 ± 0.712
2.176IlePro: 2.176 ± 0.44
3.181IleGln: 3.181 ± 0.535
2.344IleArg: 2.344 ± 0.452
3.767IleSer: 3.767 ± 0.446
3.767IleThr: 3.767 ± 0.73
4.687IleVal: 4.687 ± 0.587
0.502IleTrp: 0.502 ± 0.223
2.762IleTyr: 2.762 ± 0.438
0.0IleXaa: 0.0 ± 0.0
Lys
5.859LysAla: 5.859 ± 0.668
0.753LysCys: 0.753 ± 0.306
4.687LysAsp: 4.687 ± 0.608
7.198LysGlu: 7.198 ± 0.623
3.264LysPhe: 3.264 ± 0.545
4.101LysGly: 4.101 ± 0.698
1.256LysHis: 1.256 ± 0.382
6.445LysIle: 6.445 ± 0.547
6.445LysLys: 6.445 ± 0.937
7.198LysLeu: 7.198 ± 0.729
3.516LysMet: 3.516 ± 0.561
6.529LysAsn: 6.529 ± 0.751
1.758LysPro: 1.758 ± 0.329
4.185LysGln: 4.185 ± 0.557
3.097LysArg: 3.097 ± 0.564
4.101LysSer: 4.101 ± 0.531
5.776LysThr: 5.776 ± 0.634
6.027LysVal: 6.027 ± 0.648
1.674LysTrp: 1.674 ± 0.356
4.436LysTyr: 4.436 ± 0.667
0.0LysXaa: 0.0 ± 0.0
Leu
4.52LeuAla: 4.52 ± 0.664
0.921LeuCys: 0.921 ± 0.275
6.11LeuAsp: 6.11 ± 0.834
7.952LeuGlu: 7.952 ± 0.834
3.348LeuPhe: 3.348 ± 0.716
4.52LeuGly: 4.52 ± 0.706
1.256LeuHis: 1.256 ± 0.367
5.524LeuIle: 5.524 ± 0.913
8.454LeuLys: 8.454 ± 0.876
6.278LeuLeu: 6.278 ± 0.857
2.093LeuMet: 2.093 ± 0.483
5.776LeuAsn: 5.776 ± 0.694
2.678LeuPro: 2.678 ± 0.505
2.595LeuGln: 2.595 ± 0.569
2.427LeuArg: 2.427 ± 0.483
5.524LeuSer: 5.524 ± 0.557
5.776LeuThr: 5.776 ± 0.754
3.348LeuVal: 3.348 ± 0.499
1.088LeuTrp: 1.088 ± 0.34
3.599LeuTyr: 3.599 ± 0.49
0.0LeuXaa: 0.0 ± 0.0
Met
1.841MetAla: 1.841 ± 0.424
0.084MetCys: 0.084 ± 0.114
1.423MetAsp: 1.423 ± 0.34
1.841MetGlu: 1.841 ± 0.509
1.256MetPhe: 1.256 ± 0.428
1.004MetGly: 1.004 ± 0.27
0.251MetHis: 0.251 ± 0.14
1.59MetIle: 1.59 ± 0.371
2.846MetLys: 2.846 ± 0.524
2.427MetLeu: 2.427 ± 0.429
0.419MetMet: 0.419 ± 0.203
2.176MetAsn: 2.176 ± 0.401
0.921MetPro: 0.921 ± 0.274
1.004MetGln: 1.004 ± 0.311
1.59MetArg: 1.59 ± 0.364
1.339MetSer: 1.339 ± 0.307
2.009MetThr: 2.009 ± 0.353
1.423MetVal: 1.423 ± 0.29
0.084MetTrp: 0.084 ± 0.073
1.59MetTyr: 1.59 ± 0.404
0.0MetXaa: 0.0 ± 0.0
Asn
3.934AsnAla: 3.934 ± 0.645
0.419AsnCys: 0.419 ± 0.203
3.599AsnAsp: 3.599 ± 0.626
5.608AsnGlu: 5.608 ± 0.912
2.176AsnPhe: 2.176 ± 0.407
5.273AsnGly: 5.273 ± 0.696
1.172AsnHis: 1.172 ± 0.421
4.269AsnIle: 4.269 ± 0.64
5.608AsnLys: 5.608 ± 0.771
4.436AsnLeu: 4.436 ± 0.577
1.59AsnMet: 1.59 ± 0.433
4.018AsnAsn: 4.018 ± 0.812
2.26AsnPro: 2.26 ± 0.425
1.925AsnGln: 1.925 ± 0.43
2.093AsnArg: 2.093 ± 0.389
3.767AsnSer: 3.767 ± 0.601
4.185AsnThr: 4.185 ± 0.637
4.353AsnVal: 4.353 ± 0.65
1.004AsnTrp: 1.004 ± 0.348
2.009AsnTyr: 2.009 ± 0.42
0.0AsnXaa: 0.0 ± 0.0
Pro
2.176ProAla: 2.176 ± 0.453
0.084ProCys: 0.084 ± 0.085
1.256ProAsp: 1.256 ± 0.4
2.093ProGlu: 2.093 ± 0.376
1.256ProPhe: 1.256 ± 0.279
1.339ProGly: 1.339 ± 0.321
0.335ProHis: 0.335 ± 0.146
1.841ProIle: 1.841 ± 0.355
1.423ProLys: 1.423 ± 0.32
3.097ProLeu: 3.097 ± 0.717
0.67ProMet: 0.67 ± 0.222
0.837ProAsn: 0.837 ± 0.28
0.837ProPro: 0.837 ± 0.325
0.921ProGln: 0.921 ± 0.251
0.586ProArg: 0.586 ± 0.209
1.507ProSer: 1.507 ± 0.343
1.507ProThr: 1.507 ± 0.263
2.595ProVal: 2.595 ± 0.585
0.0ProTrp: 0.0 ± 0.0
0.921ProTyr: 0.921 ± 0.294
0.0ProXaa: 0.0 ± 0.0
Gln
2.009GlnAla: 2.009 ± 0.398
0.251GlnCys: 0.251 ± 0.157
1.925GlnAsp: 1.925 ± 0.372
2.093GlnGlu: 2.093 ± 0.417
1.925GlnPhe: 1.925 ± 0.474
1.59GlnGly: 1.59 ± 0.326
0.837GlnHis: 0.837 ± 0.392
2.427GlnIle: 2.427 ± 0.349
3.013GlnLys: 3.013 ± 0.605
3.348GlnLeu: 3.348 ± 0.552
0.502GlnMet: 0.502 ± 0.182
1.925GlnAsn: 1.925 ± 0.407
0.921GlnPro: 0.921 ± 0.244
1.674GlnGln: 1.674 ± 0.399
1.172GlnArg: 1.172 ± 0.387
3.013GlnSer: 3.013 ± 0.48
1.758GlnThr: 1.758 ± 0.328
2.511GlnVal: 2.511 ± 0.43
0.419GlnTrp: 0.419 ± 0.176
1.423GlnTyr: 1.423 ± 0.307
0.0GlnXaa: 0.0 ± 0.0
Arg
1.507ArgAla: 1.507 ± 0.319
0.335ArgCys: 0.335 ± 0.182
2.344ArgAsp: 2.344 ± 0.353
2.26ArgGlu: 2.26 ± 0.408
1.841ArgPhe: 1.841 ± 0.468
1.758ArgGly: 1.758 ± 0.429
0.753ArgHis: 0.753 ± 0.291
2.176ArgIle: 2.176 ± 0.583
4.101ArgLys: 4.101 ± 0.579
3.767ArgLeu: 3.767 ± 0.518
1.507ArgMet: 1.507 ± 0.357
2.009ArgAsn: 2.009 ± 0.522
0.837ArgPro: 0.837 ± 0.323
1.256ArgGln: 1.256 ± 0.338
0.753ArgArg: 0.753 ± 0.31
1.59ArgSer: 1.59 ± 0.469
1.507ArgThr: 1.507 ± 0.312
1.59ArgVal: 1.59 ± 0.369
0.502ArgTrp: 0.502 ± 0.24
1.59ArgTyr: 1.59 ± 0.455
0.0ArgXaa: 0.0 ± 0.0
Ser
2.595SerAla: 2.595 ± 0.581
0.502SerCys: 0.502 ± 0.215
4.604SerAsp: 4.604 ± 0.689
5.441SerGlu: 5.441 ± 0.477
2.344SerPhe: 2.344 ± 0.521
3.432SerGly: 3.432 ± 0.498
0.837SerHis: 0.837 ± 0.287
4.687SerIle: 4.687 ± 0.875
4.855SerLys: 4.855 ± 0.5
4.604SerLeu: 4.604 ± 0.779
1.004SerMet: 1.004 ± 0.352
3.348SerAsn: 3.348 ± 0.539
1.256SerPro: 1.256 ± 0.26
1.423SerGln: 1.423 ± 0.341
1.674SerArg: 1.674 ± 0.429
4.101SerSer: 4.101 ± 0.632
3.767SerThr: 3.767 ± 0.523
3.683SerVal: 3.683 ± 0.614
1.088SerTrp: 1.088 ± 0.311
1.841SerTyr: 1.841 ± 0.415
0.0SerXaa: 0.0 ± 0.0
Thr
6.613ThrAla: 6.613 ± 1.009
0.251ThrCys: 0.251 ± 0.158
3.181ThrAsp: 3.181 ± 0.651
5.022ThrGlu: 5.022 ± 0.568
1.507ThrPhe: 1.507 ± 0.412
5.022ThrGly: 5.022 ± 0.701
0.586ThrHis: 0.586 ± 0.276
3.181ThrIle: 3.181 ± 0.482
6.027ThrLys: 6.027 ± 0.768
6.027ThrLeu: 6.027 ± 0.636
1.004ThrMet: 1.004 ± 0.253
3.516ThrAsn: 3.516 ± 0.51
2.344ThrPro: 2.344 ± 0.706
1.256ThrGln: 1.256 ± 0.261
1.507ThrArg: 1.507 ± 0.388
3.85ThrSer: 3.85 ± 0.642
2.93ThrThr: 2.93 ± 0.524
4.269ThrVal: 4.269 ± 0.603
0.837ThrTrp: 0.837 ± 0.28
1.758ThrTyr: 1.758 ± 0.394
0.0ThrXaa: 0.0 ± 0.0
Val
5.19ValAla: 5.19 ± 0.736
0.502ValCys: 0.502 ± 0.189
3.683ValAsp: 3.683 ± 0.709
6.027ValGlu: 6.027 ± 0.829
2.93ValPhe: 2.93 ± 0.542
3.516ValGly: 3.516 ± 0.513
1.004ValHis: 1.004 ± 0.309
3.934ValIle: 3.934 ± 0.52
5.524ValLys: 5.524 ± 0.721
4.353ValLeu: 4.353 ± 0.65
1.507ValMet: 1.507 ± 0.329
5.106ValAsn: 5.106 ± 0.726
1.674ValPro: 1.674 ± 0.358
1.925ValGln: 1.925 ± 0.438
3.013ValArg: 3.013 ± 0.491
4.018ValSer: 4.018 ± 0.562
4.436ValThr: 4.436 ± 0.747
2.846ValVal: 2.846 ± 0.583
1.004ValTrp: 1.004 ± 0.252
1.841ValTyr: 1.841 ± 0.322
0.0ValXaa: 0.0 ± 0.0
Trp
1.172TrpAla: 1.172 ± 0.346
0.335TrpCys: 0.335 ± 0.134
1.088TrpAsp: 1.088 ± 0.385
0.502TrpGlu: 0.502 ± 0.301
0.67TrpPhe: 0.67 ± 0.254
0.753TrpGly: 0.753 ± 0.24
0.586TrpHis: 0.586 ± 0.202
1.004TrpIle: 1.004 ± 0.307
1.088TrpLys: 1.088 ± 0.301
1.59TrpLeu: 1.59 ± 0.466
0.502TrpMet: 0.502 ± 0.174
0.837TrpAsn: 0.837 ± 0.329
0.084TrpPro: 0.084 ± 0.088
0.419TrpGln: 0.419 ± 0.276
0.167TrpArg: 0.167 ± 0.108
0.921TrpSer: 0.921 ± 0.239
0.502TrpThr: 0.502 ± 0.189
0.502TrpVal: 0.502 ± 0.219
0.335TrpTrp: 0.335 ± 0.195
1.004TrpTyr: 1.004 ± 0.477
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.344TyrAla: 2.344 ± 0.48
0.335TyrCys: 0.335 ± 0.183
2.093TyrAsp: 2.093 ± 0.425
2.846TyrGlu: 2.846 ± 0.551
2.427TyrPhe: 2.427 ± 0.456
2.176TyrGly: 2.176 ± 0.498
0.586TyrHis: 0.586 ± 0.198
2.93TyrIle: 2.93 ± 0.453
4.855TyrLys: 4.855 ± 0.692
2.846TyrLeu: 2.846 ± 0.639
1.004TyrMet: 1.004 ± 0.283
2.344TyrAsn: 2.344 ± 0.471
0.837TyrPro: 0.837 ± 0.363
1.758TyrGln: 1.758 ± 0.378
1.088TyrArg: 1.088 ± 0.282
2.344TyrSer: 2.344 ± 0.435
1.59TyrThr: 1.59 ± 0.352
2.595TyrVal: 2.595 ± 0.465
0.753TyrTrp: 0.753 ± 0.245
2.26TyrTyr: 2.26 ± 0.501
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (11948 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski