Amino acid dipepetide frequency for Lactococcus phage phi7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.513AlaAla: 0.513 ± 0.284
0.103AlaCys: 0.103 ± 0.106
2.772AlaAsp: 2.772 ± 0.587
4.311AlaGlu: 4.311 ± 0.731
4.003AlaPhe: 4.003 ± 0.803
4.106AlaGly: 4.106 ± 0.761
0.616AlaHis: 0.616 ± 0.254
4.414AlaIle: 4.414 ± 0.797
5.03AlaLys: 5.03 ± 0.84
6.262AlaLeu: 6.262 ± 0.972
1.95AlaMet: 1.95 ± 0.567
4.209AlaAsn: 4.209 ± 0.818
0.821AlaPro: 0.821 ± 0.31
2.258AlaGln: 2.258 ± 0.538
1.95AlaArg: 1.95 ± 0.448
2.464AlaSer: 2.464 ± 0.533
3.285AlaThr: 3.285 ± 0.647
3.593AlaVal: 3.593 ± 0.871
1.95AlaTrp: 1.95 ± 0.729
1.437AlaTyr: 1.437 ± 0.335
0.0AlaXaa: 0.0 ± 0.0
Cys
0.513CysAla: 0.513 ± 0.263
0.103CysCys: 0.103 ± 0.105
0.205CysAsp: 0.205 ± 0.131
0.411CysGlu: 0.411 ± 0.199
0.411CysPhe: 0.411 ± 0.263
0.719CysGly: 0.719 ± 0.299
0.103CysHis: 0.103 ± 0.097
0.411CysIle: 0.411 ± 0.213
0.616CysLys: 0.616 ± 0.253
0.513CysLeu: 0.513 ± 0.26
0.103CysMet: 0.103 ± 0.084
0.719CysAsn: 0.719 ± 0.274
0.205CysPro: 0.205 ± 0.135
0.308CysGln: 0.308 ± 0.15
0.411CysArg: 0.411 ± 0.185
0.411CysSer: 0.411 ± 0.217
0.103CysThr: 0.103 ± 0.104
0.308CysVal: 0.308 ± 0.168
0.308CysTrp: 0.308 ± 0.182
0.411CysTyr: 0.411 ± 0.219
0.0CysXaa: 0.0 ± 0.0
Asp
1.437AspAla: 1.437 ± 0.483
0.205AspCys: 0.205 ± 0.118
3.285AspAsp: 3.285 ± 0.699
3.387AspGlu: 3.387 ± 0.7
3.798AspPhe: 3.798 ± 0.567
3.798AspGly: 3.798 ± 0.723
0.719AspHis: 0.719 ± 0.257
4.619AspIle: 4.619 ± 0.645
5.954AspLys: 5.954 ± 0.603
5.748AspLeu: 5.748 ± 0.706
1.026AspMet: 1.026 ± 0.348
4.003AspAsn: 4.003 ± 0.561
1.437AspPro: 1.437 ± 0.393
0.616AspGln: 0.616 ± 0.27
2.053AspArg: 2.053 ± 0.484
3.593AspSer: 3.593 ± 0.653
3.798AspThr: 3.798 ± 0.547
2.669AspVal: 2.669 ± 0.447
1.026AspTrp: 1.026 ± 0.269
2.874AspTyr: 2.874 ± 0.663
0.0AspXaa: 0.0 ± 0.0
Glu
3.695GluAla: 3.695 ± 0.554
0.513GluCys: 0.513 ± 0.25
3.182GluAsp: 3.182 ± 0.585
5.543GluGlu: 5.543 ± 1.062
3.695GluPhe: 3.695 ± 0.643
2.464GluGly: 2.464 ± 0.388
1.232GluHis: 1.232 ± 0.374
5.748GluIle: 5.748 ± 0.794
6.364GluLys: 6.364 ± 1.257
9.752GluLeu: 9.752 ± 1.363
2.156GluMet: 2.156 ± 0.386
5.543GluAsn: 5.543 ± 0.771
1.129GluPro: 1.129 ± 0.376
4.619GluGln: 4.619 ± 0.881
3.079GluArg: 3.079 ± 0.656
4.106GluSer: 4.106 ± 0.629
4.619GluThr: 4.619 ± 0.653
5.132GluVal: 5.132 ± 0.691
0.821GluTrp: 0.821 ± 0.285
3.49GluTyr: 3.49 ± 0.704
0.0GluXaa: 0.0 ± 0.0
Phe
2.361PheAla: 2.361 ± 0.566
0.308PheCys: 0.308 ± 0.162
3.798PheAsp: 3.798 ± 0.654
2.772PheGlu: 2.772 ± 0.523
1.95PhePhe: 1.95 ± 0.497
2.156PheGly: 2.156 ± 0.417
0.205PheHis: 0.205 ± 0.163
3.387PheIle: 3.387 ± 0.566
3.901PheLys: 3.901 ± 0.759
2.977PheLeu: 2.977 ± 0.549
0.924PheMet: 0.924 ± 0.25
2.669PheAsn: 2.669 ± 0.604
0.719PhePro: 0.719 ± 0.236
1.129PheGln: 1.129 ± 0.401
1.437PheArg: 1.437 ± 0.334
3.695PheSer: 3.695 ± 0.687
4.106PheThr: 4.106 ± 0.634
2.464PheVal: 2.464 ± 0.406
0.205PheTrp: 0.205 ± 0.148
1.95PheTyr: 1.95 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
4.003GlyAla: 4.003 ± 1.178
0.308GlyCys: 0.308 ± 0.187
2.874GlyAsp: 2.874 ± 0.506
4.619GlyGlu: 4.619 ± 0.522
2.156GlyPhe: 2.156 ± 0.462
4.517GlyGly: 4.517 ± 0.698
1.026GlyHis: 1.026 ± 0.396
4.209GlyIle: 4.209 ± 1.16
5.851GlyLys: 5.851 ± 0.713
5.851GlyLeu: 5.851 ± 1.169
1.745GlyMet: 1.745 ± 0.496
3.49GlyAsn: 3.49 ± 0.676
0.103GlyPro: 0.103 ± 0.104
2.464GlyGln: 2.464 ± 0.493
1.95GlyArg: 1.95 ± 0.434
4.517GlySer: 4.517 ± 0.929
3.49GlyThr: 3.49 ± 0.78
4.722GlyVal: 4.722 ± 0.913
1.232GlyTrp: 1.232 ± 0.304
3.901GlyTyr: 3.901 ± 0.718
0.0GlyXaa: 0.0 ± 0.0
His
0.616HisAla: 0.616 ± 0.22
0.513HisCys: 0.513 ± 0.266
0.719HisAsp: 0.719 ± 0.301
0.513HisGlu: 0.513 ± 0.248
0.411HisPhe: 0.411 ± 0.19
1.129HisGly: 1.129 ± 0.329
0.0HisHis: 0.0 ± 0.0
1.54HisIle: 1.54 ± 0.313
0.821HisLys: 0.821 ± 0.342
1.026HisLeu: 1.026 ± 0.38
0.103HisMet: 0.103 ± 0.084
1.54HisAsn: 1.54 ± 0.37
0.103HisPro: 0.103 ± 0.084
0.308HisGln: 0.308 ± 0.185
0.411HisArg: 0.411 ± 0.25
0.103HisSer: 0.103 ± 0.084
1.026HisThr: 1.026 ± 0.317
1.026HisVal: 1.026 ± 0.439
0.103HisTrp: 0.103 ± 0.112
0.513HisTyr: 0.513 ± 0.235
0.0HisXaa: 0.0 ± 0.0
Ile
4.414IleAla: 4.414 ± 0.612
0.205IleCys: 0.205 ± 0.146
4.209IleAsp: 4.209 ± 0.628
6.672IleGlu: 6.672 ± 0.885
2.566IlePhe: 2.566 ± 0.625
3.285IleGly: 3.285 ± 0.684
0.821IleHis: 0.821 ± 0.314
4.619IleIle: 4.619 ± 0.625
7.083IleLys: 7.083 ± 0.771
5.851IleLeu: 5.851 ± 0.901
1.54IleMet: 1.54 ± 0.376
4.619IleAsn: 4.619 ± 0.479
2.156IlePro: 2.156 ± 0.634
2.977IleGln: 2.977 ± 0.506
1.334IleArg: 1.334 ± 0.366
4.414IleSer: 4.414 ± 0.867
4.927IleThr: 4.927 ± 0.673
5.543IleVal: 5.543 ± 0.744
1.129IleTrp: 1.129 ± 0.294
2.669IleTyr: 2.669 ± 0.534
0.0IleXaa: 0.0 ± 0.0
Lys
6.056LysAla: 6.056 ± 0.809
0.411LysCys: 0.411 ± 0.187
4.927LysAsp: 4.927 ± 0.552
9.033LysGlu: 9.033 ± 1.283
2.361LysPhe: 2.361 ± 0.489
6.159LysGly: 6.159 ± 0.989
1.026LysHis: 1.026 ± 0.356
5.44LysIle: 5.44 ± 0.838
9.033LysLys: 9.033 ± 1.293
7.288LysLeu: 7.288 ± 0.953
2.977LysMet: 2.977 ± 0.479
5.748LysAsn: 5.748 ± 0.598
1.95LysPro: 1.95 ± 0.537
3.285LysGln: 3.285 ± 0.56
3.695LysArg: 3.695 ± 0.637
5.132LysSer: 5.132 ± 0.67
5.132LysThr: 5.132 ± 0.532
5.748LysVal: 5.748 ± 0.837
1.334LysTrp: 1.334 ± 0.379
3.49LysTyr: 3.49 ± 0.588
0.0LysXaa: 0.0 ± 0.0
Leu
4.619LeuAla: 4.619 ± 0.617
0.513LeuCys: 0.513 ± 0.194
5.235LeuAsp: 5.235 ± 0.758
7.185LeuGlu: 7.185 ± 0.9
3.798LeuPhe: 3.798 ± 0.617
4.722LeuGly: 4.722 ± 0.835
1.334LeuHis: 1.334 ± 0.405
7.288LeuIle: 7.288 ± 0.931
8.622LeuLys: 8.622 ± 1.095
6.467LeuLeu: 6.467 ± 1.007
1.437LeuMet: 1.437 ± 0.347
4.927LeuAsn: 4.927 ± 0.72
2.977LeuPro: 2.977 ± 0.585
3.49LeuGln: 3.49 ± 0.689
2.772LeuArg: 2.772 ± 0.495
4.824LeuSer: 4.824 ± 0.734
6.159LeuThr: 6.159 ± 0.736
5.748LeuVal: 5.748 ± 0.747
1.745LeuTrp: 1.745 ± 0.434
4.824LeuTyr: 4.824 ± 0.817
0.0LeuXaa: 0.0 ± 0.0
Met
1.848MetAla: 1.848 ± 0.475
0.103MetCys: 0.103 ± 0.108
1.232MetAsp: 1.232 ± 0.392
1.54MetGlu: 1.54 ± 0.467
0.513MetPhe: 0.513 ± 0.254
1.026MetGly: 1.026 ± 0.268
0.205MetHis: 0.205 ± 0.16
2.464MetIle: 2.464 ± 0.421
2.669MetLys: 2.669 ± 0.537
1.745MetLeu: 1.745 ± 0.44
0.308MetMet: 0.308 ± 0.21
1.745MetAsn: 1.745 ± 0.406
0.513MetPro: 0.513 ± 0.231
1.232MetGln: 1.232 ± 0.305
0.513MetArg: 0.513 ± 0.242
2.053MetSer: 2.053 ± 0.39
1.745MetThr: 1.745 ± 0.481
1.848MetVal: 1.848 ± 0.396
0.103MetTrp: 0.103 ± 0.11
1.334MetTyr: 1.334 ± 0.339
0.0MetXaa: 0.0 ± 0.0
Asn
4.619AsnAla: 4.619 ± 1.057
0.411AsnCys: 0.411 ± 0.233
4.722AsnAsp: 4.722 ± 0.664
5.132AsnGlu: 5.132 ± 0.67
2.361AsnPhe: 2.361 ± 0.582
5.646AsnGly: 5.646 ± 0.622
1.026AsnHis: 1.026 ± 0.33
4.209AsnIle: 4.209 ± 0.641
5.646AsnLys: 5.646 ± 0.89
6.159AsnLeu: 6.159 ± 0.813
1.334AsnMet: 1.334 ± 0.333
4.003AsnAsn: 4.003 ± 0.607
2.258AsnPro: 2.258 ± 0.475
2.156AsnGln: 2.156 ± 0.469
1.437AsnArg: 1.437 ± 0.304
4.722AsnSer: 4.722 ± 0.775
3.593AsnThr: 3.593 ± 0.69
4.209AsnVal: 4.209 ± 0.604
1.232AsnTrp: 1.232 ± 0.33
2.258AsnTyr: 2.258 ± 0.503
0.0AsnXaa: 0.0 ± 0.0
Pro
1.334ProAla: 1.334 ± 0.428
0.205ProCys: 0.205 ± 0.126
1.642ProAsp: 1.642 ± 0.439
1.745ProGlu: 1.745 ± 0.414
1.026ProPhe: 1.026 ± 0.275
0.308ProGly: 0.308 ± 0.15
0.308ProHis: 0.308 ± 0.171
1.848ProIle: 1.848 ± 0.451
2.156ProLys: 2.156 ± 0.599
2.156ProLeu: 2.156 ± 0.54
0.513ProMet: 0.513 ± 0.218
1.848ProAsn: 1.848 ± 0.617
0.719ProPro: 0.719 ± 0.285
0.719ProGln: 0.719 ± 0.3
0.411ProArg: 0.411 ± 0.169
1.54ProSer: 1.54 ± 0.48
2.053ProThr: 2.053 ± 0.404
1.232ProVal: 1.232 ± 0.396
0.103ProTrp: 0.103 ± 0.108
0.924ProTyr: 0.924 ± 0.361
0.0ProXaa: 0.0 ± 0.0
Gln
3.182GlnAla: 3.182 ± 0.648
0.411GlnCys: 0.411 ± 0.177
1.848GlnAsp: 1.848 ± 0.513
2.464GlnGlu: 2.464 ± 0.541
0.924GlnPhe: 0.924 ± 0.293
3.079GlnGly: 3.079 ± 0.523
0.308GlnHis: 0.308 ± 0.179
1.642GlnIle: 1.642 ± 0.337
2.156GlnLys: 2.156 ± 0.466
4.003GlnLeu: 4.003 ± 0.582
1.129GlnMet: 1.129 ± 0.263
2.053GlnAsn: 2.053 ± 0.354
1.334GlnPro: 1.334 ± 0.306
1.745GlnGln: 1.745 ± 0.464
1.95GlnArg: 1.95 ± 0.43
2.464GlnSer: 2.464 ± 0.461
2.361GlnThr: 2.361 ± 0.459
1.95GlnVal: 1.95 ± 0.431
0.821GlnTrp: 0.821 ± 0.222
1.334GlnTyr: 1.334 ± 0.33
0.0GlnXaa: 0.0 ± 0.0
Arg
2.156ArgAla: 2.156 ± 0.466
0.308ArgCys: 0.308 ± 0.176
1.334ArgAsp: 1.334 ± 0.357
2.772ArgGlu: 2.772 ± 0.551
0.719ArgPhe: 0.719 ± 0.276
1.95ArgGly: 1.95 ± 0.434
0.924ArgHis: 0.924 ± 0.305
1.848ArgIle: 1.848 ± 0.463
4.209ArgLys: 4.209 ± 0.745
3.182ArgLeu: 3.182 ± 0.563
0.513ArgMet: 0.513 ± 0.256
2.361ArgAsn: 2.361 ± 0.537
0.513ArgPro: 0.513 ± 0.212
1.437ArgGln: 1.437 ± 0.34
1.54ArgArg: 1.54 ± 0.412
1.95ArgSer: 1.95 ± 0.418
1.54ArgThr: 1.54 ± 0.321
2.258ArgVal: 2.258 ± 0.558
0.411ArgTrp: 0.411 ± 0.226
1.745ArgTyr: 1.745 ± 0.417
0.0ArgXaa: 0.0 ± 0.0
Ser
4.003SerAla: 4.003 ± 0.971
0.719SerCys: 0.719 ± 0.277
3.49SerAsp: 3.49 ± 0.542
4.003SerGlu: 4.003 ± 0.593
2.772SerPhe: 2.772 ± 0.511
5.44SerGly: 5.44 ± 1.074
0.821SerHis: 0.821 ± 0.243
4.414SerIle: 4.414 ± 0.612
5.235SerLys: 5.235 ± 0.812
5.543SerLeu: 5.543 ± 0.709
1.95SerMet: 1.95 ± 0.435
4.209SerAsn: 4.209 ± 0.587
1.437SerPro: 1.437 ± 0.433
1.642SerGln: 1.642 ± 0.384
2.464SerArg: 2.464 ± 0.451
5.132SerSer: 5.132 ± 0.686
3.285SerThr: 3.285 ± 0.677
3.695SerVal: 3.695 ± 0.693
0.616SerTrp: 0.616 ± 0.259
2.874SerTyr: 2.874 ± 0.572
0.0SerXaa: 0.0 ± 0.0
Thr
4.517ThrAla: 4.517 ± 0.677
0.308ThrCys: 0.308 ± 0.173
3.387ThrAsp: 3.387 ± 0.61
5.748ThrGlu: 5.748 ± 0.54
3.49ThrPhe: 3.49 ± 0.644
4.619ThrGly: 4.619 ± 0.677
0.103ThrHis: 0.103 ± 0.084
3.695ThrIle: 3.695 ± 0.656
4.209ThrLys: 4.209 ± 0.574
5.646ThrLeu: 5.646 ± 0.686
1.334ThrMet: 1.334 ± 0.285
4.414ThrAsn: 4.414 ± 0.596
1.642ThrPro: 1.642 ± 0.301
2.464ThrGln: 2.464 ± 0.586
2.053ThrArg: 2.053 ± 0.473
4.517ThrSer: 4.517 ± 0.756
4.722ThrThr: 4.722 ± 0.636
3.901ThrVal: 3.901 ± 0.757
1.129ThrTrp: 1.129 ± 0.413
2.258ThrTyr: 2.258 ± 0.445
0.0ThrXaa: 0.0 ± 0.0
Val
3.695ValAla: 3.695 ± 0.494
0.616ValCys: 0.616 ± 0.272
3.593ValAsp: 3.593 ± 0.703
4.209ValGlu: 4.209 ± 0.767
2.977ValPhe: 2.977 ± 0.485
3.593ValGly: 3.593 ± 0.539
0.616ValHis: 0.616 ± 0.257
5.132ValIle: 5.132 ± 0.623
6.672ValLys: 6.672 ± 0.678
3.285ValLeu: 3.285 ± 0.688
2.156ValMet: 2.156 ± 0.451
3.695ValAsn: 3.695 ± 0.705
1.437ValPro: 1.437 ± 0.365
1.95ValGln: 1.95 ± 0.593
2.772ValArg: 2.772 ± 0.583
5.235ValSer: 5.235 ± 1.095
4.414ValThr: 4.414 ± 0.698
2.566ValVal: 2.566 ± 0.595
0.513ValTrp: 0.513 ± 0.216
2.772ValTyr: 2.772 ± 0.467
0.0ValXaa: 0.0 ± 0.0
Trp
0.924TrpAla: 0.924 ± 0.287
0.411TrpCys: 0.411 ± 0.195
1.026TrpAsp: 1.026 ± 0.452
0.719TrpGlu: 0.719 ± 0.263
1.026TrpPhe: 1.026 ± 0.396
1.026TrpGly: 1.026 ± 0.253
0.205TrpHis: 0.205 ± 0.125
0.924TrpIle: 0.924 ± 0.303
1.129TrpLys: 1.129 ± 0.452
1.334TrpLeu: 1.334 ± 0.445
0.308TrpMet: 0.308 ± 0.176
1.232TrpAsn: 1.232 ± 0.308
0.308TrpPro: 0.308 ± 0.179
0.821TrpGln: 0.821 ± 0.311
0.308TrpArg: 0.308 ± 0.23
0.821TrpSer: 0.821 ± 0.274
0.719TrpThr: 0.719 ± 0.245
0.821TrpVal: 0.821 ± 0.289
0.205TrpTrp: 0.205 ± 0.147
1.232TrpTyr: 1.232 ± 0.363
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.54TyrAla: 1.54 ± 0.452
0.616TyrCys: 0.616 ± 0.3
2.566TyrAsp: 2.566 ± 0.551
4.106TyrGlu: 4.106 ± 0.63
2.156TyrPhe: 2.156 ± 0.438
3.285TyrGly: 3.285 ± 0.567
0.924TyrHis: 0.924 ± 0.328
3.182TyrIle: 3.182 ± 0.585
2.874TyrLys: 2.874 ± 0.753
3.901TyrLeu: 3.901 ± 0.747
1.129TyrMet: 1.129 ± 0.309
4.106TyrAsn: 4.106 ± 0.601
1.026TyrPro: 1.026 ± 0.376
1.54TyrGln: 1.54 ± 0.477
1.129TyrArg: 1.129 ± 0.339
2.053TyrSer: 2.053 ± 0.503
3.079TyrThr: 3.079 ± 0.578
2.566TyrVal: 2.566 ± 0.589
0.513TyrTrp: 0.513 ± 0.235
2.053TyrTyr: 2.053 ± 0.503
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (9743 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski