Amino acid dipepetide frequency for Lactococcus phage 936 group phage PhiA1127

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.034AlaAla: 1.034 ± 0.448
0.23AlaCys: 0.23 ± 0.153
3.446AlaAsp: 3.446 ± 0.703
5.284AlaGlu: 5.284 ± 0.906
3.791AlaPhe: 3.791 ± 0.874
4.365AlaGly: 4.365 ± 0.668
0.919AlaHis: 0.919 ± 0.327
5.169AlaIle: 5.169 ± 1.025
6.318AlaLys: 6.318 ± 0.801
5.744AlaLeu: 5.744 ± 0.894
2.757AlaMet: 2.757 ± 0.481
4.71AlaAsn: 4.71 ± 1.023
0.689AlaPro: 0.689 ± 0.282
2.642AlaGln: 2.642 ± 0.686
2.527AlaArg: 2.527 ± 0.498
3.331AlaSer: 3.331 ± 0.896
4.021AlaThr: 4.021 ± 0.949
4.021AlaVal: 4.021 ± 0.847
1.723AlaTrp: 1.723 ± 0.51
2.068AlaTyr: 2.068 ± 0.46
0.0AlaXaa: 0.0 ± 0.0
Cys
0.345CysAla: 0.345 ± 0.182
0.115CysCys: 0.115 ± 0.11
0.23CysAsp: 0.23 ± 0.161
0.23CysGlu: 0.23 ± 0.14
0.23CysPhe: 0.23 ± 0.166
0.804CysGly: 0.804 ± 0.34
0.115CysHis: 0.115 ± 0.149
0.46CysIle: 0.46 ± 0.238
0.689CysLys: 0.689 ± 0.39
0.23CysLeu: 0.23 ± 0.164
0.115CysMet: 0.115 ± 0.117
0.804CysAsn: 0.804 ± 0.304
0.115CysPro: 0.115 ± 0.135
0.46CysGln: 0.46 ± 0.288
0.345CysArg: 0.345 ± 0.186
0.23CysSer: 0.23 ± 0.164
0.23CysThr: 0.23 ± 0.147
0.23CysVal: 0.23 ± 0.18
0.0CysTrp: 0.0 ± 0.0
0.115CysTyr: 0.115 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
2.068AspAla: 2.068 ± 0.595
0.23AspCys: 0.23 ± 0.147
3.217AspAsp: 3.217 ± 0.679
4.021AspGlu: 4.021 ± 0.88
3.217AspPhe: 3.217 ± 0.617
4.71AspGly: 4.71 ± 0.775
0.689AspHis: 0.689 ± 0.296
3.791AspIle: 3.791 ± 0.659
4.94AspLys: 4.94 ± 0.651
6.203AspLeu: 6.203 ± 0.892
1.034AspMet: 1.034 ± 0.288
4.136AspAsn: 4.136 ± 0.672
1.723AspPro: 1.723 ± 0.439
0.574AspGln: 0.574 ± 0.273
1.608AspArg: 1.608 ± 0.483
2.642AspSer: 2.642 ± 0.573
3.561AspThr: 3.561 ± 0.678
3.102AspVal: 3.102 ± 0.602
0.804AspTrp: 0.804 ± 0.266
3.676AspTyr: 3.676 ± 0.656
0.0AspXaa: 0.0 ± 0.0
Glu
5.055GluAla: 5.055 ± 0.797
0.46GluCys: 0.46 ± 0.238
2.987GluAsp: 2.987 ± 0.583
5.055GluGlu: 5.055 ± 0.932
3.676GluPhe: 3.676 ± 0.47
2.298GluGly: 2.298 ± 0.525
1.379GluHis: 1.379 ± 0.407
5.744GluIle: 5.744 ± 0.833
6.203GluLys: 6.203 ± 1.146
10.454GluLeu: 10.454 ± 1.331
2.527GluMet: 2.527 ± 0.468
4.48GluAsn: 4.48 ± 0.788
1.149GluPro: 1.149 ± 0.341
4.136GluGln: 4.136 ± 0.784
3.217GluArg: 3.217 ± 0.609
3.676GluSer: 3.676 ± 0.497
5.284GluThr: 5.284 ± 0.923
3.906GluVal: 3.906 ± 0.67
0.919GluTrp: 0.919 ± 0.356
2.987GluTyr: 2.987 ± 0.684
0.0GluXaa: 0.0 ± 0.0
Phe
3.217PheAla: 3.217 ± 0.736
0.23PheCys: 0.23 ± 0.163
2.987PheAsp: 2.987 ± 0.562
3.217PheGlu: 3.217 ± 0.763
1.723PhePhe: 1.723 ± 0.501
1.723PheGly: 1.723 ± 0.46
0.0PheHis: 0.0 ± 0.0
3.446PheIle: 3.446 ± 0.643
3.791PheLys: 3.791 ± 0.749
2.183PheLeu: 2.183 ± 0.489
0.689PheMet: 0.689 ± 0.255
3.331PheAsn: 3.331 ± 0.779
1.149PhePro: 1.149 ± 0.445
1.379PheGln: 1.379 ± 0.4
1.493PheArg: 1.493 ± 0.353
3.446PheSer: 3.446 ± 0.804
2.757PheThr: 2.757 ± 0.482
2.642PheVal: 2.642 ± 0.566
0.23PheTrp: 0.23 ± 0.142
1.608PheTyr: 1.608 ± 0.39
0.0PheXaa: 0.0 ± 0.0
Gly
4.021GlyAla: 4.021 ± 1.115
0.345GlyCys: 0.345 ± 0.212
2.872GlyAsp: 2.872 ± 0.731
4.136GlyGlu: 4.136 ± 0.658
2.298GlyPhe: 2.298 ± 0.434
4.365GlyGly: 4.365 ± 0.913
0.46GlyHis: 0.46 ± 0.206
4.136GlyIle: 4.136 ± 0.792
6.318GlyLys: 6.318 ± 0.819
5.744GlyLeu: 5.744 ± 1.196
1.379GlyMet: 1.379 ± 0.432
3.217GlyAsn: 3.217 ± 0.782
0.115GlyPro: 0.115 ± 0.112
2.068GlyGln: 2.068 ± 0.433
1.608GlyArg: 1.608 ± 0.266
4.25GlySer: 4.25 ± 0.865
3.676GlyThr: 3.676 ± 0.883
5.514GlyVal: 5.514 ± 1.1
1.264GlyTrp: 1.264 ± 0.351
2.987GlyTyr: 2.987 ± 0.525
0.0GlyXaa: 0.0 ± 0.0
His
1.034HisAla: 1.034 ± 0.313
0.46HisCys: 0.46 ± 0.24
0.574HisAsp: 0.574 ± 0.323
0.804HisGlu: 0.804 ± 0.312
0.345HisPhe: 0.345 ± 0.207
1.838HisGly: 1.838 ± 0.451
0.0HisHis: 0.0 ± 0.0
0.689HisIle: 0.689 ± 0.26
0.46HisLys: 0.46 ± 0.212
0.919HisLeu: 0.919 ± 0.34
0.0HisMet: 0.0 ± 0.0
1.608HisAsn: 1.608 ± 0.412
0.23HisPro: 0.23 ± 0.14
0.23HisGln: 0.23 ± 0.157
0.345HisArg: 0.345 ± 0.178
0.574HisSer: 0.574 ± 0.347
0.804HisThr: 0.804 ± 0.31
0.689HisVal: 0.689 ± 0.262
0.115HisTrp: 0.115 ± 0.083
0.574HisTyr: 0.574 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
5.284IleAla: 5.284 ± 0.666
0.23IleCys: 0.23 ± 0.163
4.825IleAsp: 4.825 ± 0.662
6.203IleGlu: 6.203 ± 0.963
2.642IlePhe: 2.642 ± 0.662
3.791IleGly: 3.791 ± 0.906
0.919IleHis: 0.919 ± 0.303
4.48IleIle: 4.48 ± 0.711
6.318IleLys: 6.318 ± 0.733
5.284IleLeu: 5.284 ± 0.846
1.264IleMet: 1.264 ± 0.367
5.399IleAsn: 5.399 ± 0.618
2.068IlePro: 2.068 ± 0.422
2.412IleGln: 2.412 ± 0.486
1.723IleArg: 1.723 ± 0.502
3.676IleSer: 3.676 ± 0.8
4.71IleThr: 4.71 ± 0.649
4.48IleVal: 4.48 ± 0.587
0.804IleTrp: 0.804 ± 0.258
2.757IleTyr: 2.757 ± 0.546
0.0IleXaa: 0.0 ± 0.0
Lys
6.778LysAla: 6.778 ± 0.988
0.345LysCys: 0.345 ± 0.205
5.399LysAsp: 5.399 ± 0.601
7.122LysGlu: 7.122 ± 1.281
1.608LysPhe: 1.608 ± 0.454
5.055LysGly: 5.055 ± 0.694
1.379LysHis: 1.379 ± 0.538
5.744LysIle: 5.744 ± 0.932
8.96LysLys: 8.96 ± 1.155
8.041LysLeu: 8.041 ± 0.756
3.102LysMet: 3.102 ± 0.622
5.974LysAsn: 5.974 ± 0.793
1.838LysPro: 1.838 ± 0.553
3.791LysGln: 3.791 ± 0.707
3.791LysArg: 3.791 ± 0.826
5.169LysSer: 5.169 ± 0.725
4.825LysThr: 4.825 ± 0.762
6.088LysVal: 6.088 ± 0.751
1.379LysTrp: 1.379 ± 0.305
3.791LysTyr: 3.791 ± 0.694
0.0LysXaa: 0.0 ± 0.0
Leu
5.399LeuAla: 5.399 ± 0.587
0.23LeuCys: 0.23 ± 0.189
4.71LeuAsp: 4.71 ± 0.583
5.399LeuGlu: 5.399 ± 0.846
3.446LeuPhe: 3.446 ± 0.633
4.595LeuGly: 4.595 ± 0.919
1.379LeuHis: 1.379 ± 0.353
6.203LeuIle: 6.203 ± 0.864
8.616LeuLys: 8.616 ± 0.854
6.318LeuLeu: 6.318 ± 0.958
1.608LeuMet: 1.608 ± 0.46
5.514LeuAsn: 5.514 ± 0.86
3.217LeuPro: 3.217 ± 0.613
3.217LeuGln: 3.217 ± 0.583
3.102LeuArg: 3.102 ± 0.455
5.169LeuSer: 5.169 ± 0.82
6.548LeuThr: 6.548 ± 0.825
5.399LeuVal: 5.399 ± 0.779
1.493LeuTrp: 1.493 ± 0.387
4.595LeuTyr: 4.595 ± 0.758
0.0LeuXaa: 0.0 ± 0.0
Met
2.068MetAla: 2.068 ± 0.478
0.115MetCys: 0.115 ± 0.128
1.149MetAsp: 1.149 ± 0.386
2.068MetGlu: 2.068 ± 0.532
0.689MetPhe: 0.689 ± 0.294
0.919MetGly: 0.919 ± 0.31
0.345MetHis: 0.345 ± 0.212
2.642MetIle: 2.642 ± 0.685
2.412MetLys: 2.412 ± 0.485
1.034MetLeu: 1.034 ± 0.382
0.46MetMet: 0.46 ± 0.257
2.298MetAsn: 2.298 ± 0.517
0.574MetPro: 0.574 ± 0.288
1.838MetGln: 1.838 ± 0.348
0.574MetArg: 0.574 ± 0.237
1.838MetSer: 1.838 ± 0.417
1.838MetThr: 1.838 ± 0.529
0.919MetVal: 0.919 ± 0.276
0.115MetTrp: 0.115 ± 0.1
1.034MetTyr: 1.034 ± 0.352
0.0MetXaa: 0.0 ± 0.0
Asn
5.284AsnAla: 5.284 ± 1.137
0.115AsnCys: 0.115 ± 0.119
4.48AsnAsp: 4.48 ± 0.742
6.203AsnGlu: 6.203 ± 1.004
2.183AsnPhe: 2.183 ± 0.485
6.088AsnGly: 6.088 ± 0.915
0.804AsnHis: 0.804 ± 0.351
4.021AsnIle: 4.021 ± 0.629
5.859AsnLys: 5.859 ± 1.114
6.088AsnLeu: 6.088 ± 0.932
1.493AsnMet: 1.493 ± 0.375
3.561AsnAsn: 3.561 ± 0.589
1.723AsnPro: 1.723 ± 0.49
2.183AsnGln: 2.183 ± 0.564
2.298AsnArg: 2.298 ± 0.453
4.595AsnSer: 4.595 ± 0.706
4.595AsnThr: 4.595 ± 0.899
4.136AsnVal: 4.136 ± 0.728
0.804AsnTrp: 0.804 ± 0.354
2.642AsnTyr: 2.642 ± 0.672
0.0AsnXaa: 0.0 ± 0.0
Pro
1.953ProAla: 1.953 ± 0.376
0.23ProCys: 0.23 ± 0.151
1.608ProAsp: 1.608 ± 0.446
1.838ProGlu: 1.838 ± 0.505
0.919ProPhe: 0.919 ± 0.3
0.345ProGly: 0.345 ± 0.189
0.115ProHis: 0.115 ± 0.134
1.493ProIle: 1.493 ± 0.337
2.068ProLys: 2.068 ± 0.543
1.838ProLeu: 1.838 ± 0.405
0.689ProMet: 0.689 ± 0.233
2.527ProAsn: 2.527 ± 0.831
0.689ProPro: 0.689 ± 0.296
0.804ProGln: 0.804 ± 0.282
0.574ProArg: 0.574 ± 0.216
0.689ProSer: 0.689 ± 0.25
2.298ProThr: 2.298 ± 0.458
1.838ProVal: 1.838 ± 0.485
0.345ProTrp: 0.345 ± 0.181
0.345ProTyr: 0.345 ± 0.168
0.0ProXaa: 0.0 ± 0.0
Gln
3.102GlnAla: 3.102 ± 0.673
0.23GlnCys: 0.23 ± 0.174
1.838GlnAsp: 1.838 ± 0.473
2.757GlnGlu: 2.757 ± 0.501
1.264GlnPhe: 1.264 ± 0.335
2.642GlnGly: 2.642 ± 0.448
0.46GlnHis: 0.46 ± 0.194
1.493GlnIle: 1.493 ± 0.361
3.561GlnLys: 3.561 ± 0.675
2.872GlnLeu: 2.872 ± 0.677
0.919GlnMet: 0.919 ± 0.265
2.183GlnAsn: 2.183 ± 0.359
1.379GlnPro: 1.379 ± 0.379
1.838GlnGln: 1.838 ± 0.535
1.608GlnArg: 1.608 ± 0.434
2.987GlnSer: 2.987 ± 0.483
1.608GlnThr: 1.608 ± 0.42
2.642GlnVal: 2.642 ± 0.602
0.574GlnTrp: 0.574 ± 0.25
1.608GlnTyr: 1.608 ± 0.377
0.0GlnXaa: 0.0 ± 0.0
Arg
2.412ArgAla: 2.412 ± 0.621
0.345ArgCys: 0.345 ± 0.195
1.723ArgAsp: 1.723 ± 0.47
1.953ArgGlu: 1.953 ± 0.441
0.804ArgPhe: 0.804 ± 0.291
1.953ArgGly: 1.953 ± 0.507
0.919ArgHis: 0.919 ± 0.297
2.183ArgIle: 2.183 ± 0.532
4.71ArgLys: 4.71 ± 0.951
3.676ArgLeu: 3.676 ± 0.706
0.804ArgMet: 0.804 ± 0.331
2.068ArgAsn: 2.068 ± 0.455
0.689ArgPro: 0.689 ± 0.268
1.608ArgGln: 1.608 ± 0.385
2.068ArgArg: 2.068 ± 0.583
2.068ArgSer: 2.068 ± 0.427
2.068ArgThr: 2.068 ± 0.523
2.183ArgVal: 2.183 ± 0.573
0.345ArgTrp: 0.345 ± 0.195
1.608ArgTyr: 1.608 ± 0.442
0.0ArgXaa: 0.0 ± 0.0
Ser
4.825SerAla: 4.825 ± 1.098
0.574SerCys: 0.574 ± 0.294
3.331SerAsp: 3.331 ± 0.579
3.906SerGlu: 3.906 ± 0.791
3.217SerPhe: 3.217 ± 0.512
4.595SerGly: 4.595 ± 1.436
0.574SerHis: 0.574 ± 0.278
5.169SerIle: 5.169 ± 0.71
4.48SerLys: 4.48 ± 0.969
6.088SerLeu: 6.088 ± 0.895
1.493SerMet: 1.493 ± 0.298
4.365SerAsn: 4.365 ± 0.841
1.379SerPro: 1.379 ± 0.428
1.723SerGln: 1.723 ± 0.498
2.298SerArg: 2.298 ± 0.536
4.48SerSer: 4.48 ± 0.821
3.102SerThr: 3.102 ± 0.827
4.136SerVal: 4.136 ± 0.726
0.689SerTrp: 0.689 ± 0.362
1.493SerTyr: 1.493 ± 0.404
0.0SerXaa: 0.0 ± 0.0
Thr
4.825ThrAla: 4.825 ± 0.679
0.345ThrCys: 0.345 ± 0.212
3.906ThrAsp: 3.906 ± 0.666
5.744ThrGlu: 5.744 ± 0.603
2.527ThrPhe: 2.527 ± 0.578
4.136ThrGly: 4.136 ± 0.596
0.0ThrHis: 0.0 ± 0.0
4.71ThrIle: 4.71 ± 0.921
4.825ThrLys: 4.825 ± 0.565
5.284ThrLeu: 5.284 ± 0.704
1.264ThrMet: 1.264 ± 0.344
4.365ThrAsn: 4.365 ± 0.767
1.723ThrPro: 1.723 ± 0.345
2.412ThrGln: 2.412 ± 0.613
1.838ThrArg: 1.838 ± 0.485
5.284ThrSer: 5.284 ± 0.723
4.365ThrThr: 4.365 ± 0.679
4.825ThrVal: 4.825 ± 0.877
0.919ThrTrp: 0.919 ± 0.32
2.298ThrTyr: 2.298 ± 0.533
0.0ThrXaa: 0.0 ± 0.0
Val
3.906ValAla: 3.906 ± 0.525
0.23ValCys: 0.23 ± 0.163
3.791ValAsp: 3.791 ± 0.786
5.284ValGlu: 5.284 ± 0.741
3.102ValPhe: 3.102 ± 0.687
2.872ValGly: 2.872 ± 0.391
0.46ValHis: 0.46 ± 0.254
4.365ValIle: 4.365 ± 0.573
5.744ValLys: 5.744 ± 0.747
3.561ValLeu: 3.561 ± 0.556
1.838ValMet: 1.838 ± 0.453
3.906ValAsn: 3.906 ± 0.968
1.264ValPro: 1.264 ± 0.453
1.953ValGln: 1.953 ± 0.397
3.102ValArg: 3.102 ± 0.734
4.94ValSer: 4.94 ± 0.901
5.974ValThr: 5.974 ± 0.849
3.217ValVal: 3.217 ± 0.721
0.46ValTrp: 0.46 ± 0.24
3.217ValTyr: 3.217 ± 0.597
0.0ValXaa: 0.0 ± 0.0
Trp
0.46TrpAla: 0.46 ± 0.211
0.345TrpCys: 0.345 ± 0.203
0.919TrpAsp: 0.919 ± 0.401
0.574TrpGlu: 0.574 ± 0.249
0.919TrpPhe: 0.919 ± 0.404
0.689TrpGly: 0.689 ± 0.304
0.23TrpHis: 0.23 ± 0.161
0.46TrpIle: 0.46 ± 0.229
1.034TrpLys: 1.034 ± 0.356
1.493TrpLeu: 1.493 ± 0.403
0.345TrpMet: 0.345 ± 0.199
1.264TrpAsn: 1.264 ± 0.369
0.0TrpPro: 0.0 ± 0.0
0.919TrpGln: 0.919 ± 0.268
0.46TrpArg: 0.46 ± 0.308
1.034TrpSer: 1.034 ± 0.264
0.689TrpThr: 0.689 ± 0.229
0.689TrpVal: 0.689 ± 0.314
0.0TrpTrp: 0.0 ± 0.0
0.919TrpTyr: 0.919 ± 0.346
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.723TyrAla: 1.723 ± 0.5
0.689TyrCys: 0.689 ± 0.325
1.953TyrAsp: 1.953 ± 0.578
3.791TyrGlu: 3.791 ± 0.772
2.642TyrPhe: 2.642 ± 0.536
2.987TyrGly: 2.987 ± 0.706
1.149TyrHis: 1.149 ± 0.307
2.872TyrIle: 2.872 ± 0.649
2.987TyrLys: 2.987 ± 0.663
3.102TyrLeu: 3.102 ± 0.826
1.034TyrMet: 1.034 ± 0.444
3.446TyrAsn: 3.446 ± 0.653
1.493TyrPro: 1.493 ± 0.435
1.379TyrGln: 1.379 ± 0.589
1.608TyrArg: 1.608 ± 0.392
1.838TyrSer: 1.838 ± 0.528
2.642TyrThr: 2.642 ± 0.675
2.527TyrVal: 2.527 ± 0.485
0.46TyrTrp: 0.46 ± 0.2
2.183TyrTyr: 2.183 ± 0.526
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (8706 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski