Amino acid dipepetide frequency for Lactococcus phage 98201

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.066AlaAla: 4.066 ± 0.874
0.177AlaCys: 0.177 ± 0.12
4.331AlaAsp: 4.331 ± 0.574
3.094AlaGlu: 3.094 ± 0.478
1.856AlaPhe: 1.856 ± 0.344
5.392AlaGly: 5.392 ± 0.814
1.414AlaHis: 1.414 ± 0.459
4.243AlaIle: 4.243 ± 0.86
5.215AlaLys: 5.215 ± 0.659
6.452AlaLeu: 6.452 ± 1.07
1.414AlaMet: 1.414 ± 0.266
3.889AlaAsn: 3.889 ± 0.673
2.386AlaPro: 2.386 ± 0.389
2.563AlaGln: 2.563 ± 0.377
2.033AlaArg: 2.033 ± 0.502
4.154AlaSer: 4.154 ± 0.774
4.773AlaThr: 4.773 ± 0.754
4.154AlaVal: 4.154 ± 0.545
0.884AlaTrp: 0.884 ± 0.287
2.121AlaTyr: 2.121 ± 0.513
0.0AlaXaa: 0.0 ± 0.0
Cys
0.088CysAla: 0.088 ± 0.092
0.0CysCys: 0.0 ± 0.0
0.442CysAsp: 0.442 ± 0.194
0.795CysGlu: 0.795 ± 0.263
0.53CysPhe: 0.53 ± 0.26
0.707CysGly: 0.707 ± 0.252
0.53CysHis: 0.53 ± 0.301
0.265CysIle: 0.265 ± 0.143
0.265CysLys: 0.265 ± 0.149
0.177CysLeu: 0.177 ± 0.118
0.088CysMet: 0.088 ± 0.089
0.0CysAsn: 0.0 ± 0.0
0.088CysPro: 0.088 ± 0.088
0.088CysGln: 0.088 ± 0.079
0.354CysArg: 0.354 ± 0.147
0.354CysSer: 0.354 ± 0.213
0.442CysThr: 0.442 ± 0.204
0.0CysVal: 0.0 ± 0.0
0.088CysTrp: 0.088 ± 0.089
0.265CysTyr: 0.265 ± 0.141
0.0CysXaa: 0.0 ± 0.0
Asp
3.094AspAla: 3.094 ± 0.583
0.53AspCys: 0.53 ± 0.258
4.154AspAsp: 4.154 ± 0.709
5.303AspGlu: 5.303 ± 0.886
3.359AspPhe: 3.359 ± 0.551
6.01AspGly: 6.01 ± 1.163
0.354AspHis: 0.354 ± 0.155
4.066AspIle: 4.066 ± 0.432
5.568AspLys: 5.568 ± 0.753
4.331AspLeu: 4.331 ± 0.566
1.679AspMet: 1.679 ± 0.423
3.712AspAsn: 3.712 ± 0.546
1.414AspPro: 1.414 ± 0.38
1.149AspGln: 1.149 ± 0.304
2.121AspArg: 2.121 ± 0.314
4.419AspSer: 4.419 ± 0.664
3.27AspThr: 3.27 ± 0.673
3.977AspVal: 3.977 ± 0.634
0.53AspTrp: 0.53 ± 0.224
3.005AspTyr: 3.005 ± 0.471
0.0AspXaa: 0.0 ± 0.0
Glu
3.712GluAla: 3.712 ± 0.77
0.354GluCys: 0.354 ± 0.171
3.182GluAsp: 3.182 ± 0.486
6.187GluGlu: 6.187 ± 1.171
3.801GluPhe: 3.801 ± 0.697
2.298GluGly: 2.298 ± 0.395
1.237GluHis: 1.237 ± 0.3
6.275GluIle: 6.275 ± 0.896
5.392GluLys: 5.392 ± 1.114
7.071GluLeu: 7.071 ± 1.221
1.856GluMet: 1.856 ± 0.403
3.624GluAsn: 3.624 ± 0.769
1.944GluPro: 1.944 ± 0.466
3.094GluGln: 3.094 ± 0.685
3.005GluArg: 3.005 ± 0.703
3.712GluSer: 3.712 ± 0.663
4.508GluThr: 4.508 ± 0.666
4.154GluVal: 4.154 ± 0.653
1.503GluTrp: 1.503 ± 0.448
3.094GluTyr: 3.094 ± 0.522
0.0GluXaa: 0.0 ± 0.0
Phe
2.828PheAla: 2.828 ± 0.479
0.177PheCys: 0.177 ± 0.12
2.652PheAsp: 2.652 ± 0.529
3.801PheGlu: 3.801 ± 0.69
2.298PhePhe: 2.298 ± 0.425
2.386PheGly: 2.386 ± 0.363
0.53PheHis: 0.53 ± 0.233
3.535PheIle: 3.535 ± 0.522
4.154PheLys: 4.154 ± 0.521
2.652PheLeu: 2.652 ± 0.458
1.503PheMet: 1.503 ± 0.412
2.21PheAsn: 2.21 ± 0.381
1.237PhePro: 1.237 ± 0.3
0.972PheGln: 0.972 ± 0.235
1.944PheArg: 1.944 ± 0.532
3.005PheSer: 3.005 ± 0.471
2.828PheThr: 2.828 ± 0.501
2.563PheVal: 2.563 ± 0.541
0.795PheTrp: 0.795 ± 0.25
2.386PheTyr: 2.386 ± 0.492
0.0PheXaa: 0.0 ± 0.0
Gly
2.917GlyAla: 2.917 ± 0.585
0.354GlyCys: 0.354 ± 0.167
3.712GlyAsp: 3.712 ± 0.373
3.27GlyGlu: 3.27 ± 0.662
4.243GlyPhe: 4.243 ± 0.631
4.95GlyGly: 4.95 ± 0.728
1.149GlyHis: 1.149 ± 0.343
6.629GlyIle: 6.629 ± 0.779
5.833GlyLys: 5.833 ± 0.743
5.126GlyLeu: 5.126 ± 0.805
1.149GlyMet: 1.149 ± 0.295
4.596GlyAsn: 4.596 ± 0.651
1.061GlyPro: 1.061 ± 0.385
3.094GlyGln: 3.094 ± 0.452
2.033GlyArg: 2.033 ± 0.394
5.48GlySer: 5.48 ± 0.891
5.833GlyThr: 5.833 ± 0.93
3.359GlyVal: 3.359 ± 0.496
1.326GlyTrp: 1.326 ± 0.379
2.386GlyTyr: 2.386 ± 0.309
0.0GlyXaa: 0.0 ± 0.0
His
0.795HisAla: 0.795 ± 0.351
0.265HisCys: 0.265 ± 0.152
0.972HisAsp: 0.972 ± 0.317
1.237HisGlu: 1.237 ± 0.406
0.265HisPhe: 0.265 ± 0.152
1.326HisGly: 1.326 ± 0.45
0.53HisHis: 0.53 ± 0.196
0.884HisIle: 0.884 ± 0.272
0.707HisLys: 0.707 ± 0.261
0.884HisLeu: 0.884 ± 0.234
0.265HisMet: 0.265 ± 0.143
0.53HisAsn: 0.53 ± 0.229
0.354HisPro: 0.354 ± 0.217
0.442HisGln: 0.442 ± 0.211
0.354HisArg: 0.354 ± 0.159
0.972HisSer: 0.972 ± 0.358
0.884HisThr: 0.884 ± 0.27
0.795HisVal: 0.795 ± 0.315
0.088HisTrp: 0.088 ± 0.079
0.795HisTyr: 0.795 ± 0.35
0.0HisXaa: 0.0 ± 0.0
Ile
4.243IleAla: 4.243 ± 0.798
0.265IleCys: 0.265 ± 0.162
5.657IleAsp: 5.657 ± 0.545
4.331IleGlu: 4.331 ± 0.772
1.768IlePhe: 1.768 ± 0.491
4.243IleGly: 4.243 ± 0.723
1.061IleHis: 1.061 ± 0.365
5.568IleIle: 5.568 ± 1.405
6.187IleLys: 6.187 ± 0.688
4.331IleLeu: 4.331 ± 0.683
1.944IleMet: 1.944 ± 0.467
4.596IleAsn: 4.596 ± 0.607
3.801IlePro: 3.801 ± 0.649
3.094IleGln: 3.094 ± 0.446
2.563IleArg: 2.563 ± 0.481
6.717IleSer: 6.717 ± 0.869
4.773IleThr: 4.773 ± 0.806
4.154IleVal: 4.154 ± 0.582
0.619IleTrp: 0.619 ± 0.263
1.237IleTyr: 1.237 ± 0.367
0.0IleXaa: 0.0 ± 0.0
Lys
6.364LysAla: 6.364 ± 0.942
0.619LysCys: 0.619 ± 0.224
4.331LysAsp: 4.331 ± 0.655
6.01LysGlu: 6.01 ± 1.001
3.624LysPhe: 3.624 ± 0.715
5.48LysGly: 5.48 ± 0.62
0.707LysHis: 0.707 ± 0.248
4.596LysIle: 4.596 ± 0.594
6.541LysLys: 6.541 ± 1.083
6.541LysLeu: 6.541 ± 0.828
2.563LysMet: 2.563 ± 0.427
4.684LysAsn: 4.684 ± 0.675
2.298LysPro: 2.298 ± 0.507
3.535LysGln: 3.535 ± 0.617
3.005LysArg: 3.005 ± 0.433
5.392LysSer: 5.392 ± 0.842
5.657LysThr: 5.657 ± 0.76
4.243LysVal: 4.243 ± 0.504
1.326LysTrp: 1.326 ± 0.397
2.121LysTyr: 2.121 ± 0.397
0.0LysXaa: 0.0 ± 0.0
Leu
5.48LeuAla: 5.48 ± 0.766
0.265LeuCys: 0.265 ± 0.163
3.27LeuAsp: 3.27 ± 0.644
5.745LeuGlu: 5.745 ± 0.865
3.624LeuPhe: 3.624 ± 0.497
4.95LeuGly: 4.95 ± 0.791
0.177LeuHis: 0.177 ± 0.147
5.303LeuIle: 5.303 ± 0.979
5.657LeuLys: 5.657 ± 0.71
4.419LeuLeu: 4.419 ± 0.825
2.21LeuMet: 2.21 ± 0.548
3.977LeuAsn: 3.977 ± 0.569
4.331LeuPro: 4.331 ± 0.9
2.917LeuGln: 2.917 ± 0.608
3.182LeuArg: 3.182 ± 0.57
6.275LeuSer: 6.275 ± 0.707
6.717LeuThr: 6.717 ± 0.901
4.243LeuVal: 4.243 ± 0.857
1.061LeuTrp: 1.061 ± 0.357
2.121LeuTyr: 2.121 ± 0.389
0.0LeuXaa: 0.0 ± 0.0
Met
1.856MetAla: 1.856 ± 0.404
0.177MetCys: 0.177 ± 0.177
1.414MetAsp: 1.414 ± 0.368
1.944MetGlu: 1.944 ± 0.39
0.972MetPhe: 0.972 ± 0.317
1.768MetGly: 1.768 ± 0.411
0.177MetHis: 0.177 ± 0.125
1.326MetIle: 1.326 ± 0.405
2.121MetLys: 2.121 ± 0.46
1.326MetLeu: 1.326 ± 0.302
0.53MetMet: 0.53 ± 0.222
1.503MetAsn: 1.503 ± 0.34
0.354MetPro: 0.354 ± 0.142
1.414MetGln: 1.414 ± 0.375
1.237MetArg: 1.237 ± 0.29
1.679MetSer: 1.679 ± 0.397
2.652MetThr: 2.652 ± 0.442
0.795MetVal: 0.795 ± 0.26
0.088MetTrp: 0.088 ± 0.081
0.265MetTyr: 0.265 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
3.801AsnAla: 3.801 ± 0.814
0.265AsnCys: 0.265 ± 0.145
4.154AsnAsp: 4.154 ± 0.552
2.652AsnGlu: 2.652 ± 0.589
3.359AsnPhe: 3.359 ± 0.605
7.071AsnGly: 7.071 ± 0.914
0.53AsnHis: 0.53 ± 0.191
4.331AsnIle: 4.331 ± 0.472
4.066AsnLys: 4.066 ± 0.571
3.889AsnLeu: 3.889 ± 0.582
0.972AsnMet: 0.972 ± 0.295
3.801AsnAsn: 3.801 ± 0.833
1.856AsnPro: 1.856 ± 0.482
2.475AsnGln: 2.475 ± 0.454
2.475AsnArg: 2.475 ± 0.401
3.977AsnSer: 3.977 ± 0.728
2.917AsnThr: 2.917 ± 0.544
4.243AsnVal: 4.243 ± 0.54
0.53AsnTrp: 0.53 ± 0.182
2.74AsnTyr: 2.74 ± 0.47
0.0AsnXaa: 0.0 ± 0.0
Pro
1.149ProAla: 1.149 ± 0.335
0.0ProCys: 0.0 ± 0.0
1.944ProAsp: 1.944 ± 0.471
1.768ProGlu: 1.768 ± 0.425
1.061ProPhe: 1.061 ± 0.293
1.149ProGly: 1.149 ± 0.393
0.619ProHis: 0.619 ± 0.2
1.944ProIle: 1.944 ± 0.299
2.475ProLys: 2.475 ± 0.54
2.475ProLeu: 2.475 ± 0.626
0.619ProMet: 0.619 ± 0.206
1.944ProAsn: 1.944 ± 0.393
1.237ProPro: 1.237 ± 0.296
2.917ProGln: 2.917 ± 0.525
0.972ProArg: 0.972 ± 0.382
3.005ProSer: 3.005 ± 0.441
2.652ProThr: 2.652 ± 0.637
1.768ProVal: 1.768 ± 0.33
0.265ProTrp: 0.265 ± 0.166
0.884ProTyr: 0.884 ± 0.233
0.0ProXaa: 0.0 ± 0.0
Gln
3.889GlnAla: 3.889 ± 0.77
0.177GlnCys: 0.177 ± 0.122
1.768GlnAsp: 1.768 ± 0.384
3.27GlnGlu: 3.27 ± 0.596
2.033GlnPhe: 2.033 ± 0.456
2.298GlnGly: 2.298 ± 0.509
0.707GlnHis: 0.707 ± 0.201
2.563GlnIle: 2.563 ± 0.604
3.182GlnLys: 3.182 ± 0.499
4.154GlnLeu: 4.154 ± 0.756
1.503GlnMet: 1.503 ± 0.364
2.121GlnAsn: 2.121 ± 0.346
1.061GlnPro: 1.061 ± 0.308
1.768GlnGln: 1.768 ± 0.316
1.503GlnArg: 1.503 ± 0.435
2.386GlnSer: 2.386 ± 0.381
2.563GlnThr: 2.563 ± 0.41
2.475GlnVal: 2.475 ± 0.467
0.177GlnTrp: 0.177 ± 0.115
1.326GlnTyr: 1.326 ± 0.337
0.0GlnXaa: 0.0 ± 0.0
Arg
2.21ArgAla: 2.21 ± 0.53
0.354ArgCys: 0.354 ± 0.222
3.005ArgAsp: 3.005 ± 0.474
3.535ArgGlu: 3.535 ± 0.683
2.033ArgPhe: 2.033 ± 0.594
1.503ArgGly: 1.503 ± 0.418
0.707ArgHis: 0.707 ± 0.275
2.21ArgIle: 2.21 ± 0.411
2.917ArgLys: 2.917 ± 0.376
3.005ArgLeu: 3.005 ± 0.619
1.061ArgMet: 1.061 ± 0.299
2.298ArgAsn: 2.298 ± 0.441
0.884ArgPro: 0.884 ± 0.315
1.326ArgGln: 1.326 ± 0.316
1.856ArgArg: 1.856 ± 0.376
1.326ArgSer: 1.326 ± 0.387
2.121ArgThr: 2.121 ± 0.435
2.298ArgVal: 2.298 ± 0.44
0.619ArgTrp: 0.619 ± 0.212
1.414ArgTyr: 1.414 ± 0.401
0.0ArgXaa: 0.0 ± 0.0
Ser
4.596SerAla: 4.596 ± 0.635
0.177SerCys: 0.177 ± 0.106
4.95SerAsp: 4.95 ± 0.637
4.95SerGlu: 4.95 ± 0.703
3.27SerPhe: 3.27 ± 0.397
6.099SerGly: 6.099 ± 1.061
0.619SerHis: 0.619 ± 0.24
3.977SerIle: 3.977 ± 0.663
6.099SerLys: 6.099 ± 0.87
4.684SerLeu: 4.684 ± 0.697
1.503SerMet: 1.503 ± 0.354
5.568SerAsn: 5.568 ± 0.635
1.326SerPro: 1.326 ± 0.358
1.856SerGln: 1.856 ± 0.387
2.21SerArg: 2.21 ± 0.466
6.717SerSer: 6.717 ± 1.253
3.977SerThr: 3.977 ± 0.696
5.126SerVal: 5.126 ± 0.829
0.795SerTrp: 0.795 ± 0.364
2.917SerTyr: 2.917 ± 0.42
0.0SerXaa: 0.0 ± 0.0
Thr
6.187ThrAla: 6.187 ± 0.977
0.795ThrCys: 0.795 ± 0.328
3.801ThrAsp: 3.801 ± 0.777
4.419ThrGlu: 4.419 ± 0.636
2.121ThrPhe: 2.121 ± 0.494
5.745ThrGly: 5.745 ± 0.814
1.149ThrHis: 1.149 ± 0.337
5.126ThrIle: 5.126 ± 0.784
4.95ThrLys: 4.95 ± 0.599
5.657ThrLeu: 5.657 ± 0.891
0.795ThrMet: 0.795 ± 0.213
5.392ThrAsn: 5.392 ± 0.98
2.917ThrPro: 2.917 ± 0.485
2.828ThrGln: 2.828 ± 0.496
2.475ThrArg: 2.475 ± 0.39
3.977ThrSer: 3.977 ± 0.771
6.099ThrThr: 6.099 ± 1.392
4.95ThrVal: 4.95 ± 0.93
0.795ThrTrp: 0.795 ± 0.277
2.298ThrTyr: 2.298 ± 0.535
0.0ThrXaa: 0.0 ± 0.0
Val
4.154ValAla: 4.154 ± 0.507
0.177ValCys: 0.177 ± 0.12
4.419ValAsp: 4.419 ± 0.662
4.773ValGlu: 4.773 ± 0.642
1.944ValPhe: 1.944 ± 0.3
3.182ValGly: 3.182 ± 0.474
0.53ValHis: 0.53 ± 0.221
5.126ValIle: 5.126 ± 0.591
4.684ValLys: 4.684 ± 0.729
4.066ValLeu: 4.066 ± 0.784
1.061ValMet: 1.061 ± 0.251
3.182ValAsn: 3.182 ± 0.695
1.414ValPro: 1.414 ± 0.407
2.74ValGln: 2.74 ± 0.752
1.591ValArg: 1.591 ± 0.413
4.596ValSer: 4.596 ± 0.545
5.745ValThr: 5.745 ± 0.697
4.154ValVal: 4.154 ± 0.702
0.442ValTrp: 0.442 ± 0.176
2.121ValTyr: 2.121 ± 0.411
0.0ValXaa: 0.0 ± 0.0
Trp
0.707TrpAla: 0.707 ± 0.353
0.177TrpCys: 0.177 ± 0.116
1.061TrpAsp: 1.061 ± 0.319
0.619TrpGlu: 0.619 ± 0.238
0.619TrpPhe: 0.619 ± 0.244
0.265TrpGly: 0.265 ± 0.15
0.177TrpHis: 0.177 ± 0.105
0.884TrpIle: 0.884 ± 0.215
0.884TrpLys: 0.884 ± 0.29
1.326TrpLeu: 1.326 ± 0.414
0.442TrpMet: 0.442 ± 0.182
0.795TrpAsn: 0.795 ± 0.248
0.088TrpPro: 0.088 ± 0.093
0.795TrpGln: 0.795 ± 0.224
0.354TrpArg: 0.354 ± 0.174
0.795TrpSer: 0.795 ± 0.281
1.237TrpThr: 1.237 ± 0.688
0.707TrpVal: 0.707 ± 0.205
0.442TrpTrp: 0.442 ± 0.23
0.53TrpTyr: 0.53 ± 0.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.475TyrAla: 2.475 ± 0.523
0.265TyrCys: 0.265 ± 0.146
2.828TyrAsp: 2.828 ± 0.555
2.298TyrGlu: 2.298 ± 0.585
1.679TyrPhe: 1.679 ± 0.37
1.679TyrGly: 1.679 ± 0.448
0.442TyrHis: 0.442 ± 0.227
2.298TyrIle: 2.298 ± 0.52
2.828TyrLys: 2.828 ± 0.516
3.359TyrLeu: 3.359 ± 0.664
0.354TyrMet: 0.354 ± 0.161
1.679TyrAsn: 1.679 ± 0.394
0.795TyrPro: 0.795 ± 0.204
1.856TyrGln: 1.856 ± 0.344
1.414TyrArg: 1.414 ± 0.345
2.475TyrSer: 2.475 ± 0.45
2.828TyrThr: 2.828 ± 0.542
1.944TyrVal: 1.944 ± 0.41
0.53TyrTrp: 0.53 ± 0.223
1.679TyrTyr: 1.679 ± 0.366
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (11315 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski