Amino acid dipepetide frequency for Lactobacillus phage JNU_P9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.945AlaAla: 7.945 ± 1.217
0.482AlaCys: 0.482 ± 0.19
5.939AlaAsp: 5.939 ± 1.045
4.574AlaGlu: 4.574 ± 0.581
2.006AlaPhe: 2.006 ± 0.352
6.5AlaGly: 6.5 ± 0.91
1.364AlaHis: 1.364 ± 0.371
6.581AlaIle: 6.581 ± 0.719
7.062AlaLys: 7.062 ± 1.056
7.062AlaLeu: 7.062 ± 0.739
2.006AlaMet: 2.006 ± 0.425
5.939AlaAsn: 5.939 ± 0.702
2.408AlaPro: 2.408 ± 0.488
3.21AlaGln: 3.21 ± 0.722
2.648AlaArg: 2.648 ± 0.482
5.778AlaSer: 5.778 ± 0.666
4.895AlaThr: 4.895 ± 0.565
4.815AlaVal: 4.815 ± 0.987
2.327AlaTrp: 2.327 ± 0.548
3.29AlaTyr: 3.29 ± 0.535
0.0AlaXaa: 0.0 ± 0.0
Cys
0.401CysAla: 0.401 ± 0.193
0.161CysCys: 0.161 ± 0.116
0.241CysAsp: 0.241 ± 0.138
0.321CysGlu: 0.321 ± 0.156
0.401CysPhe: 0.401 ± 0.176
0.642CysGly: 0.642 ± 0.261
0.08CysHis: 0.08 ± 0.083
0.241CysIle: 0.241 ± 0.124
0.562CysLys: 0.562 ± 0.248
0.321CysLeu: 0.321 ± 0.163
0.08CysMet: 0.08 ± 0.078
0.161CysAsn: 0.161 ± 0.114
0.562CysPro: 0.562 ± 0.205
0.241CysGln: 0.241 ± 0.145
0.161CysArg: 0.161 ± 0.126
0.161CysSer: 0.161 ± 0.125
0.401CysThr: 0.401 ± 0.171
0.161CysVal: 0.161 ± 0.107
0.0CysTrp: 0.0 ± 0.0
0.321CysTyr: 0.321 ± 0.158
0.0CysXaa: 0.0 ± 0.0
Asp
6.019AspAla: 6.019 ± 0.771
0.321AspCys: 0.321 ± 0.177
5.297AspAsp: 5.297 ± 0.823
4.253AspGlu: 4.253 ± 0.73
2.408AspPhe: 2.408 ± 0.468
5.457AspGly: 5.457 ± 0.496
1.043AspHis: 1.043 ± 0.268
4.494AspIle: 4.494 ± 0.602
3.852AspLys: 3.852 ± 0.588
6.099AspLeu: 6.099 ± 0.596
1.445AspMet: 1.445 ± 0.297
3.13AspAsn: 3.13 ± 0.622
2.408AspPro: 2.408 ± 0.444
2.809AspGln: 2.809 ± 0.392
2.969AspArg: 2.969 ± 0.432
5.377AspSer: 5.377 ± 0.643
2.889AspThr: 2.889 ± 0.42
4.655AspVal: 4.655 ± 0.648
1.043AspTrp: 1.043 ± 0.247
3.21AspTyr: 3.21 ± 0.607
0.0AspXaa: 0.0 ± 0.0
Glu
4.655GluAla: 4.655 ± 0.677
0.482GluCys: 0.482 ± 0.165
3.13GluAsp: 3.13 ± 0.6
2.729GluGlu: 2.729 ± 0.626
1.525GluPhe: 1.525 ± 0.527
2.408GluGly: 2.408 ± 0.404
0.803GluHis: 0.803 ± 0.286
2.408GluIle: 2.408 ± 0.455
4.334GluLys: 4.334 ± 0.593
4.013GluLeu: 4.013 ± 0.637
2.167GluMet: 2.167 ± 0.43
2.327GluAsn: 2.327 ± 0.522
1.846GluPro: 1.846 ± 0.518
2.488GluGln: 2.488 ± 0.447
2.809GluArg: 2.809 ± 0.548
3.932GluSer: 3.932 ± 0.62
3.531GluThr: 3.531 ± 0.585
3.852GluVal: 3.852 ± 0.693
0.803GluTrp: 0.803 ± 0.267
1.846GluTyr: 1.846 ± 0.397
0.0GluXaa: 0.0 ± 0.0
Phe
2.568PheAla: 2.568 ± 0.432
0.241PheCys: 0.241 ± 0.134
2.408PheAsp: 2.408 ± 0.31
2.087PheGlu: 2.087 ± 0.416
0.883PhePhe: 0.883 ± 0.294
3.13PheGly: 3.13 ± 0.452
0.642PheHis: 0.642 ± 0.194
1.685PheIle: 1.685 ± 0.446
2.167PheLys: 2.167 ± 0.412
2.809PheLeu: 2.809 ± 0.542
0.883PheMet: 0.883 ± 0.236
1.685PheAsn: 1.685 ± 0.302
1.284PhePro: 1.284 ± 0.315
1.043PheGln: 1.043 ± 0.236
0.963PheArg: 0.963 ± 0.258
2.648PheSer: 2.648 ± 0.419
2.087PheThr: 2.087 ± 0.457
2.247PheVal: 2.247 ± 0.509
0.642PheTrp: 0.642 ± 0.227
0.562PheTyr: 0.562 ± 0.19
0.0PheXaa: 0.0 ± 0.0
Gly
4.735GlyAla: 4.735 ± 0.917
0.401GlyCys: 0.401 ± 0.189
5.858GlyAsp: 5.858 ± 0.522
2.327GlyGlu: 2.327 ± 0.407
3.29GlyPhe: 3.29 ± 0.48
4.735GlyGly: 4.735 ± 0.81
1.685GlyHis: 1.685 ± 0.477
4.655GlyIle: 4.655 ± 0.927
5.939GlyLys: 5.939 ± 0.791
4.815GlyLeu: 4.815 ± 1.017
2.247GlyMet: 2.247 ± 0.511
3.611GlyAsn: 3.611 ± 0.774
1.766GlyPro: 1.766 ± 0.443
2.568GlyGln: 2.568 ± 0.401
2.006GlyArg: 2.006 ± 0.511
4.334GlySer: 4.334 ± 0.61
4.735GlyThr: 4.735 ± 0.615
3.772GlyVal: 3.772 ± 0.633
1.124GlyTrp: 1.124 ± 0.256
3.692GlyTyr: 3.692 ± 0.655
0.0GlyXaa: 0.0 ± 0.0
His
1.204HisAla: 1.204 ± 0.35
0.08HisCys: 0.08 ± 0.083
1.043HisAsp: 1.043 ± 0.241
0.883HisGlu: 0.883 ± 0.301
1.043HisPhe: 1.043 ± 0.273
1.284HisGly: 1.284 ± 0.286
0.241HisHis: 0.241 ± 0.176
1.364HisIle: 1.364 ± 0.326
1.525HisLys: 1.525 ± 0.368
1.204HisLeu: 1.204 ± 0.276
0.401HisMet: 0.401 ± 0.169
1.124HisAsn: 1.124 ± 0.357
0.722HisPro: 0.722 ± 0.22
1.043HisGln: 1.043 ± 0.324
1.124HisArg: 1.124 ± 0.3
1.525HisSer: 1.525 ± 0.339
0.883HisThr: 0.883 ± 0.321
1.204HisVal: 1.204 ± 0.355
0.321HisTrp: 0.321 ± 0.155
0.722HisTyr: 0.722 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
6.179IleAla: 6.179 ± 0.775
0.241IleCys: 0.241 ± 0.138
5.216IleAsp: 5.216 ± 0.687
3.29IleGlu: 3.29 ± 0.53
1.605IlePhe: 1.605 ± 0.463
3.05IleGly: 3.05 ± 0.49
1.525IleHis: 1.525 ± 0.35
3.29IleIle: 3.29 ± 0.603
5.297IleLys: 5.297 ± 0.644
2.729IleLeu: 2.729 ± 0.412
1.525IleMet: 1.525 ± 0.301
3.531IleAsn: 3.531 ± 0.604
1.846IlePro: 1.846 ± 0.418
3.451IleGln: 3.451 ± 0.537
1.926IleArg: 1.926 ± 0.456
4.976IleSer: 4.976 ± 0.619
4.895IleThr: 4.895 ± 0.85
3.531IleVal: 3.531 ± 0.483
0.803IleTrp: 0.803 ± 0.259
2.488IleTyr: 2.488 ± 0.428
0.0IleXaa: 0.0 ± 0.0
Lys
7.062LysAla: 7.062 ± 0.989
0.321LysCys: 0.321 ± 0.208
4.093LysAsp: 4.093 ± 0.608
3.531LysGlu: 3.531 ± 0.563
2.167LysPhe: 2.167 ± 0.399
3.451LysGly: 3.451 ± 0.549
1.766LysHis: 1.766 ± 0.396
3.451LysIle: 3.451 ± 0.559
5.216LysLys: 5.216 ± 0.79
4.815LysLeu: 4.815 ± 0.728
2.408LysMet: 2.408 ± 0.392
2.969LysAsn: 2.969 ± 0.582
2.729LysPro: 2.729 ± 0.407
4.895LysGln: 4.895 ± 0.679
4.013LysArg: 4.013 ± 0.65
5.698LysSer: 5.698 ± 1.03
5.136LysThr: 5.136 ± 0.632
5.216LysVal: 5.216 ± 0.673
0.722LysTrp: 0.722 ± 0.249
3.29LysTyr: 3.29 ± 0.545
0.0LysXaa: 0.0 ± 0.0
Leu
7.624LeuAla: 7.624 ± 0.704
0.642LeuCys: 0.642 ± 0.233
4.414LeuAsp: 4.414 ± 0.633
4.253LeuGlu: 4.253 ± 0.691
1.846LeuPhe: 1.846 ± 0.396
5.297LeuGly: 5.297 ± 0.798
1.445LeuHis: 1.445 ± 0.266
4.253LeuIle: 4.253 ± 0.634
5.698LeuLys: 5.698 ± 0.672
5.618LeuLeu: 5.618 ± 0.982
2.408LeuMet: 2.408 ± 0.416
4.414LeuAsn: 4.414 ± 0.761
2.969LeuPro: 2.969 ± 0.52
2.568LeuGln: 2.568 ± 0.412
2.408LeuArg: 2.408 ± 0.447
6.179LeuSer: 6.179 ± 0.762
4.414LeuThr: 4.414 ± 0.675
5.136LeuVal: 5.136 ± 0.569
0.642LeuTrp: 0.642 ± 0.277
2.327LeuTyr: 2.327 ± 0.396
0.0LeuXaa: 0.0 ± 0.0
Met
2.006MetAla: 2.006 ± 0.534
0.241MetCys: 0.241 ± 0.146
1.284MetAsp: 1.284 ± 0.346
1.043MetGlu: 1.043 ± 0.365
1.124MetPhe: 1.124 ± 0.258
1.204MetGly: 1.204 ± 0.297
0.161MetHis: 0.161 ± 0.103
1.445MetIle: 1.445 ± 0.289
2.006MetLys: 2.006 ± 0.458
1.846MetLeu: 1.846 ± 0.417
1.124MetMet: 1.124 ± 0.362
2.568MetAsn: 2.568 ± 0.456
1.043MetPro: 1.043 ± 0.297
1.204MetGln: 1.204 ± 0.23
1.204MetArg: 1.204 ± 0.384
2.167MetSer: 2.167 ± 0.385
2.327MetThr: 2.327 ± 0.425
1.124MetVal: 1.124 ± 0.239
0.482MetTrp: 0.482 ± 0.211
1.685MetTyr: 1.685 ± 0.362
0.0MetXaa: 0.0 ± 0.0
Asn
4.093AsnAla: 4.093 ± 0.603
0.08AsnCys: 0.08 ± 0.083
3.932AsnAsp: 3.932 ± 0.501
3.21AsnGlu: 3.21 ± 0.52
1.445AsnPhe: 1.445 ± 0.403
5.457AsnGly: 5.457 ± 0.931
1.124AsnHis: 1.124 ± 0.315
3.451AsnIle: 3.451 ± 0.616
2.568AsnLys: 2.568 ± 0.482
3.932AsnLeu: 3.932 ± 0.599
1.364AsnMet: 1.364 ± 0.313
2.889AsnAsn: 2.889 ± 0.587
2.006AsnPro: 2.006 ± 0.439
2.889AsnGln: 2.889 ± 0.573
2.648AsnArg: 2.648 ± 0.484
2.809AsnSer: 2.809 ± 0.558
3.05AsnThr: 3.05 ± 0.592
3.531AsnVal: 3.531 ± 0.494
1.043AsnTrp: 1.043 ± 0.274
1.364AsnTyr: 1.364 ± 0.308
0.0AsnXaa: 0.0 ± 0.0
Pro
2.969ProAla: 2.969 ± 0.488
0.0ProCys: 0.0 ± 0.0
2.247ProAsp: 2.247 ± 0.412
2.568ProGlu: 2.568 ± 0.479
1.284ProPhe: 1.284 ± 0.324
1.605ProGly: 1.605 ± 0.37
0.642ProHis: 0.642 ± 0.256
2.488ProIle: 2.488 ± 0.49
2.327ProLys: 2.327 ± 0.468
2.247ProLeu: 2.247 ± 0.368
0.241ProMet: 0.241 ± 0.145
1.766ProAsn: 1.766 ± 0.402
0.963ProPro: 0.963 ± 0.373
1.445ProGln: 1.445 ± 0.427
0.883ProArg: 0.883 ± 0.224
3.05ProSer: 3.05 ± 0.493
2.809ProThr: 2.809 ± 0.561
2.006ProVal: 2.006 ± 0.469
0.722ProTrp: 0.722 ± 0.28
1.605ProTyr: 1.605 ± 0.33
0.0ProXaa: 0.0 ± 0.0
Gln
5.216GlnAla: 5.216 ± 0.737
0.161GlnCys: 0.161 ± 0.113
2.247GlnAsp: 2.247 ± 0.498
1.766GlnGlu: 1.766 ± 0.419
1.284GlnPhe: 1.284 ± 0.3
2.809GlnGly: 2.809 ± 0.774
0.562GlnHis: 0.562 ± 0.189
2.648GlnIle: 2.648 ± 0.48
2.889GlnLys: 2.889 ± 0.382
4.013GlnLeu: 4.013 ± 0.647
1.445GlnMet: 1.445 ± 0.356
1.605GlnAsn: 1.605 ± 0.312
1.766GlnPro: 1.766 ± 0.347
3.772GlnGln: 3.772 ± 0.683
1.685GlnArg: 1.685 ± 0.419
3.692GlnSer: 3.692 ± 0.528
2.969GlnThr: 2.969 ± 0.509
2.889GlnVal: 2.889 ± 0.451
0.963GlnTrp: 0.963 ± 0.302
2.488GlnTyr: 2.488 ± 0.559
0.0GlnXaa: 0.0 ± 0.0
Arg
2.488ArgAla: 2.488 ± 0.442
0.241ArgCys: 0.241 ± 0.149
2.488ArgAsp: 2.488 ± 0.576
2.648ArgGlu: 2.648 ± 0.435
1.445ArgPhe: 1.445 ± 0.283
2.247ArgGly: 2.247 ± 0.417
0.883ArgHis: 0.883 ± 0.269
1.766ArgIle: 1.766 ± 0.437
2.408ArgLys: 2.408 ± 0.518
3.611ArgLeu: 3.611 ± 0.631
0.722ArgMet: 0.722 ± 0.274
2.327ArgAsn: 2.327 ± 0.514
1.284ArgPro: 1.284 ± 0.338
1.766ArgGln: 1.766 ± 0.418
1.204ArgArg: 1.204 ± 0.342
2.729ArgSer: 2.729 ± 0.483
2.327ArgThr: 2.327 ± 0.425
3.05ArgVal: 3.05 ± 0.61
0.562ArgTrp: 0.562 ± 0.192
1.926ArgTyr: 1.926 ± 0.505
0.0ArgXaa: 0.0 ± 0.0
Ser
5.778SerAla: 5.778 ± 0.978
0.401SerCys: 0.401 ± 0.211
6.019SerAsp: 6.019 ± 0.788
3.29SerGlu: 3.29 ± 0.543
2.648SerPhe: 2.648 ± 0.469
6.42SerGly: 6.42 ± 1.126
1.685SerHis: 1.685 ± 0.391
4.574SerIle: 4.574 ± 0.528
5.698SerLys: 5.698 ± 0.848
4.976SerLeu: 4.976 ± 0.588
2.006SerMet: 2.006 ± 0.395
4.494SerAsn: 4.494 ± 0.609
2.247SerPro: 2.247 ± 0.47
3.13SerGln: 3.13 ± 0.585
2.568SerArg: 2.568 ± 0.389
5.136SerSer: 5.136 ± 0.932
4.173SerThr: 4.173 ± 0.653
4.895SerVal: 4.895 ± 0.561
1.043SerTrp: 1.043 ± 0.327
2.167SerTyr: 2.167 ± 0.378
0.0SerXaa: 0.0 ± 0.0
Thr
5.939ThrAla: 5.939 ± 0.633
0.161ThrCys: 0.161 ± 0.113
4.815ThrAsp: 4.815 ± 0.626
2.006ThrGlu: 2.006 ± 0.337
2.167ThrPhe: 2.167 ± 0.337
5.377ThrGly: 5.377 ± 0.688
1.525ThrHis: 1.525 ± 0.388
4.976ThrIle: 4.976 ± 0.884
4.253ThrLys: 4.253 ± 0.504
4.655ThrLeu: 4.655 ± 0.645
1.846ThrMet: 1.846 ± 0.406
3.531ThrAsn: 3.531 ± 0.565
2.568ThrPro: 2.568 ± 0.639
2.087ThrGln: 2.087 ± 0.328
2.488ThrArg: 2.488 ± 0.325
3.852ThrSer: 3.852 ± 0.543
4.334ThrThr: 4.334 ± 0.684
4.173ThrVal: 4.173 ± 0.59
0.963ThrTrp: 0.963 ± 0.287
2.809ThrTyr: 2.809 ± 0.732
0.0ThrXaa: 0.0 ± 0.0
Val
5.457ValAla: 5.457 ± 0.63
0.321ValCys: 0.321 ± 0.156
4.895ValAsp: 4.895 ± 0.678
3.772ValGlu: 3.772 ± 0.498
2.006ValPhe: 2.006 ± 0.307
4.013ValGly: 4.013 ± 0.673
0.722ValHis: 0.722 ± 0.247
4.253ValIle: 4.253 ± 0.553
4.494ValLys: 4.494 ± 0.567
5.537ValLeu: 5.537 ± 0.629
1.926ValMet: 1.926 ± 0.447
2.327ValAsn: 2.327 ± 0.511
1.926ValPro: 1.926 ± 0.399
2.167ValGln: 2.167 ± 0.526
2.087ValArg: 2.087 ± 0.45
5.297ValSer: 5.297 ± 0.801
4.253ValThr: 4.253 ± 0.596
4.895ValVal: 4.895 ± 0.7
0.722ValTrp: 0.722 ± 0.256
2.648ValTyr: 2.648 ± 0.479
0.0ValXaa: 0.0 ± 0.0
Trp
1.525TrpAla: 1.525 ± 0.327
0.161TrpCys: 0.161 ± 0.113
1.445TrpAsp: 1.445 ± 0.438
1.043TrpGlu: 1.043 ± 0.301
0.482TrpPhe: 0.482 ± 0.181
0.401TrpGly: 0.401 ± 0.198
0.241TrpHis: 0.241 ± 0.129
1.364TrpIle: 1.364 ± 0.4
1.525TrpLys: 1.525 ± 0.615
1.204TrpLeu: 1.204 ± 0.261
0.321TrpMet: 0.321 ± 0.202
0.642TrpAsn: 0.642 ± 0.183
0.241TrpPro: 0.241 ± 0.153
1.284TrpGln: 1.284 ± 0.281
0.722TrpArg: 0.722 ± 0.197
0.883TrpSer: 0.883 ± 0.322
1.043TrpThr: 1.043 ± 0.29
0.722TrpVal: 0.722 ± 0.17
0.401TrpTrp: 0.401 ± 0.2
0.482TrpTyr: 0.482 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.969TyrAla: 2.969 ± 0.584
0.482TyrCys: 0.482 ± 0.219
2.167TyrAsp: 2.167 ± 0.403
2.488TyrGlu: 2.488 ± 0.494
1.605TyrPhe: 1.605 ± 0.492
2.809TyrGly: 2.809 ± 0.578
0.803TyrHis: 0.803 ± 0.291
2.006TyrIle: 2.006 ± 0.478
3.13TyrLys: 3.13 ± 0.511
3.05TyrLeu: 3.05 ± 0.439
0.642TyrMet: 0.642 ± 0.184
1.846TyrAsn: 1.846 ± 0.397
1.284TyrPro: 1.284 ± 0.272
2.809TyrGln: 2.809 ± 0.38
1.685TyrArg: 1.685 ± 0.447
3.13TyrSer: 3.13 ± 0.763
3.371TyrThr: 3.371 ± 0.647
1.846TyrVal: 1.846 ± 0.287
0.803TyrTrp: 0.803 ± 0.281
1.846TyrTyr: 1.846 ± 0.468
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12462 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski