Amino acid dipepetide frequency for Lactobacillus phage JNU_P2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.406AlaAla: 7.406 ± 1.151
0.069AlaCys: 0.069 ± 0.084
5.676AlaAsp: 5.676 ± 0.887
5.122AlaGlu: 5.122 ± 0.699
2.284AlaPhe: 2.284 ± 0.417
5.745AlaGly: 5.745 ± 1.026
1.246AlaHis: 1.246 ± 0.317
5.468AlaIle: 5.468 ± 0.525
8.306AlaLys: 8.306 ± 1.25
7.544AlaLeu: 7.544 ± 0.796
2.63AlaMet: 2.63 ± 0.481
4.153AlaAsn: 4.153 ± 0.541
1.453AlaPro: 1.453 ± 0.384
3.461AlaGln: 3.461 ± 0.444
2.769AlaArg: 2.769 ± 0.451
4.845AlaSer: 4.845 ± 0.796
5.676AlaThr: 5.676 ± 0.682
5.676AlaVal: 5.676 ± 0.564
0.9AlaTrp: 0.9 ± 0.261
2.907AlaTyr: 2.907 ± 0.358
0.0AlaXaa: 0.0 ± 0.0
Cys
0.069CysAla: 0.069 ± 0.065
0.0CysCys: 0.0 ± 0.0
0.208CysAsp: 0.208 ± 0.122
0.208CysGlu: 0.208 ± 0.134
0.277CysPhe: 0.277 ± 0.141
0.138CysGly: 0.138 ± 0.11
0.069CysHis: 0.069 ± 0.064
0.138CysIle: 0.138 ± 0.083
0.0CysLys: 0.0 ± 0.0
0.623CysLeu: 0.623 ± 0.191
0.138CysMet: 0.138 ± 0.103
0.277CysAsn: 0.277 ± 0.15
0.138CysPro: 0.138 ± 0.121
0.208CysGln: 0.208 ± 0.117
0.208CysArg: 0.208 ± 0.12
0.277CysSer: 0.277 ± 0.128
0.069CysThr: 0.069 ± 0.069
0.208CysVal: 0.208 ± 0.134
0.069CysTrp: 0.069 ± 0.061
0.208CysTyr: 0.208 ± 0.123
0.0CysXaa: 0.0 ± 0.0
Asp
4.36AspAla: 4.36 ± 0.499
0.277AspCys: 0.277 ± 0.161
6.022AspAsp: 6.022 ± 1.047
4.014AspGlu: 4.014 ± 0.778
2.353AspPhe: 2.353 ± 0.351
6.16AspGly: 6.16 ± 0.664
1.384AspHis: 1.384 ± 0.309
4.222AspIle: 4.222 ± 0.488
4.707AspLys: 4.707 ± 0.555
4.43AspLeu: 4.43 ± 0.718
2.492AspMet: 2.492 ± 0.391
2.769AspAsn: 2.769 ± 0.542
2.007AspPro: 2.007 ± 0.428
2.492AspGln: 2.492 ± 0.365
2.769AspArg: 2.769 ± 0.427
4.914AspSer: 4.914 ± 0.485
4.43AspThr: 4.43 ± 0.539
4.084AspVal: 4.084 ± 0.558
1.177AspTrp: 1.177 ± 0.239
2.63AspTyr: 2.63 ± 0.459
0.069AspXaa: 0.069 ± 0.061
Glu
4.845GluAla: 4.845 ± 0.495
0.208GluCys: 0.208 ± 0.108
2.838GluAsp: 2.838 ± 0.518
2.699GluGlu: 2.699 ± 0.53
2.076GluPhe: 2.076 ± 0.425
2.146GluGly: 2.146 ± 0.298
1.177GluHis: 1.177 ± 0.357
2.353GluIle: 2.353 ± 0.46
4.499GluLys: 4.499 ± 0.677
5.191GluLeu: 5.191 ± 0.622
1.246GluMet: 1.246 ± 0.347
3.322GluAsn: 3.322 ± 0.711
1.938GluPro: 1.938 ± 0.479
3.253GluGln: 3.253 ± 0.467
2.422GluArg: 2.422 ± 0.371
3.461GluSer: 3.461 ± 0.53
2.699GluThr: 2.699 ± 0.408
3.53GluVal: 3.53 ± 0.427
0.969GluTrp: 0.969 ± 0.276
2.146GluTyr: 2.146 ± 0.381
0.0GluXaa: 0.0 ± 0.0
Phe
2.699PheAla: 2.699 ± 0.403
0.069PheCys: 0.069 ± 0.071
3.184PheAsp: 3.184 ± 0.511
2.215PheGlu: 2.215 ± 0.467
1.177PhePhe: 1.177 ± 0.293
2.838PheGly: 2.838 ± 0.537
0.623PheHis: 0.623 ± 0.229
1.384PheIle: 1.384 ± 0.262
4.153PheLys: 4.153 ± 0.479
2.492PheLeu: 2.492 ± 0.392
0.692PheMet: 0.692 ± 0.2
1.869PheAsn: 1.869 ± 0.326
0.969PhePro: 0.969 ± 0.306
1.177PheGln: 1.177 ± 0.247
1.246PheArg: 1.246 ± 0.296
2.838PheSer: 2.838 ± 0.49
2.492PheThr: 2.492 ± 0.369
1.8PheVal: 1.8 ± 0.424
0.346PheTrp: 0.346 ± 0.15
0.969PheTyr: 0.969 ± 0.261
0.0PheXaa: 0.0 ± 0.0
Gly
4.983GlyAla: 4.983 ± 0.833
0.484GlyCys: 0.484 ± 0.17
4.153GlyAsp: 4.153 ± 0.615
3.53GlyGlu: 3.53 ± 0.475
2.63GlyPhe: 2.63 ± 0.42
5.606GlyGly: 5.606 ± 1.196
2.146GlyHis: 2.146 ± 0.294
3.668GlyIle: 3.668 ± 0.45
5.537GlyLys: 5.537 ± 0.845
5.883GlyLeu: 5.883 ± 0.777
1.384GlyMet: 1.384 ± 0.341
2.976GlyAsn: 2.976 ± 0.454
1.177GlyPro: 1.177 ± 0.351
1.938GlyGln: 1.938 ± 0.458
2.769GlyArg: 2.769 ± 0.519
5.329GlySer: 5.329 ± 1.05
6.506GlyThr: 6.506 ± 0.875
4.845GlyVal: 4.845 ± 0.514
1.315GlyTrp: 1.315 ± 0.312
3.322GlyTyr: 3.322 ± 0.578
0.0GlyXaa: 0.0 ± 0.0
His
1.315HisAla: 1.315 ± 0.271
0.138HisCys: 0.138 ± 0.109
1.8HisAsp: 1.8 ± 0.37
1.384HisGlu: 1.384 ± 0.333
0.969HisPhe: 0.969 ± 0.285
1.661HisGly: 1.661 ± 0.353
0.277HisHis: 0.277 ± 0.165
1.246HisIle: 1.246 ± 0.334
0.9HisLys: 0.9 ± 0.236
1.038HisLeu: 1.038 ± 0.237
0.554HisMet: 0.554 ± 0.201
0.831HisAsn: 0.831 ± 0.223
0.554HisPro: 0.554 ± 0.179
0.9HisGln: 0.9 ± 0.263
1.177HisArg: 1.177 ± 0.371
1.592HisSer: 1.592 ± 0.325
1.8HisThr: 1.8 ± 0.452
2.007HisVal: 2.007 ± 0.379
0.346HisTrp: 0.346 ± 0.174
0.9HisTyr: 0.9 ± 0.229
0.0HisXaa: 0.0 ± 0.0
Ile
4.568IleAla: 4.568 ± 0.53
0.415IleCys: 0.415 ± 0.176
4.36IleAsp: 4.36 ± 0.579
3.045IleGlu: 3.045 ± 0.46
1.73IlePhe: 1.73 ± 0.461
3.668IleGly: 3.668 ± 0.668
1.038IleHis: 1.038 ± 0.264
2.284IleIle: 2.284 ± 0.429
4.776IleLys: 4.776 ± 0.487
3.253IleLeu: 3.253 ± 0.525
1.107IleMet: 1.107 ± 0.225
3.045IleAsn: 3.045 ± 0.457
2.492IlePro: 2.492 ± 0.409
1.661IleGln: 1.661 ± 0.32
2.561IleArg: 2.561 ± 0.348
4.707IleSer: 4.707 ± 0.603
3.391IleThr: 3.391 ± 0.502
4.222IleVal: 4.222 ± 0.675
0.415IleTrp: 0.415 ± 0.152
2.769IleTyr: 2.769 ± 0.412
0.0IleXaa: 0.0 ± 0.0
Lys
6.852LysAla: 6.852 ± 0.719
0.138LysCys: 0.138 ± 0.089
3.53LysAsp: 3.53 ± 0.455
3.876LysGlu: 3.876 ± 0.603
2.492LysPhe: 2.492 ± 0.405
4.222LysGly: 4.222 ± 0.615
1.453LysHis: 1.453 ± 0.304
4.36LysIle: 4.36 ± 0.603
7.821LysLys: 7.821 ± 1.488
6.921LysLeu: 6.921 ± 0.72
2.63LysMet: 2.63 ± 0.393
4.153LysAsn: 4.153 ± 0.552
3.184LysPro: 3.184 ± 0.569
3.738LysGln: 3.738 ± 0.673
4.568LysArg: 4.568 ± 0.578
6.714LysSer: 6.714 ± 1.102
4.776LysThr: 4.776 ± 0.633
5.606LysVal: 5.606 ± 0.616
1.177LysTrp: 1.177 ± 0.271
2.907LysTyr: 2.907 ± 0.581
0.0LysXaa: 0.0 ± 0.0
Leu
6.575LeuAla: 6.575 ± 0.643
0.138LeuCys: 0.138 ± 0.114
5.676LeuAsp: 5.676 ± 0.65
4.014LeuGlu: 4.014 ± 0.542
2.907LeuPhe: 2.907 ± 0.442
4.36LeuGly: 4.36 ± 0.581
0.692LeuHis: 0.692 ± 0.247
4.291LeuIle: 4.291 ± 0.573
5.122LeuLys: 5.122 ± 0.61
6.229LeuLeu: 6.229 ± 0.651
1.8LeuMet: 1.8 ± 0.306
4.914LeuAsn: 4.914 ± 0.601
3.115LeuPro: 3.115 ± 0.476
3.668LeuGln: 3.668 ± 0.472
3.461LeuArg: 3.461 ± 0.509
5.329LeuSer: 5.329 ± 0.429
5.745LeuThr: 5.745 ± 0.675
4.499LeuVal: 4.499 ± 0.581
0.484LeuTrp: 0.484 ± 0.159
2.422LeuTyr: 2.422 ± 0.415
0.0LeuXaa: 0.0 ± 0.0
Met
3.738MetAla: 3.738 ± 0.53
0.138MetCys: 0.138 ± 0.085
1.384MetAsp: 1.384 ± 0.291
1.661MetGlu: 1.661 ± 0.308
0.554MetPhe: 0.554 ± 0.157
1.523MetGly: 1.523 ± 0.398
0.623MetHis: 0.623 ± 0.158
1.938MetIle: 1.938 ± 0.331
1.8MetLys: 1.8 ± 0.364
1.246MetLeu: 1.246 ± 0.261
0.761MetMet: 0.761 ± 0.197
1.246MetAsn: 1.246 ± 0.393
0.969MetPro: 0.969 ± 0.302
0.831MetGln: 0.831 ± 0.289
1.315MetArg: 1.315 ± 0.297
1.869MetSer: 1.869 ± 0.343
2.561MetThr: 2.561 ± 0.386
1.73MetVal: 1.73 ± 0.315
0.554MetTrp: 0.554 ± 0.2
1.038MetTyr: 1.038 ± 0.236
0.0MetXaa: 0.0 ± 0.0
Asn
4.014AsnAla: 4.014 ± 0.403
0.138AsnCys: 0.138 ± 0.102
4.222AsnAsp: 4.222 ± 0.56
2.215AsnGlu: 2.215 ± 0.357
2.492AsnPhe: 2.492 ± 0.428
5.676AsnGly: 5.676 ± 0.738
1.177AsnHis: 1.177 ± 0.3
2.769AsnIle: 2.769 ± 0.416
3.461AsnLys: 3.461 ± 0.536
3.391AsnLeu: 3.391 ± 0.548
1.592AsnMet: 1.592 ± 0.33
1.869AsnAsn: 1.869 ± 0.366
2.215AsnPro: 2.215 ± 0.344
2.492AsnGln: 2.492 ± 0.493
2.146AsnArg: 2.146 ± 0.425
3.738AsnSer: 3.738 ± 0.521
2.907AsnThr: 2.907 ± 0.461
2.838AsnVal: 2.838 ± 0.455
0.623AsnTrp: 0.623 ± 0.19
2.007AsnTyr: 2.007 ± 0.364
0.0AsnXaa: 0.0 ± 0.0
Pro
3.045ProAla: 3.045 ± 0.533
0.0ProCys: 0.0 ± 0.0
2.838ProAsp: 2.838 ± 0.571
2.146ProGlu: 2.146 ± 0.52
1.107ProPhe: 1.107 ± 0.266
1.8ProGly: 1.8 ± 0.414
1.592ProHis: 1.592 ± 0.364
1.73ProIle: 1.73 ± 0.322
3.115ProLys: 3.115 ± 0.444
2.284ProLeu: 2.284 ± 0.41
0.623ProMet: 0.623 ± 0.209
1.384ProAsn: 1.384 ± 0.268
1.177ProPro: 1.177 ± 0.387
1.107ProGln: 1.107 ± 0.307
0.969ProArg: 0.969 ± 0.299
2.422ProSer: 2.422 ± 0.393
2.699ProThr: 2.699 ± 0.561
2.146ProVal: 2.146 ± 0.373
0.484ProTrp: 0.484 ± 0.172
1.661ProTyr: 1.661 ± 0.326
0.0ProXaa: 0.0 ± 0.0
Gln
3.807GlnAla: 3.807 ± 0.546
0.208GlnCys: 0.208 ± 0.118
2.146GlnAsp: 2.146 ± 0.356
2.284GlnGlu: 2.284 ± 0.377
1.661GlnPhe: 1.661 ± 0.467
2.353GlnGly: 2.353 ± 0.495
0.623GlnHis: 0.623 ± 0.213
2.976GlnIle: 2.976 ± 0.383
2.699GlnLys: 2.699 ± 0.56
3.115GlnLeu: 3.115 ± 0.523
1.038GlnMet: 1.038 ± 0.292
2.007GlnAsn: 2.007 ± 0.314
1.8GlnPro: 1.8 ± 0.358
2.284GlnGln: 2.284 ± 0.526
2.146GlnArg: 2.146 ± 0.43
2.492GlnSer: 2.492 ± 0.419
1.592GlnThr: 1.592 ± 0.332
3.322GlnVal: 3.322 ± 0.433
0.277GlnTrp: 0.277 ± 0.112
1.315GlnTyr: 1.315 ± 0.317
0.0GlnXaa: 0.0 ± 0.0
Arg
3.738ArgAla: 3.738 ± 0.645
0.208ArgCys: 0.208 ± 0.114
2.769ArgAsp: 2.769 ± 0.409
2.561ArgGlu: 2.561 ± 0.422
1.384ArgPhe: 1.384 ± 0.41
2.769ArgGly: 2.769 ± 0.557
1.038ArgHis: 1.038 ± 0.224
2.422ArgIle: 2.422 ± 0.404
3.461ArgLys: 3.461 ± 0.549
3.738ArgLeu: 3.738 ± 0.457
1.523ArgMet: 1.523 ± 0.336
2.353ArgAsn: 2.353 ± 0.341
1.384ArgPro: 1.384 ± 0.269
1.592ArgGln: 1.592 ± 0.278
2.076ArgArg: 2.076 ± 0.334
3.184ArgSer: 3.184 ± 0.338
2.492ArgThr: 2.492 ± 0.381
2.699ArgVal: 2.699 ± 0.444
0.415ArgTrp: 0.415 ± 0.168
1.73ArgTyr: 1.73 ± 0.334
0.0ArgXaa: 0.0 ± 0.0
Ser
5.814SerAla: 5.814 ± 0.553
0.138SerCys: 0.138 ± 0.122
4.845SerAsp: 4.845 ± 0.548
3.184SerGlu: 3.184 ± 0.436
2.076SerPhe: 2.076 ± 0.391
6.506SerGly: 6.506 ± 0.705
1.315SerHis: 1.315 ± 0.242
3.738SerIle: 3.738 ± 0.376
7.198SerLys: 7.198 ± 0.98
4.637SerLeu: 4.637 ± 0.732
3.045SerMet: 3.045 ± 0.402
4.776SerAsn: 4.776 ± 0.496
2.007SerPro: 2.007 ± 0.344
2.076SerGln: 2.076 ± 0.429
3.253SerArg: 3.253 ± 0.434
4.776SerSer: 4.776 ± 0.555
4.084SerThr: 4.084 ± 0.519
4.707SerVal: 4.707 ± 0.637
1.177SerTrp: 1.177 ± 0.236
2.63SerTyr: 2.63 ± 0.494
0.0SerXaa: 0.0 ± 0.0
Thr
5.814ThrAla: 5.814 ± 0.699
0.0ThrCys: 0.0 ± 0.0
5.053ThrAsp: 5.053 ± 0.644
3.391ThrGlu: 3.391 ± 0.563
2.838ThrPhe: 2.838 ± 0.462
4.568ThrGly: 4.568 ± 0.532
1.73ThrHis: 1.73 ± 0.406
3.945ThrIle: 3.945 ± 0.585
5.26ThrLys: 5.26 ± 0.831
4.084ThrLeu: 4.084 ± 0.52
1.315ThrMet: 1.315 ± 0.287
3.391ThrAsn: 3.391 ± 0.419
3.668ThrPro: 3.668 ± 0.556
1.938ThrGln: 1.938 ± 0.382
3.115ThrArg: 3.115 ± 0.352
4.43ThrSer: 4.43 ± 0.469
5.329ThrThr: 5.329 ± 0.751
4.291ThrVal: 4.291 ± 0.533
0.484ThrTrp: 0.484 ± 0.187
2.215ThrTyr: 2.215 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
5.468ValAla: 5.468 ± 0.59
0.346ValCys: 0.346 ± 0.156
3.876ValAsp: 3.876 ± 0.513
3.184ValGlu: 3.184 ± 0.458
2.007ValPhe: 2.007 ± 0.306
4.845ValGly: 4.845 ± 0.554
2.007ValHis: 2.007 ± 0.314
3.876ValIle: 3.876 ± 0.491
5.191ValLys: 5.191 ± 0.6
4.153ValLeu: 4.153 ± 0.515
1.661ValMet: 1.661 ± 0.331
3.668ValAsn: 3.668 ± 0.541
2.492ValPro: 2.492 ± 0.439
2.353ValGln: 2.353 ± 0.46
2.769ValArg: 2.769 ± 0.537
5.745ValSer: 5.745 ± 0.634
4.499ValThr: 4.499 ± 0.537
4.43ValVal: 4.43 ± 0.613
1.038ValTrp: 1.038 ± 0.244
1.869ValTyr: 1.869 ± 0.368
0.0ValXaa: 0.0 ± 0.0
Trp
1.177TrpAla: 1.177 ± 0.27
0.277TrpCys: 0.277 ± 0.143
0.484TrpAsp: 0.484 ± 0.15
0.692TrpGlu: 0.692 ± 0.211
0.554TrpPhe: 0.554 ± 0.234
0.623TrpGly: 0.623 ± 0.186
0.415TrpHis: 0.415 ± 0.162
0.831TrpIle: 0.831 ± 0.222
0.761TrpLys: 0.761 ± 0.178
1.592TrpLeu: 1.592 ± 0.328
0.138TrpMet: 0.138 ± 0.102
0.831TrpAsn: 0.831 ± 0.217
0.208TrpPro: 0.208 ± 0.124
0.9TrpGln: 0.9 ± 0.273
0.208TrpArg: 0.208 ± 0.099
0.9TrpSer: 0.9 ± 0.217
0.831TrpThr: 0.831 ± 0.285
0.623TrpVal: 0.623 ± 0.239
0.277TrpTrp: 0.277 ± 0.134
0.761TrpTyr: 0.761 ± 0.221
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.045TyrAla: 3.045 ± 0.4
0.138TyrCys: 0.138 ± 0.089
2.699TyrAsp: 2.699 ± 0.52
1.592TyrGlu: 1.592 ± 0.361
1.661TyrPhe: 1.661 ± 0.369
2.976TyrGly: 2.976 ± 0.511
0.761TyrHis: 0.761 ± 0.271
1.869TyrIle: 1.869 ± 0.289
2.353TyrLys: 2.353 ± 0.416
3.599TyrLeu: 3.599 ± 0.532
1.038TyrMet: 1.038 ± 0.266
2.146TyrAsn: 2.146 ± 0.375
1.384TyrPro: 1.384 ± 0.372
2.076TyrGln: 2.076 ± 0.411
1.592TyrArg: 1.592 ± 0.438
2.422TyrSer: 2.422 ± 0.339
2.284TyrThr: 2.284 ± 0.325
2.215TyrVal: 2.215 ± 0.409
0.554TyrTrp: 0.554 ± 0.241
1.938TyrTyr: 1.938 ± 0.543
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.069XaaLys: 0.069 ± 0.061
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.208XaaXaa: 0.208 ± 0.182
Statistics based on 71 proteins (14449 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski