Amino acid dipepetide frequency for Lactobacillus phage P1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.774AlaAla: 3.774 ± 0.915
0.298AlaCys: 0.298 ± 0.116
3.973AlaAsp: 3.973 ± 0.399
3.824AlaGlu: 3.824 ± 0.4
2.135AlaPhe: 2.135 ± 0.312
4.966AlaGly: 4.966 ± 0.829
0.745AlaHis: 0.745 ± 0.138
4.519AlaIle: 4.519 ± 0.707
5.313AlaLys: 5.313 ± 1.058
6.704AlaLeu: 6.704 ± 0.711
1.639AlaMet: 1.639 ± 0.304
3.873AlaAsn: 3.873 ± 0.611
1.688AlaPro: 1.688 ± 0.318
2.632AlaGln: 2.632 ± 0.428
2.731AlaArg: 2.731 ± 0.396
5.264AlaSer: 5.264 ± 1.18
4.171AlaThr: 4.171 ± 0.625
5.512AlaVal: 5.512 ± 0.698
1.241AlaTrp: 1.241 ± 0.27
3.277AlaTyr: 3.277 ± 0.35
0.0AlaXaa: 0.0 ± 0.0
Cys
0.348CysAla: 0.348 ± 0.13
0.0CysCys: 0.0 ± 0.0
0.546CysAsp: 0.546 ± 0.173
0.348CysGlu: 0.348 ± 0.125
0.745CysPhe: 0.745 ± 0.241
0.348CysGly: 0.348 ± 0.132
0.149CysHis: 0.149 ± 0.086
0.348CysIle: 0.348 ± 0.132
0.348CysLys: 0.348 ± 0.12
0.646CysLeu: 0.646 ± 0.188
0.05CysMet: 0.05 ± 0.048
0.447CysAsn: 0.447 ± 0.157
0.149CysPro: 0.149 ± 0.09
0.248CysGln: 0.248 ± 0.111
0.397CysArg: 0.397 ± 0.149
0.298CysSer: 0.298 ± 0.166
0.397CysThr: 0.397 ± 0.142
0.447CysVal: 0.447 ± 0.142
0.199CysTrp: 0.199 ± 0.1
0.447CysTyr: 0.447 ± 0.151
0.0CysXaa: 0.0 ± 0.0
Asp
3.029AspAla: 3.029 ± 0.322
0.795AspCys: 0.795 ± 0.226
5.562AspAsp: 5.562 ± 0.819
5.363AspGlu: 5.363 ± 0.613
2.681AspPhe: 2.681 ± 0.364
6.406AspGly: 6.406 ± 0.721
1.043AspHis: 1.043 ± 0.201
4.072AspIle: 4.072 ± 0.411
6.108AspLys: 6.108 ± 0.568
4.817AspLeu: 4.817 ± 0.691
1.738AspMet: 1.738 ± 0.317
4.618AspAsn: 4.618 ± 0.559
1.291AspPro: 1.291 ± 0.325
1.39AspGln: 1.39 ± 0.257
2.632AspArg: 2.632 ± 0.441
5.611AspSer: 5.611 ± 0.686
3.873AspThr: 3.873 ± 0.526
3.774AspVal: 3.774 ± 0.553
0.894AspTrp: 0.894 ± 0.176
4.122AspTyr: 4.122 ± 0.544
0.0AspXaa: 0.0 ± 0.0
Glu
4.817GluAla: 4.817 ± 0.469
0.397GluCys: 0.397 ± 0.191
4.866GluAsp: 4.866 ± 0.651
2.83GluGlu: 2.83 ± 0.374
2.235GluPhe: 2.235 ± 0.347
2.483GluGly: 2.483 ± 0.383
1.341GluHis: 1.341 ± 0.337
4.221GluIle: 4.221 ± 0.414
4.717GluLys: 4.717 ± 0.548
6.853GluLeu: 6.853 ± 0.742
1.937GluMet: 1.937 ± 0.362
2.83GluAsn: 2.83 ± 0.435
1.539GluPro: 1.539 ± 0.27
2.681GluGln: 2.681 ± 0.373
2.483GluArg: 2.483 ± 0.557
3.228GluSer: 3.228 ± 0.334
3.079GluThr: 3.079 ± 0.349
4.171GluVal: 4.171 ± 0.656
0.894GluTrp: 0.894 ± 0.196
2.384GluTyr: 2.384 ± 0.341
0.0GluXaa: 0.0 ± 0.0
Phe
2.334PheAla: 2.334 ± 0.33
0.298PheCys: 0.298 ± 0.12
3.377PheAsp: 3.377 ± 0.46
1.688PheGlu: 1.688 ± 0.372
1.639PhePhe: 1.639 ± 0.342
3.426PheGly: 3.426 ± 0.333
0.596PheHis: 0.596 ± 0.189
2.334PheIle: 2.334 ± 0.354
3.079PheLys: 3.079 ± 0.532
2.88PheLeu: 2.88 ± 0.519
0.795PheMet: 0.795 ± 0.191
2.185PheAsn: 2.185 ± 0.343
0.844PhePro: 0.844 ± 0.193
1.092PheGln: 1.092 ± 0.207
1.788PheArg: 1.788 ± 0.259
2.93PheSer: 2.93 ± 0.341
2.582PheThr: 2.582 ± 0.477
2.83PheVal: 2.83 ± 0.393
0.497PheTrp: 0.497 ± 0.164
1.639PheTyr: 1.639 ± 0.277
0.0PheXaa: 0.0 ± 0.0
Gly
4.271GlyAla: 4.271 ± 0.711
0.149GlyCys: 0.149 ± 0.092
3.824GlyAsp: 3.824 ± 0.384
3.625GlyGlu: 3.625 ± 0.46
3.277GlyPhe: 3.277 ± 0.509
4.42GlyGly: 4.42 ± 1.018
1.44GlyHis: 1.44 ± 0.268
4.519GlyIle: 4.519 ± 0.685
5.611GlyLys: 5.611 ± 1.177
6.505GlyLeu: 6.505 ± 0.863
1.241GlyMet: 1.241 ± 0.331
4.271GlyAsn: 4.271 ± 0.412
0.894GlyPro: 0.894 ± 0.192
2.781GlyGln: 2.781 ± 0.338
3.426GlyArg: 3.426 ± 0.371
4.668GlySer: 4.668 ± 0.658
5.115GlyThr: 5.115 ± 0.781
4.221GlyVal: 4.221 ± 0.404
1.092GlyTrp: 1.092 ± 0.245
3.973GlyTyr: 3.973 ± 0.31
0.0GlyXaa: 0.0 ± 0.0
His
1.142HisAla: 1.142 ± 0.247
0.248HisCys: 0.248 ± 0.103
1.192HisAsp: 1.192 ± 0.27
1.092HisGlu: 1.092 ± 0.313
0.844HisPhe: 0.844 ± 0.204
1.589HisGly: 1.589 ± 0.294
0.795HisHis: 0.795 ± 0.209
1.39HisIle: 1.39 ± 0.288
1.341HisLys: 1.341 ± 0.234
0.993HisLeu: 0.993 ± 0.209
0.397HisMet: 0.397 ± 0.125
1.241HisAsn: 1.241 ± 0.224
0.447HisPro: 0.447 ± 0.137
0.248HisGln: 0.248 ± 0.097
0.894HisArg: 0.894 ± 0.288
0.993HisSer: 0.993 ± 0.196
0.993HisThr: 0.993 ± 0.198
0.993HisVal: 0.993 ± 0.248
0.248HisTrp: 0.248 ± 0.104
1.092HisTyr: 1.092 ± 0.214
0.0HisXaa: 0.0 ± 0.0
Ile
5.313IleAla: 5.313 ± 0.552
0.447IleCys: 0.447 ± 0.153
4.42IleAsp: 4.42 ± 0.593
3.575IleGlu: 3.575 ± 0.415
1.887IlePhe: 1.887 ± 0.342
4.469IleGly: 4.469 ± 1.254
0.943IleHis: 0.943 ± 0.198
3.476IleIle: 3.476 ± 0.457
5.81IleLys: 5.81 ± 0.711
4.072IleLeu: 4.072 ± 0.502
1.291IleMet: 1.291 ± 0.314
5.065IleAsn: 5.065 ± 0.509
2.334IlePro: 2.334 ± 0.317
1.837IleGln: 1.837 ± 0.286
1.986IleArg: 1.986 ± 0.285
4.37IleSer: 4.37 ± 0.548
4.42IleThr: 4.42 ± 0.692
4.519IleVal: 4.519 ± 0.566
0.447IleTrp: 0.447 ± 0.133
2.284IleTyr: 2.284 ± 0.332
0.0IleXaa: 0.0 ± 0.0
Lys
5.164LysAla: 5.164 ± 0.853
0.497LysCys: 0.497 ± 0.177
4.866LysAsp: 4.866 ± 0.451
5.363LysGlu: 5.363 ± 0.746
2.781LysPhe: 2.781 ± 0.267
4.966LysGly: 4.966 ± 0.83
1.44LysHis: 1.44 ± 0.25
4.966LysIle: 4.966 ± 0.576
6.505LysLys: 6.505 ± 0.707
6.604LysLeu: 6.604 ± 0.617
2.88LysMet: 2.88 ± 0.563
4.817LysAsn: 4.817 ± 0.417
2.135LysPro: 2.135 ± 0.34
3.327LysGln: 3.327 ± 0.385
4.469LysArg: 4.469 ± 0.491
4.42LysSer: 4.42 ± 0.777
5.711LysThr: 5.711 ± 0.71
5.065LysVal: 5.065 ± 0.55
0.894LysTrp: 0.894 ± 0.319
3.824LysTyr: 3.824 ± 0.464
0.0LysXaa: 0.0 ± 0.0
Leu
5.214LeuAla: 5.214 ± 0.577
0.546LeuCys: 0.546 ± 0.183
6.108LeuAsp: 6.108 ± 0.781
5.462LeuGlu: 5.462 ± 0.559
2.979LeuPhe: 2.979 ± 0.398
5.661LeuGly: 5.661 ± 0.674
1.291LeuHis: 1.291 ± 0.319
4.866LeuIle: 4.866 ± 0.494
7.002LeuLys: 7.002 ± 0.51
6.009LeuLeu: 6.009 ± 0.708
2.433LeuMet: 2.433 ± 0.431
3.923LeuAsn: 3.923 ± 0.479
3.228LeuPro: 3.228 ± 0.418
2.384LeuGln: 2.384 ± 0.269
3.277LeuArg: 3.277 ± 0.408
6.803LeuSer: 6.803 ± 0.572
5.661LeuThr: 5.661 ± 0.489
4.916LeuVal: 4.916 ± 0.588
0.943LeuTrp: 0.943 ± 0.205
3.079LeuTyr: 3.079 ± 0.535
0.0LeuXaa: 0.0 ± 0.0
Met
2.036MetAla: 2.036 ± 0.331
0.05MetCys: 0.05 ± 0.049
1.142MetAsp: 1.142 ± 0.209
1.539MetGlu: 1.539 ± 0.278
0.695MetPhe: 0.695 ± 0.202
0.894MetGly: 0.894 ± 0.201
0.248MetHis: 0.248 ± 0.099
1.738MetIle: 1.738 ± 0.277
2.384MetLys: 2.384 ± 0.345
2.384MetLeu: 2.384 ± 0.422
0.447MetMet: 0.447 ± 0.138
1.192MetAsn: 1.192 ± 0.195
0.596MetPro: 0.596 ± 0.173
1.192MetGln: 1.192 ± 0.298
0.795MetArg: 0.795 ± 0.207
1.738MetSer: 1.738 ± 0.265
1.589MetThr: 1.589 ± 0.264
1.49MetVal: 1.49 ± 0.287
0.099MetTrp: 0.099 ± 0.074
0.795MetTyr: 0.795 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
4.32AsnAla: 4.32 ± 0.522
0.348AsnCys: 0.348 ± 0.148
4.122AsnAsp: 4.122 ± 0.49
4.072AsnGlu: 4.072 ± 0.452
2.135AsnPhe: 2.135 ± 0.321
5.959AsnGly: 5.959 ± 0.701
1.241AsnHis: 1.241 ± 0.29
3.625AsnIle: 3.625 ± 0.456
5.462AsnLys: 5.462 ± 0.579
4.37AsnLeu: 4.37 ± 0.496
1.44AsnMet: 1.44 ± 0.311
3.824AsnAsn: 3.824 ± 0.518
1.986AsnPro: 1.986 ± 0.297
2.582AsnGln: 2.582 ± 0.471
1.986AsnArg: 1.986 ± 0.308
3.426AsnSer: 3.426 ± 0.383
3.178AsnThr: 3.178 ± 0.507
3.277AsnVal: 3.277 ± 0.416
0.943AsnTrp: 0.943 ± 0.163
2.731AsnTyr: 2.731 ± 0.504
0.0AsnXaa: 0.0 ± 0.0
Pro
2.384ProAla: 2.384 ± 0.367
0.149ProCys: 0.149 ± 0.102
2.533ProAsp: 2.533 ± 0.345
2.185ProGlu: 2.185 ± 0.416
0.943ProPhe: 0.943 ± 0.235
0.894ProGly: 0.894 ± 0.245
0.199ProHis: 0.199 ± 0.084
1.887ProIle: 1.887 ± 0.375
2.086ProLys: 2.086 ± 0.295
1.738ProLeu: 1.738 ± 0.311
0.447ProMet: 0.447 ± 0.125
1.837ProAsn: 1.837 ± 0.209
0.298ProPro: 0.298 ± 0.127
1.142ProGln: 1.142 ± 0.357
0.596ProArg: 0.596 ± 0.182
2.284ProSer: 2.284 ± 0.371
1.44ProThr: 1.44 ± 0.286
2.681ProVal: 2.681 ± 0.346
0.248ProTrp: 0.248 ± 0.095
1.44ProTyr: 1.44 ± 0.223
0.0ProXaa: 0.0 ± 0.0
Gln
2.483GlnAla: 2.483 ± 0.496
0.149GlnCys: 0.149 ± 0.086
2.384GlnAsp: 2.384 ± 0.338
1.589GlnGlu: 1.589 ± 0.345
1.688GlnPhe: 1.688 ± 0.293
2.036GlnGly: 2.036 ± 0.426
0.844GlnHis: 0.844 ± 0.213
2.483GlnIle: 2.483 ± 0.332
2.284GlnLys: 2.284 ± 0.334
2.83GlnLeu: 2.83 ± 0.37
0.993GlnMet: 0.993 ± 0.188
2.185GlnAsn: 2.185 ± 0.312
1.639GlnPro: 1.639 ± 0.436
1.49GlnGln: 1.49 ± 0.458
1.639GlnArg: 1.639 ± 0.323
2.632GlnSer: 2.632 ± 0.395
2.83GlnThr: 2.83 ± 0.481
2.185GlnVal: 2.185 ± 0.451
0.199GlnTrp: 0.199 ± 0.085
1.639GlnTyr: 1.639 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
2.93ArgAla: 2.93 ± 0.424
0.199ArgCys: 0.199 ± 0.102
2.334ArgAsp: 2.334 ± 0.397
2.88ArgGlu: 2.88 ± 0.439
1.688ArgPhe: 1.688 ± 0.262
2.185ArgGly: 2.185 ± 0.352
0.943ArgHis: 0.943 ± 0.189
1.986ArgIle: 1.986 ± 0.353
3.426ArgLys: 3.426 ± 0.482
3.029ArgLeu: 3.029 ± 0.288
0.795ArgMet: 0.795 ± 0.234
2.185ArgAsn: 2.185 ± 0.366
1.192ArgPro: 1.192 ± 0.232
1.291ArgGln: 1.291 ± 0.286
1.341ArgArg: 1.341 ± 0.229
2.384ArgSer: 2.384 ± 0.369
2.632ArgThr: 2.632 ± 0.458
2.93ArgVal: 2.93 ± 0.498
0.695ArgTrp: 0.695 ± 0.203
1.589ArgTyr: 1.589 ± 0.277
0.0ArgXaa: 0.0 ± 0.0
Ser
5.909SerAla: 5.909 ± 1.077
0.199SerCys: 0.199 ± 0.091
5.164SerAsp: 5.164 ± 0.547
4.022SerGlu: 4.022 ± 0.525
2.433SerPhe: 2.433 ± 0.333
5.065SerGly: 5.065 ± 0.723
1.44SerHis: 1.44 ± 0.272
4.37SerIle: 4.37 ± 0.447
4.618SerLys: 4.618 ± 0.839
6.455SerLeu: 6.455 ± 0.613
1.49SerMet: 1.49 ± 0.292
5.015SerAsn: 5.015 ± 0.689
1.788SerPro: 1.788 ± 0.331
2.88SerGln: 2.88 ± 0.534
2.284SerArg: 2.284 ± 0.295
5.81SerSer: 5.81 ± 0.63
3.625SerThr: 3.625 ± 0.505
4.271SerVal: 4.271 ± 0.425
0.844SerTrp: 0.844 ± 0.18
2.83SerTyr: 2.83 ± 0.469
0.0SerXaa: 0.0 ± 0.0
Thr
5.115ThrAla: 5.115 ± 0.956
0.596ThrCys: 0.596 ± 0.178
4.271ThrAsp: 4.271 ± 0.401
3.575ThrGlu: 3.575 ± 0.418
2.681ThrPhe: 2.681 ± 0.491
5.164ThrGly: 5.164 ± 0.83
0.894ThrHis: 0.894 ± 0.254
4.271ThrIle: 4.271 ± 0.426
5.015ThrLys: 5.015 ± 0.425
4.717ThrLeu: 4.717 ± 0.518
0.844ThrMet: 0.844 ± 0.195
3.526ThrAsn: 3.526 ± 0.517
1.986ThrPro: 1.986 ± 0.33
2.433ThrGln: 2.433 ± 0.481
1.639ThrArg: 1.639 ± 0.329
4.271ThrSer: 4.271 ± 0.74
4.122ThrThr: 4.122 ± 0.64
5.015ThrVal: 5.015 ± 0.639
0.298ThrTrp: 0.298 ± 0.117
3.128ThrTyr: 3.128 ± 0.416
0.0ThrXaa: 0.0 ± 0.0
Val
4.519ValAla: 4.519 ± 0.465
0.546ValCys: 0.546 ± 0.187
4.469ValAsp: 4.469 ± 0.53
4.271ValGlu: 4.271 ± 0.521
2.582ValPhe: 2.582 ± 0.425
4.221ValGly: 4.221 ± 0.488
1.49ValHis: 1.49 ± 0.264
4.568ValIle: 4.568 ± 0.544
5.512ValLys: 5.512 ± 0.581
5.214ValLeu: 5.214 ± 0.585
1.192ValMet: 1.192 ± 0.245
4.469ValAsn: 4.469 ± 0.42
1.837ValPro: 1.837 ± 0.264
2.135ValGln: 2.135 ± 0.377
1.986ValArg: 1.986 ± 0.383
5.065ValSer: 5.065 ± 0.474
4.916ValThr: 4.916 ± 0.416
4.42ValVal: 4.42 ± 0.537
0.943ValTrp: 0.943 ± 0.198
1.738ValTyr: 1.738 ± 0.344
0.0ValXaa: 0.0 ± 0.0
Trp
0.596TrpAla: 0.596 ± 0.181
0.248TrpCys: 0.248 ± 0.115
0.646TrpAsp: 0.646 ± 0.161
0.447TrpGlu: 0.447 ± 0.141
0.993TrpPhe: 0.993 ± 0.268
0.646TrpGly: 0.646 ± 0.125
0.149TrpHis: 0.149 ± 0.096
0.894TrpIle: 0.894 ± 0.266
0.943TrpLys: 0.943 ± 0.262
1.738TrpLeu: 1.738 ± 0.314
0.099TrpMet: 0.099 ± 0.062
0.596TrpAsn: 0.596 ± 0.161
0.149TrpPro: 0.149 ± 0.08
0.596TrpGln: 0.596 ± 0.183
0.546TrpArg: 0.546 ± 0.141
0.795TrpSer: 0.795 ± 0.198
0.844TrpThr: 0.844 ± 0.162
0.695TrpVal: 0.695 ± 0.205
0.149TrpTrp: 0.149 ± 0.104
0.497TrpTyr: 0.497 ± 0.179
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.582TyrAla: 2.582 ± 0.345
0.745TyrCys: 0.745 ± 0.197
4.022TyrAsp: 4.022 ± 0.614
2.284TyrGlu: 2.284 ± 0.511
1.788TyrPhe: 1.788 ± 0.332
3.526TyrGly: 3.526 ± 0.506
0.943TyrHis: 0.943 ± 0.248
2.433TyrIle: 2.433 ± 0.386
3.128TyrLys: 3.128 ± 0.38
3.228TyrLeu: 3.228 ± 0.479
0.844TyrMet: 0.844 ± 0.177
2.93TyrAsn: 2.93 ± 0.501
1.39TyrPro: 1.39 ± 0.249
1.837TyrGln: 1.837 ± 0.357
1.738TyrArg: 1.738 ± 0.263
3.526TyrSer: 3.526 ± 0.52
2.384TyrThr: 2.384 ± 0.434
2.681TyrVal: 2.681 ± 0.499
0.497TyrTrp: 0.497 ± 0.13
1.986TyrTyr: 1.986 ± 0.409
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (20139 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski