Amino acid dipepetide frequency for Lactobacillus phage ATCC 8014-B2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.197AlaAla: 3.197 ± 0.73
0.346AlaCys: 0.346 ± 0.151
3.499AlaAsp: 3.499 ± 0.362
2.894AlaGlu: 2.894 ± 0.386
2.246AlaPhe: 2.246 ± 0.231
4.752AlaGly: 4.752 ± 0.77
0.95AlaHis: 0.95 ± 0.181
4.968AlaIle: 4.968 ± 0.64
5.356AlaLys: 5.356 ± 0.799
6.048AlaLeu: 6.048 ± 0.542
1.555AlaMet: 1.555 ± 0.32
3.629AlaAsn: 3.629 ± 0.412
1.123AlaPro: 1.123 ± 0.28
2.333AlaGln: 2.333 ± 0.386
2.333AlaArg: 2.333 ± 0.371
4.104AlaSer: 4.104 ± 0.878
3.844AlaThr: 3.844 ± 0.492
4.233AlaVal: 4.233 ± 0.636
1.166AlaTrp: 1.166 ± 0.254
2.678AlaTyr: 2.678 ± 0.366
0.0AlaXaa: 0.0 ± 0.0
Cys
0.302CysAla: 0.302 ± 0.129
0.13CysCys: 0.13 ± 0.08
0.562CysAsp: 0.562 ± 0.179
0.518CysGlu: 0.518 ± 0.155
0.259CysPhe: 0.259 ± 0.099
0.648CysGly: 0.648 ± 0.166
0.216CysHis: 0.216 ± 0.103
0.346CysIle: 0.346 ± 0.133
0.432CysLys: 0.432 ± 0.132
0.95CysLeu: 0.95 ± 0.235
0.216CysMet: 0.216 ± 0.116
0.518CysAsn: 0.518 ± 0.19
0.518CysPro: 0.518 ± 0.199
0.216CysGln: 0.216 ± 0.106
0.432CysArg: 0.432 ± 0.141
0.389CysSer: 0.389 ± 0.138
0.216CysThr: 0.216 ± 0.106
0.562CysVal: 0.562 ± 0.265
0.086CysTrp: 0.086 ± 0.048
0.475CysTyr: 0.475 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
2.981AspAla: 2.981 ± 0.442
0.518AspCys: 0.518 ± 0.147
6.523AspAsp: 6.523 ± 0.745
5.443AspGlu: 5.443 ± 0.62
2.808AspPhe: 2.808 ± 0.351
5.875AspGly: 5.875 ± 0.568
0.648AspHis: 0.648 ± 0.145
4.881AspIle: 4.881 ± 0.57
5.616AspLys: 5.616 ± 0.424
4.924AspLeu: 4.924 ± 0.5
2.333AspMet: 2.333 ± 0.364
4.795AspAsn: 4.795 ± 0.441
1.425AspPro: 1.425 ± 0.31
0.821AspGln: 0.821 ± 0.18
2.203AspArg: 2.203 ± 0.33
5.918AspSer: 5.918 ± 0.576
3.758AspThr: 3.758 ± 0.422
3.844AspVal: 3.844 ± 0.536
0.907AspTrp: 0.907 ± 0.151
3.931AspTyr: 3.931 ± 0.473
0.0AspXaa: 0.0 ± 0.0
Glu
4.406GluAla: 4.406 ± 0.468
0.605GluCys: 0.605 ± 0.195
4.492GluAsp: 4.492 ± 0.491
5.184GluGlu: 5.184 ± 0.503
2.203GluPhe: 2.203 ± 0.367
2.635GluGly: 2.635 ± 0.346
1.382GluHis: 1.382 ± 0.226
4.665GluIle: 4.665 ± 0.477
4.449GluLys: 4.449 ± 0.488
7.3GluLeu: 7.3 ± 0.68
2.333GluMet: 2.333 ± 0.341
3.369GluAsn: 3.369 ± 0.448
1.944GluPro: 1.944 ± 0.27
2.549GluGln: 2.549 ± 0.297
2.16GluArg: 2.16 ± 0.315
3.499GluSer: 3.499 ± 0.323
3.369GluThr: 3.369 ± 0.442
3.758GluVal: 3.758 ± 0.382
0.994GluTrp: 0.994 ± 0.21
2.376GluTyr: 2.376 ± 0.297
0.0GluXaa: 0.0 ± 0.0
Phe
1.944PheAla: 1.944 ± 0.266
0.302PheCys: 0.302 ± 0.114
2.981PheAsp: 2.981 ± 0.408
2.289PheGlu: 2.289 ± 0.353
2.333PhePhe: 2.333 ± 0.368
2.808PheGly: 2.808 ± 0.355
0.389PheHis: 0.389 ± 0.143
2.678PheIle: 2.678 ± 0.366
3.499PheLys: 3.499 ± 0.392
2.592PheLeu: 2.592 ± 0.337
0.95PheMet: 0.95 ± 0.207
2.808PheAsn: 2.808 ± 0.395
0.95PhePro: 0.95 ± 0.206
0.821PheGln: 0.821 ± 0.202
1.685PheArg: 1.685 ± 0.229
3.024PheSer: 3.024 ± 0.426
2.203PheThr: 2.203 ± 0.331
2.678PheVal: 2.678 ± 0.329
0.259PheTrp: 0.259 ± 0.117
1.901PheTyr: 1.901 ± 0.288
0.0PheXaa: 0.0 ± 0.0
Gly
4.449GlyAla: 4.449 ± 0.539
0.173GlyCys: 0.173 ± 0.087
4.147GlyAsp: 4.147 ± 0.395
3.499GlyGlu: 3.499 ± 0.375
3.11GlyPhe: 3.11 ± 0.361
4.838GlyGly: 4.838 ± 0.875
1.296GlyHis: 1.296 ± 0.22
4.32GlyIle: 4.32 ± 0.576
5.356GlyLys: 5.356 ± 0.813
6.609GlyLeu: 6.609 ± 0.749
1.771GlyMet: 1.771 ± 0.281
4.32GlyAsn: 4.32 ± 0.412
0.734GlyPro: 0.734 ± 0.183
2.462GlyGln: 2.462 ± 0.346
2.592GlyArg: 2.592 ± 0.335
4.881GlySer: 4.881 ± 0.638
4.622GlyThr: 4.622 ± 0.526
3.931GlyVal: 3.931 ± 0.41
1.037GlyTrp: 1.037 ± 0.2
3.499GlyTyr: 3.499 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
0.734HisAla: 0.734 ± 0.162
0.086HisCys: 0.086 ± 0.06
1.296HisAsp: 1.296 ± 0.214
0.95HisGlu: 0.95 ± 0.233
0.821HisPhe: 0.821 ± 0.204
1.598HisGly: 1.598 ± 0.232
0.605HisHis: 0.605 ± 0.171
1.339HisIle: 1.339 ± 0.253
1.08HisLys: 1.08 ± 0.254
0.994HisLeu: 0.994 ± 0.228
0.691HisMet: 0.691 ± 0.174
1.339HisAsn: 1.339 ± 0.247
0.605HisPro: 0.605 ± 0.18
0.302HisGln: 0.302 ± 0.101
0.648HisArg: 0.648 ± 0.157
1.037HisSer: 1.037 ± 0.209
0.864HisThr: 0.864 ± 0.187
1.08HisVal: 1.08 ± 0.241
0.259HisTrp: 0.259 ± 0.122
0.95HisTyr: 0.95 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
4.32IleAla: 4.32 ± 0.382
0.389IleCys: 0.389 ± 0.134
6.004IleAsp: 6.004 ± 0.603
3.715IleGlu: 3.715 ± 0.351
1.987IlePhe: 1.987 ± 0.399
4.881IleGly: 4.881 ± 0.969
1.296IleHis: 1.296 ± 0.226
4.233IleIle: 4.233 ± 0.482
6.048IleLys: 6.048 ± 0.596
4.449IleLeu: 4.449 ± 0.627
2.073IleMet: 2.073 ± 0.353
6.479IleAsn: 6.479 ± 0.492
1.728IlePro: 1.728 ± 0.292
1.857IleGln: 1.857 ± 0.291
1.771IleArg: 1.771 ± 0.255
5.4IleSer: 5.4 ± 0.465
3.715IleThr: 3.715 ± 0.457
4.622IleVal: 4.622 ± 0.525
0.346IleTrp: 0.346 ± 0.131
2.505IleTyr: 2.505 ± 0.321
0.0IleXaa: 0.0 ± 0.0
Lys
4.492LysAla: 4.492 ± 0.808
0.605LysCys: 0.605 ± 0.17
5.4LysAsp: 5.4 ± 0.514
6.177LysGlu: 6.177 ± 0.696
3.067LysPhe: 3.067 ± 0.324
4.622LysGly: 4.622 ± 0.614
1.728LysHis: 1.728 ± 0.318
5.572LysIle: 5.572 ± 0.587
7.127LysLys: 7.127 ± 0.527
7.171LysLeu: 7.171 ± 0.497
3.024LysMet: 3.024 ± 0.406
4.924LysAsn: 4.924 ± 0.618
1.987LysPro: 1.987 ± 0.32
3.283LysGln: 3.283 ± 0.434
3.456LysArg: 3.456 ± 0.335
4.276LysSer: 4.276 ± 0.584
5.011LysThr: 5.011 ± 0.476
4.19LysVal: 4.19 ± 0.453
0.907LysTrp: 0.907 ± 0.229
4.406LysTyr: 4.406 ± 0.434
0.0LysXaa: 0.0 ± 0.0
Leu
5.184LeuAla: 5.184 ± 0.429
0.907LeuCys: 0.907 ± 0.248
5.313LeuAsp: 5.313 ± 0.53
6.307LeuGlu: 6.307 ± 0.603
3.24LeuPhe: 3.24 ± 0.34
4.536LeuGly: 4.536 ± 0.576
1.166LeuHis: 1.166 ± 0.222
5.184LeuIle: 5.184 ± 0.43
6.35LeuLys: 6.35 ± 0.473
7.559LeuLeu: 7.559 ± 0.579
2.549LeuMet: 2.549 ± 0.344
5.054LeuAsn: 5.054 ± 0.442
2.765LeuPro: 2.765 ± 0.3
2.592LeuGln: 2.592 ± 0.285
2.678LeuArg: 2.678 ± 0.373
7.041LeuSer: 7.041 ± 0.594
5.4LeuThr: 5.4 ± 0.484
4.363LeuVal: 4.363 ± 0.443
0.734LeuTrp: 0.734 ± 0.191
3.499LeuTyr: 3.499 ± 0.5
0.0LeuXaa: 0.0 ± 0.0
Met
1.771MetAla: 1.771 ± 0.253
0.216MetCys: 0.216 ± 0.127
1.166MetAsp: 1.166 ± 0.221
1.728MetGlu: 1.728 ± 0.276
1.08MetPhe: 1.08 ± 0.23
1.037MetGly: 1.037 ± 0.203
0.346MetHis: 0.346 ± 0.126
1.987MetIle: 1.987 ± 0.285
2.894MetLys: 2.894 ± 0.352
2.721MetLeu: 2.721 ± 0.375
0.562MetMet: 0.562 ± 0.155
1.296MetAsn: 1.296 ± 0.219
0.562MetPro: 0.562 ± 0.171
1.253MetGln: 1.253 ± 0.281
0.95MetArg: 0.95 ± 0.227
2.678MetSer: 2.678 ± 0.338
2.117MetThr: 2.117 ± 0.29
1.857MetVal: 1.857 ± 0.274
0.259MetTrp: 0.259 ± 0.089
0.994MetTyr: 0.994 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
3.715AsnAla: 3.715 ± 0.492
0.605AsnCys: 0.605 ± 0.189
4.276AsnAsp: 4.276 ± 0.319
3.931AsnGlu: 3.931 ± 0.355
2.246AsnPhe: 2.246 ± 0.321
5.313AsnGly: 5.313 ± 0.56
1.123AsnHis: 1.123 ± 0.264
4.968AsnIle: 4.968 ± 0.456
6.22AsnLys: 6.22 ± 0.547
3.672AsnLeu: 3.672 ± 0.423
1.598AsnMet: 1.598 ± 0.236
4.881AsnAsn: 4.881 ± 0.579
2.03AsnPro: 2.03 ± 0.289
2.419AsnGln: 2.419 ± 0.416
2.117AsnArg: 2.117 ± 0.278
4.276AsnSer: 4.276 ± 0.393
3.456AsnThr: 3.456 ± 0.396
4.406AsnVal: 4.406 ± 0.345
0.734AsnTrp: 0.734 ± 0.139
2.937AsnTyr: 2.937 ± 0.416
0.0AsnXaa: 0.0 ± 0.0
Pro
2.549ProAla: 2.549 ± 0.352
0.13ProCys: 0.13 ± 0.085
2.117ProAsp: 2.117 ± 0.303
1.944ProGlu: 1.944 ± 0.361
0.605ProPhe: 0.605 ± 0.163
1.253ProGly: 1.253 ± 0.293
0.518ProHis: 0.518 ± 0.142
1.598ProIle: 1.598 ± 0.248
1.598ProLys: 1.598 ± 0.285
1.901ProLeu: 1.901 ± 0.284
0.216ProMet: 0.216 ± 0.085
1.555ProAsn: 1.555 ± 0.223
0.259ProPro: 0.259 ± 0.14
0.778ProGln: 0.778 ± 0.188
0.778ProArg: 0.778 ± 0.214
1.944ProSer: 1.944 ± 0.289
1.253ProThr: 1.253 ± 0.267
2.376ProVal: 2.376 ± 0.307
0.216ProTrp: 0.216 ± 0.087
1.512ProTyr: 1.512 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
1.944GlnAla: 1.944 ± 0.427
0.13GlnCys: 0.13 ± 0.074
1.685GlnAsp: 1.685 ± 0.234
2.03GlnGlu: 2.03 ± 0.241
1.425GlnPhe: 1.425 ± 0.26
1.641GlnGly: 1.641 ± 0.278
0.562GlnHis: 0.562 ± 0.122
2.289GlnIle: 2.289 ± 0.33
2.203GlnLys: 2.203 ± 0.313
2.678GlnLeu: 2.678 ± 0.335
1.037GlnMet: 1.037 ± 0.174
2.16GlnAsn: 2.16 ± 0.339
0.821GlnPro: 0.821 ± 0.228
1.08GlnGln: 1.08 ± 0.166
1.253GlnArg: 1.253 ± 0.235
2.721GlnSer: 2.721 ± 0.419
2.851GlnThr: 2.851 ± 0.514
2.073GlnVal: 2.073 ± 0.347
0.302GlnTrp: 0.302 ± 0.092
1.987GlnTyr: 1.987 ± 0.358
0.0GlnXaa: 0.0 ± 0.0
Arg
2.376ArgAla: 2.376 ± 0.266
0.259ArgCys: 0.259 ± 0.104
2.721ArgAsp: 2.721 ± 0.409
1.987ArgGlu: 1.987 ± 0.326
1.814ArgPhe: 1.814 ± 0.287
2.03ArgGly: 2.03 ± 0.297
0.648ArgHis: 0.648 ± 0.173
1.512ArgIle: 1.512 ± 0.289
2.635ArgLys: 2.635 ± 0.333
2.981ArgLeu: 2.981 ± 0.307
0.778ArgMet: 0.778 ± 0.242
1.555ArgAsn: 1.555 ± 0.296
0.648ArgPro: 0.648 ± 0.166
1.123ArgGln: 1.123 ± 0.217
1.08ArgArg: 1.08 ± 0.225
2.419ArgSer: 2.419 ± 0.333
1.771ArgThr: 1.771 ± 0.25
2.808ArgVal: 2.808 ± 0.424
0.518ArgTrp: 0.518 ± 0.196
1.728ArgTyr: 1.728 ± 0.283
0.0ArgXaa: 0.0 ± 0.0
Ser
5.443SerAla: 5.443 ± 0.921
0.475SerCys: 0.475 ± 0.142
4.924SerAsp: 4.924 ± 0.426
5.443SerGlu: 5.443 ± 0.652
2.894SerPhe: 2.894 ± 0.389
5.875SerGly: 5.875 ± 0.611
1.469SerHis: 1.469 ± 0.229
4.147SerIle: 4.147 ± 0.434
5.918SerLys: 5.918 ± 0.51
6.048SerLeu: 6.048 ± 0.532
1.901SerMet: 1.901 ± 0.287
4.536SerAsn: 4.536 ± 0.545
1.857SerPro: 1.857 ± 0.278
2.376SerGln: 2.376 ± 0.522
1.814SerArg: 1.814 ± 0.233
6.479SerSer: 6.479 ± 0.608
3.801SerThr: 3.801 ± 0.54
4.795SerVal: 4.795 ± 0.514
0.907SerTrp: 0.907 ± 0.181
3.456SerTyr: 3.456 ± 0.415
0.0SerXaa: 0.0 ± 0.0
Thr
4.06ThrAla: 4.06 ± 0.685
0.605ThrCys: 0.605 ± 0.185
4.19ThrAsp: 4.19 ± 0.493
2.765ThrGlu: 2.765 ± 0.328
2.462ThrPhe: 2.462 ± 0.429
4.665ThrGly: 4.665 ± 0.789
1.037ThrHis: 1.037 ± 0.261
4.665ThrIle: 4.665 ± 0.387
4.579ThrLys: 4.579 ± 0.43
4.104ThrLeu: 4.104 ± 0.398
1.123ThrMet: 1.123 ± 0.181
3.499ThrAsn: 3.499 ± 0.424
2.03ThrPro: 2.03 ± 0.3
2.073ThrGln: 2.073 ± 0.327
1.641ThrArg: 1.641 ± 0.261
4.492ThrSer: 4.492 ± 0.66
3.456ThrThr: 3.456 ± 0.532
4.838ThrVal: 4.838 ± 0.567
0.562ThrTrp: 0.562 ± 0.129
2.937ThrTyr: 2.937 ± 0.341
0.0ThrXaa: 0.0 ± 0.0
Val
4.06ValAla: 4.06 ± 0.553
0.778ValCys: 0.778 ± 0.223
4.492ValAsp: 4.492 ± 0.579
3.801ValGlu: 3.801 ± 0.358
2.16ValPhe: 2.16 ± 0.3
4.406ValGly: 4.406 ± 0.551
1.08ValHis: 1.08 ± 0.189
4.19ValIle: 4.19 ± 0.536
5.745ValLys: 5.745 ± 0.456
4.924ValLeu: 4.924 ± 0.511
1.598ValMet: 1.598 ± 0.345
4.32ValAsn: 4.32 ± 0.43
1.641ValPro: 1.641 ± 0.19
1.728ValGln: 1.728 ± 0.325
1.598ValArg: 1.598 ± 0.294
5.572ValSer: 5.572 ± 0.506
4.536ValThr: 4.536 ± 0.5
4.536ValVal: 4.536 ± 0.513
0.734ValTrp: 0.734 ± 0.174
2.376ValTyr: 2.376 ± 0.375
0.0ValXaa: 0.0 ± 0.0
Trp
0.778TrpAla: 0.778 ± 0.204
0.043TrpCys: 0.043 ± 0.041
0.648TrpAsp: 0.648 ± 0.163
0.605TrpGlu: 0.605 ± 0.14
0.734TrpPhe: 0.734 ± 0.261
0.734TrpGly: 0.734 ± 0.189
0.173TrpHis: 0.173 ± 0.09
0.821TrpIle: 0.821 ± 0.201
0.648TrpLys: 0.648 ± 0.175
1.296TrpLeu: 1.296 ± 0.31
0.086TrpMet: 0.086 ± 0.061
0.648TrpAsn: 0.648 ± 0.171
0.086TrpPro: 0.086 ± 0.056
0.605TrpGln: 0.605 ± 0.159
0.605TrpArg: 0.605 ± 0.134
1.08TrpSer: 1.08 ± 0.223
0.821TrpThr: 0.821 ± 0.168
0.432TrpVal: 0.432 ± 0.131
0.302TrpTrp: 0.302 ± 0.09
0.691TrpTyr: 0.691 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.549TyrAla: 2.549 ± 0.316
0.691TyrCys: 0.691 ± 0.165
3.715TyrAsp: 3.715 ± 0.534
2.462TyrGlu: 2.462 ± 0.399
1.598TyrPhe: 1.598 ± 0.292
3.499TyrGly: 3.499 ± 0.5
0.648TyrHis: 0.648 ± 0.184
3.456TyrIle: 3.456 ± 0.446
3.672TyrLys: 3.672 ± 0.435
3.499TyrLeu: 3.499 ± 0.411
1.037TyrMet: 1.037 ± 0.171
3.24TyrAsn: 3.24 ± 0.315
1.382TyrPro: 1.382 ± 0.206
2.203TyrGln: 2.203 ± 0.377
1.598TyrArg: 1.598 ± 0.271
3.326TyrSer: 3.326 ± 0.512
2.721TyrThr: 2.721 ± 0.445
2.937TyrVal: 2.937 ± 0.391
0.605TyrTrp: 0.605 ± 0.155
2.203TyrTyr: 2.203 ± 0.386
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 127 proteins (23151 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski