Amino acid dipepetide frequency for Lactobacillus phage A2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.22AlaAla: 9.22 ± 2.359
0.457AlaCys: 0.457 ± 0.191
6.325AlaAsp: 6.325 ± 0.78
5.715AlaGlu: 5.715 ± 0.478
2.591AlaPhe: 2.591 ± 0.48
5.41AlaGly: 5.41 ± 0.8
0.838AlaHis: 0.838 ± 0.239
4.801AlaIle: 4.801 ± 0.875
6.706AlaLys: 6.706 ± 0.784
6.325AlaLeu: 6.325 ± 0.857
2.819AlaMet: 2.819 ± 0.527
5.029AlaAsn: 5.029 ± 0.623
2.362AlaPro: 2.362 ± 0.378
2.896AlaGln: 2.896 ± 0.493
3.277AlaArg: 3.277 ± 0.383
5.487AlaSer: 5.487 ± 0.933
4.039AlaThr: 4.039 ± 0.502
6.096AlaVal: 6.096 ± 0.744
0.686AlaTrp: 0.686 ± 0.192
3.048AlaTyr: 3.048 ± 0.511
0.0AlaXaa: 0.0 ± 0.0
Cys
0.305CysAla: 0.305 ± 0.143
0.305CysCys: 0.305 ± 0.137
0.457CysAsp: 0.457 ± 0.168
0.61CysGlu: 0.61 ± 0.241
0.0CysPhe: 0.0 ± 0.0
0.914CysGly: 0.914 ± 0.294
0.381CysHis: 0.381 ± 0.176
0.229CysIle: 0.229 ± 0.142
0.381CysLys: 0.381 ± 0.176
0.305CysLeu: 0.305 ± 0.172
0.076CysMet: 0.076 ± 0.08
0.229CysAsn: 0.229 ± 0.119
0.457CysPro: 0.457 ± 0.217
0.305CysGln: 0.305 ± 0.151
0.381CysArg: 0.381 ± 0.181
0.0CysSer: 0.0 ± 0.0
0.381CysThr: 0.381 ± 0.187
0.076CysVal: 0.076 ± 0.068
0.0CysTrp: 0.0 ± 0.0
0.152CysTyr: 0.152 ± 0.123
0.0CysXaa: 0.0 ± 0.0
Asp
5.334AspAla: 5.334 ± 0.624
0.533AspCys: 0.533 ± 0.217
6.02AspAsp: 6.02 ± 0.8
4.039AspGlu: 4.039 ± 0.541
2.667AspPhe: 2.667 ± 0.445
5.868AspGly: 5.868 ± 0.892
1.524AspHis: 1.524 ± 0.45
3.124AspIle: 3.124 ± 0.442
4.344AspLys: 4.344 ± 0.625
6.172AspLeu: 6.172 ± 0.522
1.829AspMet: 1.829 ± 0.296
3.277AspAsn: 3.277 ± 0.492
3.429AspPro: 3.429 ± 0.451
2.438AspGln: 2.438 ± 0.426
3.81AspArg: 3.81 ± 0.592
4.115AspSer: 4.115 ± 0.537
4.496AspThr: 4.496 ± 0.524
4.572AspVal: 4.572 ± 0.623
1.143AspTrp: 1.143 ± 0.236
3.124AspTyr: 3.124 ± 0.471
0.0AspXaa: 0.0 ± 0.0
Glu
4.344GluAla: 4.344 ± 0.549
0.229GluCys: 0.229 ± 0.14
3.429GluAsp: 3.429 ± 0.586
3.581GluGlu: 3.581 ± 0.637
1.981GluPhe: 1.981 ± 0.368
3.2GluGly: 3.2 ± 0.45
1.067GluHis: 1.067 ± 0.213
2.972GluIle: 2.972 ± 0.452
4.877GluLys: 4.877 ± 0.807
5.791GluLeu: 5.791 ± 0.566
1.295GluMet: 1.295 ± 0.352
2.362GluAsn: 2.362 ± 0.522
2.286GluPro: 2.286 ± 0.477
2.438GluGln: 2.438 ± 0.411
2.438GluArg: 2.438 ± 0.45
3.581GluSer: 3.581 ± 0.481
3.963GluThr: 3.963 ± 0.571
3.658GluVal: 3.658 ± 0.496
1.372GluTrp: 1.372 ± 0.29
1.829GluTyr: 1.829 ± 0.401
0.0GluXaa: 0.0 ± 0.0
Phe
2.819PheAla: 2.819 ± 0.423
0.076PheCys: 0.076 ± 0.081
2.362PheAsp: 2.362 ± 0.355
2.438PheGlu: 2.438 ± 0.563
0.762PhePhe: 0.762 ± 0.296
3.505PheGly: 3.505 ± 0.425
0.533PheHis: 0.533 ± 0.201
2.134PheIle: 2.134 ± 0.51
2.819PheLys: 2.819 ± 0.383
2.362PheLeu: 2.362 ± 0.497
1.143PheMet: 1.143 ± 0.24
1.753PheAsn: 1.753 ± 0.348
0.914PhePro: 0.914 ± 0.271
1.143PheGln: 1.143 ± 0.282
1.143PheArg: 1.143 ± 0.304
2.667PheSer: 2.667 ± 0.56
1.981PheThr: 1.981 ± 0.426
2.591PheVal: 2.591 ± 0.532
0.762PheTrp: 0.762 ± 0.243
1.219PheTyr: 1.219 ± 0.311
0.0PheXaa: 0.0 ± 0.0
Gly
4.572GlyAla: 4.572 ± 0.778
0.229GlyCys: 0.229 ± 0.131
4.344GlyAsp: 4.344 ± 0.617
3.81GlyGlu: 3.81 ± 0.468
2.819GlyPhe: 2.819 ± 0.449
4.725GlyGly: 4.725 ± 0.565
1.295GlyHis: 1.295 ± 0.334
4.496GlyIle: 4.496 ± 0.58
5.791GlyLys: 5.791 ± 0.68
4.648GlyLeu: 4.648 ± 0.645
2.515GlyMet: 2.515 ± 0.394
2.972GlyAsn: 2.972 ± 0.373
2.057GlyPro: 2.057 ± 0.344
1.448GlyGln: 1.448 ± 0.362
3.658GlyArg: 3.658 ± 0.433
4.953GlySer: 4.953 ± 0.541
5.791GlyThr: 5.791 ± 0.798
4.42GlyVal: 4.42 ± 0.505
1.6GlyTrp: 1.6 ± 0.282
3.048GlyTyr: 3.048 ± 0.449
0.0GlyXaa: 0.0 ± 0.0
His
1.753HisAla: 1.753 ± 0.358
0.152HisCys: 0.152 ± 0.119
1.067HisAsp: 1.067 ± 0.255
0.61HisGlu: 0.61 ± 0.228
0.838HisPhe: 0.838 ± 0.254
1.524HisGly: 1.524 ± 0.283
0.381HisHis: 0.381 ± 0.232
1.143HisIle: 1.143 ± 0.273
1.219HisLys: 1.219 ± 0.3
2.057HisLeu: 2.057 ± 0.46
0.152HisMet: 0.152 ± 0.113
0.152HisAsn: 0.152 ± 0.098
0.61HisPro: 0.61 ± 0.235
0.762HisGln: 0.762 ± 0.24
1.219HisArg: 1.219 ± 0.331
1.372HisSer: 1.372 ± 0.279
1.219HisThr: 1.219 ± 0.343
0.991HisVal: 0.991 ± 0.305
0.457HisTrp: 0.457 ± 0.193
0.686HisTyr: 0.686 ± 0.222
0.0HisXaa: 0.0 ± 0.0
Ile
5.258IleAla: 5.258 ± 0.5
0.457IleCys: 0.457 ± 0.204
4.496IleAsp: 4.496 ± 0.761
4.039IleGlu: 4.039 ± 0.497
1.676IlePhe: 1.676 ± 0.389
3.277IleGly: 3.277 ± 0.437
0.991IleHis: 0.991 ± 0.231
3.658IleIle: 3.658 ± 0.567
5.563IleLys: 5.563 ± 0.608
4.267IleLeu: 4.267 ± 0.515
1.448IleMet: 1.448 ± 0.312
3.81IleAsn: 3.81 ± 0.486
2.896IlePro: 2.896 ± 0.415
2.21IleGln: 2.21 ± 0.407
2.667IleArg: 2.667 ± 0.53
4.191IleSer: 4.191 ± 0.573
3.886IleThr: 3.886 ± 0.507
2.362IleVal: 2.362 ± 0.416
0.61IleTrp: 0.61 ± 0.198
1.905IleTyr: 1.905 ± 0.466
0.0IleXaa: 0.0 ± 0.0
Lys
7.011LysAla: 7.011 ± 0.697
0.229LysCys: 0.229 ± 0.12
4.115LysAsp: 4.115 ± 0.489
4.039LysGlu: 4.039 ± 0.616
2.362LysPhe: 2.362 ± 0.417
4.877LysGly: 4.877 ± 0.812
1.6LysHis: 1.6 ± 0.346
4.801LysIle: 4.801 ± 0.664
4.572LysLys: 4.572 ± 0.635
6.172LysLeu: 6.172 ± 0.779
1.753LysMet: 1.753 ± 0.389
4.115LysAsn: 4.115 ± 0.642
2.896LysPro: 2.896 ± 0.528
4.039LysGln: 4.039 ± 0.589
4.115LysArg: 4.115 ± 0.501
5.258LysSer: 5.258 ± 0.47
4.039LysThr: 4.039 ± 0.672
4.344LysVal: 4.344 ± 0.599
1.524LysTrp: 1.524 ± 0.283
2.896LysTyr: 2.896 ± 0.424
0.0LysXaa: 0.0 ± 0.0
Leu
6.553LeuAla: 6.553 ± 0.556
0.457LeuCys: 0.457 ± 0.167
5.715LeuAsp: 5.715 ± 0.746
3.886LeuGlu: 3.886 ± 0.464
2.438LeuPhe: 2.438 ± 0.382
5.029LeuGly: 5.029 ± 0.595
1.676LeuHis: 1.676 ± 0.307
5.868LeuIle: 5.868 ± 0.663
7.544LeuLys: 7.544 ± 0.882
5.41LeuLeu: 5.41 ± 0.565
1.829LeuMet: 1.829 ± 0.386
3.505LeuAsn: 3.505 ± 0.487
3.886LeuPro: 3.886 ± 0.38
3.429LeuGln: 3.429 ± 0.461
3.429LeuArg: 3.429 ± 0.399
5.182LeuSer: 5.182 ± 0.847
4.953LeuThr: 4.953 ± 0.511
4.344LeuVal: 4.344 ± 0.524
0.838LeuTrp: 0.838 ± 0.267
2.515LeuTyr: 2.515 ± 0.484
0.0LeuXaa: 0.0 ± 0.0
Met
1.753MetAla: 1.753 ± 0.377
0.076MetCys: 0.076 ± 0.073
2.286MetAsp: 2.286 ± 0.371
0.838MetGlu: 0.838 ± 0.215
1.295MetPhe: 1.295 ± 0.265
1.448MetGly: 1.448 ± 0.316
0.381MetHis: 0.381 ± 0.141
1.219MetIle: 1.219 ± 0.271
2.362MetLys: 2.362 ± 0.395
1.448MetLeu: 1.448 ± 0.272
0.305MetMet: 0.305 ± 0.155
1.524MetAsn: 1.524 ± 0.289
1.219MetPro: 1.219 ± 0.272
1.219MetGln: 1.219 ± 0.322
1.067MetArg: 1.067 ± 0.286
2.134MetSer: 2.134 ± 0.433
2.743MetThr: 2.743 ± 0.502
1.676MetVal: 1.676 ± 0.371
0.076MetTrp: 0.076 ± 0.073
0.686MetTyr: 0.686 ± 0.212
0.0MetXaa: 0.0 ± 0.0
Asn
4.115AsnAla: 4.115 ± 0.649
0.152AsnCys: 0.152 ± 0.111
3.658AsnAsp: 3.658 ± 0.527
2.819AsnGlu: 2.819 ± 0.417
1.219AsnPhe: 1.219 ± 0.352
5.258AsnGly: 5.258 ± 0.538
1.143AsnHis: 1.143 ± 0.2
2.286AsnIle: 2.286 ± 0.428
3.429AsnLys: 3.429 ± 0.43
3.124AsnLeu: 3.124 ± 0.469
1.295AsnMet: 1.295 ± 0.328
2.057AsnAsn: 2.057 ± 0.343
2.057AsnPro: 2.057 ± 0.29
2.286AsnGln: 2.286 ± 0.456
2.896AsnArg: 2.896 ± 0.397
2.819AsnSer: 2.819 ± 0.704
2.667AsnThr: 2.667 ± 0.38
3.2AsnVal: 3.2 ± 0.451
0.914AsnTrp: 0.914 ± 0.29
2.057AsnTyr: 2.057 ± 0.381
0.0AsnXaa: 0.0 ± 0.0
Pro
3.277ProAla: 3.277 ± 0.484
0.076ProCys: 0.076 ± 0.076
3.581ProAsp: 3.581 ± 0.558
4.039ProGlu: 4.039 ± 0.649
1.372ProPhe: 1.372 ± 0.276
1.448ProGly: 1.448 ± 0.273
1.067ProHis: 1.067 ± 0.271
2.438ProIle: 2.438 ± 0.386
2.667ProLys: 2.667 ± 0.481
2.743ProLeu: 2.743 ± 0.474
0.914ProMet: 0.914 ± 0.293
1.981ProAsn: 1.981 ± 0.412
1.067ProPro: 1.067 ± 0.333
1.372ProGln: 1.372 ± 0.301
1.372ProArg: 1.372 ± 0.38
2.972ProSer: 2.972 ± 0.484
2.438ProThr: 2.438 ± 0.425
2.743ProVal: 2.743 ± 0.426
0.61ProTrp: 0.61 ± 0.218
1.372ProTyr: 1.372 ± 0.351
0.0ProXaa: 0.0 ± 0.0
Gln
4.344GlnAla: 4.344 ± 0.723
0.381GlnCys: 0.381 ± 0.211
1.829GlnAsp: 1.829 ± 0.322
1.6GlnGlu: 1.6 ± 0.353
1.905GlnPhe: 1.905 ± 0.367
1.829GlnGly: 1.829 ± 0.303
0.762GlnHis: 0.762 ± 0.233
3.124GlnIle: 3.124 ± 0.477
2.972GlnLys: 2.972 ± 0.506
3.963GlnLeu: 3.963 ± 0.6
1.067GlnMet: 1.067 ± 0.281
1.372GlnAsn: 1.372 ± 0.346
1.295GlnPro: 1.295 ± 0.324
1.981GlnGln: 1.981 ± 0.446
2.057GlnArg: 2.057 ± 0.326
2.743GlnSer: 2.743 ± 0.412
2.591GlnThr: 2.591 ± 0.415
2.286GlnVal: 2.286 ± 0.349
0.914GlnTrp: 0.914 ± 0.262
1.676GlnTyr: 1.676 ± 0.358
0.0GlnXaa: 0.0 ± 0.0
Arg
3.658ArgAla: 3.658 ± 0.478
0.457ArgCys: 0.457 ± 0.19
3.353ArgAsp: 3.353 ± 0.546
2.743ArgGlu: 2.743 ± 0.527
1.981ArgPhe: 1.981 ± 0.414
2.362ArgGly: 2.362 ± 0.371
0.838ArgHis: 0.838 ± 0.204
3.277ArgIle: 3.277 ± 0.457
2.972ArgLys: 2.972 ± 0.504
4.496ArgLeu: 4.496 ± 0.618
1.829ArgMet: 1.829 ± 0.338
1.829ArgAsn: 1.829 ± 0.321
1.676ArgPro: 1.676 ± 0.459
2.21ArgGln: 2.21 ± 0.34
2.438ArgArg: 2.438 ± 0.453
3.124ArgSer: 3.124 ± 0.45
2.667ArgThr: 2.667 ± 0.464
2.972ArgVal: 2.972 ± 0.454
0.686ArgTrp: 0.686 ± 0.239
1.372ArgTyr: 1.372 ± 0.269
0.0ArgXaa: 0.0 ± 0.0
Ser
5.182SerAla: 5.182 ± 0.74
0.152SerCys: 0.152 ± 0.109
4.344SerAsp: 4.344 ± 0.619
3.429SerGlu: 3.429 ± 0.607
2.286SerPhe: 2.286 ± 0.327
6.63SerGly: 6.63 ± 0.826
0.914SerHis: 0.914 ± 0.301
3.353SerIle: 3.353 ± 0.383
4.496SerLys: 4.496 ± 0.728
4.877SerLeu: 4.877 ± 0.39
2.286SerMet: 2.286 ± 0.392
3.658SerAsn: 3.658 ± 0.441
2.438SerPro: 2.438 ± 0.57
3.124SerGln: 3.124 ± 0.48
2.438SerArg: 2.438 ± 0.466
4.725SerSer: 4.725 ± 0.653
3.886SerThr: 3.886 ± 0.44
4.953SerVal: 4.953 ± 0.584
1.067SerTrp: 1.067 ± 0.273
2.515SerTyr: 2.515 ± 0.507
0.0SerXaa: 0.0 ± 0.0
Thr
6.172ThrAla: 6.172 ± 0.962
0.305ThrCys: 0.305 ± 0.138
5.182ThrAsp: 5.182 ± 0.566
3.048ThrGlu: 3.048 ± 0.427
2.438ThrPhe: 2.438 ± 0.354
4.648ThrGly: 4.648 ± 0.726
0.762ThrHis: 0.762 ± 0.427
4.572ThrIle: 4.572 ± 0.487
3.886ThrLys: 3.886 ± 0.565
4.344ThrLeu: 4.344 ± 0.574
1.372ThrMet: 1.372 ± 0.286
3.353ThrAsn: 3.353 ± 0.537
3.277ThrPro: 3.277 ± 0.487
2.057ThrGln: 2.057 ± 0.398
2.667ThrArg: 2.667 ± 0.465
3.886ThrSer: 3.886 ± 0.616
3.734ThrThr: 3.734 ± 0.495
3.505ThrVal: 3.505 ± 0.483
0.61ThrTrp: 0.61 ± 0.197
2.515ThrTyr: 2.515 ± 0.459
0.0ThrXaa: 0.0 ± 0.0
Val
3.963ValAla: 3.963 ± 0.603
0.457ValCys: 0.457 ± 0.174
5.868ValAsp: 5.868 ± 0.741
3.124ValGlu: 3.124 ± 0.52
2.515ValPhe: 2.515 ± 0.423
3.81ValGly: 3.81 ± 0.427
0.914ValHis: 0.914 ± 0.293
3.81ValIle: 3.81 ± 0.391
4.648ValLys: 4.648 ± 0.572
5.258ValLeu: 5.258 ± 0.636
1.372ValMet: 1.372 ± 0.294
3.734ValAsn: 3.734 ± 0.496
2.819ValPro: 2.819 ± 0.467
2.286ValGln: 2.286 ± 0.458
3.277ValArg: 3.277 ± 0.599
4.344ValSer: 4.344 ± 0.607
3.658ValThr: 3.658 ± 0.563
4.496ValVal: 4.496 ± 0.578
0.991ValTrp: 0.991 ± 0.272
1.448ValTyr: 1.448 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
0.838TrpAla: 0.838 ± 0.236
0.152TrpCys: 0.152 ± 0.108
1.143TrpAsp: 1.143 ± 0.274
0.762TrpGlu: 0.762 ± 0.224
0.381TrpPhe: 0.381 ± 0.192
0.838TrpGly: 0.838 ± 0.259
0.61TrpHis: 0.61 ± 0.217
1.067TrpIle: 1.067 ± 0.257
1.143TrpLys: 1.143 ± 0.312
1.981TrpLeu: 1.981 ± 0.403
0.0TrpMet: 0.0 ± 0.0
0.991TrpAsn: 0.991 ± 0.522
0.533TrpPro: 0.533 ± 0.172
0.533TrpGln: 0.533 ± 0.224
0.991TrpArg: 0.991 ± 0.325
0.991TrpSer: 0.991 ± 0.306
1.067TrpThr: 1.067 ± 0.249
1.067TrpVal: 1.067 ± 0.341
0.305TrpTrp: 0.305 ± 0.176
0.381TrpTyr: 0.381 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.658TyrAla: 3.658 ± 0.443
0.61TyrCys: 0.61 ± 0.254
2.286TyrAsp: 2.286 ± 0.354
1.524TyrGlu: 1.524 ± 0.376
1.6TyrPhe: 1.6 ± 0.277
2.591TyrGly: 2.591 ± 0.402
0.686TyrHis: 0.686 ± 0.211
1.295TyrIle: 1.295 ± 0.349
2.438TyrLys: 2.438 ± 0.606
2.743TyrLeu: 2.743 ± 0.573
0.381TyrMet: 0.381 ± 0.195
1.753TyrAsn: 1.753 ± 0.321
1.372TyrPro: 1.372 ± 0.291
2.515TyrGln: 2.515 ± 0.416
1.6TyrArg: 1.6 ± 0.316
2.21TyrSer: 2.21 ± 0.407
2.134TyrThr: 2.134 ± 0.301
2.438TyrVal: 2.438 ± 0.512
0.61TyrTrp: 0.61 ± 0.214
1.295TyrTyr: 1.295 ± 0.381
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (13124 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski