Amino acid dipepetide frequency for Lactobacillus phage T25

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.91AlaAla: 7.91 ± 1.083
0.589AlaCys: 0.589 ± 0.218
6.479AlaAsp: 6.479 ± 0.937
4.376AlaGlu: 4.376 ± 0.584
2.693AlaPhe: 2.693 ± 0.469
6.732AlaGly: 6.732 ± 1.062
1.262AlaHis: 1.262 ± 0.335
6.648AlaIle: 6.648 ± 0.701
7.657AlaLys: 7.657 ± 0.904
6.563AlaLeu: 6.563 ± 0.792
1.851AlaMet: 1.851 ± 0.344
5.89AlaAsn: 5.89 ± 0.757
2.104AlaPro: 2.104 ± 0.378
3.787AlaGln: 3.787 ± 0.861
3.198AlaArg: 3.198 ± 0.464
4.544AlaSer: 4.544 ± 0.663
4.965AlaThr: 4.965 ± 0.587
4.712AlaVal: 4.712 ± 0.854
1.346AlaTrp: 1.346 ± 0.422
2.945AlaTyr: 2.945 ± 0.488
0.0AlaXaa: 0.0 ± 0.0
Cys
0.673CysAla: 0.673 ± 0.249
0.252CysCys: 0.252 ± 0.138
0.168CysAsp: 0.168 ± 0.122
0.421CysGlu: 0.421 ± 0.167
0.168CysPhe: 0.168 ± 0.113
0.421CysGly: 0.421 ± 0.199
0.084CysHis: 0.084 ± 0.091
0.252CysIle: 0.252 ± 0.134
0.421CysLys: 0.421 ± 0.23
0.168CysLeu: 0.168 ± 0.113
0.0CysMet: 0.0 ± 0.0
0.084CysAsn: 0.084 ± 0.095
0.505CysPro: 0.505 ± 0.217
0.168CysGln: 0.168 ± 0.115
0.168CysArg: 0.168 ± 0.113
0.337CysSer: 0.337 ± 0.175
0.421CysThr: 0.421 ± 0.173
0.168CysVal: 0.168 ± 0.106
0.084CysTrp: 0.084 ± 0.106
0.589CysTyr: 0.589 ± 0.209
0.0CysXaa: 0.0 ± 0.0
Asp
6.984AspAla: 6.984 ± 0.703
0.252AspCys: 0.252 ± 0.117
4.376AspAsp: 4.376 ± 0.558
4.123AspGlu: 4.123 ± 0.727
2.188AspPhe: 2.188 ± 0.457
6.311AspGly: 6.311 ± 0.693
1.515AspHis: 1.515 ± 0.357
3.787AspIle: 3.787 ± 0.566
4.46AspLys: 4.46 ± 0.614
5.638AspLeu: 5.638 ± 0.631
2.104AspMet: 2.104 ± 0.419
3.113AspAsn: 3.113 ± 0.557
2.44AspPro: 2.44 ± 0.435
2.777AspGln: 2.777 ± 0.421
3.45AspArg: 3.45 ± 0.548
4.628AspSer: 4.628 ± 0.492
3.366AspThr: 3.366 ± 0.593
5.806AspVal: 5.806 ± 0.822
1.178AspTrp: 1.178 ± 0.435
2.861AspTyr: 2.861 ± 0.689
0.0AspXaa: 0.0 ± 0.0
Glu
3.702GluAla: 3.702 ± 0.699
0.168GluCys: 0.168 ± 0.091
3.45GluAsp: 3.45 ± 0.555
2.356GluGlu: 2.356 ± 0.453
2.02GluPhe: 2.02 ± 0.48
2.945GluGly: 2.945 ± 0.466
1.094GluHis: 1.094 ± 0.278
2.861GluIle: 2.861 ± 0.559
3.618GluLys: 3.618 ± 0.642
4.291GluLeu: 4.291 ± 0.552
2.272GluMet: 2.272 ± 0.418
1.851GluAsn: 1.851 ± 0.445
2.02GluPro: 2.02 ± 0.46
2.777GluGln: 2.777 ± 0.454
1.683GluArg: 1.683 ± 0.393
2.693GluSer: 2.693 ± 0.42
3.282GluThr: 3.282 ± 0.49
3.534GluVal: 3.534 ± 0.565
0.757GluTrp: 0.757 ± 0.28
2.104GluTyr: 2.104 ± 0.555
0.0GluXaa: 0.0 ± 0.0
Phe
2.609PheAla: 2.609 ± 0.396
0.421PheCys: 0.421 ± 0.201
2.02PheAsp: 2.02 ± 0.33
2.188PheGlu: 2.188 ± 0.388
1.43PhePhe: 1.43 ± 0.378
3.029PheGly: 3.029 ± 0.453
0.589PheHis: 0.589 ± 0.259
2.188PheIle: 2.188 ± 0.525
2.693PheLys: 2.693 ± 0.404
2.272PheLeu: 2.272 ± 0.469
1.094PheMet: 1.094 ± 0.254
2.356PheAsn: 2.356 ± 0.421
1.262PhePro: 1.262 ± 0.313
0.757PheGln: 0.757 ± 0.33
1.262PheArg: 1.262 ± 0.371
2.777PheSer: 2.777 ± 0.516
1.935PheThr: 1.935 ± 0.335
1.851PheVal: 1.851 ± 0.427
1.094PheTrp: 1.094 ± 0.433
0.757PheTyr: 0.757 ± 0.203
0.0PheXaa: 0.0 ± 0.0
Gly
4.376GlyAla: 4.376 ± 0.692
0.337GlyCys: 0.337 ± 0.158
5.554GlyAsp: 5.554 ± 0.526
2.693GlyGlu: 2.693 ± 0.514
3.45GlyPhe: 3.45 ± 0.405
4.46GlyGly: 4.46 ± 1.022
1.43GlyHis: 1.43 ± 0.41
5.133GlyIle: 5.133 ± 0.624
5.722GlyLys: 5.722 ± 0.933
5.385GlyLeu: 5.385 ± 0.884
2.188GlyMet: 2.188 ± 0.418
4.291GlyAsn: 4.291 ± 0.663
1.515GlyPro: 1.515 ± 0.422
2.777GlyGln: 2.777 ± 0.46
2.272GlyArg: 2.272 ± 0.489
5.385GlySer: 5.385 ± 0.649
4.881GlyThr: 4.881 ± 0.708
4.881GlyVal: 4.881 ± 0.663
2.104GlyTrp: 2.104 ± 0.496
3.618GlyTyr: 3.618 ± 0.495
0.0GlyXaa: 0.0 ± 0.0
His
1.178HisAla: 1.178 ± 0.371
0.084HisCys: 0.084 ± 0.106
1.346HisAsp: 1.346 ± 0.256
1.01HisGlu: 1.01 ± 0.297
1.178HisPhe: 1.178 ± 0.35
0.841HisGly: 0.841 ± 0.224
0.421HisHis: 0.421 ± 0.269
1.178HisIle: 1.178 ± 0.361
1.767HisLys: 1.767 ± 0.338
0.757HisLeu: 0.757 ± 0.273
0.168HisMet: 0.168 ± 0.106
1.346HisAsn: 1.346 ± 0.372
0.757HisPro: 0.757 ± 0.229
0.757HisGln: 0.757 ± 0.307
0.841HisArg: 0.841 ± 0.325
1.599HisSer: 1.599 ± 0.568
0.926HisThr: 0.926 ± 0.295
1.346HisVal: 1.346 ± 0.437
0.337HisTrp: 0.337 ± 0.171
0.841HisTyr: 0.841 ± 0.24
0.0HisXaa: 0.0 ± 0.0
Ile
6.395IleAla: 6.395 ± 0.717
0.252IleCys: 0.252 ± 0.137
5.049IleAsp: 5.049 ± 0.572
3.113IleGlu: 3.113 ± 0.586
1.599IlePhe: 1.599 ± 0.431
3.702IleGly: 3.702 ± 0.584
1.515IleHis: 1.515 ± 0.387
3.198IleIle: 3.198 ± 0.649
4.796IleLys: 4.796 ± 0.603
2.945IleLeu: 2.945 ± 0.451
1.851IleMet: 1.851 ± 0.406
3.198IleAsn: 3.198 ± 0.554
2.272IlePro: 2.272 ± 0.458
2.693IleGln: 2.693 ± 0.468
2.356IleArg: 2.356 ± 0.56
4.796IleSer: 4.796 ± 0.519
3.702IleThr: 3.702 ± 0.642
3.534IleVal: 3.534 ± 0.611
1.262IleTrp: 1.262 ± 0.347
2.524IleTyr: 2.524 ± 0.531
0.0IleXaa: 0.0 ± 0.0
Lys
7.742LysAla: 7.742 ± 0.991
0.421LysCys: 0.421 ± 0.225
4.796LysAsp: 4.796 ± 0.596
3.366LysGlu: 3.366 ± 0.474
2.272LysPhe: 2.272 ± 0.394
3.787LysGly: 3.787 ± 0.553
1.43LysHis: 1.43 ± 0.412
4.207LysIle: 4.207 ± 0.563
5.974LysLys: 5.974 ± 0.794
5.89LysLeu: 5.89 ± 0.656
2.609LysMet: 2.609 ± 0.549
3.702LysAsn: 3.702 ± 0.769
3.029LysPro: 3.029 ± 0.497
4.291LysGln: 4.291 ± 0.823
2.861LysArg: 2.861 ± 0.665
5.974LysSer: 5.974 ± 0.942
4.965LysThr: 4.965 ± 0.586
4.039LysVal: 4.039 ± 0.544
0.926LysTrp: 0.926 ± 0.258
3.198LysTyr: 3.198 ± 0.538
0.0LysXaa: 0.0 ± 0.0
Leu
7.152LeuAla: 7.152 ± 0.562
0.337LeuCys: 0.337 ± 0.176
5.806LeuAsp: 5.806 ± 0.531
3.366LeuGlu: 3.366 ± 0.514
2.609LeuPhe: 2.609 ± 0.613
5.806LeuGly: 5.806 ± 0.721
1.683LeuHis: 1.683 ± 0.375
4.376LeuIle: 4.376 ± 0.612
5.722LeuLys: 5.722 ± 0.622
5.049LeuLeu: 5.049 ± 0.814
2.104LeuMet: 2.104 ± 0.492
4.544LeuAsn: 4.544 ± 0.84
2.861LeuPro: 2.861 ± 0.452
2.188LeuGln: 2.188 ± 0.434
3.029LeuArg: 3.029 ± 0.61
5.89LeuSer: 5.89 ± 0.725
4.291LeuThr: 4.291 ± 0.659
4.628LeuVal: 4.628 ± 0.687
0.673LeuTrp: 0.673 ± 0.226
2.524LeuTyr: 2.524 ± 0.523
0.0LeuXaa: 0.0 ± 0.0
Met
2.02MetAla: 2.02 ± 0.549
0.168MetCys: 0.168 ± 0.113
1.43MetAsp: 1.43 ± 0.324
1.01MetGlu: 1.01 ± 0.273
0.841MetPhe: 0.841 ± 0.247
1.178MetGly: 1.178 ± 0.31
0.421MetHis: 0.421 ± 0.172
1.515MetIle: 1.515 ± 0.295
1.935MetLys: 1.935 ± 0.355
1.851MetLeu: 1.851 ± 0.417
1.094MetMet: 1.094 ± 0.356
1.599MetAsn: 1.599 ± 0.333
1.178MetPro: 1.178 ± 0.364
1.43MetGln: 1.43 ± 0.272
1.094MetArg: 1.094 ± 0.318
2.104MetSer: 2.104 ± 0.357
2.693MetThr: 2.693 ± 0.368
2.272MetVal: 2.272 ± 0.418
0.673MetTrp: 0.673 ± 0.213
1.767MetTyr: 1.767 ± 0.337
0.0MetXaa: 0.0 ± 0.0
Asn
3.955AsnAla: 3.955 ± 0.878
0.168AsnCys: 0.168 ± 0.149
4.544AsnAsp: 4.544 ± 0.548
3.198AsnGlu: 3.198 ± 0.475
1.094AsnPhe: 1.094 ± 0.255
6.563AsnGly: 6.563 ± 1.024
1.178AsnHis: 1.178 ± 0.271
3.029AsnIle: 3.029 ± 0.414
2.777AsnLys: 2.777 ± 0.595
3.787AsnLeu: 3.787 ± 0.587
1.767AsnMet: 1.767 ± 0.341
2.693AsnAsn: 2.693 ± 0.519
2.02AsnPro: 2.02 ± 0.403
2.44AsnGln: 2.44 ± 0.442
3.198AsnArg: 3.198 ± 0.543
2.693AsnSer: 2.693 ± 0.515
2.609AsnThr: 2.609 ± 0.483
2.861AsnVal: 2.861 ± 0.413
1.094AsnTrp: 1.094 ± 0.3
1.094AsnTyr: 1.094 ± 0.323
0.0AsnXaa: 0.0 ± 0.0
Pro
3.029ProAla: 3.029 ± 0.691
0.084ProCys: 0.084 ± 0.074
2.356ProAsp: 2.356 ± 0.412
2.104ProGlu: 2.104 ± 0.466
1.178ProPhe: 1.178 ± 0.343
1.935ProGly: 1.935 ± 0.349
0.337ProHis: 0.337 ± 0.188
1.515ProIle: 1.515 ± 0.369
3.198ProLys: 3.198 ± 0.517
2.693ProLeu: 2.693 ± 0.437
0.337ProMet: 0.337 ± 0.169
2.02ProAsn: 2.02 ± 0.38
1.178ProPro: 1.178 ± 0.439
1.599ProGln: 1.599 ± 0.447
1.01ProArg: 1.01 ± 0.299
2.861ProSer: 2.861 ± 0.612
2.693ProThr: 2.693 ± 0.545
2.609ProVal: 2.609 ± 0.377
0.841ProTrp: 0.841 ± 0.262
1.094ProTyr: 1.094 ± 0.196
0.0ProXaa: 0.0 ± 0.0
Gln
5.217GlnAla: 5.217 ± 0.624
0.505GlnCys: 0.505 ± 0.277
2.272GlnAsp: 2.272 ± 0.546
2.02GlnGlu: 2.02 ± 0.349
1.515GlnPhe: 1.515 ± 0.329
1.767GlnGly: 1.767 ± 0.46
0.841GlnHis: 0.841 ± 0.232
2.609GlnIle: 2.609 ± 0.454
2.693GlnLys: 2.693 ± 0.571
3.955GlnLeu: 3.955 ± 0.556
1.683GlnMet: 1.683 ± 0.398
1.262GlnAsn: 1.262 ± 0.298
2.02GlnPro: 2.02 ± 0.461
4.46GlnGln: 4.46 ± 1.035
1.43GlnArg: 1.43 ± 0.337
3.113GlnSer: 3.113 ± 0.558
2.777GlnThr: 2.777 ± 0.393
2.356GlnVal: 2.356 ± 0.38
1.01GlnTrp: 1.01 ± 0.289
1.935GlnTyr: 1.935 ± 0.358
0.0GlnXaa: 0.0 ± 0.0
Arg
2.861ArgAla: 2.861 ± 0.393
0.252ArgCys: 0.252 ± 0.197
2.356ArgAsp: 2.356 ± 0.642
1.935ArgGlu: 1.935 ± 0.352
1.683ArgPhe: 1.683 ± 0.354
2.524ArgGly: 2.524 ± 0.487
0.505ArgHis: 0.505 ± 0.23
2.524ArgIle: 2.524 ± 0.606
3.029ArgLys: 3.029 ± 0.802
3.787ArgLeu: 3.787 ± 0.759
0.841ArgMet: 0.841 ± 0.242
1.851ArgAsn: 1.851 ± 0.312
1.262ArgPro: 1.262 ± 0.333
1.935ArgGln: 1.935 ± 0.367
1.683ArgArg: 1.683 ± 0.479
2.188ArgSer: 2.188 ± 0.365
2.693ArgThr: 2.693 ± 0.431
2.945ArgVal: 2.945 ± 0.456
0.589ArgTrp: 0.589 ± 0.199
1.43ArgTyr: 1.43 ± 0.432
0.0ArgXaa: 0.0 ± 0.0
Ser
5.89SerAla: 5.89 ± 0.95
0.084SerCys: 0.084 ± 0.087
5.385SerAsp: 5.385 ± 0.675
2.861SerGlu: 2.861 ± 0.556
2.44SerPhe: 2.44 ± 0.347
5.89SerGly: 5.89 ± 1.199
1.43SerHis: 1.43 ± 0.369
4.881SerIle: 4.881 ± 0.512
4.965SerLys: 4.965 ± 0.604
5.217SerLeu: 5.217 ± 0.572
2.104SerMet: 2.104 ± 0.462
4.376SerAsn: 4.376 ± 0.515
2.188SerPro: 2.188 ± 0.466
2.693SerGln: 2.693 ± 0.607
2.356SerArg: 2.356 ± 0.425
3.955SerSer: 3.955 ± 0.54
3.871SerThr: 3.871 ± 0.446
4.544SerVal: 4.544 ± 0.542
1.094SerTrp: 1.094 ± 0.306
2.609SerTyr: 2.609 ± 0.556
0.0SerXaa: 0.0 ± 0.0
Thr
4.544ThrAla: 4.544 ± 0.505
0.337ThrCys: 0.337 ± 0.172
4.796ThrAsp: 4.796 ± 0.602
1.935ThrGlu: 1.935 ± 0.428
2.44ThrPhe: 2.44 ± 0.449
4.712ThrGly: 4.712 ± 0.648
0.841ThrHis: 0.841 ± 0.191
4.376ThrIle: 4.376 ± 0.763
4.544ThrLys: 4.544 ± 0.634
5.301ThrLeu: 5.301 ± 0.643
1.43ThrMet: 1.43 ± 0.338
3.618ThrAsn: 3.618 ± 0.634
2.02ThrPro: 2.02 ± 0.435
2.693ThrGln: 2.693 ± 0.352
2.524ThrArg: 2.524 ± 0.383
4.207ThrSer: 4.207 ± 0.582
3.702ThrThr: 3.702 ± 0.493
4.544ThrVal: 4.544 ± 0.645
0.841ThrTrp: 0.841 ± 0.247
2.356ThrTyr: 2.356 ± 0.503
0.0ThrXaa: 0.0 ± 0.0
Val
5.049ValAla: 5.049 ± 0.627
0.168ValCys: 0.168 ± 0.1
5.217ValAsp: 5.217 ± 0.707
4.46ValGlu: 4.46 ± 0.688
1.851ValPhe: 1.851 ± 0.229
5.554ValGly: 5.554 ± 0.68
1.262ValHis: 1.262 ± 0.306
3.787ValIle: 3.787 ± 0.588
5.638ValLys: 5.638 ± 0.687
4.628ValLeu: 4.628 ± 0.658
1.599ValMet: 1.599 ± 0.331
2.609ValAsn: 2.609 ± 0.497
2.356ValPro: 2.356 ± 0.425
1.683ValGln: 1.683 ± 0.359
1.851ValArg: 1.851 ± 0.33
4.628ValSer: 4.628 ± 0.651
3.871ValThr: 3.871 ± 0.667
4.712ValVal: 4.712 ± 0.69
0.589ValTrp: 0.589 ± 0.207
2.693ValTyr: 2.693 ± 0.497
0.0ValXaa: 0.0 ± 0.0
Trp
1.935TrpAla: 1.935 ± 0.321
0.252TrpCys: 0.252 ± 0.15
1.262TrpAsp: 1.262 ± 0.589
1.01TrpGlu: 1.01 ± 0.307
0.673TrpPhe: 0.673 ± 0.221
0.421TrpGly: 0.421 ± 0.224
0.168TrpHis: 0.168 ± 0.13
0.926TrpIle: 0.926 ± 0.214
1.094TrpLys: 1.094 ± 0.323
1.599TrpLeu: 1.599 ± 0.284
0.505TrpMet: 0.505 ± 0.19
0.757TrpAsn: 0.757 ± 0.378
0.252TrpPro: 0.252 ± 0.148
1.01TrpGln: 1.01 ± 0.271
1.43TrpArg: 1.43 ± 0.294
1.262TrpSer: 1.262 ± 0.363
1.43TrpThr: 1.43 ± 0.277
0.505TrpVal: 0.505 ± 0.148
0.252TrpTrp: 0.252 ± 0.135
0.505TrpTyr: 0.505 ± 0.182
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.029TyrAla: 3.029 ± 0.681
0.421TyrCys: 0.421 ± 0.239
2.693TyrAsp: 2.693 ± 0.475
1.935TyrGlu: 1.935 ± 0.548
1.262TyrPhe: 1.262 ± 0.339
3.871TyrGly: 3.871 ± 0.847
0.673TyrHis: 0.673 ± 0.284
1.767TyrIle: 1.767 ± 0.461
2.777TyrLys: 2.777 ± 0.522
2.945TyrLeu: 2.945 ± 0.42
0.589TyrMet: 0.589 ± 0.151
1.599TyrAsn: 1.599 ± 0.367
1.43TyrPro: 1.43 ± 0.284
2.524TyrGln: 2.524 ± 0.472
1.178TyrArg: 1.178 ± 0.341
3.198TyrSer: 3.198 ± 0.566
2.693TyrThr: 2.693 ± 0.583
2.356TyrVal: 2.356 ± 0.438
0.505TyrTrp: 0.505 ± 0.202
1.262TyrTyr: 1.262 ± 0.375
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (11885 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski