Amino acid dipepetide frequency for Lactococcus phage C41431

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.359AlaAla: 3.359 ± 0.773
0.65AlaCys: 0.65 ± 0.267
4.659AlaAsp: 4.659 ± 0.715
4.334AlaGlu: 4.334 ± 0.587
2.492AlaPhe: 2.492 ± 0.479
4.443AlaGly: 4.443 ± 1.104
0.758AlaHis: 0.758 ± 0.272
4.551AlaIle: 4.551 ± 0.89
5.418AlaLys: 5.418 ± 0.774
5.418AlaLeu: 5.418 ± 1.054
1.625AlaMet: 1.625 ± 0.337
3.901AlaAsn: 3.901 ± 0.56
1.625AlaPro: 1.625 ± 0.539
2.817AlaGln: 2.817 ± 0.489
2.167AlaArg: 2.167 ± 0.579
3.467AlaSer: 3.467 ± 0.61
3.684AlaThr: 3.684 ± 0.554
3.684AlaVal: 3.684 ± 0.637
0.975AlaTrp: 0.975 ± 0.36
2.059AlaTyr: 2.059 ± 0.478
0.0AlaXaa: 0.0 ± 0.0
Cys
0.542CysAla: 0.542 ± 0.265
0.217CysCys: 0.217 ± 0.148
0.758CysAsp: 0.758 ± 0.259
0.65CysGlu: 0.65 ± 0.334
0.217CysPhe: 0.217 ± 0.142
0.542CysGly: 0.542 ± 0.211
0.325CysHis: 0.325 ± 0.184
0.433CysIle: 0.433 ± 0.23
0.542CysLys: 0.542 ± 0.334
0.542CysLeu: 0.542 ± 0.258
0.0CysMet: 0.0 ± 0.0
0.108CysAsn: 0.108 ± 0.106
0.325CysPro: 0.325 ± 0.218
0.325CysGln: 0.325 ± 0.16
0.542CysArg: 0.542 ± 0.371
0.867CysSer: 0.867 ± 0.332
0.108CysThr: 0.108 ± 0.1
0.217CysVal: 0.217 ± 0.141
0.108CysTrp: 0.108 ± 0.094
0.325CysTyr: 0.325 ± 0.169
0.0CysXaa: 0.0 ± 0.0
Asp
2.926AspAla: 2.926 ± 0.488
0.217AspCys: 0.217 ± 0.15
3.034AspAsp: 3.034 ± 0.586
5.093AspGlu: 5.093 ± 0.896
2.926AspPhe: 2.926 ± 0.671
6.285AspGly: 6.285 ± 1.239
0.542AspHis: 0.542 ± 0.215
3.901AspIle: 3.901 ± 0.647
4.117AspLys: 4.117 ± 0.526
4.117AspLeu: 4.117 ± 0.738
2.492AspMet: 2.492 ± 0.467
4.009AspAsn: 4.009 ± 0.558
1.95AspPro: 1.95 ± 0.43
0.542AspGln: 0.542 ± 0.21
2.275AspArg: 2.275 ± 0.417
5.093AspSer: 5.093 ± 0.536
2.817AspThr: 2.817 ± 0.561
3.359AspVal: 3.359 ± 0.623
1.084AspTrp: 1.084 ± 0.355
2.709AspTyr: 2.709 ± 0.541
0.0AspXaa: 0.0 ± 0.0
Glu
4.334GluAla: 4.334 ± 0.682
0.542GluCys: 0.542 ± 0.233
4.009GluAsp: 4.009 ± 0.757
3.576GluGlu: 3.576 ± 0.888
4.117GluPhe: 4.117 ± 0.488
2.384GluGly: 2.384 ± 0.498
1.084GluHis: 1.084 ± 0.354
4.551GluIle: 4.551 ± 0.764
6.826GluLys: 6.826 ± 1.091
6.718GluLeu: 6.718 ± 0.842
2.384GluMet: 2.384 ± 0.543
4.117GluAsn: 4.117 ± 0.692
2.059GluPro: 2.059 ± 0.522
3.359GluGln: 3.359 ± 0.603
3.684GluArg: 3.684 ± 0.663
3.901GluSer: 3.901 ± 0.571
4.117GluThr: 4.117 ± 0.777
4.009GluVal: 4.009 ± 0.783
0.65GluTrp: 0.65 ± 0.296
2.926GluTyr: 2.926 ± 0.532
0.0GluXaa: 0.0 ± 0.0
Phe
2.6PheAla: 2.6 ± 0.577
0.217PheCys: 0.217 ± 0.162
3.251PheAsp: 3.251 ± 0.573
3.251PheGlu: 3.251 ± 0.596
1.3PhePhe: 1.3 ± 0.342
2.926PheGly: 2.926 ± 0.661
0.867PheHis: 0.867 ± 0.323
2.817PheIle: 2.817 ± 0.633
3.792PheLys: 3.792 ± 0.517
2.384PheLeu: 2.384 ± 0.422
1.192PheMet: 1.192 ± 0.338
3.251PheAsn: 3.251 ± 0.617
1.3PhePro: 1.3 ± 0.359
1.625PheGln: 1.625 ± 0.544
1.3PheArg: 1.3 ± 0.374
3.792PheSer: 3.792 ± 0.638
3.901PheThr: 3.901 ± 0.581
2.6PheVal: 2.6 ± 0.478
0.542PheTrp: 0.542 ± 0.271
1.842PheTyr: 1.842 ± 0.511
0.0PheXaa: 0.0 ± 0.0
Gly
3.359GlyAla: 3.359 ± 0.722
0.217GlyCys: 0.217 ± 0.151
2.926GlyAsp: 2.926 ± 0.539
4.659GlyGlu: 4.659 ± 0.691
3.901GlyPhe: 3.901 ± 0.568
4.659GlyGly: 4.659 ± 0.941
0.65GlyHis: 0.65 ± 0.337
6.068GlyIle: 6.068 ± 1.315
5.526GlyLys: 5.526 ± 0.662
5.309GlyLeu: 5.309 ± 0.75
1.192GlyMet: 1.192 ± 0.269
4.443GlyAsn: 4.443 ± 0.695
0.758GlyPro: 0.758 ± 0.436
3.684GlyGln: 3.684 ± 0.678
1.517GlyArg: 1.517 ± 0.417
4.009GlySer: 4.009 ± 0.654
5.526GlyThr: 5.526 ± 1.273
4.334GlyVal: 4.334 ± 0.868
0.758GlyTrp: 0.758 ± 0.247
3.251GlyTyr: 3.251 ± 0.595
0.0GlyXaa: 0.0 ± 0.0
His
0.975HisAla: 0.975 ± 0.362
0.108HisCys: 0.108 ± 0.103
0.975HisAsp: 0.975 ± 0.303
1.3HisGlu: 1.3 ± 0.328
0.433HisPhe: 0.433 ± 0.233
0.867HisGly: 0.867 ± 0.372
0.217HisHis: 0.217 ± 0.144
0.65HisIle: 0.65 ± 0.273
0.867HisLys: 0.867 ± 0.29
0.975HisLeu: 0.975 ± 0.335
0.108HisMet: 0.108 ± 0.112
0.65HisAsn: 0.65 ± 0.22
0.217HisPro: 0.217 ± 0.134
0.542HisGln: 0.542 ± 0.245
0.325HisArg: 0.325 ± 0.179
0.867HisSer: 0.867 ± 0.325
0.433HisThr: 0.433 ± 0.236
0.975HisVal: 0.975 ± 0.326
0.217HisTrp: 0.217 ± 0.14
0.542HisTyr: 0.542 ± 0.238
0.0HisXaa: 0.0 ± 0.0
Ile
4.443IleAla: 4.443 ± 0.546
0.433IleCys: 0.433 ± 0.253
4.659IleAsp: 4.659 ± 0.656
5.201IleGlu: 5.201 ± 0.796
2.275IlePhe: 2.275 ± 0.55
3.901IleGly: 3.901 ± 0.733
0.65IleHis: 0.65 ± 0.237
4.984IleIle: 4.984 ± 0.814
5.418IleLys: 5.418 ± 0.652
4.659IleLeu: 4.659 ± 0.886
1.192IleMet: 1.192 ± 0.305
5.743IleAsn: 5.743 ± 0.797
2.275IlePro: 2.275 ± 0.432
2.492IleGln: 2.492 ± 0.419
1.734IleArg: 1.734 ± 0.467
7.26IleSer: 7.26 ± 0.712
4.551IleThr: 4.551 ± 0.854
2.6IleVal: 2.6 ± 0.493
0.108IleTrp: 0.108 ± 0.1
2.275IleTyr: 2.275 ± 0.486
0.0IleXaa: 0.0 ± 0.0
Lys
6.176LysAla: 6.176 ± 0.839
0.542LysCys: 0.542 ± 0.221
5.634LysAsp: 5.634 ± 0.695
5.526LysGlu: 5.526 ± 1.135
2.709LysPhe: 2.709 ± 0.512
5.526LysGly: 5.526 ± 0.762
1.409LysHis: 1.409 ± 0.562
4.984LysIle: 4.984 ± 0.792
9.21LysLys: 9.21 ± 1.402
7.368LysLeu: 7.368 ± 0.751
1.517LysMet: 1.517 ± 0.42
6.176LysAsn: 6.176 ± 0.941
2.709LysPro: 2.709 ± 0.649
2.817LysGln: 2.817 ± 0.496
4.117LysArg: 4.117 ± 0.834
4.334LysSer: 4.334 ± 0.742
4.768LysThr: 4.768 ± 0.704
5.851LysVal: 5.851 ± 0.931
1.409LysTrp: 1.409 ± 0.419
3.467LysTyr: 3.467 ± 0.776
0.0LysXaa: 0.0 ± 0.0
Leu
5.418LeuAla: 5.418 ± 0.859
0.217LeuCys: 0.217 ± 0.148
4.117LeuAsp: 4.117 ± 0.682
5.309LeuGlu: 5.309 ± 0.587
3.901LeuPhe: 3.901 ± 0.595
5.093LeuGly: 5.093 ± 0.897
0.758LeuHis: 0.758 ± 0.295
4.443LeuIle: 4.443 ± 0.45
8.777LeuLys: 8.777 ± 1.343
5.309LeuLeu: 5.309 ± 1.027
2.059LeuMet: 2.059 ± 0.498
4.984LeuAsn: 4.984 ± 0.776
3.467LeuPro: 3.467 ± 0.45
3.251LeuGln: 3.251 ± 0.751
2.492LeuArg: 2.492 ± 0.739
7.368LeuSer: 7.368 ± 0.82
5.526LeuThr: 5.526 ± 0.647
3.684LeuVal: 3.684 ± 0.562
1.409LeuTrp: 1.409 ± 0.404
1.192LeuTyr: 1.192 ± 0.372
0.0LeuXaa: 0.0 ± 0.0
Met
1.95MetAla: 1.95 ± 0.561
0.542MetCys: 0.542 ± 0.218
1.084MetAsp: 1.084 ± 0.362
2.275MetGlu: 2.275 ± 0.403
0.217MetPhe: 0.217 ± 0.152
0.867MetGly: 0.867 ± 0.327
0.217MetHis: 0.217 ± 0.163
1.192MetIle: 1.192 ± 0.325
2.709MetLys: 2.709 ± 0.466
1.3MetLeu: 1.3 ± 0.418
0.217MetMet: 0.217 ± 0.132
1.409MetAsn: 1.409 ± 0.361
1.084MetPro: 1.084 ± 0.35
0.975MetGln: 0.975 ± 0.347
0.975MetArg: 0.975 ± 0.29
1.409MetSer: 1.409 ± 0.35
2.492MetThr: 2.492 ± 0.498
1.084MetVal: 1.084 ± 0.371
0.433MetTrp: 0.433 ± 0.23
0.758MetTyr: 0.758 ± 0.272
0.0MetXaa: 0.0 ± 0.0
Asn
3.576AsnAla: 3.576 ± 0.524
0.542AsnCys: 0.542 ± 0.251
3.576AsnAsp: 3.576 ± 0.482
2.817AsnGlu: 2.817 ± 0.49
3.792AsnPhe: 3.792 ± 0.761
4.984AsnGly: 4.984 ± 1.352
0.975AsnHis: 0.975 ± 0.372
4.443AsnIle: 4.443 ± 0.762
4.334AsnLys: 4.334 ± 0.642
3.467AsnLeu: 3.467 ± 0.56
0.975AsnMet: 0.975 ± 0.363
5.093AsnAsn: 5.093 ± 0.742
2.709AsnPro: 2.709 ± 0.61
4.443AsnGln: 4.443 ± 0.971
2.059AsnArg: 2.059 ± 0.429
5.418AsnSer: 5.418 ± 0.683
3.467AsnThr: 3.467 ± 0.771
4.226AsnVal: 4.226 ± 0.652
0.433AsnTrp: 0.433 ± 0.191
3.251AsnTyr: 3.251 ± 0.842
0.0AsnXaa: 0.0 ± 0.0
Pro
1.625ProAla: 1.625 ± 0.499
0.0ProCys: 0.0 ± 0.0
1.625ProAsp: 1.625 ± 0.49
2.384ProGlu: 2.384 ± 0.55
1.409ProPhe: 1.409 ± 0.361
0.975ProGly: 0.975 ± 0.455
0.108ProHis: 0.108 ± 0.11
2.059ProIle: 2.059 ± 0.431
2.6ProLys: 2.6 ± 0.555
4.117ProLeu: 4.117 ± 0.654
0.867ProMet: 0.867 ± 0.285
1.734ProAsn: 1.734 ± 0.479
1.192ProPro: 1.192 ± 0.312
1.95ProGln: 1.95 ± 0.468
1.084ProArg: 1.084 ± 0.361
2.167ProSer: 2.167 ± 0.466
2.492ProThr: 2.492 ± 0.54
2.275ProVal: 2.275 ± 0.426
0.325ProTrp: 0.325 ± 0.172
1.192ProTyr: 1.192 ± 0.298
0.0ProXaa: 0.0 ± 0.0
Gln
3.034GlnAla: 3.034 ± 0.52
0.217GlnCys: 0.217 ± 0.141
1.3GlnAsp: 1.3 ± 0.434
3.359GlnGlu: 3.359 ± 0.703
1.842GlnPhe: 1.842 ± 0.443
2.6GlnGly: 2.6 ± 0.655
0.217GlnHis: 0.217 ± 0.156
2.059GlnIle: 2.059 ± 0.538
3.684GlnLys: 3.684 ± 0.681
3.576GlnLeu: 3.576 ± 0.547
0.542GlnMet: 0.542 ± 0.222
2.6GlnAsn: 2.6 ± 0.884
1.3GlnPro: 1.3 ± 0.403
3.901GlnGln: 3.901 ± 1.629
2.492GlnArg: 2.492 ± 0.619
3.576GlnSer: 3.576 ± 0.569
1.842GlnThr: 1.842 ± 0.554
2.275GlnVal: 2.275 ± 0.394
0.65GlnTrp: 0.65 ± 0.238
2.275GlnTyr: 2.275 ± 0.443
0.0GlnXaa: 0.0 ± 0.0
Arg
2.709ArgAla: 2.709 ± 0.52
0.975ArgCys: 0.975 ± 0.268
2.384ArgAsp: 2.384 ± 0.532
2.492ArgGlu: 2.492 ± 0.732
1.409ArgPhe: 1.409 ± 0.372
1.084ArgGly: 1.084 ± 0.362
0.217ArgHis: 0.217 ± 0.142
3.034ArgIle: 3.034 ± 0.599
3.467ArgLys: 3.467 ± 0.969
4.443ArgLeu: 4.443 ± 0.591
0.758ArgMet: 0.758 ± 0.266
2.275ArgAsn: 2.275 ± 0.449
1.3ArgPro: 1.3 ± 0.407
1.734ArgGln: 1.734 ± 0.43
1.734ArgArg: 1.734 ± 0.592
1.842ArgSer: 1.842 ± 0.542
1.842ArgThr: 1.842 ± 0.453
1.409ArgVal: 1.409 ± 0.296
0.433ArgTrp: 0.433 ± 0.215
2.275ArgTyr: 2.275 ± 0.544
0.0ArgXaa: 0.0 ± 0.0
Ser
5.201SerAla: 5.201 ± 1.05
0.433SerCys: 0.433 ± 0.277
5.309SerAsp: 5.309 ± 0.807
4.551SerGlu: 4.551 ± 0.685
3.142SerPhe: 3.142 ± 0.48
6.501SerGly: 6.501 ± 1.071
1.084SerHis: 1.084 ± 0.3
4.334SerIle: 4.334 ± 0.644
6.393SerLys: 6.393 ± 0.749
5.959SerLeu: 5.959 ± 0.916
2.6SerMet: 2.6 ± 0.604
4.768SerAsn: 4.768 ± 0.722
1.409SerPro: 1.409 ± 0.398
2.384SerGln: 2.384 ± 0.615
2.926SerArg: 2.926 ± 0.474
4.551SerSer: 4.551 ± 0.779
3.576SerThr: 3.576 ± 0.805
4.443SerVal: 4.443 ± 0.621
0.433SerTrp: 0.433 ± 0.204
2.817SerTyr: 2.817 ± 0.52
0.0SerXaa: 0.0 ± 0.0
Thr
3.684ThrAla: 3.684 ± 0.731
0.325ThrCys: 0.325 ± 0.178
3.792ThrAsp: 3.792 ± 0.63
4.226ThrGlu: 4.226 ± 0.53
2.6ThrPhe: 2.6 ± 0.419
5.851ThrGly: 5.851 ± 0.961
1.192ThrHis: 1.192 ± 0.286
4.984ThrIle: 4.984 ± 0.778
5.201ThrLys: 5.201 ± 0.691
4.768ThrLeu: 4.768 ± 0.579
0.975ThrMet: 0.975 ± 0.391
3.467ThrAsn: 3.467 ± 0.779
2.384ThrPro: 2.384 ± 0.459
2.167ThrGln: 2.167 ± 0.512
1.517ThrArg: 1.517 ± 0.395
3.684ThrSer: 3.684 ± 0.682
3.576ThrThr: 3.576 ± 1.09
5.959ThrVal: 5.959 ± 1.123
1.084ThrTrp: 1.084 ± 0.299
0.975ThrTyr: 0.975 ± 0.322
0.0ThrXaa: 0.0 ± 0.0
Val
3.251ValAla: 3.251 ± 0.71
0.433ValCys: 0.433 ± 0.242
3.034ValAsp: 3.034 ± 0.598
5.309ValGlu: 5.309 ± 0.805
3.251ValPhe: 3.251 ± 0.581
4.334ValGly: 4.334 ± 0.95
0.217ValHis: 0.217 ± 0.157
3.792ValIle: 3.792 ± 0.751
4.226ValLys: 4.226 ± 0.686
4.117ValLeu: 4.117 ± 0.83
1.3ValMet: 1.3 ± 0.339
2.6ValAsn: 2.6 ± 0.58
2.926ValPro: 2.926 ± 0.639
1.409ValGln: 1.409 ± 0.338
2.275ValArg: 2.275 ± 0.488
4.876ValSer: 4.876 ± 0.599
4.551ValThr: 4.551 ± 0.679
4.117ValVal: 4.117 ± 0.505
0.758ValTrp: 0.758 ± 0.231
2.059ValTyr: 2.059 ± 0.397
0.0ValXaa: 0.0 ± 0.0
Trp
0.867TrpAla: 0.867 ± 0.296
0.433TrpCys: 0.433 ± 0.195
0.325TrpAsp: 0.325 ± 0.25
0.975TrpGlu: 0.975 ± 0.349
0.542TrpPhe: 0.542 ± 0.24
0.65TrpGly: 0.65 ± 0.38
0.108TrpHis: 0.108 ± 0.107
0.867TrpIle: 0.867 ± 0.305
0.975TrpLys: 0.975 ± 0.338
0.975TrpLeu: 0.975 ± 0.279
0.108TrpMet: 0.108 ± 0.092
1.3TrpAsn: 1.3 ± 0.316
0.325TrpPro: 0.325 ± 0.242
0.542TrpGln: 0.542 ± 0.24
0.65TrpArg: 0.65 ± 0.292
1.084TrpSer: 1.084 ± 0.338
0.542TrpThr: 0.542 ± 0.253
0.65TrpVal: 0.65 ± 0.228
0.108TrpTrp: 0.108 ± 0.101
0.542TrpTyr: 0.542 ± 0.25
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.059TyrAla: 2.059 ± 0.532
0.433TyrCys: 0.433 ± 0.223
3.034TyrAsp: 3.034 ± 0.615
2.059TyrGlu: 2.059 ± 0.483
2.275TyrPhe: 2.275 ± 0.55
2.275TyrGly: 2.275 ± 0.456
0.65TyrHis: 0.65 ± 0.247
2.709TyrIle: 2.709 ± 0.476
2.167TyrLys: 2.167 ± 0.496
3.359TyrLeu: 3.359 ± 0.632
0.975TyrMet: 0.975 ± 0.3
1.95TyrAsn: 1.95 ± 0.45
0.867TyrPro: 0.867 ± 0.335
2.384TyrGln: 2.384 ± 0.551
1.95TyrArg: 1.95 ± 0.387
3.251TyrSer: 3.251 ± 0.606
2.6TyrThr: 2.6 ± 0.593
1.084TyrVal: 1.084 ± 0.334
0.65TyrTrp: 0.65 ± 0.265
1.3TyrTyr: 1.3 ± 0.371
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (9230 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski