Amino acid dipepetide frequency for Lactococcus phage 5171F

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.161AlaAla: 0.161 ± 0.12
0.161AlaCys: 0.161 ± 0.2
3.543AlaAsp: 3.543 ± 0.518
5.315AlaGlu: 5.315 ± 1.085
2.577AlaPhe: 2.577 ± 0.517
3.221AlaGly: 3.221 ± 0.744
0.0AlaHis: 0.0 ± 0.0
4.832AlaIle: 4.832 ± 0.984
5.959AlaLys: 5.959 ± 1.198
6.925AlaLeu: 6.925 ± 1.474
1.933AlaMet: 1.933 ± 0.45
4.671AlaAsn: 4.671 ± 0.627
0.805AlaPro: 0.805 ± 0.396
3.06AlaGln: 3.06 ± 0.851
1.933AlaArg: 1.933 ± 0.531
3.221AlaSer: 3.221 ± 0.627
4.349AlaThr: 4.349 ± 1.108
4.51AlaVal: 4.51 ± 1.292
0.966AlaTrp: 0.966 ± 0.399
3.06AlaTyr: 3.06 ± 0.793
0.0AlaXaa: 0.0 ± 0.0
Cys
0.483CysAla: 0.483 ± 0.355
0.161CysCys: 0.161 ± 0.195
0.161CysAsp: 0.161 ± 0.148
0.483CysGlu: 0.483 ± 0.265
0.322CysPhe: 0.322 ± 0.223
0.644CysGly: 0.644 ± 0.313
0.644CysHis: 0.644 ± 0.456
0.161CysIle: 0.161 ± 0.169
1.127CysLys: 1.127 ± 0.547
0.644CysLeu: 0.644 ± 0.299
0.161CysMet: 0.161 ± 0.188
0.483CysAsn: 0.483 ± 0.354
0.322CysPro: 0.322 ± 0.195
0.0CysGln: 0.0 ± 0.0
0.161CysArg: 0.161 ± 0.122
0.644CysSer: 0.644 ± 0.278
0.966CysThr: 0.966 ± 0.406
0.161CysVal: 0.161 ± 0.146
0.322CysTrp: 0.322 ± 0.183
0.483CysTyr: 0.483 ± 0.235
0.0CysXaa: 0.0 ± 0.0
Asp
3.382AspAla: 3.382 ± 0.752
0.161AspCys: 0.161 ± 0.122
4.026AspAsp: 4.026 ± 0.805
4.51AspGlu: 4.51 ± 0.804
4.671AspPhe: 4.671 ± 0.908
4.832AspGly: 4.832 ± 1.003
0.161AspHis: 0.161 ± 0.182
3.865AspIle: 3.865 ± 0.929
5.798AspLys: 5.798 ± 0.717
5.315AspLeu: 5.315 ± 0.711
2.255AspMet: 2.255 ± 0.598
3.704AspAsn: 3.704 ± 0.913
1.288AspPro: 1.288 ± 0.339
1.127AspGln: 1.127 ± 0.444
1.45AspArg: 1.45 ± 0.489
3.382AspSer: 3.382 ± 0.853
3.221AspThr: 3.221 ± 0.604
1.933AspVal: 1.933 ± 0.432
1.288AspTrp: 1.288 ± 0.518
4.026AspTyr: 4.026 ± 0.991
0.0AspXaa: 0.0 ± 0.0
Glu
3.865GluAla: 3.865 ± 0.713
1.288GluCys: 1.288 ± 0.459
3.704GluAsp: 3.704 ± 0.675
5.798GluGlu: 5.798 ± 1.336
4.187GluPhe: 4.187 ± 0.903
3.221GluGly: 3.221 ± 0.68
1.45GluHis: 1.45 ± 0.576
4.671GluIle: 4.671 ± 0.657
4.349GluLys: 4.349 ± 0.892
7.892GluLeu: 7.892 ± 1.142
3.865GluMet: 3.865 ± 0.773
4.187GluAsn: 4.187 ± 0.61
0.966GluPro: 0.966 ± 0.398
4.026GluGln: 4.026 ± 0.86
4.026GluArg: 4.026 ± 0.839
2.255GluSer: 2.255 ± 0.49
5.798GluThr: 5.798 ± 0.866
4.671GluVal: 4.671 ± 0.924
1.127GluTrp: 1.127 ± 0.423
2.738GluTyr: 2.738 ± 0.937
0.0GluXaa: 0.0 ± 0.0
Phe
2.577PheAla: 2.577 ± 0.61
0.644PheCys: 0.644 ± 0.27
3.06PheAsp: 3.06 ± 0.694
2.416PheGlu: 2.416 ± 0.503
1.127PhePhe: 1.127 ± 0.473
3.704PheGly: 3.704 ± 0.708
0.322PheHis: 0.322 ± 0.161
4.187PheIle: 4.187 ± 1.053
5.637PheLys: 5.637 ± 1.042
2.255PheLeu: 2.255 ± 0.461
1.45PheMet: 1.45 ± 0.418
3.704PheAsn: 3.704 ± 0.703
1.127PhePro: 1.127 ± 0.396
0.966PheGln: 0.966 ± 0.311
1.611PheArg: 1.611 ± 0.36
2.577PheSer: 2.577 ± 0.57
3.382PheThr: 3.382 ± 0.656
2.416PheVal: 2.416 ± 0.661
0.805PheTrp: 0.805 ± 0.374
2.416PheTyr: 2.416 ± 0.567
0.0PheXaa: 0.0 ± 0.0
Gly
4.993GlyAla: 4.993 ± 1.279
0.161GlyCys: 0.161 ± 0.195
3.543GlyAsp: 3.543 ± 0.825
3.221GlyGlu: 3.221 ± 0.79
4.671GlyPhe: 4.671 ± 1.171
4.671GlyGly: 4.671 ± 0.86
0.483GlyHis: 0.483 ± 0.244
4.187GlyIle: 4.187 ± 0.802
4.51GlyLys: 4.51 ± 0.822
6.281GlyLeu: 6.281 ± 1.04
2.094GlyMet: 2.094 ± 0.72
3.543GlyAsn: 3.543 ± 0.689
0.644GlyPro: 0.644 ± 0.256
2.899GlyGln: 2.899 ± 0.597
3.06GlyArg: 3.06 ± 0.639
5.476GlySer: 5.476 ± 1.008
6.12GlyThr: 6.12 ± 1.255
5.315GlyVal: 5.315 ± 0.829
0.644GlyTrp: 0.644 ± 0.277
3.221GlyTyr: 3.221 ± 0.809
0.0GlyXaa: 0.0 ± 0.0
His
0.483HisAla: 0.483 ± 0.252
0.161HisCys: 0.161 ± 0.139
0.805HisAsp: 0.805 ± 0.384
0.805HisGlu: 0.805 ± 0.368
0.805HisPhe: 0.805 ± 0.365
0.322HisGly: 0.322 ± 0.197
0.161HisHis: 0.161 ± 0.139
1.127HisIle: 1.127 ± 0.33
0.966HisLys: 0.966 ± 0.426
1.611HisLeu: 1.611 ± 0.583
0.161HisMet: 0.161 ± 0.174
0.805HisAsn: 0.805 ± 0.382
0.161HisPro: 0.161 ± 0.159
0.644HisGln: 0.644 ± 0.348
0.161HisArg: 0.161 ± 0.154
0.805HisSer: 0.805 ± 0.359
0.161HisThr: 0.161 ± 0.162
0.644HisVal: 0.644 ± 0.26
0.483HisTrp: 0.483 ± 0.236
1.127HisTyr: 1.127 ± 0.339
0.0HisXaa: 0.0 ± 0.0
Ile
3.543IleAla: 3.543 ± 0.666
0.483IleCys: 0.483 ± 0.244
5.798IleAsp: 5.798 ± 1.023
5.798IleGlu: 5.798 ± 0.992
2.094IlePhe: 2.094 ± 0.473
4.832IleGly: 4.832 ± 0.85
0.966IleHis: 0.966 ± 0.394
4.671IleIle: 4.671 ± 0.865
5.476IleLys: 5.476 ± 0.951
3.704IleLeu: 3.704 ± 1.043
1.772IleMet: 1.772 ± 0.524
5.476IleAsn: 5.476 ± 0.993
2.094IlePro: 2.094 ± 0.388
2.255IleGln: 2.255 ± 0.444
2.094IleArg: 2.094 ± 0.486
2.899IleSer: 2.899 ± 0.532
4.187IleThr: 4.187 ± 0.715
2.416IleVal: 2.416 ± 0.536
0.805IleTrp: 0.805 ± 0.39
2.899IleTyr: 2.899 ± 0.605
0.0IleXaa: 0.0 ± 0.0
Lys
7.409LysAla: 7.409 ± 0.859
0.966LysCys: 0.966 ± 0.394
5.315LysAsp: 5.315 ± 1.162
8.858LysGlu: 8.858 ± 1.379
3.221LysPhe: 3.221 ± 0.6
7.248LysGly: 7.248 ± 0.876
1.288LysHis: 1.288 ± 0.457
4.671LysIle: 4.671 ± 1.208
7.409LysLys: 7.409 ± 1.404
6.281LysLeu: 6.281 ± 1.032
3.06LysMet: 3.06 ± 0.739
5.315LysAsn: 5.315 ± 0.771
2.738LysPro: 2.738 ± 0.637
4.349LysGln: 4.349 ± 0.731
3.382LysArg: 3.382 ± 0.584
3.704LysSer: 3.704 ± 0.665
4.187LysThr: 4.187 ± 0.919
6.925LysVal: 6.925 ± 1.204
0.966LysTrp: 0.966 ± 0.418
3.221LysTyr: 3.221 ± 0.784
0.0LysXaa: 0.0 ± 0.0
Leu
4.349LeuAla: 4.349 ± 0.932
0.805LeuCys: 0.805 ± 0.311
3.865LeuAsp: 3.865 ± 0.727
7.409LeuGlu: 7.409 ± 1.286
3.382LeuPhe: 3.382 ± 0.697
5.959LeuGly: 5.959 ± 1.428
1.611LeuHis: 1.611 ± 0.694
4.832LeuIle: 4.832 ± 0.883
7.731LeuLys: 7.731 ± 1.154
7.086LeuLeu: 7.086 ± 1.173
2.738LeuMet: 2.738 ± 1.16
5.476LeuAsn: 5.476 ± 1.215
2.738LeuPro: 2.738 ± 0.602
4.187LeuGln: 4.187 ± 0.909
2.577LeuArg: 2.577 ± 0.622
4.832LeuSer: 4.832 ± 0.876
5.637LeuThr: 5.637 ± 0.998
4.671LeuVal: 4.671 ± 0.9
0.161LeuTrp: 0.161 ± 0.198
3.382LeuTyr: 3.382 ± 0.797
0.0LeuXaa: 0.0 ± 0.0
Met
3.543MetAla: 3.543 ± 0.658
0.161MetCys: 0.161 ± 0.148
1.611MetAsp: 1.611 ± 0.399
2.255MetGlu: 2.255 ± 0.699
1.288MetPhe: 1.288 ± 0.553
1.45MetGly: 1.45 ± 0.414
0.161MetHis: 0.161 ± 0.159
2.899MetIle: 2.899 ± 0.701
2.899MetLys: 2.899 ± 0.616
2.416MetLeu: 2.416 ± 0.673
0.322MetMet: 0.322 ± 0.154
1.288MetAsn: 1.288 ± 0.447
0.805MetPro: 0.805 ± 0.374
1.127MetGln: 1.127 ± 0.38
1.127MetArg: 1.127 ± 0.512
1.45MetSer: 1.45 ± 0.564
1.772MetThr: 1.772 ± 0.567
1.45MetVal: 1.45 ± 0.474
0.161MetTrp: 0.161 ± 0.145
0.966MetTyr: 0.966 ± 0.388
0.0MetXaa: 0.0 ± 0.0
Asn
4.832AsnAla: 4.832 ± 1.191
0.805AsnCys: 0.805 ± 0.402
3.704AsnAsp: 3.704 ± 0.867
5.476AsnGlu: 5.476 ± 0.838
2.899AsnPhe: 2.899 ± 0.55
5.315AsnGly: 5.315 ± 0.606
0.322AsnHis: 0.322 ± 0.198
3.543AsnIle: 3.543 ± 0.736
6.764AsnLys: 6.764 ± 1.167
4.671AsnLeu: 4.671 ± 0.903
1.933AsnMet: 1.933 ± 0.534
4.349AsnAsn: 4.349 ± 0.741
2.577AsnPro: 2.577 ± 0.906
2.094AsnGln: 2.094 ± 0.602
1.772AsnArg: 1.772 ± 0.468
2.899AsnSer: 2.899 ± 0.69
4.349AsnThr: 4.349 ± 0.779
3.865AsnVal: 3.865 ± 0.577
0.966AsnTrp: 0.966 ± 0.307
2.577AsnTyr: 2.577 ± 0.535
0.0AsnXaa: 0.0 ± 0.0
Pro
0.966ProAla: 0.966 ± 0.38
0.0ProCys: 0.0 ± 0.0
2.094ProAsp: 2.094 ± 0.639
1.288ProGlu: 1.288 ± 0.415
1.288ProPhe: 1.288 ± 0.538
0.322ProGly: 0.322 ± 0.244
0.0ProHis: 0.0 ± 0.0
1.611ProIle: 1.611 ± 0.444
2.255ProLys: 2.255 ± 0.538
2.738ProLeu: 2.738 ± 0.455
0.161ProMet: 0.161 ± 0.159
2.577ProAsn: 2.577 ± 0.923
0.966ProPro: 0.966 ± 0.372
1.772ProGln: 1.772 ± 0.543
1.127ProArg: 1.127 ± 0.403
1.772ProSer: 1.772 ± 0.576
1.288ProThr: 1.288 ± 0.402
1.611ProVal: 1.611 ± 0.443
0.0ProTrp: 0.0 ± 0.0
1.45ProTyr: 1.45 ± 0.436
0.0ProXaa: 0.0 ± 0.0
Gln
3.543GlnAla: 3.543 ± 0.943
0.322GlnCys: 0.322 ± 0.201
1.611GlnAsp: 1.611 ± 0.515
2.094GlnGlu: 2.094 ± 0.538
1.933GlnPhe: 1.933 ± 0.576
2.738GlnGly: 2.738 ± 0.485
0.483GlnHis: 0.483 ± 0.253
3.221GlnIle: 3.221 ± 0.596
4.026GlnLys: 4.026 ± 0.981
3.06GlnLeu: 3.06 ± 0.548
1.611GlnMet: 1.611 ± 0.47
1.611GlnAsn: 1.611 ± 0.433
0.644GlnPro: 0.644 ± 0.267
1.933GlnGln: 1.933 ± 0.642
1.772GlnArg: 1.772 ± 0.522
2.577GlnSer: 2.577 ± 0.709
2.577GlnThr: 2.577 ± 0.52
1.611GlnVal: 1.611 ± 0.384
0.644GlnTrp: 0.644 ± 0.263
1.611GlnTyr: 1.611 ± 0.641
0.0GlnXaa: 0.0 ± 0.0
Arg
1.933ArgAla: 1.933 ± 0.438
0.483ArgCys: 0.483 ± 0.245
1.772ArgAsp: 1.772 ± 0.457
4.51ArgGlu: 4.51 ± 0.994
1.772ArgPhe: 1.772 ± 0.587
2.416ArgGly: 2.416 ± 0.418
0.322ArgHis: 0.322 ± 0.233
1.288ArgIle: 1.288 ± 0.352
3.865ArgLys: 3.865 ± 0.794
1.933ArgLeu: 1.933 ± 0.483
0.805ArgMet: 0.805 ± 0.43
2.255ArgAsn: 2.255 ± 0.728
0.966ArgPro: 0.966 ± 0.483
0.966ArgGln: 0.966 ± 0.326
1.127ArgArg: 1.127 ± 0.548
1.772ArgSer: 1.772 ± 0.623
1.933ArgThr: 1.933 ± 0.534
2.416ArgVal: 2.416 ± 0.698
0.966ArgTrp: 0.966 ± 0.392
1.288ArgTyr: 1.288 ± 0.489
0.0ArgXaa: 0.0 ± 0.0
Ser
3.06SerAla: 3.06 ± 0.754
0.0SerCys: 0.0 ± 0.0
3.704SerAsp: 3.704 ± 0.748
3.382SerGlu: 3.382 ± 0.721
2.255SerPhe: 2.255 ± 0.436
4.671SerGly: 4.671 ± 0.912
0.805SerHis: 0.805 ± 0.323
3.221SerIle: 3.221 ± 0.673
5.315SerLys: 5.315 ± 0.678
5.476SerLeu: 5.476 ± 0.9
0.966SerMet: 0.966 ± 0.298
3.382SerAsn: 3.382 ± 0.593
0.805SerPro: 0.805 ± 0.387
1.127SerGln: 1.127 ± 0.429
1.45SerArg: 1.45 ± 0.409
3.865SerSer: 3.865 ± 0.763
3.221SerThr: 3.221 ± 0.686
3.865SerVal: 3.865 ± 0.68
0.966SerTrp: 0.966 ± 0.505
2.577SerTyr: 2.577 ± 0.594
0.0SerXaa: 0.0 ± 0.0
Thr
3.704ThrAla: 3.704 ± 0.765
0.161ThrCys: 0.161 ± 0.154
4.993ThrAsp: 4.993 ± 1.02
3.382ThrGlu: 3.382 ± 0.863
2.899ThrPhe: 2.899 ± 0.731
6.12ThrGly: 6.12 ± 1.401
0.644ThrHis: 0.644 ± 0.37
4.187ThrIle: 4.187 ± 0.937
7.248ThrLys: 7.248 ± 1.014
7.248ThrLeu: 7.248 ± 0.841
0.644ThrMet: 0.644 ± 0.313
2.899ThrAsn: 2.899 ± 0.832
2.416ThrPro: 2.416 ± 0.616
2.416ThrGln: 2.416 ± 0.504
1.127ThrArg: 1.127 ± 0.387
3.543ThrSer: 3.543 ± 0.775
2.738ThrThr: 2.738 ± 0.778
3.704ThrVal: 3.704 ± 0.725
0.644ThrTrp: 0.644 ± 0.322
2.577ThrTyr: 2.577 ± 0.702
0.161ThrXaa: 0.161 ± 0.163
Val
3.06ValAla: 3.06 ± 0.541
0.322ValCys: 0.322 ± 0.226
4.187ValAsp: 4.187 ± 0.536
4.026ValGlu: 4.026 ± 0.938
2.094ValPhe: 2.094 ± 0.563
2.416ValGly: 2.416 ± 0.683
0.966ValHis: 0.966 ± 0.405
3.865ValIle: 3.865 ± 0.83
4.832ValLys: 4.832 ± 0.76
3.704ValLeu: 3.704 ± 0.623
1.611ValMet: 1.611 ± 0.53
4.349ValAsn: 4.349 ± 0.939
1.933ValPro: 1.933 ± 0.565
2.738ValGln: 2.738 ± 0.657
3.382ValArg: 3.382 ± 0.737
4.026ValSer: 4.026 ± 0.878
3.704ValThr: 3.704 ± 0.707
2.899ValVal: 2.899 ± 0.67
1.45ValTrp: 1.45 ± 0.454
2.416ValTyr: 2.416 ± 0.662
0.0ValXaa: 0.0 ± 0.0
Trp
1.611TrpAla: 1.611 ± 0.465
0.483TrpCys: 0.483 ± 0.26
0.644TrpAsp: 0.644 ± 0.29
0.322TrpGlu: 0.322 ± 0.237
0.805TrpPhe: 0.805 ± 0.305
1.45TrpGly: 1.45 ± 0.448
0.322TrpHis: 0.322 ± 0.206
0.483TrpIle: 0.483 ± 0.279
0.966TrpLys: 0.966 ± 0.404
2.094TrpLeu: 2.094 ± 0.547
0.161TrpMet: 0.161 ± 0.14
1.288TrpAsn: 1.288 ± 0.577
0.0TrpPro: 0.0 ± 0.0
0.483TrpGln: 0.483 ± 0.264
0.322TrpArg: 0.322 ± 0.235
0.644TrpSer: 0.644 ± 0.318
1.127TrpThr: 1.127 ± 0.416
0.644TrpVal: 0.644 ± 0.337
0.322TrpTrp: 0.322 ± 0.186
0.161TrpTyr: 0.161 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.704TyrAla: 3.704 ± 0.541
0.805TyrCys: 0.805 ± 0.289
2.255TyrAsp: 2.255 ± 0.51
2.738TyrGlu: 2.738 ± 0.598
2.094TyrPhe: 2.094 ± 0.65
4.187TyrGly: 4.187 ± 1.038
1.288TyrHis: 1.288 ± 0.516
2.577TyrIle: 2.577 ± 0.759
3.382TyrLys: 3.382 ± 0.684
2.094TyrLeu: 2.094 ± 0.681
1.288TyrMet: 1.288 ± 0.381
4.349TyrAsn: 4.349 ± 0.765
1.288TyrPro: 1.288 ± 0.535
1.45TyrGln: 1.45 ± 0.422
1.288TyrArg: 1.288 ± 0.452
1.772TyrSer: 1.772 ± 0.519
2.738TyrThr: 2.738 ± 0.638
2.255TyrVal: 2.255 ± 0.449
0.644TyrTrp: 0.644 ± 0.315
2.255TyrTyr: 2.255 ± 0.765
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.161XaaIle: 0.161 ± 0.163
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 37 proteins (6210 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski