Amino acid dipepetide frequency for Lactococcus phage PLgW-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.664AlaAla: 0.664 ± 0.358
0.221AlaCys: 0.221 ± 0.143
3.097AlaAsp: 3.097 ± 0.506
4.978AlaGlu: 4.978 ± 1.301
2.765AlaPhe: 2.765 ± 0.613
3.319AlaGly: 3.319 ± 1.023
0.996AlaHis: 0.996 ± 0.377
6.637AlaIle: 6.637 ± 1.244
4.646AlaLys: 4.646 ± 0.725
6.084AlaLeu: 6.084 ± 0.9
2.434AlaMet: 2.434 ± 0.937
3.429AlaAsn: 3.429 ± 0.518
0.996AlaPro: 0.996 ± 0.34
2.544AlaGln: 2.544 ± 0.501
3.65AlaArg: 3.65 ± 0.551
5.863AlaSer: 5.863 ± 1.205
3.54AlaThr: 3.54 ± 0.904
5.31AlaVal: 5.31 ± 0.923
1.327AlaTrp: 1.327 ± 0.496
2.212AlaTyr: 2.212 ± 0.521
0.0AlaXaa: 0.0 ± 0.0
Cys
0.332CysAla: 0.332 ± 0.237
0.111CysCys: 0.111 ± 0.124
0.442CysAsp: 0.442 ± 0.203
0.442CysGlu: 0.442 ± 0.206
0.442CysPhe: 0.442 ± 0.226
0.332CysGly: 0.332 ± 0.201
0.111CysHis: 0.111 ± 0.111
0.332CysIle: 0.332 ± 0.19
0.774CysLys: 0.774 ± 0.328
0.221CysLeu: 0.221 ± 0.159
0.0CysMet: 0.0 ± 0.0
0.332CysAsn: 0.332 ± 0.182
0.111CysPro: 0.111 ± 0.096
0.221CysGln: 0.221 ± 0.157
0.221CysArg: 0.221 ± 0.16
0.664CysSer: 0.664 ± 0.321
0.111CysThr: 0.111 ± 0.124
0.774CysVal: 0.774 ± 0.319
0.0CysTrp: 0.0 ± 0.0
0.332CysTyr: 0.332 ± 0.181
0.0CysXaa: 0.0 ± 0.0
Asp
2.876AspAla: 2.876 ± 0.564
0.221AspCys: 0.221 ± 0.16
3.208AspAsp: 3.208 ± 0.742
4.535AspGlu: 4.535 ± 0.91
3.208AspPhe: 3.208 ± 0.598
2.434AspGly: 2.434 ± 0.509
0.553AspHis: 0.553 ± 0.257
4.646AspIle: 4.646 ± 0.522
4.204AspLys: 4.204 ± 0.622
5.31AspLeu: 5.31 ± 0.567
1.659AspMet: 1.659 ± 0.354
4.978AspAsn: 4.978 ± 0.692
1.549AspPro: 1.549 ± 0.487
0.885AspGln: 0.885 ± 0.315
1.991AspArg: 1.991 ± 0.588
2.765AspSer: 2.765 ± 0.529
4.425AspThr: 4.425 ± 0.556
3.872AspVal: 3.872 ± 0.592
1.106AspTrp: 1.106 ± 0.356
2.212AspTyr: 2.212 ± 0.579
0.0AspXaa: 0.0 ± 0.0
Glu
4.204GluAla: 4.204 ± 0.857
0.332GluCys: 0.332 ± 0.216
4.425GluAsp: 4.425 ± 1.027
5.088GluGlu: 5.088 ± 0.947
3.761GluPhe: 3.761 ± 0.636
3.65GluGly: 3.65 ± 0.527
0.553GluHis: 0.553 ± 0.226
5.973GluIle: 5.973 ± 0.717
6.969GluLys: 6.969 ± 1.156
6.969GluLeu: 6.969 ± 1.092
2.544GluMet: 2.544 ± 0.417
4.978GluAsn: 4.978 ± 0.633
1.991GluPro: 1.991 ± 0.568
4.535GluGln: 4.535 ± 0.739
2.434GluArg: 2.434 ± 0.641
3.097GluSer: 3.097 ± 0.648
4.314GluThr: 4.314 ± 0.71
5.31GluVal: 5.31 ± 0.713
1.217GluTrp: 1.217 ± 0.489
3.761GluTyr: 3.761 ± 0.785
0.0GluXaa: 0.0 ± 0.0
Phe
2.212PheAla: 2.212 ± 0.688
0.332PheCys: 0.332 ± 0.189
2.323PheAsp: 2.323 ± 0.48
3.54PheGlu: 3.54 ± 0.565
2.655PhePhe: 2.655 ± 1.048
2.987PheGly: 2.987 ± 0.65
0.111PheHis: 0.111 ± 0.101
3.54PheIle: 3.54 ± 0.724
3.982PheLys: 3.982 ± 0.548
2.434PheLeu: 2.434 ± 0.46
1.438PheMet: 1.438 ± 0.425
2.876PheAsn: 2.876 ± 0.839
1.217PhePro: 1.217 ± 0.382
1.549PheGln: 1.549 ± 0.427
1.106PheArg: 1.106 ± 0.29
3.54PheSer: 3.54 ± 0.996
3.319PheThr: 3.319 ± 0.654
2.987PheVal: 2.987 ± 0.482
0.553PheTrp: 0.553 ± 0.235
1.549PheTyr: 1.549 ± 0.406
0.0PheXaa: 0.0 ± 0.0
Gly
3.65GlyAla: 3.65 ± 0.747
0.774GlyCys: 0.774 ± 0.239
2.987GlyAsp: 2.987 ± 0.558
4.757GlyGlu: 4.757 ± 0.759
2.102GlyPhe: 2.102 ± 0.55
3.761GlyGly: 3.761 ± 0.678
0.332GlyHis: 0.332 ± 0.195
4.314GlyIle: 4.314 ± 1.612
5.088GlyLys: 5.088 ± 0.833
6.195GlyLeu: 6.195 ± 0.83
1.217GlyMet: 1.217 ± 0.367
2.987GlyAsn: 2.987 ± 0.458
0.111GlyPro: 0.111 ± 0.111
2.102GlyGln: 2.102 ± 0.611
2.434GlyArg: 2.434 ± 0.507
3.982GlySer: 3.982 ± 0.567
3.872GlyThr: 3.872 ± 0.613
3.65GlyVal: 3.65 ± 0.89
0.996GlyTrp: 0.996 ± 0.507
3.65GlyTyr: 3.65 ± 0.679
0.0GlyXaa: 0.0 ± 0.0
His
0.774HisAla: 0.774 ± 0.249
0.221HisCys: 0.221 ± 0.148
0.664HisAsp: 0.664 ± 0.256
0.664HisGlu: 0.664 ± 0.271
0.332HisPhe: 0.332 ± 0.15
1.327HisGly: 1.327 ± 0.421
0.442HisHis: 0.442 ± 0.263
0.996HisIle: 0.996 ± 0.399
1.217HisLys: 1.217 ± 0.399
0.885HisLeu: 0.885 ± 0.313
0.442HisMet: 0.442 ± 0.215
0.332HisAsn: 0.332 ± 0.2
0.332HisPro: 0.332 ± 0.178
0.332HisGln: 0.332 ± 0.182
0.442HisArg: 0.442 ± 0.202
0.221HisSer: 0.221 ± 0.153
0.553HisThr: 0.553 ± 0.214
1.217HisVal: 1.217 ± 0.498
0.111HisTrp: 0.111 ± 0.084
0.332HisTyr: 0.332 ± 0.183
0.0HisXaa: 0.0 ± 0.0
Ile
5.199IleAla: 5.199 ± 0.925
0.221IleCys: 0.221 ± 0.156
5.531IleAsp: 5.531 ± 0.771
7.743IleGlu: 7.743 ± 0.883
2.765IlePhe: 2.765 ± 0.551
4.535IleGly: 4.535 ± 1.052
0.774IleHis: 0.774 ± 0.341
3.982IleIle: 3.982 ± 0.863
6.195IleLys: 6.195 ± 0.996
4.204IleLeu: 4.204 ± 0.861
1.77IleMet: 1.77 ± 0.387
3.54IleAsn: 3.54 ± 0.519
1.327IlePro: 1.327 ± 0.357
2.765IleGln: 2.765 ± 0.574
1.881IleArg: 1.881 ± 0.452
4.093IleSer: 4.093 ± 0.771
5.088IleThr: 5.088 ± 0.753
5.973IleVal: 5.973 ± 1.06
2.212IleTrp: 2.212 ± 1.173
2.987IleTyr: 2.987 ± 0.575
0.0IleXaa: 0.0 ± 0.0
Lys
7.08LysAla: 7.08 ± 0.938
0.332LysCys: 0.332 ± 0.178
4.646LysAsp: 4.646 ± 0.795
7.743LysGlu: 7.743 ± 1.256
2.544LysPhe: 2.544 ± 0.49
5.088LysGly: 5.088 ± 0.688
0.996LysHis: 0.996 ± 0.36
5.752LysIle: 5.752 ± 0.873
8.96LysLys: 8.96 ± 1.37
7.19LysLeu: 7.19 ± 0.964
2.544LysMet: 2.544 ± 0.532
4.867LysAsn: 4.867 ± 0.739
2.323LysPro: 2.323 ± 0.679
3.54LysGln: 3.54 ± 0.61
4.204LysArg: 4.204 ± 0.731
4.646LysSer: 4.646 ± 0.735
5.863LysThr: 5.863 ± 0.764
4.867LysVal: 4.867 ± 0.882
1.549LysTrp: 1.549 ± 0.353
2.987LysTyr: 2.987 ± 0.671
0.0LysXaa: 0.0 ± 0.0
Leu
4.757LeuAla: 4.757 ± 0.622
0.442LeuCys: 0.442 ± 0.228
4.757LeuAsp: 4.757 ± 0.609
5.863LeuGlu: 5.863 ± 0.815
2.987LeuPhe: 2.987 ± 0.711
4.093LeuGly: 4.093 ± 0.586
1.327LeuHis: 1.327 ± 0.366
5.42LeuIle: 5.42 ± 1.015
8.186LeuLys: 8.186 ± 1.159
5.088LeuLeu: 5.088 ± 0.841
2.102LeuMet: 2.102 ± 0.501
4.978LeuAsn: 4.978 ± 0.706
2.544LeuPro: 2.544 ± 0.589
2.765LeuGln: 2.765 ± 0.608
1.659LeuArg: 1.659 ± 0.341
5.199LeuSer: 5.199 ± 0.833
5.642LeuThr: 5.642 ± 0.787
6.637LeuVal: 6.637 ± 0.876
1.217LeuTrp: 1.217 ± 0.368
3.208LeuTyr: 3.208 ± 0.718
0.0LeuXaa: 0.0 ± 0.0
Met
2.655MetAla: 2.655 ± 0.476
0.332MetCys: 0.332 ± 0.151
1.106MetAsp: 1.106 ± 0.352
2.102MetGlu: 2.102 ± 0.434
0.885MetPhe: 0.885 ± 0.387
1.327MetGly: 1.327 ± 0.351
0.0MetHis: 0.0 ± 0.0
2.655MetIle: 2.655 ± 0.569
3.208MetLys: 3.208 ± 0.612
2.102MetLeu: 2.102 ± 0.514
0.332MetMet: 0.332 ± 0.197
2.212MetAsn: 2.212 ± 0.445
0.442MetPro: 0.442 ± 0.193
0.885MetGln: 0.885 ± 0.361
1.438MetArg: 1.438 ± 0.383
1.327MetSer: 1.327 ± 0.428
3.097MetThr: 3.097 ± 0.426
1.659MetVal: 1.659 ± 0.485
0.442MetTrp: 0.442 ± 0.263
0.664MetTyr: 0.664 ± 0.292
0.0MetXaa: 0.0 ± 0.0
Asn
4.646AsnAla: 4.646 ± 0.961
0.221AsnCys: 0.221 ± 0.158
2.987AsnAsp: 2.987 ± 0.575
5.642AsnGlu: 5.642 ± 0.825
3.097AsnPhe: 3.097 ± 0.534
4.204AsnGly: 4.204 ± 0.598
0.553AsnHis: 0.553 ± 0.274
4.757AsnIle: 4.757 ± 0.795
5.42AsnLys: 5.42 ± 0.807
4.646AsnLeu: 4.646 ± 0.612
1.438AsnMet: 1.438 ± 0.274
3.54AsnAsn: 3.54 ± 0.63
2.434AsnPro: 2.434 ± 0.461
2.434AsnGln: 2.434 ± 0.535
1.217AsnArg: 1.217 ± 0.424
3.65AsnSer: 3.65 ± 0.504
3.097AsnThr: 3.097 ± 0.769
2.987AsnVal: 2.987 ± 0.552
1.106AsnTrp: 1.106 ± 0.316
2.987AsnTyr: 2.987 ± 0.759
0.0AsnXaa: 0.0 ± 0.0
Pro
1.881ProAla: 1.881 ± 0.439
0.221ProCys: 0.221 ± 0.172
1.438ProAsp: 1.438 ± 0.595
2.434ProGlu: 2.434 ± 0.461
1.77ProPhe: 1.77 ± 0.443
0.442ProGly: 0.442 ± 0.203
0.0ProHis: 0.0 ± 0.0
1.549ProIle: 1.549 ± 0.433
2.212ProLys: 2.212 ± 0.702
2.434ProLeu: 2.434 ± 0.449
0.996ProMet: 0.996 ± 0.314
0.996ProAsn: 0.996 ± 0.305
0.442ProPro: 0.442 ± 0.211
0.885ProGln: 0.885 ± 0.33
0.774ProArg: 0.774 ± 0.288
1.881ProSer: 1.881 ± 0.497
2.212ProThr: 2.212 ± 0.52
1.77ProVal: 1.77 ± 0.404
0.332ProTrp: 0.332 ± 0.187
0.332ProTyr: 0.332 ± 0.178
0.0ProXaa: 0.0 ± 0.0
Gln
2.876GlnAla: 2.876 ± 0.54
0.111GlnCys: 0.111 ± 0.096
1.659GlnAsp: 1.659 ± 0.442
2.655GlnGlu: 2.655 ± 0.537
1.881GlnPhe: 1.881 ± 0.341
2.212GlnGly: 2.212 ± 0.425
0.221GlnHis: 0.221 ± 0.177
1.77GlnIle: 1.77 ± 0.438
3.761GlnLys: 3.761 ± 0.854
3.872GlnLeu: 3.872 ± 0.502
1.438GlnMet: 1.438 ± 0.56
1.106GlnAsn: 1.106 ± 0.337
1.327GlnPro: 1.327 ± 0.411
1.106GlnGln: 1.106 ± 0.376
1.881GlnArg: 1.881 ± 0.434
2.765GlnSer: 2.765 ± 0.418
2.323GlnThr: 2.323 ± 0.54
1.991GlnVal: 1.991 ± 0.498
0.442GlnTrp: 0.442 ± 0.224
1.881GlnTyr: 1.881 ± 0.472
0.0GlnXaa: 0.0 ± 0.0
Arg
2.987ArgAla: 2.987 ± 0.537
0.111ArgCys: 0.111 ± 0.1
2.544ArgAsp: 2.544 ± 0.643
2.544ArgGlu: 2.544 ± 0.581
0.774ArgPhe: 0.774 ± 0.26
1.77ArgGly: 1.77 ± 0.365
1.438ArgHis: 1.438 ± 0.418
1.659ArgIle: 1.659 ± 0.485
3.54ArgLys: 3.54 ± 0.649
2.434ArgLeu: 2.434 ± 0.577
0.885ArgMet: 0.885 ± 0.256
2.323ArgAsn: 2.323 ± 0.543
0.996ArgPro: 0.996 ± 0.334
1.659ArgGln: 1.659 ± 0.494
1.881ArgArg: 1.881 ± 0.512
1.77ArgSer: 1.77 ± 0.447
1.659ArgThr: 1.659 ± 0.463
2.434ArgVal: 2.434 ± 0.526
0.442ArgTrp: 0.442 ± 0.217
2.987ArgTyr: 2.987 ± 0.767
0.0ArgXaa: 0.0 ± 0.0
Ser
4.204SerAla: 4.204 ± 1.259
0.442SerCys: 0.442 ± 0.231
2.987SerAsp: 2.987 ± 0.486
3.982SerGlu: 3.982 ± 0.498
3.429SerPhe: 3.429 ± 0.604
5.531SerGly: 5.531 ± 1.063
0.664SerHis: 0.664 ± 0.228
3.982SerIle: 3.982 ± 0.689
4.757SerLys: 4.757 ± 0.839
4.978SerLeu: 4.978 ± 0.696
2.323SerMet: 2.323 ± 0.481
3.65SerAsn: 3.65 ± 0.715
1.659SerPro: 1.659 ± 0.297
2.212SerGln: 2.212 ± 0.493
1.881SerArg: 1.881 ± 0.399
3.872SerSer: 3.872 ± 0.738
3.54SerThr: 3.54 ± 0.685
4.978SerVal: 4.978 ± 0.59
0.664SerTrp: 0.664 ± 0.227
2.323SerTyr: 2.323 ± 0.697
0.0SerXaa: 0.0 ± 0.0
Thr
5.863ThrAla: 5.863 ± 1.038
0.442ThrCys: 0.442 ± 0.225
4.757ThrAsp: 4.757 ± 0.69
3.208ThrGlu: 3.208 ± 0.59
2.765ThrPhe: 2.765 ± 0.562
5.31ThrGly: 5.31 ± 0.841
0.774ThrHis: 0.774 ± 0.3
5.199ThrIle: 5.199 ± 0.745
4.646ThrLys: 4.646 ± 0.748
4.867ThrLeu: 4.867 ± 0.849
1.77ThrMet: 1.77 ± 0.371
3.761ThrAsn: 3.761 ± 0.609
1.659ThrPro: 1.659 ± 0.493
2.655ThrGln: 2.655 ± 0.461
2.434ThrArg: 2.434 ± 0.616
4.535ThrSer: 4.535 ± 0.788
3.982ThrThr: 3.982 ± 0.953
4.204ThrVal: 4.204 ± 0.798
0.996ThrTrp: 0.996 ± 0.349
2.212ThrTyr: 2.212 ± 0.564
0.0ThrXaa: 0.0 ± 0.0
Val
4.425ValAla: 4.425 ± 0.814
0.553ValCys: 0.553 ± 0.274
3.761ValAsp: 3.761 ± 0.696
4.314ValGlu: 4.314 ± 0.683
3.319ValPhe: 3.319 ± 0.926
3.097ValGly: 3.097 ± 0.509
0.774ValHis: 0.774 ± 0.261
5.199ValIle: 5.199 ± 0.805
5.752ValLys: 5.752 ± 0.868
4.093ValLeu: 4.093 ± 0.569
1.77ValMet: 1.77 ± 0.411
4.867ValAsn: 4.867 ± 0.674
2.102ValPro: 2.102 ± 0.477
2.102ValGln: 2.102 ± 0.532
3.097ValArg: 3.097 ± 0.544
5.088ValSer: 5.088 ± 0.882
5.752ValThr: 5.752 ± 0.898
4.204ValVal: 4.204 ± 0.591
0.664ValTrp: 0.664 ± 0.317
2.655ValTyr: 2.655 ± 0.563
0.0ValXaa: 0.0 ± 0.0
Trp
0.996TrpAla: 0.996 ± 0.288
0.332TrpCys: 0.332 ± 0.189
0.885TrpAsp: 0.885 ± 0.392
0.664TrpGlu: 0.664 ± 0.291
1.217TrpPhe: 1.217 ± 0.557
0.553TrpGly: 0.553 ± 0.219
0.221TrpHis: 0.221 ± 0.161
0.664TrpIle: 0.664 ± 0.262
0.885TrpLys: 0.885 ± 0.244
1.549TrpLeu: 1.549 ± 0.392
0.442TrpMet: 0.442 ± 0.199
2.212TrpAsn: 2.212 ± 0.929
0.0TrpPro: 0.0 ± 0.0
0.885TrpGln: 0.885 ± 0.356
0.664TrpArg: 0.664 ± 0.348
1.106TrpSer: 1.106 ± 0.33
0.664TrpThr: 0.664 ± 0.282
1.217TrpVal: 1.217 ± 0.285
0.442TrpTrp: 0.442 ± 0.17
0.332TrpTyr: 0.332 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.434TyrAla: 2.434 ± 0.481
0.332TyrCys: 0.332 ± 0.181
2.434TyrAsp: 2.434 ± 0.733
3.208TyrGlu: 3.208 ± 0.748
1.549TyrPhe: 1.549 ± 0.436
2.987TyrGly: 2.987 ± 0.628
0.996TyrHis: 0.996 ± 0.329
3.872TyrIle: 3.872 ± 0.866
3.319TyrLys: 3.319 ± 0.717
3.319TyrLeu: 3.319 ± 0.69
1.217TyrMet: 1.217 ± 0.403
3.097TyrAsn: 3.097 ± 0.623
1.438TyrPro: 1.438 ± 0.453
1.217TyrGln: 1.217 ± 0.469
1.549TyrArg: 1.549 ± 0.445
1.991TyrSer: 1.991 ± 0.5
2.987TyrThr: 2.987 ± 0.565
1.549TyrVal: 1.549 ± 0.429
0.111TyrTrp: 0.111 ± 0.111
1.549TyrTyr: 1.549 ± 0.442
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (9041 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski