Amino acid dipepetide frequency for Lactococcus phage CaseusJM1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.526AlaAla: 0.526 ± 0.27
0.105AlaCys: 0.105 ± 0.106
3.365AlaAsp: 3.365 ± 0.487
4.206AlaGlu: 4.206 ± 0.735
3.89AlaPhe: 3.89 ± 0.742
4.311AlaGly: 4.311 ± 0.934
0.841AlaHis: 0.841 ± 0.217
4.311AlaIle: 4.311 ± 0.732
5.993AlaLys: 5.993 ± 0.954
6.624AlaLeu: 6.624 ± 0.958
1.998AlaMet: 1.998 ± 0.592
4.521AlaAsn: 4.521 ± 1.004
0.841AlaPro: 0.841 ± 0.267
2.208AlaGln: 2.208 ± 0.525
2.523AlaArg: 2.523 ± 0.475
3.365AlaSer: 3.365 ± 0.868
4.311AlaThr: 4.311 ± 0.973
4.416AlaVal: 4.416 ± 1.015
1.682AlaTrp: 1.682 ± 0.742
1.577AlaTyr: 1.577 ± 0.345
0.0AlaXaa: 0.0 ± 0.0
Cys
0.105CysAla: 0.105 ± 0.091
0.105CysCys: 0.105 ± 0.105
0.315CysAsp: 0.315 ± 0.188
0.21CysGlu: 0.21 ± 0.132
0.315CysPhe: 0.315 ± 0.175
0.736CysGly: 0.736 ± 0.386
0.21CysHis: 0.21 ± 0.183
0.421CysIle: 0.421 ± 0.207
0.736CysLys: 0.736 ± 0.365
0.315CysLeu: 0.315 ± 0.178
0.0CysMet: 0.0 ± 0.0
0.631CysAsn: 0.631 ± 0.26
0.21CysPro: 0.21 ± 0.125
0.21CysGln: 0.21 ± 0.131
0.736CysArg: 0.736 ± 0.293
0.315CysSer: 0.315 ± 0.162
0.105CysThr: 0.105 ± 0.109
0.315CysVal: 0.315 ± 0.214
0.105CysTrp: 0.105 ± 0.101
0.21CysTyr: 0.21 ± 0.16
0.0CysXaa: 0.0 ± 0.0
Asp
2.103AspAla: 2.103 ± 0.474
0.105AspCys: 0.105 ± 0.108
3.47AspAsp: 3.47 ± 0.743
3.785AspGlu: 3.785 ± 0.87
3.785AspPhe: 3.785 ± 0.56
3.68AspGly: 3.68 ± 0.505
1.051AspHis: 1.051 ± 0.427
4.731AspIle: 4.731 ± 0.971
5.362AspLys: 5.362 ± 0.691
6.939AspLeu: 6.939 ± 0.928
1.682AspMet: 1.682 ± 0.37
3.89AspAsn: 3.89 ± 0.665
1.472AspPro: 1.472 ± 0.41
0.421AspGln: 0.421 ± 0.244
1.893AspArg: 1.893 ± 0.433
3.259AspSer: 3.259 ± 0.538
4.416AspThr: 4.416 ± 0.691
2.944AspVal: 2.944 ± 0.555
0.946AspTrp: 0.946 ± 0.309
3.049AspTyr: 3.049 ± 0.597
0.0AspXaa: 0.0 ± 0.0
Glu
3.575GluAla: 3.575 ± 0.471
0.526GluCys: 0.526 ± 0.215
3.049GluAsp: 3.049 ± 0.71
5.152GluGlu: 5.152 ± 0.905
3.89GluPhe: 3.89 ± 0.695
2.629GluGly: 2.629 ± 0.532
0.946GluHis: 0.946 ± 0.317
6.098GluIle: 6.098 ± 0.861
5.993GluLys: 5.993 ± 1.173
9.778GluLeu: 9.778 ± 1.373
2.418GluMet: 2.418 ± 0.448
5.152GluAsn: 5.152 ± 0.678
1.157GluPro: 1.157 ± 0.405
3.995GluGln: 3.995 ± 0.867
2.944GluArg: 2.944 ± 0.629
4.311GluSer: 4.311 ± 0.576
5.257GluThr: 5.257 ± 0.801
4.731GluVal: 4.731 ± 0.687
0.736GluTrp: 0.736 ± 0.257
2.839GluTyr: 2.839 ± 0.659
0.0GluXaa: 0.0 ± 0.0
Phe
2.839PheAla: 2.839 ± 0.715
0.315PheCys: 0.315 ± 0.186
3.575PheAsp: 3.575 ± 0.56
2.944PheGlu: 2.944 ± 0.596
1.682PhePhe: 1.682 ± 0.659
2.208PheGly: 2.208 ± 0.443
0.841PheHis: 0.841 ± 0.365
2.734PheIle: 2.734 ± 0.505
3.575PheLys: 3.575 ± 0.595
2.313PheLeu: 2.313 ± 0.468
1.051PheMet: 1.051 ± 0.31
3.575PheAsn: 3.575 ± 0.711
0.946PhePro: 0.946 ± 0.375
1.262PheGln: 1.262 ± 0.304
1.367PheArg: 1.367 ± 0.325
4.101PheSer: 4.101 ± 0.782
2.208PheThr: 2.208 ± 0.444
2.523PheVal: 2.523 ± 0.428
0.21PheTrp: 0.21 ± 0.143
1.682PheTyr: 1.682 ± 0.386
0.0PheXaa: 0.0 ± 0.0
Gly
3.785GlyAla: 3.785 ± 1.16
0.315GlyCys: 0.315 ± 0.159
3.365GlyAsp: 3.365 ± 0.581
3.89GlyGlu: 3.89 ± 0.604
1.998GlyPhe: 1.998 ± 0.433
3.995GlyGly: 3.995 ± 0.835
1.157GlyHis: 1.157 ± 0.365
3.995GlyIle: 3.995 ± 1.202
6.729GlyLys: 6.729 ± 0.824
5.783GlyLeu: 5.783 ± 0.935
1.367GlyMet: 1.367 ± 0.437
3.47GlyAsn: 3.47 ± 0.414
0.315GlyPro: 0.315 ± 0.253
2.103GlyGln: 2.103 ± 0.436
1.682GlyArg: 1.682 ± 0.354
4.942GlySer: 4.942 ± 1.045
3.365GlyThr: 3.365 ± 0.771
6.203GlyVal: 6.203 ± 0.985
1.157GlyTrp: 1.157 ± 0.379
3.049GlyTyr: 3.049 ± 0.55
0.0GlyXaa: 0.0 ± 0.0
His
1.051HisAla: 1.051 ± 0.384
0.421HisCys: 0.421 ± 0.218
0.841HisAsp: 0.841 ± 0.313
0.736HisGlu: 0.736 ± 0.245
0.421HisPhe: 0.421 ± 0.275
1.577HisGly: 1.577 ± 0.462
0.315HisHis: 0.315 ± 0.33
1.157HisIle: 1.157 ± 0.371
0.526HisLys: 0.526 ± 0.206
0.736HisLeu: 0.736 ± 0.268
0.0HisMet: 0.0 ± 0.0
1.472HisAsn: 1.472 ± 0.41
0.315HisPro: 0.315 ± 0.177
0.315HisGln: 0.315 ± 0.178
0.315HisArg: 0.315 ± 0.193
0.21HisSer: 0.21 ± 0.131
1.367HisThr: 1.367 ± 0.445
0.631HisVal: 0.631 ± 0.335
0.105HisTrp: 0.105 ± 0.11
0.526HisTyr: 0.526 ± 0.278
0.0HisXaa: 0.0 ± 0.0
Ile
4.942IleAla: 4.942 ± 0.644
0.21IleCys: 0.21 ± 0.137
3.995IleAsp: 3.995 ± 0.566
7.36IleGlu: 7.36 ± 1.152
2.734IlePhe: 2.734 ± 0.536
3.995IleGly: 3.995 ± 0.759
0.736IleHis: 0.736 ± 0.283
5.783IleIle: 5.783 ± 0.654
6.308IleLys: 6.308 ± 0.809
5.993IleLeu: 5.993 ± 1.092
1.472IleMet: 1.472 ± 0.332
5.783IleAsn: 5.783 ± 0.598
1.682IlePro: 1.682 ± 0.35
2.313IleGln: 2.313 ± 0.466
1.577IleArg: 1.577 ± 0.401
3.995IleSer: 3.995 ± 0.78
4.837IleThr: 4.837 ± 0.511
4.837IleVal: 4.837 ± 0.808
1.157IleTrp: 1.157 ± 0.374
2.523IleTyr: 2.523 ± 0.503
0.0IleXaa: 0.0 ± 0.0
Lys
7.15LysAla: 7.15 ± 0.917
0.631LysCys: 0.631 ± 0.311
4.942LysAsp: 4.942 ± 0.74
8.411LysGlu: 8.411 ± 1.642
2.523LysPhe: 2.523 ± 0.547
6.414LysGly: 6.414 ± 0.986
1.051LysHis: 1.051 ± 0.414
5.467LysIle: 5.467 ± 0.807
8.832LysLys: 8.832 ± 1.092
7.36LysLeu: 7.36 ± 0.841
3.47LysMet: 3.47 ± 0.485
4.731LysAsn: 4.731 ± 0.737
1.682LysPro: 1.682 ± 0.48
3.47LysGln: 3.47 ± 0.717
3.68LysArg: 3.68 ± 0.721
4.731LysSer: 4.731 ± 0.863
5.572LysThr: 5.572 ± 0.639
5.783LysVal: 5.783 ± 0.808
1.262LysTrp: 1.262 ± 0.36
3.785LysTyr: 3.785 ± 0.745
0.0LysXaa: 0.0 ± 0.0
Leu
5.047LeuAla: 5.047 ± 0.612
0.421LeuCys: 0.421 ± 0.237
5.152LeuAsp: 5.152 ± 0.966
6.519LeuGlu: 6.519 ± 0.742
3.049LeuPhe: 3.049 ± 0.541
4.416LeuGly: 4.416 ± 0.726
1.157LeuHis: 1.157 ± 0.365
6.834LeuIle: 6.834 ± 0.975
9.252LeuLys: 9.252 ± 0.89
6.098LeuLeu: 6.098 ± 1.042
1.262LeuMet: 1.262 ± 0.362
5.152LeuAsn: 5.152 ± 0.882
2.944LeuPro: 2.944 ± 0.524
3.049LeuGln: 3.049 ± 0.451
2.734LeuArg: 2.734 ± 0.476
5.047LeuSer: 5.047 ± 0.698
5.783LeuThr: 5.783 ± 0.727
5.993LeuVal: 5.993 ± 0.726
1.262LeuTrp: 1.262 ± 0.422
4.206LeuTyr: 4.206 ± 0.707
0.0LeuXaa: 0.0 ± 0.0
Met
2.629MetAla: 2.629 ± 0.439
0.105MetCys: 0.105 ± 0.112
1.472MetAsp: 1.472 ± 0.506
1.577MetGlu: 1.577 ± 0.499
0.526MetPhe: 0.526 ± 0.278
1.051MetGly: 1.051 ± 0.296
0.21MetHis: 0.21 ± 0.164
2.103MetIle: 2.103 ± 0.511
2.418MetLys: 2.418 ± 0.606
1.262MetLeu: 1.262 ± 0.359
0.421MetMet: 0.421 ± 0.19
2.103MetAsn: 2.103 ± 0.525
0.736MetPro: 0.736 ± 0.3
1.682MetGln: 1.682 ± 0.325
0.21MetArg: 0.21 ± 0.14
1.577MetSer: 1.577 ± 0.361
1.893MetThr: 1.893 ± 0.499
1.577MetVal: 1.577 ± 0.345
0.21MetTrp: 0.21 ± 0.145
1.157MetTyr: 1.157 ± 0.376
0.0MetXaa: 0.0 ± 0.0
Asn
4.626AsnAla: 4.626 ± 1.016
0.315AsnCys: 0.315 ± 0.195
4.311AsnAsp: 4.311 ± 0.659
5.362AsnGlu: 5.362 ± 0.837
2.418AsnPhe: 2.418 ± 0.589
5.783AsnGly: 5.783 ± 0.724
0.841AsnHis: 0.841 ± 0.28
4.942AsnIle: 4.942 ± 0.654
6.519AsnLys: 6.519 ± 1.124
5.783AsnLeu: 5.783 ± 0.873
1.682AsnMet: 1.682 ± 0.426
3.89AsnAsn: 3.89 ± 0.588
1.787AsnPro: 1.787 ± 0.388
2.103AsnGln: 2.103 ± 0.426
2.313AsnArg: 2.313 ± 0.396
4.521AsnSer: 4.521 ± 0.728
4.206AsnThr: 4.206 ± 0.734
3.47AsnVal: 3.47 ± 0.642
1.262AsnTrp: 1.262 ± 0.356
2.208AsnTyr: 2.208 ± 0.506
0.0AsnXaa: 0.0 ± 0.0
Pro
1.577ProAla: 1.577 ± 0.367
0.21ProCys: 0.21 ± 0.152
1.893ProAsp: 1.893 ± 0.579
1.472ProGlu: 1.472 ± 0.397
0.841ProPhe: 0.841 ± 0.26
0.21ProGly: 0.21 ± 0.137
0.315ProHis: 0.315 ± 0.187
1.998ProIle: 1.998 ± 0.524
2.418ProLys: 2.418 ± 0.633
1.787ProLeu: 1.787 ± 0.431
0.526ProMet: 0.526 ± 0.223
1.998ProAsn: 1.998 ± 0.665
0.631ProPro: 0.631 ± 0.293
0.526ProGln: 0.526 ± 0.284
0.421ProArg: 0.421 ± 0.201
0.946ProSer: 0.946 ± 0.368
1.787ProThr: 1.787 ± 0.366
1.472ProVal: 1.472 ± 0.496
0.21ProTrp: 0.21 ± 0.15
0.526ProTyr: 0.526 ± 0.253
0.0ProXaa: 0.0 ± 0.0
Gln
3.259GlnAla: 3.259 ± 0.665
0.105GlnCys: 0.105 ± 0.113
2.313GlnAsp: 2.313 ± 0.503
2.629GlnGlu: 2.629 ± 0.521
1.051GlnPhe: 1.051 ± 0.349
2.839GlnGly: 2.839 ± 0.473
0.315GlnHis: 0.315 ± 0.176
1.787GlnIle: 1.787 ± 0.355
2.523GlnLys: 2.523 ± 0.601
2.734GlnLeu: 2.734 ± 0.512
0.841GlnMet: 0.841 ± 0.252
2.313GlnAsn: 2.313 ± 0.436
1.051GlnPro: 1.051 ± 0.342
1.472GlnGln: 1.472 ± 0.475
1.577GlnArg: 1.577 ± 0.448
2.418GlnSer: 2.418 ± 0.523
2.523GlnThr: 2.523 ± 0.485
1.787GlnVal: 1.787 ± 0.465
0.631GlnTrp: 0.631 ± 0.217
1.051GlnTyr: 1.051 ± 0.322
0.0GlnXaa: 0.0 ± 0.0
Arg
2.734ArgAla: 2.734 ± 0.509
0.421ArgCys: 0.421 ± 0.23
1.787ArgAsp: 1.787 ± 0.414
2.103ArgGlu: 2.103 ± 0.429
1.262ArgPhe: 1.262 ± 0.405
1.998ArgGly: 1.998 ± 0.42
0.736ArgHis: 0.736 ± 0.299
1.893ArgIle: 1.893 ± 0.372
4.206ArgLys: 4.206 ± 0.86
3.47ArgLeu: 3.47 ± 0.545
0.736ArgMet: 0.736 ± 0.318
2.103ArgAsn: 2.103 ± 0.559
0.526ArgPro: 0.526 ± 0.231
1.367ArgGln: 1.367 ± 0.308
1.577ArgArg: 1.577 ± 0.365
1.682ArgSer: 1.682 ± 0.363
2.103ArgThr: 2.103 ± 0.405
1.367ArgVal: 1.367 ± 0.429
0.21ArgTrp: 0.21 ± 0.131
2.103ArgTyr: 2.103 ± 0.487
0.0ArgXaa: 0.0 ± 0.0
Ser
4.837SerAla: 4.837 ± 1.234
0.421SerCys: 0.421 ± 0.246
3.995SerAsp: 3.995 ± 0.792
4.206SerGlu: 4.206 ± 0.742
3.049SerPhe: 3.049 ± 0.63
5.362SerGly: 5.362 ± 1.181
0.526SerHis: 0.526 ± 0.221
4.521SerIle: 4.521 ± 0.75
5.362SerLys: 5.362 ± 0.733
5.152SerLeu: 5.152 ± 1.087
1.787SerMet: 1.787 ± 0.371
4.206SerAsn: 4.206 ± 0.809
0.946SerPro: 0.946 ± 0.313
2.313SerGln: 2.313 ± 0.585
2.418SerArg: 2.418 ± 0.391
5.362SerSer: 5.362 ± 1.006
3.154SerThr: 3.154 ± 0.942
3.995SerVal: 3.995 ± 0.668
0.736SerTrp: 0.736 ± 0.316
2.103SerTyr: 2.103 ± 0.527
0.0SerXaa: 0.0 ± 0.0
Thr
5.047ThrAla: 5.047 ± 0.818
0.21ThrCys: 0.21 ± 0.171
3.89ThrAsp: 3.89 ± 0.774
5.993ThrGlu: 5.993 ± 0.711
2.418ThrPhe: 2.418 ± 0.514
3.995ThrGly: 3.995 ± 0.612
0.105ThrHis: 0.105 ± 0.113
4.626ThrIle: 4.626 ± 0.997
4.942ThrLys: 4.942 ± 0.734
5.362ThrLeu: 5.362 ± 0.566
1.157ThrMet: 1.157 ± 0.329
4.521ThrAsn: 4.521 ± 0.601
1.577ThrPro: 1.577 ± 0.288
2.629ThrGln: 2.629 ± 0.55
2.313ThrArg: 2.313 ± 0.403
4.731ThrSer: 4.731 ± 0.784
3.785ThrThr: 3.785 ± 0.625
4.521ThrVal: 4.521 ± 0.684
1.051ThrTrp: 1.051 ± 0.335
1.893ThrTyr: 1.893 ± 0.446
0.0ThrXaa: 0.0 ± 0.0
Val
3.365ValAla: 3.365 ± 0.615
0.736ValCys: 0.736 ± 0.277
4.101ValAsp: 4.101 ± 0.764
4.521ValGlu: 4.521 ± 0.533
3.049ValPhe: 3.049 ± 0.682
3.575ValGly: 3.575 ± 0.549
0.421ValHis: 0.421 ± 0.199
4.521ValIle: 4.521 ± 0.587
5.783ValLys: 5.783 ± 0.926
3.575ValLeu: 3.575 ± 0.504
1.998ValMet: 1.998 ± 0.374
3.68ValAsn: 3.68 ± 0.533
1.577ValPro: 1.577 ± 0.437
1.998ValGln: 1.998 ± 0.507
2.523ValArg: 2.523 ± 0.689
5.888ValSer: 5.888 ± 1.381
5.047ValThr: 5.047 ± 0.832
4.101ValVal: 4.101 ± 0.761
0.421ValTrp: 0.421 ± 0.194
2.734ValTyr: 2.734 ± 0.463
0.0ValXaa: 0.0 ± 0.0
Trp
0.736TrpAla: 0.736 ± 0.212
0.21TrpCys: 0.21 ± 0.154
0.841TrpAsp: 0.841 ± 0.398
0.736TrpGlu: 0.736 ± 0.288
0.841TrpPhe: 0.841 ± 0.306
0.946TrpGly: 0.946 ± 0.322
0.105TrpHis: 0.105 ± 0.106
0.946TrpIle: 0.946 ± 0.331
1.051TrpLys: 1.051 ± 0.297
0.946TrpLeu: 0.946 ± 0.307
0.315TrpMet: 0.315 ± 0.172
1.682TrpAsn: 1.682 ± 0.502
0.105TrpPro: 0.105 ± 0.105
0.736TrpGln: 0.736 ± 0.269
0.315TrpArg: 0.315 ± 0.232
0.841TrpSer: 0.841 ± 0.243
0.421TrpThr: 0.421 ± 0.194
0.736TrpVal: 0.736 ± 0.293
0.21TrpTrp: 0.21 ± 0.147
1.051TrpTyr: 1.051 ± 0.335
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.682TyrAla: 1.682 ± 0.524
0.421TyrCys: 0.421 ± 0.293
2.313TyrAsp: 2.313 ± 0.635
3.365TyrGlu: 3.365 ± 0.709
2.208TyrPhe: 2.208 ± 0.433
2.944TyrGly: 2.944 ± 0.583
1.157TyrHis: 1.157 ± 0.368
3.365TyrIle: 3.365 ± 0.696
2.418TyrLys: 2.418 ± 0.613
3.47TyrLeu: 3.47 ± 0.877
0.631TyrMet: 0.631 ± 0.263
3.47TyrAsn: 3.47 ± 0.646
1.262TyrPro: 1.262 ± 0.375
1.051TyrGln: 1.051 ± 0.451
1.262TyrArg: 1.262 ± 0.382
2.208TyrSer: 2.208 ± 0.537
2.629TyrThr: 2.629 ± 0.699
2.208TyrVal: 2.208 ± 0.478
0.21TyrTrp: 0.21 ± 0.152
2.208TyrTyr: 2.208 ± 0.538
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (9512 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski