Amino acid dipepetide frequency for Lactococcus phage P162

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.725AlaAla: 5.725 ± 1.354
0.229AlaCys: 0.229 ± 0.107
3.435AlaAsp: 3.435 ± 0.557
2.691AlaGlu: 2.691 ± 0.429
2.347AlaPhe: 2.347 ± 0.348
4.408AlaGly: 4.408 ± 1.077
0.687AlaHis: 0.687 ± 0.201
5.153AlaIle: 5.153 ± 0.919
5.267AlaLys: 5.267 ± 0.633
5.782AlaLeu: 5.782 ± 0.929
2.691AlaMet: 2.691 ± 0.572
3.492AlaAsn: 3.492 ± 0.505
2.004AlaPro: 2.004 ± 0.344
1.66AlaGln: 1.66 ± 0.473
2.576AlaArg: 2.576 ± 0.422
4.179AlaSer: 4.179 ± 0.559
4.809AlaThr: 4.809 ± 0.6
4.58AlaVal: 4.58 ± 0.56
0.859AlaTrp: 0.859 ± 0.238
2.519AlaTyr: 2.519 ± 0.399
0.0AlaXaa: 0.0 ± 0.0
Cys
0.172CysAla: 0.172 ± 0.086
0.057CysCys: 0.057 ± 0.064
0.286CysAsp: 0.286 ± 0.152
0.458CysGlu: 0.458 ± 0.159
0.229CysPhe: 0.229 ± 0.114
0.344CysGly: 0.344 ± 0.119
0.0CysHis: 0.0 ± 0.0
0.687CysIle: 0.687 ± 0.252
0.458CysLys: 0.458 ± 0.152
0.515CysLeu: 0.515 ± 0.144
0.057CysMet: 0.057 ± 0.053
0.057CysAsn: 0.057 ± 0.052
0.344CysPro: 0.344 ± 0.141
0.057CysGln: 0.057 ± 0.053
0.229CysArg: 0.229 ± 0.117
0.286CysSer: 0.286 ± 0.122
0.286CysThr: 0.286 ± 0.127
0.172CysVal: 0.172 ± 0.096
0.057CysTrp: 0.057 ± 0.045
0.515CysTyr: 0.515 ± 0.185
0.0CysXaa: 0.0 ± 0.0
Asp
3.893AspAla: 3.893 ± 0.41
0.229AspCys: 0.229 ± 0.095
4.466AspAsp: 4.466 ± 0.7
4.523AspGlu: 4.523 ± 0.547
2.92AspPhe: 2.92 ± 0.378
4.58AspGly: 4.58 ± 0.623
0.802AspHis: 0.802 ± 0.21
5.439AspIle: 5.439 ± 0.385
5.267AspLys: 5.267 ± 0.796
5.725AspLeu: 5.725 ± 0.687
2.118AspMet: 2.118 ± 0.424
4.408AspAsn: 4.408 ± 0.485
2.176AspPro: 2.176 ± 0.457
2.004AspGln: 2.004 ± 0.302
1.718AspArg: 1.718 ± 0.339
4.122AspSer: 4.122 ± 0.472
4.58AspThr: 4.58 ± 0.611
4.466AspVal: 4.466 ± 0.446
0.744AspTrp: 0.744 ± 0.186
3.55AspTyr: 3.55 ± 0.552
0.0AspXaa: 0.0 ± 0.0
Glu
3.893GluAla: 3.893 ± 0.509
0.401GluCys: 0.401 ± 0.196
4.695GluAsp: 4.695 ± 0.643
5.897GluGlu: 5.897 ± 0.831
2.29GluPhe: 2.29 ± 0.407
4.008GluGly: 4.008 ± 0.462
1.431GluHis: 1.431 ± 0.381
4.008GluIle: 4.008 ± 0.516
4.294GluLys: 4.294 ± 0.571
4.523GluLeu: 4.523 ± 0.5
1.832GluMet: 1.832 ± 0.388
3.55GluAsn: 3.55 ± 0.539
1.546GluPro: 1.546 ± 0.261
1.889GluGln: 1.889 ± 0.312
2.576GluArg: 2.576 ± 0.354
3.206GluSer: 3.206 ± 0.451
3.664GluThr: 3.664 ± 0.496
4.58GluVal: 4.58 ± 0.585
1.202GluTrp: 1.202 ± 0.273
3.55GluTyr: 3.55 ± 0.455
0.0GluXaa: 0.0 ± 0.0
Phe
1.66PheAla: 1.66 ± 0.354
0.286PheCys: 0.286 ± 0.122
4.008PheAsp: 4.008 ± 0.488
2.347PheGlu: 2.347 ± 0.402
1.317PhePhe: 1.317 ± 0.236
3.092PheGly: 3.092 ± 0.38
0.515PheHis: 0.515 ± 0.167
2.748PheIle: 2.748 ± 0.383
3.721PheLys: 3.721 ± 0.443
2.347PheLeu: 2.347 ± 0.596
0.63PheMet: 0.63 ± 0.179
3.206PheAsn: 3.206 ± 0.492
1.031PhePro: 1.031 ± 0.214
0.63PheGln: 0.63 ± 0.192
1.489PheArg: 1.489 ± 0.253
1.546PheSer: 1.546 ± 0.28
2.576PheThr: 2.576 ± 0.404
1.775PheVal: 1.775 ± 0.246
0.344PheTrp: 0.344 ± 0.122
1.603PheTyr: 1.603 ± 0.286
0.0PheXaa: 0.0 ± 0.0
Gly
4.294GlyAla: 4.294 ± 0.959
0.229GlyCys: 0.229 ± 0.105
3.607GlyAsp: 3.607 ± 0.562
3.492GlyGlu: 3.492 ± 0.484
2.29GlyPhe: 2.29 ± 0.428
3.378GlyGly: 3.378 ± 0.501
1.374GlyHis: 1.374 ± 0.251
5.095GlyIle: 5.095 ± 0.814
4.809GlyLys: 4.809 ± 0.656
5.21GlyLeu: 5.21 ± 1.064
1.603GlyMet: 1.603 ± 0.269
4.065GlyAsn: 4.065 ± 0.592
1.374GlyPro: 1.374 ± 0.24
1.947GlyGln: 1.947 ± 0.405
2.519GlyArg: 2.519 ± 0.421
4.981GlySer: 4.981 ± 0.628
4.58GlyThr: 4.58 ± 0.597
4.523GlyVal: 4.523 ± 0.433
0.916GlyTrp: 0.916 ± 0.175
3.779GlyTyr: 3.779 ± 0.481
0.0GlyXaa: 0.0 ± 0.0
His
0.573HisAla: 0.573 ± 0.159
0.115HisCys: 0.115 ± 0.079
1.088HisAsp: 1.088 ± 0.287
0.802HisGlu: 0.802 ± 0.219
0.401HisPhe: 0.401 ± 0.186
1.26HisGly: 1.26 ± 0.356
0.344HisHis: 0.344 ± 0.139
1.374HisIle: 1.374 ± 0.309
1.317HisLys: 1.317 ± 0.294
1.202HisLeu: 1.202 ± 0.302
0.401HisMet: 0.401 ± 0.165
1.031HisAsn: 1.031 ± 0.284
0.573HisPro: 0.573 ± 0.137
0.344HisGln: 0.344 ± 0.156
0.458HisArg: 0.458 ± 0.161
1.26HisSer: 1.26 ± 0.28
1.317HisThr: 1.317 ± 0.288
1.145HisVal: 1.145 ± 0.263
0.057HisTrp: 0.057 ± 0.045
0.973HisTyr: 0.973 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
5.439IleAla: 5.439 ± 0.706
0.458IleCys: 0.458 ± 0.184
6.527IleAsp: 6.527 ± 0.457
4.466IleGlu: 4.466 ± 0.506
1.718IlePhe: 1.718 ± 0.272
4.466IleGly: 4.466 ± 0.967
1.546IleHis: 1.546 ± 0.29
5.897IleIle: 5.897 ± 0.743
7.042IleLys: 7.042 ± 0.614
5.267IleLeu: 5.267 ± 0.686
2.29IleMet: 2.29 ± 0.419
3.95IleAsn: 3.95 ± 0.515
2.748IlePro: 2.748 ± 0.432
2.519IleGln: 2.519 ± 0.399
2.29IleArg: 2.29 ± 0.369
4.981IleSer: 4.981 ± 0.587
5.21IleThr: 5.21 ± 0.635
5.382IleVal: 5.382 ± 0.613
0.744IleTrp: 0.744 ± 0.243
2.805IleTyr: 2.805 ± 0.36
0.0IleXaa: 0.0 ± 0.0
Lys
4.695LysAla: 4.695 ± 0.644
0.401LysCys: 0.401 ± 0.196
5.439LysAsp: 5.439 ± 0.689
6.584LysGlu: 6.584 ± 0.791
3.664LysPhe: 3.664 ± 0.436
4.351LysGly: 4.351 ± 0.593
2.118LysHis: 2.118 ± 0.387
6.011LysIle: 6.011 ± 0.653
5.897LysLys: 5.897 ± 0.743
7.443LysLeu: 7.443 ± 0.547
2.977LysMet: 2.977 ± 0.414
4.008LysAsn: 4.008 ± 0.495
2.634LysPro: 2.634 ± 0.408
2.863LysGln: 2.863 ± 0.503
3.149LysArg: 3.149 ± 0.514
4.294LysSer: 4.294 ± 0.648
5.954LysThr: 5.954 ± 0.453
4.351LysVal: 4.351 ± 0.453
1.374LysTrp: 1.374 ± 0.26
3.378LysTyr: 3.378 ± 0.548
0.0LysXaa: 0.0 ± 0.0
Leu
6.698LeuAla: 6.698 ± 1.193
0.286LeuCys: 0.286 ± 0.132
4.523LeuAsp: 4.523 ± 0.446
5.153LeuGlu: 5.153 ± 0.61
2.92LeuPhe: 2.92 ± 0.446
5.267LeuGly: 5.267 ± 0.761
0.916LeuHis: 0.916 ± 0.212
5.84LeuIle: 5.84 ± 0.901
7.214LeuLys: 7.214 ± 0.814
5.496LeuLeu: 5.496 ± 0.583
1.718LeuMet: 1.718 ± 0.26
4.466LeuAsn: 4.466 ± 0.496
2.176LeuPro: 2.176 ± 0.365
3.034LeuGln: 3.034 ± 0.389
3.435LeuArg: 3.435 ± 0.451
6.87LeuSer: 6.87 ± 0.738
4.981LeuThr: 4.981 ± 0.533
4.466LeuVal: 4.466 ± 0.491
0.573LeuTrp: 0.573 ± 0.221
3.149LeuTyr: 3.149 ± 0.597
0.0LeuXaa: 0.0 ± 0.0
Met
2.233MetAla: 2.233 ± 0.511
0.229MetCys: 0.229 ± 0.111
2.004MetAsp: 2.004 ± 0.343
1.889MetGlu: 1.889 ± 0.374
0.973MetPhe: 0.973 ± 0.192
1.832MetGly: 1.832 ± 0.555
0.286MetHis: 0.286 ± 0.125
1.832MetIle: 1.832 ± 0.34
3.55MetLys: 3.55 ± 0.535
2.004MetLeu: 2.004 ± 0.371
0.573MetMet: 0.573 ± 0.22
2.061MetAsn: 2.061 ± 0.362
0.859MetPro: 0.859 ± 0.221
0.802MetGln: 0.802 ± 0.195
1.031MetArg: 1.031 ± 0.267
2.061MetSer: 2.061 ± 0.281
1.431MetThr: 1.431 ± 0.274
2.118MetVal: 2.118 ± 0.317
0.229MetTrp: 0.229 ± 0.107
0.859MetTyr: 0.859 ± 0.234
0.0MetXaa: 0.0 ± 0.0
Asn
3.664AsnAla: 3.664 ± 0.393
0.344AsnCys: 0.344 ± 0.17
3.893AsnAsp: 3.893 ± 0.628
3.55AsnGlu: 3.55 ± 0.517
2.576AsnPhe: 2.576 ± 0.388
5.153AsnGly: 5.153 ± 0.562
0.63AsnHis: 0.63 ± 0.219
5.954AsnIle: 5.954 ± 0.452
6.126AsnLys: 6.126 ± 0.603
4.637AsnLeu: 4.637 ± 0.399
2.061AsnMet: 2.061 ± 0.286
3.55AsnAsn: 3.55 ± 0.642
2.691AsnPro: 2.691 ± 0.425
2.462AsnGln: 2.462 ± 0.371
2.176AsnArg: 2.176 ± 0.446
3.321AsnSer: 3.321 ± 0.641
3.263AsnThr: 3.263 ± 0.505
3.779AsnVal: 3.779 ± 0.573
0.687AsnTrp: 0.687 ± 0.229
2.176AsnTyr: 2.176 ± 0.394
0.0AsnXaa: 0.0 ± 0.0
Pro
1.718ProAla: 1.718 ± 0.335
0.115ProCys: 0.115 ± 0.085
2.118ProAsp: 2.118 ± 0.355
2.176ProGlu: 2.176 ± 0.441
1.317ProPhe: 1.317 ± 0.256
1.431ProGly: 1.431 ± 0.279
0.515ProHis: 0.515 ± 0.16
2.233ProIle: 2.233 ± 0.303
2.863ProLys: 2.863 ± 0.508
2.805ProLeu: 2.805 ± 0.488
1.088ProMet: 1.088 ± 0.165
2.405ProAsn: 2.405 ± 0.402
0.802ProPro: 0.802 ± 0.242
1.088ProGln: 1.088 ± 0.26
1.145ProArg: 1.145 ± 0.3
2.748ProSer: 2.748 ± 0.414
2.519ProThr: 2.519 ± 0.471
2.347ProVal: 2.347 ± 0.361
0.515ProTrp: 0.515 ± 0.17
1.374ProTyr: 1.374 ± 0.284
0.0ProXaa: 0.0 ± 0.0
Gln
2.233GlnAla: 2.233 ± 0.39
0.057GlnCys: 0.057 ± 0.053
2.004GlnAsp: 2.004 ± 0.422
1.489GlnGlu: 1.489 ± 0.298
1.031GlnPhe: 1.031 ± 0.198
2.176GlnGly: 2.176 ± 0.448
0.458GlnHis: 0.458 ± 0.157
1.832GlnIle: 1.832 ± 0.357
2.118GlnLys: 2.118 ± 0.387
2.691GlnLeu: 2.691 ± 0.394
0.916GlnMet: 0.916 ± 0.246
1.889GlnAsn: 1.889 ± 0.376
1.202GlnPro: 1.202 ± 0.3
1.603GlnGln: 1.603 ± 0.332
1.031GlnArg: 1.031 ± 0.271
2.347GlnSer: 2.347 ± 0.328
2.405GlnThr: 2.405 ± 0.317
1.832GlnVal: 1.832 ± 0.369
0.458GlnTrp: 0.458 ± 0.112
1.832GlnTyr: 1.832 ± 0.333
0.0GlnXaa: 0.0 ± 0.0
Arg
2.004ArgAla: 2.004 ± 0.339
0.229ArgCys: 0.229 ± 0.125
2.405ArgAsp: 2.405 ± 0.374
2.233ArgGlu: 2.233 ± 0.373
1.718ArgPhe: 1.718 ± 0.267
1.775ArgGly: 1.775 ± 0.263
0.63ArgHis: 0.63 ± 0.174
3.378ArgIle: 3.378 ± 0.432
3.263ArgLys: 3.263 ± 0.416
3.034ArgLeu: 3.034 ± 0.524
1.202ArgMet: 1.202 ± 0.269
2.347ArgAsn: 2.347 ± 0.457
1.431ArgPro: 1.431 ± 0.249
1.603ArgGln: 1.603 ± 0.299
1.374ArgArg: 1.374 ± 0.41
2.462ArgSer: 2.462 ± 0.322
2.347ArgThr: 2.347 ± 0.37
2.004ArgVal: 2.004 ± 0.336
0.401ArgTrp: 0.401 ± 0.127
1.317ArgTyr: 1.317 ± 0.305
0.0ArgXaa: 0.0 ± 0.0
Ser
3.092SerAla: 3.092 ± 0.534
0.172SerCys: 0.172 ± 0.086
4.122SerAsp: 4.122 ± 0.408
3.721SerGlu: 3.721 ± 0.396
2.405SerPhe: 2.405 ± 0.465
5.038SerGly: 5.038 ± 0.514
0.859SerHis: 0.859 ± 0.218
5.038SerIle: 5.038 ± 0.62
5.267SerLys: 5.267 ± 0.72
5.21SerLeu: 5.21 ± 0.612
1.603SerMet: 1.603 ± 0.333
4.58SerAsn: 4.58 ± 0.436
2.233SerPro: 2.233 ± 0.331
2.004SerGln: 2.004 ± 0.354
2.176SerArg: 2.176 ± 0.337
3.836SerSer: 3.836 ± 0.546
4.637SerThr: 4.637 ± 0.553
3.893SerVal: 3.893 ± 0.554
1.374SerTrp: 1.374 ± 0.399
2.977SerTyr: 2.977 ± 0.524
0.0SerXaa: 0.0 ± 0.0
Thr
5.21ThrAla: 5.21 ± 0.677
0.802ThrCys: 0.802 ± 0.216
3.836ThrAsp: 3.836 ± 0.4
4.58ThrGlu: 4.58 ± 0.479
2.863ThrPhe: 2.863 ± 0.412
4.466ThrGly: 4.466 ± 0.519
0.973ThrHis: 0.973 ± 0.208
5.095ThrIle: 5.095 ± 0.601
4.351ThrLys: 4.351 ± 0.583
5.324ThrLeu: 5.324 ± 0.422
1.775ThrMet: 1.775 ± 0.257
4.008ThrAsn: 4.008 ± 0.521
2.977ThrPro: 2.977 ± 0.407
1.603ThrGln: 1.603 ± 0.368
2.462ThrArg: 2.462 ± 0.406
3.893ThrSer: 3.893 ± 0.508
4.466ThrThr: 4.466 ± 0.622
5.267ThrVal: 5.267 ± 0.575
0.63ThrTrp: 0.63 ± 0.17
2.634ThrTyr: 2.634 ± 0.463
0.0ThrXaa: 0.0 ± 0.0
Val
4.122ValAla: 4.122 ± 0.633
0.172ValCys: 0.172 ± 0.099
4.866ValAsp: 4.866 ± 0.508
3.664ValGlu: 3.664 ± 0.51
1.832ValPhe: 1.832 ± 0.282
4.122ValGly: 4.122 ± 0.437
0.973ValHis: 0.973 ± 0.231
4.637ValIle: 4.637 ± 0.505
4.924ValLys: 4.924 ± 0.55
4.695ValLeu: 4.695 ± 0.656
1.66ValMet: 1.66 ± 0.278
4.695ValAsn: 4.695 ± 0.504
2.462ValPro: 2.462 ± 0.388
1.889ValGln: 1.889 ± 0.32
2.748ValArg: 2.748 ± 0.403
3.779ValSer: 3.779 ± 0.393
5.324ValThr: 5.324 ± 0.703
4.351ValVal: 4.351 ± 0.401
0.458ValTrp: 0.458 ± 0.141
2.29ValTyr: 2.29 ± 0.423
0.0ValXaa: 0.0 ± 0.0
Trp
0.973TrpAla: 0.973 ± 0.212
0.0TrpCys: 0.0 ± 0.0
1.317TrpAsp: 1.317 ± 0.274
0.63TrpGlu: 0.63 ± 0.196
0.515TrpPhe: 0.515 ± 0.182
0.573TrpGly: 0.573 ± 0.192
0.229TrpHis: 0.229 ± 0.135
0.573TrpIle: 0.573 ± 0.185
0.401TrpLys: 0.401 ± 0.157
1.317TrpLeu: 1.317 ± 0.22
0.401TrpMet: 0.401 ± 0.179
0.916TrpAsn: 0.916 ± 0.256
0.344TrpPro: 0.344 ± 0.168
0.286TrpGln: 0.286 ± 0.114
0.344TrpArg: 0.344 ± 0.124
1.145TrpSer: 1.145 ± 0.245
0.802TrpThr: 0.802 ± 0.207
0.573TrpVal: 0.573 ± 0.239
0.172TrpTrp: 0.172 ± 0.107
0.687TrpTyr: 0.687 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.805TyrAla: 2.805 ± 0.455
0.458TyrCys: 0.458 ± 0.177
3.092TyrAsp: 3.092 ± 0.609
2.691TyrGlu: 2.691 ± 0.391
1.546TyrPhe: 1.546 ± 0.254
2.347TyrGly: 2.347 ± 0.374
0.744TyrHis: 0.744 ± 0.243
2.92TyrIle: 2.92 ± 0.535
3.492TyrLys: 3.492 ± 0.515
3.95TyrLeu: 3.95 ± 0.572
1.202TyrMet: 1.202 ± 0.228
4.008TyrAsn: 4.008 ± 0.68
1.718TyrPro: 1.718 ± 0.337
1.317TyrGln: 1.317 ± 0.329
2.233TyrArg: 2.233 ± 0.39
2.92TyrSer: 2.92 ± 0.506
2.061TyrThr: 2.061 ± 0.397
2.061TyrVal: 2.061 ± 0.313
0.401TyrTrp: 0.401 ± 0.141
2.519TyrTyr: 2.519 ± 0.48
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (17468 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski