Amino acid dipepetide frequency for Lactococcus phage P078

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.247AlaAla: 6.247 ± 1.565
0.177AlaCys: 0.177 ± 0.102
3.123AlaAsp: 3.123 ± 0.552
2.947AlaGlu: 2.947 ± 0.488
2.239AlaPhe: 2.239 ± 0.379
4.479AlaGly: 4.479 ± 1.256
0.766AlaHis: 0.766 ± 0.214
5.127AlaIle: 5.127 ± 0.917
5.54AlaLys: 5.54 ± 0.875
5.716AlaLeu: 5.716 ± 1.024
2.534AlaMet: 2.534 ± 0.588
3.359AlaAsn: 3.359 ± 0.469
2.063AlaPro: 2.063 ± 0.362
2.063AlaGln: 2.063 ± 0.463
2.534AlaArg: 2.534 ± 0.466
4.361AlaSer: 4.361 ± 0.643
4.538AlaThr: 4.538 ± 0.68
4.538AlaVal: 4.538 ± 0.552
0.943AlaTrp: 0.943 ± 0.257
2.652AlaTyr: 2.652 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
0.177CysAla: 0.177 ± 0.093
0.059CysCys: 0.059 ± 0.053
0.354CysAsp: 0.354 ± 0.174
0.413CysGlu: 0.413 ± 0.137
0.177CysPhe: 0.177 ± 0.088
0.354CysGly: 0.354 ± 0.141
0.118CysHis: 0.118 ± 0.09
0.825CysIle: 0.825 ± 0.283
0.589CysLys: 0.589 ± 0.193
0.471CysLeu: 0.471 ± 0.162
0.059CysMet: 0.059 ± 0.055
0.177CysAsn: 0.177 ± 0.101
0.354CysPro: 0.354 ± 0.142
0.059CysGln: 0.059 ± 0.056
0.177CysArg: 0.177 ± 0.097
0.295CysSer: 0.295 ± 0.143
0.177CysThr: 0.177 ± 0.113
0.177CysVal: 0.177 ± 0.105
0.059CysTrp: 0.059 ± 0.048
0.471CysTyr: 0.471 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
3.772AspAla: 3.772 ± 0.442
0.354AspCys: 0.354 ± 0.137
4.479AspAsp: 4.479 ± 0.728
4.656AspGlu: 4.656 ± 0.584
3.005AspPhe: 3.005 ± 0.387
4.42AspGly: 4.42 ± 0.54
0.884AspHis: 0.884 ± 0.185
5.54AspIle: 5.54 ± 0.546
5.009AspLys: 5.009 ± 0.798
5.952AspLeu: 5.952 ± 0.724
2.18AspMet: 2.18 ± 0.351
4.302AspAsn: 4.302 ± 0.439
2.122AspPro: 2.122 ± 0.377
2.004AspGln: 2.004 ± 0.31
1.945AspArg: 1.945 ± 0.397
4.302AspSer: 4.302 ± 0.562
4.656AspThr: 4.656 ± 0.672
4.184AspVal: 4.184 ± 0.545
0.766AspTrp: 0.766 ± 0.207
3.654AspTyr: 3.654 ± 0.574
0.0AspXaa: 0.0 ± 0.0
Glu
3.889GluAla: 3.889 ± 0.495
0.354GluCys: 0.354 ± 0.18
4.538GluAsp: 4.538 ± 0.632
5.952GluGlu: 5.952 ± 0.819
2.004GluPhe: 2.004 ± 0.298
3.713GluGly: 3.713 ± 0.475
1.414GluHis: 1.414 ± 0.338
3.948GluIle: 3.948 ± 0.578
4.066GluLys: 4.066 ± 0.686
4.656GluLeu: 4.656 ± 0.603
1.945GluMet: 1.945 ± 0.414
3.123GluAsn: 3.123 ± 0.475
1.355GluPro: 1.355 ± 0.277
2.122GluGln: 2.122 ± 0.384
2.593GluArg: 2.593 ± 0.404
3.359GluSer: 3.359 ± 0.438
3.536GluThr: 3.536 ± 0.453
4.597GluVal: 4.597 ± 0.615
1.061GluTrp: 1.061 ± 0.23
3.772GluTyr: 3.772 ± 0.566
0.0GluXaa: 0.0 ± 0.0
Phe
1.827PheAla: 1.827 ± 0.278
0.354PheCys: 0.354 ± 0.15
3.654PheAsp: 3.654 ± 0.445
2.239PheGlu: 2.239 ± 0.426
1.532PhePhe: 1.532 ± 0.311
3.123PheGly: 3.123 ± 0.411
0.53PheHis: 0.53 ± 0.187
2.652PheIle: 2.652 ± 0.355
3.536PheLys: 3.536 ± 0.465
2.298PheLeu: 2.298 ± 0.437
0.648PheMet: 0.648 ± 0.214
3.241PheAsn: 3.241 ± 0.549
0.943PhePro: 0.943 ± 0.208
0.707PheGln: 0.707 ± 0.21
1.65PheArg: 1.65 ± 0.282
1.709PheSer: 1.709 ± 0.388
2.829PheThr: 2.829 ± 0.55
1.827PheVal: 1.827 ± 0.286
0.354PheTrp: 0.354 ± 0.132
1.591PheTyr: 1.591 ± 0.324
0.0PheXaa: 0.0 ± 0.0
Gly
4.361GlyAla: 4.361 ± 1.178
0.236GlyCys: 0.236 ± 0.124
3.595GlyAsp: 3.595 ± 0.522
3.241GlyGlu: 3.241 ± 0.461
2.593GlyPhe: 2.593 ± 0.493
3.359GlyGly: 3.359 ± 0.578
1.296GlyHis: 1.296 ± 0.288
5.068GlyIle: 5.068 ± 0.691
4.714GlyLys: 4.714 ± 0.699
5.304GlyLeu: 5.304 ± 1.177
1.591GlyMet: 1.591 ± 0.341
4.125GlyAsn: 4.125 ± 0.556
1.414GlyPro: 1.414 ± 0.246
2.004GlyGln: 2.004 ± 0.404
2.475GlyArg: 2.475 ± 0.43
4.832GlySer: 4.832 ± 0.809
4.656GlyThr: 4.656 ± 0.559
4.479GlyVal: 4.479 ± 0.456
0.943GlyTrp: 0.943 ± 0.2
3.654GlyTyr: 3.654 ± 0.459
0.0GlyXaa: 0.0 ± 0.0
His
0.471HisAla: 0.471 ± 0.13
0.118HisCys: 0.118 ± 0.08
1.238HisAsp: 1.238 ± 0.277
0.825HisGlu: 0.825 ± 0.215
0.236HisPhe: 0.236 ± 0.135
1.532HisGly: 1.532 ± 0.339
0.236HisHis: 0.236 ± 0.115
1.179HisIle: 1.179 ± 0.281
1.414HisLys: 1.414 ± 0.352
1.355HisLeu: 1.355 ± 0.376
0.53HisMet: 0.53 ± 0.156
1.061HisAsn: 1.061 ± 0.29
0.648HisPro: 0.648 ± 0.173
0.354HisGln: 0.354 ± 0.118
0.471HisArg: 0.471 ± 0.175
1.414HisSer: 1.414 ± 0.272
1.12HisThr: 1.12 ± 0.237
1.12HisVal: 1.12 ± 0.232
0.059HisTrp: 0.059 ± 0.048
0.943HisTyr: 0.943 ± 0.255
0.0HisXaa: 0.0 ± 0.0
Ile
5.363IleAla: 5.363 ± 0.741
0.413IleCys: 0.413 ± 0.175
6.718IleAsp: 6.718 ± 0.487
4.538IleGlu: 4.538 ± 0.559
1.532IlePhe: 1.532 ± 0.267
4.538IleGly: 4.538 ± 1.119
1.473IleHis: 1.473 ± 0.232
5.893IleIle: 5.893 ± 0.698
6.836IleLys: 6.836 ± 0.465
4.95IleLeu: 4.95 ± 0.69
2.063IleMet: 2.063 ± 0.331
3.831IleAsn: 3.831 ± 0.507
2.711IlePro: 2.711 ± 0.422
2.534IleGln: 2.534 ± 0.357
2.18IleArg: 2.18 ± 0.34
4.95IleSer: 4.95 ± 0.603
5.481IleThr: 5.481 ± 0.539
5.245IleVal: 5.245 ± 0.463
0.648IleTrp: 0.648 ± 0.265
2.829IleTyr: 2.829 ± 0.425
0.0IleXaa: 0.0 ± 0.0
Lys
4.538LysAla: 4.538 ± 0.599
0.471LysCys: 0.471 ± 0.23
5.716LysAsp: 5.716 ± 0.633
6.365LysGlu: 6.365 ± 1.059
3.713LysPhe: 3.713 ± 0.458
4.773LysGly: 4.773 ± 0.653
1.65LysHis: 1.65 ± 0.45
5.893LysIle: 5.893 ± 0.686
6.011LysLys: 6.011 ± 0.971
6.836LysLeu: 6.836 ± 0.574
3.123LysMet: 3.123 ± 0.44
4.007LysAsn: 4.007 ± 0.561
2.416LysPro: 2.416 ± 0.471
2.888LysGln: 2.888 ± 0.494
3.182LysArg: 3.182 ± 0.512
4.066LysSer: 4.066 ± 0.466
5.598LysThr: 5.598 ± 0.469
4.714LysVal: 4.714 ± 0.494
1.414LysTrp: 1.414 ± 0.285
3.3LysTyr: 3.3 ± 0.582
0.0LysXaa: 0.0 ± 0.0
Leu
6.836LeuAla: 6.836 ± 1.242
0.354LeuCys: 0.354 ± 0.159
4.597LeuAsp: 4.597 ± 0.493
4.832LeuGlu: 4.832 ± 0.667
2.947LeuPhe: 2.947 ± 0.403
5.54LeuGly: 5.54 ± 0.899
0.884LeuHis: 0.884 ± 0.218
5.775LeuIle: 5.775 ± 0.885
6.836LeuLys: 6.836 ± 0.837
5.304LeuLeu: 5.304 ± 0.581
1.532LeuMet: 1.532 ± 0.259
4.832LeuAsn: 4.832 ± 0.554
2.18LeuPro: 2.18 ± 0.343
3.182LeuGln: 3.182 ± 0.452
3.3LeuArg: 3.3 ± 0.423
6.659LeuSer: 6.659 ± 0.761
5.422LeuThr: 5.422 ± 0.507
4.656LeuVal: 4.656 ± 0.541
0.53LeuTrp: 0.53 ± 0.197
3.123LeuTyr: 3.123 ± 0.592
0.0LeuXaa: 0.0 ± 0.0
Met
2.18MetAla: 2.18 ± 0.607
0.177MetCys: 0.177 ± 0.102
2.239MetAsp: 2.239 ± 0.284
1.768MetGlu: 1.768 ± 0.38
1.061MetPhe: 1.061 ± 0.232
1.709MetGly: 1.709 ± 0.468
0.236MetHis: 0.236 ± 0.107
1.945MetIle: 1.945 ± 0.323
3.713MetLys: 3.713 ± 0.473
2.004MetLeu: 2.004 ± 0.39
0.589MetMet: 0.589 ± 0.222
1.886MetAsn: 1.886 ± 0.378
0.766MetPro: 0.766 ± 0.21
1.002MetGln: 1.002 ± 0.213
1.061MetArg: 1.061 ± 0.28
2.063MetSer: 2.063 ± 0.317
1.532MetThr: 1.532 ± 0.268
1.709MetVal: 1.709 ± 0.289
0.177MetTrp: 0.177 ± 0.09
0.943MetTyr: 0.943 ± 0.229
0.0MetXaa: 0.0 ± 0.0
Asn
3.772AsnAla: 3.772 ± 0.46
0.413AsnCys: 0.413 ± 0.187
3.948AsnAsp: 3.948 ± 0.593
3.005AsnGlu: 3.005 ± 0.504
2.534AsnPhe: 2.534 ± 0.334
5.127AsnGly: 5.127 ± 0.599
0.825AsnHis: 0.825 ± 0.247
5.598AsnIle: 5.598 ± 0.529
6.07AsnLys: 6.07 ± 0.889
4.714AsnLeu: 4.714 ± 0.395
1.945AsnMet: 1.945 ± 0.307
3.831AsnAsn: 3.831 ± 0.563
2.77AsnPro: 2.77 ± 0.487
2.357AsnGln: 2.357 ± 0.322
2.357AsnArg: 2.357 ± 0.47
3.182AsnSer: 3.182 ± 0.427
3.595AsnThr: 3.595 ± 0.578
3.654AsnVal: 3.654 ± 0.519
0.648AsnTrp: 0.648 ± 0.184
2.004AsnTyr: 2.004 ± 0.356
0.0AsnXaa: 0.0 ± 0.0
Pro
1.827ProAla: 1.827 ± 0.362
0.118ProCys: 0.118 ± 0.079
2.239ProAsp: 2.239 ± 0.429
2.18ProGlu: 2.18 ± 0.371
1.473ProPhe: 1.473 ± 0.303
1.65ProGly: 1.65 ± 0.269
0.354ProHis: 0.354 ± 0.134
2.239ProIle: 2.239 ± 0.305
2.77ProLys: 2.77 ± 0.567
2.829ProLeu: 2.829 ± 0.47
1.002ProMet: 1.002 ± 0.186
2.475ProAsn: 2.475 ± 0.44
0.766ProPro: 0.766 ± 0.224
1.179ProGln: 1.179 ± 0.262
1.179ProArg: 1.179 ± 0.264
2.593ProSer: 2.593 ± 0.377
2.357ProThr: 2.357 ± 0.376
1.945ProVal: 1.945 ± 0.343
0.53ProTrp: 0.53 ± 0.197
1.355ProTyr: 1.355 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
2.122GlnAla: 2.122 ± 0.379
0.059GlnCys: 0.059 ± 0.06
2.122GlnAsp: 2.122 ± 0.446
1.65GlnGlu: 1.65 ± 0.287
1.296GlnPhe: 1.296 ± 0.213
2.298GlnGly: 2.298 ± 0.426
0.53GlnHis: 0.53 ± 0.173
1.709GlnIle: 1.709 ± 0.374
2.122GlnLys: 2.122 ± 0.375
2.947GlnLeu: 2.947 ± 0.534
1.061GlnMet: 1.061 ± 0.316
1.768GlnAsn: 1.768 ± 0.35
1.238GlnPro: 1.238 ± 0.256
1.768GlnGln: 1.768 ± 0.301
1.179GlnArg: 1.179 ± 0.268
2.416GlnSer: 2.416 ± 0.385
2.593GlnThr: 2.593 ± 0.42
2.122GlnVal: 2.122 ± 0.445
0.354GlnTrp: 0.354 ± 0.131
1.886GlnTyr: 1.886 ± 0.345
0.0GlnXaa: 0.0 ± 0.0
Arg
2.298ArgAla: 2.298 ± 0.494
0.236ArgCys: 0.236 ± 0.138
2.534ArgAsp: 2.534 ± 0.395
2.122ArgGlu: 2.122 ± 0.398
1.532ArgPhe: 1.532 ± 0.302
1.414ArgGly: 1.414 ± 0.241
0.648ArgHis: 0.648 ± 0.213
3.182ArgIle: 3.182 ± 0.436
3.359ArgLys: 3.359 ± 0.497
3.123ArgLeu: 3.123 ± 0.537
1.238ArgMet: 1.238 ± 0.307
2.298ArgAsn: 2.298 ± 0.445
1.473ArgPro: 1.473 ± 0.277
1.591ArgGln: 1.591 ± 0.338
1.532ArgArg: 1.532 ± 0.423
2.416ArgSer: 2.416 ± 0.343
2.357ArgThr: 2.357 ± 0.484
2.122ArgVal: 2.122 ± 0.342
0.413ArgTrp: 0.413 ± 0.138
1.296ArgTyr: 1.296 ± 0.321
0.0ArgXaa: 0.0 ± 0.0
Ser
3.713SerAla: 3.713 ± 0.732
0.177SerCys: 0.177 ± 0.097
4.007SerAsp: 4.007 ± 0.385
3.831SerGlu: 3.831 ± 0.413
2.534SerPhe: 2.534 ± 0.431
4.773SerGly: 4.773 ± 0.572
1.061SerHis: 1.061 ± 0.266
4.95SerIle: 4.95 ± 0.712
5.481SerLys: 5.481 ± 0.611
5.186SerLeu: 5.186 ± 0.665
1.591SerMet: 1.591 ± 0.373
4.42SerAsn: 4.42 ± 0.44
2.239SerPro: 2.239 ± 0.355
1.945SerGln: 1.945 ± 0.323
2.239SerArg: 2.239 ± 0.304
3.948SerSer: 3.948 ± 0.651
4.538SerThr: 4.538 ± 0.52
3.831SerVal: 3.831 ± 0.502
1.238SerTrp: 1.238 ± 0.379
2.947SerTyr: 2.947 ± 0.477
0.0SerXaa: 0.0 ± 0.0
Thr
4.95ThrAla: 4.95 ± 0.588
0.766ThrCys: 0.766 ± 0.258
4.007ThrAsp: 4.007 ± 0.388
4.538ThrGlu: 4.538 ± 0.627
2.829ThrPhe: 2.829 ± 0.417
4.42ThrGly: 4.42 ± 0.452
1.238ThrHis: 1.238 ± 0.286
5.068ThrIle: 5.068 ± 0.767
4.125ThrLys: 4.125 ± 0.535
5.422ThrLeu: 5.422 ± 0.468
1.65ThrMet: 1.65 ± 0.269
3.889ThrAsn: 3.889 ± 0.453
3.123ThrPro: 3.123 ± 0.44
1.827ThrGln: 1.827 ± 0.385
2.239ThrArg: 2.239 ± 0.319
4.007ThrSer: 4.007 ± 0.597
4.832ThrThr: 4.832 ± 0.593
5.363ThrVal: 5.363 ± 0.697
0.766ThrTrp: 0.766 ± 0.19
2.298ThrTyr: 2.298 ± 0.394
0.0ThrXaa: 0.0 ± 0.0
Val
3.713ValAla: 3.713 ± 0.635
0.236ValCys: 0.236 ± 0.128
4.891ValAsp: 4.891 ± 0.503
3.654ValGlu: 3.654 ± 0.543
1.945ValPhe: 1.945 ± 0.279
3.948ValGly: 3.948 ± 0.506
1.12ValHis: 1.12 ± 0.252
4.714ValIle: 4.714 ± 0.516
4.832ValLys: 4.832 ± 0.49
5.009ValLeu: 5.009 ± 0.747
1.827ValMet: 1.827 ± 0.276
4.597ValAsn: 4.597 ± 0.428
2.534ValPro: 2.534 ± 0.404
2.004ValGln: 2.004 ± 0.363
2.416ValArg: 2.416 ± 0.4
4.361ValSer: 4.361 ± 0.44
4.95ValThr: 4.95 ± 0.647
4.538ValVal: 4.538 ± 0.429
0.471ValTrp: 0.471 ± 0.139
2.004ValTyr: 2.004 ± 0.42
0.0ValXaa: 0.0 ± 0.0
Trp
1.002TrpAla: 1.002 ± 0.249
0.0TrpCys: 0.0 ± 0.0
1.238TrpAsp: 1.238 ± 0.294
0.648TrpGlu: 0.648 ± 0.219
0.589TrpPhe: 0.589 ± 0.212
0.53TrpGly: 0.53 ± 0.228
0.177TrpHis: 0.177 ± 0.108
0.413TrpIle: 0.413 ± 0.164
0.295TrpLys: 0.295 ± 0.165
1.296TrpLeu: 1.296 ± 0.179
0.413TrpMet: 0.413 ± 0.19
0.943TrpAsn: 0.943 ± 0.246
0.295TrpPro: 0.295 ± 0.149
0.354TrpGln: 0.354 ± 0.141
0.413TrpArg: 0.413 ± 0.145
1.002TrpSer: 1.002 ± 0.25
0.707TrpThr: 0.707 ± 0.198
0.707TrpVal: 0.707 ± 0.227
0.177TrpTrp: 0.177 ± 0.106
0.766TrpTyr: 0.766 ± 0.222
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.888TyrAla: 2.888 ± 0.447
0.471TyrCys: 0.471 ± 0.169
2.947TyrAsp: 2.947 ± 0.654
2.593TyrGlu: 2.593 ± 0.448
1.473TyrPhe: 1.473 ± 0.276
2.239TyrGly: 2.239 ± 0.388
0.884TyrHis: 0.884 ± 0.249
3.005TyrIle: 3.005 ± 0.539
3.359TyrLys: 3.359 ± 0.541
4.007TyrLeu: 4.007 ± 0.593
1.179TyrMet: 1.179 ± 0.228
4.184TyrAsn: 4.184 ± 0.666
1.65TyrPro: 1.65 ± 0.298
1.296TyrGln: 1.296 ± 0.31
2.063TyrArg: 2.063 ± 0.354
2.711TyrSer: 2.711 ± 0.505
1.886TyrThr: 1.886 ± 0.405
2.239TyrVal: 2.239 ± 0.378
0.413TyrTrp: 0.413 ± 0.14
2.298TyrTyr: 2.298 ± 0.516
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (16970 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski