Amino acid dipepetide frequency for Enterobacteria phage CUS-3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.583AlaAla: 10.583 ± 1.483
0.917AlaCys: 0.917 ± 0.311
5.833AlaAsp: 5.833 ± 0.821
8.75AlaGlu: 8.75 ± 1.561
3.667AlaPhe: 3.667 ± 0.602
6.917AlaGly: 6.917 ± 1.543
2.083AlaHis: 2.083 ± 0.658
6.583AlaIle: 6.583 ± 0.809
6.833AlaLys: 6.833 ± 0.856
7.5AlaLeu: 7.5 ± 0.93
3.583AlaMet: 3.583 ± 0.633
5.0AlaAsn: 5.0 ± 0.713
1.833AlaPro: 1.833 ± 0.348
5.083AlaGln: 5.083 ± 0.94
6.417AlaArg: 6.417 ± 0.688
4.583AlaSer: 4.583 ± 0.691
5.917AlaThr: 5.917 ± 0.748
4.75AlaVal: 4.75 ± 0.634
2.167AlaTrp: 2.167 ± 0.406
2.083AlaTyr: 2.083 ± 0.406
0.0AlaXaa: 0.0 ± 0.0
Cys
1.083CysAla: 1.083 ± 0.335
0.167CysCys: 0.167 ± 0.126
0.75CysAsp: 0.75 ± 0.315
0.417CysGlu: 0.417 ± 0.182
0.25CysPhe: 0.25 ± 0.118
1.0CysGly: 1.0 ± 0.287
0.167CysHis: 0.167 ± 0.123
0.417CysIle: 0.417 ± 0.195
0.75CysLys: 0.75 ± 0.286
0.75CysLeu: 0.75 ± 0.27
0.583CysMet: 0.583 ± 0.245
0.833CysAsn: 0.833 ± 0.269
0.417CysPro: 0.417 ± 0.19
0.333CysGln: 0.333 ± 0.166
0.417CysArg: 0.417 ± 0.173
0.917CysSer: 0.917 ± 0.277
0.167CysThr: 0.167 ± 0.129
0.833CysVal: 0.833 ± 0.239
0.167CysTrp: 0.167 ± 0.123
0.333CysTyr: 0.333 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
6.417AspAla: 6.417 ± 0.855
0.5AspCys: 0.5 ± 0.23
4.5AspAsp: 4.5 ± 0.642
3.75AspGlu: 3.75 ± 0.724
2.0AspPhe: 2.0 ± 0.546
5.0AspGly: 5.0 ± 0.75
0.833AspHis: 0.833 ± 0.199
4.417AspIle: 4.417 ± 0.535
2.5AspLys: 2.5 ± 0.453
4.25AspLeu: 4.25 ± 0.554
1.75AspMet: 1.75 ± 0.464
3.417AspAsn: 3.417 ± 0.626
2.0AspPro: 2.0 ± 0.398
1.583AspGln: 1.583 ± 0.282
3.083AspArg: 3.083 ± 0.504
3.083AspSer: 3.083 ± 0.514
2.167AspThr: 2.167 ± 0.434
5.083AspVal: 5.083 ± 0.594
1.083AspTrp: 1.083 ± 0.263
2.167AspTyr: 2.167 ± 0.386
0.0AspXaa: 0.0 ± 0.0
Glu
7.417GluAla: 7.417 ± 1.278
0.5GluCys: 0.5 ± 0.198
2.917GluAsp: 2.917 ± 0.512
5.667GluGlu: 5.667 ± 0.91
2.083GluPhe: 2.083 ± 0.369
4.083GluGly: 4.083 ± 0.466
1.583GluHis: 1.583 ± 0.339
4.167GluIle: 4.167 ± 0.645
4.333GluLys: 4.333 ± 0.547
6.833GluLeu: 6.833 ± 0.728
1.917GluMet: 1.917 ± 0.355
3.917GluAsn: 3.917 ± 0.493
2.167GluPro: 2.167 ± 0.412
3.917GluGln: 3.917 ± 0.859
5.25GluArg: 5.25 ± 0.909
4.0GluSer: 4.0 ± 0.652
2.417GluThr: 2.417 ± 0.601
4.083GluVal: 4.083 ± 0.513
1.667GluTrp: 1.667 ± 0.41
2.333GluTyr: 2.333 ± 0.53
0.0GluXaa: 0.0 ± 0.0
Phe
3.167PheAla: 3.167 ± 0.528
0.333PheCys: 0.333 ± 0.184
2.0PheAsp: 2.0 ± 0.427
2.25PheGlu: 2.25 ± 0.386
0.25PhePhe: 0.25 ± 0.134
2.083PheGly: 2.083 ± 0.349
0.25PheHis: 0.25 ± 0.111
1.75PheIle: 1.75 ± 0.484
1.833PheLys: 1.833 ± 0.406
2.0PheLeu: 2.0 ± 0.375
0.917PheMet: 0.917 ± 0.304
1.667PheAsn: 1.667 ± 0.329
0.667PhePro: 0.667 ± 0.21
0.917PheGln: 0.917 ± 0.322
1.833PheArg: 1.833 ± 0.43
2.167PheSer: 2.167 ± 0.407
2.167PheThr: 2.167 ± 0.413
2.167PheVal: 2.167 ± 0.478
0.417PheTrp: 0.417 ± 0.237
1.333PheTyr: 1.333 ± 0.355
0.0PheXaa: 0.0 ± 0.0
Gly
6.167GlyAla: 6.167 ± 0.934
0.75GlyCys: 0.75 ± 0.243
3.75GlyAsp: 3.75 ± 0.595
4.667GlyGlu: 4.667 ± 0.557
1.833GlyPhe: 1.833 ± 0.385
4.417GlyGly: 4.417 ± 0.924
1.083GlyHis: 1.083 ± 0.281
5.083GlyIle: 5.083 ± 0.744
4.25GlyLys: 4.25 ± 0.604
4.417GlyLeu: 4.417 ± 0.538
2.5GlyMet: 2.5 ± 0.537
2.75GlyAsn: 2.75 ± 0.491
1.083GlyPro: 1.083 ± 0.254
4.0GlyGln: 4.0 ± 0.753
4.583GlyArg: 4.583 ± 0.533
4.75GlySer: 4.75 ± 0.708
3.167GlyThr: 3.167 ± 0.553
4.583GlyVal: 4.583 ± 0.649
1.0GlyTrp: 1.0 ± 0.281
2.25GlyTyr: 2.25 ± 0.397
0.0GlyXaa: 0.0 ± 0.0
His
1.333HisAla: 1.333 ± 0.36
0.333HisCys: 0.333 ± 0.156
1.25HisAsp: 1.25 ± 0.278
1.667HisGlu: 1.667 ± 0.68
0.833HisPhe: 0.833 ± 0.286
2.0HisGly: 2.0 ± 0.452
0.333HisHis: 0.333 ± 0.185
0.75HisIle: 0.75 ± 0.257
1.167HisLys: 1.167 ± 0.28
1.0HisLeu: 1.0 ± 0.312
0.583HisMet: 0.583 ± 0.209
0.667HisAsn: 0.667 ± 0.257
0.5HisPro: 0.5 ± 0.204
0.667HisGln: 0.667 ± 0.218
0.917HisArg: 0.917 ± 0.234
0.917HisSer: 0.917 ± 0.335
0.25HisThr: 0.25 ± 0.153
0.833HisVal: 0.833 ± 0.278
0.167HisTrp: 0.167 ± 0.103
0.583HisTyr: 0.583 ± 0.199
0.0HisXaa: 0.0 ± 0.0
Ile
6.667IleAla: 6.667 ± 0.649
0.917IleCys: 0.917 ± 0.281
4.167IleAsp: 4.167 ± 0.489
5.667IleGlu: 5.667 ± 0.684
1.083IlePhe: 1.083 ± 0.282
4.083IleGly: 4.083 ± 0.498
1.083IleHis: 1.083 ± 0.341
3.5IleIle: 3.5 ± 0.63
3.083IleLys: 3.083 ± 0.466
3.667IleLeu: 3.667 ± 0.456
1.25IleMet: 1.25 ± 0.363
3.0IleAsn: 3.0 ± 0.559
3.75IlePro: 3.75 ± 0.499
2.583IleGln: 2.583 ± 0.488
3.75IleArg: 3.75 ± 0.569
4.083IleSer: 4.083 ± 0.582
5.0IleThr: 5.0 ± 0.601
3.333IleVal: 3.333 ± 0.465
0.5IleTrp: 0.5 ± 0.234
1.917IleTyr: 1.917 ± 0.398
0.0IleXaa: 0.0 ± 0.0
Lys
8.0LysAla: 8.0 ± 1.284
0.667LysCys: 0.667 ± 0.287
3.083LysAsp: 3.083 ± 0.524
4.5LysGlu: 4.5 ± 0.76
1.583LysPhe: 1.583 ± 0.278
3.333LysGly: 3.333 ± 0.529
0.667LysHis: 0.667 ± 0.202
3.667LysIle: 3.667 ± 0.471
3.833LysLys: 3.833 ± 0.676
5.333LysLeu: 5.333 ± 0.73
2.25LysMet: 2.25 ± 0.495
3.0LysAsn: 3.0 ± 0.441
3.5LysPro: 3.5 ± 0.66
3.083LysGln: 3.083 ± 0.568
4.083LysArg: 4.083 ± 0.796
2.917LysSer: 2.917 ± 0.587
3.0LysThr: 3.0 ± 0.514
3.0LysVal: 3.0 ± 0.45
0.667LysTrp: 0.667 ± 0.214
2.333LysTyr: 2.333 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
8.0LeuAla: 8.0 ± 0.88
0.75LeuCys: 0.75 ± 0.246
4.333LeuAsp: 4.333 ± 0.581
5.667LeuGlu: 5.667 ± 0.61
2.583LeuPhe: 2.583 ± 0.588
4.083LeuGly: 4.083 ± 0.695
0.583LeuHis: 0.583 ± 0.307
5.083LeuIle: 5.083 ± 0.59
5.417LeuLys: 5.417 ± 0.872
5.75LeuLeu: 5.75 ± 0.749
2.167LeuMet: 2.167 ± 0.446
3.917LeuAsn: 3.917 ± 0.608
3.25LeuPro: 3.25 ± 0.489
3.083LeuGln: 3.083 ± 0.435
4.75LeuArg: 4.75 ± 0.604
5.583LeuSer: 5.583 ± 0.721
4.0LeuThr: 4.0 ± 0.522
3.333LeuVal: 3.333 ± 0.542
1.167LeuTrp: 1.167 ± 0.295
2.667LeuTyr: 2.667 ± 0.512
0.0LeuXaa: 0.0 ± 0.0
Met
3.667MetAla: 3.667 ± 0.538
0.667MetCys: 0.667 ± 0.252
1.917MetAsp: 1.917 ± 0.584
1.5MetGlu: 1.5 ± 0.297
0.667MetPhe: 0.667 ± 0.211
1.0MetGly: 1.0 ± 0.322
0.25MetHis: 0.25 ± 0.14
1.917MetIle: 1.917 ± 0.361
1.833MetLys: 1.833 ± 0.394
2.5MetLeu: 2.5 ± 0.535
0.5MetMet: 0.5 ± 0.214
1.083MetAsn: 1.083 ± 0.343
1.417MetPro: 1.417 ± 0.339
1.0MetGln: 1.0 ± 0.322
2.083MetArg: 2.083 ± 0.43
2.5MetSer: 2.5 ± 0.506
2.25MetThr: 2.25 ± 0.345
1.833MetVal: 1.833 ± 0.426
0.25MetTrp: 0.25 ± 0.149
0.917MetTyr: 0.917 ± 0.318
0.0MetXaa: 0.0 ± 0.0
Asn
5.667AsnAla: 5.667 ± 0.66
0.417AsnCys: 0.417 ± 0.205
2.167AsnAsp: 2.167 ± 0.433
3.0AsnGlu: 3.0 ± 0.433
0.75AsnPhe: 0.75 ± 0.225
4.833AsnGly: 4.833 ± 0.627
1.5AsnHis: 1.5 ± 0.354
2.083AsnIle: 2.083 ± 0.487
3.083AsnLys: 3.083 ± 0.532
3.333AsnLeu: 3.333 ± 0.498
0.917AsnMet: 0.917 ± 0.274
3.0AsnAsn: 3.0 ± 0.666
2.167AsnPro: 2.167 ± 0.506
2.5AsnGln: 2.5 ± 0.435
3.0AsnArg: 3.0 ± 0.473
2.667AsnSer: 2.667 ± 0.452
1.583AsnThr: 1.583 ± 0.415
2.75AsnVal: 2.75 ± 0.568
0.5AsnTrp: 0.5 ± 0.194
2.25AsnTyr: 2.25 ± 0.414
0.0AsnXaa: 0.0 ± 0.0
Pro
3.083ProAla: 3.083 ± 0.438
0.25ProCys: 0.25 ± 0.139
2.667ProAsp: 2.667 ± 0.499
3.5ProGlu: 3.5 ± 0.527
1.417ProPhe: 1.417 ± 0.327
2.083ProGly: 2.083 ± 0.371
0.5ProHis: 0.5 ± 0.207
1.417ProIle: 1.417 ± 0.331
2.583ProLys: 2.583 ± 0.398
2.667ProLeu: 2.667 ± 0.396
1.25ProMet: 1.25 ± 0.347
1.25ProAsn: 1.25 ± 0.317
1.583ProPro: 1.583 ± 0.347
1.583ProGln: 1.583 ± 0.485
1.75ProArg: 1.75 ± 0.357
2.167ProSer: 2.167 ± 0.39
1.833ProThr: 1.833 ± 0.36
3.25ProVal: 3.25 ± 0.714
0.5ProTrp: 0.5 ± 0.26
1.167ProTyr: 1.167 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
5.417GlnAla: 5.417 ± 1.042
0.25GlnCys: 0.25 ± 0.153
1.917GlnAsp: 1.917 ± 0.45
3.0GlnGlu: 3.0 ± 0.641
1.417GlnPhe: 1.417 ± 0.286
2.833GlnGly: 2.833 ± 0.661
0.583GlnHis: 0.583 ± 0.244
3.667GlnIle: 3.667 ± 0.395
2.833GlnLys: 2.833 ± 0.485
3.417GlnLeu: 3.417 ± 0.507
1.333GlnMet: 1.333 ± 0.3
2.167GlnAsn: 2.167 ± 0.376
1.667GlnPro: 1.667 ± 0.325
4.333GlnGln: 4.333 ± 1.018
3.0GlnArg: 3.0 ± 0.548
2.833GlnSer: 2.833 ± 0.5
1.417GlnThr: 1.417 ± 0.4
2.917GlnVal: 2.917 ± 0.539
0.75GlnTrp: 0.75 ± 0.218
1.917GlnTyr: 1.917 ± 0.438
0.0GlnXaa: 0.0 ± 0.0
Arg
4.417ArgAla: 4.417 ± 0.663
0.5ArgCys: 0.5 ± 0.199
3.583ArgAsp: 3.583 ± 0.51
4.333ArgGlu: 4.333 ± 0.702
2.0ArgPhe: 2.0 ± 0.376
3.5ArgGly: 3.5 ± 0.535
1.333ArgHis: 1.333 ± 0.373
4.417ArgIle: 4.417 ± 0.671
5.333ArgLys: 5.333 ± 0.697
5.583ArgLeu: 5.583 ± 0.739
2.167ArgMet: 2.167 ± 0.474
3.167ArgAsn: 3.167 ± 0.522
1.583ArgPro: 1.583 ± 0.393
2.583ArgGln: 2.583 ± 0.441
4.5ArgArg: 4.5 ± 0.665
3.583ArgSer: 3.583 ± 0.458
1.917ArgThr: 1.917 ± 0.372
3.167ArgVal: 3.167 ± 0.567
1.083ArgTrp: 1.083 ± 0.302
2.083ArgTyr: 2.083 ± 0.457
0.0ArgXaa: 0.0 ± 0.0
Ser
5.083SerAla: 5.083 ± 0.638
0.333SerCys: 0.333 ± 0.166
4.333SerAsp: 4.333 ± 0.543
3.5SerGlu: 3.5 ± 0.608
2.833SerPhe: 2.833 ± 0.554
5.667SerGly: 5.667 ± 0.75
1.083SerHis: 1.083 ± 0.259
3.833SerIle: 3.833 ± 0.595
2.583SerLys: 2.583 ± 0.526
5.833SerLeu: 5.833 ± 0.588
2.25SerMet: 2.25 ± 0.43
2.75SerAsn: 2.75 ± 0.378
1.917SerPro: 1.917 ± 0.353
3.5SerGln: 3.5 ± 0.615
2.833SerArg: 2.833 ± 0.473
3.75SerSer: 3.75 ± 0.531
2.667SerThr: 2.667 ± 0.504
3.417SerVal: 3.417 ± 0.607
0.833SerTrp: 0.833 ± 0.236
1.833SerTyr: 1.833 ± 0.396
0.0SerXaa: 0.0 ± 0.0
Thr
5.417ThrAla: 5.417 ± 0.787
0.75ThrCys: 0.75 ± 0.218
3.667ThrAsp: 3.667 ± 0.648
2.917ThrGlu: 2.917 ± 0.524
1.417ThrPhe: 1.417 ± 0.457
4.0ThrGly: 4.0 ± 0.631
0.417ThrHis: 0.417 ± 0.157
3.5ThrIle: 3.5 ± 0.565
2.667ThrLys: 2.667 ± 0.416
2.75ThrLeu: 2.75 ± 0.622
1.167ThrMet: 1.167 ± 0.336
1.5ThrAsn: 1.5 ± 0.302
3.5ThrPro: 3.5 ± 0.62
2.25ThrGln: 2.25 ± 0.478
1.667ThrArg: 1.667 ± 0.324
3.0ThrSer: 3.0 ± 0.52
3.417ThrThr: 3.417 ± 0.643
4.25ThrVal: 4.25 ± 0.552
0.5ThrTrp: 0.5 ± 0.197
1.083ThrTyr: 1.083 ± 0.288
0.0ThrXaa: 0.0 ± 0.0
Val
5.083ValAla: 5.083 ± 0.7
0.75ValCys: 0.75 ± 0.235
3.5ValAsp: 3.5 ± 0.569
4.583ValGlu: 4.583 ± 0.633
2.0ValPhe: 2.0 ± 0.382
3.917ValGly: 3.917 ± 0.623
1.25ValHis: 1.25 ± 0.365
4.5ValIle: 4.5 ± 0.592
4.0ValLys: 4.0 ± 0.576
4.083ValLeu: 4.083 ± 0.614
1.333ValMet: 1.333 ± 0.332
3.083ValAsn: 3.083 ± 0.649
1.583ValPro: 1.583 ± 0.431
2.0ValGln: 2.0 ± 0.394
3.167ValArg: 3.167 ± 0.572
4.833ValSer: 4.833 ± 0.71
3.75ValThr: 3.75 ± 0.608
4.083ValVal: 4.083 ± 0.668
0.417ValTrp: 0.417 ± 0.191
2.167ValTyr: 2.167 ± 0.698
0.0ValXaa: 0.0 ± 0.0
Trp
1.667TrpAla: 1.667 ± 0.324
0.25TrpCys: 0.25 ± 0.135
0.583TrpAsp: 0.583 ± 0.207
0.417TrpGlu: 0.417 ± 0.149
0.333TrpPhe: 0.333 ± 0.175
0.75TrpGly: 0.75 ± 0.276
0.333TrpHis: 0.333 ± 0.148
0.5TrpIle: 0.5 ± 0.242
1.25TrpLys: 1.25 ± 0.326
2.333TrpLeu: 2.333 ± 0.347
0.667TrpMet: 0.667 ± 0.249
0.667TrpAsn: 0.667 ± 0.267
0.167TrpPro: 0.167 ± 0.106
0.417TrpGln: 0.417 ± 0.172
0.917TrpArg: 0.917 ± 0.234
1.0TrpSer: 1.0 ± 0.253
0.917TrpThr: 0.917 ± 0.28
0.667TrpVal: 0.667 ± 0.242
0.25TrpTrp: 0.25 ± 0.139
0.5TrpTyr: 0.5 ± 0.201
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.833TyrAla: 2.833 ± 0.537
0.75TyrCys: 0.75 ± 0.212
2.667TyrAsp: 2.667 ± 0.607
1.5TyrGlu: 1.5 ± 0.36
1.167TyrPhe: 1.167 ± 0.327
1.5TyrGly: 1.5 ± 0.35
0.833TyrHis: 0.833 ± 0.239
1.833TyrIle: 1.833 ± 0.375
2.333TyrLys: 2.333 ± 0.442
2.167TyrLeu: 2.167 ± 0.431
0.583TyrMet: 0.583 ± 0.192
1.333TyrAsn: 1.333 ± 0.325
1.75TyrPro: 1.75 ± 0.443
2.25TyrGln: 2.25 ± 0.607
2.833TyrArg: 2.833 ± 0.509
1.5TyrSer: 1.5 ± 0.332
1.833TyrThr: 1.833 ± 0.322
1.75TyrVal: 1.75 ± 0.365
0.5TyrTrp: 0.5 ± 0.187
1.417TyrTyr: 1.417 ± 0.365
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (12001 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski