Amino acid dipepetide frequency for Bacteriophage Lily

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.913AlaAla: 4.913 ± 0.694
0.88AlaCys: 0.88 ± 0.202
3.446AlaAsp: 3.446 ± 0.447
6.159AlaGlu: 6.159 ± 0.721
3.153AlaPhe: 3.153 ± 0.693
4.839AlaGly: 4.839 ± 0.731
0.44AlaHis: 0.44 ± 0.169
4.766AlaIle: 4.766 ± 0.571
5.793AlaLys: 5.793 ± 0.6
6.526AlaLeu: 6.526 ± 0.882
1.686AlaMet: 1.686 ± 0.395
2.126AlaAsn: 2.126 ± 0.368
2.64AlaPro: 2.64 ± 0.499
3.006AlaGln: 3.006 ± 0.501
2.566AlaArg: 2.566 ± 0.377
4.399AlaSer: 4.399 ± 0.569
3.886AlaThr: 3.886 ± 0.552
5.279AlaVal: 5.279 ± 0.767
1.027AlaTrp: 1.027 ± 0.334
2.126AlaTyr: 2.126 ± 0.504
0.0AlaXaa: 0.0 ± 0.0
Cys
0.22CysAla: 0.22 ± 0.124
0.22CysCys: 0.22 ± 0.117
0.513CysAsp: 0.513 ± 0.204
0.733CysGlu: 0.733 ± 0.209
0.367CysPhe: 0.367 ± 0.158
0.587CysGly: 0.587 ± 0.179
0.073CysHis: 0.073 ± 0.066
0.587CysIle: 0.587 ± 0.214
0.953CysLys: 0.953 ± 0.358
0.733CysLeu: 0.733 ± 0.291
0.147CysMet: 0.147 ± 0.09
0.22CysAsn: 0.22 ± 0.134
0.587CysPro: 0.587 ± 0.228
0.513CysGln: 0.513 ± 0.199
0.587CysArg: 0.587 ± 0.185
0.587CysSer: 0.587 ± 0.194
0.293CysThr: 0.293 ± 0.149
0.733CysVal: 0.733 ± 0.223
0.147CysTrp: 0.147 ± 0.11
0.367CysTyr: 0.367 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
3.52AspAla: 3.52 ± 0.597
0.293AspCys: 0.293 ± 0.15
4.033AspAsp: 4.033 ± 0.574
5.499AspGlu: 5.499 ± 0.683
2.713AspPhe: 2.713 ± 0.438
3.886AspGly: 3.886 ± 0.506
1.027AspHis: 1.027 ± 0.253
2.786AspIle: 2.786 ± 0.441
3.886AspLys: 3.886 ± 0.491
5.426AspLeu: 5.426 ± 0.56
1.686AspMet: 1.686 ± 0.364
2.053AspAsn: 2.053 ± 0.354
3.006AspPro: 3.006 ± 0.481
2.42AspGln: 2.42 ± 0.574
2.64AspArg: 2.64 ± 0.437
4.106AspSer: 4.106 ± 0.526
2.126AspThr: 2.126 ± 0.368
4.179AspVal: 4.179 ± 0.681
1.027AspTrp: 1.027 ± 0.311
1.833AspTyr: 1.833 ± 0.327
0.0AspXaa: 0.0 ± 0.0
Glu
5.426GluAla: 5.426 ± 0.545
0.367GluCys: 0.367 ± 0.168
4.326GluAsp: 4.326 ± 0.55
7.332GluGlu: 7.332 ± 0.962
2.566GluPhe: 2.566 ± 0.375
4.179GluGly: 4.179 ± 0.581
1.393GluHis: 1.393 ± 0.312
7.259GluIle: 7.259 ± 1.002
8.506GluLys: 8.506 ± 1.126
7.112GluLeu: 7.112 ± 0.877
3.226GluMet: 3.226 ± 0.54
2.86GluAsn: 2.86 ± 0.449
2.346GluPro: 2.346 ± 0.42
3.74GluGln: 3.74 ± 0.578
3.666GluArg: 3.666 ± 0.47
3.446GluSer: 3.446 ± 0.448
3.886GluThr: 3.886 ± 0.542
5.279GluVal: 5.279 ± 0.656
1.32GluTrp: 1.32 ± 0.364
3.3GluTyr: 3.3 ± 0.548
0.0GluXaa: 0.0 ± 0.0
Phe
2.346PheAla: 2.346 ± 0.446
0.587PheCys: 0.587 ± 0.22
2.713PheAsp: 2.713 ± 0.565
2.346PheGlu: 2.346 ± 0.444
1.173PhePhe: 1.173 ± 0.293
2.493PheGly: 2.493 ± 0.579
0.66PheHis: 0.66 ± 0.305
2.42PheIle: 2.42 ± 0.402
2.86PheLys: 2.86 ± 0.498
3.08PheLeu: 3.08 ± 0.447
0.807PheMet: 0.807 ± 0.298
1.613PheAsn: 1.613 ± 0.393
1.54PhePro: 1.54 ± 0.35
0.953PheGln: 0.953 ± 0.276
1.906PheArg: 1.906 ± 0.369
3.3PheSer: 3.3 ± 0.455
2.713PheThr: 2.713 ± 0.459
2.493PheVal: 2.493 ± 0.466
0.44PheTrp: 0.44 ± 0.139
1.1PheTyr: 1.1 ± 0.282
0.0PheXaa: 0.0 ± 0.0
Gly
4.106GlyAla: 4.106 ± 0.664
0.807GlyCys: 0.807 ± 0.24
3.74GlyAsp: 3.74 ± 0.686
4.766GlyGlu: 4.766 ± 0.622
2.713GlyPhe: 2.713 ± 0.41
5.353GlyGly: 5.353 ± 0.763
0.953GlyHis: 0.953 ± 0.234
5.793GlyIle: 5.793 ± 0.797
8.066GlyLys: 8.066 ± 0.712
4.546GlyLeu: 4.546 ± 0.558
1.686GlyMet: 1.686 ± 0.453
3.226GlyAsn: 3.226 ± 0.445
1.76GlyPro: 1.76 ± 0.436
2.713GlyGln: 2.713 ± 0.462
3.08GlyArg: 3.08 ± 0.386
4.033GlySer: 4.033 ± 0.637
2.713GlyThr: 2.713 ± 0.42
4.253GlyVal: 4.253 ± 0.512
1.32GlyTrp: 1.32 ± 0.301
2.64GlyTyr: 2.64 ± 0.513
0.0GlyXaa: 0.0 ± 0.0
His
0.953HisAla: 0.953 ± 0.281
0.367HisCys: 0.367 ± 0.202
0.587HisAsp: 0.587 ± 0.222
1.247HisGlu: 1.247 ± 0.313
0.88HisPhe: 0.88 ± 0.272
0.953HisGly: 0.953 ± 0.237
0.44HisHis: 0.44 ± 0.173
1.54HisIle: 1.54 ± 0.384
1.173HisLys: 1.173 ± 0.427
2.053HisLeu: 2.053 ± 0.417
0.44HisMet: 0.44 ± 0.202
0.733HisAsn: 0.733 ± 0.206
0.733HisPro: 0.733 ± 0.268
0.367HisGln: 0.367 ± 0.185
0.807HisArg: 0.807 ± 0.268
0.733HisSer: 0.733 ± 0.196
0.293HisThr: 0.293 ± 0.162
1.393HisVal: 1.393 ± 0.327
0.44HisTrp: 0.44 ± 0.195
0.66HisTyr: 0.66 ± 0.213
0.0HisXaa: 0.0 ± 0.0
Ile
3.3IleAla: 3.3 ± 0.493
0.807IleCys: 0.807 ± 0.227
4.253IleAsp: 4.253 ± 0.469
5.426IleGlu: 5.426 ± 0.707
2.713IlePhe: 2.713 ± 0.435
4.766IleGly: 4.766 ± 0.608
1.247IleHis: 1.247 ± 0.294
3.226IleIle: 3.226 ± 0.619
4.766IleLys: 4.766 ± 0.556
4.913IleLeu: 4.913 ± 0.62
1.686IleMet: 1.686 ± 0.312
2.933IleAsn: 2.933 ± 0.509
3.593IlePro: 3.593 ± 0.462
3.373IleGln: 3.373 ± 0.414
3.813IleArg: 3.813 ± 0.473
4.033IleSer: 4.033 ± 0.63
3.74IleThr: 3.74 ± 0.543
4.179IleVal: 4.179 ± 0.539
0.66IleTrp: 0.66 ± 0.259
2.64IleTyr: 2.64 ± 0.525
0.0IleXaa: 0.0 ± 0.0
Lys
6.893LysAla: 6.893 ± 0.735
0.513LysCys: 0.513 ± 0.189
4.619LysAsp: 4.619 ± 0.508
8.359LysGlu: 8.359 ± 0.833
2.64LysPhe: 2.64 ± 0.392
5.279LysGly: 5.279 ± 0.706
1.613LysHis: 1.613 ± 0.271
4.693LysIle: 4.693 ± 0.679
6.893LysLys: 6.893 ± 0.866
5.646LysLeu: 5.646 ± 0.681
2.2LysMet: 2.2 ± 0.399
2.566LysAsn: 2.566 ± 0.376
2.42LysPro: 2.42 ± 0.425
4.473LysGln: 4.473 ± 0.654
4.546LysArg: 4.546 ± 0.715
4.326LysSer: 4.326 ± 0.506
4.986LysThr: 4.986 ± 0.524
5.353LysVal: 5.353 ± 0.533
1.393LysTrp: 1.393 ± 0.264
2.933LysTyr: 2.933 ± 0.592
0.0LysXaa: 0.0 ± 0.0
Leu
7.186LeuAla: 7.186 ± 0.694
0.88LeuCys: 0.88 ± 0.25
5.206LeuAsp: 5.206 ± 0.533
5.866LeuGlu: 5.866 ± 0.734
3.08LeuPhe: 3.08 ± 0.561
6.746LeuGly: 6.746 ± 0.723
1.613LeuHis: 1.613 ± 0.306
4.546LeuIle: 4.546 ± 0.59
6.673LeuLys: 6.673 ± 0.678
5.646LeuLeu: 5.646 ± 0.765
1.686LeuMet: 1.686 ± 0.313
3.373LeuAsn: 3.373 ± 0.554
3.886LeuPro: 3.886 ± 0.488
3.153LeuGln: 3.153 ± 0.32
3.74LeuArg: 3.74 ± 0.469
5.866LeuSer: 5.866 ± 0.95
4.693LeuThr: 4.693 ± 0.557
4.399LeuVal: 4.399 ± 0.446
0.953LeuTrp: 0.953 ± 0.292
3.08LeuTyr: 3.08 ± 0.461
0.0LeuXaa: 0.0 ± 0.0
Met
2.566MetAla: 2.566 ± 0.368
0.147MetCys: 0.147 ± 0.113
1.54MetAsp: 1.54 ± 0.326
1.833MetGlu: 1.833 ± 0.37
1.1MetPhe: 1.1 ± 0.345
1.54MetGly: 1.54 ± 0.363
0.22MetHis: 0.22 ± 0.126
1.466MetIle: 1.466 ± 0.354
2.786MetLys: 2.786 ± 0.388
2.346MetLeu: 2.346 ± 0.443
0.66MetMet: 0.66 ± 0.225
2.346MetAsn: 2.346 ± 0.448
0.953MetPro: 0.953 ± 0.329
1.027MetGln: 1.027 ± 0.357
1.393MetArg: 1.393 ± 0.366
1.613MetSer: 1.613 ± 0.321
1.466MetThr: 1.466 ± 0.321
1.247MetVal: 1.247 ± 0.259
0.293MetTrp: 0.293 ± 0.17
0.733MetTyr: 0.733 ± 0.257
0.0MetXaa: 0.0 ± 0.0
Asn
2.786AsnAla: 2.786 ± 0.526
0.44AsnCys: 0.44 ± 0.183
1.32AsnAsp: 1.32 ± 0.343
4.253AsnGlu: 4.253 ± 0.541
1.247AsnPhe: 1.247 ± 0.277
3.373AsnGly: 3.373 ± 0.454
0.513AsnHis: 0.513 ± 0.193
2.713AsnIle: 2.713 ± 0.509
3.153AsnLys: 3.153 ± 0.473
3.593AsnLeu: 3.593 ± 0.537
1.027AsnMet: 1.027 ± 0.23
2.346AsnAsn: 2.346 ± 0.478
2.126AsnPro: 2.126 ± 0.338
1.54AsnGln: 1.54 ± 0.309
2.346AsnArg: 2.346 ± 0.45
2.126AsnSer: 2.126 ± 0.388
1.98AsnThr: 1.98 ± 0.27
2.493AsnVal: 2.493 ± 0.396
0.367AsnTrp: 0.367 ± 0.147
1.247AsnTyr: 1.247 ± 0.278
0.0AsnXaa: 0.0 ± 0.0
Pro
2.786ProAla: 2.786 ± 0.485
0.367ProCys: 0.367 ± 0.158
2.786ProAsp: 2.786 ± 0.434
2.933ProGlu: 2.933 ± 0.403
1.54ProPhe: 1.54 ± 0.352
2.933ProGly: 2.933 ± 0.534
0.66ProHis: 0.66 ± 0.245
2.2ProIle: 2.2 ± 0.379
3.226ProLys: 3.226 ± 0.457
2.86ProLeu: 2.86 ± 0.48
1.027ProMet: 1.027 ± 0.312
1.686ProAsn: 1.686 ± 0.312
1.54ProPro: 1.54 ± 0.343
1.393ProGln: 1.393 ± 0.291
1.173ProArg: 1.173 ± 0.309
3.006ProSer: 3.006 ± 0.462
1.906ProThr: 1.906 ± 0.375
2.933ProVal: 2.933 ± 0.571
0.953ProTrp: 0.953 ± 0.247
1.466ProTyr: 1.466 ± 0.342
0.0ProXaa: 0.0 ± 0.0
Gln
2.786GlnAla: 2.786 ± 0.576
0.0GlnCys: 0.0 ± 0.0
2.273GlnAsp: 2.273 ± 0.407
3.74GlnGlu: 3.74 ± 0.583
1.54GlnPhe: 1.54 ± 0.306
1.54GlnGly: 1.54 ± 0.323
0.513GlnHis: 0.513 ± 0.217
2.713GlnIle: 2.713 ± 0.479
3.446GlnLys: 3.446 ± 0.567
3.96GlnLeu: 3.96 ± 0.559
1.393GlnMet: 1.393 ± 0.28
2.273GlnAsn: 2.273 ± 0.416
1.466GlnPro: 1.466 ± 0.466
1.906GlnGln: 1.906 ± 0.407
2.566GlnArg: 2.566 ± 0.372
1.76GlnSer: 1.76 ± 0.289
1.76GlnThr: 1.76 ± 0.311
3.08GlnVal: 3.08 ± 0.557
0.147GlnTrp: 0.147 ± 0.113
1.76GlnTyr: 1.76 ± 0.382
0.0GlnXaa: 0.0 ± 0.0
Arg
3.593ArgAla: 3.593 ± 0.598
0.293ArgCys: 0.293 ± 0.145
2.713ArgAsp: 2.713 ± 0.435
4.033ArgGlu: 4.033 ± 0.503
1.906ArgPhe: 1.906 ± 0.37
2.933ArgGly: 2.933 ± 0.365
1.027ArgHis: 1.027 ± 0.242
4.179ArgIle: 4.179 ± 0.531
4.473ArgLys: 4.473 ± 0.713
4.033ArgLeu: 4.033 ± 0.58
1.833ArgMet: 1.833 ± 0.349
1.833ArgAsn: 1.833 ± 0.381
2.053ArgPro: 2.053 ± 0.324
1.98ArgGln: 1.98 ± 0.405
2.126ArgArg: 2.126 ± 0.394
2.42ArgSer: 2.42 ± 0.515
2.2ArgThr: 2.2 ± 0.436
3.08ArgVal: 3.08 ± 0.542
0.513ArgTrp: 0.513 ± 0.198
1.247ArgTyr: 1.247 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
5.059SerAla: 5.059 ± 0.692
0.44SerCys: 0.44 ± 0.174
3.666SerAsp: 3.666 ± 0.489
3.886SerGlu: 3.886 ± 0.605
2.2SerPhe: 2.2 ± 0.435
5.059SerGly: 5.059 ± 0.684
0.66SerHis: 0.66 ± 0.208
4.839SerIle: 4.839 ± 0.53
4.106SerLys: 4.106 ± 0.48
4.473SerLeu: 4.473 ± 0.636
1.54SerMet: 1.54 ± 0.345
2.273SerAsn: 2.273 ± 0.423
2.713SerPro: 2.713 ± 0.44
2.42SerGln: 2.42 ± 0.459
3.446SerArg: 3.446 ± 0.545
4.546SerSer: 4.546 ± 0.725
3.226SerThr: 3.226 ± 0.546
4.913SerVal: 4.913 ± 0.65
0.88SerTrp: 0.88 ± 0.274
1.54SerTyr: 1.54 ± 0.376
0.0SerXaa: 0.0 ± 0.0
Thr
3.52ThrAla: 3.52 ± 0.64
0.22ThrCys: 0.22 ± 0.111
3.08ThrAsp: 3.08 ± 0.556
4.179ThrGlu: 4.179 ± 0.481
1.686ThrPhe: 1.686 ± 0.309
4.986ThrGly: 4.986 ± 0.57
1.32ThrHis: 1.32 ± 0.36
3.593ThrIle: 3.593 ± 0.505
3.006ThrLys: 3.006 ± 0.462
5.133ThrLeu: 5.133 ± 0.662
1.686ThrMet: 1.686 ± 0.351
1.613ThrAsn: 1.613 ± 0.387
1.613ThrPro: 1.613 ± 0.308
1.613ThrGln: 1.613 ± 0.34
2.2ThrArg: 2.2 ± 0.391
3.153ThrSer: 3.153 ± 0.548
2.933ThrThr: 2.933 ± 0.527
4.619ThrVal: 4.619 ± 0.697
0.66ThrTrp: 0.66 ± 0.195
2.126ThrTyr: 2.126 ± 0.487
0.0ThrXaa: 0.0 ± 0.0
Val
4.766ValAla: 4.766 ± 0.669
0.66ValCys: 0.66 ± 0.307
4.766ValAsp: 4.766 ± 0.459
4.839ValGlu: 4.839 ± 0.615
2.346ValPhe: 2.346 ± 0.384
3.74ValGly: 3.74 ± 0.653
0.88ValHis: 0.88 ± 0.26
3.813ValIle: 3.813 ± 0.45
4.913ValLys: 4.913 ± 0.765
6.159ValLeu: 6.159 ± 0.779
1.686ValMet: 1.686 ± 0.307
2.786ValAsn: 2.786 ± 0.426
3.08ValPro: 3.08 ± 0.473
2.053ValGln: 2.053 ± 0.508
3.153ValArg: 3.153 ± 0.464
5.646ValSer: 5.646 ± 0.695
4.619ValThr: 4.619 ± 0.655
4.766ValVal: 4.766 ± 0.582
0.953ValTrp: 0.953 ± 0.297
2.2ValTyr: 2.2 ± 0.418
0.0ValXaa: 0.0 ± 0.0
Trp
0.953TrpAla: 0.953 ± 0.281
0.147TrpCys: 0.147 ± 0.098
0.293TrpAsp: 0.293 ± 0.152
1.247TrpGlu: 1.247 ± 0.284
0.66TrpPhe: 0.66 ± 0.234
0.66TrpGly: 0.66 ± 0.2
0.513TrpHis: 0.513 ± 0.208
0.733TrpIle: 0.733 ± 0.228
1.247TrpLys: 1.247 ± 0.289
1.1TrpLeu: 1.1 ± 0.307
0.513TrpMet: 0.513 ± 0.186
0.953TrpAsn: 0.953 ± 0.257
0.293TrpPro: 0.293 ± 0.155
0.66TrpGln: 0.66 ± 0.199
0.587TrpArg: 0.587 ± 0.182
0.807TrpSer: 0.807 ± 0.226
0.807TrpThr: 0.807 ± 0.198
1.393TrpVal: 1.393 ± 0.343
0.293TrpTrp: 0.293 ± 0.145
0.293TrpTyr: 0.293 ± 0.13
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.98TyrAla: 1.98 ± 0.419
0.733TyrCys: 0.733 ± 0.206
2.346TyrAsp: 2.346 ± 0.395
2.86TyrGlu: 2.86 ± 0.441
1.32TyrPhe: 1.32 ± 0.271
2.42TyrGly: 2.42 ± 0.41
1.027TyrHis: 1.027 ± 0.305
2.2TyrIle: 2.2 ± 0.446
2.42TyrLys: 2.42 ± 0.35
2.933TyrLeu: 2.933 ± 0.579
0.587TyrMet: 0.587 ± 0.215
1.173TyrAsn: 1.173 ± 0.252
1.027TyrPro: 1.027 ± 0.272
1.32TyrGln: 1.32 ± 0.31
2.126TyrArg: 2.126 ± 0.413
1.98TyrSer: 1.98 ± 0.365
2.713TyrThr: 2.713 ± 0.69
1.76TyrVal: 1.76 ± 0.374
0.367TyrTrp: 0.367 ± 0.171
1.393TyrTyr: 1.393 ± 0.38
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (13639 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski