Amino acid dipepetide frequency for Cronobacter phage Dev2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.791AlaAla: 9.791 ± 1.225
0.766AlaCys: 0.766 ± 0.288
5.449AlaAsp: 5.449 ± 0.743
5.874AlaGlu: 5.874 ± 0.859
3.235AlaPhe: 3.235 ± 0.397
8.854AlaGly: 8.854 ± 1.003
1.107AlaHis: 1.107 ± 0.352
5.619AlaIle: 5.619 ± 0.576
7.151AlaLys: 7.151 ± 0.585
7.492AlaLeu: 7.492 ± 0.958
2.639AlaMet: 2.639 ± 0.449
3.576AlaAsn: 3.576 ± 0.478
2.384AlaPro: 2.384 ± 0.511
2.895AlaGln: 2.895 ± 0.617
3.576AlaArg: 3.576 ± 0.595
5.108AlaSer: 5.108 ± 0.681
3.576AlaThr: 3.576 ± 0.673
6.641AlaVal: 6.641 ± 1.131
1.277AlaTrp: 1.277 ± 0.371
2.639AlaTyr: 2.639 ± 0.512
0.0AlaXaa: 0.0 ± 0.0
Cys
0.511CysAla: 0.511 ± 0.149
0.0CysCys: 0.0 ± 0.0
0.766CysAsp: 0.766 ± 0.343
0.681CysGlu: 0.681 ± 0.294
0.596CysPhe: 0.596 ± 0.237
0.766CysGly: 0.766 ± 0.301
0.341CysHis: 0.341 ± 0.163
0.341CysIle: 0.341 ± 0.183
0.596CysLys: 0.596 ± 0.299
0.851CysLeu: 0.851 ± 0.334
0.255CysMet: 0.255 ± 0.197
0.085CysAsn: 0.085 ± 0.079
0.511CysPro: 0.511 ± 0.217
0.085CysGln: 0.085 ± 0.072
0.681CysArg: 0.681 ± 0.326
0.341CysSer: 0.341 ± 0.185
0.426CysThr: 0.426 ± 0.18
0.341CysVal: 0.341 ± 0.183
0.085CysTrp: 0.085 ± 0.082
0.17CysTyr: 0.17 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
6.215AspAla: 6.215 ± 0.76
0.511AspCys: 0.511 ± 0.294
4.172AspAsp: 4.172 ± 0.822
4.257AspGlu: 4.257 ± 0.599
1.958AspPhe: 1.958 ± 0.481
6.385AspGly: 6.385 ± 0.69
1.192AspHis: 1.192 ± 0.345
4.001AspIle: 4.001 ± 0.36
3.661AspLys: 3.661 ± 0.694
5.108AspLeu: 5.108 ± 0.511
1.788AspMet: 1.788 ± 0.383
1.958AspAsn: 1.958 ± 0.489
2.639AspPro: 2.639 ± 0.573
2.299AspGln: 2.299 ± 0.347
2.128AspArg: 2.128 ± 0.406
3.15AspSer: 3.15 ± 0.358
3.576AspThr: 3.576 ± 0.611
4.172AspVal: 4.172 ± 0.587
1.192AspTrp: 1.192 ± 0.379
2.299AspTyr: 2.299 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
7.151GluAla: 7.151 ± 0.946
0.511GluCys: 0.511 ± 0.231
4.512GluAsp: 4.512 ± 0.708
4.001GluGlu: 4.001 ± 0.62
2.469GluPhe: 2.469 ± 0.494
5.364GluGly: 5.364 ± 0.76
1.022GluHis: 1.022 ± 0.278
2.639GluIle: 2.639 ± 0.364
2.639GluLys: 2.639 ± 0.463
5.874GluLeu: 5.874 ± 0.763
2.043GluMet: 2.043 ± 0.461
2.384GluAsn: 2.384 ± 0.575
2.469GluPro: 2.469 ± 0.489
3.916GluGln: 3.916 ± 0.578
4.172GluArg: 4.172 ± 0.524
3.916GluSer: 3.916 ± 0.691
4.086GluThr: 4.086 ± 0.532
4.768GluVal: 4.768 ± 0.844
1.362GluTrp: 1.362 ± 0.324
2.895GluTyr: 2.895 ± 0.558
0.0GluXaa: 0.0 ± 0.0
Phe
2.895PheAla: 2.895 ± 0.53
0.426PheCys: 0.426 ± 0.199
2.809PheAsp: 2.809 ± 0.508
1.873PheGlu: 1.873 ± 0.369
1.277PhePhe: 1.277 ± 0.333
2.469PheGly: 2.469 ± 0.556
0.681PheHis: 0.681 ± 0.246
1.873PheIle: 1.873 ± 0.409
3.405PheLys: 3.405 ± 0.637
3.15PheLeu: 3.15 ± 0.448
0.851PheMet: 0.851 ± 0.251
2.639PheAsn: 2.639 ± 0.482
1.532PhePro: 1.532 ± 0.384
0.766PheGln: 0.766 ± 0.294
1.447PheArg: 1.447 ± 0.307
2.214PheSer: 2.214 ± 0.33
2.469PheThr: 2.469 ± 0.4
2.98PheVal: 2.98 ± 0.572
0.255PheTrp: 0.255 ± 0.132
1.532PheTyr: 1.532 ± 0.349
0.0PheXaa: 0.0 ± 0.0
Gly
6.555GlyAla: 6.555 ± 1.058
0.511GlyCys: 0.511 ± 0.249
5.534GlyAsp: 5.534 ± 0.932
5.193GlyGlu: 5.193 ± 0.649
2.128GlyPhe: 2.128 ± 0.301
5.534GlyGly: 5.534 ± 0.842
1.022GlyHis: 1.022 ± 0.281
4.001GlyIle: 4.001 ± 0.62
6.896GlyLys: 6.896 ± 0.709
6.811GlyLeu: 6.811 ± 0.907
2.214GlyMet: 2.214 ± 0.473
2.384GlyAsn: 2.384 ± 0.408
1.703GlyPro: 1.703 ± 0.385
3.32GlyGln: 3.32 ± 0.483
5.364GlyArg: 5.364 ± 0.567
6.13GlySer: 6.13 ± 0.853
4.257GlyThr: 4.257 ± 0.632
5.193GlyVal: 5.193 ± 0.718
1.362GlyTrp: 1.362 ± 0.317
4.172GlyTyr: 4.172 ± 0.667
0.0GlyXaa: 0.0 ± 0.0
His
0.766HisAla: 0.766 ± 0.241
0.255HisCys: 0.255 ± 0.164
1.022HisAsp: 1.022 ± 0.419
1.447HisGlu: 1.447 ± 0.408
0.596HisPhe: 0.596 ± 0.244
1.022HisGly: 1.022 ± 0.295
0.341HisHis: 0.341 ± 0.158
0.851HisIle: 0.851 ± 0.233
1.022HisLys: 1.022 ± 0.304
1.788HisLeu: 1.788 ± 0.391
0.511HisMet: 0.511 ± 0.195
0.426HisAsn: 0.426 ± 0.163
0.511HisPro: 0.511 ± 0.18
0.511HisGln: 0.511 ± 0.221
0.936HisArg: 0.936 ± 0.226
0.681HisSer: 0.681 ± 0.207
1.107HisThr: 1.107 ± 0.226
0.936HisVal: 0.936 ± 0.214
0.511HisTrp: 0.511 ± 0.213
0.681HisTyr: 0.681 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
3.576IleAla: 3.576 ± 0.601
0.681IleCys: 0.681 ± 0.286
3.491IleAsp: 3.491 ± 0.558
2.809IleGlu: 2.809 ± 0.391
0.851IlePhe: 0.851 ± 0.257
4.342IleGly: 4.342 ± 0.59
1.022IleHis: 1.022 ± 0.29
2.043IleIle: 2.043 ± 0.387
3.32IleLys: 3.32 ± 0.582
3.405IleLeu: 3.405 ± 0.543
1.362IleMet: 1.362 ± 0.349
2.639IleAsn: 2.639 ± 0.608
2.214IlePro: 2.214 ± 0.492
1.873IleGln: 1.873 ± 0.667
3.32IleArg: 3.32 ± 0.508
2.469IleSer: 2.469 ± 0.494
3.405IleThr: 3.405 ± 0.458
3.746IleVal: 3.746 ± 0.583
0.596IleTrp: 0.596 ± 0.223
1.532IleTyr: 1.532 ± 0.306
0.0IleXaa: 0.0 ± 0.0
Lys
7.747LysAla: 7.747 ± 0.777
0.596LysCys: 0.596 ± 0.297
3.576LysAsp: 3.576 ± 0.469
4.172LysGlu: 4.172 ± 0.516
2.724LysPhe: 2.724 ± 0.572
4.342LysGly: 4.342 ± 0.666
1.192LysHis: 1.192 ± 0.397
2.214LysIle: 2.214 ± 0.406
4.086LysLys: 4.086 ± 0.938
4.768LysLeu: 4.768 ± 0.483
1.958LysMet: 1.958 ± 0.36
2.724LysAsn: 2.724 ± 0.447
2.895LysPro: 2.895 ± 0.565
2.469LysGln: 2.469 ± 0.576
3.831LysArg: 3.831 ± 0.598
4.938LysSer: 4.938 ± 0.576
3.405LysThr: 3.405 ± 0.494
5.023LysVal: 5.023 ± 0.59
1.192LysTrp: 1.192 ± 0.273
2.469LysTyr: 2.469 ± 0.418
0.0LysXaa: 0.0 ± 0.0
Leu
7.918LeuAla: 7.918 ± 0.915
0.255LeuCys: 0.255 ± 0.13
4.257LeuAsp: 4.257 ± 0.527
6.726LeuGlu: 6.726 ± 0.775
2.724LeuPhe: 2.724 ± 0.414
5.023LeuGly: 5.023 ± 0.626
1.192LeuHis: 1.192 ± 0.308
3.15LeuIle: 3.15 ± 0.5
6.726LeuLys: 6.726 ± 0.839
4.853LeuLeu: 4.853 ± 0.698
2.469LeuMet: 2.469 ± 0.474
4.427LeuAsn: 4.427 ± 0.89
3.32LeuPro: 3.32 ± 0.425
4.172LeuGln: 4.172 ± 0.608
4.172LeuArg: 4.172 ± 0.467
4.768LeuSer: 4.768 ± 0.64
5.278LeuThr: 5.278 ± 0.813
5.619LeuVal: 5.619 ± 0.582
0.681LeuTrp: 0.681 ± 0.247
2.128LeuTyr: 2.128 ± 0.491
0.0LeuXaa: 0.0 ± 0.0
Met
3.065MetAla: 3.065 ± 0.477
0.341MetCys: 0.341 ± 0.196
1.277MetAsp: 1.277 ± 0.318
2.043MetGlu: 2.043 ± 0.27
0.936MetPhe: 0.936 ± 0.326
2.809MetGly: 2.809 ± 0.446
0.341MetHis: 0.341 ± 0.173
1.192MetIle: 1.192 ± 0.271
1.022MetLys: 1.022 ± 0.243
2.384MetLeu: 2.384 ± 0.436
0.596MetMet: 0.596 ± 0.169
1.022MetAsn: 1.022 ± 0.295
0.851MetPro: 0.851 ± 0.28
0.936MetGln: 0.936 ± 0.313
1.277MetArg: 1.277 ± 0.258
1.958MetSer: 1.958 ± 0.4
2.299MetThr: 2.299 ± 0.367
2.554MetVal: 2.554 ± 0.442
0.255MetTrp: 0.255 ± 0.136
0.766MetTyr: 0.766 ± 0.22
0.0MetXaa: 0.0 ± 0.0
Asn
3.916AsnAla: 3.916 ± 0.857
0.426AsnCys: 0.426 ± 0.223
2.469AsnAsp: 2.469 ± 0.507
2.299AsnGlu: 2.299 ± 0.464
1.703AsnPhe: 1.703 ± 0.277
4.172AsnGly: 4.172 ± 0.537
0.511AsnHis: 0.511 ± 0.172
2.214AsnIle: 2.214 ± 0.381
1.873AsnLys: 1.873 ± 0.382
3.15AsnLeu: 3.15 ± 0.669
1.277AsnMet: 1.277 ± 0.367
1.873AsnAsn: 1.873 ± 0.362
2.384AsnPro: 2.384 ± 0.455
1.532AsnGln: 1.532 ± 0.351
2.214AsnArg: 2.214 ± 0.512
3.065AsnSer: 3.065 ± 0.544
1.958AsnThr: 1.958 ± 0.519
2.98AsnVal: 2.98 ± 0.42
0.17AsnTrp: 0.17 ± 0.113
1.277AsnTyr: 1.277 ± 0.352
0.0AsnXaa: 0.0 ± 0.0
Pro
2.895ProAla: 2.895 ± 0.5
0.426ProCys: 0.426 ± 0.217
2.128ProAsp: 2.128 ± 0.346
3.491ProGlu: 3.491 ± 0.595
1.618ProPhe: 1.618 ± 0.283
1.958ProGly: 1.958 ± 0.36
0.681ProHis: 0.681 ± 0.22
1.873ProIle: 1.873 ± 0.421
3.15ProLys: 3.15 ± 0.723
2.384ProLeu: 2.384 ± 0.53
1.022ProMet: 1.022 ± 0.257
2.469ProAsn: 2.469 ± 0.401
0.936ProPro: 0.936 ± 0.331
2.043ProGln: 2.043 ± 0.493
1.703ProArg: 1.703 ± 0.357
2.639ProSer: 2.639 ± 0.395
2.895ProThr: 2.895 ± 0.364
3.065ProVal: 3.065 ± 0.416
0.766ProTrp: 0.766 ± 0.261
1.022ProTyr: 1.022 ± 0.285
0.0ProXaa: 0.0 ± 0.0
Gln
3.831GlnAla: 3.831 ± 0.725
0.17GlnCys: 0.17 ± 0.133
4.001GlnAsp: 4.001 ± 0.696
2.469GlnGlu: 2.469 ± 0.539
1.788GlnPhe: 1.788 ± 0.366
2.895GlnGly: 2.895 ± 0.459
0.341GlnHis: 0.341 ± 0.185
1.362GlnIle: 1.362 ± 0.3
2.128GlnLys: 2.128 ± 0.575
5.108GlnLeu: 5.108 ± 0.912
1.447GlnMet: 1.447 ± 0.419
1.873GlnAsn: 1.873 ± 0.532
1.447GlnPro: 1.447 ± 0.402
1.618GlnGln: 1.618 ± 0.453
2.98GlnArg: 2.98 ± 0.677
2.384GlnSer: 2.384 ± 0.487
2.128GlnThr: 2.128 ± 0.53
2.299GlnVal: 2.299 ± 0.429
0.681GlnTrp: 0.681 ± 0.206
1.107GlnTyr: 1.107 ± 0.385
0.0GlnXaa: 0.0 ± 0.0
Arg
4.853ArgAla: 4.853 ± 0.795
0.681ArgCys: 0.681 ± 0.254
3.831ArgAsp: 3.831 ± 0.473
4.342ArgGlu: 4.342 ± 0.587
2.895ArgPhe: 2.895 ± 0.388
3.491ArgGly: 3.491 ± 0.41
0.596ArgHis: 0.596 ± 0.236
3.065ArgIle: 3.065 ± 0.672
2.895ArgLys: 2.895 ± 0.614
5.449ArgLeu: 5.449 ± 0.677
0.936ArgMet: 0.936 ± 0.321
2.043ArgAsn: 2.043 ± 0.396
1.618ArgPro: 1.618 ± 0.31
2.384ArgGln: 2.384 ± 0.46
1.958ArgArg: 1.958 ± 0.276
3.32ArgSer: 3.32 ± 0.576
2.299ArgThr: 2.299 ± 0.394
3.235ArgVal: 3.235 ± 0.692
1.192ArgTrp: 1.192 ± 0.371
1.618ArgTyr: 1.618 ± 0.381
0.0ArgXaa: 0.0 ± 0.0
Ser
4.938SerAla: 4.938 ± 0.685
0.681SerCys: 0.681 ± 0.251
4.682SerAsp: 4.682 ± 0.544
3.32SerGlu: 3.32 ± 0.51
3.065SerPhe: 3.065 ± 0.504
6.13SerGly: 6.13 ± 1.004
1.958SerHis: 1.958 ± 0.374
2.554SerIle: 2.554 ± 0.597
3.065SerLys: 3.065 ± 0.469
3.491SerLeu: 3.491 ± 0.57
1.618SerMet: 1.618 ± 0.491
1.618SerAsn: 1.618 ± 0.345
2.98SerPro: 2.98 ± 0.6
2.724SerGln: 2.724 ± 0.609
3.32SerArg: 3.32 ± 0.539
3.576SerSer: 3.576 ± 0.555
3.661SerThr: 3.661 ± 0.614
4.172SerVal: 4.172 ± 0.592
1.022SerTrp: 1.022 ± 0.35
2.639SerTyr: 2.639 ± 0.514
0.0SerXaa: 0.0 ± 0.0
Thr
3.32ThrAla: 3.32 ± 0.649
0.255ThrCys: 0.255 ± 0.127
3.065ThrAsp: 3.065 ± 0.561
5.278ThrGlu: 5.278 ± 0.748
2.214ThrPhe: 2.214 ± 0.376
5.364ThrGly: 5.364 ± 0.667
0.511ThrHis: 0.511 ± 0.195
3.491ThrIle: 3.491 ± 0.513
3.746ThrLys: 3.746 ± 0.661
4.597ThrLeu: 4.597 ± 0.599
1.277ThrMet: 1.277 ± 0.346
2.128ThrAsn: 2.128 ± 0.48
3.576ThrPro: 3.576 ± 0.451
2.128ThrGln: 2.128 ± 0.448
2.554ThrArg: 2.554 ± 0.49
2.384ThrSer: 2.384 ± 0.506
3.32ThrThr: 3.32 ± 0.464
5.278ThrVal: 5.278 ± 0.696
0.511ThrTrp: 0.511 ± 0.133
1.703ThrTyr: 1.703 ± 0.311
0.0ThrXaa: 0.0 ± 0.0
Val
5.193ValAla: 5.193 ± 0.791
0.341ValCys: 0.341 ± 0.175
3.15ValAsp: 3.15 ± 0.44
5.364ValGlu: 5.364 ± 0.762
2.809ValPhe: 2.809 ± 0.555
5.193ValGly: 5.193 ± 0.729
1.022ValHis: 1.022 ± 0.475
3.916ValIle: 3.916 ± 0.512
5.534ValLys: 5.534 ± 0.758
4.853ValLeu: 4.853 ± 0.618
2.384ValMet: 2.384 ± 0.431
2.98ValAsn: 2.98 ± 0.567
3.15ValPro: 3.15 ± 0.513
3.916ValGln: 3.916 ± 0.633
3.746ValArg: 3.746 ± 0.622
5.278ValSer: 5.278 ± 0.73
4.086ValThr: 4.086 ± 0.574
5.874ValVal: 5.874 ± 0.91
0.851ValTrp: 0.851 ± 0.289
2.299ValTyr: 2.299 ± 0.427
0.0ValXaa: 0.0 ± 0.0
Trp
0.511TrpAla: 0.511 ± 0.175
0.255TrpCys: 0.255 ± 0.165
0.511TrpAsp: 0.511 ± 0.181
0.936TrpGlu: 0.936 ± 0.287
0.596TrpPhe: 0.596 ± 0.227
1.192TrpGly: 1.192 ± 0.35
0.426TrpHis: 0.426 ± 0.173
0.511TrpIle: 0.511 ± 0.213
1.362TrpLys: 1.362 ± 0.36
2.128TrpLeu: 2.128 ± 0.405
0.085TrpMet: 0.085 ± 0.101
0.851TrpAsn: 0.851 ± 0.265
0.255TrpPro: 0.255 ± 0.135
0.681TrpGln: 0.681 ± 0.308
0.851TrpArg: 0.851 ± 0.245
1.022TrpSer: 1.022 ± 0.406
0.766TrpThr: 0.766 ± 0.237
1.277TrpVal: 1.277 ± 0.321
0.255TrpTrp: 0.255 ± 0.129
0.341TrpTyr: 0.341 ± 0.161
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.001TyrAla: 4.001 ± 0.649
0.341TyrCys: 0.341 ± 0.176
1.958TyrAsp: 1.958 ± 0.362
1.618TyrGlu: 1.618 ± 0.45
1.362TyrPhe: 1.362 ± 0.353
2.98TyrGly: 2.98 ± 0.44
0.426TyrHis: 0.426 ± 0.163
1.788TyrIle: 1.788 ± 0.503
2.043TyrLys: 2.043 ± 0.366
2.214TyrLeu: 2.214 ± 0.408
1.107TyrMet: 1.107 ± 0.288
1.362TyrAsn: 1.362 ± 0.39
1.788TyrPro: 1.788 ± 0.484
1.788TyrGln: 1.788 ± 0.526
2.469TyrArg: 2.469 ± 0.477
1.873TyrSer: 1.873 ± 0.416
1.788TyrThr: 1.788 ± 0.431
1.788TyrVal: 1.788 ± 0.484
0.596TyrTrp: 0.596 ± 0.244
1.192TyrTyr: 1.192 ± 0.327
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (11747 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski