Amino acid dipepetide frequency for Escherichia phage ECBP1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.991AlaAla: 7.991 ± 0.965
0.564AlaCys: 0.564 ± 0.174
5.03AlaAsp: 5.03 ± 0.549
5.5AlaGlu: 5.5 ± 0.459
2.256AlaPhe: 2.256 ± 0.352
6.346AlaGly: 6.346 ± 0.75
1.081AlaHis: 1.081 ± 0.194
4.56AlaIle: 4.56 ± 0.524
5.782AlaLys: 5.782 ± 0.617
7.897AlaLeu: 7.897 ± 1.134
2.82AlaMet: 2.82 ± 0.507
4.983AlaAsn: 4.983 ± 0.514
2.397AlaPro: 2.397 ± 0.439
4.137AlaGln: 4.137 ± 0.732
3.713AlaArg: 3.713 ± 0.512
4.795AlaSer: 4.795 ± 0.431
5.782AlaThr: 5.782 ± 0.689
5.97AlaVal: 5.97 ± 0.5
0.893AlaTrp: 0.893 ± 0.161
3.478AlaTyr: 3.478 ± 0.371
0.0AlaXaa: 0.0 ± 0.0
Cys
0.611CysAla: 0.611 ± 0.161
0.047CysCys: 0.047 ± 0.049
0.423CysAsp: 0.423 ± 0.131
0.564CysGlu: 0.564 ± 0.186
0.423CysPhe: 0.423 ± 0.127
0.423CysGly: 0.423 ± 0.16
0.188CysHis: 0.188 ± 0.088
0.752CysIle: 0.752 ± 0.2
0.752CysLys: 0.752 ± 0.224
0.846CysLeu: 0.846 ± 0.287
0.235CysMet: 0.235 ± 0.089
0.517CysAsn: 0.517 ± 0.186
0.282CysPro: 0.282 ± 0.124
0.141CysGln: 0.141 ± 0.086
0.141CysArg: 0.141 ± 0.093
0.423CysSer: 0.423 ± 0.146
0.517CysThr: 0.517 ± 0.196
0.846CysVal: 0.846 ± 0.268
0.141CysTrp: 0.141 ± 0.077
0.188CysTyr: 0.188 ± 0.088
0.0CysXaa: 0.0 ± 0.0
Asp
4.936AspAla: 4.936 ± 0.689
0.752AspCys: 0.752 ± 0.242
3.666AspAsp: 3.666 ± 0.453
3.525AspGlu: 3.525 ± 0.475
2.068AspPhe: 2.068 ± 0.332
3.008AspGly: 3.008 ± 0.387
0.658AspHis: 0.658 ± 0.164
4.513AspIle: 4.513 ± 0.46
3.29AspLys: 3.29 ± 0.377
4.842AspLeu: 4.842 ± 0.594
1.645AspMet: 1.645 ± 0.305
2.35AspAsn: 2.35 ± 0.283
3.055AspPro: 3.055 ± 0.336
2.021AspGln: 2.021 ± 0.425
2.538AspArg: 2.538 ± 0.315
4.184AspSer: 4.184 ± 0.312
3.901AspThr: 3.901 ± 0.443
3.572AspVal: 3.572 ± 0.454
0.705AspTrp: 0.705 ± 0.202
2.115AspTyr: 2.115 ± 0.342
0.0AspXaa: 0.0 ± 0.0
Glu
6.816GluAla: 6.816 ± 0.927
0.47GluCys: 0.47 ± 0.189
3.854GluAsp: 3.854 ± 0.519
5.641GluGlu: 5.641 ± 0.765
2.726GluPhe: 2.726 ± 0.363
4.137GluGly: 4.137 ± 0.471
0.846GluHis: 0.846 ± 0.182
3.525GluIle: 3.525 ± 0.387
4.089GluLys: 4.089 ± 0.552
6.158GluLeu: 6.158 ± 0.508
2.162GluMet: 2.162 ± 0.243
3.055GluAsn: 3.055 ± 0.354
3.29GluPro: 3.29 ± 0.44
3.008GluGln: 3.008 ± 0.487
2.068GluArg: 2.068 ± 0.337
3.055GluSer: 3.055 ± 0.375
3.008GluThr: 3.008 ± 0.394
4.701GluVal: 4.701 ± 0.619
0.705GluTrp: 0.705 ± 0.129
2.679GluTyr: 2.679 ± 0.378
0.0GluXaa: 0.0 ± 0.0
Phe
2.679PheAla: 2.679 ± 0.35
0.517PheCys: 0.517 ± 0.182
2.397PheAsp: 2.397 ± 0.328
1.833PheGlu: 1.833 ± 0.299
0.987PhePhe: 0.987 ± 0.221
2.585PheGly: 2.585 ± 0.291
0.47PheHis: 0.47 ± 0.132
2.726PheIle: 2.726 ± 0.391
1.88PheLys: 1.88 ± 0.37
2.632PheLeu: 2.632 ± 0.431
1.551PheMet: 1.551 ± 0.283
2.726PheAsn: 2.726 ± 0.415
1.128PhePro: 1.128 ± 0.218
1.457PheGln: 1.457 ± 0.253
1.927PheArg: 1.927 ± 0.265
2.209PheSer: 2.209 ± 0.317
2.914PheThr: 2.914 ± 0.462
1.927PheVal: 1.927 ± 0.374
0.376PheTrp: 0.376 ± 0.147
1.457PheTyr: 1.457 ± 0.219
0.0PheXaa: 0.0 ± 0.0
Gly
5.218GlyAla: 5.218 ± 0.58
0.94GlyCys: 0.94 ± 0.24
2.961GlyAsp: 2.961 ± 0.376
3.666GlyGlu: 3.666 ± 0.311
3.102GlyPhe: 3.102 ± 0.504
3.854GlyGly: 3.854 ± 0.482
1.081GlyHis: 1.081 ± 0.213
3.901GlyIle: 3.901 ± 0.331
6.252GlyLys: 6.252 ± 0.522
5.077GlyLeu: 5.077 ± 0.56
2.35GlyMet: 2.35 ± 0.28
4.748GlyAsn: 4.748 ± 0.479
1.034GlyPro: 1.034 ± 0.203
2.256GlyGln: 2.256 ± 0.367
2.068GlyArg: 2.068 ± 0.384
4.748GlySer: 4.748 ± 0.549
4.56GlyThr: 4.56 ± 0.435
4.278GlyVal: 4.278 ± 0.481
1.081GlyTrp: 1.081 ± 0.26
3.008GlyTyr: 3.008 ± 0.43
0.0GlyXaa: 0.0 ± 0.0
His
1.269HisAla: 1.269 ± 0.19
0.047HisCys: 0.047 ± 0.046
1.081HisAsp: 1.081 ± 0.233
1.128HisGlu: 1.128 ± 0.24
0.47HisPhe: 0.47 ± 0.16
0.658HisGly: 0.658 ± 0.213
0.564HisHis: 0.564 ± 0.182
1.081HisIle: 1.081 ± 0.241
1.692HisLys: 1.692 ± 0.408
2.209HisLeu: 2.209 ± 0.34
0.376HisMet: 0.376 ± 0.121
0.893HisAsn: 0.893 ± 0.159
0.705HisPro: 0.705 ± 0.209
0.423HisGln: 0.423 ± 0.159
0.564HisArg: 0.564 ± 0.193
1.41HisSer: 1.41 ± 0.323
0.799HisThr: 0.799 ± 0.201
0.799HisVal: 0.799 ± 0.204
0.376HisTrp: 0.376 ± 0.13
1.034HisTyr: 1.034 ± 0.183
0.0HisXaa: 0.0 ± 0.0
Ile
4.701IleAla: 4.701 ± 0.495
0.47IleCys: 0.47 ± 0.144
3.666IleAsp: 3.666 ± 0.514
3.901IleGlu: 3.901 ± 0.383
1.645IlePhe: 1.645 ± 0.326
3.478IleGly: 3.478 ± 0.381
1.269IleHis: 1.269 ± 0.264
3.102IleIle: 3.102 ± 0.542
3.901IleLys: 3.901 ± 0.428
3.995IleLeu: 3.995 ± 0.463
1.222IleMet: 1.222 ± 0.19
3.384IleAsn: 3.384 ± 0.435
3.196IlePro: 3.196 ± 0.488
2.397IleGln: 2.397 ± 0.348
3.008IleArg: 3.008 ± 0.392
3.29IleSer: 3.29 ± 0.332
4.466IleThr: 4.466 ± 0.459
3.196IleVal: 3.196 ± 0.423
0.423IleTrp: 0.423 ± 0.141
2.397IleTyr: 2.397 ± 0.352
0.0IleXaa: 0.0 ± 0.0
Lys
6.628LysAla: 6.628 ± 0.762
0.47LysCys: 0.47 ± 0.155
3.196LysAsp: 3.196 ± 0.349
4.842LysGlu: 4.842 ± 0.531
2.491LysPhe: 2.491 ± 0.342
3.713LysGly: 3.713 ± 0.439
1.457LysHis: 1.457 ± 0.347
2.773LysIle: 2.773 ± 0.358
3.384LysLys: 3.384 ± 0.488
6.769LysLeu: 6.769 ± 0.513
1.88LysMet: 1.88 ± 0.287
3.478LysAsn: 3.478 ± 0.317
3.055LysPro: 3.055 ± 0.523
3.102LysGln: 3.102 ± 0.382
2.632LysArg: 2.632 ± 0.349
4.466LysSer: 4.466 ± 0.479
4.607LysThr: 4.607 ± 0.354
4.231LysVal: 4.231 ± 0.341
0.658LysTrp: 0.658 ± 0.202
2.162LysTyr: 2.162 ± 0.421
0.0LysXaa: 0.0 ± 0.0
Leu
7.709LeuAla: 7.709 ± 0.615
0.611LeuCys: 0.611 ± 0.157
4.889LeuAsp: 4.889 ± 0.494
4.466LeuGlu: 4.466 ± 0.486
3.008LeuPhe: 3.008 ± 0.367
6.487LeuGly: 6.487 ± 0.557
1.551LeuHis: 1.551 ± 0.307
4.137LeuIle: 4.137 ± 0.401
5.171LeuLys: 5.171 ± 0.553
5.829LeuLeu: 5.829 ± 0.489
2.679LeuMet: 2.679 ± 0.343
5.171LeuAsn: 5.171 ± 0.53
4.56LeuPro: 4.56 ± 0.541
3.008LeuGln: 3.008 ± 0.454
3.807LeuArg: 3.807 ± 0.513
5.171LeuSer: 5.171 ± 0.554
5.547LeuThr: 5.547 ± 0.452
6.017LeuVal: 6.017 ± 0.602
0.846LeuTrp: 0.846 ± 0.207
2.491LeuTyr: 2.491 ± 0.379
0.0LeuXaa: 0.0 ± 0.0
Met
2.773MetAla: 2.773 ± 0.479
0.094MetCys: 0.094 ± 0.07
1.269MetAsp: 1.269 ± 0.262
2.115MetGlu: 2.115 ± 0.346
0.517MetPhe: 0.517 ± 0.153
1.692MetGly: 1.692 ± 0.258
0.47MetHis: 0.47 ± 0.152
1.786MetIle: 1.786 ± 0.349
2.585MetLys: 2.585 ± 0.406
2.35MetLeu: 2.35 ± 0.35
0.705MetMet: 0.705 ± 0.185
2.303MetAsn: 2.303 ± 0.275
1.081MetPro: 1.081 ± 0.23
1.551MetGln: 1.551 ± 0.291
0.94MetArg: 0.94 ± 0.185
2.538MetSer: 2.538 ± 0.358
2.021MetThr: 2.021 ± 0.347
1.739MetVal: 1.739 ± 0.318
0.235MetTrp: 0.235 ± 0.092
0.799MetTyr: 0.799 ± 0.164
0.0MetXaa: 0.0 ± 0.0
Asn
4.466AsnAla: 4.466 ± 0.634
0.282AsnCys: 0.282 ± 0.14
2.726AsnAsp: 2.726 ± 0.278
3.525AsnGlu: 3.525 ± 0.462
2.068AsnPhe: 2.068 ± 0.373
3.995AsnGly: 3.995 ± 0.522
1.363AsnHis: 1.363 ± 0.274
3.149AsnIle: 3.149 ± 0.305
4.372AsnLys: 4.372 ± 0.348
4.513AsnLeu: 4.513 ± 0.496
1.457AsnMet: 1.457 ± 0.228
3.243AsnAsn: 3.243 ± 0.356
3.149AsnPro: 3.149 ± 0.407
3.29AsnGln: 3.29 ± 0.398
3.196AsnArg: 3.196 ± 0.306
2.961AsnSer: 2.961 ± 0.435
3.995AsnThr: 3.995 ± 0.649
3.149AsnVal: 3.149 ± 0.393
0.94AsnTrp: 0.94 ± 0.232
1.786AsnTyr: 1.786 ± 0.364
0.0AsnXaa: 0.0 ± 0.0
Pro
3.713ProAla: 3.713 ± 0.512
0.329ProCys: 0.329 ± 0.134
2.538ProAsp: 2.538 ± 0.428
4.089ProGlu: 4.089 ± 0.566
2.115ProPhe: 2.115 ± 0.332
2.773ProGly: 2.773 ± 0.38
0.376ProHis: 0.376 ± 0.159
2.35ProIle: 2.35 ± 0.31
1.739ProLys: 1.739 ± 0.237
3.055ProLeu: 3.055 ± 0.381
1.316ProMet: 1.316 ± 0.226
1.833ProAsn: 1.833 ± 0.269
1.081ProPro: 1.081 ± 0.285
1.316ProGln: 1.316 ± 0.219
1.363ProArg: 1.363 ± 0.333
2.538ProSer: 2.538 ± 0.307
3.196ProThr: 3.196 ± 0.356
3.854ProVal: 3.854 ± 0.371
0.705ProTrp: 0.705 ± 0.179
1.41ProTyr: 1.41 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
4.513GlnAla: 4.513 ± 0.633
0.235GlnCys: 0.235 ± 0.101
1.927GlnAsp: 1.927 ± 0.324
2.867GlnGlu: 2.867 ± 0.419
1.645GlnPhe: 1.645 ± 0.25
2.867GlnGly: 2.867 ± 0.42
0.423GlnHis: 0.423 ± 0.129
1.974GlnIle: 1.974 ± 0.245
2.867GlnLys: 2.867 ± 0.466
3.76GlnLeu: 3.76 ± 0.468
1.269GlnMet: 1.269 ± 0.225
2.256GlnAsn: 2.256 ± 0.435
0.94GlnPro: 0.94 ± 0.241
2.021GlnGln: 2.021 ± 0.398
1.551GlnArg: 1.551 ± 0.222
2.538GlnSer: 2.538 ± 0.321
2.585GlnThr: 2.585 ± 0.332
3.478GlnVal: 3.478 ± 0.382
0.47GlnTrp: 0.47 ± 0.153
1.692GlnTyr: 1.692 ± 0.252
0.0GlnXaa: 0.0 ± 0.0
Arg
3.384ArgAla: 3.384 ± 0.543
0.329ArgCys: 0.329 ± 0.15
2.397ArgAsp: 2.397 ± 0.322
3.149ArgGlu: 3.149 ± 0.388
1.504ArgPhe: 1.504 ± 0.231
2.491ArgGly: 2.491 ± 0.293
0.752ArgHis: 0.752 ± 0.231
2.867ArgIle: 2.867 ± 0.503
3.243ArgLys: 3.243 ± 0.301
3.854ArgLeu: 3.854 ± 0.389
1.316ArgMet: 1.316 ± 0.272
2.961ArgAsn: 2.961 ± 0.436
1.504ArgPro: 1.504 ± 0.291
1.88ArgGln: 1.88 ± 0.284
2.021ArgArg: 2.021 ± 0.308
2.397ArgSer: 2.397 ± 0.376
2.256ArgThr: 2.256 ± 0.293
2.256ArgVal: 2.256 ± 0.288
0.517ArgTrp: 0.517 ± 0.143
1.269ArgTyr: 1.269 ± 0.199
0.0ArgXaa: 0.0 ± 0.0
Ser
4.184SerAla: 4.184 ± 0.434
0.658SerCys: 0.658 ± 0.163
3.572SerAsp: 3.572 ± 0.361
4.466SerGlu: 4.466 ± 0.443
2.256SerPhe: 2.256 ± 0.371
4.936SerGly: 4.936 ± 0.459
0.799SerHis: 0.799 ± 0.19
3.478SerIle: 3.478 ± 0.482
3.76SerLys: 3.76 ± 0.51
6.064SerLeu: 6.064 ± 0.524
1.598SerMet: 1.598 ± 0.316
2.82SerAsn: 2.82 ± 0.347
2.867SerPro: 2.867 ± 0.455
2.115SerGln: 2.115 ± 0.426
2.914SerArg: 2.914 ± 0.382
3.807SerSer: 3.807 ± 0.568
4.184SerThr: 4.184 ± 0.535
3.995SerVal: 3.995 ± 0.496
0.611SerTrp: 0.611 ± 0.174
2.115SerTyr: 2.115 ± 0.35
0.0SerXaa: 0.0 ± 0.0
Thr
4.654ThrAla: 4.654 ± 0.603
0.329ThrCys: 0.329 ± 0.139
3.995ThrAsp: 3.995 ± 0.353
3.76ThrGlu: 3.76 ± 0.418
3.008ThrPhe: 3.008 ± 0.296
4.795ThrGly: 4.795 ± 0.601
1.269ThrHis: 1.269 ± 0.292
4.419ThrIle: 4.419 ± 0.507
3.854ThrLys: 3.854 ± 0.509
5.359ThrLeu: 5.359 ± 0.446
1.222ThrMet: 1.222 ± 0.244
4.231ThrAsn: 4.231 ± 0.515
3.337ThrPro: 3.337 ± 0.43
2.35ThrGln: 2.35 ± 0.287
2.585ThrArg: 2.585 ± 0.313
3.807ThrSer: 3.807 ± 0.392
3.384ThrThr: 3.384 ± 0.549
5.077ThrVal: 5.077 ± 0.501
0.658ThrTrp: 0.658 ± 0.197
2.444ThrTyr: 2.444 ± 0.414
0.0ThrXaa: 0.0 ± 0.0
Val
6.158ValAla: 6.158 ± 0.588
0.564ValCys: 0.564 ± 0.177
4.278ValAsp: 4.278 ± 0.388
4.278ValGlu: 4.278 ± 0.501
2.162ValPhe: 2.162 ± 0.31
4.842ValGly: 4.842 ± 0.559
1.692ValHis: 1.692 ± 0.264
3.29ValIle: 3.29 ± 0.481
3.807ValLys: 3.807 ± 0.431
4.231ValLeu: 4.231 ± 0.459
2.303ValMet: 2.303 ± 0.295
4.137ValAsn: 4.137 ± 0.425
3.102ValPro: 3.102 ± 0.482
3.29ValGln: 3.29 ± 0.473
3.337ValArg: 3.337 ± 0.411
3.854ValSer: 3.854 ± 0.466
4.748ValThr: 4.748 ± 0.659
5.265ValVal: 5.265 ± 0.631
0.658ValTrp: 0.658 ± 0.155
2.679ValTyr: 2.679 ± 0.397
0.0ValXaa: 0.0 ± 0.0
Trp
0.799TrpAla: 0.799 ± 0.197
0.188TrpCys: 0.188 ± 0.088
0.987TrpAsp: 0.987 ± 0.259
0.517TrpGlu: 0.517 ± 0.213
0.329TrpPhe: 0.329 ± 0.116
0.47TrpGly: 0.47 ± 0.119
0.376TrpHis: 0.376 ± 0.15
0.517TrpIle: 0.517 ± 0.137
0.658TrpLys: 0.658 ± 0.178
1.222TrpLeu: 1.222 ± 0.204
0.282TrpMet: 0.282 ± 0.123
0.517TrpAsn: 0.517 ± 0.192
0.47TrpPro: 0.47 ± 0.14
0.47TrpGln: 0.47 ± 0.123
0.564TrpArg: 0.564 ± 0.171
0.752TrpSer: 0.752 ± 0.189
0.329TrpThr: 0.329 ± 0.113
1.457TrpVal: 1.457 ± 0.278
0.047TrpTrp: 0.047 ± 0.04
0.47TrpTyr: 0.47 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.773TyrAla: 2.773 ± 0.297
0.564TyrCys: 0.564 ± 0.213
2.491TyrAsp: 2.491 ± 0.316
2.162TyrGlu: 2.162 ± 0.327
1.598TyrPhe: 1.598 ± 0.358
2.632TyrGly: 2.632 ± 0.409
0.893TyrHis: 0.893 ± 0.235
2.256TyrIle: 2.256 ± 0.339
2.914TyrLys: 2.914 ± 0.49
2.491TyrLeu: 2.491 ± 0.319
1.128TyrMet: 1.128 ± 0.187
2.021TyrAsn: 2.021 ± 0.316
1.457TyrPro: 1.457 ± 0.327
1.645TyrGln: 1.645 ± 0.226
1.41TyrArg: 1.41 ± 0.297
2.256TyrSer: 2.256 ± 0.366
1.739TyrThr: 1.739 ± 0.261
2.867TyrVal: 2.867 ± 0.394
0.376TyrTrp: 0.376 ± 0.124
1.175TyrTyr: 1.175 ± 0.32
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (21275 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski