Amino acid dipepetide frequency for Erwinia phage vB_EhrS_49

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.78AlaAla: 12.78 ± 2.101
0.702AlaCys: 0.702 ± 0.235
7.022AlaAsp: 7.022 ± 0.65
8.005AlaGlu: 8.005 ± 1.377
3.792AlaPhe: 3.792 ± 0.546
7.865AlaGly: 7.865 ± 0.97
1.124AlaHis: 1.124 ± 0.288
6.53AlaIle: 6.53 ± 0.595
6.109AlaLys: 6.109 ± 0.828
7.654AlaLeu: 7.654 ± 0.814
3.651AlaMet: 3.651 ± 0.577
3.3AlaAsn: 3.3 ± 0.395
2.247AlaPro: 2.247 ± 0.435
5.828AlaGln: 5.828 ± 1.069
4.354AlaArg: 4.354 ± 0.581
5.828AlaSer: 5.828 ± 1.017
6.109AlaThr: 6.109 ± 0.779
6.25AlaVal: 6.25 ± 0.722
2.247AlaTrp: 2.247 ± 0.437
2.387AlaTyr: 2.387 ± 0.468
0.0AlaXaa: 0.0 ± 0.0
Cys
1.334CysAla: 1.334 ± 0.346
0.421CysCys: 0.421 ± 0.159
0.913CysAsp: 0.913 ± 0.257
0.913CysGlu: 0.913 ± 0.298
0.211CysPhe: 0.211 ± 0.129
0.913CysGly: 0.913 ± 0.262
0.351CysHis: 0.351 ± 0.16
0.772CysIle: 0.772 ± 0.214
1.053CysLys: 1.053 ± 0.352
0.913CysLeu: 0.913 ± 0.25
0.281CysMet: 0.281 ± 0.129
0.702CysAsn: 0.702 ± 0.258
0.702CysPro: 0.702 ± 0.248
0.281CysGln: 0.281 ± 0.131
0.772CysArg: 0.772 ± 0.257
1.124CysSer: 1.124 ± 0.304
0.562CysThr: 0.562 ± 0.182
0.421CysVal: 0.421 ± 0.146
0.281CysTrp: 0.281 ± 0.131
0.281CysTyr: 0.281 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
6.952AspAla: 6.952 ± 0.648
1.053AspCys: 1.053 ± 0.366
4.354AspAsp: 4.354 ± 0.672
4.213AspGlu: 4.213 ± 0.577
1.404AspPhe: 1.404 ± 0.356
4.635AspGly: 4.635 ± 0.623
0.843AspHis: 0.843 ± 0.233
3.792AspIle: 3.792 ± 0.399
4.003AspLys: 4.003 ± 0.613
5.126AspLeu: 5.126 ± 0.581
1.685AspMet: 1.685 ± 0.302
2.879AspAsn: 2.879 ± 0.542
2.387AspPro: 2.387 ± 0.469
1.826AspGln: 1.826 ± 0.395
3.792AspArg: 3.792 ± 0.503
4.003AspSer: 4.003 ± 0.477
3.23AspThr: 3.23 ± 0.538
3.019AspVal: 3.019 ± 0.526
1.545AspTrp: 1.545 ± 0.349
2.739AspTyr: 2.739 ± 0.504
0.0AspXaa: 0.0 ± 0.0
Glu
7.514GluAla: 7.514 ± 1.065
0.843GluCys: 0.843 ± 0.244
2.739GluAsp: 2.739 ± 0.501
3.862GluGlu: 3.862 ± 0.588
2.317GluPhe: 2.317 ± 0.377
4.003GluGly: 4.003 ± 0.67
1.264GluHis: 1.264 ± 0.315
3.792GluIle: 3.792 ± 0.51
3.792GluLys: 3.792 ± 0.603
3.792GluLeu: 3.792 ± 0.478
1.966GluMet: 1.966 ± 0.374
3.09GluAsn: 3.09 ± 0.548
1.755GluPro: 1.755 ± 0.364
4.354GluGln: 4.354 ± 0.709
4.494GluArg: 4.494 ± 0.847
3.371GluSer: 3.371 ± 0.462
3.09GluThr: 3.09 ± 0.796
4.213GluVal: 4.213 ± 0.428
1.404GluTrp: 1.404 ± 0.299
1.615GluTyr: 1.615 ± 0.299
0.0GluXaa: 0.0 ± 0.0
Phe
2.317PheAla: 2.317 ± 0.405
0.562PheCys: 0.562 ± 0.182
2.739PheAsp: 2.739 ± 0.379
2.879PheGlu: 2.879 ± 0.449
1.264PhePhe: 1.264 ± 0.413
2.107PheGly: 2.107 ± 0.383
0.632PheHis: 0.632 ± 0.178
1.826PheIle: 1.826 ± 0.387
1.826PheLys: 1.826 ± 0.374
1.755PheLeu: 1.755 ± 0.385
0.632PheMet: 0.632 ± 0.22
1.545PheAsn: 1.545 ± 0.389
0.632PhePro: 0.632 ± 0.194
1.545PheGln: 1.545 ± 0.293
3.019PheArg: 3.019 ± 0.57
1.896PheSer: 1.896 ± 0.334
2.949PheThr: 2.949 ± 0.532
2.317PheVal: 2.317 ± 0.343
0.492PheTrp: 0.492 ± 0.157
1.264PheTyr: 1.264 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
5.407GlyAla: 5.407 ± 0.598
1.264GlyCys: 1.264 ± 0.347
3.371GlyAsp: 3.371 ± 0.329
4.354GlyGlu: 4.354 ± 0.407
3.16GlyPhe: 3.16 ± 0.431
4.986GlyGly: 4.986 ± 0.674
1.053GlyHis: 1.053 ± 0.268
4.143GlyIle: 4.143 ± 0.476
5.196GlyLys: 5.196 ± 0.638
5.196GlyLeu: 5.196 ± 0.73
1.545GlyMet: 1.545 ± 0.407
3.09GlyAsn: 3.09 ± 0.469
1.334GlyPro: 1.334 ± 0.29
2.317GlyGln: 2.317 ± 0.333
4.003GlyArg: 4.003 ± 0.519
4.354GlySer: 4.354 ± 0.663
5.056GlyThr: 5.056 ± 0.835
5.407GlyVal: 5.407 ± 0.6
0.913GlyTrp: 0.913 ± 0.238
3.019GlyTyr: 3.019 ± 0.581
0.0GlyXaa: 0.0 ± 0.0
His
1.545HisAla: 1.545 ± 0.337
0.421HisCys: 0.421 ± 0.207
1.124HisAsp: 1.124 ± 0.293
0.913HisGlu: 0.913 ± 0.266
0.562HisPhe: 0.562 ± 0.174
0.913HisGly: 0.913 ± 0.272
0.281HisHis: 0.281 ± 0.16
0.772HisIle: 0.772 ± 0.238
0.562HisLys: 0.562 ± 0.194
1.615HisLeu: 1.615 ± 0.321
0.14HisMet: 0.14 ± 0.097
0.211HisAsn: 0.211 ± 0.11
0.913HisPro: 0.913 ± 0.268
0.702HisGln: 0.702 ± 0.279
0.772HisArg: 0.772 ± 0.224
0.913HisSer: 0.913 ± 0.317
0.702HisThr: 0.702 ± 0.352
0.843HisVal: 0.843 ± 0.229
0.562HisTrp: 0.562 ± 0.198
0.632HisTyr: 0.632 ± 0.205
0.0HisXaa: 0.0 ± 0.0
Ile
6.039IleAla: 6.039 ± 0.721
0.632IleCys: 0.632 ± 0.212
3.932IleAsp: 3.932 ± 0.526
4.003IleGlu: 4.003 ± 0.576
1.615IlePhe: 1.615 ± 0.304
4.986IleGly: 4.986 ± 0.601
0.632IleHis: 0.632 ± 0.221
3.792IleIle: 3.792 ± 0.485
3.441IleLys: 3.441 ± 0.499
3.16IleLeu: 3.16 ± 0.546
0.421IleMet: 0.421 ± 0.218
3.3IleAsn: 3.3 ± 0.614
1.966IlePro: 1.966 ± 0.306
2.458IleGln: 2.458 ± 0.385
2.317IleArg: 2.317 ± 0.406
4.915IleSer: 4.915 ± 0.655
5.337IleThr: 5.337 ± 0.642
2.598IleVal: 2.598 ± 0.49
0.913IleTrp: 0.913 ± 0.257
0.983IleTyr: 0.983 ± 0.274
0.0IleXaa: 0.0 ± 0.0
Lys
7.303LysAla: 7.303 ± 0.806
0.913LysCys: 0.913 ± 0.368
3.932LysAsp: 3.932 ± 0.581
3.23LysGlu: 3.23 ± 0.477
1.826LysPhe: 1.826 ± 0.379
3.371LysGly: 3.371 ± 0.427
1.124LysHis: 1.124 ± 0.309
1.685LysIle: 1.685 ± 0.378
4.564LysLys: 4.564 ± 0.778
4.705LysLeu: 4.705 ± 0.543
1.053LysMet: 1.053 ± 0.305
2.879LysAsn: 2.879 ± 0.375
3.511LysPro: 3.511 ± 0.618
3.581LysGln: 3.581 ± 0.474
3.932LysArg: 3.932 ± 0.609
4.143LysSer: 4.143 ± 0.535
2.879LysThr: 2.879 ± 0.505
3.722LysVal: 3.722 ± 0.494
0.772LysTrp: 0.772 ± 0.229
1.966LysTyr: 1.966 ± 0.421
0.0LysXaa: 0.0 ± 0.0
Leu
8.426LeuAla: 8.426 ± 1.062
0.913LeuCys: 0.913 ± 0.237
4.775LeuAsp: 4.775 ± 0.537
3.932LeuGlu: 3.932 ± 0.403
1.896LeuPhe: 1.896 ± 0.381
4.213LeuGly: 4.213 ± 0.509
0.913LeuHis: 0.913 ± 0.219
4.283LeuIle: 4.283 ± 0.528
3.932LeuLys: 3.932 ± 0.546
5.407LeuLeu: 5.407 ± 0.633
2.528LeuMet: 2.528 ± 0.398
4.283LeuAsn: 4.283 ± 0.551
2.387LeuPro: 2.387 ± 0.499
3.09LeuGln: 3.09 ± 0.47
4.143LeuArg: 4.143 ± 0.471
5.547LeuSer: 5.547 ± 0.817
5.828LeuThr: 5.828 ± 0.571
4.283LeuVal: 4.283 ± 0.628
0.702LeuTrp: 0.702 ± 0.261
2.317LeuTyr: 2.317 ± 0.375
0.0LeuXaa: 0.0 ± 0.0
Met
2.739MetAla: 2.739 ± 0.404
0.492MetCys: 0.492 ± 0.177
1.896MetAsp: 1.896 ± 0.413
1.404MetGlu: 1.404 ± 0.308
0.632MetPhe: 0.632 ± 0.23
1.124MetGly: 1.124 ± 0.29
0.281MetHis: 0.281 ± 0.142
1.053MetIle: 1.053 ± 0.257
2.739MetLys: 2.739 ± 0.373
1.966MetLeu: 1.966 ± 0.379
0.913MetMet: 0.913 ± 0.301
1.053MetAsn: 1.053 ± 0.302
1.194MetPro: 1.194 ± 0.323
0.843MetGln: 0.843 ± 0.218
0.983MetArg: 0.983 ± 0.229
2.668MetSer: 2.668 ± 0.318
2.387MetThr: 2.387 ± 0.499
1.334MetVal: 1.334 ± 0.274
0.211MetTrp: 0.211 ± 0.14
0.492MetTyr: 0.492 ± 0.18
0.0MetXaa: 0.0 ± 0.0
Asn
5.056AsnAla: 5.056 ± 0.694
0.632AsnCys: 0.632 ± 0.242
2.809AsnAsp: 2.809 ± 0.371
2.317AsnGlu: 2.317 ± 0.307
1.053AsnPhe: 1.053 ± 0.274
4.354AsnGly: 4.354 ± 0.619
0.492AsnHis: 0.492 ± 0.203
2.668AsnIle: 2.668 ± 0.434
2.387AsnLys: 2.387 ± 0.453
2.387AsnLeu: 2.387 ± 0.345
0.843AsnMet: 0.843 ± 0.287
1.896AsnAsn: 1.896 ± 0.412
2.036AsnPro: 2.036 ± 0.297
2.036AsnGln: 2.036 ± 0.439
2.458AsnArg: 2.458 ± 0.399
2.879AsnSer: 2.879 ± 0.434
2.879AsnThr: 2.879 ± 0.488
2.668AsnVal: 2.668 ± 0.653
0.492AsnTrp: 0.492 ± 0.192
1.194AsnTyr: 1.194 ± 0.411
0.0AsnXaa: 0.0 ± 0.0
Pro
2.739ProAla: 2.739 ± 0.389
0.492ProCys: 0.492 ± 0.2
3.09ProAsp: 3.09 ± 0.518
2.387ProGlu: 2.387 ± 0.425
1.615ProPhe: 1.615 ± 0.354
2.317ProGly: 2.317 ± 0.435
0.562ProHis: 0.562 ± 0.202
2.177ProIle: 2.177 ± 0.504
1.826ProLys: 1.826 ± 0.48
2.879ProLeu: 2.879 ± 0.421
1.053ProMet: 1.053 ± 0.303
1.545ProAsn: 1.545 ± 0.285
1.264ProPro: 1.264 ± 0.31
1.334ProGln: 1.334 ± 0.295
1.826ProArg: 1.826 ± 0.302
2.387ProSer: 2.387 ± 0.442
2.107ProThr: 2.107 ± 0.391
2.458ProVal: 2.458 ± 0.349
0.14ProTrp: 0.14 ± 0.098
1.194ProTyr: 1.194 ± 0.333
0.0ProXaa: 0.0 ± 0.0
Gln
4.635GlnAla: 4.635 ± 0.895
0.913GlnCys: 0.913 ± 0.194
1.545GlnAsp: 1.545 ± 0.365
2.528GlnGlu: 2.528 ± 0.527
2.036GlnPhe: 2.036 ± 0.396
2.528GlnGly: 2.528 ± 0.546
0.702GlnHis: 0.702 ± 0.216
3.23GlnIle: 3.23 ± 0.421
3.019GlnLys: 3.019 ± 0.595
3.651GlnLeu: 3.651 ± 0.504
1.475GlnMet: 1.475 ± 0.372
1.334GlnAsn: 1.334 ± 0.331
1.966GlnPro: 1.966 ± 0.413
2.879GlnGln: 2.879 ± 0.721
2.528GlnArg: 2.528 ± 0.491
2.809GlnSer: 2.809 ± 0.428
2.247GlnThr: 2.247 ± 0.495
3.3GlnVal: 3.3 ± 0.444
0.632GlnTrp: 0.632 ± 0.233
1.896GlnTyr: 1.896 ± 0.421
0.0GlnXaa: 0.0 ± 0.0
Arg
5.266ArgAla: 5.266 ± 0.43
0.772ArgCys: 0.772 ± 0.213
3.371ArgAsp: 3.371 ± 0.401
4.354ArgGlu: 4.354 ± 0.802
1.685ArgPhe: 1.685 ± 0.28
3.09ArgGly: 3.09 ± 0.461
1.124ArgHis: 1.124 ± 0.274
3.441ArgIle: 3.441 ± 0.576
4.073ArgLys: 4.073 ± 0.576
5.126ArgLeu: 5.126 ± 0.627
2.107ArgMet: 2.107 ± 0.424
2.107ArgAsn: 2.107 ± 0.419
1.826ArgPro: 1.826 ± 0.357
3.16ArgGln: 3.16 ± 0.685
3.581ArgArg: 3.581 ± 0.572
3.019ArgSer: 3.019 ± 0.474
3.09ArgThr: 3.09 ± 0.511
3.019ArgVal: 3.019 ± 0.502
0.632ArgTrp: 0.632 ± 0.188
1.826ArgTyr: 1.826 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
7.022SerAla: 7.022 ± 0.932
0.562SerCys: 0.562 ± 0.197
4.705SerAsp: 4.705 ± 0.562
4.003SerGlu: 4.003 ± 0.507
2.949SerPhe: 2.949 ± 0.518
5.547SerGly: 5.547 ± 0.909
1.194SerHis: 1.194 ± 0.286
3.23SerIle: 3.23 ± 0.501
2.949SerLys: 2.949 ± 0.516
4.986SerLeu: 4.986 ± 0.762
2.107SerMet: 2.107 ± 0.352
2.668SerAsn: 2.668 ± 0.545
2.387SerPro: 2.387 ± 0.4
2.879SerGln: 2.879 ± 0.495
3.651SerArg: 3.651 ± 0.436
4.073SerSer: 4.073 ± 0.707
3.371SerThr: 3.371 ± 0.552
5.547SerVal: 5.547 ± 0.527
1.264SerTrp: 1.264 ± 0.232
1.404SerTyr: 1.404 ± 0.359
0.0SerXaa: 0.0 ± 0.0
Thr
7.022ThrAla: 7.022 ± 0.828
0.07ThrCys: 0.07 ± 0.066
4.775ThrAsp: 4.775 ± 0.703
3.792ThrGlu: 3.792 ± 0.568
2.247ThrPhe: 2.247 ± 0.393
5.758ThrGly: 5.758 ± 0.678
0.772ThrHis: 0.772 ± 0.248
3.862ThrIle: 3.862 ± 0.648
3.441ThrLys: 3.441 ± 0.535
4.986ThrLeu: 4.986 ± 0.652
1.194ThrMet: 1.194 ± 0.332
2.809ThrAsn: 2.809 ± 0.534
2.879ThrPro: 2.879 ± 0.558
2.809ThrGln: 2.809 ± 0.408
3.16ThrArg: 3.16 ± 0.397
3.511ThrSer: 3.511 ± 0.619
4.073ThrThr: 4.073 ± 0.789
2.949ThrVal: 2.949 ± 0.461
1.264ThrTrp: 1.264 ± 0.359
2.247ThrTyr: 2.247 ± 0.52
0.0ThrXaa: 0.0 ± 0.0
Val
5.547ValAla: 5.547 ± 0.731
0.843ValCys: 0.843 ± 0.205
3.441ValAsp: 3.441 ± 0.489
3.3ValGlu: 3.3 ± 0.456
2.036ValPhe: 2.036 ± 0.365
4.143ValGly: 4.143 ± 0.488
0.702ValHis: 0.702 ± 0.209
4.283ValIle: 4.283 ± 0.496
3.932ValLys: 3.932 ± 0.438
4.143ValLeu: 4.143 ± 0.443
1.755ValMet: 1.755 ± 0.311
3.09ValAsn: 3.09 ± 0.67
2.458ValPro: 2.458 ± 0.425
2.107ValGln: 2.107 ± 0.376
3.441ValArg: 3.441 ± 0.537
5.266ValSer: 5.266 ± 0.474
4.775ValThr: 4.775 ± 0.742
3.581ValVal: 3.581 ± 0.518
0.702ValTrp: 0.702 ± 0.225
1.545ValTyr: 1.545 ± 0.295
0.0ValXaa: 0.0 ± 0.0
Trp
1.264TrpAla: 1.264 ± 0.298
0.07TrpCys: 0.07 ± 0.082
1.053TrpAsp: 1.053 ± 0.25
0.772TrpGlu: 0.772 ± 0.21
0.281TrpPhe: 0.281 ± 0.132
0.843TrpGly: 0.843 ± 0.212
0.492TrpHis: 0.492 ± 0.198
0.562TrpIle: 0.562 ± 0.215
1.124TrpLys: 1.124 ± 0.307
1.966TrpLeu: 1.966 ± 0.331
0.281TrpMet: 0.281 ± 0.144
0.702TrpAsn: 0.702 ± 0.25
0.421TrpPro: 0.421 ± 0.207
0.913TrpGln: 0.913 ± 0.254
1.264TrpArg: 1.264 ± 0.297
0.983TrpSer: 0.983 ± 0.248
1.264TrpThr: 1.264 ± 0.354
0.843TrpVal: 0.843 ± 0.224
0.562TrpTrp: 0.562 ± 0.223
0.421TrpTyr: 0.421 ± 0.155
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.949TyrAla: 2.949 ± 0.475
0.632TyrCys: 0.632 ± 0.202
1.896TyrAsp: 1.896 ± 0.292
2.107TyrGlu: 2.107 ± 0.331
1.615TyrPhe: 1.615 ± 0.427
1.475TyrGly: 1.475 ± 0.343
0.562TyrHis: 0.562 ± 0.196
1.545TyrIle: 1.545 ± 0.433
1.124TyrLys: 1.124 ± 0.223
2.387TyrLeu: 2.387 ± 0.459
0.702TyrMet: 0.702 ± 0.174
1.053TyrAsn: 1.053 ± 0.286
1.124TyrPro: 1.124 ± 0.302
0.913TyrGln: 0.913 ± 0.248
2.177TyrArg: 2.177 ± 0.366
2.809TyrSer: 2.809 ± 0.471
1.685TyrThr: 1.685 ± 0.351
2.317TyrVal: 2.317 ± 0.424
0.351TyrTrp: 0.351 ± 0.17
0.913TyrTyr: 0.913 ± 0.302
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 80 proteins (14242 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski