Amino acid dipepetide frequency for Erwinia phage pEp_SNUABM_09

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.252AlaAla: 10.252 ± 1.084
0.689AlaCys: 0.689 ± 0.278
6.72AlaAsp: 6.72 ± 0.762
6.375AlaGlu: 6.375 ± 0.697
3.188AlaPhe: 3.188 ± 0.493
7.926AlaGly: 7.926 ± 0.985
1.292AlaHis: 1.292 ± 0.34
3.446AlaIle: 3.446 ± 0.646
7.409AlaLys: 7.409 ± 0.662
8.529AlaLeu: 8.529 ± 0.749
2.843AlaMet: 2.843 ± 0.554
3.963AlaAsn: 3.963 ± 0.513
2.671AlaPro: 2.671 ± 0.644
4.652AlaGln: 4.652 ± 0.687
5.772AlaArg: 5.772 ± 0.681
5.686AlaSer: 5.686 ± 1.029
3.705AlaThr: 3.705 ± 0.69
5.859AlaVal: 5.859 ± 0.888
1.292AlaTrp: 1.292 ± 0.41
2.843AlaTyr: 2.843 ± 0.741
0.0AlaXaa: 0.0 ± 0.0
Cys
0.431CysAla: 0.431 ± 0.187
0.172CysCys: 0.172 ± 0.147
0.603CysAsp: 0.603 ± 0.381
0.431CysGlu: 0.431 ± 0.185
0.431CysPhe: 0.431 ± 0.214
0.345CysGly: 0.345 ± 0.164
0.431CysHis: 0.431 ± 0.22
0.258CysIle: 0.258 ± 0.139
0.431CysLys: 0.431 ± 0.232
0.948CysLeu: 0.948 ± 0.408
0.258CysMet: 0.258 ± 0.156
0.345CysAsn: 0.345 ± 0.191
0.345CysPro: 0.345 ± 0.153
0.258CysGln: 0.258 ± 0.134
0.689CysArg: 0.689 ± 0.252
0.345CysSer: 0.345 ± 0.167
0.258CysThr: 0.258 ± 0.183
0.431CysVal: 0.431 ± 0.181
0.172CysTrp: 0.172 ± 0.118
0.086CysTyr: 0.086 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
6.203AspAla: 6.203 ± 0.625
0.258AspCys: 0.258 ± 0.14
3.877AspAsp: 3.877 ± 0.561
5.169AspGlu: 5.169 ± 0.604
3.188AspPhe: 3.188 ± 0.459
6.289AspGly: 6.289 ± 0.605
1.034AspHis: 1.034 ± 0.235
3.532AspIle: 3.532 ± 0.409
2.843AspLys: 2.843 ± 0.433
3.36AspLeu: 3.36 ± 0.46
1.723AspMet: 1.723 ± 0.364
1.982AspAsn: 1.982 ± 0.304
2.929AspPro: 2.929 ± 0.62
1.723AspGln: 1.723 ± 0.357
2.326AspArg: 2.326 ± 0.386
3.877AspSer: 3.877 ± 0.719
4.308AspThr: 4.308 ± 0.58
3.791AspVal: 3.791 ± 0.556
0.775AspTrp: 0.775 ± 0.199
2.843AspTyr: 2.843 ± 0.576
0.0AspXaa: 0.0 ± 0.0
Glu
7.065GluAla: 7.065 ± 0.824
0.775GluCys: 0.775 ± 0.282
3.705GluAsp: 3.705 ± 0.571
5.945GluGlu: 5.945 ± 0.988
2.585GluPhe: 2.585 ± 0.436
6.031GluGly: 6.031 ± 0.782
1.034GluHis: 1.034 ± 0.351
2.843GluIle: 2.843 ± 0.389
3.446GluLys: 3.446 ± 0.465
6.117GluLeu: 6.117 ± 0.729
2.154GluMet: 2.154 ± 0.514
2.843GluAsn: 2.843 ± 0.57
2.412GluPro: 2.412 ± 0.571
3.619GluGln: 3.619 ± 0.697
3.705GluArg: 3.705 ± 0.497
5.169GluSer: 5.169 ± 0.866
3.188GluThr: 3.188 ± 0.634
4.566GluVal: 4.566 ± 0.49
1.12GluTrp: 1.12 ± 0.346
2.929GluTyr: 2.929 ± 0.446
0.0GluXaa: 0.0 ± 0.0
Phe
3.188PheAla: 3.188 ± 0.379
0.258PheCys: 0.258 ± 0.17
3.446PheAsp: 3.446 ± 0.557
1.982PheGlu: 1.982 ± 0.32
1.378PhePhe: 1.378 ± 0.35
3.188PheGly: 3.188 ± 0.624
0.948PheHis: 0.948 ± 0.377
1.637PheIle: 1.637 ± 0.474
2.671PheLys: 2.671 ± 0.502
3.188PheLeu: 3.188 ± 0.544
1.206PheMet: 1.206 ± 0.237
1.895PheAsn: 1.895 ± 0.4
1.637PhePro: 1.637 ± 0.478
1.292PheGln: 1.292 ± 0.349
1.982PheArg: 1.982 ± 0.448
1.637PheSer: 1.637 ± 0.515
2.757PheThr: 2.757 ± 0.538
2.498PheVal: 2.498 ± 0.564
0.172PheTrp: 0.172 ± 0.11
0.689PheTyr: 0.689 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
7.926GlyAla: 7.926 ± 1.176
0.603GlyCys: 0.603 ± 0.262
5.428GlyAsp: 5.428 ± 0.682
4.652GlyGlu: 4.652 ± 0.695
2.498GlyPhe: 2.498 ± 0.7
6.462GlyGly: 6.462 ± 0.913
1.034GlyHis: 1.034 ± 0.25
4.997GlyIle: 4.997 ± 0.602
5.859GlyLys: 5.859 ± 0.745
7.065GlyLeu: 7.065 ± 0.751
2.154GlyMet: 2.154 ± 0.539
3.36GlyAsn: 3.36 ± 0.664
1.12GlyPro: 1.12 ± 0.286
2.671GlyGln: 2.671 ± 0.427
4.825GlyArg: 4.825 ± 0.535
5.6GlySer: 5.6 ± 0.855
4.308GlyThr: 4.308 ± 0.448
4.049GlyVal: 4.049 ± 0.639
1.723GlyTrp: 1.723 ± 0.462
3.188GlyTyr: 3.188 ± 0.713
0.0GlyXaa: 0.0 ± 0.0
His
1.292HisAla: 1.292 ± 0.278
0.345HisCys: 0.345 ± 0.182
0.862HisAsp: 0.862 ± 0.281
1.206HisGlu: 1.206 ± 0.345
0.775HisPhe: 0.775 ± 0.264
1.206HisGly: 1.206 ± 0.275
0.431HisHis: 0.431 ± 0.192
1.12HisIle: 1.12 ± 0.271
1.034HisLys: 1.034 ± 0.321
1.465HisLeu: 1.465 ± 0.252
0.517HisMet: 0.517 ± 0.177
0.517HisAsn: 0.517 ± 0.206
0.517HisPro: 0.517 ± 0.191
0.086HisGln: 0.086 ± 0.082
0.689HisArg: 0.689 ± 0.208
0.948HisSer: 0.948 ± 0.362
1.465HisThr: 1.465 ± 0.468
1.206HisVal: 1.206 ± 0.254
0.258HisTrp: 0.258 ± 0.171
0.517HisTyr: 0.517 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
4.48IleAla: 4.48 ± 0.627
0.517IleCys: 0.517 ± 0.198
3.877IleAsp: 3.877 ± 0.537
3.015IleGlu: 3.015 ± 0.487
1.378IlePhe: 1.378 ± 0.34
3.188IleGly: 3.188 ± 0.395
1.206IleHis: 1.206 ± 0.34
2.412IleIle: 2.412 ± 0.498
2.929IleLys: 2.929 ± 0.36
3.619IleLeu: 3.619 ± 0.499
1.378IleMet: 1.378 ± 0.42
2.154IleAsn: 2.154 ± 0.568
2.068IlePro: 2.068 ± 0.519
1.637IleGln: 1.637 ± 0.378
3.188IleArg: 3.188 ± 0.596
2.412IleSer: 2.412 ± 0.465
2.757IleThr: 2.757 ± 0.495
3.705IleVal: 3.705 ± 0.453
0.775IleTrp: 0.775 ± 0.25
1.551IleTyr: 1.551 ± 0.319
0.0IleXaa: 0.0 ± 0.0
Lys
7.409LysAla: 7.409 ± 0.925
0.431LysCys: 0.431 ± 0.176
3.446LysAsp: 3.446 ± 0.655
4.222LysGlu: 4.222 ± 0.655
2.671LysPhe: 2.671 ± 0.36
4.825LysGly: 4.825 ± 0.533
1.034LysHis: 1.034 ± 0.27
1.723LysIle: 1.723 ± 0.405
4.222LysLys: 4.222 ± 0.651
6.203LysLeu: 6.203 ± 0.865
1.637LysMet: 1.637 ± 0.297
2.24LysAsn: 2.24 ± 0.326
2.326LysPro: 2.326 ± 0.533
3.446LysGln: 3.446 ± 0.669
2.671LysArg: 2.671 ± 0.441
2.757LysSer: 2.757 ± 0.504
4.308LysThr: 4.308 ± 0.652
5.342LysVal: 5.342 ± 0.763
0.862LysTrp: 0.862 ± 0.28
2.498LysTyr: 2.498 ± 0.423
0.0LysXaa: 0.0 ± 0.0
Leu
7.926LeuAla: 7.926 ± 1.004
0.258LeuCys: 0.258 ± 0.22
4.566LeuAsp: 4.566 ± 0.584
6.117LeuGlu: 6.117 ± 1.15
2.498LeuPhe: 2.498 ± 0.506
4.308LeuGly: 4.308 ± 0.441
0.948LeuHis: 0.948 ± 0.276
3.705LeuIle: 3.705 ± 0.635
6.203LeuLys: 6.203 ± 0.728
4.739LeuLeu: 4.739 ± 0.835
2.326LeuMet: 2.326 ± 0.438
4.135LeuAsn: 4.135 ± 0.515
3.102LeuPro: 3.102 ± 0.63
4.049LeuGln: 4.049 ± 0.799
5.945LeuArg: 5.945 ± 0.683
4.825LeuSer: 4.825 ± 0.641
5.514LeuThr: 5.514 ± 0.784
5.083LeuVal: 5.083 ± 0.82
1.551LeuTrp: 1.551 ± 0.464
2.326LeuTyr: 2.326 ± 0.525
0.0LeuXaa: 0.0 ± 0.0
Met
2.498MetAla: 2.498 ± 0.422
0.172MetCys: 0.172 ± 0.133
2.068MetAsp: 2.068 ± 0.537
1.292MetGlu: 1.292 ± 0.37
1.034MetPhe: 1.034 ± 0.25
1.637MetGly: 1.637 ± 0.339
0.172MetHis: 0.172 ± 0.122
1.637MetIle: 1.637 ± 0.442
1.292MetLys: 1.292 ± 0.52
2.757MetLeu: 2.757 ± 0.486
0.775MetMet: 0.775 ± 0.226
1.292MetAsn: 1.292 ± 0.296
1.12MetPro: 1.12 ± 0.264
1.465MetGln: 1.465 ± 0.385
1.551MetArg: 1.551 ± 0.428
2.585MetSer: 2.585 ± 0.632
1.637MetThr: 1.637 ± 0.334
1.723MetVal: 1.723 ± 0.394
0.258MetTrp: 0.258 ± 0.132
0.431MetTyr: 0.431 ± 0.186
0.0MetXaa: 0.0 ± 0.0
Asn
4.222AsnAla: 4.222 ± 0.864
0.258AsnCys: 0.258 ± 0.191
2.843AsnAsp: 2.843 ± 0.432
3.532AsnGlu: 3.532 ± 0.456
1.465AsnPhe: 1.465 ± 0.374
4.48AsnGly: 4.48 ± 0.705
0.689AsnHis: 0.689 ± 0.257
2.24AsnIle: 2.24 ± 0.465
1.465AsnLys: 1.465 ± 0.385
4.135AsnLeu: 4.135 ± 0.634
1.034AsnMet: 1.034 ± 0.354
2.326AsnAsn: 2.326 ± 0.778
1.809AsnPro: 1.809 ± 0.357
1.895AsnGln: 1.895 ± 0.404
1.809AsnArg: 1.809 ± 0.604
2.585AsnSer: 2.585 ± 0.589
2.326AsnThr: 2.326 ± 0.575
2.929AsnVal: 2.929 ± 0.544
0.775AsnTrp: 0.775 ± 0.219
1.809AsnTyr: 1.809 ± 0.359
0.0AsnXaa: 0.0 ± 0.0
Pro
3.963ProAla: 3.963 ± 0.605
0.172ProCys: 0.172 ± 0.11
2.068ProAsp: 2.068 ± 0.47
3.446ProGlu: 3.446 ± 0.863
1.637ProPhe: 1.637 ± 0.399
2.498ProGly: 2.498 ± 0.389
0.689ProHis: 0.689 ± 0.172
1.292ProIle: 1.292 ± 0.323
2.24ProLys: 2.24 ± 0.546
2.154ProLeu: 2.154 ± 0.387
0.948ProMet: 0.948 ± 0.299
2.068ProAsn: 2.068 ± 0.433
0.948ProPro: 0.948 ± 0.27
1.034ProGln: 1.034 ± 0.276
1.982ProArg: 1.982 ± 0.445
1.723ProSer: 1.723 ± 0.372
1.551ProThr: 1.551 ± 0.448
2.154ProVal: 2.154 ± 0.451
0.948ProTrp: 0.948 ± 0.22
1.809ProTyr: 1.809 ± 0.469
0.0ProXaa: 0.0 ± 0.0
Gln
4.135GlnAla: 4.135 ± 0.729
0.172GlnCys: 0.172 ± 0.142
1.809GlnAsp: 1.809 ± 0.424
3.446GlnGlu: 3.446 ± 0.706
2.24GlnPhe: 2.24 ± 0.385
3.015GlnGly: 3.015 ± 0.424
0.862GlnHis: 0.862 ± 0.261
2.326GlnIle: 2.326 ± 0.493
1.982GlnLys: 1.982 ± 0.569
3.705GlnLeu: 3.705 ± 0.561
0.948GlnMet: 0.948 ± 0.317
1.292GlnAsn: 1.292 ± 0.267
1.982GlnPro: 1.982 ± 0.355
2.068GlnGln: 2.068 ± 0.446
2.326GlnArg: 2.326 ± 0.458
2.24GlnSer: 2.24 ± 0.453
1.982GlnThr: 1.982 ± 0.455
2.843GlnVal: 2.843 ± 0.621
1.206GlnTrp: 1.206 ± 0.338
1.034GlnTyr: 1.034 ± 0.35
0.0GlnXaa: 0.0 ± 0.0
Arg
4.394ArgAla: 4.394 ± 0.471
0.948ArgCys: 0.948 ± 0.261
3.877ArgAsp: 3.877 ± 0.615
4.566ArgGlu: 4.566 ± 0.81
1.982ArgPhe: 1.982 ± 0.339
3.619ArgGly: 3.619 ± 0.493
0.689ArgHis: 0.689 ± 0.211
3.015ArgIle: 3.015 ± 0.474
4.394ArgLys: 4.394 ± 0.819
4.308ArgLeu: 4.308 ± 0.559
1.465ArgMet: 1.465 ± 0.368
2.671ArgAsn: 2.671 ± 0.599
2.068ArgPro: 2.068 ± 0.524
1.723ArgGln: 1.723 ± 0.291
2.757ArgArg: 2.757 ± 0.406
3.963ArgSer: 3.963 ± 0.536
3.446ArgThr: 3.446 ± 0.436
3.532ArgVal: 3.532 ± 0.64
0.517ArgTrp: 0.517 ± 0.206
1.551ArgTyr: 1.551 ± 0.309
0.0ArgXaa: 0.0 ± 0.0
Ser
6.117SerAla: 6.117 ± 1.061
0.345SerCys: 0.345 ± 0.249
4.48SerAsp: 4.48 ± 0.694
3.705SerGlu: 3.705 ± 0.788
3.015SerPhe: 3.015 ± 0.521
5.6SerGly: 5.6 ± 0.78
1.034SerHis: 1.034 ± 0.309
2.929SerIle: 2.929 ± 0.478
4.308SerLys: 4.308 ± 0.53
3.274SerLeu: 3.274 ± 0.466
1.465SerMet: 1.465 ± 0.411
1.809SerAsn: 1.809 ± 0.378
1.723SerPro: 1.723 ± 0.448
1.982SerGln: 1.982 ± 0.463
2.757SerArg: 2.757 ± 0.56
3.877SerSer: 3.877 ± 0.461
3.36SerThr: 3.36 ± 0.443
4.049SerVal: 4.049 ± 0.704
0.948SerTrp: 0.948 ± 0.257
2.585SerTyr: 2.585 ± 0.598
0.0SerXaa: 0.0 ± 0.0
Thr
3.877ThrAla: 3.877 ± 0.558
0.517ThrCys: 0.517 ± 0.23
3.446ThrAsp: 3.446 ± 0.705
4.308ThrGlu: 4.308 ± 0.598
1.895ThrPhe: 1.895 ± 0.383
5.514ThrGly: 5.514 ± 0.626
1.034ThrHis: 1.034 ± 0.271
3.619ThrIle: 3.619 ± 0.713
4.566ThrLys: 4.566 ± 0.516
5.169ThrLeu: 5.169 ± 0.684
1.378ThrMet: 1.378 ± 0.435
3.102ThrAsn: 3.102 ± 0.62
2.585ThrPro: 2.585 ± 0.478
1.982ThrGln: 1.982 ± 0.428
2.498ThrArg: 2.498 ± 0.532
2.757ThrSer: 2.757 ± 0.502
3.015ThrThr: 3.015 ± 0.63
4.049ThrVal: 4.049 ± 0.619
0.775ThrTrp: 0.775 ± 0.245
1.378ThrTyr: 1.378 ± 0.328
0.0ThrXaa: 0.0 ± 0.0
Val
5.514ValAla: 5.514 ± 0.729
0.345ValCys: 0.345 ± 0.153
2.24ValAsp: 2.24 ± 0.425
4.825ValGlu: 4.825 ± 0.741
2.585ValPhe: 2.585 ± 0.693
5.6ValGly: 5.6 ± 0.75
1.034ValHis: 1.034 ± 0.335
3.532ValIle: 3.532 ± 0.769
4.48ValLys: 4.48 ± 0.604
4.997ValLeu: 4.997 ± 0.808
1.723ValMet: 1.723 ± 0.377
3.36ValAsn: 3.36 ± 0.552
2.585ValPro: 2.585 ± 0.45
3.446ValGln: 3.446 ± 0.579
4.652ValArg: 4.652 ± 0.708
4.049ValSer: 4.049 ± 0.613
4.308ValThr: 4.308 ± 0.932
5.083ValVal: 5.083 ± 0.702
0.689ValTrp: 0.689 ± 0.201
1.982ValTyr: 1.982 ± 0.368
0.0ValXaa: 0.0 ± 0.0
Trp
1.12TrpAla: 1.12 ± 0.282
0.258TrpCys: 0.258 ± 0.167
0.517TrpAsp: 0.517 ± 0.251
1.12TrpGlu: 1.12 ± 0.348
0.345TrpPhe: 0.345 ± 0.155
0.775TrpGly: 0.775 ± 0.296
0.345TrpHis: 0.345 ± 0.196
0.431TrpIle: 0.431 ± 0.181
1.12TrpLys: 1.12 ± 0.324
1.465TrpLeu: 1.465 ± 0.418
0.517TrpMet: 0.517 ± 0.206
1.12TrpAsn: 1.12 ± 0.282
0.258TrpPro: 0.258 ± 0.128
0.603TrpGln: 0.603 ± 0.211
0.689TrpArg: 0.689 ± 0.226
1.034TrpSer: 1.034 ± 0.302
1.723TrpThr: 1.723 ± 0.364
1.292TrpVal: 1.292 ± 0.452
0.517TrpTrp: 0.517 ± 0.247
0.517TrpTyr: 0.517 ± 0.208
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.188TyrAla: 3.188 ± 0.523
0.172TyrCys: 0.172 ± 0.124
2.154TyrAsp: 2.154 ± 0.407
1.895TyrGlu: 1.895 ± 0.465
0.862TyrPhe: 0.862 ± 0.396
3.015TyrGly: 3.015 ± 0.381
0.517TyrHis: 0.517 ± 0.245
1.551TyrIle: 1.551 ± 0.333
1.378TyrLys: 1.378 ± 0.24
2.929TyrLeu: 2.929 ± 0.551
0.862TyrMet: 0.862 ± 0.275
2.154TyrAsn: 2.154 ± 0.395
1.12TyrPro: 1.12 ± 0.32
1.982TyrGln: 1.982 ± 0.588
2.671TyrArg: 2.671 ± 0.376
1.292TyrSer: 1.292 ± 0.329
1.551TyrThr: 1.551 ± 0.409
2.843TyrVal: 2.843 ± 0.544
0.431TyrTrp: 0.431 ± 0.2
1.034TyrTyr: 1.034 ± 0.306
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (11608 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski