Amino acid dipepetide frequency for Escherichia phage vB_EcoM_ECOO78

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.239AlaAla: 11.239 ± 1.14
1.1AlaCys: 1.1 ± 0.293
6.681AlaAsp: 6.681 ± 0.835
7.545AlaGlu: 7.545 ± 0.919
3.694AlaPhe: 3.694 ± 0.572
9.117AlaGly: 9.117 ± 1.17
1.572AlaHis: 1.572 ± 0.423
6.917AlaIle: 6.917 ± 0.942
5.03AlaLys: 5.03 ± 0.772
6.681AlaLeu: 6.681 ± 1.025
2.515AlaMet: 2.515 ± 0.478
6.681AlaAsn: 6.681 ± 0.656
3.615AlaPro: 3.615 ± 0.55
4.401AlaGln: 4.401 ± 0.595
5.423AlaArg: 5.423 ± 0.663
5.187AlaSer: 5.187 ± 0.577
5.659AlaThr: 5.659 ± 0.823
6.366AlaVal: 6.366 ± 0.806
1.493AlaTrp: 1.493 ± 0.325
3.065AlaTyr: 3.065 ± 0.468
0.0AlaXaa: 0.0 ± 0.0
Cys
0.393CysAla: 0.393 ± 0.168
0.314CysCys: 0.314 ± 0.161
0.786CysAsp: 0.786 ± 0.277
0.707CysGlu: 0.707 ± 0.252
0.472CysPhe: 0.472 ± 0.19
0.865CysGly: 0.865 ± 0.267
0.157CysHis: 0.157 ± 0.119
0.629CysIle: 0.629 ± 0.19
0.55CysLys: 0.55 ± 0.242
1.1CysLeu: 1.1 ± 0.297
0.0CysMet: 0.0 ± 0.0
0.157CysAsn: 0.157 ± 0.094
0.865CysPro: 0.865 ± 0.302
0.472CysGln: 0.472 ± 0.22
0.55CysArg: 0.55 ± 0.207
1.022CysSer: 1.022 ± 0.41
0.943CysThr: 0.943 ± 0.282
0.786CysVal: 0.786 ± 0.237
0.314CysTrp: 0.314 ± 0.153
0.314CysTyr: 0.314 ± 0.192
0.0CysXaa: 0.0 ± 0.0
Asp
5.58AspAla: 5.58 ± 0.654
0.943AspCys: 0.943 ± 0.319
4.559AspAsp: 4.559 ± 0.552
3.93AspGlu: 3.93 ± 0.639
2.672AspPhe: 2.672 ± 0.502
6.681AspGly: 6.681 ± 0.918
1.336AspHis: 1.336 ± 0.384
3.065AspIle: 3.065 ± 0.45
3.38AspLys: 3.38 ± 0.563
3.773AspLeu: 3.773 ± 0.509
1.415AspMet: 1.415 ± 0.282
2.515AspAsn: 2.515 ± 0.377
2.594AspPro: 2.594 ± 0.435
2.201AspGln: 2.201 ± 0.4
2.751AspArg: 2.751 ± 0.473
2.279AspSer: 2.279 ± 0.375
3.38AspThr: 3.38 ± 0.53
5.266AspVal: 5.266 ± 0.69
1.1AspTrp: 1.1 ± 0.315
2.437AspTyr: 2.437 ± 0.39
0.0AspXaa: 0.0 ± 0.0
Glu
5.03GluAla: 5.03 ± 0.649
0.943GluCys: 0.943 ± 0.248
1.179GluAsp: 1.179 ± 0.279
3.458GluGlu: 3.458 ± 0.761
2.044GluPhe: 2.044 ± 0.44
4.323GluGly: 4.323 ± 0.518
1.258GluHis: 1.258 ± 0.364
4.087GluIle: 4.087 ± 0.592
3.458GluLys: 3.458 ± 0.615
6.209GluLeu: 6.209 ± 0.642
1.729GluMet: 1.729 ± 0.358
1.572GluAsn: 1.572 ± 0.288
2.279GluPro: 2.279 ± 0.438
3.773GluGln: 3.773 ± 0.653
4.244GluArg: 4.244 ± 0.59
3.694GluSer: 3.694 ± 0.54
3.458GluThr: 3.458 ± 0.578
4.48GluVal: 4.48 ± 0.462
2.122GluTrp: 2.122 ± 0.361
1.179GluTyr: 1.179 ± 0.316
0.0GluXaa: 0.0 ± 0.0
Phe
3.223PheAla: 3.223 ± 0.495
0.393PheCys: 0.393 ± 0.155
2.437PheAsp: 2.437 ± 0.485
2.122PheGlu: 2.122 ± 0.353
1.1PhePhe: 1.1 ± 0.295
2.83PheGly: 2.83 ± 0.431
0.393PheHis: 0.393 ± 0.192
2.279PheIle: 2.279 ± 0.502
1.729PheLys: 1.729 ± 0.335
2.122PheLeu: 2.122 ± 0.424
0.943PheMet: 0.943 ± 0.235
2.044PheAsn: 2.044 ± 0.507
0.865PhePro: 0.865 ± 0.29
1.258PheGln: 1.258 ± 0.431
1.886PheArg: 1.886 ± 0.407
2.358PheSer: 2.358 ± 0.533
2.044PheThr: 2.044 ± 0.314
1.493PheVal: 1.493 ± 0.344
0.707PheTrp: 0.707 ± 0.234
0.786PheTyr: 0.786 ± 0.302
0.0PheXaa: 0.0 ± 0.0
Gly
7.703GlyAla: 7.703 ± 0.854
0.865GlyCys: 0.865 ± 0.3
4.794GlyAsp: 4.794 ± 0.663
5.266GlyGlu: 5.266 ± 0.5
3.38GlyPhe: 3.38 ± 0.519
7.231GlyGly: 7.231 ± 0.848
1.1GlyHis: 1.1 ± 0.29
4.401GlyIle: 4.401 ± 0.597
5.109GlyLys: 5.109 ± 0.546
4.952GlyLeu: 4.952 ± 0.682
1.886GlyMet: 1.886 ± 0.405
3.773GlyAsn: 3.773 ± 0.611
2.201GlyPro: 2.201 ± 0.371
3.458GlyGln: 3.458 ± 0.564
4.323GlyArg: 4.323 ± 0.722
5.109GlySer: 5.109 ± 0.859
6.052GlyThr: 6.052 ± 0.761
5.816GlyVal: 5.816 ± 0.779
1.336GlyTrp: 1.336 ± 0.398
3.223GlyTyr: 3.223 ± 0.403
0.0GlyXaa: 0.0 ± 0.0
His
1.336HisAla: 1.336 ± 0.374
0.707HisCys: 0.707 ± 0.25
0.786HisAsp: 0.786 ± 0.221
1.258HisGlu: 1.258 ± 0.294
0.943HisPhe: 0.943 ± 0.28
1.572HisGly: 1.572 ± 0.366
0.786HisHis: 0.786 ± 0.267
0.472HisIle: 0.472 ± 0.214
0.786HisLys: 0.786 ± 0.302
1.729HisLeu: 1.729 ± 0.447
0.393HisMet: 0.393 ± 0.175
0.472HisAsn: 0.472 ± 0.164
0.786HisPro: 0.786 ± 0.211
1.1HisGln: 1.1 ± 0.321
0.943HisArg: 0.943 ± 0.266
0.629HisSer: 0.629 ± 0.212
0.707HisThr: 0.707 ± 0.217
0.943HisVal: 0.943 ± 0.253
0.393HisTrp: 0.393 ± 0.206
0.472HisTyr: 0.472 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
6.288IleAla: 6.288 ± 0.835
0.472IleCys: 0.472 ± 0.221
4.401IleAsp: 4.401 ± 0.499
3.065IleGlu: 3.065 ± 0.625
1.572IlePhe: 1.572 ± 0.381
5.502IleGly: 5.502 ± 0.763
0.786IleHis: 0.786 ± 0.237
2.672IleIle: 2.672 ± 0.568
2.358IleLys: 2.358 ± 0.518
3.223IleLeu: 3.223 ± 0.551
1.179IleMet: 1.179 ± 0.284
2.987IleAsn: 2.987 ± 0.494
1.965IlePro: 1.965 ± 0.497
1.729IleGln: 1.729 ± 0.408
3.38IleArg: 3.38 ± 0.596
3.223IleSer: 3.223 ± 0.443
4.873IleThr: 4.873 ± 0.593
2.83IleVal: 2.83 ± 0.429
0.707IleTrp: 0.707 ± 0.225
0.707IleTyr: 0.707 ± 0.221
0.0IleXaa: 0.0 ± 0.0
Lys
4.716LysAla: 4.716 ± 0.74
0.786LysCys: 0.786 ± 0.264
3.301LysAsp: 3.301 ± 0.426
1.808LysGlu: 1.808 ± 0.447
1.965LysPhe: 1.965 ± 0.362
3.773LysGly: 3.773 ± 0.607
1.022LysHis: 1.022 ± 0.298
2.358LysIle: 2.358 ± 0.477
2.751LysLys: 2.751 ± 0.573
4.637LysLeu: 4.637 ± 0.701
1.651LysMet: 1.651 ± 0.483
2.358LysAsn: 2.358 ± 0.374
3.851LysPro: 3.851 ± 0.757
2.908LysGln: 2.908 ± 0.579
3.615LysArg: 3.615 ± 0.663
3.144LysSer: 3.144 ± 0.447
3.615LysThr: 3.615 ± 0.408
3.458LysVal: 3.458 ± 0.606
0.629LysTrp: 0.629 ± 0.192
1.415LysTyr: 1.415 ± 0.315
0.0LysXaa: 0.0 ± 0.0
Leu
9.353LeuAla: 9.353 ± 0.851
0.943LeuCys: 0.943 ± 0.233
4.952LeuAsp: 4.952 ± 0.527
4.873LeuGlu: 4.873 ± 0.616
2.201LeuPhe: 2.201 ± 0.402
4.873LeuGly: 4.873 ± 0.685
1.572LeuHis: 1.572 ± 0.294
3.458LeuIle: 3.458 ± 0.483
3.851LeuLys: 3.851 ± 0.563
5.502LeuLeu: 5.502 ± 0.611
1.493LeuMet: 1.493 ± 0.311
2.594LeuAsn: 2.594 ± 0.478
4.637LeuPro: 4.637 ± 0.57
3.223LeuGln: 3.223 ± 0.58
5.816LeuArg: 5.816 ± 0.723
4.48LeuSer: 4.48 ± 0.713
4.401LeuThr: 4.401 ± 0.666
5.738LeuVal: 5.738 ± 0.649
1.336LeuTrp: 1.336 ± 0.36
1.336LeuTyr: 1.336 ± 0.356
0.0LeuXaa: 0.0 ± 0.0
Met
4.008MetAla: 4.008 ± 0.634
0.0MetCys: 0.0 ± 0.0
1.022MetAsp: 1.022 ± 0.446
1.415MetGlu: 1.415 ± 0.365
0.393MetPhe: 0.393 ± 0.163
2.279MetGly: 2.279 ± 0.458
0.393MetHis: 0.393 ± 0.171
0.707MetIle: 0.707 ± 0.227
2.044MetLys: 2.044 ± 0.49
1.808MetLeu: 1.808 ± 0.345
1.022MetMet: 1.022 ± 0.28
1.258MetAsn: 1.258 ± 0.289
1.258MetPro: 1.258 ± 0.244
1.336MetGln: 1.336 ± 0.331
1.808MetArg: 1.808 ± 0.326
0.707MetSer: 0.707 ± 0.201
1.651MetThr: 1.651 ± 0.354
1.415MetVal: 1.415 ± 0.291
0.472MetTrp: 0.472 ± 0.165
0.079MetTyr: 0.079 ± 0.075
0.0MetXaa: 0.0 ± 0.0
Asn
5.266AsnAla: 5.266 ± 0.679
0.786AsnCys: 0.786 ± 0.225
3.615AsnAsp: 3.615 ± 0.535
2.358AsnGlu: 2.358 ± 0.431
1.258AsnPhe: 1.258 ± 0.354
3.93AsnGly: 3.93 ± 0.653
0.707AsnHis: 0.707 ± 0.216
2.987AsnIle: 2.987 ± 0.386
2.751AsnLys: 2.751 ± 0.469
3.301AsnLeu: 3.301 ± 0.427
1.179AsnMet: 1.179 ± 0.336
1.886AsnAsn: 1.886 ± 0.467
2.044AsnPro: 2.044 ± 0.456
1.572AsnGln: 1.572 ± 0.356
2.122AsnArg: 2.122 ± 0.449
2.279AsnSer: 2.279 ± 0.734
1.729AsnThr: 1.729 ± 0.366
2.594AsnVal: 2.594 ± 0.548
0.629AsnTrp: 0.629 ± 0.25
1.179AsnTyr: 1.179 ± 0.265
0.0AsnXaa: 0.0 ± 0.0
Pro
5.109ProAla: 5.109 ± 0.748
0.157ProCys: 0.157 ± 0.106
3.38ProAsp: 3.38 ± 0.434
2.594ProGlu: 2.594 ± 0.462
1.651ProPhe: 1.651 ± 0.248
3.773ProGly: 3.773 ± 0.574
0.55ProHis: 0.55 ± 0.2
1.572ProIle: 1.572 ± 0.374
2.83ProLys: 2.83 ± 0.565
3.144ProLeu: 3.144 ± 0.432
0.943ProMet: 0.943 ± 0.262
1.965ProAsn: 1.965 ± 0.291
2.044ProPro: 2.044 ± 0.472
1.965ProGln: 1.965 ± 0.429
2.122ProArg: 2.122 ± 0.385
2.515ProSer: 2.515 ± 0.568
2.594ProThr: 2.594 ± 0.548
3.38ProVal: 3.38 ± 0.56
0.55ProTrp: 0.55 ± 0.192
1.336ProTyr: 1.336 ± 0.322
0.0ProXaa: 0.0 ± 0.0
Gln
4.716GlnAla: 4.716 ± 0.708
0.314GlnCys: 0.314 ± 0.191
2.672GlnAsp: 2.672 ± 0.467
2.437GlnGlu: 2.437 ± 0.446
1.415GlnPhe: 1.415 ± 0.403
2.201GlnGly: 2.201 ± 0.445
1.258GlnHis: 1.258 ± 0.376
1.886GlnIle: 1.886 ± 0.328
2.358GlnLys: 2.358 ± 0.437
2.987GlnLeu: 2.987 ± 0.442
1.415GlnMet: 1.415 ± 0.287
0.707GlnAsn: 0.707 ± 0.209
2.515GlnPro: 2.515 ± 0.512
3.851GlnGln: 3.851 ± 0.755
3.93GlnArg: 3.93 ± 0.598
2.201GlnSer: 2.201 ± 0.333
2.122GlnThr: 2.122 ± 0.351
3.301GlnVal: 3.301 ± 0.454
0.865GlnTrp: 0.865 ± 0.236
1.336GlnTyr: 1.336 ± 0.308
0.0GlnXaa: 0.0 ± 0.0
Arg
6.681ArgAla: 6.681 ± 0.83
0.55ArgCys: 0.55 ± 0.198
3.38ArgAsp: 3.38 ± 0.584
4.087ArgGlu: 4.087 ± 0.561
1.886ArgPhe: 1.886 ± 0.351
4.087ArgGly: 4.087 ± 0.535
1.1ArgHis: 1.1 ± 0.294
3.223ArgIle: 3.223 ± 0.481
3.38ArgLys: 3.38 ± 0.589
5.58ArgLeu: 5.58 ± 0.619
1.572ArgMet: 1.572 ± 0.416
2.908ArgAsn: 2.908 ± 0.361
2.279ArgPro: 2.279 ± 0.385
2.672ArgGln: 2.672 ± 0.459
4.244ArgArg: 4.244 ± 0.665
3.065ArgSer: 3.065 ± 0.487
3.301ArgThr: 3.301 ± 0.525
3.144ArgVal: 3.144 ± 0.565
1.179ArgTrp: 1.179 ± 0.317
1.493ArgTyr: 1.493 ± 0.3
0.0ArgXaa: 0.0 ± 0.0
Ser
4.559SerAla: 4.559 ± 0.655
0.157SerCys: 0.157 ± 0.092
4.166SerAsp: 4.166 ± 0.624
3.301SerGlu: 3.301 ± 0.398
1.022SerPhe: 1.022 ± 0.381
4.952SerGly: 4.952 ± 0.833
0.786SerHis: 0.786 ± 0.216
2.908SerIle: 2.908 ± 0.533
2.908SerLys: 2.908 ± 0.45
5.738SerLeu: 5.738 ± 0.738
1.179SerMet: 1.179 ± 0.287
2.83SerAsn: 2.83 ± 0.77
2.751SerPro: 2.751 ± 0.524
2.201SerGln: 2.201 ± 0.386
2.672SerArg: 2.672 ± 0.388
2.751SerSer: 2.751 ± 0.448
2.672SerThr: 2.672 ± 0.479
4.401SerVal: 4.401 ± 0.786
0.865SerTrp: 0.865 ± 0.285
1.651SerTyr: 1.651 ± 0.416
0.0SerXaa: 0.0 ± 0.0
Thr
7.781ThrAla: 7.781 ± 0.95
0.393ThrCys: 0.393 ± 0.161
2.672ThrAsp: 2.672 ± 0.368
3.93ThrGlu: 3.93 ± 0.567
1.965ThrPhe: 1.965 ± 0.368
6.131ThrGly: 6.131 ± 0.475
0.629ThrHis: 0.629 ± 0.197
4.48ThrIle: 4.48 ± 0.774
2.672ThrLys: 2.672 ± 0.505
5.187ThrLeu: 5.187 ± 0.668
1.415ThrMet: 1.415 ± 0.336
2.201ThrAsn: 2.201 ± 0.389
2.908ThrPro: 2.908 ± 0.482
2.122ThrGln: 2.122 ± 0.443
2.279ThrArg: 2.279 ± 0.61
3.38ThrSer: 3.38 ± 0.429
3.615ThrThr: 3.615 ± 0.748
4.873ThrVal: 4.873 ± 0.788
1.022ThrTrp: 1.022 ± 0.292
1.336ThrTyr: 1.336 ± 0.314
0.0ThrXaa: 0.0 ± 0.0
Val
6.917ValAla: 6.917 ± 0.788
0.786ValCys: 0.786 ± 0.244
4.166ValAsp: 4.166 ± 0.587
4.323ValGlu: 4.323 ± 0.71
1.651ValPhe: 1.651 ± 0.367
4.794ValGly: 4.794 ± 0.527
0.55ValHis: 0.55 ± 0.198
3.773ValIle: 3.773 ± 0.619
4.008ValLys: 4.008 ± 0.605
5.58ValLeu: 5.58 ± 0.77
2.358ValMet: 2.358 ± 0.44
3.615ValAsn: 3.615 ± 0.456
2.594ValPro: 2.594 ± 0.486
2.044ValGln: 2.044 ± 0.293
4.244ValArg: 4.244 ± 0.77
3.301ValSer: 3.301 ± 0.647
5.502ValThr: 5.502 ± 0.709
6.052ValVal: 6.052 ± 0.897
1.022ValTrp: 1.022 ± 0.233
1.886ValTyr: 1.886 ± 0.547
0.0ValXaa: 0.0 ± 0.0
Trp
1.493TrpAla: 1.493 ± 0.386
0.157TrpCys: 0.157 ± 0.118
1.022TrpAsp: 1.022 ± 0.272
0.865TrpGlu: 0.865 ± 0.294
0.707TrpPhe: 0.707 ± 0.229
1.179TrpGly: 1.179 ± 0.359
0.629TrpHis: 0.629 ± 0.22
0.707TrpIle: 0.707 ± 0.254
0.629TrpLys: 0.629 ± 0.221
2.044TrpLeu: 2.044 ± 0.444
0.472TrpMet: 0.472 ± 0.168
0.786TrpAsn: 0.786 ± 0.23
0.629TrpPro: 0.629 ± 0.213
0.943TrpGln: 0.943 ± 0.23
1.258TrpArg: 1.258 ± 0.307
1.258TrpSer: 1.258 ± 0.303
1.022TrpThr: 1.022 ± 0.289
0.943TrpVal: 0.943 ± 0.237
0.314TrpTrp: 0.314 ± 0.182
0.236TrpTyr: 0.236 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.594TyrAla: 2.594 ± 0.402
0.629TyrCys: 0.629 ± 0.259
1.886TyrAsp: 1.886 ± 0.427
1.179TyrGlu: 1.179 ± 0.302
1.022TyrPhe: 1.022 ± 0.311
1.886TyrGly: 1.886 ± 0.371
0.629TyrHis: 0.629 ± 0.224
1.336TyrIle: 1.336 ± 0.361
1.179TyrLys: 1.179 ± 0.309
1.336TyrLeu: 1.336 ± 0.338
0.236TyrMet: 0.236 ± 0.142
0.865TyrAsn: 0.865 ± 0.27
1.493TyrPro: 1.493 ± 0.302
1.336TyrGln: 1.336 ± 0.274
2.201TyrArg: 2.201 ± 0.396
1.886TyrSer: 1.886 ± 0.312
1.651TyrThr: 1.651 ± 0.338
1.886TyrVal: 1.886 ± 0.419
0.236TyrTrp: 0.236 ± 0.138
0.629TyrTyr: 0.629 ± 0.213
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12724 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski