Amino acid dipepetide frequency for Escherichia phage YDC107_1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.923AlaAla: 9.923 ± 1.628
1.145AlaCys: 1.145 ± 0.261
5.038AlaAsp: 5.038 ± 0.695
5.725AlaGlu: 5.725 ± 0.689
3.893AlaPhe: 3.893 ± 0.438
6.106AlaGly: 6.106 ± 0.87
1.145AlaHis: 1.145 ± 0.224
5.496AlaIle: 5.496 ± 0.604
4.427AlaLys: 4.427 ± 0.47
9.16AlaLeu: 9.16 ± 0.696
2.672AlaMet: 2.672 ± 0.438
2.366AlaAsn: 2.366 ± 0.448
2.214AlaPro: 2.214 ± 0.385
4.427AlaGln: 4.427 ± 0.823
6.717AlaArg: 6.717 ± 0.752
5.267AlaSer: 5.267 ± 0.636
6.259AlaThr: 6.259 ± 0.969
6.106AlaVal: 6.106 ± 0.803
1.832AlaTrp: 1.832 ± 0.352
2.366AlaTyr: 2.366 ± 0.391
0.0AlaXaa: 0.0 ± 0.0
Cys
0.916CysAla: 0.916 ± 0.268
0.229CysCys: 0.229 ± 0.19
0.611CysAsp: 0.611 ± 0.201
0.763CysGlu: 0.763 ± 0.302
0.611CysPhe: 0.611 ± 0.235
1.069CysGly: 1.069 ± 0.252
0.458CysHis: 0.458 ± 0.188
0.84CysIle: 0.84 ± 0.306
0.229CysLys: 0.229 ± 0.137
0.992CysLeu: 0.992 ± 0.245
0.611CysMet: 0.611 ± 0.232
0.229CysAsn: 0.229 ± 0.121
0.763CysPro: 0.763 ± 0.204
0.382CysGln: 0.382 ± 0.184
0.687CysArg: 0.687 ± 0.223
0.916CysSer: 0.916 ± 0.243
0.229CysThr: 0.229 ± 0.125
1.145CysVal: 1.145 ± 0.269
0.382CysTrp: 0.382 ± 0.134
0.229CysTyr: 0.229 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
4.961AspAla: 4.961 ± 0.671
0.763AspCys: 0.763 ± 0.238
4.656AspAsp: 4.656 ± 1.065
4.961AspGlu: 4.961 ± 0.596
2.672AspPhe: 2.672 ± 0.37
4.961AspGly: 4.961 ± 0.577
0.687AspHis: 0.687 ± 0.303
3.588AspIle: 3.588 ± 0.477
3.053AspLys: 3.053 ± 0.371
4.656AspLeu: 4.656 ± 0.607
0.916AspMet: 0.916 ± 0.287
2.29AspAsn: 2.29 ± 0.364
3.053AspPro: 3.053 ± 0.655
0.84AspGln: 0.84 ± 0.277
3.511AspArg: 3.511 ± 0.556
3.817AspSer: 3.817 ± 0.418
2.519AspThr: 2.519 ± 0.667
5.343AspVal: 5.343 ± 0.589
0.687AspTrp: 0.687 ± 0.238
1.908AspTyr: 1.908 ± 0.377
0.0AspXaa: 0.0 ± 0.0
Glu
6.106GluAla: 6.106 ± 0.646
0.534GluCys: 0.534 ± 0.202
2.595GluAsp: 2.595 ± 0.383
6.412GluGlu: 6.412 ± 1.188
2.061GluPhe: 2.061 ± 0.42
2.672GluGly: 2.672 ± 0.464
1.298GluHis: 1.298 ± 0.347
3.969GluIle: 3.969 ± 0.649
3.893GluLys: 3.893 ± 0.528
8.091GluLeu: 8.091 ± 0.846
2.366GluMet: 2.366 ± 0.402
2.748GluAsn: 2.748 ± 0.533
1.679GluPro: 1.679 ± 0.348
4.885GluGln: 4.885 ± 0.611
5.572GluArg: 5.572 ± 0.658
3.282GluSer: 3.282 ± 0.382
3.206GluThr: 3.206 ± 0.567
3.664GluVal: 3.664 ± 0.514
0.305GluTrp: 0.305 ± 0.15
1.298GluTyr: 1.298 ± 0.347
0.0GluXaa: 0.0 ± 0.0
Phe
3.053PheAla: 3.053 ± 0.479
0.84PheCys: 0.84 ± 0.255
2.901PheAsp: 2.901 ± 0.507
1.832PheGlu: 1.832 ± 0.326
1.374PhePhe: 1.374 ± 0.42
2.137PheGly: 2.137 ± 0.36
0.84PheHis: 0.84 ± 0.238
1.679PheIle: 1.679 ± 0.358
2.137PheLys: 2.137 ± 0.509
2.595PheLeu: 2.595 ± 0.404
1.145PheMet: 1.145 ± 0.226
2.443PheAsn: 2.443 ± 0.385
1.374PhePro: 1.374 ± 0.341
0.84PheGln: 0.84 ± 0.284
2.29PheArg: 2.29 ± 0.411
3.359PheSer: 3.359 ± 0.556
2.595PheThr: 2.595 ± 0.444
2.977PheVal: 2.977 ± 0.387
0.229PheTrp: 0.229 ± 0.132
1.298PheTyr: 1.298 ± 0.331
0.0PheXaa: 0.0 ± 0.0
Gly
4.961GlyAla: 4.961 ± 0.946
0.382GlyCys: 0.382 ± 0.152
4.427GlyAsp: 4.427 ± 0.532
4.656GlyGlu: 4.656 ± 0.634
2.214GlyPhe: 2.214 ± 0.454
4.885GlyGly: 4.885 ± 1.026
0.687GlyHis: 0.687 ± 0.201
4.045GlyIle: 4.045 ± 0.573
4.122GlyLys: 4.122 ± 0.577
5.343GlyLeu: 5.343 ± 0.74
2.672GlyMet: 2.672 ± 0.422
2.901GlyAsn: 2.901 ± 0.477
1.45GlyPro: 1.45 ± 0.312
3.282GlyGln: 3.282 ± 0.57
4.732GlyArg: 4.732 ± 0.508
4.045GlySer: 4.045 ± 0.512
3.435GlyThr: 3.435 ± 0.634
3.817GlyVal: 3.817 ± 0.462
1.374GlyTrp: 1.374 ± 0.286
2.672GlyTyr: 2.672 ± 0.367
0.0GlyXaa: 0.0 ± 0.0
His
1.145HisAla: 1.145 ± 0.279
0.153HisCys: 0.153 ± 0.114
1.298HisAsp: 1.298 ± 0.258
0.763HisGlu: 0.763 ± 0.217
1.527HisPhe: 1.527 ± 0.259
0.382HisGly: 0.382 ± 0.188
0.534HisHis: 0.534 ± 0.201
1.298HisIle: 1.298 ± 0.275
1.069HisLys: 1.069 ± 0.24
1.298HisLeu: 1.298 ± 0.241
0.305HisMet: 0.305 ± 0.169
0.992HisAsn: 0.992 ± 0.304
0.687HisPro: 0.687 ± 0.204
0.611HisGln: 0.611 ± 0.168
1.298HisArg: 1.298 ± 0.322
1.756HisSer: 1.756 ± 0.41
0.611HisThr: 0.611 ± 0.213
0.687HisVal: 0.687 ± 0.2
0.305HisTrp: 0.305 ± 0.15
0.916HisTyr: 0.916 ± 0.28
0.0HisXaa: 0.0 ± 0.0
Ile
3.893IleAla: 3.893 ± 0.591
1.374IleCys: 1.374 ± 0.322
3.511IleAsp: 3.511 ± 0.437
3.13IleGlu: 3.13 ± 0.524
1.145IlePhe: 1.145 ± 0.292
4.351IleGly: 4.351 ± 0.41
1.221IleHis: 1.221 ± 0.343
1.832IleIle: 1.832 ± 0.377
3.435IleLys: 3.435 ± 0.552
3.74IleLeu: 3.74 ± 0.525
1.527IleMet: 1.527 ± 0.355
2.519IleAsn: 2.519 ± 0.485
1.908IlePro: 1.908 ± 0.4
1.374IleGln: 1.374 ± 0.336
3.435IleArg: 3.435 ± 0.471
3.511IleSer: 3.511 ± 0.425
3.359IleThr: 3.359 ± 0.544
3.435IleVal: 3.435 ± 0.496
0.611IleTrp: 0.611 ± 0.229
1.679IleTyr: 1.679 ± 0.413
0.0IleXaa: 0.0 ± 0.0
Lys
5.267LysAla: 5.267 ± 0.666
0.534LysCys: 0.534 ± 0.169
2.29LysAsp: 2.29 ± 0.458
3.664LysGlu: 3.664 ± 0.509
2.137LysPhe: 2.137 ± 0.417
4.045LysGly: 4.045 ± 0.691
1.145LysHis: 1.145 ± 0.315
2.901LysIle: 2.901 ± 0.446
3.817LysLys: 3.817 ± 0.522
4.198LysLeu: 4.198 ± 0.728
0.992LysMet: 0.992 ± 0.326
2.29LysAsn: 2.29 ± 0.422
2.29LysPro: 2.29 ± 0.373
2.366LysGln: 2.366 ± 0.385
4.274LysArg: 4.274 ± 0.622
3.817LysSer: 3.817 ± 0.413
3.588LysThr: 3.588 ± 0.469
3.511LysVal: 3.511 ± 0.423
0.84LysTrp: 0.84 ± 0.238
2.214LysTyr: 2.214 ± 0.403
0.0LysXaa: 0.0 ± 0.0
Leu
8.167LeuAla: 8.167 ± 0.79
1.069LeuCys: 1.069 ± 0.211
4.427LeuAsp: 4.427 ± 0.508
5.343LeuGlu: 5.343 ± 0.75
2.595LeuPhe: 2.595 ± 0.43
3.74LeuGly: 3.74 ± 0.476
1.985LeuHis: 1.985 ± 0.424
3.664LeuIle: 3.664 ± 0.437
6.259LeuLys: 6.259 ± 0.768
6.717LeuLeu: 6.717 ± 0.749
1.908LeuMet: 1.908 ± 0.345
3.969LeuAsn: 3.969 ± 0.504
4.045LeuPro: 4.045 ± 0.528
3.206LeuGln: 3.206 ± 0.485
7.022LeuArg: 7.022 ± 1.011
7.099LeuSer: 7.099 ± 0.869
5.954LeuThr: 5.954 ± 0.634
4.122LeuVal: 4.122 ± 0.568
1.679LeuTrp: 1.679 ± 0.348
1.908LeuTyr: 1.908 ± 0.374
0.0LeuXaa: 0.0 ± 0.0
Met
2.901MetAla: 2.901 ± 0.564
0.076MetCys: 0.076 ± 0.076
0.992MetAsp: 0.992 ± 0.262
1.145MetGlu: 1.145 ± 0.227
1.298MetPhe: 1.298 ± 0.287
1.527MetGly: 1.527 ± 0.281
0.763MetHis: 0.763 ± 0.316
1.145MetIle: 1.145 ± 0.247
2.061MetLys: 2.061 ± 0.426
2.824MetLeu: 2.824 ± 0.387
0.916MetMet: 0.916 ± 0.187
1.145MetAsn: 1.145 ± 0.313
1.679MetPro: 1.679 ± 0.406
1.221MetGln: 1.221 ± 0.38
1.985MetArg: 1.985 ± 0.408
1.985MetSer: 1.985 ± 0.432
3.817MetThr: 3.817 ± 0.677
1.756MetVal: 1.756 ± 0.318
0.153MetTrp: 0.153 ± 0.105
0.687MetTyr: 0.687 ± 0.197
0.0MetXaa: 0.0 ± 0.0
Asn
2.366AsnAla: 2.366 ± 0.372
0.611AsnCys: 0.611 ± 0.232
2.595AsnAsp: 2.595 ± 0.545
2.214AsnGlu: 2.214 ± 0.426
0.84AsnPhe: 0.84 ± 0.213
3.206AsnGly: 3.206 ± 0.524
0.458AsnHis: 0.458 ± 0.204
2.672AsnIle: 2.672 ± 0.486
2.519AsnLys: 2.519 ± 0.449
2.137AsnLeu: 2.137 ± 0.382
1.908AsnMet: 1.908 ± 0.386
2.061AsnAsn: 2.061 ± 0.37
2.061AsnPro: 2.061 ± 0.32
1.298AsnGln: 1.298 ± 0.269
1.756AsnArg: 1.756 ± 0.371
2.595AsnSer: 2.595 ± 0.47
2.977AsnThr: 2.977 ± 0.637
2.214AsnVal: 2.214 ± 0.374
0.534AsnTrp: 0.534 ± 0.162
1.069AsnTyr: 1.069 ± 0.317
0.0AsnXaa: 0.0 ± 0.0
Pro
4.351ProAla: 4.351 ± 0.536
0.229ProCys: 0.229 ± 0.168
2.901ProAsp: 2.901 ± 0.617
3.053ProGlu: 3.053 ± 0.456
1.679ProPhe: 1.679 ± 0.386
3.74ProGly: 3.74 ± 0.489
0.534ProHis: 0.534 ± 0.261
1.298ProIle: 1.298 ± 0.273
1.603ProLys: 1.603 ± 0.342
3.282ProLeu: 3.282 ± 0.384
0.687ProMet: 0.687 ± 0.214
1.374ProAsn: 1.374 ± 0.276
1.145ProPro: 1.145 ± 0.271
1.298ProGln: 1.298 ± 0.284
2.137ProArg: 2.137 ± 0.449
2.748ProSer: 2.748 ± 0.405
1.221ProThr: 1.221 ± 0.264
4.122ProVal: 4.122 ± 0.444
0.458ProTrp: 0.458 ± 0.165
1.221ProTyr: 1.221 ± 0.353
0.0ProXaa: 0.0 ± 0.0
Gln
4.351GlnAla: 4.351 ± 0.663
0.305GlnCys: 0.305 ± 0.182
1.374GlnAsp: 1.374 ± 0.304
2.519GlnGlu: 2.519 ± 0.504
1.527GlnPhe: 1.527 ± 0.426
2.443GlnGly: 2.443 ± 0.331
1.221GlnHis: 1.221 ± 0.304
1.756GlnIle: 1.756 ± 0.379
2.137GlnLys: 2.137 ± 0.38
4.656GlnLeu: 4.656 ± 0.685
1.679GlnMet: 1.679 ± 0.373
1.45GlnAsn: 1.45 ± 0.34
1.908GlnPro: 1.908 ± 0.386
2.29GlnGln: 2.29 ± 0.461
3.359GlnArg: 3.359 ± 0.631
2.672GlnSer: 2.672 ± 0.549
2.214GlnThr: 2.214 ± 0.351
1.908GlnVal: 1.908 ± 0.429
0.992GlnTrp: 0.992 ± 0.252
1.221GlnTyr: 1.221 ± 0.37
0.0GlnXaa: 0.0 ± 0.0
Arg
5.114ArgAla: 5.114 ± 0.654
0.916ArgCys: 0.916 ± 0.328
4.274ArgAsp: 4.274 ± 0.725
4.809ArgGlu: 4.809 ± 0.791
2.901ArgPhe: 2.901 ± 0.361
5.19ArgGly: 5.19 ± 0.677
1.298ArgHis: 1.298 ± 0.304
4.58ArgIle: 4.58 ± 0.556
4.503ArgLys: 4.503 ± 0.566
5.954ArgLeu: 5.954 ± 0.677
2.672ArgMet: 2.672 ± 0.464
2.214ArgAsn: 2.214 ± 0.403
2.061ArgPro: 2.061 ± 0.322
3.74ArgGln: 3.74 ± 0.481
7.557ArgArg: 7.557 ± 0.955
3.588ArgSer: 3.588 ± 0.496
3.206ArgThr: 3.206 ± 0.391
4.198ArgVal: 4.198 ± 0.702
1.374ArgTrp: 1.374 ± 0.33
2.29ArgTyr: 2.29 ± 0.358
0.0ArgXaa: 0.0 ± 0.0
Ser
7.633SerAla: 7.633 ± 0.557
0.84SerCys: 0.84 ± 0.243
4.961SerAsp: 4.961 ± 0.608
4.274SerGlu: 4.274 ± 0.547
2.29SerPhe: 2.29 ± 0.335
6.106SerGly: 6.106 ± 0.607
1.145SerHis: 1.145 ± 0.235
3.13SerIle: 3.13 ± 0.488
1.985SerLys: 1.985 ± 0.348
4.427SerLeu: 4.427 ± 0.571
1.45SerMet: 1.45 ± 0.324
1.603SerAsn: 1.603 ± 0.366
3.435SerPro: 3.435 ± 0.575
2.366SerGln: 2.366 ± 0.391
5.343SerArg: 5.343 ± 0.827
3.359SerSer: 3.359 ± 0.416
3.435SerThr: 3.435 ± 0.449
4.809SerVal: 4.809 ± 0.644
1.756SerTrp: 1.756 ± 0.406
1.298SerTyr: 1.298 ± 0.309
0.0SerXaa: 0.0 ± 0.0
Thr
6.641ThrAla: 6.641 ± 0.85
0.992ThrCys: 0.992 ± 0.228
3.511ThrAsp: 3.511 ± 0.493
3.664ThrGlu: 3.664 ± 0.807
2.824ThrPhe: 2.824 ± 0.398
4.503ThrGly: 4.503 ± 0.696
0.611ThrHis: 0.611 ± 0.232
2.29ThrIle: 2.29 ± 0.441
1.45ThrLys: 1.45 ± 0.307
5.648ThrLeu: 5.648 ± 0.607
1.374ThrMet: 1.374 ± 0.306
1.679ThrAsn: 1.679 ± 0.309
2.901ThrPro: 2.901 ± 0.512
2.595ThrGln: 2.595 ± 0.395
3.588ThrArg: 3.588 ± 0.439
2.977ThrSer: 2.977 ± 0.554
2.901ThrThr: 2.901 ± 0.606
3.817ThrVal: 3.817 ± 0.813
0.916ThrTrp: 0.916 ± 0.284
2.214ThrTyr: 2.214 ± 0.379
0.0ThrXaa: 0.0 ± 0.0
Val
5.877ValAla: 5.877 ± 0.581
0.763ValCys: 0.763 ± 0.312
4.656ValAsp: 4.656 ± 0.536
4.351ValGlu: 4.351 ± 0.417
2.672ValPhe: 2.672 ± 0.394
2.901ValGly: 2.901 ± 0.561
1.145ValHis: 1.145 ± 0.296
3.053ValIle: 3.053 ± 0.361
4.198ValLys: 4.198 ± 0.512
5.19ValLeu: 5.19 ± 0.608
2.824ValMet: 2.824 ± 0.393
2.366ValAsn: 2.366 ± 0.411
2.443ValPro: 2.443 ± 0.341
2.214ValGln: 2.214 ± 0.425
4.427ValArg: 4.427 ± 0.486
5.19ValSer: 5.19 ± 0.782
3.13ValThr: 3.13 ± 0.556
5.19ValVal: 5.19 ± 0.675
0.992ValTrp: 0.992 ± 0.273
1.908ValTyr: 1.908 ± 0.408
0.0ValXaa: 0.0 ± 0.0
Trp
1.145TrpAla: 1.145 ± 0.203
0.229TrpCys: 0.229 ± 0.122
1.069TrpAsp: 1.069 ± 0.29
1.069TrpGlu: 1.069 ± 0.284
0.534TrpPhe: 0.534 ± 0.189
0.611TrpGly: 0.611 ± 0.237
0.153TrpHis: 0.153 ± 0.097
0.763TrpIle: 0.763 ± 0.278
0.84TrpLys: 0.84 ± 0.284
2.214TrpLeu: 2.214 ± 0.456
0.382TrpMet: 0.382 ± 0.161
0.229TrpAsn: 0.229 ± 0.136
0.458TrpPro: 0.458 ± 0.23
0.992TrpGln: 0.992 ± 0.249
1.298TrpArg: 1.298 ± 0.387
1.221TrpSer: 1.221 ± 0.29
0.84TrpThr: 0.84 ± 0.326
1.145TrpVal: 1.145 ± 0.282
0.153TrpTrp: 0.153 ± 0.106
0.382TrpTyr: 0.382 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.511TyrAla: 3.511 ± 0.446
0.305TyrCys: 0.305 ± 0.158
1.756TyrAsp: 1.756 ± 0.281
2.29TyrGlu: 2.29 ± 0.328
1.145TyrPhe: 1.145 ± 0.382
1.527TyrGly: 1.527 ± 0.323
0.305TyrHis: 0.305 ± 0.149
0.992TyrIle: 0.992 ± 0.315
2.443TyrLys: 2.443 ± 0.398
1.756TyrLeu: 1.756 ± 0.442
0.992TyrMet: 0.992 ± 0.302
1.221TyrAsn: 1.221 ± 0.312
1.527TyrPro: 1.527 ± 0.306
1.832TyrGln: 1.832 ± 0.266
1.45TyrArg: 1.45 ± 0.274
2.443TyrSer: 2.443 ± 0.49
1.603TyrThr: 1.603 ± 0.246
1.527TyrVal: 1.527 ± 0.3
0.153TyrTrp: 0.153 ± 0.117
0.84TyrTyr: 0.84 ± 0.287
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (13102 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski