Amino acid dipepetide frequency for Streptomyces phage ZL12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
24.094AlaAla: 24.094 ± 2.167
1.272AlaCys: 1.272 ± 0.245
9.098AlaAsp: 9.098 ± 0.599
7.556AlaGlu: 7.556 ± 0.627
2.506AlaPhe: 2.506 ± 0.357
10.64AlaGly: 10.64 ± 0.616
2.968AlaHis: 2.968 ± 0.33
4.665AlaIle: 4.665 ± 0.6
3.624AlaLys: 3.624 ± 0.455
10.447AlaLeu: 10.447 ± 0.81
3.045AlaMet: 3.045 ± 0.288
3.123AlaAsn: 3.123 ± 0.392
6.746AlaPro: 6.746 ± 0.728
4.202AlaGln: 4.202 ± 0.406
10.794AlaArg: 10.794 ± 0.752
6.515AlaSer: 6.515 ± 0.565
8.674AlaThr: 8.674 ± 0.604
10.64AlaVal: 10.64 ± 0.834
2.12AlaTrp: 2.12 ± 0.294
2.776AlaTyr: 2.776 ± 0.337
0.0AlaXaa: 0.0 ± 0.0
Cys
1.079CysAla: 1.079 ± 0.215
0.116CysCys: 0.116 ± 0.07
0.617CysAsp: 0.617 ± 0.136
0.424CysGlu: 0.424 ± 0.119
0.116CysPhe: 0.116 ± 0.069
1.118CysGly: 1.118 ± 0.241
0.193CysHis: 0.193 ± 0.087
0.193CysIle: 0.193 ± 0.086
0.27CysLys: 0.27 ± 0.118
0.386CysLeu: 0.386 ± 0.17
0.116CysMet: 0.116 ± 0.064
0.077CysAsn: 0.077 ± 0.054
0.925CysPro: 0.925 ± 0.242
0.424CysGln: 0.424 ± 0.137
1.041CysArg: 1.041 ± 0.243
0.732CysSer: 0.732 ± 0.217
0.887CysThr: 0.887 ± 0.199
0.54CysVal: 0.54 ± 0.148
0.154CysTrp: 0.154 ± 0.08
0.308CysTyr: 0.308 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
7.941AspAla: 7.941 ± 0.56
0.617AspCys: 0.617 ± 0.16
5.089AspAsp: 5.089 ± 0.428
3.778AspGlu: 3.778 ± 0.387
1.118AspPhe: 1.118 ± 0.253
5.86AspGly: 5.86 ± 0.546
1.503AspHis: 1.503 ± 0.265
1.696AspIle: 1.696 ± 0.268
1.311AspLys: 1.311 ± 0.263
6.091AspLeu: 6.091 ± 0.476
1.311AspMet: 1.311 ± 0.189
1.542AspAsn: 1.542 ± 0.226
5.898AspPro: 5.898 ± 0.508
2.467AspGln: 2.467 ± 0.272
4.973AspArg: 4.973 ± 0.435
2.506AspSer: 2.506 ± 0.278
4.048AspThr: 4.048 ± 0.403
3.624AspVal: 3.624 ± 0.404
1.426AspTrp: 1.426 ± 0.201
1.542AspTyr: 1.542 ± 0.217
0.0AspXaa: 0.0 ± 0.0
Glu
7.402GluAla: 7.402 ± 0.516
0.386GluCys: 0.386 ± 0.114
3.662GluAsp: 3.662 ± 0.414
3.277GluGlu: 3.277 ± 0.36
1.465GluPhe: 1.465 ± 0.238
4.009GluGly: 4.009 ± 0.373
1.503GluHis: 1.503 ± 0.284
1.581GluIle: 1.581 ± 0.261
1.735GluLys: 1.735 ± 0.276
4.934GluLeu: 4.934 ± 0.511
0.694GluMet: 0.694 ± 0.142
1.503GluAsn: 1.503 ± 0.314
3.855GluPro: 3.855 ± 0.477
2.93GluGln: 2.93 ± 0.344
3.778GluArg: 3.778 ± 0.412
1.349GluSer: 1.349 ± 0.229
3.585GluThr: 3.585 ± 0.328
3.045GluVal: 3.045 ± 0.403
0.771GluTrp: 0.771 ± 0.155
1.465GluTyr: 1.465 ± 0.245
0.0GluXaa: 0.0 ± 0.0
Phe
1.966PheAla: 1.966 ± 0.295
0.27PheCys: 0.27 ± 0.097
1.388PheAsp: 1.388 ± 0.259
0.732PheGlu: 0.732 ± 0.196
0.424PhePhe: 0.424 ± 0.138
2.39PheGly: 2.39 ± 0.342
0.578PheHis: 0.578 ± 0.143
1.002PheIle: 1.002 ± 0.194
0.308PheLys: 0.308 ± 0.094
1.503PheLeu: 1.503 ± 0.278
0.231PheMet: 0.231 ± 0.119
0.54PheAsn: 0.54 ± 0.112
0.925PhePro: 0.925 ± 0.215
0.848PheGln: 0.848 ± 0.189
1.388PheArg: 1.388 ± 0.196
1.311PheSer: 1.311 ± 0.161
1.658PheThr: 1.658 ± 0.227
1.041PheVal: 1.041 ± 0.212
0.463PheTrp: 0.463 ± 0.125
0.617PheTyr: 0.617 ± 0.166
0.0PheXaa: 0.0 ± 0.0
Gly
7.71GlyAla: 7.71 ± 0.611
0.771GlyCys: 0.771 ± 0.186
5.359GlyAsp: 5.359 ± 0.345
4.202GlyGlu: 4.202 ± 0.393
2.197GlyPhe: 2.197 ± 0.27
7.71GlyGly: 7.71 ± 0.873
2.814GlyHis: 2.814 ± 0.375
2.776GlyIle: 2.776 ± 0.286
3.161GlyLys: 3.161 ± 0.38
6.554GlyLeu: 6.554 ± 0.547
2.082GlyMet: 2.082 ± 0.264
2.236GlyAsn: 2.236 ± 0.357
5.397GlyPro: 5.397 ± 0.477
2.66GlyGln: 2.66 ± 0.252
7.672GlyArg: 7.672 ± 0.646
6.052GlySer: 6.052 ± 0.762
6.168GlyThr: 6.168 ± 0.682
5.204GlyVal: 5.204 ± 0.408
2.467GlyTrp: 2.467 ± 0.287
2.429GlyTyr: 2.429 ± 0.337
0.0GlyXaa: 0.0 ± 0.0
His
2.39HisAla: 2.39 ± 0.349
0.386HisCys: 0.386 ± 0.129
1.426HisAsp: 1.426 ± 0.209
0.964HisGlu: 0.964 ± 0.187
0.578HisPhe: 0.578 ± 0.16
2.236HisGly: 2.236 ± 0.317
1.349HisHis: 1.349 ± 0.315
0.655HisIle: 0.655 ± 0.149
0.501HisLys: 0.501 ± 0.119
2.39HisLeu: 2.39 ± 0.26
0.424HisMet: 0.424 ± 0.129
0.694HisAsn: 0.694 ± 0.194
2.082HisPro: 2.082 ± 0.372
1.041HisGln: 1.041 ± 0.201
2.313HisArg: 2.313 ± 0.348
1.002HisSer: 1.002 ± 0.187
1.966HisThr: 1.966 ± 0.262
1.503HisVal: 1.503 ± 0.273
0.655HisTrp: 0.655 ± 0.138
0.54HisTyr: 0.54 ± 0.154
0.0HisXaa: 0.0 ± 0.0
Ile
4.973IleAla: 4.973 ± 0.437
0.463IleCys: 0.463 ± 0.137
3.161IleAsp: 3.161 ± 0.377
1.812IleGlu: 1.812 ± 0.282
0.54IlePhe: 0.54 ± 0.172
2.699IleGly: 2.699 ± 0.377
0.964IleHis: 0.964 ± 0.198
0.655IleIle: 0.655 ± 0.142
0.887IleLys: 0.887 ± 0.185
2.197IleLeu: 2.197 ± 0.314
0.347IleMet: 0.347 ± 0.101
1.542IleAsn: 1.542 ± 0.343
2.39IlePro: 2.39 ± 0.332
1.234IleGln: 1.234 ± 0.18
3.007IleArg: 3.007 ± 0.372
1.118IleSer: 1.118 ± 0.2
3.007IleThr: 3.007 ± 0.508
2.274IleVal: 2.274 ± 0.363
0.463IleTrp: 0.463 ± 0.12
0.617IleTyr: 0.617 ± 0.152
0.0IleXaa: 0.0 ± 0.0
Lys
4.395LysAla: 4.395 ± 0.579
0.116LysCys: 0.116 ± 0.068
1.272LysAsp: 1.272 ± 0.28
1.118LysGlu: 1.118 ± 0.187
0.694LysPhe: 0.694 ± 0.18
2.621LysGly: 2.621 ± 0.438
0.617LysHis: 0.617 ± 0.207
0.81LysIle: 0.81 ± 0.262
1.118LysLys: 1.118 ± 0.311
2.197LysLeu: 2.197 ± 0.348
0.308LysMet: 0.308 ± 0.092
0.694LysAsn: 0.694 ± 0.248
2.506LysPro: 2.506 ± 0.386
1.157LysGln: 1.157 ± 0.195
2.043LysArg: 2.043 ± 0.349
1.234LysSer: 1.234 ± 0.214
2.236LysThr: 2.236 ± 0.401
1.465LysVal: 1.465 ± 0.274
0.116LysTrp: 0.116 ± 0.064
0.463LysTyr: 0.463 ± 0.158
0.0LysXaa: 0.0 ± 0.0
Leu
10.678LeuAla: 10.678 ± 0.698
0.81LeuCys: 0.81 ± 0.21
6.361LeuAsp: 6.361 ± 0.491
4.241LeuGlu: 4.241 ± 0.385
1.619LeuPhe: 1.619 ± 0.271
6.515LeuGly: 6.515 ± 0.553
1.889LeuHis: 1.889 ± 0.295
3.161LeuIle: 3.161 ± 0.289
2.005LeuLys: 2.005 ± 0.302
6.361LeuLeu: 6.361 ± 0.497
1.503LeuMet: 1.503 ± 0.223
2.467LeuAsn: 2.467 ± 0.313
5.436LeuPro: 5.436 ± 0.534
1.928LeuGln: 1.928 ± 0.264
7.247LeuArg: 7.247 ± 0.632
3.739LeuSer: 3.739 ± 0.38
6.168LeuThr: 6.168 ± 0.476
5.127LeuVal: 5.127 ± 0.465
1.041LeuTrp: 1.041 ± 0.236
1.735LeuTyr: 1.735 ± 0.273
0.0LeuXaa: 0.0 ± 0.0
Met
2.583MetAla: 2.583 ± 0.319
0.231MetCys: 0.231 ± 0.094
1.118MetAsp: 1.118 ± 0.224
0.848MetGlu: 0.848 ± 0.2
0.386MetPhe: 0.386 ± 0.135
1.234MetGly: 1.234 ± 0.194
0.308MetHis: 0.308 ± 0.1
0.694MetIle: 0.694 ± 0.143
0.386MetLys: 0.386 ± 0.125
1.157MetLeu: 1.157 ± 0.231
0.193MetMet: 0.193 ± 0.086
0.617MetAsn: 0.617 ± 0.127
1.889MetPro: 1.889 ± 0.237
0.655MetGln: 0.655 ± 0.13
1.311MetArg: 1.311 ± 0.211
1.311MetSer: 1.311 ± 0.215
1.889MetThr: 1.889 ± 0.231
1.349MetVal: 1.349 ± 0.227
0.308MetTrp: 0.308 ± 0.11
0.347MetTyr: 0.347 ± 0.124
0.0MetXaa: 0.0 ± 0.0
Asn
3.547AsnAla: 3.547 ± 0.41
0.193AsnCys: 0.193 ± 0.09
1.426AsnAsp: 1.426 ± 0.221
1.349AsnGlu: 1.349 ± 0.224
0.501AsnPhe: 0.501 ± 0.122
2.776AsnGly: 2.776 ± 0.42
0.54AsnHis: 0.54 ± 0.151
0.925AsnIle: 0.925 ± 0.232
0.617AsnLys: 0.617 ± 0.2
2.39AsnLeu: 2.39 ± 0.311
0.463AsnMet: 0.463 ± 0.132
1.041AsnAsn: 1.041 ± 0.19
2.159AsnPro: 2.159 ± 0.325
1.388AsnGln: 1.388 ± 0.203
1.928AsnArg: 1.928 ± 0.259
1.195AsnSer: 1.195 ± 0.334
2.429AsnThr: 2.429 ± 0.273
1.503AsnVal: 1.503 ± 0.284
0.424AsnTrp: 0.424 ± 0.132
0.501AsnTyr: 0.501 ± 0.124
0.0AsnXaa: 0.0 ± 0.0
Pro
9.985ProAla: 9.985 ± 0.807
0.694ProCys: 0.694 ± 0.157
4.973ProAsp: 4.973 ± 0.513
4.549ProGlu: 4.549 ± 0.501
1.272ProPhe: 1.272 ± 0.243
6.476ProGly: 6.476 ± 0.65
1.85ProHis: 1.85 ± 0.352
2.082ProIle: 2.082 ± 0.328
2.313ProLys: 2.313 ± 0.339
4.241ProLeu: 4.241 ± 0.332
1.195ProMet: 1.195 ± 0.237
1.928ProAsn: 1.928 ± 0.293
4.742ProPro: 4.742 ± 0.462
1.889ProGln: 1.889 ± 0.268
4.395ProArg: 4.395 ± 0.6
3.662ProSer: 3.662 ± 0.462
5.86ProThr: 5.86 ± 0.651
5.012ProVal: 5.012 ± 0.508
1.041ProTrp: 1.041 ± 0.177
1.619ProTyr: 1.619 ± 0.253
0.0ProXaa: 0.0 ± 0.0
Gln
4.626GlnAla: 4.626 ± 0.408
0.347GlnCys: 0.347 ± 0.115
1.542GlnAsp: 1.542 ± 0.277
2.043GlnGlu: 2.043 ± 0.274
0.501GlnPhe: 0.501 ± 0.158
2.66GlnGly: 2.66 ± 0.359
1.272GlnHis: 1.272 ± 0.201
1.426GlnIle: 1.426 ± 0.219
0.925GlnLys: 0.925 ± 0.199
3.701GlnLeu: 3.701 ± 0.453
0.81GlnMet: 0.81 ± 0.185
0.964GlnAsn: 0.964 ± 0.198
2.699GlnPro: 2.699 ± 0.34
2.159GlnGln: 2.159 ± 0.295
3.007GlnArg: 3.007 ± 0.389
1.311GlnSer: 1.311 ± 0.262
2.274GlnThr: 2.274 ± 0.337
2.968GlnVal: 2.968 ± 0.352
0.578GlnTrp: 0.578 ± 0.125
0.732GlnTyr: 0.732 ± 0.139
0.0GlnXaa: 0.0 ± 0.0
Arg
10.987ArgAla: 10.987 ± 0.825
0.925ArgCys: 0.925 ± 0.188
4.279ArgAsp: 4.279 ± 0.417
3.971ArgGlu: 3.971 ± 0.446
1.619ArgPhe: 1.619 ± 0.241
5.474ArgGly: 5.474 ± 0.491
2.506ArgHis: 2.506 ± 0.363
3.624ArgIle: 3.624 ± 0.363
1.619ArgLys: 1.619 ± 0.298
7.556ArgLeu: 7.556 ± 0.603
1.928ArgMet: 1.928 ± 0.278
2.313ArgAsn: 2.313 ± 0.321
5.783ArgPro: 5.783 ± 0.603
3.007ArgGln: 3.007 ± 0.377
6.978ArgArg: 6.978 ± 0.673
4.009ArgSer: 4.009 ± 0.354
5.59ArgThr: 5.59 ± 0.444
4.78ArgVal: 4.78 ± 0.419
1.85ArgTrp: 1.85 ± 0.255
1.542ArgTyr: 1.542 ± 0.251
0.0ArgXaa: 0.0 ± 0.0
Ser
5.975SerAla: 5.975 ± 0.432
0.308SerCys: 0.308 ± 0.099
2.66SerAsp: 2.66 ± 0.32
1.966SerGlu: 1.966 ± 0.248
1.002SerPhe: 1.002 ± 0.168
5.436SerGly: 5.436 ± 0.6
0.771SerHis: 0.771 ± 0.173
1.619SerIle: 1.619 ± 0.309
1.85SerLys: 1.85 ± 0.286
3.547SerLeu: 3.547 ± 0.55
0.964SerMet: 0.964 ± 0.154
1.157SerAsn: 1.157 ± 0.258
4.009SerPro: 4.009 ± 0.515
1.658SerGln: 1.658 ± 0.231
3.855SerArg: 3.855 ± 0.365
3.123SerSer: 3.123 ± 0.528
4.78SerThr: 4.78 ± 0.577
2.891SerVal: 2.891 ± 0.328
1.079SerTrp: 1.079 ± 0.202
1.195SerTyr: 1.195 ± 0.225
0.0SerXaa: 0.0 ± 0.0
Thr
11.025ThrAla: 11.025 ± 0.821
0.771ThrCys: 0.771 ± 0.164
4.279ThrAsp: 4.279 ± 0.418
3.701ThrGlu: 3.701 ± 0.386
1.002ThrPhe: 1.002 ± 0.183
7.479ThrGly: 7.479 ± 0.644
1.272ThrHis: 1.272 ± 0.216
2.66ThrIle: 2.66 ± 0.459
2.159ThrLys: 2.159 ± 0.438
5.281ThrLeu: 5.281 ± 0.473
1.465ThrMet: 1.465 ± 0.218
1.735ThrAsn: 1.735 ± 0.363
5.86ThrPro: 5.86 ± 0.603
2.313ThrGln: 2.313 ± 0.365
5.32ThrArg: 5.32 ± 0.408
3.932ThrSer: 3.932 ± 0.516
7.093ThrThr: 7.093 ± 0.652
5.821ThrVal: 5.821 ± 0.514
1.195ThrTrp: 1.195 ± 0.199
1.503ThrTyr: 1.503 ± 0.299
0.0ThrXaa: 0.0 ± 0.0
Val
8.867ValAla: 8.867 ± 0.586
0.54ValCys: 0.54 ± 0.157
4.163ValAsp: 4.163 ± 0.315
3.701ValGlu: 3.701 ± 0.381
1.118ValPhe: 1.118 ± 0.182
4.934ValGly: 4.934 ± 0.54
1.426ValHis: 1.426 ± 0.28
2.853ValIle: 2.853 ± 0.321
1.311ValLys: 1.311 ± 0.306
5.937ValLeu: 5.937 ± 0.588
1.234ValMet: 1.234 ± 0.206
1.696ValAsn: 1.696 ± 0.329
4.433ValPro: 4.433 ± 0.467
2.891ValGln: 2.891 ± 0.271
5.474ValArg: 5.474 ± 0.499
3.624ValSer: 3.624 ± 0.383
4.973ValThr: 4.973 ± 0.665
5.127ValVal: 5.127 ± 0.562
1.079ValTrp: 1.079 ± 0.211
1.426ValTyr: 1.426 ± 0.219
0.0ValXaa: 0.0 ± 0.0
Trp
2.236TrpAla: 2.236 ± 0.274
0.193TrpCys: 0.193 ± 0.086
1.311TrpAsp: 1.311 ± 0.23
1.195TrpGlu: 1.195 ± 0.204
0.463TrpPhe: 0.463 ± 0.137
1.157TrpGly: 1.157 ± 0.223
0.54TrpHis: 0.54 ± 0.136
0.655TrpIle: 0.655 ± 0.164
0.386TrpLys: 0.386 ± 0.14
1.234TrpLeu: 1.234 ± 0.224
0.193TrpMet: 0.193 ± 0.084
0.617TrpAsn: 0.617 ± 0.168
1.157TrpPro: 1.157 ± 0.214
0.925TrpGln: 0.925 ± 0.186
1.619TrpArg: 1.619 ± 0.236
1.041TrpSer: 1.041 ± 0.198
1.118TrpThr: 1.118 ± 0.203
1.272TrpVal: 1.272 ± 0.289
0.308TrpTrp: 0.308 ± 0.102
0.27TrpTyr: 0.27 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.045TyrAla: 3.045 ± 0.325
0.308TyrCys: 0.308 ± 0.152
1.157TyrAsp: 1.157 ± 0.223
1.503TyrGlu: 1.503 ± 0.265
0.463TyrPhe: 0.463 ± 0.137
2.197TyrGly: 2.197 ± 0.274
0.231TyrHis: 0.231 ± 0.104
0.617TyrIle: 0.617 ± 0.136
0.694TyrLys: 0.694 ± 0.17
1.966TyrLeu: 1.966 ± 0.279
0.27TyrMet: 0.27 ± 0.105
0.81TyrAsn: 0.81 ± 0.171
0.887TyrPro: 0.887 ± 0.19
0.771TyrGln: 0.771 ± 0.167
2.197TyrArg: 2.197 ± 0.345
1.079TyrSer: 1.079 ± 0.189
1.465TyrThr: 1.465 ± 0.226
1.619TyrVal: 1.619 ± 0.258
0.347TyrTrp: 0.347 ± 0.114
0.848TyrTyr: 0.848 ± 0.165
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 112 proteins (25941 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski