Amino acid dipepetide frequency for Escherichia phage wV8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.982AlaAla: 4.982 ± 0.606
0.642AlaCys: 0.642 ± 0.157
3.925AlaAsp: 3.925 ± 0.41
3.888AlaGlu: 3.888 ± 0.377
2.453AlaPhe: 2.453 ± 0.337
5.209AlaGly: 5.209 ± 0.506
1.472AlaHis: 1.472 ± 0.253
4.454AlaIle: 4.454 ± 0.356
5.171AlaLys: 5.171 ± 0.488
5.775AlaLeu: 5.775 ± 0.553
2.491AlaMet: 2.491 ± 0.281
3.661AlaAsn: 3.661 ± 0.418
1.812AlaPro: 1.812 ± 0.294
2.453AlaGln: 2.453 ± 0.321
2.453AlaArg: 2.453 ± 0.328
4.189AlaSer: 4.189 ± 0.565
4.378AlaThr: 4.378 ± 0.639
5.209AlaVal: 5.209 ± 0.521
0.755AlaTrp: 0.755 ± 0.145
2.868AlaTyr: 2.868 ± 0.332
0.0AlaXaa: 0.0 ± 0.0
Cys
0.566CysAla: 0.566 ± 0.183
0.226CysCys: 0.226 ± 0.117
0.566CysAsp: 0.566 ± 0.147
1.057CysGlu: 1.057 ± 0.239
0.793CysPhe: 0.793 ± 0.159
0.981CysGly: 0.981 ± 0.222
0.528CysHis: 0.528 ± 0.129
0.528CysIle: 0.528 ± 0.126
1.434CysLys: 1.434 ± 0.255
1.095CysLeu: 1.095 ± 0.161
0.34CysMet: 0.34 ± 0.102
0.83CysAsn: 0.83 ± 0.171
0.604CysPro: 0.604 ± 0.152
0.453CysGln: 0.453 ± 0.146
0.793CysArg: 0.793 ± 0.197
1.208CysSer: 1.208 ± 0.247
0.642CysThr: 0.642 ± 0.159
1.095CysVal: 1.095 ± 0.211
0.113CysTrp: 0.113 ± 0.057
0.679CysTyr: 0.679 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
4.605AspAla: 4.605 ± 0.398
0.642AspCys: 0.642 ± 0.167
3.699AspAsp: 3.699 ± 0.398
4.491AspGlu: 4.491 ± 0.475
3.359AspPhe: 3.359 ± 0.346
4.718AspGly: 4.718 ± 0.416
0.604AspHis: 0.604 ± 0.16
4.227AspIle: 4.227 ± 0.418
4.303AspLys: 4.303 ± 0.359
5.209AspLeu: 5.209 ± 0.432
1.774AspMet: 1.774 ± 0.257
3.208AspAsn: 3.208 ± 0.403
1.698AspPro: 1.698 ± 0.287
0.528AspGln: 0.528 ± 0.113
1.887AspArg: 1.887 ± 0.239
4.265AspSer: 4.265 ± 0.34
4.038AspThr: 4.038 ± 0.374
4.34AspVal: 4.34 ± 0.459
1.283AspTrp: 1.283 ± 0.24
2.529AspTyr: 2.529 ± 0.312
0.0AspXaa: 0.0 ± 0.0
Glu
4.944GluAla: 4.944 ± 0.588
0.793GluCys: 0.793 ± 0.152
4.265GluAsp: 4.265 ± 0.422
5.51GluGlu: 5.51 ± 0.535
2.831GluPhe: 2.831 ± 0.33
4.152GluGly: 4.152 ± 0.435
1.623GluHis: 1.623 ± 0.25
4.491GluIle: 4.491 ± 0.477
5.058GluLys: 5.058 ± 0.593
5.699GluLeu: 5.699 ± 0.45
2.868GluMet: 2.868 ± 0.305
3.435GluAsn: 3.435 ± 0.321
1.698GluPro: 1.698 ± 0.29
2.378GluGln: 2.378 ± 0.345
2.755GluArg: 2.755 ± 0.336
3.85GluSer: 3.85 ± 0.35
3.397GluThr: 3.397 ± 0.332
4.378GluVal: 4.378 ± 0.333
0.755GluTrp: 0.755 ± 0.183
2.567GluTyr: 2.567 ± 0.316
0.0GluXaa: 0.0 ± 0.0
Phe
2.529PheAla: 2.529 ± 0.393
0.566PheCys: 0.566 ± 0.145
3.208PheAsp: 3.208 ± 0.333
3.359PheGlu: 3.359 ± 0.356
1.321PhePhe: 1.321 ± 0.203
2.982PheGly: 2.982 ± 0.33
1.019PheHis: 1.019 ± 0.211
2.642PheIle: 2.642 ± 0.32
3.661PheLys: 3.661 ± 0.356
2.906PheLeu: 2.906 ± 0.346
1.095PheMet: 1.095 ± 0.197
2.567PheAsn: 2.567 ± 0.379
0.981PhePro: 0.981 ± 0.215
1.321PheGln: 1.321 ± 0.214
1.585PheArg: 1.585 ± 0.218
2.982PheSer: 2.982 ± 0.298
2.68PheThr: 2.68 ± 0.386
2.604PheVal: 2.604 ± 0.298
0.377PheTrp: 0.377 ± 0.115
2.151PheTyr: 2.151 ± 0.313
0.0PheXaa: 0.0 ± 0.0
Gly
4.605GlyAla: 4.605 ± 0.469
1.132GlyCys: 1.132 ± 0.204
3.17GlyAsp: 3.17 ± 0.345
4.34GlyGlu: 4.34 ± 0.513
3.359GlyPhe: 3.359 ± 0.336
4.152GlyGly: 4.152 ± 0.524
1.283GlyHis: 1.283 ± 0.211
4.642GlyIle: 4.642 ± 0.427
6.077GlyLys: 6.077 ± 0.528
4.718GlyLeu: 4.718 ± 0.433
0.868GlyMet: 0.868 ± 0.195
3.284GlyAsn: 3.284 ± 0.443
0.075GlyPro: 0.075 ± 0.046
2.0GlyGln: 2.0 ± 0.307
2.453GlyArg: 2.453 ± 0.308
4.378GlySer: 4.378 ± 0.53
3.774GlyThr: 3.774 ± 0.522
5.095GlyVal: 5.095 ± 0.389
0.906GlyTrp: 0.906 ± 0.173
3.019GlyTyr: 3.019 ± 0.325
0.0GlyXaa: 0.0 ± 0.0
His
0.981HisAla: 0.981 ± 0.203
0.604HisCys: 0.604 ± 0.155
1.17HisAsp: 1.17 ± 0.195
1.132HisGlu: 1.132 ± 0.216
0.906HisPhe: 0.906 ± 0.196
0.868HisGly: 0.868 ± 0.18
0.528HisHis: 0.528 ± 0.152
1.132HisIle: 1.132 ± 0.2
1.283HisLys: 1.283 ± 0.237
1.849HisLeu: 1.849 ± 0.24
0.604HisMet: 0.604 ± 0.15
1.019HisAsn: 1.019 ± 0.211
0.755HisPro: 0.755 ± 0.163
0.453HisGln: 0.453 ± 0.14
0.868HisArg: 0.868 ± 0.142
1.547HisSer: 1.547 ± 0.376
1.434HisThr: 1.434 ± 0.361
1.321HisVal: 1.321 ± 0.236
0.264HisTrp: 0.264 ± 0.135
0.944HisTyr: 0.944 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
4.152IleAla: 4.152 ± 0.393
1.057IleCys: 1.057 ± 0.217
4.227IleAsp: 4.227 ± 0.338
4.227IleGlu: 4.227 ± 0.469
2.491IlePhe: 2.491 ± 0.279
3.321IleGly: 3.321 ± 0.359
1.057IleHis: 1.057 ± 0.18
3.623IleIle: 3.623 ± 0.35
5.36IleLys: 5.36 ± 0.522
4.114IleLeu: 4.114 ± 0.419
1.812IleMet: 1.812 ± 0.305
3.51IleAsn: 3.51 ± 0.331
2.076IlePro: 2.076 ± 0.322
1.887IleGln: 1.887 ± 0.263
2.642IleArg: 2.642 ± 0.336
3.925IleSer: 3.925 ± 0.371
4.152IleThr: 4.152 ± 0.439
3.699IleVal: 3.699 ± 0.399
0.717IleTrp: 0.717 ± 0.144
2.944IleTyr: 2.944 ± 0.376
0.0IleXaa: 0.0 ± 0.0
Lys
5.963LysAla: 5.963 ± 0.56
1.057LysCys: 1.057 ± 0.205
4.982LysAsp: 4.982 ± 0.503
6.303LysGlu: 6.303 ± 0.532
2.567LysPhe: 2.567 ± 0.358
5.095LysGly: 5.095 ± 0.425
1.359LysHis: 1.359 ± 0.215
4.529LysIle: 4.529 ± 0.382
5.737LysLys: 5.737 ± 0.595
6.19LysLeu: 6.19 ± 0.435
2.567LysMet: 2.567 ± 0.285
3.85LysAsn: 3.85 ± 0.369
2.68LysPro: 2.68 ± 0.321
3.057LysGln: 3.057 ± 0.356
3.812LysArg: 3.812 ± 0.373
4.756LysSer: 4.756 ± 0.465
5.322LysThr: 5.322 ± 0.491
6.567LysVal: 6.567 ± 0.463
0.642LysTrp: 0.642 ± 0.154
2.717LysTyr: 2.717 ± 0.419
0.0LysXaa: 0.0 ± 0.0
Leu
5.661LeuAla: 5.661 ± 0.469
1.17LeuCys: 1.17 ± 0.186
5.435LeuAsp: 5.435 ± 0.44
5.963LeuGlu: 5.963 ± 0.455
2.868LeuPhe: 2.868 ± 0.332
3.925LeuGly: 3.925 ± 0.393
1.698LeuHis: 1.698 ± 0.278
3.435LeuIle: 3.435 ± 0.309
6.567LeuLys: 6.567 ± 0.507
5.85LeuLeu: 5.85 ± 0.435
2.642LeuMet: 2.642 ± 0.319
4.114LeuAsn: 4.114 ± 0.348
3.246LeuPro: 3.246 ± 0.359
3.208LeuGln: 3.208 ± 0.369
3.85LeuArg: 3.85 ± 0.388
5.737LeuSer: 5.737 ± 0.46
5.36LeuThr: 5.36 ± 0.409
4.303LeuVal: 4.303 ± 0.457
0.906LeuTrp: 0.906 ± 0.191
3.17LeuTyr: 3.17 ± 0.337
0.0LeuXaa: 0.0 ± 0.0
Met
2.114MetAla: 2.114 ± 0.292
0.377MetCys: 0.377 ± 0.122
1.283MetAsp: 1.283 ± 0.202
1.774MetGlu: 1.774 ± 0.25
1.396MetPhe: 1.396 ± 0.248
1.623MetGly: 1.623 ± 0.231
0.377MetHis: 0.377 ± 0.11
1.812MetIle: 1.812 ± 0.299
2.944MetLys: 2.944 ± 0.363
2.416MetLeu: 2.416 ± 0.291
0.642MetMet: 0.642 ± 0.177
1.434MetAsn: 1.434 ± 0.213
0.868MetPro: 0.868 ± 0.209
1.547MetGln: 1.547 ± 0.307
1.51MetArg: 1.51 ± 0.239
2.189MetSer: 2.189 ± 0.265
2.0MetThr: 2.0 ± 0.269
1.321MetVal: 1.321 ± 0.204
0.302MetTrp: 0.302 ± 0.094
0.944MetTyr: 0.944 ± 0.176
0.0MetXaa: 0.0 ± 0.0
Asn
3.963AsnAla: 3.963 ± 0.417
0.793AsnCys: 0.793 ± 0.19
2.567AsnAsp: 2.567 ± 0.322
2.567AsnGlu: 2.567 ± 0.275
2.378AsnPhe: 2.378 ± 0.289
4.189AsnGly: 4.189 ± 0.45
1.321AsnHis: 1.321 ± 0.211
3.774AsnIle: 3.774 ± 0.449
3.812AsnLys: 3.812 ± 0.431
4.907AsnLeu: 4.907 ± 0.403
1.208AsnMet: 1.208 ± 0.241
3.057AsnAsn: 3.057 ± 0.413
1.963AsnPro: 1.963 ± 0.256
1.51AsnGln: 1.51 ± 0.22
2.227AsnArg: 2.227 ± 0.262
3.51AsnSer: 3.51 ± 0.365
2.567AsnThr: 2.567 ± 0.499
3.586AsnVal: 3.586 ± 0.347
0.604AsnTrp: 0.604 ± 0.137
2.114AsnTyr: 2.114 ± 0.318
0.0AsnXaa: 0.0 ± 0.0
Pro
1.585ProAla: 1.585 ± 0.218
0.415ProCys: 0.415 ± 0.115
2.114ProAsp: 2.114 ± 0.318
2.831ProGlu: 2.831 ± 0.315
2.0ProPhe: 2.0 ± 0.284
0.226ProGly: 0.226 ± 0.112
0.642ProHis: 0.642 ± 0.194
1.359ProIle: 1.359 ± 0.199
2.453ProLys: 2.453 ± 0.319
2.227ProLeu: 2.227 ± 0.292
1.132ProMet: 1.132 ± 0.211
1.547ProAsn: 1.547 ± 0.253
0.944ProPro: 0.944 ± 0.208
0.981ProGln: 0.981 ± 0.196
1.057ProArg: 1.057 ± 0.189
2.416ProSer: 2.416 ± 0.336
2.529ProThr: 2.529 ± 0.281
2.453ProVal: 2.453 ± 0.228
0.189ProTrp: 0.189 ± 0.085
1.208ProTyr: 1.208 ± 0.219
0.0ProXaa: 0.0 ± 0.0
Gln
2.378GlnAla: 2.378 ± 0.352
0.453GlnCys: 0.453 ± 0.132
1.396GlnAsp: 1.396 ± 0.206
2.34GlnGlu: 2.34 ± 0.326
1.321GlnPhe: 1.321 ± 0.2
1.774GlnGly: 1.774 ± 0.259
0.491GlnHis: 0.491 ± 0.135
2.491GlnIle: 2.491 ± 0.3
2.378GlnLys: 2.378 ± 0.214
3.019GlnLeu: 3.019 ± 0.399
1.095GlnMet: 1.095 ± 0.237
1.849GlnAsn: 1.849 ± 0.271
1.095GlnPro: 1.095 ± 0.174
1.472GlnGln: 1.472 ± 0.267
1.698GlnArg: 1.698 ± 0.246
2.189GlnSer: 2.189 ± 0.234
1.925GlnThr: 1.925 ± 0.255
2.604GlnVal: 2.604 ± 0.353
0.566GlnTrp: 0.566 ± 0.175
1.774GlnTyr: 1.774 ± 0.257
0.0GlnXaa: 0.0 ± 0.0
Arg
2.491ArgAla: 2.491 ± 0.292
0.83ArgCys: 0.83 ± 0.185
2.906ArgAsp: 2.906 ± 0.347
2.453ArgGlu: 2.453 ± 0.32
1.396ArgPhe: 1.396 ± 0.3
2.793ArgGly: 2.793 ± 0.361
0.793ArgHis: 0.793 ± 0.166
2.944ArgIle: 2.944 ± 0.281
3.17ArgLys: 3.17 ± 0.346
3.435ArgLeu: 3.435 ± 0.392
1.283ArgMet: 1.283 ± 0.242
2.378ArgAsn: 2.378 ± 0.236
1.283ArgPro: 1.283 ± 0.213
1.547ArgGln: 1.547 ± 0.224
2.302ArgArg: 2.302 ± 0.312
2.0ArgSer: 2.0 ± 0.266
2.604ArgThr: 2.604 ± 0.334
2.755ArgVal: 2.755 ± 0.283
0.528ArgTrp: 0.528 ± 0.16
2.114ArgTyr: 2.114 ± 0.312
0.0ArgXaa: 0.0 ± 0.0
Ser
4.076SerAla: 4.076 ± 0.559
0.906SerCys: 0.906 ± 0.186
4.68SerAsp: 4.68 ± 0.49
4.491SerGlu: 4.491 ± 0.39
3.133SerPhe: 3.133 ± 0.373
3.963SerGly: 3.963 ± 0.425
1.51SerHis: 1.51 ± 0.266
3.623SerIle: 3.623 ± 0.417
4.567SerLys: 4.567 ± 0.381
5.661SerLeu: 5.661 ± 0.411
1.585SerMet: 1.585 ± 0.202
3.435SerAsn: 3.435 ± 0.324
1.849SerPro: 1.849 ± 0.289
2.604SerGln: 2.604 ± 0.336
2.717SerArg: 2.717 ± 0.29
4.378SerSer: 4.378 ± 0.338
3.51SerThr: 3.51 ± 0.457
5.36SerVal: 5.36 ± 0.457
0.868SerTrp: 0.868 ± 0.171
3.321SerTyr: 3.321 ± 0.331
0.0SerXaa: 0.0 ± 0.0
Thr
4.152ThrAla: 4.152 ± 0.418
0.642ThrCys: 0.642 ± 0.176
3.699ThrAsp: 3.699 ± 0.341
2.944ThrGlu: 2.944 ± 0.343
3.17ThrPhe: 3.17 ± 0.343
5.322ThrGly: 5.322 ± 0.568
1.321ThrHis: 1.321 ± 0.284
3.85ThrIle: 3.85 ± 0.458
4.944ThrLys: 4.944 ± 0.576
4.944ThrLeu: 4.944 ± 0.402
0.944ThrMet: 0.944 ± 0.196
2.755ThrAsn: 2.755 ± 0.401
2.416ThrPro: 2.416 ± 0.303
2.529ThrGln: 2.529 ± 0.304
2.453ThrArg: 2.453 ± 0.29
4.227ThrSer: 4.227 ± 0.504
3.888ThrThr: 3.888 ± 0.613
5.51ThrVal: 5.51 ± 0.544
0.604ThrTrp: 0.604 ± 0.148
3.057ThrTyr: 3.057 ± 0.326
0.0ThrXaa: 0.0 ± 0.0
Val
5.51ValAla: 5.51 ± 0.532
1.132ValCys: 1.132 ± 0.187
4.642ValAsp: 4.642 ± 0.439
4.114ValGlu: 4.114 ± 0.422
2.416ValPhe: 2.416 ± 0.339
4.416ValGly: 4.416 ± 0.444
1.17ValHis: 1.17 ± 0.244
4.378ValIle: 4.378 ± 0.425
6.19ValLys: 6.19 ± 0.428
4.416ValLeu: 4.416 ± 0.405
2.265ValMet: 2.265 ± 0.303
3.888ValAsn: 3.888 ± 0.376
2.302ValPro: 2.302 ± 0.348
2.076ValGln: 2.076 ± 0.317
2.793ValArg: 2.793 ± 0.344
5.133ValSer: 5.133 ± 0.444
4.869ValThr: 4.869 ± 0.455
4.907ValVal: 4.907 ± 0.429
0.566ValTrp: 0.566 ± 0.155
2.944ValTyr: 2.944 ± 0.34
0.0ValXaa: 0.0 ± 0.0
Trp
0.415TrpAla: 0.415 ± 0.143
0.189TrpCys: 0.189 ± 0.085
0.868TrpAsp: 0.868 ± 0.166
0.83TrpGlu: 0.83 ± 0.154
0.528TrpPhe: 0.528 ± 0.144
0.604TrpGly: 0.604 ± 0.151
0.038TrpHis: 0.038 ± 0.037
0.566TrpIle: 0.566 ± 0.153
1.057TrpLys: 1.057 ± 0.205
1.057TrpLeu: 1.057 ± 0.201
0.415TrpMet: 0.415 ± 0.152
0.906TrpAsn: 0.906 ± 0.205
0.264TrpPro: 0.264 ± 0.098
0.415TrpGln: 0.415 ± 0.132
0.528TrpArg: 0.528 ± 0.117
0.755TrpSer: 0.755 ± 0.178
0.642TrpThr: 0.642 ± 0.139
0.717TrpVal: 0.717 ± 0.151
0.226TrpTrp: 0.226 ± 0.097
0.604TrpTyr: 0.604 ± 0.142
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.567TyrAla: 2.567 ± 0.342
0.906TyrCys: 0.906 ± 0.182
2.604TyrAsp: 2.604 ± 0.289
2.831TyrGlu: 2.831 ± 0.298
1.849TyrPhe: 1.849 ± 0.244
3.057TyrGly: 3.057 ± 0.326
0.83TyrHis: 0.83 ± 0.186
2.491TyrIle: 2.491 ± 0.305
3.586TyrLys: 3.586 ± 0.36
3.623TyrLeu: 3.623 ± 0.367
1.208TyrMet: 1.208 ± 0.215
1.887TyrAsn: 1.887 ± 0.223
1.698TyrPro: 1.698 ± 0.283
1.887TyrGln: 1.887 ± 0.294
1.623TyrArg: 1.623 ± 0.251
2.604TyrSer: 2.604 ± 0.314
3.661TyrThr: 3.661 ± 0.387
2.302TyrVal: 2.302 ± 0.266
0.453TyrTrp: 0.453 ± 0.13
1.849TyrTyr: 1.849 ± 0.248
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 140 proteins (26496 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski