Amino acid dipepetide frequency for Gordonia phage Phendrix

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.781AlaAla: 9.781 ± 0.835
0.844AlaCys: 0.844 ± 0.177
5.094AlaAsp: 5.094 ± 0.371
5.327AlaGlu: 5.327 ± 0.589
2.766AlaPhe: 2.766 ± 0.316
6.346AlaGly: 6.346 ± 0.621
2.242AlaHis: 2.242 ± 0.266
5.327AlaIle: 5.327 ± 0.567
4.803AlaLys: 4.803 ± 0.531
6.87AlaLeu: 6.87 ± 0.72
2.882AlaMet: 2.882 ± 0.268
3.086AlaAsn: 3.086 ± 0.348
4.279AlaPro: 4.279 ± 0.377
3.231AlaGln: 3.231 ± 0.452
5.851AlaArg: 5.851 ± 0.409
5.764AlaSer: 5.764 ± 0.461
5.415AlaThr: 5.415 ± 0.612
5.851AlaVal: 5.851 ± 0.322
1.747AlaTrp: 1.747 ± 0.191
2.795AlaTyr: 2.795 ± 0.226
0.0AlaXaa: 0.0 ± 0.0
Cys
0.99CysAla: 0.99 ± 0.173
0.233CysCys: 0.233 ± 0.087
0.873CysAsp: 0.873 ± 0.181
0.64CysGlu: 0.64 ± 0.143
0.437CysPhe: 0.437 ± 0.124
1.892CysGly: 1.892 ± 0.266
0.204CysHis: 0.204 ± 0.082
0.437CysIle: 0.437 ± 0.113
0.32CysLys: 0.32 ± 0.108
0.611CysLeu: 0.611 ± 0.154
0.116CysMet: 0.116 ± 0.054
0.349CysAsn: 0.349 ± 0.097
0.728CysPro: 0.728 ± 0.188
0.233CysGln: 0.233 ± 0.072
0.64CysArg: 0.64 ± 0.134
0.757CysSer: 0.757 ± 0.182
0.99CysThr: 0.99 ± 0.209
0.873CysVal: 0.873 ± 0.174
0.116CysTrp: 0.116 ± 0.053
0.582CysTyr: 0.582 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
6.55AspAla: 6.55 ± 0.46
0.757AspCys: 0.757 ± 0.168
5.91AspAsp: 5.91 ± 0.666
4.949AspGlu: 4.949 ± 0.442
2.125AspPhe: 2.125 ± 0.262
4.6AspGly: 4.6 ± 0.375
1.718AspHis: 1.718 ± 0.238
3.173AspIle: 3.173 ± 0.327
3.581AspLys: 3.581 ± 0.364
5.036AspLeu: 5.036 ± 0.382
2.125AspMet: 2.125 ± 0.232
2.678AspAsn: 2.678 ± 0.312
4.25AspPro: 4.25 ± 0.281
1.863AspGln: 1.863 ± 0.231
3.784AspArg: 3.784 ± 0.298
3.319AspSer: 3.319 ± 0.35
3.93AspThr: 3.93 ± 0.414
5.269AspVal: 5.269 ± 0.458
1.659AspTrp: 1.659 ± 0.226
2.096AspTyr: 2.096 ± 0.298
0.0AspXaa: 0.0 ± 0.0
Glu
6.026GluAla: 6.026 ± 0.555
0.786GluCys: 0.786 ± 0.162
4.949GluAsp: 4.949 ± 0.444
5.618GluGlu: 5.618 ± 0.687
2.504GluPhe: 2.504 ± 0.279
4.046GluGly: 4.046 ± 0.442
1.456GluHis: 1.456 ± 0.223
3.435GluIle: 3.435 ± 0.287
3.348GluLys: 3.348 ± 0.359
5.211GluLeu: 5.211 ± 0.514
1.892GluMet: 1.892 ± 0.299
2.038GluAsn: 2.038 ± 0.258
1.98GluPro: 1.98 ± 0.244
2.387GluGln: 2.387 ± 0.281
4.338GluArg: 4.338 ± 0.351
3.464GluSer: 3.464 ± 0.296
3.843GluThr: 3.843 ± 0.366
4.105GluVal: 4.105 ± 0.375
1.863GluTrp: 1.863 ± 0.216
2.067GluTyr: 2.067 ± 0.265
0.0GluXaa: 0.0 ± 0.0
Phe
2.998PheAla: 2.998 ± 0.308
0.233PheCys: 0.233 ± 0.073
2.474PheAsp: 2.474 ± 0.295
1.776PheGlu: 1.776 ± 0.239
1.019PhePhe: 1.019 ± 0.167
2.358PheGly: 2.358 ± 0.269
0.64PheHis: 0.64 ± 0.162
1.543PheIle: 1.543 ± 0.221
1.95PheLys: 1.95 ± 0.304
2.3PheLeu: 2.3 ± 0.249
0.815PheMet: 0.815 ± 0.168
1.368PheAsn: 1.368 ± 0.225
1.456PhePro: 1.456 ± 0.175
0.67PheGln: 0.67 ± 0.125
1.194PheArg: 1.194 ± 0.201
1.514PheSer: 1.514 ± 0.192
2.096PheThr: 2.096 ± 0.21
2.474PheVal: 2.474 ± 0.26
0.408PheTrp: 0.408 ± 0.122
1.077PheTyr: 1.077 ± 0.176
0.0PheXaa: 0.0 ± 0.0
Gly
5.618GlyAla: 5.618 ± 0.471
0.699GlyCys: 0.699 ± 0.142
4.716GlyAsp: 4.716 ± 0.412
4.629GlyGlu: 4.629 ± 0.584
2.445GlyPhe: 2.445 ± 0.231
4.891GlyGly: 4.891 ± 0.634
1.776GlyHis: 1.776 ± 0.254
3.726GlyIle: 3.726 ± 0.399
4.046GlyLys: 4.046 ± 0.42
4.483GlyLeu: 4.483 ± 0.431
1.98GlyMet: 1.98 ± 0.233
2.242GlyAsn: 2.242 ± 0.274
2.969GlyPro: 2.969 ± 0.298
1.659GlyGln: 1.659 ± 0.211
4.308GlyArg: 4.308 ± 0.375
4.803GlySer: 4.803 ± 0.452
5.269GlyThr: 5.269 ± 0.628
6.084GlyVal: 6.084 ± 0.63
1.659GlyTrp: 1.659 ± 0.235
2.504GlyTyr: 2.504 ± 0.262
0.0GlyXaa: 0.0 ± 0.0
His
2.125HisAla: 2.125 ± 0.275
0.378HisCys: 0.378 ± 0.112
1.718HisAsp: 1.718 ± 0.283
1.659HisGlu: 1.659 ± 0.219
0.728HisPhe: 0.728 ± 0.143
2.038HisGly: 2.038 ± 0.263
0.699HisHis: 0.699 ± 0.148
1.077HisIle: 1.077 ± 0.195
1.281HisLys: 1.281 ± 0.231
1.543HisLeu: 1.543 ± 0.246
0.495HisMet: 0.495 ± 0.109
0.611HisAsn: 0.611 ± 0.13
1.31HisPro: 1.31 ± 0.221
0.786HisGln: 0.786 ± 0.148
2.387HisArg: 2.387 ± 0.301
1.106HisSer: 1.106 ± 0.186
1.63HisThr: 1.63 ± 0.224
1.747HisVal: 1.747 ± 0.26
0.582HisTrp: 0.582 ± 0.11
0.699HisTyr: 0.699 ± 0.167
0.0HisXaa: 0.0 ± 0.0
Ile
5.298IleAla: 5.298 ± 0.501
0.699IleCys: 0.699 ± 0.158
4.279IleAsp: 4.279 ± 0.415
3.755IleGlu: 3.755 ± 0.367
1.31IlePhe: 1.31 ± 0.186
3.697IleGly: 3.697 ± 0.451
1.106IleHis: 1.106 ± 0.193
2.183IleIle: 2.183 ± 0.266
2.853IleLys: 2.853 ± 0.437
2.998IleLeu: 2.998 ± 0.382
1.194IleMet: 1.194 ± 0.192
1.688IleAsn: 1.688 ± 0.199
1.863IlePro: 1.863 ± 0.221
1.601IleGln: 1.601 ± 0.199
3.26IleArg: 3.26 ± 0.286
3.144IleSer: 3.144 ± 0.43
2.998IleThr: 2.998 ± 0.308
3.784IleVal: 3.784 ± 0.317
0.873IleTrp: 0.873 ± 0.145
0.961IleTyr: 0.961 ± 0.166
0.0IleXaa: 0.0 ± 0.0
Lys
4.221LysAla: 4.221 ± 0.575
0.466LysCys: 0.466 ± 0.114
3.057LysAsp: 3.057 ± 0.333
3.348LysGlu: 3.348 ± 0.31
1.397LysPhe: 1.397 ± 0.167
2.474LysGly: 2.474 ± 0.322
1.543LysHis: 1.543 ± 0.214
2.911LysIle: 2.911 ± 0.245
4.832LysLys: 4.832 ± 0.637
3.843LysLeu: 3.843 ± 0.437
1.805LysMet: 1.805 ± 0.208
2.096LysAsn: 2.096 ± 0.359
3.115LysPro: 3.115 ± 0.295
1.747LysGln: 1.747 ± 0.232
2.998LysArg: 2.998 ± 0.329
3.377LysSer: 3.377 ± 0.328
3.61LysThr: 3.61 ± 0.29
3.522LysVal: 3.522 ± 0.34
1.485LysTrp: 1.485 ± 0.201
1.863LysTyr: 1.863 ± 0.307
0.0LysXaa: 0.0 ± 0.0
Leu
6.288LeuAla: 6.288 ± 0.492
0.815LeuCys: 0.815 ± 0.168
5.211LeuAsp: 5.211 ± 0.367
5.706LeuGlu: 5.706 ± 0.482
1.834LeuPhe: 1.834 ± 0.216
5.269LeuGly: 5.269 ± 0.508
1.63LeuHis: 1.63 ± 0.238
3.319LeuIle: 3.319 ± 0.318
3.435LeuLys: 3.435 ± 0.33
5.56LeuLeu: 5.56 ± 0.494
1.98LeuMet: 1.98 ± 0.228
2.387LeuAsn: 2.387 ± 0.297
3.319LeuPro: 3.319 ± 0.284
2.62LeuGln: 2.62 ± 0.263
4.192LeuArg: 4.192 ± 0.343
4.949LeuSer: 4.949 ± 0.423
4.832LeuThr: 4.832 ± 0.428
5.007LeuVal: 5.007 ± 0.338
1.339LeuTrp: 1.339 ± 0.204
2.707LeuTyr: 2.707 ± 0.306
0.0LeuXaa: 0.0 ± 0.0
Met
1.98MetAla: 1.98 ± 0.257
0.233MetCys: 0.233 ± 0.082
1.834MetAsp: 1.834 ± 0.239
1.397MetGlu: 1.397 ± 0.169
0.99MetPhe: 0.99 ± 0.174
1.514MetGly: 1.514 ± 0.201
1.281MetHis: 1.281 ± 0.23
1.863MetIle: 1.863 ± 0.245
1.252MetLys: 1.252 ± 0.179
1.892MetLeu: 1.892 ± 0.223
0.786MetMet: 0.786 ± 0.137
0.902MetAsn: 0.902 ± 0.137
1.397MetPro: 1.397 ± 0.172
0.902MetGln: 0.902 ± 0.161
2.183MetArg: 2.183 ± 0.263
2.358MetSer: 2.358 ± 0.295
2.067MetThr: 2.067 ± 0.244
1.514MetVal: 1.514 ± 0.225
0.495MetTrp: 0.495 ± 0.123
1.194MetTyr: 1.194 ± 0.18
0.0MetXaa: 0.0 ± 0.0
Asn
3.581AsnAla: 3.581 ± 0.318
0.262AsnCys: 0.262 ± 0.092
2.678AsnAsp: 2.678 ± 0.323
1.572AsnGlu: 1.572 ± 0.221
1.194AsnPhe: 1.194 ± 0.163
2.882AsnGly: 2.882 ± 0.29
0.932AsnHis: 0.932 ± 0.166
1.98AsnIle: 1.98 ± 0.244
2.009AsnLys: 2.009 ± 0.255
2.3AsnLeu: 2.3 ± 0.305
0.961AsnMet: 0.961 ± 0.167
1.252AsnAsn: 1.252 ± 0.195
2.562AsnPro: 2.562 ± 0.227
1.135AsnGln: 1.135 ± 0.171
2.94AsnArg: 2.94 ± 0.319
1.95AsnSer: 1.95 ± 0.257
2.504AsnThr: 2.504 ± 0.289
2.242AsnVal: 2.242 ± 0.372
0.495AsnTrp: 0.495 ± 0.103
0.873AsnTyr: 0.873 ± 0.15
0.0AsnXaa: 0.0 ± 0.0
Pro
4.367ProAla: 4.367 ± 0.408
0.495ProCys: 0.495 ± 0.12
3.726ProAsp: 3.726 ± 0.403
3.697ProGlu: 3.697 ± 0.392
1.514ProPhe: 1.514 ± 0.284
3.29ProGly: 3.29 ± 0.322
0.99ProHis: 0.99 ± 0.176
1.863ProIle: 1.863 ± 0.248
2.387ProLys: 2.387 ± 0.285
3.348ProLeu: 3.348 ± 0.282
1.281ProMet: 1.281 ± 0.193
1.718ProAsn: 1.718 ± 0.234
2.416ProPro: 2.416 ± 0.349
1.601ProGln: 1.601 ± 0.248
2.795ProArg: 2.795 ± 0.289
3.522ProSer: 3.522 ± 0.357
2.795ProThr: 2.795 ± 0.265
4.017ProVal: 4.017 ± 0.363
0.786ProTrp: 0.786 ± 0.155
1.31ProTyr: 1.31 ± 0.187
0.0ProXaa: 0.0 ± 0.0
Gln
2.795GlnAla: 2.795 ± 0.316
0.233GlnCys: 0.233 ± 0.082
1.31GlnAsp: 1.31 ± 0.162
1.485GlnGlu: 1.485 ± 0.207
0.961GlnPhe: 0.961 ± 0.143
1.921GlnGly: 1.921 ± 0.241
0.64GlnHis: 0.64 ± 0.135
1.776GlnIle: 1.776 ± 0.203
1.339GlnLys: 1.339 ± 0.191
2.329GlnLeu: 2.329 ± 0.225
0.99GlnMet: 0.99 ± 0.181
1.164GlnAsn: 1.164 ± 0.254
1.019GlnPro: 1.019 ± 0.176
1.164GlnGln: 1.164 ± 0.158
2.911GlnArg: 2.911 ± 0.296
1.776GlnSer: 1.776 ± 0.221
2.329GlnThr: 2.329 ± 0.22
2.591GlnVal: 2.591 ± 0.266
0.64GlnTrp: 0.64 ± 0.124
1.135GlnTyr: 1.135 ± 0.206
0.0GlnXaa: 0.0 ± 0.0
Arg
5.618ArgAla: 5.618 ± 0.454
1.194ArgCys: 1.194 ± 0.21
4.25ArgAsp: 4.25 ± 0.266
4.803ArgGlu: 4.803 ± 0.4
2.009ArgPhe: 2.009 ± 0.23
3.959ArgGly: 3.959 ± 0.299
1.601ArgHis: 1.601 ± 0.251
3.29ArgIle: 3.29 ± 0.3
3.581ArgLys: 3.581 ± 0.385
4.891ArgLeu: 4.891 ± 0.386
2.067ArgMet: 2.067 ± 0.307
2.387ArgAsn: 2.387 ± 0.303
3.144ArgPro: 3.144 ± 0.277
2.3ArgGln: 2.3 ± 0.26
4.308ArgArg: 4.308 ± 0.508
2.824ArgSer: 2.824 ± 0.329
3.988ArgThr: 3.988 ± 0.384
4.745ArgVal: 4.745 ± 0.353
1.368ArgTrp: 1.368 ± 0.22
2.212ArgTyr: 2.212 ± 0.237
0.0ArgXaa: 0.0 ± 0.0
Ser
5.444SerAla: 5.444 ± 0.455
0.815SerCys: 0.815 ± 0.184
4.192SerAsp: 4.192 ± 0.334
3.552SerGlu: 3.552 ± 0.338
1.747SerPhe: 1.747 ± 0.229
5.589SerGly: 5.589 ± 0.659
1.135SerHis: 1.135 ± 0.211
2.649SerIle: 2.649 ± 0.323
2.998SerLys: 2.998 ± 0.307
4.454SerLeu: 4.454 ± 0.273
1.863SerMet: 1.863 ± 0.241
2.562SerAsn: 2.562 ± 0.236
2.678SerPro: 2.678 ± 0.255
1.31SerGln: 1.31 ± 0.187
3.843SerArg: 3.843 ± 0.365
4.57SerSer: 4.57 ± 0.464
4.25SerThr: 4.25 ± 0.351
4.017SerVal: 4.017 ± 0.358
1.892SerTrp: 1.892 ± 0.207
1.543SerTyr: 1.543 ± 0.261
0.0SerXaa: 0.0 ± 0.0
Thr
5.356ThrAla: 5.356 ± 0.493
0.99ThrCys: 0.99 ± 0.186
3.959ThrAsp: 3.959 ± 0.354
3.639ThrGlu: 3.639 ± 0.396
1.834ThrPhe: 1.834 ± 0.24
5.356ThrGly: 5.356 ± 0.532
1.426ThrHis: 1.426 ± 0.207
3.086ThrIle: 3.086 ± 0.371
2.911ThrLys: 2.911 ± 0.246
5.356ThrLeu: 5.356 ± 0.408
1.921ThrMet: 1.921 ± 0.228
2.212ThrAsn: 2.212 ± 0.227
4.105ThrPro: 4.105 ± 0.478
1.368ThrGln: 1.368 ± 0.219
4.076ThrArg: 4.076 ± 0.401
4.163ThrSer: 4.163 ± 0.442
5.386ThrThr: 5.386 ± 0.527
5.56ThrVal: 5.56 ± 0.513
1.339ThrTrp: 1.339 ± 0.19
2.038ThrTyr: 2.038 ± 0.255
0.0ThrXaa: 0.0 ± 0.0
Val
6.958ValAla: 6.958 ± 0.697
1.019ValCys: 1.019 ± 0.218
5.473ValAsp: 5.473 ± 0.424
4.425ValGlu: 4.425 ± 0.376
2.504ValPhe: 2.504 ± 0.255
3.901ValGly: 3.901 ± 0.361
1.98ValHis: 1.98 ± 0.278
3.377ValIle: 3.377 ± 0.342
3.755ValLys: 3.755 ± 0.335
4.92ValLeu: 4.92 ± 0.369
1.776ValMet: 1.776 ± 0.234
3.086ValAsn: 3.086 ± 0.268
3.348ValPro: 3.348 ± 0.314
2.3ValGln: 2.3 ± 0.229
4.338ValArg: 4.338 ± 0.332
4.221ValSer: 4.221 ± 0.422
4.978ValThr: 4.978 ± 0.505
5.764ValVal: 5.764 ± 0.423
1.543ValTrp: 1.543 ± 0.232
3.057ValTyr: 3.057 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
1.31TrpAla: 1.31 ± 0.193
0.349TrpCys: 0.349 ± 0.103
1.252TrpAsp: 1.252 ± 0.177
1.368TrpGlu: 1.368 ± 0.215
0.553TrpPhe: 0.553 ± 0.112
1.718TrpGly: 1.718 ± 0.24
0.67TrpHis: 0.67 ± 0.135
1.164TrpIle: 1.164 ± 0.204
1.281TrpLys: 1.281 ± 0.16
2.096TrpLeu: 2.096 ± 0.225
0.408TrpMet: 0.408 ± 0.11
1.019TrpAsn: 1.019 ± 0.163
0.961TrpPro: 0.961 ± 0.186
0.611TrpGln: 0.611 ± 0.132
1.194TrpArg: 1.194 ± 0.172
1.339TrpSer: 1.339 ± 0.213
1.31TrpThr: 1.31 ± 0.204
1.339TrpVal: 1.339 ± 0.177
0.553TrpTrp: 0.553 ± 0.123
1.048TrpTyr: 1.048 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.998TyrAla: 2.998 ± 0.302
0.524TyrCys: 0.524 ± 0.134
2.416TyrAsp: 2.416 ± 0.259
2.009TyrGlu: 2.009 ± 0.253
0.495TyrPhe: 0.495 ± 0.131
2.766TyrGly: 2.766 ± 0.302
0.932TyrHis: 0.932 ± 0.16
1.368TyrIle: 1.368 ± 0.246
1.747TyrLys: 1.747 ± 0.236
2.358TyrLeu: 2.358 ± 0.319
0.67TyrMet: 0.67 ± 0.134
1.456TyrAsn: 1.456 ± 0.183
1.019TyrPro: 1.019 ± 0.185
1.019TyrGln: 1.019 ± 0.181
3.028TyrArg: 3.028 ± 0.342
2.183TyrSer: 2.183 ± 0.252
1.863TyrThr: 1.863 ± 0.272
2.212TyrVal: 2.212 ± 0.238
0.699TyrTrp: 0.699 ± 0.164
1.31TyrTyr: 1.31 ± 0.198
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 188 proteins (34352 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski