Amino acid dipepetide frequency for Escherichia phage FFH2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.05AlaAla: 6.05 ± 0.496
1.244AlaCys: 1.244 ± 0.192
4.185AlaAsp: 4.185 ± 0.312
4.95AlaGlu: 4.95 ± 0.368
2.846AlaPhe: 2.846 ± 0.211
4.974AlaGly: 4.974 ± 0.417
1.1AlaHis: 1.1 ± 0.123
3.922AlaIle: 3.922 ± 0.31
5.333AlaLys: 5.333 ± 0.45
5.285AlaLeu: 5.285 ± 0.35
2.344AlaMet: 2.344 ± 0.238
3.731AlaAsn: 3.731 ± 0.307
2.152AlaPro: 2.152 ± 0.23
2.128AlaGln: 2.128 ± 0.283
3.181AlaArg: 3.181 ± 0.258
3.3AlaSer: 3.3 ± 0.308
3.898AlaThr: 3.898 ± 0.425
4.472AlaVal: 4.472 ± 0.288
1.363AlaTrp: 1.363 ± 0.183
3.372AlaTyr: 3.372 ± 0.279
0.0AlaXaa: 0.0 ± 0.0
Cys
0.933CysAla: 0.933 ± 0.179
0.383CysCys: 0.383 ± 0.092
0.933CysAsp: 0.933 ± 0.151
1.052CysGlu: 1.052 ± 0.171
0.598CysPhe: 0.598 ± 0.119
1.339CysGly: 1.339 ± 0.172
0.454CysHis: 0.454 ± 0.111
0.909CysIle: 0.909 ± 0.171
1.004CysLys: 1.004 ± 0.137
1.148CysLeu: 1.148 ± 0.168
0.478CysMet: 0.478 ± 0.1
0.837CysAsn: 0.837 ± 0.151
0.837CysPro: 0.837 ± 0.128
0.526CysGln: 0.526 ± 0.114
1.028CysArg: 1.028 ± 0.157
0.909CysSer: 0.909 ± 0.139
0.765CysThr: 0.765 ± 0.131
1.315CysVal: 1.315 ± 0.168
0.335CysTrp: 0.335 ± 0.095
0.694CysTyr: 0.694 ± 0.133
0.0CysXaa: 0.0 ± 0.0
Asp
4.281AspAla: 4.281 ± 0.255
1.148AspCys: 1.148 ± 0.162
4.209AspAsp: 4.209 ± 0.416
4.305AspGlu: 4.305 ± 0.317
3.276AspPhe: 3.276 ± 0.341
5.118AspGly: 5.118 ± 0.365
1.554AspHis: 1.554 ± 0.178
4.065AspIle: 4.065 ± 0.324
4.807AspLys: 4.807 ± 0.315
5.907AspLeu: 5.907 ± 0.364
1.961AspMet: 1.961 ± 0.222
3.563AspAsn: 3.563 ± 0.287
3.587AspPro: 3.587 ± 0.361
1.794AspGln: 1.794 ± 0.226
2.774AspArg: 2.774 ± 0.282
3.539AspSer: 3.539 ± 0.311
3.181AspThr: 3.181 ± 0.249
4.711AspVal: 4.711 ± 0.307
1.196AspTrp: 1.196 ± 0.16
2.822AspTyr: 2.822 ± 0.291
0.0AspXaa: 0.0 ± 0.0
Glu
4.831GluAla: 4.831 ± 0.327
0.813GluCys: 0.813 ± 0.153
4.95GluAsp: 4.95 ± 0.444
6.816GluGlu: 6.816 ± 0.681
2.918GluPhe: 2.918 ± 0.218
5.476GluGly: 5.476 ± 0.415
1.244GluHis: 1.244 ± 0.193
4.352GluIle: 4.352 ± 0.35
4.352GluLys: 4.352 ± 0.377
6.289GluLeu: 6.289 ± 0.407
2.439GluMet: 2.439 ± 0.253
3.205GluAsn: 3.205 ± 0.291
1.985GluPro: 1.985 ± 0.225
1.937GluGln: 1.937 ± 0.246
3.205GluArg: 3.205 ± 0.295
3.468GluSer: 3.468 ± 0.287
2.989GluThr: 2.989 ± 0.28
5.333GluVal: 5.333 ± 0.328
1.315GluTrp: 1.315 ± 0.197
2.918GluTyr: 2.918 ± 0.305
0.0GluXaa: 0.0 ± 0.0
Phe
2.368PheAla: 2.368 ± 0.215
0.67PheCys: 0.67 ± 0.139
3.396PheAsp: 3.396 ± 0.248
3.109PheGlu: 3.109 ± 0.299
1.554PhePhe: 1.554 ± 0.198
3.085PheGly: 3.085 ± 0.27
0.765PheHis: 0.765 ± 0.131
2.559PheIle: 2.559 ± 0.295
2.989PheLys: 2.989 ± 0.203
3.228PheLeu: 3.228 ± 0.297
1.435PheMet: 1.435 ± 0.202
2.391PheAsn: 2.391 ± 0.253
1.435PhePro: 1.435 ± 0.187
1.148PheGln: 1.148 ± 0.175
1.889PheArg: 1.889 ± 0.204
2.607PheSer: 2.607 ± 0.276
2.2PheThr: 2.2 ± 0.25
2.965PheVal: 2.965 ± 0.308
0.694PheTrp: 0.694 ± 0.114
2.176PheTyr: 2.176 ± 0.195
0.0PheXaa: 0.0 ± 0.0
Gly
4.687GlyAla: 4.687 ± 0.453
1.148GlyCys: 1.148 ± 0.164
5.022GlyAsp: 5.022 ± 0.308
4.95GlyGlu: 4.95 ± 0.302
3.348GlyPhe: 3.348 ± 0.334
5.165GlyGly: 5.165 ± 0.439
1.244GlyHis: 1.244 ± 0.175
4.065GlyIle: 4.065 ± 0.38
5.572GlyLys: 5.572 ± 0.355
4.998GlyLeu: 4.998 ± 0.34
1.817GlyMet: 1.817 ± 0.223
3.922GlyAsn: 3.922 ± 0.295
1.387GlyPro: 1.387 ± 0.208
2.128GlyGln: 2.128 ± 0.244
3.372GlyArg: 3.372 ± 0.264
3.707GlySer: 3.707 ± 0.316
4.137GlyThr: 4.137 ± 0.525
4.783GlyVal: 4.783 ± 0.3
1.363GlyTrp: 1.363 ± 0.174
3.778GlyTyr: 3.778 ± 0.291
0.0GlyXaa: 0.0 ± 0.0
His
0.98HisAla: 0.98 ± 0.15
0.478HisCys: 0.478 ± 0.104
1.1HisAsp: 1.1 ± 0.157
1.028HisGlu: 1.028 ± 0.16
0.861HisPhe: 0.861 ± 0.139
1.124HisGly: 1.124 ± 0.168
0.454HisHis: 0.454 ± 0.103
1.1HisIle: 1.1 ± 0.161
1.531HisLys: 1.531 ± 0.164
1.794HisLeu: 1.794 ± 0.218
0.502HisMet: 0.502 ± 0.111
0.957HisAsn: 0.957 ± 0.165
1.148HisPro: 1.148 ± 0.185
0.717HisGln: 0.717 ± 0.144
0.861HisArg: 0.861 ± 0.142
1.076HisSer: 1.076 ± 0.177
1.387HisThr: 1.387 ± 0.187
1.148HisVal: 1.148 ± 0.176
0.311HisTrp: 0.311 ± 0.097
1.124HisTyr: 1.124 ± 0.177
0.0HisXaa: 0.0 ± 0.0
Ile
4.065IleAla: 4.065 ± 0.291
0.98IleCys: 0.98 ± 0.164
4.065IleAsp: 4.065 ± 0.36
3.707IleGlu: 3.707 ± 0.271
2.081IlePhe: 2.081 ± 0.24
3.587IleGly: 3.587 ± 0.295
1.124IleHis: 1.124 ± 0.146
4.018IleIle: 4.018 ± 0.309
3.826IleLys: 3.826 ± 0.279
4.568IleLeu: 4.568 ± 0.421
1.578IleMet: 1.578 ± 0.178
3.109IleAsn: 3.109 ± 0.292
2.774IlePro: 2.774 ± 0.271
2.057IleGln: 2.057 ± 0.223
3.085IleArg: 3.085 ± 0.277
3.539IleSer: 3.539 ± 0.305
3.731IleThr: 3.731 ± 0.304
4.472IleVal: 4.472 ± 0.335
0.789IleTrp: 0.789 ± 0.138
2.391IleTyr: 2.391 ± 0.261
0.0IleXaa: 0.0 ± 0.0
Lys
5.668LysAla: 5.668 ± 0.474
1.172LysCys: 1.172 ± 0.158
4.902LysAsp: 4.902 ± 0.399
5.309LysGlu: 5.309 ± 0.435
2.822LysPhe: 2.822 ± 0.231
4.544LysGly: 4.544 ± 0.318
1.291LysHis: 1.291 ± 0.197
4.687LysIle: 4.687 ± 0.327
4.472LysLys: 4.472 ± 0.399
4.879LysLeu: 4.879 ± 0.33
2.272LysMet: 2.272 ± 0.231
3.037LysAsn: 3.037 ± 0.265
2.272LysPro: 2.272 ± 0.268
2.2LysGln: 2.2 ± 0.228
2.87LysArg: 2.87 ± 0.299
3.324LysSer: 3.324 ± 0.28
3.802LysThr: 3.802 ± 0.285
4.974LysVal: 4.974 ± 0.36
1.004LysTrp: 1.004 ± 0.148
2.918LysTyr: 2.918 ± 0.25
0.0LysXaa: 0.0 ± 0.0
Leu
5.524LeuAla: 5.524 ± 0.357
1.076LeuCys: 1.076 ± 0.154
5.787LeuAsp: 5.787 ± 0.374
6.457LeuGlu: 6.457 ± 0.541
2.702LeuPhe: 2.702 ± 0.3
4.496LeuGly: 4.496 ± 0.359
1.722LeuHis: 1.722 ± 0.21
4.281LeuIle: 4.281 ± 0.33
5.022LeuLys: 5.022 ± 0.36
5.118LeuLeu: 5.118 ± 0.449
2.272LeuMet: 2.272 ± 0.238
4.424LeuAsn: 4.424 ± 0.278
3.874LeuPro: 3.874 ± 0.291
2.678LeuGln: 2.678 ± 0.237
3.515LeuArg: 3.515 ± 0.318
4.113LeuSer: 4.113 ± 0.348
4.592LeuThr: 4.592 ± 0.361
5.046LeuVal: 5.046 ± 0.321
1.483LeuTrp: 1.483 ± 0.176
2.702LeuTyr: 2.702 ± 0.283
0.0LeuXaa: 0.0 ± 0.0
Met
2.535MetAla: 2.535 ± 0.235
0.454MetCys: 0.454 ± 0.122
1.435MetAsp: 1.435 ± 0.181
1.746MetGlu: 1.746 ± 0.229
1.1MetPhe: 1.1 ± 0.177
1.77MetGly: 1.77 ± 0.198
0.526MetHis: 0.526 ± 0.107
1.817MetIle: 1.817 ± 0.18
2.822MetLys: 2.822 ± 0.283
2.535MetLeu: 2.535 ± 0.279
0.933MetMet: 0.933 ± 0.16
1.028MetAsn: 1.028 ± 0.171
0.885MetPro: 0.885 ± 0.169
1.076MetGln: 1.076 ± 0.172
1.148MetArg: 1.148 ± 0.162
2.176MetSer: 2.176 ± 0.234
1.507MetThr: 1.507 ± 0.207
1.65MetVal: 1.65 ± 0.203
0.598MetTrp: 0.598 ± 0.109
0.861MetTyr: 0.861 ± 0.136
0.0MetXaa: 0.0 ± 0.0
Asn
3.826AsnAla: 3.826 ± 0.299
0.741AsnCys: 0.741 ± 0.123
3.037AsnAsp: 3.037 ± 0.21
2.607AsnGlu: 2.607 ± 0.293
2.487AsnPhe: 2.487 ± 0.246
4.615AsnGly: 4.615 ± 0.406
0.98AsnHis: 0.98 ± 0.136
3.109AsnIle: 3.109 ± 0.306
3.181AsnLys: 3.181 ± 0.274
4.137AsnLeu: 4.137 ± 0.324
1.267AsnMet: 1.267 ± 0.179
2.941AsnAsn: 2.941 ± 0.326
2.32AsnPro: 2.32 ± 0.226
1.483AsnGln: 1.483 ± 0.183
2.224AsnArg: 2.224 ± 0.233
2.654AsnSer: 2.654 ± 0.236
3.085AsnThr: 3.085 ± 0.301
2.726AsnVal: 2.726 ± 0.231
0.957AsnTrp: 0.957 ± 0.142
1.602AsnTyr: 1.602 ± 0.225
0.0AsnXaa: 0.0 ± 0.0
Pro
2.272ProAla: 2.272 ± 0.228
0.574ProCys: 0.574 ± 0.118
3.324ProAsp: 3.324 ± 0.308
3.874ProGlu: 3.874 ± 0.291
1.985ProPhe: 1.985 ± 0.217
2.439ProGly: 2.439 ± 0.267
0.717ProHis: 0.717 ± 0.13
1.698ProIle: 1.698 ± 0.188
2.391ProLys: 2.391 ± 0.282
2.822ProLeu: 2.822 ± 0.259
0.861ProMet: 0.861 ± 0.14
1.387ProAsn: 1.387 ± 0.193
1.028ProPro: 1.028 ± 0.17
1.315ProGln: 1.315 ± 0.185
1.578ProArg: 1.578 ± 0.22
2.798ProSer: 2.798 ± 0.257
2.152ProThr: 2.152 ± 0.216
3.133ProVal: 3.133 ± 0.274
0.407ProTrp: 0.407 ± 0.111
1.363ProTyr: 1.363 ± 0.209
0.0ProXaa: 0.0 ± 0.0
Gln
2.535GlnAla: 2.535 ± 0.208
0.407GlnCys: 0.407 ± 0.093
1.794GlnAsp: 1.794 ± 0.183
2.511GlnGlu: 2.511 ± 0.276
1.387GlnPhe: 1.387 ± 0.159
1.77GlnGly: 1.77 ± 0.186
0.502GlnHis: 0.502 ± 0.108
1.817GlnIle: 1.817 ± 0.214
1.913GlnLys: 1.913 ± 0.211
2.702GlnLeu: 2.702 ± 0.253
1.076GlnMet: 1.076 ± 0.166
1.22GlnAsn: 1.22 ± 0.154
1.387GlnPro: 1.387 ± 0.205
1.794GlnGln: 1.794 ± 0.253
1.507GlnArg: 1.507 ± 0.151
1.531GlnSer: 1.531 ± 0.186
1.698GlnThr: 1.698 ± 0.249
2.2GlnVal: 2.2 ± 0.273
0.765GlnTrp: 0.765 ± 0.137
1.411GlnTyr: 1.411 ± 0.209
0.0GlnXaa: 0.0 ± 0.0
Arg
2.798ArgAla: 2.798 ± 0.247
0.933ArgCys: 0.933 ± 0.147
2.989ArgAsp: 2.989 ± 0.271
2.846ArgGlu: 2.846 ± 0.295
1.746ArgPhe: 1.746 ± 0.213
2.941ArgGly: 2.941 ± 0.246
0.813ArgHis: 0.813 ± 0.138
2.87ArgIle: 2.87 ± 0.245
3.396ArgLys: 3.396 ± 0.294
3.659ArgLeu: 3.659 ± 0.324
1.554ArgMet: 1.554 ± 0.18
2.152ArgAsn: 2.152 ± 0.224
1.483ArgPro: 1.483 ± 0.207
1.578ArgGln: 1.578 ± 0.157
2.559ArgArg: 2.559 ± 0.276
2.583ArgSer: 2.583 ± 0.241
2.463ArgThr: 2.463 ± 0.261
3.515ArgVal: 3.515 ± 0.278
0.933ArgTrp: 0.933 ± 0.152
1.985ArgTyr: 1.985 ± 0.187
0.0ArgXaa: 0.0 ± 0.0
Ser
3.922SerAla: 3.922 ± 0.342
1.028SerCys: 1.028 ± 0.154
3.324SerAsp: 3.324 ± 0.313
3.3SerGlu: 3.3 ± 0.292
2.487SerPhe: 2.487 ± 0.289
4.998SerGly: 4.998 ± 0.412
1.22SerHis: 1.22 ± 0.167
3.205SerIle: 3.205 ± 0.282
3.994SerLys: 3.994 ± 0.372
3.515SerLeu: 3.515 ± 0.33
1.387SerMet: 1.387 ± 0.174
2.965SerAsn: 2.965 ± 0.238
2.296SerPro: 2.296 ± 0.2
1.554SerGln: 1.554 ± 0.184
2.583SerArg: 2.583 ± 0.213
2.989SerSer: 2.989 ± 0.339
3.515SerThr: 3.515 ± 0.346
3.778SerVal: 3.778 ± 0.291
0.813SerTrp: 0.813 ± 0.133
2.009SerTyr: 2.009 ± 0.246
0.0SerXaa: 0.0 ± 0.0
Thr
3.994ThrAla: 3.994 ± 0.402
0.909ThrCys: 0.909 ± 0.14
3.205ThrAsp: 3.205 ± 0.288
3.563ThrGlu: 3.563 ± 0.357
2.75ThrPhe: 2.75 ± 0.219
4.496ThrGly: 4.496 ± 0.37
1.267ThrHis: 1.267 ± 0.165
3.898ThrIle: 3.898 ± 0.329
3.396ThrLys: 3.396 ± 0.316
4.639ThrLeu: 4.639 ± 0.392
1.148ThrMet: 1.148 ± 0.169
2.344ThrAsn: 2.344 ± 0.266
2.463ThrPro: 2.463 ± 0.219
1.817ThrGln: 1.817 ± 0.19
2.152ThrArg: 2.152 ± 0.237
3.348ThrSer: 3.348 ± 0.381
3.635ThrThr: 3.635 ± 0.372
4.496ThrVal: 4.496 ± 0.396
1.076ThrTrp: 1.076 ± 0.173
2.248ThrTyr: 2.248 ± 0.3
0.0ThrXaa: 0.0 ± 0.0
Val
4.807ValAla: 4.807 ± 0.337
1.172ValCys: 1.172 ± 0.144
5.955ValAsp: 5.955 ± 0.419
4.759ValGlu: 4.759 ± 0.37
3.324ValPhe: 3.324 ± 0.276
4.687ValGly: 4.687 ± 0.31
1.339ValHis: 1.339 ± 0.178
4.281ValIle: 4.281 ± 0.347
4.639ValLys: 4.639 ± 0.329
4.4ValLeu: 4.4 ± 0.326
1.913ValMet: 1.913 ± 0.202
3.491ValAsn: 3.491 ± 0.328
2.081ValPro: 2.081 ± 0.221
1.937ValGln: 1.937 ± 0.207
3.085ValArg: 3.085 ± 0.245
3.755ValSer: 3.755 ± 0.331
4.209ValThr: 4.209 ± 0.381
6.433ValVal: 6.433 ± 0.451
1.411ValTrp: 1.411 ± 0.172
2.87ValTyr: 2.87 ± 0.239
0.0ValXaa: 0.0 ± 0.0
Trp
1.004TrpAla: 1.004 ± 0.188
0.407TrpCys: 0.407 ± 0.112
1.124TrpAsp: 1.124 ± 0.144
1.77TrpGlu: 1.77 ± 0.198
0.789TrpPhe: 0.789 ± 0.152
1.004TrpGly: 1.004 ± 0.168
0.335TrpHis: 0.335 ± 0.089
1.267TrpIle: 1.267 ± 0.178
1.267TrpLys: 1.267 ± 0.212
1.435TrpLeu: 1.435 ± 0.207
0.454TrpMet: 0.454 ± 0.101
0.909TrpAsn: 0.909 ± 0.15
0.526TrpPro: 0.526 ± 0.102
0.526TrpGln: 0.526 ± 0.114
0.861TrpArg: 0.861 ± 0.142
0.957TrpSer: 0.957 ± 0.125
1.1TrpThr: 1.1 ± 0.136
1.028TrpVal: 1.028 ± 0.133
0.478TrpTrp: 0.478 ± 0.113
0.933TrpTyr: 0.933 ± 0.132
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.75TyrAla: 2.75 ± 0.21
0.741TyrCys: 0.741 ± 0.144
3.061TyrAsp: 3.061 ± 0.299
2.2TyrGlu: 2.2 ± 0.244
1.602TyrPhe: 1.602 ± 0.171
2.965TyrGly: 2.965 ± 0.304
1.028TyrHis: 1.028 ± 0.137
1.626TyrIle: 1.626 ± 0.206
2.511TyrLys: 2.511 ± 0.245
3.826TyrLeu: 3.826 ± 0.336
0.885TyrMet: 0.885 ± 0.157
2.368TyrAsn: 2.368 ± 0.245
2.2TyrPro: 2.2 ± 0.219
1.531TyrGln: 1.531 ± 0.185
2.32TyrArg: 2.32 ± 0.256
2.511TyrSer: 2.511 ± 0.276
2.798TyrThr: 2.798 ± 0.232
2.368TyrVal: 2.368 ± 0.233
0.885TyrTrp: 0.885 ± 0.163
2.224TyrTyr: 2.224 ± 0.25
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 218 proteins (41817 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski