Amino acid dipepetide frequency for Tenacibaculum phage pT24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.36AlaAla: 0.36 ± 0.129
0.305AlaCys: 0.305 ± 0.066
1.732AlaAsp: 1.732 ± 0.158
2.411AlaGlu: 2.411 ± 0.182
1.94AlaPhe: 1.94 ± 0.171
1.676AlaGly: 1.676 ± 0.327
0.651AlaHis: 0.651 ± 0.092
3.492AlaIle: 3.492 ± 0.254
3.672AlaLys: 3.672 ± 0.273
4.046AlaLeu: 4.046 ± 0.377
1.192AlaMet: 1.192 ± 0.139
2.425AlaAsn: 2.425 ± 0.322
0.97AlaPro: 0.97 ± 0.136
1.178AlaGln: 1.178 ± 0.136
1.469AlaArg: 1.469 ± 0.138
2.813AlaSer: 2.813 ± 0.282
2.355AlaThr: 2.355 ± 0.249
1.857AlaVal: 1.857 ± 0.164
0.319AlaTrp: 0.319 ± 0.068
1.773AlaTyr: 1.773 ± 0.158
0.0AlaXaa: 0.0 ± 0.0
Cys
0.346CysAla: 0.346 ± 0.083
0.097CysCys: 0.097 ± 0.043
0.61CysAsp: 0.61 ± 0.097
0.942CysGlu: 0.942 ± 0.141
0.499CysPhe: 0.499 ± 0.103
0.61CysGly: 0.61 ± 0.1
0.249CysHis: 0.249 ± 0.056
0.762CysIle: 0.762 ± 0.091
0.776CysLys: 0.776 ± 0.113
0.61CysLeu: 0.61 ± 0.102
0.236CysMet: 0.236 ± 0.056
0.485CysAsn: 0.485 ± 0.076
0.222CysPro: 0.222 ± 0.058
0.194CysGln: 0.194 ± 0.057
0.443CysArg: 0.443 ± 0.094
0.637CysSer: 0.637 ± 0.094
0.402CysThr: 0.402 ± 0.085
0.596CysVal: 0.596 ± 0.104
0.042CysTrp: 0.042 ± 0.026
0.457CysTyr: 0.457 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
2.993AspAla: 2.993 ± 0.278
0.804AspCys: 0.804 ± 0.106
4.877AspAsp: 4.877 ± 0.375
5.598AspGlu: 5.598 ± 0.388
4.738AspPhe: 4.738 ± 0.241
4.434AspGly: 4.434 ± 0.26
0.43AspHis: 0.43 ± 0.064
6.373AspIle: 6.373 ± 0.36
5.417AspLys: 5.417 ± 0.296
6.47AspLeu: 6.47 ± 0.335
1.704AspMet: 1.704 ± 0.166
4.17AspAsn: 4.17 ± 0.269
1.081AspPro: 1.081 ± 0.118
0.623AspGln: 0.623 ± 0.092
1.635AspArg: 1.635 ± 0.169
4.752AspSer: 4.752 ± 0.271
3.408AspThr: 3.408 ± 0.205
5.113AspVal: 5.113 ± 0.29
0.776AspTrp: 0.776 ± 0.105
3.713AspTyr: 3.713 ± 0.229
0.0AspXaa: 0.0 ± 0.0
Glu
2.494GluAla: 2.494 ± 0.197
0.887GluCys: 0.887 ± 0.115
4.06GluAsp: 4.06 ± 0.23
5.168GluGlu: 5.168 ± 0.643
4.032GluPhe: 4.032 ± 0.242
2.993GluGly: 2.993 ± 0.276
1.316GluHis: 1.316 ± 0.153
6.581GluIle: 6.581 ± 0.281
6.36GluLys: 6.36 ± 0.531
7.025GluLeu: 7.025 ± 0.444
2.217GluMet: 2.217 ± 0.201
5.653GluAsn: 5.653 ± 0.325
1.538GluPro: 1.538 ± 0.132
2.342GluGln: 2.342 ± 0.166
3.284GluArg: 3.284 ± 0.238
5.833GluSer: 5.833 ± 0.463
4.06GluThr: 4.06 ± 0.221
4.392GluVal: 4.392 ± 0.252
0.693GluTrp: 0.693 ± 0.108
4.073GluTyr: 4.073 ± 0.241
0.0GluXaa: 0.0 ± 0.0
Phe
1.801PheAla: 1.801 ± 0.153
0.499PheCys: 0.499 ± 0.085
4.351PheAsp: 4.351 ± 0.303
3.935PheGlu: 3.935 ± 0.261
1.676PhePhe: 1.676 ± 0.168
2.979PheGly: 2.979 ± 0.244
0.804PheHis: 0.804 ± 0.102
3.893PheIle: 3.893 ± 0.249
5.279PheLys: 5.279 ± 0.263
3.658PheLeu: 3.658 ± 0.222
1.386PheMet: 1.386 ± 0.162
3.963PheAsn: 3.963 ± 0.199
1.302PhePro: 1.302 ± 0.116
1.33PheGln: 1.33 ± 0.121
2.148PheArg: 2.148 ± 0.167
3.782PheSer: 3.782 ± 0.257
2.632PheThr: 2.632 ± 0.206
3.062PheVal: 3.062 ± 0.223
0.36PheTrp: 0.36 ± 0.073
2.314PheTyr: 2.314 ± 0.208
0.0PheXaa: 0.0 ± 0.0
Gly
2.217GlyAla: 2.217 ± 0.267
0.485GlyCys: 0.485 ± 0.087
4.323GlyAsp: 4.323 ± 0.287
3.949GlyGlu: 3.949 ± 0.234
3.284GlyPhe: 3.284 ± 0.197
2.896GlyGly: 2.896 ± 0.346
0.817GlyHis: 0.817 ± 0.162
4.032GlyIle: 4.032 ± 0.463
4.337GlyLys: 4.337 ± 0.308
4.96GlyLeu: 4.96 ± 0.366
1.205GlyMet: 1.205 ± 0.13
3.492GlyAsn: 3.492 ± 0.298
0.014GlyPro: 0.014 ± 0.013
1.372GlyGln: 1.372 ± 0.211
1.815GlyArg: 1.815 ± 0.164
3.533GlySer: 3.533 ± 0.57
3.201GlyThr: 3.201 ± 0.427
4.17GlyVal: 4.17 ± 0.267
0.54GlyTrp: 0.54 ± 0.104
2.619GlyTyr: 2.619 ± 0.372
0.0GlyXaa: 0.0 ± 0.0
His
0.679HisAla: 0.679 ± 0.109
0.208HisCys: 0.208 ± 0.062
1.261HisAsp: 1.261 ± 0.126
1.178HisGlu: 1.178 ± 0.113
0.637HisPhe: 0.637 ± 0.096
0.97HisGly: 0.97 ± 0.127
0.346HisHis: 0.346 ± 0.067
1.579HisIle: 1.579 ± 0.192
1.51HisLys: 1.51 ± 0.159
1.496HisLeu: 1.496 ± 0.125
0.222HisMet: 0.222 ± 0.06
1.219HisAsn: 1.219 ± 0.131
0.513HisPro: 0.513 ± 0.072
0.36HisGln: 0.36 ± 0.063
0.623HisArg: 0.623 ± 0.088
1.053HisSer: 1.053 ± 0.12
0.901HisThr: 0.901 ± 0.108
0.651HisVal: 0.651 ± 0.1
0.152HisTrp: 0.152 ± 0.043
0.859HisTyr: 0.859 ± 0.115
0.0HisXaa: 0.0 ± 0.0
Ile
2.563IleAla: 2.563 ± 0.234
0.762IleCys: 0.762 ± 0.104
6.9IleAsp: 6.9 ± 0.346
6.941IleGlu: 6.941 ± 0.275
3.256IlePhe: 3.256 ± 0.244
4.697IleGly: 4.697 ± 0.838
1.219IleHis: 1.219 ± 0.125
5.847IleIle: 5.847 ± 0.31
7.731IleLys: 7.731 ± 0.453
6.401IleLeu: 6.401 ± 0.295
2.092IleMet: 2.092 ± 0.192
6.346IleAsn: 6.346 ± 0.319
2.549IlePro: 2.549 ± 0.212
2.439IleGln: 2.439 ± 0.195
2.563IleArg: 2.563 ± 0.185
6.11IleSer: 6.11 ± 0.335
4.503IleThr: 4.503 ± 0.314
4.558IleVal: 4.558 ± 0.251
0.554IleTrp: 0.554 ± 0.089
3.256IleTyr: 3.256 ± 0.239
0.0IleXaa: 0.0 ± 0.0
Lys
3.727LysAla: 3.727 ± 0.277
0.817LysCys: 0.817 ± 0.132
6.498LysAsp: 6.498 ± 0.399
7.108LysGlu: 7.108 ± 0.476
4.351LysPhe: 4.351 ± 0.264
4.586LysGly: 4.586 ± 0.25
1.843LysHis: 1.843 ± 0.156
7.163LysIle: 7.163 ± 0.376
8.008LysLys: 8.008 ± 0.549
7.551LysLeu: 7.551 ± 0.371
2.979LysMet: 2.979 ± 0.268
6.457LysAsn: 6.457 ± 0.339
2.064LysPro: 2.064 ± 0.18
2.84LysGln: 2.84 ± 0.209
3.104LysArg: 3.104 ± 0.235
6.858LysSer: 6.858 ± 0.348
5.514LysThr: 5.514 ± 0.329
5.791LysVal: 5.791 ± 0.348
0.596LysTrp: 0.596 ± 0.087
4.42LysTyr: 4.42 ± 0.258
0.0LysXaa: 0.0 ± 0.0
Leu
2.868LeuAla: 2.868 ± 0.293
0.831LeuCys: 0.831 ± 0.119
5.598LeuAsp: 5.598 ± 0.278
6.651LeuGlu: 6.651 ± 0.373
4.073LeuPhe: 4.073 ± 0.248
4.822LeuGly: 4.822 ± 0.358
1.247LeuHis: 1.247 ± 0.132
5.875LeuIle: 5.875 ± 0.299
8.563LeuLys: 8.563 ± 0.47
5.778LeuLeu: 5.778 ± 0.332
2.245LeuMet: 2.245 ± 0.182
6.318LeuAsn: 6.318 ± 0.329
2.549LeuPro: 2.549 ± 0.209
2.632LeuGln: 2.632 ± 0.2
3.436LeuArg: 3.436 ± 0.199
6.346LeuSer: 6.346 ± 0.395
5.362LeuThr: 5.362 ± 0.261
4.669LeuVal: 4.669 ± 0.245
0.637LeuTrp: 0.637 ± 0.106
3.519LeuTyr: 3.519 ± 0.227
0.0LeuXaa: 0.0 ± 0.0
Met
1.122MetAla: 1.122 ± 0.132
0.305MetCys: 0.305 ± 0.07
1.441MetAsp: 1.441 ± 0.16
1.912MetGlu: 1.912 ± 0.189
1.067MetPhe: 1.067 ± 0.143
1.579MetGly: 1.579 ± 0.15
0.402MetHis: 0.402 ± 0.078
1.995MetIle: 1.995 ± 0.156
3.034MetLys: 3.034 ± 0.277
1.676MetLeu: 1.676 ± 0.16
0.776MetMet: 0.776 ± 0.124
1.912MetAsn: 1.912 ± 0.143
0.651MetPro: 0.651 ± 0.109
0.734MetGln: 0.734 ± 0.106
1.081MetArg: 1.081 ± 0.143
2.009MetSer: 2.009 ± 0.163
1.178MetThr: 1.178 ± 0.123
1.607MetVal: 1.607 ± 0.171
0.139MetTrp: 0.139 ± 0.044
1.219MetTyr: 1.219 ± 0.125
0.0MetXaa: 0.0 ± 0.0
Asn
2.799AsnAla: 2.799 ± 0.299
0.61AsnCys: 0.61 ± 0.094
5.307AsnAsp: 5.307 ± 0.325
5.694AsnGlu: 5.694 ± 0.308
3.658AsnPhe: 3.658 ± 0.236
4.697AsnGly: 4.697 ± 0.241
1.289AsnHis: 1.289 ± 0.116
6.415AsnIle: 6.415 ± 0.306
6.069AsnLys: 6.069 ± 0.294
5.113AsnLeu: 5.113 ± 0.253
1.413AsnMet: 1.413 ± 0.159
5.542AsnAsn: 5.542 ± 0.529
2.12AsnPro: 2.12 ± 0.18
1.884AsnGln: 1.884 ± 0.151
2.854AsnArg: 2.854 ± 0.198
4.988AsnSer: 4.988 ± 0.363
4.697AsnThr: 4.697 ± 0.731
4.503AsnVal: 4.503 ± 0.305
0.817AsnTrp: 0.817 ± 0.12
3.173AsnTyr: 3.173 ± 0.234
0.0AsnXaa: 0.0 ± 0.0
Pro
0.568ProAla: 0.568 ± 0.086
0.18ProCys: 0.18 ± 0.049
1.316ProAsp: 1.316 ± 0.134
1.607ProGlu: 1.607 ± 0.15
1.427ProPhe: 1.427 ± 0.125
0.042ProGly: 0.042 ± 0.022
0.499ProHis: 0.499 ± 0.093
1.746ProIle: 1.746 ± 0.133
2.868ProLys: 2.868 ± 0.212
2.203ProLeu: 2.203 ± 0.159
0.665ProMet: 0.665 ± 0.093
1.954ProAsn: 1.954 ± 0.158
0.554ProPro: 0.554 ± 0.117
0.568ProGln: 0.568 ± 0.1
0.859ProArg: 0.859 ± 0.106
2.148ProSer: 2.148 ± 0.174
1.663ProThr: 1.663 ± 0.177
1.455ProVal: 1.455 ± 0.142
0.236ProTrp: 0.236 ± 0.057
1.247ProTyr: 1.247 ± 0.13
0.0ProXaa: 0.0 ± 0.0
Gln
1.15GlnAla: 1.15 ± 0.132
0.139GlnCys: 0.139 ± 0.044
1.524GlnAsp: 1.524 ± 0.167
1.51GlnGlu: 1.51 ± 0.119
1.289GlnPhe: 1.289 ± 0.114
1.316GlnGly: 1.316 ± 0.125
0.665GlnHis: 0.665 ± 0.088
2.189GlnIle: 2.189 ± 0.161
2.439GlnLys: 2.439 ± 0.185
2.175GlnLeu: 2.175 ± 0.17
0.707GlnMet: 0.707 ± 0.091
2.134GlnAsn: 2.134 ± 0.166
0.804GlnPro: 0.804 ± 0.114
1.122GlnGln: 1.122 ± 0.137
1.053GlnArg: 1.053 ± 0.119
1.801GlnSer: 1.801 ± 0.25
1.483GlnThr: 1.483 ± 0.16
1.69GlnVal: 1.69 ± 0.149
0.194GlnTrp: 0.194 ± 0.055
1.496GlnTyr: 1.496 ± 0.147
0.0GlnXaa: 0.0 ± 0.0
Arg
1.316ArgAla: 1.316 ± 0.13
0.236ArgCys: 0.236 ± 0.06
1.704ArgAsp: 1.704 ± 0.131
2.425ArgGlu: 2.425 ± 0.186
2.148ArgPhe: 2.148 ± 0.186
1.607ArgGly: 1.607 ± 0.14
0.72ArgHis: 0.72 ± 0.096
3.076ArgIle: 3.076 ± 0.205
3.769ArgLys: 3.769 ± 0.28
3.45ArgLeu: 3.45 ± 0.234
1.122ArgMet: 1.122 ± 0.138
2.826ArgAsn: 2.826 ± 0.159
0.956ArgPro: 0.956 ± 0.124
0.97ArgGln: 0.97 ± 0.143
1.372ArgArg: 1.372 ± 0.179
1.483ArgSer: 1.483 ± 0.146
2.064ArgThr: 2.064 ± 0.154
2.328ArgVal: 2.328 ± 0.149
0.277ArgTrp: 0.277 ± 0.061
1.995ArgTyr: 1.995 ± 0.166
0.0ArgXaa: 0.0 ± 0.0
Ser
2.84SerAla: 2.84 ± 0.294
0.651SerCys: 0.651 ± 0.117
4.794SerAsp: 4.794 ± 0.297
5.085SerGlu: 5.085 ± 0.29
3.63SerPhe: 3.63 ± 0.245
3.81SerGly: 3.81 ± 0.499
1.122SerHis: 1.122 ± 0.136
6.581SerIle: 6.581 ± 0.303
6.941SerLys: 6.941 ± 0.358
6.318SerLeu: 6.318 ± 0.301
2.009SerMet: 2.009 ± 0.186
5.722SerAsn: 5.722 ± 0.542
1.483SerPro: 1.483 ± 0.134
1.69SerGln: 1.69 ± 0.173
2.245SerArg: 2.245 ± 0.167
4.919SerSer: 4.919 ± 0.42
3.866SerThr: 3.866 ± 0.281
4.406SerVal: 4.406 ± 0.257
0.651SerTrp: 0.651 ± 0.091
3.159SerTyr: 3.159 ± 0.217
0.0SerXaa: 0.0 ± 0.0
Thr
2.106ThrAla: 2.106 ± 0.307
0.346ThrCys: 0.346 ± 0.067
3.658ThrAsp: 3.658 ± 0.226
3.81ThrGlu: 3.81 ± 0.193
3.131ThrPhe: 3.131 ± 0.212
3.852ThrGly: 3.852 ± 0.604
1.122ThrHis: 1.122 ± 0.127
4.586ThrIle: 4.586 ± 0.214
5.182ThrLys: 5.182 ± 0.307
4.891ThrLeu: 4.891 ± 0.266
1.15ThrMet: 1.15 ± 0.132
4.115ThrAsn: 4.115 ± 0.377
1.732ThrPro: 1.732 ± 0.137
1.663ThrGln: 1.663 ± 0.148
1.912ThrArg: 1.912 ± 0.138
4.406ThrSer: 4.406 ± 0.233
3.575ThrThr: 3.575 ± 0.305
3.173ThrVal: 3.173 ± 0.246
0.499ThrTrp: 0.499 ± 0.097
2.66ThrTyr: 2.66 ± 0.189
0.0ThrXaa: 0.0 ± 0.0
Val
2.522ValAla: 2.522 ± 0.208
0.471ValCys: 0.471 ± 0.101
4.766ValAsp: 4.766 ± 0.286
5.029ValGlu: 5.029 ± 0.294
3.311ValPhe: 3.311 ± 0.229
3.02ValGly: 3.02 ± 0.245
0.707ValHis: 0.707 ± 0.096
4.794ValIle: 4.794 ± 0.21
5.196ValLys: 5.196 ± 0.305
4.932ValLeu: 4.932 ± 0.292
1.275ValMet: 1.275 ± 0.148
4.475ValAsn: 4.475 ± 0.318
1.566ValPro: 1.566 ± 0.16
1.469ValGln: 1.469 ± 0.141
2.023ValArg: 2.023 ± 0.172
4.503ValSer: 4.503 ± 0.224
3.644ValThr: 3.644 ± 0.284
3.367ValVal: 3.367 ± 0.201
0.291ValTrp: 0.291 ± 0.061
2.937ValTyr: 2.937 ± 0.214
0.0ValXaa: 0.0 ± 0.0
Trp
0.277TrpAla: 0.277 ± 0.053
0.152TrpCys: 0.152 ± 0.049
0.61TrpAsp: 0.61 ± 0.085
0.582TrpGlu: 0.582 ± 0.082
0.582TrpPhe: 0.582 ± 0.088
0.402TrpGly: 0.402 ± 0.06
0.194TrpHis: 0.194 ± 0.052
0.526TrpIle: 0.526 ± 0.083
0.748TrpLys: 0.748 ± 0.091
0.637TrpLeu: 0.637 ± 0.112
0.263TrpMet: 0.263 ± 0.06
0.831TrpAsn: 0.831 ± 0.088
0.0TrpPro: 0.0 ± 0.0
0.139TrpGln: 0.139 ± 0.053
0.277TrpArg: 0.277 ± 0.058
0.568TrpSer: 0.568 ± 0.093
0.443TrpThr: 0.443 ± 0.08
0.554TrpVal: 0.554 ± 0.099
0.166TrpTrp: 0.166 ± 0.046
0.485TrpTyr: 0.485 ± 0.072
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.898TyrAla: 1.898 ± 0.178
0.43TyrCys: 0.43 ± 0.085
3.782TyrAsp: 3.782 ± 0.187
3.464TyrGlu: 3.464 ± 0.227
2.425TyrPhe: 2.425 ± 0.196
2.272TyrGly: 2.272 ± 0.198
0.776TyrHis: 0.776 ± 0.101
3.782TyrIle: 3.782 ± 0.297
4.378TyrLys: 4.378 ± 0.222
4.628TyrLeu: 4.628 ± 0.242
0.97TyrMet: 0.97 ± 0.115
3.602TyrAsn: 3.602 ± 0.191
1.136TyrPro: 1.136 ± 0.132
1.33TyrGln: 1.33 ± 0.115
1.649TyrArg: 1.649 ± 0.161
3.381TyrSer: 3.381 ± 0.233
2.619TyrThr: 2.619 ± 0.215
2.355TyrVal: 2.355 ± 0.164
0.499TyrTrp: 0.499 ± 0.099
2.023TyrTyr: 2.023 ± 0.184
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 297 proteins (72176 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski