Amino acid dipepetide frequency for Vibrio phage nt-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.03AlaAla: 5.03 ± 0.311
0.698AlaCys: 0.698 ± 0.102
4.543AlaAsp: 4.543 ± 0.323
5.175AlaGlu: 5.175 ± 0.348
2.818AlaPhe: 2.818 ± 0.194
4.477AlaGly: 4.477 ± 0.332
1.528AlaHis: 1.528 ± 0.141
5.136AlaIle: 5.136 ± 0.242
4.359AlaLys: 4.359 ± 0.269
5.425AlaLeu: 5.425 ± 0.309
1.87AlaMet: 1.87 ± 0.194
3.424AlaAsn: 3.424 ± 0.237
2.015AlaPro: 2.015 ± 0.18
2.568AlaGln: 2.568 ± 0.207
3.371AlaArg: 3.371 ± 0.203
4.253AlaSer: 4.253 ± 0.26
4.095AlaThr: 4.095 ± 0.282
4.332AlaVal: 4.332 ± 0.233
0.843AlaTrp: 0.843 ± 0.086
2.884AlaTyr: 2.884 ± 0.179
0.0AlaXaa: 0.0 ± 0.0
Cys
0.737CysAla: 0.737 ± 0.095
0.237CysCys: 0.237 ± 0.053
0.935CysAsp: 0.935 ± 0.107
0.895CysGlu: 0.895 ± 0.117
0.514CysPhe: 0.514 ± 0.081
0.961CysGly: 0.961 ± 0.118
0.408CysHis: 0.408 ± 0.084
0.79CysIle: 0.79 ± 0.113
0.935CysLys: 0.935 ± 0.114
0.79CysLeu: 0.79 ± 0.111
0.29CysMet: 0.29 ± 0.06
0.724CysAsn: 0.724 ± 0.096
0.448CysPro: 0.448 ± 0.08
0.448CysGln: 0.448 ± 0.077
0.579CysArg: 0.579 ± 0.093
0.698CysSer: 0.698 ± 0.103
0.711CysThr: 0.711 ± 0.095
0.803CysVal: 0.803 ± 0.107
0.184CysTrp: 0.184 ± 0.051
0.421CysTyr: 0.421 ± 0.074
0.0CysXaa: 0.0 ± 0.0
Asp
4.438AspAla: 4.438 ± 0.263
0.948AspCys: 0.948 ± 0.128
5.083AspAsp: 5.083 ± 0.294
5.873AspGlu: 5.873 ± 0.298
3.411AspPhe: 3.411 ± 0.186
5.109AspGly: 5.109 ± 0.273
1.198AspHis: 1.198 ± 0.137
4.938AspIle: 4.938 ± 0.276
4.201AspLys: 4.201 ± 0.239
4.807AspLeu: 4.807 ± 0.285
2.002AspMet: 2.002 ± 0.182
3.49AspAsn: 3.49 ± 0.205
2.502AspPro: 2.502 ± 0.187
1.751AspGln: 1.751 ± 0.154
2.963AspArg: 2.963 ± 0.202
3.832AspSer: 3.832 ± 0.212
3.806AspThr: 3.806 ± 0.264
5.07AspVal: 5.07 ± 0.278
1.343AspTrp: 1.343 ± 0.116
3.358AspTyr: 3.358 ± 0.234
0.0AspXaa: 0.0 ± 0.0
Glu
5.557GluAla: 5.557 ± 0.277
0.843GluCys: 0.843 ± 0.116
4.306GluAsp: 4.306 ± 0.235
5.281GluGlu: 5.281 ± 0.308
3.937GluPhe: 3.937 ± 0.242
2.95GluGly: 2.95 ± 0.209
2.067GluHis: 2.067 ± 0.176
5.518GluIle: 5.518 ± 0.3
4.596GluLys: 4.596 ± 0.295
7.401GluLeu: 7.401 ± 0.339
2.305GluMet: 2.305 ± 0.209
3.384GluAsn: 3.384 ± 0.233
1.528GluPro: 1.528 ± 0.151
2.739GluGln: 2.739 ± 0.224
4.253GluArg: 4.253 ± 0.299
4.306GluSer: 4.306 ± 0.282
4.359GluThr: 4.359 ± 0.255
4.886GluVal: 4.886 ± 0.246
1.053GluTrp: 1.053 ± 0.118
3.358GluTyr: 3.358 ± 0.215
0.0GluXaa: 0.0 ± 0.0
Phe
2.937PheAla: 2.937 ± 0.18
0.461PheCys: 0.461 ± 0.076
3.977PheAsp: 3.977 ± 0.268
3.595PheGlu: 3.595 ± 0.201
1.475PhePhe: 1.475 ± 0.154
2.937PheGly: 2.937 ± 0.193
1.027PheHis: 1.027 ± 0.109
2.976PheIle: 2.976 ± 0.209
2.937PheLys: 2.937 ± 0.197
2.607PheLeu: 2.607 ± 0.193
1.251PheMet: 1.251 ± 0.133
2.305PheAsn: 2.305 ± 0.157
1.146PhePro: 1.146 ± 0.157
1.093PheGln: 1.093 ± 0.123
2.054PheArg: 2.054 ± 0.156
2.765PheSer: 2.765 ± 0.19
2.779PheThr: 2.779 ± 0.203
2.976PheVal: 2.976 ± 0.258
0.658PheTrp: 0.658 ± 0.1
1.514PheTyr: 1.514 ± 0.156
0.0PheXaa: 0.0 ± 0.0
Gly
3.753GlyAla: 3.753 ± 0.234
1.027GlyCys: 1.027 ± 0.114
4.622GlyAsp: 4.622 ± 0.324
4.491GlyGlu: 4.491 ± 0.253
2.594GlyPhe: 2.594 ± 0.178
4.003GlyGly: 4.003 ± 0.334
1.119GlyHis: 1.119 ± 0.14
3.819GlyIle: 3.819 ± 0.241
4.214GlyLys: 4.214 ± 0.263
4.332GlyLeu: 4.332 ± 0.241
1.462GlyMet: 1.462 ± 0.145
3.134GlyAsn: 3.134 ± 0.265
0.724GlyPro: 0.724 ± 0.1
1.804GlyGln: 1.804 ± 0.164
2.568GlyArg: 2.568 ± 0.207
4.056GlySer: 4.056 ± 0.358
4.201GlyThr: 4.201 ± 0.317
4.28GlyVal: 4.28 ± 0.277
1.001GlyTrp: 1.001 ± 0.112
2.805GlyTyr: 2.805 ± 0.195
0.0GlyXaa: 0.0 ± 0.0
His
1.672HisAla: 1.672 ± 0.138
0.303HisCys: 0.303 ± 0.062
1.804HisAsp: 1.804 ± 0.163
1.751HisGlu: 1.751 ± 0.149
1.027HisPhe: 1.027 ± 0.116
1.475HisGly: 1.475 ± 0.125
0.593HisHis: 0.593 ± 0.089
1.646HisIle: 1.646 ± 0.17
1.383HisLys: 1.383 ± 0.128
1.488HisLeu: 1.488 ± 0.134
0.527HisMet: 0.527 ± 0.087
1.014HisAsn: 1.014 ± 0.113
1.106HisPro: 1.106 ± 0.14
0.645HisGln: 0.645 ± 0.085
1.014HisArg: 1.014 ± 0.122
1.264HisSer: 1.264 ± 0.139
1.567HisThr: 1.567 ± 0.19
1.83HisVal: 1.83 ± 0.19
0.369HisTrp: 0.369 ± 0.066
1.04HisTyr: 1.04 ± 0.125
0.0HisXaa: 0.0 ± 0.0
Ile
5.136IleAla: 5.136 ± 0.288
0.724IleCys: 0.724 ± 0.103
5.676IleAsp: 5.676 ± 0.228
6.018IleGlu: 6.018 ± 0.292
2.344IlePhe: 2.344 ± 0.153
3.569IleGly: 3.569 ± 0.228
1.449IleHis: 1.449 ± 0.153
3.753IleIle: 3.753 ± 0.241
4.53IleLys: 4.53 ± 0.298
4.332IleLeu: 4.332 ± 0.275
1.659IleMet: 1.659 ± 0.158
3.556IleAsn: 3.556 ± 0.208
2.252IlePro: 2.252 ± 0.176
2.12IleGln: 2.12 ± 0.175
3.147IleArg: 3.147 ± 0.191
4.122IleSer: 4.122 ± 0.256
4.267IleThr: 4.267 ± 0.229
4.78IleVal: 4.78 ± 0.238
0.606IleTrp: 0.606 ± 0.079
2.146IleTyr: 2.146 ± 0.173
0.0IleXaa: 0.0 ± 0.0
Lys
4.451LysAla: 4.451 ± 0.308
0.685LysCys: 0.685 ± 0.099
3.766LysAsp: 3.766 ± 0.23
4.846LysGlu: 4.846 ± 0.306
3.213LysPhe: 3.213 ± 0.207
3.266LysGly: 3.266 ± 0.217
2.146LysHis: 2.146 ± 0.198
4.082LysIle: 4.082 ± 0.3
4.793LysLys: 4.793 ± 0.347
5.465LysLeu: 5.465 ± 0.309
2.37LysMet: 2.37 ± 0.188
2.831LysAsn: 2.831 ± 0.186
1.896LysPro: 1.896 ± 0.171
2.752LysGln: 2.752 ± 0.186
3.951LysArg: 3.951 ± 0.284
3.463LysSer: 3.463 ± 0.237
3.898LysThr: 3.898 ± 0.191
4.03LysVal: 4.03 ± 0.268
1.014LysTrp: 1.014 ± 0.095
2.818LysTyr: 2.818 ± 0.214
0.0LysXaa: 0.0 ± 0.0
Leu
5.399LeuAla: 5.399 ± 0.259
0.974LeuCys: 0.974 ± 0.125
5.61LeuAsp: 5.61 ± 0.295
5.294LeuGlu: 5.294 ± 0.291
2.937LeuPhe: 2.937 ± 0.191
4.161LeuGly: 4.161 ± 0.221
1.817LeuHis: 1.817 ± 0.155
4.293LeuIle: 4.293 ± 0.26
5.057LeuLys: 5.057 ± 0.275
5.412LeuLeu: 5.412 ± 0.286
2.239LeuMet: 2.239 ± 0.198
4.148LeuAsn: 4.148 ± 0.205
2.963LeuPro: 2.963 ± 0.209
2.502LeuGln: 2.502 ± 0.199
4.122LeuArg: 4.122 ± 0.232
5.36LeuSer: 5.36 ± 0.264
4.583LeuThr: 4.583 ± 0.261
4.767LeuVal: 4.767 ± 0.252
0.843LeuTrp: 0.843 ± 0.105
3.068LeuTyr: 3.068 ± 0.212
0.0LeuXaa: 0.0 ± 0.0
Met
1.475MetAla: 1.475 ± 0.149
0.316MetCys: 0.316 ± 0.066
1.027MetAsp: 1.027 ± 0.147
1.04MetGlu: 1.04 ± 0.167
1.238MetPhe: 1.238 ± 0.13
1.159MetGly: 1.159 ± 0.134
0.553MetHis: 0.553 ± 0.082
2.489MetIle: 2.489 ± 0.232
2.818MetLys: 2.818 ± 0.23
2.278MetLeu: 2.278 ± 0.173
0.922MetMet: 0.922 ± 0.14
1.909MetAsn: 1.909 ± 0.181
0.988MetPro: 0.988 ± 0.106
1.146MetGln: 1.146 ± 0.118
1.238MetArg: 1.238 ± 0.134
2.436MetSer: 2.436 ± 0.192
2.094MetThr: 2.094 ± 0.143
0.948MetVal: 0.948 ± 0.102
0.263MetTrp: 0.263 ± 0.054
1.119MetTyr: 1.119 ± 0.102
0.0MetXaa: 0.0 ± 0.0
Asn
3.793AsnAla: 3.793 ± 0.211
0.645AsnCys: 0.645 ± 0.094
3.411AsnAsp: 3.411 ± 0.265
3.582AsnGlu: 3.582 ± 0.23
2.028AsnPhe: 2.028 ± 0.149
4.227AsnGly: 4.227 ± 0.286
1.08AsnHis: 1.08 ± 0.11
3.371AsnIle: 3.371 ± 0.201
3.095AsnLys: 3.095 ± 0.197
3.74AsnLeu: 3.74 ± 0.25
1.251AsnMet: 1.251 ± 0.135
2.752AsnAsn: 2.752 ± 0.195
1.909AsnPro: 1.909 ± 0.186
1.435AsnGln: 1.435 ± 0.114
2.726AsnArg: 2.726 ± 0.181
2.95AsnSer: 2.95 ± 0.175
2.805AsnThr: 2.805 ± 0.208
3.463AsnVal: 3.463 ± 0.267
0.645AsnTrp: 0.645 ± 0.091
2.278AsnTyr: 2.278 ± 0.171
0.0AsnXaa: 0.0 ± 0.0
Pro
1.87ProAla: 1.87 ± 0.175
0.395ProCys: 0.395 ± 0.074
2.146ProAsp: 2.146 ± 0.165
2.765ProGlu: 2.765 ± 0.209
1.528ProPhe: 1.528 ± 0.121
1.58ProGly: 1.58 ± 0.145
0.751ProHis: 0.751 ± 0.101
1.62ProIle: 1.62 ± 0.139
1.646ProLys: 1.646 ± 0.188
2.278ProLeu: 2.278 ± 0.172
0.751ProMet: 0.751 ± 0.095
1.62ProAsn: 1.62 ± 0.154
0.527ProPro: 0.527 ± 0.075
1.08ProGln: 1.08 ± 0.119
1.185ProArg: 1.185 ± 0.128
2.067ProSer: 2.067 ± 0.174
1.936ProThr: 1.936 ± 0.189
2.107ProVal: 2.107 ± 0.169
0.329ProTrp: 0.329 ± 0.065
1.462ProTyr: 1.462 ± 0.15
0.0ProXaa: 0.0 ± 0.0
Gln
2.291GlnAla: 2.291 ± 0.189
0.435GlnCys: 0.435 ± 0.071
2.067GlnAsp: 2.067 ± 0.163
2.634GlnGlu: 2.634 ± 0.19
1.383GlnPhe: 1.383 ± 0.117
1.83GlnGly: 1.83 ± 0.185
0.619GlnHis: 0.619 ± 0.104
2.107GlnIle: 2.107 ± 0.17
2.16GlnLys: 2.16 ± 0.169
2.937GlnLeu: 2.937 ± 0.191
1.014GlnMet: 1.014 ± 0.126
1.593GlnAsn: 1.593 ± 0.151
0.909GlnPro: 0.909 ± 0.11
1.291GlnGln: 1.291 ± 0.179
1.923GlnArg: 1.923 ± 0.136
2.107GlnSer: 2.107 ± 0.183
1.778GlnThr: 1.778 ± 0.147
2.028GlnVal: 2.028 ± 0.158
0.579GlnTrp: 0.579 ± 0.097
1.462GlnTyr: 1.462 ± 0.128
0.0GlnXaa: 0.0 ± 0.0
Arg
3.371ArgAla: 3.371 ± 0.251
0.606ArgCys: 0.606 ± 0.086
3.384ArgAsp: 3.384 ± 0.258
3.45ArgGlu: 3.45 ± 0.249
2.067ArgPhe: 2.067 ± 0.156
2.897ArgGly: 2.897 ± 0.204
1.093ArgHis: 1.093 ± 0.121
3.371ArgIle: 3.371 ± 0.194
3.332ArgLys: 3.332 ± 0.248
4.056ArgLeu: 4.056 ± 0.215
1.396ArgMet: 1.396 ± 0.145
2.384ArgAsn: 2.384 ± 0.163
1.356ArgPro: 1.356 ± 0.117
1.37ArgGln: 1.37 ± 0.129
2.502ArgArg: 2.502 ± 0.156
3.174ArgSer: 3.174 ± 0.253
2.844ArgThr: 2.844 ± 0.175
3.621ArgVal: 3.621 ± 0.222
0.803ArgTrp: 0.803 ± 0.115
2.133ArgTyr: 2.133 ± 0.192
0.0ArgXaa: 0.0 ± 0.0
Ser
4.188SerAla: 4.188 ± 0.269
0.751SerCys: 0.751 ± 0.091
4.793SerAsp: 4.793 ± 0.263
4.214SerGlu: 4.214 ± 0.26
3.002SerPhe: 3.002 ± 0.189
4.411SerGly: 4.411 ± 0.404
1.238SerHis: 1.238 ± 0.119
4.214SerIle: 4.214 ± 0.227
4.03SerLys: 4.03 ± 0.261
4.556SerLeu: 4.556 ± 0.249
1.541SerMet: 1.541 ± 0.141
3.556SerAsn: 3.556 ± 0.285
1.87SerPro: 1.87 ± 0.144
1.804SerGln: 1.804 ± 0.156
2.897SerArg: 2.897 ± 0.185
4.201SerSer: 4.201 ± 0.333
3.74SerThr: 3.74 ± 0.287
4.543SerVal: 4.543 ± 0.252
0.711SerTrp: 0.711 ± 0.095
2.594SerTyr: 2.594 ± 0.205
0.0SerXaa: 0.0 ± 0.0
Thr
4.24ThrAla: 4.24 ± 0.285
0.685ThrCys: 0.685 ± 0.107
3.872ThrAsp: 3.872 ± 0.234
4.425ThrGlu: 4.425 ± 0.239
2.594ThrPhe: 2.594 ± 0.202
4.056ThrGly: 4.056 ± 0.337
1.317ThrHis: 1.317 ± 0.142
4.53ThrIle: 4.53 ± 0.258
3.503ThrLys: 3.503 ± 0.239
4.951ThrLeu: 4.951 ± 0.302
1.317ThrMet: 1.317 ± 0.13
2.884ThrAsn: 2.884 ± 0.237
2.423ThrPro: 2.423 ± 0.185
2.41ThrGln: 2.41 ± 0.206
2.449ThrArg: 2.449 ± 0.193
3.819ThrSer: 3.819 ± 0.23
3.648ThrThr: 3.648 ± 0.266
5.044ThrVal: 5.044 ± 0.328
0.974ThrTrp: 0.974 ± 0.133
2.212ThrTyr: 2.212 ± 0.184
0.0ThrXaa: 0.0 ± 0.0
Val
4.675ValAla: 4.675 ± 0.302
1.04ValCys: 1.04 ± 0.156
4.675ValAsp: 4.675 ± 0.274
5.267ValGlu: 5.267 ± 0.293
3.055ValPhe: 3.055 ± 0.211
3.608ValGly: 3.608 ± 0.23
1.804ValHis: 1.804 ± 0.157
4.306ValIle: 4.306 ± 0.239
4.411ValLys: 4.411 ± 0.281
4.951ValLeu: 4.951 ± 0.241
1.817ValMet: 1.817 ± 0.154
3.49ValAsn: 3.49 ± 0.218
1.87ValPro: 1.87 ± 0.172
2.265ValGln: 2.265 ± 0.181
3.569ValArg: 3.569 ± 0.23
4.583ValSer: 4.583 ± 0.256
4.714ValThr: 4.714 ± 0.325
4.938ValVal: 4.938 ± 0.252
0.869ValTrp: 0.869 ± 0.11
2.779ValTyr: 2.779 ± 0.199
0.0ValXaa: 0.0 ± 0.0
Trp
0.83TrpAla: 0.83 ± 0.103
0.158TrpCys: 0.158 ± 0.042
1.146TrpAsp: 1.146 ± 0.137
1.04TrpGlu: 1.04 ± 0.118
0.672TrpPhe: 0.672 ± 0.101
0.764TrpGly: 0.764 ± 0.108
0.5TrpHis: 0.5 ± 0.08
0.816TrpIle: 0.816 ± 0.104
0.816TrpLys: 0.816 ± 0.111
1.001TrpLeu: 1.001 ± 0.117
0.435TrpMet: 0.435 ± 0.082
0.777TrpAsn: 0.777 ± 0.101
0.092TrpPro: 0.092 ± 0.029
0.421TrpGln: 0.421 ± 0.08
0.724TrpArg: 0.724 ± 0.103
1.014TrpSer: 1.014 ± 0.125
0.724TrpThr: 0.724 ± 0.115
1.014TrpVal: 1.014 ± 0.109
0.184TrpTrp: 0.184 ± 0.05
0.803TrpTyr: 0.803 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.002TyrAla: 3.002 ± 0.22
0.579TyrCys: 0.579 ± 0.088
3.279TyrAsp: 3.279 ± 0.22
2.989TyrGlu: 2.989 ± 0.228
1.633TyrPhe: 1.633 ± 0.151
2.568TyrGly: 2.568 ± 0.259
1.212TyrHis: 1.212 ± 0.139
2.423TyrIle: 2.423 ± 0.168
2.923TyrLys: 2.923 ± 0.228
2.739TyrLeu: 2.739 ± 0.195
1.027TyrMet: 1.027 ± 0.113
2.212TyrAsn: 2.212 ± 0.183
1.119TyrPro: 1.119 ± 0.133
1.501TyrGln: 1.501 ± 0.146
1.975TyrArg: 1.975 ± 0.137
2.384TyrSer: 2.384 ± 0.17
2.805TyrThr: 2.805 ± 0.197
3.226TyrVal: 3.226 ± 0.209
0.658TyrTrp: 0.658 ± 0.107
1.699TyrTyr: 1.699 ± 0.155
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 405 proteins (75939 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski