Amino acid dipepetide frequency for Cyanophage S-TIM4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.272AlaAla: 6.272 ± 0.437
0.442AlaCys: 0.442 ± 0.092
4.188AlaAsp: 4.188 ± 0.238
3.534AlaGlu: 3.534 ± 0.284
2.986AlaPhe: 2.986 ± 0.279
6.131AlaGly: 6.131 ± 0.436
1.095AlaHis: 1.095 ± 0.157
4.453AlaIle: 4.453 ± 0.313
4.294AlaLys: 4.294 ± 0.426
4.417AlaLeu: 4.417 ± 0.271
1.484AlaMet: 1.484 ± 0.206
4.081AlaAsn: 4.081 ± 0.336
2.332AlaPro: 2.332 ± 0.207
2.756AlaGln: 2.756 ± 0.221
2.456AlaArg: 2.456 ± 0.21
5.142AlaSer: 5.142 ± 0.48
6.078AlaThr: 6.078 ± 0.622
4.523AlaVal: 4.523 ± 0.375
0.795AlaTrp: 0.795 ± 0.113
2.562AlaTyr: 2.562 ± 0.261
0.0AlaXaa: 0.0 ± 0.0
Cys
0.654CysAla: 0.654 ± 0.117
0.053CysCys: 0.053 ± 0.027
0.689CysAsp: 0.689 ± 0.139
0.53CysGlu: 0.53 ± 0.112
0.459CysPhe: 0.459 ± 0.107
0.442CysGly: 0.442 ± 0.082
0.212CysHis: 0.212 ± 0.061
0.495CysIle: 0.495 ± 0.104
0.601CysLys: 0.601 ± 0.115
0.742CysLeu: 0.742 ± 0.128
0.283CysMet: 0.283 ± 0.096
0.353CysAsn: 0.353 ± 0.09
0.318CysPro: 0.318 ± 0.065
0.477CysGln: 0.477 ± 0.111
0.353CysArg: 0.353 ± 0.111
0.495CysSer: 0.495 ± 0.092
0.583CysThr: 0.583 ± 0.118
0.477CysVal: 0.477 ± 0.098
0.124CysTrp: 0.124 ± 0.052
0.406CysTyr: 0.406 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
4.877AspAla: 4.877 ± 0.293
0.654AspCys: 0.654 ± 0.11
4.647AspAsp: 4.647 ± 0.365
3.94AspGlu: 3.94 ± 0.311
3.145AspPhe: 3.145 ± 0.241
5.53AspGly: 5.53 ± 0.381
1.237AspHis: 1.237 ± 0.187
4.276AspIle: 4.276 ± 0.264
3.746AspLys: 3.746 ± 0.348
5.23AspLeu: 5.23 ± 0.339
1.431AspMet: 1.431 ± 0.188
3.693AspAsn: 3.693 ± 0.267
3.286AspPro: 3.286 ± 0.244
1.961AspGln: 1.961 ± 0.199
2.509AspArg: 2.509 ± 0.217
3.922AspSer: 3.922 ± 0.283
3.64AspThr: 3.64 ± 0.337
3.905AspVal: 3.905 ± 0.341
0.972AspTrp: 0.972 ± 0.141
3.269AspTyr: 3.269 ± 0.259
0.0AspXaa: 0.0 ± 0.0
Glu
3.392GluAla: 3.392 ± 0.28
0.565GluCys: 0.565 ± 0.101
3.781GluAsp: 3.781 ± 0.326
4.417GluGlu: 4.417 ± 0.534
3.004GluPhe: 3.004 ± 0.244
3.852GluGly: 3.852 ± 0.305
1.025GluHis: 1.025 ± 0.171
4.241GluIle: 4.241 ± 0.277
3.675GluLys: 3.675 ± 0.444
5.177GluLeu: 5.177 ± 0.362
1.29GluMet: 1.29 ± 0.216
3.145GluAsn: 3.145 ± 0.262
1.643GluPro: 1.643 ± 0.202
2.262GluGln: 2.262 ± 0.174
2.315GluArg: 2.315 ± 0.244
2.915GluSer: 2.915 ± 0.314
3.622GluThr: 3.622 ± 0.242
4.382GluVal: 4.382 ± 0.28
1.025GluTrp: 1.025 ± 0.17
2.88GluTyr: 2.88 ± 0.261
0.0GluXaa: 0.0 ± 0.0
Phe
2.898PheAla: 2.898 ± 0.2
0.406PheCys: 0.406 ± 0.087
3.551PheAsp: 3.551 ± 0.271
2.615PheGlu: 2.615 ± 0.202
1.696PhePhe: 1.696 ± 0.22
3.18PheGly: 3.18 ± 0.251
0.654PheHis: 0.654 ± 0.139
2.739PheIle: 2.739 ± 0.234
2.562PheLys: 2.562 ± 0.233
3.145PheLeu: 3.145 ± 0.274
1.042PheMet: 1.042 ± 0.152
2.562PheAsn: 2.562 ± 0.247
1.838PhePro: 1.838 ± 0.211
1.626PheGln: 1.626 ± 0.157
1.59PheArg: 1.59 ± 0.165
2.862PheSer: 2.862 ± 0.28
3.675PheThr: 3.675 ± 0.413
2.65PheVal: 2.65 ± 0.276
0.336PheTrp: 0.336 ± 0.08
1.767PheTyr: 1.767 ± 0.165
0.0PheXaa: 0.0 ± 0.0
Gly
5.884GlyAla: 5.884 ± 0.506
0.618GlyCys: 0.618 ± 0.126
5.089GlyAsp: 5.089 ± 0.297
3.693GlyGlu: 3.693 ± 0.261
3.198GlyPhe: 3.198 ± 0.271
7.951GlyGly: 7.951 ± 1.028
1.166GlyHis: 1.166 ± 0.169
4.258GlyIle: 4.258 ± 0.314
4.046GlyLys: 4.046 ± 0.344
4.806GlyLeu: 4.806 ± 0.333
1.573GlyMet: 1.573 ± 0.278
4.612GlyAsn: 4.612 ± 0.428
1.944GlyPro: 1.944 ± 0.19
2.421GlyGln: 2.421 ± 0.244
2.862GlyArg: 2.862 ± 0.209
6.749GlySer: 6.749 ± 0.673
7.651GlyThr: 7.651 ± 0.668
4.859GlyVal: 4.859 ± 0.357
1.095GlyTrp: 1.095 ± 0.15
3.251GlyTyr: 3.251 ± 0.283
0.0GlyXaa: 0.0 ± 0.0
His
0.866HisAla: 0.866 ± 0.133
0.283HisCys: 0.283 ± 0.079
1.042HisAsp: 1.042 ± 0.173
1.025HisGlu: 1.025 ± 0.191
0.83HisPhe: 0.83 ± 0.155
1.219HisGly: 1.219 ± 0.145
0.512HisHis: 0.512 ± 0.133
0.936HisIle: 0.936 ± 0.17
0.777HisLys: 0.777 ± 0.132
1.025HisLeu: 1.025 ± 0.114
0.389HisMet: 0.389 ± 0.083
0.707HisAsn: 0.707 ± 0.135
1.113HisPro: 1.113 ± 0.16
0.53HisGln: 0.53 ± 0.096
0.512HisArg: 0.512 ± 0.095
1.025HisSer: 1.025 ± 0.156
1.29HisThr: 1.29 ± 0.161
0.883HisVal: 0.883 ± 0.155
0.336HisTrp: 0.336 ± 0.09
0.848HisTyr: 0.848 ± 0.131
0.0HisXaa: 0.0 ± 0.0
Ile
3.975IleAla: 3.975 ± 0.21
0.636IleCys: 0.636 ± 0.144
4.559IleAsp: 4.559 ± 0.308
4.205IleGlu: 4.205 ± 0.285
2.562IlePhe: 2.562 ± 0.24
4.17IleGly: 4.17 ± 0.311
0.795IleHis: 0.795 ± 0.14
3.481IleIle: 3.481 ± 0.258
4.17IleLys: 4.17 ± 0.254
3.975IleLeu: 3.975 ± 0.321
1.078IleMet: 1.078 ± 0.147
4.099IleAsn: 4.099 ± 0.28
2.792IlePro: 2.792 ± 0.227
2.739IleGln: 2.739 ± 0.222
2.191IleArg: 2.191 ± 0.259
4.311IleSer: 4.311 ± 0.336
5.583IleThr: 5.583 ± 0.65
4.347IleVal: 4.347 ± 0.359
0.477IleTrp: 0.477 ± 0.082
2.297IleTyr: 2.297 ± 0.288
0.0IleXaa: 0.0 ± 0.0
Lys
3.869LysAla: 3.869 ± 0.406
0.618LysCys: 0.618 ± 0.112
3.728LysAsp: 3.728 ± 0.317
4.506LysGlu: 4.506 ± 0.535
2.491LysPhe: 2.491 ± 0.245
3.922LysGly: 3.922 ± 0.401
1.095LysHis: 1.095 ± 0.205
4.294LysIle: 4.294 ± 0.36
5.354LysLys: 5.354 ± 0.698
4.735LysLeu: 4.735 ± 0.282
1.59LysMet: 1.59 ± 0.246
3.428LysAsn: 3.428 ± 0.364
2.209LysPro: 2.209 ± 0.273
2.279LysGln: 2.279 ± 0.246
2.633LysArg: 2.633 ± 0.3
3.551LysSer: 3.551 ± 0.305
3.816LysThr: 3.816 ± 0.239
4.311LysVal: 4.311 ± 0.353
0.707LysTrp: 0.707 ± 0.112
2.933LysTyr: 2.933 ± 0.31
0.0LysXaa: 0.0 ± 0.0
Leu
4.347LeuAla: 4.347 ± 0.307
0.671LeuCys: 0.671 ± 0.123
5.513LeuAsp: 5.513 ± 0.28
4.028LeuGlu: 4.028 ± 0.313
2.951LeuPhe: 2.951 ± 0.215
4.523LeuGly: 4.523 ± 0.339
1.113LeuHis: 1.113 ± 0.137
4.011LeuIle: 4.011 ± 0.244
4.859LeuLys: 4.859 ± 0.41
4.912LeuLeu: 4.912 ± 0.347
1.219LeuMet: 1.219 ± 0.209
4.841LeuAsn: 4.841 ± 0.391
2.968LeuPro: 2.968 ± 0.302
2.544LeuGln: 2.544 ± 0.222
2.898LeuArg: 2.898 ± 0.259
5.407LeuSer: 5.407 ± 0.32
6.078LeuThr: 6.078 ± 0.51
4.47LeuVal: 4.47 ± 0.238
0.548LeuTrp: 0.548 ± 0.114
3.322LeuTyr: 3.322 ± 0.293
0.0LeuXaa: 0.0 ± 0.0
Met
1.679MetAla: 1.679 ± 0.257
0.124MetCys: 0.124 ± 0.053
1.025MetAsp: 1.025 ± 0.152
1.325MetGlu: 1.325 ± 0.201
0.972MetPhe: 0.972 ± 0.162
1.042MetGly: 1.042 ± 0.176
0.477MetHis: 0.477 ± 0.097
1.396MetIle: 1.396 ± 0.197
2.191MetLys: 2.191 ± 0.369
1.219MetLeu: 1.219 ± 0.171
0.265MetMet: 0.265 ± 0.067
1.131MetAsn: 1.131 ± 0.19
0.795MetPro: 0.795 ± 0.143
0.883MetGln: 0.883 ± 0.149
1.007MetArg: 1.007 ± 0.173
1.502MetSer: 1.502 ± 0.229
1.555MetThr: 1.555 ± 0.199
0.919MetVal: 0.919 ± 0.135
0.177MetTrp: 0.177 ± 0.061
0.707MetTyr: 0.707 ± 0.112
0.0MetXaa: 0.0 ± 0.0
Asn
4.329AsnAla: 4.329 ± 0.43
0.601AsnCys: 0.601 ± 0.119
3.092AsnAsp: 3.092 ± 0.266
2.933AsnGlu: 2.933 ± 0.229
2.562AsnPhe: 2.562 ± 0.274
5.036AsnGly: 5.036 ± 0.502
0.883AsnHis: 0.883 ± 0.136
4.311AsnIle: 4.311 ± 0.42
3.286AsnLys: 3.286 ± 0.333
4.576AsnLeu: 4.576 ± 0.316
0.83AsnMet: 0.83 ± 0.147
3.604AsnAsn: 3.604 ± 0.316
3.074AsnPro: 3.074 ± 0.252
2.58AsnGln: 2.58 ± 0.189
2.173AsnArg: 2.173 ± 0.191
4.329AsnSer: 4.329 ± 0.278
4.647AsnThr: 4.647 ± 0.431
3.799AsnVal: 3.799 ± 0.278
0.777AsnTrp: 0.777 ± 0.135
2.58AsnTyr: 2.58 ± 0.181
0.0AsnXaa: 0.0 ± 0.0
Pro
2.244ProAla: 2.244 ± 0.22
0.283ProCys: 0.283 ± 0.08
3.004ProAsp: 3.004 ± 0.25
2.456ProGlu: 2.456 ± 0.249
1.661ProPhe: 1.661 ± 0.183
2.898ProGly: 2.898 ± 0.319
0.689ProHis: 0.689 ± 0.114
2.138ProIle: 2.138 ± 0.19
2.421ProLys: 2.421 ± 0.317
2.438ProLeu: 2.438 ± 0.204
0.53ProMet: 0.53 ± 0.108
2.491ProAsn: 2.491 ± 0.26
1.59ProPro: 1.59 ± 0.246
0.972ProGln: 0.972 ± 0.127
1.714ProArg: 1.714 ± 0.177
3.021ProSer: 3.021 ± 0.209
3.463ProThr: 3.463 ± 0.314
2.633ProVal: 2.633 ± 0.201
0.495ProTrp: 0.495 ± 0.111
1.696ProTyr: 1.696 ± 0.206
0.0ProXaa: 0.0 ± 0.0
Gln
1.944GlnAla: 1.944 ± 0.17
0.336GlnCys: 0.336 ± 0.1
1.908GlnAsp: 1.908 ± 0.189
2.509GlnGlu: 2.509 ± 0.193
1.608GlnPhe: 1.608 ± 0.176
2.509GlnGly: 2.509 ± 0.217
0.583GlnHis: 0.583 ± 0.128
2.332GlnIle: 2.332 ± 0.189
2.368GlnLys: 2.368 ± 0.26
2.968GlnLeu: 2.968 ± 0.207
0.954GlnMet: 0.954 ± 0.131
2.156GlnAsn: 2.156 ± 0.164
1.431GlnPro: 1.431 ± 0.146
1.254GlnGln: 1.254 ± 0.153
1.714GlnArg: 1.714 ± 0.191
2.191GlnSer: 2.191 ± 0.207
2.385GlnThr: 2.385 ± 0.262
2.456GlnVal: 2.456 ± 0.202
0.442GlnTrp: 0.442 ± 0.105
1.82GlnTyr: 1.82 ± 0.225
0.0GlnXaa: 0.0 ± 0.0
Arg
2.226ArgAla: 2.226 ± 0.214
0.353ArgCys: 0.353 ± 0.081
2.262ArgAsp: 2.262 ± 0.171
2.368ArgGlu: 2.368 ± 0.303
2.032ArgPhe: 2.032 ± 0.185
2.615ArgGly: 2.615 ± 0.287
0.742ArgHis: 0.742 ± 0.138
2.668ArgIle: 2.668 ± 0.226
2.898ArgLys: 2.898 ± 0.324
3.304ArgLeu: 3.304 ± 0.227
1.219ArgMet: 1.219 ± 0.173
2.05ArgAsn: 2.05 ± 0.21
1.343ArgPro: 1.343 ± 0.159
1.537ArgGln: 1.537 ± 0.169
1.749ArgArg: 1.749 ± 0.256
2.279ArgSer: 2.279 ± 0.222
2.332ArgThr: 2.332 ± 0.294
2.915ArgVal: 2.915 ± 0.205
0.353ArgTrp: 0.353 ± 0.093
1.961ArgTyr: 1.961 ± 0.204
0.0ArgXaa: 0.0 ± 0.0
Ser
5.919SerAla: 5.919 ± 0.49
0.459SerCys: 0.459 ± 0.091
4.382SerAsp: 4.382 ± 0.316
3.233SerGlu: 3.233 ± 0.288
2.915SerPhe: 2.915 ± 0.339
7.509SerGly: 7.509 ± 0.678
0.936SerHis: 0.936 ± 0.15
4.329SerIle: 4.329 ± 0.435
3.675SerLys: 3.675 ± 0.325
4.329SerLeu: 4.329 ± 0.28
1.537SerMet: 1.537 ± 0.175
4.276SerAsn: 4.276 ± 0.351
2.527SerPro: 2.527 ± 0.275
1.944SerGln: 1.944 ± 0.19
2.403SerArg: 2.403 ± 0.248
5.619SerSer: 5.619 ± 0.603
5.707SerThr: 5.707 ± 0.469
4.629SerVal: 4.629 ± 0.334
0.424SerTrp: 0.424 ± 0.108
2.703SerTyr: 2.703 ± 0.198
0.0SerXaa: 0.0 ± 0.0
Thr
6.961ThrAla: 6.961 ± 0.715
0.654ThrCys: 0.654 ± 0.11
4.347ThrAsp: 4.347 ± 0.381
3.622ThrGlu: 3.622 ± 0.236
3.587ThrPhe: 3.587 ± 0.353
6.979ThrGly: 6.979 ± 0.798
0.901ThrHis: 0.901 ± 0.122
5.389ThrIle: 5.389 ± 0.536
3.534ThrLys: 3.534 ± 0.28
5.831ThrLeu: 5.831 ± 0.491
1.184ThrMet: 1.184 ± 0.16
4.788ThrAsn: 4.788 ± 0.453
3.092ThrPro: 3.092 ± 0.231
2.527ThrGln: 2.527 ± 0.22
2.898ThrArg: 2.898 ± 0.189
5.884ThrSer: 5.884 ± 0.511
7.315ThrThr: 7.315 ± 0.737
5.919ThrVal: 5.919 ± 0.702
0.866ThrTrp: 0.866 ± 0.124
3.092ThrTyr: 3.092 ± 0.237
0.0ThrXaa: 0.0 ± 0.0
Val
4.665ValAla: 4.665 ± 0.348
0.406ValCys: 0.406 ± 0.097
4.824ValAsp: 4.824 ± 0.391
4.311ValGlu: 4.311 ± 0.302
2.544ValPhe: 2.544 ± 0.197
4.806ValGly: 4.806 ± 0.481
0.813ValHis: 0.813 ± 0.113
3.516ValIle: 3.516 ± 0.266
4.099ValLys: 4.099 ± 0.312
4.294ValLeu: 4.294 ± 0.265
1.343ValMet: 1.343 ± 0.217
4.329ValAsn: 4.329 ± 0.314
2.756ValPro: 2.756 ± 0.222
2.209ValGln: 2.209 ± 0.152
2.703ValArg: 2.703 ± 0.205
5.159ValSer: 5.159 ± 0.357
6.06ValThr: 6.06 ± 0.655
4.364ValVal: 4.364 ± 0.326
0.724ValTrp: 0.724 ± 0.134
2.385ValTyr: 2.385 ± 0.225
0.0ValXaa: 0.0 ± 0.0
Trp
0.601TrpAla: 0.601 ± 0.099
0.141TrpCys: 0.141 ± 0.041
0.707TrpAsp: 0.707 ± 0.123
0.724TrpGlu: 0.724 ± 0.135
0.512TrpPhe: 0.512 ± 0.101
0.636TrpGly: 0.636 ± 0.092
0.371TrpHis: 0.371 ± 0.107
0.742TrpIle: 0.742 ± 0.132
0.742TrpLys: 0.742 ± 0.155
0.724TrpLeu: 0.724 ± 0.128
0.336TrpMet: 0.336 ± 0.089
0.795TrpAsn: 0.795 ± 0.13
0.212TrpPro: 0.212 ± 0.059
0.336TrpGln: 0.336 ± 0.085
0.459TrpArg: 0.459 ± 0.096
0.76TrpSer: 0.76 ± 0.114
0.883TrpThr: 0.883 ± 0.139
0.936TrpVal: 0.936 ± 0.139
0.141TrpTrp: 0.141 ± 0.051
0.477TrpTyr: 0.477 ± 0.089
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.421TyrAla: 2.421 ± 0.185
0.442TyrCys: 0.442 ± 0.101
3.534TyrAsp: 3.534 ± 0.267
2.633TyrGlu: 2.633 ± 0.281
1.679TyrPhe: 1.679 ± 0.175
2.792TyrGly: 2.792 ± 0.192
0.848TyrHis: 0.848 ± 0.144
2.438TyrIle: 2.438 ± 0.239
2.774TyrLys: 2.774 ± 0.24
3.375TyrLeu: 3.375 ± 0.252
0.795TyrMet: 0.795 ± 0.128
2.862TyrAsn: 2.862 ± 0.253
1.573TyrPro: 1.573 ± 0.189
2.032TyrGln: 2.032 ± 0.18
2.067TyrArg: 2.067 ± 0.237
2.332TyrSer: 2.332 ± 0.262
3.004TyrThr: 3.004 ± 0.35
2.933TyrVal: 2.933 ± 0.271
0.424TyrTrp: 0.424 ± 0.101
2.12TyrTyr: 2.12 ± 0.204
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 235 proteins (56598 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski