Amino acid dipepetide frequency for Shewanella sp. phage 1/4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.544AlaAla: 4.544 ± 0.51
0.539AlaCys: 0.539 ± 0.131
3.209AlaAsp: 3.209 ± 0.276
3.723AlaGlu: 3.723 ± 0.34
1.848AlaPhe: 1.848 ± 0.188
4.159AlaGly: 4.159 ± 0.374
1.001AlaHis: 1.001 ± 0.165
3.723AlaIle: 3.723 ± 0.295
3.62AlaLys: 3.62 ± 0.379
4.133AlaLeu: 4.133 ± 0.355
1.361AlaMet: 1.361 ± 0.21
3.106AlaAsn: 3.106 ± 0.292
1.412AlaPro: 1.412 ± 0.206
1.951AlaGln: 1.951 ± 0.266
1.951AlaArg: 1.951 ± 0.264
3.8AlaSer: 3.8 ± 0.322
3.491AlaThr: 3.491 ± 0.298
3.543AlaVal: 3.543 ± 0.345
0.719AlaTrp: 0.719 ± 0.14
1.977AlaTyr: 1.977 ± 0.243
0.0AlaXaa: 0.0 ± 0.0
Cys
0.616CysAla: 0.616 ± 0.157
0.205CysCys: 0.205 ± 0.075
1.155CysAsp: 1.155 ± 0.178
1.181CysGlu: 1.181 ± 0.195
0.77CysPhe: 0.77 ± 0.151
1.54CysGly: 1.54 ± 0.188
0.436CysHis: 0.436 ± 0.105
0.95CysIle: 0.95 ± 0.191
1.078CysLys: 1.078 ± 0.191
1.335CysLeu: 1.335 ± 0.239
0.385CysMet: 0.385 ± 0.091
0.899CysAsn: 0.899 ± 0.146
0.385CysPro: 0.385 ± 0.119
0.462CysGln: 0.462 ± 0.109
0.667CysArg: 0.667 ± 0.144
1.104CysSer: 1.104 ± 0.269
0.873CysThr: 0.873 ± 0.162
0.847CysVal: 0.847 ± 0.132
0.359CysTrp: 0.359 ± 0.088
0.436CysTyr: 0.436 ± 0.101
0.0CysXaa: 0.0 ± 0.0
Asp
3.132AspAla: 3.132 ± 0.304
1.284AspCys: 1.284 ± 0.197
3.954AspAsp: 3.954 ± 0.369
3.8AspGlu: 3.8 ± 0.341
2.85AspPhe: 2.85 ± 0.292
5.083AspGly: 5.083 ± 0.346
1.027AspHis: 1.027 ± 0.151
4.39AspIle: 4.39 ± 0.387
4.955AspLys: 4.955 ± 0.344
6.367AspLeu: 6.367 ± 0.456
1.797AspMet: 1.797 ± 0.218
4.236AspAsn: 4.236 ± 0.314
1.977AspPro: 1.977 ± 0.222
1.361AspGln: 1.361 ± 0.157
1.771AspArg: 1.771 ± 0.2
4.082AspSer: 4.082 ± 0.282
4.21AspThr: 4.21 ± 0.334
4.39AspVal: 4.39 ± 0.316
1.72AspTrp: 1.72 ± 0.238
3.106AspTyr: 3.106 ± 0.3
0.0AspXaa: 0.0 ± 0.0
Glu
3.389GluAla: 3.389 ± 0.395
1.207GluCys: 1.207 ± 0.197
4.595GluAsp: 4.595 ± 0.343
4.826GluGlu: 4.826 ± 0.391
2.542GluPhe: 2.542 ± 0.245
4.467GluGly: 4.467 ± 0.397
1.258GluHis: 1.258 ± 0.196
4.082GluIle: 4.082 ± 0.328
3.491GluLys: 3.491 ± 0.353
6.315GluLeu: 6.315 ± 0.521
1.771GluMet: 1.771 ± 0.189
3.055GluAsn: 3.055 ± 0.328
1.463GluPro: 1.463 ± 0.209
2.054GluGln: 2.054 ± 0.257
2.696GluArg: 2.696 ± 0.294
3.954GluSer: 3.954 ± 0.332
2.747GluThr: 2.747 ± 0.261
5.853GluVal: 5.853 ± 0.417
0.924GluTrp: 0.924 ± 0.165
2.952GluTyr: 2.952 ± 0.318
0.0GluXaa: 0.0 ± 0.0
Phe
1.72PheAla: 1.72 ± 0.21
0.539PheCys: 0.539 ± 0.101
2.798PheAsp: 2.798 ± 0.275
2.234PheGlu: 2.234 ± 0.239
1.232PhePhe: 1.232 ± 0.203
2.157PheGly: 2.157 ± 0.297
0.745PheHis: 0.745 ± 0.137
3.055PheIle: 3.055 ± 0.375
3.466PheLys: 3.466 ± 0.348
3.106PheLeu: 3.106 ± 0.339
1.617PheMet: 1.617 ± 0.216
3.081PheAsn: 3.081 ± 0.265
1.258PhePro: 1.258 ± 0.195
0.77PheGln: 0.77 ± 0.13
1.412PheArg: 1.412 ± 0.218
3.106PheSer: 3.106 ± 0.339
3.029PheThr: 3.029 ± 0.265
2.259PheVal: 2.259 ± 0.231
0.411PheTrp: 0.411 ± 0.094
1.361PheTyr: 1.361 ± 0.178
0.0PheXaa: 0.0 ± 0.0
Gly
3.286GlyAla: 3.286 ± 0.335
1.284GlyCys: 1.284 ± 0.203
4.647GlyAsp: 4.647 ± 0.44
3.723GlyGlu: 3.723 ± 0.296
3.183GlyPhe: 3.183 ± 0.275
5.674GlyGly: 5.674 ± 0.454
1.258GlyHis: 1.258 ± 0.184
3.517GlyIle: 3.517 ± 0.306
4.595GlyLys: 4.595 ± 0.381
5.263GlyLeu: 5.263 ± 0.376
1.592GlyMet: 1.592 ± 0.189
3.928GlyAsn: 3.928 ± 0.344
0.308GlyPro: 0.308 ± 0.19
1.592GlyGln: 1.592 ± 0.208
2.362GlyArg: 2.362 ± 0.225
5.597GlySer: 5.597 ± 0.438
4.159GlyThr: 4.159 ± 0.383
5.956GlyVal: 5.956 ± 0.394
1.309GlyTrp: 1.309 ± 0.164
3.029GlyTyr: 3.029 ± 0.32
0.0GlyXaa: 0.0 ± 0.0
His
1.001HisAla: 1.001 ± 0.151
0.334HisCys: 0.334 ± 0.097
1.361HisAsp: 1.361 ± 0.195
1.027HisGlu: 1.027 ± 0.162
0.77HisPhe: 0.77 ± 0.136
1.309HisGly: 1.309 ± 0.171
0.822HisHis: 0.822 ± 0.177
1.309HisIle: 1.309 ± 0.184
1.54HisLys: 1.54 ± 0.179
1.386HisLeu: 1.386 ± 0.196
0.539HisMet: 0.539 ± 0.116
1.284HisAsn: 1.284 ± 0.191
0.616HisPro: 0.616 ± 0.127
0.693HisGln: 0.693 ± 0.133
0.693HisArg: 0.693 ± 0.144
1.361HisSer: 1.361 ± 0.173
1.643HisThr: 1.643 ± 0.252
1.335HisVal: 1.335 ± 0.201
0.411HisTrp: 0.411 ± 0.112
0.976HisTyr: 0.976 ± 0.142
0.0HisXaa: 0.0 ± 0.0
Ile
3.44IleAla: 3.44 ± 0.287
0.642IleCys: 0.642 ± 0.159
5.109IleAsp: 5.109 ± 0.348
4.929IleGlu: 4.929 ± 0.361
1.797IlePhe: 1.797 ± 0.197
3.877IleGly: 3.877 ± 0.355
1.155IleHis: 1.155 ± 0.153
4.441IleIle: 4.441 ± 0.376
5.545IleLys: 5.545 ± 0.391
4.749IleLeu: 4.749 ± 0.41
2.079IleMet: 2.079 ± 0.228
4.313IleAsn: 4.313 ± 0.375
2.439IlePro: 2.439 ± 0.222
2.208IleGln: 2.208 ± 0.279
2.336IleArg: 2.336 ± 0.204
4.185IleSer: 4.185 ± 0.297
4.955IleThr: 4.955 ± 0.339
3.62IleVal: 3.62 ± 0.338
0.873IleTrp: 0.873 ± 0.127
2.002IleTyr: 2.002 ± 0.206
0.0IleXaa: 0.0 ± 0.0
Lys
3.44LysAla: 3.44 ± 0.341
0.822LysCys: 0.822 ± 0.172
4.749LysAsp: 4.749 ± 0.381
4.749LysGlu: 4.749 ± 0.389
2.567LysPhe: 2.567 ± 0.24
4.313LysGly: 4.313 ± 0.377
1.643LysHis: 1.643 ± 0.219
4.339LysIle: 4.339 ± 0.378
4.082LysLys: 4.082 ± 0.438
6.572LysLeu: 6.572 ± 0.486
2.619LysMet: 2.619 ± 0.31
2.824LysAsn: 2.824 ± 0.298
2.49LysPro: 2.49 ± 0.302
2.927LysGln: 2.927 ± 0.32
3.183LysArg: 3.183 ± 0.333
4.493LysSer: 4.493 ± 0.329
4.57LysThr: 4.57 ± 0.358
6.136LysVal: 6.136 ± 0.454
1.284LysTrp: 1.284 ± 0.194
3.723LysTyr: 3.723 ± 0.358
0.0LysXaa: 0.0 ± 0.0
Leu
4.878LeuAla: 4.878 ± 0.34
1.848LeuCys: 1.848 ± 0.227
6.264LeuAsp: 6.264 ± 0.367
5.417LeuGlu: 5.417 ± 0.348
3.132LeuPhe: 3.132 ± 0.274
4.929LeuGly: 4.929 ± 0.31
2.157LeuHis: 2.157 ± 0.208
5.186LeuIle: 5.186 ± 0.389
6.213LeuLys: 6.213 ± 0.459
7.522LeuLeu: 7.522 ± 0.489
2.105LeuMet: 2.105 ± 0.22
4.903LeuAsn: 4.903 ± 0.382
3.183LeuPro: 3.183 ± 0.254
2.696LeuGln: 2.696 ± 0.295
3.209LeuArg: 3.209 ± 0.302
6.11LeuSer: 6.11 ± 0.399
6.983LeuThr: 6.983 ± 0.423
4.826LeuVal: 4.826 ± 0.347
1.078LeuTrp: 1.078 ± 0.165
3.363LeuTyr: 3.363 ± 0.29
0.0LeuXaa: 0.0 ± 0.0
Met
1.669MetAla: 1.669 ± 0.231
0.205MetCys: 0.205 ± 0.08
1.335MetAsp: 1.335 ± 0.202
1.592MetGlu: 1.592 ± 0.205
1.258MetPhe: 1.258 ± 0.175
1.515MetGly: 1.515 ± 0.196
0.359MetHis: 0.359 ± 0.106
1.515MetIle: 1.515 ± 0.195
2.593MetLys: 2.593 ± 0.238
3.029MetLeu: 3.029 ± 0.267
1.053MetMet: 1.053 ± 0.2
1.489MetAsn: 1.489 ± 0.204
0.847MetPro: 0.847 ± 0.123
1.053MetGln: 1.053 ± 0.145
1.104MetArg: 1.104 ± 0.161
2.542MetSer: 2.542 ± 0.253
1.771MetThr: 1.771 ± 0.206
2.157MetVal: 2.157 ± 0.251
0.488MetTrp: 0.488 ± 0.116
0.77MetTyr: 0.77 ± 0.151
0.0MetXaa: 0.0 ± 0.0
Asn
2.619AsnAla: 2.619 ± 0.345
0.693AsnCys: 0.693 ± 0.141
2.927AsnAsp: 2.927 ± 0.272
2.875AsnGlu: 2.875 ± 0.276
1.977AsnPhe: 1.977 ± 0.211
3.8AsnGly: 3.8 ± 0.359
1.207AsnHis: 1.207 ± 0.168
4.287AsnIle: 4.287 ± 0.363
5.032AsnLys: 5.032 ± 0.352
4.416AsnLeu: 4.416 ± 0.375
1.848AsnMet: 1.848 ± 0.211
3.543AsnAsn: 3.543 ± 0.317
2.285AsnPro: 2.285 ± 0.235
1.797AsnGln: 1.797 ± 0.203
2.465AsnArg: 2.465 ± 0.249
3.62AsnSer: 3.62 ± 0.31
4.801AsnThr: 4.801 ± 0.268
3.055AsnVal: 3.055 ± 0.356
0.95AsnTrp: 0.95 ± 0.178
2.593AsnTyr: 2.593 ± 0.252
0.0AsnXaa: 0.0 ± 0.0
Pro
1.746ProAla: 1.746 ± 0.275
0.539ProCys: 0.539 ± 0.137
2.259ProAsp: 2.259 ± 0.26
2.157ProGlu: 2.157 ± 0.223
1.361ProPhe: 1.361 ± 0.196
0.0ProGly: 0.0 ± 0.0
0.642ProHis: 0.642 ± 0.132
1.72ProIle: 1.72 ± 0.232
1.951ProLys: 1.951 ± 0.278
2.567ProLeu: 2.567 ± 0.246
0.796ProMet: 0.796 ± 0.155
1.746ProAsn: 1.746 ± 0.229
1.001ProPro: 1.001 ± 0.154
1.463ProGln: 1.463 ± 0.179
0.976ProArg: 0.976 ± 0.165
2.567ProSer: 2.567 ± 0.282
2.388ProThr: 2.388 ± 0.344
2.567ProVal: 2.567 ± 0.229
0.462ProTrp: 0.462 ± 0.113
1.386ProTyr: 1.386 ± 0.217
0.0ProXaa: 0.0 ± 0.0
Gln
1.951GlnAla: 1.951 ± 0.232
0.565GlnCys: 0.565 ± 0.122
1.823GlnAsp: 1.823 ± 0.259
2.285GlnGlu: 2.285 ± 0.286
1.309GlnPhe: 1.309 ± 0.164
2.079GlnGly: 2.079 ± 0.202
0.745GlnHis: 0.745 ± 0.142
2.079GlnIle: 2.079 ± 0.253
2.105GlnLys: 2.105 ± 0.268
2.696GlnLeu: 2.696 ± 0.265
0.796GlnMet: 0.796 ± 0.147
1.489GlnAsn: 1.489 ± 0.183
1.078GlnPro: 1.078 ± 0.191
1.694GlnGln: 1.694 ± 0.225
1.566GlnArg: 1.566 ± 0.202
2.054GlnSer: 2.054 ± 0.202
1.848GlnThr: 1.848 ± 0.271
2.028GlnVal: 2.028 ± 0.236
0.411GlnTrp: 0.411 ± 0.099
1.669GlnTyr: 1.669 ± 0.22
0.0GlnXaa: 0.0 ± 0.0
Arg
1.669ArgAla: 1.669 ± 0.219
0.847ArgCys: 0.847 ± 0.182
2.362ArgAsp: 2.362 ± 0.269
2.516ArgGlu: 2.516 ± 0.266
1.771ArgPhe: 1.771 ± 0.189
2.696ArgGly: 2.696 ± 0.27
0.796ArgHis: 0.796 ± 0.144
2.49ArgIle: 2.49 ± 0.239
3.081ArgLys: 3.081 ± 0.364
3.902ArgLeu: 3.902 ± 0.288
0.976ArgMet: 0.976 ± 0.147
2.105ArgAsn: 2.105 ± 0.262
1.001ArgPro: 1.001 ± 0.167
1.489ArgGln: 1.489 ± 0.207
1.438ArgArg: 1.438 ± 0.164
2.67ArgSer: 2.67 ± 0.261
1.823ArgThr: 1.823 ± 0.192
3.414ArgVal: 3.414 ± 0.312
0.642ArgTrp: 0.642 ± 0.152
1.438ArgTyr: 1.438 ± 0.21
0.0ArgXaa: 0.0 ± 0.0
Ser
4.005SerAla: 4.005 ± 0.333
0.924SerCys: 0.924 ± 0.174
4.416SerAsp: 4.416 ± 0.394
3.774SerGlu: 3.774 ± 0.283
3.081SerPhe: 3.081 ± 0.295
5.135SerGly: 5.135 ± 0.488
1.592SerHis: 1.592 ± 0.225
4.826SerIle: 4.826 ± 0.433
4.801SerLys: 4.801 ± 0.382
5.776SerLeu: 5.776 ± 0.41
1.746SerMet: 1.746 ± 0.223
3.44SerAsn: 3.44 ± 0.268
2.465SerPro: 2.465 ± 0.252
2.028SerGln: 2.028 ± 0.28
3.337SerArg: 3.337 ± 0.313
4.518SerSer: 4.518 ± 0.405
4.39SerThr: 4.39 ± 0.392
5.058SerVal: 5.058 ± 0.397
0.976SerTrp: 0.976 ± 0.17
2.696SerTyr: 2.696 ± 0.221
0.0SerXaa: 0.0 ± 0.0
Thr
3.748ThrAla: 3.748 ± 0.376
0.745ThrCys: 0.745 ± 0.143
3.389ThrAsp: 3.389 ± 0.279
3.697ThrGlu: 3.697 ± 0.306
3.081ThrPhe: 3.081 ± 0.334
4.775ThrGly: 4.775 ± 0.503
1.566ThrHis: 1.566 ± 0.236
4.724ThrIle: 4.724 ± 0.394
4.416ThrLys: 4.416 ± 0.394
5.956ThrLeu: 5.956 ± 0.395
1.335ThrMet: 1.335 ± 0.185
3.748ThrAsn: 3.748 ± 0.33
3.312ThrPro: 3.312 ± 0.326
2.157ThrGln: 2.157 ± 0.271
2.644ThrArg: 2.644 ± 0.244
4.724ThrSer: 4.724 ± 0.397
5.699ThrThr: 5.699 ± 0.491
4.621ThrVal: 4.621 ± 0.394
0.873ThrTrp: 0.873 ± 0.152
2.773ThrTyr: 2.773 ± 0.286
0.0ThrXaa: 0.0 ± 0.0
Val
4.082ValAla: 4.082 ± 0.331
1.232ValCys: 1.232 ± 0.166
5.237ValAsp: 5.237 ± 0.357
5.237ValGlu: 5.237 ± 0.38
1.951ValPhe: 1.951 ± 0.265
5.212ValGly: 5.212 ± 0.441
1.104ValHis: 1.104 ± 0.146
4.595ValIle: 4.595 ± 0.345
5.058ValLys: 5.058 ± 0.474
5.391ValLeu: 5.391 ± 0.402
2.208ValMet: 2.208 ± 0.23
4.262ValAsn: 4.262 ± 0.32
1.797ValPro: 1.797 ± 0.208
2.285ValGln: 2.285 ± 0.229
2.67ValArg: 2.67 ± 0.248
4.364ValSer: 4.364 ± 0.341
4.929ValThr: 4.929 ± 0.33
5.212ValVal: 5.212 ± 0.369
1.027ValTrp: 1.027 ± 0.179
3.286ValTyr: 3.286 ± 0.309
0.0ValXaa: 0.0 ± 0.0
Trp
0.719TrpAla: 0.719 ± 0.138
0.385TrpCys: 0.385 ± 0.107
1.053TrpAsp: 1.053 ± 0.188
1.232TrpGlu: 1.232 ± 0.173
1.053TrpPhe: 1.053 ± 0.199
1.078TrpGly: 1.078 ± 0.165
0.103TrpHis: 0.103 ± 0.049
0.719TrpIle: 0.719 ± 0.147
1.053TrpLys: 1.053 ± 0.178
1.566TrpLeu: 1.566 ± 0.181
0.513TrpMet: 0.513 ± 0.127
0.693TrpAsn: 0.693 ± 0.146
0.154TrpPro: 0.154 ± 0.075
0.411TrpGln: 0.411 ± 0.11
0.873TrpArg: 0.873 ± 0.159
0.899TrpSer: 0.899 ± 0.147
0.745TrpThr: 0.745 ± 0.146
1.72TrpVal: 1.72 ± 0.223
0.308TrpTrp: 0.308 ± 0.083
0.642TrpTyr: 0.642 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.157TyrAla: 2.157 ± 0.262
0.847TyrCys: 0.847 ± 0.147
2.85TyrAsp: 2.85 ± 0.284
2.67TyrGlu: 2.67 ± 0.289
1.823TyrPhe: 1.823 ± 0.187
2.465TyrGly: 2.465 ± 0.221
0.796TyrHis: 0.796 ± 0.123
2.747TyrIle: 2.747 ± 0.285
2.85TyrLys: 2.85 ± 0.278
3.979TyrLeu: 3.979 ± 0.293
1.181TyrMet: 1.181 ± 0.163
2.644TyrAsn: 2.644 ± 0.274
0.822TyrPro: 0.822 ± 0.14
1.181TyrGln: 1.181 ± 0.172
1.771TyrArg: 1.771 ± 0.199
3.235TyrSer: 3.235 ± 0.316
2.927TyrThr: 2.927 ± 0.31
2.542TyrVal: 2.542 ± 0.282
0.693TyrTrp: 0.693 ± 0.148
2.105TyrTyr: 2.105 ± 0.223
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 235 proteins (38953 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski