Amino acid dipepetide frequency for Pseudomonas phage JG004

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.767AlaAla: 6.767 ± 0.638
1.324AlaCys: 1.324 ± 0.227
4.156AlaAsp: 4.156 ± 0.42
5.554AlaGlu: 5.554 ± 0.457
3.089AlaPhe: 3.089 ± 0.371
5.517AlaGly: 5.517 ± 0.503
1.912AlaHis: 1.912 ± 0.281
5.37AlaIle: 5.37 ± 0.396
4.892AlaLys: 4.892 ± 0.493
6.51AlaLeu: 6.51 ± 0.564
3.053AlaMet: 3.053 ± 0.306
2.979AlaAsn: 2.979 ± 0.38
2.133AlaPro: 2.133 ± 0.269
3.126AlaGln: 3.126 ± 0.425
5.002AlaArg: 5.002 ± 0.431
4.303AlaSer: 4.303 ± 0.405
5.259AlaThr: 5.259 ± 0.596
5.517AlaVal: 5.517 ± 0.462
1.581AlaTrp: 1.581 ± 0.224
3.2AlaTyr: 3.2 ± 0.316
0.0AlaXaa: 0.0 ± 0.0
Cys
0.956CysAla: 0.956 ± 0.225
0.294CysCys: 0.294 ± 0.105
0.772CysAsp: 0.772 ± 0.155
1.25CysGlu: 1.25 ± 0.217
0.441CysPhe: 0.441 ± 0.11
1.25CysGly: 1.25 ± 0.22
0.257CysHis: 0.257 ± 0.095
0.919CysIle: 0.919 ± 0.175
0.846CysLys: 0.846 ± 0.175
0.736CysLeu: 0.736 ± 0.159
0.478CysMet: 0.478 ± 0.122
0.956CysAsn: 0.956 ± 0.205
0.515CysPro: 0.515 ± 0.168
0.515CysGln: 0.515 ± 0.134
0.883CysArg: 0.883 ± 0.212
0.809CysSer: 0.809 ± 0.165
0.294CysThr: 0.294 ± 0.113
0.846CysVal: 0.846 ± 0.208
0.368CysTrp: 0.368 ± 0.101
0.662CysTyr: 0.662 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
4.487AspAla: 4.487 ± 0.384
0.993AspCys: 0.993 ± 0.199
3.126AspAsp: 3.126 ± 0.321
3.678AspGlu: 3.678 ± 0.373
2.722AspPhe: 2.722 ± 0.345
4.634AspGly: 4.634 ± 0.39
1.398AspHis: 1.398 ± 0.215
4.009AspIle: 4.009 ± 0.449
3.2AspLys: 3.2 ± 0.354
4.781AspLeu: 4.781 ± 0.371
1.876AspMet: 1.876 ± 0.252
2.574AspAsn: 2.574 ± 0.281
2.942AspPro: 2.942 ± 0.353
1.912AspGln: 1.912 ± 0.303
3.678AspArg: 3.678 ± 0.405
3.126AspSer: 3.126 ± 0.365
3.053AspThr: 3.053 ± 0.396
3.788AspVal: 3.788 ± 0.334
1.324AspTrp: 1.324 ± 0.246
2.538AspTyr: 2.538 ± 0.277
0.0AspXaa: 0.0 ± 0.0
Glu
7.172GluAla: 7.172 ± 0.463
1.067GluCys: 1.067 ± 0.166
5.112GluAsp: 5.112 ± 0.422
7.319GluGlu: 7.319 ± 0.55
3.126GluPhe: 3.126 ± 0.378
5.333GluGly: 5.333 ± 0.455
1.324GluHis: 1.324 ± 0.254
3.898GluIle: 3.898 ± 0.352
3.788GluLys: 3.788 ± 0.47
6.399GluLeu: 6.399 ± 0.521
1.949GluMet: 1.949 ± 0.301
2.685GluAsn: 2.685 ± 0.368
2.133GluPro: 2.133 ± 0.31
3.016GluGln: 3.016 ± 0.315
4.082GluArg: 4.082 ± 0.402
3.567GluSer: 3.567 ± 0.336
3.604GluThr: 3.604 ± 0.298
5.554GluVal: 5.554 ± 0.467
1.729GluTrp: 1.729 ± 0.25
3.163GluTyr: 3.163 ± 0.382
0.0GluXaa: 0.0 ± 0.0
Phe
2.869PheAla: 2.869 ± 0.267
0.588PheCys: 0.588 ± 0.16
2.574PheAsp: 2.574 ± 0.355
3.163PheGlu: 3.163 ± 0.311
1.655PhePhe: 1.655 ± 0.269
2.722PheGly: 2.722 ± 0.262
0.846PheHis: 0.846 ± 0.182
1.839PheIle: 1.839 ± 0.252
3.31PheLys: 3.31 ± 0.352
2.905PheLeu: 2.905 ± 0.312
1.214PheMet: 1.214 ± 0.177
2.207PheAsn: 2.207 ± 0.301
1.581PhePro: 1.581 ± 0.193
1.581PheGln: 1.581 ± 0.24
2.023PheArg: 2.023 ± 0.237
2.501PheSer: 2.501 ± 0.281
2.427PheThr: 2.427 ± 0.326
2.648PheVal: 2.648 ± 0.282
0.846PheTrp: 0.846 ± 0.184
1.287PheTyr: 1.287 ± 0.206
0.0PheXaa: 0.0 ± 0.0
Gly
4.303GlyAla: 4.303 ± 0.413
1.25GlyCys: 1.25 ± 0.229
4.193GlyAsp: 4.193 ± 0.404
5.37GlyGlu: 5.37 ± 0.493
3.531GlyPhe: 3.531 ± 0.359
5.149GlyGly: 5.149 ± 0.546
1.287GlyHis: 1.287 ± 0.234
3.604GlyIle: 3.604 ± 0.413
4.413GlyLys: 4.413 ± 0.502
5.186GlyLeu: 5.186 ± 0.481
2.096GlyMet: 2.096 ± 0.242
3.163GlyAsn: 3.163 ± 0.393
1.618GlyPro: 1.618 ± 0.261
2.758GlyGln: 2.758 ± 0.342
4.046GlyArg: 4.046 ± 0.409
4.892GlySer: 4.892 ± 0.401
3.678GlyThr: 3.678 ± 0.372
5.664GlyVal: 5.664 ± 0.439
1.361GlyTrp: 1.361 ± 0.205
3.31GlyTyr: 3.31 ± 0.4
0.0GlyXaa: 0.0 ± 0.0
His
1.692HisAla: 1.692 ± 0.222
0.368HisCys: 0.368 ± 0.106
1.214HisAsp: 1.214 ± 0.205
0.772HisGlu: 0.772 ± 0.183
1.03HisPhe: 1.03 ± 0.239
1.398HisGly: 1.398 ± 0.227
0.405HisHis: 0.405 ± 0.144
1.214HisIle: 1.214 ± 0.251
1.14HisLys: 1.14 ± 0.251
1.655HisLeu: 1.655 ± 0.283
0.588HisMet: 0.588 ± 0.133
0.846HisAsn: 0.846 ± 0.216
0.809HisPro: 0.809 ± 0.157
0.625HisGln: 0.625 ± 0.137
1.214HisArg: 1.214 ± 0.21
1.067HisSer: 1.067 ± 0.202
1.25HisThr: 1.25 ± 0.218
1.103HisVal: 1.103 ± 0.193
0.441HisTrp: 0.441 ± 0.142
0.699HisTyr: 0.699 ± 0.175
0.0HisXaa: 0.0 ± 0.0
Ile
4.156IleAla: 4.156 ± 0.386
0.478IleCys: 0.478 ± 0.12
4.193IleAsp: 4.193 ± 0.431
3.862IleGlu: 3.862 ± 0.399
1.581IlePhe: 1.581 ± 0.236
3.825IleGly: 3.825 ± 0.375
1.471IleHis: 1.471 ± 0.273
2.648IleIle: 2.648 ± 0.321
3.788IleLys: 3.788 ± 0.382
4.193IleLeu: 4.193 ± 0.379
1.581IleMet: 1.581 ± 0.258
2.391IleAsn: 2.391 ± 0.305
2.391IlePro: 2.391 ± 0.292
2.06IleGln: 2.06 ± 0.247
3.788IleArg: 3.788 ± 0.408
3.715IleSer: 3.715 ± 0.369
2.28IleThr: 2.28 ± 0.299
3.862IleVal: 3.862 ± 0.353
0.809IleTrp: 0.809 ± 0.207
1.398IleTyr: 1.398 ± 0.216
0.0IleXaa: 0.0 ± 0.0
Lys
6.032LysAla: 6.032 ± 0.494
0.662LysCys: 0.662 ± 0.159
4.082LysAsp: 4.082 ± 0.432
5.627LysGlu: 5.627 ± 0.554
1.802LysPhe: 1.802 ± 0.265
4.524LysGly: 4.524 ± 0.397
1.14LysHis: 1.14 ± 0.265
3.457LysIle: 3.457 ± 0.332
3.42LysLys: 3.42 ± 0.394
4.855LysLeu: 4.855 ± 0.393
1.986LysMet: 1.986 ± 0.241
2.28LysAsn: 2.28 ± 0.327
1.986LysPro: 1.986 ± 0.319
1.912LysGln: 1.912 ± 0.252
3.384LysArg: 3.384 ± 0.43
3.567LysSer: 3.567 ± 0.357
3.089LysThr: 3.089 ± 0.329
3.935LysVal: 3.935 ± 0.312
1.067LysTrp: 1.067 ± 0.212
2.207LysTyr: 2.207 ± 0.271
0.0LysXaa: 0.0 ± 0.0
Leu
6.068LeuAla: 6.068 ± 0.483
1.03LeuCys: 1.03 ± 0.216
5.737LeuAsp: 5.737 ± 0.455
5.921LeuGlu: 5.921 ± 0.498
2.317LeuPhe: 2.317 ± 0.342
5.737LeuGly: 5.737 ± 0.508
1.729LeuHis: 1.729 ± 0.266
4.156LeuIle: 4.156 ± 0.396
5.443LeuLys: 5.443 ± 0.468
5.664LeuLeu: 5.664 ± 0.545
2.28LeuMet: 2.28 ± 0.238
2.869LeuAsn: 2.869 ± 0.377
4.009LeuPro: 4.009 ± 0.369
2.832LeuGln: 2.832 ± 0.283
4.561LeuArg: 4.561 ± 0.391
5.37LeuSer: 5.37 ± 0.47
4.561LeuThr: 4.561 ± 0.46
4.634LeuVal: 4.634 ± 0.401
1.287LeuTrp: 1.287 ± 0.234
3.053LeuTyr: 3.053 ± 0.314
0.0LeuXaa: 0.0 ± 0.0
Met
3.531MetAla: 3.531 ± 0.355
0.478MetCys: 0.478 ± 0.146
1.434MetAsp: 1.434 ± 0.23
3.126MetGlu: 3.126 ± 0.309
1.14MetPhe: 1.14 ± 0.17
1.839MetGly: 1.839 ± 0.273
0.221MetHis: 0.221 ± 0.095
1.14MetIle: 1.14 ± 0.195
1.912MetLys: 1.912 ± 0.262
1.986MetLeu: 1.986 ± 0.238
1.214MetMet: 1.214 ± 0.245
1.471MetAsn: 1.471 ± 0.242
0.956MetPro: 0.956 ± 0.201
1.581MetGln: 1.581 ± 0.259
1.581MetArg: 1.581 ± 0.215
1.949MetSer: 1.949 ± 0.222
2.17MetThr: 2.17 ± 0.258
1.324MetVal: 1.324 ± 0.213
0.478MetTrp: 0.478 ± 0.115
0.993MetTyr: 0.993 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
3.126AsnAla: 3.126 ± 0.45
0.368AsnCys: 0.368 ± 0.127
2.243AsnAsp: 2.243 ± 0.26
2.317AsnGlu: 2.317 ± 0.26
1.655AsnPhe: 1.655 ± 0.18
3.567AsnGly: 3.567 ± 0.379
0.919AsnHis: 0.919 ± 0.201
2.538AsnIle: 2.538 ± 0.294
2.501AsnLys: 2.501 ± 0.304
3.898AsnLeu: 3.898 ± 0.482
0.993AsnMet: 0.993 ± 0.208
1.839AsnAsn: 1.839 ± 0.314
2.096AsnPro: 2.096 ± 0.295
1.14AsnGln: 1.14 ± 0.224
1.876AsnArg: 1.876 ± 0.248
2.905AsnSer: 2.905 ± 0.396
2.317AsnThr: 2.317 ± 0.292
3.163AsnVal: 3.163 ± 0.352
0.699AsnTrp: 0.699 ± 0.199
1.177AsnTyr: 1.177 ± 0.207
0.0AsnXaa: 0.0 ± 0.0
Pro
3.494ProAla: 3.494 ± 0.378
0.478ProCys: 0.478 ± 0.141
2.096ProAsp: 2.096 ± 0.36
3.898ProGlu: 3.898 ± 0.357
1.545ProPhe: 1.545 ± 0.214
2.574ProGly: 2.574 ± 0.323
0.956ProHis: 0.956 ± 0.193
1.912ProIle: 1.912 ± 0.274
1.802ProLys: 1.802 ± 0.281
2.648ProLeu: 2.648 ± 0.293
1.214ProMet: 1.214 ± 0.243
1.214ProAsn: 1.214 ± 0.26
1.287ProPro: 1.287 ± 0.289
1.067ProGln: 1.067 ± 0.212
1.729ProArg: 1.729 ± 0.235
2.133ProSer: 2.133 ± 0.251
2.427ProThr: 2.427 ± 0.309
3.457ProVal: 3.457 ± 0.317
0.405ProTrp: 0.405 ± 0.108
1.214ProTyr: 1.214 ± 0.24
0.0ProXaa: 0.0 ± 0.0
Gln
3.494GlnAla: 3.494 ± 0.292
0.588GlnCys: 0.588 ± 0.151
1.655GlnAsp: 1.655 ± 0.243
2.427GlnGlu: 2.427 ± 0.313
1.839GlnPhe: 1.839 ± 0.216
2.207GlnGly: 2.207 ± 0.321
0.772GlnHis: 0.772 ± 0.151
2.023GlnIle: 2.023 ± 0.308
1.508GlnLys: 1.508 ± 0.225
3.273GlnLeu: 3.273 ± 0.297
1.434GlnMet: 1.434 ± 0.251
1.177GlnAsn: 1.177 ± 0.152
1.03GlnPro: 1.03 ± 0.2
1.03GlnGln: 1.03 ± 0.199
2.17GlnArg: 2.17 ± 0.269
2.17GlnSer: 2.17 ± 0.29
1.839GlnThr: 1.839 ± 0.247
2.685GlnVal: 2.685 ± 0.269
0.736GlnTrp: 0.736 ± 0.144
1.508GlnTyr: 1.508 ± 0.206
0.0GlnXaa: 0.0 ± 0.0
Arg
4.45ArgAla: 4.45 ± 0.412
0.699ArgCys: 0.699 ± 0.151
3.126ArgAsp: 3.126 ± 0.319
4.266ArgGlu: 4.266 ± 0.352
2.354ArgPhe: 2.354 ± 0.332
3.751ArgGly: 3.751 ± 0.446
1.177ArgHis: 1.177 ± 0.188
2.832ArgIle: 2.832 ± 0.307
4.524ArgLys: 4.524 ± 0.373
5.039ArgLeu: 5.039 ± 0.439
1.471ArgMet: 1.471 ± 0.228
1.839ArgAsn: 1.839 ± 0.29
2.538ArgPro: 2.538 ± 0.314
2.28ArgGln: 2.28 ± 0.282
4.046ArgArg: 4.046 ± 0.525
3.236ArgSer: 3.236 ± 0.346
2.942ArgThr: 2.942 ± 0.282
4.377ArgVal: 4.377 ± 0.347
1.214ArgTrp: 1.214 ± 0.245
1.802ArgTyr: 1.802 ± 0.26
0.0ArgXaa: 0.0 ± 0.0
Ser
4.634SerAla: 4.634 ± 0.425
0.919SerCys: 0.919 ± 0.205
3.494SerAsp: 3.494 ± 0.367
3.972SerGlu: 3.972 ± 0.351
2.538SerPhe: 2.538 ± 0.248
4.082SerGly: 4.082 ± 0.458
0.699SerHis: 0.699 ± 0.163
2.905SerIle: 2.905 ± 0.305
3.273SerLys: 3.273 ± 0.303
5.186SerLeu: 5.186 ± 0.496
1.729SerMet: 1.729 ± 0.219
2.869SerAsn: 2.869 ± 0.401
2.464SerPro: 2.464 ± 0.277
1.802SerGln: 1.802 ± 0.249
3.825SerArg: 3.825 ± 0.408
3.089SerSer: 3.089 ± 0.4
3.126SerThr: 3.126 ± 0.305
5.149SerVal: 5.149 ± 0.427
0.956SerTrp: 0.956 ± 0.163
2.096SerTyr: 2.096 ± 0.22
0.0SerXaa: 0.0 ± 0.0
Thr
4.046ThrAla: 4.046 ± 0.474
0.772ThrCys: 0.772 ± 0.177
2.722ThrAsp: 2.722 ± 0.346
4.119ThrGlu: 4.119 ± 0.376
2.611ThrPhe: 2.611 ± 0.252
4.708ThrGly: 4.708 ± 0.47
1.103ThrHis: 1.103 ± 0.164
3.016ThrIle: 3.016 ± 0.251
2.758ThrLys: 2.758 ± 0.398
5.186ThrLeu: 5.186 ± 0.389
1.214ThrMet: 1.214 ± 0.235
2.243ThrAsn: 2.243 ± 0.312
2.096ThrPro: 2.096 ± 0.283
1.876ThrGln: 1.876 ± 0.301
2.942ThrArg: 2.942 ± 0.333
2.832ThrSer: 2.832 ± 0.35
2.832ThrThr: 2.832 ± 0.31
4.708ThrVal: 4.708 ± 0.398
0.846ThrTrp: 0.846 ± 0.175
2.023ThrTyr: 2.023 ± 0.276
0.0ThrXaa: 0.0 ± 0.0
Val
6.068ValAla: 6.068 ± 0.445
0.588ValCys: 0.588 ± 0.16
4.229ValAsp: 4.229 ± 0.398
5.296ValGlu: 5.296 ± 0.472
3.236ValPhe: 3.236 ± 0.334
4.229ValGly: 4.229 ± 0.379
1.067ValHis: 1.067 ± 0.205
3.862ValIle: 3.862 ± 0.354
4.928ValLys: 4.928 ± 0.41
5.48ValLeu: 5.48 ± 0.426
2.354ValMet: 2.354 ± 0.283
3.163ValAsn: 3.163 ± 0.344
2.905ValPro: 2.905 ± 0.373
2.317ValGln: 2.317 ± 0.223
4.046ValArg: 4.046 ± 0.297
4.266ValSer: 4.266 ± 0.402
4.193ValThr: 4.193 ± 0.317
5.701ValVal: 5.701 ± 0.485
1.214ValTrp: 1.214 ± 0.234
2.574ValTyr: 2.574 ± 0.3
0.0ValXaa: 0.0 ± 0.0
Trp
1.14TrpAla: 1.14 ± 0.208
0.441TrpCys: 0.441 ± 0.138
1.25TrpAsp: 1.25 ± 0.195
1.545TrpGlu: 1.545 ± 0.238
0.772TrpPhe: 0.772 ± 0.16
0.919TrpGly: 0.919 ± 0.19
0.478TrpHis: 0.478 ± 0.122
0.809TrpIle: 0.809 ± 0.165
1.214TrpLys: 1.214 ± 0.235
1.471TrpLeu: 1.471 ± 0.265
0.772TrpMet: 0.772 ± 0.163
0.809TrpAsn: 0.809 ± 0.162
0.625TrpPro: 0.625 ± 0.153
0.588TrpGln: 0.588 ± 0.159
1.14TrpArg: 1.14 ± 0.207
0.919TrpSer: 0.919 ± 0.191
1.03TrpThr: 1.03 ± 0.198
1.177TrpVal: 1.177 ± 0.256
0.257TrpTrp: 0.257 ± 0.095
0.662TrpTyr: 0.662 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.758TyrAla: 2.758 ± 0.345
0.662TyrCys: 0.662 ± 0.155
2.427TyrAsp: 2.427 ± 0.283
2.538TyrGlu: 2.538 ± 0.359
1.802TyrPhe: 1.802 ± 0.229
2.905TyrGly: 2.905 ± 0.357
0.368TyrHis: 0.368 ± 0.114
2.317TyrIle: 2.317 ± 0.326
2.391TyrLys: 2.391 ± 0.311
2.207TyrLeu: 2.207 ± 0.264
0.993TyrMet: 0.993 ± 0.197
1.765TyrAsn: 1.765 ± 0.229
1.434TyrPro: 1.434 ± 0.233
1.545TyrGln: 1.545 ± 0.229
1.949TyrArg: 1.949 ± 0.234
2.391TyrSer: 2.391 ± 0.359
2.28TyrThr: 2.28 ± 0.317
2.427TyrVal: 2.427 ± 0.284
0.405TyrTrp: 0.405 ± 0.108
1.214TyrTyr: 1.214 ± 0.198
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 161 proteins (27191 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski