Amino acid dipepetide frequency for Gordonia phage TimTam

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.503AlaAla: 18.503 ± 2.22
1.164AlaCys: 1.164 ± 0.339
8.67AlaAsp: 8.67 ± 0.697
7.739AlaGlu: 7.739 ± 0.687
2.502AlaPhe: 2.502 ± 0.458
9.31AlaGly: 9.31 ± 0.699
1.687AlaHis: 1.687 ± 0.335
4.771AlaIle: 4.771 ± 0.644
3.899AlaLys: 3.899 ± 0.615
8.263AlaLeu: 8.263 ± 0.824
3.491AlaMet: 3.491 ± 0.618
3.375AlaAsn: 3.375 ± 0.429
5.004AlaPro: 5.004 ± 0.376
4.364AlaGln: 4.364 ± 0.707
7.972AlaArg: 7.972 ± 0.727
6.691AlaSer: 6.691 ± 0.885
8.03AlaThr: 8.03 ± 1.066
9.252AlaVal: 9.252 ± 0.777
2.037AlaTrp: 2.037 ± 0.283
2.56AlaTyr: 2.56 ± 0.317
0.0AlaXaa: 0.0 ± 0.0
Cys
0.931CysAla: 0.931 ± 0.216
0.116CysCys: 0.116 ± 0.077
0.756CysAsp: 0.756 ± 0.272
0.524CysGlu: 0.524 ± 0.19
0.233CysPhe: 0.233 ± 0.119
1.164CysGly: 1.164 ± 0.339
0.175CysHis: 0.175 ± 0.112
0.116CysIle: 0.116 ± 0.098
0.058CysLys: 0.058 ± 0.07
0.233CysLeu: 0.233 ± 0.112
0.175CysMet: 0.175 ± 0.099
0.349CysAsn: 0.349 ± 0.151
0.407CysPro: 0.407 ± 0.199
0.291CysGln: 0.291 ± 0.156
0.815CysArg: 0.815 ± 0.244
0.524CysSer: 0.524 ± 0.176
0.524CysThr: 0.524 ± 0.194
0.349CysVal: 0.349 ± 0.134
0.291CysTrp: 0.291 ± 0.144
0.233CysTyr: 0.233 ± 0.124
0.0CysXaa: 0.0 ± 0.0
Asp
7.041AspAla: 7.041 ± 0.872
0.524AspCys: 0.524 ± 0.23
6.342AspAsp: 6.342 ± 0.921
4.422AspGlu: 4.422 ± 0.516
1.978AspPhe: 1.978 ± 0.385
6.459AspGly: 6.459 ± 0.744
2.037AspHis: 2.037 ± 0.352
2.56AspIle: 2.56 ± 0.362
1.746AspLys: 1.746 ± 0.338
6.808AspLeu: 6.808 ± 0.583
1.106AspMet: 1.106 ± 0.266
2.386AspAsn: 2.386 ± 0.407
5.295AspPro: 5.295 ± 0.525
2.56AspGln: 2.56 ± 0.427
5.295AspArg: 5.295 ± 0.618
2.735AspSer: 2.735 ± 0.408
3.433AspThr: 3.433 ± 0.53
4.597AspVal: 4.597 ± 0.457
1.455AspTrp: 1.455 ± 0.312
1.28AspTyr: 1.28 ± 0.303
0.0AspXaa: 0.0 ± 0.0
Glu
7.041GluAla: 7.041 ± 0.794
0.582GluCys: 0.582 ± 0.2
2.909GluAsp: 2.909 ± 0.457
3.2GluGlu: 3.2 ± 0.519
1.978GluPhe: 1.978 ± 0.349
4.015GluGly: 4.015 ± 0.55
1.047GluHis: 1.047 ± 0.281
2.968GluIle: 2.968 ± 0.526
2.153GluLys: 2.153 ± 0.345
4.713GluLeu: 4.713 ± 0.589
0.873GluMet: 0.873 ± 0.196
1.455GluAsn: 1.455 ± 0.313
2.851GluPro: 2.851 ± 0.47
2.851GluGln: 2.851 ± 0.462
4.946GluArg: 4.946 ± 0.628
3.026GluSer: 3.026 ± 0.435
3.724GluThr: 3.724 ± 0.546
4.597GluVal: 4.597 ± 0.539
1.106GluTrp: 1.106 ± 0.269
0.931GluTyr: 0.931 ± 0.231
0.0GluXaa: 0.0 ± 0.0
Phe
2.968PheAla: 2.968 ± 0.337
0.175PheCys: 0.175 ± 0.088
2.444PheAsp: 2.444 ± 0.358
1.629PheGlu: 1.629 ± 0.275
0.756PhePhe: 0.756 ± 0.189
2.851PheGly: 2.851 ± 0.378
0.465PheHis: 0.465 ± 0.164
0.873PheIle: 0.873 ± 0.227
1.106PheLys: 1.106 ± 0.273
1.047PheLeu: 1.047 ± 0.274
0.407PheMet: 0.407 ± 0.153
1.047PheAsn: 1.047 ± 0.274
1.396PhePro: 1.396 ± 0.241
0.582PheGln: 0.582 ± 0.179
2.211PheArg: 2.211 ± 0.455
1.455PheSer: 1.455 ± 0.324
2.095PheThr: 2.095 ± 0.332
1.513PheVal: 1.513 ± 0.292
1.047PheTrp: 1.047 ± 0.231
0.465PheTyr: 0.465 ± 0.172
0.0PheXaa: 0.0 ± 0.0
Gly
8.146GlyAla: 8.146 ± 0.806
0.815GlyCys: 0.815 ± 0.336
4.946GlyAsp: 4.946 ± 0.466
4.48GlyGlu: 4.48 ± 0.442
2.153GlyPhe: 2.153 ± 0.316
8.204GlyGly: 8.204 ± 1.177
1.396GlyHis: 1.396 ± 0.279
3.782GlyIle: 3.782 ± 0.402
3.258GlyLys: 3.258 ± 0.529
6.284GlyLeu: 6.284 ± 0.693
2.269GlyMet: 2.269 ± 0.396
3.258GlyAsn: 3.258 ± 0.403
3.84GlyPro: 3.84 ± 0.465
3.899GlyGln: 3.899 ± 0.532
6.517GlyArg: 6.517 ± 0.644
4.771GlySer: 4.771 ± 0.635
7.041GlyThr: 7.041 ± 1.212
6.051GlyVal: 6.051 ± 0.635
1.804GlyTrp: 1.804 ± 0.372
1.92GlyTyr: 1.92 ± 0.259
0.0GlyXaa: 0.0 ± 0.0
His
1.978HisAla: 1.978 ± 0.299
0.116HisCys: 0.116 ± 0.086
1.455HisAsp: 1.455 ± 0.355
1.338HisGlu: 1.338 ± 0.396
0.524HisPhe: 0.524 ± 0.166
1.338HisGly: 1.338 ± 0.23
0.756HisHis: 0.756 ± 0.243
0.931HisIle: 0.931 ± 0.219
0.582HisLys: 0.582 ± 0.19
1.687HisLeu: 1.687 ± 0.469
0.175HisMet: 0.175 ± 0.095
0.291HisAsn: 0.291 ± 0.131
1.455HisPro: 1.455 ± 0.324
0.815HisGln: 0.815 ± 0.22
1.571HisArg: 1.571 ± 0.282
0.64HisSer: 0.64 ± 0.172
1.338HisThr: 1.338 ± 0.276
1.338HisVal: 1.338 ± 0.311
0.465HisTrp: 0.465 ± 0.155
0.989HisTyr: 0.989 ± 0.244
0.0HisXaa: 0.0 ± 0.0
Ile
5.586IleAla: 5.586 ± 0.615
0.407IleCys: 0.407 ± 0.187
4.364IleAsp: 4.364 ± 0.364
3.84IleGlu: 3.84 ± 0.567
0.64IlePhe: 0.64 ± 0.171
4.306IleGly: 4.306 ± 0.497
1.164IleHis: 1.164 ± 0.304
1.455IleIle: 1.455 ± 0.276
0.989IleLys: 0.989 ± 0.233
1.92IleLeu: 1.92 ± 0.338
0.291IleMet: 0.291 ± 0.112
1.164IleAsn: 1.164 ± 0.282
2.851IlePro: 2.851 ± 0.354
2.153IleGln: 2.153 ± 0.41
3.899IleArg: 3.899 ± 0.438
2.386IleSer: 2.386 ± 0.339
2.677IleThr: 2.677 ± 0.512
3.433IleVal: 3.433 ± 0.459
0.349IleTrp: 0.349 ± 0.135
0.989IleTyr: 0.989 ± 0.211
0.0IleXaa: 0.0 ± 0.0
Lys
3.491LysAla: 3.491 ± 0.746
0.116LysCys: 0.116 ± 0.088
1.455LysAsp: 1.455 ± 0.308
1.571LysGlu: 1.571 ± 0.367
0.931LysPhe: 0.931 ± 0.239
2.386LysGly: 2.386 ± 0.418
0.756LysHis: 0.756 ± 0.186
1.862LysIle: 1.862 ± 0.401
0.989LysLys: 0.989 ± 0.304
2.851LysLeu: 2.851 ± 0.397
0.349LysMet: 0.349 ± 0.163
0.873LysAsn: 0.873 ± 0.262
2.444LysPro: 2.444 ± 0.386
1.164LysGln: 1.164 ± 0.309
2.269LysArg: 2.269 ± 0.412
1.687LysSer: 1.687 ± 0.298
2.386LysThr: 2.386 ± 0.392
2.386LysVal: 2.386 ± 0.319
0.989LysTrp: 0.989 ± 0.22
0.873LysTyr: 0.873 ± 0.216
0.0LysXaa: 0.0 ± 0.0
Leu
10.765LeuAla: 10.765 ± 0.733
0.756LeuCys: 0.756 ± 0.234
6.459LeuAsp: 6.459 ± 0.762
4.073LeuGlu: 4.073 ± 0.584
2.502LeuPhe: 2.502 ± 0.52
5.644LeuGly: 5.644 ± 0.459
1.338LeuHis: 1.338 ± 0.431
3.899LeuIle: 3.899 ± 0.459
2.095LeuLys: 2.095 ± 0.401
4.655LeuLeu: 4.655 ± 0.475
1.338LeuMet: 1.338 ± 0.255
2.153LeuAsn: 2.153 ± 0.417
4.364LeuPro: 4.364 ± 0.615
2.269LeuGln: 2.269 ± 0.288
6.342LeuArg: 6.342 ± 0.769
4.015LeuSer: 4.015 ± 0.477
5.237LeuThr: 5.237 ± 0.668
5.295LeuVal: 5.295 ± 0.837
1.455LeuTrp: 1.455 ± 0.289
1.047LeuTyr: 1.047 ± 0.254
0.0LeuXaa: 0.0 ± 0.0
Met
2.269MetAla: 2.269 ± 0.541
0.233MetCys: 0.233 ± 0.112
0.291MetAsp: 0.291 ± 0.132
0.582MetGlu: 0.582 ± 0.192
0.291MetPhe: 0.291 ± 0.138
1.804MetGly: 1.804 ± 0.411
0.524MetHis: 0.524 ± 0.165
0.873MetIle: 0.873 ± 0.204
0.989MetLys: 0.989 ± 0.258
1.28MetLeu: 1.28 ± 0.284
0.407MetMet: 0.407 ± 0.155
0.756MetAsn: 0.756 ± 0.225
1.222MetPro: 1.222 ± 0.229
0.756MetGln: 0.756 ± 0.18
2.037MetArg: 2.037 ± 0.367
1.455MetSer: 1.455 ± 0.3
3.142MetThr: 3.142 ± 0.451
1.164MetVal: 1.164 ± 0.274
0.465MetTrp: 0.465 ± 0.176
0.291MetTyr: 0.291 ± 0.139
0.0MetXaa: 0.0 ± 0.0
Asn
3.84AsnAla: 3.84 ± 0.397
0.058AsnCys: 0.058 ± 0.058
2.211AsnAsp: 2.211 ± 0.305
0.756AsnGlu: 0.756 ± 0.246
0.349AsnPhe: 0.349 ± 0.142
2.618AsnGly: 2.618 ± 0.39
0.931AsnHis: 0.931 ± 0.26
0.815AsnIle: 0.815 ± 0.249
0.698AsnLys: 0.698 ± 0.229
2.269AsnLeu: 2.269 ± 0.283
0.407AsnMet: 0.407 ± 0.125
0.931AsnAsn: 0.931 ± 0.263
2.386AsnPro: 2.386 ± 0.382
0.815AsnGln: 0.815 ± 0.214
2.502AsnArg: 2.502 ± 0.354
1.92AsnSer: 1.92 ± 0.249
2.502AsnThr: 2.502 ± 0.533
2.037AsnVal: 2.037 ± 0.346
0.524AsnTrp: 0.524 ± 0.155
0.931AsnTyr: 0.931 ± 0.255
0.0AsnXaa: 0.0 ± 0.0
Pro
6.75ProAla: 6.75 ± 0.629
0.524ProCys: 0.524 ± 0.237
4.946ProAsp: 4.946 ± 0.695
3.724ProGlu: 3.724 ± 0.469
1.571ProPhe: 1.571 ± 0.308
6.517ProGly: 6.517 ± 0.629
0.931ProHis: 0.931 ± 0.238
2.56ProIle: 2.56 ± 0.331
2.037ProLys: 2.037 ± 0.411
3.608ProLeu: 3.608 ± 0.492
1.513ProMet: 1.513 ± 0.324
1.687ProAsn: 1.687 ± 0.287
2.735ProPro: 2.735 ± 0.531
1.396ProGln: 1.396 ± 0.292
3.666ProArg: 3.666 ± 0.575
2.793ProSer: 2.793 ± 0.401
3.84ProThr: 3.84 ± 0.466
3.84ProVal: 3.84 ± 0.395
1.164ProTrp: 1.164 ± 0.192
0.931ProTyr: 0.931 ± 0.233
0.0ProXaa: 0.0 ± 0.0
Gln
4.073GlnAla: 4.073 ± 0.693
0.349GlnCys: 0.349 ± 0.152
1.455GlnAsp: 1.455 ± 0.326
1.571GlnGlu: 1.571 ± 0.312
0.931GlnPhe: 0.931 ± 0.188
2.56GlnGly: 2.56 ± 0.639
0.989GlnHis: 0.989 ± 0.257
2.327GlnIle: 2.327 ± 0.384
1.28GlnLys: 1.28 ± 0.353
4.539GlnLeu: 4.539 ± 0.497
1.455GlnMet: 1.455 ± 0.232
0.873GlnAsn: 0.873 ± 0.236
2.037GlnPro: 2.037 ± 0.306
1.513GlnGln: 1.513 ± 0.319
3.608GlnArg: 3.608 ± 0.481
1.338GlnSer: 1.338 ± 0.36
1.687GlnThr: 1.687 ± 0.424
2.735GlnVal: 2.735 ± 0.444
0.873GlnTrp: 0.873 ± 0.212
0.407GlnTyr: 0.407 ± 0.126
0.0GlnXaa: 0.0 ± 0.0
Arg
7.972ArgAla: 7.972 ± 0.724
0.698ArgCys: 0.698 ± 0.289
4.946ArgAsp: 4.946 ± 0.515
5.004ArgGlu: 5.004 ± 0.577
2.153ArgPhe: 2.153 ± 0.334
4.713ArgGly: 4.713 ± 0.531
1.164ArgHis: 1.164 ± 0.288
3.724ArgIle: 3.724 ± 0.549
3.491ArgLys: 3.491 ± 0.482
7.448ArgLeu: 7.448 ± 0.771
1.978ArgMet: 1.978 ± 0.375
1.978ArgAsn: 1.978 ± 0.403
4.539ArgPro: 4.539 ± 0.719
2.677ArgGln: 2.677 ± 0.452
6.691ArgArg: 6.691 ± 1.02
4.306ArgSer: 4.306 ± 0.56
5.12ArgThr: 5.12 ± 0.605
5.993ArgVal: 5.993 ± 0.693
1.455ArgTrp: 1.455 ± 0.337
2.56ArgTyr: 2.56 ± 0.37
0.0ArgXaa: 0.0 ± 0.0
Ser
5.644SerAla: 5.644 ± 0.822
0.233SerCys: 0.233 ± 0.117
3.782SerAsp: 3.782 ± 0.539
3.2SerGlu: 3.2 ± 0.378
1.396SerPhe: 1.396 ± 0.256
5.644SerGly: 5.644 ± 0.567
0.698SerHis: 0.698 ± 0.211
2.793SerIle: 2.793 ± 0.473
1.92SerLys: 1.92 ± 0.331
4.015SerLeu: 4.015 ± 0.405
1.164SerMet: 1.164 ± 0.35
2.153SerAsn: 2.153 ± 0.362
2.327SerPro: 2.327 ± 0.428
2.269SerGln: 2.269 ± 0.463
3.724SerArg: 3.724 ± 0.54
2.211SerSer: 2.211 ± 0.356
3.782SerThr: 3.782 ± 0.6
4.189SerVal: 4.189 ± 0.566
1.396SerTrp: 1.396 ± 0.309
1.28SerTyr: 1.28 ± 0.267
0.0SerXaa: 0.0 ± 0.0
Thr
9.834ThrAla: 9.834 ± 1.411
0.465ThrCys: 0.465 ± 0.168
4.655ThrAsp: 4.655 ± 0.497
2.793ThrGlu: 2.793 ± 0.47
1.687ThrPhe: 1.687 ± 0.301
6.633ThrGly: 6.633 ± 0.762
1.164ThrHis: 1.164 ± 0.241
3.84ThrIle: 3.84 ± 0.499
1.513ThrLys: 1.513 ± 0.322
5.179ThrLeu: 5.179 ± 0.557
1.513ThrMet: 1.513 ± 0.278
1.396ThrAsn: 1.396 ± 0.271
4.946ThrPro: 4.946 ± 0.643
1.862ThrGln: 1.862 ± 0.351
4.946ThrArg: 4.946 ± 0.46
5.062ThrSer: 5.062 ± 0.76
5.819ThrThr: 5.819 ± 0.666
5.935ThrVal: 5.935 ± 0.53
1.106ThrTrp: 1.106 ± 0.258
1.455ThrTyr: 1.455 ± 0.304
0.0ThrXaa: 0.0 ± 0.0
Val
8.263ValAla: 8.263 ± 0.631
0.64ValCys: 0.64 ± 0.217
5.702ValAsp: 5.702 ± 0.656
4.189ValGlu: 4.189 ± 0.394
2.618ValPhe: 2.618 ± 0.367
5.179ValGly: 5.179 ± 0.761
1.396ValHis: 1.396 ± 0.28
3.782ValIle: 3.782 ± 0.377
1.92ValLys: 1.92 ± 0.414
5.761ValLeu: 5.761 ± 0.899
0.989ValMet: 0.989 ± 0.191
2.269ValAsn: 2.269 ± 0.418
4.189ValPro: 4.189 ± 0.513
2.677ValGln: 2.677 ± 0.518
5.295ValArg: 5.295 ± 0.589
4.364ValSer: 4.364 ± 0.683
6.517ValThr: 6.517 ± 0.609
5.586ValVal: 5.586 ± 0.582
0.989ValTrp: 0.989 ± 0.278
1.047ValTyr: 1.047 ± 0.175
0.0ValXaa: 0.0 ± 0.0
Trp
1.92TrpAla: 1.92 ± 0.332
0.116TrpCys: 0.116 ± 0.082
0.989TrpAsp: 0.989 ± 0.238
1.28TrpGlu: 1.28 ± 0.317
0.873TrpPhe: 0.873 ± 0.214
1.047TrpGly: 1.047 ± 0.258
0.756TrpHis: 0.756 ± 0.255
0.465TrpIle: 0.465 ± 0.147
0.465TrpLys: 0.465 ± 0.172
1.687TrpLeu: 1.687 ± 0.361
0.407TrpMet: 0.407 ± 0.155
0.64TrpAsn: 0.64 ± 0.161
1.106TrpPro: 1.106 ± 0.214
0.931TrpGln: 0.931 ± 0.234
2.502TrpArg: 2.502 ± 0.436
0.873TrpSer: 0.873 ± 0.231
1.396TrpThr: 1.396 ± 0.295
1.455TrpVal: 1.455 ± 0.281
0.233TrpTrp: 0.233 ± 0.121
0.407TrpTyr: 0.407 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.269TyrAla: 2.269 ± 0.377
0.175TyrCys: 0.175 ± 0.11
1.28TyrAsp: 1.28 ± 0.309
1.047TyrGlu: 1.047 ± 0.353
0.465TyrPhe: 0.465 ± 0.152
2.444TyrGly: 2.444 ± 0.352
0.407TyrHis: 0.407 ± 0.163
0.465TyrIle: 0.465 ± 0.176
0.64TyrLys: 0.64 ± 0.209
1.455TyrLeu: 1.455 ± 0.352
0.465TyrMet: 0.465 ± 0.177
0.465TyrAsn: 0.465 ± 0.146
1.338TyrPro: 1.338 ± 0.275
0.756TyrGln: 0.756 ± 0.214
1.862TyrArg: 1.862 ± 0.382
1.571TyrSer: 1.571 ± 0.236
1.513TyrThr: 1.513 ± 0.265
1.629TyrVal: 1.629 ± 0.254
0.349TyrTrp: 0.349 ± 0.14
0.465TyrTyr: 0.465 ± 0.163
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (17187 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski