Amino acid dipepetide frequency for Mycobacterium phage Ekdilam

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.709AlaAla: 20.709 ± 1.658
1.101AlaCys: 1.101 ± 0.287
9.647AlaAsp: 9.647 ± 0.779
9.437AlaGlu: 9.437 ± 0.855
3.198AlaPhe: 3.198 ± 0.538
9.227AlaGly: 9.227 ± 1.158
2.517AlaHis: 2.517 ± 0.351
4.928AlaIle: 4.928 ± 0.651
3.88AlaLys: 3.88 ± 0.443
12.425AlaLeu: 12.425 ± 0.791
3.722AlaMet: 3.722 ± 0.509
2.936AlaAsn: 2.936 ± 0.552
6.606AlaPro: 6.606 ± 0.722
4.456AlaGln: 4.456 ± 0.676
9.175AlaArg: 9.175 ± 0.93
5.085AlaSer: 5.085 ± 0.681
6.763AlaThr: 6.763 ± 0.58
10.328AlaVal: 10.328 ± 0.833
2.307AlaTrp: 2.307 ± 0.306
2.726AlaTyr: 2.726 ± 0.486
0.0AlaXaa: 0.0 ± 0.0
Cys
1.101CysAla: 1.101 ± 0.284
0.157CysCys: 0.157 ± 0.084
0.629CysAsp: 0.629 ± 0.224
0.944CysGlu: 0.944 ± 0.272
0.21CysPhe: 0.21 ± 0.105
1.416CysGly: 1.416 ± 0.308
0.315CysHis: 0.315 ± 0.146
0.419CysIle: 0.419 ± 0.135
0.734CysLys: 0.734 ± 0.193
0.839CysLeu: 0.839 ± 0.2
0.105CysMet: 0.105 ± 0.065
0.315CysAsn: 0.315 ± 0.147
0.682CysPro: 0.682 ± 0.207
0.315CysGln: 0.315 ± 0.133
0.996CysArg: 0.996 ± 0.215
1.101CysSer: 1.101 ± 0.255
0.577CysThr: 0.577 ± 0.202
0.839CysVal: 0.839 ± 0.223
0.367CysTrp: 0.367 ± 0.146
0.052CysTyr: 0.052 ± 0.049
0.0CysXaa: 0.0 ± 0.0
Asp
9.175AspAla: 9.175 ± 0.758
0.472AspCys: 0.472 ± 0.207
6.658AspAsp: 6.658 ± 0.816
5.138AspGlu: 5.138 ± 0.682
1.258AspPhe: 1.258 ± 0.284
5.61AspGly: 5.61 ± 0.51
1.258AspHis: 1.258 ± 0.267
1.258AspIle: 1.258 ± 0.256
1.835AspLys: 1.835 ± 0.387
5.505AspLeu: 5.505 ± 0.673
1.783AspMet: 1.783 ± 0.291
1.94AspAsn: 1.94 ± 0.347
3.827AspPro: 3.827 ± 0.43
1.835AspGln: 1.835 ± 0.32
5.715AspArg: 5.715 ± 0.606
3.093AspSer: 3.093 ± 0.432
3.355AspThr: 3.355 ± 0.393
5.4AspVal: 5.4 ± 0.49
1.049AspTrp: 1.049 ± 0.214
1.363AspTyr: 1.363 ± 0.289
0.0AspXaa: 0.0 ± 0.0
Glu
8.703GluAla: 8.703 ± 0.894
0.839GluCys: 0.839 ± 0.202
2.517GluAsp: 2.517 ± 0.442
1.311GluGlu: 1.311 ± 0.316
1.416GluPhe: 1.416 ± 0.256
4.037GluGly: 4.037 ± 0.518
1.73GluHis: 1.73 ± 0.352
1.835GluIle: 1.835 ± 0.349
1.573GluLys: 1.573 ± 0.3
4.981GluLeu: 4.981 ± 0.493
0.996GluMet: 0.996 ± 0.242
1.101GluAsn: 1.101 ± 0.222
3.198GluPro: 3.198 ± 0.535
2.988GluGln: 2.988 ± 0.317
4.666GluArg: 4.666 ± 0.728
2.831GluSer: 2.831 ± 0.362
1.992GluThr: 1.992 ± 0.288
5.715GluVal: 5.715 ± 0.717
1.101GluTrp: 1.101 ± 0.279
1.783GluTyr: 1.783 ± 0.362
0.0GluXaa: 0.0 ± 0.0
Phe
2.674PheAla: 2.674 ± 0.445
0.472PheCys: 0.472 ± 0.17
3.198PheAsp: 3.198 ± 0.398
0.944PheGlu: 0.944 ± 0.229
0.577PhePhe: 0.577 ± 0.169
2.831PheGly: 2.831 ± 0.437
0.367PheHis: 0.367 ± 0.144
0.786PheIle: 0.786 ± 0.211
0.944PheLys: 0.944 ± 0.293
2.097PheLeu: 2.097 ± 0.306
0.577PheMet: 0.577 ± 0.192
0.682PheAsn: 0.682 ± 0.192
1.52PhePro: 1.52 ± 0.276
0.629PheGln: 0.629 ± 0.151
1.678PheArg: 1.678 ± 0.337
0.996PheSer: 0.996 ± 0.216
1.625PheThr: 1.625 ± 0.284
2.307PheVal: 2.307 ± 0.316
0.367PheTrp: 0.367 ± 0.116
0.419PheTyr: 0.419 ± 0.163
0.0PheXaa: 0.0 ± 0.0
Gly
9.804GlyAla: 9.804 ± 0.999
1.52GlyCys: 1.52 ± 0.3
5.085GlyAsp: 5.085 ± 0.474
4.561GlyGlu: 4.561 ± 0.379
2.412GlyPhe: 2.412 ± 0.357
9.437GlyGly: 9.437 ± 1.402
1.363GlyHis: 1.363 ± 0.327
2.464GlyIle: 2.464 ± 0.55
4.404GlyLys: 4.404 ± 0.517
7.392GlyLeu: 7.392 ± 0.754
1.363GlyMet: 1.363 ± 0.287
3.46GlyAsn: 3.46 ± 0.574
3.722GlyPro: 3.722 ± 0.541
2.464GlyGln: 2.464 ± 0.446
5.505GlyArg: 5.505 ± 0.6
5.715GlySer: 5.715 ± 0.666
5.085GlyThr: 5.085 ± 0.466
7.917GlyVal: 7.917 ± 0.621
2.517GlyTrp: 2.517 ± 0.357
2.097GlyTyr: 2.097 ± 0.357
0.0GlyXaa: 0.0 ± 0.0
His
1.992HisAla: 1.992 ± 0.402
0.472HisCys: 0.472 ± 0.146
1.363HisAsp: 1.363 ± 0.328
0.996HisGlu: 0.996 ± 0.243
0.839HisPhe: 0.839 ± 0.191
2.307HisGly: 2.307 ± 0.504
0.472HisHis: 0.472 ± 0.157
0.577HisIle: 0.577 ± 0.178
0.419HisLys: 0.419 ± 0.139
2.254HisLeu: 2.254 ± 0.37
0.21HisMet: 0.21 ± 0.09
0.734HisAsn: 0.734 ± 0.176
0.891HisPro: 0.891 ± 0.205
0.577HisGln: 0.577 ± 0.224
1.887HisArg: 1.887 ± 0.315
0.786HisSer: 0.786 ± 0.232
1.153HisThr: 1.153 ± 0.252
2.307HisVal: 2.307 ± 0.354
0.315HisTrp: 0.315 ± 0.116
0.839HisTyr: 0.839 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
5.348IleAla: 5.348 ± 0.49
0.21IleCys: 0.21 ± 0.112
3.355IleAsp: 3.355 ± 0.401
2.988IleGlu: 2.988 ± 0.496
0.786IlePhe: 0.786 ± 0.189
3.565IleGly: 3.565 ± 0.736
0.315IleHis: 0.315 ± 0.149
0.734IleIle: 0.734 ± 0.215
1.52IleLys: 1.52 ± 0.257
2.569IleLeu: 2.569 ± 0.415
0.472IleMet: 0.472 ± 0.212
1.258IleAsn: 1.258 ± 0.235
1.625IlePro: 1.625 ± 0.332
0.734IleGln: 0.734 ± 0.187
2.464IleArg: 2.464 ± 0.394
1.678IleSer: 1.678 ± 0.322
2.045IleThr: 2.045 ± 0.248
2.884IleVal: 2.884 ± 0.361
0.786IleTrp: 0.786 ± 0.217
0.315IleTyr: 0.315 ± 0.154
0.0IleXaa: 0.0 ± 0.0
Lys
4.666LysAla: 4.666 ± 0.695
0.577LysCys: 0.577 ± 0.175
1.363LysAsp: 1.363 ± 0.292
0.524LysGlu: 0.524 ± 0.164
0.944LysPhe: 0.944 ± 0.215
3.093LysGly: 3.093 ± 0.462
1.206LysHis: 1.206 ± 0.322
1.363LysIle: 1.363 ± 0.284
0.682LysLys: 0.682 ± 0.166
3.827LysLeu: 3.827 ± 0.425
0.996LysMet: 0.996 ± 0.191
0.577LysAsn: 0.577 ± 0.184
2.936LysPro: 2.936 ± 0.422
1.363LysGln: 1.363 ± 0.231
2.569LysArg: 2.569 ± 0.52
1.311LysSer: 1.311 ± 0.272
1.678LysThr: 1.678 ± 0.326
3.355LysVal: 3.355 ± 0.475
0.629LysTrp: 0.629 ± 0.163
1.153LysTyr: 1.153 ± 0.345
0.0LysXaa: 0.0 ± 0.0
Leu
11.953LeuAla: 11.953 ± 0.753
0.524LeuCys: 0.524 ± 0.171
7.602LeuAsp: 7.602 ± 0.62
2.307LeuGlu: 2.307 ± 0.354
2.202LeuPhe: 2.202 ± 0.376
7.183LeuGly: 7.183 ± 0.586
1.835LeuHis: 1.835 ± 0.337
3.565LeuIle: 3.565 ± 0.457
2.884LeuLys: 2.884 ± 0.449
6.291LeuLeu: 6.291 ± 0.741
1.992LeuMet: 1.992 ± 0.355
2.307LeuAsn: 2.307 ± 0.342
4.876LeuPro: 4.876 ± 0.584
3.041LeuGln: 3.041 ± 0.45
6.501LeuArg: 6.501 ± 0.676
5.977LeuSer: 5.977 ± 0.55
4.718LeuThr: 4.718 ± 0.557
6.396LeuVal: 6.396 ± 0.47
1.678LeuTrp: 1.678 ± 0.319
1.678LeuTyr: 1.678 ± 0.286
0.0LeuXaa: 0.0 ± 0.0
Met
2.831MetAla: 2.831 ± 0.419
0.105MetCys: 0.105 ± 0.078
0.944MetAsp: 0.944 ± 0.24
0.682MetGlu: 0.682 ± 0.142
0.524MetPhe: 0.524 ± 0.177
1.468MetGly: 1.468 ± 0.294
0.367MetHis: 0.367 ± 0.147
0.944MetIle: 0.944 ± 0.224
0.315MetLys: 0.315 ± 0.116
1.468MetLeu: 1.468 ± 0.248
0.419MetMet: 0.419 ± 0.119
0.734MetAsn: 0.734 ± 0.196
1.206MetPro: 1.206 ± 0.29
0.734MetGln: 0.734 ± 0.185
1.416MetArg: 1.416 ± 0.272
2.464MetSer: 2.464 ± 0.324
1.678MetThr: 1.678 ± 0.291
1.573MetVal: 1.573 ± 0.309
0.367MetTrp: 0.367 ± 0.124
0.682MetTyr: 0.682 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
3.88AsnAla: 3.88 ± 0.623
0.472AsnCys: 0.472 ± 0.163
1.206AsnAsp: 1.206 ± 0.242
1.153AsnGlu: 1.153 ± 0.194
0.367AsnPhe: 0.367 ± 0.139
3.565AsnGly: 3.565 ± 0.572
0.367AsnHis: 0.367 ± 0.127
0.944AsnIle: 0.944 ± 0.258
1.416AsnLys: 1.416 ± 0.299
2.15AsnLeu: 2.15 ± 0.399
0.524AsnMet: 0.524 ± 0.163
0.524AsnAsn: 0.524 ± 0.165
2.412AsnPro: 2.412 ± 0.264
0.734AsnGln: 0.734 ± 0.159
1.311AsnArg: 1.311 ± 0.287
1.153AsnSer: 1.153 ± 0.283
1.94AsnThr: 1.94 ± 0.332
2.097AsnVal: 2.097 ± 0.354
0.419AsnTrp: 0.419 ± 0.183
0.524AsnTyr: 0.524 ± 0.17
0.0AsnXaa: 0.0 ± 0.0
Pro
8.126ProAla: 8.126 ± 0.688
0.682ProCys: 0.682 ± 0.225
3.984ProAsp: 3.984 ± 0.427
4.194ProGlu: 4.194 ± 0.553
1.573ProPhe: 1.573 ± 0.253
4.981ProGly: 4.981 ± 0.418
1.416ProHis: 1.416 ± 0.323
2.045ProIle: 2.045 ± 0.281
2.097ProLys: 2.097 ± 0.272
3.775ProLeu: 3.775 ± 0.377
1.101ProMet: 1.101 ± 0.25
1.363ProAsn: 1.363 ± 0.296
3.198ProPro: 3.198 ± 0.447
1.363ProGln: 1.363 ± 0.272
3.146ProArg: 3.146 ± 0.485
2.988ProSer: 2.988 ± 0.461
3.25ProThr: 3.25 ± 0.478
4.299ProVal: 4.299 ± 0.58
0.839ProTrp: 0.839 ± 0.184
1.311ProTyr: 1.311 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
5.085GlnAla: 5.085 ± 0.585
0.472GlnCys: 0.472 ± 0.193
0.944GlnAsp: 0.944 ± 0.19
1.049GlnGlu: 1.049 ± 0.171
0.682GlnPhe: 0.682 ± 0.156
2.464GlnGly: 2.464 ± 0.287
1.049GlnHis: 1.049 ± 0.187
2.254GlnIle: 2.254 ± 0.374
0.839GlnLys: 0.839 ± 0.234
2.307GlnLeu: 2.307 ± 0.417
0.577GlnMet: 0.577 ± 0.184
0.786GlnAsn: 0.786 ± 0.222
2.045GlnPro: 2.045 ± 0.324
1.468GlnGln: 1.468 ± 0.262
3.093GlnArg: 3.093 ± 0.563
1.311GlnSer: 1.311 ± 0.298
1.625GlnThr: 1.625 ± 0.332
2.936GlnVal: 2.936 ± 0.406
0.891GlnTrp: 0.891 ± 0.204
0.682GlnTyr: 0.682 ± 0.186
0.0GlnXaa: 0.0 ± 0.0
Arg
7.287ArgAla: 7.287 ± 0.772
1.101ArgCys: 1.101 ± 0.274
3.932ArgAsp: 3.932 ± 0.477
4.089ArgGlu: 4.089 ± 0.482
1.783ArgPhe: 1.783 ± 0.341
5.61ArgGly: 5.61 ± 0.532
1.416ArgHis: 1.416 ± 0.322
2.674ArgIle: 2.674 ± 0.312
3.565ArgLys: 3.565 ± 0.492
7.078ArgLeu: 7.078 ± 0.666
1.625ArgMet: 1.625 ± 0.37
2.569ArgAsn: 2.569 ± 0.335
3.88ArgPro: 3.88 ± 0.571
2.517ArgGln: 2.517 ± 0.306
5.243ArgArg: 5.243 ± 0.615
3.513ArgSer: 3.513 ± 0.387
4.456ArgThr: 4.456 ± 0.528
5.243ArgVal: 5.243 ± 0.633
1.94ArgTrp: 1.94 ± 0.344
1.468ArgTyr: 1.468 ± 0.318
0.0ArgXaa: 0.0 ± 0.0
Ser
6.449SerAla: 6.449 ± 0.672
0.472SerCys: 0.472 ± 0.182
2.412SerAsp: 2.412 ± 0.333
3.565SerGlu: 3.565 ± 0.392
1.573SerPhe: 1.573 ± 0.283
4.876SerGly: 4.876 ± 0.787
1.258SerHis: 1.258 ± 0.333
2.726SerIle: 2.726 ± 0.509
1.573SerLys: 1.573 ± 0.25
4.089SerLeu: 4.089 ± 0.411
1.468SerMet: 1.468 ± 0.214
1.573SerAsn: 1.573 ± 0.224
2.674SerPro: 2.674 ± 0.335
1.468SerGln: 1.468 ± 0.232
2.884SerArg: 2.884 ± 0.352
3.146SerSer: 3.146 ± 0.572
3.88SerThr: 3.88 ± 0.462
3.932SerVal: 3.932 ± 0.418
1.049SerTrp: 1.049 ± 0.253
1.258SerTyr: 1.258 ± 0.201
0.0SerXaa: 0.0 ± 0.0
Thr
7.235ThrAla: 7.235 ± 1.04
0.944ThrCys: 0.944 ± 0.252
3.46ThrAsp: 3.46 ± 0.432
2.779ThrGlu: 2.779 ± 0.519
1.94ThrPhe: 1.94 ± 0.238
6.082ThrGly: 6.082 ± 0.485
1.363ThrHis: 1.363 ± 0.315
2.412ThrIle: 2.412 ± 0.413
2.15ThrLys: 2.15 ± 0.324
4.718ThrLeu: 4.718 ± 0.499
0.786ThrMet: 0.786 ± 0.177
1.311ThrAsn: 1.311 ± 0.265
3.775ThrPro: 3.775 ± 0.378
1.416ThrGln: 1.416 ± 0.291
3.46ThrArg: 3.46 ± 0.428
2.831ThrSer: 2.831 ± 0.41
3.722ThrThr: 3.722 ± 0.458
4.142ThrVal: 4.142 ± 0.436
1.363ThrTrp: 1.363 ± 0.223
1.783ThrTyr: 1.783 ± 0.339
0.0ThrXaa: 0.0 ± 0.0
Val
9.489ValAla: 9.489 ± 0.921
0.839ValCys: 0.839 ± 0.189
6.606ValAsp: 6.606 ± 0.571
6.658ValGlu: 6.658 ± 0.593
2.097ValPhe: 2.097 ± 0.404
6.816ValGly: 6.816 ± 0.77
1.625ValHis: 1.625 ± 0.34
2.412ValIle: 2.412 ± 0.425
2.831ValLys: 2.831 ± 0.394
6.711ValLeu: 6.711 ± 0.717
1.101ValMet: 1.101 ± 0.213
1.835ValAsn: 1.835 ± 0.279
4.823ValPro: 4.823 ± 0.452
2.621ValGln: 2.621 ± 0.432
4.928ValArg: 4.928 ± 0.545
4.142ValSer: 4.142 ± 0.416
5.348ValThr: 5.348 ± 0.588
7.759ValVal: 7.759 ± 0.79
1.783ValTrp: 1.783 ± 0.337
1.52ValTyr: 1.52 ± 0.281
0.0ValXaa: 0.0 ± 0.0
Trp
1.94TrpAla: 1.94 ± 0.356
0.367TrpCys: 0.367 ± 0.121
1.101TrpAsp: 1.101 ± 0.225
0.682TrpGlu: 0.682 ± 0.179
0.682TrpPhe: 0.682 ± 0.186
1.52TrpGly: 1.52 ± 0.335
0.682TrpHis: 0.682 ± 0.195
0.839TrpIle: 0.839 ± 0.216
0.472TrpLys: 0.472 ± 0.138
2.831TrpLeu: 2.831 ± 0.421
0.419TrpMet: 0.419 ± 0.166
0.472TrpAsn: 0.472 ± 0.175
0.891TrpPro: 0.891 ± 0.209
1.049TrpGln: 1.049 ± 0.232
2.621TrpArg: 2.621 ± 0.366
1.101TrpSer: 1.101 ± 0.245
1.153TrpThr: 1.153 ± 0.252
0.996TrpVal: 0.996 ± 0.211
0.419TrpTrp: 0.419 ± 0.132
0.472TrpTyr: 0.472 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.726TyrAla: 2.726 ± 0.28
0.315TyrCys: 0.315 ± 0.144
1.52TyrAsp: 1.52 ± 0.288
1.311TyrGlu: 1.311 ± 0.246
0.629TyrPhe: 0.629 ± 0.172
2.202TyrGly: 2.202 ± 0.359
0.472TyrHis: 0.472 ± 0.18
0.472TyrIle: 0.472 ± 0.167
0.682TyrLys: 0.682 ± 0.205
2.202TyrLeu: 2.202 ± 0.32
0.629TyrMet: 0.629 ± 0.164
0.682TyrAsn: 0.682 ± 0.193
0.944TyrPro: 0.944 ± 0.19
0.786TyrGln: 0.786 ± 0.258
1.783TyrArg: 1.783 ± 0.368
1.101TyrSer: 1.101 ± 0.24
1.625TyrThr: 1.625 ± 0.254
1.52TyrVal: 1.52 ± 0.307
0.524TyrTrp: 0.524 ± 0.172
0.419TyrTyr: 0.419 ± 0.119
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (19075 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski