Amino acid dipepetide frequency for Mycobacterium phage Oline

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.072AlaAla: 18.072 ± 1.282
1.383AlaCys: 1.383 ± 0.325
7.192AlaAsp: 7.192 ± 0.554
7.469AlaGlu: 7.469 ± 0.631
2.49AlaPhe: 2.49 ± 0.365
10.788AlaGly: 10.788 ± 1.337
2.49AlaHis: 2.49 ± 0.354
4.702AlaIle: 4.702 ± 0.537
3.919AlaLys: 3.919 ± 0.436
10.65AlaLeu: 10.65 ± 0.903
3.227AlaMet: 3.227 ± 0.344
2.951AlaAsn: 2.951 ± 0.316
6.685AlaPro: 6.685 ± 0.507
4.287AlaGln: 4.287 ± 0.397
8.805AlaArg: 8.805 ± 0.806
5.302AlaSer: 5.302 ± 0.501
7.699AlaThr: 7.699 ± 0.558
8.898AlaVal: 8.898 ± 0.961
1.844AlaTrp: 1.844 ± 0.313
2.904AlaTyr: 2.904 ± 0.357
0.0AlaXaa: 0.0 ± 0.0
Cys
0.83CysAla: 0.83 ± 0.255
0.046CysCys: 0.046 ± 0.041
1.014CysAsp: 1.014 ± 0.235
0.83CysGlu: 0.83 ± 0.228
0.323CysPhe: 0.323 ± 0.118
1.752CysGly: 1.752 ± 0.374
0.415CysHis: 0.415 ± 0.13
0.369CysIle: 0.369 ± 0.131
0.231CysLys: 0.231 ± 0.099
0.553CysLeu: 0.553 ± 0.181
0.277CysMet: 0.277 ± 0.117
0.369CysAsn: 0.369 ± 0.133
1.014CysPro: 1.014 ± 0.258
0.323CysGln: 0.323 ± 0.128
0.83CysArg: 0.83 ± 0.2
0.553CysSer: 0.553 ± 0.155
0.599CysThr: 0.599 ± 0.178
0.83CysVal: 0.83 ± 0.196
0.231CysTrp: 0.231 ± 0.127
0.323CysTyr: 0.323 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
6.869AspAla: 6.869 ± 0.586
1.106AspCys: 1.106 ± 0.301
4.656AspAsp: 4.656 ± 0.569
5.163AspGlu: 5.163 ± 0.591
1.706AspPhe: 1.706 ± 0.24
6.039AspGly: 6.039 ± 0.642
1.291AspHis: 1.291 ± 0.309
3.135AspIle: 3.135 ± 0.401
1.982AspLys: 1.982 ± 0.374
5.671AspLeu: 5.671 ± 0.483
1.06AspMet: 1.06 ± 0.242
2.167AspAsn: 2.167 ± 0.305
5.163AspPro: 5.163 ± 0.516
1.429AspGln: 1.429 ± 0.267
3.688AspArg: 3.688 ± 0.465
2.858AspSer: 2.858 ± 0.388
3.965AspThr: 3.965 ± 0.475
3.734AspVal: 3.734 ± 0.349
1.199AspTrp: 1.199 ± 0.228
1.798AspTyr: 1.798 ± 0.252
0.0AspXaa: 0.0 ± 0.0
Glu
7.837GluAla: 7.837 ± 0.849
0.692GluCys: 0.692 ± 0.209
3.596GluAsp: 3.596 ± 0.528
2.167GluGlu: 2.167 ± 0.33
2.121GluPhe: 2.121 ± 0.317
4.518GluGly: 4.518 ± 0.556
1.429GluHis: 1.429 ± 0.251
3.78GluIle: 3.78 ± 0.462
1.752GluLys: 1.752 ± 0.301
6.132GluLeu: 6.132 ± 0.561
0.876GluMet: 0.876 ± 0.199
0.968GluAsn: 0.968 ± 0.177
3.227GluPro: 3.227 ± 0.488
2.582GluGln: 2.582 ± 0.357
4.38GluArg: 4.38 ± 0.439
1.66GluSer: 1.66 ± 0.298
3.596GluThr: 3.596 ± 0.427
6.085GluVal: 6.085 ± 0.562
0.876GluTrp: 0.876 ± 0.2
1.106GluTyr: 1.106 ± 0.22
0.0GluXaa: 0.0 ± 0.0
Phe
2.997PheAla: 2.997 ± 0.326
0.277PheCys: 0.277 ± 0.11
1.475PheAsp: 1.475 ± 0.265
1.383PheGlu: 1.383 ± 0.226
0.507PhePhe: 0.507 ± 0.129
2.997PheGly: 2.997 ± 0.347
0.461PheHis: 0.461 ± 0.125
0.738PheIle: 0.738 ± 0.17
1.014PheLys: 1.014 ± 0.206
1.475PheLeu: 1.475 ± 0.284
0.323PheMet: 0.323 ± 0.129
0.922PheAsn: 0.922 ± 0.193
1.383PhePro: 1.383 ± 0.247
0.553PheGln: 0.553 ± 0.14
1.567PheArg: 1.567 ± 0.252
1.153PheSer: 1.153 ± 0.232
1.475PheThr: 1.475 ± 0.252
1.383PheVal: 1.383 ± 0.223
0.415PheTrp: 0.415 ± 0.155
0.645PheTyr: 0.645 ± 0.173
0.0PheXaa: 0.0 ± 0.0
Gly
9.359GlyAla: 9.359 ± 1.448
1.337GlyCys: 1.337 ± 0.286
5.44GlyAsp: 5.44 ± 0.539
5.486GlyGlu: 5.486 ± 0.65
1.982GlyPhe: 1.982 ± 0.317
13.185GlyGly: 13.185 ± 2.488
1.936GlyHis: 1.936 ± 0.397
4.795GlyIle: 4.795 ± 0.362
3.55GlyLys: 3.55 ± 0.404
7.837GlyLeu: 7.837 ± 0.908
2.075GlyMet: 2.075 ± 0.293
2.536GlyAsn: 2.536 ± 0.453
4.426GlyPro: 4.426 ± 0.464
3.412GlyGln: 3.412 ± 0.523
5.763GlyArg: 5.763 ± 0.605
5.256GlySer: 5.256 ± 0.448
6.869GlyThr: 6.869 ± 0.68
6.777GlyVal: 6.777 ± 0.603
1.936GlyTrp: 1.936 ± 0.324
2.812GlyTyr: 2.812 ± 0.343
0.0GlyXaa: 0.0 ± 0.0
His
1.982HisAla: 1.982 ± 0.337
0.277HisCys: 0.277 ± 0.123
1.153HisAsp: 1.153 ± 0.282
1.475HisGlu: 1.475 ± 0.291
0.461HisPhe: 0.461 ± 0.136
1.66HisGly: 1.66 ± 0.238
0.876HisHis: 0.876 ± 0.276
1.521HisIle: 1.521 ± 0.306
0.369HisLys: 0.369 ± 0.139
1.337HisLeu: 1.337 ± 0.313
0.553HisMet: 0.553 ± 0.161
0.415HisAsn: 0.415 ± 0.126
1.199HisPro: 1.199 ± 0.29
0.323HisGln: 0.323 ± 0.123
2.213HisArg: 2.213 ± 0.434
0.83HisSer: 0.83 ± 0.176
2.028HisThr: 2.028 ± 0.369
1.199HisVal: 1.199 ± 0.335
0.231HisTrp: 0.231 ± 0.093
0.738HisTyr: 0.738 ± 0.24
0.0HisXaa: 0.0 ± 0.0
Ile
6.039IleAla: 6.039 ± 0.466
0.553IleCys: 0.553 ± 0.175
3.688IleAsp: 3.688 ± 0.388
4.887IleGlu: 4.887 ± 0.594
0.83IlePhe: 0.83 ± 0.146
5.44IleGly: 5.44 ± 0.5
0.645IleHis: 0.645 ± 0.18
1.567IleIle: 1.567 ± 0.302
1.291IleLys: 1.291 ± 0.306
3.135IleLeu: 3.135 ± 0.365
0.83IleMet: 0.83 ± 0.178
1.521IleAsn: 1.521 ± 0.246
2.582IlePro: 2.582 ± 0.298
1.014IleGln: 1.014 ± 0.227
2.443IleArg: 2.443 ± 0.44
2.351IleSer: 2.351 ± 0.388
3.55IleThr: 3.55 ± 0.37
3.089IleVal: 3.089 ± 0.377
0.323IleTrp: 0.323 ± 0.119
0.83IleTyr: 0.83 ± 0.184
0.0IleXaa: 0.0 ± 0.0
Lys
4.149LysAla: 4.149 ± 0.546
0.231LysCys: 0.231 ± 0.094
1.567LysAsp: 1.567 ± 0.218
0.784LysGlu: 0.784 ± 0.252
0.599LysPhe: 0.599 ± 0.163
2.397LysGly: 2.397 ± 0.298
0.599LysHis: 0.599 ± 0.168
1.706LysIle: 1.706 ± 0.234
0.922LysLys: 0.922 ± 0.239
3.043LysLeu: 3.043 ± 0.343
0.507LysMet: 0.507 ± 0.164
0.784LysAsn: 0.784 ± 0.204
1.475LysPro: 1.475 ± 0.343
1.337LysGln: 1.337 ± 0.229
2.121LysArg: 2.121 ± 0.314
1.475LysSer: 1.475 ± 0.264
2.213LysThr: 2.213 ± 0.291
2.628LysVal: 2.628 ± 0.351
0.369LysTrp: 0.369 ± 0.135
0.645LysTyr: 0.645 ± 0.173
0.0LysXaa: 0.0 ± 0.0
Leu
11.295LeuAla: 11.295 ± 0.817
1.014LeuCys: 1.014 ± 0.221
6.27LeuAsp: 6.27 ± 0.564
3.043LeuGlu: 3.043 ± 0.288
1.752LeuPhe: 1.752 ± 0.352
7.33LeuGly: 7.33 ± 0.954
1.199LeuHis: 1.199 ± 0.246
4.241LeuIle: 4.241 ± 0.418
2.49LeuLys: 2.49 ± 0.442
5.117LeuLeu: 5.117 ± 0.513
1.752LeuMet: 1.752 ± 0.249
3.273LeuAsn: 3.273 ± 0.49
5.809LeuPro: 5.809 ± 0.58
1.521LeuGln: 1.521 ± 0.296
4.103LeuArg: 4.103 ± 0.55
4.103LeuSer: 4.103 ± 0.466
7.054LeuThr: 7.054 ± 0.548
4.518LeuVal: 4.518 ± 0.517
1.291LeuTrp: 1.291 ± 0.27
1.66LeuTyr: 1.66 ± 0.268
0.0LeuXaa: 0.0 ± 0.0
Met
2.72MetAla: 2.72 ± 0.325
0.231MetCys: 0.231 ± 0.088
0.645MetAsp: 0.645 ± 0.161
0.692MetGlu: 0.692 ± 0.153
0.83MetPhe: 0.83 ± 0.172
1.475MetGly: 1.475 ± 0.229
0.461MetHis: 0.461 ± 0.16
0.876MetIle: 0.876 ± 0.195
0.692MetLys: 0.692 ± 0.179
1.429MetLeu: 1.429 ± 0.213
0.369MetMet: 0.369 ± 0.134
0.415MetAsn: 0.415 ± 0.154
1.429MetPro: 1.429 ± 0.25
0.507MetGln: 0.507 ± 0.151
1.291MetArg: 1.291 ± 0.261
1.752MetSer: 1.752 ± 0.315
2.628MetThr: 2.628 ± 0.287
1.383MetVal: 1.383 ± 0.262
0.553MetTrp: 0.553 ± 0.156
0.507MetTyr: 0.507 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
3.319AsnAla: 3.319 ± 0.387
0.231AsnCys: 0.231 ± 0.086
1.844AsnAsp: 1.844 ± 0.317
1.337AsnGlu: 1.337 ± 0.259
0.369AsnPhe: 0.369 ± 0.128
3.504AsnGly: 3.504 ± 0.447
0.645AsnHis: 0.645 ± 0.184
1.521AsnIle: 1.521 ± 0.34
1.106AsnLys: 1.106 ± 0.242
1.798AsnLeu: 1.798 ± 0.286
0.507AsnMet: 0.507 ± 0.114
1.245AsnAsn: 1.245 ± 0.287
1.844AsnPro: 1.844 ± 0.276
0.369AsnGln: 0.369 ± 0.134
2.028AsnArg: 2.028 ± 0.314
1.752AsnSer: 1.752 ± 0.305
2.674AsnThr: 2.674 ± 0.413
1.798AsnVal: 1.798 ± 0.264
0.415AsnTrp: 0.415 ± 0.159
0.83AsnTyr: 0.83 ± 0.218
0.0AsnXaa: 0.0 ± 0.0
Pro
7.515ProAla: 7.515 ± 0.577
0.507ProCys: 0.507 ± 0.165
4.103ProAsp: 4.103 ± 0.504
5.671ProGlu: 5.671 ± 0.727
1.429ProPhe: 1.429 ± 0.248
6.085ProGly: 6.085 ± 0.658
1.521ProHis: 1.521 ± 0.341
2.582ProIle: 2.582 ± 0.416
1.245ProLys: 1.245 ± 0.217
3.504ProLeu: 3.504 ± 0.39
1.199ProMet: 1.199 ± 0.227
2.075ProAsn: 2.075 ± 0.243
4.472ProPro: 4.472 ± 0.618
1.66ProGln: 1.66 ± 0.23
3.55ProArg: 3.55 ± 0.477
2.443ProSer: 2.443 ± 0.377
3.734ProThr: 3.734 ± 0.315
4.979ProVal: 4.979 ± 0.493
1.291ProTrp: 1.291 ± 0.197
1.014ProTyr: 1.014 ± 0.238
0.0ProXaa: 0.0 ± 0.0
Gln
4.057GlnAla: 4.057 ± 0.642
0.231GlnCys: 0.231 ± 0.12
1.245GlnAsp: 1.245 ± 0.259
0.876GlnGlu: 0.876 ± 0.156
1.153GlnPhe: 1.153 ± 0.2
2.167GlnGly: 2.167 ± 0.355
0.968GlnHis: 0.968 ± 0.2
2.075GlnIle: 2.075 ± 0.261
0.876GlnLys: 0.876 ± 0.214
2.351GlnLeu: 2.351 ± 0.301
0.553GlnMet: 0.553 ± 0.132
0.784GlnAsn: 0.784 ± 0.194
1.614GlnPro: 1.614 ± 0.263
1.614GlnGln: 1.614 ± 0.25
2.858GlnArg: 2.858 ± 0.405
1.614GlnSer: 1.614 ± 0.281
2.167GlnThr: 2.167 ± 0.296
1.936GlnVal: 1.936 ± 0.285
0.692GlnTrp: 0.692 ± 0.172
0.645GlnTyr: 0.645 ± 0.183
0.0GlnXaa: 0.0 ± 0.0
Arg
6.961ArgAla: 6.961 ± 0.665
0.784ArgCys: 0.784 ± 0.195
4.241ArgAsp: 4.241 ± 0.495
4.795ArgGlu: 4.795 ± 0.574
1.567ArgPhe: 1.567 ± 0.266
4.241ArgGly: 4.241 ± 0.501
1.706ArgHis: 1.706 ± 0.336
2.351ArgIle: 2.351 ± 0.297
1.752ArgLys: 1.752 ± 0.284
5.947ArgLeu: 5.947 ± 0.453
1.66ArgMet: 1.66 ± 0.327
2.351ArgAsn: 2.351 ± 0.393
4.149ArgPro: 4.149 ± 0.481
2.536ArgGln: 2.536 ± 0.377
6.132ArgArg: 6.132 ± 0.703
3.227ArgSer: 3.227 ± 0.39
3.826ArgThr: 3.826 ± 0.398
5.486ArgVal: 5.486 ± 0.697
1.66ArgTrp: 1.66 ± 0.293
2.259ArgTyr: 2.259 ± 0.364
0.0ArgXaa: 0.0 ± 0.0
Ser
5.256SerAla: 5.256 ± 0.52
0.553SerCys: 0.553 ± 0.16
3.504SerAsp: 3.504 ± 0.31
2.259SerGlu: 2.259 ± 0.372
0.784SerPhe: 0.784 ± 0.213
5.763SerGly: 5.763 ± 0.699
0.645SerHis: 0.645 ± 0.158
1.66SerIle: 1.66 ± 0.312
1.245SerLys: 1.245 ± 0.231
3.826SerLeu: 3.826 ± 0.671
1.337SerMet: 1.337 ± 0.275
1.291SerAsn: 1.291 ± 0.278
3.365SerPro: 3.365 ± 0.405
1.844SerGln: 1.844 ± 0.281
3.78SerArg: 3.78 ± 0.379
2.536SerSer: 2.536 ± 0.429
3.78SerThr: 3.78 ± 0.485
3.873SerVal: 3.873 ± 0.448
1.106SerTrp: 1.106 ± 0.194
1.429SerTyr: 1.429 ± 0.223
0.0SerXaa: 0.0 ± 0.0
Thr
8.667ThrAla: 8.667 ± 0.494
0.553ThrCys: 0.553 ± 0.182
5.21ThrAsp: 5.21 ± 0.584
3.734ThrGlu: 3.734 ± 0.445
1.475ThrPhe: 1.475 ± 0.241
7.054ThrGly: 7.054 ± 0.469
0.876ThrHis: 0.876 ± 0.192
3.688ThrIle: 3.688 ± 0.39
2.305ThrLys: 2.305 ± 0.317
5.348ThrLeu: 5.348 ± 0.501
1.66ThrMet: 1.66 ± 0.277
1.89ThrAsn: 1.89 ± 0.37
4.933ThrPro: 4.933 ± 0.513
1.89ThrGln: 1.89 ± 0.307
3.965ThrArg: 3.965 ± 0.484
4.334ThrSer: 4.334 ± 0.378
4.149ThrThr: 4.149 ± 0.523
6.178ThrVal: 6.178 ± 0.484
1.106ThrTrp: 1.106 ± 0.266
1.614ThrTyr: 1.614 ± 0.272
0.0ThrXaa: 0.0 ± 0.0
Val
8.298ValAla: 8.298 ± 0.496
0.968ValCys: 0.968 ± 0.25
4.61ValAsp: 4.61 ± 0.435
5.532ValGlu: 5.532 ± 0.575
1.521ValPhe: 1.521 ± 0.21
7.238ValGly: 7.238 ± 0.645
1.706ValHis: 1.706 ± 0.305
3.319ValIle: 3.319 ± 0.341
1.982ValLys: 1.982 ± 0.264
6.362ValLeu: 6.362 ± 0.485
1.383ValMet: 1.383 ± 0.267
2.121ValAsn: 2.121 ± 0.279
3.78ValPro: 3.78 ± 0.339
1.844ValGln: 1.844 ± 0.257
4.702ValArg: 4.702 ± 0.443
4.195ValSer: 4.195 ± 0.396
5.44ValThr: 5.44 ± 0.617
6.5ValVal: 6.5 ± 0.656
1.798ValTrp: 1.798 ± 0.311
1.798ValTyr: 1.798 ± 0.36
0.0ValXaa: 0.0 ± 0.0
Trp
2.305TrpAla: 2.305 ± 0.389
0.415TrpCys: 0.415 ± 0.134
1.199TrpAsp: 1.199 ± 0.243
0.415TrpGlu: 0.415 ± 0.135
0.645TrpPhe: 0.645 ± 0.222
1.014TrpGly: 1.014 ± 0.224
0.461TrpHis: 0.461 ± 0.16
0.922TrpIle: 0.922 ± 0.24
0.277TrpLys: 0.277 ± 0.108
1.475TrpLeu: 1.475 ± 0.277
0.461TrpMet: 0.461 ± 0.164
0.369TrpAsn: 0.369 ± 0.126
0.876TrpPro: 0.876 ± 0.219
0.599TrpGln: 0.599 ± 0.18
1.291TrpArg: 1.291 ± 0.256
1.337TrpSer: 1.337 ± 0.243
1.521TrpThr: 1.521 ± 0.272
1.521TrpVal: 1.521 ± 0.325
0.461TrpTrp: 0.461 ± 0.152
0.415TrpTyr: 0.415 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.227TyrAla: 3.227 ± 0.408
0.323TyrCys: 0.323 ± 0.117
2.028TyrAsp: 2.028 ± 0.279
1.383TyrGlu: 1.383 ± 0.244
0.599TyrPhe: 0.599 ± 0.172
2.121TyrGly: 2.121 ± 0.392
0.507TyrHis: 0.507 ± 0.161
1.06TyrIle: 1.06 ± 0.196
0.553TyrLys: 0.553 ± 0.176
1.844TyrLeu: 1.844 ± 0.347
0.231TyrMet: 0.231 ± 0.112
0.692TyrAsn: 0.692 ± 0.164
1.153TyrPro: 1.153 ± 0.263
0.83TyrGln: 0.83 ± 0.194
2.167TyrArg: 2.167 ± 0.347
1.106TyrSer: 1.106 ± 0.196
1.66TyrThr: 1.66 ± 0.267
2.305TyrVal: 2.305 ± 0.299
0.184TyrTrp: 0.184 ± 0.085
0.507TyrTyr: 0.507 ± 0.154
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 101 proteins (21692 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski