Amino acid dipepetide frequency for Arthrobacter phage TripleJ

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.151AlaAla: 19.151 ± 2.583
0.413AlaCys: 0.413 ± 0.164
7.853AlaAsp: 7.853 ± 0.724
8.336AlaGlu: 8.336 ± 0.884
3.169AlaPhe: 3.169 ± 0.409
10.127AlaGly: 10.127 ± 1.259
2.618AlaHis: 2.618 ± 0.458
4.064AlaIle: 4.064 ± 0.481
5.924AlaLys: 5.924 ± 0.616
11.436AlaLeu: 11.436 ± 0.894
3.238AlaMet: 3.238 ± 0.499
4.753AlaAsn: 4.753 ± 0.77
5.511AlaPro: 5.511 ± 0.745
5.856AlaGln: 5.856 ± 0.686
8.129AlaArg: 8.129 ± 0.697
6.2AlaSer: 6.2 ± 0.78
8.129AlaThr: 8.129 ± 0.916
8.887AlaVal: 8.887 ± 0.638
2.342AlaTrp: 2.342 ± 0.413
2.549AlaTyr: 2.549 ± 0.473
0.0AlaXaa: 0.0 ± 0.0
Cys
0.758CysAla: 0.758 ± 0.205
0.0CysCys: 0.0 ± 0.0
0.344CysAsp: 0.344 ± 0.212
0.413CysGlu: 0.413 ± 0.164
0.069CysPhe: 0.069 ± 0.063
1.24CysGly: 1.24 ± 0.356
0.138CysHis: 0.138 ± 0.097
0.069CysIle: 0.069 ± 0.064
0.344CysLys: 0.344 ± 0.153
0.482CysLeu: 0.482 ± 0.184
0.138CysMet: 0.138 ± 0.094
0.276CysAsn: 0.276 ± 0.147
1.102CysPro: 1.102 ± 0.264
0.276CysGln: 0.276 ± 0.14
0.827CysArg: 0.827 ± 0.268
0.207CysSer: 0.207 ± 0.122
1.033CysThr: 1.033 ± 0.341
0.207CysVal: 0.207 ± 0.122
0.138CysTrp: 0.138 ± 0.105
0.207CysTyr: 0.207 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
7.096AspAla: 7.096 ± 0.77
0.62AspCys: 0.62 ± 0.273
3.858AspAsp: 3.858 ± 0.707
4.064AspGlu: 4.064 ± 0.642
1.791AspPhe: 1.791 ± 0.301
5.856AspGly: 5.856 ± 0.63
1.791AspHis: 1.791 ± 0.38
3.513AspIle: 3.513 ± 0.515
2.204AspLys: 2.204 ± 0.375
5.649AspLeu: 5.649 ± 0.649
1.86AspMet: 1.86 ± 0.417
1.102AspAsn: 1.102 ± 0.282
3.651AspPro: 3.651 ± 0.505
2.411AspGln: 2.411 ± 0.412
3.513AspArg: 3.513 ± 0.444
3.376AspSer: 3.376 ± 0.494
3.1AspThr: 3.1 ± 0.421
3.72AspVal: 3.72 ± 0.877
1.24AspTrp: 1.24 ± 0.24
1.653AspTyr: 1.653 ± 0.331
0.0AspXaa: 0.0 ± 0.0
Glu
8.405GluAla: 8.405 ± 0.78
0.827GluCys: 0.827 ± 0.287
3.1GluAsp: 3.1 ± 0.501
2.756GluGlu: 2.756 ± 0.492
1.722GluPhe: 1.722 ± 0.359
3.307GluGly: 3.307 ± 0.561
1.033GluHis: 1.033 ± 0.219
3.169GluIle: 3.169 ± 0.543
2.549GluLys: 2.549 ± 0.381
5.442GluLeu: 5.442 ± 0.658
0.827GluMet: 0.827 ± 0.217
1.791GluAsn: 1.791 ± 0.41
2.824GluPro: 2.824 ± 0.582
3.031GluGln: 3.031 ± 0.47
4.064GluArg: 4.064 ± 0.513
2.549GluSer: 2.549 ± 0.516
3.307GluThr: 3.307 ± 0.472
4.684GluVal: 4.684 ± 0.451
1.791GluTrp: 1.791 ± 0.371
1.447GluTyr: 1.447 ± 0.337
0.0GluXaa: 0.0 ± 0.0
Phe
2.756PheAla: 2.756 ± 0.561
0.207PheCys: 0.207 ± 0.108
1.791PheAsp: 1.791 ± 0.409
1.929PheGlu: 1.929 ± 0.415
1.171PhePhe: 1.171 ± 0.326
2.756PheGly: 2.756 ± 0.544
0.413PheHis: 0.413 ± 0.206
1.516PheIle: 1.516 ± 0.346
1.309PheLys: 1.309 ± 0.262
1.653PheLeu: 1.653 ± 0.349
0.276PheMet: 0.276 ± 0.118
0.964PheAsn: 0.964 ± 0.291
1.722PhePro: 1.722 ± 0.289
1.102PheGln: 1.102 ± 0.249
1.447PheArg: 1.447 ± 0.342
1.378PheSer: 1.378 ± 0.325
2.687PheThr: 2.687 ± 0.496
0.62PheVal: 0.62 ± 0.183
0.413PheTrp: 0.413 ± 0.162
0.758PheTyr: 0.758 ± 0.205
0.0PheXaa: 0.0 ± 0.0
Gly
8.818GlyAla: 8.818 ± 0.856
0.758GlyCys: 0.758 ± 0.247
3.996GlyAsp: 3.996 ± 0.52
3.789GlyGlu: 3.789 ± 0.552
2.824GlyPhe: 2.824 ± 0.537
7.578GlyGly: 7.578 ± 1.343
2.067GlyHis: 2.067 ± 0.366
3.376GlyIle: 3.376 ± 0.376
4.616GlyLys: 4.616 ± 0.585
7.096GlyLeu: 7.096 ± 0.716
2.067GlyMet: 2.067 ± 0.378
3.238GlyAsn: 3.238 ± 0.456
2.893GlyPro: 2.893 ± 0.397
3.72GlyGln: 3.72 ± 0.606
4.409GlyArg: 4.409 ± 0.505
4.822GlySer: 4.822 ± 0.592
6.751GlyThr: 6.751 ± 0.744
5.236GlyVal: 5.236 ± 0.697
2.273GlyTrp: 2.273 ± 0.393
2.48GlyTyr: 2.48 ± 0.457
0.0GlyXaa: 0.0 ± 0.0
His
2.962HisAla: 2.962 ± 0.388
0.207HisCys: 0.207 ± 0.124
0.964HisAsp: 0.964 ± 0.255
2.204HisGlu: 2.204 ± 0.397
0.758HisPhe: 0.758 ± 0.231
1.722HisGly: 1.722 ± 0.393
0.896HisHis: 0.896 ± 0.241
0.758HisIle: 0.758 ± 0.232
0.62HisLys: 0.62 ± 0.183
0.827HisLeu: 0.827 ± 0.251
0.276HisMet: 0.276 ± 0.137
0.551HisAsn: 0.551 ± 0.179
0.964HisPro: 0.964 ± 0.292
0.964HisGln: 0.964 ± 0.252
1.584HisArg: 1.584 ± 0.346
0.62HisSer: 0.62 ± 0.195
0.964HisThr: 0.964 ± 0.267
1.378HisVal: 1.378 ± 0.257
0.689HisTrp: 0.689 ± 0.215
0.758HisTyr: 0.758 ± 0.194
0.0HisXaa: 0.0 ± 0.0
Ile
4.891IleAla: 4.891 ± 0.521
0.207IleCys: 0.207 ± 0.114
3.376IleAsp: 3.376 ± 0.487
2.48IleGlu: 2.48 ± 0.436
0.758IlePhe: 0.758 ± 0.231
3.789IleGly: 3.789 ± 0.443
0.827IleHis: 0.827 ± 0.232
1.998IleIle: 1.998 ± 0.362
1.516IleLys: 1.516 ± 0.338
2.549IleLeu: 2.549 ± 0.42
0.964IleMet: 0.964 ± 0.325
1.033IleAsn: 1.033 ± 0.265
2.824IlePro: 2.824 ± 0.46
1.929IleGln: 1.929 ± 0.367
3.582IleArg: 3.582 ± 0.544
2.824IleSer: 2.824 ± 0.508
4.753IleThr: 4.753 ± 0.718
2.549IleVal: 2.549 ± 0.435
0.413IleTrp: 0.413 ± 0.157
0.827IleTyr: 0.827 ± 0.281
0.0IleXaa: 0.0 ± 0.0
Lys
5.718LysAla: 5.718 ± 0.671
0.551LysCys: 0.551 ± 0.222
2.824LysAsp: 2.824 ± 0.447
2.618LysGlu: 2.618 ± 0.474
0.964LysPhe: 0.964 ± 0.195
2.756LysGly: 2.756 ± 0.38
0.896LysHis: 0.896 ± 0.296
1.171LysIle: 1.171 ± 0.325
1.791LysLys: 1.791 ± 0.416
3.238LysLeu: 3.238 ± 0.51
1.24LysMet: 1.24 ± 0.272
1.378LysAsn: 1.378 ± 0.26
2.893LysPro: 2.893 ± 0.473
1.86LysGln: 1.86 ± 0.325
2.273LysArg: 2.273 ± 0.356
2.48LysSer: 2.48 ± 0.444
2.962LysThr: 2.962 ± 0.451
2.48LysVal: 2.48 ± 0.398
0.689LysTrp: 0.689 ± 0.234
1.516LysTyr: 1.516 ± 0.284
0.0LysXaa: 0.0 ± 0.0
Leu
10.265LeuAla: 10.265 ± 0.826
0.276LeuCys: 0.276 ± 0.139
5.442LeuAsp: 5.442 ± 0.612
5.098LeuGlu: 5.098 ± 0.677
1.791LeuPhe: 1.791 ± 0.33
6.476LeuGly: 6.476 ± 0.999
1.516LeuHis: 1.516 ± 0.336
4.064LeuIle: 4.064 ± 0.503
3.169LeuLys: 3.169 ± 0.519
5.924LeuLeu: 5.924 ± 0.708
1.171LeuMet: 1.171 ± 0.262
3.513LeuAsn: 3.513 ± 0.483
4.271LeuPro: 4.271 ± 0.609
2.687LeuGln: 2.687 ± 0.543
4.96LeuArg: 4.96 ± 0.541
4.064LeuSer: 4.064 ± 0.556
6.889LeuThr: 6.889 ± 0.738
5.236LeuVal: 5.236 ± 0.722
0.758LeuTrp: 0.758 ± 0.219
2.136LeuTyr: 2.136 ± 0.385
0.0LeuXaa: 0.0 ± 0.0
Met
3.927MetAla: 3.927 ± 0.518
0.138MetCys: 0.138 ± 0.105
0.964MetAsp: 0.964 ± 0.19
0.827MetGlu: 0.827 ± 0.222
0.413MetPhe: 0.413 ± 0.18
2.136MetGly: 2.136 ± 0.623
0.207MetHis: 0.207 ± 0.099
0.896MetIle: 0.896 ± 0.25
1.033MetLys: 1.033 ± 0.235
1.24MetLeu: 1.24 ± 0.281
0.207MetMet: 0.207 ± 0.112
0.827MetAsn: 0.827 ± 0.248
1.102MetPro: 1.102 ± 0.259
0.758MetGln: 0.758 ± 0.235
1.033MetArg: 1.033 ± 0.257
2.136MetSer: 2.136 ± 0.339
2.411MetThr: 2.411 ± 0.414
1.791MetVal: 1.791 ± 0.33
0.276MetTrp: 0.276 ± 0.109
0.138MetTyr: 0.138 ± 0.083
0.0MetXaa: 0.0 ± 0.0
Asn
5.167AsnAla: 5.167 ± 0.541
0.413AsnCys: 0.413 ± 0.174
2.687AsnAsp: 2.687 ± 0.452
1.447AsnGlu: 1.447 ± 0.343
1.102AsnPhe: 1.102 ± 0.281
3.307AsnGly: 3.307 ± 0.526
0.413AsnHis: 0.413 ± 0.21
1.309AsnIle: 1.309 ± 0.235
1.171AsnLys: 1.171 ± 0.311
2.48AsnLeu: 2.48 ± 0.59
0.551AsnMet: 0.551 ± 0.17
0.689AsnAsn: 0.689 ± 0.214
1.929AsnPro: 1.929 ± 0.348
0.964AsnGln: 0.964 ± 0.24
1.998AsnArg: 1.998 ± 0.339
1.102AsnSer: 1.102 ± 0.257
2.273AsnThr: 2.273 ± 0.516
1.791AsnVal: 1.791 ± 0.347
0.551AsnTrp: 0.551 ± 0.239
0.827AsnTyr: 0.827 ± 0.242
0.0AsnXaa: 0.0 ± 0.0
Pro
6.958ProAla: 6.958 ± 0.747
0.413ProCys: 0.413 ± 0.163
3.376ProAsp: 3.376 ± 0.582
4.133ProGlu: 4.133 ± 0.669
1.033ProPhe: 1.033 ± 0.295
4.478ProGly: 4.478 ± 0.457
1.033ProHis: 1.033 ± 0.228
1.791ProIle: 1.791 ± 0.396
2.687ProLys: 2.687 ± 0.371
2.893ProLeu: 2.893 ± 0.561
1.24ProMet: 1.24 ± 0.273
1.378ProAsn: 1.378 ± 0.335
2.756ProPro: 2.756 ± 0.524
2.067ProGln: 2.067 ± 0.433
2.411ProArg: 2.411 ± 0.452
3.858ProSer: 3.858 ± 0.586
2.756ProThr: 2.756 ± 0.425
3.996ProVal: 3.996 ± 0.615
0.413ProTrp: 0.413 ± 0.19
1.378ProTyr: 1.378 ± 0.286
0.0ProXaa: 0.0 ± 0.0
Gln
7.165GlnAla: 7.165 ± 0.853
0.344GlnCys: 0.344 ± 0.168
1.998GlnAsp: 1.998 ± 0.297
2.136GlnGlu: 2.136 ± 0.374
1.171GlnPhe: 1.171 ± 0.335
2.687GlnGly: 2.687 ± 0.557
0.689GlnHis: 0.689 ± 0.209
1.653GlnIle: 1.653 ± 0.281
1.24GlnLys: 1.24 ± 0.264
3.927GlnLeu: 3.927 ± 0.618
1.171GlnMet: 1.171 ± 0.251
1.102GlnAsn: 1.102 ± 0.282
1.653GlnPro: 1.653 ± 0.38
2.067GlnGln: 2.067 ± 0.38
2.48GlnArg: 2.48 ± 0.4
2.411GlnSer: 2.411 ± 0.322
2.411GlnThr: 2.411 ± 0.418
3.513GlnVal: 3.513 ± 0.623
1.033GlnTrp: 1.033 ± 0.291
0.689GlnTyr: 0.689 ± 0.25
0.0GlnXaa: 0.0 ± 0.0
Arg
7.302ArgAla: 7.302 ± 0.736
0.62ArgCys: 0.62 ± 0.223
4.133ArgAsp: 4.133 ± 0.579
3.513ArgGlu: 3.513 ± 0.497
1.791ArgPhe: 1.791 ± 0.262
5.236ArgGly: 5.236 ± 0.617
2.342ArgHis: 2.342 ± 0.524
3.376ArgIle: 3.376 ± 0.458
2.342ArgLys: 2.342 ± 0.371
4.822ArgLeu: 4.822 ± 0.403
1.929ArgMet: 1.929 ± 0.336
2.411ArgAsn: 2.411 ± 0.345
2.687ArgPro: 2.687 ± 0.425
2.411ArgGln: 2.411 ± 0.404
3.789ArgArg: 3.789 ± 0.513
2.48ArgSer: 2.48 ± 0.35
3.513ArgThr: 3.513 ± 0.494
3.996ArgVal: 3.996 ± 0.446
0.689ArgTrp: 0.689 ± 0.256
1.24ArgTyr: 1.24 ± 0.333
0.0ArgXaa: 0.0 ± 0.0
Ser
6.269SerAla: 6.269 ± 0.761
0.482SerCys: 0.482 ± 0.184
4.064SerAsp: 4.064 ± 0.547
2.687SerGlu: 2.687 ± 0.391
1.378SerPhe: 1.378 ± 0.311
5.373SerGly: 5.373 ± 0.78
0.758SerHis: 0.758 ± 0.215
2.549SerIle: 2.549 ± 0.433
2.549SerLys: 2.549 ± 0.381
4.96SerLeu: 4.96 ± 0.648
1.584SerMet: 1.584 ± 0.354
1.378SerAsn: 1.378 ± 0.182
2.342SerPro: 2.342 ± 0.436
2.067SerGln: 2.067 ± 0.406
3.513SerArg: 3.513 ± 0.512
3.582SerSer: 3.582 ± 0.405
3.651SerThr: 3.651 ± 0.492
4.271SerVal: 4.271 ± 0.483
0.482SerTrp: 0.482 ± 0.173
1.447SerTyr: 1.447 ± 0.266
0.0SerXaa: 0.0 ± 0.0
Thr
9.507ThrAla: 9.507 ± 0.92
0.758ThrCys: 0.758 ± 0.226
3.927ThrAsp: 3.927 ± 0.522
3.789ThrGlu: 3.789 ± 0.575
1.791ThrPhe: 1.791 ± 0.3
5.787ThrGly: 5.787 ± 0.519
1.378ThrHis: 1.378 ± 0.326
3.996ThrIle: 3.996 ± 0.537
2.411ThrLys: 2.411 ± 0.413
6.613ThrLeu: 6.613 ± 0.686
1.653ThrMet: 1.653 ± 0.34
2.687ThrAsn: 2.687 ± 0.547
4.478ThrPro: 4.478 ± 0.611
2.962ThrGln: 2.962 ± 0.448
3.651ThrArg: 3.651 ± 0.625
3.996ThrSer: 3.996 ± 0.656
5.511ThrThr: 5.511 ± 0.709
4.753ThrVal: 4.753 ± 0.5
1.24ThrTrp: 1.24 ± 0.293
1.929ThrTyr: 1.929 ± 0.451
0.0ThrXaa: 0.0 ± 0.0
Val
7.302ValAla: 7.302 ± 0.667
0.413ValCys: 0.413 ± 0.176
4.753ValAsp: 4.753 ± 0.53
3.996ValGlu: 3.996 ± 0.599
2.204ValPhe: 2.204 ± 0.304
4.753ValGly: 4.753 ± 0.607
0.62ValHis: 0.62 ± 0.187
2.962ValIle: 2.962 ± 0.529
2.824ValLys: 2.824 ± 0.357
5.442ValLeu: 5.442 ± 0.706
1.309ValMet: 1.309 ± 0.28
2.136ValAsn: 2.136 ± 0.354
3.376ValPro: 3.376 ± 0.478
2.618ValGln: 2.618 ± 0.369
3.996ValArg: 3.996 ± 0.537
4.547ValSer: 4.547 ± 0.547
6.545ValThr: 6.545 ± 0.791
4.96ValVal: 4.96 ± 0.56
1.171ValTrp: 1.171 ± 0.384
1.929ValTyr: 1.929 ± 0.267
0.0ValXaa: 0.0 ± 0.0
Trp
1.791TrpAla: 1.791 ± 0.356
0.138TrpCys: 0.138 ± 0.089
1.24TrpAsp: 1.24 ± 0.261
1.102TrpGlu: 1.102 ± 0.272
0.276TrpPhe: 0.276 ± 0.127
1.171TrpGly: 1.171 ± 0.301
0.551TrpHis: 0.551 ± 0.24
0.689TrpIle: 0.689 ± 0.207
1.102TrpLys: 1.102 ± 0.263
1.447TrpLeu: 1.447 ± 0.266
0.138TrpMet: 0.138 ± 0.087
0.482TrpAsn: 0.482 ± 0.151
0.896TrpPro: 0.896 ± 0.225
0.551TrpGln: 0.551 ± 0.188
1.24TrpArg: 1.24 ± 0.25
0.964TrpSer: 0.964 ± 0.246
1.447TrpThr: 1.447 ± 0.273
1.516TrpVal: 1.516 ± 0.323
0.482TrpTrp: 0.482 ± 0.181
0.207TrpTyr: 0.207 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.48TyrAla: 2.48 ± 0.349
0.482TyrCys: 0.482 ± 0.222
1.378TyrAsp: 1.378 ± 0.297
1.309TyrGlu: 1.309 ± 0.31
0.689TyrPhe: 0.689 ± 0.252
2.342TyrGly: 2.342 ± 0.4
0.482TyrHis: 0.482 ± 0.153
1.171TyrIle: 1.171 ± 0.326
0.896TyrLys: 0.896 ± 0.238
1.584TyrLeu: 1.584 ± 0.299
0.551TyrMet: 0.551 ± 0.221
0.482TyrAsn: 0.482 ± 0.154
1.171TyrPro: 1.171 ± 0.311
1.24TyrGln: 1.24 ± 0.28
1.722TyrArg: 1.722 ± 0.243
1.791TyrSer: 1.791 ± 0.352
1.722TyrThr: 1.722 ± 0.394
2.273TyrVal: 2.273 ± 0.384
0.344TyrTrp: 0.344 ± 0.174
0.482TyrTyr: 0.482 ± 0.2
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (14517 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski