Amino acid dipepetide frequency for Mycobacterium virus Che9d

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.499AlaAla: 13.499 ± 1.672
1.162AlaCys: 1.162 ± 0.272
6.528AlaAsp: 6.528 ± 0.568
7.911AlaGlu: 7.911 ± 0.871
3.209AlaPhe: 3.209 ± 0.414
8.741AlaGly: 8.741 ± 1.209
1.936AlaHis: 1.936 ± 0.327
4.481AlaIle: 4.481 ± 0.468
5.367AlaLys: 5.367 ± 0.523
9.959AlaLeu: 9.959 ± 0.858
3.043AlaMet: 3.043 ± 0.367
3.209AlaAsn: 3.209 ± 0.479
3.983AlaPro: 3.983 ± 0.447
4.592AlaGln: 4.592 ± 0.522
6.252AlaArg: 6.252 ± 0.687
4.592AlaSer: 4.592 ± 0.486
5.145AlaThr: 5.145 ± 0.583
7.856AlaVal: 7.856 ± 0.68
2.213AlaTrp: 2.213 ± 0.385
1.936AlaTyr: 1.936 ± 0.343
0.0AlaXaa: 0.0 ± 0.0
Cys
1.051CysAla: 1.051 ± 0.282
0.387CysCys: 0.387 ± 0.185
0.941CysAsp: 0.941 ± 0.229
1.217CysGlu: 1.217 ± 0.224
0.166CysPhe: 0.166 ± 0.093
1.881CysGly: 1.881 ± 0.421
0.498CysHis: 0.498 ± 0.167
0.277CysIle: 0.277 ± 0.108
0.553CysLys: 0.553 ± 0.183
0.885CysLeu: 0.885 ± 0.253
0.0CysMet: 0.0 ± 0.0
0.387CysAsn: 0.387 ± 0.121
0.664CysPro: 0.664 ± 0.198
0.498CysGln: 0.498 ± 0.175
0.996CysArg: 0.996 ± 0.299
1.107CysSer: 1.107 ± 0.322
0.553CysThr: 0.553 ± 0.174
0.553CysVal: 0.553 ± 0.183
0.332CysTrp: 0.332 ± 0.148
0.221CysTyr: 0.221 ± 0.093
0.0CysXaa: 0.0 ± 0.0
Asp
6.971AspAla: 6.971 ± 0.579
0.664AspCys: 0.664 ± 0.177
4.537AspAsp: 4.537 ± 0.642
4.647AspGlu: 4.647 ± 0.646
2.268AspPhe: 2.268 ± 0.37
5.533AspGly: 5.533 ± 0.629
0.996AspHis: 0.996 ± 0.233
3.209AspIle: 3.209 ± 0.419
1.992AspLys: 1.992 ± 0.333
4.149AspLeu: 4.149 ± 0.529
1.162AspMet: 1.162 ± 0.259
1.826AspAsn: 1.826 ± 0.361
4.094AspPro: 4.094 ± 0.554
2.268AspGln: 2.268 ± 0.341
4.537AspArg: 4.537 ± 0.491
3.098AspSer: 3.098 ± 0.327
3.264AspThr: 3.264 ± 0.406
4.758AspVal: 4.758 ± 0.481
1.383AspTrp: 1.383 ± 0.244
1.604AspTyr: 1.604 ± 0.324
0.0AspXaa: 0.0 ± 0.0
Glu
6.971GluAla: 6.971 ± 0.588
1.162GluCys: 1.162 ± 0.316
3.43GluAsp: 3.43 ± 0.528
3.32GluGlu: 3.32 ± 0.577
2.213GluPhe: 2.213 ± 0.325
4.647GluGly: 4.647 ± 0.601
1.66GluHis: 1.66 ± 0.309
2.434GluIle: 2.434 ± 0.353
2.324GluLys: 2.324 ± 0.367
6.196GluLeu: 6.196 ± 0.548
1.051GluMet: 1.051 ± 0.215
1.66GluAsn: 1.66 ± 0.263
3.651GluPro: 3.651 ± 0.457
2.656GluGln: 2.656 ± 0.349
4.481GluArg: 4.481 ± 0.712
3.541GluSer: 3.541 ± 0.412
2.877GluThr: 2.877 ± 0.426
4.426GluVal: 4.426 ± 0.474
2.158GluTrp: 2.158 ± 0.339
1.549GluTyr: 1.549 ± 0.282
0.0GluXaa: 0.0 ± 0.0
Phe
2.49PheAla: 2.49 ± 0.301
0.277PheCys: 0.277 ± 0.113
2.434PheAsp: 2.434 ± 0.35
1.715PheGlu: 1.715 ± 0.327
0.664PhePhe: 0.664 ± 0.174
2.766PheGly: 2.766 ± 0.427
0.719PheHis: 0.719 ± 0.184
1.604PheIle: 1.604 ± 0.266
0.941PheLys: 0.941 ± 0.259
1.936PheLeu: 1.936 ± 0.32
0.885PheMet: 0.885 ± 0.2
0.885PheAsn: 0.885 ± 0.206
1.272PhePro: 1.272 ± 0.232
0.775PheGln: 0.775 ± 0.219
1.494PheArg: 1.494 ± 0.287
1.992PheSer: 1.992 ± 0.312
2.545PheThr: 2.545 ± 0.33
1.66PheVal: 1.66 ± 0.304
0.719PheTrp: 0.719 ± 0.193
0.775PheTyr: 0.775 ± 0.193
0.0PheXaa: 0.0 ± 0.0
Gly
8.797GlyAla: 8.797 ± 1.266
1.107GlyCys: 1.107 ± 0.294
5.145GlyAsp: 5.145 ± 0.507
4.979GlyGlu: 4.979 ± 0.53
2.434GlyPhe: 2.434 ± 0.385
10.456GlyGly: 10.456 ± 2.049
2.268GlyHis: 2.268 ± 0.327
3.873GlyIle: 3.873 ± 0.468
2.766GlyLys: 2.766 ± 0.464
6.086GlyLeu: 6.086 ± 0.655
1.715GlyMet: 1.715 ± 0.289
2.822GlyAsn: 2.822 ± 0.414
4.426GlyPro: 4.426 ± 0.608
3.928GlyGln: 3.928 ± 0.499
5.809GlyArg: 5.809 ± 0.587
5.477GlySer: 5.477 ± 0.979
6.086GlyThr: 6.086 ± 0.61
7.248GlyVal: 7.248 ± 0.643
2.268GlyTrp: 2.268 ± 0.358
1.881GlyTyr: 1.881 ± 0.368
0.0GlyXaa: 0.0 ± 0.0
His
1.383HisAla: 1.383 ± 0.233
0.498HisCys: 0.498 ± 0.157
1.272HisAsp: 1.272 ± 0.281
1.383HisGlu: 1.383 ± 0.273
0.664HisPhe: 0.664 ± 0.173
1.992HisGly: 1.992 ± 0.302
0.664HisHis: 0.664 ± 0.185
0.941HisIle: 0.941 ± 0.222
0.83HisLys: 0.83 ± 0.222
1.438HisLeu: 1.438 ± 0.269
0.498HisMet: 0.498 ± 0.16
0.719HisAsn: 0.719 ± 0.221
1.107HisPro: 1.107 ± 0.183
0.553HisGln: 0.553 ± 0.202
1.66HisArg: 1.66 ± 0.316
1.272HisSer: 1.272 ± 0.253
1.272HisThr: 1.272 ± 0.288
1.715HisVal: 1.715 ± 0.336
0.664HisTrp: 0.664 ± 0.189
0.719HisTyr: 0.719 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
5.035IleAla: 5.035 ± 0.515
0.553IleCys: 0.553 ± 0.189
3.707IleAsp: 3.707 ± 0.497
3.43IleGlu: 3.43 ± 0.409
0.775IlePhe: 0.775 ± 0.187
4.813IleGly: 4.813 ± 0.517
0.996IleHis: 0.996 ± 0.215
2.268IleIle: 2.268 ± 0.341
1.494IleLys: 1.494 ± 0.28
2.379IleLeu: 2.379 ± 0.402
0.719IleMet: 0.719 ± 0.196
1.77IleAsn: 1.77 ± 0.44
2.6IlePro: 2.6 ± 0.356
1.494IleGln: 1.494 ± 0.321
3.485IleArg: 3.485 ± 0.368
2.268IleSer: 2.268 ± 0.397
3.651IleThr: 3.651 ± 0.565
3.043IleVal: 3.043 ± 0.312
0.941IleTrp: 0.941 ± 0.211
0.775IleTyr: 0.775 ± 0.194
0.0IleXaa: 0.0 ± 0.0
Lys
4.26LysAla: 4.26 ± 0.532
0.443LysCys: 0.443 ± 0.164
1.826LysAsp: 1.826 ± 0.293
1.715LysGlu: 1.715 ± 0.358
0.885LysPhe: 0.885 ± 0.194
2.213LysGly: 2.213 ± 0.321
1.162LysHis: 1.162 ± 0.254
1.77LysIle: 1.77 ± 0.311
1.494LysLys: 1.494 ± 0.316
2.6LysLeu: 2.6 ± 0.35
0.83LysMet: 0.83 ± 0.204
1.107LysAsn: 1.107 ± 0.298
2.932LysPro: 2.932 ± 0.43
1.77LysGln: 1.77 ± 0.416
3.32LysArg: 3.32 ± 0.508
2.158LysSer: 2.158 ± 0.366
1.936LysThr: 1.936 ± 0.339
2.158LysVal: 2.158 ± 0.335
1.051LysTrp: 1.051 ± 0.195
0.885LysTyr: 0.885 ± 0.245
0.0LysXaa: 0.0 ± 0.0
Leu
9.239LeuAla: 9.239 ± 0.948
0.609LeuCys: 0.609 ± 0.182
5.588LeuAsp: 5.588 ± 0.583
3.928LeuGlu: 3.928 ± 0.531
2.324LeuPhe: 2.324 ± 0.364
7.026LeuGly: 7.026 ± 0.524
0.996LeuHis: 0.996 ± 0.246
3.983LeuIle: 3.983 ± 0.547
3.32LeuLys: 3.32 ± 0.411
5.643LeuLeu: 5.643 ± 0.728
1.107LeuMet: 1.107 ± 0.306
2.324LeuAsn: 2.324 ± 0.36
5.145LeuPro: 5.145 ± 0.562
2.766LeuGln: 2.766 ± 0.418
4.647LeuArg: 4.647 ± 0.514
4.979LeuSer: 4.979 ± 0.539
5.256LeuThr: 5.256 ± 0.524
5.09LeuVal: 5.09 ± 0.528
1.162LeuTrp: 1.162 ± 0.27
1.936LeuTyr: 1.936 ± 0.242
0.0LeuXaa: 0.0 ± 0.0
Met
2.822MetAla: 2.822 ± 0.367
0.498MetCys: 0.498 ± 0.167
0.719MetAsp: 0.719 ± 0.186
0.885MetGlu: 0.885 ± 0.233
0.719MetPhe: 0.719 ± 0.291
1.438MetGly: 1.438 ± 0.367
0.221MetHis: 0.221 ± 0.094
1.051MetIle: 1.051 ± 0.224
0.719MetLys: 0.719 ± 0.246
1.438MetLeu: 1.438 ± 0.226
0.387MetMet: 0.387 ± 0.151
0.719MetAsn: 0.719 ± 0.21
1.051MetPro: 1.051 ± 0.217
0.332MetGln: 0.332 ± 0.106
1.604MetArg: 1.604 ± 0.319
2.268MetSer: 2.268 ± 0.402
1.992MetThr: 1.992 ± 0.289
1.162MetVal: 1.162 ± 0.198
0.498MetTrp: 0.498 ± 0.171
0.443MetTyr: 0.443 ± 0.13
0.0MetXaa: 0.0 ± 0.0
Asn
3.707AsnAla: 3.707 ± 0.55
0.387AsnCys: 0.387 ± 0.144
1.272AsnAsp: 1.272 ± 0.204
1.715AsnGlu: 1.715 ± 0.3
0.553AsnPhe: 0.553 ± 0.186
4.205AsnGly: 4.205 ± 0.523
0.664AsnHis: 0.664 ± 0.205
1.494AsnIle: 1.494 ± 0.263
0.83AsnLys: 0.83 ± 0.177
2.877AsnLeu: 2.877 ± 0.353
0.553AsnMet: 0.553 ± 0.143
1.494AsnAsn: 1.494 ± 0.251
2.379AsnPro: 2.379 ± 0.463
0.609AsnGln: 0.609 ± 0.193
1.936AsnArg: 1.936 ± 0.387
1.549AsnSer: 1.549 ± 0.272
2.434AsnThr: 2.434 ± 0.342
2.102AsnVal: 2.102 ± 0.276
0.443AsnTrp: 0.443 ± 0.128
0.664AsnTyr: 0.664 ± 0.177
0.0AsnXaa: 0.0 ± 0.0
Pro
5.145ProAla: 5.145 ± 0.489
0.609ProCys: 0.609 ± 0.192
4.426ProAsp: 4.426 ± 0.526
4.979ProGlu: 4.979 ± 0.476
1.549ProPhe: 1.549 ± 0.283
5.698ProGly: 5.698 ± 0.711
1.383ProHis: 1.383 ± 0.268
2.268ProIle: 2.268 ± 0.363
2.213ProLys: 2.213 ± 0.348
2.932ProLeu: 2.932 ± 0.445
1.051ProMet: 1.051 ± 0.204
2.158ProAsn: 2.158 ± 0.363
3.651ProPro: 3.651 ± 0.636
1.992ProGln: 1.992 ± 0.357
3.209ProArg: 3.209 ± 0.428
2.49ProSer: 2.49 ± 0.411
3.873ProThr: 3.873 ± 0.453
4.094ProVal: 4.094 ± 0.562
1.217ProTrp: 1.217 ± 0.258
1.826ProTyr: 1.826 ± 0.264
0.0ProXaa: 0.0 ± 0.0
Gln
3.817GlnAla: 3.817 ± 0.545
0.609GlnCys: 0.609 ± 0.225
1.383GlnAsp: 1.383 ± 0.226
1.715GlnGlu: 1.715 ± 0.321
0.996GlnPhe: 0.996 ± 0.251
2.877GlnGly: 2.877 ± 0.4
1.107GlnHis: 1.107 ± 0.228
2.324GlnIle: 2.324 ± 0.365
1.438GlnLys: 1.438 ± 0.439
3.209GlnLeu: 3.209 ± 0.388
1.328GlnMet: 1.328 ± 0.28
1.383GlnAsn: 1.383 ± 0.305
2.545GlnPro: 2.545 ± 0.392
1.66GlnGln: 1.66 ± 0.348
2.932GlnArg: 2.932 ± 0.408
2.102GlnSer: 2.102 ± 0.266
1.438GlnThr: 1.438 ± 0.25
2.047GlnVal: 2.047 ± 0.334
0.885GlnTrp: 0.885 ± 0.237
0.941GlnTyr: 0.941 ± 0.245
0.0GlnXaa: 0.0 ± 0.0
Arg
6.362ArgAla: 6.362 ± 0.661
1.217ArgCys: 1.217 ± 0.285
4.26ArgAsp: 4.26 ± 0.467
4.315ArgGlu: 4.315 ± 0.539
1.881ArgPhe: 1.881 ± 0.313
4.537ArgGly: 4.537 ± 0.447
1.494ArgHis: 1.494 ± 0.283
2.988ArgIle: 2.988 ± 0.446
2.711ArgLys: 2.711 ± 0.485
5.367ArgLeu: 5.367 ± 0.617
2.047ArgMet: 2.047 ± 0.428
1.549ArgAsn: 1.549 ± 0.271
3.817ArgPro: 3.817 ± 0.53
2.877ArgGln: 2.877 ± 0.406
7.082ArgArg: 7.082 ± 0.885
4.094ArgSer: 4.094 ± 0.42
3.541ArgThr: 3.541 ± 0.404
5.367ArgVal: 5.367 ± 0.65
2.158ArgTrp: 2.158 ± 0.378
2.047ArgTyr: 2.047 ± 0.347
0.0ArgXaa: 0.0 ± 0.0
Ser
5.533SerAla: 5.533 ± 0.628
0.498SerCys: 0.498 ± 0.2
3.762SerAsp: 3.762 ± 0.43
3.375SerGlu: 3.375 ± 0.493
1.826SerPhe: 1.826 ± 0.358
6.362SerGly: 6.362 ± 0.971
0.996SerHis: 0.996 ± 0.261
2.988SerIle: 2.988 ± 0.362
1.715SerLys: 1.715 ± 0.364
4.426SerLeu: 4.426 ± 0.548
1.66SerMet: 1.66 ± 0.256
2.545SerAsn: 2.545 ± 0.375
3.098SerPro: 3.098 ± 0.409
2.158SerGln: 2.158 ± 0.261
3.928SerArg: 3.928 ± 0.513
4.205SerSer: 4.205 ± 0.704
3.485SerThr: 3.485 ± 0.442
4.371SerVal: 4.371 ± 0.642
1.604SerTrp: 1.604 ± 0.316
0.996SerTyr: 0.996 ± 0.236
0.0SerXaa: 0.0 ± 0.0
Thr
6.805ThrAla: 6.805 ± 0.72
0.553ThrCys: 0.553 ± 0.196
3.707ThrAsp: 3.707 ± 0.472
3.209ThrGlu: 3.209 ± 0.423
1.604ThrPhe: 1.604 ± 0.357
5.92ThrGly: 5.92 ± 0.752
0.83ThrHis: 0.83 ± 0.228
3.043ThrIle: 3.043 ± 0.459
1.494ThrLys: 1.494 ± 0.249
5.367ThrLeu: 5.367 ± 0.497
0.719ThrMet: 0.719 ± 0.195
2.047ThrAsn: 2.047 ± 0.354
4.537ThrPro: 4.537 ± 0.516
1.826ThrGln: 1.826 ± 0.336
3.762ThrArg: 3.762 ± 0.418
4.371ThrSer: 4.371 ± 0.505
3.651ThrThr: 3.651 ± 0.511
4.869ThrVal: 4.869 ± 0.636
1.936ThrTrp: 1.936 ± 0.333
0.775ThrTyr: 0.775 ± 0.268
0.0ThrXaa: 0.0 ± 0.0
Val
7.524ValAla: 7.524 ± 0.573
0.996ValCys: 0.996 ± 0.229
4.481ValAsp: 4.481 ± 0.483
5.643ValGlu: 5.643 ± 0.594
2.545ValPhe: 2.545 ± 0.421
4.592ValGly: 4.592 ± 0.594
1.715ValHis: 1.715 ± 0.34
3.098ValIle: 3.098 ± 0.499
2.434ValLys: 2.434 ± 0.376
5.533ValLeu: 5.533 ± 0.647
1.107ValMet: 1.107 ± 0.235
1.77ValAsn: 1.77 ± 0.361
3.651ValPro: 3.651 ± 0.345
2.324ValGln: 2.324 ± 0.37
4.205ValArg: 4.205 ± 0.555
5.311ValSer: 5.311 ± 0.718
4.813ValThr: 4.813 ± 0.732
6.252ValVal: 6.252 ± 0.713
1.881ValTrp: 1.881 ± 0.353
2.158ValTyr: 2.158 ± 0.356
0.0ValXaa: 0.0 ± 0.0
Trp
1.881TrpAla: 1.881 ± 0.258
0.443TrpCys: 0.443 ± 0.152
1.77TrpAsp: 1.77 ± 0.34
1.162TrpGlu: 1.162 ± 0.256
0.609TrpPhe: 0.609 ± 0.178
1.438TrpGly: 1.438 ± 0.341
0.443TrpHis: 0.443 ± 0.169
1.217TrpIle: 1.217 ± 0.276
1.272TrpLys: 1.272 ± 0.227
2.988TrpLeu: 2.988 ± 0.391
0.719TrpMet: 0.719 ± 0.195
0.609TrpAsn: 0.609 ± 0.165
1.107TrpPro: 1.107 ± 0.254
0.83TrpGln: 0.83 ± 0.228
1.826TrpArg: 1.826 ± 0.332
1.604TrpSer: 1.604 ± 0.328
1.604TrpThr: 1.604 ± 0.305
1.77TrpVal: 1.77 ± 0.387
0.719TrpTrp: 0.719 ± 0.199
0.719TrpTyr: 0.719 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.324TyrAla: 2.324 ± 0.384
0.498TyrCys: 0.498 ± 0.17
1.77TyrAsp: 1.77 ± 0.252
1.162TyrGlu: 1.162 ± 0.24
0.664TyrPhe: 0.664 ± 0.233
2.434TyrGly: 2.434 ± 0.437
0.498TyrHis: 0.498 ± 0.179
0.885TyrIle: 0.885 ± 0.187
0.609TyrLys: 0.609 ± 0.17
1.66TyrLeu: 1.66 ± 0.3
0.332TyrMet: 0.332 ± 0.135
0.885TyrAsn: 0.885 ± 0.21
1.051TyrPro: 1.051 ± 0.242
0.83TyrGln: 0.83 ± 0.207
2.545TyrArg: 2.545 ± 0.415
1.051TyrSer: 1.051 ± 0.229
1.604TyrThr: 1.604 ± 0.267
1.549TyrVal: 1.549 ± 0.238
0.498TyrTrp: 0.498 ± 0.202
0.498TyrTyr: 0.498 ± 0.143
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 111 proteins (18076 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski