Amino acid dipepetide frequency for Mycobacterium virus Bron

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.792AlaAla: 11.792 ± 0.96
1.3AlaCys: 1.3 ± 0.314
6.098AlaAsp: 6.098 ± 0.541
7.308AlaGlu: 7.308 ± 0.656
3.138AlaPhe: 3.138 ± 0.373
8.025AlaGly: 8.025 ± 0.7
1.973AlaHis: 1.973 ± 0.324
4.753AlaIle: 4.753 ± 0.539
7.846AlaLys: 7.846 ± 0.576
9.012AlaLeu: 9.012 ± 0.651
2.78AlaMet: 2.78 ± 0.338
3.542AlaAsn: 3.542 ± 0.485
4.349AlaPro: 4.349 ± 0.478
4.349AlaGln: 4.349 ± 0.798
5.291AlaArg: 5.291 ± 0.434
5.066AlaSer: 5.066 ± 0.636
5.739AlaThr: 5.739 ± 0.425
6.322AlaVal: 6.322 ± 0.578
1.749AlaTrp: 1.749 ± 0.254
2.825AlaTyr: 2.825 ± 0.369
0.0AlaXaa: 0.0 ± 0.0
Cys
1.121CysAla: 1.121 ± 0.255
0.269CysCys: 0.269 ± 0.131
0.897CysAsp: 0.897 ± 0.227
0.628CysGlu: 0.628 ± 0.144
0.224CysPhe: 0.224 ± 0.09
1.345CysGly: 1.345 ± 0.279
0.359CysHis: 0.359 ± 0.139
0.404CysIle: 0.404 ± 0.135
0.314CysLys: 0.314 ± 0.094
0.942CysLeu: 0.942 ± 0.196
0.404CysMet: 0.404 ± 0.15
0.404CysAsn: 0.404 ± 0.144
0.762CysPro: 0.762 ± 0.194
0.717CysGln: 0.717 ± 0.202
0.986CysArg: 0.986 ± 0.271
0.314CysSer: 0.314 ± 0.097
0.493CysThr: 0.493 ± 0.157
0.673CysVal: 0.673 ± 0.204
0.045CysTrp: 0.045 ± 0.047
0.314CysTyr: 0.314 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
5.604AspAla: 5.604 ± 0.509
0.538AspCys: 0.538 ± 0.182
3.945AspAsp: 3.945 ± 0.494
4.932AspGlu: 4.932 ± 0.568
1.838AspPhe: 1.838 ± 0.247
5.335AspGly: 5.335 ± 0.441
1.345AspHis: 1.345 ± 0.264
2.735AspIle: 2.735 ± 0.347
3.542AspLys: 3.542 ± 0.478
5.515AspLeu: 5.515 ± 0.542
1.48AspMet: 1.48 ± 0.309
1.928AspAsn: 1.928 ± 0.244
3.318AspPro: 3.318 ± 0.309
2.018AspGln: 2.018 ± 0.315
3.542AspArg: 3.542 ± 0.435
3.587AspSer: 3.587 ± 0.376
3.004AspThr: 3.004 ± 0.389
3.901AspVal: 3.901 ± 0.483
1.48AspTrp: 1.48 ± 0.26
2.152AspTyr: 2.152 ± 0.366
0.0AspXaa: 0.0 ± 0.0
Glu
6.591GluAla: 6.591 ± 0.55
0.942GluCys: 0.942 ± 0.24
3.587GluAsp: 3.587 ± 0.345
2.869GluGlu: 2.869 ± 0.4
2.376GluPhe: 2.376 ± 0.393
4.618GluGly: 4.618 ± 0.443
2.152GluHis: 2.152 ± 0.294
2.78GluIle: 2.78 ± 0.33
2.645GluLys: 2.645 ± 0.4
7.622GluLeu: 7.622 ± 0.65
1.704GluMet: 1.704 ± 0.315
1.883GluAsn: 1.883 ± 0.339
2.78GluPro: 2.78 ± 0.454
1.883GluGln: 1.883 ± 0.363
5.246GluArg: 5.246 ± 0.565
2.69GluSer: 2.69 ± 0.386
4.08GluThr: 4.08 ± 0.467
3.721GluVal: 3.721 ± 0.345
1.076GluTrp: 1.076 ± 0.204
2.376GluTyr: 2.376 ± 0.335
0.0GluXaa: 0.0 ± 0.0
Phe
3.004PheAla: 3.004 ± 0.351
0.448PheCys: 0.448 ± 0.144
1.749PheAsp: 1.749 ± 0.321
1.973PheGlu: 1.973 ± 0.347
1.121PhePhe: 1.121 ± 0.212
2.556PheGly: 2.556 ± 0.315
0.583PheHis: 0.583 ± 0.172
1.569PheIle: 1.569 ± 0.268
1.166PheLys: 1.166 ± 0.296
2.376PheLeu: 2.376 ± 0.374
0.807PheMet: 0.807 ± 0.154
1.345PheAsn: 1.345 ± 0.216
1.569PhePro: 1.569 ± 0.278
0.897PheGln: 0.897 ± 0.181
2.062PheArg: 2.062 ± 0.275
1.704PheSer: 1.704 ± 0.255
1.838PheThr: 1.838 ± 0.254
2.242PheVal: 2.242 ± 0.269
0.673PheTrp: 0.673 ± 0.148
1.3PheTyr: 1.3 ± 0.253
0.0PheXaa: 0.0 ± 0.0
Gly
6.905GlyAla: 6.905 ± 0.793
0.852GlyCys: 0.852 ± 0.216
4.932GlyAsp: 4.932 ± 0.446
5.425GlyGlu: 5.425 ± 0.366
2.78GlyPhe: 2.78 ± 0.411
8.07GlyGly: 8.07 ± 1.377
1.614GlyHis: 1.614 ± 0.274
3.587GlyIle: 3.587 ± 0.43
5.156GlyLys: 5.156 ± 0.438
6.411GlyLeu: 6.411 ± 0.575
2.197GlyMet: 2.197 ± 0.32
3.183GlyAsn: 3.183 ± 0.466
3.901GlyPro: 3.901 ± 0.601
2.69GlyGln: 2.69 ± 0.431
4.753GlyArg: 4.753 ± 0.472
4.304GlySer: 4.304 ± 0.482
5.156GlyThr: 5.156 ± 0.633
5.56GlyVal: 5.56 ± 0.492
1.793GlyTrp: 1.793 ± 0.306
3.318GlyTyr: 3.318 ± 0.403
0.0GlyXaa: 0.0 ± 0.0
His
1.524HisAla: 1.524 ± 0.258
0.269HisCys: 0.269 ± 0.112
1.211HisAsp: 1.211 ± 0.251
1.435HisGlu: 1.435 ± 0.275
0.717HisPhe: 0.717 ± 0.171
1.659HisGly: 1.659 ± 0.323
0.269HisHis: 0.269 ± 0.106
0.807HisIle: 0.807 ± 0.176
0.762HisLys: 0.762 ± 0.192
1.749HisLeu: 1.749 ± 0.252
0.583HisMet: 0.583 ± 0.185
0.538HisAsn: 0.538 ± 0.136
1.3HisPro: 1.3 ± 0.258
0.09HisGln: 0.09 ± 0.056
1.435HisArg: 1.435 ± 0.27
1.121HisSer: 1.121 ± 0.217
1.48HisThr: 1.48 ± 0.263
1.793HisVal: 1.793 ± 0.309
0.404HisTrp: 0.404 ± 0.138
0.942HisTyr: 0.942 ± 0.247
0.0HisXaa: 0.0 ± 0.0
Ile
4.439IleAla: 4.439 ± 0.499
0.493IleCys: 0.493 ± 0.167
3.004IleAsp: 3.004 ± 0.313
3.452IleGlu: 3.452 ± 0.426
1.076IlePhe: 1.076 ± 0.202
3.318IleGly: 3.318 ± 0.432
0.628IleHis: 0.628 ± 0.196
1.704IleIle: 1.704 ± 0.31
1.524IleLys: 1.524 ± 0.218
3.273IleLeu: 3.273 ± 0.431
0.807IleMet: 0.807 ± 0.175
1.838IleAsn: 1.838 ± 0.274
2.78IlePro: 2.78 ± 0.36
1.883IleGln: 1.883 ± 0.225
3.676IleArg: 3.676 ± 0.442
2.107IleSer: 2.107 ± 0.348
2.287IleThr: 2.287 ± 0.296
3.452IleVal: 3.452 ± 0.432
0.762IleTrp: 0.762 ± 0.196
0.897IleTyr: 0.897 ± 0.222
0.0IleXaa: 0.0 ± 0.0
Lys
6.636LysAla: 6.636 ± 0.549
0.673LysCys: 0.673 ± 0.195
2.556LysAsp: 2.556 ± 0.405
2.735LysGlu: 2.735 ± 0.401
1.569LysPhe: 1.569 ± 0.258
3.856LysGly: 3.856 ± 0.389
1.031LysHis: 1.031 ± 0.223
1.928LysIle: 1.928 ± 0.281
2.511LysLys: 2.511 ± 0.351
3.99LysLeu: 3.99 ± 0.41
1.211LysMet: 1.211 ± 0.237
2.242LysAsn: 2.242 ± 0.308
3.452LysPro: 3.452 ± 0.522
2.018LysGln: 2.018 ± 0.425
4.663LysArg: 4.663 ± 0.492
2.242LysSer: 2.242 ± 0.398
1.883LysThr: 1.883 ± 0.315
3.049LysVal: 3.049 ± 0.374
1.39LysTrp: 1.39 ± 0.267
1.793LysTyr: 1.793 ± 0.326
0.0LysXaa: 0.0 ± 0.0
Leu
8.967LeuAla: 8.967 ± 0.594
1.166LeuCys: 1.166 ± 0.2
5.694LeuAsp: 5.694 ± 0.499
4.394LeuGlu: 4.394 ± 0.525
2.78LeuPhe: 2.78 ± 0.352
5.604LeuGly: 5.604 ± 0.814
1.749LeuHis: 1.749 ± 0.28
3.183LeuIle: 3.183 ± 0.424
4.618LeuLys: 4.618 ± 0.465
6.68LeuLeu: 6.68 ± 0.533
2.376LeuMet: 2.376 ± 0.336
3.721LeuAsn: 3.721 ± 0.434
5.111LeuPro: 5.111 ± 0.464
2.331LeuGln: 2.331 ± 0.294
4.797LeuArg: 4.797 ± 0.495
5.649LeuSer: 5.649 ± 0.578
5.918LeuThr: 5.918 ± 0.508
5.739LeuVal: 5.739 ± 0.517
1.704LeuTrp: 1.704 ± 0.343
2.376LeuTyr: 2.376 ± 0.347
0.0LeuXaa: 0.0 ± 0.0
Met
3.049MetAla: 3.049 ± 0.373
0.224MetCys: 0.224 ± 0.097
1.255MetAsp: 1.255 ± 0.271
1.48MetGlu: 1.48 ± 0.27
0.583MetPhe: 0.583 ± 0.177
1.48MetGly: 1.48 ± 0.237
0.314MetHis: 0.314 ± 0.101
0.628MetIle: 0.628 ± 0.192
0.852MetLys: 0.852 ± 0.172
1.659MetLeu: 1.659 ± 0.296
0.538MetMet: 0.538 ± 0.166
0.717MetAsn: 0.717 ± 0.155
1.211MetPro: 1.211 ± 0.248
0.717MetGln: 0.717 ± 0.208
1.749MetArg: 1.749 ± 0.261
2.242MetSer: 2.242 ± 0.328
1.659MetThr: 1.659 ± 0.308
1.704MetVal: 1.704 ± 0.317
0.717MetTrp: 0.717 ± 0.223
0.717MetTyr: 0.717 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
3.676AsnAla: 3.676 ± 0.569
0.179AsnCys: 0.179 ± 0.089
1.928AsnAsp: 1.928 ± 0.319
2.152AsnGlu: 2.152 ± 0.3
0.942AsnPhe: 0.942 ± 0.168
4.259AsnGly: 4.259 ± 0.398
0.628AsnHis: 0.628 ± 0.149
1.569AsnIle: 1.569 ± 0.29
1.166AsnLys: 1.166 ± 0.217
3.497AsnLeu: 3.497 ± 0.467
0.942AsnMet: 0.942 ± 0.241
1.48AsnAsn: 1.48 ± 0.307
2.421AsnPro: 2.421 ± 0.331
1.614AsnGln: 1.614 ± 0.23
2.466AsnArg: 2.466 ± 0.334
2.107AsnSer: 2.107 ± 0.315
2.376AsnThr: 2.376 ± 0.319
2.331AsnVal: 2.331 ± 0.387
0.538AsnTrp: 0.538 ± 0.138
1.076AsnTyr: 1.076 ± 0.226
0.0AsnXaa: 0.0 ± 0.0
Pro
4.035ProAla: 4.035 ± 0.538
0.628ProCys: 0.628 ± 0.199
4.125ProAsp: 4.125 ± 0.444
4.17ProGlu: 4.17 ± 0.581
1.48ProPhe: 1.48 ± 0.273
4.663ProGly: 4.663 ± 0.567
0.673ProHis: 0.673 ± 0.202
2.511ProIle: 2.511 ± 0.263
3.183ProLys: 3.183 ± 0.441
3.228ProLeu: 3.228 ± 0.361
1.076ProMet: 1.076 ± 0.213
2.376ProAsn: 2.376 ± 0.295
2.556ProPro: 2.556 ± 0.294
1.345ProGln: 1.345 ± 0.248
3.138ProArg: 3.138 ± 0.37
3.318ProSer: 3.318 ± 0.386
3.183ProThr: 3.183 ± 0.41
4.17ProVal: 4.17 ± 0.382
1.255ProTrp: 1.255 ± 0.228
1.345ProTyr: 1.345 ± 0.229
0.0ProXaa: 0.0 ± 0.0
Gln
4.977GlnAla: 4.977 ± 0.537
0.314GlnCys: 0.314 ± 0.119
1.076GlnAsp: 1.076 ± 0.306
2.331GlnGlu: 2.331 ± 0.328
1.255GlnPhe: 1.255 ± 0.24
2.735GlnGly: 2.735 ± 0.34
0.583GlnHis: 0.583 ± 0.158
1.928GlnIle: 1.928 ± 0.29
1.435GlnLys: 1.435 ± 0.241
3.318GlnLeu: 3.318 ± 0.509
0.628GlnMet: 0.628 ± 0.166
0.852GlnAsn: 0.852 ± 0.216
1.524GlnPro: 1.524 ± 0.298
1.704GlnGln: 1.704 ± 0.373
2.466GlnArg: 2.466 ± 0.316
1.524GlnSer: 1.524 ± 0.326
1.121GlnThr: 1.121 ± 0.203
2.825GlnVal: 2.825 ± 0.349
0.583GlnTrp: 0.583 ± 0.141
1.031GlnTyr: 1.031 ± 0.248
0.0GlnXaa: 0.0 ± 0.0
Arg
7.084ArgAla: 7.084 ± 0.562
1.076ArgCys: 1.076 ± 0.273
4.304ArgAsp: 4.304 ± 0.437
5.156ArgGlu: 5.156 ± 0.536
1.883ArgPhe: 1.883 ± 0.283
5.56ArgGly: 5.56 ± 0.469
1.749ArgHis: 1.749 ± 0.351
2.825ArgIle: 2.825 ± 0.419
3.766ArgLys: 3.766 ± 0.57
4.618ArgLeu: 4.618 ± 0.449
1.3ArgMet: 1.3 ± 0.221
2.69ArgAsn: 2.69 ± 0.268
2.511ArgPro: 2.511 ± 0.365
1.928ArgGln: 1.928 ± 0.3
4.125ArgArg: 4.125 ± 0.417
3.228ArgSer: 3.228 ± 0.377
3.273ArgThr: 3.273 ± 0.368
4.349ArgVal: 4.349 ± 0.474
1.614ArgTrp: 1.614 ± 0.288
2.018ArgTyr: 2.018 ± 0.361
0.0ArgXaa: 0.0 ± 0.0
Ser
5.38SerAla: 5.38 ± 0.558
0.179SerCys: 0.179 ± 0.121
3.721SerAsp: 3.721 ± 0.383
3.407SerGlu: 3.407 ± 0.348
2.018SerPhe: 2.018 ± 0.312
5.291SerGly: 5.291 ± 0.59
0.807SerHis: 0.807 ± 0.139
2.6SerIle: 2.6 ± 0.325
2.331SerLys: 2.331 ± 0.349
4.573SerLeu: 4.573 ± 0.505
0.986SerMet: 0.986 ± 0.19
1.749SerAsn: 1.749 ± 0.195
2.869SerPro: 2.869 ± 0.4
1.793SerGln: 1.793 ± 0.238
3.049SerArg: 3.049 ± 0.341
2.735SerSer: 2.735 ± 0.348
3.228SerThr: 3.228 ± 0.358
4.618SerVal: 4.618 ± 0.511
1.524SerTrp: 1.524 ± 0.318
1.39SerTyr: 1.39 ± 0.215
0.0SerXaa: 0.0 ± 0.0
Thr
6.322ThrAla: 6.322 ± 0.572
0.538ThrCys: 0.538 ± 0.179
3.273ThrAsp: 3.273 ± 0.306
2.914ThrGlu: 2.914 ± 0.305
2.197ThrPhe: 2.197 ± 0.388
4.663ThrGly: 4.663 ± 0.576
1.121ThrHis: 1.121 ± 0.275
2.959ThrIle: 2.959 ± 0.356
2.645ThrLys: 2.645 ± 0.362
5.873ThrLeu: 5.873 ± 0.587
1.031ThrMet: 1.031 ± 0.239
2.287ThrAsn: 2.287 ± 0.396
3.228ThrPro: 3.228 ± 0.328
1.883ThrGln: 1.883 ± 0.288
3.004ThrArg: 3.004 ± 0.388
2.914ThrSer: 2.914 ± 0.535
3.049ThrThr: 3.049 ± 0.348
4.977ThrVal: 4.977 ± 0.517
1.121ThrTrp: 1.121 ± 0.219
1.524ThrTyr: 1.524 ± 0.267
0.0ThrXaa: 0.0 ± 0.0
Val
7.398ValAla: 7.398 ± 0.593
0.897ValCys: 0.897 ± 0.193
4.887ValAsp: 4.887 ± 0.46
3.497ValGlu: 3.497 ± 0.385
1.48ValPhe: 1.48 ± 0.32
5.56ValGly: 5.56 ± 0.499
1.255ValHis: 1.255 ± 0.215
3.318ValIle: 3.318 ± 0.386
4.125ValLys: 4.125 ± 0.471
5.604ValLeu: 5.604 ± 0.553
1.793ValMet: 1.793 ± 0.272
2.645ValAsn: 2.645 ± 0.438
3.811ValPro: 3.811 ± 0.466
2.376ValGln: 2.376 ± 0.356
4.394ValArg: 4.394 ± 0.4
4.618ValSer: 4.618 ± 0.53
4.394ValThr: 4.394 ± 0.457
5.829ValVal: 5.829 ± 0.555
1.211ValTrp: 1.211 ± 0.194
1.883ValTyr: 1.883 ± 0.327
0.0ValXaa: 0.0 ± 0.0
Trp
1.838TrpAla: 1.838 ± 0.284
0.179TrpCys: 0.179 ± 0.087
1.524TrpAsp: 1.524 ± 0.278
1.211TrpGlu: 1.211 ± 0.225
0.673TrpPhe: 0.673 ± 0.168
1.076TrpGly: 1.076 ± 0.202
0.628TrpHis: 0.628 ± 0.193
0.583TrpIle: 0.583 ± 0.149
0.583TrpLys: 0.583 ± 0.154
2.018TrpLeu: 2.018 ± 0.305
0.583TrpMet: 0.583 ± 0.164
0.986TrpAsn: 0.986 ± 0.177
1.121TrpPro: 1.121 ± 0.232
1.031TrpGln: 1.031 ± 0.243
1.569TrpArg: 1.569 ± 0.223
1.211TrpSer: 1.211 ± 0.264
1.614TrpThr: 1.614 ± 0.265
1.614TrpVal: 1.614 ± 0.24
0.583TrpTrp: 0.583 ± 0.188
0.448TrpTyr: 0.448 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.959TyrAla: 2.959 ± 0.439
0.448TyrCys: 0.448 ± 0.182
1.973TyrAsp: 1.973 ± 0.237
2.152TyrGlu: 2.152 ± 0.374
0.852TyrPhe: 0.852 ± 0.22
2.914TyrGly: 2.914 ± 0.289
0.628TyrHis: 0.628 ± 0.154
1.031TyrIle: 1.031 ± 0.266
1.255TyrLys: 1.255 ± 0.235
2.6TyrLeu: 2.6 ± 0.357
0.224TyrMet: 0.224 ± 0.125
0.942TyrAsn: 0.942 ± 0.212
1.883TyrPro: 1.883 ± 0.319
0.942TyrGln: 0.942 ± 0.188
2.645TyrArg: 2.645 ± 0.335
1.569TyrSer: 1.569 ± 0.248
1.749TyrThr: 1.749 ± 0.313
2.107TyrVal: 2.107 ± 0.323
0.852TyrTrp: 0.852 ± 0.193
0.628TyrTyr: 0.628 ± 0.147
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 123 proteins (22305 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski