Amino acid dipepetide frequency for Mycobacterium virus Deadp

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.238AlaAla: 16.238 ± 2.406
0.728AlaCys: 0.728 ± 0.219
7.503AlaAsp: 7.503 ± 0.648
6.887AlaGlu: 6.887 ± 0.73
2.744AlaPhe: 2.744 ± 0.361
11.871AlaGly: 11.871 ± 1.833
2.352AlaHis: 2.352 ± 0.386
4.592AlaIle: 4.592 ± 0.471
3.752AlaLys: 3.752 ± 0.42
8.287AlaLeu: 8.287 ± 0.78
2.912AlaMet: 2.912 ± 0.465
2.352AlaAsn: 2.352 ± 0.385
5.879AlaPro: 5.879 ± 0.601
3.416AlaGln: 3.416 ± 0.469
7.335AlaArg: 7.335 ± 0.695
5.767AlaSer: 5.767 ± 0.562
5.375AlaThr: 5.375 ± 0.609
7.839AlaVal: 7.839 ± 0.741
2.744AlaTrp: 2.744 ± 0.48
1.904AlaTyr: 1.904 ± 0.33
0.0AlaXaa: 0.0 ± 0.0
Cys
0.952CysAla: 0.952 ± 0.302
0.0CysCys: 0.0 ± 0.0
1.232CysAsp: 1.232 ± 0.338
0.616CysGlu: 0.616 ± 0.215
0.392CysPhe: 0.392 ± 0.131
1.512CysGly: 1.512 ± 0.362
0.224CysHis: 0.224 ± 0.116
0.168CysIle: 0.168 ± 0.105
0.448CysLys: 0.448 ± 0.156
0.56CysLeu: 0.56 ± 0.202
0.224CysMet: 0.224 ± 0.11
0.392CysAsn: 0.392 ± 0.133
0.896CysPro: 0.896 ± 0.282
0.224CysGln: 0.224 ± 0.109
0.672CysArg: 0.672 ± 0.236
0.728CysSer: 0.728 ± 0.234
0.784CysThr: 0.784 ± 0.255
0.728CysVal: 0.728 ± 0.179
0.336CysTrp: 0.336 ± 0.119
0.056CysTyr: 0.056 ± 0.057
0.0CysXaa: 0.0 ± 0.0
Asp
6.663AspAla: 6.663 ± 0.592
1.064AspCys: 1.064 ± 0.276
4.815AspAsp: 4.815 ± 0.555
3.472AspGlu: 3.472 ± 0.424
2.128AspPhe: 2.128 ± 0.302
7.111AspGly: 7.111 ± 0.615
1.288AspHis: 1.288 ± 0.28
2.464AspIle: 2.464 ± 0.36
1.68AspLys: 1.68 ± 0.284
6.383AspLeu: 6.383 ± 0.519
1.456AspMet: 1.456 ± 0.3
1.232AspAsn: 1.232 ± 0.27
4.648AspPro: 4.648 ± 0.608
2.352AspGln: 2.352 ± 0.365
4.704AspArg: 4.704 ± 0.593
3.92AspSer: 3.92 ± 0.589
4.871AspThr: 4.871 ± 0.489
4.256AspVal: 4.256 ± 0.51
1.456AspTrp: 1.456 ± 0.274
2.352AspTyr: 2.352 ± 0.368
0.0AspXaa: 0.0 ± 0.0
Glu
6.439GluAla: 6.439 ± 0.702
1.064GluCys: 1.064 ± 0.306
2.52GluAsp: 2.52 ± 0.372
2.968GluGlu: 2.968 ± 0.473
2.576GluPhe: 2.576 ± 0.405
2.856GluGly: 2.856 ± 0.404
1.4GluHis: 1.4 ± 0.312
2.576GluIle: 2.576 ± 0.385
1.792GluLys: 1.792 ± 0.365
5.711GluLeu: 5.711 ± 0.714
1.792GluMet: 1.792 ± 0.341
2.128GluAsn: 2.128 ± 0.288
2.576GluPro: 2.576 ± 0.391
2.912GluGln: 2.912 ± 0.45
4.536GluArg: 4.536 ± 0.688
3.08GluSer: 3.08 ± 0.44
4.424GluThr: 4.424 ± 0.568
3.472GluVal: 3.472 ± 0.495
1.68GluTrp: 1.68 ± 0.275
2.016GluTyr: 2.016 ± 0.405
0.0GluXaa: 0.0 ± 0.0
Phe
3.472PheAla: 3.472 ± 0.495
0.168PheCys: 0.168 ± 0.101
2.8PheAsp: 2.8 ± 0.496
1.568PheGlu: 1.568 ± 0.365
0.952PhePhe: 0.952 ± 0.272
3.08PheGly: 3.08 ± 0.536
0.336PheHis: 0.336 ± 0.135
1.344PheIle: 1.344 ± 0.341
0.84PheLys: 0.84 ± 0.222
2.072PheLeu: 2.072 ± 0.327
1.008PheMet: 1.008 ± 0.248
1.176PheAsn: 1.176 ± 0.34
1.736PhePro: 1.736 ± 0.305
1.008PheGln: 1.008 ± 0.322
1.568PheArg: 1.568 ± 0.289
1.568PheSer: 1.568 ± 0.284
2.408PheThr: 2.408 ± 0.436
2.016PheVal: 2.016 ± 0.287
0.672PheTrp: 0.672 ± 0.211
0.952PheTyr: 0.952 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
9.351GlyAla: 9.351 ± 1.346
0.896GlyCys: 0.896 ± 0.265
6.271GlyAsp: 6.271 ± 0.561
3.976GlyGlu: 3.976 ± 0.56
3.024GlyPhe: 3.024 ± 0.423
10.471GlyGly: 10.471 ± 2.742
1.848GlyHis: 1.848 ± 0.26
4.648GlyIle: 4.648 ± 0.609
2.52GlyLys: 2.52 ± 0.346
5.823GlyLeu: 5.823 ± 0.573
2.52GlyMet: 2.52 ± 0.451
2.968GlyAsn: 2.968 ± 0.46
3.92GlyPro: 3.92 ± 0.598
2.24GlyGln: 2.24 ± 0.575
4.927GlyArg: 4.927 ± 0.645
7.111GlySer: 7.111 ± 1.473
6.047GlyThr: 6.047 ± 0.723
5.263GlyVal: 5.263 ± 0.45
2.408GlyTrp: 2.408 ± 0.526
2.408GlyTyr: 2.408 ± 0.451
0.0GlyXaa: 0.0 ± 0.0
His
1.904HisAla: 1.904 ± 0.46
0.392HisCys: 0.392 ± 0.186
1.624HisAsp: 1.624 ± 0.388
1.12HisGlu: 1.12 ± 0.241
0.28HisPhe: 0.28 ± 0.12
1.456HisGly: 1.456 ± 0.293
0.952HisHis: 0.952 ± 0.246
1.008HisIle: 1.008 ± 0.225
0.728HisLys: 0.728 ± 0.182
1.008HisLeu: 1.008 ± 0.254
0.448HisMet: 0.448 ± 0.133
0.84HisAsn: 0.84 ± 0.211
2.184HisPro: 2.184 ± 0.372
0.784HisGln: 0.784 ± 0.226
2.016HisArg: 2.016 ± 0.393
0.84HisSer: 0.84 ± 0.2
1.568HisThr: 1.568 ± 0.335
1.176HisVal: 1.176 ± 0.253
0.28HisTrp: 0.28 ± 0.12
0.616HisTyr: 0.616 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
5.599IleAla: 5.599 ± 0.567
0.616IleCys: 0.616 ± 0.221
4.144IleAsp: 4.144 ± 0.454
3.696IleGlu: 3.696 ± 0.352
0.728IlePhe: 0.728 ± 0.246
3.36IleGly: 3.36 ± 0.523
1.456IleHis: 1.456 ± 0.32
1.4IleIle: 1.4 ± 0.305
1.232IleLys: 1.232 ± 0.267
2.352IleLeu: 2.352 ± 0.412
0.392IleMet: 0.392 ± 0.148
1.904IleAsn: 1.904 ± 0.326
2.856IlePro: 2.856 ± 0.326
1.512IleGln: 1.512 ± 0.238
3.08IleArg: 3.08 ± 0.54
2.352IleSer: 2.352 ± 0.451
3.584IleThr: 3.584 ± 0.412
3.024IleVal: 3.024 ± 0.368
0.84IleTrp: 0.84 ± 0.227
0.672IleTyr: 0.672 ± 0.214
0.0IleXaa: 0.0 ± 0.0
Lys
3.696LysAla: 3.696 ± 0.558
0.28LysCys: 0.28 ± 0.147
1.512LysAsp: 1.512 ± 0.318
1.456LysGlu: 1.456 ± 0.31
1.288LysPhe: 1.288 ± 0.227
2.576LysGly: 2.576 ± 0.369
0.896LysHis: 0.896 ± 0.25
1.064LysIle: 1.064 ± 0.248
1.4LysLys: 1.4 ± 0.472
2.184LysLeu: 2.184 ± 0.46
0.448LysMet: 0.448 ± 0.167
0.84LysAsn: 0.84 ± 0.232
2.296LysPro: 2.296 ± 0.448
1.624LysGln: 1.624 ± 0.249
2.576LysArg: 2.576 ± 0.384
1.792LysSer: 1.792 ± 0.342
2.464LysThr: 2.464 ± 0.383
2.352LysVal: 2.352 ± 0.459
0.952LysTrp: 0.952 ± 0.236
0.84LysTyr: 0.84 ± 0.245
0.0LysXaa: 0.0 ± 0.0
Leu
7.783LeuAla: 7.783 ± 0.837
0.672LeuCys: 0.672 ± 0.185
5.263LeuAsp: 5.263 ± 0.48
3.584LeuGlu: 3.584 ± 0.448
2.408LeuPhe: 2.408 ± 0.31
5.767LeuGly: 5.767 ± 0.621
1.4LeuHis: 1.4 ± 0.287
3.528LeuIle: 3.528 ± 0.498
2.072LeuLys: 2.072 ± 0.329
4.256LeuLeu: 4.256 ± 0.534
1.568LeuMet: 1.568 ± 0.324
2.184LeuAsn: 2.184 ± 0.363
5.823LeuPro: 5.823 ± 0.754
2.744LeuGln: 2.744 ± 0.403
4.592LeuArg: 4.592 ± 0.636
5.543LeuSer: 5.543 ± 0.497
5.207LeuThr: 5.207 ± 0.473
5.039LeuVal: 5.039 ± 0.515
1.344LeuTrp: 1.344 ± 0.3
2.128LeuTyr: 2.128 ± 0.333
0.0LeuXaa: 0.0 ± 0.0
Met
2.296MetAla: 2.296 ± 0.35
0.28MetCys: 0.28 ± 0.146
1.456MetAsp: 1.456 ± 0.319
1.064MetGlu: 1.064 ± 0.237
0.784MetPhe: 0.784 ± 0.234
1.792MetGly: 1.792 ± 0.268
0.224MetHis: 0.224 ± 0.122
0.952MetIle: 0.952 ± 0.253
0.784MetLys: 0.784 ± 0.257
1.904MetLeu: 1.904 ± 0.288
0.504MetMet: 0.504 ± 0.198
0.896MetAsn: 0.896 ± 0.235
1.344MetPro: 1.344 ± 0.284
0.504MetGln: 0.504 ± 0.14
1.736MetArg: 1.736 ± 0.273
2.968MetSer: 2.968 ± 0.374
2.296MetThr: 2.296 ± 0.372
1.568MetVal: 1.568 ± 0.342
0.28MetTrp: 0.28 ± 0.125
0.224MetTyr: 0.224 ± 0.115
0.0MetXaa: 0.0 ± 0.0
Asn
3.304AsnAla: 3.304 ± 0.403
0.28AsnCys: 0.28 ± 0.118
1.736AsnAsp: 1.736 ± 0.314
1.848AsnGlu: 1.848 ± 0.32
0.896AsnPhe: 0.896 ± 0.3
3.752AsnGly: 3.752 ± 0.5
0.728AsnHis: 0.728 ± 0.179
1.568AsnIle: 1.568 ± 0.436
1.064AsnLys: 1.064 ± 0.226
2.408AsnLeu: 2.408 ± 0.432
0.448AsnMet: 0.448 ± 0.133
1.624AsnAsn: 1.624 ± 0.347
2.128AsnPro: 2.128 ± 0.35
1.008AsnGln: 1.008 ± 0.353
1.736AsnArg: 1.736 ± 0.33
1.96AsnSer: 1.96 ± 0.297
2.016AsnThr: 2.016 ± 0.303
1.792AsnVal: 1.792 ± 0.353
0.672AsnTrp: 0.672 ± 0.218
0.616AsnTyr: 0.616 ± 0.163
0.0AsnXaa: 0.0 ± 0.0
Pro
5.319ProAla: 5.319 ± 0.644
0.672ProCys: 0.672 ± 0.194
4.312ProAsp: 4.312 ± 0.491
4.76ProGlu: 4.76 ± 0.607
1.68ProPhe: 1.68 ± 0.302
6.775ProGly: 6.775 ± 0.607
1.288ProHis: 1.288 ± 0.299
1.96ProIle: 1.96 ± 0.346
2.072ProLys: 2.072 ± 0.358
4.256ProLeu: 4.256 ± 0.531
1.568ProMet: 1.568 ± 0.263
2.072ProAsn: 2.072 ± 0.307
3.36ProPro: 3.36 ± 0.551
1.904ProGln: 1.904 ± 0.444
3.416ProArg: 3.416 ± 0.609
3.08ProSer: 3.08 ± 0.406
3.304ProThr: 3.304 ± 0.403
4.983ProVal: 4.983 ± 0.514
0.84ProTrp: 0.84 ± 0.205
1.064ProTyr: 1.064 ± 0.234
0.0ProXaa: 0.0 ± 0.0
Gln
4.815GlnAla: 4.815 ± 0.615
0.392GlnCys: 0.392 ± 0.213
1.624GlnAsp: 1.624 ± 0.279
1.456GlnGlu: 1.456 ± 0.294
1.064GlnPhe: 1.064 ± 0.224
2.352GlnGly: 2.352 ± 0.513
0.504GlnHis: 0.504 ± 0.222
1.848GlnIle: 1.848 ± 0.331
1.568GlnLys: 1.568 ± 0.249
2.856GlnLeu: 2.856 ± 0.422
0.784GlnMet: 0.784 ± 0.224
0.84GlnAsn: 0.84 ± 0.304
2.688GlnPro: 2.688 ± 0.57
1.064GlnGln: 1.064 ± 0.209
2.24GlnArg: 2.24 ± 0.307
2.24GlnSer: 2.24 ± 0.402
2.072GlnThr: 2.072 ± 0.338
2.24GlnVal: 2.24 ± 0.338
0.504GlnTrp: 0.504 ± 0.146
0.952GlnTyr: 0.952 ± 0.275
0.0GlnXaa: 0.0 ± 0.0
Arg
6.103ArgAla: 6.103 ± 0.585
1.064ArgCys: 1.064 ± 0.296
4.424ArgAsp: 4.424 ± 0.62
4.592ArgGlu: 4.592 ± 0.55
2.296ArgPhe: 2.296 ± 0.381
3.976ArgGly: 3.976 ± 0.492
1.512ArgHis: 1.512 ± 0.394
4.032ArgIle: 4.032 ± 0.585
2.352ArgLys: 2.352 ± 0.446
4.76ArgLeu: 4.76 ± 0.636
2.352ArgMet: 2.352 ± 0.416
2.128ArgAsn: 2.128 ± 0.309
3.64ArgPro: 3.64 ± 0.365
2.128ArgGln: 2.128 ± 0.368
5.431ArgArg: 5.431 ± 0.778
3.584ArgSer: 3.584 ± 0.509
3.416ArgThr: 3.416 ± 0.64
4.983ArgVal: 4.983 ± 0.668
1.792ArgTrp: 1.792 ± 0.319
1.624ArgTyr: 1.624 ± 0.312
0.0ArgXaa: 0.0 ± 0.0
Ser
7.727SerAla: 7.727 ± 1.58
0.504SerCys: 0.504 ± 0.199
3.92SerAsp: 3.92 ± 0.458
3.528SerGlu: 3.528 ± 0.527
2.184SerPhe: 2.184 ± 0.45
6.327SerGly: 6.327 ± 0.811
1.008SerHis: 1.008 ± 0.25
3.08SerIle: 3.08 ± 0.499
2.576SerLys: 2.576 ± 0.428
4.648SerLeu: 4.648 ± 0.438
1.512SerMet: 1.512 ± 0.292
2.128SerAsn: 2.128 ± 0.403
3.08SerPro: 3.08 ± 0.394
1.848SerGln: 1.848 ± 0.325
3.416SerArg: 3.416 ± 0.388
3.472SerSer: 3.472 ± 0.535
3.528SerThr: 3.528 ± 0.404
4.312SerVal: 4.312 ± 0.516
1.4SerTrp: 1.4 ± 0.247
1.288SerTyr: 1.288 ± 0.23
0.0SerXaa: 0.0 ± 0.0
Thr
6.943ThrAla: 6.943 ± 0.707
0.56ThrCys: 0.56 ± 0.19
4.312ThrAsp: 4.312 ± 0.564
3.976ThrGlu: 3.976 ± 0.425
1.792ThrPhe: 1.792 ± 0.307
5.431ThrGly: 5.431 ± 0.65
1.904ThrHis: 1.904 ± 0.32
3.416ThrIle: 3.416 ± 0.364
2.24ThrLys: 2.24 ± 0.359
4.368ThrLeu: 4.368 ± 0.465
1.4ThrMet: 1.4 ± 0.28
2.408ThrAsn: 2.408 ± 0.445
3.808ThrPro: 3.808 ± 0.448
2.184ThrGln: 2.184 ± 0.392
3.696ThrArg: 3.696 ± 0.408
4.424ThrSer: 4.424 ± 0.466
5.151ThrThr: 5.151 ± 0.67
5.823ThrVal: 5.823 ± 0.707
0.952ThrTrp: 0.952 ± 0.224
1.96ThrTyr: 1.96 ± 0.356
0.0ThrXaa: 0.0 ± 0.0
Val
7.111ValAla: 7.111 ± 0.625
1.064ValCys: 1.064 ± 0.278
5.431ValAsp: 5.431 ± 0.559
4.815ValGlu: 4.815 ± 0.527
2.128ValPhe: 2.128 ± 0.39
5.431ValGly: 5.431 ± 0.549
1.344ValHis: 1.344 ± 0.283
2.968ValIle: 2.968 ± 0.474
2.016ValLys: 2.016 ± 0.347
5.039ValLeu: 5.039 ± 0.644
1.4ValMet: 1.4 ± 0.26
2.24ValAsn: 2.24 ± 0.358
3.976ValPro: 3.976 ± 0.412
2.8ValGln: 2.8 ± 0.371
4.368ValArg: 4.368 ± 0.638
4.704ValSer: 4.704 ± 0.541
4.983ValThr: 4.983 ± 0.531
6.439ValVal: 6.439 ± 0.777
1.792ValTrp: 1.792 ± 0.371
1.4ValTyr: 1.4 ± 0.258
0.0ValXaa: 0.0 ± 0.0
Trp
2.072TrpAla: 2.072 ± 0.3
0.168TrpCys: 0.168 ± 0.123
1.624TrpAsp: 1.624 ± 0.305
1.176TrpGlu: 1.176 ± 0.281
0.672TrpPhe: 0.672 ± 0.209
0.896TrpGly: 0.896 ± 0.207
0.336TrpHis: 0.336 ± 0.138
1.12TrpIle: 1.12 ± 0.242
0.784TrpLys: 0.784 ± 0.179
2.072TrpLeu: 2.072 ± 0.308
0.896TrpMet: 0.896 ± 0.228
0.504TrpAsn: 0.504 ± 0.206
0.784TrpPro: 0.784 ± 0.259
1.064TrpGln: 1.064 ± 0.308
2.128TrpArg: 2.128 ± 0.409
1.4TrpSer: 1.4 ± 0.403
1.624TrpThr: 1.624 ± 0.294
1.904TrpVal: 1.904 ± 0.4
0.784TrpTrp: 0.784 ± 0.183
0.504TrpTyr: 0.504 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.688TyrAla: 2.688 ± 0.462
0.28TyrCys: 0.28 ± 0.14
1.792TyrAsp: 1.792 ± 0.37
2.016TyrGlu: 2.016 ± 0.323
0.728TyrPhe: 0.728 ± 0.219
1.848TyrGly: 1.848 ± 0.334
0.224TyrHis: 0.224 ± 0.097
1.064TyrIle: 1.064 ± 0.253
0.616TyrLys: 0.616 ± 0.209
1.96TyrLeu: 1.96 ± 0.295
0.168TyrMet: 0.168 ± 0.091
0.728TyrAsn: 0.728 ± 0.171
1.064TyrPro: 1.064 ± 0.198
0.784TyrGln: 0.784 ± 0.194
2.016TyrArg: 2.016 ± 0.433
0.952TyrSer: 0.952 ± 0.215
1.624TyrThr: 1.624 ± 0.325
2.24TyrVal: 2.24 ± 0.331
0.784TyrTrp: 0.784 ± 0.229
0.56TyrTyr: 0.56 ± 0.166
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 106 proteins (17860 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski