Amino acid dipepetide frequency for Mycobacterium virus Goose

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.861AlaAla: 10.861 ± 1.226
0.816AlaCys: 0.816 ± 0.22
4.96AlaAsp: 4.96 ± 0.703
6.781AlaGlu: 6.781 ± 0.751
2.951AlaPhe: 2.951 ± 0.404
7.911AlaGly: 7.911 ± 0.994
1.946AlaHis: 1.946 ± 0.355
4.583AlaIle: 4.583 ± 0.421
4.96AlaLys: 4.96 ± 0.581
9.166AlaLeu: 9.166 ± 0.847
2.449AlaMet: 2.449 ± 0.375
2.637AlaAsn: 2.637 ± 0.502
6.655AlaPro: 6.655 ± 0.925
4.081AlaGln: 4.081 ± 0.505
5.776AlaArg: 5.776 ± 0.635
4.52AlaSer: 4.52 ± 0.561
5.902AlaThr: 5.902 ± 0.585
7.659AlaVal: 7.659 ± 0.726
2.7AlaTrp: 2.7 ± 0.38
2.26AlaTyr: 2.26 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
0.691CysAla: 0.691 ± 0.201
0.126CysCys: 0.126 ± 0.129
0.565CysAsp: 0.565 ± 0.212
0.816CysGlu: 0.816 ± 0.247
0.126CysPhe: 0.126 ± 0.083
0.753CysGly: 0.753 ± 0.229
0.126CysHis: 0.126 ± 0.086
0.314CysIle: 0.314 ± 0.167
0.439CysLys: 0.439 ± 0.173
0.753CysLeu: 0.753 ± 0.23
0.126CysMet: 0.126 ± 0.093
0.314CysAsn: 0.314 ± 0.144
0.565CysPro: 0.565 ± 0.196
0.377CysGln: 0.377 ± 0.155
0.691CysArg: 0.691 ± 0.229
0.188CysSer: 0.188 ± 0.118
0.691CysThr: 0.691 ± 0.242
0.502CysVal: 0.502 ± 0.168
0.126CysTrp: 0.126 ± 0.075
0.439CysTyr: 0.439 ± 0.183
0.0CysXaa: 0.0 ± 0.0
Asp
6.341AspAla: 6.341 ± 0.698
0.691AspCys: 0.691 ± 0.212
3.955AspAsp: 3.955 ± 0.467
4.458AspGlu: 4.458 ± 0.738
2.009AspPhe: 2.009 ± 0.319
5.902AspGly: 5.902 ± 0.645
1.695AspHis: 1.695 ± 0.365
2.197AspIle: 2.197 ± 0.391
2.009AspLys: 2.009 ± 0.45
6.153AspLeu: 6.153 ± 0.731
1.507AspMet: 1.507 ± 0.312
1.758AspAsn: 1.758 ± 0.306
4.771AspPro: 4.771 ± 0.623
2.511AspGln: 2.511 ± 0.397
4.646AspArg: 4.646 ± 0.665
2.825AspSer: 2.825 ± 0.361
3.327AspThr: 3.327 ± 0.464
4.018AspVal: 4.018 ± 0.439
1.13AspTrp: 1.13 ± 0.247
2.323AspTyr: 2.323 ± 0.362
0.0AspXaa: 0.0 ± 0.0
Glu
7.346GluAla: 7.346 ± 0.605
0.251GluCys: 0.251 ± 0.113
5.148GluAsp: 5.148 ± 0.578
4.269GluGlu: 4.269 ± 0.387
2.449GluPhe: 2.449 ± 0.393
5.023GluGly: 5.023 ± 0.528
1.381GluHis: 1.381 ± 0.346
2.888GluIle: 2.888 ± 0.502
1.883GluLys: 1.883 ± 0.324
5.964GluLeu: 5.964 ± 0.641
1.444GluMet: 1.444 ± 0.291
2.637GluAsn: 2.637 ± 0.433
3.014GluPro: 3.014 ± 0.485
2.449GluGln: 2.449 ± 0.38
5.588GluArg: 5.588 ± 0.627
3.327GluSer: 3.327 ± 0.475
3.955GluThr: 3.955 ± 0.408
4.96GluVal: 4.96 ± 0.555
1.57GluTrp: 1.57 ± 0.307
1.883GluTyr: 1.883 ± 0.382
0.0GluXaa: 0.0 ± 0.0
Phe
3.265PheAla: 3.265 ± 0.501
0.377PheCys: 0.377 ± 0.191
2.511PheAsp: 2.511 ± 0.413
2.637PheGlu: 2.637 ± 0.413
0.314PhePhe: 0.314 ± 0.139
3.202PheGly: 3.202 ± 0.483
0.502PheHis: 0.502 ± 0.214
1.13PheIle: 1.13 ± 0.276
0.628PheLys: 0.628 ± 0.202
2.7PheLeu: 2.7 ± 0.414
0.691PheMet: 0.691 ± 0.194
1.13PheAsn: 1.13 ± 0.289
2.072PhePro: 2.072 ± 0.422
0.879PheGln: 0.879 ± 0.25
1.758PheArg: 1.758 ± 0.347
1.758PheSer: 1.758 ± 0.288
1.695PheThr: 1.695 ± 0.34
2.386PheVal: 2.386 ± 0.332
0.377PheTrp: 0.377 ± 0.178
0.565PheTyr: 0.565 ± 0.171
0.0PheXaa: 0.0 ± 0.0
Gly
6.09GlyAla: 6.09 ± 0.972
0.753GlyCys: 0.753 ± 0.2
6.278GlyAsp: 6.278 ± 0.827
5.023GlyGlu: 5.023 ± 0.56
3.453GlyPhe: 3.453 ± 0.55
7.973GlyGly: 7.973 ± 1.117
2.7GlyHis: 2.7 ± 0.477
4.583GlyIle: 4.583 ± 0.62
4.081GlyLys: 4.081 ± 0.5
6.153GlyLeu: 6.153 ± 0.595
2.072GlyMet: 2.072 ± 0.335
3.076GlyAsn: 3.076 ± 0.521
7.032GlyPro: 7.032 ± 2.563
3.139GlyGln: 3.139 ± 0.476
4.897GlyArg: 4.897 ± 0.572
4.458GlySer: 4.458 ± 0.644
5.65GlyThr: 5.65 ± 0.752
6.027GlyVal: 6.027 ± 0.677
1.632GlyTrp: 1.632 ± 0.35
3.265GlyTyr: 3.265 ± 0.498
0.0GlyXaa: 0.0 ± 0.0
His
1.381HisAla: 1.381 ± 0.269
0.063HisCys: 0.063 ± 0.067
1.381HisAsp: 1.381 ± 0.275
1.632HisGlu: 1.632 ± 0.341
1.13HisPhe: 1.13 ± 0.287
2.135HisGly: 2.135 ± 0.538
0.502HisHis: 0.502 ± 0.172
1.256HisIle: 1.256 ± 0.3
0.942HisLys: 0.942 ± 0.24
1.695HisLeu: 1.695 ± 0.336
0.188HisMet: 0.188 ± 0.102
0.502HisAsn: 0.502 ± 0.148
1.381HisPro: 1.381 ± 0.268
0.502HisGln: 0.502 ± 0.203
1.695HisArg: 1.695 ± 0.394
0.691HisSer: 0.691 ± 0.21
1.13HisThr: 1.13 ± 0.308
0.879HisVal: 0.879 ± 0.26
0.502HisTrp: 0.502 ± 0.199
0.816HisTyr: 0.816 ± 0.211
0.0HisXaa: 0.0 ± 0.0
Ile
5.211IleAla: 5.211 ± 0.628
0.377IleCys: 0.377 ± 0.144
3.83IleAsp: 3.83 ± 0.438
3.014IleGlu: 3.014 ± 0.373
0.942IlePhe: 0.942 ± 0.23
3.955IleGly: 3.955 ± 0.441
1.067IleHis: 1.067 ± 0.26
1.318IleIle: 1.318 ± 0.263
2.009IleLys: 2.009 ± 0.285
3.265IleLeu: 3.265 ± 0.493
0.691IleMet: 0.691 ± 0.17
1.758IleAsn: 1.758 ± 0.268
3.014IlePro: 3.014 ± 0.386
1.444IleGln: 1.444 ± 0.265
3.202IleArg: 3.202 ± 0.519
2.135IleSer: 2.135 ± 0.413
2.574IleThr: 2.574 ± 0.388
3.327IleVal: 3.327 ± 0.523
0.691IleTrp: 0.691 ± 0.191
0.942IleTyr: 0.942 ± 0.209
0.0IleXaa: 0.0 ± 0.0
Lys
4.96LysAla: 4.96 ± 0.553
0.188LysCys: 0.188 ± 0.111
2.762LysAsp: 2.762 ± 0.421
2.26LysGlu: 2.26 ± 0.392
0.628LysPhe: 0.628 ± 0.187
3.955LysGly: 3.955 ± 0.672
1.318LysHis: 1.318 ± 0.262
1.821LysIle: 1.821 ± 0.351
1.695LysLys: 1.695 ± 0.371
3.767LysLeu: 3.767 ± 0.411
1.067LysMet: 1.067 ± 0.214
1.318LysAsn: 1.318 ± 0.352
2.323LysPro: 2.323 ± 0.419
1.632LysGln: 1.632 ± 0.283
3.202LysArg: 3.202 ± 0.555
2.072LysSer: 2.072 ± 0.343
2.135LysThr: 2.135 ± 0.352
3.327LysVal: 3.327 ± 0.409
1.067LysTrp: 1.067 ± 0.279
0.753LysTyr: 0.753 ± 0.197
0.0LysXaa: 0.0 ± 0.0
Leu
8.915LeuAla: 8.915 ± 0.811
0.565LeuCys: 0.565 ± 0.196
5.085LeuAsp: 5.085 ± 0.49
5.525LeuGlu: 5.525 ± 0.565
2.26LeuPhe: 2.26 ± 0.394
6.278LeuGly: 6.278 ± 0.627
1.507LeuHis: 1.507 ± 0.327
3.893LeuIle: 3.893 ± 0.586
3.453LeuLys: 3.453 ± 0.532
5.713LeuLeu: 5.713 ± 0.589
2.574LeuMet: 2.574 ± 0.364
2.323LeuAsn: 2.323 ± 0.369
5.148LeuPro: 5.148 ± 0.493
1.946LeuGln: 1.946 ± 0.472
6.529LeuArg: 6.529 ± 0.649
4.96LeuSer: 4.96 ± 0.703
5.274LeuThr: 5.274 ± 0.604
5.337LeuVal: 5.337 ± 0.508
1.381LeuTrp: 1.381 ± 0.292
2.386LeuTyr: 2.386 ± 0.485
0.0LeuXaa: 0.0 ± 0.0
Met
3.076MetAla: 3.076 ± 0.513
0.0MetCys: 0.0 ± 0.0
1.256MetAsp: 1.256 ± 0.273
0.942MetGlu: 0.942 ± 0.205
0.816MetPhe: 0.816 ± 0.205
2.009MetGly: 2.009 ± 0.375
0.502MetHis: 0.502 ± 0.162
1.318MetIle: 1.318 ± 0.294
1.507MetLys: 1.507 ± 0.349
1.13MetLeu: 1.13 ± 0.256
0.377MetMet: 0.377 ± 0.161
1.005MetAsn: 1.005 ± 0.258
1.57MetPro: 1.57 ± 0.369
0.691MetGln: 0.691 ± 0.205
1.695MetArg: 1.695 ± 0.322
1.318MetSer: 1.318 ± 0.29
1.758MetThr: 1.758 ± 0.332
1.507MetVal: 1.507 ± 0.303
0.251MetTrp: 0.251 ± 0.11
0.565MetTyr: 0.565 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
3.453AsnAla: 3.453 ± 0.437
0.251AsnCys: 0.251 ± 0.164
1.946AsnAsp: 1.946 ± 0.328
1.318AsnGlu: 1.318 ± 0.241
0.816AsnPhe: 0.816 ± 0.194
3.893AsnGly: 3.893 ± 0.587
0.691AsnHis: 0.691 ± 0.208
1.758AsnIle: 1.758 ± 0.396
1.067AsnLys: 1.067 ± 0.219
2.449AsnLeu: 2.449 ± 0.488
0.502AsnMet: 0.502 ± 0.153
0.502AsnAsn: 0.502 ± 0.186
2.7AsnPro: 2.7 ± 0.486
0.879AsnGln: 0.879 ± 0.232
1.883AsnArg: 1.883 ± 0.401
1.381AsnSer: 1.381 ± 0.314
1.758AsnThr: 1.758 ± 0.326
2.511AsnVal: 2.511 ± 0.321
1.13AsnTrp: 1.13 ± 0.278
1.13AsnTyr: 1.13 ± 0.253
0.0AsnXaa: 0.0 ± 0.0
Pro
6.09ProAla: 6.09 ± 0.798
0.439ProCys: 0.439 ± 0.184
4.081ProAsp: 4.081 ± 0.455
5.211ProGlu: 5.211 ± 0.542
1.821ProPhe: 1.821 ± 0.317
5.65ProGly: 5.65 ± 0.761
1.13ProHis: 1.13 ± 0.251
2.323ProIle: 2.323 ± 0.391
2.574ProLys: 2.574 ± 0.528
4.018ProLeu: 4.018 ± 0.562
1.318ProMet: 1.318 ± 0.294
2.26ProAsn: 2.26 ± 0.394
3.579ProPro: 3.579 ± 0.516
3.265ProGln: 3.265 ± 1.082
3.265ProArg: 3.265 ± 0.523
2.951ProSer: 2.951 ± 0.43
4.269ProThr: 4.269 ± 0.799
4.646ProVal: 4.646 ± 0.548
1.507ProTrp: 1.507 ± 0.406
1.946ProTyr: 1.946 ± 0.366
0.0ProXaa: 0.0 ± 0.0
Gln
3.014GlnAla: 3.014 ± 0.393
0.377GlnCys: 0.377 ± 0.162
1.758GlnAsp: 1.758 ± 0.433
1.946GlnGlu: 1.946 ± 0.409
1.13GlnPhe: 1.13 ± 0.292
4.771GlnGly: 4.771 ± 1.607
0.439GlnHis: 0.439 ± 0.16
2.323GlnIle: 2.323 ± 0.399
1.507GlnLys: 1.507 ± 0.309
3.579GlnLeu: 3.579 ± 0.527
0.753GlnMet: 0.753 ± 0.218
0.691GlnAsn: 0.691 ± 0.224
2.135GlnPro: 2.135 ± 0.404
1.256GlnGln: 1.256 ± 0.269
2.762GlnArg: 2.762 ± 0.41
1.444GlnSer: 1.444 ± 0.383
1.632GlnThr: 1.632 ± 0.251
2.951GlnVal: 2.951 ± 0.401
0.816GlnTrp: 0.816 ± 0.212
1.005GlnTyr: 1.005 ± 0.21
0.0GlnXaa: 0.0 ± 0.0
Arg
6.341ArgAla: 6.341 ± 0.702
1.13ArgCys: 1.13 ± 0.394
3.893ArgAsp: 3.893 ± 0.557
5.085ArgGlu: 5.085 ± 0.641
2.386ArgPhe: 2.386 ± 0.365
5.274ArgGly: 5.274 ± 0.7
1.005ArgHis: 1.005 ± 0.244
3.202ArgIle: 3.202 ± 0.453
3.893ArgLys: 3.893 ± 0.537
5.65ArgLeu: 5.65 ± 0.795
1.695ArgMet: 1.695 ± 0.341
2.762ArgAsn: 2.762 ± 0.381
2.951ArgPro: 2.951 ± 0.499
2.449ArgGln: 2.449 ± 0.487
5.902ArgArg: 5.902 ± 0.682
3.076ArgSer: 3.076 ± 0.47
3.014ArgThr: 3.014 ± 0.35
4.897ArgVal: 4.897 ± 0.509
1.695ArgTrp: 1.695 ± 0.399
2.511ArgTyr: 2.511 ± 0.42
0.0ArgXaa: 0.0 ± 0.0
Ser
4.834SerAla: 4.834 ± 0.5
0.188SerCys: 0.188 ± 0.117
3.076SerAsp: 3.076 ± 0.451
3.955SerGlu: 3.955 ± 0.514
1.883SerPhe: 1.883 ± 0.274
4.96SerGly: 4.96 ± 0.631
0.816SerHis: 0.816 ± 0.208
1.758SerIle: 1.758 ± 0.459
2.072SerLys: 2.072 ± 0.392
4.018SerLeu: 4.018 ± 0.559
1.381SerMet: 1.381 ± 0.238
1.256SerAsn: 1.256 ± 0.279
2.7SerPro: 2.7 ± 0.384
1.758SerGln: 1.758 ± 0.319
3.83SerArg: 3.83 ± 0.597
2.511SerSer: 2.511 ± 0.369
2.135SerThr: 2.135 ± 0.343
3.955SerVal: 3.955 ± 0.508
0.942SerTrp: 0.942 ± 0.206
1.13SerTyr: 1.13 ± 0.276
0.0SerXaa: 0.0 ± 0.0
Thr
6.215ThrAla: 6.215 ± 0.646
0.565ThrCys: 0.565 ± 0.221
3.014ThrAsp: 3.014 ± 0.536
3.893ThrGlu: 3.893 ± 0.571
1.758ThrPhe: 1.758 ± 0.335
4.96ThrGly: 4.96 ± 0.654
0.942ThrHis: 0.942 ± 0.264
2.26ThrIle: 2.26 ± 0.434
2.574ThrLys: 2.574 ± 0.386
5.525ThrLeu: 5.525 ± 0.595
1.695ThrMet: 1.695 ± 0.311
1.632ThrAsn: 1.632 ± 0.305
4.269ThrPro: 4.269 ± 0.563
2.323ThrGln: 2.323 ± 0.392
3.516ThrArg: 3.516 ± 0.525
2.449ThrSer: 2.449 ± 0.443
3.265ThrThr: 3.265 ± 0.452
4.081ThrVal: 4.081 ± 0.467
0.942ThrTrp: 0.942 ± 0.258
1.632ThrTyr: 1.632 ± 0.296
0.0ThrXaa: 0.0 ± 0.0
Val
7.534ValAla: 7.534 ± 0.739
0.816ValCys: 0.816 ± 0.244
4.583ValAsp: 4.583 ± 0.576
5.525ValGlu: 5.525 ± 0.55
2.26ValPhe: 2.26 ± 0.449
6.027ValGly: 6.027 ± 0.677
1.13ValHis: 1.13 ± 0.297
3.076ValIle: 3.076 ± 0.48
3.579ValLys: 3.579 ± 0.437
5.776ValLeu: 5.776 ± 0.62
1.067ValMet: 1.067 ± 0.261
2.574ValAsn: 2.574 ± 0.423
3.704ValPro: 3.704 ± 0.546
2.449ValGln: 2.449 ± 0.475
4.332ValArg: 4.332 ± 0.528
4.583ValSer: 4.583 ± 0.531
4.458ValThr: 4.458 ± 0.52
5.776ValVal: 5.776 ± 0.481
1.318ValTrp: 1.318 ± 0.261
2.197ValTyr: 2.197 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
1.946TrpAla: 1.946 ± 0.359
0.251TrpCys: 0.251 ± 0.133
1.57TrpAsp: 1.57 ± 0.298
1.318TrpGlu: 1.318 ± 0.287
0.691TrpPhe: 0.691 ± 0.219
1.632TrpGly: 1.632 ± 0.291
0.439TrpHis: 0.439 ± 0.188
1.507TrpIle: 1.507 ± 0.288
0.691TrpLys: 0.691 ± 0.219
1.193TrpLeu: 1.193 ± 0.294
0.753TrpMet: 0.753 ± 0.199
0.879TrpAsn: 0.879 ± 0.241
1.13TrpPro: 1.13 ± 0.285
1.067TrpGln: 1.067 ± 0.208
1.067TrpArg: 1.067 ± 0.215
1.005TrpSer: 1.005 ± 0.259
1.005TrpThr: 1.005 ± 0.253
1.821TrpVal: 1.821 ± 0.365
0.753TrpTrp: 0.753 ± 0.25
0.377TrpTyr: 0.377 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.009TyrAla: 2.009 ± 0.409
0.502TyrCys: 0.502 ± 0.212
2.574TyrAsp: 2.574 ± 0.465
1.821TyrGlu: 1.821 ± 0.371
0.753TyrPhe: 0.753 ± 0.194
2.072TyrGly: 2.072 ± 0.34
0.565TyrHis: 0.565 ± 0.184
1.256TyrIle: 1.256 ± 0.296
0.691TyrLys: 0.691 ± 0.221
2.386TyrLeu: 2.386 ± 0.319
0.879TyrMet: 0.879 ± 0.271
1.005TyrAsn: 1.005 ± 0.23
1.632TyrPro: 1.632 ± 0.28
1.193TyrGln: 1.193 ± 0.261
2.637TyrArg: 2.637 ± 0.466
1.507TyrSer: 1.507 ± 0.33
1.883TyrThr: 1.883 ± 0.358
2.197TyrVal: 2.197 ± 0.36
0.565TyrTrp: 0.565 ± 0.193
1.256TyrTyr: 1.256 ± 0.299
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (15929 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski