Amino acid dipepetide frequency for Mycobacterium phage Bones

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.883AlaAla: 12.883 ± 0.962
0.611AlaCys: 0.611 ± 0.17
6.594AlaAsp: 6.594 ± 0.631
6.533AlaGlu: 6.533 ± 0.716
3.358AlaPhe: 3.358 ± 0.385
7.937AlaGly: 7.937 ± 0.889
1.465AlaHis: 1.465 ± 0.358
4.579AlaIle: 4.579 ± 0.562
4.213AlaLys: 4.213 ± 0.545
9.037AlaLeu: 9.037 ± 1.036
2.137AlaMet: 2.137 ± 0.393
2.564AlaAsn: 2.564 ± 0.397
4.885AlaPro: 4.885 ± 0.675
3.48AlaGln: 3.48 ± 0.47
6.228AlaArg: 6.228 ± 0.595
4.701AlaSer: 4.701 ± 0.514
5.495AlaThr: 5.495 ± 0.615
8.548AlaVal: 8.548 ± 0.759
1.832AlaTrp: 1.832 ± 0.329
2.87AlaTyr: 2.87 ± 0.425
0.0AlaXaa: 0.0 ± 0.0
Cys
0.855CysAla: 0.855 ± 0.236
0.0CysCys: 0.0 ± 0.0
0.427CysAsp: 0.427 ± 0.142
0.55CysGlu: 0.55 ± 0.162
0.305CysPhe: 0.305 ± 0.138
0.55CysGly: 0.55 ± 0.202
0.183CysHis: 0.183 ± 0.11
0.366CysIle: 0.366 ± 0.135
0.305CysLys: 0.305 ± 0.139
0.366CysLeu: 0.366 ± 0.166
0.122CysMet: 0.122 ± 0.076
0.305CysAsn: 0.305 ± 0.144
0.611CysPro: 0.611 ± 0.217
0.244CysGln: 0.244 ± 0.145
0.305CysArg: 0.305 ± 0.13
0.427CysSer: 0.427 ± 0.186
0.183CysThr: 0.183 ± 0.119
0.427CysVal: 0.427 ± 0.157
0.183CysTrp: 0.183 ± 0.096
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.716AspAla: 6.716 ± 0.538
0.55AspCys: 0.55 ± 0.191
5.068AspAsp: 5.068 ± 0.496
3.358AspGlu: 3.358 ± 0.508
2.259AspPhe: 2.259 ± 0.346
6.472AspGly: 6.472 ± 0.564
1.221AspHis: 1.221 ± 0.305
2.381AspIle: 2.381 ± 0.401
2.809AspLys: 2.809 ± 0.443
7.327AspLeu: 7.327 ± 0.672
1.099AspMet: 1.099 ± 0.21
2.015AspAsn: 2.015 ± 0.373
4.396AspPro: 4.396 ± 0.495
1.587AspGln: 1.587 ± 0.304
3.786AspArg: 3.786 ± 0.35
2.809AspSer: 2.809 ± 0.453
4.335AspThr: 4.335 ± 0.43
5.373AspVal: 5.373 ± 0.584
1.771AspTrp: 1.771 ± 0.298
1.649AspTyr: 1.649 ± 0.332
0.0AspXaa: 0.0 ± 0.0
Glu
6.228GluAla: 6.228 ± 0.765
0.305GluCys: 0.305 ± 0.155
4.152GluAsp: 4.152 ± 0.476
4.885GluGlu: 4.885 ± 0.596
2.198GluPhe: 2.198 ± 0.386
4.396GluGly: 4.396 ± 0.437
1.221GluHis: 1.221 ± 0.274
3.175GluIle: 3.175 ± 0.446
2.687GluLys: 2.687 ± 0.425
6.411GluLeu: 6.411 ± 0.572
1.404GluMet: 1.404 ± 0.238
1.893GluAsn: 1.893 ± 0.325
2.748GluPro: 2.748 ± 0.459
2.748GluGln: 2.748 ± 0.412
3.602GluArg: 3.602 ± 0.516
3.358GluSer: 3.358 ± 0.465
3.541GluThr: 3.541 ± 0.468
5.434GluVal: 5.434 ± 0.608
1.771GluTrp: 1.771 ± 0.337
2.381GluTyr: 2.381 ± 0.472
0.0GluXaa: 0.0 ± 0.0
Phe
2.442PheAla: 2.442 ± 0.352
0.305PheCys: 0.305 ± 0.155
2.809PheAsp: 2.809 ± 0.373
2.137PheGlu: 2.137 ± 0.359
0.488PhePhe: 0.488 ± 0.16
3.48PheGly: 3.48 ± 0.476
0.55PheHis: 0.55 ± 0.239
1.587PheIle: 1.587 ± 0.331
1.649PheLys: 1.649 ± 0.308
2.564PheLeu: 2.564 ± 0.508
0.55PheMet: 0.55 ± 0.205
1.282PheAsn: 1.282 ± 0.274
1.404PhePro: 1.404 ± 0.301
1.16PheGln: 1.16 ± 0.297
1.954PheArg: 1.954 ± 0.338
2.198PheSer: 2.198 ± 0.368
1.893PheThr: 1.893 ± 0.328
2.442PheVal: 2.442 ± 0.416
0.611PheTrp: 0.611 ± 0.186
0.855PheTyr: 0.855 ± 0.286
0.0PheXaa: 0.0 ± 0.0
Gly
6.961GlyAla: 6.961 ± 0.967
1.038GlyCys: 1.038 ± 0.302
5.373GlyAsp: 5.373 ± 0.519
4.701GlyGlu: 4.701 ± 0.469
3.053GlyPhe: 3.053 ± 0.455
9.586GlyGly: 9.586 ± 2.275
1.893GlyHis: 1.893 ± 0.375
4.579GlyIle: 4.579 ± 0.597
3.969GlyLys: 3.969 ± 0.526
7.815GlyLeu: 7.815 ± 0.864
1.587GlyMet: 1.587 ± 0.28
3.358GlyAsn: 3.358 ± 0.444
4.213GlyPro: 4.213 ± 0.519
2.198GlyGln: 2.198 ± 0.358
4.213GlyArg: 4.213 ± 0.513
6.289GlySer: 6.289 ± 0.914
4.824GlyThr: 4.824 ± 0.565
5.8GlyVal: 5.8 ± 0.646
2.198GlyTrp: 2.198 ± 0.449
2.992GlyTyr: 2.992 ± 0.403
0.0GlyXaa: 0.0 ± 0.0
His
1.71HisAla: 1.71 ± 0.344
0.183HisCys: 0.183 ± 0.149
1.099HisAsp: 1.099 ± 0.227
1.16HisGlu: 1.16 ± 0.287
0.855HisPhe: 0.855 ± 0.195
1.649HisGly: 1.649 ± 0.349
0.55HisHis: 0.55 ± 0.184
0.855HisIle: 0.855 ± 0.164
1.221HisLys: 1.221 ± 0.344
1.16HisLeu: 1.16 ± 0.297
0.183HisMet: 0.183 ± 0.154
0.427HisAsn: 0.427 ± 0.167
1.099HisPro: 1.099 ± 0.251
0.977HisGln: 0.977 ± 0.279
1.343HisArg: 1.343 ± 0.286
0.488HisSer: 0.488 ± 0.169
1.099HisThr: 1.099 ± 0.223
1.649HisVal: 1.649 ± 0.329
0.55HisTrp: 0.55 ± 0.178
0.733HisTyr: 0.733 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
6.533IleAla: 6.533 ± 0.632
0.244IleCys: 0.244 ± 0.119
3.48IleAsp: 3.48 ± 0.409
3.602IleGlu: 3.602 ± 0.404
0.733IlePhe: 0.733 ± 0.23
3.908IleGly: 3.908 ± 0.425
0.855IleHis: 0.855 ± 0.215
1.832IleIle: 1.832 ± 0.303
1.649IleLys: 1.649 ± 0.313
3.236IleLeu: 3.236 ± 0.346
0.672IleMet: 0.672 ± 0.18
1.832IleAsn: 1.832 ± 0.313
3.236IlePro: 3.236 ± 0.442
1.404IleGln: 1.404 ± 0.271
3.663IleArg: 3.663 ± 0.493
3.114IleSer: 3.114 ± 0.411
3.175IleThr: 3.175 ± 0.456
2.931IleVal: 2.931 ± 0.566
0.672IleTrp: 0.672 ± 0.193
1.343IleTyr: 1.343 ± 0.23
0.0IleXaa: 0.0 ± 0.0
Lys
3.541LysAla: 3.541 ± 0.484
0.366LysCys: 0.366 ± 0.166
2.748LysAsp: 2.748 ± 0.474
2.259LysGlu: 2.259 ± 0.386
1.649LysPhe: 1.649 ± 0.324
2.503LysGly: 2.503 ± 0.418
1.038LysHis: 1.038 ± 0.299
2.931LysIle: 2.931 ± 0.519
2.32LysLys: 2.32 ± 0.419
3.358LysLeu: 3.358 ± 0.397
0.977LysMet: 0.977 ± 0.228
1.587LysAsn: 1.587 ± 0.269
2.687LysPro: 2.687 ± 0.389
1.771LysGln: 1.771 ± 0.417
3.297LysArg: 3.297 ± 0.495
2.381LysSer: 2.381 ± 0.526
2.259LysThr: 2.259 ± 0.377
3.602LysVal: 3.602 ± 0.606
0.733LysTrp: 0.733 ± 0.26
1.038LysTyr: 1.038 ± 0.248
0.0LysXaa: 0.0 ± 0.0
Leu
9.342LeuAla: 9.342 ± 0.816
0.244LeuCys: 0.244 ± 0.109
6.411LeuAsp: 6.411 ± 0.612
5.739LeuGlu: 5.739 ± 0.706
2.137LeuPhe: 2.137 ± 0.376
7.632LeuGly: 7.632 ± 0.722
1.465LeuHis: 1.465 ± 0.368
4.396LeuIle: 4.396 ± 0.52
3.908LeuLys: 3.908 ± 0.42
5.068LeuLeu: 5.068 ± 0.522
1.893LeuMet: 1.893 ± 0.353
2.381LeuAsn: 2.381 ± 0.364
5.495LeuPro: 5.495 ± 0.565
2.625LeuGln: 2.625 ± 0.497
6.228LeuArg: 6.228 ± 0.548
6.106LeuSer: 6.106 ± 0.614
5.862LeuThr: 5.862 ± 0.527
4.518LeuVal: 4.518 ± 0.606
1.099LeuTrp: 1.099 ± 0.311
2.32LeuTyr: 2.32 ± 0.436
0.0LeuXaa: 0.0 ± 0.0
Met
2.259MetAla: 2.259 ± 0.352
0.0MetCys: 0.0 ± 0.0
1.099MetAsp: 1.099 ± 0.24
1.343MetGlu: 1.343 ± 0.339
0.55MetPhe: 0.55 ± 0.171
1.16MetGly: 1.16 ± 0.238
0.427MetHis: 0.427 ± 0.218
0.55MetIle: 0.55 ± 0.192
1.038MetLys: 1.038 ± 0.233
1.282MetLeu: 1.282 ± 0.317
0.122MetMet: 0.122 ± 0.094
1.099MetAsn: 1.099 ± 0.222
1.282MetPro: 1.282 ± 0.298
0.488MetGln: 0.488 ± 0.16
1.404MetArg: 1.404 ± 0.284
2.076MetSer: 2.076 ± 0.398
1.893MetThr: 1.893 ± 0.274
1.221MetVal: 1.221 ± 0.321
0.244MetTrp: 0.244 ± 0.118
0.427MetTyr: 0.427 ± 0.164
0.0MetXaa: 0.0 ± 0.0
Asn
2.809AsnAla: 2.809 ± 0.543
0.366AsnCys: 0.366 ± 0.171
1.954AsnAsp: 1.954 ± 0.393
1.771AsnGlu: 1.771 ± 0.328
1.099AsnPhe: 1.099 ± 0.332
3.48AsnGly: 3.48 ± 0.532
0.611AsnHis: 0.611 ± 0.192
1.526AsnIle: 1.526 ± 0.342
0.977AsnLys: 0.977 ± 0.273
2.503AsnLeu: 2.503 ± 0.428
0.733AsnMet: 0.733 ± 0.196
1.038AsnAsn: 1.038 ± 0.354
2.992AsnPro: 2.992 ± 0.394
1.099AsnGln: 1.099 ± 0.249
1.587AsnArg: 1.587 ± 0.299
1.893AsnSer: 1.893 ± 0.4
1.832AsnThr: 1.832 ± 0.321
2.564AsnVal: 2.564 ± 0.432
0.794AsnTrp: 0.794 ± 0.189
1.16AsnTyr: 1.16 ± 0.298
0.0AsnXaa: 0.0 ± 0.0
Pro
5.129ProAla: 5.129 ± 0.439
0.305ProCys: 0.305 ± 0.127
4.213ProAsp: 4.213 ± 0.439
4.457ProGlu: 4.457 ± 0.504
2.015ProPhe: 2.015 ± 0.398
5.129ProGly: 5.129 ± 0.695
1.038ProHis: 1.038 ± 0.25
2.259ProIle: 2.259 ± 0.366
2.137ProLys: 2.137 ± 0.363
4.885ProLeu: 4.885 ± 0.668
1.16ProMet: 1.16 ± 0.296
1.649ProAsn: 1.649 ± 0.351
2.809ProPro: 2.809 ± 0.444
1.587ProGln: 1.587 ± 0.325
2.625ProArg: 2.625 ± 0.456
4.091ProSer: 4.091 ± 0.548
3.663ProThr: 3.663 ± 0.468
4.152ProVal: 4.152 ± 0.507
1.099ProTrp: 1.099 ± 0.28
1.771ProTyr: 1.771 ± 0.356
0.0ProXaa: 0.0 ± 0.0
Gln
3.663GlnAla: 3.663 ± 0.682
0.122GlnCys: 0.122 ± 0.092
1.465GlnAsp: 1.465 ± 0.327
1.954GlnGlu: 1.954 ± 0.318
1.221GlnPhe: 1.221 ± 0.229
2.564GlnGly: 2.564 ± 0.354
0.488GlnHis: 0.488 ± 0.145
2.32GlnIle: 2.32 ± 0.489
1.587GlnLys: 1.587 ± 0.378
3.541GlnLeu: 3.541 ± 0.462
0.672GlnMet: 0.672 ± 0.191
0.733GlnAsn: 0.733 ± 0.184
1.954GlnPro: 1.954 ± 0.353
1.771GlnGln: 1.771 ± 0.377
1.649GlnArg: 1.649 ± 0.341
1.71GlnSer: 1.71 ± 0.281
1.587GlnThr: 1.587 ± 0.286
2.503GlnVal: 2.503 ± 0.378
0.672GlnTrp: 0.672 ± 0.171
0.733GlnTyr: 0.733 ± 0.204
0.0GlnXaa: 0.0 ± 0.0
Arg
5.434ArgAla: 5.434 ± 0.632
0.611ArgCys: 0.611 ± 0.188
3.236ArgAsp: 3.236 ± 0.557
4.396ArgGlu: 4.396 ± 0.595
1.893ArgPhe: 1.893 ± 0.365
5.068ArgGly: 5.068 ± 0.616
0.855ArgHis: 0.855 ± 0.222
3.175ArgIle: 3.175 ± 0.503
2.625ArgLys: 2.625 ± 0.415
6.045ArgLeu: 6.045 ± 0.725
1.71ArgMet: 1.71 ± 0.302
2.503ArgAsn: 2.503 ± 0.454
2.748ArgPro: 2.748 ± 0.52
2.015ArgGln: 2.015 ± 0.35
5.129ArgArg: 5.129 ± 0.669
4.03ArgSer: 4.03 ± 0.617
3.236ArgThr: 3.236 ± 0.585
5.19ArgVal: 5.19 ± 0.619
0.977ArgTrp: 0.977 ± 0.227
1.832ArgTyr: 1.832 ± 0.322
0.0ArgXaa: 0.0 ± 0.0
Ser
6.045SerAla: 6.045 ± 0.668
0.305SerCys: 0.305 ± 0.141
3.786SerAsp: 3.786 ± 0.43
3.908SerGlu: 3.908 ± 0.469
2.015SerPhe: 2.015 ± 0.29
6.655SerGly: 6.655 ± 1.02
1.343SerHis: 1.343 ± 0.276
2.381SerIle: 2.381 ± 0.345
2.381SerLys: 2.381 ± 0.35
5.556SerLeu: 5.556 ± 0.672
1.282SerMet: 1.282 ± 0.269
2.32SerAsn: 2.32 ± 0.44
2.992SerPro: 2.992 ± 0.457
1.771SerGln: 1.771 ± 0.308
3.053SerArg: 3.053 ± 0.436
3.602SerSer: 3.602 ± 0.704
3.358SerThr: 3.358 ± 0.477
3.48SerVal: 3.48 ± 0.458
1.099SerTrp: 1.099 ± 0.27
1.404SerTyr: 1.404 ± 0.291
0.0SerXaa: 0.0 ± 0.0
Thr
6.472ThrAla: 6.472 ± 0.695
0.366ThrCys: 0.366 ± 0.22
4.579ThrAsp: 4.579 ± 0.518
4.457ThrGlu: 4.457 ± 0.434
2.137ThrPhe: 2.137 ± 0.368
6.411ThrGly: 6.411 ± 0.535
1.099ThrHis: 1.099 ± 0.295
2.87ThrIle: 2.87 ± 0.524
2.32ThrLys: 2.32 ± 0.346
5.251ThrLeu: 5.251 ± 0.657
0.916ThrMet: 0.916 ± 0.189
1.771ThrAsn: 1.771 ± 0.336
3.541ThrPro: 3.541 ± 0.45
1.771ThrGln: 1.771 ± 0.296
3.297ThrArg: 3.297 ± 0.524
2.809ThrSer: 2.809 ± 0.397
4.152ThrThr: 4.152 ± 0.548
5.007ThrVal: 5.007 ± 0.584
1.221ThrTrp: 1.221 ± 0.301
1.954ThrTyr: 1.954 ± 0.362
0.0ThrXaa: 0.0 ± 0.0
Val
6.961ValAla: 6.961 ± 0.764
0.366ValCys: 0.366 ± 0.138
5.739ValAsp: 5.739 ± 0.672
4.213ValGlu: 4.213 ± 0.536
2.748ValPhe: 2.748 ± 0.313
3.969ValGly: 3.969 ± 0.613
1.465ValHis: 1.465 ± 0.313
3.541ValIle: 3.541 ± 0.385
3.48ValLys: 3.48 ± 0.45
5.19ValLeu: 5.19 ± 0.498
1.587ValMet: 1.587 ± 0.396
2.381ValAsn: 2.381 ± 0.378
5.068ValPro: 5.068 ± 0.558
2.259ValGln: 2.259 ± 0.443
5.556ValArg: 5.556 ± 0.739
4.579ValSer: 4.579 ± 0.565
6.411ValThr: 6.411 ± 0.563
5.19ValVal: 5.19 ± 0.682
0.916ValTrp: 0.916 ± 0.214
2.259ValTyr: 2.259 ± 0.341
0.0ValXaa: 0.0 ± 0.0
Trp
1.404TrpAla: 1.404 ± 0.3
0.122TrpCys: 0.122 ± 0.077
1.465TrpAsp: 1.465 ± 0.292
1.038TrpGlu: 1.038 ± 0.232
0.977TrpPhe: 0.977 ± 0.229
1.587TrpGly: 1.587 ± 0.284
0.488TrpHis: 0.488 ± 0.176
1.282TrpIle: 1.282 ± 0.228
0.305TrpLys: 0.305 ± 0.136
1.893TrpLeu: 1.893 ± 0.278
0.488TrpMet: 0.488 ± 0.17
0.366TrpAsn: 0.366 ± 0.156
0.733TrpPro: 0.733 ± 0.274
0.794TrpGln: 0.794 ± 0.19
1.282TrpArg: 1.282 ± 0.344
0.855TrpSer: 0.855 ± 0.273
1.526TrpThr: 1.526 ± 0.41
1.832TrpVal: 1.832 ± 0.319
0.672TrpTrp: 0.672 ± 0.27
0.305TrpTyr: 0.305 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.564TyrAla: 2.564 ± 0.448
0.305TyrCys: 0.305 ± 0.132
1.282TyrAsp: 1.282 ± 0.317
2.076TyrGlu: 2.076 ± 0.314
0.672TyrPhe: 0.672 ± 0.174
2.503TyrGly: 2.503 ± 0.451
0.733TyrHis: 0.733 ± 0.199
1.526TyrIle: 1.526 ± 0.312
1.465TyrLys: 1.465 ± 0.294
2.381TyrLeu: 2.381 ± 0.425
0.611TyrMet: 0.611 ± 0.183
1.282TyrAsn: 1.282 ± 0.293
1.282TyrPro: 1.282 ± 0.302
1.16TyrGln: 1.16 ± 0.241
2.503TyrArg: 2.503 ± 0.346
1.282TyrSer: 1.282 ± 0.256
2.076TyrThr: 2.076 ± 0.358
2.015TyrVal: 2.015 ± 0.329
0.305TyrTrp: 0.305 ± 0.137
0.794TyrTyr: 0.794 ± 0.23
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 89 proteins (16379 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski