Amino acid dipepetide frequency for Mycobacterium phage D29 (Mycobacteriophage D29)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.556AlaAla: 11.556 ± 1.098
0.605AlaCys: 0.605 ± 0.175
6.383AlaAsp: 6.383 ± 0.762
7.794AlaGlu: 7.794 ± 0.788
4.501AlaPhe: 4.501 ± 0.589
7.592AlaGly: 7.592 ± 0.92
1.344AlaHis: 1.344 ± 0.311
4.77AlaIle: 4.77 ± 0.623
4.636AlaLys: 4.636 ± 0.569
8.465AlaLeu: 8.465 ± 0.87
3.292AlaMet: 3.292 ± 0.458
3.494AlaAsn: 3.494 ± 0.411
4.77AlaPro: 4.77 ± 0.582
3.964AlaGln: 3.964 ± 0.514
6.316AlaArg: 6.316 ± 0.746
4.031AlaSer: 4.031 ± 0.545
4.636AlaThr: 4.636 ± 0.536
7.525AlaVal: 7.525 ± 0.697
2.083AlaTrp: 2.083 ± 0.445
2.822AlaTyr: 2.822 ± 0.551
0.0AlaXaa: 0.0 ± 0.0
Cys
0.739CysAla: 0.739 ± 0.26
0.0CysCys: 0.0 ± 0.0
0.537CysAsp: 0.537 ± 0.185
0.403CysGlu: 0.403 ± 0.162
0.336CysPhe: 0.336 ± 0.145
0.605CysGly: 0.605 ± 0.19
0.202CysHis: 0.202 ± 0.135
0.269CysIle: 0.269 ± 0.187
0.537CysLys: 0.537 ± 0.158
0.739CysLeu: 0.739 ± 0.229
0.067CysMet: 0.067 ± 0.067
0.336CysAsn: 0.336 ± 0.147
0.672CysPro: 0.672 ± 0.238
0.067CysGln: 0.067 ± 0.074
0.672CysArg: 0.672 ± 0.177
0.605CysSer: 0.605 ± 0.21
0.403CysThr: 0.403 ± 0.169
0.605CysVal: 0.605 ± 0.183
0.336CysTrp: 0.336 ± 0.152
0.537CysTyr: 0.537 ± 0.181
0.0CysXaa: 0.0 ± 0.0
Asp
6.383AspAla: 6.383 ± 0.65
0.806AspCys: 0.806 ± 0.265
3.494AspAsp: 3.494 ± 0.406
4.501AspGlu: 4.501 ± 0.622
2.687AspPhe: 2.687 ± 0.47
6.248AspGly: 6.248 ± 0.57
1.747AspHis: 1.747 ± 0.359
3.292AspIle: 3.292 ± 0.421
2.284AspLys: 2.284 ± 0.412
5.106AspLeu: 5.106 ± 0.661
1.344AspMet: 1.344 ± 0.252
1.948AspAsn: 1.948 ± 0.414
5.173AspPro: 5.173 ± 0.622
2.016AspGln: 2.016 ± 0.37
3.292AspArg: 3.292 ± 0.534
2.687AspSer: 2.687 ± 0.483
3.426AspThr: 3.426 ± 0.39
4.501AspVal: 4.501 ± 0.556
1.411AspTrp: 1.411 ± 0.347
2.352AspTyr: 2.352 ± 0.359
0.0AspXaa: 0.0 ± 0.0
Glu
7.726GluAla: 7.726 ± 0.841
0.202GluCys: 0.202 ± 0.114
4.367GluAsp: 4.367 ± 0.577
5.308GluGlu: 5.308 ± 0.861
2.956GluPhe: 2.956 ± 0.355
4.636GluGly: 4.636 ± 0.681
1.277GluHis: 1.277 ± 0.352
3.897GluIle: 3.897 ± 0.491
2.419GluLys: 2.419 ± 0.387
7.055GluLeu: 7.055 ± 0.802
2.083GluMet: 2.083 ± 0.402
2.016GluAsn: 2.016 ± 0.358
2.419GluPro: 2.419 ± 0.4
2.217GluGln: 2.217 ± 0.349
4.166GluArg: 4.166 ± 0.606
3.023GluSer: 3.023 ± 0.49
3.83GluThr: 3.83 ± 0.381
4.636GluVal: 4.636 ± 0.501
1.411GluTrp: 1.411 ± 0.331
2.15GluTyr: 2.15 ± 0.344
0.0GluXaa: 0.0 ± 0.0
Phe
2.889PheAla: 2.889 ± 0.513
0.403PheCys: 0.403 ± 0.172
2.755PheAsp: 2.755 ± 0.518
2.217PheGlu: 2.217 ± 0.349
0.873PhePhe: 0.873 ± 0.237
3.158PheGly: 3.158 ± 0.589
0.672PheHis: 0.672 ± 0.226
1.344PheIle: 1.344 ± 0.266
1.478PheLys: 1.478 ± 0.292
2.889PheLeu: 2.889 ± 0.463
0.47PheMet: 0.47 ± 0.179
1.209PheAsn: 1.209 ± 0.274
1.948PhePro: 1.948 ± 0.353
1.008PheGln: 1.008 ± 0.342
2.62PheArg: 2.62 ± 0.4
2.687PheSer: 2.687 ± 0.514
2.822PheThr: 2.822 ± 0.37
2.217PheVal: 2.217 ± 0.365
0.605PheTrp: 0.605 ± 0.201
0.941PheTyr: 0.941 ± 0.217
0.0PheXaa: 0.0 ± 0.0
Gly
7.189GlyAla: 7.189 ± 1.02
0.739GlyCys: 0.739 ± 0.217
5.778GlyAsp: 5.778 ± 0.643
4.233GlyGlu: 4.233 ± 0.64
3.83GlyPhe: 3.83 ± 0.613
8.6GlyGly: 8.6 ± 1.495
2.419GlyHis: 2.419 ± 0.399
4.77GlyIle: 4.77 ± 0.838
3.494GlyLys: 3.494 ± 0.419
6.987GlyLeu: 6.987 ± 0.87
1.612GlyMet: 1.612 ± 0.302
2.755GlyAsn: 2.755 ± 0.38
3.091GlyPro: 3.091 ± 0.384
3.897GlyGln: 3.897 ± 0.548
4.031GlyArg: 4.031 ± 0.45
3.628GlySer: 3.628 ± 0.536
4.837GlyThr: 4.837 ± 0.636
6.248GlyVal: 6.248 ± 0.722
1.612GlyTrp: 1.612 ± 0.293
2.352GlyTyr: 2.352 ± 0.422
0.0GlyXaa: 0.0 ± 0.0
His
1.411HisAla: 1.411 ± 0.33
0.202HisCys: 0.202 ± 0.124
1.68HisAsp: 1.68 ± 0.341
1.277HisGlu: 1.277 ± 0.307
0.739HisPhe: 0.739 ± 0.215
1.948HisGly: 1.948 ± 0.446
0.537HisHis: 0.537 ± 0.177
1.142HisIle: 1.142 ± 0.244
1.142HisLys: 1.142 ± 0.318
1.747HisLeu: 1.747 ± 0.321
0.336HisMet: 0.336 ± 0.162
0.403HisAsn: 0.403 ± 0.138
1.277HisPro: 1.277 ± 0.255
0.941HisGln: 0.941 ± 0.241
1.881HisArg: 1.881 ± 0.396
1.075HisSer: 1.075 ± 0.232
1.411HisThr: 1.411 ± 0.345
1.478HisVal: 1.478 ± 0.328
0.336HisTrp: 0.336 ± 0.173
0.605HisTyr: 0.605 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
5.778IleAla: 5.778 ± 0.496
0.47IleCys: 0.47 ± 0.179
3.494IleAsp: 3.494 ± 0.461
4.098IleGlu: 4.098 ± 0.437
1.277IlePhe: 1.277 ± 0.267
4.3IleGly: 4.3 ± 0.608
1.209IleHis: 1.209 ± 0.232
1.545IleIle: 1.545 ± 0.281
2.553IleLys: 2.553 ± 0.407
4.3IleLeu: 4.3 ± 0.453
0.605IleMet: 0.605 ± 0.169
1.948IleAsn: 1.948 ± 0.302
3.628IlePro: 3.628 ± 0.49
1.747IleGln: 1.747 ± 0.364
3.158IleArg: 3.158 ± 0.443
2.687IleSer: 2.687 ± 0.538
3.225IleThr: 3.225 ± 0.342
3.023IleVal: 3.023 ± 0.498
0.672IleTrp: 0.672 ± 0.221
0.873IleTyr: 0.873 ± 0.208
0.0IleXaa: 0.0 ± 0.0
Lys
5.039LysAla: 5.039 ± 0.615
0.336LysCys: 0.336 ± 0.182
2.62LysAsp: 2.62 ± 0.316
2.62LysGlu: 2.62 ± 0.349
0.739LysPhe: 0.739 ± 0.244
3.023LysGly: 3.023 ± 0.429
0.739LysHis: 0.739 ± 0.219
2.083LysIle: 2.083 ± 0.382
3.292LysLys: 3.292 ± 0.661
3.897LysLeu: 3.897 ± 0.505
1.209LysMet: 1.209 ± 0.33
1.948LysAsn: 1.948 ± 0.386
3.494LysPro: 3.494 ± 0.657
1.814LysGln: 1.814 ± 0.33
2.553LysArg: 2.553 ± 0.466
1.68LysSer: 1.68 ± 0.346
2.822LysThr: 2.822 ± 0.433
3.897LysVal: 3.897 ± 0.464
0.941LysTrp: 0.941 ± 0.28
1.075LysTyr: 1.075 ± 0.242
0.0LysXaa: 0.0 ± 0.0
Leu
8.801LeuAla: 8.801 ± 0.71
0.605LeuCys: 0.605 ± 0.189
4.972LeuAsp: 4.972 ± 0.522
5.241LeuGlu: 5.241 ± 0.633
2.553LeuPhe: 2.553 ± 0.361
7.525LeuGly: 7.525 ± 0.991
2.553LeuHis: 2.553 ± 0.522
4.434LeuIle: 4.434 ± 0.428
3.225LeuLys: 3.225 ± 0.416
5.644LeuLeu: 5.644 ± 0.607
2.755LeuMet: 2.755 ± 0.452
2.352LeuAsn: 2.352 ± 0.475
4.233LeuPro: 4.233 ± 0.509
2.553LeuGln: 2.553 ± 0.61
5.442LeuArg: 5.442 ± 0.7
5.442LeuSer: 5.442 ± 0.838
5.106LeuThr: 5.106 ± 0.719
5.106LeuVal: 5.106 ± 0.531
1.545LeuTrp: 1.545 ± 0.317
2.486LeuTyr: 2.486 ± 0.431
0.0LeuXaa: 0.0 ± 0.0
Met
2.553MetAla: 2.553 ± 0.43
0.269MetCys: 0.269 ± 0.135
1.008MetAsp: 1.008 ± 0.23
1.008MetGlu: 1.008 ± 0.217
0.403MetPhe: 0.403 ± 0.206
1.68MetGly: 1.68 ± 0.313
0.403MetHis: 0.403 ± 0.178
1.344MetIle: 1.344 ± 0.277
1.814MetLys: 1.814 ± 0.38
1.344MetLeu: 1.344 ± 0.315
0.47MetMet: 0.47 ± 0.147
0.806MetAsn: 0.806 ± 0.27
1.344MetPro: 1.344 ± 0.334
0.873MetGln: 0.873 ± 0.264
1.344MetArg: 1.344 ± 0.333
1.948MetSer: 1.948 ± 0.321
2.016MetThr: 2.016 ± 0.345
1.411MetVal: 1.411 ± 0.264
0.269MetTrp: 0.269 ± 0.113
0.806MetTyr: 0.806 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
3.561AsnAla: 3.561 ± 0.576
0.47AsnCys: 0.47 ± 0.151
1.948AsnAsp: 1.948 ± 0.375
1.612AsnGlu: 1.612 ± 0.401
0.941AsnPhe: 0.941 ± 0.258
3.292AsnGly: 3.292 ± 0.532
0.806AsnHis: 0.806 ± 0.24
1.545AsnIle: 1.545 ± 0.313
1.008AsnLys: 1.008 ± 0.255
3.225AsnLeu: 3.225 ± 0.371
0.806AsnMet: 0.806 ± 0.213
0.403AsnAsn: 0.403 ± 0.162
2.822AsnPro: 2.822 ± 0.423
0.806AsnGln: 0.806 ± 0.291
2.284AsnArg: 2.284 ± 0.433
0.873AsnSer: 0.873 ± 0.249
1.881AsnThr: 1.881 ± 0.405
2.62AsnVal: 2.62 ± 0.399
0.47AsnTrp: 0.47 ± 0.179
1.008AsnTyr: 1.008 ± 0.219
0.0AsnXaa: 0.0 ± 0.0
Pro
5.173ProAla: 5.173 ± 0.569
0.403ProCys: 0.403 ± 0.195
4.098ProAsp: 4.098 ± 0.576
3.695ProGlu: 3.695 ± 0.643
1.747ProPhe: 1.747 ± 0.417
4.569ProGly: 4.569 ± 0.606
1.344ProHis: 1.344 ± 0.291
2.419ProIle: 2.419 ± 0.364
3.023ProLys: 3.023 ± 0.579
3.628ProLeu: 3.628 ± 0.45
1.277ProMet: 1.277 ± 0.321
2.352ProAsn: 2.352 ± 0.433
2.083ProPro: 2.083 ± 0.403
1.142ProGln: 1.142 ± 0.293
3.494ProArg: 3.494 ± 0.498
2.62ProSer: 2.62 ± 0.444
3.762ProThr: 3.762 ± 0.515
4.3ProVal: 4.3 ± 0.484
1.478ProTrp: 1.478 ± 0.391
1.478ProTyr: 1.478 ± 0.294
0.0ProXaa: 0.0 ± 0.0
Gln
3.964GlnAla: 3.964 ± 0.559
0.134GlnCys: 0.134 ± 0.086
0.873GlnAsp: 0.873 ± 0.306
1.881GlnGlu: 1.881 ± 0.349
1.209GlnPhe: 1.209 ± 0.265
2.284GlnGly: 2.284 ± 0.403
0.672GlnHis: 0.672 ± 0.178
3.091GlnIle: 3.091 ± 0.518
1.814GlnLys: 1.814 ± 0.412
3.494GlnLeu: 3.494 ± 0.662
0.403GlnMet: 0.403 ± 0.154
1.008GlnAsn: 1.008 ± 0.26
1.68GlnPro: 1.68 ± 0.402
1.68GlnGln: 1.68 ± 0.424
2.687GlnArg: 2.687 ± 0.5
1.411GlnSer: 1.411 ± 0.251
2.083GlnThr: 2.083 ± 0.452
2.822GlnVal: 2.822 ± 0.346
0.739GlnTrp: 0.739 ± 0.24
1.209GlnTyr: 1.209 ± 0.306
0.0GlnXaa: 0.0 ± 0.0
Arg
5.039ArgAla: 5.039 ± 0.668
1.209ArgCys: 1.209 ± 0.349
5.039ArgAsp: 5.039 ± 0.67
5.509ArgGlu: 5.509 ± 0.599
2.419ArgPhe: 2.419 ± 0.523
3.561ArgGly: 3.561 ± 0.51
1.344ArgHis: 1.344 ± 0.349
3.628ArgIle: 3.628 ± 0.492
3.292ArgLys: 3.292 ± 0.479
5.98ArgLeu: 5.98 ± 0.693
1.478ArgMet: 1.478 ± 0.328
2.016ArgAsn: 2.016 ± 0.344
2.822ArgPro: 2.822 ± 0.395
1.478ArgGln: 1.478 ± 0.329
5.576ArgArg: 5.576 ± 0.899
3.292ArgSer: 3.292 ± 0.44
2.352ArgThr: 2.352 ± 0.358
4.501ArgVal: 4.501 ± 0.459
1.411ArgTrp: 1.411 ± 0.281
2.62ArgTyr: 2.62 ± 0.42
0.0ArgXaa: 0.0 ± 0.0
Ser
4.905SerAla: 4.905 ± 0.504
0.336SerCys: 0.336 ± 0.143
3.292SerAsp: 3.292 ± 0.483
2.956SerGlu: 2.956 ± 0.439
2.016SerPhe: 2.016 ± 0.321
4.703SerGly: 4.703 ± 0.513
0.739SerHis: 0.739 ± 0.197
2.217SerIle: 2.217 ± 0.443
2.016SerLys: 2.016 ± 0.413
3.561SerLeu: 3.561 ± 0.486
1.277SerMet: 1.277 ± 0.299
1.142SerAsn: 1.142 ± 0.269
2.822SerPro: 2.822 ± 0.427
2.419SerGln: 2.419 ± 0.375
4.031SerArg: 4.031 ± 0.517
3.091SerSer: 3.091 ± 0.684
3.158SerThr: 3.158 ± 0.396
3.091SerVal: 3.091 ± 0.499
1.545SerTrp: 1.545 ± 0.356
1.142SerTyr: 1.142 ± 0.269
0.0SerXaa: 0.0 ± 0.0
Thr
6.114ThrAla: 6.114 ± 0.534
0.403ThrCys: 0.403 ± 0.145
3.359ThrAsp: 3.359 ± 0.515
3.225ThrGlu: 3.225 ± 0.525
1.814ThrPhe: 1.814 ± 0.338
5.442ThrGly: 5.442 ± 0.632
1.075ThrHis: 1.075 ± 0.307
2.822ThrIle: 2.822 ± 0.504
3.762ThrLys: 3.762 ± 0.549
4.703ThrLeu: 4.703 ± 0.661
1.612ThrMet: 1.612 ± 0.334
1.545ThrAsn: 1.545 ± 0.311
4.031ThrPro: 4.031 ± 0.585
2.083ThrGln: 2.083 ± 0.364
3.225ThrArg: 3.225 ± 0.44
2.553ThrSer: 2.553 ± 0.499
2.687ThrThr: 2.687 ± 0.5
5.375ThrVal: 5.375 ± 0.609
0.941ThrTrp: 0.941 ± 0.225
1.814ThrTyr: 1.814 ± 0.335
0.0ThrXaa: 0.0 ± 0.0
Val
6.92ValAla: 6.92 ± 0.856
0.605ValCys: 0.605 ± 0.185
5.644ValAsp: 5.644 ± 0.601
6.45ValGlu: 6.45 ± 0.7
2.352ValPhe: 2.352 ± 0.456
5.308ValGly: 5.308 ± 0.686
1.545ValHis: 1.545 ± 0.287
3.158ValIle: 3.158 ± 0.501
3.091ValLys: 3.091 ± 0.467
5.442ValLeu: 5.442 ± 0.632
0.873ValMet: 0.873 ± 0.239
2.956ValAsn: 2.956 ± 0.597
3.561ValPro: 3.561 ± 0.476
2.284ValGln: 2.284 ± 0.526
4.569ValArg: 4.569 ± 0.58
3.83ValSer: 3.83 ± 0.548
4.972ValThr: 4.972 ± 0.55
5.912ValVal: 5.912 ± 0.505
1.478ValTrp: 1.478 ± 0.365
1.747ValTyr: 1.747 ± 0.315
0.0ValXaa: 0.0 ± 0.0
Trp
2.016TrpAla: 2.016 ± 0.452
0.269TrpCys: 0.269 ± 0.16
1.142TrpAsp: 1.142 ± 0.268
1.612TrpGlu: 1.612 ± 0.323
0.806TrpPhe: 0.806 ± 0.232
1.344TrpGly: 1.344 ± 0.345
0.537TrpHis: 0.537 ± 0.208
1.209TrpIle: 1.209 ± 0.23
0.47TrpLys: 0.47 ± 0.175
1.478TrpLeu: 1.478 ± 0.291
0.47TrpMet: 0.47 ± 0.139
0.806TrpAsn: 0.806 ± 0.213
0.739TrpPro: 0.739 ± 0.248
1.209TrpGln: 1.209 ± 0.249
1.075TrpArg: 1.075 ± 0.24
1.411TrpSer: 1.411 ± 0.298
1.545TrpThr: 1.545 ± 0.307
1.209TrpVal: 1.209 ± 0.261
0.336TrpTrp: 0.336 ± 0.177
0.537TrpTyr: 0.537 ± 0.228
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.889TyrAla: 2.889 ± 0.408
0.202TyrCys: 0.202 ± 0.106
2.217TyrAsp: 2.217 ± 0.317
2.15TyrGlu: 2.15 ± 0.328
1.008TyrPhe: 1.008 ± 0.269
2.284TyrGly: 2.284 ± 0.384
0.403TyrHis: 0.403 ± 0.148
1.612TyrIle: 1.612 ± 0.283
0.47TyrLys: 0.47 ± 0.175
2.889TyrLeu: 2.889 ± 0.371
0.605TyrMet: 0.605 ± 0.201
0.941TyrAsn: 0.941 ± 0.213
1.478TyrPro: 1.478 ± 0.326
0.941TyrGln: 0.941 ± 0.255
2.15TyrArg: 2.15 ± 0.357
1.881TyrSer: 1.881 ± 0.311
1.612TyrThr: 1.612 ± 0.316
2.217TyrVal: 2.217 ± 0.495
0.605TyrTrp: 0.605 ± 0.235
0.605TyrTyr: 0.605 ± 0.242
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (14885 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski