Amino acid dipepetide frequency for Streptomyces phage Toma

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.873AlaAla: 10.873 ± 0.943
0.506AlaCys: 0.506 ± 0.18
6.511AlaAsp: 6.511 ± 0.659
7.459AlaGlu: 7.459 ± 0.895
2.908AlaPhe: 2.908 ± 0.558
7.459AlaGly: 7.459 ± 0.904
2.023AlaHis: 2.023 ± 0.514
4.994AlaIle: 4.994 ± 0.607
4.994AlaLys: 4.994 ± 0.545
10.43AlaLeu: 10.43 ± 1.165
3.098AlaMet: 3.098 ± 0.371
3.224AlaAsn: 3.224 ± 0.565
4.741AlaPro: 4.741 ± 0.501
3.856AlaGln: 3.856 ± 0.503
6.701AlaArg: 6.701 ± 0.778
4.299AlaSer: 4.299 ± 0.685
6.322AlaThr: 6.322 ± 0.687
8.092AlaVal: 8.092 ± 0.84
1.77AlaTrp: 1.77 ± 0.366
3.034AlaTyr: 3.034 ± 0.453
0.0AlaXaa: 0.0 ± 0.0
Cys
0.759CysAla: 0.759 ± 0.199
0.063CysCys: 0.063 ± 0.067
0.443CysAsp: 0.443 ± 0.174
0.759CysGlu: 0.759 ± 0.213
0.253CysPhe: 0.253 ± 0.159
0.632CysGly: 0.632 ± 0.194
0.19CysHis: 0.19 ± 0.093
0.253CysIle: 0.253 ± 0.119
0.316CysLys: 0.316 ± 0.145
0.253CysLeu: 0.253 ± 0.137
0.063CysMet: 0.063 ± 0.069
0.19CysAsn: 0.19 ± 0.099
0.506CysPro: 0.506 ± 0.179
0.19CysGln: 0.19 ± 0.139
0.506CysArg: 0.506 ± 0.198
0.569CysSer: 0.569 ± 0.217
0.316CysThr: 0.316 ± 0.185
0.506CysVal: 0.506 ± 0.181
0.19CysTrp: 0.19 ± 0.106
0.253CysTyr: 0.253 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
7.27AspAla: 7.27 ± 0.647
0.253AspCys: 0.253 ± 0.155
3.919AspAsp: 3.919 ± 0.652
4.868AspGlu: 4.868 ± 0.609
2.276AspPhe: 2.276 ± 0.451
6.764AspGly: 6.764 ± 0.643
1.264AspHis: 1.264 ± 0.351
2.971AspIle: 2.971 ± 0.436
2.213AspLys: 2.213 ± 0.374
6.069AspLeu: 6.069 ± 0.673
1.77AspMet: 1.77 ± 0.308
1.517AspAsn: 1.517 ± 0.331
4.235AspPro: 4.235 ± 0.533
1.833AspGln: 1.833 ± 0.338
2.908AspArg: 2.908 ± 0.458
3.477AspSer: 3.477 ± 0.452
3.477AspThr: 3.477 ± 0.535
3.73AspVal: 3.73 ± 0.549
1.77AspTrp: 1.77 ± 0.233
1.454AspTyr: 1.454 ± 0.346
0.0AspXaa: 0.0 ± 0.0
Glu
7.459GluAla: 7.459 ± 0.913
0.822GluCys: 0.822 ± 0.264
4.172GluAsp: 4.172 ± 0.534
5.753GluGlu: 5.753 ± 1.028
1.896GluPhe: 1.896 ± 0.297
6.638GluGly: 6.638 ± 0.658
1.707GluHis: 1.707 ± 0.443
3.856GluIle: 3.856 ± 0.453
2.023GluLys: 2.023 ± 0.396
6.385GluLeu: 6.385 ± 0.853
1.328GluMet: 1.328 ± 0.265
2.149GluAsn: 2.149 ± 0.343
3.034GluPro: 3.034 ± 0.42
2.276GluGln: 2.276 ± 0.41
4.109GluArg: 4.109 ± 0.625
4.741GluSer: 4.741 ± 0.749
4.172GluThr: 4.172 ± 0.66
5.247GluVal: 5.247 ± 0.738
1.264GluTrp: 1.264 ± 0.285
2.402GluTyr: 2.402 ± 0.48
0.0GluXaa: 0.0 ± 0.0
Phe
3.098PheAla: 3.098 ± 0.502
0.253PheCys: 0.253 ± 0.134
2.213PheAsp: 2.213 ± 0.436
2.781PheGlu: 2.781 ± 0.477
0.948PhePhe: 0.948 ± 0.223
2.971PheGly: 2.971 ± 0.466
0.379PheHis: 0.379 ± 0.133
1.454PheIle: 1.454 ± 0.324
1.454PheLys: 1.454 ± 0.345
2.465PheLeu: 2.465 ± 0.394
0.885PheMet: 0.885 ± 0.249
1.011PheAsn: 1.011 ± 0.266
1.264PhePro: 1.264 ± 0.323
1.328PheGln: 1.328 ± 0.316
2.276PheArg: 2.276 ± 0.441
1.896PheSer: 1.896 ± 0.393
2.023PheThr: 2.023 ± 0.364
1.58PheVal: 1.58 ± 0.318
0.759PheTrp: 0.759 ± 0.2
0.948PheTyr: 0.948 ± 0.273
0.0PheXaa: 0.0 ± 0.0
Gly
7.459GlyAla: 7.459 ± 0.887
0.506GlyCys: 0.506 ± 0.21
6.195GlyAsp: 6.195 ± 0.854
5.12GlyGlu: 5.12 ± 0.586
2.845GlyPhe: 2.845 ± 0.486
7.965GlyGly: 7.965 ± 1.033
1.896GlyHis: 1.896 ± 0.401
3.73GlyIle: 3.73 ± 0.687
4.741GlyLys: 4.741 ± 0.631
6.764GlyLeu: 6.764 ± 0.814
1.58GlyMet: 1.58 ± 0.319
2.465GlyAsn: 2.465 ± 0.374
3.54GlyPro: 3.54 ± 0.623
3.477GlyGln: 3.477 ± 0.487
4.994GlyArg: 4.994 ± 0.565
5.31GlySer: 5.31 ± 0.768
6.258GlyThr: 6.258 ± 1.027
6.511GlyVal: 6.511 ± 0.561
2.402GlyTrp: 2.402 ± 0.395
2.908GlyTyr: 2.908 ± 0.428
0.0GlyXaa: 0.0 ± 0.0
His
2.023HisAla: 2.023 ± 0.376
0.19HisCys: 0.19 ± 0.108
1.138HisAsp: 1.138 ± 0.28
1.264HisGlu: 1.264 ± 0.31
0.885HisPhe: 0.885 ± 0.236
2.149HisGly: 2.149 ± 0.483
0.569HisHis: 0.569 ± 0.198
0.695HisIle: 0.695 ± 0.237
0.506HisLys: 0.506 ± 0.172
1.77HisLeu: 1.77 ± 0.347
0.19HisMet: 0.19 ± 0.1
0.506HisAsn: 0.506 ± 0.187
1.011HisPro: 1.011 ± 0.276
0.569HisGln: 0.569 ± 0.177
1.075HisArg: 1.075 ± 0.253
1.454HisSer: 1.454 ± 0.37
1.58HisThr: 1.58 ± 0.301
1.328HisVal: 1.328 ± 0.323
0.759HisTrp: 0.759 ± 0.266
0.759HisTyr: 0.759 ± 0.242
0.0HisXaa: 0.0 ± 0.0
Ile
4.994IleAla: 4.994 ± 0.564
0.063IleCys: 0.063 ± 0.082
3.477IleAsp: 3.477 ± 0.465
4.741IleGlu: 4.741 ± 0.624
1.328IlePhe: 1.328 ± 0.264
3.224IleGly: 3.224 ± 0.695
0.759IleHis: 0.759 ± 0.22
2.402IleIle: 2.402 ± 0.677
2.276IleLys: 2.276 ± 0.564
2.971IleLeu: 2.971 ± 0.692
0.632IleMet: 0.632 ± 0.243
1.454IleAsn: 1.454 ± 0.303
2.465IlePro: 2.465 ± 0.37
1.391IleGln: 1.391 ± 0.309
3.287IleArg: 3.287 ± 0.484
2.402IleSer: 2.402 ± 0.466
2.592IleThr: 2.592 ± 0.404
2.845IleVal: 2.845 ± 0.522
0.759IleTrp: 0.759 ± 0.197
1.707IleTyr: 1.707 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
4.678LysAla: 4.678 ± 0.609
0.316LysCys: 0.316 ± 0.168
2.655LysAsp: 2.655 ± 0.465
2.529LysGlu: 2.529 ± 0.417
1.075LysPhe: 1.075 ± 0.268
4.425LysGly: 4.425 ± 0.62
0.695LysHis: 0.695 ± 0.209
2.213LysIle: 2.213 ± 0.491
2.023LysLys: 2.023 ± 0.473
4.046LysLeu: 4.046 ± 0.491
0.885LysMet: 0.885 ± 0.196
1.454LysAsn: 1.454 ± 0.304
2.781LysPro: 2.781 ± 0.592
1.707LysGln: 1.707 ± 0.377
3.73LysArg: 3.73 ± 0.552
2.023LysSer: 2.023 ± 0.397
2.845LysThr: 2.845 ± 0.447
2.781LysVal: 2.781 ± 0.421
0.759LysTrp: 0.759 ± 0.274
1.075LysTyr: 1.075 ± 0.307
0.0LysXaa: 0.0 ± 0.0
Leu
10.683LeuAla: 10.683 ± 0.836
0.569LeuCys: 0.569 ± 0.197
6.322LeuAsp: 6.322 ± 0.604
5.626LeuGlu: 5.626 ± 0.624
1.517LeuPhe: 1.517 ± 0.321
6.448LeuGly: 6.448 ± 0.707
1.77LeuHis: 1.77 ± 0.386
4.046LeuIle: 4.046 ± 0.542
3.603LeuLys: 3.603 ± 0.496
6.701LeuLeu: 6.701 ± 0.734
2.086LeuMet: 2.086 ± 0.417
3.414LeuAsn: 3.414 ± 0.481
4.172LeuPro: 4.172 ± 0.605
1.96LeuGln: 1.96 ± 0.401
5.816LeuArg: 5.816 ± 0.588
5.31LeuSer: 5.31 ± 0.701
5.31LeuThr: 5.31 ± 0.67
5.942LeuVal: 5.942 ± 0.682
1.138LeuTrp: 1.138 ± 0.252
1.644LeuTyr: 1.644 ± 0.317
0.0LeuXaa: 0.0 ± 0.0
Met
2.781MetAla: 2.781 ± 0.451
0.126MetCys: 0.126 ± 0.093
0.695MetAsp: 0.695 ± 0.235
0.885MetGlu: 0.885 ± 0.202
0.632MetPhe: 0.632 ± 0.247
1.77MetGly: 1.77 ± 0.534
0.379MetHis: 0.379 ± 0.116
1.328MetIle: 1.328 ± 0.252
1.328MetLys: 1.328 ± 0.292
1.644MetLeu: 1.644 ± 0.306
0.443MetMet: 0.443 ± 0.184
0.569MetAsn: 0.569 ± 0.157
1.201MetPro: 1.201 ± 0.266
0.695MetGln: 0.695 ± 0.185
1.517MetArg: 1.517 ± 0.328
2.339MetSer: 2.339 ± 0.373
1.833MetThr: 1.833 ± 0.337
1.391MetVal: 1.391 ± 0.313
0.316MetTrp: 0.316 ± 0.14
0.253MetTyr: 0.253 ± 0.145
0.0MetXaa: 0.0 ± 0.0
Asn
2.781AsnAla: 2.781 ± 0.399
0.443AsnCys: 0.443 ± 0.224
1.77AsnAsp: 1.77 ± 0.35
2.023AsnGlu: 2.023 ± 0.352
1.138AsnPhe: 1.138 ± 0.268
3.35AsnGly: 3.35 ± 0.402
0.759AsnHis: 0.759 ± 0.243
1.011AsnIle: 1.011 ± 0.22
0.885AsnLys: 0.885 ± 0.285
3.034AsnLeu: 3.034 ± 0.458
0.379AsnMet: 0.379 ± 0.152
0.885AsnAsn: 0.885 ± 0.222
1.96AsnPro: 1.96 ± 0.374
1.011AsnGln: 1.011 ± 0.291
2.149AsnArg: 2.149 ± 0.344
1.454AsnSer: 1.454 ± 0.303
2.339AsnThr: 2.339 ± 0.376
1.833AsnVal: 1.833 ± 0.374
0.695AsnTrp: 0.695 ± 0.202
1.138AsnTyr: 1.138 ± 0.262
0.0AsnXaa: 0.0 ± 0.0
Pro
4.804ProAla: 4.804 ± 0.544
0.695ProCys: 0.695 ± 0.202
3.983ProAsp: 3.983 ± 0.446
3.983ProGlu: 3.983 ± 0.477
1.644ProPhe: 1.644 ± 0.351
4.551ProGly: 4.551 ± 0.442
0.506ProHis: 0.506 ± 0.174
2.529ProIle: 2.529 ± 0.542
2.718ProLys: 2.718 ± 0.483
2.971ProLeu: 2.971 ± 0.479
0.759ProMet: 0.759 ± 0.237
1.264ProAsn: 1.264 ± 0.28
1.896ProPro: 1.896 ± 0.382
2.149ProGln: 2.149 ± 0.324
2.465ProArg: 2.465 ± 0.412
3.224ProSer: 3.224 ± 0.556
3.161ProThr: 3.161 ± 0.499
3.603ProVal: 3.603 ± 0.409
0.632ProTrp: 0.632 ± 0.174
0.759ProTyr: 0.759 ± 0.241
0.0ProXaa: 0.0 ± 0.0
Gln
3.73GlnAla: 3.73 ± 0.415
0.253GlnCys: 0.253 ± 0.19
1.833GlnAsp: 1.833 ± 0.367
1.833GlnGlu: 1.833 ± 0.387
1.454GlnPhe: 1.454 ± 0.241
2.023GlnGly: 2.023 ± 0.351
0.569GlnHis: 0.569 ± 0.217
2.086GlnIle: 2.086 ± 0.424
1.707GlnLys: 1.707 ± 0.362
2.718GlnLeu: 2.718 ± 0.469
1.201GlnMet: 1.201 ± 0.306
0.948GlnAsn: 0.948 ± 0.242
1.454GlnPro: 1.454 ± 0.342
0.822GlnGln: 0.822 ± 0.206
2.465GlnArg: 2.465 ± 0.405
1.58GlnSer: 1.58 ± 0.324
1.644GlnThr: 1.644 ± 0.33
2.908GlnVal: 2.908 ± 0.438
0.569GlnTrp: 0.569 ± 0.198
0.948GlnTyr: 0.948 ± 0.274
0.0GlnXaa: 0.0 ± 0.0
Arg
5.563ArgAla: 5.563 ± 0.62
0.569ArgCys: 0.569 ± 0.236
4.046ArgAsp: 4.046 ± 0.478
4.046ArgGlu: 4.046 ± 0.735
3.35ArgPhe: 3.35 ± 0.435
4.488ArgGly: 4.488 ± 0.576
1.58ArgHis: 1.58 ± 0.352
2.592ArgIle: 2.592 ± 0.437
3.287ArgLys: 3.287 ± 0.539
5.31ArgLeu: 5.31 ± 0.658
1.96ArgMet: 1.96 ± 0.339
1.96ArgAsn: 1.96 ± 0.313
2.149ArgPro: 2.149 ± 0.394
2.529ArgGln: 2.529 ± 0.429
5.12ArgArg: 5.12 ± 0.889
3.793ArgSer: 3.793 ± 0.649
3.477ArgThr: 3.477 ± 0.461
4.994ArgVal: 4.994 ± 0.687
1.138ArgTrp: 1.138 ± 0.277
1.96ArgTyr: 1.96 ± 0.318
0.0ArgXaa: 0.0 ± 0.0
Ser
5.942SerAla: 5.942 ± 0.635
0.316SerCys: 0.316 ± 0.127
3.35SerAsp: 3.35 ± 0.473
4.615SerGlu: 4.615 ± 0.614
2.023SerPhe: 2.023 ± 0.401
5.12SerGly: 5.12 ± 0.663
1.138SerHis: 1.138 ± 0.235
2.592SerIle: 2.592 ± 0.644
1.96SerLys: 1.96 ± 0.374
6.005SerLeu: 6.005 ± 0.588
1.011SerMet: 1.011 ± 0.289
1.896SerAsn: 1.896 ± 0.428
2.718SerPro: 2.718 ± 0.434
1.77SerGln: 1.77 ± 0.392
3.919SerArg: 3.919 ± 0.542
3.54SerSer: 3.54 ± 0.52
3.603SerThr: 3.603 ± 0.515
4.172SerVal: 4.172 ± 0.529
1.075SerTrp: 1.075 ± 0.243
2.086SerTyr: 2.086 ± 0.436
0.0SerXaa: 0.0 ± 0.0
Thr
6.132ThrAla: 6.132 ± 0.797
0.316ThrCys: 0.316 ± 0.147
3.098ThrAsp: 3.098 ± 0.388
4.172ThrGlu: 4.172 ± 0.581
2.339ThrPhe: 2.339 ± 0.557
5.5ThrGly: 5.5 ± 0.965
1.833ThrHis: 1.833 ± 0.407
2.845ThrIle: 2.845 ± 0.538
2.529ThrLys: 2.529 ± 0.441
4.994ThrLeu: 4.994 ± 0.587
1.011ThrMet: 1.011 ± 0.276
2.213ThrAsn: 2.213 ± 0.384
4.046ThrPro: 4.046 ± 0.591
1.644ThrGln: 1.644 ± 0.288
3.161ThrArg: 3.161 ± 0.606
3.983ThrSer: 3.983 ± 0.733
5.057ThrThr: 5.057 ± 0.689
5.626ThrVal: 5.626 ± 0.726
0.948ThrTrp: 0.948 ± 0.26
2.465ThrTyr: 2.465 ± 0.339
0.0ThrXaa: 0.0 ± 0.0
Val
7.586ValAla: 7.586 ± 0.733
0.443ValCys: 0.443 ± 0.184
4.172ValAsp: 4.172 ± 0.461
4.994ValGlu: 4.994 ± 0.595
2.149ValPhe: 2.149 ± 0.329
6.574ValGly: 6.574 ± 0.7
1.707ValHis: 1.707 ± 0.372
2.971ValIle: 2.971 ± 0.415
4.172ValLys: 4.172 ± 0.533
6.069ValLeu: 6.069 ± 0.705
1.58ValMet: 1.58 ± 0.33
2.276ValAsn: 2.276 ± 0.389
3.161ValPro: 3.161 ± 0.358
2.213ValGln: 2.213 ± 0.423
4.425ValArg: 4.425 ± 0.585
3.919ValSer: 3.919 ± 0.473
4.678ValThr: 4.678 ± 0.642
5.057ValVal: 5.057 ± 0.647
1.707ValTrp: 1.707 ± 0.379
1.644ValTyr: 1.644 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
2.086TrpAla: 2.086 ± 0.368
0.316TrpCys: 0.316 ± 0.131
1.58TrpAsp: 1.58 ± 0.346
1.517TrpGlu: 1.517 ± 0.403
0.443TrpPhe: 0.443 ± 0.163
1.264TrpGly: 1.264 ± 0.284
0.253TrpHis: 0.253 ± 0.121
0.695TrpIle: 0.695 ± 0.243
1.201TrpLys: 1.201 ± 0.279
1.264TrpLeu: 1.264 ± 0.306
0.569TrpMet: 0.569 ± 0.186
0.822TrpAsn: 0.822 ± 0.239
0.379TrpPro: 0.379 ± 0.163
0.443TrpGln: 0.443 ± 0.179
1.517TrpArg: 1.517 ± 0.387
1.517TrpSer: 1.517 ± 0.258
1.454TrpThr: 1.454 ± 0.302
1.58TrpVal: 1.58 ± 0.321
0.126TrpTrp: 0.126 ± 0.145
0.316TrpTyr: 0.316 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.655TyrAla: 2.655 ± 0.364
0.19TyrCys: 0.19 ± 0.111
2.465TyrAsp: 2.465 ± 0.444
2.465TyrGlu: 2.465 ± 0.446
0.885TyrPhe: 0.885 ± 0.204
3.35TyrGly: 3.35 ± 0.513
0.443TyrHis: 0.443 ± 0.169
0.506TyrIle: 0.506 ± 0.195
0.759TyrLys: 0.759 ± 0.234
2.213TyrLeu: 2.213 ± 0.376
0.632TyrMet: 0.632 ± 0.197
0.822TyrAsn: 0.822 ± 0.232
1.58TyrPro: 1.58 ± 0.372
0.885TyrGln: 0.885 ± 0.259
1.77TyrArg: 1.77 ± 0.387
2.023TyrSer: 2.023 ± 0.422
1.77TyrThr: 1.77 ± 0.29
1.77TyrVal: 1.77 ± 0.349
0.569TyrTrp: 0.569 ± 0.214
0.948TyrTyr: 0.948 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (15820 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski