Amino acid dipepetide frequency for Streptomyces phage Asten

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.004AlaAla: 11.004 ± 1.011
0.443AlaCys: 0.443 ± 0.174
6.45AlaAsp: 6.45 ± 0.602
8.158AlaGlu: 8.158 ± 0.928
3.415AlaPhe: 3.415 ± 0.571
7.02AlaGly: 7.02 ± 0.7
1.644AlaHis: 1.644 ± 0.413
4.743AlaIle: 4.743 ± 0.607
4.553AlaLys: 4.553 ± 0.566
10.498AlaLeu: 10.498 ± 1.182
3.099AlaMet: 3.099 ± 0.382
3.541AlaAsn: 3.541 ± 0.547
4.806AlaPro: 4.806 ± 0.636
3.794AlaGln: 3.794 ± 0.481
6.008AlaArg: 6.008 ± 0.668
5.059AlaSer: 5.059 ± 0.807
5.502AlaThr: 5.502 ± 0.73
8.158AlaVal: 8.158 ± 0.812
2.15AlaTrp: 2.15 ± 0.426
3.352AlaTyr: 3.352 ± 0.569
0.0AlaXaa: 0.0 ± 0.0
Cys
0.759CysAla: 0.759 ± 0.209
0.063CysCys: 0.063 ± 0.059
0.506CysAsp: 0.506 ± 0.155
0.632CysGlu: 0.632 ± 0.181
0.19CysPhe: 0.19 ± 0.116
0.759CysGly: 0.759 ± 0.182
0.19CysHis: 0.19 ± 0.113
0.379CysIle: 0.379 ± 0.154
0.316CysLys: 0.316 ± 0.151
0.379CysLeu: 0.379 ± 0.152
0.126CysMet: 0.126 ± 0.082
0.253CysAsn: 0.253 ± 0.141
0.696CysPro: 0.696 ± 0.306
0.063CysGln: 0.063 ± 0.06
0.316CysArg: 0.316 ± 0.179
0.506CysSer: 0.506 ± 0.197
0.253CysThr: 0.253 ± 0.139
0.443CysVal: 0.443 ± 0.145
0.126CysTrp: 0.126 ± 0.097
0.316CysTyr: 0.316 ± 0.171
0.0CysXaa: 0.0 ± 0.0
Asp
7.272AspAla: 7.272 ± 0.743
0.253AspCys: 0.253 ± 0.119
3.984AspAsp: 3.984 ± 0.587
4.68AspGlu: 4.68 ± 0.617
1.96AspPhe: 1.96 ± 0.358
6.83AspGly: 6.83 ± 0.689
1.138AspHis: 1.138 ± 0.372
2.783AspIle: 2.783 ± 0.462
2.087AspLys: 2.087 ± 0.378
6.008AspLeu: 6.008 ± 0.757
1.834AspMet: 1.834 ± 0.257
1.581AspAsn: 1.581 ± 0.313
4.427AspPro: 4.427 ± 0.537
1.644AspGln: 1.644 ± 0.291
3.668AspArg: 3.668 ± 0.58
3.352AspSer: 3.352 ± 0.394
4.047AspThr: 4.047 ± 0.451
4.047AspVal: 4.047 ± 0.53
1.771AspTrp: 1.771 ± 0.331
1.581AspTyr: 1.581 ± 0.304
0.0AspXaa: 0.0 ± 0.0
Glu
7.842GluAla: 7.842 ± 0.772
0.885GluCys: 0.885 ± 0.259
3.731GluAsp: 3.731 ± 0.48
5.312GluGlu: 5.312 ± 0.961
1.96GluPhe: 1.96 ± 0.285
5.565GluGly: 5.565 ± 0.653
1.454GluHis: 1.454 ± 0.42
4.111GluIle: 4.111 ± 0.5
2.466GluLys: 2.466 ± 0.441
6.45GluLeu: 6.45 ± 0.776
1.202GluMet: 1.202 ± 0.245
2.213GluAsn: 2.213 ± 0.412
2.846GluPro: 2.846 ± 0.382
2.213GluGln: 2.213 ± 0.389
3.921GluArg: 3.921 ± 0.557
4.111GluSer: 4.111 ± 0.581
4.3GluThr: 4.3 ± 0.783
6.071GluVal: 6.071 ± 0.732
1.328GluTrp: 1.328 ± 0.288
2.466GluTyr: 2.466 ± 0.484
0.0GluXaa: 0.0 ± 0.0
Phe
2.909PheAla: 2.909 ± 0.427
0.316PheCys: 0.316 ± 0.136
2.719PheAsp: 2.719 ± 0.437
2.403PheGlu: 2.403 ± 0.491
0.949PhePhe: 0.949 ± 0.241
3.225PheGly: 3.225 ± 0.453
0.506PheHis: 0.506 ± 0.19
1.328PheIle: 1.328 ± 0.339
1.391PheLys: 1.391 ± 0.31
1.96PheLeu: 1.96 ± 0.431
0.696PheMet: 0.696 ± 0.237
1.012PheAsn: 1.012 ± 0.273
1.454PhePro: 1.454 ± 0.301
1.391PheGln: 1.391 ± 0.379
2.403PheArg: 2.403 ± 0.395
1.454PheSer: 1.454 ± 0.378
2.403PheThr: 2.403 ± 0.376
1.771PheVal: 1.771 ± 0.342
0.696PheTrp: 0.696 ± 0.212
1.202PheTyr: 1.202 ± 0.381
0.0PheXaa: 0.0 ± 0.0
Gly
7.652GlyAla: 7.652 ± 0.868
0.506GlyCys: 0.506 ± 0.201
6.514GlyAsp: 6.514 ± 0.841
5.186GlyGlu: 5.186 ± 0.656
2.909GlyPhe: 2.909 ± 0.452
7.525GlyGly: 7.525 ± 1.02
1.96GlyHis: 1.96 ± 0.364
3.541GlyIle: 3.541 ± 0.588
5.122GlyLys: 5.122 ± 0.494
6.514GlyLeu: 6.514 ± 0.877
1.644GlyMet: 1.644 ± 0.33
2.277GlyAsn: 2.277 ± 0.349
3.478GlyPro: 3.478 ± 0.636
3.099GlyGln: 3.099 ± 0.396
4.363GlyArg: 4.363 ± 0.54
5.881GlySer: 5.881 ± 0.889
5.565GlyThr: 5.565 ± 0.864
6.514GlyVal: 6.514 ± 0.652
2.087GlyTrp: 2.087 ± 0.407
3.162GlyTyr: 3.162 ± 0.495
0.0GlyXaa: 0.0 ± 0.0
His
1.96HisAla: 1.96 ± 0.424
0.19HisCys: 0.19 ± 0.109
1.202HisAsp: 1.202 ± 0.254
1.202HisGlu: 1.202 ± 0.325
0.696HisPhe: 0.696 ± 0.19
1.834HisGly: 1.834 ± 0.374
0.569HisHis: 0.569 ± 0.197
0.632HisIle: 0.632 ± 0.202
0.443HisLys: 0.443 ± 0.149
1.771HisLeu: 1.771 ± 0.402
0.126HisMet: 0.126 ± 0.096
0.632HisAsn: 0.632 ± 0.258
0.949HisPro: 0.949 ± 0.26
0.506HisGln: 0.506 ± 0.164
0.949HisArg: 0.949 ± 0.279
1.328HisSer: 1.328 ± 0.326
1.328HisThr: 1.328 ± 0.27
1.644HisVal: 1.644 ± 0.332
0.632HisTrp: 0.632 ± 0.25
0.885HisTyr: 0.885 ± 0.255
0.0HisXaa: 0.0 ± 0.0
Ile
4.806IleAla: 4.806 ± 0.43
0.063IleCys: 0.063 ± 0.066
3.668IleAsp: 3.668 ± 0.502
3.794IleGlu: 3.794 ± 0.521
1.138IlePhe: 1.138 ± 0.21
3.225IleGly: 3.225 ± 0.503
0.822IleHis: 0.822 ± 0.203
1.96IleIle: 1.96 ± 0.482
2.277IleLys: 2.277 ± 0.508
3.478IleLeu: 3.478 ± 0.66
0.632IleMet: 0.632 ± 0.199
0.949IleAsn: 0.949 ± 0.262
2.719IlePro: 2.719 ± 0.349
1.202IleGln: 1.202 ± 0.314
3.541IleArg: 3.541 ± 0.425
2.024IleSer: 2.024 ± 0.33
2.593IleThr: 2.593 ± 0.383
3.162IleVal: 3.162 ± 0.375
0.632IleTrp: 0.632 ± 0.209
1.454IleTyr: 1.454 ± 0.418
0.0IleXaa: 0.0 ± 0.0
Lys
3.731LysAla: 3.731 ± 0.563
0.19LysCys: 0.19 ± 0.117
2.719LysAsp: 2.719 ± 0.418
2.466LysGlu: 2.466 ± 0.376
1.265LysPhe: 1.265 ± 0.298
5.059LysGly: 5.059 ± 0.618
0.569LysHis: 0.569 ± 0.187
2.087LysIle: 2.087 ± 0.419
1.771LysLys: 1.771 ± 0.407
3.794LysLeu: 3.794 ± 0.463
0.949LysMet: 0.949 ± 0.2
1.391LysAsn: 1.391 ± 0.323
3.099LysPro: 3.099 ± 0.53
1.518LysGln: 1.518 ± 0.268
3.415LysArg: 3.415 ± 0.548
2.277LysSer: 2.277 ± 0.338
2.783LysThr: 2.783 ± 0.495
2.783LysVal: 2.783 ± 0.423
0.885LysTrp: 0.885 ± 0.241
0.949LysTyr: 0.949 ± 0.271
0.0LysXaa: 0.0 ± 0.0
Leu
10.181LeuAla: 10.181 ± 0.851
0.506LeuCys: 0.506 ± 0.178
6.134LeuAsp: 6.134 ± 0.779
4.743LeuGlu: 4.743 ± 0.608
2.087LeuPhe: 2.087 ± 0.346
6.64LeuGly: 6.64 ± 0.799
1.834LeuHis: 1.834 ± 0.378
4.174LeuIle: 4.174 ± 0.521
3.541LeuLys: 3.541 ± 0.507
6.893LeuLeu: 6.893 ± 0.704
1.897LeuMet: 1.897 ± 0.365
3.288LeuAsn: 3.288 ± 0.427
4.363LeuPro: 4.363 ± 0.578
2.403LeuGln: 2.403 ± 0.381
6.008LeuArg: 6.008 ± 0.655
5.059LeuSer: 5.059 ± 0.626
5.439LeuThr: 5.439 ± 0.644
6.64LeuVal: 6.64 ± 0.615
1.328LeuTrp: 1.328 ± 0.298
1.897LeuTyr: 1.897 ± 0.36
0.0LeuXaa: 0.0 ± 0.0
Met
3.225MetAla: 3.225 ± 0.39
0.063MetCys: 0.063 ± 0.07
0.632MetAsp: 0.632 ± 0.236
1.138MetGlu: 1.138 ± 0.238
0.379MetPhe: 0.379 ± 0.192
1.834MetGly: 1.834 ± 0.518
0.253MetHis: 0.253 ± 0.134
1.012MetIle: 1.012 ± 0.247
1.012MetLys: 1.012 ± 0.293
1.644MetLeu: 1.644 ± 0.317
0.506MetMet: 0.506 ± 0.234
0.569MetAsn: 0.569 ± 0.193
1.518MetPro: 1.518 ± 0.278
0.569MetGln: 0.569 ± 0.168
1.518MetArg: 1.518 ± 0.338
2.403MetSer: 2.403 ± 0.406
2.087MetThr: 2.087 ± 0.38
1.391MetVal: 1.391 ± 0.306
0.19MetTrp: 0.19 ± 0.109
0.126MetTyr: 0.126 ± 0.098
0.0MetXaa: 0.0 ± 0.0
Asn
3.035AsnAla: 3.035 ± 0.48
0.696AsnCys: 0.696 ± 0.287
1.581AsnAsp: 1.581 ± 0.308
2.024AsnGlu: 2.024 ± 0.329
1.138AsnPhe: 1.138 ± 0.246
3.225AsnGly: 3.225 ± 0.478
1.012AsnHis: 1.012 ± 0.315
1.265AsnIle: 1.265 ± 0.222
1.202AsnLys: 1.202 ± 0.336
3.099AsnLeu: 3.099 ± 0.472
0.443AsnMet: 0.443 ± 0.161
0.696AsnAsn: 0.696 ± 0.247
1.644AsnPro: 1.644 ± 0.293
0.822AsnGln: 0.822 ± 0.251
2.213AsnArg: 2.213 ± 0.324
1.581AsnSer: 1.581 ± 0.363
2.213AsnThr: 2.213 ± 0.477
1.834AsnVal: 1.834 ± 0.346
0.506AsnTrp: 0.506 ± 0.171
0.822AsnTyr: 0.822 ± 0.191
0.0AsnXaa: 0.0 ± 0.0
Pro
5.059ProAla: 5.059 ± 0.61
0.632ProCys: 0.632 ± 0.197
3.541ProAsp: 3.541 ± 0.478
4.363ProGlu: 4.363 ± 0.473
1.518ProPhe: 1.518 ± 0.333
4.363ProGly: 4.363 ± 0.426
0.632ProHis: 0.632 ± 0.222
2.34ProIle: 2.34 ± 0.519
2.846ProLys: 2.846 ± 0.493
3.099ProLeu: 3.099 ± 0.396
0.949ProMet: 0.949 ± 0.246
1.707ProAsn: 1.707 ± 0.45
1.834ProPro: 1.834 ± 0.393
1.581ProGln: 1.581 ± 0.289
2.34ProArg: 2.34 ± 0.406
2.909ProSer: 2.909 ± 0.492
3.731ProThr: 3.731 ± 0.597
4.174ProVal: 4.174 ± 0.512
0.569ProTrp: 0.569 ± 0.192
1.012ProTyr: 1.012 ± 0.275
0.0ProXaa: 0.0 ± 0.0
Gln
3.794GlnAla: 3.794 ± 0.5
0.063GlnCys: 0.063 ± 0.068
1.707GlnAsp: 1.707 ± 0.324
2.213GlnGlu: 2.213 ± 0.424
1.138GlnPhe: 1.138 ± 0.28
1.771GlnGly: 1.771 ± 0.334
0.569GlnHis: 0.569 ± 0.184
2.213GlnIle: 2.213 ± 0.42
1.771GlnLys: 1.771 ± 0.336
2.972GlnLeu: 2.972 ± 0.503
0.949GlnMet: 0.949 ± 0.22
1.075GlnAsn: 1.075 ± 0.263
1.265GlnPro: 1.265 ± 0.283
0.569GlnGln: 0.569 ± 0.178
3.099GlnArg: 3.099 ± 0.577
1.834GlnSer: 1.834 ± 0.334
1.644GlnThr: 1.644 ± 0.389
2.213GlnVal: 2.213 ± 0.345
0.379GlnTrp: 0.379 ± 0.159
0.822GlnTyr: 0.822 ± 0.297
0.0GlnXaa: 0.0 ± 0.0
Arg
5.375ArgAla: 5.375 ± 0.652
0.822ArgCys: 0.822 ± 0.36
4.427ArgAsp: 4.427 ± 0.615
4.49ArgGlu: 4.49 ± 0.775
2.909ArgPhe: 2.909 ± 0.323
3.921ArgGly: 3.921 ± 0.519
1.518ArgHis: 1.518 ± 0.386
1.96ArgIle: 1.96 ± 0.29
3.225ArgLys: 3.225 ± 0.503
5.692ArgLeu: 5.692 ± 0.661
1.644ArgMet: 1.644 ± 0.318
1.834ArgAsn: 1.834 ± 0.321
2.34ArgPro: 2.34 ± 0.41
2.719ArgGln: 2.719 ± 0.424
5.944ArgArg: 5.944 ± 1.029
4.3ArgSer: 4.3 ± 0.749
3.605ArgThr: 3.605 ± 0.418
5.122ArgVal: 5.122 ± 0.674
1.265ArgTrp: 1.265 ± 0.301
2.213ArgTyr: 2.213 ± 0.41
0.0ArgXaa: 0.0 ± 0.0
Ser
5.755SerAla: 5.755 ± 0.663
0.19SerCys: 0.19 ± 0.103
3.478SerAsp: 3.478 ± 0.579
3.921SerGlu: 3.921 ± 0.528
2.15SerPhe: 2.15 ± 0.367
5.565SerGly: 5.565 ± 0.745
0.885SerHis: 0.885 ± 0.225
2.593SerIle: 2.593 ± 0.466
1.96SerLys: 1.96 ± 0.348
5.755SerLeu: 5.755 ± 0.662
1.012SerMet: 1.012 ± 0.228
1.96SerAsn: 1.96 ± 0.469
2.783SerPro: 2.783 ± 0.365
2.024SerGln: 2.024 ± 0.41
3.984SerArg: 3.984 ± 0.619
3.415SerSer: 3.415 ± 0.591
3.415SerThr: 3.415 ± 0.653
4.427SerVal: 4.427 ± 0.525
1.138SerTrp: 1.138 ± 0.264
1.644SerTyr: 1.644 ± 0.37
0.0SerXaa: 0.0 ± 0.0
Thr
6.071ThrAla: 6.071 ± 0.887
0.316ThrCys: 0.316 ± 0.139
3.541ThrAsp: 3.541 ± 0.607
4.237ThrGlu: 4.237 ± 0.487
2.593ThrPhe: 2.593 ± 0.522
5.881ThrGly: 5.881 ± 1.184
1.707ThrHis: 1.707 ± 0.391
2.277ThrIle: 2.277 ± 0.439
2.277ThrLys: 2.277 ± 0.442
5.186ThrLeu: 5.186 ± 0.567
0.949ThrMet: 0.949 ± 0.251
2.213ThrAsn: 2.213 ± 0.361
3.921ThrPro: 3.921 ± 0.82
2.213ThrGln: 2.213 ± 0.355
3.225ThrArg: 3.225 ± 0.549
3.352ThrSer: 3.352 ± 0.578
4.996ThrThr: 4.996 ± 0.634
4.806ThrVal: 4.806 ± 0.54
1.265ThrTrp: 1.265 ± 0.262
2.972ThrTyr: 2.972 ± 0.457
0.0ThrXaa: 0.0 ± 0.0
Val
8.348ValAla: 8.348 ± 0.633
0.632ValCys: 0.632 ± 0.225
4.616ValAsp: 4.616 ± 0.499
5.439ValGlu: 5.439 ± 0.671
2.15ValPhe: 2.15 ± 0.385
6.261ValGly: 6.261 ± 0.664
1.644ValHis: 1.644 ± 0.438
3.352ValIle: 3.352 ± 0.485
3.921ValLys: 3.921 ± 0.497
6.197ValLeu: 6.197 ± 0.734
1.897ValMet: 1.897 ± 0.351
2.277ValAsn: 2.277 ± 0.425
3.162ValPro: 3.162 ± 0.389
2.593ValGln: 2.593 ± 0.463
4.68ValArg: 4.68 ± 0.61
3.731ValSer: 3.731 ± 0.532
4.743ValThr: 4.743 ± 0.501
4.49ValVal: 4.49 ± 0.562
1.581ValTrp: 1.581 ± 0.303
1.518ValTyr: 1.518 ± 0.296
0.0ValXaa: 0.0 ± 0.0
Trp
1.96TrpAla: 1.96 ± 0.324
0.379TrpCys: 0.379 ± 0.129
1.518TrpAsp: 1.518 ± 0.308
1.454TrpGlu: 1.454 ± 0.376
0.569TrpPhe: 0.569 ± 0.183
1.391TrpGly: 1.391 ± 0.314
0.19TrpHis: 0.19 ± 0.113
0.759TrpIle: 0.759 ± 0.257
1.012TrpLys: 1.012 ± 0.251
1.454TrpLeu: 1.454 ± 0.407
0.569TrpMet: 0.569 ± 0.151
0.632TrpAsn: 0.632 ± 0.256
0.379TrpPro: 0.379 ± 0.176
0.379TrpGln: 0.379 ± 0.172
1.771TrpArg: 1.771 ± 0.39
1.454TrpSer: 1.454 ± 0.327
1.391TrpThr: 1.391 ± 0.301
1.581TrpVal: 1.581 ± 0.348
0.126TrpTrp: 0.126 ± 0.121
0.316TrpTyr: 0.316 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.719TyrAla: 2.719 ± 0.329
0.063TyrCys: 0.063 ± 0.074
2.277TyrAsp: 2.277 ± 0.536
2.593TyrGlu: 2.593 ± 0.528
1.075TyrPhe: 1.075 ± 0.251
3.605TyrGly: 3.605 ± 0.528
0.379TyrHis: 0.379 ± 0.182
0.632TyrIle: 0.632 ± 0.239
0.506TyrLys: 0.506 ± 0.17
2.213TyrLeu: 2.213 ± 0.404
0.885TyrMet: 0.885 ± 0.202
0.949TyrAsn: 0.949 ± 0.266
1.518TyrPro: 1.518 ± 0.403
0.759TyrGln: 0.759 ± 0.208
1.96TyrArg: 1.96 ± 0.396
2.024TyrSer: 2.024 ± 0.392
1.96TyrThr: 1.96 ± 0.366
1.96TyrVal: 1.96 ± 0.385
0.632TyrTrp: 0.632 ± 0.289
0.885TyrTyr: 0.885 ± 0.206
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (15814 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski