Amino acid dipepetide frequency for Streptomyces phage phiSAJS1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.77AlaAla: 22.77 ± 1.974
1.597AlaCys: 1.597 ± 0.429
11.001AlaAsp: 11.001 ± 0.983
6.979AlaGlu: 6.979 ± 0.78
3.726AlaPhe: 3.726 ± 0.5
11.651AlaGly: 11.651 ± 0.738
2.188AlaHis: 2.188 ± 0.369
3.844AlaIle: 3.844 ± 0.885
4.968AlaLys: 4.968 ± 0.751
11.651AlaLeu: 11.651 ± 1.377
2.78AlaMet: 2.78 ± 0.51
3.43AlaAsn: 3.43 ± 0.497
6.861AlaPro: 6.861 ± 0.72
5.145AlaGln: 5.145 ± 0.573
7.452AlaArg: 7.452 ± 0.638
8.458AlaSer: 8.458 ± 0.812
8.753AlaThr: 8.753 ± 0.843
10.705AlaVal: 10.705 ± 0.842
1.774AlaTrp: 1.774 ± 0.333
3.194AlaTyr: 3.194 ± 0.378
0.0AlaXaa: 0.0 ± 0.0
Cys
1.597CysAla: 1.597 ± 0.398
0.355CysCys: 0.355 ± 0.17
0.769CysAsp: 0.769 ± 0.209
0.473CysGlu: 0.473 ± 0.146
0.473CysPhe: 0.473 ± 0.163
1.597CysGly: 1.597 ± 0.562
0.473CysHis: 0.473 ± 0.165
0.355CysIle: 0.355 ± 0.183
0.237CysLys: 0.237 ± 0.11
0.887CysLeu: 0.887 ± 0.254
0.414CysMet: 0.414 ± 0.15
0.414CysAsn: 0.414 ± 0.19
1.301CysPro: 1.301 ± 0.333
0.71CysGln: 0.71 ± 0.199
1.065CysArg: 1.065 ± 0.298
0.828CysSer: 0.828 ± 0.243
0.651CysThr: 0.651 ± 0.182
1.183CysVal: 1.183 ± 0.298
0.296CysTrp: 0.296 ± 0.129
0.473CysTyr: 0.473 ± 0.165
0.0CysXaa: 0.0 ± 0.0
Asp
9.759AspAla: 9.759 ± 0.921
1.479AspCys: 1.479 ± 0.355
5.264AspAsp: 5.264 ± 0.936
2.602AspGlu: 2.602 ± 0.385
1.774AspPhe: 1.774 ± 0.284
8.162AspGly: 8.162 ± 0.777
1.479AspHis: 1.479 ± 0.383
1.597AspIle: 1.597 ± 0.366
1.715AspLys: 1.715 ± 0.364
5.323AspLeu: 5.323 ± 0.509
2.011AspMet: 2.011 ± 0.466
1.656AspAsn: 1.656 ± 0.32
3.371AspPro: 3.371 ± 0.551
3.016AspGln: 3.016 ± 0.433
6.033AspArg: 6.033 ± 0.617
2.78AspSer: 2.78 ± 0.37
5.027AspThr: 5.027 ± 0.565
5.145AspVal: 5.145 ± 0.484
1.538AspTrp: 1.538 ± 0.466
1.242AspTyr: 1.242 ± 0.31
0.0AspXaa: 0.0 ± 0.0
Glu
8.458GluAla: 8.458 ± 0.856
0.591GluCys: 0.591 ± 0.204
2.661GluAsp: 2.661 ± 0.446
1.774GluGlu: 1.774 ± 0.343
0.887GluPhe: 0.887 ± 0.229
4.258GluGly: 4.258 ± 0.568
0.946GluHis: 0.946 ± 0.243
1.124GluIle: 1.124 ± 0.265
0.237GluLys: 0.237 ± 0.11
4.672GluLeu: 4.672 ± 0.545
0.651GluMet: 0.651 ± 0.195
0.591GluAsn: 0.591 ± 0.163
1.715GluPro: 1.715 ± 0.323
2.307GluGln: 2.307 ± 0.316
3.194GluArg: 3.194 ± 0.482
1.005GluSer: 1.005 ± 0.279
1.242GluThr: 1.242 ± 0.265
3.194GluVal: 3.194 ± 0.51
0.828GluTrp: 0.828 ± 0.181
0.946GluTyr: 0.946 ± 0.264
0.0GluXaa: 0.0 ± 0.0
Phe
2.721PheAla: 2.721 ± 0.488
0.237PheCys: 0.237 ± 0.122
2.188PheAsp: 2.188 ± 0.442
0.828PheGlu: 0.828 ± 0.241
0.828PhePhe: 0.828 ± 0.213
3.016PheGly: 3.016 ± 0.405
0.473PheHis: 0.473 ± 0.226
0.591PheIle: 0.591 ± 0.175
0.769PheLys: 0.769 ± 0.24
1.893PheLeu: 1.893 ± 0.327
1.005PheMet: 1.005 ± 0.254
1.005PheAsn: 1.005 ± 0.24
1.183PhePro: 1.183 ± 0.254
1.183PheGln: 1.183 ± 0.232
1.893PheArg: 1.893 ± 0.305
1.715PheSer: 1.715 ± 0.366
2.307PheThr: 2.307 ± 0.373
2.07PheVal: 2.07 ± 0.365
0.296PheTrp: 0.296 ± 0.131
0.355PheTyr: 0.355 ± 0.126
0.0PheXaa: 0.0 ± 0.0
Gly
10.528GlyAla: 10.528 ± 0.716
1.833GlyCys: 1.833 ± 0.497
5.796GlyAsp: 5.796 ± 0.763
3.549GlyGlu: 3.549 ± 0.458
3.549GlyPhe: 3.549 ± 0.56
9.404GlyGly: 9.404 ± 1.545
2.425GlyHis: 2.425 ± 0.495
3.371GlyIle: 3.371 ± 0.399
3.903GlyLys: 3.903 ± 0.513
6.269GlyLeu: 6.269 ± 0.996
2.721GlyMet: 2.721 ± 0.401
2.602GlyAsn: 2.602 ± 0.461
4.495GlyPro: 4.495 ± 0.561
4.317GlyGln: 4.317 ± 0.546
5.737GlyArg: 5.737 ± 0.486
5.205GlySer: 5.205 ± 0.791
7.097GlyThr: 7.097 ± 0.723
6.683GlyVal: 6.683 ± 0.826
1.597GlyTrp: 1.597 ± 0.375
3.135GlyTyr: 3.135 ± 0.517
0.0GlyXaa: 0.0 ± 0.0
His
2.07HisAla: 2.07 ± 0.409
0.296HisCys: 0.296 ± 0.132
0.887HisAsp: 0.887 ± 0.261
0.71HisGlu: 0.71 ± 0.193
0.414HisPhe: 0.414 ± 0.145
1.774HisGly: 1.774 ± 0.419
0.769HisHis: 0.769 ± 0.21
0.887HisIle: 0.887 ± 0.228
0.414HisLys: 0.414 ± 0.16
1.893HisLeu: 1.893 ± 0.42
0.414HisMet: 0.414 ± 0.147
0.71HisAsn: 0.71 ± 0.188
1.242HisPro: 1.242 ± 0.283
1.124HisGln: 1.124 ± 0.262
1.656HisArg: 1.656 ± 0.282
0.946HisSer: 0.946 ± 0.185
1.479HisThr: 1.479 ± 0.308
2.011HisVal: 2.011 ± 0.37
0.473HisTrp: 0.473 ± 0.208
0.414HisTyr: 0.414 ± 0.137
0.0HisXaa: 0.0 ± 0.0
Ile
3.489IleAla: 3.489 ± 0.6
0.473IleCys: 0.473 ± 0.139
2.129IleAsp: 2.129 ± 0.318
1.183IleGlu: 1.183 ± 0.22
0.828IlePhe: 0.828 ± 0.223
3.016IleGly: 3.016 ± 0.626
0.828IleHis: 0.828 ± 0.23
1.301IleIle: 1.301 ± 0.304
1.419IleLys: 1.419 ± 0.396
2.78IleLeu: 2.78 ± 0.437
0.473IleMet: 0.473 ± 0.141
1.479IleAsn: 1.479 ± 0.319
1.656IlePro: 1.656 ± 0.246
1.715IleGln: 1.715 ± 0.295
3.135IleArg: 3.135 ± 0.394
2.07IleSer: 2.07 ± 0.31
3.667IleThr: 3.667 ± 0.562
1.656IleVal: 1.656 ± 0.299
0.059IleTrp: 0.059 ± 0.07
0.414IleTyr: 0.414 ± 0.162
0.0IleXaa: 0.0 ± 0.0
Lys
6.033LysAla: 6.033 ± 0.621
0.237LysCys: 0.237 ± 0.114
2.425LysAsp: 2.425 ± 0.465
1.183LysGlu: 1.183 ± 0.272
0.651LysPhe: 0.651 ± 0.188
3.43LysGly: 3.43 ± 0.5
0.769LysHis: 0.769 ± 0.197
0.71LysIle: 0.71 ± 0.179
1.656LysLys: 1.656 ± 0.49
2.957LysLeu: 2.957 ± 0.387
0.828LysMet: 0.828 ± 0.245
0.237LysAsn: 0.237 ± 0.107
2.011LysPro: 2.011 ± 0.52
1.774LysGln: 1.774 ± 0.345
2.188LysArg: 2.188 ± 0.459
2.011LysSer: 2.011 ± 0.389
1.774LysThr: 1.774 ± 0.344
2.129LysVal: 2.129 ± 0.348
0.532LysTrp: 0.532 ± 0.15
0.946LysTyr: 0.946 ± 0.275
0.0LysXaa: 0.0 ± 0.0
Leu
13.485LeuAla: 13.485 ± 1.112
0.651LeuCys: 0.651 ± 0.163
6.033LeuAsp: 6.033 ± 0.676
4.909LeuGlu: 4.909 ± 0.73
1.597LeuPhe: 1.597 ± 0.356
6.269LeuGly: 6.269 ± 0.678
1.538LeuHis: 1.538 ± 0.296
2.957LeuIle: 2.957 ± 0.522
3.903LeuLys: 3.903 ± 0.44
7.452LeuLeu: 7.452 ± 0.645
1.479LeuMet: 1.479 ± 0.279
2.78LeuAsn: 2.78 ± 0.468
4.14LeuPro: 4.14 ± 0.498
1.656LeuGln: 1.656 ± 0.382
5.086LeuArg: 5.086 ± 0.638
4.022LeuSer: 4.022 ± 0.537
5.145LeuThr: 5.145 ± 0.522
5.974LeuVal: 5.974 ± 0.581
1.065LeuTrp: 1.065 ± 0.231
1.597LeuTyr: 1.597 ± 0.26
0.0LeuXaa: 0.0 ± 0.0
Met
2.898MetAla: 2.898 ± 0.407
0.296MetCys: 0.296 ± 0.131
1.597MetAsp: 1.597 ± 0.282
1.183MetGlu: 1.183 ± 0.258
0.532MetPhe: 0.532 ± 0.154
1.893MetGly: 1.893 ± 0.369
0.414MetHis: 0.414 ± 0.178
0.887MetIle: 0.887 ± 0.219
1.301MetLys: 1.301 ± 0.333
1.538MetLeu: 1.538 ± 0.367
0.414MetMet: 0.414 ± 0.155
0.828MetAsn: 0.828 ± 0.21
1.242MetPro: 1.242 ± 0.283
0.473MetGln: 0.473 ± 0.165
1.242MetArg: 1.242 ± 0.29
0.946MetSer: 0.946 ± 0.239
2.188MetThr: 2.188 ± 0.331
1.833MetVal: 1.833 ± 0.361
0.355MetTrp: 0.355 ± 0.146
0.591MetTyr: 0.591 ± 0.186
0.0MetXaa: 0.0 ± 0.0
Asn
3.963AsnAla: 3.963 ± 0.573
0.71AsnCys: 0.71 ± 0.18
1.301AsnAsp: 1.301 ± 0.314
0.769AsnGlu: 0.769 ± 0.193
0.828AsnPhe: 0.828 ± 0.222
3.844AsnGly: 3.844 ± 0.728
0.769AsnHis: 0.769 ± 0.198
1.479AsnIle: 1.479 ± 0.309
0.473AsnLys: 0.473 ± 0.163
1.833AsnLeu: 1.833 ± 0.369
0.887AsnMet: 0.887 ± 0.24
1.242AsnAsn: 1.242 ± 0.333
2.307AsnPro: 2.307 ± 0.422
0.946AsnGln: 0.946 ± 0.224
1.656AsnArg: 1.656 ± 0.306
1.419AsnSer: 1.419 ± 0.345
1.597AsnThr: 1.597 ± 0.275
2.484AsnVal: 2.484 ± 0.408
0.355AsnTrp: 0.355 ± 0.121
0.296AsnTyr: 0.296 ± 0.136
0.0AsnXaa: 0.0 ± 0.0
Pro
6.624ProAla: 6.624 ± 0.726
0.828ProCys: 0.828 ± 0.276
3.844ProAsp: 3.844 ± 0.547
3.016ProGlu: 3.016 ± 0.45
1.183ProPhe: 1.183 ± 0.248
4.85ProGly: 4.85 ± 0.575
1.065ProHis: 1.065 ± 0.217
1.833ProIle: 1.833 ± 0.361
1.656ProLys: 1.656 ± 0.431
3.194ProLeu: 3.194 ± 0.547
1.005ProMet: 1.005 ± 0.23
0.828ProAsn: 0.828 ± 0.233
1.656ProPro: 1.656 ± 0.374
1.833ProGln: 1.833 ± 0.375
2.957ProArg: 2.957 ± 0.49
2.898ProSer: 2.898 ± 0.401
3.489ProThr: 3.489 ± 0.545
5.205ProVal: 5.205 ± 0.51
1.36ProTrp: 1.36 ± 0.328
1.065ProTyr: 1.065 ± 0.281
0.0ProXaa: 0.0 ± 0.0
Gln
4.436GlnAla: 4.436 ± 0.526
0.591GlnCys: 0.591 ± 0.224
3.371GlnAsp: 3.371 ± 0.362
1.065GlnGlu: 1.065 ± 0.258
1.301GlnPhe: 1.301 ± 0.225
3.075GlnGly: 3.075 ± 0.468
0.651GlnHis: 0.651 ± 0.217
1.242GlnIle: 1.242 ± 0.327
1.065GlnLys: 1.065 ± 0.255
4.081GlnLeu: 4.081 ± 0.505
0.887GlnMet: 0.887 ± 0.229
0.828GlnAsn: 0.828 ± 0.212
2.366GlnPro: 2.366 ± 0.382
2.188GlnGln: 2.188 ± 0.435
3.016GlnArg: 3.016 ± 0.451
1.715GlnSer: 1.715 ± 0.314
2.484GlnThr: 2.484 ± 0.334
2.425GlnVal: 2.425 ± 0.348
0.769GlnTrp: 0.769 ± 0.217
0.532GlnTyr: 0.532 ± 0.177
0.0GlnXaa: 0.0 ± 0.0
Arg
7.63ArgAla: 7.63 ± 0.613
1.005ArgCys: 1.005 ± 0.291
4.317ArgAsp: 4.317 ± 0.53
2.543ArgGlu: 2.543 ± 0.429
2.011ArgPhe: 2.011 ± 0.345
4.968ArgGly: 4.968 ± 0.573
1.833ArgHis: 1.833 ± 0.346
3.194ArgIle: 3.194 ± 0.45
2.366ArgLys: 2.366 ± 0.39
6.328ArgLeu: 6.328 ± 0.589
1.952ArgMet: 1.952 ± 0.311
2.78ArgAsn: 2.78 ± 0.373
2.661ArgPro: 2.661 ± 0.483
2.307ArgGln: 2.307 ± 0.373
5.205ArgArg: 5.205 ± 0.553
2.425ArgSer: 2.425 ± 0.33
3.903ArgThr: 3.903 ± 0.539
5.619ArgVal: 5.619 ± 0.664
0.946ArgTrp: 0.946 ± 0.266
2.129ArgTyr: 2.129 ± 0.419
0.0ArgXaa: 0.0 ± 0.0
Ser
5.678SerAla: 5.678 ± 0.821
0.769SerCys: 0.769 ± 0.254
3.253SerAsp: 3.253 ± 0.376
1.833SerGlu: 1.833 ± 0.342
2.011SerPhe: 2.011 ± 0.274
5.855SerGly: 5.855 ± 0.79
0.651SerHis: 0.651 ± 0.204
2.011SerIle: 2.011 ± 0.331
1.952SerLys: 1.952 ± 0.383
3.785SerLeu: 3.785 ± 0.402
1.419SerMet: 1.419 ± 0.289
1.656SerAsn: 1.656 ± 0.335
2.011SerPro: 2.011 ± 0.356
1.656SerGln: 1.656 ± 0.395
3.253SerArg: 3.253 ± 0.394
2.78SerSer: 2.78 ± 0.439
3.903SerThr: 3.903 ± 0.373
4.613SerVal: 4.613 ± 0.617
1.242SerTrp: 1.242 ± 0.364
0.828SerTyr: 0.828 ± 0.211
0.0SerXaa: 0.0 ± 0.0
Thr
9.877ThrAla: 9.877 ± 0.99
1.065ThrCys: 1.065 ± 0.332
4.436ThrAsp: 4.436 ± 0.434
2.661ThrGlu: 2.661 ± 0.464
1.715ThrPhe: 1.715 ± 0.344
7.63ThrGly: 7.63 ± 0.585
1.065ThrHis: 1.065 ± 0.297
2.839ThrIle: 2.839 ± 0.397
1.893ThrLys: 1.893 ± 0.361
5.441ThrLeu: 5.441 ± 0.473
1.301ThrMet: 1.301 ± 0.325
2.188ThrAsn: 2.188 ± 0.399
4.022ThrPro: 4.022 ± 0.441
1.538ThrGln: 1.538 ± 0.357
3.135ThrArg: 3.135 ± 0.407
3.312ThrSer: 3.312 ± 0.49
4.495ThrThr: 4.495 ± 0.483
6.92ThrVal: 6.92 ± 0.648
1.183ThrTrp: 1.183 ± 0.277
1.36ThrTyr: 1.36 ± 0.297
0.0ThrXaa: 0.0 ± 0.0
Val
12.716ValAla: 12.716 ± 0.876
0.887ValCys: 0.887 ± 0.221
6.21ValAsp: 6.21 ± 0.547
2.602ValGlu: 2.602 ± 0.377
1.538ValPhe: 1.538 ± 0.3
5.796ValGly: 5.796 ± 0.625
1.301ValHis: 1.301 ± 0.314
2.484ValIle: 2.484 ± 0.418
3.371ValLys: 3.371 ± 0.475
6.21ValLeu: 6.21 ± 0.761
1.419ValMet: 1.419 ± 0.306
2.721ValAsn: 2.721 ± 0.319
4.14ValPro: 4.14 ± 0.574
3.253ValGln: 3.253 ± 0.442
5.145ValArg: 5.145 ± 0.501
4.436ValSer: 4.436 ± 0.702
6.269ValThr: 6.269 ± 0.606
5.796ValVal: 5.796 ± 0.624
1.005ValTrp: 1.005 ± 0.245
1.774ValTyr: 1.774 ± 0.364
0.0ValXaa: 0.0 ± 0.0
Trp
1.479TrpAla: 1.479 ± 0.319
0.237TrpCys: 0.237 ± 0.105
1.479TrpAsp: 1.479 ± 0.343
0.532TrpGlu: 0.532 ± 0.159
0.414TrpPhe: 0.414 ± 0.169
0.946TrpGly: 0.946 ± 0.241
0.532TrpHis: 0.532 ± 0.197
0.355TrpIle: 0.355 ± 0.159
0.828TrpLys: 0.828 ± 0.199
1.597TrpLeu: 1.597 ± 0.374
0.237TrpMet: 0.237 ± 0.12
0.532TrpAsn: 0.532 ± 0.156
1.242TrpPro: 1.242 ± 0.294
0.532TrpGln: 0.532 ± 0.178
1.005TrpArg: 1.005 ± 0.292
1.124TrpSer: 1.124 ± 0.214
1.36TrpThr: 1.36 ± 0.324
1.419TrpVal: 1.419 ± 0.322
0.237TrpTrp: 0.237 ± 0.124
0.237TrpTyr: 0.237 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.721TyrAla: 2.721 ± 0.382
0.414TyrCys: 0.414 ± 0.155
2.011TyrAsp: 2.011 ± 0.438
0.887TyrGlu: 0.887 ± 0.22
0.296TyrPhe: 0.296 ± 0.13
2.839TyrGly: 2.839 ± 0.486
0.414TyrHis: 0.414 ± 0.138
0.71TyrIle: 0.71 ± 0.263
0.355TyrLys: 0.355 ± 0.137
1.952TyrLeu: 1.952 ± 0.31
0.296TyrMet: 0.296 ± 0.118
0.651TyrAsn: 0.651 ± 0.146
0.769TyrPro: 0.769 ± 0.199
0.414TyrGln: 0.414 ± 0.158
2.07TyrArg: 2.07 ± 0.353
1.065TyrSer: 1.065 ± 0.303
1.242TyrThr: 1.242 ± 0.286
2.011TyrVal: 2.011 ± 0.36
0.414TyrTrp: 0.414 ± 0.173
0.355TyrTyr: 0.355 ± 0.113
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (16909 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski